Extracting an Optimal Set of Linguistic Summaries using Genetic Algorithm Combined with Greedy Strategy

Phạm Thị Lan, Nguyễn Cát Hồ, Phạm Đình Phong

  • Thi Lan Pham Department of Information Technology, Hanoi University of Education
  • Cat Ho Nguyen Institute of Theoretical and Applied Research, Duy Tan University in Da Nang and Hanoi
  • Dinh Phong Pham University of Transport and Communications
Keywords: Linguistic data summary, hedge algebras, linguistic frame of cognition, genetic algorithm, greedy strategy

Abstract

The goal of extracting linguistic data summaries is to produce summary sentences expressed in natural language which represent knowledge hidden in numerical dataset. At the most general level, human users can get a very large number of linguistic summaries. In this paper, we propose a model of genetic algorithm combined with greedy strategy to extract an optimal set of linguistic summaries based on the evaluation measures of goodness and diversity of the set of linguistic summaries. The experimental results on creep dataset have demonstrated the outperformance of the proposed model of genetic algorithm combined with greedy strategy in comparison with the existing genetic algorithm models in extracting linguistic summaries from data.

Author Biographies

Thi Lan Pham, Department of Information Technology, Hanoi University of Education

Phạm Thị Lan1, Nguyễn Cát Hồ2, 3, Phạm Đình Phong4

  1. Department of Information Technology, Hanoi University of Education
  2. Theoretical and Applied Research Institute, Duy Tan University
  3. Department of Information Technology, Duy Tan University
  4. Department of Information Technology, University of Transport and Communications

Contact: Pham Thi Lan, ptlan@hnue.edu.vn

Cat Ho Nguyen, Institute of Theoretical and Applied Research, Duy Tan University in Da Nang and Hanoi

Ho Nguyen-Cat was born in Hanoi, Vietnam, in 1941. He received a B.S. degree in mathematics from the University of Hanoi, the former VNU University of Science of Vietnam National University, Hanoi; a Dr.
degree in mathematics and mechanics from the Warsaw University, Warsaw, Poland, in 1971; and a Dr. Sc. degree in mathematics, cybernetics and computer science from the Dresden University of Technology, Dresden, Germany, in 1987. He is now a researcher at the Institute of Theoretical and Applied Research, Duy Tan University in Da Nang and Hanoi. He is the author of more than 90 articles, including more than 40 articles published in international journals and conferences. His research interests include fuzzy logic, fuzzy databases, and computing with words, especially hedge algebras as a mathematical basis for directly handling linguistic words and their applications.
Email: ncatho@gmail.com

Dinh Phong Pham, University of Transport and Communications

Phong Pham-Dinh received a Master degree in Information Technology and a Doctor of Philosophy degree in Computer Science from University of Engineering and Technology, Vietnam National University,
Hanoi in 2011 and 2018, respectively. Now, he is a lecturer at the University of Transport and Communications. His research interests include hedge algebras, fuzzy systems, soft computing, data mining, and machine learning.
Email: phongpd@utc.edu.vn

Published
2021-04-27