A Meta-Heuristic Approach for Enhancing Performance of Associative Classification
Abstract
Associative Classification is an interesting approach in data mining to create more accurate and easily
interpretable predictive systems. This approach is often built on both association rule mining and classification techniques, to find a set of rules called association rules for classification (CAR) of label attributes. There are many kinds of associative classification such as CPAR, CBA, CMAR but the accuracy is still low on large datasets, and the running time is not reasonable as well. This paper proposes a heuristic approach to significantly enhance the performance of Associative Classification algorithms in running time, reducing the rule set, and accuracy on large data. Experimental results show that heuristic searching the optimal data set makes the associative classification more useful on big data, and reasonable in practice.
References
Z. Huang, Z. Zhou, T. He, and X. Wang, “ACAC: Associative Classification Based on All-Confidence," IEEE International Conference on Granular Computing, pp. 289–293, 2011.
Mushroom Classification Dataset, [Online]. Available: https://www.kaggle.com/datasets/uciml/mushroom-classification.
Spam Emails Dataset, [Online]. Available: https://www.kaggle.com/datasets/yasserh/spamemailsdataset
H. F. Ong, C. Y. M. Neoh, V. K. Vijayaraj, Y. X. Low, “Information-Based Rule Ranking for Associative Classification," ISPACS, 2022.
M. Abrar, A. Tze and S. Abbas, “Associative Classification using Automata with Structure based Merging," IJACSAA, vol. 10, 2019.
D. L. Olson and G. Lauhoff, “Market Basket Analysis" in Descriptive Data Mining," Springer Singapore, 2019.
K. D. Rajab, “New Associative Classification Method Based on Rule Pruning for Classification of Datasets," IEEE Access, vol. 7, pp. 157783-157795, 2019.
H. F. Ong, N. Mustapha, H. Hamdan, R. Rosli and A. Mustapha, “Informative top-k class associative rule for cancer biomarker discovery on microarray data," Expert Systems with Applications, vol. 146, 2020.
Majid Seyf, Yue Xu, Richi Nayak, “DAC: Discriminative Associative Classifcation”, SN Computer Science, 2023.
E.R.Omiecinski, “Alternative interest measures for mining associations in databases”, IEEE Transactions on Knowledge and Data Engineering, vol 15, pp. 57-69, 2003.
N. Q. Huy, T. A. Tuan, N. T. N. Thanh, “An efficient algorithm that optimizes the classification association rule set”, VNICT 26, pp. 13-19, 2023.
K.H.T Dam,T. G Wilson,A Legay,R Veroneze, “Packer classification based on association rule mining”, Applied Soft Computing, Vol 127, Issue C, 2022.
Philippe Fournier-Viger, “SPMF - An Open-Source Data Mining Library”, https://www.philippe-fournierviger.com/spmf/index.php, Version 2.59, 2022.