Hybrid Grid Partitioning, Rough Set Theory, and Fuzzy Rule Generation for Enhanced Association Rule Mining on Complex Datasets
DOI:
https://doi.org/10.35335/emod.v14i3.33Keywords:
Association rule mining, Complex datasets, Hybrid grid partitioning, Rough set theory, Fuzzy rule generationAbstract
Association rule mining plays a crucial role in extracting valuable insights and patterns from complex datasets. However, traditional association rule mining algorithms often face challenges in accurately discovering relevant rules due to the complexity, uncertainty, and vagueness inherent in such datasets. In this research, we propose an integrated approach that combines hybrid grid partitioning, rough set theory, and fuzzy rule generation to enhance association rule mining on complex datasets. First, hybrid grid partitioning is employed to divide the data space into a set of refined grid cells, allowing for more precise representation of the dataset's structure. Next, rough set theory is utilized to handle uncertainty and vagueness by computing lower and upper approximations of concepts. This enables the identification of objects that share similar condition attribute values and improves the robustness of rule generation. Additionally, fuzzy rule generation is incorporated to capture nuanced relationships and patterns within the dataset. Fuzzy logic is employed to represent imprecise and subjective concepts, facilitating the discovery of deeper insights and enhancing the comprehensibility of the generated rules. The proposed approach contributes to the accuracy, interpretability, and relevance of association rules in complex datasets. By integrating multiple techniques, it addresses the limitations of traditional algorithms and provides a comprehensive framework for knowledge discovery. Experimental evaluations demonstrate the effectiveness of the proposed approach in enhancing rule discovery accuracy and interpretability compared to traditional methods. Although some limitations, such as scalability and parameter sensitivity, need to be addressed, the research's findings highlight the potential of the integrated approach for extracting valuable insights from complex datasets. The proposed methodology has broad applicability across various domains and can empower decision-making processes in areas such as market basket analysis, customer behavior analysis, bioinformatics, and web mining. Future research can focus on addressing the identified limitations and further validating the approach's effectiveness in real-world scenarios.
References
Abdel-Basset, M., Mohamed, M., Smarandache, F., & Chang, V. (2018). Neutrosophic association rule mining algorithm for big data analysis. Symmetry, 10(4), 106.
Adebayo, O. S., & Abdul Aziz, N. (2019). Improved malware detection model with apriori association rule and particle swarm optimization. Security and Communication Networks, 2019.
Aghaeipoor, F., & Eftekhari, M. (2019). EEFR-R: extracting effective fuzzy rules for regression problems, through the cooperation of association rule mining concepts and evolutionary algorithms. Soft Computing, 23, 11737-11757.
Alcalá-Fdez, J., Alcala, R., & Herrera, F. (2011). A fuzzy association rule-based classification model for high-dimensional problems with genetic rule selection and lateral tuning. IEEE Transactions on Fuzzy systems, 19(5), 857-872.
Bai, H., Ge, Y., Wang, J., Li, D., Liao, Y., & Zheng, X. (2014). A method for extracting rules from spatial data based on rough fuzzy sets. Knowledge-Based Systems, 57, 28-40.
Berger, P. A. (2004). Rough set rule induction for suitability assessment. Environmental management, 34, 546-558.
Chang, P. C., Liu, C. H., & Wang, Y. W. (2006). A hybrid model by clustering and evolving fuzzy rules for sales decision supports in printed circuit board industry. Decision Support Systems, 42(3), 1254-1269.
Chen, G., & Wei, Q. (2002). Fuzzy association rules and the extended mining algorithms. Information Sciences, 147(1-4), 201-228.
Chimphlee, W., HananAbdullah, A., Sap, M. N. M., Chimphlee, S., & Srinoy, S. (2007). A Rough-Fuzzy Hybrid Algorithm for computer intrusion detection. a a, 2, 1.
Chou, C. H., Hsieh, S. C., & Qiu, C. J. (2017). Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction. Applied Soft Computing, 56, 298-316.
Dastjerd, N. K., Sert, O. C., Ozyer, T., & Alhajj, R. (2019). Fuzzy classification methods based diagnosis of Parkinson’s disease from speech test cases. Current aging science, 12(2), 100-120.
Dehuri, S., & Ghosh, A. (2013). Revisiting evolutionary algorithms in feature selection and nonfuzzy/fuzzy rule based classification. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 3(2), 83-108.
Djenouri, Y., Belhadi, A., Fournier-Viger, P., & Fujita, H. (2018). Mining diversified association rules in big datasets: A cluster/GPU/genetic approach. Information Sciences, 459, 117-134.
Elsaid, A., Salem, R., & Abdul-Kader, H. (2017). Research Article A Dynamic Stakeholder Classification and Prioritization Based on Hybrid Rough-fuzzy Method.
Felici, G., & Vercellis, C. (Eds.). (2007). Mathematical methods for knowledge discovery and data mining. IGI Global.
Florez, G., Bridges, S. A., & Vaughn, R. B. (2002, June). An improved algorithm for fuzzy data mining for intrusion detection. In 2002 Annual Meeting of the North American Fuzzy Information Processing Society Proceedings. NAFIPS-FLINT 2002 (Cat. No. 02TH8622) (pp. 457-462). IEEE.
Goel, L., Gupta, D., & Panchal, V. K. (2012). Hybrid bio-inspired techniques for land cover feature extraction: A remote sensing perspective. Applied soft computing, 12(2), 832-849.
Gorzałczany, M. B., & Rudziński, F. (2016). A multi-objective genetic optimization for fast, fuzzy rule-based credit classification with balanced accuracy and interpretability. Applied Soft Computing, 40, 206-220.
Guo, J. Y. (2003). Rough set-based approach to data mining. Case Western Reserve University.
Han, E. H., Karypis, G., & Kumar, V. (2000). Scalable parallel data mining for association rules. IEEE Transactions on Knowledge and Data Engineering, 12(3), 337-352.
Hassanien, A. E., Abraham, A., Peters, J. F., Schaefer, G., & Henry, C. (2009). Rough sets and near sets in medical imaging: A review. IEEE Transactions on Information Technology in Biomedicine, 13(6), 955-968.
He, Y. (2006). Fuzzy-granular based data mining for effective decision support in biomedical applications.
He, Y., Tang, Y., Zhang, Y. Q., & Sunderraman, R. (2006). Adaptive Fuzzy Association Rule mining for effective decision support in biomedical applications. International journal of data mining and bioinformatics, 1(1), 3-18.
Huang, H. H., & Kuo, Y. H. (2010). Cross-lingual document representation and semantic similarity measure: A fuzzy set and rough set based approach. IEEE Transactions on Fuzzy Systems, 18(6), 1098-1111.
Hüllermeier, E. (2005). Fuzzy methods in machine learning and data mining: Status and prospects. Fuzzy sets and Systems, 156(3), 387-406.
Hung, Y. H. (2009). A neural network classifier with rough set-based feature selection to classify multiclass IC package products. Advanced Engineering Informatics, 23(3), 348-357.
Jain, V., Benyoucef, L., & Deshmukh, S. G. (2008). A new approach for evaluating agility in supply chains using fuzzy association rules mining. Engineering Applications of Artificial Intelligence, 21(3), 367-385.
Jensen, R., & Shen, Q. (2008). Computational intelligence and feature selection: rough and fuzzy approaches.
Jin, W. (2005). Fuzzy classification based on fuzzy association rule mining. North Carolina State University.
Kaya, M., & Alhajj, R. (2006). Utilizing genetic algorithms to optimize membership functions for fuzzy weighted association rules mining. Applied Intelligence, 24, 7-15.
Lee, C., Zaknich, A., & Braunl, T. (2008, June). A framework of adaptive TS type rough-fuzzy inference systems (ARFIS). In 2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence) (pp. 567-574). IEEE.
Lee, H. E., Park, K. H., & Bien, Z. Z. (2008). Iterative fuzzy clustering algorithm with supervision to construct probabilistic fuzzy rule base from numerical data. IEEE Transactions on Fuzzy Systems, 16(1), 263-277.
Li, R., & Wang, Z. O. (2004). Mining classification rules using rough sets and neural networks. European Journal of Operational Research, 157(2), 439-448.
Li, T., Ruan, D., Geert, W., Song, J., & Xu, Y. (2007). A rough sets based characteristic relation approach for dynamic attribute generalization in data mining. Knowledge-Based Systems, 20(5), 485-494.
Liang, J., Wang, F., Dang, C., & Qian, Y. (2012). A group incremental approach to feature selection applying rough set technique. IEEE Transactions on Knowledge and Data Engineering, 26(2), 294-308.
Mabu, S., Chen, C., Lu, N., Shimada, K., & Hirasawa, K. (2010). An intrusion-detection model based on fuzzy class-association-rule mining using genetic network programming. IEEE transactions on systems, man, and cybernetics, part C (Applications and Reviews), 41(1), 130-139.
Mangalampalli, A., & Pudi, V. (2009, August). Fuzzy association rule mining algorithm for fast and efficient performance on very large datasets. In 2009 IEEE international conference on fuzzy systems (pp. 1163-1168). IEEE.
Nithya, N. S., & Duraiswamy, K. (2014). Gain ratio based fuzzy weighted association rule mining classifier for medical diagnostic interface. Sadhana, 39, 39-52.
Pach, F. P., Gyenesei, A., & Abonyi, J. (2008). Compact fuzzy association rule-based classifier. Expert systems with applications, 34(4), 2406-2416.
Paranjape-Voditel, P., & Deshpande, U. (2013). A stock market portfolio recommender system based on association rule mining. Applied Soft Computing, 13(2), 1055-1063.
Patil, P. R., Sharma, Y., & Kshirasagar, M. (2016). Performance analysis of intrusion detection systems implemented using hybrid machine learning techniques. Int. J. Comput. Appl, 0975-8887.
Prabhavathy, P. (2016). Some applications of covering based rough set theory in information systems.
Raza, M. S., & Qamar, U. (2018). A heuristic based dependency calculation technique for rough set theory. Pattern Recognition, 81, 309-325.
Rebbah, M., Yemres, M. E. A., Khaldi, M., & Debakla, M. (2015). Hybrid Distribution for Association Rules Extraction on Grid Computing. Retrieved, 24th Nov.
Riza, L. S., Bergmeir, C., Herrera, F., & Benítez, J. M. (2015). frbs: Fuzzy rule-based systems for classification and regression in R. Journal of statistical software, 65, 1-30.
Salleh, M. N. M., Talpur, N., & Talpur, K. H. (2018). A modified neuro-fuzzy system using metaheuristic approaches for data classification. Artificial intelligence–emerging trends and applications’.(Ed. MAA Fernandez) pp, 29-45.
Sangaiah, A. K., Gao, X. Z., & Abraham, A. (Eds.). (2016). Handbook of Research on Fuzzy and Rough Set Theory in Organizational Decision Making. IGI Global.
Sathiyapriya, K., & Sadasivam, G. S. (2013). A survey on privacy preserving association rule mining. International Journal of Data Mining & Knowledge Management Process, 3(2), 119.
Setyohadi, D. B., Bakar, A. A., & Othman, Z. A. (2015). Optimization overlap clustering based on the hybrid rough discernibility concept and rough K-Means. Intelligent Data Analysis, 19(4), 795-823.
Sheikhan, M., & Sharifi Rad, M. (2013). Using particle swarm optimization in fuzzy association rules‐based feature selection and fuzzy ARTMAP‐based attack recognition. Security and Communication Networks, 6(7), 797-811.
Shen, Q., & Chouchoulas, A. (2000). A modular approach to generating fuzzy rules with reduced attributes for the monitoring of complex systems. Engineering Applications of Artificial Intelligence, 13(3), 263-278.
Singh, S., Garg, R., & Mishra, P. K. (2017). A comparative study of association rule mining algorithms on grid and cloud platform. arXiv preprint arXiv:1709.07594.
Sivakumar, N. R., & Alaraj, A. (2017). Path management strategy to reduce flooding of grid fisheye state routing protocol in mobile ad hoc network using fuzzy and rough set theory. International Journal of Artificial Intelligence and Soft Computing, 6(3), 187-208.
Soliman, O. S., & Adly, A. (2012, May). Bio-inspired algorithm for classification association rules. In 2012 8th International Conference on Informatics and Systems (INFOS) (pp. BIO-154). IEEE.
Suganthi, L., Iniyan, S., & Samuel, A. A. (2015). Applications of fuzzy logic in renewable energy systems–a review. Renewable and sustainable energy reviews, 48, 585-607.
Sun, Q., Li, Z., Liu, Z., & Zhou, J. (2011). Fault Diagnosis for Smart Grid by a Hybrid Method of Rough Sets and Neural Network. In Advances in Computer Science, Environment, Ecoinformatics, and Education: International Conference, CSEE 2011, Wuhan, China, August 21-22, 2011. Proceedings, Part IV (pp. 577-582). Springer Berlin Heidelberg.
Tajbakhsh, A., Rahmati, M., & Mirzaei, A. (2009). Intrusion detection using fuzzy association rules. Applied Soft Computing, 9(2), 462-469.
Tiwari, A., Tiwari, A. K., Saini, H. C., Sharma, A. K., & Yadav, A. K. (2013, July). A cloud computing using rough set theory for cloud service parameters through ontology in cloud simulator. In ACITY-2013 Conference at Chennai, in CS and IT proceedings (Vol. 10).
Tlili, R., & Slimani, Y. (2012). A novel data partitioning approach for association rule mining on grids. International Journal of Grid and Distributed Computing, 5(4), 1-20.
Udhaya Kumar, S., & Hannah Inbarani, H. (2017). PSO-based feature selection and neighborhood rough set-based classification for BCI multiclass motor imagery task. Neural Computing and Applications, 28, 3239-3258.
Valdés, J. J., & Barton, A. J. (2007). Finding relevant attributes in high dimensional data: a distributed computing hybrid data mining strategy. Transactions on Rough Sets VI: Commemorating the Life and Work of Zdzisław Pawlak, Part I, 366-396.
Verlinde, H., De Cock, M., & Boute, R. (2006). Fuzzy versus quantitative association rules: A fair data-driven comparison. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 36(3), 679-684.
Vluymans, S., D'eer, L., Saeys, Y., & Cornelis, C. (2015). Applications of Fuzzy Rough Set Theory in Machine Learning: a Survey. Fundam. Informaticae, 142(1-4), 53-86.
Wang, W., & Bridges, S. M. (2000). Genetic algorithm optimization of membership functions for mining fuzzy association rules. Department of Computer Science Mississippi State University, 2.
Wen, T., Wang, G., Guo, Q., & Ma, X. (2008, October). An Optimal Association Rule Mining Algorithm Based on Knowledge Grid. In 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery (Vol. 2, pp. 572-575). IEEE.
Yan, S., Zhou, J., Zheng, Y., & Li, C. (2018). An improved hybrid backtracking search algorithm based T–S fuzzy model and its implementation to hydroelectric generating units. Neurocomputing, 275, 2066-2079.
Yeh, C. C., Chi, D. J., & Lin, Y. R. (2014). Going-concern prediction using hybrid random forests and rough set approach. Information Sciences, 254, 98-110.
Yeh, C. C., Chi, D. J., Lin, T. Y., & Chiu, S. H. (2016). A hybrid detecting fraudulent financial statements model using rough set theory and support vector machines. Cybernetics and Systems, 47(4), 261-276.
Zhang, J., Williams, S. O., & Wang, H. (2018). Intelligent computing system based on pattern recognition and data mining algorithms. Sustainable Computing: Informatics and Systems, 20, 192-202.
Zhang, Y., Ishibuchi, H., & Wang, S. (2017). Deep Takagi–Sugeno–Kang fuzzy classifier with shared linguistic fuzzy rules. IEEE Transactions on Fuzzy Systems, 26(3), 1535-1549.
Zhao, W., Niu, Q., Li, K., & Irwin, G. W. (2013). A hybrid learning method for constructing compact rule-based fuzzy models. IEEE transactions on cybernetics, 43(6), 1807-1821.
Zhou, S. M., & Gan, J. Q. (2007). Constructing L2-SVM-based fuzzy classifiers in high-dimensional space with automatic model selection and fuzzy rule ranking. IEEE Transactions on Fuzzy Systems, 15(3), 398-409.
Published
How to Cite
Issue
Section
License
Copyright (c) 2020 Neeta Grant Caribbean

This work is licensed under a Creative Commons Attribution 4.0 International License.
