Integrating hybrid grid partition and rough set method for fuzzy rule generation: a novel approach for accurate dataset classification

Authors

  • Luke Joseph Faculty of Science, The Catholic University of Malawi, Malawi
  • Meiser Llywellenie O'Leary The Catholic University of Malawi, Malawi
  • Bisani Zagré The Catholic University of Malawi, Malawi

DOI:

https://doi.org/10.35335/emod.v17i2.21

Keywords:

Dataset classification, Fuzzy rule generation, Grid partition, Hybrid approach, Rough set method

Abstract

Accurate dataset classification is a critical task in various domains, and combining different methodologies can enhance classification performance. This research presents a novel approach that integrates Hybrid Grid Partition and Rough Set methods for fuzzy rule generation, aiming to improve accuracy and interpretability in dataset classification. The proposed approach leverages Hybrid Grid Partition to discretize continuous attributes and Rough Set attribute reduction to identify essential attributes, enabling accurate classification while handling uncertainty and imprecision. The generated fuzzy rules provide interpretability, aiding decision-making processes and providing insights into classification factors. The approach's robustness and generalization capabilities are demonstrated through experiments on diverse datasets, indicating its potential applicability in real-world scenarios. However, limitations such as the absence of specific evaluation metrics and the need for further validation on larger datasets are acknowledged. Overall, this research contributes to accurate dataset classification by offering a novel integrated approach and highlighting areas for future investigation and refinement

References

Abdelhamid, N., Ayesh, A., & Thabtah, F. (2014). Phishing detection based associative classification data mining. Expert Systems with Applications, 41(13), 5948–5959.

Akram, M., Luqman, A., & Alcantud, J. C. R. (2022). An integrated ELECTRE-I approach for risk evaluation with hesitant Pythagorean fuzzy information. Expert Systems with Applications, 200, 116945.

Akram, M., Muhiuddin, G., & Santos-García, G. (2022). An enhanced VIKOR method for multi-criteria group decision-making with complex Fermatean fuzzy sets. Math. Biosci. Eng, 19(7), 7201–7231.

Albahri, A. S., Hamid, R. A., Alwan, J. K., Al-Qays, Z. T., Zaidan, A. A., Zaidan, B. B., Albahri, A. O. S., AlAmoodi, A. H., Khlaf, J. M., & Almahdi, E. M. (2020). Role of biological data mining and machine learning techniques in detecting and diagnosing the novel coronavirus (COVID-19): a systematic review. Journal of Medical Systems, 44, 1–11.

Almugren, N., & Alshamlan, H. (2019). A survey on hybrid feature selection methods in microarray gene expression data for cancer classification. IEEE Access, 7, 78533–78548.

Alshomrani, S., Bawakid, A., Shim, S.-O., Fernández, A., & Herrera, F. (2015). A proposal for evolutionary fuzzy systems using feature weighting: dealing with overlapping in imbalanced datasets. Knowledge-Based Systems, 73, 1–17.

Bello, M., Nápoles, G., Vanhoof, K., & Bello, R. (2021). Data quality measures based on granular computing for multi-label classification. Information Sciences, 560, 51–67.

Bezdek, J. C., & Keller, J. M. (2020). Streaming data analysis: Clustering or classification? IEEE Transactions on Systems, Man, and Cybernetics: Systems, 51(1), 91–102.

Bhagat, S. N., Rath, P. S., & Mitra, A. (2022). Automated Room Occupancy Prediction Using Fuzzy-Rough Set Theory-Based Supervised Learning. In Human-Centric Smart Computing: Proceedings of ICHCSC 2022 (pp. 329–335). Springer.

Bhattacharya, S., Maddikunta, P. K. R., Hakak, S., Khan, W. Z., Bashir, A. K., Jolfaei, A., & Tariq, U. (2020). Antlion re-sampling based deep neural network model for classification of imbalanced multimodal stroke dataset. Multimedia Tools and Applications, 1–25.

Bock, F. E., Aydin, R. C., Cyron, C. J., Huber, N., Kalidindi, S. R., & Klusemann, B. (2019). A review of the application of machine learning and data mining approaches in continuum materials mechanics. Frontiers in Materials, 6, 110.

Boulmaiz, T., Guermoui, M., Saber, M., Boutaghane, H., Abida, H., & Eslamian, S. (2023). Uncertainty analysis using fuzzy models in hydroinformatics. In Handbook of Hydroinformatics (pp. 423–434). Elsevier.

Buabeng, A., Simons, A., Frempong, N. K., & Ziggah, Y. Y. (2022). Predictive Maintenance Model Based on Multisensor Data Fusion of Hybrid Fuzzy Rough Set Theory Feature Selection and Stacked Ensemble for Fault Classification. Mathematical Problems in Engineering, 2022.

Buczak, A. L., & Guven, E. (2015). A survey of data mining and machine learning methods for cyber security intrusion detection. IEEE Communications Surveys & Tutorials, 18(2), 1153–1176.

Chimphlee, W., HananAbdullah, A., Sap, M. N. M., Chimphlee, S., & Srinoy, S. (2007). A Rough-Fuzzy Hybrid Algorithm for computer intrusion detection. A A, 2, 1.

Chowdhary, C. L., Mittal, M., Pattanaik, P. A., & Marszalek, Z. (2020). An efficient segmentation and classification system in medical images using intuitionist possibilistic fuzzy C-mean clustering and fuzzy SVM algorithm. Sensors, 20(14), 3903.

Desrochers, B. (2018). Simultaneous localization and mapping in unstructured environments: a set-membership approach. ENSTA Bretagne-École nationale supérieure de techniques avancées Bretagne.

Dey, P., & Jana, D. K. (2023). Evaluation of the convincing ability through presentation skills of pre-service management wizards using AI via T2 linguistic fuzzy logic. Journal of Computational and Cognitive Engineering, 2(2), 133–142.

Díaz-Rodríguez, N., Cadahía, O. L., Cuéllar, M. P., Lilius, J., & Calvo-Flores, M. D. (2014). Handling real-world context awareness, uncertainty and vagueness in real-time human activity tracking and recognition with a fuzzy ontology-based hybrid method. Sensors, 14(10), 18131–18171.

Dougherty, J., Kohavi, R., & Sahami, M. (1995). Supervised and unsupervised discretization of continuous features. In Machine learning proceedings 1995 (pp. 194–202). Elsevier.

Duong-Bao, N., He, J., Thi, L. N., Nguyen-Huu, K., & Lee, S.-W. (2022). A Novel Valued Tolerance Rough Set and Decision Rules Method for Indoor Positioning Using WiFi Fingerprinting. Sensors, 22(15), 5709.

Fazzolari, M., Alcalá, R., & Herrera, F. (2014). A multi-objective evolutionary method for learning granularities based on fuzzy discretization to improve the accuracy-complexity trade-off of fuzzy rule-based classification systems: D-MOFARC algorithm. Applied Soft Computing, 24, 470–481.

Goel, L., Gupta, D., & Panchal, V. K. (2012). Hybrid bio-inspired techniques for land cover feature extraction: A remote sensing perspective. Applied Soft Computing, 12(2), 832–849.

Hentout, A., Maoudj, A., & Aouache, M. (2023). A review of the literature on fuzzy-logic approaches for collision-free path planning of manipulator robots. Artificial Intelligence Review, 56(4), 3369–3444.

Houssein, E. H., Hosney, M. E., Mohamed, W. M., Ali, A. A., & Younis, E. M. G. (2023). Fuzzy-based hunger games search algorithm for global optimization and feature selection using medical data. Neural Computing and Applications, 35(7), 5251–5275.

Jagtap, S. B., & Hambarde, M. S. M. (2014). Agricultural plant leaf disease detection and diagnosis using image processing based on morphological feature extraction. IOSR J. VLSI Signal Process, 4(5), 24–30.

Janatabad, A. A., Sadeghieh, A., Lotfi, M. M., & Mostafaeipour, A. (2022). Determining the tariffs of physicians using a combined model of data mining techniques and fuzzy logic in health insurance. International Journal of Industrial Engineering, 33(1), 1–9.

Javed, K. (2014). A robust & reliable Data-driven prognostics approach based on extreme learning machine and fuzzy clustering. Université de Franche-Comté.

Jayatilake, S. M. D. A. C., & Ganegoda, G. U. (2021). Involvement of machine learning tools in healthcare decision making. Journal of Healthcare Engineering, 2021.

Ji, W., Pang, Y., Jia, X., Wang, Z., Hou, F., Song, B., Liu, M., & Wang, R. (2021). Fuzzy rough sets and fuzzy rough neural networks for feature selection: A review. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 11(3), e1402.

Jiang, F., & Sui, Y. (2015). A novel approach for discretization of continuous attributes in rough set theory. Knowledge-Based Systems, 73, 324–334.

Kang, X. (2021). Combining rough set theory and support vector regression to the sustainable form design of hybrid electric vehicle. Journal of Cleaner Production, 304, 127137.

Kavakiotis, I., Tsave, O., Salifoglou, A., Maglaveras, N., Vlahavas, I., & Chouvarda, I. (2017). Machine learning and data mining methods in diabetes research. Computational and Structural Biotechnology Journal, 15, 104–116.

Khan, M. A., & Kim, Y. (2021). Cardiac Arrhythmia Disease Classification Using LSTM Deep Learning Approach. Computers, Materials & Continua, 67(1).

Kostek, B. (2013). Soft computing in acoustics: applications of neural networks, fuzzy logic and rough sets to musical acoustics (Vol. 31). Physica.

Kotsiantis, S. B., Zaharakis, I. D., & Pintelas, P. E. (2006). Machine learning: a review of classification and combining techniques. Artificial Intelligence Review, 26(3), 159–190.

Kumar, P. R., & Ravi, V. (2007). Bankruptcy prediction in banks and firms via statistical and intelligent techniques–A review. European Journal of Operational Research, 180(1), 1–28.

Kumar, T., Sankaran, K. S., Ritonga, M., Asif, S., Kumar, C. S., Mohammad, S., Sengan, S., & Asenso, E. (2022). Research Article Fuzzy Logic and Machine Learning-Enabled Recommendation System to Predict Suitable Academic Program for Students.

Liu, H., Hussain, F., Tan, C. L., & Dash, M. (2002). Discretization: An enabling technique. Data Mining and Knowledge Discovery, 6, 393–423.

López, V., Fernández, A., García, S., Palade, V., & Herrera, F. (2013). An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics. Information Sciences, 250, 113–141.

Matkan, A. A., Hajeb, M., & Sadeghian, S. (2014). Road extraction from lidar data using support vector machine classification. Photogrammetric Engineering & Remote Sensing, 80(5), 409–422.

Mishra, S., Tripathy, H. K., Mallick, P. K., Bhoi, A. K., & Barsocchi, P. (2020). EAGA-MLP—an enhanced and adaptive hybrid classification model for diabetes diagnosis. Sensors, 20(14), 4036.

Nahaei, V. S., Novin, M. H., & Khaligh, M. A. (2021). Review and prioritization of investment projects in the Waste Management organization of Tabriz Municipality with a Rough Sets Theory approach. International Journal of Innovation in Management, Economics and Social Sciences, 1(3), 46–57.

Nanda, N. B., & Parikh, A. (2019). Hybrid approach for network intrusion detection system using random forest classifier and rough set theory for rules generation. Advanced Informatics for Computing Research: Third International Conference, ICAICR 2019, Shimla, India, June 15–16, 2019, Revised Selected Papers, Part II 3, 274–287.

Nguyen, G., Dlugolinsky, S., Bobák, M., Tran, V., López García, Á., Heredia, I., Malík, P., & Hluchý, L. (2019). Machine learning and deep learning frameworks and libraries for large-scale data mining: a survey. Artificial Intelligence Review, 52, 77–124.

Nilashi, M., Abumalloh, R. A., Alrizq, M., Almulihi, A., Alghamdi, O. A., Farooque, M., Samad, S., Mohd, S., & Ahmadi, H. (2022). A Hybrid Method to Solve Data Sparsity in Travel Recommendation Agents Using Fuzzy Logic Approach. Mathematical Problems in Engineering, 2022.

Pati, S. K., Das, A. K., & Ghosh, A. (2013). Gene selection using multi-objective genetic algorithm integrating cellular automata and rough set theory. Swarm, Evolutionary, and Memetic Computing: 4th International Conference, SEMCCO 2013, Chennai, India, December 19-21, 2013, Proceedings, Part II 4, 144–155.

Said, B., Lathamaheswari, M., Singh, P. K., Ouallane, A. A., Bakhouyi, A., Bakali, A., Talea, M., Dhital, A., & Deivanayagampillai, N. (2022). An Intelligent Traffic Control System Using Neutrosophic Sets, Rough sets, Graph Theory, Fuzzy sets and its Extended Approach: A Literature Review. Neutrosophic Sets and Systems, Vol. 50/2022: An International Journal in Information Science and Engineering, 10.

Sajjad, M., Khan, S., Jan, Z., Muhammad, K., Moon, H., Kwak, J. T., Rho, S., Baik, S. W., & Mehmood, I. (2016). Leukocytes classification and segmentation in microscopic blood smear: a resource-aware healthcare service in smart cities. IEEE Access, 5, 3475–3489.

Salleh, M. N. M., Talpur, N., & Talpur, K. H. (2018). A modified neuro-fuzzy system using metaheuristic approaches for data classification. Artificial Intelligence–Emerging Trends and Applications’.(Ed. MAA Fernandez) Pp, 29–45.

Saravi, B., Hassel, F., Ülkümen, S., Zink, A., Shavlokhova, V., Couillard-Despres, S., Boeker, M., Obid, P., & Lang, G. M. (2022). Artificial intelligence-driven prediction modeling and decision making in spine surgery using hybrid machine learning models. Journal of Personalized Medicine, 12(4), 509.

Singh, S., & Som, T. (2022). Intuitionistic Fuzzy Rough Sets: Theory to Practice. Mathematics in Computational Science and Engineering, 91–133.

Sokolova, M., & Lapalme, G. (2009). A systematic analysis of performance measures for classification tasks. Information Processing & Management, 45(4), 427–437.

Swathy, M., & Saruladha, K. (2022). A comparative study of classification and prediction of Cardio-Vascular Diseases (CVD) using Machine Learning and Deep Learning techniques. ICT Express, 8(1), 109–116.

Tabakov, M., Chlopowiec, A. B., & Chlopowiec, A. R. (2023). A Novel Classification Method Using the Takagi–Sugeno Model and a Type-2 Fuzzy Rule Induction Approach. Applied Sciences, 13(9), 5279.

Tang, J., Zhang, G., Wang, Y., Wang, H., & Liu, F. (2015). A hybrid approach to integrate fuzzy C-means based imputation method with genetic algorithm for missing traffic volume data estimation. Transportation Research Part C: Emerging Technologies, 51, 29–40.

Tran, D. T., & Huh, J.-H. (2022). Building a model to exploit association rules and analyze purchasing behavior based on rough set theory. The Journal of Supercomputing, 78(8), 11051–11091.

Vluymans, S., D’eer, L., Saeys, Y., & Cornelis, C. (2015). Applications of Fuzzy Rough Set Theory in Machine Learning: a Survey. Fundam. Informaticae, 142(1–4), 53–86.

Wang, H., Xu, Z., & Pedrycz, W. (2017). An overview on the roles of fuzzy set techniques in big data processing: Trends, challenges and opportunities. Knowledge-Based Systems, 118, 15–30.

Wang, T., & Zhou, M. (2021). Integrating rough set theory with customer satisfaction to construct a novel approach for mining product design rules. Journal of Intelligent & Fuzzy Systems, 41(1), 331–353.

Yeh, C.-C., Chi, D.-J., & Hsu, M.-F. (2010). A hybrid approach of DEA, rough set and support vector machines for business failure prediction. Expert Systems with Applications, 37(2), 1535–1541.

Yeh, C.-C., Chi, D.-J., & Lin, Y.-R. (2014). Going-concern prediction using hybrid random forests and rough set approach. Information Sciences, 254, 98–110.

Zhu, Y., & Kan, H. Y. (2022). Aviation and Airspace Management under Rough Set Theory. Mathematical Problems in Engineering.

Downloads

Published

2023-05-30

How to Cite

Joseph, L., O'Leary, M. L., & Zagré, B. (2023). Integrating hybrid grid partition and rough set method for fuzzy rule generation: a novel approach for accurate dataset classification. International Journal of Enterprise Modelling, 17(2), 88–98. https://doi.org/10.35335/emod.v17i2.21

Similar Articles

You may also start an advanced similarity search for this article.