A Novel Hybrid Approach: Grid Partition and Rough Set-Based Fuzzy Rule Generation for Accurate Dataset Classification

Authors

  • Dalzon Marie Université de Port-au-Prince, Haiti
  • Moïse Etzer Université de Port-au-Prince, Haiti

DOI:

https://doi.org/10.35335/emod.v13i3.15

Keywords:

Hybrid approach, Dataset classification, Grid partitioning, Rough set-based feature reduction, Fuzzy rule generation

Abstract

Accurate dataset classification is a fundamental task in various domains such as machine learning, pattern recognition, and data mining. This research proposes a novel hybrid approach that combines grid partitioning, rough set-based feature reduction, and fuzzy rule generation to enhance classification accuracy and interpretability. The approach begins with the partitioning of the dataset into a grid of cells, enabling localized analysis and capturing intricate patterns. Next, rough set-based feature reduction is applied to identify essential features and reduce dimensionality. This process helps overcome the curse of dimensionality commonly associated with complex datasets. Subsequently, fuzzy rule generation is employed, leveraging linguistic variables and membership functions to represent imprecise and uncertain information. This enhances interpretability by providing transparent decision-making rules. To evaluate the effectiveness of the proposed approach, comparative analysis with traditional classification methods, including decision trees, support vector machines, and neural networks, is conducted. The results demonstrate the superiority or at least comparability of the hybrid approach in terms of classification accuracy, computational complexity, and interpretability. However, it is essential to acknowledge the limitations of the research, such as the sensitivity to grid size and the interpretability-performance trade-off. Future research can focus on refining the approach by exploring optimal grid size selection methods and mitigating the interpretability-performance trade-off.The findings of this research contribute to the advancement of accurate dataset classification techniques. The proposed hybrid approach offers improved classification accuracy, handles complex datasets effectively, and enhances interpretability through fuzzy rules. The practical implications of the research span domains such as bioinformatics, IoT, and financial analysis. Overall, this research provides a foundation for further exploration, refinement, and real-world applications of the hybrid approach in accurate dataset classification scenarios.

References

Abbas, S. M., Alam, K. A., & Shamshirband, S. (2019). A soft-rough set based approach for handling contextual sparsity in context-aware video recommender systems. Mathematics, 7(8), 740.

Bai, H., Ge, Y., Wang, J., Li, D., Liao, Y., & Zheng, X. (2014). A method for extracting rules from spatial data based on rough fuzzy sets. Knowledge-Based Systems, 57, 28-40.

Bhaumik, H., Bhattacharyya, S., Nath, M. D., & Chakraborty, S. (2016). Hybrid soft computing approaches to content based video retrieval: A brief review. Applied Soft Computing, 46, 1008-1029.

Cervante, L., Xue, B., Shang, L., & Zhang, M. (2012). A dimension reduction approach to classification based on particle swarm optimisation and rough set theory. In AI 2012: Advances in Artificial Intelligence: 25th Australasian Joint Conference, Sydney, Australia, December 4-7, 2012. Proceedings 25 (pp. 313-325). Springer Berlin Heidelberg.

Chandra, D. K., & Ravi, V. (2009). Feature selection and fuzzy rule-based classifier applied to bankruptcy prediction in banks. International Journal of Information and Decision Sciences, 1(4), 343-365.

Chen, H. L., Yang, B., Liu, J., & Liu, D. Y. (2011). A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert systems with applications, 38(7), 9014-9022.

Chou, C. H., Hsieh, S. C., & Qiu, C. J. (2017). Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction. Applied Soft Computing, 56, 298-316.

Chouchoulas, A. (1999). A rough set approach to text classification. The University Of Edinburgh.

Ferone, A., & Maratea, A. (2019). Integrating rough set principles in the graded possibilistic clustering. Information Sciences, 477, 148-160.

Guo, J. Y. (2003). Rough set-based approach to data mining. Case Western Reserve University.

Hassanien, A. E., Abraham, A., Peters, J. F., Schaefer, G., & Henry, C. (2009). Rough sets and near sets in medical imaging: A review. IEEE Transactions on Information Technology in Biomedicine, 13(6), 955-968.

Hassanien, A. E., Moftah, H. M., Azar, A. T., & Shoman, M. (2014). MRI breast cancer diagnosis hybrid approach using adaptive ant-based segmentation and multilayer perceptron neural networks classifier. Applied Soft Computing, 14, 62-71.

Hung, Y. H. (2009). A neural network classifier with rough set-based feature selection to classify multiclass IC package products. Advanced Engineering Informatics, 23(3), 348-357.

Jensen, R., & Shen, Q. (2008). Computational intelligence and feature selection: rough and fuzzy approaches.

Juneja, A., Rana, B., & Agrawal, R. K. (2018). A novel fuzzy rough selection of non-linearly extracted features for schizophrenia diagnosis using fMRI. Computer methods and programs in biomedicine, 155, 139-152.

Khashei, M., Hamadani, A. Z., & Bijari, M. (2012). A fuzzy intelligent approach to the classification problem in gene expression data analysis. Knowledge-Based Systems, 27, 465-474.

Kumar, P. R., & Ravi, V. (2007). Bankruptcy prediction in banks and firms via statistical and intelligent techniques–A review. European journal of operational research, 180(1), 1-28.

Kumar, S. U., & Inbarani, H. H. (2017). Neighborhood rough set based ECG signal classification for diagnosis of cardiac diseases. Soft Computing, 21, 4721-4733.

Liang, J., Wang, F., Dang, C., & Qian, Y. (2012). A group incremental approach to feature selection applying rough set technique. IEEE Transactions on Knowledge and Data Engineering, 26(2), 294-308.

Mitra, S., Mitra, M., & Chaudhuri, B. B. (2006). An approach to a rough set based disease inference engine for ECG classification. In Rough Sets and Current Trends in Computing: 5th International Conference, RSCTC 2006 Kobe, Japan, November 6-8, 2006 Proceedings 5 (pp. 398-407). Springer Berlin Heidelberg.

Mitra, S., Mitra, M., & Chaudhuri, B. B. (2008). A rough set based approach for ecg classification. Transactions on Rough Sets IX, 157-186.

Nancy, S. G., & Appavu, S. (2016). Optimal feature selection for classification using rough set-based CGA-NN classifier. International Journal of Business Intelligence and Data Mining, 11(4), 357-378.

Own, H. S., & Abraham, A. (2012). A new weighted rough set framework based classification for Egyptian NeoNatal Jaundice. Applied Soft Computing, 12(3), 999-1005.

Panda, M., & Abraham, A. (2015). Hybrid evolutionary algorithms for classification data mining. Neural Computing and Applications, 26, 507-523.

Priyanga, S., Gauthama Raman, M. R., Jagtap, S. S., Aswin, N., Kirthivasan, K., & Shankar Sriram, V. S. (2019). An improved rough set theory based feature selection approach for intrusion detection in SCADA systems. Journal of Intelligent & Fuzzy Systems, 36(5), 3993-4003.

Raza, M. S., & Qamar, U. (2018). A heuristic based dependency calculation technique for rough set theory. Pattern Recognition, 81, 309-325.

Raza, M. S., & Qamar, U. (2018). Feature selection using rough set-based direct dependency calculation by avoiding the positive region. International Journal of Approximate Reasoning, 92, 175-197.

Salamo, M., & Lopez-Sanchez, M. (2011). Rough set based approaches to feature selection for case-based reasoning classifiers. Pattern Recognition Letters, 32(2), 280-292.

Suo, M., An, R., Zhou, D., & Li, S. (2018). Grid-clustered rough set model for self-learning and fast reduction. Pattern Recognition Letters, 106, 61-68.

Udhaya Kumar, S., & Hannah Inbarani, H. (2017). PSO-based feature selection and neighborhood rough set-based classification for BCI multiclass motor imagery task. Neural Computing and Applications, 28, 3239-3258.

Vluymans, S., D'eer, L., Saeys, Y., & Cornelis, C. (2015). Applications of Fuzzy Rough Set Theory in Machine Learning: a Survey. Fundam. Informaticae, 142(1-4), 53-86.

Xue, B., Cervante, L., Shang, L., Browne, W. N., & Zhang, M. (2014). Binary PSO and rough set theory for feature selection: A multi-objective filter based approach. International Journal of Computational Intelligence and Applications, 13(02), 1450009.

Yasodha, S., & Prakash, P. S. (2012, March). Data mining classification technique for talent management using SVM. In 2012 International Conference on Computing, Electronics and Electrical Technologies (ICCEET) (pp. 959-963). IEEE.

Yeh, C. C., Chi, D. J., Lin, T. Y., & Chiu, S. H. (2016). A hybrid detecting fraudulent financial statements model using rough set theory and support vector machines. Cybernetics and Systems, 47(4), 261-276.

Zhang, J., Deng, Z., Choi, K. S., & Wang, S. (2017). Data-driven elastic fuzzy logic system modeling: Constructing a concise system with human-like inference mechanism. IEEE Transactions on Fuzzy Systems, 26(4), 2160-2173.

Zhou, T., Lu, H., Zhang, J., & Shi, H. (2016). Pulmonary nodule detection model based on SVM and CT image feature-level fusion with rough sets. BioMed research international, 2016.

Published

2019-09-30

How to Cite

Marie, D., & Etzer, M. (2019). A Novel Hybrid Approach: Grid Partition and Rough Set-Based Fuzzy Rule Generation for Accurate Dataset Classification. International Journal of Enterprise Modelling, 13(3), 119–129. https://doi.org/10.35335/emod.v13i3.15