Enhancing Accuracy and Interpretability in Dataset Classification: Advancements in Hybrid Grid Partition and Rough Set Methods for Fuzzy Rule Generation
DOI:
https://doi.org/10.35335/emod.v13i1.5Keywords:
Dataset classification, Accuracy enhancement., Interpretability improvement, Hybrid grid partition, Rough set methodsAbstract
Accurate and interpretable classification of datasets plays a crucial role in various domains, including healthcare, finance, and image recognition. This research focuses on enhancing accuracy and interpretability in dataset classification through the integration of hybrid grid partition and rough set methods for fuzzy rule generation. The proposed mathematical model leverages the grid partition approach to handle the curse of dimensionality and reduce dataset complexity, while the rough set method identifies essential features and generates meaningful fuzzy rules. The assigned membership values to linguistic terms further enhance interpretability. The model's accuracy and interpretability were evaluated using a diabetes dataset, achieving an accuracy rate of 85% on the validation dataset and 83% on the testing dataset. Comparative analysis demonstrated competitive performance against existing methods. The iterative refinement process contributed to the model's optimization. However, limitations include dataset dependency, parameter sensitivity, and scalability. Future research directions include advanced rule pruning techniques, optimization of model parameters, handling imbalanced datasets, incorporating feature selection, robustness and scalability evaluation, comparative studies, and real-world application validation. The proposed model presents a promising approach to enhance accuracy and interpretability in dataset classification.
References
Ahmed, M. M., & Isa, N. A. M. (2017). Knowledge base to fuzzy information granule: A review from the interpretability-accuracy perspective. Applied Soft Computing, 54, 121-140.
Alonso, J. M., Castiello, C., & Mencar, C. (2015). Interpretability of fuzzy systems: Current research trends and prospects. Springer handbook of computational intelligence, 219-237.
Ang, K. K., & Quek, C. (2005). RSPOP: Rough set–based pseudo outer-product Fuzzy rule identification algorithm. Neural Computation, 17(1), 205-243.
Ang, K. K., & Quek, C. (2006, July). Rough set-based neuro-fuzzy system. In The 2006 IEEE international joint conference on neural network proceedings (pp. 742-749). IEEE.
Berger, P. A. (2004). Rough set rule induction for suitability assessment. Environmental management, 34, 546-558.
Bien, J., & Tibshirani, R. (2011). Prototype selection for interpretable classification.
Cheruku, R., Edla, D. R., Kuppili, V., & Dharavath, R. (2018). Rst-batminer: A fuzzy rule miner integrating rough set feature selection and bat optimization for detection of diabetes disease. Applied Soft Computing, 67, 764-780.
Elkano, M., Galar, M., Sanz, J. A., Fernández, A., Barrenechea, E., Herrera, F., & Bustince, H. (2014). Enhancing multiclass classification in FARC-HD fuzzy classifier: On the synergy between $ n $-dimensional overlap functions and decomposition strategies. IEEE Transactions on Fuzzy Systems, 23(5), 1562-1580.
Fazzolari, M., Alcalá, R., & Herrera, F. (2014). A multi-objective evolutionary method for learning granularities based on fuzzy discretization to improve the accuracy-complexity trade-off of fuzzy rule-based classification systems: D-MOFARC algorithm. Applied Soft Computing, 24, 470-481.
Fernandez-Riverola, F., Díaz, F., & Corchado, J. M. (2006). Reducing the memory size of a fuzzy case-based reasoning system applying rough set techniques. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 37(1), 138-146.
Gacto, M. J., Alcalá, R., & Herrera, F. (2010). Integration of an index to preserve the semantic interpretability in the multiobjective evolutionary rule selection and tuning of linguistic fuzzy systems. IEEE Transactions on Fuzzy Systems, 18(3), 515-531.
Gacto, M. J., Alcalá, R., & Herrera, F. (2011). Interpretability of linguistic fuzzy rule-based systems: An overview of interpretability measures. Information Sciences, 181(20), 4340-4360.
Ganesh Kumar, P., Kavitha, M. S., & Ahn, B. C. (2016). Automated detection of cancer associated genes using a combined fuzzy-rough-set-based f-information and water swirl algorithm of human gene expression data. Plos one, 11(12), e0167504.
García, S., Fernández, A., & Herrera, F. (2009). Enhancing the effectiveness and interpretability of decision tree and rule induction classifiers with evolutionary training set selection over imbalanced problems. Applied soft computing, 9(4), 1304-1314.
Goel, L., Gupta, D., & Panchal, V. K. (2012). Hybrid bio-inspired techniques for land cover feature extraction: A remote sensing perspective. Applied soft computing, 12(2), 832-849.
Gorzałczany, M. B., & Rudziński, F. (2016). A multi-objective genetic optimization for fast, fuzzy rule-based credit classification with balanced accuracy and interpretability. Applied Soft Computing, 40, 206-220.
Grosenick, L., Greer, S., & Knutson, B. (2008). Interpretable classifiers for FMRI improve prediction of purchases. IEEE transactions on neural systems and rehabilitation engineering, 16(6), 539-548.
Guo, J. Y. (2003). Rough set-based approach to data mining. Case Western Reserve University.
Harkness, K., Sabbagh, M., Jacobson, J., Chowdrey, N., & Chen, T. (2005). Enhanced accuracy of mental state decoding in dysphoric college students. Cognition & Emotion, 19(7), 999-1025.
Hassanien, A. E., Abraham, A., Peters, J. F., Schaefer, G., & Henry, C. (2009). Rough sets and near sets in medical imaging: A review. IEEE Transactions on Information Technology in Biomedicine, 13(6), 955-968.
Hettiarachchi, H. W. D., Hemapala, K. U., & Jayasekara, A. B. P. (2018). Review of applications of fuzzy logic in multi-agent-based control system of AC-DC hybrid microgrid. IEEE Access, 7, 1284-1299.
Hüllermeier, E. (2005). Fuzzy methods in machine learning and data mining: Status and prospects. Fuzzy sets and Systems, 156(3), 387-406.
Jensen, R., & Shen, Q. (2007). Fuzzy-rough sets assisted attribute selection. IEEE Transactions on fuzzy systems, 15(1), 73-89.
Jensen, R., & Shen, Q. (2008). Computational intelligence and feature selection: rough and fuzzy approaches.
Kothari, S., Phan, J. H., Young, A. N., & Wang, M. D. (2013). Histological image classification using biologically interpretable shape-based features. BMC medical imaging, 13, 1-17.
Lin, T. Y., & Cercone, N. (Eds.). (2012). Rough sets and data mining: Analysis of imprecise data. Springer Science & Business Media.
Lipton, Z. C. (2018). The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery. Queue, 16(3), 31-57.
Maji, P., & Pal, S. K. (2012). Rough-fuzzy pattern recognition: applications in bioinformatics and medical imaging (Vol. 3). John Wiley & Sons.
Mala, I., Akhtar, P., Ali, T. J., & Zia, S. S. (2013). Fuzzy rule based classification for heart dataset using fuzzy decision tree algorithm based on fuzzy RDBMS. World Applied Sciences Journal, 28(9), 1331-1335.
Mitra, S., & Hayashi, Y. (2000). Neuro-fuzzy rule generation: survey in soft computing framework. IEEE transactions on neural networks, 11(3), 748-768.
Pancho, D. P., Alonso, J. M., Cordón, O., Quirin, A., & Magdalena, L. (2013). FINGRAMS: visual representations of fuzzy rule-based inference for expert analysis of comprehensibility. IEEE Transactions on Fuzzy Systems, 21(6), 1133-1149.
Pawlak, Z. (1997). Rough set approach to knowledge-based decision support. European journal of operational research, 99(1), 48-57.
Polkowski, L. (Ed.). (2013). Rough sets in knowledge discovery 2: applications, case studies and software systems (Vol. 19). Physica.
Pulkkinen, P., & Koivisto, H. (2007). Identification of interpretable and accurate fuzzy classifiers and function estimators with hybrid methods. Applied Soft Computing, 7(2), 520-533.
Rini, D. P., Shamsuddin, S. M., & Yuhaniz, S. S. (2016). Particle swarm optimization for ANFIS interpretability and accuracy. Soft Computing, 20, 251-262.
Rissino, S., & Lambert-Torres, G. (2009). Rough set theory—fundamental concepts, principals, data extraction, and applications. In Data mining and knowledge discovery in real life applications. IntechOpen.
Riza, L. S., Bergmeir, C., Herrera, F., & Benítez, J. M. (2015). frbs: Fuzzy rule-based systems for classification and regression in R. Journal of statistical software, 65, 1-30.
Sangaiah, A. K., Gao, X. Z., & Abraham, A. (Eds.). (2016). Handbook of Research on Fuzzy and Rough Set Theory in Organizational Decision Making. IGI Global.
Saxena, A., Prasad, M., Gupta, A., Bharill, N., Patel, O. P., Tiwari, A., ... & Lin, C. T. (2017). A review of clustering techniques and developments. Neurocomputing, 267, 664-681.
Seo, S., Huang, J., Yang, H., & Liu, Y. (2017, August). Interpretable convolutional neural networks with dual local and global attention for review rating prediction. In Proceedings of the eleventh ACM conference on recommender systems (pp. 297-305).
Shen, Q., & Jensen, R. (2004). Selecting informative features with fuzzy-rough sets and its application for complex systems monitoring. Pattern recognition, 37(7), 1351-1363.
Shihabudheen, K. V., & Pillai, G. N. (2018). Recent advances in neuro-fuzzy system: A survey. Knowledge-Based Systems, 152, 136-162.
Suganthi, L., Iniyan, S., & Samuel, A. A. (2015). Applications of fuzzy logic in renewable energy systems–a review. Renewable and sustainable energy reviews, 48, 585-607.
Vluymans, S., D'eer, L., Saeys, Y., & Cornelis, C. (2015). Applications of Fuzzy Rough Set Theory in Machine Learning: a Survey. Fundam. Informaticae, 142(1-4), 53-86.
Walczak, B., & Massart, D. L. (1999). Rough sets theory. Chemometrics and intelligent laboratory systems, 47(1), 1-16.
Wang, X., Yang, J., Jensen, R., & Liu, X. (2006). Rough set feature selection and rule induction for prediction of malignancy degree in brain glioma. Computer methods and programs in biomedicine, 83(2), 147-156.
Zhang, D., Wang, Y., & Huang, H. (2009). Rough neural network modeling based on fuzzy rough model and its application to texture classification. Neurocomputing, 72(10-12), 2433-2443.
Zhou, S. M., & Gan, J. Q. (2008). Low-level interpretability and high-level interpretability: a unified view of data-driven interpretable fuzzy system modelling. Fuzzy sets and systems, 159(23), 3091-3131.
Zhou, T., Chung, F. L., & Wang, S. (2016). Deep TSK fuzzy classifier with stacked generalization and triplely concise interpretability guarantee for large data. IEEE Transactions on Fuzzy Systems, 25(5), 1207-1221.
Published
How to Cite
Issue
Section
License
Copyright (c) 2018 Josea Moreno Chawla, Herrera Rocío

This work is licensed under a Creative Commons Attribution 4.0 International License.
