An Enhanced Hybrid Grid Partitioning and Rough Set Approach for Fuzzy Rule Generation in Complex Dataset Problem

Authors

  • Szolga Mihaela Munteanu Technical University of Cluj-Napoca, Rumania
  • Makeithappen Daniela Ioan Technical University of Cluj-Napoca, Rumania

DOI:

https://doi.org/10.35335/emod.v14i3.35

Keywords:

Fuzzy rule generation, Complex datasets, Hybrid grid partitioning, Rough set theory, Interpretability

Abstract

This research presents an enhanced hybrid grid partitioning and rough set approach for fuzzy rule generation in complex datasets. Complex datasets pose challenges due to their high dimensionality, nonlinearity, and uncertainty, which traditional analysis techniques struggle to address. Fuzzy rule-based systems offer a promising solution, but generating effective fuzzy rules becomes increasingly difficult as the dataset complexity increases. To overcome these challenges, we propose a methodology that integrates hybrid grid partitioning and rough set theory. The hybrid partitioning technique captures global and local patterns, while rough set theory handles uncertainty and incomplete information. The approach generates accurate and interpretable fuzzy rules by optimizing the rule set's size and balancing accuracy and interpretability. Experimental evaluations demonstrate the approach's effectiveness in terms of rule accuracy, interpretability, and computational efficiency. The generated fuzzy rules provide valuable insights into complex relationships, aiding decision-making processes in various domains. The research contributes to the advancement of fuzzy rule generation techniques for complex datasets and offers a practical solution for knowledge extraction in complex systems.

References

Abebe, A. J., Solomatine, D. P., & Venneker, R. G. W. (2000). Application of adaptive fuzzy rule-based models for reconstruction of missing precipitation events. Hydrological Sciences Journal, 45(3), 425-436.

Bai, H., Ge, Y., Wang, J., Li, D., Liao, Y., & Zheng, X. (2014). A method for extracting rules from spatial data based on rough fuzzy sets. Knowledge-Based Systems, 57, 28-40.

Berger, P. A. (2004). Rough set rule induction for suitability assessment. Environmental management, 34, 546-558.

Bhatt, V., Dhakar, M., & Chaurasia, B. K. (2016). Filtered clustering based on local outlier factor in data mining. International Journal of Database Theory and Application, 9(5), 275-282.

Bioucas-Dias, J. M., Plaza, A., Camps-Valls, G., Scheunders, P., Nasrabadi, N., & Chanussot, J. (2013). Hyperspectral remote sensing data analysis and future challenges. IEEE Geoscience and remote sensing magazine, 1(2), 6-36.

Bungkomkhun, P., & Auwatanamongkol, S. (2012). Grid-based supervised clustering algorithm using greedy and gradient descent methods to build clusters. IJCSI International Journal of Computer Science.

Cao, Q., Bouqata, B., Mackenzie, P. D., Messier, D., & Salvo, J. J. (2009, October). A grid-based clustering method for mining frequent trips from large-scale, event-based telematics datasets. In 2009 IEEE International Conference on Systems, Man and Cybernetics (pp. 2996-3001). IEEE.

Casey, M. A., Veltkamp, R., Goto, M., Leman, M., Rhodes, C., & Slaney, M. (2008). Content-based music information retrieval: Current directions and future challenges. Proceedings of the IEEE, 96(4), 668-696.

Casillas, J., Cordón, O., Del Jesus, M. J., & Herrera, F. (2005). Genetic tuning of fuzzy rule deep structures preserving interpretability and its interaction with fuzzy rule set reduction. IEEE Transactions on Fuzzy Systems, 13(1), 13-29.

Chen, H. L., Yang, B., Liu, J., & Liu, D. Y. (2011). A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert systems with applications, 38(7), 9014-9022.

Chen, T., Shang, C., Su, P., & Shen, Q. (2018). Induction of accurate and interpretable fuzzy rules from preliminary crisp representation. Knowledge-Based Systems, 146, 152-166.

Cordón, O. (2011). A historical review of evolutionary learning methods for Mamdani-type fuzzy rule-based systems: Designing interpretable genetic fuzzy systems. International journal of approximate reasoning, 52(6), 894-913.

Correa, C. D., Chan, Y. H., & Ma, K. L. (2009, October). A framework for uncertainty-aware visual analytics. In 2009 IEEE symposium on visual analytics science and technology (pp. 51-58). IEEE.

Demri, S. P., & Orlowska, E. (2013). Incomplete information: Structure, inference, complexity. Springer Science & Business Media.

Destercke, S., Guillaume, S., & Charnomordic, B. (2007). Building an interpretable fuzzy rule base from data using orthogonal least squares—application to a depollution problem. Fuzzy Sets and Systems, 158(18), 2078-2094.

Dimitriou, L., Tsekeris, T., & Stathopoulos, A. (2008). Adaptive hybrid fuzzy rule-based system approach for modeling and predicting urban traffic flow. Transportation Research Part C: Emerging Technologies, 16(5), 554-573.

Duckstein, L. (1995). Fuzzy rule-based modeling with applications to geophysical, biological, and engineering systems (Vol. 8). CRC press.

Elsaid, A., Salem, R., & Abdul-Kader, H. (2017). Research Article A Dynamic Stakeholder Classification and Prioritization Based on Hybrid Rough-fuzzy Method.

Ferranti, A., Marcelloni, F., Segatori, A., Antonelli, M., & Ducange, P. (2017). A distributed approach to multi-objective evolutionary generation of fuzzy rule-based classifiers from big data. Information Sciences, 415, 319-340.

Gadaras, I., & Mikhailov, L. (2009). An interpretable fuzzy rule-based classification methodology for medical diagnosis. Artificial intelligence in medicine, 47(1), 25-41.

Goel, R., Paul, S., & Hakkani-Tür, D. (2019). Hyst: A hybrid approach for flexible and accurate dialogue state tracking. arXiv preprint arXiv:1907.00883.

Gorzałczany, M. B., & Rudziński, F. (2016). A multi-objective genetic optimization for fast, fuzzy rule-based credit classification with balanced accuracy and interpretability. Applied Soft Computing, 40, 206-220.

Hassanien, A. E. (2003). Intelligent data analysis of breast cancer based on rough set theory. International Journal on Artificial Intelligence Tools, 12(04), 465-479.

Huang, H. H., & Kuo, Y. H. (2010). Cross-lingual document representation and semantic similarity measure: A fuzzy set and rough set based approach. IEEE Transactions on Fuzzy Systems, 18(6), 1098-1111.

Jain, A. K., Prabhakar, S., & Hong, L. (1999). A multichannel approach to fingerprint classification. IEEE transactions on pattern analysis and machine intelligence, 21(4), 348-359.

Jensen, R., & Shen, Q. (2008). Computational intelligence and feature selection: rough and fuzzy approaches.

Jiang, Z., Li, T., Min, W., Qi, Z., & Rao, Y. (2017). Fuzzy c-means clustering based on weights and gene expression programming. Pattern Recognition Letters, 90, 1-7.

Jin, Y. (2000). Fuzzy modeling of high-dimensional systems: complexity reduction and interpretability improvement. IEEE Transactions on Fuzzy Systems, 8(2), 212-221.

Kumar, S. U., & Inbarani, H. H. (2017). Neighborhood rough set based ECG signal classification for diagnosis of cardiac diseases. Soft Computing, 21, 4721-4733.

Liang, J., Wang, F., Dang, C., & Qian, Y. (2012). A group incremental approach to feature selection applying rough set technique. IEEE Transactions on Knowledge and Data Engineering, 26(2), 294-308.

Magdalena, L. (2015). Fuzzy rule-based systems. Springer handbook of computational intelligence, 203-218.

Mansourypoor, F., & Asadi, S. (2017). Development of a reinforcement learning-based evolutionary fuzzy rule-based system for diabetes diagnosis. Computers in Biology and Medicine, 91, 337-352.

Mitra, S., Mitra, M., & Chaudhuri, B. B. (2006). An approach to a rough set based disease inference engine for ECG classification. In Rough Sets and Current Trends in Computing: 5th International Conference, RSCTC 2006 Kobe, Japan, November 6-8, 2006 Proceedings 5 (pp. 398-407). Springer Berlin Heidelberg.

Najafabadi, M. M., Villanustre, F., Khoshgoftaar, T. M., Seliya, N., Wald, R., & Muharemagic, E. (2015). Deep learning applications and challenges in big data analytics. Journal of big data, 2(1), 1-21.

Nguyen, L. H., & Holmes, S. (2019). Ten quick tips for effective dimensionality reduction. PLoS computational biology, 15(6), e1006907.

Nilashi, M., Ahmadi, H., Shahmoradi, L., Ibrahim, O., & Akbari, E. (2019). A predictive method for hepatitis disease diagnosis using ensembles of neuro-fuzzy technique. Journal of infection and public health, 12(1), 13-20.

Orlowska, E. (Ed.). (2013). Incomplete information: Rough set analysis (Vol. 13). Physica.

Pan, Y. F., Hou, X., & Liu, C. L. (2010). A hybrid approach to detect and localize texts in natural scene images. IEEE transactions on image processing, 20(3), 800-813.

Papageorgiou, E. I., & Salmeron, J. L. (2013). Methods and algorithms for fuzzy cognitive map-based modeling. In Fuzzy cognitive maps for applied sciences and engineering: from fundamentals to extensions and learning algorithms (pp. 1-28). Berlin, Heidelberg: Springer Berlin Heidelberg.

Polkowski, L. (2002). Rough sets. Heidelberg: Physica-Verlag.

Poongothai, S., Dharuman, C., Venkatesan, P., & Campus, R. (2017). A comparison of fuzzy genetic and neuro genetic hybrid algorithm for the classification of diabetes disease. International Journal of Pure and Applied Mathematics, 113, 208-216.

Priyanga, S., Gauthama Raman, M. R., Jagtap, S. S., Aswin, N., Kirthivasan, K., & Shankar Sriram, V. S. (2019). An improved rough set theory based feature selection approach for intrusion detection in SCADA systems. Journal of Intelligent & Fuzzy Systems, 36(5), 3993-4003.

Rajab, S., & Sharma, V. (2019). An interpretable neuro-fuzzy approach to stock price forecasting. Soft Computing, 23, 921-936.

Rey, M. I., Galende, M., Fuente, M. J., & Sainz-Palmero, G. I. (2017). Multi-objective based Fuzzy Rule Based Systems (FRBSs) for trade-off improvement in accuracy and interpretability: A rule relevance point of view. Knowledge-Based Systems, 127, 67-84.

Rissino, S., & Lambert-Torres, G. (2009). Rough set theory—fundamental concepts, principals, data extraction, and applications. In Data mining and knowledge discovery in real life applications. IntechOpen.

Riza, L. S., Bergmeir, C., Herrera, F., & Benítez, J. M. (2015). frbs: Fuzzy rule-based systems for classification and regression in R. Journal of statistical software, 65, 1-30.

Salleh, M. N. M., Talpur, N., & Talpur, K. H. (2018). A modified neuro-fuzzy system using metaheuristic approaches for data classification. Artificial intelligence–emerging trends and applications’.(Ed. MAA Fernandez) pp, 29-45.

Sedighiani, K., & HashemiKhabir, S. (2009, May). Constructing interpretable genetic fuzzy rule-based system for Breast Cancer Diagnostic. In 2009 WRI Global Congress on Intelligent Systems (Vol. 1, pp. 441-446). IEEE.

Sharmila, P., Baskaran, J., Nayanatara, C., & Maheswari, R. (2019). A hybrid technique of machine learning and data analytics for optimized distribution of renewable energy resources targeting smart energy management. Procedia Computer Science, 165, 278-284.

Shen, Q., & Chouchoulas, A. (2000). A modular approach to generating fuzzy rules with reduced attributes for the monitoring of complex systems. Engineering Applications of Artificial Intelligence, 13(3), 263-278.

Shen, Q., & Chouchoulas, A. (2002). A rough-fuzzy approach for generating classification rules. Pattern recognition, 35(11), 2425-2438.

Shen, Q., & Jensen, R. (2004). Selecting informative features with fuzzy-rough sets and its application for complex systems monitoring. Pattern recognition, 37(7), 1351-1363.

Sivakumar, N. R., & Alaraj, A. (2017). Path management strategy to reduce flooding of grid fisheye state routing protocol in mobile ad hoc network using fuzzy and rough set theory. International Journal of Artificial Intelligence and Soft Computing, 6(3), 187-208.

Smyth, P., & Goodman, R. M. (1992). An information theoretic approach to rule induction from databases. IEEE transactions on Knowledge and data engineering, 4(4), 301-316.

Srimani, P. K., & Koti, M. S. (2014). Knowledge discovery in medical data by using rough set rule induction algorithms. Indian Journal of Science and Technology, 7(7), 905.

Strehl, A., & Ghosh, J. (2002). Cluster ensembles---a knowledge reuse framework for combining multiple partitions. Journal of machine learning research, 3(Dec), 583-617.

Suganthi, L., Iniyan, S., & Samuel, A. A. (2015). Applications of fuzzy logic in renewable energy systems–a review. Renewable and sustainable energy reviews, 48, 585-607.

Sun, Q., Li, Z., Liu, Z., & Zhou, J. (2011). Fault Diagnosis for Smart Grid by a Hybrid Method of Rough Sets and Neural Network. In Advances in Computer Science, Environment, Ecoinformatics, and Education: International Conference, CSEE 2011, Wuhan, China, August 21-22, 2011. Proceedings, Part IV (pp. 577-582). Springer Berlin Heidelberg.

Tripathy, R., Bilionis, I., & Gonzalez, M. (2016). Gaussian processes with built-in dimensionality reduction: Applications to high-dimensional uncertainty propagation. Journal of Computational Physics, 321, 191-223.

Udhaya Kumar, S., & Hannah Inbarani, H. (2017). PSO-based feature selection and neighborhood rough set-based classification for BCI multiclass motor imagery task. Neural Computing and Applications, 28, 3239-3258.

Ustundag, A., Kılınç, M. S., & Cevikcan, E. (2010). Fuzzy rule-based system for the economic analysis of RFID investments. Expert systems with applications, 37(7), 5300-5306.

Wang, H., Kwong, S., Jin, Y., Wei, W., & Man, K. F. (2005). Multi-objective hierarchical genetic algorithm for interpretable fuzzy rule-based knowledge extraction. Fuzzy sets and systems, 149(1), 149-186.

Xue, J., Liu, X., & Chen, P. (2016). Rhombic grid based clustering algorithm with spiking neural P systems. Journal of Computational and Theoretical Nanoscience, 13(6), 3895-3901.

Yao, Y. Y. (2001). Information granulation and rough set approximation. International Journal of Intelligent Systems, 16(1), 87-104.

Yuan, X., Ren, D., Wang, Z., & Guo, C. (2013). Dimension projection matrix/tree: Interactive subspace visual exploration and analysis of high dimensional data. IEEE Transactions on Visualization and Computer Graphics, 19(12), 2625-2633.

Zhai, L. Y., Khoo, L. P., & Fok, S. C. (2006). Knowledge acquisition and uncertainty in fault diagnosis: A rough sets perspective. Data mining and knowledge discovery approaches based on rule induction techniques, 359-394.

Zhang, J., Feng, X., & Liu, Z. (2018). A grid-based clustering algorithm via load analysis for industrial Internet of things. IEEE Access, 6, 13117-13128.

Zhou, Q., Yang, S., Zhao, C., Cai, M., Lou, H., Luo, Y., & Hou, L. (2016). Development and implementation of a spatial unit non-overlapping water stress index for water scarcity evaluation with a moderate spatial resolution. Ecological Indicators, 69, 422-433.

Zhu, Y., Zabaras, N., Koutsourelakis, P. S., & Perdikaris, P. (2019). Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data. Journal of Computational Physics, 394, 56-81.

Published

2020-09-30

How to Cite

Munteanu, S. M., & Ioan, M. D. (2020). An Enhanced Hybrid Grid Partitioning and Rough Set Approach for Fuzzy Rule Generation in Complex Dataset Problem. International Journal of Enterprise Modelling, 14(3), 150–161. https://doi.org/10.35335/emod.v14i3.35