A hybrid approach for adaptive fuzzy network partitioning and rule generation using rough set theory

Improving data-driven decision making through accurate and interpretable rules

Authors

  • Jonhariono Sihotang Universitas Putra Abadi Langkat, Indonesia
  • Aisyah Alesha Tajik State University of Commerce, Tajikistan
  • Juliana Batubara Institute of Computer Science, Indoensia
  • Sonya Enjelina Gorat Institute of Computer Science, Indoensia
  • Firta Sari Panjaitan Institute of Computare Science, Indonesia

DOI:

https://doi.org/10.35335/emod.v16i1.54

Keywords:

Adaptive Fuzzy Network Partitioning, Data-Driven Decision Making, Hybrid Approach, Rough Set Theory, Rule Generation

Abstract

Data-driven decision making is vital in credit risk assessment and other areas. Complex datasets are hard to rule. We use adaptive fuzzy network partitioning, rough set theory, and rule generation to improve data-driven credit risk assessment. An adaptive fuzzy network partitioning algorithm is used to cluster the dataset. Each cluster instance receives fuzzy membership degrees. Next, rough set-based attribute reduction identifies credit risk assessment attributes inside each cluster. Finally, attributes are used to build accurate and understandable credit risk assessment criteria. A loan application dataset is used to test the suggested method. The results show successful loan application clustering and the creation of credit risk criteria for each cluster. Accurate predictions and interpretable rules improve credit risk assessment comprehension and decision-making. By merging adaptive fuzzy network partitioning, rough set theory, and rule generation, the hybrid methodology overcomes classic technique constraints. These methods create a comprehensive framework for credit risk assessment criteria that improves accuracy and interpretability. Financial institutions and credit providers may benefit from the approach. The proposed approach can be tested in multiple domains and extended to handle increasingly complicated datasets. Evaluating the methodology on real-world datasets and comparing it to existing methods can also reveal its practicality and efficacy. This research generates accurate and interpretable rules for data-driven credit risk assessment using a hybrid method. Adaptive fuzzy network partitioning, rough set theory, and rule generation can improve decision-making across domains

References

Abid, F. Ben, Sallem, M., & Braham, A. (2019). Robust interpretable deep learning for intelligent fault diagnosis of induction motors. IEEE Transactions on Instrumentation and Measurement, 69(6), 3506–3515.

Acosta, M. P., Vahdatikhaki, F., Santos, J., Hammad, A., & Dorée, A. G. (2021). How to bring UHI to the urban planning table? A data-driven modeling approach. Sustainable Cities and Society, 71, 102948.

Bai, C., Dhavale, D., & Sarkis, J. (2016). Complex investment decisions using rough set and fuzzy c-means: An example of investment in green supply chains. European Journal of Operational Research, 248(2), 507–521.

Bello, R., & Falcon, R. (2017). Rough sets in machine learning: a review. Thriving Rough Sets: 10th Anniversary-Honoring Professor Zdzisław Pawlak’s Life and Legacy & 35 Years of Rough Sets, 87–118.

Bruckert, S., Finzel, B., & Schmid, U. (2020). The next generation of medical decision support: a roadmap toward transparent expert companions. Frontiers in Artificial Intelligence, 3, 507973.

Buddhakulsomsiri, J., Siradeghyan, Y., Zakarian, A., & Li, X. (2006). Association rule-generation algorithm for mining automotive warranty data. International Journal of Production Research, 44(14), 2749–2770.

Bulut, H., Onan, A., & Korukoğlu, S. (2020). An improved ant-based algorithm based on heaps merging and fuzzy c-means for clustering cancer gene expression data. Sādhanā, 45, 1–17.

Chen, G., Liu, H., Yu, L., Wei, Q., & Zhang, X. (2006). A new approach to classification based on association rule mining. Decision Support Systems, 42(2), 674–689.

Cheng, C.-H., & Yang, J.-H. (2018). Fuzzy time-series model based on rough set rule induction for forecasting stock price. Neurocomputing, 302, 33–45.

Clancy, C. M., & Cronin, K. (2005). Evidence-based decision making: global evidence, local decisions. Health Affairs, 24(1), 151–162.

De’ath, G., & Fabricius, K. E. (2000). Classification and regression trees: a powerful yet simple technique for ecological data analysis. Ecology, 81(11), 3178–3192.

De Caigny, A., Coussement, K., & De Bock, K. W. (2018). A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees. European Journal of Operational Research, 269(2), 760–772.

de Zepeda, M. V. N., Meng, F., Su, J., Zeng, X.-J., & Wang, Q. (2021). Dynamic clustering analysis for driving styles identification. Engineering Applications of Artificial Intelligence, 97, 104096.

Deng, L., Hu, Y., Cheung, J. P. Y., & Luk, K. D. K. (2017). A data-driven decision support system for scoliosis prognosis. IEEE Access, 5, 7874–7884.

Duţu, L.-C., Mauris, G., & Bolon, P. (2017). A fast and accurate rule-base generation method for Mamdani fuzzy systems. IEEE Transactions on Fuzzy Systems, 26(2), 715–733.

Fan, C., Xiao, F., Yan, C., Liu, C., Li, Z., & Wang, J. (2019). A novel methodology to explain and evaluate data-driven building energy performance models based on interpretable machine learning. Applied Energy, 235, 1551–1560.

Gahegan, M., Takatsuka, M., Wheeler, M., & Hardisty, F. (2002). Introducing GeoVISTA Studio: an integrated suite of visualization and computational methods for exploration and knowledge construction in geography. Computers, Environment and Urban Systems, 26(4), 267–292.

Gorzałczany, M. B., & Rudziński, F. (2016). A multi-objective genetic optimization for fast, fuzzy rule-based credit classification with balanced accuracy and interpretability. Applied Soft Computing, 40, 206–220.

Herrera-Viedma, E., Palomares, I., Li, C.-C., Cabrerizo, F. J., Dong, Y., Chiclana, F., & Herrera, F. (2020). Revisiting fuzzy and linguistic decision making: scenarios and challenges for making wiser decisions in a better way. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 51(1), 191–208.

Hu, J., Li, T., Luo, C., Fujita, H., & Yang, Y. (2017). Incremental fuzzy cluster ensemble learning based on rough set theory. Knowledge-Based Systems, 132, 144–155.

Hu, K.-H., Chen, F.-H., Hsu, M.-F., & Tzeng, G.-H. (2021). Identifying key factors for adopting artificial intelligence-enabled auditing techniques by joint utilization of fuzzy-rough set theory and MRDM technique. Technological and Economic Development of Economy, 27(2), 459–492.

Hua, L., Gu, Y., Gu, X., Xue, J., & Ni, T. (2021). A novel brain MRI image segmentation method using an improved multi-view fuzzy c-means clustering algorithm. Frontiers in Neuroscience, 15, 662674.

Kang, X. (2021). Combining rough set theory and support vector regression to the sustainable form design of hybrid electric vehicle. Journal of Cleaner Production, 304, 127137.

Kock, A., & Georg Gemünden, H. (2016). Antecedents to decision‐making quality and agility in innovation portfolio management. Journal of Product Innovation Management, 33(6), 670–686.

Lepri, B., Oliver, N., Letouzé, E., Pentland, A., & Vinck, P. (2018). Fair, transparent, and accountable algorithmic decision-making processes: The premise, the proposed solutions, and the open challenges. Philosophy & Technology, 31, 611–627.

Leung, M. K. K., Delong, A., Alipanahi, B., & Frey, B. J. (2015). Machine learning in genomic medicine: a review of computational problems and data sets. Proceedings of the IEEE, 104(1), 176–197.

Li, D., Zhang, H., Li, T., Bouras, A., Yu, X., & Wang, T. (2021). Hybrid missing value imputation algorithms using fuzzy c-means and vaguely quantified rough set. IEEE Transactions on Fuzzy Systems, 30(5), 1396–1408.

Li, X.-H., Cao, C. C., Shi, Y., Bai, W., Gao, H., Qiu, L., Wang, C., Gao, Y., Zhang, S., & Xue, X. (2020). A survey of data-driven and knowledge-aware explainable ai. IEEE Transactions on Knowledge and Data Engineering, 34(1), 29–49.

Liu, B., Ma, Y., & Wong, C.-K. (2001). Classification using association rules: weaknesses and enhancements. Data Mining for Scientific and Engineering Applications, 591–605.

Luca, M., & Bazerman, M. H. (2021). The power of experiments: Decision making in a data-driven world. Mit Press.

Mardani, A., Nilashi, M., Antucheviciene, J., Tavana, M., Bausys, R., & Ibrahim, O. (2017). Recent fuzzy generalisations of rough sets theory: A systematic review and methodological critique of the literature. Complexity, 2017.

Marie, D., & Etzer, M. (2019). A Novel Hybrid Approach: Grid Partition and Rough Set-Based Fuzzy Rule Generation for Accurate Dataset Classification. International Journal of Enterprise Modelling, 13(3), 119–129.

Mencagli, G., Torquati, M., & Danelutto, M. (2018). Elastic-PPQ: A two-level autonomic system for spatial preference query processing over dynamic data streams. Future Generation Computer Systems, 79, 862–877.

Mishra, S., Schuetter, J., Datta-Gupta, A., & Bromhal, G. (2021). Robust data-driven machine-learning models for subsurface applications: are we there yet? Journal of Petroleum Technology, 73(03), 25–30.

Munusamy, S., & Murugesan, P. (2020). Modified dynamic fuzzy c-means clustering algorithm–Application in dynamic customer segmentation. Applied Intelligence, 50(6), 1922–1942.

Nanda, N. B., & Parikh, A. (2019). Hybrid approach for network intrusion detection system using random forest classifier and rough set theory for rules generation. Advanced Informatics for Computing Research: Third International Conference, ICAICR 2019, Shimla, India, June 15–16, 2019, Revised Selected Papers, Part II 3, 274–287.

Nasr, M., Hamdy, M., Hegazy, D., & Bahnasy, K. (2021). An efficient algorithm for unique class association rule mining. Expert Systems with Applications, 164, 113978.

Olanow, C. W., Watts, R. L., & Koller, W. C. (2001). An algorithm (decision tree) for the management of Parkinson’s disease (2001):: Treatment Guidelines. Neurology, 56(suppl 5), S1–S88.

Pacheco, F., Cerrada, M., Sánchez, R.-V., Cabrera, D., Li, C., & de Oliveira, J. V. (2017). Attribute clustering using rough set theory for feature selection in fault severity classification of rotating machinery. Expert Systems with Applications, 71, 69–86.

Provost, F., & Fawcett, T. (2013). Data science and its relationship to big data and data-driven decision making. Big Data, 1(1), 51–59.

Qin, C., Zhang, W., & Wen, J. (2016). A Rough Sets Approach to Teaching Quality Evaluation Modeling and Empirical Analysis. Revista de La Facultad de Ingeniería, 31(6), 242–252.

Rajendran, P., & Madheswaran, M. (2010). Hybrid medical image classification using association rule mining with decision tree algorithm. ArXiv Preprint ArXiv:1001.3503.

Renigier-Biłozor, M., Janowski, A., & d’Amato, M. (2019). Automated valuation model based on fuzzy and rough set theory for real estate market with insufficient source data. Land Use Policy, 87, 104021.

Saldivar, A. A. F., Goh, C., Li, Y., Yu, H., & Chen, Y. (2016). Attribute identification and predictive customisation using fuzzy clustering and genetic search for Industry 4.0 environments. 2016 10th International Conference on Software, Knowledge, Information Management & Applications (SKIMA), 79–86.

Sarker, I. H. (2021). Data science and analytics: an overview from data-driven smart computing, decision-making and applications perspective. SN Computer Science, 2(5), 377.

Savastjanov, M. N. (2021). Improving Production Planning Decisions: A Hybrid Grid Partitioning and Rough Set Approach for Fuzzy Rule Generation in Optimization Models. International Journal of Enterprise Modelling, 15(1), 13–24.

Segura‐Delgado, A., Gacto, M. J., Alcalá, R., & Alcalá‐Fdez, J. (2020). Temporal association rule mining: An overview considering the time variable as an integral or implied component. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 10(4), e1367.

Shi, J., Lei, Y., Zhou, Y., & Gong, M. (2016). Enhanced rough–fuzzy c-means algorithm with strict rough sets properties. Applied Soft Computing, 46, 827–850.

Škrjanc, I., Iglesias, J. A., Sanchis, A., Leite, D., Lughofer, E., & Gomide, F. (2019). Evolving fuzzy and neuro-fuzzy approaches in clustering, regression, identification, and classification: A survey. Information Sciences, 490, 344–368.

Tang, J., Zhang, X., Yu, T., & Liu, F. (2021). Missing traffic data imputation considering approximate intervals: A hybrid structure integrating adaptive network-based inference and fuzzy rough set. Physica A: Statistical Mechanics and Its Applications, 573, 125776.

Tarnowska, K. A., Dispoto, B. C., & Conragan, J. (2021). Explainable AI-based clinical decision support system for hearing disorders. AMIA Summits on Translational Science Proceedings, 2021, 595.

Thabtah, F. A., & Cowling, P. I. (2007). A greedy classification algorithm based on association rule. Applied Soft Computing, 7(3), 1102–1111.

Thabtah, F. A., Cowling, P., & Peng, Y. (2004). MMAC: A new multi-class, multi-label associative classification approach. Fourth IEEE International Conference on Data Mining (ICDM’04), 217–224.

Thabtah, F., Cowling, P., & Peng, Y. (2005). MCAR: multi-class classification based on association rule. The 3rd ACS/IEEE International Conference OnComputer Systems and Applications, 2005., 33.

Vidhya, K. A., & Geetha, T. V. (2017). Rough set theory for document clustering: A review. Journal of Intelligent & Fuzzy Systems, 32(3), 2165–2185.

Waghen, K., & Ouali, M.-S. (2021). Multi-level interpretable logic tree analysis: A data-driven approach for hierarchical causality analysis. Expert Systems with Applications, 178, 115035.

Wang, C., & Zheng, X. (2020). Application of improved time series Apriori algorithm by frequent itemsets in association rule data mining based on temporal constraint. Evolutionary Intelligence, 13(1), 39–49.

Zhao, R., Gu, L., & Zhu, X. (2019). Combining fuzzy C-means clustering with fuzzy rough feature selection. Applied Sciences, 9(4), 679.

Downloads

Published

2022-01-30

How to Cite

Sihotang, J., Alesha, A., Batubara, J., Gorat, S. E., & Panjaitan, F. S. (2022). A hybrid approach for adaptive fuzzy network partitioning and rule generation using rough set theory: Improving data-driven decision making through accurate and interpretable rules. International Journal of Enterprise Modelling, 16(1), 12–22. https://doi.org/10.35335/emod.v16i1.54

Most read articles by the same author(s)