Optimizing Dataset Classification with Hybrid Grid Partitioning and Rough Set-based Fuzzy Rule Generation Approach
DOI:
https://doi.org/10.35335/emod.v14i1.23Keywords:
Dataset classification, Hybrid grid partitioning, Rough set-based attribute reduction, Fuzzy rule generation, OptimizationAbstract
This research focuses on optimizing dataset classification by combining hybrid grid partitioning and rough set-based fuzzy rule generation. Traditional classification algorithms often face challenges in handling high-dimensional data, attribute redundancy, and uncertainty, leading to reduced accuracy and increased computational complexity. To address these issues, we propose an integrated approach that leverages hybrid grid partitioning for adaptive representation of the dataset, rough set-based attribute reduction for identifying relevant attributes, and fuzzy rule generation for handling uncertainty and capturing complex relationships. The hybrid grid partitioning creates multiple levels of granularity, capturing both local and global patterns. Rough set-based attribute reduction reduces dimensionality and eliminates redundant information. Fuzzy rule generation enables the handling of uncertainty and the mapping of input attributes to output classes. We present a case example of customer churn prediction in a telecommunications company to illustrate the practical relevance of the proposed approach. The optimized classification model provides insights into churn factors and enables proactive measures for customer retention. The research contributes to the field by offering an integrated framework to enhance classification accuracy, interpretability, and efficiency. The findings have the potential to benefit various industries and applications that rely on accurate classification models, improving decision-making processes and performance of intelligent systems.
References
Abbas, S. M., Alam, K. A., & Shamshirband, S. (2019). A soft-rough set based approach for handling contextual sparsity in context-aware video recommender systems. Mathematics, 7(8), 740.
Ahmed, M. M., & Isa, N. A. M. (2017). Knowledge base to fuzzy information granule: A review from the interpretability-accuracy perspective. Applied Soft Computing, 54, 121-140.
Al‐Maitah, M. (2019). Text analytics for big data using rough–fuzzy soft computing techniques. Expert Systems, 36(6), e12463.
Bhatt, R., Ramanna, S., & Peters, J. F. (2009). Software defect classification: A comparative study of rough-neuro-fuzzy hybrid approaches with linear and non-linear SVMs. Rough Set Theory: A True Landmark in Data Analysis, 213-231.
Bhaumik, H., Bhattacharyya, S., Nath, M. D., & Chakraborty, S. (2016). Hybrid soft computing approaches to content based video retrieval: A brief review. Applied Soft Computing, 46, 1008-1029.
Cervante, L., Xue, B., Shang, L., & Zhang, M. (2012). A dimension reduction approach to classification based on particle swarm optimisation and rough set theory. In AI 2012: Advances in Artificial Intelligence: 25th Australasian Joint Conference, Sydney, Australia, December 4-7, 2012. Proceedings 25 (pp. 313-325). Springer Berlin Heidelberg.
Chimphlee, W., HananAbdullah, A., Sap, M. N. M., Chimphlee, S., & Srinoy, S. (2007). A Rough-Fuzzy Hybrid Algorithm for computer intrusion detection. a a, 2, 1.
Chou, C. H., Hsieh, S. C., & Qiu, C. J. (2017). Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction. Applied Soft Computing, 56, 298-316.
Coy, C. G. (2010). A hybrid-genetic algorithm for training a Sugeno-type Fuzzy Inference System with a mutable rule base (Doctoral dissertation, University of Toledo).
Dehnavi, A. M., Sehhati, M. R., & Rabbani, H. (2013). Hybrid method for prediction of metastasis in breast cancer patients using gene expression signals. Journal of Medical Signals and Sensors, 3(2), 79.
Guo, J. Y. (2003). Rough set-based approach to data mining. Case Western Reserve University.
Hassanien, A. E., Abraham, A., Peters, J. F., Schaefer, G., & Henry, C. (2009). Rough sets and near sets in medical imaging: A review. IEEE Transactions on Information Technology in Biomedicine, 13(6), 955-968.
Hassanien, A. E., Moftah, H. M., Azar, A. T., & Shoman, M. (2014). MRI breast cancer diagnosis hybrid approach using adaptive ant-based segmentation and multilayer perceptron neural networks classifier. Applied Soft Computing, 14, 62-71.
Hor, C. L., Crossley, P., Watson, S., & Millar, D. (2009). Developing a knowledge-based system using rough set theory and genetic algorithms for substation fault diagnosis. Rough Set Theory: A True Landmark in Data Analysis, 279-320.
Huang, X., Shi, F., Gu, W., & Chen, S. (2009). SVM-based fuzzy rules acquisition system for pulsed GTAW process. Engineering Applications of Artificial Intelligence, 22(8), 1245-1255.
Jensen, R., & Shen, Q. (2008). Computational intelligence and feature selection: rough and fuzzy approaches.
Khashei, M., Hamadani, A. Z., & Bijari, M. (2012). A fuzzy intelligent approach to the classification problem in gene expression data analysis. Knowledge-Based Systems, 27, 465-474.
Kostek, B. (2013). Soft computing in acoustics: applications of neural networks, fuzzy logic and rough sets to musical acoustics (Vol. 31). Physica.
Ma, L., & Fan, S. (2017). CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests. BMC bioinformatics, 18(1), 1-18.
Mahata, K., Das, R., Das, S., & Sarkar, A. (2017). Watershed image analysis using a PSO-CA hybrid approach. In Hybrid Intelligent Techniques for Pattern Analysis and Understanding (pp. 229-246). Chapman and Hall/CRC.
Maji, P., & Pal, S. K. (2012). Rough-fuzzy pattern recognition: applications in bioinformatics and medical imaging (Vol. 3). John Wiley & Sons.
Marwala, T., Lagazio, M., Marwala, T., & Lagazio, M. (2011). Particle Swarm Optimization and Hill-Climbing Optimized Rough Sets for Modeling Interstate Conflict. Militarized Conflict Modeling Using Computational Intelligence, 147-164.
Mitra, S., Mitra, M., & Chaudhuri, B. B. (2006). An approach to a rough set based disease inference engine for ECG classification. In Rough Sets and Current Trends in Computing: 5th International Conference, RSCTC 2006 Kobe, Japan, November 6-8, 2006 Proceedings 5 (pp. 398-407). Springer Berlin Heidelberg.
Mitra, S., Mitra, M., & Chaudhuri, B. B. (2008). A rough set based approach for ecg classification. Transactions on Rough Sets IX, 157-186.
Mohanty, R., Ravi, V., & Patra, M. R. (2010). The application of intelligent and soft-computing techniques to software engineering problems: a review. International Journal of Information and Decision Sciences, 2(3), 233-272.
Mohebi, E., & Mohd. Noor Md. Sap. (2009, May). An Optimized Hybrid Kohonen Neural Network for Ambiguity Detection in Cluster Analysis Using Simulated Annealing. In ICEIS (pp. 389-401).
Mohebi, E., & Sap, M. N. M. Data Clustering using Artificial Neural Network, Rough set Theory and Optimization Techniques.
Nancy, S. G., & Appavu, S. (2016). Optimal feature selection for classification using rough set-based CGA-NN classifier. International Journal of Business Intelligence and Data Mining, 11(4), 357-378.
Pourpanah, F., Shi, Y., Lim, C. P., Hao, Q., & Tan, C. J. (2019). Feature selection based on brain storm optimization for data classification. Applied Soft Computing, 80, 761-775.
Pun, C., & Kumar, N. (2011, November). Projected Rough Fuzzy c-means clustering. In 2011 11th International Conference on Intelligent Systems Design and Applications (pp. 530-536). IEEE.
Rahman, M. M. (2006). Rough set approach to feature reduction in KDD: Evolutionary computing and data sampling. Faculty of Graduate Studies and Research, University of Regina.
Rajab, S., & Sharma, V. (2019). An interpretable neuro-fuzzy approach to stock price forecasting. Soft Computing, 23, 921-936.
Rawat, S., Patel, A., Celestino, J., & dos Santos, A. L. M. (2016). A dominance based rough set classification system for fault diagnosis in electrical smart grid environments. Artificial Intelligence Review, 46, 389-411.
Sadiq, A. S., Alkazemi, B., Mirjalili, S., Ahmed, N., Khan, S., Ali, I., ... & Ghafoor, K. Z. (2018). An efficient ids using hybrid magnetic swarm optimization in wanets. IEEE Access, 6, 29041-29053.
Salamo, M., & Lopez-Sanchez, M. (2011). Rough set based approaches to feature selection for case-based reasoning classifiers. Pattern Recognition Letters, 32(2), 280-292.
Siddique, M. M., Zheng, Y., Yang, X., & Beddoe, G. (2008, March). Reduction of false positives by extracting fuzzy rules from data for polyp detection in CTC scans. In Medical Imaging 2008: Computer-Aided Diagnosis (Vol. 6915, pp. 239-250). SPIE.
Sivakumar, N. R., & Alaraj, A. (2017). Path management strategy to reduce flooding of grid fisheye state routing protocol in mobile ad hoc network using fuzzy and rough set theory. International Journal of Artificial Intelligence and Soft Computing, 6(3), 187-208.
Slezak, D., Szczuka, M., Duentsch, I., & Yao, Y. (Eds.). (2005). Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing: 10th International Conference, RSFDGrC 2005, Regina, Canada, August 31-September 3, 2005, Proceedings, Part I (Vol. 3641). Springer.
Thangarasu, M., & Inbarani, H. H. (2016, October). MPRK algorithm for clustering the large text datasets. In 2016 IEEE International Conference on Advances in Computer Applications (ICACA) (pp. 224-229). IEEE.
Tomar, D., & Agarwal, S. (2015). Hybrid feature selection based weighted least squares twin support vector machine approach for diagnosing breast cancer, hepatitis, and diabetes. Advances in Artificial Neural Systems, 2015.
Tripathy, B. K., Seetha, H., & Murty, M. N. (2018). Uncertainty-based clustering algorithms for large data sets. In Modern Technologies for Big Data Classification and Clustering (pp. 1-33). IGI Global.
Udhaya Kumar, S., & Hannah Inbarani, H. (2017). PSO-based feature selection and neighborhood rough set-based classification for BCI multiclass motor imagery task. Neural Computing and Applications, 28, 3239-3258.
Valdés, J. J., & Barton, A. J. (2007). Finding relevant attributes in high dimensional data: a distributed computing hybrid data mining strategy. Transactions on Rough Sets VI: Commemorating the Life and Work of Zdzisław Pawlak, Part I, 366-396.
Vluymans, S., D'eer, L., Saeys, Y., & Cornelis, C. (2015). Applications of Fuzzy Rough Set Theory in Machine Learning: a Survey. Fundam. Informaticae, 142(1-4), 53-86.
Wang, Y., Wu, X., Chen, Z., Ren, F., Feng, L., & Du, Q. (2019). Optimizing the predictive ability of machine learning methods for landslide susceptibility mapping using SMOTE for Lishui City in Zhejiang Province, China. International journal of environmental research and public health, 16(3), 368.
Yasodha, S., & Prakash, P. S. (2012, March). Data mining classification technique for talent management using SVM. In 2012 International Conference on Computing, Electronics and Electrical Technologies (ICCEET) (pp. 959-963). IEEE.
Zhang, D., Wang, Y., & Huang, H. (2009). Rough neural network modeling based on fuzzy rough model and its application to texture classification. Neurocomputing, 72(10-12), 2433-2443.
Zhou, T., Chung, F. L., & Wang, S. (2016). Deep TSK fuzzy classifier with stacked generalization and triplely concise interpretability guarantee for large data. IEEE Transactions on Fuzzy Systems, 25(5), 1207-1221.
Published
How to Cite
Issue
Section
License
Copyright (c) 2020 Dall Matew Caloz

This work is licensed under a Creative Commons Attribution 4.0 International License.
