Content-Based Filtering Using TF-IDF for a Course Recommendation System for Indonesian MSME Entrepreneurs
DOI:
https://doi.org/10.35335/int.jo.emod.v20i2.172Keywords:
Cosine Similarity, Entrepreneurs, MSMEs, Recommendation System, TF-IDFAbstract
Micro, Small, and Medium Enterprises (MSMEs), particularly those led by female entrepreneurs, play a vital role in Indonesia’s economic development; however, digital learning platforms often lack adaptive mechanisms that align course offerings with individual learning needs. Existing platforms generally rely on manual selection or generic categorization, creating a gap in personalized recommendation support within digital entrepreneurship education. The primary objective of this study is to develop and assess a content-based course recommendation system for Femalepreneur.id that addresses this limitation. A quantitative experimental research design was adopted using user profile data collected through questionnaires and course descriptions obtained from the platform repository. The methodology integrates systematic text preprocessing, feature representation using Term Frequency–Inverse Document Frequency (TF-IDF), and similarity computation through cosine similarity to generate personalized recommendations. The experimental results indicate Mean Precision@3 at 51.5%, Mean Average Precision@3 (MAP@3) at 42.9%, and Hit Ratio@3 at 90.9%. The precision matrix demonstrates the system can recommend relevant result until three courses as the maximum value based on the ground truth. While, Hit Ratio matrix reveals that at least the system can recommend at least one relevant topic. These findings confirm the effectiveness of TF-IDF in modelling textual learning features and highlight the contribution of the proposed system in strengthening personalized digital entrepreneurship learning for female entrepreneurs.
References
Chen, L., Ifenthaler, D., Yin, J., & Yau, K. (2021). Online and blended entrepreneurship education : a systematic review of applied educational technologies. In Entrepreneurship Education (Vol. 4, Issue 2). Springer Singapore. https://doi.org/10.1007/s41959-021-00047-7
Ertuğrul, D. Ç., & Bitirim, S. (2025). Job recommender systems : a systematic literature review , applications , open issues , and challenges. In Journal of Big Data. Springer International Publishing. https://doi.org/10.1186/s40537-025-01173-y
Guleria, P. (2025). NLP based text classification using TF-IDF enabled fine-tuned long short-term memory : An empirical analysis. Array, 27(August), 100467. https://doi.org/10.1016/j.array.2025.100467
Hanaysha, J. R. (2022). International Journal of Information Management Data Insights Impact of social media marketing features on consumer ’ s purchase decision in the fast-food industry : Brand trust as a mediator. International Journal of Information Management Data Insights, 2(2), 100102. https://doi.org/10.1016/j.jjimei.2022.100102
Hiro, A., Permana, J., & Wibowo, A. T. (2023). Movie Recommendation System Based on Synopsis Using Content-Based Filtering with TF-IDF and Cosine Similarity. 9(2), 1–14. https://doi.org/10.21108/ijoict.v9i2.747
Huang, R. (2023). Improved content recommendation algorithm integrating semantic information. Journal of Big Data. https://doi.org/10.1186/s40537-023-00776-7
Jin, Z., Ye, F., Nedjah, N., & Zhang, X. (2024). Electronic Commerce Research and Applications A comparative study of various recommendation algorithms based on E-commerce big data. 68(October).
Kabir, S., Farrokhvar, L., & Dabouei, A. (2023). A Weakly supervised approach for thoracic diseases detection. Expert Systems With Applications, 213(PB), 118942. https://doi.org/10.1016/j.eswa.2022.118942
Li, H., Wang, Y., Li, Y., Xiao, G., Hu, P., Zhao, R., & Li, B. (2021). Learning Adaptive Criteria Weights for Active Semi-Supervised Learning.
Liang, M., & Niu, T. (2022). ScienceDirect ScienceDirect Research on Text Classification Techniques Based on Improved TF- Research on Text Classification Techniques Based on Improved TF- IDF Algorithm and LSTM Inputs IDF Algorithm and LSTM Inputs. Procedia Computer Science, 208, 460–470. https://doi.org/10.1016/j.procs.2022.10.064
Raharjo, M. M., & Arifin, F. (2023). Machine Learning System Implementation of Education Podcast Recommendations on Spotify Applications Using Content-Based Filtering and TF-IDF. 8(November), 221–230.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Rr Octanty Mulianingtyas, Bagus Hendra Saputra, Rohani Situmorang, Galih Prakoso Rizky A

This work is licensed under a Creative Commons Attribution 4.0 International License.
