PENINGKATAN SKALABILITAS SISTEM REKOMENDASI WEBSITE BERITA MENGGUNAKAN CONTENT-BASED FILTERING DAN K-MEANS

Authors

  • Muhamad Arldi Megantara Universitas Amikom Yogyakarta
  • Ema Utami Universitas Amikom Yogyakarta

DOI:

https://doi.org/10.55123/storage.v5i1.7651

Keywords:

Content-Based Filtering, K-Means Clustering, Skalabilitas, Mean Average Precision, Sistem Rekomendasi Berita

Abstract

Perkembangan internet menghasilkan volume data yang besar termasuk website berita yang memunculkan tantangan pada model rekomendasi Content-based filtering (CBF) dengan kompleksitas komputasi linier O(N). Penelitian ini mengusulkan optimalisasi CBF dengan integrasi dengan algoritma K-Means yang akan menghasilkan partisi data dengan tujuan reduksi biaya komputasi. Digunakan 22.250 artikel berita dari antara news. Penentuan nilai cluster (K) optimal menggunakan elbow method dengan K terbaik K = 6 dan divalidasi dengan Silhouette Score dengan nilai 0,5201, yang mengindikasikan sebaran data baik. Hasil dari pengintegrasian CBF dan K-means menunjukan adanya efisiensi pada response time dan penggunaan memori hingga 6 kali lipat. Namun, ditemukan pula trade-off berupa penurunan Mean Average Precision (MAP) dari 0,89 (konvensional) menjadi 0,77 (K=6), yang masih baik pada kualitas rekomendasi. Selain itu, digunakan threshold dengan nilai 0,30 yang terbukti optimal dalam filter konten yang relevan. Kesimpulan yang dapat diambil dari penelitian adalah penggunaan partisi data mampu memberikan kualitas rekomendasi yang tetap andal namun pada beban komputasi yang ditekan.

Downloads

Download data is not yet available.

References

Afoudi, Y., Lazaar, M., & Al Achhab, M. (2021). Hybrid recommendation system combined content-based filtering and collaborative prediction using artificial neural network. Simulation Modelling Practice and Theory, 113, 102375. https://doi.org/10.1016/j.simpat.2021.102375

Ikotun, A. M., Ezugwu, A. E., Abualigah, L., Abuhaija, B., & Heming, J. (2023). K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data. Information Sciences, 622, 178–210. https://doi.org/10.1016/j.ins.2022.11.139

Imbar, R. V., Adelia, Ayub, M., & Rehatta, A. (2014). Implementasi Cosine Similarity dan Algoritma Smith-Waterman untuk Mendeteksi Kemiripan Teks.

Isinkaye, F. O., Folajimi, Y. O., & Ojokoh, B. A. (2015). Recommendation systems: Principles, methods and evaluation. Egyptian Informatics Journal, 16(3), 261–273. https://doi.org/10.1016/j.eij.2015.06.005

Jan, M., Khusro, S., Alam, I., Khan, I., & Niazi, B. (2022). Interest-Based Content Clustering for Enhancing Searching and Recommendations on Smart TV. Wireless Communications and Mobile Computing, 2022, 1–14. https://doi.org/10.1155/2022/3896840

Juni Permana, A. H. J. P. & Agung Toto Wibowo. (2023). Movie Recommendation System Based on Synopsis Using Content-Based Filtering with TF-IDF and Cosine Similarity. International Journal on Information and Communication Technology (IJoICT), 9(2), 1–14. https://doi.org/10.21108/ijoict.v9i2.747

Kaufman, L., & Rousseeuw, P. J. (1990). Finding Groups in Data: An Introduction to Cluster Analysis (1st ed.). Wiley. https://doi.org/10.1002/9780470316801

Kumbhar, R., Mhamane, S., Patil, H., Patil, S., & Kale, S. (2020). Text Document Clustering Using K-means Algorithm with Dimension Reduction Techniques. 2020 5th International Conference on Communication and Electronics Systems (ICCES), 1222–1228. https://doi.org/10.1109/ICCES48766.2020.9137928

Liu, J., Song, J., Li, C., Zhu, X., & Deng, R. (2021). A Hybrid News Recommendation Algorithm Based On K-means Clustering and Collaborative Filtering. Journal of Physics: Conference Series, 1881(3), Article 3. https://doi.org/10.1088/1742-6596/1881/3/032050

Naghizadeh, A., & Metaxas, D. N. (2020). Condensed Silhouette: An Optimized Filtering Process for Cluster Selection in K-Means. Procedia Computer Science, 176, 205–214. https://doi.org/10.1016/j.procs.2020.08.022

Park, K., Hong, J. S., & Kim, W. (2020). A Methodology Combining Cosine Similarity with Classifier for Text Classification. Applied Artificial Intelligence, 34(5), 396–411. https://doi.org/10.1080/08839514.2020.1723868

Rinjeni, T. P., Indriawan, A., & Rakhmawati, N. A. (2024). Matching Scientific Article Titles using Cosine Similarity and Jaccard Similarity Algorithm. Procedia Computer Science, 234, 553–560. https://doi.org/10.1016/j.procs.2024.03.039

Roy, D., & Dutta, M. (2022). A systematic review and research perspective on recommender systems. Journal of Big Data, 9(1), 59. https://doi.org/10.1186/s40537-022-00592-5

Sukestiyarno, Y. L., Sapolo, H. A., & Sofyan, H. (2023). Application of Recommendation System on E-Learning Platform Using Content-Based Filtering with Jaccard Similarity and Cosine Similarity Algorithms. Computer Science and Mathematics. https://doi.org/10.20944/preprints202306.1672.v1

Yang, Y., Yao, H., Li, R., & Wang, S. (2021). A collaborative filtering recommendation algorithm based on user clustering with preference types. Journal of Physics: Conference Series, 1848(1), Article 1. https://doi.org/10.1088/1742-6596/1848/1/012043

Downloads

Published

2026-02-28

Issue

Section

Articles