PENERAPAN ALGORITMA RANDOM FOREST DALAM PREDIKSI CUSTOMER CHURN UNTUK MENDUKUNG STRATEGI RETENSI PELANGGAN
DOI:
https://doi.org/10.69714/84q65g10Keywords:
Customer Churn, Data Mining, Random Forest, Classification, Customer RetentionAbstract
Customer churn is a condition when customers stop using company services within a certain period. A high churn rate can negatively impact company revenue and customer loyalty, especially in telecommunications companies. This study aims to implement the Random Forest algorithm to predict customer churn in order to support customer retention strategies. The dataset used in this study is Telco Customer Churn obtained from Kaggle with a total of 7043 customer records. This research applies the CRISP-DM methodology consisting of business understanding, data understanding, data preparation, modeling, evaluation, and deployment. The preprocessing stage includes handling missing values, transforming data types, and encoding categorical data. The modeling process uses the Random Forest Classifier algorithm with an 80:20 split between training and testing data. The results show that the Random Forest model achieved an accuracy of 77% in predicting customer churn. The most influential factors affecting churn based on feature importance are TotalCharges, tenure, MonthlyCharges, and customer contract types. Based on the research results, the Random Forest algorithm can help companies identify customers with churn potential and support more effective customer retention strategies.
References
[1] A. Amin, S. Anwar, A. Adnan, M. Nawaz, K. Alawfi, A. Hussain, and K. Huang, “Customer churn prediction in telecommunication industry using data certainty,” Journal of Business Research, vol. 94, pp. 290–301, 2019, doi: 10.1016/j.jbusres.2018.03.004.
[2] S. Idris, A. Khan, and Y. S. Lee, “Intelligent churn prediction in telecom: Employing mRMR feature selection and RotBoost based ensemble classification,” Applied Intelligence, vol. 39, no. 3, pp. 659–672, 2013, doi: 10.1007/s10489-012-0438-0.
[3] L. Breiman, “Random Forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001, doi: 10.1023/A:1010933404324.
[4] A. C. Berson, S. Smith, and K. Thearling, “Building Data Mining Applications for CRM,” Data Mining and Knowledge Discovery, vol. 6, no. 1, pp. 85–102, 2002.
[5] M. H. Dunham, “Data Mining: Introductory and Advanced Topics,” Data Mining and Knowledge Discovery, vol. 8, no. 2, pp. 145–160, 2003.
[6] N. Ahmad, A. Hussain, and M. S. Khan, “Predicting Customer Churn Using Machine Learning Techniques,” International Journal of Advanced Computer Science and Applications, vol. 10, no. 5, pp. 1–8, 2019.
[7] Kaggle, “Telco Customer Churn Dataset.” Internet: https://www.kaggle.com/blastchar/telco-customer-churn [May 30, 2026].
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Jurnal Riset Teknik Komputer

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.









