PENERAPAN ALGORITMA RANDOM FOREST DALAM PREDIKSI CUSTOMER CHURN UNTUK MENDUKUNG STRATEGI RETENSI PELANGGAN

Authors

  • Oktaviana Putri Agung Universitas Pamulang Author
  • Fairuza Mayla Faizal Universitas Pamulang Author
  • Irenia Mascharenhas Universitas Pamulang Author

DOI:

https://doi.org/10.69714/84q65g10

Keywords:

Customer Churn, Data Mining, Random Forest, Classification, Customer Retention

Abstract

Customer churn is a condition when customers stop using company services within a certain period. A high churn rate can negatively impact company revenue and customer loyalty, especially in telecommunications companies. This study aims to implement the Random Forest algorithm to predict customer churn in order to support customer retention strategies. The dataset used in this study is Telco Customer Churn obtained from Kaggle with a total of 7043 customer records. This research applies the CRISP-DM methodology consisting of business understanding, data understanding, data preparation, modeling, evaluation, and deployment. The preprocessing stage includes handling missing values, transforming data types, and encoding categorical data. The modeling process uses the Random Forest Classifier algorithm with an 80:20 split between training and testing data. The results show that the Random Forest model achieved an accuracy of 77% in predicting customer churn. The most influential factors affecting churn based on feature importance are TotalCharges, tenure, MonthlyCharges, and customer contract types. Based on the research results, the Random Forest algorithm can help companies identify customers with churn potential and support more effective customer retention strategies.

References

[1] A. Amin, S. Anwar, A. Adnan, M. Nawaz, K. Alawfi, A. Hussain, and K. Huang, “Customer churn prediction in telecommunication industry using data certainty,” Journal of Business Research, vol. 94, pp. 290–301, 2019, doi: 10.1016/j.jbusres.2018.03.004.

[2] S. Idris, A. Khan, and Y. S. Lee, “Intelligent churn prediction in telecom: Employing mRMR feature selection and RotBoost based ensemble classification,” Applied Intelligence, vol. 39, no. 3, pp. 659–672, 2013, doi: 10.1007/s10489-012-0438-0.

[3] L. Breiman, “Random Forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001, doi: 10.1023/A:1010933404324.

[4] A. C. Berson, S. Smith, and K. Thearling, “Building Data Mining Applications for CRM,” Data Mining and Knowledge Discovery, vol. 6, no. 1, pp. 85–102, 2002.

[5] M. H. Dunham, “Data Mining: Introductory and Advanced Topics,” Data Mining and Knowledge Discovery, vol. 8, no. 2, pp. 145–160, 2003.

[6] N. Ahmad, A. Hussain, and M. S. Khan, “Predicting Customer Churn Using Machine Learning Techniques,” International Journal of Advanced Computer Science and Applications, vol. 10, no. 5, pp. 1–8, 2019.

[7] Kaggle, “Telco Customer Churn Dataset.” Internet: https://www.kaggle.com/blastchar/telco-customer-churn [May 30, 2026].

Downloads

Published

2026-06-18

How to Cite

PENERAPAN ALGORITMA RANDOM FOREST DALAM PREDIKSI CUSTOMER CHURN UNTUK MENDUKUNG STRATEGI RETENSI PELANGGAN (Oktaviana Putri Agung, Fairuza Mayla Faizal, & Irenia Mascharenhas, Trans.). (2026). Jurnal Riset Teknik Komputer, 3(2), 90-96. https://doi.org/10.69714/84q65g10