Customer Segmentation Using the Integration of the Recency Frequency Monetary Model and the K-Means Cluster Algorithm

Alamsyah Alamsyah(1), P. Eko Prasetyo(2), Sunyoto Sunyoto(3), Siti Harnina Bintari(4), Danang Dwi Saputro(5), Shohihatur Rohman(6), Rizka Nur Pratama(7),


(1) Department of Computer Science, Universitas Negeri Semarang
(2) Department of Development Economics, Universitas Negeri Semarang, Indonesia
(3) Department of Mechanical Engineering, Universitas Negeri Semarang, Indonesia
(4) Department of Biology, Universitas Negeri Semarang, Indonesia
(5) Department of Mechanical Engineering, Universitas Negeri Semarang, Indonesia
(6) Department of Mechanical Engineering, Universitas Negeri Semarang, Indonesia
(7) Department of Computer Science, Universitas Negeri Semarang, Indonesia

Abstract

Purpose: This research aims to do customer segmentation in retail companies by implementing the Recency Frequency Monetary (RFM) K-Means cluster model and algorithm optimized by the Elbow method.

Methods: This study uses several methods. The RFM model method was chosen to segment customers because it is one of the optimal methods for segmenting customers. The K-Means cluster algorithm method was chosen because it is easy to interpret, implement, fast in convergence, and adapt, but lacks sensitivity to the initial partitioning of the number of clusters. To help classify each category of customers and know the level of loyalty, they use a combination of the RFM model and the K-Means method. The Elbow method is used to improve the performance of the K-Means algorithm by correcting the weakness of the K-Means algorithm, which helps to choose the optimal k value to be used when clustering.

Result: This research produces customer segmentation 3 clusters with a Sum of Square Error (SSE) value of 25,829.39 and a Callinski-Harabaz Index (CHI) value of 36,625.89. The SSE and CHI values are the largest ones, so they are the optimal cluster values.

Novelty: The application of the integrated RFM model and the K-Means cluster algorithm optimized by the Elbow method can be used as a method for customer segmentation.

Keywords

Customer Segmentation; RFM, K-Means Algorithm; Elbow Method

Full Text:

PDF

References

B. E. Adiana, I. Soesanti, and A. E. Permanasari, “Analysis of Customer Segmentation Using a Combination of RFM Models and Clustering Techniques,” J. Terap. Teknol. Inf., vol. 2, no. 1, pp. 23–32, 2018.

H. Zhao and C. He, “Objective Cluster Analysis in Value-Based Customer Segmentation Method,” in 2009 Second Int. Workshop Knowl. Discov. Data Min., 2009, pp. 484–487.

Y. Chen, G. Zhang, D. Hu, and S. Wang, “Customer Segmentation in Customer Relationship Management based on Data Mining,” in Int. Conf. Program. Lang. Manuf., 2006, pp. 288–293.

I. Soesanti, “Web-based Monitoring System on the Production Process of Yogyakarta Batik Industry,” J. Theor. Appl. Inf. Technol., vol. 87, no. 1, pp. 146–152, 2016.

N. W. Wardani, G. R. Dantes, and G. Indrawan, “Prediction of Customer Churn Using the C4.5 Decision Tree Algorithm based on Customer Segmentation to Retain Customers in Retail Companies,” J. Resist. (Rekayasa Sist. Komputer), vol. 1, no. 1, pp. 16–24, 2018.

J. T. Wei, S.-Y. Lin, Y.-Z. Yang, and H.-H. Wu, “Applying Data Mining and RFM Model to Analyze Customers’ Values of a Veterinary Hospital,” in 2016 Int. Symp. Comput. Consum. Control (IS3C), 2016, pp. 481–484.

J. Wu and Z. Lin, “Research on Customer Segmentation Model by Clustering,” in Proc. 7th int. conf. Electron. commer. - ICEC ’05, 2005, p. 316.

C.-H. Cheng and Y.-S. Chen, “Classifying the Segmentation of Customer Value Via RFM Model and RS Theory,” Expert Syst. Appl., vol. 36, no. 3, pp. 4176–4184, 2009.

M. A. Berry, “Mastering Data Mining: The Art and Science of Customer Relationship Management,” Ind. Manag. Data Syst., vol. 100, no. 5, pp. 245–246, 2000.

L. Ye, C. Qiu-ru, X. Hai-xu, L. Yi-jun, and Y. Zhi-min, “Telecom Customer Segmentation with K-Means Clustering,” in 2012 7th Int. Conf. Comput. Sci. Educ. (ICCSE), 2012, pp. 648–651.

Walid and Alamsyah, “Recurrent Neural Network for Forecasting Time Series With Long Memory Pattern,” J. Phys. Conf. Ser., vol. 824, p. 012038, 2017.

A. Alamsyah, B. Prasetiyo, M. F. Al Hakim, and F. D. Pradana, “Prediction of COVID-19 Using Recurrent Neural Network Model,” Sci. J. Informatics, vol. 8, no. 1, pp. 98–103, 2021.

B. Prasetiyo, Alamsyah, and M. A. Muslim, “Analysis of Building Energy Efficiency Dataset Using Naive Bayes Classification Classifier,” J. Phys. Conf. Ser., vol. 1321, no. 3, p. 032016, 2019.

I. Dinariyah and Alamsyah, “Accuracy Enhancement in Face Recognition Using 1D-PCA &Amp; 2D-PCA based on Multilevel Reverse-Biorthogonal Wavelet Transform with KNN Classifier,” J. Phys. Conf. Ser., vol. 1918, no. 4, p. 042144, 2021.

N. Kurinjivendhan and K. Thangadurai, “Modified K-Means Algorithm and Genetic Approach for Cluster Optimization,” in 2016 Int. Conf. Data Min. Adv. Comput. (SAPIENCE), 2016, pp. 53–56.

A. K. Jain, “Data Clustering: 50 Years Beyond K-Means,” Pattern Recognit. Lett., vol. 31, no. 8, pp. 651–666, 2010.

P. Bholowalia and A. Kumar, “EBK-Means: A Clustering Technique based on Elbow Method and K-Means in WSN,” Int. J. Comput. Appl., vol. 105, no. 9, pp. 975–8887, 2014.

O. Doğan, E. Ayçin, and Z. A. Bulut, “Customer Segmentation by Using RFM Model and Clustering Methods: A Case Study in Retail Industry,” Int. J. Contemp. Econ. Adm. Sci., vol. 8, no. 1, pp. 1–19, 2018.

T. M. Kodinariya and P. R. Makwana, “Review on Determining of Cluster in K-Means,” Int. J. Adv. Res. Comput. Sci. Manag. Stud., vol. 1, no. 6, pp. 90–95, 2013.

D. Abdullah, S. Susilo, A. S. Ahmar, R. Rusli, and R. Hidayat, “The Application of K-Means Clustering for Province Clustering in Indonesia of the Risk of the COVID-19 Pandemic based on COVID-19 Data,” Qual. Quant., vol. 56, no. 3, pp. 1283–1291, 2022.

A. M. Hughes, Strategic Database Marketing. McGraw –Hill, 2005.

A. Ambarwari, Q. Jafar Adrian, and Y. Herdiyeni, “Analysis of the Effect of Data Scaling on the Performance of the Machine Learning Algorithm for Plant Identification,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 4, no. 1, pp. 117–122, 2020.

C. Yuan and H. Yang, “Research on K-value selection method of K-means clustering algorithm,” Multidiscip. Sci. J., vol. 2, no. 2, pp. 226–235, 2019.

P. N. Tan, M. Steinbach, and V. Kumar, “Data mining introduction,” 2006.

Refbacks

  • There are currently no refbacks.




Scientific Journal of Informatics (SJI)
p-ISSN 2407-7658 | e-ISSN 2460-0040
Published By Department of Computer Science Universitas Negeri Semarang
Website: https://journal.unnes.ac.id/nju/index.php/sji
Email: [email protected]

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.