Applying K-means Clustering to Create Product Recommendation System Based on Purchase Profiles

Authors

  • Roniel Venâncio Santana Universidade Federal do Ceará
  • Heráclito Lopes Jaguaribe Pontes Universidade Federal do Ceará

DOI:

https://doi.org/10.22279/navus.2020.v10.p01-14.1189

Keywords:

Recommendation system. Data science. Machine learning. Clustering. Business intelligence

Abstract

The use of predictive machine learning models for big data is today one of the main trends to be explored by data science. Its application to the business world for a search of competitive differential is directly related to Business Intelligence so companies can make more assertive decisions. Thus, this paper proposes to apply a machine learning technique to create a product recommendation system based on customers' purchase profile, modeled for a product distribution company. For this purpose, the K-means clustering algorithm was used to group customers based on their purchase profile. Finally, the recommendation system's principle is based on a comparative analysis between customers in the same cluster and based on their geographic distances to recommend that item that sells well in one point of sales but does not perform so well in another. At the end of the application 70 clusters were generated for the entire range of customers of the company focused in the present study. Each customer in each cluster received a list containing 5 recommended products based on the comparison made with their close neighbors of similar buying profile.

Downloads

Download data is not yet available.

References

Ali, H., & Lande, M. (2019). Data-Driven Decisions in Prototyping and Product Development: A Framework for Uncertainty and Decision-Making. ASME International Mechanical Engineering Congress and Exposition, Salt Lake City, Vol. 83518, p. V014T14A039.

Arunachalam, D., & Kumar, N. (2018). Benefit-based consumer segmentation and performance evaluation of clustering approaches: An evidence of data-driven decision-making. Expert Systems with Applications, 111, 11-34.

Cardoso, G. P., Sininbardi, B. G., Sobral, M. S., & de Souza, E. M. (2019). Modelo de avaliação de risco em campanhas kickstarter utilizando machine learning. Navus-Revista de Gestão e Tecnologia, 9(4), 66-79.

Cavalcante, I. M., Frazzon, E. M., Forcellini, F. A., & Ivanov, D. (2019). A supervised machine learning approach to data-driven simulation of resilient supplier selection in digital manufacturing. International Journal of Information Management, 49, 86-97.

Chiu, M. C., & Ko, L. W. (2017). Develop a personalized intelligent music selection system based on heart rate variability and machine learning. Multimedia Tools and Applications, 76(14), 15607-15639.

Diale, M., Celik, T., & Van Der Walt, C. (2019). Unsupervised feature learning for spam email filtering. Computers & Electrical Engineering, 74, 89-104.

Fernández-García, A. J., Iribarne, L., Corral, A., Criado, J., & Wang, J. Z. (2019). A recommender system for component-based applications using machine learning techniques. Knowledge-Based Systems, 164, 68-84.

Ferreira, E. de Paula, Junior, M. R. B., Isnard, P. A., de Souza França, R., & de Aguiar Filho, A. S. (2018). Gestão do conhecimento, internet das coisas e inovação: a relação dos temas e a intensidade de pesquisas realizadas. Navus-Revista de Gestão e Tecnologia, 8(3), 99-112.

Fraga, B. D., Erpen, J. G., Varvakis, G., & dos Santos, N. (2017). Business Intelligence: métodos e técnicas de gestão do conhecimento e as tendências para avanços do capital intelectual. Navus-Revista de Gestão e Tecnologia, 7(1), 43-56.

Frémal, S., & Lecron, F. (2017). Weighting strategies for a recommender system using item clustering based on genres. Expert Systems with Applications, 77, 105-113.

Govind, B. S., Tene, R., & Saideep, K. L. (2018). Novel Recommender Systems Using Personalized Sentiment Mining. IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT), Bangalore, 4, 1-5.

Irvan, M., & Terano, T. (2016). Group Recommendation System for E-Learning Communities: A Multi-agent Approach. Advances in Social Computing and Digital Education, Singapore, 7, 35-46.

Kant, S., Mahara, T., Jain, V. K., Jain, D. K., & Sangaiah, A. K. (2018). LeaderRank based k-means clustering initialization method for collaborative filtering. Computers & Electrical Engineering, 69, 598-609.

Karimi, M., Jannach, D., & Jugovac, M. (2018). News recommender systems–Survey and roads ahead. Information Processing & Management, 54(6), 1203-1227.

Liang, J., & Liu, R. (2015). Stacked denoising autoencoder and dropout together to prevent overfitting in deep neural network. 2015 8th International Congress on Image and Signal Processing (CISP), London, 22, 697-701.

Long, Q. (2018). Data-driven decision making for supply chain networks with agent-based computational experiment. Knowledge-Based Systems, 141, 55-66.

Lopez, C., Tucker, S., Salameh, T., & Tucker, C. (2018). An unsupervised machine learning method for discovering patient clusters based on genetic signatures. Journal of biomedical informatics, 85, 30-39.

Majumdar, A. (2018). Graph structured autoencoder. Neural Networks, 106, 271-280.

Nilashi, M., Bagherifard, K., Rahmani, M., & Rafe, V. (2017). A recommender system for tourism industry using cluster ensemble and prediction machine learning techniques. Computers & industrial engineering, 109, 357-368.

Parashar, A., & Goyal, D. (2016). Clustering Gait Data Using Different Machine Learning Techniques and Finding the Best Technique. International Conference on Smart Trends for Information Technology and Computer Communications, Singapore, 426-433.

Portugal, I., Alencar, P., & Cowan, D. (2018). The use of machine learning algorithms in recommender systems: A systematic review. Expert Systems with Applications, 97, 205-227.

Putriany, V., Jauhari, J., & Heroza, R. I. (2019). Item Clustering as An Input for Skin Care Product Recommended System using Content Based Filtering. Journal of Physics: Conference Series, Palembang, Vol. 1196, No. 1, p. 012004.

Sangeetha, J., & Prakash, V. S. J. (2019). Improved Feature-Specific Collaborative Filtering Model for the Aspect-Opinion Based Product Recommendation. Advances in Big Data and Cloud Computing, Singapore, 275-289.

Schenatto, K., de Souza, E. G., Bazzi, C. L., Gavioli, A., Betzek, N. M., & Beneduzzi, H. M. (2017). Normalization of data for delineating management zones. Computers and Electronics in Agriculture, 143, 238-248.

Trigueiro, F. M. C., Neto, D. A. C., de Sousa Santos, T., & Prearo, L. C. (2017). Comportamento de consumo no segmento de veículos automotores nas cidades de Cuiabá e Várzea Grande. NAVUS-Revista de Gestão e Tecnologia, 7(3), 7-18.

Wasid, M., & Ali, R. (2018). An improved recommender system based on multi-criteria clustering approach. Procedia Computer Science, 131, 93-101.

Wen, T., Bao, J., & Ding, F. (2018). QoS-Aware Web Service Recommendation Model Based on Users and Services Clustering. Proceedings of the International Conference on Information Technology and Electrical Engineering 2018, Xiamen, 18, 1-6.

Zahra, S., Ghazanfar, M. A., Khalid, A., Azam, M. A., Naeem, U., & Prugel-Bennett, A. (2015). Novel centroid selection approaches for KMeans-clustering based recommender systems. Information sciences, 320, 156-189.

Zhang, Y., Zhang, R., Wang, Y., Guo, H., Zhong, R. Y., Qu, T., & Li, Z. (2019). Big data driven decision-making for batch-based production systems. Procedia CIRP, 83, 814-818.

Published

2020-07-26

Issue

Section

Articles