Enhancing Sequence Movie Recommendation System Using Deep Learning and KMeans
Abstract
:1. Introduction
- We interpret the recommendation model using the sequential temporal of user interactions (movie ratings) to deliver a dynamic and contextual comprehension of user preferences based on the Movielens dataset. Our approach adopts a Transformer architecture, integrating multi-head attention with user demographic and movie embeddings, which allows the model to weigh various aspects of a user’s movie-watching history differently when making predictions for the next target movie. Sequential recommendations are referred to as advanced model-based CF, as this is more effective in tackling the issues with existing traditional techniques.
- Then, the model contains a KMeans clustering post process to group movies into clusters depending on their embeddings, which aids in diversifying recommendations. It also integrates movie genres as extra attributes, boosting the model’s capacity to represent varied movie qualities. The algorithm is designed to anticipate Top-N recommendations for users, employing clustering to ensure a mix of genres and preferences. The evaluation measures expand beyond typical loss functions to include precision, recall, and F1 scores, offering a comprehensive view of the model’s performance.
2. Related Work
2.1. Traditional-Based Recommendation Systems
2.2. Advanced-Based Recommendation Systems
3. Methodology of Movie Recommendation Systems
3.1. Data Processing and Sequence Creation
3.2. Model Architecture
3.2.1. Process ONE—Predicting User Ratings for Movies
- (a)
- User-Movie Embedding Layer
- (b)
- Model Training
3.2.2. Process TWO: Leveraging Predictions for Recommendations
4. Experimental Study
4.1. Implementation Detail
4.1.1. Environment Set-up of Transformer Model
4.1.2. Environment Set up of K-Means Algorithm
4.2. Evaluation Metric
4.3. Benchmark Models
- Bayesian Personalized Ranking (BPR): The model is an optimization technique applied to matrix factorization, explicitly tailored to the handling of implicit feedback to enhance the factorization process by incorporating a pairwise ranking loss function.
- NeuMF: This is a collaborative filtering model that uses user–item interactions with an MLP instead of the inner product in matrix factorization when calculating the relationship between the user and item.
- Caser: This adopts convolutional neural networks (CNNs) in horizontal and vertical dimensions to effectively model high-order Markov Chains (M.C.s) for sequential recommendations.
- GRU4Rec: This is a unidirectional recurrent neural network (RNN)-based framework that is used to capture sequential dependencies and make predictions.
- SASRec: This is a state-of-the-art sequential recommendation model leveraging self-attention blocks to predict the next item for recommendation. Additionally, it employs the dot product computation between sequential latent features of the latest item and embeddings of the target item to establish the scoring mechanism.
4.4. Results Analysis
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Djedouboum, A.C.; Abba Ari, A.A.; Gueroui, A.M.; Mohamadou, A.; Aliouat, Z. Big data collection in large-scale wireless sensor networks. Sensors 2018, 18, 4474. [Google Scholar] [CrossRef]
- Qolomany, B.; Al-Fuqaha, A.; Gupta, A.; Benhaddou, D.; Alwajidi, S.; Qadir, J.; Fong, A.C. Leveraging machine learning and big data for smart buildings: A comprehensive survey. IEEE Access 2019, 7, 90316–90356. [Google Scholar] [CrossRef]
- Guk, K.; Han, G.; Lim, J.; Jeong, K.; Kang, T.; Lim, E.K.; Jung, J. Evolution of wearable devices with real-time disease monitoring for personalized healthcare. Nanomaterials 2019, 9, 813. [Google Scholar] [CrossRef]
- Lemonde, C.; Arsenio, E.; Henriques, R. Integrative analysis of multimodal traffic data: Addressing open challenges using big data analytics in the city of Lisbon. Eur. Transp. Res. Rev. 2021, 13, 64. [Google Scholar] [CrossRef]
- Kirmani, S.; Mazid, A.; Khan, I.A.; Abid, M.A. Survey on IoT-Enabled Smart Grids: Technologies, Architectures, Applications, and Challenges. Sustainability 2022, 15, 717. [Google Scholar] [CrossRef]
- Fayyaz, Z.; Ebrahimian, M.; Nawara, D.; Ibrahim, A.; Kashef, R. Recommendation systems: Algorithms, challenges, metrics, and business opportunities. Appl. Sci. 2020, 10, 7748. [Google Scholar] [CrossRef]
- Qin, L.; Xu, X.; Li, J. A real-time professional content recommendation system for healthcare providers’ knowledge acquisition. In Big Data–BigData 2018: 7th International Congress, Held as Part of the Services Conference Federation, SCF 2018, Seattle, WA, USA, 25–30 June 2018, Proceedings; Springer International Publishing: Berlin/Heidelberg, Germany, 2018; pp. 367–371. [Google Scholar]
- Rabiu, I.; Salim, N.; Da’u, A.; Osman, A. Recommender system based on temporal models: A systematic review. Appl. Sci. 2020, 10, 2204. [Google Scholar] [CrossRef]
- Bennett, J.; Lanning, S. The netflix prize. In Proceedings of the KDD Cup and Workshop, San Jose, CA, USA, 12 August 2007; Volume 2007, p. 35. [Google Scholar]
- Covington, P.; Adams, J.; Sargin, E. Deep neural networks for youtube recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems, Boston, MA, USA, 15–19 September 2016; pp. 191–198. [Google Scholar]
- Smith, B.; Linden, G. Two decades of recommender systems at Amazon.com. IEEE Internet Comput. 2017, 21, 12–18. [Google Scholar] [CrossRef]
- Cheng, H.T.; Koc, L.; Harmsen, J.; Shaked, T.; Chandra, T.; Aradhye, H.; Anderson, G.; Corrado, G.; Chai, W.; Ispir, M.; et al. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, Boston, MA, USA, 15 September 2016; pp. 7–10. [Google Scholar]
- Roy, D.; Dutta, M. A systematic review and research perspective on recommender systems. J. Big Data 2022, 9, 59. [Google Scholar] [CrossRef]
- Vilakone, P.; Xinchang, K.; Park, D.S. Movie recommendation system based on users’ personal information and movies rated using the method of k-clique and normalized discounted cumulative gain. J. Inf. Process. Syst. 2020, 16, 494–507. [Google Scholar]
- Selimi, D.; Nuci, K.P. The use of Recommender Systems in web technology and an in-depth analysis of Cold State problem. arXiv 2020, arXiv:2009.04780. [Google Scholar]
- Shambour, Q.Y.; Hussein, A.H.; Kharma, Q.M.; Abualhaj, M.M. Effective Hybrid Content-Based Collaborative Filtering Approach for Requirements Engineering. Comput. Syst. Sci. Eng. 2022, 40, 113–125. [Google Scholar] [CrossRef]
- Khanal, S.S.; Prasad PW, C.; Alsadoon, A.; Maag, A. A systematic review: Machine learning based recommendation systems for e-learning. Educ. Inf. Technol. 2020, 25, 2635–2664. [Google Scholar] [CrossRef]
- Park, S.T.; Chu, W. Pairwise preference regression for cold-start recommendation. In Proceedings of the 3rd ACM Conference on Recommender Systems, New York, NY, USA, 23–25 October 2009; pp. 21–28. [Google Scholar]
- Martins, G.B.; Papa, J.P.; Adeli, H. Deep learning techniques for recommender systems based on collaborative filtering. Expert Syst. 2020, 37, e12647. [Google Scholar] [CrossRef]
- Peng, S.; Park, D.S.; Kim, D.Y.; Yang, Y.; Siet, S.; Ugli SI, R.; Lee, H. A Modern Recommendation System Survey in the Big Data Era. In Proceedings of the International Conference on Computer Science and Its Applications and the International Conference on Ubiquitous Information Technologies and Applications, Vientiane, Laos, 19–21 December 2022; Springer Nature: Singapore, 2023; pp. 577–582. [Google Scholar]
- Pazzani, M.J.; Billsus, D. Content-based recommendation systems. In The Adaptive Web: Methods and Strategies of Web Personalization; Springer: Berlin/Heidelberg, Germany, 2007; pp. 325–341. [Google Scholar]
- Koren, Y.; Rendle, S.; Bell, R. Advances in collaborative filtering. In Recommender Systems Handbook; Springer: New York, NY, USA, 2021; pp. 91–142. [Google Scholar]
- Sarwar, B.; Karypis, G.; Konstan, J.; Riedl, J. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th International Conference on World Wide Web, Hong Kong, China, 1–5 May 2001; pp. 285–295. [Google Scholar]
- Takács, G.; Pilászy, I.; Németh, B.; Tikk, D. Scalable collaborative filtering approaches for large recommender systems. J. Mach. Learn. Res. 2009, 10, 623–656. [Google Scholar]
- Barathy, R.; Chitra, P. Applying matrix factorization in collaborative filtering recommender systems. In Proceedings of the 6th International Conference on Advanced Computing and Communication Systems (ICACCS), Coimbatore, India, 6–7 March 2020; pp. 635–639. [Google Scholar]
- Rendle, S.; Freudenthaler, C.; Gantner, Z.; Schmidt-Thieme, L. BPR: Bayesian personalized ranking from implicit feedback. arXiv 2012, arXiv:1205.2618. [Google Scholar]
- Sun, C.; Sun, G.; Ding, Z.; Liu, Q.; Ma, Z. A News Recommendation Algorithm Based on SVD and Improved K-means. In Proceedings of the 2021 International Conference on Networking, Communications and Information Technology (NetCIT), Manchester, UK, 26–27 December 2021; IEEE: New York, NY, USA, 2021; pp. 130–134. [Google Scholar]
- Patoulia, A.A.; Kiourtis, A.; Mavrogiorgou, A.; Kyriazis, D. A comparative study of collaborative filtering in product recommendation. Emerg. Sci. J. 2023, 7, 1–15. [Google Scholar] [CrossRef]
- Zhang, S.; Yao, L.; Sun, A.; Tay, Y. Deep learning based recommender system: A survey and new perspectives. ACM Comput. Surv. (CSUR) 2019, 52, 1–38. [Google Scholar] [CrossRef]
- Xinchang, K.; Vilakone, P.; Park, D.S. Movie recommendation algorithm using social network analysis to alleviate cold-start problem. J. Inf. Process. Syst. 2019, 15, 616–631. [Google Scholar]
- Jing, H. Application of Improved K-Means Algorithm in Collaborative Recommendation System. J. Appl. Math. 2022, 2022, 2213173. [Google Scholar] [CrossRef]
- Wang, K.; Zhang, T.; Xue, T.; Lu, Y.; Na, S.G. E-commerce personalized recommendation analysis by deeply-learned clustering. J. Vis. Commun. Image Represent. 2020, 71, 102735. [Google Scholar] [CrossRef]
- Chen, J.; Li, K.; Rong, H.; Bilal, K.; Yang, N.; Li, K. A disease diagnosis and treatment recommendation system based on big data mining and cloud computing. Inf. Sci. 2018, 435, 124–149. [Google Scholar] [CrossRef]
- Rendle, S.; Freudenthaler, C.; Schmidt-Thieme, L. Factorizing personalized markov chains for next-basket recommendation. In Proceedings of the 19th International Conference on World Wide Web, Raleigh, NC, USA, 26–30 April 2010; pp. 811–820. [Google Scholar]
- Tang, J.; Wang, K. Personalized top-n sequential recommendation via convolutional sequence embedding. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, Marina del Rey, CA, USA, 5–9 February 2018; pp. 565–573. [Google Scholar]
- Hidasi, B.; Karatzoglou, A.; Baltrunas, L.; Tikk, D. Session-based recommendations with recurrent neural networks. arXiv 2015, arXiv:1511.06939. [Google Scholar]
- Choe, B.; Kang, T.; Jung, K. Recommendation system with hierarchical recurrent neural network for long-term time series. IEEE Access 2021, 9, 72033–72039. [Google Scholar] [CrossRef]
- Duan, J.; Zhang, P.F.; Qiu, R.; Huang, Z. Long short-term enhanced memory for sequential recommendation. World Wide Web 2023, 26, 561–583. [Google Scholar] [CrossRef]
- Kang, W.C.; McAuley, J. Self-attentive sequential recommendation. In Proceedings of the IEEE International Conference on Data Mining (ICDM), Singapore, 17–20 November 2018; pp. 197–206. [Google Scholar]
- Yu, S.; Guo, M.; Chen, X.; Qiu, J.; Sun, J. Personalized Movie Recommendations Based on a Multi-Feature Attention Mechanism with Neural Networks. Mathematics 2023, 11, 1355. [Google Scholar] [CrossRef]
- Chen, Q.; Zhao, H.; Li, W.; Huang, P.; Ou, W. Behavior sequence transformer for e-commerce recommendation in alibaba. In Proceedings of the 1st International Workshop on Deep Learning Practice for High-Dimensional Sparse Data, Anchorage, AK, USA, 5 August 2019; pp. 1–4. [Google Scholar]
- Wang, D.; Zhang, X.; Yu, D.; Xu, G.; Deng, S. Came: Content-and context-aware music embedding for recommendation. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 1375–1388. [Google Scholar] [CrossRef] [PubMed]
- Chen, Q.; Jiang, F.; Guo, X.; Chen, J.; Sha, K.; Wang, Y. Combine temporal information in session-based recommendation with graph neural networks. Expert Syst. Appl. 2024, 238, 121969. [Google Scholar] [CrossRef]
- Mavrogiorgos, K.; Kiourtis, A.; Mavrogiorgou, A.; Kleftakis, S.; Kyriazis, D. A multi-layer approach for data cleaning in the healthcare domain. In Proceedings of the 2022 8th International Conference on Computing and Data Engineering, Bangkok, Thailand, 11–13 January 2022; pp. 22–28. [Google Scholar]
- Eskandanian, F.; Mobasher, B.; Burke, R. A clustering approach for personalizing diversity in collaborative recommender systems. In Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization, Bratislava, Slovakia, 9–12 July 2017; pp. 280–284. [Google Scholar]
- Saeed, M.; Mehrdad, M.; Farahnaz, H. Optimal Diversity of Recommendation List for Recommender Systems based on the Users’ Desire Diversity. J. Inf. Sci. Theory Pract. 2019, 7, 31–39. [Google Scholar]
- Movielens, GroupLens. Retrieved 31 January 2023. Available online: https://grouplens.org/datasets/movielens/ (accessed on 8 December 2021).
User ID | Movie ID in Sequence | Sequence Rating | Target Movie ID | Target Rating | ||||||
---|---|---|---|---|---|---|---|---|---|---|
User_1 | 2926 | 2915 | 2344 | 2968 | 2.0 | 3.0 | 3.0 | 4.0 | 2968 | 4.0 |
User_2 | 1307 | 2791 | 73 | 3269 | 3.0 | 5.0 | 1.0 | 5.0 | 3269 | 5.0 |
User_3 | 420 | 3688 | 838 | 2174 | 2.0 | 1.0 | 4.0 | 3.0 | 2174 | 3.0 |
User_4 | 1704 | 541 | 1219 | 3521 | 4.0 | 2.0 | 3.0 | 2.0 | 3521 | 2.0 |
User_5 | 44 | 1608 | 2064 | 3835 | 5.0 | 1.0 | 1.0 | 4.0 | 2064 | 4.0 |
Dataset | #User | #Item | #Rating | Sparsity |
---|---|---|---|---|
100 K | 943 | 1682 | 100,000 | 93.69% |
1 M | 6040 | 3883 | 1,000,209 | 95.73% |
Dataset | Metric | BPR | NeuFM | GRU4Rec | Caser | SASRec | Our Model | Improvement |
---|---|---|---|---|---|---|---|---|
Movielens 100 K | HitRat@5 | 0.4507 | 0.5610 | 0.3701 | 0.3595 | 0.3637 | 0.5676 | 1.17% |
HitRat@10 | 0.5801 | 0.6925 | 0.5260 | 0.5111 | 0.5419 | 0.6633 | −4.21% | |
NDCG@5 NDCG@10 | 0.1656 0.1588 | 0.2325 0.2314 | 0.2325 0.2826 | 0.2371 0.2861 | 0.2390 0.2965 | 0.3714 0.3783 | 55.39% 27.58% | |
MovieLeng 1 M | HitRat@5 | 0.4780 | 0.6752 | 0.5823 | 0.5889 | 0.5743 | 0.7034 | 4.17% |
HitRat@10 NDCG@5 NDCG@10 | 0.5927 0.1927 0.1770 | 0.5808 0.2323 0.2442 | 0.6866 0.4487 0.4826 | 0.7003 0.4463 0.4825 | 0.6861 0.4428 0.4791 | 0.7309 0.5869 0.6238 | 4.36% 30.80% 29.25% |
Dataset | RMSE | MAE | Precision | Recall | F1-Score | Item Coverage | Intra-List Diversity | |
---|---|---|---|---|---|---|---|---|
K-Means | Non K-Means | |||||||
100 K | 1.0756 | 0.8741 | 0.5516 | 0.3260 | 0.4098 | 0.1165 | 0.2447 | 0.2173 |
1 M | 0.9927 | 0.8007 | 0.5838 | 0.4723 | 0.5222 | 0.3216 | 0.3007 | 0.2878 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Siet, S.; Peng, S.; Ilkhomjon, S.; Kang, M.; Park, D.-S. Enhancing Sequence Movie Recommendation System Using Deep Learning and KMeans. Appl. Sci. 2024, 14, 2505. https://doi.org/10.3390/app14062505
Siet S, Peng S, Ilkhomjon S, Kang M, Park D-S. Enhancing Sequence Movie Recommendation System Using Deep Learning and KMeans. Applied Sciences. 2024; 14(6):2505. https://doi.org/10.3390/app14062505
Chicago/Turabian StyleSiet, Sophort, Sony Peng, Sadriddinov Ilkhomjon, Misun Kang, and Doo-Soon Park. 2024. "Enhancing Sequence Movie Recommendation System Using Deep Learning and KMeans" Applied Sciences 14, no. 6: 2505. https://doi.org/10.3390/app14062505
APA StyleSiet, S., Peng, S., Ilkhomjon, S., Kang, M., & Park, D. -S. (2024). Enhancing Sequence Movie Recommendation System Using Deep Learning and KMeans. Applied Sciences, 14(6), 2505. https://doi.org/10.3390/app14062505