Applying Machine Learning in Marketing: An Analysis Using the NMF and k-Means Algorithms
Abstract
:1. Introduction
2. Methodology
2.1. Data Collection Process: Literature Search Procedure
2.2. Data Management and Storage: Database Development and Record Storage
2.3. Data Preprocessing: Text Data Cleaning
2.4. Feature Extraction: Generation and Storage of Vector Embeddings
Listing 1. Python code for embedding model and encoding parameters. |
Listing 2. Python code for calculating tokens, filtering data, and generating embeddings. |
2.5. Topic Modeling Using Non-Negative Matrix Factorization
- Data preparation: We retrieved the cleaned text data from our Supabase database to ensure a robust analysis foundation. The first row of this database is shown in Table 5.
- Word count analysis and common word identification: The initial analysis focused on the distribution of word usage in the abstract, identifying the most frequently occurring words in the dataset to guide the topic modeling process. The mean word value for abstract paper is approximately 120.59, and the mean standard deviation is approximately , indicating a wide range of values. The 25th percentile (Q1) is 94 words, the average (Q2) is words, and the 75th percentile (Q3) is words. The top words by frequency, among all papers after processing the text, are ‘market’, ‘learn’, ‘data’, ‘machine’, and ‘use’ (Figure 4).
- Vectorization of text: The NMF is an unsupervised technique, so there is no labeling of the topics in which the model is trained. It decomposes (or factorizes) high-dimensional vectors into low-dimensional representations. These lower-dimensional vectors are not negative, which means that their coefficients are also not negative. In our example, high-dimensional vectors are tf-idf weights [16], but they can be anything, including word vectors or a simple number of raw words. By converting the abstract text of each document into numerical form, we can use it to create functions (Listing 3). After processing the documents, we have a little more than 1553 unique words, so we set max-features to include only the top 750 terms per term frequency in all documents to further reduce the features.
- Model fitting and topic extraction: The only parameter required is the number of components, i.e., the number of topics that we want (Listing 4). From this point on, the model is run to obtain the topics. Table 6 shows the three topics extracted with the NMF algorithm.
- Topic assignment and dataset compilation: Documents were systematically assigned to the extracted topics and a comprehensive dataset encapsulating these associations was compiled.
- Output documentation: We meticulously documented the results of the analysis for subsequent review and reference, ensuring that findings were accessible and well organized.
Listing 3. Python code for TF-IDF vectorization of text data. |
Listing 4. Python code for fitting NMF model. |
2.6. Comparative Analysis Using K-Means Clustering
- Data retrieval and preparation: Initially, we accessed the vector embeddings stored in the Supabase database, preparing them for subsequent analysis (Listing 5).
- Matrix formation and k-means clustering: A matrix was constructed from these embeddings, upon which k-means clustering was applied. In order to construct the matrix, string embeddings were transformed into numpy arrays (Listing 6).
- Cluster assignment and analysis output: Documents were systematically assigned to their respective clusters. The clustering results were then summarized to reflect the natural groupings and their characteristics (Listing 7).
- Data export: The dataset, inclusive of cluster assignments, was exported for extended analysis and future reference (Listing 8).
Listing 5. Python code for creating a Supabase client and loading data. |
Listing 6. Python code for converting string embeddings to numpy arrays. |
Listing 7. Python code for KMeans clustering. |
Listing 8. Python code for printing KMeans clustering results and saving to CSV. |
2.7. Evaluation of Topic Modeling and Clustering Results
3. Cluster 1: Machine Learning Algorithms in Marketing
4. Cluster 2: Application of Big Data and ML in Marketing and Consumer Behavior
- Social networks: Big data helps gather real-time consumer info. ML is used to analyze reviews and spot fake ones or unusual patterns.
- Understanding consumer behavior: Big data and fuzzy support vector machines show what consumers prefer when searching for info or comparing products. This is handy for launching new products where people need to pick what suits them best. Precision marketing, with big data, targets customers accurately with tailored offers [21].
- Advertising: Big data suggests TV/Internet content and predicts what people will watch and buy, making ads more accurate. It also helps in creative decision making and predicts sales from text, audio, and video data.
- Targeted digital promotions: Companies boost email campaigns by setting clear goals, personalizing content, and tracking engagement [22].
- Direct marketing: Predicting customer behavior helps tailor messages and improve campaign effectiveness [23]. ML helps companies understand customers better and run more targeted campaigns.
- Customer experience: Technology enhances marketing value and creates better customer experiences. Meeting customer needs and expectations requires using technology to understand behavior and assess marketing impact. Big data aids in decision making and understanding consumer behavior in real time.
- E-commerce: By using machine learning models to search for similar items on competitive online platforms, sellers can easily compare prices and offers to optimize their own marketing strategy. This can help sellers attract new customers and increase sales by providing better value propositions [24].
- Dynamic prices: The potential of automated pricing information and targeting marketing messages [25].
- Brand personality: Applied to consumer data, firms can assess how consumers perceive brand personality and study the effects of brand–consumer congruence in personality on social media [26].
5. Cluster 3: Machine Learning Applied to Digital Marketing
6. Future Trends
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Eid, R.; El-Gohary, H. The impact of E-marketing use on small business enterprises’ marketing success. Serv. Ind. J. 2013, 33, 31–50. [Google Scholar] [CrossRef]
- Ye, G.; Hudders, L.; De Jans, S.; De Veirman, M. The value of influencer marketing for business: A bibliometric analysis and managerial implications. J. Advert. 2021, 50, 160–178. [Google Scholar] [CrossRef]
- Ngai, E.W.; Wu, Y. Machine learning in marketing: A literature review, conceptual framework, and research agenda. J. Bus. Res. 2022, 145, 35–48. [Google Scholar] [CrossRef]
- Herhausen, D.; Bernritter, S.F.; Ngai, E.W.; Kumar, A.; Delen, D. Machine learning in marketing: Recent progress and future research directions. J. Bus. Res. 2024, 170, 114254. [Google Scholar] [CrossRef]
- Akter, S.; Dwivedi, Y.K.; Sajib, S.; Biswas, K.; Bandara, R.J.; Michael, K. Algorithmic bias in machine learning-based marketing models. J. Bus. Res. 2022, 144, 201–216. [Google Scholar] [CrossRef]
- Volkmar, G.; Fischer, P.M.; Reinecke, S. Artificial intelligence and machine learning: Exploring drivers, barriers, and future developments in marketing management. J. Bus. Res. 2022, 149, 599–614. [Google Scholar] [CrossRef]
- Yazıcı, G.; Ozansoy Çadırcı, T. Creating meaningful insights from customer reviews: A methodological comparison of topic modeling algorithms and their use in marketing research. J. Mark. Anal. 2023, 1–23. [Google Scholar] [CrossRef]
- Sterne, J. Artificial Intelligence for Marketing: Practical Applications; John Wiley & Sons: Weinheim, Germany, 2017. [Google Scholar]
- Zhang, L.; Gao, Q.; Li, T. Dynamic adaptation method of business process based on hierarchical feature model. Information 2021, 12, 362. [Google Scholar] [CrossRef]
- Arasu, B.S.; Seelan, B.J.B.; Thamaraiselvan, N. A machine learning-based approach to enhancing social media marketing. Comput. Electr. Eng. 2020, 86, 106723. [Google Scholar] [CrossRef]
- Bonab, H.; Aliannejadi, M.; Vardasbi, A.; Kanoulas, E.; Allan, J. Cross-market product recommendation. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, Queensland, Australia, 1–5 November 2021; pp. 110–119. [Google Scholar]
- Tirunillai, S.; Tellis, G.J. Mining marketing meaning from online chatter: Strategic brand analysis of big data using latent dirichlet allocation. J. Mark. Res. 2014, 51, 463–479. [Google Scholar] [CrossRef]
- Kalinić, Z.; Marinković, V.; Kalinić, L.; Liébana-Cabanillas, F. Neural network modeling of consumer satisfaction in mobile commerce: An empirical analysis. Expert Syst. Appl. 2021, 175, 114803. [Google Scholar] [CrossRef]
- Brei, V.A. Machine learning in marketing: Overview, learning strategies, applications, and future developments. Found. Trends® Mark. 2020, 14, 173–236. [Google Scholar] [CrossRef]
- Guo, Y.T.; Li, Q.Q.; Liang, C.S. The rise of nonnegative matrix factorization: Algorithms and applications. Inf. Syst. 2024, 123, 102379. [Google Scholar] [CrossRef]
- Kanakogi, K.; Washizaki, H.; Fukazawa, Y.; Ogata, S.; Okubo, T.; Kato, T.; Kanuka, H.; Hazeyama, A.; Yoshioka, N. Tracing cve vulnerability information to capec attack patterns using natural language processing techniques. Information 2021, 12, 298. [Google Scholar] [CrossRef]
- El Koufi, N.; Belangour, A.; Sdiq, M. Research on precision marketing based on big data analysis and machine learning: Case study of Morocco. Int. J. Adv. Comput. Sci. Appl. 2022, 13, 58–63. [Google Scholar]
- Saengthongrattanachot, W.; Na-udom, A.; Rungrattanaubol, J. A comparison of machine learning techniques for classification in bank marketing data. Thai J. Math. 2022, 20, 157–168. [Google Scholar]
- Zhang, M. Research on precision marketing based on consumer portrait from the perspective of machine learning. Wirel. Commun. Mob. Comput. 2022, 2022, 9408690. [Google Scholar] [CrossRef]
- Sarker, I.H. Machine learning: Algorithms, real-world applications and research directions. SN Comput. Sci. 2021, 2, 160. [Google Scholar] [CrossRef]
- Miklosik, A.; Evans, N. Impact of big data and machine learning on digital transformation in marketing: A literature review. IEEE Access 2020, 8, 101284–101292. [Google Scholar] [CrossRef]
- Ellickson, P.B.; Kar, W.; Reeder III, J.C. Estimating marketing component effects: Double machine learning from targeted digital promotions. Mark. Sci. 2023, 42, 704–728. [Google Scholar] [CrossRef]
- Choi, Y.; Choi, J.W. Assessing the predictive performance of machine learning in direct marketing response. Int. J.-Bus. Res. (IJEBR) 2023, 19, 1–12. [Google Scholar]
- Cherednichenko, O.; Ivashchenko, O.; Cibák, L.; Lincenyi, M. Item Matching Model in E-Commerce: How Users Benefit. Econ. Cult. 2023, 20, 77–90. [Google Scholar] [CrossRef]
- Thirumuruganathan, S.; Al Emadi, N.; Jung, S.g.; Salminen, J.; Robillos, D.R.; Jansen, B.J. Will they take this offer? A machine learning price elasticity model for predicting upselling acceptance of premium airline seating. Inf. Manag. 2023, 60, 103759. [Google Scholar] [CrossRef]
- Pamuksuz, U.; Yun, J.T.; Humphreys, A. A brand-new look at you: Predicting brand personality in social media networks with machine learning. J. Interact. Mark. 2021, 56, 1–15. [Google Scholar] [CrossRef]
- Kim, K.; Ahn, S.J. Rewards that undermine customer loyalty? A motivational approach to loyalty programs. Psychol. Mark. 2017, 34, 842–852. [Google Scholar] [CrossRef]
- Palomo-Domínguez, I. Del mito a la viralidad. El caso de la campaña de Cruzcampo que resucitó a Lola Flores. aDResearch Rev. Int. Investig. Comun. 2021, 26, 38–58. [Google Scholar]
- Ip, K. Revolutionising content recommendation: The impact of AI in marketing. J. AI Robot. Workplace Autom. 2023, 2, 382–389. [Google Scholar]
- Sankar, J.G. AI-driven marketing success stories: A case note of industry pioneers. In AI-Driven Marketing Research and Data Analytics; IGI Global: Hershey, PA, USA, 2024; pp. 48–66. [Google Scholar]
- Chen, Y. Comparing content marketing strategies of digital brands using machine learning. Humanit. Soc. Sci. Commun. 2023, 10, 57. [Google Scholar] [CrossRef]
- Boddu, R.S.K.; Santoki, A.A.; Khurana, S.; Koli, P.V.; Rai, R.; Agrawal, A. An analysis to understand the role of machine learning, robotics and artificial intelligence in digital marketing. Mater. Today Proc. 2022, 56, 2288–2292. [Google Scholar] [CrossRef]
- Mao, S.; Huang, R. Complexity construction of intelligent marketing strategy based on mobile computing and machine learning simulation environment. Complexity 2021, 2021, 9910834. [Google Scholar] [CrossRef]
- Salminen, J.; Yoganathan, V.; Corporan, J.; Jansen, B.J.; Jung, S.G. Machine learning approach to auto-tagging online content for content marketing efficiency: A comparative analysis between methods and content type. J. Bus. Res. 2019, 101, 203–217. [Google Scholar] [CrossRef]
- Chintalapati, S.; Pandey, S.K. Artificial intelligence in marketing: A systematic literature review. Int. J. Mark. Res. 2022, 64, 38–68. [Google Scholar] [CrossRef]
- Haleem, A.; Javaid, M.; Qadri, M.A.; Singh, R.P.; Suman, R. Artificial intelligence (AI) applications for marketing: A literature-based study. Int. J. Intell. Netw. 2022, 3, 119–132. [Google Scholar] [CrossRef]
- De Mauro, A.; Sestino, A.; Bacconi, A. Machine learning and artificial intelligence use in marketing: A general taxonomy. Ital. J. Mark. 2022, 2022, 439–457. [Google Scholar] [CrossRef]
- Guidoum, S.; Saadi, E. Digital marketing in the age of AI: View global experiences. Econ. Stud. 2024, 18, 141–151. [Google Scholar]
- Kaushal, S.; Mishra, D. Strategic implications of AI in contemporary business and Society. In Proceedings of the 2024 2nd International Conference on Disruptive Technologies (ICDT), Greater Noida, India, 15–16 March 2024; pp. 1469–1474. [Google Scholar]
- Kumar, V.; Rajan, B.; Venkatesan, R.; Lecinski, J. Understanding the role of artificial intelligence in personalized engagement marketing. Calif. Manag. Rev. 2019, 61, 135–155. [Google Scholar] [CrossRef]
- Kumar, D. The Use of Artificial Intelligence in Data Analysis and Business Intelligence. In AI and Emotional Intelligence for Modern Business Management; IGI Global: Hershey, PA, USA, 2023; pp. 311–320. [Google Scholar]
- Vidhya, V.; Donthu, S.; Veeran, L.; Lakshmi, Y.S.; Yadav, B. The intersection of AI and consumer behavior: Predictive models in modern marketing. Remit. Rev. 2023, 8, 2410–2424. [Google Scholar]
- Surendro, K. Predictive analytics for predicting customer behavior. In Proceedings of the 2019 International Conference of Artificial Intelligence and Information Technology (ICAIIT), Yogyakarta, Indonesia, 13–15 March 2019; pp. 230–233. [Google Scholar]
- Soni, V. Adopting generative AI in digital marketing campaigns: An empirical Study of drivers and barriers. Sage Sci. Rev. Appl. Mach. Learn. 2023, 6, 1–15. [Google Scholar]
- Zhang, S.; Liao, P.; Ye, H.Q.; Zhou, Z. Dynamic marketing resource allocation with two-stage decisions. J. Theor. Appl. Electron. Commer. Res. 2022, 17, 327–344. [Google Scholar] [CrossRef]
- Gołąb-Andrzejak, E. The impact of generative AI and ChatGPT on creating digital advertising campaigns. Cybern. Syst. 2023, 1–14. [Google Scholar] [CrossRef]
- Klaus, P.; Zaichkowsky, J. AI voice bots: A services marketing research agenda. J. Serv. Mark. 2020, 34, 389–398. [Google Scholar] [CrossRef]
- Skrebeca, J.; Kalniete, P.; Goldbergs, J.; Pitkevica, L.; Tihomirova, D.; Romanovs, A. Modern development trends of chatbots using artificial intelligence (AI). In Proceedings of the 2021 62nd International Scientific Conference on Information Technology and Management Science of Riga Technical University (ITMS), Riga, Latvia, 14–15 October 2021; pp. 1–6. [Google Scholar]
- Runaite, D. How Will Voice Search Optimisation Aid or Limit Digital Marketing? An End-User Perspective. Ph.D. Thesis, National College of Ireland, Dublin, Ireland, 2021. [Google Scholar]
- Israfilzade, K.; Sadili, N. Beyond interaction: Generative AI in conversational marketing-foundations, developments, and future directions. J. Life Econ. 2024, 11, 13–29. [Google Scholar] [CrossRef]
- Peres, R.; Schreier, M.; Schweidel, D.A.; Sorescu, A. Blockchain meets marketing: Opportunities, threats, and avenues for future research. Int. J. Res. Mark. 2023, 40, 1–11. [Google Scholar] [CrossRef]
- Eng, S. The new realities in digital marketing: Virtual reality, augmented reality, and extended reality. In The SAGE Handbook of Digital Marketing; SAGE Publications: New York, NY, USA, 2022; p. 276. [Google Scholar]
- Sehgal, P.; Kumar, B.; Sharma, M.; Salameh, A.A.; Kumar, S.; Asha, P. Role of IoT in transformation of marketing: A quantitative study of opportunities and challenges. Webology 2022, 18, 5838–5849. [Google Scholar]
- Baykal, B. Connected Consumers: Unleashing the Power of IoT for Personalized Marketing. In Academic Research and Evaluations in Social Sciences—IV; Bozgeyik, Y., Ed.; Özgür Publications: Gaziantep, Turkey, 2023; p. 69. [Google Scholar]
Title | Title Cleaned |
---|---|
Machine Learning in Marketing | machin learn market |
Algorithmic bias in machine learning-based marketing models | algorithm bias machin learning-bas market model |
Marketing analytics stages: Demystifying and deploying machine learning | market analyt stage demystifi deploy machin learn |
Abstract | Abstract Cleaned |
---|---|
The widespread impacts of artificial intelligence (AI) and machine learning (ML) in many segments of society have not yet been felt strongly in the marketing field. Despite such shortfall, ML offers a variety of potential benefits, including the opportunity to apply more robust methods for the generalization of scientific discoveries. Trying to reduce this shortfall, this monograph has four goals. First, to provide marketing… | widespread impact artifici intellig machin learn ml mani segment societi felt strong market field despit shortfal ml offer varieti potenti benefit includ opportun appli robust method general scientif discoveri tri reduc shortfal monograph goal provid market… |
Title Cleaned | Embedding Title |
---|---|
machin learn market | , , , …, The vector dimension is 1536. |
Abstract Cleaned | Embedding Abstract |
---|---|
widespread impact artifici intellig machin learn ml mani segment societi felt strong market field despit shortfal ml offer varieti potenti benefit includ opportun appli robust method general scientif discoveri tri reduc shortfal monograph goal provid market… | , , , …, The vector dimension is 1536 |
Year | Cited | Title | Abstract |
---|---|---|---|
2022 | 20 | algorithm bias machine learning-based market model | articl introduc algorithm bias machin learn ml base market model dramat growth algorithm decis make continu gain momentum market research stream inadequ despit devast asymmetr oppress impact algorithm bias various custom group void studi present framework identifi sourc algorithm bias market draw microfound dynam capabl use systemat literatur review indepth interview ml profession studi primari dimens design bias contextu bias applic bias correspond subdimens model data method cultur social person product price place promot synthes divers perspect use theori practic propos framework build dynam algorithm manag capabl tackl algorithm bias ml-base market decis make |
No. | Topics |
---|---|
1 | data brand media market strategi custom develop onlin social busi base analysi environ new behavior use compani analyz platform inform |
2 | ml learn research review tool intellig applic method analyt machin mak decision focus manag artifici adopt potenti |
3 | model algorithm data use consum predict method respons custom research machin learn accuraci databas direct campaign network classif decis perform |
Abstract | Topic No. | Topic |
---|---|---|
The widespread impacts of artificial intelligence (AI) and machine learning (ML) in many segments of society have not yet been felt strongly in the marketing field. Despite such shortfall, ML offers a variety of potential benefits, … | 2 | ml learn research review tool intellig applic method analyt machin mak decision focus manag artifici adopt potenti |
This article introduces algorithmic bias in machine learning (ML) based marketing models. Although the dramatic growth of algorithmic decision making continues to gain momentum in marketing, research in this stream is still inadequate despite the devastating, asymmetric and oppressive impacts of algorithmic bias on various customer groups… | 3 | model algorithm data use consum predict method respons custom research machin learn accuraci databas direct campaign network classif decis perform |
Company Name | Industry | Objective | Strategy | Results | Source |
---|---|---|---|---|---|
Starbucks | Retail | Increase customer loyalty and frequency of store visits | Use ML in their Starbucks Rewards mobile app to analyze purchasing history and customer preferences in order to provide tailored marketing offers and product recommendations | The company achieved a 16% year-over-year growth in its loyalty program participation | Kim and Ahn [27] |
Cruzcampo | Brewing | Strengthen Cruzcampo’s identity and connect with more people by tapping into the nostalgia and national pride linked to iconic Spanish artist Lola Flores | Use advanced AI and ML to digitally recreate Lola Flores’ voice and image for the ’Acento’ campaign | Throughout 2021, the campaign generated 614 million impressions, representing the number of times it reached viewers either on television or through social media | Palomo-Domínguez [28] |
Netflix | Entertainment | Maximize viewer engagement and reduce churn by personalizing content recommendations | Use ML to analyze viewer data and preferences to enhance recommendation algorithms based on user viewing patterns and habits | Higher viewer satisfaction rate and retention, reducing churn by 75% and saving Netflix approximately USD 1 billion annually in customer retention | Ip [29] |
Nike | Sportswear | Increase brand loyalty and customer engagement through innovative marketing campaigns | Implement ML and AI to analyze consumer behavior, tailor marketing content, and boost campaign effectiveness with innovative personalized designs | Customer engagement increased by 20% and social media followers by 30% across all platforms through precise targeting and personalized interactions | Sankar [30] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Gallego, V.; Lingan, J.; Freixes, A.; Juan, A.A.; Osorio, C. Applying Machine Learning in Marketing: An Analysis Using the NMF and k-Means Algorithms. Information 2024, 15, 368. https://doi.org/10.3390/info15070368
Gallego V, Lingan J, Freixes A, Juan AA, Osorio C. Applying Machine Learning in Marketing: An Analysis Using the NMF and k-Means Algorithms. Information. 2024; 15(7):368. https://doi.org/10.3390/info15070368
Chicago/Turabian StyleGallego, Victor, Jessica Lingan, Alfons Freixes, Angel A. Juan, and Celia Osorio. 2024. "Applying Machine Learning in Marketing: An Analysis Using the NMF and k-Means Algorithms" Information 15, no. 7: 368. https://doi.org/10.3390/info15070368
APA StyleGallego, V., Lingan, J., Freixes, A., Juan, A. A., & Osorio, C. (2024). Applying Machine Learning in Marketing: An Analysis Using the NMF and k-Means Algorithms. Information, 15(7), 368. https://doi.org/10.3390/info15070368