1. Introduction
The escalating threat of dementia in aging societies poses a significant social and economic burden. As the global population ages, the number of people with dementia continues to increase. Dementia is a common geriatric condition affecting approximately 10% of individuals aged 65 years and above [
1]. It denotes a state characterized not only by a decline in cognitive function but also by impairments in language, intelligence, concentration, and judgment abilities, indicating anomalies in perceptual skills. Once dementia develops, it is not reversible and tends to either remain stable or progressively worsen [
2]. The disease places a substantial burden not only on the patients themselves but also on their family members, necessitating proactive national responses [
3]. Despite the various types and symptoms of dementia, its underlying mechanisms remain unclear, and no definitive treatments are currently available. Consequently, early diagnosis of dementia is crucial and increasingly emphasized [
4,
5].
Globally, the number of individuals diagnosed with dementia continues to rise steadily. According to the World Health Organization (WHO), it is projected that by 2050, the number of dementia patients will reach approximately 152.8 million, which is nearly three times the current figure [
6]. The economic burden of dementia is significant; in 2019, the global cost for 55.2 million individuals with dementia was estimated at USD 1.313, equating to an average cost of USD 23,796 [
7]. This financial burden is expected to increase alongside the aging population, highlighting the critical need for the establishment of early diagnosis systems for dementia.
Early diagnosis is the most effective method for managing dementia, allowing for interventions to delay its progression to severe stages. Traditionally, the diagnosis of dementia relies on the clinical expertise of physicians and often involves expensive neuroimaging assessments [
8]. This makes the early diagnosis of dementia difficult and inaccessible for many individuals. National-level initiatives, such as the Mini-Mental State Examination for Dementia Screening conducted at public health centers, aim to address this issue [
9]. However, delays in acknowledging symptoms often prevent timely visits to health centers for diagnosis.
If dementia is not diagnosed early, patients may miss the critical treatment window, losing a valuable opportunity to slow the progression of the disease. As previously mentioned, dementia is a progressive disorder that worsens over time. Without appropriate treatment and management during the early stages, the disease can quickly advance to severe stages [
2]. Failure to diagnose early means that patients may not receive the necessary treatment until the disease has significantly progressed, leading to accelerated functional decline and an increased risk of losing independence in daily activities. Additionally, in the early stages of dementia, many patients either do not notice the decline in cognitive function or dismiss mild symptoms, resulting in a diagnosis only after the disease has advanced considerably. This situation often increases the psychological and financial burden on both the patients and their families, significantly diminishing the quality of life for those affected by dementia. Therefore, early detection and proactive treatment are essential for maintaining patient functionality and improving quality of life.
Traditional dementia diagnosis methods face two main challenges: the need for individuals to visit diagnostic facilities such as hospitals or health centers, and the reliance on expensive equipment [
10]. Which solutions address these issues? A potential solution for early diagnosis is to record data generated from daily life activities utilizing equipment that is readily available, thereby overcoming reliance on expensive diagnostic tools [
11].
Lifelog data generated through Internet of Things (IoT) technologies, such as wearable devices and mobile equipment, provides a comprehensive record of an individual’s daily life activities. Although lifelog data includes information recorded on social networking sites, their most noteworthy application is in the healthcare industry [
12]. In healthcare, notable lifelog data include activity levels, sleep information, dietary habits, weight fluctuations, body mass index, and muscle mass data collected from smartphones and wearable devices [
13]. The healthcare industry aims to leverage these data to address weaknesses in medical management and provide continuously usable services.
To utilize health lifelog data in real time, the application of artificial intelligence (AI) technology, capable of classifying large volumes of data and deriving meaningful results in real time, is essential. AI technology can serve as a critical decision-making tool for the early diagnosis of dementia. Various studies have demonstrated the extensive use of AI in real-time data collection and analysis [
14]. These studies highlight the important role of AI technology in real-time data analysis and decision making. In a study utilizing lifelog data to predict diabetes and cardiovascular diseases, machine learning models demonstrated a precision of 97.1% and a recall of 96.2%, thereby validating the effectiveness of early diagnosis through lifelog data analysis [
15]. Building on these findings, digital healthcare platforms that leverage lifelog data have continued to evolve, collecting and automatically analyzing individual health data to offer personalized health management. These platforms employ AI-based deep learning modules to perform real-time analyses, making them highly effective tools for managing chronic diseases [
16]. Therefore, the application of AI technology to the real-time analysis of health lifelog data is considered a valid approach.
Dementia diagnosis is a complex process that requires specialized knowledge of various conditions and scenarios. However, based on the results of prior research, implementing an AI-based early dementia diagnosis prediction system by integrating health lifelog data with AI technology appears feasible. Therefore, in this study, we aim to develop an AI-based predictive algorithm that enables early dementia diagnosis using health lifelog data, serving as preliminary research for building an AI-based diagnostic system. The results of this study are expected to facilitate early dementia diagnosis, enabling appropriate and timely treatment, thereby slowing disease progression and improving patients’ quality of life.
4. Discussion
This study aimed to develop an algorithm for the early diagnosis of high-risk dementia groups among pre-older adult individuals using AI. By leveraging health lifelog data that are easily accessible in everyday life, this study suggests a significant potential to reduce reliance on expensive medical equipment and specialized medical professionals required by traditional diagnostic methods. This approach could be especially applicable in areas with limited access to healthcare or in economically disadvantaged environments, offering the potential to improve public health quality. The key findings of the study are summarized as follows.
First, participants’ cognitive function in this study was categorized into three groups: CN (111), MCI (51), and Dem (12). This led to an imbalance in the data between groups. Although we attempted to address this issue, it did not resolve the imbalance between MCI and Dem or the gender distribution. Consequently, patients with MCI and Dem were combined and labeled as the high-risk dementia group. Data imbalance is a common limitation in dementia prediction studies. For instance, in previous studies, the Synthetic Minority Over-sampling Technique (SMOTE) was proposed to artificially augment data for minority groups in machine learning-based research [
31]. This technique helps balance datasets by generating examples of the minority class.
In healthcare-related research, data augmentation has been employed to address the issue of insufficient sample sizes, such as in studies focused on classifying human body types using deep learning techniques [
32]. Additionally, efforts to build AI systems aimed at preventing doping among athletes have also utilized data augmentation to compensate for limited sample sizes [
33]. Following these examples from various previous studies, this research applied data augmentation to tackle the problem of data imbalance. However, in the long term, future research will need to employ more precise classification techniques and expand data collection across diverse populations.
Second, this study aimed to develop an algorithm for the early diagnosis of high-risk dementia groups using original lifelog data. Although the algorithm achieved a maximum accuracy of 0.879, the sensitivity for correctly classifying actual dementia cases was low at 0.429. This raises questions about whether this algorithm is optimal for classifying dementia, which could be a topic for discussion among researchers. However, the accurate and quick prediction of patients with dementia is crucial for its management and treatment. Precise and rapid diagnosis plays a decisive role in establishing appropriate treatment and management plans, which are essential for maintaining a patient’s quality of life and slowing the progression of dementia.
Using lifelog data collected from daily life for proactive early diagnoses of dementia would considerably aid the initiation of appropriate treatment at an early stage. For example, in previous studies, a machine learning-based system using lifelog data has detected abnormal behaviors in dementia patients. This system demonstrated the potential to monitor patient conditions in real time and identify issues early [
34]. This study underscores the importance of early diagnosis and continuous monitoring in dementia management and suggests how technological approaches can improve patient care. The use of lifelog data has expanded in various fields, not only in dementia research. These data are increasingly being applied in the context of early and proactive diagnosis, where speed is often prioritized over precision. In this regard, low-cost wearable devices play a crucial role, serving as important tools for rapid data collection and analysis. The data collection device used in this study aligns with this approach. For instance, in previous research, the authors developed a low-cost, autonomous wearable device designed to track Alzheimer’s patients. The device uses GPS and geofencing technology to monitor the patient’s location in real time and sends alerts when the patient exits a designated safe zone [
35]. Such low-cost devices help alleviate the burden on patients and their families and can be effectively utilized in regions with limited access to healthcare.
Third, we addressed the data imbalance issue by performing data augmentation. Numerous studies have proposed data augmentation as a solution to data imbalance problems [
32]. The results of this study support previous findings, showing improved performance in sensitivity after data augmentation, indicating that the ability to classify actual dementia cases as dementia improved. An increase in sensitivity implies a better identification of actual dementia cases, potentially leading to a more accurate diagnosis. Therefore, we hope that future studies will continue to explore various techniques to enhance sensitivity. Furthermore, future studies should investigate how these techniques can be applied in clinical settings to contribute to high-quality research capable of early dementia prediction.
5. Conclusions
This study aimed to develop a predictive algorithm using AI technology for the early diagnosis of high-risk dementia groups among pre-older adult individuals. Early diagnostic methods that utilize health lifelog data aim to overcome the limitations of traditional diagnostic methods. The results suggest that effectively utilizing lifelog data, which can be easily collected from daily life, not only enhances the accessibility of dementia diagnosis and enables the efficient use of medical resources but also plays a crucial role in delaying the progression of dementia.
This study focused on improving model accuracy and sensitivity by applying data augmentation techniques to overcome the limitations of previous studies, such as data imbalance issues. The improvement in sensitivity after data augmentation can enhance the reliability of AI-based diagnostic systems. Nevertheless, future research should address the issue of data imbalance, as well as efforts to improve sensitivity.
Finally, the approach used in this study suggests the potential for application not only in dementia diagnosis but also in the early diagnosis of various health conditions. The integration of AI and healthcare technology could lead to more precise and personalized medical services, further improving the quality of public health. Future research should aim to enhance the model’s predictive power using more diverse data and advanced analytical techniques and explore its applicability in real clinical settings.