Next Article in Journal
Accompanying Hemoglobin Polymerization in Red Blood Cells in Patients with Sickle Cell Disease Using Fluorescence Lifetime Imaging
Previous Article in Journal
Collateral Sensitivity to β-Lactam Antibiotics in Evolved Apramycin-Resistant MRSA
Previous Article in Special Issue
Mulberry Leaf Compounds and Gut Microbiota in Alzheimer’s Disease and Diabetes: A Study Using Network Pharmacology, Molecular Dynamics Simulation, and Cellular Assays
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Exosomal mRNA Signatures as Predictive Biomarkers for Risk and Age of Onset in Alzheimer’s Disease

by
Daniel A. Bolívar
1,†,
María I. Mosquera-Heredia
2,†,
Oscar M. Vidal
2,
Ernesto Barceló
3,4,5,
Ricardo Allegri
6,
Luis C. Morales
2,
Carlos Silvera-Redondo
2,
Mauricio Arcos-Burgos
7,
Pilar Garavito-Galofre
2,*,‡ and
Jorge I. Vélez
1,*,‡
1
Department of Industrial Engineering, Universidad del Norte, Barranquilla 081007, Colombia
2
Department of Medicine, Universidad del Norte, Barranquilla 081007, Colombia
3
Instituto Colombiano de Neuropedagogía, Barranquilla 080020, Colombia
4
Department of Health Sciences, Universidad de La Costa, Barranquilla 080002, Colombia
5
Grupo Internacional de Investigación Neuro-Conductual (GIINCO), Universidad de La Costa, Barranquilla 080002, Colombia
6
Institute for Neurological Research FLENI, Montañeses 2325, Buenos Aires C1428AQK, Argentina
7
Grupo de Investigación en Psiquiatría (GIPSI), Departamento de Psiquiatría, Instituto de Investigaciones Médicas, Facultad de Medicina, Universidad de Antioquia, Medellín 050010, Colombia
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
These authors also contributed equally to this work.
Int. J. Mol. Sci. 2024, 25(22), 12293; https://doi.org/10.3390/ijms252212293
Submission received: 11 September 2024 / Revised: 7 November 2024 / Accepted: 11 November 2024 / Published: 15 November 2024
(This article belongs to the Special Issue Molecular Advances in Alzheimer’s Disease 3.0)

Abstract

:
Alzheimer’s disease (AD) is a neurodegenerative disorder characterized by progressive cognitive decline and memory loss. While the precise causes of AD remain unclear, emerging evidence suggests that messenger RNA (mRNA) dysregulation contributes to AD pathology and risk. This study examined exosomal mRNA expression profiles of 15 individuals diagnosed with AD and 15 healthy controls from Barranquilla, Colombia. Utilizing advanced bioinformatics and machine learning (ML) techniques, we identified differentially expressed mRNAs and assessed their predictive power for AD diagnosis and AD age of onset (ADAOO). Our results showed that ENST00000331581 (CADM1) and ENST00000382258 (TNFRSF19) were significantly upregulated in AD patients. Key predictors for AD diagnosis included ENST00000311550 (GABRB3), ENST00000278765 (GGTLC1), ENST00000331581 (CADM1), ENST00000372572 (FOXJ3), and ENST00000636358 (ACY1), achieving > 90% accuracy in both training and testing datasets. For ADAOO, ENST00000340552 (LIMK2) expression correlated with a delay of ~12.6 years, while ENST00000304677 (RNASE6), ENST00000640218 (HNRNPU), ENST00000602017 (PPP5D1), ENST00000224950 (STN1), and ENST00000322088 (PPP2R1A) emerged as the most important predictors. ENST00000304677 (RNASE6) and ENST00000602017 (PPP5D1) showed promising predictive accuracy in unseen data. These findings suggest that mRNA expression profiles may serve as effective biomarkers for AD diagnosis and ADAOO, providing a cost-efficient and minimally invasive tool for early detection and monitoring. Further research is needed to validate these results in larger, diverse cohorts and explore the biological roles of the identified mRNAs in AD pathogenesis.

1. Introduction

Alzheimer’s disease (AD), the most common form of dementia [1], originates from a combination of genetic, environmental, and lifestyle factors that contribute to the accumulation of amyloid-beta (Aβ) plaques and hyperphosphorylated tau tangles in the brain [2,3,4,5]. Although aging is the primary risk factor of late-onset AD (>65 y/o) [3], alleles harbored in major and minor effect genes play a significant role in shaping the architecture of AD etiology [5,6]. Currently, AD diagnosis involves a combination of cognitive assessments, brain imaging, and biomarker analysis. However, early detection of AD remains elusive due to the subtle nature of early symptoms [5,7,8,9,10,11,12,13,14].
Messenger RNA (mRNA) transcripts are single-stranded RNA molecules that serve as intermediates between the genetic information encoded in DNA and the synthesis of proteins via translation. Analysis of brain mRNA expression has allowed researchers to identify differences between individuals with AD and healthy controls, as well as genes actively involved in AD development and progression [15,16]. This provides valuable insights into the molecular pathways and cellular processes that are dysregulated in the disease [15,17].
One significant breakthrough in AD detection has been the identification of the SRSF1 and PTBP1 proteins in regulating AD-related genes [18]. These proteins act as splicing factors, influencing the production of specific isoforms of the CD33 gene, which is associated with AD. Other studies linked mRNA expression for specific genes, such as the acetylcholinesterase (ACHE) gene, proposed as a potential biomarker for diagnosing AD and related conditions [19]. This association is crucial as it links mRNA expression to oxidative stress, a key contributor to the progression of AD.
Additionally, cellular hypoxia can influence AD development by altering pre-mRNA splicing, particularly of the Tau gene, suggesting that environmental influences can significantly impact AD progression through changes in mRNA processing [20]. Furthermore, crucial microRNA-mRNA pairs, such as miR-26a-5p/PTGS2, have been identified as essential regulators in AD, highlighting the importance of regulatory networks and post-transcriptional regulation in AD development [21,22]. Recently, the microRNA 221, which is a cerebrospinal fluid microRNA, has emerged as a promising candidate for the early detection of AD [6,23,24], suggesting that the study of mRNA has the potential to advance the development of new diagnostic tools and therapeutic strategies for AD [25,26].
Since 2020, a collaborative effort has been underway to elucidate the genetic landscape of AD susceptibility and AD age of onset (ADAOO) in Barranquilla, Colombia. This involves a comprehensive clinical, cognitive, neuropsychological, and genetic assessment of individuals with AD (cases) and healthy unrelated controls. In this report, we present the results and analysis of microarrays quantifying the expression of 16,580 mRNAs using advanced bioinformatics, data analytics, and ML techniques to identify exosomal mRNA signatures that could improve disease diagnosis, prediction, and treatment. We hypothesize that (1) mRNAs could be promising, non-invasive, and reliable novel diagnostic markers for AD in this population, and (2) these mRNA signatures could potentially allow early diagnosis, risk prediction, and the development of targeted interventions for this devastating neurodegenerative disease. We identify differentially expressed mRNAs that could serve as potential biomarkers for AD diagnosis and ADAOO. Our results suggest that integrating mRNAs with ML tools could improve early detection and monitoring. Additionally, we provide insights into the role of mRNAs within the CADM1 and TNFRSF19 genes in AD pathology. While our findings are promising, further validation in larger cohorts is essential to confirm the reliability of these biomarkers and explore their roles in disease mechanisms.

2. Results

2.1. Subjects

We collected data from 30 individuals through clinical evaluations, family histories, comprehensive neurological and neuropsychological clinical examinations, and structured interviews. Demographic data are summarized in Table 1. The Universidad del Norte Ethics Committee approved this study (Project Approval Act #188 of 23 May 2019).

2.2. mRNA Signatures Contributing to AD Susceptibility via Logistic Regression

The expression of 16,580 mRNAs was quantified in all participants, identifying 385 significantly associated with AD at a 5% nominal level. Of these, 82 mRNAs had a protective effect against AD, while 303 mRNAs were associated with an increased risk of an AD diagnosis (Figure 1a). However, none of these mRNAs were statistically significant after FDR correction.
Table 2 shows the top 10 mRNAs conferring susceptibility to AD in our sample, which are in the KRTAP5-6, TPCN2, GALM, KCNK6, CXCR5, ZNF626, STON1, C3orf22, AKNA, and SMIM5 genes. Figure 2a shows the p-value distribution across chromosomes. Note that mRNAs most significantly associated with AD susceptibility are in chromosomes 2, 11, and 19.

2.3. mRNAs Signatures Differentially Expressed Between Comparison Groups

We identified 154 differentially expressed mRNAs using Gamma regression with a Type I error of 5%; 102 mRNAs were upregulated, and 52 were downregulated in individuals with AD compared to healthy controls (Figure 1b). Table 3 shows the top 10 differentially expressed mRNAs in our sample, and Figure 2b shows the distribution of p-values across chromosomes. However, only ENST00000331581 (CADM1) and ENST00000382258 (TNFRSF19) were statistically significantly differentially expressed after FDR correction.

2.4. mRNAs Signatures Modifying ADAOO

We identified 2034 mRNAs that had a delaying effect on ADAOO ( β ^ > 0) and 1468 mRNAs that accelerated ADAOO ( β ^ < 0) in our individuals with AD, with a nominal Type I error of 5%. Table 4 shows the top 10 mRNAs associated with ADAOO in our sample. Interestingly, ENST00000257696 ( β ^ = 4.34 ; HILPDA) and ENST00000304060 ( β ^ = 4.79 ; ZNF440) delay ADAOO, whereas ENST00000263851 ( β ^ = 17.31 ; IL7), ENST00000340552 ( β ^ = 12.6 ; LIMK2), and ENST00000230658 ( β ^ = 11.05 ; ISL1) are the top accelerators. However, only ENST00000340552 (LIMK2), which accelerates AD onset by ~12.6 years (Table 4) showed a statistically significant association with ADAOO after FDR correction.

2.5. mRNAs Signatures Identified via ML

We identified several mRNAs with high accuracy for predicting AD diagnosis and ADAOO using the one-rule (OneR) ML algorithm (Table 5). Notably, the ENST00000331581 (CADM1), ENST00000372572 (FOXJ3) and ENST00000311550 (GABRB3) mRNAs independently achieved an accuracy of 95.4% for predicting AD diagnosis in the training dataset (n = 21). Regarding ADAOO, ENST00000640218 (HNRNPU), ENST00000261245 (MNAT1), and ENST00000339562 (NR4A2) exhibited a remarkable ability to accurately predict ADAOO in the training dataset (n = 11; Table 5).

2.6. ML-Based Predictive Framework of AD Diagnosis

We evaluated the performance of several ML algorithms to construct a predictive framework for AD diagnosis based on the 30 mRNAs with the highest predictive power identified via the OneR ML algorithm (Supplementary Table S2). Figure 3a summarizes their accuracy in the training dataset.
Our findings show that the rf, xgbLinear, and xgbTree ML perform exceptionally well in predicting AD diagnosis based on mRNA expression levels, achieving accuracies of 94.7%, 98%, and 99%, respectively (Table 6). Notably, the xgbTree algorithm exhibits low standard deviation and coefficient of variation. In contrast, the avNNet, hdda, and knn algorithms showed lower accuracy and more significant variability.
Further evaluation of the xgbTree algorithm confirmed its robust predictive capability for AD diagnosis (Table 7). Analysis of the ROC curve and AUC for the xgbTree algorithm across training and testing datasets suggest that this ML algorithm is capable of distinguishing individuals with AD from healthy controls and that ENST00000311550 (GABRB3) is the most significant mRNA for predicting AD (Supplementary Figure S1).

2.7. Feature Selection and Model Refinement for AD Diagnosis

We applied the OneR algorithm to enhance our ML-based approach for AD diagnosis and narrowed the predictors to the top five mRNAs. Our analysis identified that ENST00000311550 (GABRB3), ENST00000278765 (GGTLC1), ENST00000331581 (CADM1), ENST00000372572 (FOXJ3), and ENST00000636358 (ACY1) have the highest predictive power for AD diagnosis.
Subsequently, we assessed the performance of different ML algorithms based on these mRNAs (Supplementary Table S4). Interestingly, some ML algorithms achieved remarkable accuracy scores (i.e., avNNet, lda, lda2, svmLinear, svmLinear2, svmPoly, treebag, xgbLinear, and xgbTree), while others, despite showing slightly lower accuracies and higher variability, perform reasonably well (i.e., svmRadial, knn, and rf). This suggests that ML algorithms using the top 5 mRNAs identified via OneR can distinguish between individuals with AD and healthy controls in our sample. However, the xgbTree algorithm is the preferred choice. This model achieves remarkable performance in training and testing datasets, with the ROC curve and AUC values indicating that the ML-based model is robust, generalizable, and capable of accurately distinguishing AD individuals from healthy controls (Supplementary Figure S1). The ENST00000311550 (GABRB3) mRNA is the most important predictor.
To further enhance the prediction accuracy for AD diagnosis, we explored combinations of mRNAs when using the xgbTree ML algorithm. Our goal was to identify the most effective predictors for diagnosing AD. Thus, we assessed the predictive power of eight pairs of gene transcripts. Of these, the pair ENST00000311550 (GABRB3) and ENST00000331581 (CADM1) emerged as the most accurate, achieving an average accuracy of 95.8% in the training data (Supplementary Figure S2). Variable importance revealed that, under this model, ENST00000311550 (GABRB3) is the most important predictor for AD diagnosis. ENST00000278765 (GGTLC1) was included as a predictor for the final predictive model because it also demonstrated good predictive power and robust performance metrics. The final model with these three mRNAs achieved remarkable AUC, accuracy, sensitivity, specificity, and precision scores during the training and testing phases (Supplementary Figure S2).

2.8. ML-Based Predictive Framework for ADAOO

Table 8 reports the performance of several ML algorithms for predicting ADAOO based on the top 30 mRNAs identified via OneR (Table S3, Supplementary Material). Our results indicate that the pls and known algorithms demonstrated superior performance; the former achieved an RMSE of 6.519 and an MAE of 6.459, while the known achieved RMSE and MAE values of 6.817 and 6.761, respectively.
ML algorithms were clustered into three groups (Supplementary Figure S3). Variable importance analyses of the top performer algorithms revealed distinct prioritizations for predicting ADAOO. For instance, HBMT00001385713 (LONRF1), ENCT00000265279 (INSM1), ENST00000370332 (GFI1), and ENST00000257430 (APC) are pivotal variables for ADAOO prediction using rf (Figure 4a); HBMT00001385713 (LONRF1), ENST00000263736 (SRBD1), ENST00000304677 (RNASE6), and ENST00000640218 (HNRNPU) are identified as the most important by the xgbLinear algorithm (Figure 4b); and xgbTree ranks ENST00000304677 (RNASE6), ENST00000640218 (HNRNPU), ENST00000602017 (PPP5D1), ENST00000224950 (STN1), and ENST00000322088 (PPP2R1A) mRNAs as the most critical ADAOO predictors in our cohort of individuals with AD (Figure 4c).

2.9. Refining the ML-Based Model for ADAOO Prediction

We selected the top five mRNAs to construct pair combinations and tested their ADAOO predictive power for the testing dataset using the rf, xgbLinear, and xgbTree algorithms (Table 9). Among the distinct model combinations, the xgbTree algorithm with ENST00000304677 (RNASE6) and ENST00000602017 (PPP5D1) as predictors achieved the best performance (RMSE = 0.462, R2 = 0.993, MAE = 0.392; Table 9).

3. Discussion

This study explored the utility of various data analytics and machine learning (ML) techniques for identifying patterns in mRNA expression data related to Alzheimer’s disease (AD). The key findings are as follows: ML methods successfully identified differentially expressed mRNAs in AD, providing insights into their roles in disease pathogenesis. Logistic regression analysis revealed 385 differentially expressed mRNAs, 82 showing a protective effect and 303 associated with increased AD risk. Secondly, an ML-based framework predicts AD based on mRNA profiles, demonstrating promise for early detection and personalized intervention strategies. Several mRNA transcripts, including ENST00000331581 (CADM1), ENST00000372572 (FOXJ3), and ENST00000311550 (GABRB3), exhibited exceptional predictive power, accurately distinguishing AD cases from controls. Lastly, the study extended to predicting the age of AD onset (ADAOO), highlighting the potential for personalized treatment planning based on individual risk assessments. Key mRNA transcripts, such as ENST00000304677 (RNASE6) and ENST00000602017 (INPP5D), were identified as crucial predictors of ADAOO by advanced ML algorithms such as xgbTree and RF [27,28].
Logistic regression analysis revealed 385 differentially expressed mRNAs, with 82 demonstrating a protective effect and 303 associated with an increased risk of AD. Although these mRNAs did not reach statistical significance (Table 2), the biological relevance of these mRNAs could play an important role in AD pathogenesis.
We used Gamma regression to identify differentially expressed mRNAs and their association with AD and ADAOO as complementary analyses. A total of 154 differentially expressed mRNAs, 102 upregulated and 52 downregulated in individuals with AD, were identified. Of these, two mRNAs, ENST00000331581 (CADM1) and ENST00000382258 (TNFRSF19), were statistically significant after multiple testing corrections were applied (Table 3). These mRNAs provide supporting evidence of the role of the CADM1 and TNFRSF19 in AD pathogenesis. Interestingly, CADM1 is implicated in synaptic assembly and has known isoforms, such as SP3, identified in proteogenomic studies [29,30]. The detection of these isoforms in humans and mice, along with their altered expression in AD models, highlights their potential role in neurodegenerative processes [29].
In our study, an mRNA within the TNFRSF19 (TROY) gene also emerged as an important biomarker. Previous studies have linked TNFRSF19 elevated expression to both intracranial aneurysms and coronary artery disease [31,32]. Furthermore, its strong correlation with inflammatory markers and immune-related genes highlights its potential role in chronic inflammation and vascular abnormalities. Indeed, elevated expression of CADM1 and TNFRSF19 in AD models emphasizes their critical role in inflammatory processes associated with AD [31,32].
We identified that expression levels in ENST00000340552 (LIMK2) delay ADAOO by ~12 years (Table 4). LIMK2 is a protein crucial for controlling the dynamics of the cell’s internal framework, known as the actin cytoskeleton. This process is vital for shaping cell structure and movement. When activated by ROCK1, LIMK2 can modify another protein called cofilin, which normally destabilizes the actin network. By phosphorylating cofilin, LIMK2 prevents it from breaking down actin, allowing cells to maintain their shape and move effectively. This regulation of actin is essential for fundamental cellular activities like cell division, apoptosis, and cell migration [33]. Researchers have also linked abnormalities in the ROCK1/LIMK2/cofilin pathway to various types of cancer [34,35]. LIMK2 is also involved in neurodevelopmental disorders and neurodegenerative diseases, including AD, Parkinson’s, and schizophrenia [36]. Recent studies show that targeting LIMK2 in cancer and neurological disorders is promising, as LIMK2 inhibitors have shown efficacy in preclinical models [37,38,39]. Thus, identifying an mRNA regulating LIMK2 as significantly associated with delayed ADAOO emphasizes its potential as a neuroprotective factor. Given its role in actin dynamics and broad impact on cellular processes, LIMK2 represents a valuable target for therapeutic strategies aimed at delaying the onset or progression of AD.
Using the OneR ML algorithm, we identified that transcripts ENST00000331581 (CADM1), ENST00000372572 (FOXJ3), and ENST00000311550 (GABRB3) each achieved an accuracy of 95.4% for distinguishing AD cases from controls (Table 5) [36,40,41,42,43]. Similarly, the performance of an ML-based predictive framework for AD diagnosis using 16 ML algorithms and the expression levels of several mRNAs was rigorously evaluated and assessed (Figure 3). Notably, RF, xgbLinear, and xgbTree emerged as top performers (Table 6). The subsequent application of the OneR ML algorithm further refined our predictive approach by identifying five key mRNA transcripts—ENST00000311550, ENST00000278765, ENST00000331581, ENST00000372572, and ENST00000636358 (Supplementary Figures S1 and S2). The fact that the gene regulated by ENST00000311550 is particularly involved in critical biological processes related to neurodegeneration and synaptic function underscores its potential as a key biomarker for AD diagnosis [44,45,46]. Analysis of GABAergic signaling components in post-mortem human brain tissue revealed significant transcriptional downregulation of GABA receptors (GABBR2, GABRA1, GABRB3, GABRG2), GABA synthesizing enzymes (GAD1, GAD2), and other neurotransmitter receptors (GRIK1, GRIK2), implicating a disruption in the excitatory/inhibitory balance that contributes to cognitive decline in AD [44]. These findings align with previous studies linking alterations in GABAergic pathways to AD pathology, suggesting potential therapeutic targets aimed at restoring neuronal function through modulation of these pathways [46]. Moreover, insights from genetic studies in epilepsy highlight parallels in synaptic dysfunction, reinforcing the broader implications of disrupted neuronal networks in neurodegenerative diseases like AD [45,46]. In addition, genes regulated by ENST00000331581, ENST00000372572, and ENST00000636358 are implicated in essential processes such as cell adhesion, synaptic function, and neuronal signaling, which are crucial in the context of neurodegenerative diseases like AD [36,41,43,47,48].
We developed and comprehensively assessed the performance of an ML-based framework for predicting ADAOO (Table 8). Among the algorithms tested, avNNet exhibited poor performance metrics, whereas knn, pls, xgbTree, xgbLinear, and RF consistently demonstrated superior predictive accuracy with low RMSE and MAE values (Table 8). Notably, the xgbTree algorithm exhibited exceptional performance in predicting ADAOO, with an impressively low RMSE and high R2 values (Table 9). Key mRNA transcripts such as ENST00000304677 and ENST00000602017 were identified as pivotal for predicting ADAOO by xgbTree, which suggests their critical role in delineating ADAOO in our sample (Table 9). ENST00000304677, located within the RNASE6 gene, plays an important role in innate immune responses and has been linked to neuroinflammation, a characteristic feature of AD. Specifically, RNASE6 expression correlates with myeloid-derived suppressor cells (MDSCs), suggesting its involvement in immune modulation that may influence susceptibility to ADAOO. RNASE6 expression interacts with APOE-ε4 status, indicating that higher levels of RNASE6 are associated with poorer memory outcomes among APOE-ε4 carriers [27,49]. RNASE6 encodes an antimicrobial peptide involved in innate immune responses and has been identified in gene co-expression networks with other inflammatory genes implicated in AD, such as TREM2 and MS4A [50,51].
Furthermore, ENST00000602017, identified as crucial in the predictive model, regulates the Inositol polyphosphate-5-phosphatase (INPP5D) gene, also known as SHIP1, which has emerged as significant in AD pathophysiology, particularly associated with late-onset AD (LOAD). INPP5D is selectively expressed in brain microglia and has been linked to LOAD through genome-wide association studies [28]. Despite its critical role, the precise impact of INPP5D on disease onset and progression remains unclear. Differential gene expression analysis investigated INPP5D in AD, revealing its upregulation in LOAD and positive correlation with amyloid plaque density. In the 5xFAD amyloid mouse model, INPP5D expression increased with disease progression, particularly in plaque-associated microglia. Notably, depletion of microglia using the colony-stimulating factor receptor-1 antagonist PLX5622 entirely abolished the elevated Inpp5d expression levels in 5xFAD mice.
Similarly, RF revealed the significance of HBMT00001385713 and ENST00000257430 in the ML-based predictive models of ADAOO (Table 9). ENST00000257430, associated with the APC/C-Cdh1 pathway, plays a crucial role in AD pathophysiology [52]. The APC/C-Cdh1 complex, an E3 ubiquitin ligase, regulates synaptic plasticity and neuronal survival. In AD, aberrant activation of Aβ induces phosphorylation of Cdh1, disrupting the APC/C-Cdh1 complex. This disruption leads to the accumulation of substrates such as Rock2 and Cyclin B1 in affected brain regions, contributing to synaptic loss and neurotoxicity. Studies in neurons and animal models have demonstrated that maintaining normal APC/C-Cdh1 activity may mitigate Aβ-induced neurotoxic effects, suggesting potential therapeutic targets for AD [53].
In summary, our study showed that using ML algorithms to assess AD risk and ADAOO based on demographic and genetic data is promising for clinical applications, as indicated by the RMSE, MAE, and R2 performance metrics. Genetic variants are essential predictors in our ML models for AD and ADAOO. These models can facilitate personalized assessments, ultimately advancing predictive genomics and personalized medicine approaches for AD and improving individualized treatment strategies for patients at risk of developing the disease [54,55,56]. Thus, integrating mRNA biomarkers with advanced ML methods shows potential for early ADAOO detection and intervention, enhancing clinical management strategies and improving our understanding and treatment of AD.
Integrating ML and mRNA data in AD research presents a robust framework for advancing our understanding of the disease [54,55,56]. Identified mRNAs associated with AD risk, protection, and ADAOO prediction establish a solid foundation for future investigations, particularly in Latin American and Caribbean regions [6,26,57]. Validating these findings in more extensive, diverse cohorts and exploring the biological roles of the identified mRNAs could unveil novel insights into AD pathogenesis. Furthermore, these findings hold significant therapeutic potential, as targeting mRNAs linked to AD risk or protection could lead to the development of novel treatments, including gene therapies aimed at modulating mRNA expression and potentially altering disease trajectories [54,55,56].

4. Materials and Methods

4.1. Participants

We recruited 30 participants (15 with a diagnosis of AD and 15 healthy controls) at the Instituto Colombiano de Neuropedagogía (ICN) in Barranquilla, Colombia. The ICN team determined the candidates’ eligibility based on the Montreal Cognitive Assessment (MoCA) results [58] and the inclusion criteria described elsewhere [13].
Patients were classified as affected by AD if they met the DSM-V criteria [59] and had a Mini-Mental State Examination (MMSE) [60] score between 0 and 18 points. Exclusion criteria included other neurological or major psychiatric disorders, psychoactive substance use, excessive alcohol consumption, and inability to complete the clinical studies as previously described [13]. Healthy controls were non-family volunteers over 65 years old, without suspected AD, and with an MMSE score between 19 and 29. Individuals with depression, mild cognitive impairment (MCI), dementia, other neurological disorders, major psychiatric illnesses, or those using psychoactive substances or consuming excessive alcohol were excluded.

4.2. Neuropsychological Assessment

After explaining to potential participants what the study consisted of and obtaining informed consent, an exhaustive neuropsychological evaluation was performed, which included the following tests: Boston Denomination Test [61,62], Rey–Osterrieth Complex Figure [63], Rey Auditory Verbal Learning Test (RAVLT) [64], Trail Making Test (TMT) [65,66], Symbol Digit Modality Test (SDMT) [67], Stroop Color and Word Test [68], Token Test [69], Benton’s Visual Retention Test (BVRT) [70], Clock Drawing Test [71], Memory Scale subtest of the Wisconsin Card Testing Test [72], Geriatric Depression Screening Test [73], Global Deterioration Scale (GDS) [74], Barthel Functional Index [75], and Neuropsychiatric Inventory [76]. Additional data for each participant, such as age at the beginning of the study, sex, educational level, marital status, weight, and height, were also recorded through the clinical history. In participants diagnosed with AD, the AD age of onset (ADAOO) of the disease was defined as the age at onset of symptoms according to previous research [77,78].

4.3. RNA Isolation and Extraction

Blood samples were collected to isolate circulating exosomes as described elsewhere [13]. Exosomes were isolated using the Total Exosome Isolation Reagent commercial kit (catalogue #4478360, Thermo Fisher Scientific, Inc., Waltham, MA, USA) following the manufacturer’s instructions with minor modifications standardized at Universidad del Norte, Barranquilla laboratories. The resulting exosomes were characterized by scanning electron microscopy (SEM). For this purpose, exosomes were encapsulated with nanodiamond particles, and their sizes were confirmed.
For the extraction of RNA contained in exosomes, a technique based on the acid phenol–chloroform method was standardized in the laboratory of the Universidad del Norte [13]. Extracted RNA was resuspended with 50 µL of RNAse-free water and then subjected to DNase I (catalogue #EN0521, Thermo Fisher Scientific, Inc., USA) following the manufacturer’s instructions. Finally, the concentration and indexes of the readings obtained with the optical densities (ODs) 260/230 and 260/280 were measured in a NanoDrop 2000 (Thermo Fisher Scientific, Inc., USA) and corresponded to the RNA quality indexes.

4.4. mRNA Microarray Study

For mRNA identification and differential expression analysis, the 30 RNA samples (15 cases with AD and 15 healthy controls) were sent to Arraystar, Inc. (Rockville, MD, USA), where RNA quality control, labelling, and hybridization were performed according to Agilent’s single-color microarray-based gene expression analysis protocol (Agilent Technologies, Santa Clara, CA, USA) with minor modifications.

4.4.1. Quality Control

Each sample was subjected to retrotranscription to obtain complementary DNA (cDNA), amplified, and transcribed back to its complementary RNA (cRNA). In this step, amplification and incorporation of the cyanine 3 (Cy3) fluorescent dye labelling was achieved simultaneously along the entire length of the 3′ unbiased transcript using a random priming method (Arraystar Flash RNA Labelling Kit, Arraystar, Inc., Rockville, MD, USA). The labelled cRNAs were purified with the RNeasy mini kit (Qiagen, Hilden, Germany). In this step, reagent residues and the excess of cyanine not incorporated were eliminated. As a control of the amplification and labelling process of the samples, the concentration of the cRNA was obtained, and the rate of cyanine incorporation or specific activity (pmol of Cy3 per μg cRNA). Hybridization was allowed to continue if the cRNA concentration was >1.65 μg and the specific activity was >9 pmol of Cy3 per μg of cRNA. Otherwise, cRNA preparation was repeated.

4.4.2. Hybridization and Microarray Scanning

A total of 1 μg of each labelled cRNA was fragmented by adding five μL of blocking agent 10x and 1 μL of fragmentation buffer 25x. The mixture was heated to 60 °C for 30 min, and then 25 μL of hybridization buffer 2x GE was used to dilute the labelled cRNA; 50 μL of hybridization solution was dispensed onto a hybridization plate, which was then assembled with an lncRNA expression microarray plate. The plates were incubated for 17 h at 65 °C in an Agilent hybridization oven. The hybridized arrays were washed and scanned using an Agilent scanner (equipment #G2505C, Agilent Technologies, Santa Clara, CA, USA).

4.4.3. mRNA Microarray and Data Normalization

The Arraystar Human LncRNA Arrays V5 is designed to systematically profile long non-coding RNAs (lncRNAs) and the entire set of protein-coding mRNAs: about 39,317 lncRNAs (8393 Gold Standard LncRNAs and 30,924 Reliable LncRNAs) and 21,174 mRNA coding transcripts. Arraystar, Inc. maintains high-quality proprietary lncRNA transcriptome databases that extensively collect lncRNAs through all major public databases and repositories, knowledge-based mining of scientific publications, and our lncRNA discovery pipelines, which include FANTOM5 CAT (version 1), GENECODE (version 29), RefSeq (updated to November 2018), BIGTranscriptome (version 1), knownGene (updated to November 2018), lncRNAdb, LncRNAWiki, RNAdb, NRED, CLS FL, NONCODE (version 5), MiTranscriptome (version 2), and an lncRNA/mRNA discovery pipeline from more than 47 Tb RNA-seq data. Each transcript is represented by a specific exon or splice junction probe, which can identify individual transcripts accurately. Positive and negative probes for housekeeping genes were printed onto the array for hybridization quality control.
Quantile normalization and subsequent data processing were performed using the GeneSpring GX v12.1 software package (Agilent Technologies, Santa Clara, CA, USA). After normalization, mRNAs were flagged as present or marginal (“all-target value”) in at least 15 of 30 samples chosen for further analysis.

4.5. Identification of mRNAs Conferring Susceptibility to AD

mRNAs conferring susceptibility to AD were identified using Generalized Linear Models [79]. For the jth mRNA, a Logistic regression model of the form AD ~ mRNAj + Age + Sex + Schooling was fitted using the glm() function in R version 4.4.1 [80], where Age is the age of the individual at the beginning of the study and Schooling corresponds to years of education. Subsequently, we extracted the estimated regression coefficient associated with mRNAj, denoted as β ^ j , the corresponding standard error S E ^ β ^ j and the test statistic computed as t j = β ^ j S E ^ β ^ j . For interpretation purposes, β ^ j > 0 implies that the jth mRNA confers susceptibility to AD; β ^ j < 0 implies that the jth mRNA has a protective effect; and β ^ j = 0 implies that the jth mRNA does not affect AD susceptibility (j = 1, 2, …, m). Under the null hypothesis, t j ~ t n p . In our context, n = 30 and p = 5. The p-value for the jth mRNA is P j = 2 P r ( t 25 > | t j | ) . Thus, p-values P 1 ,   P 2 ,   P 3 , , P m are collected. As m is usually large, p-values were corrected for multiple testing using the false discovery rate (FDR) [81,82]. mRNAs with FDR-corrected p-values below 5% (pFDR < 0.05) were statistically significantly associated with AD susceptibility.

4.6. mRNA Differentially Expressed Between AD Groups

mRNAs differentially expressed between individuals with AD and healthy controls were identified using a Gamma regression model with an identity link of the form mRNAj ~ AD + Age + Sex + Schooling was fitted to the data as implemented in the glm() function of R. For the jth mRNA, the estimated regression coefficient associated with AD, denoted as β ^ j , the standard error S E ^ β ^ j , and the test statistic t j were extracted (for more details, see Section 4.5). For interpretation purposes, β ^ j > 0 implies that the jth mRNA is upregulated in individuals with AD; β ^ j < 0 implies that the jth mRNA is downregulated; and β ^ j = 0 implies that there is no difference in the average expression levels of the jth mRNA between the comparison groups. The p-value of AD for the jth mRNA is calculated as P j = 2 P r ( t 25 > | t j | ) . Further, the collection of p-values P 1 ,   P 2 ,   P 3 , ,   P m was corrected for multiple testing using FDR, with only those with pFDR < 0.05 considered differentially expressed between individuals with AD and healthy controls.

4.7. mRNA Associated with ADAOO

mRNAs potentially associated with ADAOO were identified using a Gamma regression model of the form ADAOO ~ mRNAj + Age + Sex + Schooling with an identity link. Next, the regression coefficient associated with mRNAj, denoted as β ^ j , as well as the standard error S E ^ β ^ j and the test statistic t j , were extracted. In this case, only individuals with AD were considered for analysis. For interpretation purposes, β ^ j > 0 implies that the jth mRNA delays ADAOO; β ^ j < 0 implies that the jth mRNA accelerates ADAOO; and β ^ j = 0 implies that the jth mRNA has no effect on ADA. The p-value for the jth mRNA is calculated as P j = 2 P r ( t 10 > | t j | ) . Therefore, mRNAs with pFDR < 0.05 were associated with ADAOO.

4.8. Identification of mRNA Signatures Relevant to AD and ADAOO Using ML

We utilized the OneR package [83,84] in R to construct a simple and interpretable rule-based predictive model for AD and ADAOO. The OneR ML algorithm generates one-rule models for each predictor in the data and selects the single most predictive attribute for predicting an outcome variable of interest [83,84]. In this case, the mRNA expression levels were included as predictor variables, and the outcome variables were AD diagnosis (0: control; 1: case) and ADAOO. For each mRNA, OneR counts how often each class (AD diagnosis or a categorized version of ADAOO) appears, finds the most frequent class, makes a rule that assigns that class to the mRNA expression level, and calculates the error of that rule.

4.9. ML-Based Predictive Framework with mRNA Signatures

We used the caret package [85,86] in R to construct predictive models of AD status (0: control; 1: case) using the expression levels of mRNAs and demographic variables (i.e., age at the beginning of the study, sex, and years of education) as predictors. This package implements a series of ML algorithms and a comprehensive framework for building, testing, and validating ML models for classification and regression [85,86].
To develop ML models for AD, we employed several algorithms: Classification and Regression Tree (CART), Bagged CART, Random Forest (RF), XGBoost (xgbTree and xgbLinear), Support Vector Machines (SVMs), Linear Discriminant Analysis (lda), K-nearest Neighbors (knn), and Model Averaged Neural Network (avNNet). These algorithms were selected for their capacity to manage complex relationships in the data and deliver robust predictions. For details on these algorithms and their parameters, see Table S1 of the Supplementary Material. The dataset (n = 30) was partitioned into training (70%, n = 21) and testing (30%, n = 9) datasets. The performance of each algorithm was evaluated using accuracy metrics derived from the cross-validation process, which emphasizes showing the models’ predicted outcomes compared to actual results [87,88]. Each ML-based model was evaluated using the accuracy, the Receiver Operating Characteristic (ROC) curve, the area under the ROC curve (AUC), sensitivity, specificity and precision. These metrics assess how well the model predictions align with actual outcomes, with higher values indicating better performance [87,89].
We constructed an ML-based predictive framework for ADAOO. In addition to the ML algorithms previously mentioned, the performance of Ridge Regression (ridge), Generalized Linear Models (GLM), Generalized Additive Models (gam) using Locally Estimated Scatterplot Smoothing (LOESS; gamLoess), Multi-Layer Perceptron (mlp), and Partial Least Squares (pls) was also assessed (Supplementary Table S1). The original dataset (n = 15) was partitioned into training (n = 11) and testing (n = 4) datasets using the same proportions. As ADOO is a numerical variable, the performance of the ML-based models was assessed using the Mean Absolute Error (MAE), Mean Squared Error (MSE), and the coefficient of determination (R2). All models for AD diagnosis or ADAOO were trained using mRNA expression levels as predictors and utilized a 10-fold cross-validation procedure. This approach was specifically designed to ensure unbiased evaluations and enhance our understanding of how the models will likely perform on future unseen data.

5. Conclusions

Alzheimer’s disease (AD) is a progressive neurodegenerative disorder characterized by cognitive decline, primarily due to the accumulation of amyloid beta (Aβ) plaques and tau tangles in the brain [90]. Current diagnostic methods often rely on clinical assessments and imaging techniques. However, a growing interest in molecular biomarkers, particularly messenger RNA (mRNA) expression profiles, may enhance diagnostic accuracy and provide insights into the disease mechanisms in AD [15,17,91].
This study provides a framework for integrating ML and mRNA expression analysis, paving the way for personalized medicine approaches in AD. By identifying specific mRNAs associated with AD diagnosis and age of onset (ADAOO), we contribute to a deeper understanding of the biological underpinnings of AD and its progression.
Our findings reveal that after false discovery rate (FDR) correction, only ENST00000331581 (CADM1) and ENST00000382258 (TNFRSF19) were statistically significantly differentially expressed between the comparison groups (Table 3). In addition, ENST00000340552 (LIMK2) was strongly associated with Alzheimer’s disease age of onset (ADAOO), accelerating AD onset by approximately 12.6 years (Table 4). Based on machine learning (ML) algorithms, the researchers identified that the expression levels of ENST00000331581 (CADM1), ENST00000372572 (FOXJ3), and ENST00000311550 (GABRB3) achieved an accuracy of 95.4% for predicting AD diagnosis (Table 5). Similarly, the expression levels of ENST00000640218 (HNRNPU), ENST00000261245 (MNAT1), and ENST00000339562 (NR4A2) showed remarkable performance in accurately predicting ADAOO (Table 5).
The use of ML algorithms combined with mRNA expression data offers a promising avenue for early diagnosis and personalized treatment strategies. Here, we further investigated the predictive power of various ML algorithms for predicting AD diagnosis and ADAOO based on mRNA expression. ENST00000311550 (GABRB3) emerged as the most significant predictor for AD diagnosis, and additional mRNAs—ENST00000278765 (GGTLC1), ENST00000331581 (CADM1), ENST00000372572 (FOXJ3), and ENST00000636358 (ACY1)—were critical predictors of AD diagnosis (Supplementary Figure S1). Notably, these mRNAs demonstrated exceptional performance, distinguishing individuals with AD from healthy controls (Supplementary Figure S1). For predicting ADAOO, ENST00000304677 (RNASE6) and ENST00000602017 (PPP5D1) have achieved the best performance metrics in the testing data, suggesting that the prediction error for ADAOO is limited to a few months (Table 9).
While our findings are promising, this study has several limitations regarding population characteristics and sample diversity. First, the relatively small sample size may restrict the generalizability of the results, requiring further validation in larger, more diverse cohorts to confirm the reliability of these biomarkers and their roles in AD. Second, the stringent inclusion and exclusion criteria may create a homogeneous sample that does not adequately represent the variability of AD in the broader population, including different clinical subtypes. Lastly, individual variability in disease progression and comorbid conditions can mask treatment effects and challenge data interpretation.
Future research should focus on several key areas to advance our understanding of AD. First, conducting larger-scale studies is essential to validate the identified mRNA biomarkers across diverse populations, ensuring their robustness and applicability in clinical settings. Second, exploring the biological roles of CADM1, TNFRSF19, and LIMK2 through functional studies will help elucidate their contributions to neurodegenerative processes and AD pathogenesis. Finally, investigating potential therapeutic strategies aimed at modulating the expression of these mRNAs or targeting their associated pathways could provide innovative approaches to delay or prevent the onset of AD. By focusing on these areas, future studies can significantly enhance diagnostic accuracy and therapeutic interventions for AD.
In summary, this study highlights the potential of ML combined with exosomal mRNA expression analysis in advancing the understanding of AD. The identified mRNA transcripts and the robust predictive models developed offer a promising avenue for more accurate and early diagnosis, ultimately leading to improved patient outcomes. Continued refinement of these models and further investigation into the underlying biological mechanisms will be crucial in translating these findings into clinical practice and driving therapeutic innovations.

Supplementary Materials

The supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms252212293/s1.

Author Contributions

Conceptualization, M.I.M.-H., O.M.V., P.G.-G. and J.I.V.; methodology, E.B., R.A., L.C.M., O.M.V. and C.S.-R.; software, D.A.B. and J.I.V.; validation, M.I.M.-H., O.M.V., M.A.-B., P.G.-G. and J.I.V.; formal analysis, D.A.B. and J.I.V.; investigation, M.I.M.-H., O.M.V., E.B., R.A., C.S.-R., P.G.-G. and J.I.V.; resources, M.I.M.-H., P.G.-G. and J.I.V.; data curation, D.A.B., M.I.M.-H., E.B. and J.I.V.; writing—original draft preparation, D.A.B. and J.I.V.; writing—review and editing, M.I.M.-H., O.M.V., M.A.-B., P.G.-G. and J.I.V.; visualization, D.A.B., M.A.-B. and J.I.V.; supervision, P.G.-G. and J.I.V.; project administration, P.G.-G. and J.I.V.; funding acquisition, M.I.M.-H., P.G.-G. and J.I.V. All authors have read and agreed to the published version of the manuscript.

Funding

This study was financed by the Ministry of Science, Technology and Innovation of Colombia (MINCIENCIAS), project “Nuevos ARN no codificantes exosomales y su papel en la patogénesis de la Enfermedad de Alzheimer”, code 121584468097, grant 844/2019, contract 416-2020, awarded to Grupo de Genética y Medicina Molecular and Grupo de Productividad y Competitividad, Universidad del Norte, Barranquilla, Colombia.

Institutional Review Board Statement

This study was conducted according to the tenets of the Declaration of Helsinki and approved by the Ethics Committee of Universidad del Norte, Barranquilla, Colombia (Project Approval Act #188 of 23 May 2019).

Informed Consent Statement

Informed consent was obtained from all individuals who participated voluntarily in this study.

Data Availability Statement

The data presented in this study are available upon reasonable request from the corresponding authors. They are not publicly available due to the ongoing nature of the study and our commitment to protecting the privacy and confidentiality of our patients.

Acknowledgments

We are deeply grateful to all the individuals, their families, and caregivers who have generously volunteered to participate in our ongoing research on Alzheimer’s disease. M.I.M.-H. was a graduate student in Biomedical Sciences at Universidad del Norte, where she received a scholarship for her studies. DBP pursued graduate studies in Data Analytics at the same institution, benefiting from a scholarship funded by the university and the Air Force Office of Scientific Research (grant #22RT0286) as part of the Minerva Research Initiative. Portions of this research were presented in partial fulfilment of the requirements for their respective degrees. The APC was funded by Universidad del Norte, Barranquilla, Colombia.

Conflicts of Interest

The authors declare no conflicts of interest. As expected, the funders had no role in the study’s design, data collection, analysis, interpretation, manuscript writing, or decision to publish the results.

References

  1. Better, M.A. 2023 Alzheimer’s Disease Facts and Figures. Alzheimer’s Dement. 2023, 19, 1598–1695. [Google Scholar] [CrossRef]
  2. Greene, A.N.; Solomon, M.B.; Privette Vinnedge, L.M. Novel Molecular Mechanisms in Alzheimer’s Disease: The Potential Role of DEK in Disease Pathogenesis. Front. Aging Neurosci. 2022, 14, 1018180. [Google Scholar] [CrossRef] [PubMed]
  3. Öztan, G.; İşsever, H. Molecular Mechanisms and Genetics of Alzheimer’s Disease. Turk. J. Biochem. 2023, 48, 218–229. [Google Scholar] [CrossRef]
  4. Serrano-Pozo, A.; Frosch, M.P.; Masliah, E.; Hyman, B.T. Neuropathological Alterations in Alzheimer Disease. Cold Spring Harb. Perspect. Med. 2011, 1, a006189. [Google Scholar] [CrossRef]
  5. Suresh, S.; Singh, S.A.; Rushendran, R.; Vellapandian, C.; Prajapati, B. Alzheimer’s Disease: The Role of Extrinsic Factors in Its Development, an Investigation of the Environmental Enigma. Front. Neurol. 2023, 14, 1303111. [Google Scholar] [CrossRef]
  6. Ramos, C.; Aguillon, D.; Cordano, C.; Lopera, F. Genetics of Dementia: Insights from Latin America. Dement. Neuropsychol. 2020, 14, 223–236. [Google Scholar] [CrossRef]
  7. Vélez, J.I.; Lopera, F.; Silva, C.T.; Villegas, A.; Espinosa, L.G.; Vidal, O.M.; Mastronardi, C.A.; Arcos-Burgos, M. Familial Alzheimer’s Disease and Recessive Modifiers. Mol. Neurobiol. 2020, 57, 1035–1043. [Google Scholar] [CrossRef]
  8. Vélez, J.I.; Lopera, F.; Patel, H.R.; Johar, A.S.; Cai, Y.; Rivera, D.; Tobón, C.; Villegas, A.; Sepulveda-Falla, D.; Lehmann, S.G.; et al. Mutations Modifying Sporadic Alzheimer’s Disease Age of Onset. Am. J. Med. Genet. Part B Neuropsychiatr. Genet. 2016, 171, 1116–1130. [Google Scholar] [CrossRef]
  9. Fortea, J.; Pegueroles, J.; Alcolea, D.; Belbin, O.; Dols-Icardo, O.; Vaqué-Alcázar, L.; Videla, L.; Gispert, J.D.; Suárez-Calvet, M.; Johnson, S.C.; et al. APOE4 Homozygosity Represents a Distinct Genetic form of Alzheimer’s Disease. Nat. Med. 2024, 30, 1284–1291. [Google Scholar] [CrossRef]
  10. Sepulveda-Falla, D.; Chavez-Gutierrez, L.; Portelius, E.; Vélez, J.I.; Dujardin, S.; Barrera-Ocampo, A.; Dinkel, F.; Hagel, C.; Puig, B.; Mastronardi, C.; et al. A Multifactorial Model of Pathology for Age of Onset Heterogeneity in Familial Alzheimer’s Disease. Acta Neuropathol. 2021, 141, 217–233. [Google Scholar] [CrossRef]
  11. Quiroz, Y.T.; Aguillon, D.; Aguirre-Acevedo, D.C.; Vasquez, D.; Zuluaga, Y.; Baena, A.Y.; Madrigal, L.; Hincapié, L.; Sanchez, J.S.; Langella, S.; et al. APOE3 Christchurch Heterozygosity and Autosomal Dominant Alzheimer’s Disease. N. Engl. J. Med. 2024, 390, 2156–2164. [Google Scholar] [CrossRef] [PubMed]
  12. Sepulveda-Falla, D.; Vélez, J.I.; Acosta-Baena, N.; Baena, A.; Moreno, S.; Krasemann, S.; Lopera, F.; Mastronardi, C.A.; Arcos-Burgos, M. Genetic Modifiers of Cognitive Decline in PSEN1 E280A Alzheimer’s Disease. Alzheimer’s Dement. 2024, 20, 2873–2885. [Google Scholar] [CrossRef] [PubMed]
  13. Mosquera-Heredia, M.I.; Vidal, O.M.; Morales, L.C.; Silvera-Redondo, C.; Barceló, E.; Allegri, R.; Arcos-Burgos, M.; Vélez, J.I.; Garavito-Galofre, P. Long Non-Coding RNAs and Alzheimer’s Disease: Towards Personalized Diagnosis. Int. J. Mol. Sci. 2024, 25, 7641. [Google Scholar] [CrossRef] [PubMed]
  14. Vélez, J.I.; Samper, L.A.; Arcos-Holzinger, M.; Espinosa, L.G.; Isaza-Ruget, M.A.; Lopera, F.; Arcos-Burgos, M. A Comprehensive Machine Learning Framework for the Exact Prediction of the Age of Onset in Familial and Sporadic Alzheimer’s Disease. Diagnostics 2021, 11, 887. [Google Scholar] [CrossRef] [PubMed]
  15. Ghosh, A.; Mizuno, K.; Tiwari, S.S.; Proitsi, P.; Gomez Perez-Nievas, B.; Glennon, E.; Martinez-Nunez, R.T.; Giese, K.P. Alzheimer’s Disease-Related Dysregulation of MRNA Translation Causes Key Pathological Features with Ageing. Transl. Psychiatry 2020, 10, 192. [Google Scholar] [CrossRef]
  16. Riscado, M.; Baptista, B.; Sousa, F. New RNA-Based Breakthroughs in Alzheimer’s Disease Diagnosis and Therapeutics. Pharmaceutics 2021, 13, 1397. [Google Scholar] [CrossRef]
  17. Donaghy, P.C.; Cockell, S.J.; Martin-Ruiz, C.; Coxhead, J.; Kane, J.; Erskine, D.; Koss, D.; Taylor, J.-P.; Morris, C.M.; O’Brien, J.T.; et al. Blood MRNA Expression in Alzheimer’s Disease and Dementia with Lewy Bodies. Am. J. Geriatr. Psychiatry 2022, 30, 964–975. [Google Scholar] [CrossRef]
  18. van Bergeijk, P.; Seneviratne, U.; Aparicio-Prat, E.; Stanton, R.; Hasson, S.A. SRSF1 and PTBP1 Are Trans-Acting Factors That Suppress the Formation of a CD33 Splicing Isoform Linked to Alzheimer’s Disease Risk. Mol. Cell. Biol. 2019, 39, e00568-18. [Google Scholar] [CrossRef]
  19. Işık, M.; Beydemir, Ş. AChE MRNA Expression as a Possible Novel Biomarker for the Diagnosis of Coronary Artery Disease and Alzheimer’s Disease, and Its Association with Oxidative Stress. Arch. Physiol. Biochem. 2022, 128, 352–359. [Google Scholar] [CrossRef]
  20. Jakubauskienė, E.; Vilys, L.; Pečiulienė, I.; Kanopka, A. The Role of Hypoxia on Alzheimer’s Disease-Related APP and Tau MRNA Formation. Gene 2021, 766, 145146. [Google Scholar] [CrossRef]
  21. Toden, S.; Zhuang, J.; Acosta, A.D.; Karns, A.P.; Salathia, N.S.; Brewer, J.B.; Wilcock, D.M.; Aballi, J.; Nerenberg, M.; Quake, S.R.; et al. Noninvasive Characterization of Alzheimer’s Disease by Circulating, Cell-Free Messenger RNA next-Generation Sequencing. Sci. Adv. 2020, 6, eabb1654. [Google Scholar] [CrossRef] [PubMed]
  22. Xie, T.; Pei, Y.; Shan, P.; Xiao, Q.; Zhou, F.; Huang, L.; Wang, S. Identification of MiRNA–MRNA Pairs in the Alzheimer’s Disease Expression Profile and Explore the Effect of MiR-26a-5p/PTGS2 on Amyloid-β Induced Neurotoxicity in Alzheimer’s Disease Cell Model. Front. Aging Neurosci. 2022, 14, 909222. [Google Scholar] [CrossRef] [PubMed]
  23. Noor Eddin, A.; Hamsho, K.; Adi, G.; Al-Rimawi, M.; Alfuwais, M.; Abdul Rab, S.; Alkattan, K.; Yaqinuddin, A. Cerebrospinal Fluid MicroRNAs as Potential Biomarkers in Alzheimer’s Disease. Front. Aging Neurosci. 2023, 15, 1210191. [Google Scholar] [CrossRef]
  24. Phu Pham, L.H.; Chang, C.-F.; Tuchez, K.; Chen, Y. Assess Alzheimer’s Disease via Plasma Extracellular Vesicle-Derived MRNA. medRxiv 2023, 16, e70006. [Google Scholar] [CrossRef]
  25. Karaglani, M.; Gourlia, K.; Tsamardinos, I.; Chatzaki, E. Accurate Blood-Based Diagnostic Biosignatures for Alzheimer’s Disease via Automated Machine Learning. J. Clin. Med. 2020, 9, 3016. [Google Scholar] [CrossRef]
  26. Parra, M.A.; Orellana, P.; Leon, T.; Victoria, C.G.; Henriquez, F.; Gomez, R.; Avalos, C.; Damian, A.; Slachevsky, A.; Ibañez, A.; et al. Biomarkers for Dementia in Latin American Countries: Gaps and Opportunities. Alzheimer’s Dement. 2023, 19, 721–735. [Google Scholar] [CrossRef]
  27. Seto, M.; Weiner, R.L.; Dumitrescu, L.; Mahoney, E.R.; Hansen, S.L.; Janve, V.; Khan, O.A.; Liu, D.; Wang, Y.; Menon, V.; et al. RNASE6 Is a Novel Modifier of APOE-Ε4 Effects on Cognition. Neurobiol. Aging 2022, 118, 66–76. [Google Scholar] [CrossRef]
  28. Tsai, A.P.; Lin, P.B.-C.; Dong, C.; Moutinho, M.; Casali, B.T.; Liu, Y.; Lamb, B.T.; Landreth, G.E.; Oblak, A.L.; Nho, K. INPP5D Expression Is Associated with Risk for Alzheimer’s Disease and Induced by Plaque-Associated Microglia. Neurobiol. Dis. 2021, 153, 105303. [Google Scholar] [CrossRef]
  29. da Silva, E.M.G.; Santos, L.G.C.; de Oliveira, F.S.; Freitas, F.C.D.P.; Parreira, V.D.S.C.; Dos Santos, H.G.; Tavares, R.; Carvalho, P.C.; Neves-Ferreira, A.G.d.C.; Haibara, A.S.; et al. Proteogenomics Reveals Orthologous Alternatively Spliced Proteoforms in the Same Human and Mouse Brain Regions with Differential Abundance in an Alzheimer’s Disease Mouse Model. Cells 2021, 10, 1583. [Google Scholar] [CrossRef]
  30. Moiseeva, E.P.; Leyland, M.L.; Bradding, P. CADM1 Is Expressed as Multiple Alternatively Spliced Functional and Dysfunctional Isoforms in Human Mast Cells. Mol. Immunol. 2013, 53, 345–354. [Google Scholar] [CrossRef]
  31. Zhang, Q.; Li, S.; Tang, D.; Yan, L.; Chen, Z.; Tao, W.; Wang, Y.; Huang, Z.; Chen, F. TNFRSF19 (TROY) as a Plasma Biomarker for Diagnosing and Monitoring Intracranial Aneurysms Progression; Research Square: Durham, NC, USA, 2022. [Google Scholar]
  32. Feng, X.; Zhang, Y.; Du, M.; Li, S.; Ding, J.; Wang, J.; Wang, Y.; Liu, P. Identification of Diagnostic Biomarkers and Therapeutic Targets in Peripheral Immune Landscape from Coronary Artery Disease. J. Transl. Med. 2022, 20, 399. [Google Scholar] [CrossRef] [PubMed]
  33. Chong, Z.X.; Ho, W.Y.; Yeap, S.K. Decoding the Tumour-Modulatory Roles of LIMK2. Life Sci. 2024, 347, 122609. [Google Scholar] [CrossRef] [PubMed]
  34. Mardilovich, K.; Baugh, M.; Crighton, D.; Kowalczyk, D.; Gabrielsen, M.; Munro, J.; Croft, D.R.; Lourenco, F.; James, D.; Kalna, G.; et al. LIM Kinase Inhibitors Disrupt Mitotic Microtubule Organization and Impair Tumor Cell Proliferation. Oncotarget 2015, 6, 38469–38486. [Google Scholar] [CrossRef]
  35. Villalonga, E.; Mosrin, C.; Normand, T.; Girardin, C.; Serrano, A.; Žunar, B.; Doudeau, M.; Godin, F.; Bénédetti, H.; Vallée, B. LIM Kinases, LIMK1 and LIMK2, Are Crucial Node Actors of the Cell Fate: Molecular to Pathological Features. Cells 2023, 12, 805. [Google Scholar] [CrossRef]
  36. Ben Zablah, Y.; Zhang, H.; Gugustea, R.; Jia, Z. LIM-Kinases in Synaptic Plasticity, Memory, and Brain Diseases. Cells 2021, 10, 2079. [Google Scholar] [CrossRef]
  37. Kang, Y.J.; Diep, Y.N.; Tran, M.; Cho, H. Therapeutic Targeting Strategies for Early- to Late-Staged Alzheimer’s Disease. Int. J. Mol. Sci. 2020, 21, 9591. [Google Scholar] [CrossRef]
  38. Nikhil, K.; Chang, L.; Viccaro, K.; Jacobsen, M.; McGuire, C.; Satapathy, S.R.; Tandiary, M.; Broman, M.M.; Cresswell, G.; He, Y.J.; et al. Identification of LIMK2 as a Therapeutic Target in Castration Resistant Prostate Cancer. Cancer Lett. 2019, 448, 182–196. [Google Scholar] [CrossRef]
  39. Shah, K.; Cook, M. LIMK2: A Multifaceted Kinase with Pleiotropic Roles in Human Physiology and Pathologies. Cancer Lett. 2023, 565, 216207. [Google Scholar] [CrossRef]
  40. Harutyunyan, A.; Jones, N.C.; Kwan, P.; Anderson, A. Network Preservation Analysis Reveals Dysregulated Synaptic Modules and Regulatory Hubs Shared Between Alzheimer’s Disease and Temporal Lobe Epilepsy. Front. Genet. 2022, 13, 821343. [Google Scholar] [CrossRef]
  41. Kong, W.; Mou, X.; Zhi, X.; Zhang, X.; Yang, Y. Dynamic Regulatory Network Reconstruction for Alzheimer’s Disease Based on Matrix Decomposition Techniques. Comput. Math. Methods Med. 2014, 2014, 891761. [Google Scholar] [CrossRef]
  42. Lau, P.; Bossers, K.; Janky, R.; Salta, E.; Frigerio, C.S.; Barbash, S.; Rothman, R.; Sierksma, A.S.R.; Thathiah, A.; Greenberg, D.; et al. Alteration of the MicroRNA Network During the Progression of Alzheimer’s Disease. EMBO Mol. Med. 2013, 5, 1613–1634. [Google Scholar] [CrossRef] [PubMed]
  43. Shi, Y.-W.; Zhang, Q.; Cai, K.; Poliquin, S.; Shen, W.; Winters, N.; Yi, Y.-H.; Wang, J.; Hu, N.; Macdonald, R.L.; et al. Synaptic Clustering Differences due to Different GABRB3 Mutations Cause Variable Epilepsy Syndromes. Brain 2019, 142, 3028–3044. [Google Scholar] [CrossRef] [PubMed]
  44. Govindpani, K.; Turner, C.; Waldvogel, H.J.; Faull, R.L.M.; Kwakowsky, A. Impaired Expression of GABA Signaling Components in the Alzheimer’s Disease Middle Temporal Gyrus. Int. J. Mol. Sci. 2020, 21, 8704. [Google Scholar] [CrossRef] [PubMed]
  45. Hill, M.A.; Gammie, S.C. Alzheimer’s Disease Large-Scale Gene Expression Portrait Identifies Exercise as the Top Theoretical Treatment. Sci. Rep. 2022, 12, 17189. [Google Scholar] [CrossRef]
  46. Kang, J.-Q. Epileptic Mechanisms Shared by Alzheimer’s Disease: Viewed via the Unique Lens of Genetic Epilepsy. Int. J. Mol. Sci. 2021, 22, 7133. [Google Scholar] [CrossRef]
  47. Posavi, M.; Diaz-Ortiz, M.; Liu, B.; Swanson, C.R.; Skrinak, R.T.; Hernandez-Con, P.; Amado, D.A.; Fullard, M.; Rick, J.; Siderowf, A.; et al. Characterization of Parkinson’s Disease Using Blood-Based Biomarkers: A Multicohort Proteomic Analysis. PLoS Med. 2019, 16, e1002931. [Google Scholar] [CrossRef]
  48. Shibuya, Y.; Niu, Z.; Bryleva, E.Y.; Harris, B.T.; Murphy, S.R.; Kheirollah, A.; Bowen, Z.D.; Chang, C.C.Y.; Chang, T.-Y. Acyl-Coenzyme A:Cholesterol Acyltransferase 1 Blockage Enhances Autophagy in the Neurons of Triple Transgenic Alzheimer’s Disease Mouse and Reduces Human P301L-Tau Content at the Presymptomatic Stage. Neurobiol. Aging 2015, 36, 2248–2259. [Google Scholar] [CrossRef]
  49. Luckett, E.S.; Zielonka, M.; Kordjani, A.; Schaeverbeke, J.; Adamczuk, K.; De Meyer, S.; Van Laere, K.; Dupont, P.; Cleynen, I.; Vandenberghe, R. Longitudinal APOE4- and Amyloid-Dependent Changes in the Blood Transcriptome in Cognitively Intact Older Adults. Alzheimers Res. Ther. 2023, 15, 121. [Google Scholar] [CrossRef]
  50. Hu, S.; Li, S.; Ning, W.; Huang, X.; Liu, X.; Deng, Y.; Franceschi, D.; Ogbuehi, A.C.; Lethaus, B.; Savkovic, V.; et al. Identifying Crosstalk Genetic Biomarkers Linking a Neurodegenerative Disease, Parkinson’s Disease, and Periodontitis Using Integrated Bioinformatics Analyses. Front. Aging Neurosci. 2022, 14, 1032401. [Google Scholar] [CrossRef]
  51. Watson, C.N.; Begum, G.; Ashman, E.; Thorn, D.; Yakoub, K.M.; Hariri, M.A.; Nehme, A.; Mondello, S.; Kobeissy, F.; Belli, A.; et al. Co-Expression Analysis of MicroRNAs and Proteins in Brain of Alzheimer’s Disease Patients. Cells 2022, 11, 163. [Google Scholar] [CrossRef]
  52. Fuchsberger, T. The Role of APC/C-Cdh1 in Alzheimer’s Disease; Universitat de Valencia Roderic: Valencia, Spain, 2016. [Google Scholar]
  53. Lapresa, R.; Agulla, J.; Bolaños, J.P.; Almeida, A. APC/C-Cdh1-Targeted Substrates as Potential Therapies for Alzheimer’s Disease. Front. Pharmacol. 2022, 13, 1086540. [Google Scholar] [CrossRef] [PubMed]
  54. Mihaescu, R.; Detmar, S.B.; Cornel, M.C.; van der Flier, W.M.; Heutink, P.; Hol, E.M.; Rikkert, M.G.M.O.; van Duijn, C.M.; Janssens, A.C.J.W. Translational Research in Genomics of Alzheimer’s Disease: A Review of Current Practice and Future Perspectives. J. Alzheimer’s Dis. 2010, 20, 967–980. [Google Scholar] [CrossRef] [PubMed]
  55. Golriz Khatami, S.; Mubeen, S.; Hofmann-Apitius, M. Data Science in Neurodegenerative Disease: Its Capabilities, Limitations, and Perspectives. Curr. Opin. Neurol. 2020, 33, 249–254. [Google Scholar] [CrossRef] [PubMed]
  56. Hampel, H.; Vergallo, A.; Perry, G.; Lista, S. The Alzheimer Precision Medicine Initiative. J. Alzheimer’s Disease 2019, 68, 1–24. [Google Scholar] [CrossRef]
  57. Duran-Aniotz, C.; Sanhueza, J.; Grinberg, L.T.; Slachevsky, A.; Valcour, V.; Robertson, I.; Lawlor, B.; Miller, B.; Ibáñez, A. The Latin American Brain Health Institute, a Regional Initiative to Reduce the Scale and Impact of Dementia. Alzheimer’s Dement. 2022, 18, 1696–1698. [Google Scholar] [CrossRef]
  58. Nasreddine, Z.S.; Phillips, N.A.; Bédirian, V.; Charbonneau, S.; Whitehead, V.; Collin, I.; Cummings, J.L.; Chertkow, H. The Montreal Cognitive Assessment, MoCA: A Brief Screening Tool for Mild Cognitive Impairment. J. Am. Geriatr. Soc. 2005, 53, 695–699. [Google Scholar] [CrossRef]
  59. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders; American Psychiatric Association Publishing: Washington, DC, USA, 2022; ISBN 0-89042-575-2. [Google Scholar]
  60. Folstein, M.F.; Robins, L.N.; Helzer, J.E. The Mini-Mental State Examination. Arch. Gen. Psychiatry 1983, 40, 812. [Google Scholar] [CrossRef]
  61. Allegri, R.F.; Villavicencio, A.F.; Taragano, F.E.; Rymberg, S.; Mangone, C.A.; Baumann, D. Spanish Boston Naming Test Norms. Clin. Neuropsychol. 1997, 11, 416–420. [Google Scholar] [CrossRef]
  62. Fernández, A.L.; Fulbright, R.L. Construct and Concurrent Validity of the Spanish Adaptation of the Boston Naming Test. Appl. Neuropsychol. Adult 2015, 22, 355–362. [Google Scholar] [CrossRef]
  63. Osterrieth, P.A. The Test of Copying a Complex Figure: A Contribution to the Study of Perception and Memory. Arch. Psychol. 1944, 30, 206–356. [Google Scholar]
  64. Bean, J. Rey Auditory Verbal Learning Test, Rey AVLT. In Encyclopedia of Clinical Neuropsychology; Springer: New York, NY, USA, 2011; pp. 2174–2175. [Google Scholar]
  65. Reitan, R.M. The Relation of the Trail Making Test to Organic Brain Damage. J. Consult. Psychol. 1955, 19, 393. [Google Scholar] [CrossRef] [PubMed]
  66. Reitan, R.M. Validity of the Trail Making Test as an Indicator of Organic Brain Damage. Percept. Mot. Skills 1958, 8, 271–276. [Google Scholar] [CrossRef]
  67. Smith, A. Symbol Digit Modalities Test. Clin. Neuropsychol. 1973. [Google Scholar] [CrossRef]
  68. Golden, C.J. Stroop Color and Word Test; Stoelting, Co.: Wood Dale, IL, USA, 1978. [Google Scholar]
  69. de Renzi, E.; Vignolo, L.A. The token test: A sensitive test to detect receptive disturbances in aphasics. Brain 1962, 85, 665–678. [Google Scholar] [CrossRef]
  70. Benton, A.L. Visuospatial Judgment: A Clinical Test. Arch. Neurol. 1978, 35, 364. [Google Scholar] [CrossRef]
  71. Aprahamian, I.; Martinelli, J.E.; Neri, A.L.; Yassuda, M.S. The Clock Drawing Test A Review of Its Accuracy in Screening for Dementia. Dement. Neuropsychol. 2009, 3, 74–80. [Google Scholar] [CrossRef]
  72. Grant, D.A.; Berg, E. A Behavioral Analysis of Degree of Reinforcement and Ease of Shifting to New Responses in a Weigl-Type Card-Sorting Problem. J. Exp. Psychol. 1948, 38, 404–411. [Google Scholar] [CrossRef]
  73. Brink, T.L.; Yesavage, J.A.; Lum, O.; Heersema, P.H.; Adey, M.; Rose, T.L. Screening Tests for Geriatric Depression. Clin. Gerontol. 1982, 1, 37–43. [Google Scholar] [CrossRef]
  74. Reisberg, B.; Ferris, S.H.; De Leon, M.J.; Crook, T. The Global Deterioration Scale for Assessment of Primary Degenerative Dementia. Am. J. Psychiatry 1982, 139, 1136–1139. [Google Scholar]
  75. Mahoney, F.I.; Barthel, D.W. Functional evaluation: The barthel index. Md. State Med. J. 1965, 14, 61–65. [Google Scholar]
  76. Cummings, J. The Neuropsychiatric Inventory: Development and Applications. J. Geriatr. Psychiatry Neurol. 2020, 33, 73–84. [Google Scholar] [CrossRef] [PubMed]
  77. Naj, A.C.; Jun, G.; Reitz, C.; Kunkle, B.W.; Perry, W.; Park, Y.S.; Beecham, G.W.; Rajbhandary, R.A.; Hamilton-Nelson, K.L.; Wang, L.-S.; et al. Effects of Multiple Genetic Loci on Age at Onset in Late-Onset Alzheimer Disease. JAMA Neurol. 2014, 71, 1394. [Google Scholar] [CrossRef] [PubMed]
  78. Saad, M.; Brkanac, Z.; Wijsman, E.M. Family-based Genome Scan for Age at Onset of Late-onset Alzheimer’s Disease in Whole Exome Sequencing Data. Genes Brain Behav. 2015, 14, 607–617. [Google Scholar] [CrossRef]
  79. Dunn, P.K.; Smyth, G.K. Generalized Linear Models with Examples in R; Springer: New York, NY, USA, 2018; ISBN 978-1-4419-0117-0. [Google Scholar]
  80. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023. [Google Scholar]
  81. Benjamini, Y.; Hochberg, Y. Controlling The False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B 1995, 57, 289–300. [Google Scholar] [CrossRef]
  82. Vélez, J.I.; Correa, J.C.; Arcos-Burgos, M. A New Method for Detecting Significant p-Values with Applications to Genetic Data. Rev. Colomb. Estad. 2014, 37, 67–76. [Google Scholar] [CrossRef]
  83. Holte, R.C. Very Simple Classification Rules Perform Well on Most Commonly Used Datasets. Mach. Learn. 1993, 11, 63–90. [Google Scholar] [CrossRef]
  84. von Jouanne-Diedrich, H. OneR: One Rule Machine Learning Classification Algorithm with Enhancements. R Package Version 2.2. 2017. Available online: https://CRAN.R-project.org/package=OneR (accessed on 10 November 2024).
  85. Kuhn, M. Package ‘caret’—Classification and Regression Training; R Package Version 6.0-86; 2020. Available online: https://cran.r-project.org/web/packages/caret/index.html (accessed on 10 November 2024).
  86. Kuhn, M. Building Predictive Models in R Using the Caret Package. J. Stat. Softw. 2008, 28, 1–26. [Google Scholar] [CrossRef]
  87. Ramezan, C.A.; Warner, T.A.; Maxwell, A.E. Evaluation of Sampling and Cross-Validation Tuning Strategies for Regional-Scale Machine Learning Classification. Remote Sens. 2019, 11, 185. [Google Scholar] [CrossRef]
  88. Kuhn, M.; Johnson, K. Applied Predictive Modeling; Springer: Berlin/Heidelberg, Germany, 2013; ISBN 9781461468493. [Google Scholar]
  89. Naidu, G.; Zuva, T.; Sibanda, E.M. A Review of Evaluation Metrics in Machine Learning Algorithms; Lecture Notes in Networks and Systems; Springer: Berlin/Heidelberg, Germany, 2023; Volume 724. [Google Scholar]
  90. Gauthier, S.; Leuzy, A.; Racine, E.; Rosa-Neto, P. Diagnosis and Management of Alzheimer’s Disease: Past, Present and Future Ethical Issues. Prog. Neurobiol. 2013, 110, 102–113. [Google Scholar] [CrossRef]
  91. Tan, M.S.; Yang, Y.X.; Xu, W.; Wang, H.F.; Tan, L.; Zuo, C.T.; Dong, Q.; Tan, L.; Suckling, J.; Yu, J.T. Associations of Alzheimer’s Disease Risk Variants with Gene Expression, Amyloidosis, Tauopathy, and Neurodegeneration. Alzheimers Res. Ther. 2021, 13, 15. [Google Scholar] [CrossRef]
Figure 1. Volcano plots for mRNAs (a) conferring AD susceptibility, (b) differentially expressed mRNAs between the comparison groups, and (c) associated with ADAOO. Red lines show statistically significant mRNAs at 5%.
Figure 1. Volcano plots for mRNAs (a) conferring AD susceptibility, (b) differentially expressed mRNAs between the comparison groups, and (c) associated with ADAOO. Red lines show statistically significant mRNAs at 5%.
Ijms 25 12293 g001
Figure 2. Manhattan plots showing mRNA signatures (a) conferring susceptibility to AD (p < 0.01 threshold, red line), (b) differentially expressed between study groups (p < 2.5 × 10−6 threshold, red line), and (c) associated with ADAOO (p < 2.5 × 10−6 threshold, red line) in a sample of 15 individuals with AD from Barranquilla, Colombia.
Figure 2. Manhattan plots showing mRNA signatures (a) conferring susceptibility to AD (p < 0.01 threshold, red line), (b) differentially expressed between study groups (p < 2.5 × 10−6 threshold, red line), and (c) associated with ADAOO (p < 2.5 × 10−6 threshold, red line) in a sample of 15 individuals with AD from Barranquilla, Colombia.
Ijms 25 12293 g002
Figure 3. (a) Accuracy and 95% confidence intervals for predicting AD diagnosis using different ML algorithms based on the top 30 mRNAs identified with OneR. (b) ROC curves for the xgbTree algorithm in the training (blue) and testing (green) datasets. (c) Variable importance analysis for the xgbTree algorithm. ROC: receiver operating characteristic; AUC: area under the ROC curve.
Figure 3. (a) Accuracy and 95% confidence intervals for predicting AD diagnosis using different ML algorithms based on the top 30 mRNAs identified with OneR. (b) ROC curves for the xgbTree algorithm in the training (blue) and testing (green) datasets. (c) Variable importance analysis for the xgbTree algorithm. ROC: receiver operating characteristic; AUC: area under the ROC curve.
Ijms 25 12293 g003
Figure 4. Variable importance for the (a) rf, (b) xgbLinear, and (c) xgbTree ML algorithms for predicting ADAOO. Here, higher values are better.
Figure 4. Variable importance for the (a) rf, (b) xgbLinear, and (c) xgbTree ML algorithms for predicting ADAOO. Here, higher values are better.
Ijms 25 12293 g004
Table 1. Clinical and sociodemographic characterization of the study population.
Table 1. Clinical and sociodemographic characterization of the study population.
VariableAll
(n = 30)
Cases
(n = 15)
Controls
(n = 15)
p
Mean (SD)
Age (years)79.8 (8.7)77.5 (8.5)82.1 (8.6)0.261
Age of onset (years)72.1 (7.2)72.1 (7.2)--
MMSE19.6 (9.6)13.9 (9.5)25.2 (5.6)0.001
MoCA15.3 (11.2)5.5 (5.3)25.9 (3)<0.001
Frequency (%)
Sex 1
Female22 (73.3%)11 (73.3%)11 (73.3%)
Male8 (26.7%)4 (26.7%)4 (26.7%)
MMSE: Mini-Mental State Examination; MoCA: Montreal Cognitive Assessment; SD: standard deviation; p: p-value.
Table 2. Top 10 mRNAs conferring susceptibility to AD.
Table 2. Top 10 mRNAs conferring susceptibility to AD.
ChrTranscript IDPosition aGene β ^   ( S E ^ β ^ ) ppFDR
11ENST000003821601,718,425KRTAP5-65.27 (1.97)0.0070.999
11MICT0000006256168,830,976TPCN22.74 (1.03)0.0070.999
2ENST0000027225238,893,052GALM3.18 (1.20)0.0080.999
19ENST0000026337238,810,484KCNK64.52 (1.71)0.0080.999
11ENST00000292174118,754,475CXCR53.76 (1.45)0.0090.999
19ENST0000060144020,802,867ZNF6262.38 (0.92)0.0090.999
2ENST0000040622648,757,325STON13.74 (1.46)0.0100.999
3ENST00000318225126,268,516C3orf227.84 (3.07)0.0100.999
9ENST00000307564117,096,436AKNA2.34 (0.92)0.0100.999
17ENST0000053749473,632,675SMIM52.16 (0.85)0.0100.999
a UCSC GRCh37/hg19 coordinates. β ^ : logistic regression coefficient; Chr: chromosome; FDR: false discovery rate; p: p-value; pFDR: FDR-corrected p-value; S E ^ β ^ : estimated standard error of β ^ .
Table 3. Top 10 mRNAs differentially expressed between cases and healthy controls.
Table 3. Top 10 mRNAs differentially expressed between cases and healthy controls.
ChrTranscriptPositionGene β ^   ( S E ^ β ^ ) ppFDR
11ENST00000331581115,047,015CADM10.97 (0.16)3.34 × 10−60.027
13ENST0000038225824,153,499TNFRSF190.40 (0.06)2.24 × 10−60.027
3ENST00000318225126,268,516C3orf220.71 (0.14)2.32 × 10−50.128
17ENCT0000017532142,030,339PYY0.82 (0.18)1.74 × 10−40.692
19ENST0000035849121,688,366ZNF4290.83 (0.19)2.16 × 10−40.692
2ENST0000040622648,757,325STON10.72 (0.17)2.50 × 10−40.692
19ENST0000026337238,810,484KCNK60.82 (0.20)3.52 × 10−40.833
7ENCT000004079041,214,597ZFAND2A0.60 (0.15)6.07 × 10−40.985
1ENST00000427500155,204,350GBA0.83 (0.22)7.12 × 10−40.985
5ENST00000509437132,333,792ZCCHC100.72 (0.18)6.33 × 10−40.985
β ^ : Gamma regression coefficient based on the identity link. Other conventions as in Table 2.
Table 4. mRNAs modifying ADAOO. Conventions as in Table 3.
Table 4. mRNAs modifying ADAOO. Conventions as in Table 3.
ChrTranscriptPositionGene β ^   ( S E ^ β ^ ) ppFDR
22ENST0000034055231,644,473LIMK2−12.6 (1.06)3.04 × 10−70.005
22ENST0000021573021,213,271SNAP29−5.59 (0.76)2.50 × 10−50.096
22ENST0000021613951,176,624ACR−7.21 (1.27)2.14 × 10−40.096
5ENST0000023065850,679,225ISL1−11.05 (1.49)2.29 × 10−50.096
4ENST0000024870653,728,457RASL11B−6.18 (1.09)2.15 × 10−40.096
7ENST00000257696128,095,945HILPDA4.34 (0.76)2.00 × 10−40.096
8ENST0000026385179,645,007IL7−17.31 (2.96)1.61 × 10−40.096
13ENST0000028239728,874,481FLT1−6.21 (1.01)1.07 × 10−40.096
19ENST0000030406011,925,099ZNF4404.79 (0.79)1.15 × 10−40.096
3ENST0000032021148,488,137ATRIP−6.93 (1.08)7.47 × 10−40.096
Table 5. Top mRNAs for AD diagnosis and ADAOO via ML in the training dataset.
Table 5. Top mRNAs for AD diagnosis and ADAOO via ML in the training dataset.
Target VariableChrTranscriptPositionGene Accuracy
AD11ENST00000331581115,047,015CADM10.954
1ENST0000037257242,642,210FOXJ30.954
15ENST0000031155026,788,693GABRB30.954
17ENST0000029319072,838,162GRIN2C0.904
21ENST0000031112446,933,690SLC19A10.904
2MICT00000202802171,678,607GAD10.904
3ENCT00000296543161,062,306SPTSSB0.904
1ENST00000427500155,204,350GBA0.904
16ENST0000057168811,641,578LITAF0.904
3ENST0000063635852,017,294ACY10.904
ADAOO1ENST00000640218245,013,602HNRNPU1.000
14ENST0000026124561,201,480MNAT11.000
2ENST00000339562157,180,944NR4A21.000
14ENST0000030467721,249,210RNASE61.000
2ENST0000026373645,615,819SRBD11.000
17ENST0000039400139,533,902KRT340.900
3ENST00000264735192,958,914HRASLS0.900
20ENCT0000026527920,349,595INSM10.900
8ENST00000313269145,064,226GRINA0.900
5ENST00000257430112,073,585APC0.900
Table 6. Performance of ML-based models for AD diagnosis in the training dataset. Best results are shown in bold.
Table 6. Performance of ML-based models for AD diagnosis in the training dataset. Best results are shown in bold.
Algorithm Accuracy
MeanStandard
Deviation
Coefficient of Variation
avNNet0.7800.23730.354
hdda0.7830.24331.072
knn0.7830.23429.858
LDA0.8570.23827.796
lda20.8570.23827.796
rf0.9470.14815.683
rpart0.8470.29534.862
rpart1SE0.8470.29534.862
rpart20.8470.29534.862
svmLinear0.7870.23830.278
svmLinear20.7870.23830.278
svmPoly0.8200.22827.802
svmRadial0.8070.22728.113
treebag0.9270.22424.147
xgbLinear0.9800.14114.431
xgbTree0.9900.0717.142
Table 7. Performance metrics for predicting AD diagnosis based on the xgbTree algorithm.
Table 7. Performance metrics for predicting AD diagnosis based on the xgbTree algorithm.
Performance
Metric
Dataset
Training
(n = 21)
Testing
(n = 9)
AUC10.875
Accuracy10.875
Sensitivity11
Specificity10.75
Precision11
Table 8. Performance of ML algorithms for predicting ADAOO based on the top 30 mRNAs.
Table 8. Performance of ML algorithms for predicting ADAOO based on the top 30 mRNAs.
Algorithm Performance Measure
RMSER2MAE
avNNet71.518-71.510
gamLoess29.955128.094
glm29.955128.094
knn6.81716.761
mlp7.606-7.530
pls6.51916.459
rf7.22717.190
ridge7.83417.802
rpart7.576-7.497
rpart1SE7.576-7.497
svmLinear10.067110.060
svmPoly6.96916.887
svmRadial7.23417.168
treebag7.587-7.506
xgbLinear10.306110.235
xgbTree8.25018.155
RMSE: Root Mean Squared Error, lower is better; MAE: Mean Absolute Error, lower is better; R2: coefficient of determination, higher is better. “-” indicates that R2 values could not be estimated. Best results are shown in bold.
Table 9. Performance of refined rf, xgbLinear, and xgbTree ML models for predicting ADAOO in the testing data. Best results are shown in bold.
Table 9. Performance of refined rf, xgbLinear, and xgbTree ML models for predicting ADAOO in the testing data. Best results are shown in bold.
AlgorithmModelmRNAs CombinationRMSER2MAE
rf1HBMT00001385713, ENCT000002652792.7010.7432.156
2HBMT00001385713, ENST000003703321.6980.8941.569
3HBMT00001385713, ENST000002574300.9740.9750.840
xgbLinear1HBMT00001385713, ENST000002637363.4840.6561.747
2HBMT00001385713, ENST000003046775.5000.3032.750
3HBMT00001385713, ENST000006402182.5540.8151.278
xgbTree1ENST00000304677, ENST000006402181.5640.9791.218
2ENST00000304677, ENST000006020170.4620.9930.392
3ENST00000304677, ENST000002249500.7400.9990.613
4ENST00000304677, ENST000003220882.0540.9831.719
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bolívar, D.A.; Mosquera-Heredia, M.I.; Vidal, O.M.; Barceló, E.; Allegri, R.; Morales, L.C.; Silvera-Redondo, C.; Arcos-Burgos, M.; Garavito-Galofre, P.; Vélez, J.I. Exosomal mRNA Signatures as Predictive Biomarkers for Risk and Age of Onset in Alzheimer’s Disease. Int. J. Mol. Sci. 2024, 25, 12293. https://doi.org/10.3390/ijms252212293

AMA Style

Bolívar DA, Mosquera-Heredia MI, Vidal OM, Barceló E, Allegri R, Morales LC, Silvera-Redondo C, Arcos-Burgos M, Garavito-Galofre P, Vélez JI. Exosomal mRNA Signatures as Predictive Biomarkers for Risk and Age of Onset in Alzheimer’s Disease. International Journal of Molecular Sciences. 2024; 25(22):12293. https://doi.org/10.3390/ijms252212293

Chicago/Turabian Style

Bolívar, Daniel A., María I. Mosquera-Heredia, Oscar M. Vidal, Ernesto Barceló, Ricardo Allegri, Luis C. Morales, Carlos Silvera-Redondo, Mauricio Arcos-Burgos, Pilar Garavito-Galofre, and Jorge I. Vélez. 2024. "Exosomal mRNA Signatures as Predictive Biomarkers for Risk and Age of Onset in Alzheimer’s Disease" International Journal of Molecular Sciences 25, no. 22: 12293. https://doi.org/10.3390/ijms252212293

APA Style

Bolívar, D. A., Mosquera-Heredia, M. I., Vidal, O. M., Barceló, E., Allegri, R., Morales, L. C., Silvera-Redondo, C., Arcos-Burgos, M., Garavito-Galofre, P., & Vélez, J. I. (2024). Exosomal mRNA Signatures as Predictive Biomarkers for Risk and Age of Onset in Alzheimer’s Disease. International Journal of Molecular Sciences, 25(22), 12293. https://doi.org/10.3390/ijms252212293

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop