Next Article in Journal
Expanding the Phenotype of the FAM149B1-Related Ciliopathy and Identification of Three Neurogenetic Disorders in a Single Family
Previous Article in Journal
FA-nf: A Functional Annotation Pipeline for Proteins from Non-Model Organisms Implemented in Nextflow
Previous Article in Special Issue
Reelin Alleviates Mesenchymal Stem Cell Senescence and Reduces Pathological α-Synuclein Expression in an In Vitro Model of Parkinson’s Disease
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Association and Gene–Gene Interactions Study of Late-Onset Alzheimer’s Disease in the Russian Population

1
Tomsk National Research Medical Center, Research Institute of Medical Genetics, 634050 Tomsk, Russia
2
Department of Neurology and Neurosurgery, Faculty of General Medicine, Siberian State Medical University, 634050 Tomsk, Russia
3
Nebbiolo Center for Clinical Trials, 634009 Tomsk, Russia
*
Author to whom correspondence should be addressed.
Genes 2021, 12(10), 1647; https://doi.org/10.3390/genes12101647
Submission received: 6 August 2021 / Revised: 11 October 2021 / Accepted: 15 October 2021 / Published: 19 October 2021
(This article belongs to the Special Issue Genetics and Genomics of Neurodegenerative Diseases)

Abstract

:
Alzheimer’s disease (AD) is a neurodegenerative disorder, and represents the most common cause of dementia. In this study, we performed several different analyses to detect loci involved in development of the late onset AD in the Russian population. DNA samples from 472 unrelated subjects were genotyped for 63 SNPs using iPLEX Assay and real-time PCR. We identified five genetic loci that were significantly associated with LOAD risk for the Russian population (TOMM40 rs2075650, APOE rs429358 and rs769449, NECTIN rs6857, APOE ε4). The results of the analysis based on comparison of the haplotype frequencies showed two risk haplotypes and one protective haplotype. The GMDR analysis demonstrated three significant models as a result: a one-factor, a two-factor and a three-factor model. A protein–protein interaction network with three subnetworks was formed for the 24 proteins. Eight proteins with a large number of interactions are identified: APOE, SORL1, APOC1, CD33, CLU, TOMM40, CNTNAP2 and CACNA1C. The present study confirms the importance of the APOE-TOMM40 locus as the main risk locus of development and progress of LOAD in the Russian population. Association analysis and bioinformatics approaches detected interactions both at the association level of single SNPs and at the level of genes and proteins.

1. Introduction

Alzheimer’s disease (AD) is a neurodegenerative disorder and the most common cause of dementia. The disease is characterized by the loss of neurons and synaptic connections in the cortex and certain subcortical areas. AD is clinically characterized by decline in progressive executive function, perceptual speed, short-term memory and other cognitive capacities [1]. Degradation of cognitive functions has a major impact on an individual’s quality of life. According to the World Alzheimer Report 2018, there are over 50 million people suffering from AD or other dementias in the world [2]. Each year, approximately 5–7 million new cases of AD are registered in the geriatric population [3].
Two forms of AD are traditionally distinguished: early onset (EOAD, onset <65 years old) and late-onset (LOAD, onset >65 years old). EOAD accounts for 1–6% of AD cases. These cases are familial and are associated with classical Mendelian patterns of inheritance. The amyloid precursor protein (APP), presenilin1 (PSEN1), and presenilin2 (PSEN2) genes are primarily implicated in EOAD. As for the later onset form of the disease, both genetic and environmental factors are considered to impact the disease risk profile. LOAD constitutes approximately 95% of AD cases [4]. Heritability for LOAD is estimated to be between 58% and 79%, on the basis of a twin study design [5]. The genetic complexity of LOAD should not be underestimated. Due to general efforts, it has become clear that pathogenesis of LOAD is multi-faceted and is not limited to one set of simple molecular interactions.
Common neurological and psychiatric disorders, including AD, schizophrenia, bipolar disorder, Parkinson’s disease are now the subject of intensive studies. Over the last few decades great progress in researching of susceptibility genetic loci to common mental and neurological diseases has been made using new technologies and methods: genome-wide association studies (GWAS), large meta-analysis, whole-exome/genome sequencing (GWAS, meta-analysis, NGS).
Alzheimer’s disease and schizophrenia are two common diseases of the brain with significant differences in neuropathology, etiology and symptoms. Data on the common links in the pathogenesis of AD and schizophrenia are accumulating [6]. Thus, statistical evidence has shown that AD and schizophrenia have a specific molecular background. Transcriptome studies have found a similar pattern of gene expression in the superior temporal gyrus in AD and schizophrenia [7]. Recently, GWASs have identified many genomic loci characterized by small effect sizes associated with LOAD: CLU, CR1, PICALM, BIN1, ABCA7, CD33 and others [8,9,10]. It is important to keep in mind that genetic association studies have often produced false findings, and thus need validation in multiple sample sets. On the other hand, it is known that most GWAS SNPs in the associated loci with disease cannot be explained on the basis of known pathological mechanisms. Next-generation sequencing has assisted the identification of rare susceptibility modifying alleles in APP, TREM2, and PLD3 [11]. However missing heritability for LOAD remains extensive. While great progress has been made with respect to understanding the genetic landscape of LOAD, the connection between the genes and variants promoting to the risk of the disease remains unclear. The data accumulation and the use of new bioinformatics approaches including those implemented in publicly available resources can help with this problem. The biological roles in metabolic pathways of all newly identified variants, as well as functionally related genes, can be assessed using ALIGATOR, KEGG, GO, STRING, GeneMANIA [12,13,14,15,16]. Exploring potentially new protein interactions in common mental and neurological diseases using bioinformatic strategies presents potential interest. Genetic risk factors could be analyzed and integrated in terms of biological pathways and functions to better understand the contribution of genetics to the general background of neurological and mental diseases. Protein interaction data, derived from the GWAS database, can be used to build protein–protein interaction (PPI) networks from genes associated with AD, schizophrenia and cognitive performance. PPI network analysis could be an effective and powerful approach to identify potential biological pathways or key genes involved in the link of neurological and mental diseases.
In addition to the complexities of LOAD associated with numerous interconnected environmental and genetic factors, the genetic architecture of LOAD differs between ethnic groups. Our prior study estimated genetic diversity of sixteen native populations of North Eurasia using a panel of genetic markers significantly associated with AD and Schizophrenia [17].
Here we follow up on our prior research in a new cohort of the AD group and controls. In the present study, we aimed to analyze 63 of the previously reported statistically significant SNP markers associated with AD in approximately 450 Russian samples from well-characterized LOAD case–control set. In addition, our focus was on the identification of the molecular and functional pathways of these SNP markers using various statistical and bioinformatics resources.

2. Materials and Methods

2.1. Samples

This study involved 185 unrelated LOAD patients (mean age 72.15 ± 7.87 years) recruited and evaluated at the Department of Neurology and Neurosurgery (Siberian State Medical University, Tomsk, Russia). Subjects were included in the study after a diagnosis of LOAD according to the International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM), the diagnostic and statistical manual of mental disorders, fourth edition, text revision (DSM-IV TR) [18] and the NINCDS-ADRDA Alzheimer’s Criteria (the National Institute of Neurological and Communicative Diseases and Stroke and the Alzheimer’s Disease and Related Disorders Association) [19]. Structural imaging based on magnetic resonance is a part of the clinical assessment of patients with suspected AD. The presence of focal symmetrical medial temporal atrophy has predictive value for AD. Our study included the cases with MRI images showing the characteristic atrophy of the hippocampus. Cases had a minimum age at disease onset of 65 years. Control group consisted of 287 unrelated cognitively normal elderly Russian participants (mean age 71.8 ± 5.7 years). The same procedures for the control group were performed as for the case group to exclude the LOAD diagnosis. Accordingly, history, neurologic examinations, magnetic resonance images of brain volumes were examined for the control group. Additionally, the MoCA test was conducted for the control group to exclude mild cognitive impairment. The MoCA was developed to be more sensitive to mild cognitive impairment in geriatric populations than other screeners (like the Mini Mental State Examination-MMSE). MoCA scores ranged between 0 and 30 points, and the higher scores indicated the better cognitive performance. In our study, the control group included people with the MoCA score ≥ 23. Controls were recruited and evaluated at the Nebbiolo Center for Clinical Trials (Tomsk).
All of the studied individuals, cases and controls were of the same ethnic (Russians) and geographical origins, living in the Tomsk region of the Russian Federation. Table 1 presents the main demographic parameters of the studied groups. The study protocol was approved by the Scientific Ethics Committee of the Research Institute of Medical Genetics (Tomsk National Research Medical Center of the Russian Academy of Sciences) (protocol number 2017/108). Individuals or legal guardians signed a written informed consent form after the study objectives and procedures had been explained.

2.2. Genotyping

Genomic DNA was extracted from the peripheral venous blood using phenol–chloroform extraction method. For this study, two multiplex panels of SNP markers that showed repeated association with cognitive performance, AD or/and schizophrenia in the GWAS were formed (see Table 2). Genetic markers were selected based on certain and clear criteria. First of all, we selected loci reaching a genomic value (p < 5.00 × 10−7) for the phenotype (cognitive functions, AD and/or schizophrenia in GWAS). The next requirement concerned the type of marker: only SNPs were selected. The minor allele frequency (MAF) should have been ≥ than 5% in at least one population group of the HapMap project or the 1000 Genomes project. Selected markers should form multiplexes using the Assay Design Suite v2.0 genotyping tool (https://agenacx.com/online-tools/, accessed on 6 August 2021). All patients and controls were genotyped for 62 SNPs using iPLEX Assay following the recommended protocol by the manufacturer (Agena Bioscience™, San Diego, CA, USA). In addition to 62 SNPs, genotyped by MALDI/TOF mass spectrometry, rs7412 of APOE was genotyped by real-time PCR using the TaqMan® SNP Genotyping Assay (Applied Biosystems, Foster City, CA, USA).

2.3. Statistical Analyses

Two SNPs (SORL1 rs11218343, TCF4 rs1261117) were excluded from the final analysis because they showed genotype call rates < 87%. Per-marker genotype call rates were higher than 95% for all the rest of the markers. The genotype distributions of 3 loci (LUZP2 rs1021261, CNTNAP2 rs10273775, GRIN2B rs2160519) were not consistent with the Hardy–Weinberg equilibrium in the control group. Therefore, these three polymorphisms were excluded from further analyses.
Pearson’s chi-squared test was performed to verify the Hardy–Weinberg equilibrium of each SNP in LOAD patients and controls. The genetic variants in the control group which were not in the Hardy–Weinberg equilibrium (p < 0.05) were excluded from the analysis. Allele and genotype frequencies of each SNP were compared between LOAD cases and controls using the chi-squared (χ2) test. The odds ratios (OR) and 95% confidence intervals (CIs) were calculated to assess the relative risk. The Benjamini and Hochberg false discovery rate method was used for multiple testing corrections [20]. A p value threshold of 0.05 was used to determine significance.
The linkage disequilibrium (LD) between SNP pairs in the genomic region of APOE- TOMM40 locus was quantified using Haploview version 4.2 software [21]. Haplotype frequencies were determined using the EM algorithm. The LD block structure was determined using the Solid Spine method provided by the Haploview 4.2. The degree of genetic linkage between the 8 SNPs in study groups was estimated as Lewontin’s coefficient D’ and Pearson’s correlation coefficient r2, where no color (D’ = 0) indicates that LD is weak or nonexistent and the black (D’ = 1) indicates that there exists strong pairwise LD between SNPs (Figure 1).
Generalized multifactor dimensionality reduction (GMDR) analysis was carried out for the detection and characterization of gene–gene interactions. This is a generalized MDR framework based on the score of a generalized linear model [22]. The best interaction model was selected on the basis of maximum testing balanced accuracy (TBA) and cross-validation consistency (CVC). Permutation (p) results were considered to be statistically significant at the 0.05 level. In our study, we used open-source software GMDR (http://sourceforge.net/projects/gmdr, accessed on 5 August 2021).
The protein–protein interaction network of the studied proteins was constructed with the online analysis tool STRING v11.0 (http://www.string-db.org/, accessed on 5 August 2021). The interactions include indirect (functional) and direct (physical) associations, which are derived from four sources: experiments, genomic context, co-expression (conserved), and previous knowledge. Biological processes, cellular components and molecular functions were analyzed by using a freely available system PANTHER (Released 2020-12-18, http://www.pantherdb.org/, accessed on 5 August 2021). STRING and GO analyses included 41 genes. SNPs of these genes were genotyped in the present study (Table 2). The analyses did not include markers from intergenic regions.

3. Results

The minor allele frequencies of 58 SNPs in LOAD patients and in controls are presented in a Table 2. Almost all the allele and genotype frequencies for the investigated loci were within the range reported in other populations of the European descent in the 1000 Genomes project (https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes/, accessed on 5 August 2021), with the exception of three genetic variants. The G allele of rs2075650 at TOMM40 gene has higher frequency (28%) in the Russian population in comparison with other populations of the European origin in the 1000 Genomes project (16%). The minor allele C of rs7984606 at KLHL1 gene has lower frequency (0.4%) in the Russian population than other populations of European descent in the 1000 Genomes project (1%). The allele A of DCHS2 rs1466662 has lower frequency (35%) in the Russian population than other populations of European origin in the 1000 Genomes project (69%).
The statistical power of the study was estimated for the Genetic Association Study (GAS) Power Calculator (http://csg.sph.umich.edu/abecasis/gas_power_calculator/, accessed on 5 August 2021) assuming an odds ratio (Genotype Relative Risk) of 1.5, allele frequencies 18% in AD cases, and a significance level of 0.05. The sample size of this study was estimated to have 80% power to detect a positive association for 36 variants with 18% frequency in AD cases. Power to detect an odds ratio of 1.5 or greater was 80% for six of the nine SNPs in Table 2, showing non-significant results after correction for multiple testing (FDR) for three studied variants: FBXO40 rs3772130, CR1 rs6656401, NECTIN2 rs6859. The reporting of variants with low frequency which are not sufficiently powered is justified not only by detecting associations with the disease but by estimating allele frequencies of these markers in cases and controls belonging to specific population.
Out of the 58 markers tested, nine showed significant association to LOAD (Table 3). Two variants demonstrated marginal p values for the allele and/or genotype frequency difference between the LOAD and control subjects: FBXO40 rs3772130 and NECTIN2 rs6859. After correction for multiple testing the results remained statistically significant for five genetic variants in three genes: TOMM40 rs2075650, APOE ε4, rs429358 and rs769449, NECTIN rs6857 (Table 3). All of these markers were located in the same region of the chromosome 19.
Due to the fact that it is important for the LD analysis that the markers are in the same chromosome region, we included five SNPs in the LD analysis, which in our study were significantly associated with LOAD: TOMM40 rs2075650, APOE rs7412, rs429358 and rs769449, NECTIN rs6857. All of these markers were located in the same region of chromosome 19 (the APOE-TOMM40 region). Markers TOMM40 rs157580, NECTIN rs6859, APOC1 rs4420638 were genotyped, but were not significantly associated with the LOAD in our study. These markers were also analyzed using the LD analysis, as they were also located in the APOE-TOMM40 region of chromosome 19, like the other 5 markers. The theoretically possible number of haplotypes was 256. A total of 44 haplotypes were detected in our samples. Twenty haplotypes were revealed in the control group, while 24 haplotypes were found in the cases. Fourteen identical haplotypes were observed in both groups. Identical haplotypes and their frequencies are presented in a Table 4.
Figure 2 demonstrates haplotypes with frequencies above 1% in the case and the control groups.
The number of main haplotypes (frequency above 10%) was four in the control group and three in LOAD patients (Figure 2). Results of the analysis based on comparison of the haplotype frequencies showed that risk haplotypes ATAGACCG (χ2 = 7.313, p = 0.0068; OR 1.90, 95% CI 1.19–3.05) and ATAGACCA (χ2 = 14.589, p = 0.0001; OR 17.43, 95% CI 2.24–135.62) were significantly associated with LOAD. Moreover, we identified one protective haplotype GCAAGTCA (χ2 = 6.822, p = 0.009; OR 0.63, 95% CI 0.44–0.89) generated from these eight SNPs. All of the above associations remain significant after permutation test (p < 0.05).
The structure of LD among eight SNPs in the APOE-TOMM40 locus for the cases and controls is shown in Figure 1. Analysis of LD in cases and controls demonstrated that these groups were characterized by different structures of haplotype blocks at APOE-TOMM40 locus (Figure 1a,b). In controls, the first block consisted of four SNPs (rs6859, rs6857, rs157580, rs2075650) and the second included three SNPs (rs769449, rs429358, rs7412), while in the cases, the first block comprised four SNPs (rs6857, rs157580, rs2075650, rs769449) and the second block was made up of two closely located SNPs (rs429358, rs7412). Notably, this difference demonstrates the high level of LD in case group between rs429358 and rs7412 from the APOE gene, which are formed the APOE isoforms (ε2, ε3, ε4). The haplotype block structure for control group demonstrates the blocks that include genes: Block1-NECTIN2-TOMM40, Block2–APOE.
Although not every studied variant in genes was significantly associated with AD by univariate (case–control) analysis, there were gene–gene interactions among variants using generalized multifactor dimensionality reduction (GMDR) method. GMDR combines genotypes into “high-risk” and “low-risk” groups in order to reduce multidimensional data into only one dimension. In our study, we analyzed all possible combinations of 44 genes both in cases and in control subjects. There were three significant models as a result of the GMDR analysis: one-factor, two-factor and three-factor model (Table 5).
Among these results, we selected two models (a two-factor and a three-factor) as the best SNP combinations, based on the best Testing Balanced Accuracy (TBA) in data and their high cross-validation consistency (CVC) value. Figure 3 shows the score distributions in the best models. Seven genotype combinations with an increased risk of developing LOAD and only one combination of low-risk genotypes were found for the two-factor model (Figure 3a). The combination of low-risk genotypes showed statistically significant differences between the studied groups: TCF4 rs17594526 “CC” and APOE rs429358 “TT” (p < 0.001; OR = 0.25; 95% CI: 0.14–0.45). Fourteen genotype combinations with an increased risk of developing LOAD and five combinations of low-risk genotypes were found for the three-locus model (Figure 3b). Since we did not have all genotypes for the three-locus model in GMDR analysis, we designated the data in this model as preliminary. The one-locus model was computed for APOE rs429358 and the empirical p value for prediction error using permutation testing was 0.0006, indicating the interactions among the three variants contributed to a higher risk of AD than did single variant alone (Table 5).
In the present study, we conducted a protein–protein network interaction (PPI) search with proteins of AD, schizophrenia and cognitive performance collected via the GWAS database using STRING (v11.0). This exploratory bioinformatic analysis demonstrated a potential network of interaction between proteins connected to AD, schizophrenia and cognitive performance. PPI network analysis of 41 studied genes showed statistically significant connectivity among proteins. Interaction enrichment had p-value < 1.0 × 10−16 for our gene set. This means that such enrichment demonstrates that the proteins are at least partly biologically connected, as a group. The PPI network contained 24 nodes and 39 edges (Figure 4). The STRING indicated that the proteins formed three subnetworks of interconnected gene products (Figure 4). Each subnetwork had a central protein. The first subnetwork consisted of 10 proteins with APOE in the center: CD33, CR1, CLU, RELN, SORL1, NECTIN2 (PRVL2), TOMM40, APOC, MPC2. The second subnetwork included 6 proteins with CACNA1C as the center component: GRIN2B, TENM4, CSMD1, NT5C2, POM121L2. In addition, the third protein subgroup had 8 nodes with CTNNA2 in the center part: FBXO40, CNTN4, CTNNA2, CDON, CAMD2, TCF4, CHD6. Proteins with a large number of interactions were identified (with edges from 8 to 5). The eight genes with strong connections (“multiple interactions”) were APOE, SORL1, APOC1, CD33, CLU, TOMM40, CNTNAP2 and CACNA1C (Figure 4). The largest number of connections was established for the APOE protein (8). This large number of interactions is not surprising, given the high known effect of the APOE gene for phenotypic modifications for several medical traits [23,24,25]. In addition to proteins with a large number of interactions, there are other proteins that can connect or act as bridges between subnetworks. The four “bridge” genes were APOE, RELN, CNTNAP2 and GRIN2B. It should be noted that APOE and CNTNAP2 represent both types of genes (“multiple interactions” and “bridge”).
To evaluate molecular function, cellular component, and biological process of our gene set we used bioinformatics resource Gene Ontology (GO, http://geneontology.org, accessed on 5 August 2021) databases via PANTHER (The Protein ANalysis THrough Evolutionary Relationships, http://www.pantherdb.org, accessed on 5 August 2021). This system includes correction for multiple testing for all GO categories, thus providing statistically significant evidence for their involvement in disease susceptibility. The studied gene set was enriched in three categories: molecular function (cell adhesion molecule binding), cellular component (neuron part), and biological process (nervous system development). Pathway enrichment analysis of genes using STRING is presented in Table 6.

4. Discussion

AD is the most common form of age-related neurodegenerative disease. To study such complex diseases, it is necessary to apply various types of approaches. In this study, we performed several different analyses to excavate loci involved in development of the late onset AD in the Russian population. First of all, we focused on the potential associations of SNPs previously identified in GWAS. Since not only AD is characterized by a decrease in cognitive function, we also included GWAS loci for other reported traits in our analysis, such as schizophrenia, unipolar depression, bipolar disorder, attention deficit hyperactivity disorder, autism spectrum disorder and others.
Similar study designs have been reported in other papers for various disorders: AD and Parkinson’s disease, AD and ischemic stroke, schizophrenia and bipolar disorder, and others [26,27]. A recent study demonstrated an unbiased link between polygenic risk for schizophrenia and a lower risk of psychosis in AD [28]. The most significant associations with LOAD were observed in the APOE region on chromosome 19 (p < 5 × 10−8). Outside the APOE region, the most significant association was found in CR1 gene on chromosome 1 (chr1:207611623, rs3818361, G/A, p = 6 × 10−3). This gene is one of the ten highest risk factors for developing AD. The CR1 gene encodes the complement receptor 1 (CR1), which is one of the regulators of complement activity. The CR1 might prevent excessive complement activation. The presence of CR1 on erythrocytes is noted as an important component in protecting tissues against immune-complex deposition and following disease, such as AD [29].
After correction for multiple testing (the Benjamini and Hochberg false discovery rate method), only the APOE region genetic markers were remained statistically significant. Interestingly, according to our data, the G allele frequency (28%) of rs2075650 at TOMM40 gene in our Russian sample sets is significantly higher than in other populations not only of European descent (13–16%), but also of the Asian population (10–12%). Salakhov et al. [30] found that TOMM40 gene rs2075650 was associated with low-density lipoprotein cholesterol levels in healthy men group of the Russian population (Kemerovo, Russia), and G allele frequency in the total group was 21%. It should be noted that, most of the genetic variants contributing to risk of developing AD are located in non-coding regions and, presumably, they may influence as a controller of the functional activity of closely located genes that have an activating effect on transcription [31]. We can suggest that there may be some additional genetic signal in this locus that is impact to risk AD. The study results supplement the previous data on the AD risk development genetic markers in the Russian population [32].
Within the chromosomal region 19q12-q13.33, there is an accumulation of SNPs that are probably critical for beginning and development of AD. A huge number of publications devoted to the role of this region in the progress of AD. In addition, although APOE was identified as a susceptibility factor for LOAD over 25 years ago [33], it is still not clear how the ε4 allele contributes to disease risk. Some observations indicate an independent effect of SNPs of the NECTIN2-TOMM40-APOE locus [34], while other studies showed that there are general effects of several SNPs [35]. Our data demonstrated that in both the cases and the controls rs157580 of TOMM40 gene is in high pairwise LD with the SNPs rs2075650 and rs769449 (D’ = 100). However, LOAD-affected individuals are characterized by higher pairwise LD between neighboring SNPs compared with unaffected individuals (Figure 1). Notably, the haplotype block structure for cases demonstrates that Block2 included rs429358 and rs7412 from the APOE gene, which are formed the APOE isoforms (ε2, ε3, ε4). Our study showed that the haplotypes that include the ε3 allele of APOE and the A allele of TOMM40 gene rs2075650 are associated with reduced risk for LOAD compared with haplotypes that include the ε4 allele of APOE and the G allele of rs2075650. In a recently published study on heritability of extreme longevity in human populations, it was found that the ε4 allele of APOE is associated with substantial reduction in the chance for extreme longevity [36].
Haplotypes that showed an association with a decrease in extreme longevity in Sebastiani et al. in our study demonstrated a significant association with an increased risk of developing LOAD. The possible combinations of loci that may reduce the risk of LOAD development were obtained by researching gene–gene interactions with GMDR. Approximately two-fold reduced risk for LOAD was observed for the combination of CLU rs1532278 “CC”, TCF4 rs17594526 “CC”, APOE rs429358 “TT” genotypes in the three-locus model. In the model combining TCF4 and APOE, genotype combination CC/TT could similarly be considered to protective to LOAD. Notably, our data presented intergenic interaction between the three loci (CLU, TCF4, APOE), demonstrating no significant associations in our replicative analysis. Transcription factor 4 gene (TCF4) plays an important role in nervous system development. TCF4, a basic helix–loop–helix transcription factor, is broadly expressed and is critical for normal brain development. The gene has been identified as the cause of Pitt-Hopkins syndrome (PTHS), and it has been implicated in various other neuropsychiatric diseases, including schizophrenia, autism and depression [37,38,39].
The STRING database aims to combine all known and predicted associations between proteins, including both functional associations and physical interactions. In the present study, we used this exploratory bioinformatic analysis to predict a possible PPI network between neurological and mental diseases with using GWAS database. The network shows possible PPIs between associated genes of AD, schizophrenia and cognitive performance. A protein–protein interaction network with three subnetworks was formed for 24 proteins. A more detailed analysis of subnetworks detected the traits of protein grouping for each of them. The first group (with the APOE) is characterized by the maximum number of edges (21). Proteins of this subnetwork are related by common biological processes, such as regulation (positive and negative) of various processes and molecule transport, as well as such common molecular function as binding. There were no significant GO associations for the other protein subgroup (the CACNA1C as a center node), but these proteins were connected through publication studies with psychiatric diseases, such as schizophrenia, bipolar disorder, depression, and others [40,41,42]. The grouping trait in the third subnetwork (with the CNTNAP2 in the center) is the biological processes related with the neuron development and differentiation, as well as with the regulation of these processes. In addition, all these three subsystems are connected together by the Reelin as a bridge. Reelin is a large signaling protein that is engaged in a cascade of cytoplasmic events that control the migration of neurons during brain development and it is essential for the correct development and plasticity of the cerebral cortex and regulate synaptic plasticity, neurotransmission and memory in the adult brain. Presumably, Reelin plays a significant role both in AD and in psychiatric disorders [43,44,45].
The PPI networks have limited clinical predictive value, but discover hypothetical new paths of interaction. The forecasted PPI network needs careful interpretation and warrants preclinical and clinical validations. Nevertheless, this network gives new facts that will be helpful to new studies of general molecular background between neurological and mental diseases.

5. Conclusions

In summary, genes and proteins affecting cognitive function are of special interest in the search of genetic composition of AD. The vast majority of cases of AD are viewed as sporadic results from the complex interaction of genetic risk factors and unknown environmental conditions. The present study confirmed the importance of the APOE-TOMM40 locus as the main risk locus of development and progress of LOAD. Association analysis and bioinformatics approaches detected interactions both at the association level of single SNPs and at the level of genes and proteins. Strong evidence for association with LOAD was seen for the APOE-4 polymorphism, as well as for rs429358, rs769449 and rs2075650, using various approaches: replicative association analysis, LD analysis, GMDR analysis and protein–protein interaction analysis.
Currently, priority approaches are aiming not only to search for genes, but also to establish their functional effect. This provides the possibility of using the progress made in identifying the biological effects of risk alleles to recognize new protective mechanisms against more rapid cognitive decline in AD. Identifying biological pathways that increase risk may be useful in the future for the treatment of cognitive and behavioral disorders. Nevertheless, the current findings are a step forward in in this direction. We believe that the identification of additional risk loci would be useful from the development of predictive approaches of AD.

Author Contributions

A.B., K.V., A.M. and V.S. contributed conception and design of the study; N.Z., I.Z., O.M. and L.M. organized the database; A.B. performed the statistical analysis; A.B. wrote the first draft of the manuscript; V.S. reviewed and corrected draft of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Russian Science Foundation, grant number 16-14-00020.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Scientific Ethics Committee of the Research Institute of Medical Genetics (Tomsk National Research Medical Center of the Russian Academy of Sciences) (protocol number 2017/108).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Alzheimer’s Association Report. Alzheimer’s disease facts and figures. Alzheimer’s Dement. 2019, 15, 321–387. Available online: https://alz-journals.onlinelibrary.wiley.com/doi/abs/10.1016/j.jalz.2019.01.010 (accessed on 6 August 2021).
  2. Patterson, C. World Alzheimer Report. In The State of the Art of Dementia Research: New Frontiers; Alzheimer’s Disease International: London, UK, 2018; Available online: https://www.alz.co.uk/research/WorldAlzheimerReport2018.pdf (accessed on 21 September 2018).
  3. Sosa-Ortiz, A.L.; Acosta-Castillo, I.; Prince, M.J. Epidemiology of dementias and Alzheimer’s disease. Arch. Med. Res. 2012, 43, 600–608. [Google Scholar] [CrossRef]
  4. Alzheimer’s Association Report. Alzheimer’s disease facts and figures. Alzheimer’s Dement. 2012, 8, 131–168. Available online: https://www.sciencedirect.com/science/article/abs/pii/S1552526012000325 (accessed on 6 August 2021).
  5. Gatz, M.; Reynolds, C.A.; Fratiglioni, L.; Johansson, B.; Mortimer, J.A.; Berg, S.; Fiske, A.; Pedersen, N.L. Role of genes and environments for explaining Alzheimer disease. Arch. Gen. Psychiatry 2006, 63, 168–174. [Google Scholar] [CrossRef]
  6. Prestia, A.; Boccardi, M.; Galluzzi, S.; Cavedo, E.; Adorni, A.; Soricelli, A.; Bonetti, M.; Geroldi, C.; Giannakopoulos, P.; Thompson, P.; et al. Hippocampal and amygdalar volume changes in elderly patients with Alzheimer’s disease and schizophrenia. Psychiatry Res. 2011, 192, 77–83. [Google Scholar] [CrossRef] [PubMed]
  7. Horesh, Y.; Katsel, P.; Haroutunian, V.; Domany, E. Gene expression signature is shared by patients with Alzheimer’s disease and schizophrenia at the superior temporal gyrus. Eur. J. Neurol. 2011, 18, 410–424. [Google Scholar] [CrossRef] [PubMed]
  8. Lambert, J.C.; Heath, S.; Even, G.; Campion, D.; Sleegers, K.; Hiltunen, M.; Combarros, O.; Zelenika, D.; Bullido, M.J.; Tavernier, B.; et al. Genome-wide association study identifies variants at CLU and CR1 associated with Alzheimer’s disease. Nat. Genet. 2009, 41, 1094–1099. [Google Scholar] [CrossRef] [PubMed]
  9. Harold, D.; Abraham, R.; Hollingworth, P.; Sims, R.; Gerrish, A.; Hamshere, M.L.; Pahwa, J.S.; Moskvina, V.; Dowzell, K.; Williams, A.; et al. Genome-wide association study identifies variants at CLU and PICALM associated with Alzheimer’s diseases. Nat. Genet. 2009, 41, 1088–1093. [Google Scholar] [CrossRef] [Green Version]
  10. Logue, M.W.; Schu, M.; Vardarajan, B.N.; Buros, J.; Green, R.C.; Go, R.C.P.; Griffith, P.; Obisesan, T.O.; Shatz, R.; Borenstein, A.; et al. A comprehensive genetic association study of Alzheimer disease in African Americans. Arch. Neurol. 2011, 68, 1569–1579. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Cruchaga, C.; Haller, G.; Chakraverty, S.; Mayo, K.; Vallania, F.L.; Mitra, R.D.; Faber, K.; Williamson, J.; Bird, T.; Diaz-Arrastia, R.; et al. Rare variants in APP, PSEN1 and PSEN2 increase risk for AD in late-onset Alzheimer’s disease families. PLoS ONE 2012, 7, e31039. [Google Scholar] [CrossRef]
  12. Holmans, P.; Green, E.K.; Pahwa, J.S.; Ferreira, M.A.; Purcell, S.M.; Sklar, P.; Owen, M.J.; O’Donovan, M.C.; Craddock, N.; Wellcome Trust Case-Control Consortium. Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder. Am. J. Hum. Genet. 2009, 85, 13–24. [Google Scholar] [CrossRef] [Green Version]
  13. Kanehisa, M.; Goto, S.; Hattori, M.; Aoki-Kinoshita, K.F.; Itoh, M.; Kawashima, S.; Katayama, T.; Araki, M.; Hirakawa, M. From genomics to chemical genomics: New developments in KEGG. Nucleic Acids Res. 2006, 34, D354–D357. [Google Scholar] [CrossRef] [PubMed]
  14. Harris, M.A.; Clark, J.; Ireland, A.; Lomax, J.; Ashburner, M.; Foulger, R.; Eilbeck, K.; Lewis, S.; Marshall, B.; Mungall, C.; et al. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2004, 32, D258–D261. [Google Scholar] [PubMed] [Green Version]
  15. Szklarczyk, D.; Franceschini, A.; Wyder, S.; Forslund, K.; Heller, D.; Huerta-Cepas, J.; Simonovic, M.; Roth, A.; Santos, A.; Tsafou, K.P.; et al. STRING v10: Protein–protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015, 43, 447–452. [Google Scholar] [CrossRef]
  16. Warde-Farley, D.; Donaldson, S.L.; Comes, O.; Zuberi, K.; Badrawi, R.; Chao, P.; Franz, M.; Grouios, C.; Kazi, F.; Lopes, C.T.; et al. The GeneMANIA prediction server: Biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 2010, 38, W214–W220. [Google Scholar] [CrossRef]
  17. Bocharova, A.V.; Stepanov, V.A. Genetic diversity of populations in North Eurasia by genetic markers associated with diseases impairing human cognitive functions. Russ. J. Genet. 2021, 57, 1082–1091. [Google Scholar] [CrossRef]
  18. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders, 4th ed.; American Psychiatric Association: Washington, DC, USA, 2010. [Google Scholar]
  19. McKhann, G.; Drachman, D.; Folstein, M.; Katzman, R.; Price, D.; Stadlan, E.M. Clinical diagnosis of Alzheimer’s disease: Report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease. Neurology 1984, 34, 939–944. [Google Scholar] [CrossRef] [Green Version]
  20. Benjamini, Y.; Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 1995, 57, 289–300. [Google Scholar] [CrossRef]
  21. Barrett, J.C.; Fry, B.; Maller, J.; Daly, M.J. Haploview: Analysis and visualization of LD and haplotype maps. Bioinformatics. 2005, 21, 263–265. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Chen, G.B.; Xu, Y.; Xu, H.M.; Li, M.D.; Zhu, J.; Lou, X.Y. Practical and theoretical considerations in study design for detecting gene-gene interactions using MDR and GMDR approaches. PLoS ONE 2011, 6, e16981. [Google Scholar] [CrossRef] [PubMed]
  23. Lipnicki, D.M.; Crawford, J.; Kochan, N.A.; Trollor, J.N.; Draper, B.; Reppermund, S.; Maston, K.; Mather, K.A.; Brodaty, H.; Sachdev, P.S.; et al. Risk factors for mild cognitive impairment, dementia and mortality: The Sydney Memory and Ageing Study. J. Am. Med. Dir. Assoc. 2017, 18, 388–395. [Google Scholar] [CrossRef] [PubMed]
  24. Martínez-Martínez, A.B.; Torres-Perez, E.; Devanney, N.; Del Moral, R.; Johnson, L.A.; Arbones-Mainar, J.M. Beyond the CNS: The many peripheral roles of APOE. Neurobiol. Dis. 2020, 138, 104809. [Google Scholar] [CrossRef] [PubMed]
  25. Cai, C.; Wen, Z.; Li, L. The relationship between ApoE gene polymorphism and the efficacy of statins controlling hyperlipidemia. Am. J. Transl. Res. 2021, 13, 6772–6777. [Google Scholar] [PubMed]
  26. Liu, G.; Bao, X.; Jiang, Y.; Liao, M.; Jiang, Q.; Feng, R.; Zhang, L.; Ma, G.; Chen, Z.; Wang, G.; et al. Identifying the Association Between Alzheimer’s Disease and Parkinson’s Disease Using Genome-Wide Association Studies and Protein-Protein Interaction Network. Mol. Neurobiol. 2015, 52, 1629–1636. [Google Scholar] [CrossRef] [PubMed]
  27. Wei, C.J.; Cui, P.; Li, H.; Lang, W.J.; Liu, G.Y.; Ma, X.F. Shared genes between Alzheimer’s disease and ischemic stroke. CNS Neurosci. Ther. 2019, 25, 855–864. [Google Scholar] [CrossRef] [PubMed]
  28. DeMichele-Sweet, M.A.A.; Weamer, E.A.; Klei, L.; Vrana, D.T.; Hollingshead, D.J.; Seltman, H.J.; Sims, R.; Foroud, T.; Hernandez, I.; Moreno-Grau, S.; et al. Genetic risk for schizophrenia and psychosis in Alzheimer disease. Mol. Psychiatry 2018, 23, 963–972. [Google Scholar] [CrossRef]
  29. Johansson, J.U.; Brubaker, W.D.; Javitz, H.; Bergen, A.W.; Nishita, D.; Trigunaite, A.; Crane, A.; Ceballos, J.; Mastroeni, D.; Tenner, A.J.; et al. Peripheral complement interactions with amyloid β peptide in Alzheimer’s disease: Polymorphisms, structure, and function of complement receptor 1. Alzheimers Dement. 2018, 14, 1438–1449. [Google Scholar] [CrossRef]
  30. Salakhov, R.R.; Goncharova, I.A.; Makeeva, O.A.; Golubenko, M.V.; Kulish, E.V.; Kashtalap, V.V.; Barbarash, O.L.; Puzyrev, V.P. TOMM40 Gene Polymorphisms Association with Lipid Profile. Genetika 2014, 50, 222–229. [Google Scholar] [CrossRef]
  31. Jansen, I.E.; Savage, J.E.; Watanabe, K.; Bryois, J.; Williams, D.M.; Steinberg, S.; Sealock, J.; Karlsson, I.K.; Hägg, S.; Athanasiu, L.; et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nat. Genet. 2019, 51, 404–413. [Google Scholar] [CrossRef]
  32. Stepanov, V.A.; Bocharova, A.V.; Marusin, A.V.; Zhukova, N.G.; Alifirova, V.M.; Zhukova, I.A. Replicative association analysis of genetic markers of cognitive traits with Alzheimer’s disease in the Russian population. Mol. Biol. 2014, 48, 952–962. [Google Scholar] [CrossRef]
  33. Corder, E.H.; Saunders, A.M.; Strittmatter, W.J.; Schmechel, D.E.; Gaskell, P.C.; Small, G.W.; Roses, A.D.; Haines, J.L.; Pericak-Vance, M.A. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer’s disease in late onset families. Science 1993, 261, 921–923. [Google Scholar] [CrossRef]
  34. Arpawong, T.E.; Pendleton, N.; Mekli, K.; McArdle, J.J.; Gatz, M.; Armoskus, C.; Knowles, J.A.; Prescott, C.A. Genetic variants specific to aging-related verbal memory: Insights from GWASs in a population-based cohort. PLoS ONE 2017, 12, e0182448. [Google Scholar] [CrossRef]
  35. Roses, A.D.; Lutz, M.W.; Amrine-Madsen, H.; Saunders, A.M.; Crenshaw, D.G.; Sundseth, S.S.; Huentelman, M.J.; Welsh-Bohmer, K.A.; Reiman, E.M. A TOMM40 variable-length polymorphism predicts the age of late-onset Alzheimer’s disease. Pharmacogenomics J. 2010, 10, 375–384. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Sebastiani, P.; Gurinovich, A.; Nygaard, M.; Sasaki, T.; Sweigart, B.; Bae, H.; Andersen, S.L.; Villa, F.; Atzmon, G.; Christensen, K.; et al. APOE Alleles and Extreme Human Longevity. J. Gerontol. Ser. A Biol. Sci. Med. Sci. 2019, 74, 44–51. [Google Scholar] [CrossRef] [Green Version]
  37. Forrest, M.; Chapman, R.M.; Doyle, A.M.; Tinsley, C.L.; Waite, A.; Blake, D.J. Functional analysis of TCF4 missense mutations that cause Pitt–Hopkins syndrome. Hum. Mutat. 2012, 33, 1676–1686. [Google Scholar] [CrossRef] [PubMed]
  38. Stefansson, H.; Ophoff, R.A.; Steinberg, S.; Andreassen, O.A.; Cichon, S.; Rujescu, D.; Werge, T.; Pietiläinen, O.P.; Mors, O.; Mortensen, P.B.; et al. Common variants conferring risk of schizophrenia. Nature 2009, 460, 744–747. [Google Scholar] [CrossRef] [Green Version]
  39. Wirgenes, K.V.; Sønderby, I.E.; Haukvik, U.K.; Mattingsdal, M.; Tesli, M.; Athanasiu, L.; Sundet, K.; Røssberg, J.I.; Dale, A.M.; Brown, A.A.; et al. TCF4 sequence variants and mRNA levels are associated with neurodevelopmental characteristics in psychotic disorders. Transl. Psychiatry 2012, 2, e112. [Google Scholar] [CrossRef] [PubMed]
  40. Eckart, N.; Song, Q.; Yang, R.; Wang, R.; Zhu, H.; McCallion, A.S.; Avramopoulos, D. Functional Characterization of Schizophrenia-Associated Variation in CACNA1C. PLoS ONE 2016, 11, e0157086. [Google Scholar] [CrossRef] [Green Version]
  41. Green, E.K.; Grozeva, D.; Jones, I.; Jones, L.; Kirov, G.; Caesar, S.; Gordon-Smith, K.; Fraser, C.; Forty, L.; Russell, E.; et al. The bipolar disorder risk allele at CACNA1C also confers risk of recurrent major depression and of schizophrenia. Mol. Psychiatry 2010, 15, 1016–1022. [Google Scholar] [CrossRef] [Green Version]
  42. Cross-Disorder Group of the Psychiatric Genomics Consortium. Identification of risk loci with shared effects on five major psychiatric disorders: A genome-wide analysis. Lancet 2013, 381, 1371–1379. [Google Scholar] [CrossRef]
  43. Ishii, K.; Kubo, K.I.; Nakajima, K. Reelin and neuropsychiatric disorders. Front. Cell. Neurosci. 2016, 10, 229. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Ovadia, G.; Shifman, S. The genetic variation of RELN expression in schizophrenia and bipolar disorder. PLoS ONE 2011, 6, e19955. [Google Scholar] [CrossRef]
  45. Bufill, E.; Roura-Poch, P.; Sala-Matavera, I.; Antón, S.; Lleó, A.; Sánchez-Saudinós, B.; Tomàs-Abadal, L.; Puig, T.; Abós, J.; Bernades, S.; et al. Reelin signaling pathway genotypes and Alzheimer disease in a Spanish population. Alzheimer Dis. Assoc. Disord. 2015, 29, 169–172. [Google Scholar] [CrossRef]
Figure 1. The structure of linkage disequilibrium in the genomic region of the NECTIN2-TOMM40-APOE locus: (a) tThe structure of LD for the cases; (b) the structure of LD for the controls. Linkage disequilibrium was measured by the D’ statistic: black—a strong link (D’ = 1, LOD > 2), grey—a significant link (D’ < 1, LOD > 2), white—poor link (D’ < 1, LOD < 2). A D’ value of 100 indicates a complete LD between 2 markers.
Figure 1. The structure of linkage disequilibrium in the genomic region of the NECTIN2-TOMM40-APOE locus: (a) tThe structure of LD for the cases; (b) the structure of LD for the controls. Linkage disequilibrium was measured by the D’ statistic: black—a strong link (D’ = 1, LOD > 2), grey—a significant link (D’ < 1, LOD > 2), white—poor link (D’ < 1, LOD < 2). A D’ value of 100 indicates a complete LD between 2 markers.
Genes 12 01647 g001
Figure 2. Haplotype frequencies for the LOAD patients and the control group. * Statistically significant differences between the cases and the controls (p < 0.05).
Figure 2. Haplotype frequencies for the LOAD patients and the control group. * Statistically significant differences between the cases and the controls (p < 0.05).
Genes 12 01647 g002
Figure 3. Gene–gene interactions among TCF4 (rs17594526), APOE (rs429358) and CLU (rs1532278) loci in AD and healthy subjects: (a) two-factor model; (b) three-factor model. The score distribution of AD subjects (left black bar in boxes) and control subjects (right black bar in boxes) is shown for each genotype combination. High-risk genotype combinations are represented by dark gray shade cells, while light gray shade cells represent low-risk genotype combinations. Cells with no shading or white cells represent genotype combination for which no data is observed.
Figure 3. Gene–gene interactions among TCF4 (rs17594526), APOE (rs429358) and CLU (rs1532278) loci in AD and healthy subjects: (a) two-factor model; (b) three-factor model. The score distribution of AD subjects (left black bar in boxes) and control subjects (right black bar in boxes) is shown for each genotype combination. High-risk genotype combinations are represented by dark gray shade cells, while light gray shade cells represent low-risk genotype combinations. Cells with no shading or white cells represent genotype combination for which no data is observed.
Genes 12 01647 g003
Figure 4. Protein–protein interaction network of studied genes. The nodes and edges represent the proteins (genes) and their interactions, respectively. The PPI network contained 24 nodes and 39 edges. The STRING indicated that the proteins formed three subnetworks of interconnected gene products. Each subnetwork had a central protein: APOE, CACNA1C, CTNNA2. All these three subsystems are connected together by RELN as a bridge.
Figure 4. Protein–protein interaction network of studied genes. The nodes and edges represent the proteins (genes) and their interactions, respectively. The PPI network contained 24 nodes and 39 edges. The STRING indicated that the proteins formed three subnetworks of interconnected gene products. Each subnetwork had a central protein: APOE, CACNA1C, CTNNA2. All these three subsystems are connected together by RELN as a bridge.
Genes 12 01647 g004
Table 1. Demographic parameters of the studied groups.
Table 1. Demographic parameters of the studied groups.
ParameterPatients with LOAD,
n = 185
Control Group,
n = 287
Gender: Women120 (64.86%)200 (69.69%)
Gender: Men65 (35.14%)87 (30.31%)
Mean age72.15 ± 7.8771.8 ± 5.70
RaceCaucasoidCaucasoid
PopulationRussiansRussians
Table 2. Frequencies of the 58 SNPs analyzed for LOAD.
Table 2. Frequencies of the 58 SNPs analyzed for LOAD.
Gene.SNP IDAllele aFunctional Consequence
(NCBI)
Position (GRCh38)Minor Allele FrequencyMAF
CasesControlsHapMap
CHD6rs1010304G/Aintron variant20:414730070.060.050.06
NCAPD3rs1031381T/Cintron variant11:1342187880.400.430.42
MPC2rs10489202T/Gintron variant1:1679338410.210.200.22
CSMD1rs10503253A/Cintron variant8:43233220.230.270.26
CCDC60rs11064768G/Aintron variant12:1193807040.080.070.07
NT5C2rs11191580C/Tintron variant10:1031464540.090.090.11
LOC105378889-PRMT6rs12125971T/Cintergenic variant1:1069210210.080.080.07
LOC101928778-LOC105371627rs12140439A/Cintergenic variant1:1777537720.290.300.34
TENM4rs12290811A/Tintron variant11:793725760.120.120.14
LUZP2rs12361953G/Tintron variant11:245895840.140.140.15
CADM2rs12494658C/Tintron variant3:858253260.240.300.28
SNX29rs12922317G/Aintron variant16:119837750.400.350.35
LOC105373605rs12989701A/Cintron variant2:1271304090.100.150.15
CTNNA2rs13034462G/Tintron variant2:798923680.030.040.05
BRD1rs138880C/Aintron variant22:498249630.210.210.23
DCHS2rs1466662A/Tintron variant4:1544262410.350.350.69
CLUrs1532278T/Cintron variant8:276087980.440.390.38
TOMM40rs157580G/Aintron variant19:448920090.340.360.35
LOC730100rs1606974A/Gintron variant2:516464610.080.060.07
NKAPLrs1635T/Gmissense variant6:282598260.070.060.05
LSM1rs16887244G/Aintron variant8:381738270.230.210.21
POM121L2rs16897515A/Cmissense variant6:273102410.090.110.12
CNTN4rs17194490T/Gintron variant3:25061020.180.150.14
ARHGAP31rs17203055G/Aintron variant3:1193654840.110.100.12
TCF4rs17512836C/Tintron variant18:555277300.0080.020.05
CADM2rs17518584C/Tintron variant3:855557730.340.320.30
TCF4rs17594526T/Cintron variant18:553910070.010.020.05
GPR89P-
TRV-AAC1-5
rs17693963C/Aintergenic variant6:277423860.060.060.08
TOMM40rs2075650G/Aintron variant19:448923620.260.280.16
CLUrs2279590T/Cintron variant8:275987360.440.400.39
ZNF365rs2393895C/Aintron variant10:625790870.230.230.20
CSMD1rs2616984G/Aintron variant8:46256190.330.310.29
DNAH11rs368331G/Aintron variant7:217033560.070.060.06
FBXO40rs3772130G/Aintron variant3:1216252930.200.250.25
STK24rs3783006C/Gintron variant13:984589550.460.480.47
CR1rs3818361A/Gintron variant1:2076116230.330.250.25
CD33rs3826656G/Aintron variant19:512233570.240.240.23
APOErs429358C/Tmissense variant19:449086840.220.100.13
ACSM1rs433598T/Cintron variant16:206688840.350.370.34
APOC1rs4420638G/A500B Downstream Variant19:449196890.180.140.18
CNTN4rs4629318A/Gintron variant3:28515900.150.150.14
CDONrs472926G/Aintron variant11:1260353630.160.170.17
CACNA1Crs4765905C/Gintron variant12:22404180.320.370.37
TENM4rs530965T/Cintron variant11:793540560.490.470.49
CSMD2rs544991T/Cintron variant1:337238290.310.270.31
PICALM-
RNU6-560P
rs561655G/Aintergenic variant11:860892370.300.350.33
CHD6rs6129846T/Cintron variant20:414786740.060.050.06
NRXN3rs6574433G/Aintron variant14:783198160.450.420.42
CR1rs6656401A/Gintron variant1:2075187040.310.230.24
NECTIN2rs6857T/C3 Prime UTR Variant19:448889970.280.190.17
NECTIN2rs6859A/Gintron variant19:448787770.500.420.44
LOC105375630rs7004633G/Aintron variant8:887480820.210.190.20
RELNrs7341475A/Gintron variant7:1037643680.160.190.16
LOC105373605rs7561528A/Gintron variant2:1271320610.310.310.34
APOErs769449A/Gintron variant19:449067450.170.080.10
KLHL1rs7984606C/Aintron variant13:698815290.0030.0040.05
NKAIN2rs9491140T/Cintron variant6:1243700910.320.320.31
APOErs7412T/Cmissense variant19:449088220.080.070.07
a Minor allele/major allele.
Table 3. Association of statistically significant genetic markers with LOAD in the Russian population.
Table 3. Association of statistically significant genetic markers with LOAD in the Russian population.
GeneVariantMinor AlleleOR (95% CI)Major AlleleOR (95% CI)p Valuep Value Corrected
APOE4 *
(ε3/ε4)
rs429358
rs7412
ε42.88 (1.95–4.24)ε30.35 (0.24–0.51)5 × 10−88.62 × 10−4
APOE * rs769449A2.44 (1.62–3.66)G0.41 (0.27–0.62)1 × 10−52.5 × 10−3
APOE * rs429358C2.53 (1.75–3.67)T0.39 (0.27–0.57)5 × 10−71.7 × 10−3
TOMM40 * rs2075650G1.67 (1.21–2.29)A0.60 (0.44–0.82)2 × 10−33.4 × 10−3
NECTIN2 * rs6857T1.65 (1.21–2.26)C0.61 (0.44–0.83)2 × 10−34.3 × 10−3
CR1rs3818361A1.5 (1.12–2.00)G0.67 (0.5–0.89)6 × 10−35.2 × 10−3
CR1rs6656401A1.46 (1.09–1.96)G0.68 (0.51–0.92)0.016 × 10−3
NECTIN2rs6859A1.34 (1.03–1.74)G0.75 (0.57–0.97)0.036.8 × 10−3
FBXO40rs3772130G0.72 (0.52–0.99)A1.39 (1.01–1.91)0.047.7 × 10−3
OR odds ratio, CI confidence interval, p values corrected based on Benjamini and Hochberg method, * Statistically significant results after multiple comparison correction.
Table 4. Identical haplotypes identified in both groups.
Table 4. Identical haplotypes identified in both groups.
Nrs, GeneHaplotype Frequencies (Cases/Controls)
%
rs6859
NECTIN2
rs6857
NECTIN2
rs157580
TOMM40
rs2075650
TOMM40
rs769449
APOE
rs429358
APOE
rs7412
APOE
rs4420638
APOC1
1GCGAGTCA21.4/26.3
2ACAAGTCA15.7/16.4
3GCAAGTCA14.0/20.9
4ATAGACCG11.0/6.4
5ATAGGTCA9.1/9.9
6ACGAGTCA7.1/5.5
7GCGAGTTA5.0/3.1
8GCAAGTTA2.8/3.0
9ACAAGTCG2.3/2.7
10GTAGACCG1.4/1.0
11GTAAGCCG1.1/0.6
12GCAAACCG0.7/0.6
13ATAAGCCG0.7/0.8
14GCAAGTCG0.5/1.0
Table 5. Gene–gene interaction models by GMDR analysis.
Table 5. Gene–gene interaction models by GMDR analysis.
Best Interaction ModelTBA #CVCp ValueOR # (95% CI) #
APOE0.596/100.00062.81 (1.54–5.13)
TCF4, APOE0.6610/10<0.00014.04 (2.27–7.17)
CLU, TCF4, APOE0.6610/10<0.00015.71 (3.03–10.78)
CVC—cross validation consistency, TBA—testing balanced accuracy, # Values rounded up to 2 decimal places.
Table 6. Significantly enriched pathways of studied genes.
Table 6. Significantly enriched pathways of studied genes.
Pathway IDPathway DescriptionObserved Gene CountFalse Discovery RateMatching Proteins in Your Network (Labels)
GO:0022008neurogenesis150.00028APOE,CDON,CLU,CNTN4,CNTNAP2,CTNNA2,KLHL1,LSM1,NRXN3,RELN,SORL1,STK24,TCF4,TENM4,ZNF365
GO:0007155cell adhesion110.00036CADM2,CD33,CDON,CNTN4,CNTNAP2,CTNNA2,DCHS2,NRXN3,PVRL2,RELN,TENM4
GO:0007399nervous system development170.00036APOE,CDON,CLU,CNTN4,CNTNAP2,CTNNA2,DNAH11,GRIN2B,KLHL1,LSM1,NRXN3,RELN,SORL1,STK24,TCF4,TENM4,ZNF365
GO:0048699generation of neurons140.00036APOE,CDON,CNTN4,CNTNAP2,CTNNA2,KLHL1,LSM1,NRXN3,RELN,SORL1,STK24,TCF4,TENM4,ZNF365
GO:0032989cellular component morphogenesis100.00044CLU,CNTN4,CNTNAP2,CTNNA2,NRXN3,PVRL2,RELN,STK24,TENM4,ZNF365
GO:0050767regulation of neurogenesis100.00044APOE,CDON,CNTN4,LSM1,RELN,SORL1,STK24,TCF4,TENM4,ZNF365
GO:0048666neuron development100.00045APOE,CNTN4,CNTNAP2,CTNNA2,KLHL1,NRXN3,RELN,STK24,TENM4,ZNF365
GO:0031175neuron projection development90.00058APOE,CNTN4,CNTNAP2,CTNNA2,KLHL1,NRXN3,RELN,STK24,ZNF365
GO:0000902cell morphogenesis90.00061CLU,CNTN4,CNTNAP2,CTNNA2,NRXN3,RELN,STK24,TENM4,ZNF365
GO:0032990cell part morphogenesis80.00064CNTN4,CNTNAP2,CTNNA2,NRXN3,PVRL2,RELN,STK24,ZNF365
GO:0007417central nervous system development100.00084CDON,CLU,CNTN4,CNTNAP2,CTNNA2,GRIN2B,KLHL1,RELN,TENM4,ZNF365
GO:0007611learning or memory60.00089APOE,CNTNAP2,DNAH11,GRIN2B,NRXN3,RELN
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Bocharova, A.; Vagaitseva, K.; Marusin, A.; Zhukova, N.; Zhukova, I.; Minaycheva, L.; Makeeva, O.; Stepanov, V. Association and Gene–Gene Interactions Study of Late-Onset Alzheimer’s Disease in the Russian Population. Genes 2021, 12, 1647. https://doi.org/10.3390/genes12101647

AMA Style

Bocharova A, Vagaitseva K, Marusin A, Zhukova N, Zhukova I, Minaycheva L, Makeeva O, Stepanov V. Association and Gene–Gene Interactions Study of Late-Onset Alzheimer’s Disease in the Russian Population. Genes. 2021; 12(10):1647. https://doi.org/10.3390/genes12101647

Chicago/Turabian Style

Bocharova, Anna, Kseniya Vagaitseva, Andrey Marusin, Natalia Zhukova, Irina Zhukova, Larisa Minaycheva, Oksana Makeeva, and Vadim Stepanov. 2021. "Association and Gene–Gene Interactions Study of Late-Onset Alzheimer’s Disease in the Russian Population" Genes 12, no. 10: 1647. https://doi.org/10.3390/genes12101647

APA Style

Bocharova, A., Vagaitseva, K., Marusin, A., Zhukova, N., Zhukova, I., Minaycheva, L., Makeeva, O., & Stepanov, V. (2021). Association and Gene–Gene Interactions Study of Late-Onset Alzheimer’s Disease in the Russian Population. Genes, 12(10), 1647. https://doi.org/10.3390/genes12101647

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop