Next Article in Journal
Renin–Angiotensin–Aldosterone System: Friend or Foe—The Matter of Balance. Insight on History, Therapeutic Implications and COVID-19 Interactions
Next Article in Special Issue
Molecular Analysis of ZNF71 KRAB in Non-Small-Cell Lung Cancer
Previous Article in Journal
In Silico Analysis of Huntingtin Homologs in Lower Eukaryotes
Previous Article in Special Issue
Preclinical Evaluation of [18F]FACH in Healthy Mice and Piglets: An 18F-Labeled Ligand for Imaging of Monocarboxylate Transporters with PET
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Integrative Transcriptome-Wide Analysis of Amyotrophic Lateral Sclerosis for the Identification of Potential Genetic Markers and Drug Candidates

1
Department of Computer Engineering, Dongguk University, Seoul 04620, Korea
2
Department of Life Science, Dongguk University, Seoul 04620, Korea
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2021, 22(6), 3216; https://doi.org/10.3390/ijms22063216
Submission received: 4 March 2021 / Revised: 18 March 2021 / Accepted: 19 March 2021 / Published: 22 March 2021
(This article belongs to the Special Issue Molecular Biomarkers in Cancer and Metabolic Disease)

Abstract

:
Amyotrophic lateral sclerosis (ALS) is a neurodegenerative neuromuscular disease. Although genome-wide association studies (GWAS) have successfully identified many variants significantly associated with ALS, it is still difficult to characterize the underlying biological mechanisms inducing ALS. In this study, we performed a transcriptome-wide association study (TWAS) to identify disease-specific genes in ALS. Using the largest ALS GWAS summary statistic (n = 80,610), we identified seven novel genes using 19 tissue reference panels. We conducted a conditional analysis to verify the genes’ independence and to confirm that they are driven by genetically regulated expressions. Furthermore, we performed a TWAS-based enrichment analysis to highlight the association of important biological pathways, one in each of the four tissue reference panels. Finally, utilizing a connectivity map, a database of human cell expression profiles cultured with bioactive small molecules, we discovered functional associations between genes and drugs to identify 15 bioactive small molecules as potential drug candidates for ALS. We believe that, by integrating the largest ALS GWAS summary statistic with gene expression to identify new risk loci and causal genes, our study provides strong candidates for molecular basis experiments in ALS.

1. Introduction

Amyotrophic lateral sclerosis (ALS), also known as Lou Gehrig’s disease, affects nerve cells in the brain and spinal cord, causing motor neurons to mutate and disappear selectively and progressively. The disorder causes muscle weakness, such as myocardial lateral sclerosis and severe myocardial dysfunction, for which the causes are unclear and no definitive diagnostic tests exist so far [1]. The annual incidence of ALS is about 2 per 100,000 persons and the incidence rises considerably in people above mid-age [2].
Many studies have been conducted to identify the genetic mutations causing ALS, not only to understand the disease mechanisms but also to facilitate the design of disease modeling and treatment [3,4]. Although genome-wide association studies (GWAS) have successfully identified many variants significantly associated with ALS, it is often difficult to characterize the underlying biological mechanisms inducing ALS. Furthermore, the actual effects of loci may not completely be described because of complex factors such as linkage disequilibrium (LD) between the variants and differences in the amount of gene expression (GE) of particular genes in various tissues [5,6,7]. In order to understand the underlying biology, researchers have used expression quantitative trait loci (eQTL) studies, in which correlations between single nucleotide polymorphisms (SNPs) and GE have been studied to discover disease mechanisms at the gene level [8]. Unfortunately, not many expression-trait associations have been identified, given the sparsity of specimen availability and the prohibitive cost, especially for genes with small effect sizes.
In recent times, transcriptome-wide association study (TWAS) has emerged as a new technology to identify functionally relevant genes whose expression levels are associated with a trait of interest. Instead of measuring GE levels directly from samples, TWAS utilizes reference panels to learn a predictive model on an eQTL dataset and impute GE levels into a larger GWAS [9]. Using large-scale GWAS statistics, several TWAS successfully identified genes, even those with small effect sizes, that are significantly associated with complex traits, by increasing the statistical power in the analysis [10,11]. Several TWAS tools have been proposed, including FUSION, which is one of the representative tools [12,13,14]. To increase the power of analysis, FUSION provides reference panels to impute gene expression, which are used to calculate the causal relationship between SNPs and GE within specific tissues. In addition, in order to more accurately prioritize causal genes taking into account cis-genetic expression factors in GWAS summary statistics, FUSION analyzes by adding an LD matrix containing LD information between SNPs [15].
In this study, we used FUSION to perform TWAS on ALS and analyzed the largest ALS cohort ever to identify the candidate genes associated with ALS. The cohort consists of 20,806 ALS cases and 59,804 controls from Europe. Brain and blood-related panels from Genotype-Tissue Expression project v7 (GTEx v7) have been used as reference panels. From the analysis, we identified 27 significantly associated genes (20 genes are reported elsewhere and seven novel genes), showing that they are indirectly involved in the biological mechanisms of ALS [5,16,17]. Utilizing the LD information, we narrowed the 27 genes to 25 genes.
A conditional analysis is used to determine whether the significant TWAS gene has shown up owing to an independent association or a number of correlated single associations [18]. Through FUSION conditional analyses, which determine the independences of resulting genes, we demonstrate that several of the genome-wide significant signals from the ALS GWAS are driven by genetically regulated expression. Based on the TWAS-based gene set enrichment analysis (TWAS-GSEA), we confirmed the functional enrichment pathway of ALS. In addition, we performed drug repositioning analysis using significant TWAS genes using the connectivity map (CMap) software [19]. Finding effective treatment for ALS remains one of the biggest issues to solve. ALS is one of the most rapid neurodegenerative diseases, but so far, other than Riluzole, no drugs that slow disease progression have been reported, and numerous drugs have shown poor results in randomized controlled trials [20]. By means of the drug repositioning analysis using CMap, we detected 15 bioactive small molecules as potential drug candidates for ALS. This study is aimed at the comprehensive and in-depth characterization of the associations between cis-genetic components and ALS through comprehensive analysis in a multitude of relevant tissues using TWAS.

2. Results

2.1. Workflow of this Study

To study the genetic mechanisms of ALS, a TWAS was performed using FUSION software. Figure 1 shows the workflow of our TWAS analysis, where ALS GWAS summary statistics, reference brain and blood tissue panels, and an LD matrix have been used as the input for FUSION software (http://gusevlab.org/projects/fusion/; accessed on 15 August 2020). The FUSION returns TWAS-identified transcriptome-wide significant genes responsible for ALS risk. After the analysis, conditional analyses were conducted on the significant TWAS genes to determine whether they were actually detected as independently significant associations. TWAS-GSEA was performed using significant TWAS genes to investigate the biological effects derived from the TWAS results and to explore biological pathways involved in the pathogenesis of ALS. Drug repositioning analysis was conducted in CMap using significant TWAS genes to discover drug candidates for ALS.

2.2. Identification of Significant TWAS Genes Associated with ALS Risk

To identify risk genes contributing to the pathogenesis of ALS, we performed a TWAS using FUSION, with GWAS summary statistics of 20,806 ALS cases and 59,804 controls from Europe, 19 SNP-GE weight brain and blood panels from the GTEx consortium, and an LD matrix estimated from the 1000 Genomes Project. As ALS affects nerve cells mainly in the brain and other related organs, the 19 SNP-GE weight panels associated with brain tissue were selected for this study. All of the TWAS genes are listed in Supplementary Table S1. Applying a threshold of FDR < 0.05, 55 genes have been identified from 19 panels, of which 27 genes were uniquely identified (Figure 2 shows a Manhattan plot of TWAS associations between gene expression and ALS. Supplementary Table S2 lists 55 significant genes and their information including p-value and FDRs). Among the 55 TWAS genes, 35 were estimated to be up-regulated in ALS (Z-score > 0), whereas 20 to be down-regulated (Z-score < 0) (Supplementary Table S2). Twenty out of the 27 uniquely identified genes have been reported as ALS-associated genes in previous studies [5,21,22,23,24]. In particular, the top three of those 27 genes, i.e., C9orf72-SMCR8 complex subunit (C9orf72) (p-value = 1.65 × 10−28 and FDR = 4.03 × 10−23), sec1 family domain containing 1 (SCFD1) (p-value = 6.65 × 10−8 and FDR = 5.07 × 10−4), and ataxin 3 (ATXN3) (p-value = 3.14 × 10−6 and FDR = 6.73 × 10−3), are well-known ALS-associated genes [5,16,17]. Table 1 lists the remaining 7 TWAS genes, which are, to the best of our knowledge, novel risk genes. The most significant among them is radial spoke head 10 homolog B (RSPH10B) (Z-score = −4.68, p = 2.90 × 10−6 and FDR = 6.43 × 10−3) from the pituitary gland. Among the brain tissues panels, the most significant novel gene found was NADH:ubiquinone oxidoreductase subunit C2 (NDUFC2) (Z-score = −4.24, p-value = 2.18 × 10−5, and FDR = 3.04 × 10−2). Among the blood tissue panels, hepatoma-derived growth factor, related protein 3 (HDGFRP3) (Z-score = −4.23, p-value = 2.39 × 10−5, and FDR = 3.24 × 10−2) was the most significant novel gene found.
Next, we examined gene-tissue association from the 27 significant TWAS genes to assess the expression pattern of each gene in 19 SNP-GE weight panels (Figure 3 shows a heatmap representing the predicted expression patterns of TWAS significant associations). Of the 27 TWAS genes, 10 genes were associated with multiple tissue panels, while the other genes were detected from only one panel. Among the 10 genes, C9orf72 was detected in 12 panels, the largest number of tissue panels among all genes. The expression levels of C9orf72 were consistently estimated to be up-regulated in ALS patients in 12 panels. While most genes had consistent expression patterns across tissue panels, only SCFD1 was predicted to be up-regulated by five SNP-GE panels associated with brain tissues and to be down-regulated by four SNP-GE panels associated with blood tissues in ALS patients. Consistent with our results, a previous study for ALS showed that the regression slope of cis-eQTLs of SCFD1 was positive in brain tissues and negative in whole-blood tissues [23]. In terms of tissue panels, the SNP-GE weight panel associated with the largest number of genes, i.e., 11 TWAS associations, was the blood panel of YFS. Even though there can be interference in the tissue enrichment of TWAS associations because of different SNP-GE weight panels having different numbers of genetic features, we could suggest that this result is consistent with previous studies given that various metabolic biomarkers implicated in ALS pathogenesis have been detected in the blood [25,26]. To summarize, we identified 27 TWAS genes, including seven novel genes, as susceptibility genes contributing to the etiology of ALS and reported reliable gene-tissue associations in ALS.

2.3. Conditional Analysis Supports the TWAS Genes of ALS

TWAS reports genes with significant causal relationships based on GWAS summary statistics. A conditional analysis using LD was performed to detect independent loci. The 27 TWAS genes were grouped into 22 different discrete loci by their physical position with a +/− 500 kb window (Supplementary Table S3 and S4). The result demonstrates that 25 of 27 significant TWAS genes were identified as independent TWAS signals at 22 loci, conditional on their predicted expression (conditional p < 0.05). Myosin XIX (MYO19) and gametogenetin-binding protein 2 (GGNBP2) among the 27 TWAS genes, located at the same loci, were tested for the independence of significance. When conditioning on the predicted expression of MYO19 in the blood panel, the GWAS signals significantly (lead SNP PGWAS = 6.21 × 10−6 conditioned on MYO19 lead SNP PGWAS = 0.00369; Figure 4A) dropped. This result indicated that the expression of MYO19 may explain most GWAS signals at the locus, implying that MYO19 is a jointly significant gene (conditional p-value in blood of YFS = 8.6 × 10−6) but GGNBP2 is not (conditional p-value in blood of YFS = 0.42). NDUFC2, one of the novel risk genes from brain tissue, was also identified as an independently significant gene, not being dependent on effects from GWAS SNPs, because the majority of GWAS signals were substantially altered after conditioning on the expression of NDUFC2 in the brain cortex lead SNP PGWAS = 7.85 × 10−6, conditioned on NDUFC2 lead SNP PGWAS = 1.61 × 10−5; Figure 4B).
Furthermore, we compared 22 discrete loci containing 25 independent TWAS associations with GWAS loci reported by the GWAS catalog. Eight of the 22 loci were mapped within +/− 500 kb of the GWAS significant loci implicated in ALS Table 2. As expected, the abovementioned C9orf73, SCFD1, and ATXN3 genes were simultaneously identified to be implicated in ALS at TWAS and GWAS significant loci. The majority of TWAS genes overlap the reported genes located at the mapped GWAS loci, implying that our TWAS results are reliable. Nevertheless, there are several TWAS genes that do not correspond to GWAS-reported genes at the overlapped loci, such as ubiquitin-specific peptidase 37 (USP37), serine protease 3 (PRSS3), β-1,4-N-acetyl-galactosaminyltransferase 1 (B4GALNT1), and MYO19. One of the TWAS novel genes, USP37, was detected at significant GWAS loci and the other novel genes were not detected within +/− 500 kb of the GWAS loci. These results indicate that TWAS responsibly discovered risk genes for ALS that could not be detected by GWAS. Collectively, we identified that the majority of our TWAS genes were independently significant risk genes for ALS.

2.4. Functional Annotation of TWAS Signals

In order to explore the biological effects arising from the TWAS-identified genes of ALS, we conducted a TWAS-GSEA using TWAS signals and reference gene sets. TWAS-GSEA can analyze functional clusters of TWAS signals, minimizing the loss of signals from intermediating genes. TWAS signals were grouped by 19 different tissues and their functional categories were tested using reference gene sets, such as Hallmark and gene ontology (GO) gene sets. Four biological gene sets were significantly associated with TWAS signals in four tissue panels, i.e., hippocampus, hypothalamus, nucleus accumbens/basal ganglia, and whole-blood (FDR < 0.05) Table 3. Three gene sets, i.e., negative regulation of binding, perikaryon, and Golgi vesicle-mediated transport, were from GO, and a single gene set, KRAS signaling up, was from Hallmark. Four significantly enriched pathways contained a single TWAS significant gene (Supplementary Table S5).
Negative regulation of binding was enriched in hippocampus tissue, which is consistent with previous reports of the correlation between ALS and cytoplasmic aggregation of RNA-binding protein [27]. Golgi vesicle-mediated transport was significantly enriched with TWAS genes in whole-blood tissue, which may indicate that the Golgi apparatus plays a crucial role in the pathogenesis of ALS [28,29,30]. SCFD1, one of the significant TWAS genes, is also involved in the endoplasmic reticulum (ER)-to-Golgi transport and was reportedly mutated in a small proportion of ALS patients [31]. Notably, up-regulation of the KRAS signaling pathway was enriched in two tissues, specifically, the hypothalamus and nucleus accumbens/basal ganglia, owing to the TWAS signal. In accordance with the importance of the KRAS pathway in several neuropathies, our result showed that dysregulation of the KRAS pathway might be also involved in the pathogenesis of ALS based on genetic features [32]. In addition, we identified that TWAS signals from the nucleus accumbens/basal ganglia were enriched in the perikaryon pathway. Neurofilament accumulation in the perikaryon was previously reported as one of the most common and characteristic pathological features of ALS [33,34]. We suggest that these four biological processes together might have a genetic contribution to the pathogenesis of ALS.

2.5. Identification of Drug Candidates for ALS

To discover drug candidates that can target the 27 TWAS genes, we conducted a drug repurposing analysis upon the TWAS results using the CMap software (Supplementary Table S6). CMap performs connectivity analysis of gene expression activation between bioactive small molecules by comparing the order of gene expression with the reference database (build 02). Table 4 lists 15 bioactive small molecules that were detected as potential drug candidates for ALS (p < 0.01). The bioactive molecule with the highest enrichment score is MK-886 (enrichment score = 0.966 and p-value = 2.01 × 10−3), which is known as an inhibitor of 5-lipoxygenase (5-LOX) and could thus reduce the development of atherosclerosis [35]. Klegeris et al. reported that 5-LOX inhibitors including MK-886 were identified as effective neuroprotective agents by suppressing toxic actions derived from reactive microglia that are associated with age-related degenerative diseases including ALS [35]. Trichostatin A (enrichment score = 0.386 and p-value = 0.00), the most statistically significant drug candidate, has shown protective effects in cell types involved in ALS through increased histone acetylation [36,37]. Yoo et al. suggested that trichostatin A may have a potential therapeutic effect to slow disease progression and to enhance motor performance in ALS patients [36].
The majority of other potential drug candidates that are identified in our analysis were reported to be either directly or indirectly associated with ALS elsewhere. STOCK1N-35696 (enrichment score = 0.965 and p-value = 2.07 × 10−3) was previously suggested as a potential therapeutic agent for Alzheimer’s disease [38]. Androsterone (enrichment score = 0.813 and p-value = 2.37 × 10−3) is a neurosteroid that could have effects on brain function, acting as a positive allosteric modulator of the γ-aminobutyric acid type A (GABAA) receptor [39,40]. Picrotoxinin (enrichment score = 0.801 and p-value = 3 × 10−3) is a component of picrotoxin that has been used as a stimulant of the central nervous system [41]. Deptropine is known as a classical histamine H1 receptor antagonist and histamine targets motor neurons, glial cells, and skeletal muscles, which all express histamine receptors, in ALS [42,43]. Volonté et al. reported that histaminergic modulation might be effective in ALS because histamine-related genes were dysregulated in the cortex and spinal cord in sporadic ALS patients [43]. Folic acid (enrichment score = 0.757 and p-value = 6.60 × 10−3) with vitamin B12 was reported to delay disease onset and prolong lifespans in the ALS mouse model [44]. Adenosine phosphate (enrichment score = 0.734 and p-value = 9.83 × 10−3) is involved in adenosine homeostasis that may reportedly play important roles in ALS progression [45]. Tetracycline (enrichment score = 0.673 and p-value = 9.75 × 10−3) is reportedly one of the antibiotics that could act as neuroprotective molecules in neurological diseases such as Huntington’s disease, Parkinson’s disease, stroke, and multiple sclerosis [46]. A previous study mentioned that phentolamine (enrichment score = 0.611 and p-value = 4.59 × 10−3) was effective to reduce blood pressure in a patient with ALS [47]. Collectively, we successfully detected 15 potential drug candidates associated with 27 TWAS genes, which may contribute to the development of new therapeutic options for ALS.

3. Discussion

ALS is a disease that has been studied from various angles by several researchers. However, its exact biological mechanism remains unclear; consequently, there have been few definitive treatments for the disease. Although many genes, SNPs, and loci associated with ALS have been reported by GWAS studies, few studies have reported the functional significance of genetic components from a transcriptomic perspective. To overcome the limitation of GWAS studies, we performed a TWAS integrating the GWAS summary statistics of more than 80,000 ALS cohorts, 19 SNP-GE weights from brain and blood tissues, and the LD matrix. Since blood biomarkers have been steadily identified to explain various pathological mechanisms of ALS, we used SNP-GE weights of not only brain tissue but also blood tissue in order to provide new insights into the pathogenesis of ALS, compared with the previous ALS TWAS study [21]. Using the TWAS results, we conducted a functional enrichment analysis of TWAS signals and a drug repositioning analysis targeting significant TWAS genes.
Twenty-seven significant TWAS genes were identified as susceptibility genes for ALS in 19 different SNP-GE weight panels related to the brain. Of the 27 significant TWAS genes, seven were novel risk genes identified by this study. Several novel genes can be biologically interpreted to be associated with ALS. RSPH10B, the most statistically significant TWAS gene, was reported to play a crucial role in ciliary and flagellar axonemes and to be associated with Kartagener syndrome [48]. Several genes responsible for Kartagener syndrome highlighted the importance of dynein motors to ciliary motility in a previous study [49]. Defects in cytoplasmic dynein-mediated retrograde axonal transport were reportedly implicated in the etiology of ALS [49]. Based on this information, we believe that RSPH10B may actually affect the pathogenesis of ALS and have a shared genetic contribution to Kartagener syndrome.
NDUFC2, the most significant novel gene from brain tissue, encodes a subunit of the mitochondrial membrane respiratory chain NADH dehydrogenase that is associated with mitochondrial diseases including Parkinson’s disease [50]. A previous study reported that the expression level of NDUFC2 may contribute to the occurrence of ischemic stroke [51]. Another study on the aggregation of neurodegenerative disease proteins in cerebral ischemia revealed a previously unrecognized molecular overlap between neurodegenerative diseases such as ALS and ischemic stroke [51,52]. USP37 encodes an enzyme that breaks down ubiquitin-specific processing proteins in the body [53]. Inhibition of USP37 was reported to facilitate the proteasomal clearance of neurotoxic proteins [54]. Acting in a similar biological mechanism to USP37, USP7 was suggested to be involved in the etiology of ALS associated with proteotoxicity [55]. In light of these facts, we suggest that our TWAS successfully identified biologically interpretable novel genes implicated in the pathogenesis of ALS.
Based on the conditional analysis, 25 out of 27 TWAS significant genes were identified as independent genes. We confirmed that the seven novel TWAS genes were located at different loci and were independently significant genes. Then, 25 genes were grouped into 22 discrete loci in which previously reported GWAS genes were implicated. Eight of the 22 loci were detected to simultaneously have TWAS- and GWAS-significant signals. NDUFC2, a novel risk gene not located at the eight loci, may not be detected by GWAS because of the small gene size and other large protein-coding genes nearby. However, TWAS identified NDUFC2 as an independently significant gene, showing the power of TWAS to amplify weak signals associated with the disease.
To investigate the biological effects derived from the ALS TWAS genes at the transcriptome level, we carried out a functional enrichment analysis using TWAS-GSEA based on the TWAS results. The TWAS signals were significantly enriched in 4 gene sets that were substantially consistent with those found in previous studies. Although the TWAS novel genes were not enriched in the four biological pathways, junction plakoglobin (JUP), C9orf72, and SCFD1, which were previously reported as ALS-related genes, were included in the pathways (Supplementary Table S5).
We also performed an in silico drug repurposing analysis using CMap to discover perturbations that can reverse the expression profiles of significant TWAS genes. CMap is a valuable approach to identify new indications for existing drug products and to potentially provide opportunities to develop new therapeutic options [56,57,58]. Fifteen bioactive molecules were detected as potential drug candidates for ALS using CMap, of which most molecules were directly or indirectly associated with ALS. Trichostatin A, the most statistically significant bioactive molecule associated with ALS, was reported to inhibit histone deacetylase 6 of which overexpression disrupted the localization of p58 mediating binding of Golgi elements to microtubules [59]. It may be consistent with the TWAS-GSEA results in terms of Golgi-related etiology for ALS. Even though the potential drug candidates were substantially implicated in ALS, functional studies may be necessary to validate their actual effects under physiological conditions because the results were derived from only in silico analyses.
While the overall results from TWAS can be invaluable sources to understand the pathogenesis of ALS, there are several limitations to overcome. TWAS is a powerful approach to find risk loci affecting GE and to detect trait-associated genes; however, identifying the associations between genes and the trait does not explain the causality of disease. Thus, the genetic effects from dysregulation of significant TWAS genes need to be validated using further in vitro or in vivo studies. As the SNP-GE weight panels were used to predict the association of each eQTL and the disease in TWAS, the results of TWAS may depend on the number of features in the SNP-GE panels. Brain substantia nigra (n = 1604) and brain amygdala (n = 1837) panels have fewer features compared to the other tissue panels, which have more than 2000 features. In fact, no significant TWAS signals were identified in the brain amygdala tissue among 19 SNP-GE weight panels. Despite these limitations, we believe that our TWAS successfully identified biologically interpretable risk genes for ALS by contemplating the GE from the brain-related tissue panels. Our study can provide new biological insights into the pathogenesis of ALS and may contribute to the advancement of therapeutic options for ALS.

4. Materials and Methods

4.1. GWAS Summary Statistics of ALS

ALS GWAS summary statistics data was retrieved from the GWAS catalog (www.ebi.ac.uk, accessed on 15 August 2020; accession number: GCST005647). Details on participant ascertainment and quality control were described by Aude et al. [6]. The ALS GWAS used in this work includes only the European population (n = 20,806 cases and 59,804 controls). To use the GWAS summary statistics for TWAS analysis, the LD score (LDSC, v1.0.1) software was used to generate a sumstat-formatted file estimating SNP heritability and genetic covariance [15]. The generated file can correct the inflation values caused by GWAS, so the values predicted will be more accurate than in the original format.

4.2. Transcriptome-Wide Association Study

A TWAS was performed using FUSION software to conduct tissue-specific gene expression imputation based on the ALS GWAS summary statistics (http://gusevlab.org/projects/fusion/; accessed on 15 August 2020). TWAS incorporates ALS GWAS summary statistics into cis-eQTL information representing the relationship between SNPs and GE in specific tissues and accounts for LD to identify candidate genes associated with traits. The SNP-GE weights containing cis-eQTL information, which indicates the correlation between GE and SNP, were retrieved from the TWAS FUSION website. Four SNP-GE weights of blood tissue were obtained from four cohort datasets: Netherlands Twins Register (NTR), Young Finns Study (YFS), metabolic syndrome in men study (METSIM), and Genotype-Tissue Expression project v7 (GTEx v7, whole blood) [60,61,62,63]. Fifteen SNP-GE weights related to brain tissue were obtained from the Genotype-Tissue Expression project v7 (GTEx v7) and Common Mind Consortium (CMC). All SNP-GE weights are composed of genes being cis-regulated by SNPs within +/− 500kb of the transcription start site and having significant heritability measured using GCTA/REML with a p-value less than 0.01. An LD reference data for European populations from the 1000 Genomes project was used to contemplate the LD region [64]. FUSION calculates gene expression using a linear mixed model such as polygenic risk scores, LASSO, elastic net, and Bayesian sparse linear mixed model (BSLMM) [14,65]. It performs 5-fold cross-validation on all models to determine the best model and displays the optimized results. An FDR-corrected p-value < 0.05 was considered as a threshold for significant TWAS associations.

4.3. Conditional Analysis Using TWAS Results

To confirm whether the multiple association features are independent signals, a conditional analysis was conducted on the genomic regions where the identified significant TWAS genes are located, using FUSION software with the LD matrix [15]. The joint association p-value for each significant TWAS gene was calculated after conditioning on the expression of the gene. This analysis can be used to investigate whether the loci have truly independent associations when significant TWAS genes were identified at the loci using SNP-GE weights. Conditional analysis can also identify how much GWAS signal remains after the effects from the TWAS independent signal are removed.

4.4. Functional Enrichment Analysis

TWAS-GSEA was performed using TWAS results and SNP-GE weights to identify the enrichment of specific biological pathways in ALS [10]. All genes obtained from each tissue panel in TWAS were ranked using Z-scores and were used as pre-ranked gene sets, in order to account for even the smallest potential that may be missed by focusing only on significant genes. Gene sets of Hallmark and GO from molecular signatures database (MsigDB v7.1, http://software.broadinstitute.org/gsea/msigdb; accessed on 15 August 2020) were used as reference biological gene sets [66]. TWAS-GSEA utilizes a linear mixed model from the lme4qtl R package to test for an association between Z-scores and gene set membership while adjusting for non-biological effects and accounting for correlation between genes [67].

4.5. Drug Repositioning Analysis Using CMap

To discover drug candidates for ALS, drug repositioning analysis was conducted using CMap upon the TWAS genes statistically significantly associated with ALS. The formats of the TWAS genes were converted into the probe identifiers of Affymetrix Human genome U133a using the hgu133a.db Bioconductor annotation R package (“hgu133a2.db”). The TWAS genes were grouped as up- or down-regulated genes based on their Z-scores and the corresponding probe IDs of up- and down-regulated genes were respectively provided as down- and up-regulated genes to the CMap software. CMap (build 02) provides data on expression profiles of more than 7000 drug signatures representing 1309 bioactive compounds. The enrichment scores between CMap bioactive molecules and the genes provided as input were calculated based on the Kolmogorov–Smirnov statistical method. The range of enrichment score was represented between +1 and −1, indicating positive or negative enrichment between the bioactive molecules and the input genes. As the TWAS genes with opposite signs of their Z-scores were provided as the input of CMap, small molecules with a p-value < 0.01 and a positive enrichment score were selected as drug candidates for ALS.

Supplementary Materials

The following are available online at https://www.mdpi.com/1422-0067/22/6/3216/s1, Table S1: TWAS summary statistics, Table S2: TWAS significant genes (FDR < 0.05), Table S3: Divide the loci of TWAS novel genes based on +/−500 kb, Table S4: Conditional analysis of multi-gene loci, Table S5: TWAS-GSEA result (FDR < 0.05) and enrichment gene set, Table S6: CMap permuted result.

Author Contributions

Conceptualization, S.P. and D.K.; methodology, S.P. and D.K.; software, S.P., D.K. and J.S.; validation S.P.; formal analysis, S.P. and D.K.; investigation, S.P. and D.K.; data curation S.P.; writing-original draft preparation, S.P.; writing-review and editing, J.W.J.J.; visualization, S.P.; supervision, J.W.J.J.; project administration, J.W.J.J.; All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the MSIT(Ministry of Science and ICT), Korea, under the ITRC(Information Technology Research Center) support program(IITP-2021-2020-0-01789) supervised by the IITP(Institute for Information & Communications Technology Planning & Evaluation); by the Basics Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (No. 2017R1C1B5017497) and by the Ministry of Trade, Industry and Energy(MOTIE) and Korea Institute for Advancement of Technology(KIAT) through the National Innovation Cluster R&D program(Development of the Platform as a Service for Skin Health by Using the Customized Cosmetics: P0015296).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data sharing not applicable.

Acknowledgments

This work was a part of the fulfillment of the requirements for Sungmin Park’s M.S degree.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

ALSAmyotrophic lateral sclerosis
GWASGenome-wide association studies
LDLinkage disequilibrium
CMapConnectivity map
eQTLExpression quantitative trait loci
TWASTranscriptome-wide association study
GEGene expression
C9orf72C9orf72-SMCR8 complex subunit
SCFD1Sec1 family domain containing 1
RSPH10BRadial spoke head 10 homolog B
NDUFC2NADH:ubiquinone oxidoreductase subunit C2
HDGFRP3Hepatoma-derived growth factor, related protein 3
MYO19Myosin XIX
GGNBP2Gametogenetin-binding protein 2
USP37Ubiquitin-specific peptidase 37
PRSS3Serine Protease 3
EREndoplasmic reticulum
5-LOX5-lipoxygenase
GABAAγ-aminobutyric acid type A
JUPJunction plakoglobin
MsigDBMolecular signatures database

References

  1. Kiernan, M.C.; Vucic, S.; Cheah, B.C.; Turner, M.R.; Eisen, A.; Hardiman, O.; Burrell, J.R.; Zoing, M.C. Amyotrophic lateral sclerosis. Lancet 2011, 377, 942–955. [Google Scholar] [CrossRef] [Green Version]
  2. Logroscino, G.; Traynor, B.J.; Hardiman, O.; Chio, A.; Mitchell, D.; Swingler, R.J.; Millul, A.; Benn, E.; Beghi, E. Incidence of amyotrophic lateral sclerosis in Europe. J. Neurol. Neurosurg. Psychiatry 2010, 81, 385–390. [Google Scholar] [CrossRef]
  3. Solcà, M.; Ronchi, R.; Bello-Ruiz, J.; Schmidlin, T.; Herbelin, B.; Luthi, F.; Konzelmann, M.; Beaulieu, J.Y.; Delaquaize, F.; Schnider, A.; et al. The multistep hypothesis of ALS revisited: The role of genetic mutations. Neurology 2018, 91, e635–e642. [Google Scholar] [CrossRef] [Green Version]
  4. Kapeli, K.; Martinez, F.J.; Yeo, G.W. Genetic mutations in RNA-binding proteins and their roles in ALS. Hum. Genet. 2017, 136, 1193–1214. [Google Scholar] [CrossRef] [Green Version]
  5. Benyamin, B.; He, J.; Zhao, Q.Y.; Gratten, J.; Garton, F.; Leo, P.J.; Liu, Z.J.; Mangelsdorf, M.; Al-Chalabi, A.; Anderson, L.; et al. Cross-ethnic meta-analysis identifies association of the GPX3-TNIP1 locus with amyotrophic lateral sclerosis. Nat. Commun. 2017, 8. [Google Scholar] [CrossRef]
  6. Nicolas, A.; Kenna, K.P.; Renton, A.E.; Ticozzi, N.; Faghri, F.; Chia, R.; Dominov, J.A.; Kenna, B.J.; Nalls, M.A.; Keagle, P.; et al. Genome-wide Analyses Identify KIF5A as a Novel ALS Gene. Neuron 2018, 97, 1268–12683. [Google Scholar] [CrossRef] [Green Version]
  7. Van Rheenen, W.; Shatunov, A.; Dekker, A.M.; McLaughlin, R.L.; Diekstra, F.P.; Pulit, S.L.; Van der Spek, R.A.A.; Vosa, U.; de Jong, S.; Robinson, M.R.; et al. Genome-wide association analyses identify new risk variants and the genetic architecture of amyotrophic lateral sclerosis. Nat. Genet. 2016, 48, 1043–1048. [Google Scholar] [CrossRef] [Green Version]
  8. Hormozdiari, F.; Zhu, A.; Kichaev, G.; Ju, C.J.T.; Segre, A.V.; Joo, J.W.J.; Won, H.J.; Sankararaman, S.; Pasaniuc, B.; Shifman, S.; et al. Widespread allelic heterogeneity in complex traits. Am. J. Hum. Genet. 2017, 100, 789–802. [Google Scholar] [CrossRef] [Green Version]
  9. Westra, H.J.; Peters, M.J.; Esko, T.; Yaghootkar, H.; Schurmann, C.; Kettunen, J.; Christiansen, M.W.; Fairfax, B.P.; Schramm, K.; Powell, J.E.; et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet. 2013, 45, 1238–1243. [Google Scholar] [CrossRef] [Green Version]
  10. Pain, O.; Pocklington, A.J.; Holmans, P.A.; Bray, N.J.; O’Brien, H.E.; Hall, L.S.; Pardinas, A.F.; O’Donovan, M.C.; Owen, M.J.; Anney, R. Novel Insight into the etiology of autism spectrum disorder gained by integrating expression data with genome-wide association statistics. Biol. Psychiat. 2019, 86, 265–273. [Google Scholar] [CrossRef] [Green Version]
  11. Hall, L.S.; Medway, C.W.; Pain, O.; Pardinas, A.F.; Rees, E.G.; Escott-Price, V.; Pocklington, A.; Bray, N.J.; Holmans, P.A.; Walters, J.T.R.; et al. A transcriptome-wide association study implicates specific pre- and post-synaptic abnormalities in schizophrenia. Hum. Mol. Genet. 2020, 29, 159–167. [Google Scholar] [CrossRef] [Green Version]
  12. Gusev, A.; Ko, A.; Shi, H.; Bhatia, G.; Chung, W.; Penninx, B.W.J.H.; Jansen, R.; de Geus, E.J.C.; Boomsma, D.I.; Wright, F.A.; et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 2016, 48, 245–252. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Zhu, Z.H.; Zhang, F.T.; Hu, H.; Bakshi, A.; Robinson, M.R.; Powell, J.E.; Montgomery, G.W.; Goddard, M.E.; Wray, N.R.; Visscher, P.M.; et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 2016, 48, 481–487. [Google Scholar] [CrossRef] [PubMed]
  14. Gamazon, E.R.; Wheeler, H.E.; Shah, K.P.; Mozaffari, S.V.; Aquino-Michaels, K.; Carroll, R.J.; Eyler, A.E.; Denny, J.C.; Nicolae, D.L.; Cox, N.J.; et al. A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 2015, 47, 1091. [Google Scholar] [CrossRef] [Green Version]
  15. Bulik-Sullivan, B.K.; Loh, P.R.; Finucane, H.K.; Ripke, S.; Yang, J.; Patterson, N.; Daly, M.J.; Price, A.L.; Neale, B.M.; Grp, S.W. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 2015, 47, 291–295. [Google Scholar] [CrossRef] [Green Version]
  16. Majounie, E.; Renton, A.E.; Mok, K.; Dopper, E.G.P.; Waite, A.; Rollinson, S.; Chio, A.; Restagno, G.; Nicolaou, N.; Simon-Sanchez, J.; et al. Frequency of the C9orf72 hexanucleotide repeat expansion in patients with amyotrophic lateral sclerosis and frontotemporal dementia: A cross-sectional study. Lancet Neurol. 2012, 11, 323–330. [Google Scholar] [CrossRef]
  17. Volk, A.E.; Weishaupt, J.H.; Andersen, P.M.; Ludolph, A.C.; Kubisch, C. Current knowledge and recent insights into the genetic basis of amyotrophic lateral sclerosis. Med. Genet. Berlin 2018, 30, 252–258. [Google Scholar] [CrossRef] [Green Version]
  18. Yang, J.; Ferreira, T.; Morris, A.P.; Medland, S.E.; Madden, P.A.F.; Heath, A.C.; Martin, N.G.; Montgomery, G.W.; Weedon, M.N.; Loos, R.J.; et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 2012, 44, 369–375. [Google Scholar] [CrossRef]
  19. Lamb, J.; Crawford, E.D.; Peck, D.; Modell, J.W.; Blat, I.C.; Wrobel, M.J.; Lerner, J.; Brunet, J.P.; Subramanian, A.; Ross, K.N.; et al. The connectivity map: Using gene-expression signatures to connect small molecules, genes, and disease. Science 2006, 313, 1929–1935. [Google Scholar] [CrossRef] [Green Version]
  20. Mitsumoto, H.; Brooks, B.R.; Silani, V. Clinical trials in amyotrophic lateral sclerosis: Why so many negative trials and how can trials be improved? Lancet Neurol. 2014, 13, 1127–1138. [Google Scholar] [CrossRef]
  21. Xiao, L.; Yuan, Z.; Jin, S.; Wang, T.; Huang, S.; Zeng, P. Multiple-Tissue Integrative Transcriptome-Wide Association Studies Discovered New Genes Associated with Amyotrophic Lateral Sclerosis. Front. Genet. 2020, 11, 1440. [Google Scholar] [CrossRef] [PubMed]
  22. Freibaum, B.D.; Chitta, R.K.; High, A.A.; Taylor, J.P. Global Analysis of TDP-43 Interacting Proteins Reveals Strong Association with RNA Splicing and Translation Machinery. J. Proteome Res. 2010, 9, 1104–1120. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Iacoangeli, A.; Lin, T.; Al Khleifat, A.; Jones, A.R.; Opie-Martin, S.; Coleman, J.R.I.; Shatunov, A.; Sproviero, W.; Williams, K.L.; Garton, F.; et al. Genome-wide meta-analysis finds the ACSL5-ZDHHC6 Locus is associated with als and links weight loss to the disease genetics. Cell Rep. 2020, 33, 108323. [Google Scholar] [CrossRef]
  24. Dodge, J.C. Lipid Involvement in neurodegenerative diseases of the motor system: Insights from lysosomal storage diseases. Front. Mol. Neurosci. 2017, 10, 356. [Google Scholar] [CrossRef] [Green Version]
  25. Saris, C.G.J.; Horvath, S.; van Vught, P.W.J.; van Es, M.A.; Blauw, H.M.; Fuller, T.F.; Langfelder, P.; DeYoung, J.; Wokke, J.H.J.; Veldink, J.H.; et al. Weighted gene co-expression network analysis of the peripheral blood from Amyotrophic Lateral Sclerosis patients. BMC Genom. 2009, 10, 1–16. [Google Scholar] [CrossRef] [Green Version]
  26. Calvo, A.C.; Pradat, P.F.; Mendonca, D.M.F.; Manzano, R. Decoding Amyotrophic Lateral Sclerosis: Discovery of Novel Disease-Related Biomarkers and Future Perspectives in Neurodegeneration. BioMed Res. Int. 2014. [Google Scholar] [CrossRef]
  27. Hawley, Z.C.E.; Campos-Melo, D.; Strong, M.J. Novel miR-b2122 regulates several ALS-related RNA-binding proteins. Mol. Brain 2017, 10, 1–14. [Google Scholar] [CrossRef] [PubMed]
  28. Haase, G.; Rabouille, C. Golgi Fragmentation in ALS motor neurons. new mechanisms targeting microtubules, tethers, and transport vesicles. Front. Neurosci. 2015, 9, 448. [Google Scholar] [CrossRef]
  29. Sundaramoorthy, V.; Sultana, J.M.; Atkin, J.D. Golgi fragmentation in amyotrophic lateral sclerosis, an overview of possible triggers and consequences. Front. Neurosci. 2015, 9, 400. [Google Scholar] [CrossRef] [Green Version]
  30. Burk, K.; Pasterkamp, R.J. Disrupted neuronal trafficking in amyotrophic lateral sclerosis. Acta Neuropathol. 2019, 137, 859–877. [Google Scholar] [CrossRef] [Green Version]
  31. Ragagnin, A.M.G.; Shadfar, S.; Vidal, M.; Jamali, M.S.; Atkin, J.D. Motor Neuron Susceptibility in ALS/FTD. Front. Neurosci. 2019, 13, 532. [Google Scholar] [CrossRef] [Green Version]
  32. Simanshu, D.K.; Nissley, D.V.; McCormick, F. RAS proteins and their regulators in human disease. Cell 2017, 170, 17–33. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Sun, X.J.; Song, J.Y.; Huang, H.L.; Chen, H.; Qian, K. Modeling hallmark pathology using motor neurons derived from the family and sporadic amyotrophic lateral sclerosis patient-specific iPS cells. Stem Cell Res. Ther. 2018, 9, 1–7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Xiao, S.X.; McLean, J.; Robertson, J. Neuronal intermediate filaments and ALS: A new look at an old question. BBA Mol. Basis Dis. 2006, 1762, 1001–1012. [Google Scholar] [CrossRef] [Green Version]
  35. Klegeris, A.; McGeer, P.L. Cyclooxygenase and 5-lipoxygenase inhibitors protect against mononuclear phagocyte neurotoxicity. Neurobiol. Aging 2002, 23, 787–794. [Google Scholar] [CrossRef]
  36. Yoo, Y.E.; Ko, C.P. Treatment with trichostatin A initiated after disease onset delays disease progression and increases survival in a mouse model of amyotrophic lateral sclerosis. Exp. Neurol. 2011, 231, 147–159. [Google Scholar] [CrossRef]
  37. Sanna, S.; Esposito, S.; Masala, A.; Sini, P.; Nieddu, G.; Galioto, M.; Fais, M.; Iaccarino, C.; Cestra, G.; Crosio, C. HDAC1 inhibition ameliorates TDP-43-induced cell death in vitro and in vivo. Cell Death Dis. 2020, 11, 1–14. [Google Scholar] [CrossRef]
  38. Rahman, M.R.; Islam, T.; Turanli, B.; Zaman, T.; Faruquee, H.M.; Rahman, M.M.; Mollah, M.N.H.; Nanda, R.K.; Arga, K.Y.; Gov, E.; et al. Network-based approach to identify molecular signatures and therapeutic agents in Alzheimer’s disease. Comput. Biol. Chem. 2019, 78, 431–439. [Google Scholar] [CrossRef]
  39. Li, P.; Bracamontes, J.; Katona, B.W.; Covey, D.F.; Steinbach, J.H.; Akk, G. Natural and enantiomeric ctiocholanolone interact with distinct sites on the rat alpha 1 beta 2 gamma 2L GABA(A) receptor. Mol. Pharmacol. 2007, 71, 1582–1590. [Google Scholar] [CrossRef] [Green Version]
  40. Zolkowska, D.; Dhir, A.; Krishnan, K.; Covey, D.F.; Rogawski, M.A. Anticonvulsant potencies of the enantiomers of the neurosteroids androsterone and etiocholanolone exceed those of the natural forms. Psychopharmacology 2014, 231, 3325–3332. [Google Scholar] [CrossRef] [Green Version]
  41. Carpenter, T.S.; Lau, E.Y.; Lightstone, F.C. Identification of a possible secondary picrotoxin-binding site on the GABA(A) Receptor. Chem. Res. Toxicol. 2013, 26, 1444–1454. [Google Scholar] [CrossRef]
  42. Qu, C.B.; Li, Y.; Li, Y.L.; Yu, P.F.; Li, P.F.; Donkers, J.M.; van de Graaf, S.F.J.; de Man, R.A.; Peppelenbosch, M.P.; Pan, Q.W. FDA-drug screening identifies deptropine inhibiting hepatitis E virus involving the NF-kappa B-RIPK1-caspase axis. Antivir. Res. 2019, 170, 104588. [Google Scholar] [CrossRef]
  43. Volonte, C.; Apolloni, S.; Sabatelli, M. Histamine beyond its effects on allergy: Potential therapeutic benefits for the treatment of amyotrophic lateral sclerosis (ALS). Pharmacol. Therapeut. 2019, 202, 120–131. [Google Scholar] [CrossRef]
  44. Wang, M.L.; Liu, Z.; Sun, W.N.; Yuan, Y.C.; Jiao, B.; Zhang, X.W.; Shen, L.; Jiang, H.; Xia, K.; Tang, B.S.; et al. Association between vitamins and amyotrophic lateral sclerosis: A center-based survey in mainland China. Front. Neurol. 2020, 11. [Google Scholar] [CrossRef]
  45. Cieslak, M.; Roszek, K.; Wujak, M. Purinergic implication in amyotrophic lateral sclerosisfrom pathological mechanisms to therapeutic perspectives. Purinerg. Signal. 2019, 15, 1–15. [Google Scholar] [CrossRef] [Green Version]
  46. Orsucci, D.; Mancuso, M.; Filosto, M.; Siciliano, G. Tetracyclines and neuromuscular disorders. Curr. Neuropharmacol. 2012, 10, 134–138. [Google Scholar] [CrossRef] [Green Version]
  47. Lee, H.L.; Lee, J.K. Amyotrophic lateral sclerosis with an acute hypertensive crises. Ann. Rehabil. Med. 2012, 36, 418–422. [Google Scholar] [CrossRef] [Green Version]
  48. Kott, E.; Legendre, M.; Copin, B.; Papon, J.F.; Dastot-Le Moal, F.; Montantin, G.; Duquesnoy, P.; Piterboth, W.; Amram, D.; Bassinet, L.; et al. Loss-of-function mutations in RSPH1 cause primary ciliary dyskinesia with central-complex and radial-spoke defects. Am. J. Hum. Genet. 2013, 93, 561–570. [Google Scholar] [CrossRef] [Green Version]
  49. Yamashita, S.; Migita, A.; Hayashi, K.; Hirahara, T.; Kimura, E.; Maeda, Y.; Hirano, T.; Uchino, M. Amyotrophic lateral sclerosis in a patient with Kartagener syndrome. Amyotroph. Lateral Scler. 2010, 11, 402–404. [Google Scholar] [CrossRef]
  50. Stroud, D.A.; Surgenor, E.E.; Formosa, L.E.; Reljic, B.; Frazier, A.E.; Dibley, M.G.; Osellame, L.D.; Stait, T.; Beilharz, T.H.; Thorburn, D.R.; et al. Accessory subunits are integral for assembly and function of human mitochondrial complex I. Nature 2016, 538, 123–126. [Google Scholar] [CrossRef] [Green Version]
  51. Tumer, M.R.; Goldacre, R.; Talbot, K.; Goldacre, M.J. Cerebrovascular injury as a risk factor for amyotrophic lateral sclerosis. J. Neurol. Neurosur. Psychiatry 2016, 87, 244–246. [Google Scholar] [CrossRef] [Green Version]
  52. Rubattu, S.; Di Castro, S.; Schulz, H.; Geurts, A.M.; Cotugno, M.; Bianchi, F.; Maatz, H.; Hummel, O.; Falak, S.; Stanzione, R.; et al. Ndufc2 Gene Inhibition Is Associated with Mitochondrial Dysfunction and Increased Stroke Susceptibility in an Animal Model of Complex Human Disease. J. Am. Heart Assoc. 2016, 5, e002701. [Google Scholar] [CrossRef] [Green Version]
  53. Puente, X.S.; Sanchez, L.M.; Overall, C.M.; Lopez-Otin, C. Human and mouse proteases: A comparative genomic approach. Nat. Rev. Genet. 2003, 4, 544–558. [Google Scholar] [CrossRef]
  54. Boland, B.; Yu, W.H.; Corti, O.; Mollereau, B.; Henriques, A.; Bezard, E.; Pastores, G.M.; Rubinsztein, D.C.; Nixon, R.A.; Duchen, M.R.; et al. Promoting the clearance of neurotoxic proteins in neurodegenerative disorders of ageing. Nat. Rev. Drug Discov. 2018, 17, 660–688. [Google Scholar] [CrossRef]
  55. Zhang, T.; Periz, G.; Lu, Y.N.; Wang, J.O. USP7 regulates ALS-associated proteotoxicity and quality control through the NEDD4L-SMAD pathway. Proc. Natl. Acad. Sci. USA 2020, 117, 28114–28125. [Google Scholar] [CrossRef]
  56. Kunkel, S.D.; Suneja, M.; Ebert, S.M.; Bongers, K.S.; Fox, D.K.; Malmberg, S.E.; Alipour, F.; Shields, R.K.; Adams, C.M. mRNA expression signatures of human skeletal muscle atrophy identify a natural compound that increases muscle mass. Cell Metab. 2011, 13, 627–638. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Huang, L.; Zhao, S.P.; Frasor, J.M.; Dai, Y. An Integrated Bioinformatics Approach Identifies Elevated Cyclin E2 Expression and E2F activity as distinct features of tamoxifen resistant breast tumors. PLoS ONE 2011, 6, e0154756. [Google Scholar] [CrossRef] [PubMed]
  58. Siu, F.M.; Ma, D.L.; Cheung, Y.W.; Lok, C.N.; Yan, K.; Yang, Z.Q.; Yang, M.S.; Xu, S.X.; Ko, B.C.B.; He, Q.Y.; et al. Proteomic and transcriptomic study on the action of a cytotoxic saponin (Polyphyllin D): Induction of endoplasmic reticulum stress and mitochondria-mediated apoptotic pathways. Proteomics 2008, 8, 3105–3117. [Google Scholar] [CrossRef] [PubMed]
  59. Haggarty, S.J.; Koeller, K.M.; Wong, J.C.; Grozinger, C.M.; Schreiber, S.L. Domain-selective small-molecule inhibitor of histone deacetylase 6 (HDAC6)-mediated tubulin deacetylation. Proc. Natl. Acad. Sci. USA 2003, 100, 4389–4394. [Google Scholar] [CrossRef] [Green Version]
  60. Nuotio, J.; Oikonen, M.; Magnussen, C.G.; Jokinen, E.; Laitinen, T.; Hutri-Kahonen, N.; Kahonen, M.; Lehtimaki, T.; Taittonen, L.; Tossavainen, P.; et al. Cardiovascular risk factors in 2011 and secular trends since 2007: The cardiovascular risk in young finns study. Scand. J. Public Health 2014, 42, 563–571. [Google Scholar] [CrossRef] [PubMed]
  61. Raitakari, O.T.; Juonala, M.; Ronnemaa, T.; Keltikangas-Jarvinen, L.; Rasanen, L.; Pietikainen, M.; Hutri-Kahonen, N.; Taittonen, L.; Jokinen, E.; Marniemi, J.; et al. Cohort profile: The cardiovascular risk in Young Finns Study. Int. J. Epidemiol. 2008, 37, 1220–1226. [Google Scholar] [CrossRef] [Green Version]
  62. Wright, F.A.; Sullivan, P.F.; Brooks, A.I.; Zou, F.; Sun, W.; Xia, K.; Madar, V.; Jansen, R.; Chung, W.; Zhou, Y.H.; et al. Heritability and genomics of gene expression in peripheral blood. Nat. Genet. 2014, 46, 430–437. [Google Scholar] [CrossRef] [PubMed]
  63. GTEx Consortium. Genetic effects on gene expression across human tissues. Nature 2017, 550, 204–213. [Google Scholar] [CrossRef] [PubMed]
  64. Genomes Project, C.; Auton, A.; Brooks, L.D.; Durbin, R.M.; Garrison, E.P.; Kang, H.M.; Korbel, J.O.; Marchini, J.L.; McCarthy, S.; McVean, G.A.; et al. A global reference for human genetic variation. Nature 2015, 526, 68–74. [Google Scholar] [CrossRef] [Green Version]
  65. Zhou, X.; Carbonetto, P.; Stephens, M. Polygenic Modeling with Bayesian Sparse Linear Mixed Models. PLoS Genet. 2013, 9, e1003264. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Liberzon, A.; Birger, C.; Thorvaldsdottir, H.; Ghandi, M.; Mesirov, J.P.; Tamayo, P. The molecular signatures database hallmark gene set collection. Cell Syst. 2015, 1, 417–425. [Google Scholar] [CrossRef] [Green Version]
  67. Ziyatdinov, A.; Vazquez-Santiago, M.; Brunel, H.; Martinez-Perez, A.; Aschard, H.; Soria, J.M. Ime4qtl: Linear mixed models with flexible covariance structure for genetic studies of related individuals. BMC Bioinform. 2018, 19, 1–5. [Google Scholar] [CrossRef]
Figure 1. Analytical workflow.
Figure 1. Analytical workflow.
Ijms 22 03216 g001
Figure 2. Manhattan plot of TWAS associations between gene expression and ALS. Each dot represents a TWAS signal. The x-axis corresponds to the chromosomal position of the genes and the y-axis corresponds to the Z-score value representing the predicted expression level of each gene in ALS. Black dashed lines indicate the transcriptome-wide significance threshold (FDR < 0.05). The names of statistically significant genes (FDR < 0.05) are labeled on the corresponding dots. Red and blue dots indicate novel risk genes and previously reported genes for ALS, respectively.
Figure 2. Manhattan plot of TWAS associations between gene expression and ALS. Each dot represents a TWAS signal. The x-axis corresponds to the chromosomal position of the genes and the y-axis corresponds to the Z-score value representing the predicted expression level of each gene in ALS. Black dashed lines indicate the transcriptome-wide significance threshold (FDR < 0.05). The names of statistically significant genes (FDR < 0.05) are labeled on the corresponding dots. Red and blue dots indicate novel risk genes and previously reported genes for ALS, respectively.
Ijms 22 03216 g002
Figure 3. Heatmap showing the predicted expression patterns of TWAS significant associations. The x-axis represents the 27 significant TWAS genes and the y-axis indicates 19 SNP-GE weights. The Z-score values are labeled on the plot and are represented by a color bar on the right.
Figure 3. Heatmap showing the predicted expression patterns of TWAS significant associations. The x-axis represents the 27 significant TWAS genes and the y-axis indicates 19 SNP-GE weights. The Z-score values are labeled on the plot and are represented by a color bar on the right.
Ijms 22 03216 g003
Figure 4. Regional association plots showing the jointly significant TWAS genes. (A) A regional association plot at chromosome 17 from blood tissue of YFS. (B) A regional association plot at chromosome 11 from the brain cortex of GTEx. Color bars on the top panel show all of the protein-coding genes and/or TWAS genes. Jointly significant genes are highlighted using green bars while marginally significant genes are highlighted using blue bars. A scatter plot at the bottom shows a Manhattan plot of the GWAS data before (gray) and after (blue) conditioning on the independently significant genes.
Figure 4. Regional association plots showing the jointly significant TWAS genes. (A) A regional association plot at chromosome 17 from blood tissue of YFS. (B) A regional association plot at chromosome 11 from the brain cortex of GTEx. Color bars on the top panel show all of the protein-coding genes and/or TWAS genes. Jointly significant genes are highlighted using green bars while marginally significant genes are highlighted using blue bars. A scatter plot at the bottom shows a Manhattan plot of the GWAS data before (gray) and after (blue) conditioning on the independently significant genes.
Ijms 22 03216 g004
Table 1. Significant TWAS novel risk genes for ALS (FDR < 0.05). GTEx, Genotype-Tissue Expression project v7; YFS, Young Finns Study; NTR, Netherlands Twin Registry.
Table 1. Significant TWAS novel risk genes for ALS (FDR < 0.05). GTEx, Genotype-Tissue Expression project v7; YFS, Young Finns Study; NTR, Netherlands Twin Registry.
GeneTissue PanelChrBest GWAS rsIDTWAS Z-ScoreTWAS p-ValueFDR
RSPH10BGTEx—Pituitary7rs709930−4.682.90 × 10−66.43 × 10−3
NDUFC2GTEx—Brain Cortex11rs665278−4.252.18 × 10−53.04 × 10−2
HDGFRP3YFS—Blood15rs13329195−4.262.39 × 10−53.24 × 10−2
USP37NTR—Blood2rs9339944.202.74 × 10−53.54 × 10−2
NDST2YFS—Blood10rs122561034.133.57 × 10−54.22 × 10−2
LCE1CGTEx—Brain Cerebellar Hemisphere1rs3126091−4.113.93 × 10−54.50 × 10−2
TRBC2NTR—Blood7rs1964986−4.094.28 × 10−54.83 × 10−2
Table 2. Independent TWAS genes mapped to GWAS loci reported by GWAS catalog.
Table 2. Independent TWAS genes mapped to GWAS loci reported by GWAS catalog.
Gene(s).ChrGene
Start Position
Gene
Stop Position
GWAS Reported GeneIndex SNPGWAS Loci
Position
USP372219,000,000219,000,000STK36, TTLL4, ZNF142rs2303565218,680,586
GPX3, TNIP15150,000,000150,000,000TNIP1rs1046331115,103,274
C9orf72927,500,00027,600,000IFNK, MOBKL2B, C9orf72rs3849943, rs384994227,543,384
PRSS3933,800,00033,800,000Intergenicrs487962832,888,522
B4GALNT11258,000,00058,000,000KIF5Ars11324797657,581,917
ATXN3, TRIP11,
RP11-529H20.6
1492,500,00092,500,000ATXN3rs1014331092,074,037
SCFD11431,100,00031,200,000SCFD1rs1013915430,678,292
MYO191734,851,47734,900,737TMEM132Ers73054734,785,087
Table 3. Biological processes significantly enriched with TWAS signals of ALS (FDR < 0.05). N mem avail denotes the number of TWAS signals involved in the reference gene set used in this analysis; N mem denotes the total number of genes in the reference gene set.
Table 3. Biological processes significantly enriched with TWAS signals of ALS (FDR < 0.05). N mem avail denotes the number of TWAS signals involved in the reference gene set used in this analysis; N mem denotes the total number of genes in the reference gene set.
PanelGeneSetN Mem AvailN MemFDR
Brain HippocampusGO: Negative regulation of binding141643.15 × 10−2
Brain HypothalamusHallmark: KRAS signaling up132001.30 × 10−2
Brain Nucleus accumbens/basal gangliaHallmark: KRAS signaling up142002.40 × 10−2
GO: Perikaryon111273.60 × 10−2
Whole-BloodGO: Post Golgi vesicle-mediated transport111043.05 × 10−2
Table 4. Bioactive molecules discovered by CMap as potential drug candidates for ALS (p < 0.01).
Table 4. Bioactive molecules discovered by CMap as potential drug candidates for ALS (p < 0.01).
CMap NamePubChem NamePubChem CIDEnrichment Scorep-Value
MK-886MK 8863651370.9662.01 × 10−3
STOCK1N-35696--0.9652.07 × 10−3
PF-00539758-00--0.9491.20 × 10−4
16,16-Dimethylprostaglandin E216,16-Dimethyl-Pge252830660.8932.38 × 10−3
BufexamacBufexamac24660.8271.35 × 10−3
AndrosteroneAndrosterone58790.8132.37 × 10−3
PicrotoxininPicrotoxinin4422920.8013 × 10−3
SulfinpyrazoneSulfinpyrazone53420.7953.46 × 10−3
DeptropineDeptropine2039110.775.43 × 10−3
Folic acidFolic acid1353986580.7576.60 × 10−3
MebendazoleMebendazole40300.7412.68 × 10−3
Adenosine phosphateAdenosine 5’-Monophosphate60830.7349.83 × 10−3
TetracyclineTetracycline546757760.6739.75 × 10−3
PhentolaminePhentolamine57750.6114.59 × 10−3
Trichostatin ATrichostatin A4447320.3860.00
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Park, S.; Kim, D.; Song, J.; Joo, J.W.J. An Integrative Transcriptome-Wide Analysis of Amyotrophic Lateral Sclerosis for the Identification of Potential Genetic Markers and Drug Candidates. Int. J. Mol. Sci. 2021, 22, 3216. https://doi.org/10.3390/ijms22063216

AMA Style

Park S, Kim D, Song J, Joo JWJ. An Integrative Transcriptome-Wide Analysis of Amyotrophic Lateral Sclerosis for the Identification of Potential Genetic Markers and Drug Candidates. International Journal of Molecular Sciences. 2021; 22(6):3216. https://doi.org/10.3390/ijms22063216

Chicago/Turabian Style

Park, Sungmin, Daeun Kim, Jaeseung Song, and Jong Wha J. Joo. 2021. "An Integrative Transcriptome-Wide Analysis of Amyotrophic Lateral Sclerosis for the Identification of Potential Genetic Markers and Drug Candidates" International Journal of Molecular Sciences 22, no. 6: 3216. https://doi.org/10.3390/ijms22063216

APA Style

Park, S., Kim, D., Song, J., & Joo, J. W. J. (2021). An Integrative Transcriptome-Wide Analysis of Amyotrophic Lateral Sclerosis for the Identification of Potential Genetic Markers and Drug Candidates. International Journal of Molecular Sciences, 22(6), 3216. https://doi.org/10.3390/ijms22063216

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop