Next Article in Journal
Greatwall-Endosulfine: A Molecular Switch that Regulates PP2A/B55 Protein Phosphatase Activity in Dividing and Quiescent Cells
Next Article in Special Issue
Knowledge Generation with Rule Induction in Cancer Omics
Previous Article in Journal
In Vitro Generation of Oocytes from Ovarian Stem Cells (OSCs): In Search of Major Evidence
Previous Article in Special Issue
RankerGUI: A Computational Framework to Compare Differential Gene Expression Profiles Using Rank Based Statistics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Integrated Pan-Cancer Analysis and Structure-Based Virtual Screening of GPR15

1
State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
2
Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200030, China
3
Wuxi School of Medicine, Jiangnan University, Wuxi 214122, China
4
Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan District, Shenzhen 518055, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2019, 20(24), 6226; https://doi.org/10.3390/ijms20246226
Submission received: 21 July 2019 / Revised: 19 November 2019 / Accepted: 4 December 2019 / Published: 10 December 2019
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)

Abstract

:
G protein-coupled receptor 15 (GPR15, also known as BOB) is an extensively studied orphan G protein-coupled receptors (GPCRs) involving human immunodeficiency virus (HIV) infection, colonic inflammation, and smoking-related diseases. Recently, GPR15 was deorphanized and its corresponding natural ligand demonstrated an ability to inhibit cancer cell growth. However, no study reported the potential role of GPR15 in a pan-cancer manner. Using large-scale publicly available data from the Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression (GTEx) databases, we found that GPR15 expression is significantly lower in colon adenocarcinoma (COAD) and rectal adenocarcinoma (READ) than in normal tissues. Among 33 cancer types, GPR15 expression was significantly positively correlated with the prognoses of COAD, neck squamous carcinoma (HNSC), and lung adenocarcinoma (LUAD) and significantly negatively correlated with stomach adenocarcinoma (STAD). This study also revealed that commonly upregulated gene sets in the high GPR15 expression group (stratified via median) of COAD, HNSC, LUAD, and STAD are enriched in immune systems, indicating that GPR15 might be considered as a potential target for cancer immunotherapy. Furthermore, we modelled the 3D structure of GPR15 and conducted structure-based virtual screening. The top eight hit compounds were screened and then subjected to molecular dynamics (MD) simulation for stability analysis. Our study provides novel insights into the role of GPR15 in a pan-cancer manner and discovered a potential hit compound for GPR15 antagonists.

1. Introduction

G protein-coupled receptors (GPCRs), also known as seven-transmembrane domain receptors, constitute the largest family of cell signaling receptors [1]. GPCRs respond to a wide range of extracellular signals and regulate various cellular and physiological processes, including hormone regulation, vision, immune responses, neuronal communication, and behavior [2]. Overwhelming evidences have demonstrated that GPCRs and their downstream signaling targets play critical roles in cancer initiation and progression by regulating signal transduction and cellular processes (including cell proliferation, apoptosis, stress signals, immune escape, invasion, angiogenesis and metastasis, ion and nutrient transport and migration) [3,4]. It has also been demonstrated that diverse GPCRs were overexpressed in a variety of tumors [5]. Both orphan and well-characterized GPCRs have been reported to be involved in cancer development [6,7,8,9,10], which provide opportunities for the development of new strategies of cancer prevention and treatment. At present, GPCR-targeted drugs for cancer treatment are still few. The limited concrete knowledge about the role of GPCRs in cancers might be the cause of this lack of GPCR-targeted drugs as a treatment for cancer.
G protein-coupled receptor 15 (GPR15, also known as BOB) is an extensively studied orphan GPCR [11]. It is a chemokine co-receptor of human immunodeficiency virus type 1 and 2 [12] and a meditator of homing control in the large intestine and skin [8]. Dozens of studies have demonstrated the significant association between GPR15 and the immune system. For example, GPR15 was found to be expressed in memory B cells, plasmablasts, and regulatory T cell subsets [13,14]. It directs T cell homing to the developing epidermis as well as to the colon and regulates colitis [8,13,15,16]. When GPR15 controls the homing of FOXP3+ regulatory T cells (Tregs) to the large intestine lamina propria, it alleviates colonic inflammation [8]. Mounting evidences have suggested that inflammation may help tumor cells to evade the defense from the immune system [17]. The altered expression level and epigenetic regulation of GPR15 could also have a significant influence in the health status of smokers [18,19,20]. Moreover, recent studies reported that GPR15 was deorphanized and its ligand can also bind to SUSD2. The co-expression pattern of GPR15L and SUSD2 can suppress proliferation of several tumoral cell lines via G1 arrest [21,22,23]. This finding indicated that GPR15 may be actively involved in cancer progression. Therefore, it is necessary to characterize the role of GPR15 in carcinogenesis.
In this study, we firstly performed pan-cancer analysis to elucidate the potential role of GPR15 in cancers. Earlier studies using similar methods have been published to provide new insights of specific genes in carcinogenesis [24,25,26,27,28]. The expression levels of GPR15 were evaluated in 33 different cancers using the data from the Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression (GTEx) databases. The function of GPR15 was predicted by integrated network analysis. Our study identified a number of common genes that are in the GPR15 regulatory network in four cancers. We provide evidence that GPR15 acts as an immunomodulator and can be considered as a novel target for immunotherapy for the four cancers. We also predicted the 3D structure of human GPR15 and applied structure-based virtual screening (SBVS) approaches [29,30,31,32,33,34,35,36,37] to discover potential antagonists that bind to the predicted active site. These results help us to understand the role of GPR15 in carcinogenesis and its future prospective for STAD drug development.

2. Results

2.1. Pan-Cancer Mutational and Expression Landscape of GPR15

Among all of the 33 cancer types from the TCGA database, cancers with significant differential GPR15 expression (on the basis of difference of median expression between cancer samples and paired normal samples) are COAD (downregulated, p = 3.06 × 10−12) and READ (downregulated, p = 6.80 × 10−4) (Figure 1C, Figure S1). Also, COAD showed significantly lower expression in tumor tissue compared to healthy tissues from the Genotype-Tissue Expression (GTEx) project. The expression landscape of GPR15 in TCGA cohorts is shown in Figure 1B.
GPR15 showed a low mutation rate compared with hotpots oncogenes among all TCGA cohorts (Figure S2). It is most frequently mutated in uterine corpus endometrial carcinoma (UCEC), uterine carcinosarcoma (UCS), lung squamous carcinoma (LUSC), rectal adenocarcinoma (READ), and colon adenocarcinoma (COAD) (Figure 1A). We performed somatic mutations analysis on these five cancers. The mutational distribution and protein domains for GPR15 with labelled hotspots are shown in Figure S3. Most mutations in GPR15 are missense mutations while the minority mutational pattern is heterogenous, and the variant classification varies from frameshift deletion (COAD), frameshift insertion (LUSC), and nonsense mutation (LUSC, READ) to missense mutation (Figure 2). Moreover, it is worth noting that GPR15 in COAD is both hypermutated and significantly downregulated compared to that in normal tissues. This pattern implies that alterations in GPR15-meditor T-cell homing [8] may have undiscovered effects on the pathophysiology of COAD.

2.2. Integrated Network Analysis of GPR15

To obtain more functional insights for GPR15, we performed integrative network analysis on GPR15 [38] as shown in Figure 3. The network was built upon co-expression, physical interaction, genetic interaction, shared protein domains, and pathway data, where we found that the most related protein is YWHAB, and the linkage is supported by direct physical interactions. YWHAB encodes the protein 14-3-3 protein beta/alpha, which plays a role in mitogenic signaling and cell cycle machinery [39]. Integrated network analysis revealed that, apart from immunity control, GPR15 may have effects on cell growth, thereby affecting carcinogenesis. The top five GPR15-related genes with the highest scores are shown in Table 1.
Pathway analysis was conducted on the top 50 genes in the network to illustrate their biological function by the Reactome platform. We found Butyrate Response Factor 1 (BRF1) binding and tristetraprolin (TTP, ZFP36) binding as the two most significant pathways. Both pathways involve YWHAB, which further implies the close interaction between GPR15 and YWHAB. The top five most related pathways are shown in Table 2 and Table S1.

2.3. Pan-Cancer Analysis of GPR15 Expression and Prognostic Association

To evaluate GPR15 expression and prognosis in a pan-cancer manner, we used the pre-train multiple variate Cox regression model, which combined specific gene expression value and basic clinical data provided by OncoLnc [46] to identify the TCGA cohorts of which the prognosis is significant with the GPR15 expression value. We found that the prognoses of the four cancer types, COAD, HNSC, LUAD, and stomach adenocarcinoma (STAD), are possibly (p < 0.15) associated with GPR15 expression (Table 3, Table S2). In addition, based on Cox coefficients, the hazards of COAD, HNSC, and LUAD were found to be negatively associated with GPR15 expression, whereas the expression of GPR15 was positively correlated with the hazard of STAD.
Then, we stratified the patients in each cohort based on the expression median into high- and low-expression groups. Afterward, we built Kaplan Meier (KM) plots for the GPR15 group in COAD, HNSC, LUAD, and STAD separately. We found that the prognoses of COAD (p = 0.014), HNSC (p = 0.0058), LUAD (p = 0.0033), and STAD (p = 0.0092) were significantly correlated with the GPR15 expression groups (Figure 4B–E).
GPR15 can reduce the inflammation level in the large intestine by controlling T-cell homing [8]. We thus hypothesized that the high expression of GPR15 in COAD can contribute to the homing and infiltration of FOXP3+ regulatory T cells (Tregs), which in turn boost the immunity response of the tumor and results in a better prognosis. This pattern was also observed in HNSC and LUAD, which suggests that GPR15 may perform similar immunity control functions in head, neck, and lung tissues. However, the observed upregulation of GPR15 in STAD implies poorer prognosis, which may suggest the opposite effects of GPR15 on stomach tissue.

2.4. Commonly Upregulated Gene Set in High GPR15 Groups of COAD, HNSC, LUAD, and STAD

To dissect the effects of the expression of GPR15 in a genome-wide manner, we performed differential gene expression (DEG) analysis [47,48] on the GPR15 low-expression group compared to the GPR15 high-expression group in the four cancer types. We found that 357, 487, 346, and 333 genes were differentially expressed in COAD, HNSC, LUAD, and STAD, respectively (Supplementary Material). The profiles of the top 200 differential expressed genes in COAD, HNSC, LUAD, and STAD are shown in Figure S4. Interestingly, we found that 146 genes were commonly upregulated (Figure 4A). These genes are defined as a commonly upregulated gene set (CUPGS). This considerable number of CUPGS implies a shared regulatory mechanism of GPR15 in COAD, HNSC, LUAD, and STAD.
GO enrichment analysis of the CUPGS was conducted and the results are shown in Figure S5A–C. Intuitively, we found that these genes were significantly enriched in the functional category of antigen binding (p = 2.41 × 10−159), cellular component of immunoglobulin complex (p = 1.32 × 10−50), and the biological process of various immunological response processes. The results of KEGG pathway analysis are shown in Figure S5D–F, which also showed that the CUPGS enriched the categories of the B-cell receptor immunology pathway, intestinal immune network for immunoglobulin A (IgA) production, and primary immunology. The GO enrichment results are listed in Table 4. The gene-concept network for CUPGS is depicted in Figure 5. Surprisingly, besides the established function of T-cell homing of GPR15, these CUPGS were also significantly associated with B-cell meditated immunity, and these genes were upregulated in the high GPR15 expression groups, which suggests that GPR15 exerts a broader immunological impact.

2.5. Association between GPR15 Expression Levels and the Immune Cell-Infiltrating Levels in Cancer

Together, the results of the enrichment analysis revealed that the regulatory role of GPR15 in the four cancers is strongly correlated to immunity function. To support these findings, we investigated the association between GPR15 expression levels and immune cell infiltration levels in the tumor microenvironment using TIMER (Figure 6, Table S3). We found that the GPR15 expression value was significantly negatively correlated with tumor purity in all four types of cancer. As for T cells, for three types of cancer, excluding COAD, the GPR15 expression value was significantly positively correlated with CD8+ T cell infiltration. In COAD, HNSC, and STAD, the GPR15 expression value was significantly positively correlated with CD4+ T cell infiltration. Even with similar immune profiles, the prognosis of STAD is the opposite from the other three types of cancer, which implies different underlying mechanisms.

2.6. 3D Structure Modeling of GPR15

Based on all the aforementioned analyses, we hypothesized that GPR15 could be a novel target of cancer immunotherapy. Recent studies have shown that the known natural ligands of GPR15 are all agonists [21,22]. We thus put more of an emphasis on drug discovery specifically for STAD to identify potential inhibitors of GPR15. There is no crystal structure available for GPR15. Thus, we performed homology modeling for the 3D structure of human GPR15. Template-based modeling is the most common approach to explore the relationships between the three-dimensional coordinates of unknown proteins and their homologs. The GPR15 sequence was searched against the PDB-BLAST for similar template selection, and type-1 angiotensin II receptor (PDB:4YAY) was selected, with a sequence identity of 32.64% and query coverage of 30 to 317 aa (Figure S6). A total of 10 models were generated and further validated by the SAVE server. The best predicted model structures were further refined by calculation of the probability density function (pdf) and discrete optimized potential energy (DOPE). The 3D model had a DOPE score of –15,495.15, which was the lowest against the predicted other models. Also, the Ramachandran plot showed 90.9% of the residues in the allowed region that depicted the stability of the predicted model. The results of the homology modelling of GPR15 are shown in Figure 7.

2.7. Structure–Function Relationship-Based Binding Site Prediction

The structure–function relationship of GPR15 is helpful in drug design. We identified its structure–function relationship using the Cofactor server [49]. We found that TRP89, SER109, ARG172, LYS180, CYS183, TRP195, PHE257, and LYS261 residues were located in the active region in GPR15. Active site regions were largely located in the extracellular regions of seven transmembrane domains, where the potential leads can bind and play a crucial role in signal transduction. A schematic representation of the ligand binding site is shown in Figure 8A,B. Also, cross-validation of the predicted residues at the active region was further supported by the results produced in the Site Finder tool of the MOE suite. Amino acid residues within 5 Å of the active were used for the generation of the receptor grid of GPR15 that was used for virtual screening.

2.8. Virtual Screening and Molecular Docking Results

We utilized the virtual screening technique to identify potential antagonists exhibiting an adequate binding affinity. We started with a chemical database consisting of 62,500 small molecules and isolated a set of compounds satisfying the threshold of a high docking score. After the first round of filtration, we obtained 733 compounds via shape-based virtual screening. This shape-based screening approach utilizes the concept of the shape of binding pockets and electrostatic potential resemblance to select new molecules, which may show similar binding modes to the binding pocket. These 733 molecules were subjected to re-docking. Docking of the selected ligands was achieved to obtain the top conformations of the selected 733 molecules into the predicted GPR15 binding site. Finally, based on the docking scores, with the threshold values fixed between −13.00 and −8.00, only the top eight screened compounds ranked by the lowest binding energy were identified as potential antagonists for GPR15. The interactions analysis for the eight hits is given in Table 5 and Figure S7, and their 2D structures are given in Figure S8.

2.9. MD Simulations and Binding Free Energy Analysis

We performed MD simulation of the top eight potential complexes to measure the stability of the protein–ligand complex. RMSD (root-mean-square deviation) profiles of the protein are shown in Figure 9A, which indicates that all systems were stable during the entire simulation run and could be used for further analysis. The RMSD of ligand-heavy atoms was also conducted to predict the stability of the atoms in docked complexes (Figure 9B). Compounds 5−8 exhibited a consistently lower RMSD (<2.1 Å), suggesting that that these compounds formed stable complexes with GPR15. We selected four hits with lower ligand RMSD values for further interaction analysis and explored the ligand binding mode in the protein based on the occupancy of hydrogen throughout the simulation time. Compound 5 showed more than 90% salt bridge interaction with Lys261 (Figure S9) in the MD trajectories. The fluctuation in RMSD was further supported by the MM/PBSA results (Table S4), which showed that compound 5 (C34H47O6N3) had a stronger binding affinity (lowest binding free energy) among the hits with consistently lower RMSD values. Combining all the structural analyses, we identified compound 5 as a promising candidate for GPR15 inhibition.

3. Discussion and Conclusions

GPCRs are well-established crucial participants in various signal transduction pathways and are major targets in drug design. Until now, more than 134 GPCRs as targets for drugs have been approved in the United States or European Union [52]. Although the endogenous ligand is not known, O-GPCRs are still popular targets with specificity in many therapeutic approaches. There is a broad range of indications linked to orphan GPCRs, including cancers, thus O-GPCRs may be utilized as clinical therapeutic targets in cancer therapy [53].
In this study, we demonstrated a novel integrative pan-cancer analysis workflow and conducted a comprehensive analysis from upstream omics to downstream drug discovery of GPR15 in cancer. Our study reported GPR15 expression and mutation levels across all cancers; the correlation between its expression and cancer prognosis; an investigation of genes with similar GPR15 expression patterns in COAD, HNSC, LUAD, and STAD; and 3D structure modeling of GPR15 to virtually screen its antagonists.
Our study provided evidence of the associations between GPR15 expression and cancer immunity. We analyzed CUPGS in COAD, HNSC, LUAD, and STAD to investigate the functions of co-expression genes with similar GPR15 expression patterns. Nearly all CUPGS were enriched in the immune-related function. GPR15 was proven to mediate regulatory T cells (Tregs) to migrate to the large intestine and reduce inflammation in the mouse model [8], but it is preferentially expressed on human effector T cells [13]. Further research supported that GPR15-dependent human CD8+ T cells can migrate into the inflamed gut, and GPR15 can also help dendritic epidermal T cells migrate to the skin [15]. Mounting evidence suggests that GPR15 may play a role in the pathological process of chronic inflammatory diseases [54]. In normal tissue, chronic inflammation is a well-acknowledged risk factor for cancers. It could gradually generate an immunosuppressive tumor microenvironment, which allows mutated cells to evade the surveillance of the immune system, causing cancer eventually [17]. However, in malignant cancer tissue, immune infiltration, which is often depicted as “inflammation”, suggests better prognosis [55]. We found that GPR15 expression is significantly positively associated with the prognoses of COAD, HNSC, and LUAD, and significantly negatively associated with STAD. Based on clinical and transcriptomic analyses, we can hypothesize that GPR15 could influence cancer prognosis through downstream immunological effectors. Moreover, the tumor microenvironment dissection via TIMER also supports possible CD8+ T cell and CD4+ T cell infiltration mediated through GPR15.
A recent study has shown that GPR15 was deorphanized and its known natural ligands are all agonists [21,22]. From another point of view, designing inhibitors for GPR15 could provide some clues for the treatment of STAD and help the functional study of GPR15 at the molecular level for experimental biologists. Therefore, we performed virtual screening for GPR15 antagonists and predicted the protein–ligand interaction of the top eight compounds. MD simulation and free energy calculation conducted on the top eight compounds led to the discovery of the best compound, compund 5 (C34H47O6N3), which could be a hit for novel drugs targeting STAD.
Together, our analysis functionally annotated GPR15 expression in a pan-cancer manner and identified potential inhibitory agents that target GPR15. Our study provided evidence of the associations between GPR15 expression and cancer immunity. Our results provide new clues regarding GPR15′s role in carcinogenesis and new insights into cancer therapy targets. Also, this novel comprehensive omics-based workflow could be utilized for the hypothesis generation of new targets in cancer.

4. Methods

4.1. Pan-Cancer Mutational Data Retrieval

We retrieved the mutation annotation format (MAF) files of the Cancer Genome Atlas (TCGA) [27] cohorts using the R package “TCGAbiolinks” [56] on 15 October 2018. The TCGA program provided us with multiple versions of somatic mutation data sets, which were generated using different workflows, and we selected the data set “MuTect2 Variant Aggregation and Masking” [57] because it encompassed more mutations than the others. Besides, according to a comparison study of mutation callers [58], MuTect2 has the highest recall and robustness. A total number of 33 cancer types and 9914 cancer samples were included in this study. Mutational summary and landscape plots were performed by R package “maftools” [59]. The mutant rate plot of GPR15 was available from the National Cancer Institute GDC Data Portal (https://gdc.cancer.gov).

4.2. Pan-Cancer GPR15 Expression Profile Analysis

We used GEPIA [60] to interactively analyze the expression profile of GPR15 among 33 cancer types, 9736 tumors, and 8587 paired normal samples from the TCGA and the Genotype-Tissue Expression (GTEx) project [61]. We used the limma [48] backend with the threshold of log2 fold-change >1 and q-value <0.05 to detect cancer types exhibiting differential expressed GPR15 compared to matched normal samples. Differential expression of GPR15 was validated from the UALCAN database (http://ualcan.path.uab.edu/index.html). The Spearman’s correlation between the GPR15 expression value and immune infiltration level in 4 types of human cancer (COAD, HNSC, LUAD, and STAD) was calculated via TIMER (Tumor IMmune Estimation Resource) [62], and visualized by its “Gene” module.

4.3. Integrated Network Analysis of GPR15

To obtain more functional insights into GPR15, we used GeneMANIA [38] and Cytoscape [63] to perform integrated network analysis. The integrated network consists of co-expression data from Gene Expression Omnibus [64] (GEO), physical and genetic interaction data from BioGRID [40], predicted protein interaction data based on orthologue using the Interologous Interaction Database [65] (I2D), and pathway molecular interaction data from BioGRID. In an algorithmic perspective, the integrated network analysis can be divided into two parts. Firstly, we used a linear regression-based algorithm to calculate a single composite functional association network from multiple network data sources (co-expression, physical interaction, genetic interaction, shared protein domains, pathway data, and so on). Second, a variation of the Gaussian field label propagation algorithm was utilized to assign a score (the discriminant value) to each node in the network. This score reflects the computed strength of association between gene pairs [66]. We also used the Reactome [67] platform to conduct pathway analysis of the predicted genes.

4.4. Survival Analysis of GPR15 Expression

We used OncoLnc [46] to determine the cancer type o which the prognosis is potentially associated with GPR15 expression. Then, we used the R package “TCGAbiolinks” to retrieve the corresponding clinical and expression data. We stratified patients in each associated cohort into “high” and “low” groups based on the median expression value of GPR15. The Kaplan–Meier method was utilized to estimate the survival function, and we used the log-rank test to evaluate the significance between two groups. Survival analysis and corresponding visualization were performed using the R package “survival” and “survminer” [68], respectively.

4.5. Gene Differential Expression Analysis

We used standalone limma and voom [47] pipeline to identify differentially expressed genes associated with GRP15 expression, comparing tumor samples with high expression of GPR15 to low-expression ones and applied the threshold of log2 fold-change >1 and adjusted p-value <0.01 to select for biological significant genes.

4.6. Commonly Upregulated Gene Set Identification and Annotation

We extracted differential expressed genes (DEGs) that were upregulated and shared among COAD, HNSC, LUAD, and STAD, and visualized these using the R package “UpsetR” [69] and “vennerable” [70]. These genes were collectively defined as the commonly upregulated gene set (CUPGS). We used the R package “clusterProfiler” [71] to perform gene ontology [72] and Kyoto Encyclopedia of Genes and Genomes (KEGG) [73] annotation of the CUPGS.

4.7. 3D Structure Prediction and Validation of GPR15

We used BLASTP [74], which is implemented in NCBI (https://www.ncbi.nlm.nih.gov/), to search and align the best templates for 3D structural modeling of GPR15. We used BLAST-p to align GPR15 with similar PDB structures and protein sequences retrieved from the UniProt database (https://www.uniprot.org/). A template was identified from the NCBI-BLASTp program. Homology modeling was performed using the MODELLER program [75], where 10 models were built through the aligned templates, and Python scripts were executed for loop modeling and model refinement. Model selection was based on the parameters of the optimized loop, side-chain conformations, DOPE, Q-mean, Z-score, and maximum deviation. Structure refinement of the modeled GPR15 was performed using the KoBaMIN [76], a web server, in order to obtain the best conformation of the modeled structures resulting from MODELLER. Validation of the predicted model was performed using the Ramachandran plot generated by the Structure Analysis and Verification Server (SAVES) server (https://servicesn.mbi.ucla.edu/SAVES/).

4.8. Active Site Prediction

The structure–function relationship for the predicted GPR15 model was established by exploring the active-site residues using the Cofactor server [49]. This approach was used to identify the biochemical function of the predicted GPR15 model and the potential binding region for its antagonists. The cross-validation of the predicted binding site was also conducted by the Site Finder tool of the MOE software.

4.9. Screening of Potential Compounds Targeting GPR15

We screened the chemical library using the best homology model of GPR15 as the receptor structure. We performed shape-based virtual screening in the first round using the Surflex-Dock [77] module of the SYBYL software. The receptor was optimized and prepared for virtual screening with hydrogen atoms and charges added. A library of 62,500 compounds obtained from the Maybridge Library (55,975) and in-house compound library (6525) were used for the screening of compounds. The dataset compounds were converted to 3D coordinates and then minimized via the Powell method using 1000 iterations with a Tripose force field. We detected the surface in the predicted active sites and mapped an idealized active site ligand (called a protomol). Then, we applied five maximum conformations per fragment and five maximum poses per ligand with a 0.05 Å minimum (root-mean-square deviation) RMSD to dock in the defined Protomol region. The search area was set to 5 Å in the grid. We restricted the cutoff value (total score < –6) in the docking scoring function to eliminate false positive results.

4.10. Molecular Docking

Docking experiments were performed via Surflex-Dock. Top hits were selected for docking and the receptor grid box was confined around the 5 Å area of the predicted active site radii. The Lamarckian genetic algorithm, a well-known docking algorithm, was used to conduct docking by setting the default parameters with 150 initial populations with randomly placed individuals and the maximum number of generations set to 27,000. A shortlist based on the consensus scoring function (Chem score + G Score + D Score + PMF Score) was generated. We then applied a cutoff (total score <–8) for the docking score function to eliminate false positive results. The lowest free binding energy was set as the criterion for the selection of the top poses.

4.11. Molecular Dynamics (MD) Simulations

We selected the top eight hits based on stability and protein–ligand interaction analyses. The initial structures for MD simulations originated from the representative docking pose form virtual screening. The GROMACS V5.1.3 package [78] was used to perform biophysical simulation of the eight complexes for 100,000 ps (100 ns), respectively. Membrane systems were constructed using the CHARMM-GUI Membrane Builder [79]. Proteins and lipids were presented using the CHARMM36 force field [80], and ligands were assigned to the CHARMM CGenFF [81]. All the systems were solvated in a cubic water box with a distance of 10 Å between the proteins, and the TIP3P model [82] was used for water molecules. Counter ions (0.15 M NaCl) were used to keep each system electrically neutral followed by a steepest descent energy minimization (~5000 steps). Subsequently, the minimized system was equilibrated into the NVT and NPT phases for 500 ps, and all bond lengths were restrained by using the LINCS method [83] with time steps of 2 fs. The temperature was set to 310 K and the pressure was maintained at 1.01325 × 105 Pa (1 air pressure) using the Langevin piston method [84], which were controlled by a Nose-Hoover thermostat and Parrinello-Rahman barostat [85], respectively. The particle mesh Ewald algorithm (PME) [86] was utilized to compel long-range electrostatic interactions, and a 1.4-nm cut off for short-range van der Waals interactions was utilized. Sampling of the MD trajectories was carried out every 2.0 ps. Finally, 100 ns MD simulations for each system were performed for further analysis. Detailed simulations conditions are listed in Table S5.

4.12. MD Trajectories Analysis

The time course of the root-mean-square deviation (RMSD) from the respective initial structures was used to assess the stability of the proteins and ligand in different simulations. Hydrogen bonds were defined as hydrogen–acceptor at a distance less than 3.5 Å and donor–hydrogen–acceptor angle as more than 135°. Salt bridges were defined by oppositely charged atoms that were within 5 Å. All of the analyses were performed using the analysis tools implemented in GROMACS. In total, 100-ns trajectories (10,000 structure) of each MD system were analyzed after eliminating the rotational and translational movements. The trajectory images were visualized and analyzed with PyMol (https://pymol.org/2/) and VMD (http://www.ks.uiuc.edu/Research/vmd/).

4.13. Binding Free Energy Calculations

To estimate the corresponding relative binding affinities, the binding free energy for selected complexes was calculated using the molecular mechanics Poisson–Boltzmann surface area (MM-PBSA) method as implemented in the g_mmpbsa tool [87], which integrates functions from GROMACS and APBS [87]. For a protein–ligand complex, the lower the binding free energy, the higher the binding affinity. The calculation was based on the following equation:
Δ G b i n d = G c o m p l e x G p r o t e i n G l i g a n d Δ G b i n d = Δ E M M + Δ G P B + Δ G n o n p o l a r T Δ S Δ G b i n d = G e l e + G v d w + G S A + G P A ,
where ∆EMM is the sum of the van der Waals and electrostatic energy, ∆GPA is the polar solvation energy, and ∆GSA is the non-polar solvation energy. The final, binding energy, ∆Gbind, was a relative value rather than an absolute value because the vibrational entropy contribution (T∆S) was not included in our calculation. In total, 100 snapshots at an interval of 10 ps from the last 10-ns trajectories during the stable phase were extracted as sampling for the calculations.

Supplementary Materials

Supplementary materials can be found at https://www.mdpi.com/1422-0067/20/24/6226/s1.

Author Contributions

Conceptualization, Y.W., X.W. and A.C.K.; methodology, Y.W., X.W. and A.C.K.; validation, A.C.K.; formal analysis, Y.W. and X.W.; investigation, Y.W. and X.W.; data curation, Y.W., X.W. and A.C.K.; writing-original draft preparation, Y.W., X.W. and A.C.K.; writing-review and editing, D.-Q.W., Y.X., C.-D.L., Q.X. and L.S.; supervision, D.-Q.W.; project administration, D.-Q.W.; funding acquisition, D.-Q.W.

Funding

This work was supported by the funding from National Key Research Program (Contract No.2016YFA0501703), National Natural Science Foundation of China (Grant No. 31601074, 61872094, 61832019), and Shanghai Jiao Tong University School of Medicine (Contract No. YG2017ZD14).

Acknowledgments

We thank all members of the laboratory for useful discussion.

Conflicts of Interest

The authors declare no conflict of interest in this work.

Abbreviations

ACCAdrenocortical carcinoma
BLCABladder urothelial carcinoma
BRCABreast invasive carcinoma
CESCCervical squamous cell Carcinoma and endocervical adenocarcinoma
CHOLCholangiocarcinoma
COADColon adenocarcinoma
DLBCDiffuse large B-cell lymphoma
ESCAEsophageal carcinoma
GBMGlioblastoma multiforme
HNSCHead and neck squamous cell carcinoma
KICHKidney chromophobe
KIRCKidney renal clear cell carcinoma
KIRPKidney renal papillary cell carcinoma
LAMLAcute myeloid leukemia
LGGLower grade glioma
LIHCLiver hepatocellular carcinoma
LUADLung adenocarcinoma
LUSCLung squamous cell carcinoma
MESOMesothelioma
OVOvarian serous cystadenocarcinoma
PADDPancreatic adenocarcinoma
PCPGPheochromocytoma and Paraganglioma
PRADprostate adenocarcinoma
READRectum adenocarcinoma
SARCSarcoma
SKCMSkin cutaneous melanoma
STADStomach adenocarcinoma
TGCTTesticular germ cell tumors
THCAThyroid carcinoma
THYMThymoma
UCECUterine corpus endometrial carcinoma
UCSUterine carcinosarcoma
UVMUveal Melanoma

References

  1. Rosenbaum, D.M.; Rasmussen, S.G.; Kobilka, B.K. The structure and function of G-protein-coupled receptors. Nature 2009, 459, 356–363. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Deng, H.K.; Unutmaz, D.; KewalRamani, V.N.; Littman, D.R. Expression cloning of new receptors used by simian and human immunodeficiency viruses. Nature 1997, 388, 296–300. [Google Scholar] [CrossRef] [PubMed]
  3. Lynch, J.R.; Wang, J.Y. G Protein-Coupled Receptor Signaling in Stem Cells and Cancer. Int. J. Mol. Sci. 2016, 17, 707. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. O’Hayre, M.; Degese, M.S.; Gutkind, J.S. Novel insights into G protein and G protein-coupled receptor signaling in cancer. Curr. Opin. Cell Biol. 2014, 27, 126–135. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Li, S.; Huang, S.; Peng, S.B. Overexpression of G protein-coupled receptors in cancer cells: Involvement in tumor progression. Int. J. Oncol. 2005, 27, 1329–1339. [Google Scholar] [CrossRef] [PubMed]
  6. Gugger, M.; White, R.; Song, S.; Waser, B.; Cescato, R.; Riviere, P.; Reubi, J.C. GPR87 is an overexpressed G-protein coupled receptor in squamous cell carcinoma of the lung. Dis. Markers 2008, 24, 41–50. [Google Scholar] [CrossRef] [Green Version]
  7. Jin, Z.; Luo, R.; Piao, X. GPR56 and its related diseases. Prog. Mol. Biol. Transl. Sci. 2009, 89, 1–13. [Google Scholar]
  8. Kim, S.V.; Xiang, W.V.; Kwak, C.; Yang, Y.; Lin, X.W.; Ota, M.; Sarpel, U.; Rifkin, D.B.; Xu, R.; Littman, D.R. GPR15-mediated homing controls immune homeostasis in the large intestine mucosa. Science 2013, 340, 1456–1459. [Google Scholar] [CrossRef] [Green Version]
  9. Prossnitz, E.R.; Maggiolini, M. Mechanisms of estrogen signaling and gene expression via GPR30. Mol. Cell. Endocrinol. 2009, 308, 32–38. [Google Scholar] [CrossRef] [Green Version]
  10. Bar-Shavit, R.; Maoz, M.; Kancharla, A.; Nag, J.K.; Agranovich, D.; Grisaru-Granovsky, S.; Uziely, B. G Protein-Coupled Receptors in Cancer. Int. J. Mol. Sci. 2016, 17, 1320. [Google Scholar] [CrossRef] [Green Version]
  11. Heiber, M.; Marchese, A.; Nguyen, T.; Heng, H.H.Q.; George, S.R.; O’Dowd, B.F. A novel human gene encoding a G-protein-coupled receptor (GPR15) is located on chromosome 3. Genomics 1996, 32, 462–465. [Google Scholar] [CrossRef] [PubMed]
  12. Blaak, H.; Boers, P.H.M.; Gruters, R.A.; Schuitemaker, H.; Van Der Ende, M.E.; Osterhaus, A. CCR5, GPR15, and CXCR6 are major coreceptors of human immunodeficiency virus type 2 variants isolated from individuals with and without plasma viremia. J. Virol. 2005, 79, 1686–1700. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Nguyen, L.P.; Pan, J.; Dinh, T.T.; Hadeiba, H.; O’Hara, E., III; Ebtikar, A.; Hertweck, A.; Gokmen, M.R.; Lord, G.M.; Jenner, R.G.; et al. Role and species-specific expression of colon T cell homing receptor GPR15 in colitis. Nat. Immunol. 2015, 16, 207–213. [Google Scholar] [CrossRef] [PubMed]
  14. Seong, Y.; Lazarus, N.H.; Sutherland, L.; Habtezion, A.; Abramson, T.; He, X.S.; Greenberg, H.B.; Butcher, E.C. Trafficking receptor signatures define blood plasmablasts responding to tissue-specific immune challenge. JCI Insight 2017, 2, e90233. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Lahl, K.; Sweere, J.; Pan, J.; Butcher, E. Orphan chemoattractant receptor GPR15 mediates dendritic epidermal T-cell recruitment to the skin. Eur. J. Immunol. 2014, 44, 2577–2581. [Google Scholar] [CrossRef] [Green Version]
  16. Habtezion, A.; Nguyen, L.P.; Hadeiba, H.; Butcher, E.C. Leukocyte Trafficking to the Small Intestine and Colon. Gastroenterology 2016, 150, 340–354. [Google Scholar] [CrossRef] [Green Version]
  17. Nakamura, K.; Smyth, M.J. Targeting cancer-related inflammation in the era of immunotherapy. Immunol. Cell Biol. 2017, 95, 325–332. [Google Scholar] [CrossRef]
  18. Haase, T.; Muller, C.; Krause, J.; Rothemeier, C.; Stenzig, J.; Kunze, S.; Waldenberger, M.; Munzel, T.; Pfeiffer, N.; Wild, P.S.; et al. Novel DNA Methylation Sites Influence GPR15 Expression in Relation to Smoking. Biomolecules 2018, 8, 74. [Google Scholar] [CrossRef] [Green Version]
  19. Koks, G.; Uudelepp, M.L.; Limbach, M.; Peterson, P.; Reimann, E.; Koks, S. Smoking-induced expression of the GPR15 gene indicates its potential role in chronic inflammatory pathologies. Am. J. Pathol. 2015, 185, 2898–2906. [Google Scholar] [CrossRef] [Green Version]
  20. Koks, S.; Koks, G. Activation of GPR15 and its involvement in the biological effects of smoking. Exp. Biol. Med. 2017, 242, 1207–1212. [Google Scholar] [CrossRef] [Green Version]
  21. Ocón, B.; Pan, J.; Dinh, T.T.; Chen, W.; Ballet, R.; Bscheider, M.; Habtezion, A.; Tu, H.; Zabel, B.A.; Butcher, E.C. A Mucosal and Cutaneous Chemokine Ligand for the Lymphocyte Chemoattractant Receptor GPR15. Front. Immunol. 2017, 8, 1111. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Pan, W.; Cheng, Y.; Zhang, H.; Liu, B.; Mo, X.; Li, T.; Li, L.; Cheng, X.; Zhang, L.; Ji, J.; et al. CSBF/C10orf99, a novel potential cytokine, inhibits colon cancer cell growth through inducing G1 arrest. Sci. Rep. 2014, 4, 6812. [Google Scholar] [CrossRef] [PubMed]
  23. Suply, T.; Hannedouche, S.; Carte, N.; Li, J.P.; Grosshans, B.; Schaefer, M.; Raad, L.; Beck, V.; Vidal, S.; Hiou-Feige, A.; et al. A natural ligand for the orphan receptor GPR15 modulates lymphocyte recruitment to epithelia. Sci. Signal. 2017, 10, eaal0180. [Google Scholar] [CrossRef] [PubMed]
  24. Mishra, S.; Kaddi, C.D.; Wang, M.D. Pan-cancer analysis for studying cancer stage using protein and gene expression data. In Proceedings of the IEEE Engineering in Medicine and Biology Society, Orlando, FL, USA, 16–20 August 2016; pp. 2440–2443. [Google Scholar]
  25. Morris, L.G.; Riaz, N.; Desrichard, A.; Senbabaoglu, Y.; Hakimi, A.A.; Makarov, V.; Reis-Filho, J.S.; Chan, T.A. Pan-cancer analysis of intratumor heterogeneity as a prognostic determinant of survival. Oncotarget 2016, 7, 10051–10063. [Google Scholar] [CrossRef] [Green Version]
  26. Ng, J.C.F.; Quist, J.; Grigoriadis, A.; Malim, M.H.; Fraternali, F. Pan-cancer transcriptomic analysis dissects immune and proliferative functions of APOBEC3 cytidine deaminases. Nucleic Acids Res. 2019, 47, 1178–1194. [Google Scholar] [CrossRef] [Green Version]
  27. Weinstein, J.N.; Collisson, E.A.; Mills, G.B.; Shaw, K.R.M.; Ozenberger, B.A.; Ellrott, K.; Shmulevich, I.; Sander, C.; Stuart, J.M. Cancer Genome Atlas Research Network. The cancer genome atlas pan-cancer analysis project. Nat. Genet. 2013, 45, 1113. [Google Scholar] [CrossRef]
  28. Singh, A.; Goel, N.; Yogita. Integrative Analysis of Multi-Genomic Data for Kidney Renal Cell Carcinoma. Interdiscip. Sci. 2019. [Google Scholar] [CrossRef]
  29. Chen, H.; Fu, W.; Wang, Z.; Wang, X.; Lei, T.; Zhu, F.; Li, D.; Chang, S.; Xu, L.; Hou, T. Reliability of Docking-Based Virtual Screening for GPCR Ligands with Homology Modeled Structures: A Case Study of the Angiotensin II Type I Receptor. ACS Chem. Neurosci. 2019, 10, 677–689. [Google Scholar] [CrossRef]
  30. Varano, F.; Catarzi, D.; Falsini, M.; Vincenzi, F.; Pasquini, S.; Varani, K.; Colotta, V. Identification of novel thiazolo[5,4-d]pyrimidine derivatives as human A1 and A2A adenosine receptor antagonists/inverse agonists. Bioorg. Med. Chem. 2018, 26, 3688–3695. [Google Scholar] [CrossRef]
  31. Cooke, R.M.; Brown, A.J.; Marshall, F.H.; Mason, J.S. Structures of G protein-coupled receptors reveal new opportunities for drug discovery. Drug Discov. Today 2015, 20, 1355–1364. [Google Scholar] [CrossRef]
  32. Becker, O.M.; Shacham, S.; Marantz, Y.; Noiman, S. Modeling the 3D structure of GPCRs: Advances and application to drug discovery. Curr. Opin. Drug Discov. Dev. 2003, 6, 353–361. [Google Scholar]
  33. Becker, O.M.; Marantz, Y.; Shacham, S.; Inbal, B.; Heifetz, A.; Kalid, O.; Bar-Haim, S.; Warshaviak, D.; Fichman, M.; Noiman, S. G protein-coupled receptors: In silico drug discovery in 3D. Proc. Natl. Acad. Sci. USA 2004, 101, 11304–11309. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Chandra, N.; Anand, P.; Yeturu, K. Structural Bioinformatics: Deriving Biological Insights from Protein Structures. Interdiscip. Sci. Comput. Life Sci. 2010, 2, 347–366. [Google Scholar] [CrossRef] [PubMed]
  35. Kaushik, A.C.; Gautam, D.; Nangraj, A.S.; Wei, D.Q.; Sahi, S. Protection of Primary Dopaminergic Midbrain Neurons Through Impact of Small Molecules Using Virtual Screening of GPR139 Supported by Molecular Dynamic Simulation and Systems Biology. Interdiscip. Sci. Comput. Life Sci. 2019, 11, 247–257. [Google Scholar] [CrossRef] [PubMed]
  36. Mahajanakatti, A.B.; Murthy, G.; Sharma, N.; Skariyachan, S. Exploring Inhibitory Potential of Curcumin against Various Cancer Targets by in silico Virtual Screening. Interdiscip. Sci. Comput. Life Sci. 2014, 6, 13–24. [Google Scholar] [CrossRef] [PubMed]
  37. Wang, Y.J.; Wang, X.G.; Xiong, Y.; Kaushik, A.C.; Muhammad, J.; Khan, A.; Dai, H.; Wei, D.Q. New strategy for identifying potential natural HIV-1 non-nucleoside reverse transcriptase inhibitors against drug-resistance: An in silico study. J. Biomol. Struct. Dyn. 2019. [Google Scholar] [CrossRef]
  38. Warde-Farley, D.; Donaldson, S.L.; Comes, O.; Zuberi, K.; Badrawi, R.; Chao, P.; Franz, M.; Grouios, C.; Kazi, F.; Lopes, C.T.; et al. The GeneMANIA prediction server: Biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 2010, 38, W214–W220. [Google Scholar] [CrossRef]
  39. Takihara, Y.; Matsuda, Y.; Hara, J. Role of the beta isoform of 14-3-3 proteins in cellular proliferation and oncogenic transformation. Carcinogenesis 2000, 21, 2073–2077. [Google Scholar] [CrossRef] [Green Version]
  40. Stark, C.; Breitkreutz, B.-J.; Reguly, T.; Boucher, L.; Breitkreutz, A.; Tyers, M. BioGRID: A general repository for interaction datasets. Nucleic Acids Res. 2006, 34, D535–D539. [Google Scholar] [CrossRef] [Green Version]
  41. Razick, S.; Magklaras, G.; Donaldson, I.M. iRefIndex: A consolidated protein interaction database with provenance. BMC Bioinf. 2008, 9, 405. [Google Scholar] [CrossRef] [Green Version]
  42. Tateno, H.; Matsushima, A.; Hiemori, K.; Onuma, Y.; Ito, Y.; Hasehira, K.; Nishimura, K.; Ohtaka, M.; Takayasu, S.; Nakanishi, M.; et al. Podocalyxin Is a Glycoprotein Ligand of the Human Pluripotent Stem Cell-Specific Probe rBC2LCN. Stem Cells Transl. Med. 2013, 2, 265–273. [Google Scholar] [CrossRef] [PubMed]
  43. Hannenhalli, S.; Putt, M.E.; Gilmore, J.M.; Wang, J.W.; Parmacek, M.S.; Epstein, J.A.; Morrisey, E.E.; Margulies, K.B.; Cappola, T.P. Transcriptional genomics associates FOX transcription factors with human heart failure. Circulation 2006, 114, 1269–1276. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Coelho, S.G.; Yin, L.L.; Smuda, C.; Mahns, A.; Kolbe, L.; Hearing, V.J. Photobiological implications of melanin photoprotection after UVB-induced tanning of human skin but not UVA-induced tanning. Pigment Cell Melanoma Res. 2015, 28, 210–216. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Scholtysik, R.; Kreuz, M.; Hummel, M.; Rosolowski, M.; Szczepanowski, M.; Klapper, W.; Loeffler, M.; Trumper, L.; Siebert, R.; Kuppers, R.; et al. Characterization of genomic imbalances in diffuse large B-cell lymphoma by detailed SNP-chip analysis. Int. J. Cancer 2015, 136, 1033–1042. [Google Scholar] [CrossRef]
  46. Anaya, J. OncoLnc: Linking TCGA survival data to mRNAs, miRNAs, and lncRNAs. PeerJ Comput. Sci. 2016, 2, e67. [Google Scholar] [CrossRef] [Green Version]
  47. Law, C.W.; Chen, Y.; Shi, W.; Smyth, G.K. Voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014, 15, R29. [Google Scholar] [CrossRef] [Green Version]
  48. Ritchie, M.E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef]
  49. Zhang, C.X.; Freddolino, P.L.; Zhang, Y. COFACTOR: Improved protein function prediction by combining structure, sequence and protein-protein interaction information. Nucleic Acids Res. 2017, 45, W291–W299. [Google Scholar] [CrossRef]
  50. Liu, W.; Chun, E.; Thompson, A.A.; Chubukov, P.; Xu, F.; Katritch, V.; Han, G.W.; Roth, C.B.; Heitman, L.H.; Cherezov, V.; et al. Structural basis for allosteric regulation of GPCRs by sodium ions. Science 2012, 337, 232–236. [Google Scholar] [CrossRef] [Green Version]
  51. Pándy-Szekeres, G.; Munk, C.; Tsonkov, T.M.; Mordalski, S.; Harpsøe, K.; Hauser, A.S.; Bojarski, A.J.; Gloriam, D.E. GPCRdb in 2018: Adding GPCR structure models and ligands. Nucleic Acids Res. 2017, 46, D440–D446. [Google Scholar] [CrossRef] [Green Version]
  52. Sriram, K.; Insel, P.A. G Protein-Coupled Receptors as Targets for Approved Drugs: How Many Targets and How Many Drugs? Mol. Pharmacol. 2018, 93, 251–258. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Andradas, C.; Caffarel, M.M.; Perez-Gomez, E.; Salazar, M.; Lorente, M.; Velasco, G.; Guzmán, M.; Sánchez, C. The orphan G protein-coupled receptor GPR55 promotes cancer cell proliferation via ERK. Oncogene 2011, 30, 245. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Cartwright, A.; Schmutz, C.; Askari, A.; Kuiper, J.H.; Middleton, J. Orphan receptor GPR15/BOB is up-regulated in rheumatoid arthritis. Cytokine 2014, 67, 53–59. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Sato, E.; Olson, S.H.; Ahn, J.; Bundy, B.; Nishikawa, H.; Qian, F.; Jungbluth, A.A.; Frosina, D.; Gnjatic, S.; Ambrosone, C. Intraepithelial CD8+ tumor-infiltrating lymphocytes and a high CD8+/regulatory T cell ratio are associated with favorable prognosis in ovarian cancer. Proc. Natl. Acad. Sci. USA 2005, 102, 18538–18543. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  56. Colaprico, A.; Silva, T.C.; Olsen, C.; Garofano, L.; Cava, C.; Garolini, D.; Sabedot, T.S.; Malta, T.M.; Pagnotta, S.M.; Castiglioni, I. TCGAbiolinks: An R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 2015, 44, e71. [Google Scholar] [CrossRef] [PubMed]
  57. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef] [Green Version]
  58. Alioto, T.S.; Buchhalter, I.; Derdak, S.; Hutter, B.; Eldridge, M.D.; Hovig, E.; Heisler, L.E.; Beck, T.A.; Simpson, J.T.; Tonon, L. A comprehensive assessment of somatic mutation detection in cancer using whole-genome sequencing. Nat. Commun. 2015, 6, 10001. [Google Scholar] [CrossRef] [Green Version]
  59. Mayakonda, A.; Lin, D.C.; Assenov, Y.; Plass, C.; Koeffler, H.P. Maftools: Efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 2018, 28, 1747–1756. [Google Scholar] [CrossRef] [Green Version]
  60. Tang, Z.; Li, C.; Kang, B.; Gao, G.; Li, C.; Zhang, Z. GEPIA: A web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res. 2017, 45, W98–W102. [Google Scholar] [CrossRef] [Green Version]
  61. Melé, M.; Ferreira, P.G.; Reverter, F.; DeLuca, D.S.; Monlong, J.; Sammeth, M.; Young, T.R.; Goldmann, J.M.; Pervouchine, D.D.; Sullivan, T.J. The human transcriptome across tissues and individuals. Science 2015, 348, 660–665. [Google Scholar] [CrossRef] [Green Version]
  62. Li, T.; Fan, J.; Wang, B.; Traugh, N.; Chen, Q.; Liu, J.S.; Li, B.; Liu, X.S. TIMER: A Web Server for Comprehensive Analysis of Tumor-Infiltrating Immune Cells. Cancer Res. 2017, 77, e108–e110. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  63. Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef] [PubMed]
  64. Edgar, R.; Domrachev, M.; Lash, A.E. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30, 207–210. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  65. Brown, K.R.; Jurisica, I. Online Predicted Human Interaction Database. Bioinformatics 2005, 21, 2076–2082. [Google Scholar] [CrossRef] [Green Version]
  66. Mostafavi, S.; Ray, D.; Warde-Farley, D.; Grouios, C.; Morris, Q. GeneMANIA: A real-time multiple association network integration algorithm for predicting gene function. Genome Biol. 2008, 9, S4. [Google Scholar] [CrossRef] [Green Version]
  67. Croft, D.; Mundo, A.F.; Haw, R.; Milacic, M.; Weiser, J.; Wu, G.; Caudy, M.; Garapati, P.; Gillespie, M.; Kamdar, M.R. The Reactome pathway knowledgebase. Nucleic Acids Res. 2013, 42, D472–D477. [Google Scholar] [CrossRef]
  68. Kassambara, A.; Kosinski, M.; Biecek, P. Survminer: Drawing Survival Curves Using’ggplot2’, R Package Version 03; Available online: https://rpkgs.datanovia.com/survminer/index.html (accessed on 3 September 2019).
  69. Conway, J.R.; Lex, A.; Gehlenborg, N. UpSetR: An R package for the visualization of intersecting sets and their properties. Bioinformatics 2017, 33, 2938–2940. [Google Scholar] [CrossRef] [Green Version]
  70. Swinton, J. Vennerable: Venn and Euler Area-Proportional Diagrams, R Package Version 03; Available online: https://rdrr.io/rforge/Vennerable/ (accessed on 2 May 2019).
  71. Yu, G.; Wang, L.-G.; Han, Y.; He, Q.-Y. Clusterprofiler: An R package for comparing biological themes among gene clusters. Omics J. Integr. Biol. 2012, 16, 284–287. [Google Scholar] [CrossRef]
  72. Ashburner, M.; Ball, C.A.; Blake, J.A.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T. Gene Ontology: Tool for the unification of biology. Nat. Genet. 2000, 25, 25–29. [Google Scholar] [CrossRef] [Green Version]
  73. Kanehisa, M.; Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28, 27–30. [Google Scholar] [CrossRef]
  74. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  75. Sanchez, R.; Sali, A. Evaluation of comparative protein structure modeling by MODELLER-3. Proteins 1997, 1, 50–58. [Google Scholar] [CrossRef]
  76. Rodrigues, J.P.; Levitt, M.; Chopra, G. KoBaMIN: A knowledge-based minimization web server for protein structure refinement. Nucleic Acids Res. 2012, 40, W323–W328. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  77. Jain, A.N. Scoring noncovalent protein-ligand interactions: A continuous differentiable function tuned to compute binding affinities. J. Comput.-Aided Mol. Des. 1996, 10, 427–440. [Google Scholar] [CrossRef]
  78. Van Der Spoel, D.; Lindahl, E.; Hess, B.; Groenhof, G.; Mark, A.E.; Berendsen, H.J. GROMACS: Fast, flexible, and free. J. Comput. Chem. 2005, 26, 1701–1718. [Google Scholar] [CrossRef]
  79. Jo, S.; Lim, J.B.; Klauda, J.B.; Im, W. CHARMM-GUI Membrane Builder for mixed bilayers and its application to yeast membranes. Biophys. J. 2009, 97, 50–58. [Google Scholar] [CrossRef] [Green Version]
  80. Huang, J.; MacKerell, A.D., Jr. CHARMM36 all-atom additive protein force field: Validation based on comparison to NMR data. J. Comput. Chem. 2013, 34, 2135–2145. [Google Scholar] [CrossRef] [Green Version]
  81. Vanommeslaeghe, K.; Hatcher, E.; Acharya, C.; Kundu, S.; Zhong, S.; Shim, J.; Darian, E.; Guvench, O.; Lopes, P.; Vorobyov, I.; et al. CHARMM general force field: A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. J. Comput. Chem. 2010, 31, 671–690. [Google Scholar] [CrossRef] [Green Version]
  82. Jorgensen, W.L.; Chandrasekhar, J.; Madura, J.D.; Impey, R.W.; Klein, M.L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983, 79, 926–935. [Google Scholar] [CrossRef]
  83. Hess, B.; Bekker, H.; Berendsen, H.J.C.; Fraaije, J.G.E.M. LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem. 1997, 18, 1463–1472. [Google Scholar] [CrossRef]
  84. Feller, S.E.; Zhang, Y.; Pastor, R.W.; Brooks, B.R. Constant pressure molecular dynamics simulation: The Langevin piston method. J. Chem. Phys. 1995, 103, 4613–4621. [Google Scholar] [CrossRef]
  85. Parrinello, M.; Rahman, A. Polymorphic transitions in single crystals: A new molecular dynamics method. J. Appl. Phys. 1981, 52, 7182–7190. [Google Scholar] [CrossRef]
  86. Darden, T.; York, D.; Pedersen, L. Particle mesh Ewald: An N log (N) method for Ewald sums in large systems. J. Chem. Phys. 1993, 98, 10089–10092. [Google Scholar] [CrossRef] [Green Version]
  87. Kumari, R.; Kumar, R. Open Source Drug Discovery Consortium; Lynn, A. g_mmpbsa—A GROMACS tool for high-throughput MM-PBSA calculations. J. Chem. Inf. Model. 2014, 54, 1951–1962. [Google Scholar] [CrossRef]
Figure 1. Expression and mutational landscape of GPR15 in the Cancer Genome Atlas (TCGA) cohorts. (A). Y-axis represents mutational rates of GPR15 (simple somatic mutation) in all TCGA cohorts. The cancer types whose GPR15 mutational rate is 0 are excluded. (B). Pan-cancer expression landscape of GPR15. “T” stands for tumor tissue and “N” stands for paired normal tissue. The expression abundance is measured by log-normalized transcripts per million (TPM). The green color of the cancer type means that GPR15 is differentially expressed between tumor tissue and paired normal cell. (C) Bar graph of the gene expression profile across all tumor samples and paired normal tissues. The height of bar represents the median expression (log-normalized TPM) of certain tumor type or normal tissue.
Figure 1. Expression and mutational landscape of GPR15 in the Cancer Genome Atlas (TCGA) cohorts. (A). Y-axis represents mutational rates of GPR15 (simple somatic mutation) in all TCGA cohorts. The cancer types whose GPR15 mutational rate is 0 are excluded. (B). Pan-cancer expression landscape of GPR15. “T” stands for tumor tissue and “N” stands for paired normal tissue. The expression abundance is measured by log-normalized transcripts per million (TPM). The green color of the cancer type means that GPR15 is differentially expressed between tumor tissue and paired normal cell. (C) Bar graph of the gene expression profile across all tumor samples and paired normal tissues. The height of bar represents the median expression (log-normalized TPM) of certain tumor type or normal tissue.
Ijms 20 06226 g001
Figure 2. Mutational summary plot of uterine corpus endometrial carcinoma (UCEC), Uterine carcinosarcoma (UCS), lung squamous carcinoma (LUSC), rectal adenocarcinoma (READ), and colon adenocarcinoma (COAD).
Figure 2. Mutational summary plot of uterine corpus endometrial carcinoma (UCEC), Uterine carcinosarcoma (UCS), lung squamous carcinoma (LUSC), rectal adenocarcinoma (READ), and colon adenocarcinoma (COAD).
Ijms 20 06226 g002
Figure 3. The integrated network of GPR15. The edge color represents supporting data. Pink means the line is based on physical interactions. Purple means the line is generated from co-expression profiles. Blue means co-localization, and yellow stands for predicted interaction. Node size stands for its weight in the network.
Figure 3. The integrated network of GPR15. The edge color represents supporting data. Pink means the line is based on physical interactions. Purple means the line is generated from co-expression profiles. Blue means co-localization, and yellow stands for predicted interaction. Node size stands for its weight in the network.
Ijms 20 06226 g003
Figure 4. Differential gene expression (DEG) analysis results and Kaplan Meier (KM) plots. (A). Upset and Venn diagram of DEGs in COAD, HNSC, LUAD, and STAD. (BE). KM plots of the GPR15 expression groups in COAD, HNSC, LUAD, and STAD.
Figure 4. Differential gene expression (DEG) analysis results and Kaplan Meier (KM) plots. (A). Upset and Venn diagram of DEGs in COAD, HNSC, LUAD, and STAD. (BE). KM plots of the GPR15 expression groups in COAD, HNSC, LUAD, and STAD.
Ijms 20 06226 g004
Figure 5. Gene-concept network for commonly upregulated gene set (CUPGS). The size of the GO term stands for the number of genes in the CUPGS that is annotated based on the term. Color scale of the gene name stands for the mean fold-change in the high GPR15 expression groups compared to the low GPR15 expression groups among COAD, HNSC, LUAD, and STAD.
Figure 5. Gene-concept network for commonly upregulated gene set (CUPGS). The size of the GO term stands for the number of genes in the CUPGS that is annotated based on the term. Color scale of the gene name stands for the mean fold-change in the high GPR15 expression groups compared to the low GPR15 expression groups among COAD, HNSC, LUAD, and STAD.
Ijms 20 06226 g005
Figure 6. Correlations between expression level of GPR15 and immune cell infiltration levels in COAD, HNSC, LUAD, and STAD generated from TIMER.
Figure 6. Correlations between expression level of GPR15 and immune cell infiltration levels in COAD, HNSC, LUAD, and STAD generated from TIMER.
Ijms 20 06226 g006
Figure 7. Homology modeling result of GPR15. (A) Predicted seven transmembrane structure (7TM) of GPR15. Different colors indicate various domains. In addition, there is a disulfide bond between Cys106 and Cys183. (B) The Ramachandran plot result of GPR15.
Figure 7. Homology modeling result of GPR15. (A) Predicted seven transmembrane structure (7TM) of GPR15. Different colors indicate various domains. In addition, there is a disulfide bond between Cys106 and Cys183. (B) The Ramachandran plot result of GPR15.
Ijms 20 06226 g007
Figure 8. Ligand binding landscape of GPR15 (A) The schematic graph of the predicted binding site of GPR15. The predicted binding site in our 3D structure is a traditional orthosteric binding site in the vicinity of the highly conserved residue (TRP254, W6.48) of family A GPCRs [50]. The key residues are shown in purple-red sticks. (B) The snake diagram, generated via GPCRdb [51], of the predicted active side residues (purple) interacting with the ligands. (C) The predicted binding mode between GPR15 and ligands at the active site pocket (dashed box). The protein–ligand interactions of representative docking poses of the top eight hits are displayed around. Different ligands are represented by different colored sticks. Hydrogen bonds are illustrated by purple lines, and Pi–pi and Pi–cation interactions are marked by a black line.
Figure 8. Ligand binding landscape of GPR15 (A) The schematic graph of the predicted binding site of GPR15. The predicted binding site in our 3D structure is a traditional orthosteric binding site in the vicinity of the highly conserved residue (TRP254, W6.48) of family A GPCRs [50]. The key residues are shown in purple-red sticks. (B) The snake diagram, generated via GPCRdb [51], of the predicted active side residues (purple) interacting with the ligands. (C) The predicted binding mode between GPR15 and ligands at the active site pocket (dashed box). The protein–ligand interactions of representative docking poses of the top eight hits are displayed around. Different ligands are represented by different colored sticks. Hydrogen bonds are illustrated by purple lines, and Pi–pi and Pi–cation interactions are marked by a black line.
Ijms 20 06226 g008
Figure 9. Root-mean-square deviation (RMSD) results of the GPR15–ligand complex systems as a function of time during the MD simulations. (A) RMSDs of backbone atoms for GPR15–ligand complexes. All the complexes are stable and attain the stability soon after reaching 20 ns. (B) RMSDs of heavy atoms for screened ligands. Compound 5, 6, 7, and 8 showed lower fluctuations among the eight hits in RMSD.
Figure 9. Root-mean-square deviation (RMSD) results of the GPR15–ligand complex systems as a function of time during the MD simulations. (A) RMSDs of backbone atoms for GPR15–ligand complexes. All the complexes are stable and attain the stability soon after reaching 20 ns. (B) RMSDs of heavy atoms for screened ligands. Compound 5, 6, 7, and 8 showed lower fluctuations among the eight hits in RMSD.
Ijms 20 06226 g009
Table 1. Top five GPR15-related genes with highest scores.
Table 1. Top five GPR15-related genes with highest scores.
GeneScoreNetwork GroupNetwork Resource
YWHAB0.22097643Physical InteractionsBioGRID-small-scale-studies [40]
YWHAB0.14404532Physical InteractionsIREF-INTACT [41]
TACR10.038140558Co-expressionTateno-Hirabayashi 2013 [42]
TAS2R90.034424774Co-expressionHannenhalli-Cappola 2006 [43]
SPDYE40.031733938Co-expressionCoelho-Hearing 2015 [44]
GPR1820.029303862Co-expressionScholtysik-Kuppers 2015 [45]
Table 2. Top five pathways of genes in the integrated network.
Table 2. Top five pathways of genes in the integrated network.
Pathway IDPathway Namep-ValueEntities Found
R-HSA-450385Butyrate Response Factor 1 (BRF1) binds and destabilizes mRNA0.00231YWHAB;EXOSC1
R-HSA-450513Tristetraprolin (TTP, ZFP36) binds and destabilizes mRNA0.00231YWHAB;EXOSC1
R-HSA-525793Myogenesis0.00636MYF6;MEF2D
R-HSA-375170CDO in myogenesis0.00636MYF6;MEF2D
R-HSA-388396GPCR downstream signaling0.00967GPR15
Table 3. Top 10 potential cancer types whose prognosis is associated with GPR15.
Table 3. Top 10 potential cancer types whose prognosis is associated with GPR15.
CancerCox Coefficientp-ValueRank
STAD0.270.002269
HNSC−0.2050.006707
LUAD−0.1610.0393711
COAD−0.1590.1504299
READ−0.3280.1602696
LUSC0.070.3306956
KIRC−0.0590.48013,174
LAML0.0780.5109516
ESCA0.0210.88014,833
“Rank” stands for the expression abundance rank among all genes.
Table 4. Top 10 enriched GO terms for each GO category. BP stands for biological process, CC stands for cellular component, MF stands for molecular function.
Table 4. Top 10 enriched GO terms for each GO category. BP stands for biological process, CC stands for cellular component, MF stands for molecular function.
IDDescriptionp-AdjustCategory
GO:0006958complement activation, classical pathway2.78 × 10−120BP
GO:0002455humoral immune response mediated by circulating immunoglobulin3.03 × 10−117BP
GO:0006956complement activation2.21× 10−112BP
GO:0072376protein activation cascade1.16 × 10−107BP
GO:0016064immunoglobulin mediated immune response1.36 × 10−104BP
GO:0019724B cell mediated immunity1.66 × 10−104BP
GO:0002429immune response-activating cell surface receptor signaling pathway3.22 × 10−91BP
GO:0006959humoral immune response4.60 × 10−91BP
GO:0002768immune response-regulating cell surface receptor signaling pathway8.99 × 10−94BP
GO:0002460adaptive immune response based on somatic recombination of immune receptors1.21 × 10−91BP
GO:0019814immunoglobulin complex1.32 × 10−80CC
GO:0042571immunoglobulin complex, circulating3.13 × 10−77CC
GO:0009897external side of plasma membrane1.50 × 10−44CC
GO:0072562blood microparticle2.80 × 10−18CC
GO:0098802plasma membrane receptor complex0.513620478CC
GO:0042101T cell receptor complex0.513620478CC
GO:0008180COP9 signalosome0.721923256CC
GO:0043235receptor complex0.721923256CC
GO:0000788nuclear nucleosome0.721923256CC
GO:0005771multivesicular body0.754761177MF
GO:0003823antigen binding2.41 × 10−159MF
GO:0034987immunoglobulin receptor binding2.28 × 10−71MF
GO:0004252serine-type endopeptidase activity2.71 × 10−46MF
GO:0008236serine-type peptidase activity9.31 × 10−45MF
GO:0017171serine hydrolase activity1.40 × 10−44MF
GO:0005068transmembrane receptor protein tyrosine kinase adaptor activity0.03233891MF
GO:0042834peptidoglycan binding0.055957242MF
GO:0031210phosphatidylcholine binding0.100244352MF
GO:0050997quaternary ammonium group binding0.100244352MF
GO:0035591signaling adaptor activity0.102967022MF
Table 5. The docking score and predicted protein–ligand interaction of the top eight compounds selected in virtual screening.
Table 5. The docking score and predicted protein–ligand interaction of the top eight compounds selected in virtual screening.
Compound No.Molecular FormulaWeight (g/mol)Docking ScoreNoncovalent InteractionsResidues
C1C38H58O2N2576.91−11.632 Pi–pi, 2 H–bondTRP89, ASP91
C2C60H55O8N1918.09−11.151 Pi–pi, 2 Pi–cation, 1 H–BondLYS180, ARG172, TRP195, LYS261
C3C38H41O7N3653.77−10.792 H–BondCYS183, ARG172
C4C21H28O4N2S404.53−10.281 Pi–pi, 2 H–bondTRP89, SER109, LYS180
C5C34H47O6N3593.76−10.111 Pi–pi, 1 Salt–bridgePHE257, LYS261
C6C27H46O3418.66−8.722 H–bondARG172, LYS180
C7C20H24O4328.41−8.32 Pi–pi, 1 H–bondTRP89, TYR182, LYS180
C8C22H27O5N5S473.55−8.291 Salt–bridge,1 Pi–piLYS261, TRP89

Share and Cite

MDPI and ACS Style

Wang, Y.; Wang, X.; Xiong, Y.; Li, C.-D.; Xu, Q.; Shen, L.; Chandra Kaushik, A.; Wei, D.-Q. An Integrated Pan-Cancer Analysis and Structure-Based Virtual Screening of GPR15. Int. J. Mol. Sci. 2019, 20, 6226. https://doi.org/10.3390/ijms20246226

AMA Style

Wang Y, Wang X, Xiong Y, Li C-D, Xu Q, Shen L, Chandra Kaushik A, Wei D-Q. An Integrated Pan-Cancer Analysis and Structure-Based Virtual Screening of GPR15. International Journal of Molecular Sciences. 2019; 20(24):6226. https://doi.org/10.3390/ijms20246226

Chicago/Turabian Style

Wang, Yanjing, Xiangeng Wang, Yi Xiong, Cheng-Dong Li, Qin Xu, Lu Shen, Aman Chandra Kaushik, and Dong-Qing Wei. 2019. "An Integrated Pan-Cancer Analysis and Structure-Based Virtual Screening of GPR15" International Journal of Molecular Sciences 20, no. 24: 6226. https://doi.org/10.3390/ijms20246226

APA Style

Wang, Y., Wang, X., Xiong, Y., Li, C. -D., Xu, Q., Shen, L., Chandra Kaushik, A., & Wei, D. -Q. (2019). An Integrated Pan-Cancer Analysis and Structure-Based Virtual Screening of GPR15. International Journal of Molecular Sciences, 20(24), 6226. https://doi.org/10.3390/ijms20246226

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop