Next Article in Journal
Stem Cells: The Pursuit of Genomic Stability
Next Article in Special Issue
Structure–Biological Function Relationship Extended to Mitotic Arrest-Deficient 2-Like Protein Mad2 Native and Mutants-New Opportunity for Genetic Disorder Control
Previous Article in Journal
Chrysin Protects against Focal Cerebral Ischemia/Reperfusion Injury in Mice through Attenuation of Oxidative Stress and Inflammation
Previous Article in Special Issue
Competing Intramolecular vs. Intermolecular Hydrogen Bonds in Solution
 
 
Correction published on 5 March 2015, see Int. J. Mol. Sci. 2015, 16(3), 5072-5075.
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

3D QSAR Studies, Pharmacophore Modeling and Virtual Screening on a Series of Steroidal Aromatase Inhibitors

1
Department of Chemistry, Yunnan University, Kunming 650091, Yunnan, China
2
Department of Chemistry, School of Pharmaceutical Science & Yunnan Key Laboratory of Pharmacology for Natural Products, Kunming Medical University, Kunming 650500, Yunnan, China
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2014, 15(11), 20927-20947; https://doi.org/10.3390/ijms151120927
Submission received: 27 July 2014 / Revised: 28 September 2014 / Accepted: 22 October 2014 / Published: 14 November 2014
(This article belongs to the Special Issue Chemical Bond and Bonding 2015)

Abstract

:
Aromatase inhibitors are the most important targets in treatment of estrogen-dependent cancers. In order to search for potent steroidal aromatase inhibitors (SAIs) with lower side effects and overcome cellular resistance, comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were performed on a series of SAIs to build 3D QSAR models. The reliable and predictive CoMFA and CoMSIA models were obtained with statistical results (CoMFA: q2 = 0.636, r2ncv = 0.988, r2pred = 0.658; CoMSIA: q2 = 0.843, r2ncv = 0.989, r2pred = 0.601). This 3D QSAR approach provides significant insights that can be used to develop novel and potent SAIs. In addition, Genetic algorithm with linear assignment of hypermolecular alignment of database (GALAHAD) was used to derive 3D pharmacophore models. The selected pharmacophore model contains two acceptor atoms and four hydrophobic centers, which was used as a 3D query for virtual screening against NCI2000 database. Six hit compounds were obtained and their biological activities were further predicted by the CoMFA and CoMSIA models, which are expected to design potent and novel SAIs.

Graphical Abstract

1. Introduction

Aromatase is a cytochrome P-450 dependent enzyme that catalyzes the aromatization of androgens to estrogens. Aromatase inhibitors (AIs) reduce the synthesis of estrogens and offer a therapeutic alternative for the treatment of estrogen-dependent cancers, such as breast cancer [1,2,3]. There are two classes of AIs, steroidal and non-steroidal compounds, which cause potent estrogen suppression [4]. The non-steroidal aromatase inhibitors (NSAIs) are mostly azole type compounds, such as the clinically used anastrozole and letrozole, which compete with the substrate for binding to the enzyme active site [5]. Among steroidal aromatase inhibitors (SAIs), formestane was widely used during the early 1990s, but it is not used nowadays because of the need to administer it by intramuscular injection. Therefore, the orally active exemestane is the main steroidal inhibitor [6]. These SAIs mimic the natural substrate androstenedione and are converted by the enzyme to reactive intermediates, which bind irreversibly to the enzyme active site, resulting in inactivation of aromatase [7]. Despite the success of the third-generation NSAIs (anastrazole and letrozole) and SAIs (exemestane), they still have some major side effects, such as increase of bone loss, joint pain, and heart problems [8]. In addition, after some years of usage they can develop cellular resistance. For these reasons, it is important to search for other potent and specific molecules with lower side effects and which can overcome the resistance phenomena.
Quantitative structure-activity relationship (QSAR) methods have been successfully employed to assist the design of new small molecule drug candidates [9,10,11,12,13,14,15,16]. Comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) are two of the most widely used three-dimensional quantitative structure-activity relationship (3D QSAR) methodologies. CoMFA calculates the energies of steric and electrostatic interactions between the compound and the probe atom at various intersections of a regular 3D lattice according to Lennard-Jones and Coulomb potentials. The resulting energies derived from these two potential functions can be contoured to offer a quantitative spatial description of the molecular properties [17]. CoMSIA introduces the Gaussian function for the distance dependence between the molecular atoms and the probe atom in order to avoid some inherent deficiencies arising from the Lennard-Jones and Coulomb potential functional forms. CoMSIA is applied to gain an insight into how steric fields, electrostatic fields, hydrophobic fields, hydrogen bond donor (HBD) and hydrogen bond acceptor (HBA) influence the activity of inhibitors [18].
Pharmacophore modeling can provide valuable insight into ligand-receptor interactions. Pharmacophore searches are the best option to find a range of chemical structures with viable features. A pharmacophore model can be considered as the ensemble of steric and electrostatic features of different compounds, which are necessary to ensure optimal supramolecular interactions with a specific biological target structure and to trigger or to block its biological response. Thus, pharmacophore modeling is the method of choice for the first round of compound selection. This ability of a pharmacophore model is used to find new classes of inhibitors when one class is known. This is known as “scaffold hopping” [19,20,21].
A series of SAIs, shown in Table 1, have been reported in the recent literatures [22,23,24,25,26,27]. To understand the structural requirements for inhibitory activity and design more potent agents, 3D QSAR studies were performed for the fist time for these SAIs using CoMFA and CoMSIA. In addition, 3D pharmacophore models were created and the selected best model was used as a 3D query for virtual screening against NCI2000 database. The biological activities of hit compounds were further predicted by using CoMFA and CoMSIA models.
Table 1. Chemical structures and bioactivity values of the steroidal aromatase inhibitors in current study.
Table 1. Chemical structures and bioactivity values of the steroidal aromatase inhibitors in current study.
CompoundGeneral StructureSubstituentsIC50 (µM)pIC50
1 b Ijms 15 20927 i001 Ijms 15 20927 i0025.25.284
2 a Ijms 15 20927 i00322.74.644
3 Ijms 15 20927 i004 6.45.194
4 Ijms 15 20927 i005–CH35.25.284
5 a–CH2CH318.14.742
6–CH2C≡CCH30.11236.950
7 b–CH2C≡CCH2CH30.01187.928
8 a–CH2C≡C(CH2)2CH30.0837.081
9–CH2C≡C(CH2)3CH30.18116.742
10–CH2C≡C(CH2)6CH32.185.662
11–CH2C≡CCH2OH0.027.699
12 b (Exemestane) Ijms 15 20927 i006 0.05017.300
13 b (Formestane) Ijms 15 20927 i007 0.04867.313
14 a Ijms 15 20927 i008 3.305.481
15 Ijms 15 20927 i009 0.186.745
16 b Ijms 15 20927 i010 0.166.796
17 Ijms 15 20927 i011 4.905.310
18 a Ijms 15 20927 i012 0.1836.738
19 b Ijms 15 20927 i013 0.1356.870
20 Ijms 15 20927 i014 0.9706.013
21 Ijms 15 20927 i015 0.2256.648
22 a Ijms 15 20927 i016 0.1456.839
23 Ijms 15 20927 i017 1.7335.761
24 Ijms 15 20927 i018 1.1505.939
25 a Ijms 15 20927 i019 0.596.229
26 Ijms 15 20927 i020 0.756.125
27 b Ijms 15 20927 i021 0.456.347
28 Ijms 15 20927 i022 0.476.328
29 a Ijms 15 20927 i0231-OH; 2-Ph7.275.138
301-OH; 2-MeOPh(p)7.135.147
311-OCOCF3; 2-Ph7.125.148
321-OCOCF3; 2-MeOPh(p)7.005.155
33 Ijms 15 20927 i0241-OCOCF3; 2-COCH3; 3-Ph6.575.182
34 a1-OCOCF3; 2-COCH3; 3-MeOPh(p)6.455.190
351-OCOCF3; 2-COCH2CH3; 3-Ph6.725.173
361-OCOCF3; 2-COCH2CH3; 3-MeOPh(p)6.615.180
371-OH; 2-H; 3-Ph6.915.161
381-OH; 2-H; 3-MeOPh(p)6.835.166
39 a1-OCOCF3; 2-CH3; 3-Ph5.815.236
401-OCOCF3; 2-CH3; 3-MeOPh(p)5.785.238
41 a1-OCOCF3; 2-Ph; 3-Ph5.675.246
421-OCOCF3; 2-Ph; 3-MeOPh(p)5.455.264
431-OH; 2-CH3; 3-Ph6.345.198
441-OH; 2-CH3; 3-MeOPh(p)6.125.213
451-OH; 2-Ph; 3-Ph6.015.221
46 a1-OH; 2-Ph; 3-MeOPh(p)5.925.228
47 Ijms 15 20927 i0251-H; 2-Ph5.235.281
481-H; 2-MeOPh(p)4.885.312
491-COCH3; 2-Ph4.915.309
501-COCH3; 2-MeOPh(p)4.895.311
51 a1-CH3; 2-Ph4.785.321
521-CH3; 2-MeOPh(p)4.565.341
531-Ph; 2-Ph4.275.370
541-Ph; 2-MeOPh(p)4.165.381
55 a Ijms 15 20927 i0261-COCH3; 2-Ph2.885.541
561-COCH3; 2-MeOPh(p)2.655.577
571-H; 2-Ph3.015.521
581-H; 2-MeOPh(p)2.915.536
591-CH3; 2-Ph2.455.611
60 a1-CH3; 2-MeOPh(p)2.115.676
611-Ph; 2-Ph1.985.703
62 b1-Ph; 2-MeOPh(p)1.825.740
63 Ijms 15 20927 i0271-CH3; 2-Ph3.515.455
641-CH3; 2-MeOPh(p)3.405.469
65 a1-Ph; 2-Ph3.345.476
661-Ph; 2-MeOPh(p)3.235.491
a Test-set compounds; and b Compounds used to generate pharmacophore models.

2. Results and Discussion

2.1. CoMFA and CoMSIA Statistical Results

The statistical parameters of standard CoMFA models constructed with steric and electrostatic fields are given in Table 2. The cross-validated coefficient (q2/r2cv), non-cross-validated correlation coefficient (r2ncv), standard error estimate (SEE) and F-statistic values (F) were computed as defined in SYBYL. The cross-validated (leave-one-out) PLS analysis shows a q2 value of 0.636 with ten components and non-cross-validated PLS analysis results in r2ncv of 0.988, SEE of 0.094 and F value of 309.026, which indicates it is a model with high quality. The corresponding field contributions of steric and electrostatic are 0.671 and 0.329, respectively, which means the steric field gives more contribution to the bioactivity than the electrostatic field does.
Table 2. Summary of CoMFA and CoMSIA results.
Table 2. Summary of CoMFA and CoMSIA results.
ComponentsCoMFACoMSIA (SA)
q2 (r2cv)0.6360.843
r2ncv0.9880.989
SEE0.0940.096
F value309.026174.304
r2pred0.6580.601
No. of compounds5050
No. of optimal components1017
Contribution
Steric0.6710.677
Electrostatic0.329
Hydrophobic
H-bond donor
H-bond acceptor0.323
Commonly, in CoMSIA, five different similarity fields (steric, electrostatic, hydrophobic, HBD, and HBA) are calculated. However, for CoMSIA models, the model with global descriptors is not the best model in all probability. Some papers [28,29] have discussed whether the five different descriptor fields in CoMSIA are totally independent of each other. The dependencies of the individual fields usually decrease the signal-to-noise ratio in the data [29] and lower the statistical significance of the results. Therefore, in our study, an optimization of 31 possible combinations of five different descriptor fields was evaluated from the values of the q2, which is shown in Figure 1. The higher the value of q2 shows, the better the model is. With the highest q2 of 0.843, CoMSIA model with the combination of steric and HBA fields (SA) was finally chosen as the best model, which indicates that the steric and HBA fields mainly contribute to the binding affinities. This CoMSIA (SA) model was obtained with seventeen optimal components. The analysis results are summarized in Table 2 (q2 = 0.843, r2ncv = 0.989, F = 174.304, SEE = 0.096). The corresponding field contributions of steric and HBA are 0.677 and 0.323, respectively, which means that the steric field provides more bioactivity contribution than the HBA field does.
Figure 1. The histogram of 31 possibilities of the CoMSIA field combinations (S = steric, E = electrostatic, H = hydrophobic, D = hydrogen bond donor, A = hydrogen bond acceptor).
Figure 1. The histogram of 31 possibilities of the CoMSIA field combinations (S = steric, E = electrostatic, H = hydrophobic, D = hydrogen bond donor, A = hydrogen bond acceptor).
Ijms 15 20927 g001

2.2. Validation of 3D QSAR Models

In order to validate the obtained 3D QSAR models, r2pred was used to determine the predictive abilities of the CoMFA and CoMSIA models from the 16 compounds (test set), which were not included in the generation of the models. The obtained r2pred of the test set is 0.658, 0.601 for the CoMFA, CoMSIA model, respectively, which indicates that both models have good predictive ability. The observed and predicted pIC50 of the training and test sets by the CoMFA and CoMSIA models are listed in Table 3, and the correlations between the observed and predicted pIC50 of training and test sets are depicted in Figure 2 for CoMFA model, Figure 3 for CoMSIA model, respectively.
Table 3. Observed and predicted pIC50 of the training and test sets using CoMFA and CoMSIA models.
Table 3. Observed and predicted pIC50 of the training and test sets using CoMFA and CoMSIA models.
CompoundObserved pIC50CoMFACoMSIA
Pred.Res.Pred.Res.
15.2845.2470.0375.357−0.073
2 a4.6445.642−0.9985.744−1.100
35.1945.1520.0425.1920.002
45.2845.1910.0935.2780.006
5 a4.7425.314−0.5725.822−1.080
66.9507.149−0.1997.064−0.114
77.9287.7780.1507.8670.061
8 a7.0817.102−0.0217.477−0.396
96.7426.804−0.0626.7110.031
105.6625.722−0.0605.678−0.016
117.6997.5980.1017.6850.014
127.3007.2720.0287.2810.019
137.3137.378−0.0657.361−0.048
14 a5.4816.542−1.0616.181−0.700
156.7456.869−0.1246.830−0.085
166.7966.7460.0506.7260.070
175.3105.348−0.0385.3070.003
18 a6.7386.6680.0707.123−0.385
196.8706.8110.0596.888−0.018
206.0135.9650.0486.062−0.049
216.6486.4360.2126.2670.381
22 a6.8396.1320.7075.9580.881
235.7616.102−0.3416.057−0.296
245.9396.008−0.0695.8880.051
25 a6.2296.236−0.0076.359−0.130
266.1256.0740.0516.151−0.026
276.3476.3010.0466.365−0.018
286.3286.2820.0466.2920.036
29 a5.1385.202−0.0645.0840.054
305.1475.186−0.0395.1220.025
315.1485.165−0.0175.1110.037
325.1555.1540.0015.1480.007
335.1825.1690.0135.184−0.002
34 a5.1905.1850.0055.1570.033
355.1735.184−0.0115.186−0.013
365.1805.187−0.0075.1600.020
375.1615.1270.0345.1460.015
385.1665.194−0.0285.1530.013
39 a5.2365.2220.0145.2330.003
405.2385.284−0.0465.2340.004
41 a5.2465.2370.0095.273−0.027
425.2645.291−0.0275.282−0.018
435.1985.1560.0425.200−0.002
445.2135.222−0.0095.2090.004
455.2215.1770.0445.236−0.015
46 a5.2285.230−0.0025.247−0.019
475.2815.2430.0385.2700.011
485.3125.3100.0025.2770.035
495.3095.2650.0445.2940.015
505.3115.2850.0265.2710.040
51 a5.3215.2790.0425.337−0.016
525.3415.3410.0005.3390.002
535.3705.3030.0675.377−0.007
545.3815.3560.0255.388−0.007
55 a5.5415.626−0.0855.603−0.062
565.5775.640−0.0635.5740.003
575.5215.594−0.0735.565−0.044
585.5365.656−0.1205.569−0.033
595.6115.633−0.0225.638−0.027
60 a5.6765.698−0.0225.6410.035
615.7035.6590.0445.6820.021
625.7405.7140.0265.6900.050
635.4555.4080.0475.475−0.020
645.4695.4690.0005.479−0.010
65 a5.4765.4420.0345.522−0.046
665.4915.4910.0005.526−0.035
a Test-set compounds.
Figure 2. Plots of observed versus predicted activities of the training set and test set molecules from CoMFA analysis.
Figure 2. Plots of observed versus predicted activities of the training set and test set molecules from CoMFA analysis.
Ijms 15 20927 g002aIjms 15 20927 g002b
Figure 3. Plots of observed versus predicted activities of the training set and test set molecules from CoMSIA analysis.
Figure 3. Plots of observed versus predicted activities of the training set and test set molecules from CoMSIA analysis.
Ijms 15 20927 g003aIjms 15 20927 g003b

2.3. CoMFA Contour Maps

The steric contour map for the CoMFA model with the most active inhibitor compound 7 is shown in Figure 4a, in which the green contours represent regions of high steric bulk tolerance (80% contribution), while the yellow contours represent regions of low steric bulk tolerance (20% contribution). It can be seen that a large green contour near C-6 of the ring B indicates a bulky group in this position is favorable to bioactivity. This is supported by the higher activity of compounds 611, which have large substituents in that position, compared with the lower activity of compounds 45, which have small substituents in that position. It also can be observed that there are two large yellow contours: one is next to C-3 of the ring A, and another is near C-15 and C-16 of the ring D, which suggests that a bulky group in the two regions will decrease inhibitory activity. It is confirmed by the fact that compound 17 with bulky substitution on C-3 of the ring A has lower bioactivity than compound 14 with small substitution in that position, and all the compounds with bulky groups near C-15 and C-16 of the ring D (compounds 13, compounds 2966) show low bioactivities (pIC50 < 6).
The CoMFA electrostatic contour map is shown in Figure 4b, in which the red areas are the regions where a negative potential is favorable to activity, while a negative potential is unfavorable in the blue areas. This figure displays two red contours around C-3 of the ring A and one red contour near C-4 of the ring A, which means that the bioactivity can be enhanced if an electronegative atom is present in these areas. It is supported that most of the studied compounds have electronegative atoms on C-3 or C-4 of the ring A. In addition, it also can be confirmed by the fact that compound 22, 24 with oxygen atom between C-3 and C-4 show higher activity than compound 21, 23 with no oxygen atom in that position, respectively.
Figure 4. CoMFA contour maps (standard deviation × coefficient) in combination with compound 7. (a) Steric contour maps: Green contours (80% contribution) refer to sterically favored regions, yellow contours (20% contribution) indicate sterically disfavored regions; and (b) Electrostatic contour maps: Blue contours (80% contribution) refer to regions where positively charged substituents are favored, Red contours (20% contribution) indicate regions where negatively charged substituents are favored.
Figure 4. CoMFA contour maps (standard deviation × coefficient) in combination with compound 7. (a) Steric contour maps: Green contours (80% contribution) refer to sterically favored regions, yellow contours (20% contribution) indicate sterically disfavored regions; and (b) Electrostatic contour maps: Blue contours (80% contribution) refer to regions where positively charged substituents are favored, Red contours (20% contribution) indicate regions where negatively charged substituents are favored.
Ijms 15 20927 g004

2.4. CoMSIA Contour Maps

The steric and HBA fields are shown in Figure 5. For each field, the favorable and disfavored contours represent 80% and 20% level contributions, respectively. Figure 5a shows the steric contour map for the CoMSIA model with the most active inhibitor compound 7, in which green regions are sterically favorable and yellow regions are sterically unfavorable. It is clear that the distribution of steric field in CoMSIA is basically consistent with CoMFA results. A little difference is that there is a yellow contour below the large green contour, which indicates a proper bulky substituent on C-6 of the ring B, is favorable to bioactivity while too large substituent in that position is unfavorable. This is supported by the case that the inhibitory activities of compounds 710 decline with the augment of the substituents on C-6 of the ring B.
Figure 5b shows the HBA contour maps, in which magenta and red contours represent areas where HBA substituents are favored and disfavored, respectively. There are there magenta contours in this figure: one is next to C-3 of the ring A, another is near C-4 of the ring A, and the third one is close to C-17 of the ring D, which indicates that the activity can be enhanced if the HBA atoms are present in these there positions. This is also consistent with the distribution of electrostatic field in CoMFA. It is supported by the case that all the studied compounds have at least one HBA atom in those positions. It also can be confirmed by the example that compound 13 (Formestane) show quite high bioactivity because of its three HBA atoms in those positions.
Figure 5. CoMSIA contour maps (standard deviation × coefficient ) in combination with compound 7. (a) Steric contour maps: Green contours refer to sterically favored regions, yellow contours indicate sterically disfavored regions; and (b) HBA contour maps: Magenta contours show regions where HBA substituents are expected; Red contours refer to areas where HBA substituents are unexpected.
Figure 5. CoMSIA contour maps (standard deviation × coefficient ) in combination with compound 7. (a) Steric contour maps: Green contours refer to sterically favored regions, yellow contours indicate sterically disfavored regions; and (b) HBA contour maps: Magenta contours show regions where HBA substituents are expected; Red contours refer to areas where HBA substituents are unexpected.
Ijms 15 20927 g005

2.5. Pharmacophore Generation

Twenty pharmacophore models were generated with default parameters after Genetic Algorithm with Linear Assignment of Hypermolecular Alignment of Database (GALAHAD) run, and their statistical values are listed in Table 4. Each of the obtained models represents a different tradeoff among the conflicting demands of maximizing steric consensus, maximizing pharmacophore consensus, and minimizing energy. All the twenty models had Pareto rank 0, which means no one model is superior to any other one. Model_03 has very high energy, which is recognized that a high-energy value is due to steric clashes [30]. Small value of energy and high values of Specificity, N_hits, Sterics and Mol_Qry are desired for the best model [31]. Therefore, Model_04 was considered to be the best model and its statistical values are shown in Table 4. This model contains two acceptor atoms and four hydrophobic centers, which is shown in Figure 6 and will be converted into a 3D query for the further virtual screening studies. This pharmacophore model is consistent with the results of CoMFA and CoMSIA. Four hydrophobic centers mean the centers of A, B, C and D rings of steroids, and two acceptor atoms next to C-3 of the ring A and C-17 of the ring D indicate that hydrogen bond acceptor atoms in these two positions can enhance the activity.
Table 4. The statistical values of pharmacophore models after GALAHAD run.
Table 4. The statistical values of pharmacophore models after GALAHAD run.
No.SpecificityN_hitsFeaturesPareto rankEnergyStericsH-bondMol_Qry
Model_015.0986054.311158.8056.9017.53
Model_023.6786020.851146.2057.0013.72
Model_035.0986012657.711151.0056.9028.55
Model_04 a5.158608.761142.6056.0041.94
Model_054.048607.431142.3055.5041.25
Model_064.048605.391147.3054.1010.74
Model_074.048607.831135.5056.6035.13
Model_084.048608.631140.0056.6013.95
Model_094.0486011.751140.4056.6017.87
Model_104.048607.231138.9056.104.67
Model_114.0486016.691146.7055.5041.25
Model_125.0986024.981149.1057.001.67
Model_134.0486012.791152.8056.600.00
Model_144.0486029.821148.8055.5035.13
Model_154.0486038.521158.6054.5011.47
Model_165.158607.251137.4054.4041.94
Model_175.0986049.491151.9056.902.81
Model_183.698607.071145.5053.0035.13
Model_193.9085028.401150.0055.9010.03
Model_204.048609.091147.4056.600.00
a The selected model (Model_04) is indicated in boldface.
Figure 6. The selected GALAHAD model includes two acceptor atoms (green) and four hydrophobic centers (cyan). The sphere sizes indicate query tolerances.
Figure 6. The selected GALAHAD model includes two acceptor atoms (green) and four hydrophobic centers (cyan). The sphere sizes indicate query tolerances.
Ijms 15 20927 g006

2.6. Virtual Screening

The obtained best GALAHAD model (Figure 6) was converted into a UNITY query, which was screened against NCI2000 database. The “flexible database search” option was implemented to perform virtual screening. Primary filters such as Lipinski’s rule of five, Van der Waals bumps, and QFIT (pharmacophoric match between query and the hit compound) were applied to reduce the dataset [32]. The screening of the pharmacophore query yielded six hit compounds that met the specific requirements. The pIC50 values of the six hit compounds were further predicted using the obtained CoMFA and CoMSIA models. Chemical structures and their predicted activity values of the hit compounds are listed in Table 5. Among the hit compounds, compounds NCI 77798 and NCI 79104 show high-predicted pIC50 values by both CoMFA and CoMSIA models (pIC50 > 6.5), which are expected to design potent and novel SAIs.
Table 5. Chemical structures and their predicted activity values of screened hit compounds.
Table 5. Chemical structures and their predicted activity values of screened hit compounds.
Hit CompoundStructurepIC50 (Predicted by CoMFA)pIC50 (Predicted by CoMSIA)
NCI 51178 Ijms 15 20927 i0286.6205.502
NCI 51181 Ijms 15 20927 i0296.4385.443
NCI 51183 Ijms 15 20927 i0306.2545.330
NCI 51184 Ijms 15 20927 i0316.1805.429
NCI 77798 Ijms 15 20927 i0326.7286.946
NCI 79104 Ijms 15 20927 i0336.5717.032

3. Experimental Section

3.1. Compounds and Biological Data

Compounds 13 [22], compounds 413 [23], compounds 1417 [24], compounds 1824 [25], compounds 2528 [26] and compounds 2966 [27] were used for this analysis, and their structures and bioactivity values are presented in Table 1. The pIC50 (−log IC50) values were used to derive 3D QSAR models. The pIC50 values of the studied compounds cover an interval of more than 3 log units. The whole data set of 66 compounds was divided into two groups in approximate ratio of 4:1; A training set with 50 compounds, a test set with 16 compounds (Table 1). The selection of the training and test sets was done manually such that low, moderate and high activity compounds were present in roughly equal proportions in both sets. The training set was used to build predictive models, while the test set was used to validate the predictive ability of the models.

3.2. Molecular Modeling and Alignment

The 3D QSAR modeling analyses, calculations and visualizations were performed using the SYBYL 7.3 molecular modeling package from Tripos Inc., St. Louis, Mo, USA, installed on Red Hat Linux workstations. Identification of the bioactive conformation is a very important step in a 3D QSAR study [33]. Among the inhibitors, the crystal structure of compound 7 bound with aromatase is available from the protein data bank (PDB code: 4GL7) [23]. Therefore, compound 7 was extracted from the complex and chosen to define the most likely binding conformation, which was modified by adding hydrogen atoms without any change of conformation and was minimized with the following steps: (i) Optimization by Steepest Descent with initial optimization of 200 simplex iterations using Tripos force field and Gasteiger-Marsili charges; (ii) Optimization by conjugate gradient; and (iii) Optimization by BFGS [34]. Three-dimensional structures of the other molecules were constructed from the compound 7. Energy minimizations of each compound were performed according to the above procedure. In the present study, compound 7 was used as a template because of its highest activity and all other compounds were aligned on the basis of the common structure shown in Figure 7. The alignment of training and test compounds is shown in Figure 8.
Figure 7. Structure of the template molecule (compound 7), common substructure is in bold.
Figure 7. Structure of the template molecule (compound 7), common substructure is in bold.
Ijms 15 20927 g008
Figure 8. Alignment of training and test set compounds on compound 7. Molecules are colored in white for common C, blue for N, red for O, green for F atoms, respectively.
Figure 8. Alignment of training and test set compounds on compound 7. Molecules are colored in white for common C, blue for N, red for O, green for F atoms, respectively.
Ijms 15 20927 g008

3.3. CoMFA and CoMSIA Models

In CoMFA, the steric fields were calculated using a Lennard-Jones potential, while the electrostatic fields were calculated using a Coulombic potential. To calculate the CoMFA fields, a 3D cubic lattice with grid spacing of 2.0 Å in X, Y and Z directions was created automatically by SYBYL. The grid pattern extended 4.0 Å units in all directions beyond the dimensions of each molecule. The steric and electrostatic probe-ligand interaction energies were calculated by using a sp3 carbon probe atom and a +1.0 charge with a distance-dependent dielectric function at each lattice point. The cut-off for energies was set to ± 30 kcal/mol and the electrostatic contributions were ignored at lattice points with maximal steric interactions [17]. In CoMSIA, five different similarity fields (steric, electrostatic, hydrophobic, HBD, and HBA) were calculated. CoMSIA models were also derived with the same lattice box and all five fields were calculated using a probe of charge +1, a radius of 1, hydrophobicity and hydrogen bonding properties of +1, and an attenuation factor of 0.3 for the Gaussian distance-dependent function [18].

3.4. Statistical Analysis

In order to derive 3D QSAR models, CoMFA and CoMSIA descriptors were used as independent variables and the pIC50 values as the dependent variables. PLS method with cross-validation (leave-one-out) was used in SYBYL to determine the optimal numbers of components by using cross-validated coefficient q2 (r2cv). After obtaining the optimal numbers of components, a PLS analysis was performed with no validation and column filtering 2.0 to generate the final model with the training set. The obtained final non-cross-validated correlation coefficient (r2ncv) is a measure of the quality of the model. The predictive capability of the 3D QSAR models was determined from the predictive correlation (r2pred). The predicted activities for the test set were obtained from the model produced by the training set.

3.5. Pharmacophore Hypothesis

The pharmacophore hypothesis was generated using GALAHAD module of SYBYL, which operates in two main stages: The ligands are aligned to each other in internal coordinate space, and then the conformations produced are aligned in Cartesian space. The feature considered in developing the pharmacophore model includes HBD atoms, HBA atoms, hydrophobic and charged centers [35,36,37]. In our study, eight compounds shown in Table 1 were selected to carry out the pharmacophore hypothesis and the genetic algorithm was used to create conformers for all molecules. The compounds selected to generate the pharmacophore hypothesis are highly active and structurally diverse.

4. Conclusions

Aromatase inhibitors have proven to be the most important targets for treatment of estrogen-dependent cancers. In order to search for more potent SAIs with lower side effects and overcome the drug resistance, 3D QSAR studies, pharmacophore modeling and virtual screening were performed. The 3D QSAR techniques, CoMFA and CoMSIA, were applied for the first time to 66 new-synthesized SAIs, and the obtained models show good statistical results, q2 = 0.636, r2ncv = 0.988, r2pred = 0.658 for CoMFA and q2 = 0.843, r2ncv = 0.989, r2pred = 0.601 for CoMSIA, which indicate that both of the models have good quality and predictive ability. CoMFA and CoMSIA contour maps show that a proper bulky group near C-6 is favorable for activity while a bulky group near C-3, C-15 and C-16 is unfavorable, and the hydrogen bond acceptor atoms on C-3, C-4 and C-17 can enhance the bioactivity. In addition, pharmacophore models were derived from eight highly active and structurally diverse compounds using GALAHAD. The obtained best pharmacophore model includes two acceptor atoms and four hydrophobic centers, which was used as a query to search NCI2000 database. Six hit compounds were obtained after the screening of the pharmacophore query, and their pIC50 values were further predicted using the obtained CoMFA and CoMSIA models, which are expected to design potent and novel SAIs. The present 3D QSAR, pharmacophore modeling, and virtual screening approach provide useful information to design and synthesize potent and novel SAIs.

Statements

As shown in a series of recent publications [38,39,40,41,42,43,44,45] in response to the call in a comprehensive review [46], user-friendly and publicly accessible web-servers represent the future direction for developing practically more useful prediction and modeling methods or demonstrating new and novel findings, we shall make efforts in our future work to provide a web-server for the approach and finding presented in this paper.

Acknowledgments

The authors gratefully acknowledge the support of this work by the Science and Technology Planning Project of Yunnan Province (No. 2011FZ096).

Author Contributions

Huiding Xie performed all experiments and data treatments; Writing was done by Huiding Xie, Kaixiong Qiu and Xiaoguang Xie; and management and submission tasks were done by Huiding Xie and Xiaoguang Xie.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Winer, E.P.; Hudis, C.; Burstein, H.J.; Wolff, A.C.; Pritchard, K.I.; Ingle, J.N.; Chlebowski, R.T.; Gelber, R.; Edge, S.B.; Gralow, J.; et al. American society of clinical oncology technology assessment on the use of aromatase inhibitors as adjuvant therapy for postmenopausal women with hormone receptor-positive breast cancer: Status report 2004. J. Clin. Oncol. 2005, 23, 619–629. [Google Scholar] [CrossRef] [PubMed]
  2. Perez, E.A. Appraising adjuvant aromatase inhibitor therapy. Oncologist 2006, 11, 1058–1069. [Google Scholar] [CrossRef] [PubMed]
  3. Jordan, V.C.; Brodie, A.M.H. Development and evolution of therapies targeted to the estrogen receptor for the treatment and prevention of breast cancer. Steroids 2007, 72, 7–25. [Google Scholar] [CrossRef] [PubMed]
  4. Brueggemeier, R.W.; Hackett, J.C.; Diaz-Cruz, E.S. Aromatase inhibitors in the treatment of breast cancer. Endocr. Rev. 2005, 26, 331–345. [Google Scholar] [CrossRef] [PubMed]
  5. Recanatini, M.; Cavalli, A.; Valenti, P. Nonsteroidal aromatase inhibitors: Recent advances. Med. Res. Rev. 2002, 22, 282–304. [Google Scholar] [CrossRef] [PubMed]
  6. Seralini, G.E.; Moslemi, S. Aromatase inhibitors: Past, present and future. Mol. Cell Endocrinol. 2001, 178, 117–131. [Google Scholar] [CrossRef] [PubMed]
  7. Hong, Y.; Rashid, R.; Chen, S. Binding features of steroidal and nonsteroidal inhibitors. Steroids 2011, 76, 802–806. [Google Scholar] [CrossRef] [PubMed]
  8. Dutta, U.; Pant, K. Aromatase inhibitors: Past, present and future in breast cancer therapy. Med. Oncol. 2008, 25, 113–124. [Google Scholar] [CrossRef] [PubMed]
  9. Du, Q.S.; Mezey, P.G.; Chou, K.C. Heuristic molecular lipophilicity potential (HMLP): A 2D QSAR study to LADH of molecular family pyrazole and derivatives. J. Comput. Chem. 2005, 26, 461–470. [Google Scholar] [CrossRef] [PubMed]
  10. Du, Q.S.; Huang, R.B.; Wei, Y.T. Multiple field three dimensional quantitative structure-activity relationship (MF-3D QSAR). J. Comput. Chem. 2008, 29, 211–219. [Google Scholar] [CrossRef] [PubMed]
  11. Du, Q.S.; Huang, R.B. Review: Recent advances in QSAR and their applications in predicting the activities of chemical molecules, peptides and proteins for drug design. Curr. Protein Pept. Sci. 2008, 9, 248–259. [Google Scholar] [CrossRef] [PubMed]
  12. Du, Q.S.; Huang, R.B.; Wei, Y.T. Fragment-based quantitative structure-activity relationship (FB QSAR) for fragment-based drug design. J. Comput. Chem. 2009, 30, 295–304. [Google Scholar] [CrossRef] [PubMed]
  13. Prado-Prado, F.J.; Gonzalez-Diaz, H.; de la Vega, O.M.; Ubeira, F.M.; Chou, K.C. Unified QSAR approach to antimicrobials. Part 3: First multi-tasking QSAR model for Input-Coded prediction, structural back-projection, and complex networks clustering of antiprotozoal compounds. Bioorg. Med. Chem. 2008, 16, 5871–5880. [Google Scholar] [CrossRef] [PubMed]
  14. Cichero, E.; Fossa, P. Docking-based 3D QSAR analyses of pyrazole derivatives as HIV-1 non-nucleoside reverse transcriptase inhibitors. J. Mol. Model. 2012, 18, 1573–1582. [Google Scholar] [CrossRef] [PubMed]
  15. Zhao, L.Z.; Liu, Y.J.; Hu, S.Y.; Zhang, H.B. 3D QSAR study of Chk1 kinase inhibitors based on docking. J. Mol. Model. 2012, 18, 3669–3694. [Google Scholar] [CrossRef] [PubMed]
  16. Chen, J.; Yu, R.; Shen, B.Z.; Xu, Y.; Liu, Y.F.; Zheng, H.; Yao, W.B. Docking-based 3D QSAR modeling of the inhibitors of IMP metallo-β-lactamase. Med. Chem. Res. 2013, 22, 1730–1739. [Google Scholar] [CrossRef]
  17. Cramer, R.D.; Patterson, D.E.; Bunce, J.D. Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. J. Am. Chem. Soc. 1988, 110, 5959–5967. [Google Scholar] [CrossRef] [PubMed]
  18. Klebe, G.; Abraham, U.; Mietzner, T. Molecular similarity indices in a comparative analysis (CoMSIA) of drug molecules to correlate and predict their biological activity. J. Med. Chem. 1994, 37, 4130–4146. [Google Scholar] [CrossRef] [PubMed]
  19. Sirois, S.; Wei, D.Q.; Du, Q.S.; Chou, K.C. Virtual screening for SARS-CoV protease based on KZ7088 pharmacophore points. J. Chem. Inf. Comput. Sci. 2004, 44, 1111–1122. [Google Scholar] [CrossRef] [PubMed]
  20. Liu, L.; Ma, Y.; Wang, R.L.; Xu, W.R.; Wang, S.Q.; Chou, K.C. Find novel dual-agonist drugs for treating type 2 diabetes by means of cheminformatics. Drug Desi. Dev. Ther. 2013, 7, 279–287. [Google Scholar]
  21. Bhatt, H.G.; Patel, P.K. Pharmacophore modeling, virtual screening and 3D QSAR studies of 5-tetrahydroquinolinylidine aminoguanidine derivatives as sodium hydrogen exchanger inhibitors. Bioorg. Med. Chem. Lett. 2012, 22, 3758–3765. [Google Scholar] [CrossRef] [PubMed]
  22. Bansal, R.; Thota, S.; Karkra, N.; Minu, M.; Zimmer, C.; Hartmann, R.W. Synthesis and aromatase inhibitory activity of some new 16E-arylidenosteroids. Bioorg. Chem. 2012, 45, 36–40. [Google Scholar] [CrossRef] [PubMed]
  23. Ghosh, D.; Lo, J.; Morton, D.; Valette, D.; Xi, J.; Griswold, J.; Hubbell, S.; Egbuta, C.; Jiang, W.; An, J.; Davies, H.M. Novel aromatase inhibitors by structure-guided design. J. Med. Chem. 2012, 55, 8464–8476. [Google Scholar] [CrossRef] [PubMed]
  24. Bansal, R.; Guleria, S.; Thota, S.; Bodhankar, S.L.; Patwardhan, M.R.; Zimmer, C.; Hartmann, R.W.; Harvey, A.L. Design, synthesis and evaluation of novel 16-imidazolyl substituted steroidal derivatives possessing potent diversified pharmacological properties. Steroids 2012, 77, 621–629. [Google Scholar] [CrossRef] [PubMed]
  25. Valera, C.; Tavares da Silva, E.J.; Amaral, C.; Correia da Silva, G.; Baptista, T.; Alcaro, S.; Costa, G.; Carvalho, R.A.; Teixeira, N.A.A.; Roleira, F.M.F. New structure-activity relationships of A- and D-Ring modified steroidal aromatase inhibitors: Design, synthesis, and biochemical evaluation. J. Med. Chem. 2012, 55, 3992–4002. [Google Scholar] [CrossRef] [PubMed]
  26. Valera, C.L.; Amaral, C.; Correia-da-Silva, G.; Carvalho, R.A.; Teixeira, N.A.; Costa, S.C.; Roleira, F.M.F.; Tavares-da-Silva, E.J. Design, synthesis and biochemical studies of new 7α-allylandrostanes as aromatase inhibitors. Steroids 2013, 78, 662–669. [Google Scholar] [CrossRef] [PubMed]
  27. Abdalla, M.M.; Al-Omar, M.A.; Bhat, M.A.; Amr, A.E.; Al-Mohizea, A.M. Steroidal pyrazolines evaluated as aromatase and quinone reductase-2 inhibitors for chemoprevention of cancer. Int. J. Biol. Macromol. 2012, 50, 1127–1132. [Google Scholar] [CrossRef] [PubMed]
  28. Böhm, M.; Stürzebecher, J.; Klebe, G. Three-dimensional quantitative structure-activity relationship analyses using comparative molecular field analysis and comparative molecular similarity indices analysis to elucidate selectivity differences of inhibitors binding to trypsin, thrombin, and factor Xa. J. Med. Chem. 1999, 42, 458–477. [Google Scholar] [CrossRef] [PubMed]
  29. Norinder, U. Recent progress in CoMFA methodology and related techniques. Perspect. Drug Discov. 1998, 3, 25–39. [Google Scholar] [CrossRef]
  30. Dorfman, R.J.; Smith, K.M.; Masek, B.B.; Clark, R.D. A knowledge-based approach to generating diverse but energetically representative ensembles of ligand conformers. J. Comput. Aid. Mol. Des. 2008, 22, 681–691. [Google Scholar] [CrossRef]
  31. Caballero, J. 3D-QSAR (CoMFA and CoMSIA) and pharmacophore (GALAHAD) studies on the differential inhibition of aldose reductase by flavonoid compounds. J. Mol. Graph. Model. 2010, 29, 363–371. [Google Scholar] [CrossRef] [PubMed]
  32. Kothandan, G.; Madhavan, T.; Gadhe, C.G.; Cho, S.J. A combined 3D QSAR and pharmacophore-based virtual screening for the identification of potent p38 MAP kinase inhibitors: An in silico approach. Med. Chem. Res. 2013, 22, 1773–1787. [Google Scholar] [CrossRef]
  33. Thaimattam, R.; Daga, P.R.; Banerjee, R.; Iqbal, J. 3D QSAR studies on c-Src kinase inhibitors and docking analyses of a potent dual kinase inhibitor of c-Src and c-Abl kinases. Bioorg. Med. Chem. 2005, 13, 4704–4712. [Google Scholar] [CrossRef] [PubMed]
  34. Hu, R.J.; Barbault, F.; Delamar, M.; Zhang, R.S. Receptor- and ligand-based 3D QSAR study for a series of non-nucleoside HIV-1 reverse transcriptase inhibitors. Bioorg. Med. Chem. 2009, 17, 2400–2409. [Google Scholar] [CrossRef] [PubMed]
  35. Richmond, N.J.; Abrams, C.A.; Wolohan, P.R.N.; Abrahamian, E.; Willett, P.; Clark, R.D. GALAHAD: 1. Pharmacophore identification by hypermolecular alignment of ligands in 3D. J. Comput. Aid. Mol. Des. 2006, 20, 567–587. [Google Scholar] [CrossRef]
  36. Shepphird, J.K.; Clark, R.D. A marriage made in torsional space: Using GALAHAD models to drive pharmacophore multiplet searches. J. Comput. Aid. Mol. Des. 2006, 20, 763–771. [Google Scholar] [CrossRef]
  37. Andrade, C.H.; Salum, L.B.; Pasqualoto, K.F.M.; Ferreira, E.I.; Andricopulo, A.D. Three-dimensional quantitative structure-activity relationships for a large series of potent antitubercular agents. Lett. Drug Des. Discov. 2008, 5, 377–387. [Google Scholar] [CrossRef]
  38. Chen, W.; Feng, P.M.; Deng, E.Z.; Lin, H.; Chou, K.C. iTIS-PseTNC: A sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition. Anal. Biochem. 2014, 462, 76–83. [Google Scholar] [CrossRef] [PubMed]
  39. Ding, H.; Deng, E.Z.; Yuan, L.F.; Liu, L.; Lin, H.; Chen, W.; Chou, K.C. iCTX-Type: A sequence-based predictor for identifying the types of conotoxins in targeting ion channels. Biomed. Res. Int. 2014, 2014, 286419. [Google Scholar] [PubMed]
  40. Fan, Y.N.; Xiao, X.; Min, J.L.; Chou, K.C. iNR-Drug: Predicting the interaction of drugs with nuclear receptors in cellular networking. Int. J. Mol. Sci. 2014, 15, 4915–4937. [Google Scholar] [CrossRef] [PubMed]
  41. Guo, S.H.; Deng, E.Z.; Xu, L.Q.; Ding, H.; Lin, H.; Chen, W.; Chou, K.C. iNuc-PseKNC: A sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition. Bioinformatics 2014, 30, 1522–1529. [Google Scholar] [CrossRef] [PubMed]
  42. Liu, B.; Xu, J.; Lan, X.; Xu, R.; Zhou, J.; Wang, X.; Chou, K.C. iDNA-Prot|dis: Identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition. PLoS One 2014, 9, e106691. [Google Scholar] [CrossRef] [PubMed]
  43. Qiu, W.R.; Xiao, X.; Lin, W.Z.; Chou, K.C. iMethyl-PseAAC: Identification of protein methylation sites via a pseudo amino acid composition approach. Biomed. Res. Int. 2014, 2014, 947416. [Google Scholar] [PubMed]
  44. Xu, Y.; Wen, X.; Shao, X.J.; Deng, N.Y.; Chou, K.C. iHyd-PseAAC: Predicting hydroxyproline and hydroxylysine in proteins by incorporating dipeptide position-specific propensity into pseudo amino acid composition. Int. J. Mol. Sci. 2014, 15, 7594–7610. [Google Scholar] [CrossRef] [PubMed]
  45. Xu, Y.; Wen, X.; Wen, L.S.; Wu, L.Y.; Deng, N.Y.; Chou, K.C. iNitro-Tyr: Prediction of nitrotyrosine sites in proteins with general pseudo amino acid composition. PLoS One 2014, 9, e105018. [Google Scholar] [CrossRef] [PubMed]
  46. Chou, K.C. Some remarks on protein attribute prediction and pseudo amino acid composition (50th Anniversary Year Review). J. Theor. Biol. 2011, 273, 236–247. [Google Scholar] [CrossRef] [PubMed]

Share and Cite

MDPI and ACS Style

Xie, H.; Qiu, K.; Xie, X. 3D QSAR Studies, Pharmacophore Modeling and Virtual Screening on a Series of Steroidal Aromatase Inhibitors. Int. J. Mol. Sci. 2014, 15, 20927-20947. https://doi.org/10.3390/ijms151120927

AMA Style

Xie H, Qiu K, Xie X. 3D QSAR Studies, Pharmacophore Modeling and Virtual Screening on a Series of Steroidal Aromatase Inhibitors. International Journal of Molecular Sciences. 2014; 15(11):20927-20947. https://doi.org/10.3390/ijms151120927

Chicago/Turabian Style

Xie, Huiding, Kaixiong Qiu, and Xiaoguang Xie. 2014. "3D QSAR Studies, Pharmacophore Modeling and Virtual Screening on a Series of Steroidal Aromatase Inhibitors" International Journal of Molecular Sciences 15, no. 11: 20927-20947. https://doi.org/10.3390/ijms151120927

APA Style

Xie, H., Qiu, K., & Xie, X. (2014). 3D QSAR Studies, Pharmacophore Modeling and Virtual Screening on a Series of Steroidal Aromatase Inhibitors. International Journal of Molecular Sciences, 15(11), 20927-20947. https://doi.org/10.3390/ijms151120927

Article Metrics

Back to TopTop