Next Article in Journal
Integrative Analysis of Single-Cell and Bulk Sequencing Data Depicting the Expression and Function of P2ry12 in Microglia Post Ischemia–Reperfusion Injury
Previous Article in Journal
Integrated QTL Mapping, Meta-Analysis, and RNA-Sequencing Reveal Candidate Genes for Maize Deep-Sowing Tolerance
Previous Article in Special Issue
Mitigation of Cardiovascular Disease and Toxicity through NRF2 Signalling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Unraveling Structural Alerts in Marketed Drugs for Improving Adverse Outcome Pathway Framework of Drug-Induced QT Prolongation

1
College of Chemistry, Sichuan University, Chengdu 610064, China
2
Medical Big Data Center, Sichuan University, Chengdu 610064, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2023, 24(7), 6771; https://doi.org/10.3390/ijms24076771
Submission received: 4 March 2023 / Revised: 21 March 2023 / Accepted: 30 March 2023 / Published: 5 April 2023
(This article belongs to the Special Issue Advances in Molecular Mechanisms of Cardiotoxicity)

Abstract

:
In pharmaceutical treatment, many non-cardiac drugs carry the risk of prolonging the QT interval, which can lead to fatal cardiac complications such as torsades de points (TdP). Although the unexpected blockade of ion channels has been widely considered to be one of the main reasons for affecting the repolarization phase of the cardiac action potential and leading to QT interval prolongation, the lack of knowledge regarding chemical structures in drugs that may induce the prolongation of the QT interval remains a barrier to further understanding the underlying mechanism and developing an effective prediction strategy. In this study, we thoroughly investigated the differences in chemical structures between QT-prolonging drugs and drugs with no drug-induced QT prolongation (DIQT) concerns, based on the Drug-Induced QT Prolongation Atlas (DIQTA) dataset. Three categories of structural alerts (SAs), namely amines, ethers, and aromatic compounds, appeared in large quantities in QT-prolonging drugs, but rarely in drugs with no DIQT concerns, indicating a close association between SAs and the risk of DIQT. Moreover, using the molecular descriptors associated with these three categories of SAs as features, the structure–activity relationship (SAR) model for predicting the high risk of inducing QT interval prolongation of marketed drugs achieved recall rates of 72.5% and 80.0% for the DIQTA dataset and the FDA Adverse Event Reporting System (FAERS) dataset, respectively. Our findings may promote a better understanding of the mechanism of DIQT and facilitate research on cardiac adverse drug reactions in drug development.

1. Introduction

Cardiotoxicity is one of the major concerns in clinical pharmaceutical treatments [1,2,3]. Apart from cardiac drugs, a number of non-cardiac drugs may prolong the QT interval, which increases the likelihood of severe cardiac complications such as torsades de points (TdP) [4]. Understanding the mechanisms underlying the drug-induced prolongation of the QT interval is crucial, not only for preventing cardiovascular adverse events in clinical practice but also for developing new drugs in the pharmaceutical industry [5,6,7,8,9,10].
Monitoring the increase in electrocardiogram (ECG) data, along with other patient information such as gender, disease states, and concomitant use of other QT-prolonging drugs, is a common approach in clinical settings to assess the potential risk that patients will acquire a prolonged QT interval and develop TdP [2,11]. However, this approach is often hindered by individual differences [12,13]. In preclinical settings, the well-known mechanism of drug-induced QT prolongation (DIQT) is the inhibition activity of the repolarizing potassium current IKr that is generated by the expression of the human ether-a-gogo-related gene (hERG) [14,15,16,17,18]. The drug-mediated blockade of the hGRG gene channel will cause the inhibition of rapid delayed rectifying potassium current, resulting in a prolonged QT interval and, in severe cases, developing into TdP. Based on this mechanism, the comprehensive in vitro Proarrhythmia assay (CiPA) initiative recommends screening chemical agents that may potentially prolong the QT interval by examining their effect on individual ion currents in vitro and predicting the drug effects on repolarization and proarrhythmia risk by using in silico methods [19]. Over the past decade, various machine learning methods, such as support vector machines, random forests, and Bayesian networks, have been applied to evaluate the risk of inducing cardiotoxicity of compounds by predicting the blocking effect of compounds on hERG gene channels [20,21,22,23,24,25,26,27,28]. Additionally, deep learning algorithms have been proposed to evaluate the risk of cardiotoxicity caused by hERG gene blockers and to assess the cardiotoxicity of FDA-approved small molecular drugs in the Drugbank [29,30]. Despite the relatively high predictive performance achieved by these studies, few of them provide deep insight into the chemical fragments, also known as structural alerts (SAs), in drugs that may contribute to cardiotoxicity. It is still a challenge to further study the underlying mechanism and develop an effective strategy for drug safety evaluation.
Therefore, in this study, we aim to identify the SAs in marketed drugs that prolong the QT interval based on data from the Drug-Induced QT Prolongation Atlas (DIQTA), which we developed in our previous study [31]. In DIQTA, the drugs were well stratified into different risk levels based on their potential to cause QT interval prolongation, as determined by FDA-approved drug labeling information. We thoroughly investigated the differences in chemical structures between QT-prolonging drugs and drugs with no DIQT concerns, and identified a set of SAs closely related to the high risk of prolongation of the QT interval. The structure–activity relationship (SAR) model also performed well in identifying marketed drugs with a high risk of DIQT when using the SAs as features. Our findings will promote the development of adverse outcome pathway frameworks for the drug-induced prolongation of the QT interval and facilitate research on cardiac adverse drug reactions in drug development.

2. Results

2.1. Structural Differences between QT and Non-QT-Prolonging Drugs

We identified 24 chemical fragments (SAs) with large differences in chemical structures between QT-prolonging drugs and no-DIQT-concern drugs, as shown in Table 1. These 24 SAs can be broadly categorized into three groups: amines, ethers, and aromatic compounds. Amines, in particular, were found to be more prevalent in QT-prolonging drugs, with a proportion at least 30% higher than that of the no-DIQT-concern drugs. Among the amines, tertiary amines (ID: 1) had the highest difference in proportion between the two categories, with a proportion of 61.1% in the QT-prolonging drugs and only 12.6% in the no-DIQT-concern drugs. For tertiary aliphatic amines (ID: 3), they appeared in over 50% of QT-prolonging drugs, but in less than 10% of no-DIQT-concern drugs. These results suggest that amines, particularly tertiary amines, may be closely associated with drug-induced prolongation of the QT interval. Similarly, for aromatic compounds, Ethers (ID: 9) and Alkylarylethers (ID: 12) appeared in 47.2% and 34.0% of QT drugs, respectively, while their proportions in the no-DIQT-concern drugs were only 17.9% and 11.6%, respectively. Aryl halides (ID: 15) appeared in 37.5% and 13.7% of the QT-prolonging drugs and no-DIQT-concern drugs, respectively. In addition, Sp3-hybridized carbon atoms (2) (ID: 2) appeared in a higher proportion of QT-prolonging drugs, reaching 81.3%, while their proportion in the no-DIQT-concern drugs was 37.9%. Figure 1 illustrates the distribution of tertiary amines, alkylarylethers, and aryl halides among different therapeutic categories of QT-prolonging drugs. All three of these SAs were predominantly found in drugs classified under the therapeutic category of the nervous system (N), followed by those in the therapeutic categories of anti-infectives for systemic use (J) and antineoplastic and immunomodulating agents (L).

2.2. SAR Model Performance on DIQTA Dataset

Based on the DIQTA dataset, we used traditional machine learning methods, including logistic regression (LR), random forest (RF), support vector machine (SVM), and XGBoost, to develop the SAR model. The model was built using five-fold cross-validation, and the modeling process was repeated 1000 times to calculate the mean and variance of each prediction metric. Figure 2 shows the distribution of Matthews correlation coefficients (MCCs) and recall rates achieved by each algorithm. As shown in the figure, SVM achieved the highest MCC value and recall rate among all of the models. The detailed results are listed in Table 2, including accuracy, precision, recall score, MCC, f1 score, balanced accuracy score (BACC), the area under the ROC curve (AUC), average precision score (AP), sensitivity (SE), and specificity (SP). From the table, it can be seen that based on chemical structure information, each model achieved a predictive accuracy of more than 75% in predicting drugs with a potential risk of QT interval prolongation, indicating that each model can effectively predict the potential risk of the QT prolongation of drugs based on chemical structure information. Among them, SVM achieved the highest accuracy of 80.6%. Meanwhile, the recall rate for positive samples (drugs with risk of QT interval prolongation) was also the highest, reaching 87.0%. In addition, we also used SVM with 1000 iterations of five-fold cross-validation to build a model and predict the permutation data. As shown in Figure 2 and Table 2, for randomly labeled datasets, the prediction results of SVM were similar to the random results.
Next, we used SVM and 24 SAs with more than 20% differences between QT-prolonging drugs and no-DIQT-concern drugs as features to identify drugs with the potential risk of inducing QT interval prolongation. The model was still constructed using 1000 iterations of five-fold cross-validation. We extracted the top 50 molecular descriptors ranked by their feature importance scores in descending order in SVM modeling (see Supplementary Table S1), and selected 10 of these molecular descriptors as features that were closely associated with the 24 SAs (see Table 3). Subsequently, the ten features were used to construct a SAR model to classify QT-prolonging drugs and no-DIQT-concern drugs. Figure 3 showed the prediction results using all features and only the ten features related to SAs. As can be seen from the figure, although the predictive accuracy, recall rate, and MCC of the model decreased after reducing the number of features, they were still 69.7%, 72.5%, and 0.376, respectively, indicating that using SAs as structural features can effectively identify drugs with a potential risk of QT interval prolongation.

2.3. SAR Model Performance on FAERS Dataset

We extracted 647 marketed drugs from the FAERS database that had at least one case report of inducing QT interval prolongation. We used SVM models established in the previous step to predict the potential risk of these drugs causing QT interval prolongation. Since the drugs with no DIQT concerns were not included in this dataset, we calculated the recall rate of the models. We sorted the 647 drugs by their reporting odds ratio (ROR) in descending order and took the top N drugs to construct a data subset. We used the models to predict the potential risk of QT interval prolongation for the drugs in the data subset, with N starting at 50 and increasing by 50 drugs each time until all drugs were included. This procedure allowed us to investigate the model performance in identifying drugs with different risks of QT interval prolongation.
Figure 4 showed the predictive performance of the models using all features and the models using ten SA-related features. Similarly to the results obtained via the DIQTA dataset, the predictive performance of the model with all features was higher than that of the ten-feature model. Both models achieved an 80.0% recall rate for identifying high-risk QT-prolonging drugs (the top 50 drugs with high ROR values). As more low-risk QT-prolonging drugs were included in the data subset, the recall rate of the full-feature model and the ten-feature model decreased from 80.0% to 66.3% and 59.3%, respectively, indicating that the models performed better in identifying high-risk QT-prolonging drugs compared to identifying low-risk QT-prolonging drugs. The top ten QT-prolonging drugs with the highest ROR values are listed in Table 4, of which eight drugs were correctly predicted as being positive. However, due to the limited number of features used, the ten-feature model could not effectively identify low-risk QT-prolonging drugs.
To further validate the predictive performance of the SVM models, we selected the top 95 QT drugs with the highest ROR values as positive samples from the FAERS database and 95 drugs that had not been reported to induce QT interval prolongation as negative samples from the DIQTA database. The therapeutic categories of the 95 no-DIQT-concern drugs were selected to be as consistent as possible with those of the 95 QT-prolonging drugs. We used this dataset as an independent test set to further validate the full-feature model and the ten-feature model. The results showed that the predictive accuracy of the full-feature model and the ten-feature model were 85.3% and 73.3%, and the recall rates were 81.1% and 77.9%, respectively, indicating that both models can effectively predict the potential risk of prolonging the QT interval for marketed drugs.

3. Discussion

Cardiotoxicity is a major concern in clinical medications, and drug-induced QT interval prolongation is a common occurrence in many therapeutic treatments, which may develop into severe arrhythmias, such as TdP. It is essential to establish a reliable strategy for assessing the potential risk of inducing QT interval prolongation of drugs. FDA-approved drug labeling information is determined after reviewing data from clinical trials and post-marketing surveillance, and can reflect the severity of the risk of QT interval prolongation induced by marketed drugs. In previous studies, we stratified QT-prolonging drugs and no-DIQT-concern drugs based on FDA-approved drug labeling information and constructed the DIQTA database. In this study, we compared the differences in chemical structures between QT-prolonging drugs and no-DIQT-concern drugs in the DIQTA database and identified three categories of SAs, namely amines, ethers, and aromatic compounds, related to the high risk of QT interval prolongation. The proportion of these SAs in the structure of QT-prolonging drugs is more than 20% higher than that in the no-DIQT-concern drugs, with the proportion of amines in QT-prolonging drugs being at least 30% higher than that in non-QT-prolonging drugs (Table 1). In the DIQTA database, more than 80% (79/96) of the most-DIQT-concern drugs (drugs that might cause fatal or life-threatening arrhythmia) contained these SAs, indicating that these three categories of SAs are closely related to the occurrence of drug-induced QT interval prolongation. In previous studies, it had been reported that the presence of tertiary amines, furan rings, and acetylene functional groups may inhibit the cytochrome P450 (CYP) 3A4 pathway, leading to blocked drug metabolism and accumulation in the body, ultimately causing cardiotoxicity [3,19,32,33,34]. For instance, terfenadine, a drug used to treat allergy symptoms, may cause serious side effects including delayed cardiac repolarization and ventricular tachycardia due to the accumulation of terfenadine in the body when CYP3A4 activity was inhibited [35].
The 24 SAs related to high-risk QT interval prolongation could be useful in gaining a better understanding of the mechanisms of drug-induced QT interval prolongation. The adverse outcome pathway (AOP) for DIQT provides a conceptual framework that connects the molecular initiating events (MIEs) to adverse outcomes (AOs) through a series of key events (KEs). The compounds that contain the three categories of substances of concern identified in our study could serve as chemical initiators, which trigger downstream KEs, such as ion channel inhibition, leading to QT prolongation. The identification of chemical initiators may guide the design of more effective in vitro and in vivo assays for predicting DIQT. Moreover, our findings could be utilized to develop risk assessment strategies for drug combinations that have the potential to cause QT interval prolongation.
In subsequent analysis, we used ten structural descriptors related to these 24 SAs as features to establish an SVM model and predict the potential risk of drugs inducing the prolongation of the QT interval. For the DIQTA dataset, the ten-feature model had a prediction accuracy and recall rate of 69.7% and 72.5%, respectively, which were lower than the prediction accuracy and recall rate (80.6% and 87.0%, respectively) obtained by the full-feature model. However, for the prediction of drugs with the potentially high risk of QT prolongation in the FAERS dataset (the top 50 drugs with high ROR values), both models achieved a recall rate of 80.0%, indicating that the ten-feature model has good performance in identifying drugs with a high risk of QT prolongation. Finally, for an independent test set containing 95 QT-prolonging drugs and 95 no-DIQT-concern drugs, the ten-feature model achieved a prediction accuracy of 73.7% and a recall rate of 77.9%. These results indicated that the 24 SAs revealed in this study might facilitate artificial-intelligence-based modeling for cardiotoxicity.
Additionally, there are some caveats that warrant further discussion. First, in the DIQTA database, QT-prolonging drugs and the drugs with no DIQT concerns were stratified based on FDA-approved drug labeling. If drug labeling from other countries, such as European Union (EU) countries, were used, there may be differences in the stratification of QT-prolonging drugs, which could ultimately lead to differences in SA identification. Secondly, our study instead revealed an association of a causality between the three categories of SAs and a high risk of drug-induced prolongation of the QT interval. The presence of these three categories of SAs in drug structures does not necessarily mean that the drugs will definitely cause QT interval prolongation or other cardiac toxicity. It only indicates that the drugs may have a higher likelihood of inducing QT interval prolongation and should be further evaluated for potential safety concerns. Finally, DIQT is a complex and multifactorial phenomenon. Other patient risk factors such as age, sex, and electrolyte imbalances can still cause QT interval prolongation during pharmaceutical treatment. Even though the chemical structure of a drug is one of the main factors that induces QT interval prolongation, it is still a challenge to establish a reliable machine learning model that predicts the risk of DIQT only based on chemical structures. It can be seen in Figure 4 that the recall rate for predicting low-risk QT-prolonging drugs achieved by the ten-feature model was only 59.3%. Therefore, it may be more effective to establish a model in future studies that combines multiple features, including clinical factors, to predict the risk of DIQT.

4. Materials and Methods

4.1. Study Design

In this study, we compared the differences in chemical structures between QT-prolonging drugs and drugs with no DIQT concerns, and identified 24 SAs that were closely associated with QT interval prolongation. Next, we utilized the Mold2 package to convert the chemical structures of the drugs into molecular descriptors, and established an SAR model based on the molecular descriptors related to the 24 SAs as features to predict the potential risk of prolonging the QT interval for drugs. Finally, the model was validated using the FAERS dataset. Figure 5 depicts the process in detail.

4.2. DIQTA Dataset

The Drug-Induced QT Prolongation Atlas (DIQTA, https://www.adratlas.com/DIQTA/, accessed on 22 February 2023) [31] stratified marketed drugs into different categories based on the severity of their effects on QT interval, including most-DIQT-concern drugs, moderate-concern drugs, ambiguous drugs, and no-DIQT-concern drugs. For structure comparison and model construction, we collected 144 drugs from the most-DIQT-concern and moderate-DIQT-concern categories as positive samples and 95 drugs from the no-DIQT-concern drugs as negative samples for this study.

4.3. FAERS Dataset

The FDA Adverse Event Reporting System (FAERS) primarily contains post-marketing surveillance data related to adverse events and medication errors associated with FDA-regulated drugs [36]. We collected all reported cases from 2004 to 2021 for this study, resulting in a total of 38,405,679 drug-event pairs. Please refer to our previous studies for detailed information on the generation procedures [31,37]. We extracted 903 drugs reported to have induced QT interval prolongation and identified 647 drugs with available chemical structures for subsequent analysis.
To assess the severity of drug-induced QT interval prolongation for each drug, we calculated the reporting odds ratio (ROR) using the following formula:
ROR = a c b d
where a, b, c, and d are the numbers of cases defined in the Table 5.
Based on their ROR values, the 647 drugs were ranked in ascending order and used for SAR model validation.

4.4. Identification of Structural Alerts

After uploading SDF files for both QT-prolonging drugs and no-DIQT-concern drugs to the ToxAlert platform in the Online Chemical Database (OCHEM, https://ochem.eu/home/show.do, accessed on 22 February 2023) [38,39], we separately obtained structural alerts (SAs) present in each category of drugs. In total, we identified 602 SAs from the structures of QT-prolonging drugs, along with a list of drugs that contained each SA. We also obtained 669 SAs from the no-DIQT-concern drugs and their corresponding drug lists. By dividing the number of drugs in a specific SA’s list by the total number of drugs in the category (QT or no-DIQT-concern drugs), we were able to calculate the proportion of each SA in each of the two categories. We retained an SA if its proportion in QT-prolonging drugs was 20% higher than its proportion in non-QT-prolonging drugs. As a result, we kept 24 SAs for further analysis.

4.5. Calculation and Selection of Molecular Descriptors

The calculation of molecular descriptors was conducted using the Mold2 package (https://www.fda.gov/science-research/bioinformatics-tools/mold2, accessed on 22 February 2023), which was developed by Dr. Tong at the National Center for Toxicological Research (NCTR), US FDA [40,41,42,43]. The package provided a collection of 777 1D and 2D molecular descriptors, and we utilized it to calculate the molecular descriptors for each drug in the DIQTA and FAERS datasets. We applied a variance threshold of 0.001 to remove descriptors with low variability among the drugs, resulting in a final set of 603 descriptors for SAR modeling. The input data were uploaded to GitHub (https://github.com/LiSH7450/DIQT_model, accessed on 20 March 2023).

4.6. SAR Model Construction

We used the support vector machine (SVM) algorithm [44] to predict the risk of drug-induced QT prolongation based on the chemical structure of the drug. Based on the DIQTA dataset, the SVM model with an RBF kernel was constructed by using 603 molecular descriptors as features and five-fold cross-validation. The regularization parameter (c) and the kernel width (γ) were optimized using a grid search method. Drugs with predicted probabilities greater than 0.5 were considered to have a higher risk of causing QT interval prolongation. To assess the robustness of the model, we conducted 1000 iterations of the modeling process. To evaluate the performance of the model, we compared the performance with other machine learning models, namely random forest (RF) [45], logistic regression (LR) [46], and extreme gradient boosting (XGBoost) [47]. The parameters of these models were also optimized using a grid search method, and all models were constructed using 1000 iterations of five-fold cross-validation. Additionally, we created a permutation dataset as a negative control for SVM model construction by randomly permuting the labels of samples.

4.7. Feature Selection

We evaluated the importance of each feature in an SVM model construction procedure using the Shapley additive explanations (SHAP) algorithm, which was introduced by Lundberg and Lee [48]. This algorithm was proposed to explain a model’s output by assigning a score to each input feature, indicating how much that feature contributes to the model’s prediction for a particular instance. We extracted the top 50 features that contributed the most to the prediction results, and 10 out of the 50 features were found to be closely associated with the 24 SAs. These ten features were used to construct the SVM model. A ten-feature model was then constructed using the ten identified molecular descriptors as features and 1000 iterations of five-fold cross-validation.

4.8. Evaluation of Model Performance

To evaluate the performance of the model, we used nine performance metrics in this study, including accuracy, precision, recall score, Matthews correlation coefficient (MCC), f1 score, balanced accuracy score (BACC), the area under the ROC curve (AUC), average precision score (AP), sensitivity (SE), and specificity (SP).
accuracy = TP + TN TP + TN + FP + FN
recallrate = TP TP + FN
precision = TP TP + FP
MCC = TP × TN FP × FN TP + FP TP + FN TN + FP TN + FN
F 1 score = 2 × precision × recall   rate precision + recall   rate
BACC = TPR + TNR 2
AUC = x = 0 1 TPR FPR 1 x dx
AP = + precision x dP Y x
SE = TP TP + FN
SP = TN TN + FP
TP, TN, FP, and FN stand for true positive, true negative, false positive, and false negative, respectively. To evaluate the performance of the model, we conducted 5-fold cross-validation and averaged the results over 1000 iterations.

5. Conclusions

In this study, we discovered a total of 24 SAs that are associated with high-risk QT-prolonging drugs. The compounds containing these SAs can be used as molecular initiating events to enhance the adverse outcome pathway related to QT interval prolongation. By using these SAs as features, we can establish an SVM to effectively predict high-risk QT-prolonging drugs. Our findings are expected to significantly improve research on cardiac adverse drug reactions in drug development and reduce drug attrition due to drug-induced prolongation of the QT interval.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms24076771/s1.

Author Contributions

Conceptualization, Z.W.; methodology, Z.W., S.L. and Y.H.; formal analysis, W.L. and S.L.; investigation, W.L. and S.L.; data curation, S.L.; writing—original draft preparation, Z.W. and S.L.; writing—review and editing, W.L., J.L., M.L. and Z.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 21575094. The APC was funded by 21575094.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available in DIQTA at https://doi.org/10.1016/j.drudis.2021.10.009, accessed on 22 February 2023, reference number [31].

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Riad, F.S.; Davis, A.M.; Moranville, M.P.; Beshai, J.F. Drug-Induced QTc Prolongation. Am. J. Cardiol. 2017, 119, 280–283. [Google Scholar] [CrossRef] [PubMed]
  2. Isbister, G.K.; Page, C.B. Drug induced QT prolongation: The measurement and assessment of the QT interval in clinical practice. Br. J. Clin. Pharmacol. 2013, 76, 48–57. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Roden, D.M. Drug therapy: Drug-induced prolongation of the QT interval. N. Engl. J. Med. 2004, 350, 1013–1022. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Mirams, G.R.; Davies, M.R.; Cui, Y.; Kohl, P.; Noble, D. Application of cardiac electrophysiology simulations to pro-arrhythmic safety testing. Br. J. Pharmacol. 2012, 167, 932–945. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Smilowitz, N.R.; Berger, J.S. Perioperative Cardiovascular Risk Assessment and Management for Noncardiac Surgery A Review. JAMA-J. Am. Med. Assoc. 2020, 324, 279–290. [Google Scholar] [CrossRef]
  6. Jabbar, A.; Pingitore, A.; Pearce, S.; Zaman, A.; Iervasi, G.; Razvi, S. Thyroid hormones and cardiovascular disease. Nat. Rev. Cardiol. 2017, 14, 39–55. [Google Scholar] [CrossRef]
  7. Curigliano, G.; Cardinale, D.; Dent, S.; Criscitiello, C.; Aseyev, O.; Lenihan, D.; Cipolla, C.M. Cardiotoxicity of Anticancer Treatments: Epidemiology, Detection, and Management. CA-Cancer J. Clin. 2016, 66, 310–325. [Google Scholar] [CrossRef] [Green Version]
  8. Kavousi, M.; Leening, M.; Nanchen, D.; Greenland, P.; Graham, I.M.; Steyerberg, E.W.; Ikram, M.A.; Stricker, B.H.; Hofman, A.; Franco, O.H. Comparison of Application of the ACC/AHA Guidelines, Adult Treatment Panel III Guidelines, and European Society of Cardiology Guidelines for Cardiovascular Disease Prevention in a European Cohort. JAMA-J. Am. Med. Assoc. 2014, 311, 1416–1423. [Google Scholar] [CrossRef]
  9. Carmeliet, P.; Jain, R.K. Molecular mechanisms and clinical applications of angiogenesis. Nature 2011, 473, 298–307. [Google Scholar] [CrossRef] [Green Version]
  10. Fermini, B.; Fossa, A.A. The impact of drug-induced qt interval prolongation on drug discovery and development. Nat. Rev. Drug Discov. 2003, 2, 439–447. [Google Scholar] [CrossRef]
  11. Thomas, S.H.L.; Behr, E.R. Pharmacological treatment of acquired QT prolongation and torsades de pointes. Br. J. Clin. Pharmacol. 2016, 81, 420–427. [Google Scholar] [CrossRef] [PubMed]
  12. Drew, B.J.; Ackerman, M.J.; Funk, M.; Gibler, W.B.; Kligfield, P.; Menon, V.; Philippides, G.J.; Roden, D.M.; Zareba, W.; Council, C.C.; et al. Prevention of Torsade de Pointes in Hospital Settings A Scientific Statement From the American Heart Association and the American College of Cardiology Foundation Endorsed by the American Association of Critical-Care Nurses and the International Society for Computerized Electrocardiology. J. Am. Coll. Cardiol. 2010, 55, 934–947. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Bednar, M.M.; Harrigan, E.P.; Anziano, R.J.; Camm, A.J.; Ruskin, J.N. The QT interval. Prog. Cardiovasc. Dis. 2001, 43, 21469. [Google Scholar] [CrossRef]
  14. Baracaldo-Santamaria, D.; Llinas-Caballero, K.; Miguel Corso-Ramirez, J.; Martin Restrepo, C.; Alberto Dominguez-Dominguez, C.; Janeth Fonseca-Mendoza, D.; Alberto Calderon-Ospina, C. Genetic and Molecular Aspects of Drug-Induced QT Interval Prolongation. Int. J. Mol. Sci. 2021, 22, 8090. [Google Scholar] [CrossRef]
  15. Lu, Z.; Wu, C.C.; Jiang, Y.; Ballou, L.M.; Clausen, C.; Cohen, I.S.; Lin, R.Z. Suppression of Phosphoinositide 3-Kinase Signaling and Alteration of Multiple Ion Currents in Drug-Induced Long QT Syndrome. Sci. Transl. Med. 2012, 4, 3623. [Google Scholar] [CrossRef] [Green Version]
  16. Wilke, R.A.; Lin, D.W.; Roden, D.M.; Watkins, P.B.; Flockhart, D.; Zineh, I.; Giacomini, K.M.; Krauss, R.M. Identifying genetic risk factors for serious adverse drug reactions: Current progress and challenges. Nat. Rev. Drug Discov. 2007, 6, 904–916. [Google Scholar] [CrossRef]
  17. Friedrichs, G.S.; Patmore, L.; Bass, A. Non-clinical evaluation of ventricular repolarization (ICH S7B): Results of an interim survey of international pharmaceutical companies. J. Pharmacol. Tox Met. 2005, 52, 6–11. [Google Scholar] [CrossRef] [PubMed]
  18. Bennett, P.B.; Yazawa, K.; Makita, N.; George, A.L. Molecular Mechanism for an Inherited Cardiac-Arrhythmia. Nature 1995, 376, 683–685. [Google Scholar] [CrossRef]
  19. Colatsky, T.; Fermini, B.; Gintant, G.; Pierson, J.B.; Sager, P.; Sekino, Y.; Strauss, D.G.; Stockbridge, N. The Comprehensive in Vitro Proarrhythmia Assay (CiPA) initiative Update on progress. J. Pharmacol. Tox Met. 2016, 81, 15–20. [Google Scholar] [CrossRef] [Green Version]
  20. Choi, K.; Balupuri, A.; Kang, N.S. The Study on the hERG Blocker Prediction Using Chemical Fingerprint Analysis. Molecules 2020, 25, 2615. [Google Scholar] [CrossRef]
  21. Cai, C.; Fang, J.; Guo, P.; Wang, Q.; Hong, H.; Moslehi, J.; Cheng, F. In Silico Pharmacoepidemiologic Evaluation of Drug-Induced Cardiovascular Complications Using Combined Classifiers. J. Chem. Inf. Model 2018, 58, 943–956. [Google Scholar] [CrossRef]
  22. Chavan, S.; Abdelaziz, A.; Wiklander, J.G.; Nicholls, I.A. A k-nearest neighbor classification of hERG K+ channel blockers. J. Comput. Aid Mol. Des. 2016, 30, 229–236. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Liu, L.; Lu, J.; Lu, Y.; Zheng, M.; Luo, X.; Zhu, W.; Jiang, H.; Chen, K. Novel Bayesian classification models for predicting compounds blocking hERG potassium channels. Acta Pharmacol. Sin. 2014, 35, 1093–1102. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Robinson, R.L.M.; Glen, R.C.; Mitchell, J.B.O. Development and Comparison of hERG Blocker Classifiers: Assessment on Different Datasets Yields Markedly Different Results. Mol. Inform. 2011, 30, 443–458. [Google Scholar] [CrossRef]
  25. Jia, L.; Sun, H. Support vector machines classification of hERG liabilities based on atom types. Bioorgan. Med. Chem. 2008, 16, 6252–6260. [Google Scholar] [CrossRef]
  26. Song, M.H.; Clark, M. Development and evaluation of an in silico model for hERG binding. J. Chem. Inf. Model 2006, 46, 392–400. [Google Scholar] [CrossRef]
  27. Sun, H. An accurate and interpretable Bayesian classification model for prediction of hERG liability. ChemMedChem 2006, 1, 315–322. [Google Scholar] [CrossRef] [PubMed]
  28. Yap, C.W.; Cai, C.Z.; Xue, Y.; Chen, Y.Z. Prediction of torsade-causing potential of drugs by support vector machine approach. Toxicol. Sci. 2004, 79, 170–177. [Google Scholar] [CrossRef] [PubMed]
  29. Karim, A.; Lee, M.; Balle, T.; Sattar, A. CardioTox net: A robust predictor for hERG channel blockade based on deep learning meta-feature ensembles. J. Cheminformatics 2021, 13, 60. [Google Scholar] [CrossRef]
  30. Cai, C.; Guo, P.; Zhou, Y.; Zhou, J.; Wang, Q.; Zhang, F.; Fang, J.; Cheng, F. Deep Learning-Based Prediction of Drug-Induced Cardiotoxicity. J. Chem. Inf. Model 2019, 59, 1073–1084. [Google Scholar] [CrossRef]
  31. Li, S.; Xu, Z.; Guo, M.; Li, M.; Wen, Z. Drug-induced QT Prolongation Atlas (DIQTA) for enhancing cardiotoxicity management. Drug Discov. Today 2022, 27, 831–837. [Google Scholar] [CrossRef]
  32. Kannankeril, P.; Roden, D.M.; Darbar, D. Drug-Induced Long QT Syndrome. Pharmacol. Rev. 2010, 62, 760–781. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Kannankeril, P.J.; Roden, D.M. Drug-induced long QT and torsade de pointes: Recent advances. Curr. Opin. Cardiol. 2007, 22, 39–43. [Google Scholar] [CrossRef] [PubMed]
  34. Zhou, S.F.; Chan, S.Y.; Goh, B.C.; Chan, E.; Duan, W.; Huang, M.; McLeod, H.L. Mechanism-based inhibition of cytochrome P450 3A4 by therapeutic drugs. Clin. Pharmacokinet. 2005, 44, 279–304. [Google Scholar] [CrossRef]
  35. Li, M.; Ramos, L.G. Drug-Induced QT Prolongation And Torsades de Pointes. Pharm. Ther. A Peer-Rev. J. Formul. Manag. 2017, 42, 473–477. [Google Scholar]
  36. Vilar, S.; Harpaz, R.; Chase, H.S.; Costanzi, S.; Rabadan, R.; Friedman, C. Facilitating adverse drug event detection in pharmacovigilance databases using molecular structure similarity: Application to rhabdomyolysis. J. Am. Med. Inform. Assn. 2011, 18, I73–I80. [Google Scholar] [CrossRef] [Green Version]
  37. Wen, Z.; Liang, Y.; Hao, Y.; Delavan, B.; Huang, R.; Mikailov, M.; Tong, W.; Li, M.; Liu, Z. Drug-Induced Rhabdomyolysis Atlas (DIRA) for idiosyncratic adverse drug reaction management. Drug Discov. Today 2019, 24, 9–15. [Google Scholar] [CrossRef]
  38. Sushko, I.; Salmina, E.; Potemkin, V.A.; Poda, G.; Tetko, I.V. ToxAlerts: A Web Server of Structural Alerts for Toxfic Chemicals and Compounds with Potential Adverse Reactions. J. Chem. Inf. Model 2012, 52, 2310–2316. [Google Scholar] [CrossRef]
  39. Sushko, I.; Novotarskyi, S.; Koerner, R.; Pandey, A.K.; Rupp, M.; Teetz, W.; Brandmaier, S.; Abdelaziz, A.; Prokopenko, V.V.; Tanchuk, V.Y.; et al. Online chemical modeling environment (OCHEM): Web platform for data storage, model development and publishing of chemical information. J. Comput. Aid Mol. Des. 2011, 25, 533–554. [Google Scholar] [CrossRef] [Green Version]
  40. Hong, H.; Harvey, B.G.; Palmese, G.R.; Stanzione, J.F.I.; Ng, H.W.; Sakkiah, S.; Tong, W.; Sadler, J.M. Experimental Data Extraction and in Silico Prediction of the Estrogenic Activity of Renewable Replacements for Bisphenol A. Int. J. Environ. Res. Public Health 2016, 13, 705. [Google Scholar] [CrossRef]
  41. Hong, H.; Rua, D.; Sakkiah, S.; Selvaraj, C.; Ge, W.; Tong, W. Consensus Modeling for Prediction of Estrogenic Activity of Ingredients Commonly Used in Sunscreen Products. Int. J. Environ. Res. Public Health 2016, 13, 958. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Hong, H.; Xie, Q.; Ge, W.; Qian, F.; Fang, H.; Shi, L.; Su, Z.; Perkins, R.; Tong, W. Mold(2), molecular descriptors from 2D structures for chemoinformatics and toxicoinformatics. J. Chem. Inf. Model 2008, 48, 1337–1344. [Google Scholar] [CrossRef] [PubMed]
  43. Hong, H.X.; Xin, X.Q. Essesa, an expert system for structure elucidation from spectral-analysis: 2. Novel algorithm of perception of the linear independent smallest set of smallest rings. Anal. Chim. Acta 1992, 262, 179–191. [Google Scholar] [CrossRef]
  44. Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
  45. Zhang, L.; Ai, H.; Chen, W.; Yin, Z.; Hu, H.; Zhu, J.; Zhao, J.; Zhao, Q.; Liu, H. CarcinoPred-EL: Novel models for predicting the carcinogenicity of chemicals using molecular fingerprints and ensemble learning methods. Sci. Rep. 2017, 7, 2118. [Google Scholar] [CrossRef] [Green Version]
  46. Park, H. An Introduction to Logistic Regression: From Basic Concepts to Interpretation with Particular Attention to Nursing Domain. J. Korean Acad. Nurs. 2013, 43, 154–164. [Google Scholar] [CrossRef] [Green Version]
  47. Chen, T.; Guestrin, C.; Assoc, C.M. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
  48. Lundberg, S.M.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Proceedings of the Advances in Neural Information Processing Systems; Guyon, I., Luxburg, U., Von Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
Figure 1. (a) Distribution of drugs containing tertiary amines among different therapeutic categories of QT-prolonging drugs; (b) distribution of drugs containing alkylarylethers among different therapeutic categories of QT-prolonging drugs; (c) distribution of drugs containing aryl halides among different therapeutic categories of QT-prolonging drugs.
Figure 1. (a) Distribution of drugs containing tertiary amines among different therapeutic categories of QT-prolonging drugs; (b) distribution of drugs containing alkylarylethers among different therapeutic categories of QT-prolonging drugs; (c) distribution of drugs containing aryl halides among different therapeutic categories of QT-prolonging drugs.
Ijms 24 06771 g001
Figure 2. MCCs and recall rates achieved by SMV, LR, RF, and XGBoost with 1000 iterations of five-fold cross-validation and by SVM with permutation dataset. (a) The distribution of MCCs achieved by four models and SVM with permutation dataset. (b) The distribution of recall rates achieved by four models and SVM with permutation dataset.
Figure 2. MCCs and recall rates achieved by SMV, LR, RF, and XGBoost with 1000 iterations of five-fold cross-validation and by SVM with permutation dataset. (a) The distribution of MCCs achieved by four models and SVM with permutation dataset. (b) The distribution of recall rates achieved by four models and SVM with permutation dataset.
Ijms 24 06771 g002
Figure 3. The predictive results achieved by SVM model with full features and ten features related to the 24 structural alerts.
Figure 3. The predictive results achieved by SVM model with full features and ten features related to the 24 structural alerts.
Ijms 24 06771 g003
Figure 4. The trend of recall rate changed when the number of drugs increased in the dataset. The Y-axis indicates the recall rate, and the X-axis is the number of drugs used for the prediction.
Figure 4. The trend of recall rate changed when the number of drugs increased in the dataset. The Y-axis indicates the recall rate, and the X-axis is the number of drugs used for the prediction.
Ijms 24 06771 g004
Figure 5. Workflow of our study.
Figure 5. Workflow of our study.
Ijms 24 06771 g005
Table 1. Twenty-four chemical fragments (SAs) with large differences in chemical structures between QT-prolonging drugs and no-DIQT-concern drugs.
Table 1. Twenty-four chemical fragments (SAs) with large differences in chemical structures between QT-prolonging drugs and no-DIQT-concern drugs.
IDClassNameSANumber of QT-Prolonging DrugsProportion of QT-Prolonging DrugsNumber of Non-QT-Prolonging DrugsProportion of Non-QT-Prolonging DrugsDifference
1aminestertiary aminesIjms 24 06771 i001880.6111120.12630.4848
2 sp3-hybridized carbon atoms (2)Ijms 24 06771 i0021170.8125360.37890.4336
3 tertiary aliphatic aminesIjms 24 06771 i003760.527890.09470.4330
4 16-tertiary amineIjms 24 06771 i004910.6319190.20.4319
5 aminesIjms 24 06771 i0051190.8264380.40.4264
6 B3-tertiary amineIjms 24 06771 i006760.5278100.10530.4225
7 nitrogen atoms (1)Ijms 24 06771 i007780.5417120.12630.4154
8 36-CH2NIjms 24 06771 i008650.4514140.14740.3040
9ethersethersIjms 24 06771 i009680.4722170.17890.2933
10 sp3-hybridized carbon atoms (6)Ijms 24 06771 i010860.5972320.33680.2604
11 13-etherIjms 24 06771 i011650.4514200.21050.2409
12 alkylarylethersIjms 24 06771 i012490.3403110.11580.2245
13aromatic compoundsarenesIjms 24 06771 i0131270.8819560.58950.2925
14 11-AC(3-Aromatic carbon)Ijms 24 06771 i0141190.8264550.57890.2474
15 aryl halideIjms 24 06771 i015540.375130.13680.2382
16 4-aromatic carbon-alkaneIjms 24 06771 i016810.5625310.32630.2362
17 aromatichalogenIjms 24 06771 i017530.3681130.13680.2312
18 10-ACH (3-aromatic carbon)Ijms 24 06771 i0181290.8958650.68420.2116
19othersbaseIjms 24 06771 i0191260.875490.51580.3592
20 six-membered heterocycles with one heteroatom (LS)Ijms 24 06771 i020690.4792210.22110.2581
21 2-CH2 (1-Alkane group)Ijms 24 06771 i0211300.9028630.66320.2396
22 halogen derivativesIjms 24 06771 i022660.4583220.23160.2268
23 halogensIjms 24 06771 i023660.4583220.23160.2268
24 NUCIjms 24 06771 i024910.6319400.42110.2109
Table 2. Prediction results of RF, XGBoost, SVM, LR, and Permutation_Y.
Table 2. Prediction results of RF, XGBoost, SVM, LR, and Permutation_Y.
XGBoostRFLRSVMPermutation_Y
Accuracy0.758 ± 0.0160.749 ± 0.0170.770 ± 0.0190.806 ± 0.0140.540 ± 0.037
Recall score0.837 ± 0.0200.827 ± 0.0190.822 ± 0.0220.870 ± 0.0170.695 ± 0.041
Precision score0.779 ± 0.0140.773 ± 0.0150.801 ± 0.0170.820 ± 0.0140.602 ± 0.027
MCC0.488 ± 0.0340.469 ± 0.0360.517 ± 0.0390.591 ± 0.031−0.001 ± 0.084
BACC0.738 ± 0.0170.729 ± 0.0180.756 ± 0.0190.790 ± 0.0150.500 ± 0.039
F1 score0.807 ± 0.0130.799 ± 0.0140.811 ± 0.0160.844 ± 0.0120.645 ± 0.030
AUC0.738 ± 0.0170.729 ± 0.0180.756 ± 0.0190.790 ± 0.0150.500 ± 0.039
AP0.750 ± 0.0130.744 ± 0.0140.766 ± 0.0160.791 ± 0.0130.603 ± 0.019
SE0.837 ± 0.0200.827 ± 0.0190.822 ± 0.0220.870 ± 0.0170.695 ± 0.041
SP0.640 ± 0.0280.632 ± 0.0320.691 ± 0.0320.710 ± 0.0260.305 ± 0.061
Table 3. Ten of these molecular descriptors as features that were closely associated with the 24 SAs.
Table 3. Ten of these molecular descriptors as features that were closely associated with the 24 SAs.
DescriptorDescription
D718number of CH3X groups
D756number of Al-O-Ar or Ar-O-Ar or R-O-C=X groups
D661number of quaternary ammonium (aliphatic) groups
D759number of tertiary aliphatic amine groups
D627number of tertiary amides (aliphatic) groups
D130number of halogen atoms in each molecule
D647number of primary amines (aliphatic) groups *
D626number of secondary amides (aromatic) groups
D757number of Al-NH2 groups
D598number of total tertiary C-sp3
* Aliphatic primary amines have a carbon with sp3 hybridization and two hydrogen atoms connected to the nitrogen atom.
Table 4. The top ten QT-prolonging drugs with the highest ROR values.
Table 4. The top ten QT-prolonging drugs with the highest ROR values.
Drug NameOdds RatioATC Code
Doxapram398.298R07
Cisapride *272.797A03
Ibutilide *223.102C01
Tropisetron *113.799A04
Trimebutine *113.799A03
Alfacalcidol *106.213M05, A11
Bedaquiline *97.265J04
Ethionamide79.660J04
Bepridil *78.360C08
Procainamide *59.457C01
* Drugs that were predicted as positives by our model.
Table 5. Contingency table for calculating the ROR.
Table 5. Contingency table for calculating the ROR.
Cases with Current ADRCases without Current ADR
Cases with current drugsab
Cases without current drugscd
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Long, W.; Li, S.; He, Y.; Lin, J.; Li, M.; Wen, Z. Unraveling Structural Alerts in Marketed Drugs for Improving Adverse Outcome Pathway Framework of Drug-Induced QT Prolongation. Int. J. Mol. Sci. 2023, 24, 6771. https://doi.org/10.3390/ijms24076771

AMA Style

Long W, Li S, He Y, Lin J, Li M, Wen Z. Unraveling Structural Alerts in Marketed Drugs for Improving Adverse Outcome Pathway Framework of Drug-Induced QT Prolongation. International Journal of Molecular Sciences. 2023; 24(7):6771. https://doi.org/10.3390/ijms24076771

Chicago/Turabian Style

Long, Wulin, Shihai Li, Yujie He, Jinzhu Lin, Menglong Li, and Zhining Wen. 2023. "Unraveling Structural Alerts in Marketed Drugs for Improving Adverse Outcome Pathway Framework of Drug-Induced QT Prolongation" International Journal of Molecular Sciences 24, no. 7: 6771. https://doi.org/10.3390/ijms24076771

APA Style

Long, W., Li, S., He, Y., Lin, J., Li, M., & Wen, Z. (2023). Unraveling Structural Alerts in Marketed Drugs for Improving Adverse Outcome Pathway Framework of Drug-Induced QT Prolongation. International Journal of Molecular Sciences, 24(7), 6771. https://doi.org/10.3390/ijms24076771

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop