Next Article in Journal
Design and Identification of a Novel Antiviral Affinity Peptide against Fowl Adenovirus Serotype 4 (FAdV-4) by Targeting Fiber2 Protein
Previous Article in Journal
Primary, Secondary, and Tertiary Prevention of Congenital Cytomegalovirus Infection
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

DRAVP: A Comprehensive Database of Antiviral Peptides and Proteins

School of Life Science and Technology, China Pharmaceutical University, 24 Tongjiaxiang, Nanjing 210009, China
*
Authors to whom correspondence should be addressed.
Viruses 2023, 15(4), 820; https://doi.org/10.3390/v15040820
Submission received: 8 March 2023 / Revised: 18 March 2023 / Accepted: 22 March 2023 / Published: 23 March 2023
(This article belongs to the Section Viral Immunology, Vaccines, and Antivirals)

Abstract

:
Viruses with rapid replication and easy mutation can become resistant to antiviral drug treatment. With novel viral infections emerging, such as the recent COVID-19 pandemic, novel antiviral therapies are urgently needed. Antiviral proteins, such as interferon, have been used for treating chronic hepatitis C infections for decades. Natural-origin antimicrobial peptides, such as defensins, have also been identified as possessing antiviral activities, including direct antiviral effects and the ability to induce indirect immune responses to viruses. To promote the development of antiviral drugs, we constructed a data repository of antiviral peptides and proteins (DRAVP). The database provides general information, antiviral activity, structure information, physicochemical information, and literature information for peptides and proteins. Because most of the proteins and peptides lack experimentally determined structures, AlphaFold was used to predict each antiviral peptide’s structure. A free website for users (http://dravp.cpu-bioinfor.org/, accessed on 30 August 2022) was constructed to facilitate data retrieval and sequence analysis. Additionally, all the data can be accessed from the web interface. The DRAVP database aims to be a useful resource for developing antiviral drugs.

1. Introduction

Viruses with rapid replication and easy mutation can become resistant to antiviral drug treatment. In the past two decades, there have been various epidemics, including severe acute respiratory syndrome coronavirus (SARS), Middle East respiratory syndrome coronavirus (MERS), Ebola virus, Zika virus, etc. [1,2,3,4]. Additionally, there was an outbreak of the coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which spread globally [5]. The pandemic became a great threat to human health. Although scientific advances have successfully discovered vaccines and antiviral drugs for several kinds of viral pathogens, the emergence and re-emergence of mutant epidemics require persistent efforts to discover novel antiviral agents.
Antiviral drugs can be divided into virus-targeting antivirals and host-targeting antivirals [6]. Virus-targeting antivirals mainly play an inhibitory role in a virus’s life cycle, such as inhibiting virus replication and transcription, blocking virus adhesion to host cells, inhibiting the activity of viral enzymes, etc. [7,8,9]. Host-targeting antivirals focus on regulating the function of host factors or other cellular processes in host cells.
The cytokines involved in the innate immune response to virus invasion act as interferons (IFNs), and interleukins have been explored as modulators of viral infections for decades [10,11]. Among them, IFNs were first discovered by Isaacs in 1957 and have been in clinical use since the 1980s [12,13]. The advances in recombinant DNA technology enabled cytokine production and the detection of their protective effect against microbial pathogens in vitro and in vivo, including IL-1, IL-6, TNF, CSF, etc. [14]. Proteins bound to viral particles have also been considered to inhibit virus invasion. For example, mannose-binding lectin (MBL) has previously been shown to bind to the SARS-CoV spike protein, which is complementarily activated by the lectin pathway [15].
Peptides that possess the potential to inhibit viruses are considered antiviral peptides (AVPs) [8]. Compared with traditional small molecule drugs, AVPs are highly specific and effective, with fewer side effects and host toxicity and show great potential as antiviral drugs [16,17,18,19]. Numerous studies have indicated that AVPs exhibit excellent antiviral activities. For example, EKL1C was shown to have broad-spectrum anticoronavirus activity and potent protection of human angiotensin-converting enzyme 2 (hACE2) in transgenic mice against authentic coronavirus infection [20,21]. Currently, antiviral therapies based on peptides and peptide-related antivirals have been confirmed for human immunodeficiency virus (HIV) and hepatitis C virus (HCV). Enfuvirtide (T-20) was the first FDA-approved fusion inhibitor to treat HIV infection, and in a recent study, Ahmadi et al. recommended that enfuvirtide has the potential to enter the clinical trial phase for treating COVID-19 [22,23]. Additionally, boceprevir and telaprevir are synthetic peptides used for treating chronic hepatitis C [24,25]. Meanwhile, several peptidomimetics, peptide-like small molecules, and amino-acid-like derivatives are currently under investigation in clinical trials for treating various viral infections [26]. Brilacidin is a mimetic of defensin being tested under phase II clinical trials for treating COVID-19 (NCT04784897) [27].
Antiviral peptides can be obtained from computational, natural, and biological sources. Among them, the computational approach is an efficient and convenient method of identifying antiviral peptides. Through molecular docking and simulating peptide structures, peptides and peptidemimics with high affinities interacting with the target can be developed and designed, which is more conducive to developing clinical antiviral drugs. In addition, deep learning and machine learning methods have been widely used in identifying antiviral peptides, with various prediction models being published, such as DeepAVP [28] and AVPIden [29]. To facilitate drug development and to identify antiviral peptides, establishing a comprehensive database of antiviral peptides is necessary. In recent years, antimicrobial peptide (AMP) databases such as DBAASP [30], DRAMP [31], and APD [32] have been built and have collected a large number of AMPs with diverse biological activities, such as antiviral, anticancer, antifungal, and so on. These databases greatly contribute to designing and identifying AMPs. However, most of the common AMP databases did not fully annotate the antiviral peptides. Some special databases, such as HIPdb [33] and ACovPepDB [34], are focused on anti-HIV peptides and anticoronavirus peptides and contain relatively few entries. The special antiviral peptide databases, such as AVPdb [35], have not been updated since 2014. Additionally, antiviral proteins and clinical information have not been included in AVPdb. To promote bioinformatics and assist experimental researchers working in the field of AVPs and antiviral-protein-based therapeutics, a complete data repository of antiviral peptides and proteins was established.
In this study, we developed a comprehensive data repository of antiviral peptides and proteins (DRAVP). The database contains information related to the sequence, antiviral activity, structure, and physicochemical properties of antiviral peptides and proteins. Meanwhile, we also provided information about approved and under-investigation drugs for antiviral therapy based on peptides and proteins. The database integrates search and sequence alignment function to facilitate the data analysis. We believe that DRAVP will be an invaluable resource for identifying and designing antiviral peptides. DRAVP is freely available online at http://dravp.cpu-bioinfor.org/ (accessed on 30 August 2022).

2. Materials and Methods

2.1. Data Collection and Extraction

To develop a comprehensive database of antiviral peptides and proteins, an extensive search was carried out on published articles and public databases. To collect the antiviral peptides, “((virus[Title/Abstract] OR viral[Title/Abstract]) AND (peptide[Title/Abstract] OR peptides[Title/Abstract])) AND (inhibit*[Title/Abstract] OR block*[Title/Abstract])” were used to search PubMed. This search provided 9559 peer-reviewed articles. From these articles, only antiviral peptides with known amino acid sequences and experimental information were manually extracted. For the antiviral proteins, a text search was carried out in UniProt [36] and PubMed using keywords “antiviral protein*”. After that, the proteins with antiviral effects were screened and registered in DRAVP. Eventually, a total of 1986 antiviral peptides and 139 antiviral proteins, along with detailed information such as the sequence, source, name, modifications, IC50, EC50, etc., were systematically compiled. DRAVP collects all antiviral activity data as much as possible from articles. Additionally, the physicochemical properties of antiviral peptides were calculated using SciDBMaker [37]. For the structure information, the experimental three-dimensional structures were mainly retrieved from PDB through a sequence blast against the PDB [38] database. If an exact match was obtained, the corresponding PDB ID was assigned to the database. In addition, we also served the modifications and stereochemistry for each sequence on the detailed information page.
To support the clinical research of antiviral therapies, information about antivirals based on peptides and proteins that have been approved or are in the clinical stage was extracted from the literature and drug databases. To facilitate user access to more detailed information about clinical and drug entries, we also provided abundant external links for each entry. Additionally, the clinical trials of each entry were compiled in the database.

2.2. Structure Prediction and Evaluation

Antiviral peptides’ structures play an important role in drug development. Due to the experiment’s difficulty, we could only collect a small number of experimentally determined structures from the PDB database. For antiviral peptides and proteins lacking experimental structures in DRAVP, AlphaFold [39] was used to predict the structures. The structure parameters for prediction were set as model: monomer; and multiple sequence alignment information (MSA) database: full_dbs (all gene databases). For each entry, five structures were generated, and the best-ranking structure was selected according to the predicted local distance difference test (pLDDT) score, then displayed on the web interface using Mol* Viewer [40]. To evaluate the structure’s accuracy predicted using AlphaFold, we randomly selected 10 antiviral peptides to benchmark. The structure prediction’s accuracy was judged by comparing the predicted structures with the experimentally determined models and calculating the root-mean-square deviation (RMSD). Additionally, Whatcheck [41] and Procheck [42] were used to evaluate the predicted structures’ quality levels. Whatcheck checks the stereochemical parameters of residues in the model and compares the differences between the submitted peptide structure and the normal structure. Procheck evaluates the stereochemical quality of peptide structures by analyzing whether the Φ/Ψ angle of the amino acid residues is in a reasonable region. Both methods provide “error”, “warning”, or “pass” levels for each evaluation indicator.

2.3. Database Construction

The DRAVP web architecture was based on the platform Linux-Apache-MySQL-PHP. After collecting and screening, all the data were stored in the MySQL database as the back end. The visualization software Navicat for MySQL was used to manage the data. Additionally, HTML, PHP, and JavaScript were applied to develop the front-end web interfaces. The database is planned to be updated every three months. To expand the data in the database, a submission page was created for users to submit antiviral peptides and proteins that were not included. We will review the submitted information and add it to our database. The database construction workflow can be overviewed as shown in Figure 1.

3. Results and Discussion

3.1. Database

DRAVP is freely available at http://dravp.cpu-bioinfor.org/ (accessed on 30 August 2022). Currently, DRAVP contains 1986 antiviral peptides, 139 antiviral proteins, and 64 clinical entries. In the database, 1327 entries of antiviral peptides have predicted structures, while 40 entries have experimentally determined structures from the PDB database. The homepage of DRAVP provides a brief introduction to the database. The navigation bar and quick search tool ensure the use of the website is convenient. The main web page of the DRAVP database contains the following interfaces: Search, Browse, Blast, Statistics, and Download. In addition, users can learn more information from the help page. Figure 2 shows screenshots of the web interfaces, and a brief function description of each page is provided below.
(i)
Search page: This page can help users to more conveniently fetch the desired data. To meet different needs, a simple search and advanced search for data retrieval were set up. The simple search allows users to retrieve data from a specific field from the dropdown menu. The advanced search is carried out through a combination of multiple fields, such as sequence, target virus, or UniProt ID, to achieve more accurate data retrieval.
(ii)
Browse page: The database provides three cross-linked browse tables based on different criteria: (1) classification based on data type, including antiviral peptides, antiviral proteins, patent data, and clinical data; (2) classification based on targeted virus type, with a total of 47 different virus types; and (3) classification based on virus family, such as coronaviridae, flaviviridae, and retroviridae. Some basic information, such as DRAVP ID, peptide name, source, and target organism, is provided in the overview table.
(iii)
Blast: To facilitate the sequence analysis in database, the Blast [43] search tool was integrated into DRAVP. A user can submit sequences in the FASTA format and choose the score matrix to obtain the sequence alignment results, which contain the score, E value, identities, positives, and gaps. The sequence alignment is shown for the peptides or proteins found to be identical or similar in the database.
(iv)
Download page: All the data and website source code are freely available for users. Three types of data files are provided on this page to satisfy different requirements. Meanwhile, the predicted structure and installed package of the Blast software are also collected on the page for download.

3.2. Database Content

Table 1 shows detailed information related to antiviral peptides and proteins that are displayed on the interfaces. Each entry of the antiviral peptides includes five parts: general information, activity information, structure information, physicochemical information, and literature information. Low cytotoxicity and hemolytics are essential conditions for developing antiviral peptides as drugs. For the convenience of researchers to design antiviral peptides with high activity and low toxicity, we have collected experimental information on cytotoxicity and hemolytic in the database. All the data were manually extracted from articles. Additionally, the data’s exactitude is one of the most important criteria for database evaluation. To prevent any possible errors, we provided literature sources for each included activity data on the web interface. All the data are available on the DRAVP website. In the physicochemical information section, we show each peptide’s amino acid distribution, which is helpful for analyzing the differences and rational design of the antiviral peptides. For antiviral proteins, we also provided comment information, which includes some information such as the half-life and antiviral mechanism.
The clinical data include antiviral drugs based on peptides and proteins that are approved or in the clinical trial stage. Table 2 shows the information displayed on the web interface of the clinical entries. We also provided external links to access various databases, such as PubChem [44], DrugBank [45], and ChEMBL [46], so users can gain more comprehensive information about entries from other sources. The clinical data may serve as data mining examples to help users focus on the most promising drug candidates. At present, the drugs that are approved and under investigation in clinical trials are mainly used for treating human immunodeficiency virus (HIV) infection, hepatitis C virus (HCV) infection, and COVID-19.

3.3. Structure Verification

AlphaFold has been verified to be highly accurate in protein structure prediction, but the reliability and accuracy in peptide structure prediction remain to be evaluated. Ten entries with experimentally determined structures were randomly selected from the database to verify AlphaFold’s peptide structure prediction performance. Table 3 shows the results of the Cα RMSDs between the predicted and experimentally determined structures. From the results, we found that the antiviral peptides were predicted with good accuracy and few outliers. The minimum RMSD value was 0.460 Å, and most Cα RMSDs were less than 2 Å. Additionally, the peptides with long sequences had higher accuracies than the short peptides on predicted structures.
According to Whatcheck and Procheck’s results, the predicted structures’ qualities were at high levels (Table 4). The average error rate evaluated by Whatcheck was at a lower level (11.42%). Additionally, in the Ramachandran plot created with Procheck, most residues were distributed in the core regions. Only DRAVPe01958 had one residue (Val18) in the disallowed region, but the Cα RMSD value of the entry between the predicted structure and known structure was 0.460 Å, which explains why the predicted structure is more accurate. McDonald et al. [47] found that AlphaFold predicted α-helical, β-hairpin, and disulfide-rich peptides with high accuracy. Additionally, our research showed that antiviral peptides were mostly α-helical peptides. Therefore, AlphaFold is reliable for peptide structure prediction.
Additionally, to know each predicted structure’s exactitude, the Mol* Viewer integrates the detailed information page to display the predicted structure, which allows users to check each amino acid’s pLDDT score. Figure 3 shows some examples containing high- and low-confidence regions. Currently, DRAVP contains 1327 predicted structures and 40 experimentally determined structures from the PDB database.

3.4. Statistics

To further understand antiviral peptides’ features as a basis for peptide design, statistics were calculated in DRAVP (Figure 4). Most peptides consisted of less than 40 amino acid residues. As shown in Figure 4B, the proportion of hydrophobic residues in most peptides was 30–50%, indicating that antiviral peptides in DRAVP are less hydrophobic. By analyzing the results, we found that short peptides with low hydrophobicity were relatively easy to form into antiviral peptides, which may be useful for designing antiviral peptides. Among the antiviral peptides, the number with anti-HIV activity was largest, and there were 124 entries with anti-SARS-CoV-2 activity. According to the secondary structure statistics, the helix is the main component of secondary structures, followed by the coil, and with the strand accounting for the least, which shows that antiviral peptides prefer an α-helix in stable secondary structures. Now the database contains 64 entries of clinical data consisting of 33 proteins, 24 peptides, and other antivirals with peptide structures such as peptidomimetic. As can be seen in Figure 4E, 30 entries are approved or in clinical trials for treating COVID-19, such as Aviptadil, which acts as an interleukin-6 inhibitor for treating COVID-19 (NCT04843761) and is in a phase 3 clinical trial, and Sotrovimab, which is a monoclonal antibody approved for COVID-19 treatment. For antiviral proteins, more than half of the proteins were isolated from humans (55%), with others being extracted from mice, pigs, rats, bovines, and so on. Among these antiviral proteins, IFN and IFN-induced proteins account for the majority, such as IFN-inducible transmembrane (IFITM) proteins and Mx GTPases.
It is a rapid and efficient approach for researchers to design new antiviral peptides with improved activity using the known peptides as templates. Table 5 lists the entries with the highest antiviral activity targeting various viruses in DRAVP, such as SARS-CoV-2, HIV, EBOV, etc. By analyzing antiviral mechanisms, most of the peptides were found to play an antiviral effect by blocking viral entry into host cells, which indicates that preventing viral entry is a promising target for antiviral intervention. These findings may contribute to the rational design of novel peptides with improved activities and the clinical application of antiviral peptides.

4. Conclusions

Recently, increasing reports of viral mutation and emergencies require constant work to discover novel antiviral agents. Antiviral peptides exert both preventative and therapeutic functions against viral infection. However, peptide drugs generally have short half-lives and poor stability. To better promote the development of antiviral peptides as drugs, it was necessary to construct a database. Here, we constructed a comprehensive database of antiviral peptides and proteins that aims to be a useful resource for discovering and designing novel antiviral drugs. A user-friendly interface was established to facilitate data searching, browsing, and alignment. The database allows the user to download all the data and source code. DRAVP will continue to collect experimentally determined antiviral peptides and reupdate all existing data. We promise to regularly update the database for at least 5 years.

Author Contributions

Y.L. extracted the data, developed the database, and wrote the manuscript. Y.Z. and X.S. tested the database. T.M. extracted the data. X.L. provided constructive suggestions for this work. H.Z. guided and supervised the whole process of this work. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (grant numbers 82073767 and 82273834) and Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD2014-65). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The source code for the DRAVP database website has been uploaded to GitHub: https://github.com/CPUDRAVP/DRAVP (accessed on 15 November 2022).

Acknowledgments

We thank the calculation support of the High-Performance Computing Center, China Pharmaceutical University.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Marston, B.J.; Dokubo, E.K.; van Steelandt, A.; Martel, L.; Williams, D.; Hersey, S.; Jambai, A.; Keita, S.; Nyenswah, T.G.; Redd, J.T. Ebola Response Impact on Public Health Programs, West Africa, 2014–2017. Emerg. Infect. Dis. 2017, 23, S25–S32. [Google Scholar] [CrossRef] [Green Version]
  2. Pardy, R.D.; Richer, M.J. Zika Virus Pathogenesis: From Early Case Reports to Epidemics. Viruses 2019, 11, 886. [Google Scholar] [CrossRef] [Green Version]
  3. Zhong, N.S.; Zheng, B.J.; Li, Y.M.; Poon, L.L.M.; Xie, Z.H.; Chan, K.H.; Li, P.H.; Tan, S.Y.; Chang, Q.; Xie, J.P.; et al. Epidemiology and cause of severe acute respiratory syndrome (SARS) in Guangdong, People’s Republic of China, in February, 2003. Lancet 2003, 362, 1353–1358. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. The Lancet. MERS-CoV: A global challenge. Lancet 2013, 381, 1960. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Chen, B.; Tian, E.K.; He, B.; Tian, L.; Han, R.; Wang, S.; Xiang, Q.; Zhang, S.; El Arnaout, T.; Cheng, W. Overview of lethal human coronaviruses. Signal. Transduct. Target. Ther. 2020, 5, 89. [Google Scholar] [CrossRef]
  6. Lou, Z.; Sun, Y.; Rao, Z. Current progress in antiviral strategies. Trends Pharmacol. Sci. 2014, 35, 86–102. [Google Scholar] [CrossRef] [PubMed]
  7. Heydari, H.; Golmohammadi, R.; Mirnejad, R.; Tebyanian, H.; Fasihi-Ramandi, M.; Moosazadeh Moghaddam, M. Antiviral peptides against Coronaviridae family: A review. Peptides 2021, 139, 170526. [Google Scholar] [CrossRef]
  8. Agarwal, G.; Gabrani, R. Antiviral Peptides: Identification and Validation. Int. J. Pept. Res. Ther. 2021, 27, 149–168. [Google Scholar] [CrossRef]
  9. Tonk, M.; Růžek, D.; Vilcinskas, A. Compelling Evidence for the Activity of Antiviral Peptides against SARS-CoV-2. Viruses 2021, 13, 912. [Google Scholar] [CrossRef]
  10. Wack, A.; Openshaw, P.; O’Garra, A. Contribution of cytokines to pathology and protection in virus infection. Curr. Opin. Virol. 2011, 1, 184–195. [Google Scholar] [CrossRef] [Green Version]
  11. Breen, E.C. Pro- and anti-inflammatory cytokines in human immunodeficiency virus infection and acquired immunodeficiency syndrome. Pharmacol. Ther. 2002, 95, 295–304. [Google Scholar] [CrossRef] [PubMed]
  12. Pang, T. Cytokines in infection. Med. J. Malays. 1993, 48, 9–11. [Google Scholar]
  13. Poynard, T.; Marcellin, P.; Lee, S.S.; Niederau, C.; Minuk, G.S.; Ideo, G.; Bain, V.; Heathcote, J.; Zeuzem, S.; Trepo, C.; et al. Randomised trial of interferon alpha2b plus ribavirin for 48 weeks or for 24 weeks versus interferon alpha2b plus placebo for 48 weeks for treatment of chronic infection with hepatitis C virus. International Hepatitis Interventional Therapy Group (IHIT). Lancet 1998, 352, 1426–1432. [Google Scholar] [CrossRef] [PubMed]
  14. Wyde, P.R.; Moore, D.K.; Pimentel, D.M.; Gilbert, B.E.; Nimrod, R.; Panet, A. Recombinant superoxide dismutase (SOD) administered by aerosol inhibits respiratory syncytial virus infection in cotton rats. Antivir. Res. 1996, 31, 173–184. [Google Scholar] [CrossRef]
  15. Stravalaci, M.; Pagani, I.; Paraboschi, E.M.; Pedotti, M.; Doni, A.; Scavello, F.; Mapelli, S.N.; Sironi, M.; Perucchini, C.; Varani, L.; et al. Recognition and inhibition of SARS-CoV-2 by humoral innate immunity pattern recognition molecules. Nat. Immunol. 2022, 23, 275–286. [Google Scholar] [CrossRef] [PubMed]
  16. Rani, P.; Kapoor, B.; Gulati, M.; Atanasov, A.G.; Alzahrani, Q.; Gupta, R. Antimicrobial peptides: A plausible approach for COVID-19 treatment. Expert. Opin. Drug. Discov. 2022, 17, 473–487. [Google Scholar] [CrossRef]
  17. Feng, M.; Fei, S.; Xia, J.; Labropoulou, V.; Swevers, L.; Sun, J. Antimicrobial Peptides as Potential Antiviral Factors in Insect Antiviral Immune Response. Front. Immunol. 2020, 11, 2030. [Google Scholar] [CrossRef]
  18. Zakaryan, H.; Chilingaryan, G.; Arabyan, E.; Serobian, A.; Wang, G. Natural antimicrobial peptides as a source of new antiviral agents. J. Gen. Virol. 2021, 102, 001661. [Google Scholar] [CrossRef]
  19. Vilas Boas, L.C.P.; Campos, M.L.; Berlanda, R.L.A.; de Carvalho Neves, N.; Franco, O.L. Antiviral peptides as promising therapeutic drugs. Cell. Mol. Life Sci. 2019, 76, 3525–3542. [Google Scholar] [CrossRef]
  20. Zhou, J.; Xu, W.; Liu, Z.; Wang, C.; Xia, S.; Lan, Q.; Cai, Y.; Su, S.; Pu, J.; Xing, L.; et al. A highly potent and stable pan-coronavirus fusion inhibitor as a candidate prophylactic and therapeutic for COVID-19 and other coronavirus diseases. Acta Pharm. Sin. B 2022, 12, 1652–1661. [Google Scholar] [CrossRef]
  21. Xia, S.; Chan, J.F.; Wang, L.; Jiao, F.; Chik, K.K.; Chu, H.; Lan, Q.; Xu, W.; Wang, Q.; Wang, C.; et al. Peptide-based pan-CoV fusion inhibitors maintain high potency against SARS-CoV-2 Omicron variant. Cell Res. 2022, 32, 404–406. [Google Scholar] [CrossRef] [PubMed]
  22. Dando, T.M.; Perry, C.M. Enfuvirtide. Drugs 2003, 63, 2755–2766. [Google Scholar] [CrossRef] [PubMed]
  23. Ahmadi, K.; Farasat, A.; Rostamian, M.; Johari, B.; Madanchi, H. Enfuvirtide, an HIV-1 fusion inhibitor peptide, can act as a potent SARS-CoV-2 fusion inhibitor: An in silico drug repurposing study. J. Biomol. Struct. Dyn. 2022, 40, 5566–5576. [Google Scholar] [CrossRef]
  24. Wilby, K.J.; Partovi, N.; Ford, J.A.; Greanya, E.; Yoshida, E.M. Review of boceprevir and telaprevir for the treatment of chronic hepatitis C. Can. J. Gastroenterol. 2012, 26, 205–210. [Google Scholar] [CrossRef]
  25. Mousavi Maleki, M.S.; Sardari, S.; Ghandehari Alavijeh, A.; Madanchi, H. Recent Patents and FDA-Approved Drugs Based on Antiviral Peptides and Other Peptide-Related Antivirals. Int. J. Pept. Res. Ther. 2023, 29, 5. [Google Scholar] [CrossRef]
  26. Mousavi Maleki, M.S.; Rostamian, M.; Madanchi, H. Antimicrobial peptides and other peptide-like therapeutics as promising candidates to combat SARS-CoV-2. Expert. Rev. Anti Infect. Ther. 2021, 19, 1205–1217. [Google Scholar] [CrossRef]
  27. Ali, W.; Elsahn, A.; Ting, D.S.J.; Dua, H.S.; Mohammed, I. Host Defence Peptides: A Potent Alternative to Combat Antimicrobial Resistance in the Era of the COVID-19 Pandemic. Antibiotics 2022, 11, 475. [Google Scholar] [CrossRef]
  28. Li, J.; Pu, Y.; Tang, J.; Zou, Q.; Guo, F. DeepAVP: A Dual-Channel Deep Neural Network for Identifying Variable-Length Antiviral Peptides. IEEE J. Biomed. Health Inform. 2020, 24, 3012–3019. [Google Scholar] [CrossRef] [PubMed]
  29. Pang, Y.; Yao, L.; Jhong, J.H.; Wang, Z.; Lee, T.Y. AVPIden: A new scheme for identification and functional prediction of antiviral peptides based on machine learning approaches. Brief. Bioinform. 2021, 22, bbab263. [Google Scholar] [CrossRef]
  30. Pirtskhalava, M.; Amstrong, A.A.; Grigolava, M.; Chubinidze, M.; Alimbarashvili, E.; Vishnepolsky, B.; Gabrielian, A.; Rosenthal, A.; Hurt, D.E.; Tartakovsky, M. DBAASP v3: Database of antimicrobial/cytotoxic activity and structure of peptides as a resource for development of new therapeutics. Nucleic Acids Res. 2021, 49, D288–D297. [Google Scholar] [CrossRef]
  31. Shi, G.; Kang, X.; Dong, F.; Liu, Y.; Zhu, N.; Hu, Y.; Xu, H.; Lao, X.; Zheng, H. DRAMP 3.0: An enhanced comprehensive data repository of antimicrobial peptides. Nucleic Acids Res. 2022, 50, D488–D496. [Google Scholar] [CrossRef]
  32. Wang, G.; Li, X.; Wang, Z. APD3: The antimicrobial peptide database as a tool for research and education. Nucleic Acids Res. 2016, 44, D1087–D1093. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Qureshi, A.; Thakur, N.; Kumar, M. HIPdb: A database of experimentally validated HIV inhibiting peptides. PLoS ONE 2013, 8, e54908. [Google Scholar] [CrossRef] [Green Version]
  34. Zhang, Q.; Chen, X.; Li, B.; Lu, C.; Yang, S.; Long, J.; Chen, H.; Huang, J.; He, B. A database of anti-coronavirus peptides. Sci. Data 2022, 9, 294. [Google Scholar] [CrossRef]
  35. Qureshi, A.; Thakur, N.; Tandon, H.; Kumar, M. AVPdb: A database of experimentally validated antiviral peptides targeting medically important viruses. Nucleic Acids Res. 2014, 42, D1147–D1153. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. The UniProt Consortium. UniProt: The universal protein knowledgebase in 2021. Nucleic Acids Res. 2021, 49, D480–D489. [Google Scholar] [CrossRef]
  37. Hammami, R.; Zouhir, A.; Naghmouchi, K.; Ben Hamida, J.; Fliss, I. SciDBMaker: New software for computer-aided design of specialized biological databases. BMC Bioinform. 2008, 9, 121. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Burley, S.K.; Berman, H.M.; Kleywegt, G.J.; Markley, J.L.; Nakamura, H.; Velankar, S. Protein Data Bank (PDB): The Single Global Macromolecular Structure Archive. Methods Mol. Biol. 2017, 1607, 627–641. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
  40. Sehnal, D.; Bittrich, S.; Deshpande, M.; Svobodová, R.; Berka, K.; Bazgier, V.; Velankar, S.; Burley, S.K.; Koča, J.; Rose, A.S. Mol* Viewer: Modern web app for 3D visualization and analysis of large biomolecular structures. Nucleic Acids Res. 2021, 49, W431–W437. [Google Scholar] [CrossRef]
  41. Hooft, R.W.; Vriend, G.; Sander, C.; Abola, E.E. Errors in protein structures. Nature 1996, 381, 272. [Google Scholar] [CrossRef] [PubMed]
  42. Laskowski, R.A.; Rullmannn, J.A.; MacArthur, M.W.; Kaptein, R.; Thornton, J.M. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. J. Biomol. NMR 1996, 8, 477–486. [Google Scholar] [CrossRef]
  43. Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009, 10, 421. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Kim, S.; Chen, J.; Cheng, T.; Gindulyte, A.; He, J.; He, S.; Li, Q.; Shoemaker, B.A.; Thiessen, P.A.; Yu, B.; et al. PubChem in 2021: New data content and improved web interfaces. Nucleic Acids Res. 2021, 49, D1388–D1395. [Google Scholar] [CrossRef]
  45. Wishart, D.S.; Feunang, Y.D.; Guo, A.C.; Lo, E.J.; Marcu, A.; Grant, J.R.; Sajed, T.; Johnson, D.; Li, C.; Sayeeda, Z.; et al. DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res. 2018, 46, D1074–D1082. [Google Scholar] [CrossRef]
  46. Mendez, D.; Gaulton, A.; Bento, A.P.; Chambers, J.; De Veij, M.; Félix, E.; Magariños, M.P.; Mosquera, J.F.; Mutowo, P.; Nowotka, M.; et al. ChEMBL: Towards direct deposition of bioassay data. Nucleic Acids Res. 2019, 47, D930–D940. [Google Scholar] [CrossRef]
  47. McDonald, E.F.; Jones, T.; Plate, L.; Meiler, J.; Gulsevin, A. Benchmarking AlphaFold2 on peptide structure prediction. Structure 2023, 31, 111–119.e112. [Google Scholar] [CrossRef]
  48. Cao, L.; Goreshnik, I.; Coventry, B.; Case, J.B.; Miller, L.; Kozodoy, L.; Chen, R.E.; Carter, L.; Walls, A.C.; Park, Y.J.; et al. De novo design of picomolar SARS-CoV-2 miniprotein inhibitors. Science 2020, 370, 426–431. [Google Scholar] [CrossRef] [PubMed]
  49. Zhu, Y.; Chong, H.; Yu, D.; Guo, Y.; Zhou, Y.; He, Y. Design and Characterization of Cholesterylated Peptide HIV-1/2 Fusion Inhibitors with Extremely Potent and Long-Lasting Antiviral Activity. J. Virol. 2019, 93, e02312-18. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Yin, P.; Zhang, L.; Ye, F.; Deng, Y.; Lu, S.; Li, Y.P.; Zhang, L.; Tan, W. A screen for inhibitory peptides of hepatitis C virus identifies a novel entry inhibitor targeting E1 and E2. Sci. Rep. 2017, 7, 3976. [Google Scholar] [CrossRef] [Green Version]
  51. Schmidt, A.G.; Yang, P.L.; Harrison, S.C. Peptide inhibitors of flavivirus entry derived from the E protein stem. J. Virol. 2010, 84, 12549–12554. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Yu, Y.; Cooper, C.L.; Wang, G.; Morwitzer, M.J.; Kota, K.; Tran, J.P.; Bradfute, S.B.; Liu, Y.; Shao, J.; Zhang, A.K.; et al. Engineered Human Cathelicidin Antimicrobial Peptides Inhibit Ebola Virus Infection. iScience 2020, 23, 100999. [Google Scholar] [CrossRef] [PubMed]
  53. Zeng, Z.; Zhang, R.; Hong, W.; Cheng, Y.; Wang, H.; Lang, Y.; Ji, Z.; Wu, Y.; Li, W.; Xie, Y.; et al. Histidine-rich Modification of a Scorpion-derived Peptide Improves Bioavailability and Inhibitory Activity against HSV-1. Theranostics 2018, 8, 199–211. [Google Scholar] [CrossRef] [PubMed]
  54. Ho, T.Y.; Wu, S.L.; Chen, J.C.; Wei, Y.C.; Cheng, S.E.; Chang, Y.H.; Liu, H.J.; Hsiang, C.Y. Design and biological activities of novel inhibitory peptides for SARS-CoV spike protein and angiotensin-converting enzyme 2 interaction. Antivir. Res. 2006, 69, 70–76. [Google Scholar] [CrossRef]
  55. Xia, S.; Liu, M.; Wang, C.; Xu, W.; Lan, Q.; Feng, S.; Qi, F.; Bao, L.; Du, L.; Liu, S.; et al. Inhibition of SARS-CoV-2 (previously 2019-nCoV) infection by a highly potent pan-coronavirus fusion inhibitor targeting its spike protein that harbors a high capacity to mediate membrane fusion. Cell Res. 2020, 30, 343–355. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Overview of DRAVP database construction.
Figure 1. Overview of DRAVP database construction.
Viruses 15 00820 g001
Figure 2. Screenshots of DRAVP web interface. (a) The advanced search page, which enables the user to access the desired data through multifield combination retrieval. (b) The cross-linked browse page, which provides three classified methods to facilitate browsing: by data type, by target virus, and by virus family. (c) The BLAST analysis tool, which enables users to access the most relevant sequence by its local alignments. (d) The download page, which provides three types of files of the data and the predicted structures.
Figure 2. Screenshots of DRAVP web interface. (a) The advanced search page, which enables the user to access the desired data through multifield combination retrieval. (b) The cross-linked browse page, which provides three classified methods to facilitate browsing: by data type, by target virus, and by virus family. (c) The BLAST analysis tool, which enables users to access the most relevant sequence by its local alignments. (d) The download page, which provides three types of files of the data and the predicted structures.
Viruses 15 00820 g002
Figure 3. Example outputs containing both high- and low-confidence regions.
Figure 3. Example outputs containing both high- and low-confidence regions.
Viruses 15 00820 g003
Figure 4. Data statistics in DRAVP database. (A) The length distribution of antiviral peptides in DRAVP. (B) Hydrophobic content distribution of antiviral peptides. The hydrophobic content represents a ratio between total hydrophobic residues and total amino acids. (C) Number of antiviral peptides target various viruses. (D) Distribution of secondary structure in predicted structures. (E) Type of virus infection that the AVP used for treatment of clinical data. Among them, the number of entries used to treat COVID-19 and HIV infections is largest. (F) Source of antiviral proteins.
Figure 4. Data statistics in DRAVP database. (A) The length distribution of antiviral peptides in DRAVP. (B) Hydrophobic content distribution of antiviral peptides. The hydrophobic content represents a ratio between total hydrophobic residues and total amino acids. (C) Number of antiviral peptides target various viruses. (D) Distribution of secondary structure in predicted structures. (E) Type of virus infection that the AVP used for treatment of clinical data. Among them, the number of entries used to treat COVID-19 and HIV infections is largest. (F) Source of antiviral proteins.
Viruses 15 00820 g004
Table 1. List of fields in antiviral peptides dataset.
Table 1. List of fields in antiviral peptides dataset.
ColumnFieldsDescription
General informationDRAVP IDThe unique ID linking to the corresponding DRAVP entry
Peptide nameName of each peptide/protein
SequenceAmino acid composition of peptide/protein
Sequence lengthThe length of peptide/protein sequence
UniProt IDThe accessing link directing to external UniProt entry
SourceThe organism where the peptide/protein is extracted or isolated
Activity informationTarget organismType of targeted virus
AssayMethods for determining the antiviral activity
ActivityAntiviral activity information against viruses; contains EC50, IC50, etc.
Hemolytic activityHemolytic activity against red blood cells
CytotoxicityCytotoxicity against cells except for RBCs
Binding targetPeptide active action site
MechanismMechanism of peptides exhibit antiviral activity
Structure informationPDB IDProvide accessing link directing to the corresponding PDB entry
Predicted structure downloadProvide the predicted structure file
Linear/cyclicPeptide linear or cyclic structure
N/C-terminal modificationThe modifications of N/C-terminal according to the references
Other modificationAll bonds and special amino acids
StereochemistryThe L/D amino acids consist of peptides
Physicochemical informationFormula, mass, pI, net charge, absent amino acids, common amino acids, basic residues, acidic residues, hydrophobic residues, Boman index, hydrophobicity, aliphatic index, half-life, extinction coefficient cystines, absorbance 280 nm, and polar residues
Literature informationTitle, PubMed ID, journal information, and doi of reference articles
Table 2. List of fields in clinical dataset.
Table 2. List of fields in clinical dataset.
FieldDescription
DRAVP IDThe unique ID linking to the corresponding DRAVP entry
NameName of the entry
SequenceAmino acid composition of peptide
Molecular formulaMolecular formula of the item
Condition/diseaseDiseases used for treatment
GroupClinical trials status
TypePeptide, protein, or other types of drugs
PubChem IDThe accessing link directing to external PubChem entry
DrugBank accession numberThe accessing number in DrugBank database
DescriptionSome information related to the entry, such as antiviral mechanism, structure description, etc.
Active sequence/structureThe chemical structure or modified sequence
CHEMBL IDThe accessing link directing to CHEMBL database
UNIIThe accessing link directing to Global Substance Registration System
CASThe accessing link directing to CAS common chemistry
ReferenceThe PubMed ID of reference articles
Clinical trials informationIncluding NCT number, study title, condition/disease, status, sponsor, and phase of clinical trials
Table 3. Comparison between predicted structures and experimental structures.
Table 3. Comparison between predicted structures and experimental structures.
DRAVP IDPDB IDSequence LengthRMSD(Å)
DRAVPe002792ERI310.486
DRAVPe015602LZI181.327
DRAVPe003031ZMQ320.409
DRAVPe015981ULL171.772
DRAVPe003732JOS222.158
DRAVPe008372MLT261.507
DRAVPe013031ZMP320.519
DRAVPe002951ZA8310.680
DRAVPe019581FD4410.460
DRAVPe003766GS5132.305
Table 4. The evaluation results of predicted structures using Whatcheck and Procheck.
Table 4. The evaluation results of predicted structures using Whatcheck and Procheck.
DRAVP IDWhatcheckProcheck
Total
Metrics
ErrorError RateCore
Regions
Additional and
Generous Allowed
Regions
Disallowed
Regions
DRAVPe0027945511.11%80.0%20.0%0.0%
DRAVPe0156037616.22%92.9%7.1%0.0%
DRAVPe0030345511.11%100.0%0.0%0.0%
DRAVPe0159839512.82%100.0%0.0%0.0%
DRAVPe003734349.30%100.0%0.0%0.0%
DRAVPe008374249.52%100.0%0.0%0.0%
DRAVPe0130342511.90%96.3%3.7%0.0%
DRAVPe0029543511.63%88.9%11.1%0.0%
DRAVPe0195846510.87%96.7%0.0%3.3%
DRAVPe003764149.76%100.0%0.0%0.0%
Average 11.42%95.48%4.19%0.33%
Table 5. List of antiviral peptides with the highest activity for eight different viruses.
Table 5. List of antiviral peptides with the highest activity for eight different viruses.
DRAVP_IDVirusTargetMechanismIC50/EC50Reference
DRAVPe00482SARS-CoV-2S proteinPrevent viral entry by binding to the spike receptor-binding domain (RBD) and blocking it from binding to ACE223.54 pMCao et al. [48]
DRAVPe00788Human immunodeficiency virus (HIV-1 and HIV-2)Envelope proteinInhibit viral entry by binding to the prehairpin intermediate (PHI) conformation and competitively prevent its transition to 6-HB0.43 pM ~0.026 nMZhu et al. [49]
DRAVPe01747Hepatitis C virus (HCV)Envelope proteinInhibit viral entry at the postattachment step and block cell-to-cell transmission1–5 nMYin et al. [50]
DRAVPe01288Dengue virus (DENV1, DENV2, DENV3, DENV4)-Interact with the viral membrane bringing peptides into the low-pH endosome, where full exposure of the peptide site on the E conformational intermediate leads to the tight, specific binding that blocks membrane fusionIC90 = 0.1–4 μMSchmidt et al. [51]
DRAVPe01806Ebola virus (EBOV)-Impair cathepsin B-mediated processing of EBOV glycoprotein, thus preventing virus entry0.99 µMYu et al. [52]
DRAVPe00757Herpes simplex virus (HSV-1)-Inhibit the viral attachment and viral entry of HSV-10.67–2.88 µg/mLZeng et al. [53]
DRAVPe01977SARS-CoVACE2Inhibit viral entry by blocking the binding of S protein to ACE21.88 nMHo et al. [54]
DRAVPe00407MERS-CoVS proteinInhibit viral six-helical bundle (6-HB) formation, thereby preventing viral fusion and entry into host cells4.2 nMXia et al. [55]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, Y.; Zhu, Y.; Sun, X.; Ma, T.; Lao, X.; Zheng, H. DRAVP: A Comprehensive Database of Antiviral Peptides and Proteins. Viruses 2023, 15, 820. https://doi.org/10.3390/v15040820

AMA Style

Liu Y, Zhu Y, Sun X, Ma T, Lao X, Zheng H. DRAVP: A Comprehensive Database of Antiviral Peptides and Proteins. Viruses. 2023; 15(4):820. https://doi.org/10.3390/v15040820

Chicago/Turabian Style

Liu, Yanchao, Youzhuo Zhu, Xin Sun, Tianyue Ma, Xingzhen Lao, and Heng Zheng. 2023. "DRAVP: A Comprehensive Database of Antiviral Peptides and Proteins" Viruses 15, no. 4: 820. https://doi.org/10.3390/v15040820

APA Style

Liu, Y., Zhu, Y., Sun, X., Ma, T., Lao, X., & Zheng, H. (2023). DRAVP: A Comprehensive Database of Antiviral Peptides and Proteins. Viruses, 15(4), 820. https://doi.org/10.3390/v15040820

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop