Next Article in Journal
Aminomethylmorpholino Nucleosides as Novel Inhibitors of PARP1 and PARP2: Experimental and Molecular Modeling Analyses of Their Selectivity and Mechanism of Action
Previous Article in Journal
Glioma-Derived Exosomes and Their Application as Drug Nanoparticles
Previous Article in Special Issue
Structural Catalytic Core in Subtilisin-like Proteins and Its Comparison to Trypsin-like Serine Proteases and Alpha/Beta-Hydrolases
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Prediction of Protein Secondary Structures Based on Substructural Descriptors of Molecular Fragments

by
Oleg S. Zakharov
1,
Anastasia V. Rudik
2,
Dmitry A. Filimonov
2 and
Alexey A. Lagunin
1,2,*
1
Department of Bioinformatics, Pirogov Russian National Research Medical University, 117997 Moscow, Russia
2
Department of Bioinformatics, Institute of Biomedical Chemistry, 119121 Moscow, Russia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(23), 12525; https://doi.org/10.3390/ijms252312525
Submission received: 30 September 2024 / Revised: 15 November 2024 / Accepted: 19 November 2024 / Published: 21 November 2024
(This article belongs to the Special Issue Protein Structure Research 2024)

Abstract

The accurate prediction of secondary structures of proteins (SSPs) is a critical challenge in molecular biology and structural bioinformatics. Despite recent advancements, this task remains complex and demands further exploration. This study presents a novel approach to SSP prediction using atom-centric substructural multilevel neighborhoods of atoms (MNA) descriptors for protein molecular fragments. A dataset comprising over 335,000 SSPs, annotated by the Dictionary of Secondary Structure in Proteins (DSSP) software from 37,000 proteins, was constructed from Protein Data Bank (PDB) records with a resolution of 2 Å or better. Protein fragments were converted into structural formulae using the RDKit Python package and stored in SD files using the MOL V3000 format. Classification sequence–structure–property relationships (SSPR) models were developed with varying levels of MNA descriptors and a Bayesian algorithm implemented in MultiPASS software. The average prediction accuracy (AUC) for eight SSP types, calculated via leave-one-out cross-validation, was 0.902. For independent test sets (ASTRAL and CB513 datasets), the best SSPR models achieved AUC, Q3, and Q8 values of 0.860, 77.32%, 70.92% and 0.889, 78.78%, 74.74%, respectively. Based on the created models, a freely available web application MNA-PSS-Pred was developed.
Keywords: prediction of secondary structures of protein; MNA descriptors; sequence–structure–property relationships; MultiPASS prediction of secondary structures of protein; MNA descriptors; sequence–structure–property relationships; MultiPASS

Share and Cite

MDPI and ACS Style

Zakharov, O.S.; Rudik, A.V.; Filimonov, D.A.; Lagunin, A.A. Prediction of Protein Secondary Structures Based on Substructural Descriptors of Molecular Fragments. Int. J. Mol. Sci. 2024, 25, 12525. https://doi.org/10.3390/ijms252312525

AMA Style

Zakharov OS, Rudik AV, Filimonov DA, Lagunin AA. Prediction of Protein Secondary Structures Based on Substructural Descriptors of Molecular Fragments. International Journal of Molecular Sciences. 2024; 25(23):12525. https://doi.org/10.3390/ijms252312525

Chicago/Turabian Style

Zakharov, Oleg S., Anastasia V. Rudik, Dmitry A. Filimonov, and Alexey A. Lagunin. 2024. "Prediction of Protein Secondary Structures Based on Substructural Descriptors of Molecular Fragments" International Journal of Molecular Sciences 25, no. 23: 12525. https://doi.org/10.3390/ijms252312525

APA Style

Zakharov, O. S., Rudik, A. V., Filimonov, D. A., & Lagunin, A. A. (2024). Prediction of Protein Secondary Structures Based on Substructural Descriptors of Molecular Fragments. International Journal of Molecular Sciences, 25(23), 12525. https://doi.org/10.3390/ijms252312525

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop