Next Article in Journal
Optimization of Mulberry Extract Foam-Mat Drying Process Parameters
Next Article in Special Issue
Comprehensive Survey of Consensus Docking for High-Throughput Virtual Screening
Previous Article in Journal
An Efficient Approach to Large-Scale Ab Initio Conformational Energy Profiles of Small Molecules
Previous Article in Special Issue
Elucidation of Binding Features and Dissociation Pathways of Inhibitors and Modulators in SARS-CoV-2 Main Protease by Multiple Molecular Dynamics Simulations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Pathfinder-Driven Chemical Space Exploration and Multiparameter Optimization in Tandem with Glide/IFD and QSAR-Based Active Learning Approach to Prioritize Design Ideas for FEP+ Calculations of SARS-CoV-2 PLpro Inhibitors

by
Njabulo Joyfull Gumede
Department of Chemistry, Mangosuthu University of Technology, P.O. Box 12363, Jacobs 4026, South Africa
Molecules 2022, 27(23), 8569; https://doi.org/10.3390/molecules27238569
Submission received: 29 October 2022 / Revised: 25 November 2022 / Accepted: 30 November 2022 / Published: 5 December 2022

Abstract

:
A global pandemic caused by the SARS-CoV-2 virus that started in 2020 and has wreaked havoc on humanity still ravages up until now. As a result, the negative impact of travel restrictions and lockdowns has underscored the importance of our preparedness for future pandemics. The main thrust of this work was based on addressing this need by traversing chemical space to design inhibitors that target the SARS-CoV-2 papain-like protease (PLpro). Pathfinder-based retrosynthesis analysis was used to generate analogs of GRL-0617 using commercially available building blocks by replacing the naphthalene moiety. A total of 10 models were built using active learning QSAR, which achieved good statistical results such as an R2 > 0.70, Q2 > 0.64, STD Dev < 0.30, and RMSE < 0.31, on average for all models. A total of 35 ideas were further prioritized for FEP+ calculations. The FEP+ results revealed that compound 45 was the most active compound in this series with a ΔG of −7.28 ± 0.96 kcal/mol. Compound 5 exhibited a ΔG of −6.78 ± 1.30 kcal/mol. The inactive compounds in this series were compound 91 and compound 23 with a ΔG of −5.74 ± 1.06 and −3.11 ± 1.45 kcal/mol. The combined strategy employed here is envisaged to be of great utility in multiparameter lead optimization efforts, to traverse chemical space, maintaining and/or improving the potency as well as the property space of synthetically aware design ideas.

1. Introduction

A global pandemic, caused by what was initially referred to as the 2019 nCov virus, which started in 2019 in China and still ravages now, has wreaked havoc on humanity [1,2,3]. It was hypothesized that this virus is transmitted between humans through sneezing, coughing, and small saliva droplets released when an infected person speaks [4]. As a result, scientists were able to pinpoint the new virus as a member of the coronavirus family [5]. Later, a change in taxonomy was made from the 2019 nCoV, to the SARS-CoV-2 virus by the Corovidae study group of the International Committee on Taxonomy of Viruses [6]. The change was made since this new virus was derived from a similar type of virus, the severe acute respiratory syndrome (SARS-CoV), that led to a pandemic between 2002 and 2003 [1,6,7].
Coronaviruses were first discovered in the 1960s and were thought to be a group of related viruses causing common colds [4]. These common colds were found to be caused by viral strains such as HCoV 229E and HCoV OC43 [8]. The groups of these viruses are classified as α-, β-, γ-, and δ-CoVs from the point they were discovered in the 1960s [1,9]. Coronaviruses are single-stranded RNA-enveloped viruses that infect the host leading to respiratory, gastrointestinal, and neurological diseases [1,3,8,10]. CoVs throughout history have been transmitted from animals to humans through a zoonotic transmission event [1,11]. SARS-CoV-2 is different from the other related viruses in its class in that it is highly infectious, and the effect of a secondary attack is just as severe as the primary attack [12]. The main protease (MPpro) and papain-like protease (PLpro) are validated drug targets for the SARS-CoV-2 virus [13,14]. Inhibitors of the former work by blocking viral infection, while inhibitors of the latter block viral replication. Furthermore, PLpro is also important for innate immunity [13,15]. As such, PLpro is a clinically relevant drug target for its multiple roles, as it overcomes innate immunity by virtue of reversing processes related to ubiquitination and ISGylation [7,13,16]. In fact, for the successful infection of the host by the virus to occur, the cleavage of host proteins responsible for ubiquitination and ISGylation needs to occur, which is facilitated by PLpro enzyme activities [17,18].
Efforts have been made in the past to use molecular modeling to design PLpro inhibitors for SARS-CoV [19]. Ghosh et al. [20] and Ratia et al. [21] performed high-throughput screening assays in independent studies to screen commercial libraries of diverse drug-like molecules. The experimental results revealed that a non-covalent inhibitor GRL-0617, an analog of compound 7724772 (see Figure 1a), shows the highest level of potency. The lead compound 7724772 inhibits SARS-CoV PLpro with an IC50 of 20 µM, while the optimized analog GRL-0617 (see Figure 1b) shows an IC50 of 0,60 µM. Further studies revealed that GRL-0617 inhibits Vero E6 cell lines for SARS-CoV viral replication with an EC50 of 15 µM, without any form of cytotoxicity demonstrated [18,19]. Recent studies by Freitas et al. [7] have also revealed that GRL-0617 is more potent than an analog, which is compound 6 from their study, and inhibits SARS-CoV-2 PLpro with an IC50 of 2.4 µM, whilst compound 6 inhibits SARS-CoV-2 PLpro with an IC50 of 5.0 µM. Attempts have recently been made to use Artificial Intelligence/Machine Learning (AI/ML) tools by several research groups to design SARS-CoV-2 drug-like molecules [21,22,23]. In fact, ML tools such as deep learning have transformed drug discovery and development. The ML models are trained using molecules with known activities to the target of interest [21,23,24]. Arshia et al. [24] have recently employed a combined strategy of the De Novo based design of SARS-CoV-2 Mpro inhibitors employing deep learning, docking, and MD simulations successfully. Furthermore, Murugesan et al. [25] and Patel et al. [26], in independent studies, have successfully screened phytochemical compounds derived from medicinal plants and used docking and MD simulations to predict their binding affinities when bound to SARS-CoV-2 Mpro. With this in mind, the main aim of this study is to demonstrate the synergy between active learning-based Glide docking tools in tandem with Pathfinder reaction-based enumeration tools and physics-based relative binding affinity estimation tools. This study aims to optimize GRL-0617, a lead compound targeting SARS-CoV-2 PLpro, by traversing chemical space, designing its analogs, and predicting their binding affinities using FEP+. The design ideas that are predicted to be the most active will be selected and prioritized for synthesis. Further biological activities of the design ideas will be performed in the future and will be relevant as therapeutic interventions for our preparedness for future pandemics.

2. Materials and Methods

2.1. Computational Details

Schrödinger Life-Sciences Suite 2021-2 was utilized for all computational calculations. Maestro (v12.8) was used as a Graphical User Interface comprising various, diverse Schrödinger modules. Pathfinder, LigPrep, Protein Preparation Wizard, Prime, Glide, Induced Fit Docking, Desmond for Molecular Dynamics Simulation, Maximum Common Substructural (MCS) Docking, AutoQSAR, Force-field builder, and FEP+ were used.

2.2. Reaction-Based Enumeration Using Pathfinder

The 2-D sketcher panel on Maestro was used to draw the structure of GRL-0617, a lead compound with activity against the SARS-CoV-2 PLpro drug target. Pathfinder-driven enumeration was performed on GRL-0617 with a maximum depth of 1. A total of 4 pathways were generated from a commercial library of building blocks via the retrosynthesis of known coupling reactions as follows: Pathway 1—amide_coupling_1, Pathway 2—amide_coupling_2, Pathway 3—curtius_3, and Pathway 4—Suzuki_2 (see Figure S1). In the first round of enumeration using Pathway 1, reactant 1 was retained as the original reactant and reactant 2 was selected from the reactant library of N containing heterocycles. The rest of the enumerations were composed of hydrazine aryl, primary and secondary amines, acid chloride, carboxylates, isocyanates, aryl, and vinyl halides. In all the enumeration runs, Smiles Arbitrary Target Specification (SMARTS) was used to remove ligands with reactive functional groups and Pan assay Interfering Structures (PAINS) offenders affecting the design ideas were filtered. A physicochemical filtering criterion was also used to filter design ideas that fall outside of the desired drug-like chemical space. Such as design ideas with an MW between 150 and 500, LogP between −1.50 and 5.0, TPSA between 30 and 150, HBA between 0 and 12, and HBD between 0 and 5, and a maximum number of rotatable bonds less than 10 was retained.

2.3. Ligand and Protein Preparation

2.3.1. Ligand Preparation

A total of 89,529 enumerated design ideas were divided according to their enumeration circles from 1 to 10, as explained in the previous section. The 3-D coordinates of the design ideas were prepared by subjecting them to Ligprep [27]. We employed Epik [28,29,30] to generate possible ionization states, tautomers, stereoisomers, and conformers at pH 7.4. The OPLS4 forcefield was used for the restrained minimization of the resulting conformers.

2.3.2. Protein Preparation

The X-ray crystal structure of SARS-CoV-2 PLpro in complex with GRL-0617 (PDB ID: 7JIR) with a resolution of 2.09 Å was uploaded from RCSB Protein Data Bank (PDB) [31]. Bond orders were assigned, creating zero bonds to metals, creating disulfide bonds, and filling in missing side chains and loops using Prime, [32,33,34] using the default parameters. The protonation states of PLpro and GRL-0617 were simulated at pH 7.4 using Epik [28,29,30]. Hydrogen bond assignment was optimized to sample water orientations, using crystal symmetry, and minimizing hydrogens of altered species. An interactive optimizer was performed at pH 7.0 using PROPKA [29] to minimize steric hindrance. Waters within 3 Å of the co-crystallized ligand in the active site were removed. Restrained minimization was undertaken by using a customized version of OPLS4 force field [27,28,29,30,31,32,33,34,35,36,37] to converge heavy atoms at a Root-Mean-Square Deviation (RMSD) of 0.30 Å.

2.4. Glide SP Docking and Auto QSAR Active Learning Models

The prepared structure of PLpro described in Section 2.3.2 was used as an initial structure for receptor grid generation. The co-crystalized structure of GRL-0617 was used to identify the active site of PLpro for positional constraints. The hydrogen bond constraints were employed by selecting all hydrogen bonds that the ligand makes with the amino acids of PLpro. Rotatable groups of the amino acids of PLpro were selected including the excluded regions in the receptor that the ligand could not occupy. Glide docking [38], employing the SP mode, was used to place the enumerated compounds in the active site cavity of PLpro. Furthermore, Hydrogen bond constraints from the receptor grid were selected for docking. Finally, all default parameters for Glide docking were used as they were. A total of 10 rounds of Glide SP docking were performed on enumerated compounds to filter compounds that do not bind with PLpro and those that exhibited weak binding when predicted with the Auto QSAR models [39]. The Auto QSAR models using the enumerated dataset were trained, tested, and re-trained using the enumerated dataset. This process continued until some sort of convergence was attained after 10 rounds of active learning Glide SP Auto QSAR modeling.

2.5. Induced Fit Docking (IFD) to Screen the Library of Enumerated Design Ideas

Top-scoring design ideas that were screened with Glide SP were used as starting structures for IFD [40,41] calculations. The co-crystalized ligand was picked to mark the active site cavity of PLpro using an enclosing box in the prepared structure of 7JIR. A standard protocol for docking was selected, to generate 20 poses. Hydrogen bond constraints were applied and were selected from those between GRL-0617 and amino acids such as Asp164, Tyr273, and Gln269 in the active site cavity of PLpro. Default parameters for ligand conformational sampling during the first round of Glide docking were employed in IFD. Glide redocking was performed using the XP mode in the IFD panel. Rank ordering of the top-scoring poses was performed by using the procedures that we developed in prior publications [42,43].

2.6. Molecular Dynamics Simulations

Molecular dynamics simulations (MDS) [44,45] were performed on 7JIR using the Desmond package on Maestro v2.8, by setting up the system first. A predefined SPC explicit solvent system was enclosed in an orthorhombic box shape. The solvation model was placed in an enclosing box with a diameter of 10 Å. The model system was then neutralized by adding a NaCl salt solution at a concentration of 0.15 M. The minimization of the system was undertaken by using OPLS4 force field. Once the model system was created the output file was uploaded into the MDS panel on Maestro. The MD simulation time was set at 100 ns and a simulation trajectory of 100 ps with a total of 1000 frames. The NPT ensemble class was selected at a temperature of 300 K and a pressure of 1.01 bar.

2.7. Maximum Common Substructural (MCS) Docking

MCS docking [46] was performed using the MCS docking panel on Maestro to align all the structures of design ideas prioritized for FEP+, using the last 2ns frame of 7JIR generated by MDS in Section 2.6 above. The structure of GRL-0617 was extracted from the 2ns MDS frame and used as a reference compound in MCS docking. The structures of the selected design ideas were extracted as input structures from the IFD pose with the highest docking score.

2.8. Free Energy Perturbation Plus (FEP+) to Predict the Relative Binding Affinities of the Design Ideas

Prior to FEP+ [47,48,49,50] calculation, the MCS poses from Section 2.7 above were used as starting structures for FEP+ calculations. FEP+ methodology was performed by first using the last frame of MDS for 7JIR as an input structure for the protein, where 7JIR was split into the protein, ligand, and water. Then the unliganded 7JIR was added to the project table where all ligands’ outputs from the MCS docking were placed. The Force Field builder panel was used to sample the force field parameters, such as torsions, for the enumerated design ideas that are absent in OPLS4 force field. Ideally, Force Field builder employs quantum mechanical simulations available in Jaguar to fit the torsion parameters simulated and is then compared to those obtained from OPLS4 force field [51,52], since newly designed chemical entities usually contain torsion parameters that are not specifically represented by parameters in forcefield databases [53]. A ligand health check in the FEP+ panel was performed to establish whether the ligands fit the criteria of having new torsional parameters explicit. Relative binding affinity calculations were performed by using an FEP+ panel on Maestro. FEP+ is based on the Replica Exchange with Solution Tempering (REST) using an MD algorithm that utilizes OPLS4 force field and is available on the Desmond package [51,52,53]. The FEP+ simulation parameters used included the NVT ensemble, a simulation time of 20ns, and a random seed of 2007. Several λ windows started with a default of 12 λ windows, followed by core hopping with 16 λ windows, and lastly charged hopping with 16 λ windows. A cycle closure technique was used to generate a perturbation map, which was applied to attain the convergence error estimates of relative binding affinities, at a simulation window of 20 ns. GRL-0617 was used as a reference compound with known experimental IC50 to PLpro. As such, FEP+ is a well-established method for reliably predicting the relative and/or absolute binding affinities of novel compounds. In the literature, various scientists have prospectively and retrospectively demonstrated the great utility of FEP+ in hit-to-lead identification and lead optimization efforts [51,52,53,54,55,56]. Furthermore, a detailed theory behind the FEP+ method and the detailed protocols can be obtained from the original publications that were also adopted in this work [47,48,49,50,53,55,56,57,58,59,60,61].

3. Results and Discussions

3.1. Pathfinder Reaction-Based Enumeration

In the Pathfinder reaction-based enumeration panel, the structure of GRL-0617 was uploaded to rapidly explore the chemical space of the library of building blocks. The aim was to explore the structure–activity relationship and potency of the design ideas. Furthermore, the multiparameter optimization of the lead compound is also aimed at improving and/or maintaining the physicochemical and Adsorption Distribution Metabolism and Excretion (ADME) properties important in drug discovery and development. Various design test cycles (Figure 2) were performed with the aim of yielding optimum design ideas, using the method adapted from Konze et al. [54] and Ghanakoda et al. [61]. All 10 rounds of ideation involved enumerating building blocks such as Nitrogen containing-heterocycles (1st round of enumeration), primary and secondary amines (2nd round of enumeration), aryl and vinyl halides (3rd round of enumeration), acid chlorides (4th round of enumeration), carboxylates (R1) and primary and secondary amines (R2) (5th round of enumeration), carboxylates only (7th round of enumeration), Isocyanates (8th round of enumeration), and carboxylates and trifluoroborates (9th and 10th round of enumerations). Appropriate physicochemical, SMARTS, and PAINS filters were employed on the Pathfinder panel (Figure S1).

3.2. Glide SP Docking and Filtering of Enumerated Design Ideas

Glide SP docking employing hydrogen bond and core constraint was used to place the design ideas in the active site cavity of a prepared grid file for PLpro. Different rounds of Glide SP docking calculations were performed, informed by the number of enumerations runs, which generated diverse ideas with the hope of expanding the chemical space of PLpro inhibitors. Firstly, ligprep generated 30,514 output structures of Nitrogen-containing heterocycle design ideas. This was followed by filtering design ideas based on a proprietary set of SMARTS patterns and PAINS filters. This step was performed since the conversion of structures from 2D to 3D in some cases generates improper tautomers [60]. Design ideas with a formal charge of zero were retained, generating neutral drug-like design ideas including GRL-0617 and its 10 analogs with known experimental IC50 values from the literature for further Glide SP docking. The positive controls used were analogs of compound 7724772 (Figure 1a) and were designed together with GRL-0617 and tested for bioanalysis using the same assay. These were included because they already have the IC50 values, so they could train, test, re-train, and re-test the QSAR models. A total of 10 rounds of Glide SP docking facilitated by active learning QSAR models were performed. This was accomplished by training and testing QSAR model_1 up to QSAR model_10 to predict the docking scores of design ideas. The docking of Nitrogen-containing heterocycles generated 638 poses that fit the active site cavity of PLpro. The Glide SP docking of a hydrazine-aryl-containing subset generated a total of 997 poses, and 88 poses were selected. The Glide SP docking of an acid chlorides-containing subset generated a total of 996 poses, and only 65 poses were selected. Furthermore, the Glide SP docking of acid chlorides and primary and secondary amine ideas to PLpro generated 582 poses, and 55 of those were selected. The Glide SP docking of aryl and vinyl halides generated 963 poses, and 222 of those were selected. Furthermore, the Glide SP docking of carboxylates and primary and secondary amine series generated 900 poses, and 92 of those were selected. The Glide SP docking of a carboxylate-only series generated 557 poses, of which 147 were selected. The Glide SP docking screening of design ideas containing isocyanate building blocks generated 146 top-scoring poses that were selected. These 10 rounds of screening the design ideas with Glide SP docking and predicting their docking scores using various Auto QSAR models prior to docking demonstrated the capabilities of the active learning approach coupled with docking to screen a large database of enumerated compounds against the PLpro receptor (Figure 2).

3.3. Auto QSAR-Based Active Learning Models to Prioritize Ideas for Glide/IFD XP Docking

As previously discussed in Section 3.3 above, the Auto QSAR models were trained on Pathfinder-generated molecules that were filtered by Glide SP docking based on their ability to bind with PLpro. An active learning approach was adopted to develop models that were used to filter enumerated compounds and select compounds that would further be docked with a more in-depth scoring function such as IFD (Figure 2). The models were built to further assess their ability to predict the Glide SP docking scores (an independent variable) against physicochemical properties and topological descriptors (dependent variables) retrieved from Canvas in situ. Multivariate statistical methods such as Multiple Linear Regression (MLR), Partial Least Squares Regression (PLS), and Kernel-based PLS (KPLS) were employed on the enumerated compounds. The descriptors used include binary fingerprints i.e., radial, linear, dendritic, and 2-D molecular prints. These were used in combination with numeric descriptors such as topographical, physicochemical, and ligand filters. A numerical rather than a categorical model was applied to the Auto QSAR panel to generate and build the Auto QSAR models.
Table 1 details the models built using the subsets of enumerated design ideas, accompanied by their statistical parameters following an active learning approach. In this approach, all models reported in Table 1 were trained to learn to predict the docking scores of a diverse set of design ideas targeting SARS-CoV-2 PLpro. The models, Auto QSAR _model_1 up to Auto QSAR _model_10, were generated using KPLS, but they differed in terms of binary fingerprints, since the enumerated design ideas have different building blocks attached in place of the naphthalene moiety (Table 1 and Table 2). The most significant models with good R2 > 0.70, Q2 > 0.64, STDDEV < 0.30, and RMSE < 0.31 were models 6, 7, 8, and 10 (Table 1). Hence, models 8 and 10 were selected to predict the binding affinities of the compounds that remained and did not undergo Glide SP docking before all the QSAR models were built. Auto QSAR _model_1 was trained with design ideas exhibiting Glide SP docking scores ranging between −7.4 and −8.4 kcal/mol. Given that the Auto QSAR plot displayed some noise in this model, a decision was made to change the activity data to range from −6.00 and −8.9 kcal/mol for Auto QSAR _model_2 (see Figure S2a,b). An improved regression coefficient was observed for Auto QSAR _model_3. It was noticed that the dynamic range was narrow and as expected, the correlation coefficient is poor. Therefore, a decision was made to add more molecules in the low-activity range of −6.0 kcal/mol. The next model was then able to predict the activity of the compounds in this range. As can be seen for Auto QSAR _model_4 up to Auto QSAR _model_7, and even with Auto QSAR _model _9, the same upward trend was observed (see Figure S2c–h). The challenge now was to improve the ability of the models to predict the activities of the top-scoring compounds.
A plot showing activity (observed) vs. activity (predicted) is presented in Figure 3 for the most optimal active learning Auto QSAR models 8 and 10. It is evident that Auto QSAR model_8 has some noisy data in the region between −6.5 kcal/mol and −9.5 kcal/mol. Therefore, it was then important to extend the range of the docking scores from −3.5 kcal/mol to −9.5 kcal/mol in Auto QSAR _model_10. The training data was improved with little noise in the region from −5 kcal/mol to −8.5 kcal/mol. Therefore, Auto QSAR _model_10 was able to predict the docking scores of the design ideas in the region between −5 kcal/mol and −8.5 kcal/mol, though it struggled to predict the activities of compounds with docking scores less than −5 kcal/mol. This is expected, as the model was not trained in the least active design ideas. Auto QSAR_model_10 was used to predict the docking scores of all the remaining compounds from all the rounds of enumerations. A total of 1252 poses were retrieved by docking the remaining design ideas against PLpro. Next, a core-constraint IFD docking approach was performed to filter compounds that do not meet the criteria. A total of 35 poses were selected with binding modes consistent with the native binding mode of co-crystalized GRL-0617 to SARS-CoV-2 PLpro (PDB ID: 7JIR). This criterion further included selecting compounds with good docking scores, a good E-model score, and a good IFD score (Table 2), even though IFD did not manage to capture the binding mode of GRL-0617, our reference compound. Therefore, the compounds in Table 2 are ranked and ordered according to their docking scores. Each of the compounds generated a total of five poses and the top-scoring poses for each of the design ideas were selected and reported here.
The visual inspection of binding poses for decision-making purposes in drug discovery projects is very important [61]. Furthermore, the selection of the correct binding pose in docking is very important not only for decision-making purposes, but also for docking post-processing procedures such as MD simulations, and FEP+ simulations [46,51]. Therefore, in the literature, scientists have used procedures such as MD simulations employing binding pose metadynamics, [62] MD simulations, [14,63,64] and more recently, MCS docking, [50] including the most exhaustive and premium IFD-MD module [65], as docking post-processing procedures. This is aimed at modeling the correct native binding mode of a compound designed by De Novo to its prospective target.
Figure 4 details the distribution of docking scores to measure the activities of the design ideas that were prioritized for further relative binding affinity estimation using FEP+. Most of the design ideas exhibited docking scores ranging from −8 kcal/mol to −10 kcal/mol. Interestingly, even though the Auto QSAR models were trained on compounds with docking scores ranging from −3 to −10 kcal/mol, IFD was able to capture compounds with docking scores ranging from −10 to −12 kcal/mol (see Figure 4). Interestingly, the diverse set of enumerated ideas did not yield ideas that deviated from the initial physicochemical space and that had diminished potency (see Table 2). This demonstrated the applicability of the enumerated compounds to traverse chemical space and design ideas that are potent and can further be optimized against PLpro.
Concurring with this, Gentile et al. [66] proposed a deep-docking protocol that was used to dock an ultra-large library of drug-like molecules. The deep-docking scores were used to train and predict the potency of the subset of compounds from the database. Furthermore, Boyles et al. [67] performed a comparative study of training and testing a machine-learning model based on ligand poses from co-crystalized proteins against poses generated by docking scoring functions. The results revealed that the predictive power of machine learning models derived from docking poses had the same effect as poses generated from co-crystalized ligands in terms of their predictive ability of docking scores of the models [67]. Thus, this screening criteria interactively filters ideas with undesirable properties and low-scoring compounds. As a result, this approach is gaining traction in speeding up drug discovery efforts and allows for the rapid and accurate prediction of docking scores, which facilitates the prioritization of compounds for more thorough and computationally rigorous approaches like FEP+ prior to synthesis [54,60].

3.4. Relative Binding Affinity Prediction Using FEP+

Here, MDS was used to refine the PDB structure of 7JIR prior to FEP+ calculations. Since it has been reported in the literature that the bound conformation of a ligand sought from MDS improves the prediction of relative binding affinity by FEP+ [51]. As such, the 100 ns MDS frame was used to extract the bound conformation of GRL-0617 and was used as a reference compound to perform MCS docking and align the structures of the design ideas to the GRL-0617 MDS conformation. This was possible because the 100 ns MDS frame revealed that GRL-0617 was stable during the simulation window. Ligand fluctuations with respect to the receptor were below 3 Å (Figure 5a) during the 100 ns MDS trajectory. This indicates that the ligand did not move out of the active site pocket. Moreover, this further indicates that MDS was able to reproduce the native binding conformation of GRL-0617, which is similar to that exhibited by co-crystalized GRL-0617 to 7JIR. Again, the use of MCS docking has been reported to improve the relative binding affinities for FEP+ simulation [46].
Of the 35 compounds that were selected for FEP+ analysis, only 5 were calculated by FEP+, as some were not FEP+ amenable and others were not selected because of limited license tokens for FEP+. Therefore, after the ligand and protein health check was performed, the experimental IC50 value of GRL-0617 was converted to establish the extent of the change in free energy, ΔG. A blind 20 ns FEP+ simulation of the 4 design ideas that are FEP+ amenable was performed with GRL-0617 as a reference compound. FEP mapper was used to create a map with edges based on the ligand similarities (Figure S4). The MCS docking overlay of GRL-0617, and the design ideas demonstrate the structural activity relationship of the subset of compounds (Figure 5b) in the active site cavity of PLpro. The protein–ligand contacts between GRL-0617 and SARS-CoV-2 PLpro reveal the native binding mode of GRL-0617 (Figure 5c,d). It should be noted that buried waters can decrease the binding energy, as the entropy of binding is penalized by the presence of buried waters, which should be displaced by the ligand (Figure 5d). A full-cycle closure technique was used for this 15 λ simulation window employing 20ns of MD simulation as per the procedure described in Section 2.7.
Figure 6a,b detail the plots of the free energy convergence and total change in free energy (ΔG in kcal/mol) between GRL-0617 (ligand 1) and compound 45 (ligand 2), as a function of time in the solvent and complex legs. The three plots for each leg also detail the reverse, forward, and sliding window with respect to the accumulated energy during the simulation window of the 20 ns trajectory. Therefore, the perturbation between the two ligands converged with a ΔG of −82.11 kcal/mol in the solvent leg and −81.63 kcal/mol in the complex leg. Further, the bootstrapping error estimates and analytical errors of 0.061 and 0.028 kcal/mol for the solvent leg and 0.108 and 0.028 kcal/mol for the complex leg, respectively, were achieved. Wang et al. [47], Cappel et al. [46], and Schindler et al. [68] have suggested that an accuracy of 1.4 kcal/mol is suitable in a drug discovery lead optimization stage. Concurring with this, Figure S4 shows a full-circle closure perturbation map of design ideas and GRL-0617 detailing the difference in the binding free energy, ΔΔG (pink), between two ligands in an edge/node and their associated ligand similarity scores (green). As can be seen, GRL-0617 is 0.47 kcal/mol more active than compound 45, with an error estimate of 0.88 kcal/mol, while GRL-0617 is 0.97 kcal/mol more active than compound 5, with an error estimate of 1.40 kcal/mol.
On the other hand, GRL-0617 is 2.01 kcal/mol more active than compound 91, with an error estimate of 0.99 kcal/mol. Furthermore, GRL-0617 is 4.64 kcal/mol more active than compound 23, with an error estimate of 1.40 kcal/mol. On the other hand, compound 45 is 0.50 kcal/mol more active than compound 5, with an error estimate of 0.88 kcal/mol, whilst compound 5 is 1.04 kcal/mol more active than compound 91, with an error estimate of 1.40 kcal/mol. Lastly, compound 91 is 2.63 kcal/mol more active than compound 23, with an error estimate of 1.40 kcal/mol (Figure S4).
Figure 6c,d, on the other hand, detail the histogram and ligand interaction diagram for the two ligands for the 15 endpoint λ replicas. This essentially demonstrates the percentage of the interaction between ligands 1 and 2, and PLpro. The ligand interaction diagram shows the hydrogen bond between the carbonyl group of GRL-0617 and Gln269, which occurs for 100% of the simulation window. A hydrogen bond between the NH group of GRL-0617 and Asp164 occurs for 50% of the simulation time, (Figure 6d) as is shown below, even though the MD simulation did not capture this hydrogen bonding between Asp164 and the NH group of GRL-0617 (Figure 5d). However, the binding mode is maintained between the active site cavity of PLpro and GRL-0617. Furthermore, the π–π interaction network between 5-amino-2-methyl benzamide moiety and Tyr268, including the naphthalene moiety is shown. The ligand interaction diagram in Figure 6d also revealed the binding mode of compound 45 in the active site cavity of PLpro. There is a hydrogen bond between the carbonyl group in compound 45 and Gln269, which also occurs for 100% of the simulation time. There is a water-mediated hydrogen bond between the NH group of benzimidazole moiety and Gly266, which occurs for 41% of the simulation time. There is also another water-mediated hydrogen bond between the oxygen of the methyl sulfonyl moiety and Tyr268, which occurs for 21% of the simulation time. There is a further π–π interaction between Tyr268 and the benzene rings of 5-amino-2-methyl benzamide moiety and benzimidazole moieties.
The free energy convergence and total change in free energy (ΔG in kcal/mol) for compound 5 (ligand 1) and compound 91 (ligand 2) can be seen in Figure S3a,b. Again, the perturbation between the two ligands converged with a ΔG of −58.81 kcal/mol in the solvent leg and −59.34 kcal/mol in the complex leg. Further, the bootstrapping error estimates and analytical errors of 0.050 and 0.039 for the solvent leg and 0.195 and 0.039 for the complex leg, respectively, were achieved. Figure S3c,d show the histogram and ligand interaction diagram for the two ligands for the 15 endpoint λ replicas. The carbonyl group in the compound 5 hydrogen bonds with Gln269 for 100% of the simulation time and Tyr264 for 46% of the simulation time. Further, the hydrogen bond between the NH group of the benzamide moiety hydrogen bonds with Asp164 for 81% of the simulation time. The amino acid Asp164 and the Tyr273 hydrogen bond with the NH2 group in compound 5, which occurs for 51% and 61% of the simulation time, respectively. A π–π interaction network between the aromatic ring of the benzamide moiety and Tyr264 including Tyr268 is observed.
Table S1 revealed that GRL-0617 exhibited a relative binding affinity estimate of −7.75 ± 0.40 kcal/mol which is consistent with the experimental binding affinity of −7.75 kcal/mol. Therefore, this explains the accuracy of FEP+ in reproducing the experimental binding affinity of this reference compound. Compound 45 in this series was the most active compound with a ΔG of −7.28 ± 0.96 kcal/mol. Compound 5 followed next with a ΔG of −6.78 ± 1.30 kcal/mol. The inactive compounds in this series were compound 91 and compound 23 with a ΔG of −5.74 ± 1.06 and −3.11 ± 1.45. In a drug discovery and development campaign, an error estimate of >1 kcal/mol is not acceptable [47,53,68]. However, the binding affinity error estimate can be improved by increasing the simulation window and using the MDS to model the system for starting structures prior to FEP+ simulations.
Figure S5 shows buried waters in the node representation of GRL-0617 and in compound 45’s binding poses predicted by FEP+, in the active site cavity of PLpro. We suggest the use of Water map to model the effect of active site waters on the enthalpy-entropy compensation. This will in turn, allow the determination of favorable and unfavorable waters that can be displaced by inserting functional groups that displace unfavorable waters to achieve a gain in potency [69]. Several researchers in the industry and in academia have used Water map for this purpose prior to FEP+ calculations with the aim of improving the relative binding affinities of the design ideas and repurposed drugs [70], an approach that we aim to follow for future studies in this research area. As such, the design of novel chemical matter containing 2-[2-(methanesulfonyl)propan-2-yl]-1H-benzimidazole, 1,3-dimethyl-1,3-dihydro-2H-benzimidazol-2-one, 3-[(S)-amino(4-fluorophenyl)methyl]-1,2,4-oxadiazol-5(4H)-one, and (S)-1-(4-methylphenyl)-1-(1H-pyrazol-3-yl)methanamine moieties, instead of the naphthalene moiety that is in GRL-0617, was successfully performed. Therefore, a combined strategy to demonstrate the synergy between active learning-based Glide docking tools in tandem with Pathfinder reaction-based enumeration tools and physics-based relative binding affinity estimation tools was successfully employed in this study.

4. Conclusions

The work reported here describes the use of Pathfinder-driven reaction-based enumeration to traverse chemical space by inserting building blocks and replacing the naphthalene moiety of GRL-0617, a lead compound, as an inhibitor of SARS-CoV-2 PLpro. This was followed by using PAINS, SMARTS, and physicochemical filters on the enumerated ideas. A subset of randomly selected ideas was placed into the active site cavity of PLpro utilizing Glide SP, this was performed to filter ideas that do not bind with PLpro. Ideas with a docking score between −3 kcal/mol to −10.7 kcal/mol were selected to build the Auto QSAR models. The models were then used to predict the docking scores of the enumerated compounds. This process was repeated ten times, where the models were trained on compounds that were enumerated, tested, and re-scored. Interestingly, the models that were built and demonstrated good statistical results on average were models 6, 7, 8, and 10, with an R2 > 0.70, Q2 > 0.64, STDDEV < 0.30, and RMSE < 0.31. Furthermore, Auto QSAR model_10, was used to finally predict the docking scores of the candidate compounds from each of the rounds of enumerations. The top-scoring compounds were then selected for IFD calculations employing a core constraint. IFD generated a total of 1 252 poses and 35 of those poses were selected for FEP+ calculations based on visual inspection, IFD score, docking score, and E-model score.
MDS and MCS docking shed some light on the binding mode and structure–activity relationships of the design ideas through their alignment with the reference compound GRL-0617. Due to limitations with respect to license tokens for FEP+ webservices, we selected only 5 compounds to run the FEP+ calculations. The FEP+ results revealed that compound 45 and compound 5 were the most active compounds in the series with ΔG of −7.28 ± 0.96 kcal/mol, and −6.78 ± 1.30 kcal/mol. These binding affinities were due to strong hydrogen bonds, π–π interactions, and hydrophobic interactions, which were revealed by FEP+ analysis. The design ideas predicted here will be synthesized and tested in the future for their biological activities against wild-type PLpro and viral cell lines containing various splice variations, and mutants of the SARS-CoV-2 spike protein. This will in turn help in preparing for future pandemics and the associated variants of SARS-CoV-2 and related viruses. Therefore, these combined strategies for lead optimization have great potential for exploring the unexplored regions of chemical space, with the hope of designing potent compounds with desirable physicochemical properties, which is important in drug discovery and development.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/molecules27238569/s1, Figure S1: Reaction based enumeration generated pathway for GRL-0617; Figure S2: Plot of Active learning Glide SP and Auto QSAR models to predict the binding affinities of enumerated compounds (a) Auto QSAR_model_1 with an R2 of 0.4310 and a Q2 of 0.1571 (b) Auto QSAR_model_2 with an R2 of 0.6262 and a Q2 of 0.5929 (c) Auto QSAR_model_3 with an R2 of 0.6090 and a Q2 of 0.5630 (d) Auto QSAR_model_4 with an R2 of 0.6227 and a Q2 of 0.5793 (e) Auto QSAR_model_5 with an R2 of 0.6980 and a Q2 of 0.6450 (f) Auto QSAR_model_6 with an R2 of 0.7084 and a Q2 of 0.6676 (g) Auto QSAR_model_7 with an R2 of 0.7086 and a Q2 of 0.6910 (h) Auto QSAR_model_9 with an R2 of 0.6718 and a Q2 of 0.6476; Figure S3: FEP+ simulation results for relative binding affinity prediction for the perturbation cycle of compound 5 (ligand 1) and compound 45 (ligand 2). (a) Free energy convergence of the perturbation cycle in solvent leg. (b) Free energy convergence of the perturbation cycle in complex leg. (c) Histogram depicting protein-ligand interactions for endpoint λ-replicas. (d) Ligand interaction diagram detailing the type of interactions observed between the two ligands and the receptor; Figure S4: A circle closure perturbation map of design ideas and GRL-0617 detailing the difference in binding free energy, ΔΔG (pink) and its associated error estimate between two ligands in an edge/node, as well as their associated ligand similarities scores (green); Figure S5: FEP+ node representation for GRL-0617 (grey carbons) on the left and compound 45 (green carbons) on the right frame in the active site cavity of PLpro with favourable and unfavourable waters shown; Table S1: Relative binding affinity prediction and its associated predicted error in kcal/mol of the design ideas and GRL-0617 as a reference compound against SARS-CoV-2 PLpro using FEP+.

Funding

No funding was received directly to perform this work. However, the token licenses for FEP+ webservices used were purchased from an SAMRC’s Capacity Development Initiative grant.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

N.J.G. acknowledges SAMRC’s Capacity Development Initiative grant that was used to purchase the software token license utilizing webservices for FEP+. The Centre for High Performance Computing is acknowledged for providing HPC infrastructure.

Conflicts of Interest

The author declares no conflict of interest.

Sample Availability

Samples of the compounds are available on request from the authors.

References

  1. Ou, X.; Liu, Y.; Lei, X.; Li, P.; Mi, D.; Ren, L.; Guo, L.; Guo, R.; Chen, T.; Hu, J.; et al. Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV. Nat. Commun. 2020, 11, 1620. [Google Scholar] [CrossRef] [Green Version]
  2. Crotti, L.; Arbelo, E. COVID-19 Q2 treatments, QT interval, and arrhythmic risk: The need for an international registry on arrhythmias. Heart Rhythm. 2020, 17, 1423–1424. [Google Scholar] [CrossRef] [PubMed]
  3. Ton, A.T.; Gentile, F.; Hsing, M.; Ban, F.; Cherkasov, A. Rapid Identification of Potential Inhibitors of SARS-CoV-2 Main Protease by Deep Docking of 1.3 billion Compounds. Mol. Inf. 2020, 39, 2000028. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. De Savi, C.; Hughes, D.L.; Kvaerno, L. Quest for a COVID-19 Cure by Repurposing Small-Molecule Drugs: Mechanism of Action, Clinical Development, Synthesis at Scale, and Outlook for Supply. Org. Process. Res. Dev. 2020, 24, 940–976. [Google Scholar] [CrossRef]
  5. Saha, A.; Sharma, A.R.; Bhattacharya, M.; Sharma, G.; Lee, S.S.; Chakraborty, C. Tocilizumab: A Therapeutic Option for the Treatment of Cytokine Storm Syndrome in COVID-19. Arch. Med. Res. 2020, 51, 595–597. [Google Scholar] [CrossRef]
  6. Gorbalenya, A.E.; Baker, S.C.; Baric, R.S.; de Groot, R.J.; Drosten, C.; Gulyaeva, A.A.; Haagmans, B.L.; Lauber, C.; Leontovich, A.M.; Neuman, B.W.; et al. The species severe acute respiratory syndrome related coronavirus: Classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 2020, 536, 536–544. [Google Scholar]
  7. Freitas, B.T.; Durie, I.A.; Murray, J.; Longo, J.E.; Miller, H.C.; Crich, D.; Hogan, R.J.; Tripp, R.A.; Pegan, S.D. Characterization and Noncovalent Inhibition of the Deubiquitinase and deISGylase Activity of SARS-CoV-2 Papain-Like Protease. ACS Infect. Dis. 2020, 6, 2099–2109. [Google Scholar] [CrossRef]
  8. Eastman, R.T.; Roth, J.S.; Brimacombe, K.R.; Simeonov, A.; Shen, M.; Patnaik, S.; Hall, M.D. Remdesivir: A Review of Its Discovery and Development Leading to Emergency Use Authorization for Treatment of COVID-19. ACS Cent. Sci. 2020, 6, 672–683. [Google Scholar] [CrossRef]
  9. Liu, C.; Zhou, Q.; Li, Y.; Garner, L.V.; Watkins, S.P.; Carter, L.J.; Smoot, J.; Gregg, A.C.; Daniels, A.D.; Jervey, S.; et al. Research and Development on Therapeutic Agents and Vaccines for COVID-19 and Related Human Coronavirus Diseases. ACS Cent. Sci. 2020, 6, 315–331. [Google Scholar] [CrossRef]
  10. Sheahan, T.P.; Sims, A.C.; Leist, S.R.; Schäfer, A.; Won, J.; Brown, A.J.; Montgomery, S.A.; Hogg, A.; Babusis, D.; Clarke, M.O.; et al. Comparative therapeutic efficacy of remdesivir and combination lopinavir, ritonavir, and interferon beta against MERS-CoV. Nat. Commun. 2020, 11, 222. [Google Scholar] [CrossRef] [Green Version]
  11. Simmons, G.; Gosalia, D.N.; Rennekamp, A.J.; Reeves, J.D.; Diamond, S.L.; Bates, P. Inhibitors of cathepsin L prevent severe acute respiratory syndrome coronavirus entry. Proc. Natl. Acad. Sci. USA 2005, 33, 11876–11881. [Google Scholar] [CrossRef]
  12. Liu, T.; Luo, S.; Libby, P.; Shi, G.P. Cathepsin L-selective inhibitors: A potentially promising treatment for COVID-19 patients. Pharmacol. Ther. 2020, 213, 107587. [Google Scholar] [CrossRef] [PubMed]
  13. Li, D.; Luan, J.; Zhang, L. Molecular docking of potential SARS-CoV-2 papain-like protease Inhibitors. Biochem. Biophys. Res. Commun. 2021, 538, 72–79. [Google Scholar] [CrossRef] [PubMed]
  14. Ma, C.; Sacco, M.D.; Xia, Z.; Lambrinidis, G.; Townsend, J.A.; Hu, Y.; Meng, X.; Szeto, T.; Ba, M.; Zhang, X.; et al. Discovery of SARS-CoV-2 Papain-like Protease Inhibitors through a Combination of High-Throughput Screening and a FlipGFP-Based Reporter Assay. ACS Cent. Sci. 2021, 7, 1245–1260. [Google Scholar] [CrossRef] [PubMed]
  15. Cannalire, R.; Cerchia, C.; Beccari, A.R.; Di Leva, F.S.; Summa, V. Targeting SARS-CoV-2 Proteases and Polymerase for COVID-19 Treatment: State of the Art and Future Opportunities. J. Med. Chem. 2022, 65, 2716–2746. [Google Scholar] [CrossRef]
  16. Gao, X.; Qin, B.; Chen, P.; Zhu, K.; Hou, P.; Wojdyla, J.A.; Wang, M.; Cui, S. Crystal structure of SARS-CoV-2 papain-like Protease. Acta Pharm. Sin. B 2021, 11, 237–245. [Google Scholar] [CrossRef]
  17. Báez-Santos, Y.M.; Barraza, S.J.; Wilson, M.W.; Agius, M.P.; Mielech, A.M.; Davis, N.M.; Baker, S.C.; Larsen, S.D.; Mesecar, A.D. X-ray Structural and Biological Evaluation of a Series of Potent and Highly Selective Inhibitors of Human Coronavirus Papain-like Proteases. J. Med. Chem. 2014, 57, 2393–2412. [Google Scholar] [CrossRef] [Green Version]
  18. Ghosh, A.K.; Takayama, J.; Aubin, Y.; Ratia, K.; Chaudhuri, R.; Baez, Y.; Sleeman, K.; Coughlin, M.; Nichols, D.B.; Mulhearn, D.C.; et al. Structure-Based Design, Synthesis, and Biological Evaluation of a Series of Novel and Reversible Inhibitors for the severe acute respiratory syndrome-Coronavirus Papain-Like Protease. J. Med. Chem. 2009, 52, 5228–5240. [Google Scholar] [CrossRef] [Green Version]
  19. Ratia, K.; Pegan, S.; Takayama, J.; Sleeman, K.; Coughlin, M.; Baliji, S.; Chaudhuria, R.; Fu, W.; Prabhakar, B.S.; Johnson, M.E.; et al. A noncovalent class of papain-like protease/deubiquitinase inhibitors blocks SARS virus replication. Proc. Natl. Acad. Sci. USA 2008, 105, 16119–16124. [Google Scholar] [CrossRef] [Green Version]
  20. Chaudhuri, R.; Tang, S.; Zhao, G.; Lu, H.; Case, D.A.; Johnson, M.E. Comparison of SARS and NL63 Papain-Like Protease Binding Sites and Binding Site Dynamics: Inhibitor Design Implications. J. Mol. Biol. 2011, 414, 272–288. [Google Scholar] [CrossRef]
  21. Arshadi, A.K.; Webb, J.; Salem, M.; Cruz, E.; Calad-Thomson, S.; Ghadirian, N.; Collins, J.; Diez-Cecilia, E.; Kelly, B.; Goodarzi, H.; et al. Artificial Intelligence for COVID-19 Drug Discovery and Vaccine Development. Front. Artif. Intell. Appl. 2020, 3, 65. [Google Scholar] [CrossRef] [PubMed]
  22. Wallach, I.; Dzamba, M.; Heifets, A. AtomNet: A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-Based Drug Discovery. arXiv 2015, arXiv:1510.02855. [Google Scholar]
  23. Sugiyama, M.G.; Cui, H.; Redka, D.S.; Karimzadeh, M.; Rujas, E.; Maan, H.; Hayat, S.; Cheung, K.; Misra, R.; McPhee, J.B.; et al. Multiscale interactome analysis coupled with off-target drug predictions reveals drug repurposing candidates for human coronavirus disease. Sci. Rep. 2021, 11, 23315. [Google Scholar] [CrossRef] [PubMed]
  24. Arshia, A.H.; Shadravan, S.; Solhjoo, A.; Sakhteman, A.; Sam, A. De novo design of novel protease inhibitor candidates in the treatment of SARS-CoV-2 using deep learning, docking, and molecular dynamic simulations. Comput. Biol. Med. 2021, 139, 104967. [Google Scholar] [CrossRef]
  25. Murugesan, S.; Kottekad, S.; Crasta, I.; Sreevathsan, S.; Usharani, D.; Perumal, M.K.; Mudliar, S.N. Targeting COVID-19 (SARS-CoV-2) main protease through active phytocompounds of ayurvedic medicinal plants—Emblica officinalis (Amla), Phyllanthus niruri Linn. (Bhumi Amla) and Tinospora cordifolia (Giloy)—A molecular docking and simulation study. Comput. Biol. Med. 2021, 136, 104683. [Google Scholar] [CrossRef]
  26. Patel, C.N.; Jani, S.P.; Jaiswal, D.G.; Kumar, S.P.; Mangukia, N.; Parmar, R.M.; Rawal, R.M.; Pandya, H.A. Identification of antiviral phytochemicals as a potential SARS-CoV-2 main protease (Mpro) inhibitor using docking and molecular dynamics simulations. Sci. Rep. 2021, 11, 20295. [Google Scholar] [CrossRef]
  27. LigPrep; Schrödinger, L.L.C: New York, NY, USA, 2021.
  28. Epik; Schrödinger, L.L.C: New York, NY, USA, 2021.
  29. Greenwood, J.R.; Calkins, D.; Sullivan, A.P.; Shelley, J.C. Towards the comprehensive, rapid, and accurate prediction of the favourable tautomeric states of drug-like molecules in aqueous solution. J. Comput. Aided. Mol. Des. 2010, 24, 591–604. [Google Scholar] [CrossRef]
  30. Shelley, J.C.; Cholleti, A.; Frye, L.; Greenwood, J.R.; Timlin, M.R.; Uchimaya, M. Epik: A software program for pKa prediction and protonation state generation for drug-like molecules. J. Comp. Aided. Mol. Des. 2007, 21, 681–691. [Google Scholar] [CrossRef]
  31. Osipiuk, J.; Azizi, S.A.; Dvorkin, S.; Endres, M.; Jedrzejczak, R.; Jones, K.A.; Kang, S.; Kathayat, R.S.; Kim, Y.; Lisnyak, V.G.; et al. Structure of papain-like protease from SARS-CoV-2 and its complexes with non-covalent inhibitors. Nat. Comm. 2021, 12, 743. [Google Scholar] [CrossRef]
  32. Prime; Schrödinger, L.L.C: New York, NY, USA, 2021.
  33. Jacobson, M.P.; Pincus, D.L.; Rapp, C.S.; Day, T.J.F.; Honig, B.; Shaw, D.E.; Friesner, R.A. A Hierarchical Approach to All-Atom Protein Loop Prediction. Proteins Struct. Funct. Genet. 2004, 55, 351–367. [Google Scholar] [CrossRef] [Green Version]
  34. Jacobson, M.P.; Friesner, R.A.; Xiang, Z.; Honig, B. On the Role of Crystal Packing Forces in Determining Protein Sidechain Conformations. J. Mol. Biol. 2002, 320, 597–608. [Google Scholar] [CrossRef] [PubMed]
  35. Jorgensen, W.L.; Maxwell, D.S.; Tirado-Rives, J. Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids. J. Am. Chem. Soc. 1996, 118, 11225–11236. [Google Scholar] [CrossRef]
  36. Jorgensen, W.L.; Tirado-Rives, J. The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. J. Am. Chem. Soc. 1988, 110, 1657–1666. [Google Scholar] [CrossRef] [PubMed]
  37. Roos, K.; Wu, C.; Damm, W.; Reboul, M.; Stevenson, J.M.; Lu, C.; Dahlgren, M.K.; Mondal, S.; Chen, W.; Wang, L.; et al. OPLS3e: Extending Force Field Coverage for Drug-Like Small Molecules. J. Chem. Theory Comput. 2019, 15, 1863–1874. [Google Scholar] [CrossRef]
  38. Glide; Schrödinger, L.L.C: New York, NY, USA, 2021.
  39. Dixon, S.L.; Duan, J.; Smith, E.; Von Bargen, C.D.; Sherman, W.; Repasky, M.P. AutoQSAR: An automated machine learning tool for best-practice quantitative structure-activity relationship modelling. Future. Med. Chem. 2016, 8, 1825–1839. [Google Scholar] [CrossRef]
  40. Induced Fit Docking Protocol; Schrödinger, L.L.C: New York, NY, USA, 2021.
  41. Farid, R.; Day, T.; Friesner, R.A.; Pearlstein, R.A. New insights about HERG blockade obtained from protein modeling, potential energy mapping, and docking studies. Bioorg. Med. Chem. 2006, 14, 3160–3173. [Google Scholar] [CrossRef]
  42. Gumede, N.J.; Singh, P.; Sabela, M.I.; Bisetty, K.; Escuder-Gilabert, L.; Medina-Hernández, M.J.; Sagrado, S. Experimental-like affinity constants and enantioselectivity estimates from flexible docking. J. Chem. Inf. Model. 2012, 52, 2754–2759. [Google Scholar] [CrossRef]
  43. Gumede, N.J.; Nxumalo, W.; Bisetty, K.; Escuder Gilabert, L.; Medina-Hernandez, M.J.; Sagrado, S. Prospective computational design, and in vitro bio-analytical tests of new chemical entities as potential selective CYP17A1 lyase inhibitors. Bioorg. Chem. 2020, 94, 103462. [Google Scholar] [CrossRef]
  44. Shaw, D.E. A fast, scalable method for the parallel evaluation of distance-limited pair wise particle interactions. J. Comput. Chem. 2005, 26, 1318. [Google Scholar] [CrossRef] [Green Version]
  45. Lippert, R.A.; Bowers, K.J.; Dror, R.O.; Eastwood, M.P.; Gregersen, B.A.; Klepeis, J.L.; Kolossvary, I.; Shaw, D.E. A common, avoidable source of error in molecular dynamics integrators. J. Chem. Phys. 2007, 126, 046101. [Google Scholar] [CrossRef] [Green Version]
  46. Cappel, D.; Jerome, S.; Hessler, G.; Matter, H. Impact of Different Automated Binding Pose Generation Approaches on Relative Binding Free Energy Simulations. J. Chem. Inf. Model. 2020, 60, 1432–1444. [Google Scholar] [CrossRef] [PubMed]
  47. Wang, L.; Wu, Y.; Deng, Y.; Kim, B.; Pearce, L.; Krilov, G.; Lupyan, D.; Robinson, S.; Dahlgren, M.K.; Greenwood, J.; et al. Accurate and Reliable Prediction of Relative Ligand Binding Potency in Prospective Drug Discovery by Way of a Modern Free-Energy Calculation Protocol and Force Field. J. Am. Chem. Soc. 2015, 137, 2695. [Google Scholar] [CrossRef] [PubMed]
  48. Wang, L.; Deng, Y.; Knight, J.L.; Wu, Y.; Kim, B.; Sherman, W.; Shelley, J.; Lin, T.; Abel, R. Modeling Local Structural Rearrangements Using FEP/REST: Application to Relative Binding Affinity Predictions of CDK2 Inhibitors. J. Chem. Theory Comput. 2013, 9, 1282. [Google Scholar] [CrossRef] [PubMed]
  49. Chen, W.; Deng, Y.; Russell, E.; Wu, Y.; Abel, R.; Wang, L. Accurate Calculation of Relative Binding Free Energies between Ligands with Different Net Charges. J. Chem. Theory Comput. 2018, 14, 6346. [Google Scholar] [CrossRef] [PubMed]
  50. Abel, R.; Wang, L.; Mobley, D.L.; Friesner, R.A. OPLS3: A Critical Review of Validation, Blind Testing, and Real-World Use of Alchemical Protein-Ligand Binding Free Energy Calculations. Curr. Top. Med. Chem. 2017, 17, 2577. [Google Scholar] [CrossRef] [PubMed]
  51. Fratevy, F.; Sirimulla, S. An Improved Free Energy Perturbation FEP+ Sampling Protocol for Flexible Ligand-Binding Domains. Sci. Rep. 2019, 9, 16829. [Google Scholar] [CrossRef] [Green Version]
  52. Moraca, F.; Negri, A.; de Oliveira, C.; Abel, R. Application of Free Energy Perturbation (FEP+) to Understanding Ligand Selectivity: A Case Study to Assess Selectivity Between Pairs of Phosphodiesterases (PDE’s). J. Chem. Inf. Model. 2019, 59, 2729–2740. [Google Scholar] [CrossRef]
  53. Abel, R.; Wang, L.; Harder, E.D.; Berne, B.J.; Friesner, R.A. Advancing Drug Discovery through Enhanced Free Energy Calculations. Acc. Chem. Res. 2017, 50, 1625–1632. [Google Scholar] [CrossRef]
  54. Konze, K.D.; Bos, P.H.; Dahlgren, M.K.; Leswing, K.; Tubert-Brohman, I.; Bortolato, A.; Robbason, B.; Abel, R.; Bhat, S. Reaction-Based Enumeration, Active Learning, and Free Energy Calculations to Rapidly Explore Synthetically Tractable Chemical Space and Optimize Potency of Cyclin-Dependent Kinase 2 Inhibitors. J. Chem. Inf. Model. 2019, 59, 3782–3793. [Google Scholar] [CrossRef]
  55. Paulsen, J.L.; Yu, H.S.; Sindhikara, D.; Wang, L.; Appleby, T.; Villaseñor, A.G.; Schmitz, U.; Shivakumar, D. Evaluation of Free Energy Calculations for the Prioritization of Macrocycle Synthesis. J. Chem. Inf. Model. 2020, 60, 3489–3498. [Google Scholar] [CrossRef]
  56. Kuhn, B.; Tichy, M.; Wang, L.; Robinson, S.; Martin, R.E.; Kuglstatter, A.; Benz, J.; Giroud, M.; Schirmeister, T.; Abel, R.; et al. Prospective Evaluation of Free Energy Calculations for the Prioritization of Cathepsin L Inhibitors. J. Med. Chem. 2017, 60, 2485–2497. [Google Scholar] [CrossRef] [PubMed]
  57. Cleves, A.E.; Johnson, S.R.; Jain, A.N. Synergy and Complementarity between Focused Machine Learning and Physics-Based Simulation in Affinity Prediction. J. Chem. Inf. Model. 2021, 61, 5948–5966. [Google Scholar] [CrossRef] [PubMed]
  58. Cournia, Z.; Allen, B.K.; Beuming, T.; Pearlman, D.A.; Radak, B.K.; Sherman, W. Rigorous Free Energy Simulations in Virtual Screening. J. Chem. Inf. Model. 2020, 60, 4153–4169. [Google Scholar] [CrossRef] [PubMed]
  59. Cournia, Z.; Allen, B.; Sherman, W. Relative Binding Free Energy Calculations in Drug Discovery: Recent Advances and Practical Considerations. J. Chem. Inf. Model. 2017, 57, 2911–2937. [Google Scholar] [CrossRef] [PubMed]
  60. Ghanakota, P.; Bos, P.H.; Konze, K.D.; Staker, J.; Marques, G.; Marshall, K.; Leswing, K.; Abel, R.; Bhat, S. Combining Cloud-Based Free-Energy Calculations, Synthetically Aware Enumerations, and Goal-Directed Generative Machine Learning for Rapid Large-Scale Chemical Exploration and Optimization. J. Chem. Inf. Model. 2020, 60, 4311–4325. [Google Scholar] [CrossRef]
  61. Fischer, A.; Smieško, M.; Sellner, M.; Lill, M.A. Decision Making in Structure-Based Drug Discovery: Visual Inspection of Docking Results. J. Med. Chem. 2021, 64, 2489–2500. [Google Scholar] [CrossRef]
  62. Clark, A.J.; Tiwary, P.; Borrelli, K.; Feng, S.; Miller, E.B.; Abel, R.; Friesner, R.A.; Berne, B.J. Prediction of Protein−Ligand Binding Poses via a Combination of Induced Fit Docking and Metadynamics Simulations. J. Chem. Theory Comput. 2016, 12, 2990–2998. [Google Scholar] [CrossRef]
  63. Lekgau, K.; Raphoko, L.A.; Lebepe, C.M.; Mongokoana, D.F.; Leboho, T.C.; Matsebatlela, T.M.; Gumede, N.J.; Nxumalo, W. Design, and synthesis of 6-amino-quinoxaline-alkynyl as potential aromatase (CYP19A1) inhibitors. J. Mol. Struct. 2022, 1255, 132473. [Google Scholar] [CrossRef]
  64. Singh, K.; Coopoosamy, R.M.; Gumede, N.J.; Sabiu, S. Computational insights, and in vitro validation of antibacterial potential of shikimate pathway-derived phenolic acids as NorA efflux pump inhibitors. Molecules 2022, 27, 2601. [Google Scholar] [CrossRef]
  65. Miller, E.B.; Murphy, R.B.; Sindhikara, D.; Borrelli, K.W.; Grisewood, M.J.; Ranalli, F.; Dixon, S.L.; Jerome, S.; Boyles, N.A.; Day, T.; et al. Reliable and Accurate Solution to the Induced Fit Docking Problem for Protein−Ligand Binding. J. Chem. Theory Comput. 2021, 17, 2630–2639. [Google Scholar] [CrossRef]
  66. Gentile, F.; Agrawal, V.; Hsing, M.; Ton, A.T.; Ban, F.; Norinder, U.; Gleave, M.E.; Cherkasov, A. Deep Docking: A Deep Learning Platform for Augmentation of Structure Based Drug Discovery. ACS Cent. Sci. 2020, 6, 939–949. [Google Scholar] [CrossRef] [PubMed]
  67. Boyles, F.; Deane, C.M.; Morris, G.M. Learning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained on Docked Poses. J. Chem. Inf. Model. 2020, 62, 5329–5341. [Google Scholar] [CrossRef] [PubMed]
  68. Schindler, C.E.M.; Baumann, H.; Blum, A.; Böse, D.; Buchstaller, H.P.; Burgdorf, L.; Cappel, D.; Chekler, E.; Czodrowski, P.; Dorsch, D.; et al. Large-Scale Assessment of Binding Free Energy Calculations in Active Drug Discovery Projects. J. Chem. Inf. Model. 2020, 60, 5457–5474. [Google Scholar] [CrossRef]
  69. Peccati, F.; Jiménez-Osés, G. Enthalpy−Entropy Compensation in Biomolecular Recognition: A Computational Perspective. ACS Omega 2021, 6, 11122–11130. [Google Scholar] [CrossRef] [PubMed]
  70. Biswal, J.; Jayaprakash, P.; Rayala, S.K.; Venkatraman, G.; Rangaswamy, R.; Jeyaraman, J. WaterMap and Molecular Dynamic Simulation-Guided Discovery of Potential PAK1 Inhibitors Using Repurposing Approaches. ACS Omega 2021, 6, 26829–26845. [Google Scholar] [CrossRef]
Figure 1. Structures of compound 7724772 (a) IUPAC name 2-methyl-N-[1-(naphthalen-2-yl) ethyl] benzamide and its optimized derivative GRL-0617 (b) with an IUPAC name 5-amino-2-methyl-N-[1-(naphthalen-1-yl) ethyl] benzamide.
Figure 1. Structures of compound 7724772 (a) IUPAC name 2-methyl-N-[1-(naphthalen-2-yl) ethyl] benzamide and its optimized derivative GRL-0617 (b) with an IUPAC name 5-amino-2-methyl-N-[1-(naphthalen-1-yl) ethyl] benzamide.
Molecules 27 08569 g001
Figure 2. Pathfinder-driven reaction-based enumeration, active learning Glide QSAR modeling, IFD, and FEP+ screening workflow.
Figure 2. Pathfinder-driven reaction-based enumeration, active learning Glide QSAR modeling, IFD, and FEP+ screening workflow.
Molecules 27 08569 g002
Figure 3. Auto QSAR active learning plots for model 8 (a) with an R2 of 0.7579 and a Q2 of 0.7191 and model 10 (b) with an R2 of 0.7410 and a Q2 of 0.3868.
Figure 3. Auto QSAR active learning plots for model 8 (a) with an R2 of 0.7579 and a Q2 of 0.7191 and model 10 (b) with an R2 of 0.7410 and a Q2 of 0.3868.
Molecules 27 08569 g003
Figure 4. Histogram detailing the distribution of docking score (kcal/mol) for a diverse set of enumerated compounds measured using IFD, that were selected and prioritized for FEP+.
Figure 4. Histogram detailing the distribution of docking score (kcal/mol) for a diverse set of enumerated compounds measured using IFD, that were selected and prioritized for FEP+.
Molecules 27 08569 g004
Figure 5. MD simulation of x-ray crystallography solved structure 7JIR and MCS docking overlay of the design ideas using the binding mode of GRL-0617 as a reference compound. (a) Root-mean-square deviation of GRL-0617 in the active site cavity of SARS-CoV-2 PLpro obtained with an MD simulation time of 100 ns. (b) MCS docking overlay of design ideas selected prioritized and FEP+ amenable with GRL-0617 (grey carbon atoms) as a reference compound taken with the 100 ns MDS optimized frame. The carbon atoms of GRL-0617 are shown in grey, compound 45 in green, compound 91 in orange, compound 5 in faded green, and compound 23 in violet. The amino acids in the active site cavity were removed to provide a clear depiction of the overlay of the structures. (c) Histogram detailing the interaction between GRL-0617 and PLpro. (d) Ligand interaction diagram revealing hydrogen bonds, π–π, and hydrophobic interactions between GRL-0617 and PLpro.
Figure 5. MD simulation of x-ray crystallography solved structure 7JIR and MCS docking overlay of the design ideas using the binding mode of GRL-0617 as a reference compound. (a) Root-mean-square deviation of GRL-0617 in the active site cavity of SARS-CoV-2 PLpro obtained with an MD simulation time of 100 ns. (b) MCS docking overlay of design ideas selected prioritized and FEP+ amenable with GRL-0617 (grey carbon atoms) as a reference compound taken with the 100 ns MDS optimized frame. The carbon atoms of GRL-0617 are shown in grey, compound 45 in green, compound 91 in orange, compound 5 in faded green, and compound 23 in violet. The amino acids in the active site cavity were removed to provide a clear depiction of the overlay of the structures. (c) Histogram detailing the interaction between GRL-0617 and PLpro. (d) Ligand interaction diagram revealing hydrogen bonds, π–π, and hydrophobic interactions between GRL-0617 and PLpro.
Molecules 27 08569 g005
Figure 6. FEP+ simulation results for relative binding affinity prediction for the perturbation cycle of GRL-0617 (ligand 1) and compound 45 (ligand 2). (a) Free energy convergence of the perturbation cycle in solvent leg. (b) Free energy convergence of the perturbation cycle in complex leg. (c) Histogram depicting protein–ligand interactions for endpoint λ-replicas. (d) Ligand interaction diagram (Ligand 1 at the top and ligand 2 at the bottom) detailing the type of interactions observed between the two ligands and the receptor.
Figure 6. FEP+ simulation results for relative binding affinity prediction for the perturbation cycle of GRL-0617 (ligand 1) and compound 45 (ligand 2). (a) Free energy convergence of the perturbation cycle in solvent leg. (b) Free energy convergence of the perturbation cycle in complex leg. (c) Histogram depicting protein–ligand interactions for endpoint λ-replicas. (d) Ligand interaction diagram (Ligand 1 at the top and ligand 2 at the bottom) detailing the type of interactions observed between the two ligands and the receptor.
Molecules 27 08569 g006
Table 1. The most optimum models generated by Auto QSAR to train the enumerated design ideas to predict their docking scores, following an active learning approach to assess their ability to bind to SARS-CoV-2 PLpro.
Table 1. The most optimum models generated by Auto QSAR to train the enumerated design ideas to predict their docking scores, following an active learning approach to assess their ability to bind to SARS-CoV-2 PLpro.
Model NameModel CodeSTDEVRMSER2Q2#Factors#Training Set#Test Set
Auto QSAR _model_1KPLS_linear_100.18050.22300.43100.15711478160
Auto QSAR _model_2KPLS_radial_440.22610.23540.62620.59293816273
Auto QSAR _model_3KPLS_radial_260.22840.23980.60900.56303865289
Auto QSAR _model_4KPLS_radial_320.24740.26150.62270.57933906303
Auto QSAR _model_5KPLS_molprint2D_80.27430.29830.69800.645031073358
Auto QSAR _model_6KPLS_molprint2D_480.28530.30400.70840.667631142381
Auto QSAR _model_7KPLS_molprint2D_470.30140.30910.70860.691031252418
Auto QSAR _model_8KPLS_radial_420.28330.30530.75790.719141362454
Auto QSAR _model_9KPLS_molprint2D_150.33230.34240.67180.647631662554
Auto QSAR_model_10KPLS_linear_390.34440.52920.74100.386832391797
R2: Regression coefficient of the training set data; STDEV: Standard deviation; Q2: Regression quotient to determine the predictive ability of the test set; RMSE: Root-Mean-Square Error; # Factors: Number of multivariate factors.
Table 2. Physicochemical parameters simulated during model building of the design ideas including positive control inhibitors from the literature with their predicted docking scores obtained using Auto QSAR_model_10 and their associated IFD binding affinity parameters.
Table 2. Physicochemical parameters simulated during model building of the design ideas including positive control inhibitors from the literature with their predicted docking scores obtained using Auto QSAR_model_10 and their associated IFD binding affinity parameters.
Rank Order No.Enumeration Coupling Building Blocks/PathwayCompound IDTPSANumNumMolMolPred YPred YDockingGlideIFDExp IC50
HDonorsHAcceptorsLogPWt SDScoreE-ModelScore(uM)
1N-Heterocycles58109.98361.32394.456ND *ND *−12.39−103.831−664.209
2R1–R2_Carboxylates39110.52353.60425.529−6.860.098−10.93−95.015−664.167
3Pathway 101135.08382.64546.481−6.620.072−10.87−113.770−673.770
4N-Heterocycles12883.8333.01320.396ND *ND *−9.91−78.744−660.799
5N-Heterocycles96141.17472.20396.451ND *ND *−9.64−102.840−660.007
6Carboxylates582.05251.26310.357−6.600.075−9.44−71.375−660.109
7R1 (vinyl and aryl halides)
R2 (Carboxylates)
282.05251.26310.357−6.600.075−9.44−71.375−660.109
8R1–R2_Carboxylates36123.13373.02449.845−6.640.213−9.12−80.291−659.220
9Carboxylates175.43242.63309.369−6.680.032−8.81−63.709−658.183
1025 a ND *ND *ND *ND *ND *ND *ND *−8.69−76.159−658.5032.64
11R1–R2_Carboxylates49120.56372.61388.431−6.620.150−8.55−69.600−653.725
12Pathway 102499.42245.13448.906−6.610.122−8.53−80.814−658.802
13Primary and secondary amines3087.9243.84377.492−7.600.090−8.40−76.477−657.629
14vinyl and aryl halides_Trifluoroborates2453.71144.53403.469−6.790.065−8.39−78.794−658.165
15N-Heterocycles67103.77433.34372.428ND *ND *−8.34−90.436−658.944
16R1 (vinyl and aryl halides)
R2 (Carboxylates)
463.4132.06252.273−6.760.054−8.31−53.957−657.165
17Pathway 1027107.69444.90500.643−6.790.091−8.26−97.525−659.060
182 a ND *ND *ND *ND *ND *ND *ND *−8.25−56.924−656.5138.70
1924 a ND *ND *ND *ND *ND *ND *ND *−8.14−72.533−658.9030.56
20N-Heterocycles11683.8333.61358.804ND *ND *−8.10−75.467−658.695
21N-Heterocycles5770.91324.49377.822ND *ND *−7.98−76.563−659.233
22N-Heterocycles17295.24343.47364.449ND *ND *−7.94−54.584−656.907
23R1 (vinyl and aryl halides)
R2 (Carboxylates)
189.34340.72273.13−6.690.035−7.77−54.137−660.284
24N-Heterocycles9183.8333.13320.396ND *ND *−7.58−87.783−657.771
251 a ND *ND *ND *ND *ND *ND *ND *−7.48−59.646−655.252200
26N-Heterocycles37122.71452.45356.817ND *ND *−7.46−73.976−657.600
27Primary and secondary amines23114.01351.91342.33−7.550.077−7.29−60.112−659.115
28Hydrazine-aryl15119.39372.51378,436−7.550.068−7.28−66.860−652.823
29Primary and secondary amines2480.9243.81360.461−7.420.123−7.25−70.170−657.073
30R1–R2_ Carboxylates5130.21452.25378.388−6.620.090−7.10−59.593−661.117
31Hydrazine-aryl54110.16363.71396.882−7.550.068−6.67−57.155−653.216
32vinyl and aryl halides_Trifluoroborates1192.51373.94425.558−6.860.084−6.52−87.781−658.149
33N-Heterocycles45117.94352.66400.504ND *ND *−6.52−74.218−657.538
34N-Heterocycles20483.8334.37342.402ND *ND *−6,56−75.146−655.866
35Hydrazine-aryl35110.16362.95380.427−7.510.056−6.51−66.176−653.539
* ND—Not determined as the said compounds were part of the training set during Auto QSAR model building active learning process. a Compounds obtained from the literature with their experimental IC50 results [18,19]. TPSA—Total Polar Surface Area. LogP—Octanol/water partition coefficient. Mol Wt—Molecular Weight. Predicted Y—Predicted docking score (Y-intercept). PredY SD—Standard Deviation of the predicted docking score.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Gumede, N.J. Pathfinder-Driven Chemical Space Exploration and Multiparameter Optimization in Tandem with Glide/IFD and QSAR-Based Active Learning Approach to Prioritize Design Ideas for FEP+ Calculations of SARS-CoV-2 PLpro Inhibitors. Molecules 2022, 27, 8569. https://doi.org/10.3390/molecules27238569

AMA Style

Gumede NJ. Pathfinder-Driven Chemical Space Exploration and Multiparameter Optimization in Tandem with Glide/IFD and QSAR-Based Active Learning Approach to Prioritize Design Ideas for FEP+ Calculations of SARS-CoV-2 PLpro Inhibitors. Molecules. 2022; 27(23):8569. https://doi.org/10.3390/molecules27238569

Chicago/Turabian Style

Gumede, Njabulo Joyfull. 2022. "Pathfinder-Driven Chemical Space Exploration and Multiparameter Optimization in Tandem with Glide/IFD and QSAR-Based Active Learning Approach to Prioritize Design Ideas for FEP+ Calculations of SARS-CoV-2 PLpro Inhibitors" Molecules 27, no. 23: 8569. https://doi.org/10.3390/molecules27238569

APA Style

Gumede, N. J. (2022). Pathfinder-Driven Chemical Space Exploration and Multiparameter Optimization in Tandem with Glide/IFD and QSAR-Based Active Learning Approach to Prioritize Design Ideas for FEP+ Calculations of SARS-CoV-2 PLpro Inhibitors. Molecules, 27(23), 8569. https://doi.org/10.3390/molecules27238569

Article Metrics

Back to TopTop