Next Article in Journal
Synthesis of the Core Framework of the Cornexistins by Intramolecular Nozaki-Hiyama-Kishi Coupling
Previous Article in Journal
Development and Validation of an HPLC-ESI/MS/MS Method for the Determination of Amoxicillin, Its Major Metabolites, and Ampicillin Residues in Chicken Tissues
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Topological Water Network Analysis Around Amino Acids

Graduate School of New Drug Discovery and Development, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Korea
*
Author to whom correspondence should be addressed.
These authors contributed equally to the work.
Molecules 2019, 24(14), 2653; https://doi.org/10.3390/molecules24142653
Submission received: 24 June 2019 / Revised: 20 July 2019 / Accepted: 22 July 2019 / Published: 22 July 2019
(This article belongs to the Section Computational and Theoretical Chemistry)

Abstract

:
Water molecules play a key role in protein stability, folding, function and ligand binding. Protein hydration has been studied using free energy perturbation algorithms. However, the study of protein hydration without free energy calculation is also an active field of research. Accordingly, topological water network (TWN) analysis has been carried out instead of free energy calculation in the present work to investigate hydration of proteins. Water networks around 20 amino acids in the aqueous solution were explored through molecular dynamics (MD) simulations. These simulation results were compared with experimental observations. Water molecules from the protein data bank structures showed TWN patterns similar to MD simulations. This work revealed that TWNs are effected by the surrounding environment. TWNs could provide valuable clues about the environment around amino acid residues in the proteins. The findings from this study could be exploited for TWN-based drug discovery and development.

Graphical Abstract

1. Introduction

Proteins are composed of amino acids. Proteins in the biological environment are surrounded by water molecules which play crucial roles in the protein structure, function and dynamics [1,2]. Furthermore, they influence the binding process between biomacromolecular targets and small molecule ligands [3,4,5,6,7,8]. Water molecules within the active site of a protein can mediate protein-ligand interactions by bridging between protein and ligand or by being displaced upon complex formation. Water molecules have garnered the deserved attention in drug discovery research.
Over the past decade, considerable effort has been devoted to water-centric research. The studies involving water molecules are commonly focused on the free energy calculations. Several free energy-based algorithms are available, such as WaterMap (from Schrodinger) [9], SZMAP (from OpenEye) [10] and GIST (in Amber) [11]. However, free energy calculations are technically very demanding, time-consuming and require multiple simulations. They consider individual water molecules. These approaches are limited to discerning the energetics of hydration sites in the presence of a fixed distribution of surrounding water molecules. Hydrogen-bonded networks are often formed by water molecules in the binding site of the protein. The removal of one water molecule affects the free energies and structure of the remainder. The methods based on free energy calculations require extra simulations to account for these changes. Furthermore, the stability of the network as a whole cannot be understood well by knowing individual binding free energy of each water site. Besides, it is not possible to obtain information on cooperative effects between neighboring water molecules without additional analysis [12].
There is a paucity of water network-based research. Water networks, also known as water clusters, have been explored using infrared spectroscopy, Fourier-transform infrared spectroscopy, fluorescence spectroscopy, replica exchange molecular dynamics simulations and quantum calculations [13,14,15,16,17,18,19,20,21,22]. The experimental and theoretical studies involving water network analysis are mostly carried out on the pure water systems. Water networks arising from water molecules’ interactions with biomolecules have not been studied in detail thus far. This important research field is at an early stage of development. A recent study reported a novel procedure to quantify the disorder of extended water−water hydrogen-bond networks sampled in particle-based computer simulations. The strategy was depended on the conformational clustering of the hydrogen-bond connectivity states [23]. Another latest study showed the possibility of ligand discovery through water network analysis without free energy calculations [24]. They discovered new thermolysin inhibitors by water network analysis.
The authors have been involved in the research related to the water network analysis for several years. The authors developed an algorithm to determine the water-networks formed by hydrogen bonding without explicit free energy calculations [25,26,27,28]. Water networks form polygonal structures with 3-, 4-, 5- and 6-membered rings which the authors named as topological water networks (TWNs). As free energy calculations are technically very demanding, time-consuming and require multiple simulations [12], TWN analysis offers a better alternative to study the water networks. However, unlike free energy calculations, many studies have not been performed using the TWN analysis approach. Authors are exploring this approach in a number of different ways. Previously, the TWN results were correlated with the hydrophobic environment to explain kinase selectivity within systems including a variant of Gleevec and a series of substituted c-Jun N-terminal kinase (JNK) ligands [25]. Additionally, a method to compute the dipole moments of the water-ring network at specific positions in the adenosine triphosphate (ATP) binding pocket of kinase was proposed. The application of this method on two kinase systems (tyrosine-protein kinase ZAP-70/checkpoint kinase 1 and mitogen-activated protein kinase kinase kinase 5/3-phosphoinositide-dependent protein kinase-1) demonstrated that orientation of dipole moments played a crucial role in the protein-ligand binding mechanism [26]. TWN analysis was performed on several kinases to address the critical selectivity issue of kinase inhibitors. TWN analysis was further employed to develop a selective interleukin-1 receptor-associated kinase 4 (IRAK4) inhibitor [27]. Recently, the authors reported that TWNs influence the formation of α-strand/sheet structure in the α-synuclein [28]. This partially folded intermediate structure was proposed to be responsible for α-synuclein aggregation and fibril formation in the Parkinson’s disease. Furthermore, changes in the TWN pattern was observed under different conditions.
To date, several studies have reported the hydration pattern at the macromolecular (protein) level. However, the hydration pattern at the atomic level around the individual amino acids has not been studied yet. In this work, through molecular dynamics (MD) simulations, TWNs have been analyzed around all the amino acids at the atomic level for the first time. Furthermore, these results have been compared with TWNs of the X-ray crystallographic structures, which are available in the protein data bank (PDB). As amino acids are the components of proteins, TWN analysis around each amino acid provides novel insights into the amino acid specific hydration pattern at the atomic level.

2. Results and Discussion

2.1. Validation of MD Simulation Results for Each Amino Acid

Based on the hydrophobicity of amino acids proposed by Kyte-Doolittle (Figure 1) [29,30], MD simulation results of five representative amino acids (Ile—strong hydrophobic, Ala—weak hydrophobic, Ser—weak hydrophilic, Glu—strong hydrophilic and negative charge and Arg—strong hydrophilic and positive charge) were validated by the Ramachandran plot analysis. The Ramachandran plot has been the mainstay of protein structure validation for several years. It involves plotting of the backbone torsion angles (φ/ψ) for the amino acid residues. The φ and ψ values are plotted on the x-axis and y-axis, respectively, to predict the possible conformation of the amino acid. Torsion angles were computed using gmx rama module of GROMACS software [31]. This module selects the φ/ψ combinations from topology file and calculates these as a function of time. The Ramachandran plots for the representative amino acids are shown in Figure 2. It can be seen in the plots that torsion angles for the representative amino acids fell into well-defined regions of the Ramachandran plot. In accordance with previous studies [32,33], φ/ψ values were located in the right-handed α-helix region (lower left quadrant) and β-sheet region (upper left quadrant). It is evident from the Ramachandran plots that amino acid structures were correct. In addition, root-mean square deviation (RMSD) from the initial structure was calculated and plotted against simulation time to examine the dynamic stability of the amino acid structures. The RMSDs of all the amino acids (Supplementary Figure S1) remained stable with low values (<1 Å) throughout the MD simulation. The reasonably stabilized RMSD curves for the simulated amino acids suggest that they are suitable for further analysis.

2.2. TWN Pattern Around Amino Acids

Water molecules influence the structure and function of biomolecules. The concept of the hydration shell has been used to describe the properties of water around the biomolecules [34,35,36]. However, defining a hydration shell and differentiating it from the bulk water is challenging. It is usually defined as first water layer or sometimes a few water layers surrounding the biomolecule. The hydration shell consists of all those water molecules whose properties are significantly affected by the presence of biomolecule. The hydration shell thickness around a given site is usually provided as the distance from that site. The distance ranges of 2.5–4.0 and 5–7 Å are considered for the first and second hydration shells, respectively [37]. The hydration shell with one to two inner layers of water molecules was reported as a distance of 10 Å from the protein in an experimental study [20]. In the case of studies involving analysis of a single water molecule as one unit, it is possible to investigate the dynamics of water in multiple layers of the hydration shell. However, TWN has been explored as one unit instead of a single water molecule analysis in the present study. As explained before, TWN are formed due to hydrogen bonding among several water molecules. Water molecules participating in TWNs can be present in different layers of the shell. Consequently, in accordance with a previous study [20], only one hydration shell with a diameter of 10 Å from the amino acid coordinates was defined in this study. A distance of 10 Å was used for hydration shell as this value can include both first and second hydration shells. Water molecules present within 10 Å of the amino acid coordinates every 10 ps were extracted as well as every 5 ps for each simulated system. The TWN analysis was carried out on the extracted water molecules using the in-house Java and R codes.
Initially, the minimal distance between the TWN and amino acid atoms was calculated every 10 ps throughout the 1 ns MD simulation (Figure 3). As shown in Table 1 and Supplementary Table S1, the total number of TWNs decreased with the increase in the size of the TWN ring. The largest number of TWNs were observed for the 3-ring whereas the smallest number of TWNs were observed for 6-ring in the case of all the amino acids. This is due to the limited size of the hydration shell (distance of 10 Å) which can accommodate a low number of 6-ring TWNs as they are larger structures as compared to the 3-ring TWNs. As can be seen in Table 1, 3-ring TWNs were largely formed around the polar atoms (O or N) for hydrophilic amino acids, such as Asp, Asn and Glu, while they were mostly formed around the non-polar C atoms for hydrophobic amino acids, such as Ile and Phe. The total number of TWNs decreased with the increase in the size of the TWN ring. However, the frequency of 4- and 5-ring TWNs (Supplementary Table S1) showed similar patterns for the amino acids as the 3-ring TWNs. In the case of the residues with positively charged side chain (Lys and Arg), the TWN pattern was weaker compared to other strong hydrophilic resides, such as Gln, Asp, Asn and Glu. These results suggested that charges on the side chains of residues influence the TWN pattern. For the 6-ring (Supplementary Table S1), a somewhat similar TWN pattern was observed as obtained for the 3-, 4- and 5-rings. This might be due to the formation of very few 6-ring TWNs.
Additionally, the TWNs for backbone and side chain of each amino acid were separated. As can be seen in Table 1 and Supplementary Table S1, there is not much difference in the TWN frequency around the backbone atoms. In contrast, the side chain atoms of hydrophobic and hydrophilic residues showed considerable differences in the frequency of TWNs around them. Unlike hydrophilic amino acids, TWNs were largely formed around the side chain non-polar C atoms of hydrophobic amino acids. The side chain polar atoms (O or N) of hydrophilic amino acids showed high TWN frequency around them. However, due to the absence of polar atoms (O or N) in the side chains of hydrophobic amino acids, TWNs could not be observed around them. The separated backbone and side chain TWN results suggested that side chains were mainly responsible for the difference in the TWN pattern around the hydrophilic and hydrophobic amino acids.
In addition to 10 ps TWN analysis, the minimal distance between the TWN and amino acid atoms was calculated every 5 ps throughout the 1 ns MD simulation to check the consistency of the results. As shown in Supplementary Table S2, the 5 ps trajectories exhibited similar TWN patterns as the 10 ps trajectories. TWNs (3-, 4- and 5-ring) were mostly formed around the polar atoms (O or N) of hydrophilic amino acids, whereas they were mainly formed around the non-polar C atoms of hydrophobic amino acids. Similar to 10 ps TWN results, the weaker TWN pattern was observed for Lys and Arg which might be due to their positively charged side chains. The separated backbone and side chain TWN results showed similar patterns as observed for the 10 ps analysis. Similar TWN patterns for different time periods (10 ps and 5 ps) indicated the consistency of our results.

2.3. TWN Pattern for Residues of the PDB Structures

In addition to the individual amino acid MD simulations, TWN patterns of the bonded amino acid residues of proteins were analyzed. As discussed in the methodology section, TWN analysis was carried out on the selected X-ray crystallographic structures available in the PDB. The hydrogen atoms are missing in most of the PDB files except for extremely high resolution crystal structures. Thus, TWNs for the crystal water molecules were calculated on the basis of maximum distance of <3.5 Å between the crystal water oxygen atoms (Figure 4) instead of the energies [38,39]. The TWN results (Table 2) for the filtered PDB dataset containing 16,548 structures demonstrated a similar pattern as observed in the MD simulations.
In the MD simulations (Table 1 and Supplementary Tables S1 and S2), total number of TWNs were almost similar around both the hydrophobic and hydrophilic residues because they were exposed to surrounding water molecules in the same way in the simulation box. However, individual amino acids are not the same as amino acid residues present in a protein. As shown in Table 2 and Supplementary Table S3, the total TWNs around the hydrophilic amino acid residues were found to be comparatively more than the hydrophobic amino acid residues. This could be due to hydrophilic residues being often more exposed to bulk water compared to the hydrophobic residues in the biological systems. Although total number of TWNs around hydrophobic residues was lower, the frequency of TWNs (3-, 4- and 5-rings) near their non-polar C atoms was comparatively higher than the non-polar C atoms of hydrophilic residues. Conversely, the frequency of TWNs was relatively higher near polar atoms (O or N) of the hydrophilic residues. As observed in the MD simulations, the total number of 6-ring TWNs in PDBs was also very small and the TWN pattern was slightly disordered accordingly. The TWN results of PDB structures were found to be consistent with MD simulation results. Similar to MD simulation results, the weaker TWN pattern was observed for positively charged residues particularly for Lys. Unlike other strong hydrophilic residues, the higher TWN frequency was observed near the non-polar C atoms of Lys. This might be due to its positively charged side chain. The TWN results of both MD simulations and PDB structures indicate that the charge effects the formation of these water networks.
As can be seen in Table 2 and Supplementary Table S3, the separated backbone and side chain TWN results for PDB structures were similar to the MD simulation results. The TWN frequency around the backbone atoms were almost similar, however, the side chain atoms showed significant differences. In accordance with the MD simulation results, TWNs were mainly formed around the side chain non-polar C atoms of the hydrophobic amino acids, whereas they were largely present around the side chain polar atoms (O or N) of the hydrophilic amino acids. The side chains were mainly responsible for the difference in the TWN pattern around the hydrophilic and hydrophobic amino acids.
Previously, another research group studied the electronic structure of the amino acids. They reported the atomic charge distribution for the amino acids [40]. The average charge on the C atoms of hydrophobic residues was found to be lower than the average charge on the C atoms of the hydrophilic residues. The charge differences could be responsible for the higher TWN frequency near the non-polar C atoms of the hydrophobic residues despite the low number of total TWNs around them. However, the lower average charge was observed around the C atoms of the Lys compared to the other hydrophilic residues, such as Asp, Asn and Glu. This could be the reason for the higher TWN frequency near non-polar C atoms of Lys.

3. Materials and Methods

3.1. Preparation and Capping of Amino Acids

The 3D structures of 20 amino acids (Supplementary Figure S2) were built and the charge states were assigned by Discovery Studio 2017 R2 (BIOVIA, San Diego, CA, USA). In the case of His, both tautomeric structures namely Hsd (protonated at delta N) and Hse (protonated at epsilon N) were created. A previous study reported that capping of the termini of the amino acids ensured that the dynamics of the φ and ψ torsion angles were analogues to the dynamics within a peptide chain [32]. Accordingly, N and C termini of each amino acid were capped with acetyl (ACE) and N-methyl amide (NME) groups, respectively (Figure 5), for removing charge effects and imitating the peptide bond. The 3D structure of each amino acid was verified by a comparison with the energy minimized structure.

3.2. Molecular Dynamics (MD) Simulation

GROMACS software version 5.1 [31] with CHARMM27 all atom force field [41] was used to perform the MD simulations on all capped amino acid structures. The simulations on the charged amino acids were performed in a similar way as the neutral amino acids. Each amino acid was solvated in a cubic box of TIP3P water molecules [42] with a margin distance of 10 Å. The energy minimization of the system was carried out for 50,000 steps with the steepest descent method. The energy minimization was followed by NVT (constant number of particles, volume and temperature) and NPT (constant number of particles, pressure and temperature) equilibrations. The NVT equilibration was carried out for 100 ps at 298 K using Berendsen thermostat [43] to stabilize the temperature. The system was again equilibrated with NPT at a pressure of 1 bar for 100 ps using the Parrinello-Rahman barostat [44]. Finally, a production run was carried for 1 ns and coordinate trajectories were recorded every 5 ps. All the simulations were performed by applying periodic boundary conditions (PBC). All bond lengths were constrained using linear constraint solver (LINCS) algorithms [45]. The long-range interactions were handled using the particle mesh Ewald (PME) method [46], while short-range interactions were truncated at 10 Å.

3.3. Topological Water Network (TWN) Analysis

The authors developed an algorithm to determine the water-networks formed by hydrogen bonding without explicit free energy calculations. Water networks form polygonal structures which were named as TWNs. The computational protocol for TWN analysis is reported in their previous works [25,26,27,28]. TWNs refers to the hydrogen-bonded cyclic water-ring networks which are formed due to the hydrogen-bond interactions among the water molecules. TWNs include 3-, 4-, 5- and 6-membered rings. The potential functions considered in the TWNs involve a rigid TIP3P water model. Lennard-Jones and Coulomb potentials are commonly used to model interactions between water molecules [42]. The interaction potential energy between two water molecules is calculated using the following equation:
v ( a , b ) = i o n   a j o n   b q i q j e 2 r i j + A r o o 12 C r o o 6
where v(a, b) is the interaction potential energy between water molecules a and b, roo indicates the distance between oxygen atoms, qi and qj represent partial charges on site i and j, respectively, rij is the distance between charges qi and qj. A coulombic force between these two charges signifies the electrostatic attraction. A function that considers both attraction and repulsion simultaneously denotes the van der Waals interaction. The repulsive force of i and j is represented by parameter A whereas attraction is indicated by parameter C.
An energy criterion of −2.25 kcal mol−1 was carefully chosen to determine the hydrogen bond between water molecules as this value closely corresponds to the minimum of the pair-energy distribution of potential [42]. The parameters were selected in such a way that they produced reasonable structural and energetic results for liquid water. The parameters’ values are provided below:
A = 582,000 kcal Å12 mol−1
C = 595 kcal Å6 mol−1
qi = −0.834e, qj = 0.417e
For TWN analysis, this study initially performed 1 ns MD simulation on each amino acid structure as discussed in the previous section. Afterwards, water molecules present within 10 Å of the alpha-carbon (Cα) atoms of each residue were extracted every 10 ps as well as every 5 ps for each simulated system. Finally, the TWN analysis was carried out on the extracted water molecules using the equation provided above. Finally, the minimal distance between the TWN oxygen atom and heavy atoms of the residues was calculated. The extraction of water molecules, the TWN analysis and minimal distance calculations were performed using the in-house Java and R codes.

3.4. Analysis of PDB Structures

3.4.1. Preparation of PDB Files

All the available PDB files (total of 126,292) were downloaded from the RCSB PDB website [47]. The non-protein PDB files, such as nucleic acids (DNA and RNA) and protein structures containing no crystal water molecules were removed. The structures (16,548 PDBs) having continuous hydration layer at the protein surface, i.e., X-ray structures refined against diffraction data with resolution better than 1.6 Å, were selected for the TWN analysis [48].

3.4.2. TWN Analysis of Filtered PDB Files

As most of the 3D structures in the PDB contain no hydrogen atoms, the presence of an energetically significant hydrogen bond can be inferred when a probable donor and acceptor are within 3.5 Å of each other [38]. Accordingly, TWNs (3-, 4-, 5- and 6-membered rings) present in the PDB files were not extracted as energies but distances of <3.5 Å [39] between the crystal water oxygen atoms. Initially, all the water molecules present in the PDB structures were extracted and then distances between the crystal water oxygen atoms were analyzed to identify TWNs (distances of <3.5 Å). Then, a minimal distance between the TWN oxygen atom and heavy atoms of the residues was calculated. This study used in-house Java and R codes for the extraction of water molecules, the TWN analysis and minimal distance calculations.

4. Conclusions

The present work presents an algorithm to determine the water-networks formed by hydrogen bonding, without explicit free energy calculations. Water networks form polygonal structures with 3-, 4-, 5- and 6-membered rings which are known as TWNs. Herein, the frequency of the formation of TWNs near twenty amino acids were analyzed through MD simulations. Furthermore, the simulation results were compared with TWNs of PDBs. The results revealed that the formation of TWNs was affected by the environment around the amino acid residues in the protein. The TWN frequencies around the polar atoms (O or N) of hydrophilic residues were found to be higher compared to polar atoms (O or N) of the hydrophobic residues. On the contrary, the TWN frequencies were higher near the non-polar C atoms of the hydrophobic residues. Previously, several studies have reported hydration pattern of proteins at the macroscopic molecular level. However, to the authors’ knowledge, this is the first attempt to investigate a hydration pattern at the atomic level. Our simulations and calculations provide a novel insight into the amino acid specific hydration pattern at the atomic level for any protein. Furthermore, this work offers new perspectives to discover novel ligands as well as to optimize lead compounds in drug discovery considering a particular binding site environment.

Supplementary Materials

The following are available online, Table S1: The number of TWNs observed around various atoms of all amino acids in the MD simulations. TWN analysis was carried out on the water molecules which were extracted every 10 ps for each simulated system. Amino acids are ordered from the most hydrophobic one (Ile, on the left hand side) to the most hydrophilic one (Arg, on the right hand side), according to the Kyte-Doolitle scale. (A) 4-ring TWNs (Backbone + Side chain), (B) 4-ring TWNs (Backbone), (C) 4-ring TWNs (Side chain), (D) 5-ring TWNs (Backbone + Side chain), (E) 5-ring TWNs (Backbone), (F) 5-ring TWNs (Side chain), (G) 6-ring TWNs (Backbone + Side chain), (H) 6-ring TWNs (Backbone), (I) 6-ring TWNs (Side chain); Table S2: The number of TWNs observed around various atoms of all amino acids in the MD simulations. TWN analysis was carried out on the water molecules which were extracted every 5 ps for each simulated system. Amino acids are ordered from the most hydrophobic one (Ile, on the left hand side) to the most hydrophilic one (Arg, on the right hand side), according to the Kyte-Doolitle scale. (A) 3-ring TWNs (Backbone + Side chain), (B) 3-ring TWNs (Backbone), (C) 3-ring TWNs (Side chain), (D) 4-ring TWNs (Backbone + Side chain), (E) 4-ring TWNs (Backbone), (F) 4-ring TWNs (Side chain), (G) 5-ring TWNs (Backbone + Side chain), (H) 5-ring TWNs (Backbone), (I) 5-ring TWNs (Side chain), (J) 6-ring TWNs (Backbone + Side chain), (K) 6-ring TWNs (Backbone), (L) 6-ring TWNs (Side chain); Table S3: The number of TWNs observed around various atoms of all amino acids in the PDBs. Amino acids are ordered from the most hydrophobic one (Ile, on the left hand side) to the most hydrophilic one (Arg, on the right hand side), according to the Kyte-Doolitle scale. (A) 4-ring TWNs (Backbone + Side chain), (B) 4-ring TWNs (Backbone), (C) 4-ring TWNs (Side chain), (D) 5-ring TWNs (Backbone + Side chain), (E) 5-ring TWNs (Backbone), (F) 5-ring TWNs (Side chain), (G) 6-ring TWNs (Backbone + Side chain), (H) 6-ring TWNs (Backbone), (I) 6-ring TWNs (Side chain); Figure S1: RMSD plot for the backbone atoms of amino acids from the initial structures throughout the 1 ns MD simulation as a function of time; Figure S2: Structures of the amino acids studied in the present work.

Author Contributions

Conceptualization, N.S.K.; methodology, K.-E.C. and H.R.Y.; software, K.-E.C. and E.C.; validation, N.S.K., K.-E.C. and A.B.; formal analysis, K.-E.C.; investigation, K.-E.C.; resources, N.S.K.; data curation, A.B.; writing—original draft preparation, K.-E.C.; writing—review and editing, N.S.K. and A.B.; visualization, K.-E.C.; supervision, N.S.K.; project administration, N.S.K.; funding acquisition, N.S.K.

Funding

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning(2017R1A2B4002827) and Chungnam National University.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Bellissent-Funel, M.-C.; Hassanali, A.; Havenith, M.; Henchman, R.; Pohl, P.; Sterpone, F.; Van Der Spoel, D.; Xu, Y.; Garcia, A.E. Water determines the structure and dynamics of proteins. Chem. Rev. 2016, 116, 7673–7697. [Google Scholar] [CrossRef] [PubMed]
  2. Frey, M. Water structure associated with proteins and its role in crystallization. Acta Crystallogr. D 1994, 50, 663–666. [Google Scholar] [CrossRef] [PubMed]
  3. Ladbury, J.E. Just add water! The effect of water on the specificity of protein-ligand binding sites and its potential application to drug design. Chem. Biol. 1996, 3, 973–980. [Google Scholar] [CrossRef] [Green Version]
  4. Poornima, C.; Dean, P. Hydration in drug design. 1. Multiple hydrogen-bonding features of water molecules in mediating protein-ligand interactions. J. Comput. Aided Mol. Des. 1995, 9, 500–512. [Google Scholar] [CrossRef] [PubMed]
  5. Hummer, G. Molecular binding: Under water’s influence. Nat. Chem. 2010, 2, 906. [Google Scholar] [CrossRef] [PubMed]
  6. Baron, R.; Setny, P.; McCammon, J.A. Water in cavity−ligand recognition. J. Am. Chem. Soc. 2010, 132, 12091–12097. [Google Scholar] [CrossRef] [PubMed]
  7. Baron, R.; Setny, P.; Paesani, F. Water structure, dynamics, and spectral signatures: Changes upon model cavity–Ligand recognition. J. Phys. Chem. B 2012, 116, 13774–13780. [Google Scholar] [CrossRef] [PubMed]
  8. Quiocho, F.; Wilson, D.K.; Vyas, N. Substrate specificity and affinity of a protein modulated by bound water molecules. Nature 1989, 340, 404. [Google Scholar] [CrossRef]
  9. Young, T.; Abel, R.; Kim, B.; Berne, B.J.; Friesner, R.A. Motifs for molecular recognition exploiting hydrophobic enclosure in protein–ligand binding. Proc. Natl. Acad. Sci. USA 2007, 104, 808–813. [Google Scholar] [CrossRef]
  10. Bayden, A.S.; Moustakas, D.T.; Joseph-McCarthy, D.; Lamb, M.L. Evaluating free energies of binding and conservation of crystallographic waters using SZMAP. J. Chem. Inf. Model. 2015, 55, 1552–1565. [Google Scholar] [CrossRef]
  11. Ramsey, S.; Nguyen, C.; Salomon-Ferrer, R.; Walker, R.C.; Gilson, M.K.; Kurtzman, T. Solvation thermodynamic mapping of molecular surfaces in AmberTools: GIST. J. Comput. Chem. 2016, 37, 2029–2037. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Ross, G.A.; Bodnarchuk, M.S.; Essex, J.W. Water sites, networks, and free energies with grand canonical Monte Carlo. J. Am. Chem. Soc. 2015, 137, 14930–14943. [Google Scholar] [CrossRef] [PubMed]
  13. Otto, K.E.; Xue, Z.; Zielke, P.; Suhm, M.A. The Raman spectrum of isolated water clusters. Phys. Chem. Chem. Phys. 2014, 16, 9849–9858. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Zischang, J.; Suhm, M.A. The OH stretching spectrum of warm water clusters. J. Chem. Phys. 2014, 140, 064312. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Richardson, J.O.; Pérez, C.; Lobsiger, S.; Reid, A.A.; Temelso, B.; Shields, G.C.; Kisiel, Z.; Wales, D.J.; Pate, B.H.; Althorpe, S.C. Concerted hydrogen-bond breaking by quantum tunneling in the water hexamer prism. Science 2016, 351, 1310–1313. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Buck, U.; Pradzynski, C.C.; Zeuch, T.; Dieterich, J.M.; Hartke, B. A size resolved investigation of large water clusters. Phys. Chem. Chem. Phys. 2014, 16, 6859–6871. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Brown, S.E.; Götz, A.W.; Cheng, X.; Steele, R.P.; Mandelshtam, V.A.; Paesani, F. Monitoring water clusters “melt” through vibrational spectroscopy. J. Am. Chem. Soc. 2017, 139, 7082–7088. [Google Scholar] [CrossRef] [PubMed]
  18. Matsumoto, M.; Baba, A.; Ohmine, I. Topological building blocks of hydrogen bond network in water. J. Chem. Phys. 2007, 127, 134504. [Google Scholar] [CrossRef] [PubMed]
  19. Cobar, E.A.; Horn, P.R.; Bergman, R.G.; Head-Gordon, M. Examination of the hydrogen-bonding networks in small water clusters (n = 2–5, 13, 17) using absolutely localized molecular orbital energy decomposition analysis. Phys. Chem. Chem. Phys. 2012, 14, 15328–15339. [Google Scholar] [CrossRef] [PubMed]
  20. Zhang, L.; Yang, Y.; Kao, Y.-T.; Wang, L.; Zhong, D. Protein hydration dynamics and molecular mechanism of coupled water−protein fluctuations. J. Am. Chem. Soc. 2009, 131, 10677–10691. [Google Scholar] [CrossRef] [PubMed]
  21. Jung, D.H.; Yang, J.H.; Jhon, M.S. The effect of an external electric field on the structure of liquid water using molecular dynamics simulations. Chem. Phys. 1999, 244, 331–337. [Google Scholar] [CrossRef]
  22. Yu, J.Y.; Shin, J.K.; Jhon, M.S. The structure of water in human ras oncogene proteins. Int. J. Quantum Chem. 1994, 51, 241–254. [Google Scholar] [CrossRef]
  23. Rahaman, O.; Kalimeri, M.; Katava, M.; Paciaroni, A.; Sterpone, F. Configurational disorder of water hydrogen-bond network at the protein dynamical transition. J. Phys. Chem. B 2017, 121, 6792–6798. [Google Scholar] [CrossRef] [PubMed]
  24. Krimmer, S.G.; Cramer, J.; Betz, M.; Fridh, V.; Karlsson, R.; Heine, A.; Klebe, G. Rational design of thermodynamic and kinetic binding profiles by optimizing surface water networks coating protein-bound ligands. J. Med. Chem. 2016, 59, 10530–10548. [Google Scholar] [CrossRef] [PubMed]
  25. Jang, W.D.; Kim, J.-T.; Kang, N.S. The analysis of water network for kinase selectivity based on the MD simulations. J. Mol. Liq. 2014, 191, 37–41. [Google Scholar] [CrossRef]
  26. Jang, W.D.; Lee, M.H.; Kang, N.S. Quantitative assessment of kinase selectivity based the water-ring network in protein binding sites using molecular dynamics simulations. J. Mol. Liq. 2016, 221, 316–322. [Google Scholar] [CrossRef]
  27. Lee, M.; Balupuri, A.; Jung, Y.-R.; Choi, S.; Lee, A.; Cho, Y.; Kang, N.S. Design of a novel and selective IRAK4 inhibitor using topological water network analysis and molecular modeling approaches. Molecules 2018, 23, 3136. [Google Scholar] [CrossRef]
  28. Balupuri, A.; Choi, K.-E.; Kang, N.S. Computational insights into the role of α-strand/sheet in aggregation of α-synuclein. Sci. Rep. 2019, 9, 59. [Google Scholar] [CrossRef]
  29. Kyte, J.; Doolittle, R.F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 1982, 157, 105–132. [Google Scholar] [CrossRef] [Green Version]
  30. Schauperl, M.; Podewitz, M.; Waldner, B.J.; Liedl, K.R. Enthalpic and entropic contributions to hydrophobicity. J. Chem. Theory Comput. 2016, 12, 4600–4610. [Google Scholar] [CrossRef]
  31. Abraham, M.J.; Murtola, T.; Schulz, R.; Páll, S.; Smith, J.C.; Hess, B.; Lindahl, E. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 2015, 1, 19–25. [Google Scholar] [CrossRef]
  32. Vitalini, F.; Noé, F.; Keller, B.J. Molecular dynamics simulations data of the twenty encoded amino acids in different force fields. Data Brief. 2016, 7, 582–590. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Hovmöller, S.; Zhou, T.; Ohlson, T. Conformations of amino acids in proteins. Acta Crystallogr. D 2002, 58, 768–776. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Laage, D.; Elsaesser, T.; Hynes, J.T. Water dynamics in the hydration shells of biomolecules. Chem. Rev. 2017, 117, 10694–10725. [Google Scholar] [CrossRef] [PubMed]
  35. Kuntz, I., Jr.; Kauzmann, W. Hydration of proteins and polypeptides. Adv. Protein Chem. 1974, 28, 239–345. [Google Scholar]
  36. Bagchi, B. Water dynamics in the hydration layer around proteins and micelles. Chem. Rev. 2005, 105, 3197–3219. [Google Scholar] [CrossRef]
  37. Makarov, V.; Pettitt, B.M.; Feig, M. Solvation and hydration of proteins and nucleic acids: A theoretical view of simulation and experiment. Acc. Chem. Res. 2002, 35, 376–384. [Google Scholar] [CrossRef]
  38. Jeffrey, G.A. An Introduction to Hydrogen Bonding; Oxford University Press: New York, NY, USA, 1997; p. 303. ISBN 0-19-509549-9. [Google Scholar]
  39. Nittinger, E.; Schneider, N.; Lange, G.; Rarey, M. Evidence of water molecules--a statistical evaluation of water molecules based on electron density. J. Chem. Inf. Model. 2015, 55, 771–783. [Google Scholar] [CrossRef]
  40. Dixon, D.A.; Lipscomb, W. Electronic structure and bonding of the amino acids containing first row atoms. J. Biol. Chem. 1976, 251, 5992–6000. [Google Scholar]
  41. Bjelkmar, P.; Larsson, P.; Cuendet, M.A.; Hess, B.; Lindahl, E. Implementation of the CHARMM force field in GROMACS: Analysis of protein stability effects from correction maps, virtual interaction sites, and water models. J. Chem. Theory Comput. 2010, 6, 459–466. [Google Scholar] [CrossRef]
  42. Jorgensen, W.L.; Chandrasekhar, J.; Madura, J.D.; Impey, R.W.; Klein, M.L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983, 79, 926–935. [Google Scholar] [CrossRef]
  43. Bussi, G.; Donadio, D.; Parrinello, M. Canonical sampling through velocity rescaling. J. Chem. Phys. 2007, 126, 014101. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Parrinello, M.; Rahman, A. Polymorphic transitions in single crystals: A new molecular dynamics method. J. Appl. Phys. 1981, 52, 7182–7190. [Google Scholar] [CrossRef]
  45. Hess, B.; Bekker, H.; Berendsen, H.J.; Fraaije, J.G. LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem. 1997, 18, 1463–1472. [Google Scholar] [CrossRef]
  46. Darden, T.; York, D.; Pedersen, L. Particle mesh Ewald: An N⋅log (N) method for Ewald sums in large systems. J. Chem. Phys. 1993, 98, 10089–10092. [Google Scholar] [CrossRef]
  47. RCSB PDB. Available online: http://ftp.rcsb.org (accessed on 3 February 2017).
  48. Carugo, O. When proteins are completely hydrated in crystals. Int. J. Biol. Macromol. 2016, 89, 137–143. [Google Scholar] [CrossRef]
Sample Availability: Samples of the compounds are not available from the authors.
Figure 1. The hydropathy index of all amino acids according to the Kyte-Doolittle scale. The larger the value, the more hydrophobic is the amino acid. Ile is the most hydrophobic while Arg is the most hydrophilic. Amino acids are ordered from the most hydrophobic one (on the left hand side) to the most hydrophilic one (on the right hand side).
Figure 1. The hydropathy index of all amino acids according to the Kyte-Doolittle scale. The larger the value, the more hydrophobic is the amino acid. Ile is the most hydrophobic while Arg is the most hydrophilic. Amino acids are ordered from the most hydrophobic one (on the left hand side) to the most hydrophilic one (on the right hand side).
Molecules 24 02653 g001
Figure 2. The Ramachandran plots for the representative amino acids. X-axis indicate φ torsion angle whereas Y-axis indicate ψ torsion angle. The φ and ψ torsion angles were extracted from the molecular dynamics (MD) trajectories. The representative amino acids were selected on the basis of hydropathy index (Ile—strong hydrophobic, Ala—weak hydrophobic, Ser—weak hydrophilic, Glu—strong hydrophilic and negative charge, and Arg—strong hydrophilic and positive charge).
Figure 2. The Ramachandran plots for the representative amino acids. X-axis indicate φ torsion angle whereas Y-axis indicate ψ torsion angle. The φ and ψ torsion angles were extracted from the molecular dynamics (MD) trajectories. The representative amino acids were selected on the basis of hydropathy index (Ile—strong hydrophobic, Ala—weak hydrophobic, Ser—weak hydrophilic, Glu—strong hydrophilic and negative charge, and Arg—strong hydrophilic and positive charge).
Molecules 24 02653 g002
Figure 3. An illustration of a 3-ring topological water network (TWN) closely located near the O atom of an amino acid during the MD simulation. Water molecules involved in the 3-ring TWN are shown as W1-W3. TWNs derived from simulations were defined on the basis of non-covalent bond energy (−2.25 kcal mol−1).
Figure 3. An illustration of a 3-ring topological water network (TWN) closely located near the O atom of an amino acid during the MD simulation. Water molecules involved in the 3-ring TWN are shown as W1-W3. TWNs derived from simulations were defined on the basis of non-covalent bond energy (−2.25 kcal mol−1).
Molecules 24 02653 g003
Figure 4. An illustration of a 3-ring TWN closely located near one of the N atoms of Arg473 in chain A of protein data bank (PDB) 2G1T (a Bcr-Abl tyrosine kinase structure). Water molecules are represented by red spheres. Water molecules involved in the 3-ring TWN are indicated as W1-W3. TWNs in PDBs were not extracted as energies but distances of <3.5 Å between the crystal water oxygen atoms due to the absence of hydrogen atoms in most of the PDBs.
Figure 4. An illustration of a 3-ring TWN closely located near one of the N atoms of Arg473 in chain A of protein data bank (PDB) 2G1T (a Bcr-Abl tyrosine kinase structure). Water molecules are represented by red spheres. Water molecules involved in the 3-ring TWN are indicated as W1-W3. TWNs in PDBs were not extracted as energies but distances of <3.5 Å between the crystal water oxygen atoms due to the absence of hydrogen atoms in most of the PDBs.
Molecules 24 02653 g004
Figure 5. Amino acid capping at both termini. The amino acid is represented by the stick model where C, N, O and H atoms are shown in dark gray, blue, red and light gray colors, respectively. The capping regions are highlighted. Amino- and carboxyl-terminals were capped with acetyl (ACE) and N-methyl amide (NME) groups, respectively.
Figure 5. Amino acid capping at both termini. The amino acid is represented by the stick model where C, N, O and H atoms are shown in dark gray, blue, red and light gray colors, respectively. The capping regions are highlighted. Amino- and carboxyl-terminals were capped with acetyl (ACE) and N-methyl amide (NME) groups, respectively.
Molecules 24 02653 g005
Table 1. The number of 3-ring TWNs observed around various atoms of all amino acids in the MD simulations. The TWN analysis was carried out on the water molecules which were extracted every 10 ps for each simulated system. The amino acids are ordered from the most hydrophobic one (Ile, on the left hand side) to the most hydrophilic one (Arg, on the right hand side), according to the Kyte-Doolitle scale.
Table 1. The number of 3-ring TWNs observed around various atoms of all amino acids in the MD simulations. The TWN analysis was carried out on the water molecules which were extracted every 10 ps for each simulated system. The amino acids are ordered from the most hydrophobic one (Ile, on the left hand side) to the most hydrophilic one (Arg, on the right hand side), according to the Kyte-Doolitle scale.
(A) 3-ring TWNs (Backbone + Side chain)
IleValLeuPheCysMetAlaGlyThrTrpSerTyrProHsdHseGlnAspAsnGluLysArg
O86939084971041301031359617012886809912519613915611399
N252837151337286520382123338070731960275181
C230248219232141175200168154207152201221169164124129105117183158
S 9328
Total341369346331344344358336309341343352340329333322344304300347338
O,N/Total0.330.330.370.300.320.410.440.500.500.390.560.430.350.490.510.610.630.650.610.470.53
C/Total0.670.670.630.700.410.510.560.500.500.610.440.570.650.510.490.390.370.350.390.520.47
S/Total 0.270.08
(B) 3-ring TWNs (Backbone)
IleValLeuPheCysMetAlaGlyThrTrpSerTyrProHsdHseGlnAspAsnGluLysArg
O86939084971041301039196951098680998788977911399
N252837151337286520182123332427201922272824
C111115105111118104125168106112123111911059710610689104119122
Total222236232210228245283336217226239243210209223213213208210260245
O,N/Total0.500.510.550.470.480.580.560.500.510.500.490.540.570.500.570.500.500.570.500.540.50
C/Total0.500.490.450.530.520.420.440.500.490.500.510.460.430.500.430.500.500.430.500.460.50
(C) 3-ring TWNs (Side chain)
IleValLeuPheCysMetAlaGlyThrTrpSerTyrProHsdHseGlnAspAsnGluLysArg
O 44 7519 381084277
N 20 564353 38 2357
C119133114121237175 489529901306467182316136436
S 9328
Total1191331141211169975 9211510410913012011010913196908793
O,N/Total 0.480.170.720.17 0.470.390.830.820.830.860.260.61
C/Total1.001.001.001.000.200.721.00 0.520.830.280.831.000.530.610.170.180.170.140.740.39
S/Total 0.800.28
Table 2. The number of 3-ring TWNs observed around various atoms of all amino acids in the PDBs. The amino acids are ordered from the most hydrophobic one (Ile, on the left hand side) to the most hydrophilic one (Arg, on the right hand side), according to the Kyte-Doolitle scale.
Table 2. The number of 3-ring TWNs observed around various atoms of all amino acids in the PDBs. The amino acids are ordered from the most hydrophobic one (Ile, on the left hand side) to the most hydrophilic one (Arg, on the right hand side), according to the Kyte-Doolitle scale.
(A) 3-ring TWNs (Backbone + Side chain)
IleValLeuPheCysMetAlaGlyThrTrpSerTyrProHisGlnAspAsnGluLysArg
O351450456504300391812607254865313110123714400861858321853886229681100182854651744405
N377698925453133265125919126316569043579237333708130545311038882510522
C59181398057833338103559092434541850116277854503513297761505882
S 146107
Total44826556840940341230197095481115514665223815722947675516371130203133714878303601550415809
O,N/Total0.870.880.880.860.850.770.890.950.940.850.970.950.780.880.970.990.980.970.900.94
C/Total0.130.120.120.140.030.170.110.050.060.150.030.050.220.120.030.010.020.030.100.06
S/Total 0.120.05
(B) 3-ring TWNs (Backbone)
IleValLeuPheCysMetAlaGlyThrTrpSerTyrProHisGlnAspAsnGluLysArg
O3514504565043003918126072548653495912375402276558321853306256563866500551744405
N377698925453133265125919126311419043579229857213058821038883625
C13253818613176590421088229538386156625268
Total390457687467347410571538868911155563213886394314460192189367270224804610561095098
O,N/Total1.001.000.990.990.990.990.980.950.990.990.990.990.980.980.990.990.990.990.990.99
C/Total0.000.000.010.010.010.010.020.050.010.010.010.010.020.020.010.010.010.010.010.01
(C) 3-ring TWNs (Side chain)
IleValLeuPheCysMetAlaGlyThrTrpSerTyrProHisGlnAspAsnGluLysArg
O 8151 89985853 580024025615223541
N 515 34353136 3649 79429897
C57878894256027325859 88233533047915327474122902737141453814
S 146107
Total578788942560173432859 903385093286332153241829348243151007424255939510711
O,N/Total 0.900.610.960.92 0.820.960.990.970.970.850.92
C/Total1.001.001.001.000.160.751.00 0.100.390.040.081.000.180.040.010.030.030.150.08
S/Total 0.840.25

Share and Cite

MDPI and ACS Style

Choi, K.-E.; Chae, E.; Balupuri, A.; Yoon, H.R.; Kang, N.S. Topological Water Network Analysis Around Amino Acids. Molecules 2019, 24, 2653. https://doi.org/10.3390/molecules24142653

AMA Style

Choi K-E, Chae E, Balupuri A, Yoon HR, Kang NS. Topological Water Network Analysis Around Amino Acids. Molecules. 2019; 24(14):2653. https://doi.org/10.3390/molecules24142653

Chicago/Turabian Style

Choi, Kwang-Eun, Eunkyoung Chae, Anand Balupuri, Hye Ree Yoon, and Nam Sook Kang. 2019. "Topological Water Network Analysis Around Amino Acids" Molecules 24, no. 14: 2653. https://doi.org/10.3390/molecules24142653

APA Style

Choi, K. -E., Chae, E., Balupuri, A., Yoon, H. R., & Kang, N. S. (2019). Topological Water Network Analysis Around Amino Acids. Molecules, 24(14), 2653. https://doi.org/10.3390/molecules24142653

Article Metrics

Back to TopTop