Next Article in Journal
Determination of Vitamin B3 Vitamer (Nicotinamide) and Vitamin B6 Vitamers in Human Hair Using LC-MS/MS
Previous Article in Journal
Cytotoxicity of Essential Oil Cordia verbenaceae against Leishmania brasiliensis and Trypanosoma cruzi
Previous Article in Special Issue
Spectral Features of Canthaxanthin in HCP2. A QM/MM Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Accurate Prediction of Absorption Spectral Shifts of Proteorhodopsin Using a Fragment-Based Quantum Mechanical Method

1
Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
2
NYU Shanghai, 1555 Century Avenue, Shanghai 200122, China
3
NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
4
Department of Chemistry, New York University, New York, NY 10003, USA
*
Author to whom correspondence should be addressed.
Molecules 2021, 26(15), 4486; https://doi.org/10.3390/molecules26154486
Submission received: 10 May 2021 / Revised: 19 July 2021 / Accepted: 20 July 2021 / Published: 25 July 2021
(This article belongs to the Special Issue Computational Spectroscopy of Protein Chromophores and Active Sites)

Abstract

:
Many experiments have been carried out to display different colors of Proteorhodopsin (PR) and its mutants, but the mechanism of color tuning of PR was not fully elucidated. In this study, we applied the Electrostatically Embedded Generalized Molecular Fractionation with Conjugate Caps (EE-GMFCC) method to the prediction of excitation energies of PRs. Excitation energies of 10 variants of Blue Proteorhodopsin (BPR-PR105Q) in residue 105GLN were calculated with the EE-GMFCC method at the TD-B3LYP/6-31G* level. The calculated results show good correlation with the experimental values of absorption wavelengths, although the experimental wavelength range among these systems is less than 50 nm. The ensemble-averaged electric fields along the polyene chain of retinal correlated well with EE-GMFCC calculated excitation energies for these 10 PRs, suggesting that electrostatic interactions from nearby residues are responsible for the color tuning. We also utilized the GMFCC method to decompose the excitation energy contribution per residue surrounding the chromophore. Our results show that residues ASP97 and ASP227 have the largest contribution to the absorption spectral shift of PR among the nearby residues of retinal. This work demonstrates that the EE-GMFCC method can be applied to accurately predict the absorption spectral shifts for biomacromolecules.

1. Introduction

The rhodopsin proteorhodopsin and related proteins have aroused continuous and extensive interest both experimentally and theoretically among researchers, especially during the past 20 years [1,2,3,4,5]. Rhodopsin is a seven transmembrane α-helices (TM) protein which uses retinal as a chromophore. Retinal, the aldehyde of vitamin A and a polyene chromophore, is derived from β-carotene and utilized in the all-trans/13-cis configurations in microbial rhodopsins allowing certain microorganisms to convert light into metabolic energy. 11-cis/all-trans retinal acts as the molecular basis of animal vision and is attached by a Schiff base linkage to the conserved residue lysine (LYS231) sidechain in the middle of TM7. The retinal Schiff base (RSB) is protonated (RSBH+) in most cases [6].
Proteorhodopsin(PR), as a member of the microbial rhodopsin family, is a light-driven protein found in marine proteobacteria. Due to the widespread global distribution of proteobacteria in sea water, PR may have an important effect on global solar energy input in the biosphere [7,8]. The maximum absorption wavelength of PR is tuned according to the depth at which the bacteria live [9,10]. It is well known that a shorter wavelength of the light corresponds to a higher energy and larger power to penetrate deeper water. To date, many experiments have been carried out to display different colors of PR and its mutants, but the mechanism of color tuning of PR is still not fully elucidated.
Borhan and co-workers performed spectral tuning of all-trans-retinal PRs, in which one or multiple residues were mutated by rational mutagenesis, enabling the absorption maximum of the pigment in the range of 425 to 644 nm [11]. In addition, for the study of single point mutation, previous works demonstrated that one of the determinants for color tuning of PR is at position 105, GLN in Blue-absorbing PR (BPR—PR105Q) and LEU in Green-absorbing PR (GPR—PR105L) [10]. Kandori and co-workers successfully introduced several amino acid mutations at position 105 of BPR (PR105Q) and investigated the absorption properties. High-performance liquid chromatography analysis showed that the isomeric composition of the all-trans form was greater than 70% for all mutants [12].
The effect of mutations on the optical properties of PRs is usually difficult to predict, and thus it would be desirable to employ theoretical calculations to aid in the rational design of PRs with tailored photo-physical properties [13]. Unfortunately, modeling the excitation energy and other excited-state properties of PRs is incredibly challenging due to the necessity of accurate quantum mechanical (QM) excited-state calculation (which scales steeply with the system size) and the large system size involved.
Recently, our group developed the Electrostatically Embedded Generalized Molecular Fractionation with Conjugate Caps (EE-GMFCC) method for quantitatively characterizing properties of proteins with localized excitations (i.e., involving a single chromophore). The excitation energy, transition dipole moment, and oscillator strength of wild-type Green Fluorescent Protein (GFP) calculated by EE-GMFCC were found to be in excellent agreement with results of full system time-dependent density functional theory (TDDFT) [13]. In this study, we applied the EE-GMFCC method for the accurate prediction of excitation energies of PRs. We hope that this method can be used to help investigate the mechanism of color tuning in PRs, and for the rational design of its mutagenesis.

2. Computational Approaches

The ground state of microbial rhodopsin possesses an all-trans configuration for its chromophore [6]. Our previous study demonstrated that the protein environment plays an important role in the excited-state properties of GFP [13]. We therefore expect a similar dependence on protein environment for the electronic properties of the PR chromophore. To handle the full complexities of the PR chromophore-protein interactions, we use Electrostatic Embedded (EE)-GMFCC, which is an extension of GMFCC to include many-body environment effects in each fragment QM calculation using embedding charges that represent the remaining fragments [14,15].

2.1. The EE-GMFCC Method for Excited State and Its Application for PR105Q

Before applying the EE-GMFCC method to absorption spectrum calculations of PRs, we give a brief description of this quantum fragmentation approach to large biomolecular systems. The EE-GMFCC method was initially developed for the total energy calculation on the ground state of large proteins [14,15,16]. In the EE-GMFCC framework, a protein with N residues is divided into N-2 fragments with each residue capped by its neighboring residues (conjugate caps) [14,15,17]. Each fragment’s energy is computed at the QM level. Interactions between fragments that are not directly bonded are handled by including two-body interactions in larger fragments formed by two residues in close contact within a distance cutoff. Typically, three-body and higher-order interactions within the EE-GMFCC scheme are small and can be neglected. The QM energies of the conjugated caps are deducted to avoid overcounting contributions from overlapping regions of the fragments.
The ground-state energy of a protein is calculated by EE-GMFCC as follows,
E EE GMFCC Ground   State = i = 2 N 1 E ˜ ( Cap i 1 A i Cap i + 1 ) i = 2 N 2 E ˜ ( Cap i Cap i + 1 ) + i = 1 N 3 j = i + 3 | R i j | λ 2 N ( E ˜ i j E ˜ i E ˜ j ) { i = 2 N 2 m k i n l i q m ( k i ) q n ( l i ) R m ( k i ) n ( l i ) i = 1 N 3 m i j = i + 3 | R i j | λ 2 N n j q m ( i ) q n ( j ) R m ( i ) n ( j ) }
where E ˜ denotes the sum of the self-energy of a fragment and the interaction energy between the fragment and its background charges. Cap i 1 A i Cap i + 1 represents the ith residue (Ai) covalently bonded with molecular caps of Cap i 1 and Cap i + 1 . The definition of molecular caps is given in ref. [14]. The first two sums calculate the one-body (1B) QM energy of the system, which accounts for the covalently-linked three neighboring residues. The third term in Equation (1) represents the two-body (2B) QM energy corrections, arising from the two non-covalently-linked residues that are spatially in close contact. The last two terms are applied to subtract the double-counted interaction between QM region and background charges in all 1B and 2B calculations. | R i j | is the minimum distance between any two atoms from residues i and j. λ 2 is the cutoff distance of 2B. q m ( k i ) represents the charge of the mth atom of fragment ki.   R m ( k i ) n ( l i ) represents the distance between atoms m ( k i ) and n ( l i ) . For a detailed description of the total ground-state energy calculation of proteins using the EE-GMFCC method, please refer to our previous work [14].
Recently, our group extended the ground state EE-GMFCC method to make it applicable for predicting the properties of excited states of the luminescent biomolecule GFP [13]. The calculation of excitation energy, transition dipole moment, and oscillator strength using EE-GMFCC showed good agreement with the corresponding full system QM results. In this study, based on our previous works [13,18,19,20,21], the three-body QM interaction term was neglected because it only slightly improves the accuracy but with substantially more computational time. Moreover, the cutoff distance threshold for the two-body QM interactions was set to 4 Å to strike a compromise between attained accuracy and computational cost.
In its current formulation, the EE-GMFCC method is appropriate only for local excitations, wherein the dominant electronic response following excitation is localized on a single molecular unit, i.e. a chromophore. Such a local excitation can be defined by the excited state of the chromophore in isolation being qualitatively similar to its excited state in the protein environment. We expect this to be the case for PRs, because its non-standard residue LYR231 has been identified as the chromophore responsible for light absorption.
After we define the locally excited region, fragments including this region are defined as excited fragments (EF), whereas fragments excluding the excited region are defined as unexcited fragments (UEF). Then the total excited-state energy of the protein can be described as the summation of the excited state energy E EE GMFCC Excited State ( EF ) from the contributions of excited fragments and the ground-state energy E EE GMFCC Excited State ( UEF ) from unexcited fragments as follows:
E EE GMFCC Excited   State ( E F ) = i = m 1 m + 1 E ˜ ( Cap i 1 A i Cap i + 1 )   i = m 1 m E ˜ ( Cap i Cap i + 1 ) + j = 1 | R m j | λ 2 j [ m 2 , m + 2 ] N ( E ˜ m j E ˜ m E ˜ j )   { i = 2 N 2 m 0 k i n l i q m 0 ( k i ) q n ( l i ) R m 0 ( k i ) n ( l i ) i = 1 N 3 m i j = i + 3 | R i j | λ 2 N n j q m ( k i ) q n ( l i ) R m ( k i ) n ( l i ) }
Most denotations in Equation (2) (the EE-GMFCC method for excited state) are similar to the ground state calculation. Here we just point out the key differences. The superscript “prime” represents the excited-state energy of the subsystem containing the localized excitation region (residue m). In addition, for the unexcited fragments, the energy calculation is the same as the ground state. Then the total EE-GMFCC energy at the excited state is given by:
E EE GMFCC Excited   State = E EE GMFCC Excited   State ( EF ) + E EE GMFCC Excited   State ( UEF )
Here, we assume that the atomic charges of the protein at the excited state are approximated to be the same as those at the ground state. We subtract Equation (1) from Equation (3) to obtain the excitation energy of the protein based on the EE-GMFCC approach. The final expression of the EE-GMFCC excitation energy is as follows:
ω = i = m 1 m + 1 ω ( Cap i 1 A i Cap i + 1 ) i = m 1 m ω ( Cap i Cap i + 1 ) + j = 1 | R m j | λ 2 j [ m 2 , m + 2 ] N ( ω m j ω m )
In this study, m is the residue number of chromophore LYR. ω stands for the excitation energy. The QM region is given in the brackets including two kinds of fragments, namely Cap i 1 A i Cap i + 1 and Cap i Cap i + 1 , whereas the remaining part of the protein was represented by background atomic charges. The QM calculations were performed using the Terachem package [22,23,24]. Cap i 1 A i Cap i + 1 represents the QM region of the ith residue (Ai) covalently bonded with molecular caps of Cap i 1 and Cap i + 1 . The definition of molecular caps is given in ref. [14]. The first two summations of Equation (4) calculate the one-body (1B) QM contributions to the total excitation energy of the system.
There are two kinds of fragment interactions in Equation (4), which includes one-body (1B) and two-body (2B) QM interaction terms. The one-body (1B) term represents the fragment with sequentially connected two residues or three residues, whereas the two-body (2B) QM interaction term represents the interaction between two non-neighboring residues that are spatially in close contact within a distance threshold. In this study, for PRs, the 1B term includes the chromophore LYR231 and its neighboring residues, namely, Fragment(230), Fragment(231), Fragment(232), Concap(230), and Concap(231), where Fragment(230), Fragment(231), and Fragment(232) contain residues 229–231, 230–232, and 231–233, respectively, and Concap(230) and Concap(231) contain residues 230–231 and 231–232, respectively (see Figure 1 and Figure S1 of the Supplementary Materials). The 2B term in this study represents the QM interaction between LYR231 and non-neighboring residue that is within 4 Å of LYR231. Therefore, Equation (4) becomes:
ω = ω 1 B + ω 2 B = ω ( Fragment ( 230 ) ) + ω ( Fragment ( 231 ) ) + ω ( Fragment ( 232 ) ) ω ( Concap ( 230 ) ) ω ( Concap ( 231 ) ) + j = 1 | R 231 , j | 4 Å j [ 229 , 233 ] N ( ω 231 , j ω 231 )
where ω 1 B and ω 2 B denote the 1B and 2B QM interactions, respectively. The last term j = 1 | R 231 , j | 4 Å j [ 229 , 233 ] N ( ω 231 , j ω 231 ) is the sum of 2B QM corrections, namely, the excitation energy between LYR231 and its non-neighboring residue j, if the distance between any pair of atoms from LYR231 and residue j is less than or equal to 4 Å. In the 1B term, the interaction between the chromophore and non-neighboring residues is described by the embedding electrostatic charges. The interactions between the chromophore and non-neighboring residues in spatially close contact are subjected to quantum mechanical treatment in the 2B QM calculations.

2.2. Structure Preparation and Molecular Dynamics (MD) Simulation

2.2.1. Homology Modelling

In this work, the structure of Blue-absorbing PR (BPR—PR105Q) was prepared by means of a hybrid Quantum Mechanics/Molecular Mechanics (QM/MM) approach and MD simulation [25]. The initial structure was taken from the X-ray crystal structure of BPR (PR105Q) in the dark state (PDB id: 4JQ6, chain B), which we chose as the template of PR105Q (PR with GLN105). The MODELLER [26] software, was utilized for homology modelling of PR105Q. The alignment of PR105Q and chain B of 4JQ6 was performed using TM-align [27]. Residue LYS231 and retinal were combined in a protonated retinal Schiff base (RSBH+) linkage as a non-standard residue in the Amber package [28,29], which we named LYR231 in this work. The structure of BPR (PR105Q) and LYR231 are shown in Figure 2. The distances between the chromophore and all residues are given in Supplementary Figure S2 of the Supplementary Materials.

2.2.2. Force Field Construction of the Non-Standard Residue LYR231

To make direct comparison to experiment, absorption spectra of the wild-type protein and several other mutated proteins were computed under high pH conditions, such that the ASP97 residue near the Schiff base was deprotonated. For the structure of chromophore, a LYS residue jointed with retinal (named LYR) was formed by the Schiff base at the connection site. The isomeric all-trans configuration occupied more than 70% for almost all of the mutant proteins. Therefore, we chose the all-trans isomer as the initial structure of LYR (see Figure 2) [12].
We set the residue ASP97 at the deprotonated state. The experimental value of the absorption peak compared in this study was the measured value when the protein was solvated in high pH solution. The Amber18 program was utilized to obtain the force-field parameters [29]. The parameters of chromophore were created using the Generalized Amber Force Field (GAFF) [30]. The Amber ff14SB force field and the TIP3P water model [31] were used for other parts of the protein [32] and water, respectively. Force-field parameters of non-standard residue (LYR231, the chromophore) were obtained from the ANTECHAMBER module with the AM1-BCC charge model [33] using the semi-empirical quantum mechanics (sqm) method [34].

2.2.3. Molecular Dynamics (MD) Simulation

The initial protein structure was solvated in a cubic periodic box of TIP3P water molecules with each side at least 10 Å from the nearest solute atom, and the total size of the cubic box was 87.298 × 71.649 × 57.894 Å3. The counter ions Na+ were added to neutralize the entire system according to its total net charge. The molecular structure of PR105Q was optimized under molecular mechanics using two steps. First, the protein was restrained and all other molecules were relaxed. Next, the entire system was energy minimized. Each minimization procedure consisted of 10,000 steps of the steepest descent optimization, followed by 40,000 steps of the conjugate gradient optimization approach.
We used a set of configurations extracted from simulations to compute the averaged absorption spectra using QM calculations. The approach based on the sequential use of simulations and quantum mechanical calculations (denominated Sequential QM/MM) has been widely used to calculate spectroscopic properties of molecules in liquid environments [35]. A wide variety of works use this sequential method to predict absorption spectra and other properties, such as NMR and emission spectra, supplying converged results similar to the experimental results.
To obtain the ensemble-average absorption spectra, conformations of PR105Q were extracted from Quantum Mechanics/Molecular Mechanics (QM/MM) molecular dynamics (MD) simulation trajectories [28,36,37]. The QM/MM MD simulation of PR was carried out using Amber18 [38] interfaced to the Gaussian09 package [39]. The QM part, consisting of the residue LYR and residues 105 and 200, was treated with the M06-2X functional in conjunction with the 6-31G* basis set [40].
Schapiro and co-workers studied the initial excited state dynamics of GPR, and their simulations indicated that the retinal-TYR200 interaction played an important role in the outcome of the photo isomerization [25]. In this study, during classical MD simulation, the fluctuation of TYR200 had a significant influence on the excitation energy of the chromophore LYR231. Therefore, we performed MD simulation of PR using the QM/MM method, and TYR200 was included in the QM region.
The detailed procedure of MD simulation is given as follows. First, the optimized system was heated to 300 K in 50 ps. Secondly, a 500 ps equilibration run with classical MD was carried out before the final 10 ps QM/MM MD simulation for a production run at 300 K [41]. Langevin dynamics with a collision frequency of 1.0 ps−1 was used to control the temperature. A 25 Å cutoff was applied to the QM/MM electrostatic interactions, and the SHAKE algorithm was applied to restrain bonds with hydrogen atoms.
The QM/MM MD simulations were performed for PR105Q and nine other mutants [28,29]. A total of 100 snapshots were evenly extracted from the trajectory of the last ps QM/MM MD simulation with a time interval of 10 fs. Each conformation of the selected 100 snapshots was subsequently calculated by the EE-GMFCC method to obtain the vertical excitation energy (Equation (5)) [13]. Here we chose the B3LYP density functional in TDDFT calculations [42].

2.2.4. Key Residue Mutation and Fragment-Based QM Calculations for Excitation Energies

Residue 105 plays an important role in determining the color of PR, where BPR (PR105Q) and GPR (PR105L) have GLN and LEU at this position, respectively [12]. Accurate predictions of absorption spectral shifts upon point mutations are critical to the rational mutagenesis design of PR [11,43,44,45]. Here, residue 105 is a vital residue that affects the absorption spectral shift of PR. Kandori and co-workers carried out an experimental investigation in which several different point mutations were introduced for the color determining residue 105 [12].
For the QM calculations, water molecules were removed from the conformation of MD simulation trajectory. The excitation energy of the protein was calculated with the EE-GMFCC method at the TD-B3LYP/6-31G* level [46,47] using Equation (5).
Figure 3 shows the work flow of the complete computational protocol utilized in this study for model construction, MD simulation, and excitation energy calculations. We used Ambertools to complete mutation operation for PR. First, we removed sidechain atoms of residue 105, while the backbone atoms (C, N, CA, O) of residue 105 and other parts of the protein were reserved. Second, we changed the residue name of R105 to the name of the residue to be mutated. Third, the sidechain atoms of the new residue were added to the backbone of R105 by the Amber program. Finally, the added residue was energy minimized to avoid unreasonable repulsive interactions with nearby residues. The minimization process was undertaken using Amber18 [28].

3. Results and Discussion

3.1. Comparison between Calculated Excitation Energies and Experiment for Wild-Type PR and Its Mutants

From experimental investigation, residue 105 is an important amino acid that influences the absorption spectrum of the chromophore in PR. In this study, we mutated residue 105 to other amino acids for predicting the absorption spectral shift. Vertical excitation energies were calculated on mutated PRs using the EE-GMFCC method. Predicted excitation energies were compared with corresponding experimental absorption peaks.
Figure 4 shows that the correlation coefficient (R) between the EE-GMFCC results (1B + 2B) and experimental values is 0.937, and the equation of linear regression is y = 0.995x + 4.718. In contrast, the results given by EE-GMFCC without two-body (2B) QM interaction corrections yielded a worse agreement with the experiment. The correlation coefficient given by EE-GMFCC with 1B correction is merely 0.710 (see Figure 4a), indicating that, perhaps unsurprisingly, the residues which are spatially in close contact with the central chromophore have the greatest impact on the excitation energy of the chromophore in PR. From Table 1, we can see that the mean unsigned error (MUE) of the predicted excitation energies of those 10 systems decreases from 8.0 to 3.5 nm calculated by EE-GMFCC from 1B to 1B + 2B, compared to experimental values. This is direct evidence that the absorption wavelength of the chromophore is affected by its surrounding chemical environment.
One hundred snapshots extracted from the last ps of QM/MM MD trajectory were selected to calculate the average excitation energy of PR. The excitation energy distribution of these 100 conformations of PR105D is shown in Figure 5. As shown in Figure 5, the conformations with the predicted excitation energy close to the experimental absorption peak have the largest population. The most probable absorption wavelength is almost equal to the average value, which is in good agreement with the experimental data [12]. Excitation energy distributions for the other nine mutants are given in Supplementary Figure S4 of the Supplementary Materials.

3.2. Residue-Based Decomposition of Excitation Energies

Next, we studied the excitation energy contribution from each residue around the chromophore. To avoid interference from embedding charges, we used the GMFCC scheme, which turns off the background charges in each fragment QM calculation, because 1B and 2B QM interactions in EE-GMFCC include many-body environmental effects through electrostatic embedding.
The GMFCC scheme leads directly to the excitation energy contribution from each residue around the chromophore LYR231 by subtracting the single chromophore excitation energy from the two-body (residue-chromophore) excitation energy [48]. Our previous work used the GMFCC method to provide a qualitative prediction of the relative shift that each residue contributes to the excitation energy of the GFP chromophore [13]. Here, we applied the same approach, in which the 2B distance threshold was set to 4 Å between the chromophore and nearby residues. The excitation energy contribution of each residue around the chromophore was calculated by GMFCC based on 100 snapshots from QM/MM MD simulation. The excitation energies were also calculated at the TD-B3LYP/6-31G* level for consistency.
The per-residue decomposition of the average excitation energy of the PR105D protein is shown in Table 2 and Figure 6. Residues spatially close to chromophore had the greatest influence on the excitation energy of the protein. ASP97, ASP105, and ASP227 yield the largest blue shifts, whereas MET134, PHE152, and LEU135 show red shifts of PR105D. As shown in Figure 6, the most blue-shifted residue is ASP97, yielding a wavelength shift of −59.5 nm (+267.9 meV), compared to the chromophore alone, whereas the most red-shifted residue is MET134, with a wavelength shift of +6.6 nm (−26.3 meV).

3.3. The Local Electric Field along the Retinal

It is known that the polyene chain of the retinal chromophore has considerable charge-transfer character in its lowest excited state, which we confirm below. It is therefore reasonable to expect that point mutations of the protein can modulate the excitation energy of the retinal through changes in the electrostatic field. To test this idea, we investigated the electric field along the polyene chain under different point mutations. A measure of the electric field along the polyene chain is given by [49,50,51]:
E c 10 c 1 = i = 1 , i L Y R N 1 4 π ε ( q i | r i r c 10 | q i | r i r c 1 | ) | r c 10 r c 1 |
where E c 10 c 1 is the electric field induced by protein residues from C1 to C10 on the conjugate chain, which approximates the electric field along the retinal chain (see Figure 2). The term i LYR denotes that charges of residue LYR231 were excluded from the calculation of electric field. q i represents the atomic charge of the ith atom, r i denotes the coordinate vector of ith atom, and | r i r c 10 | represents for the distance between atom i and atom C10.
The 1B excitation energies calculated by the EE-GMFCC method show good correlation with the average electric field based on the Amber ff14SB force field [32] and polarized protein-specific charge (PPC) [49,52,53,54] models for the 100 snapshots extracted from the QM/MM MD simulation (see Table 3 and Figure 7). The correlation coefficient (R) between 1B excitation energies and the electric fields from the Amber ff14SB charge model is 0.829, and the equation of linear regression is y = 39.5x − 77.1. In comparison, the correlation coefficient (R) between 1B excitation energies and the electric fields from the PPC charge model is 0.771, and the equation of linear regression is y = 39.7x − 76.8. The correlations between the average electric fields (with the Amber and PPC charge models) and 2B excitation energies calculated by EE-GMFCC are shown in Supplementary Figure S7 of the Supplementary Materials, and the correlations between the average electric fields (with the Amber and PPC charge models) and experimental excitation energies are shown in Supplementary Figure S8 of the Supplementary Materials.
Figure 8 shows that the HOMO magnitude decreases from C10 to C1, whereas the LUMO magnitude decreases in the reverse direction to that of the HOMO. From natural transition orbital (NTO) analysis, the HOMO and LUMO are the dominant orbitals involved in this excitation. The electron excitation from the HOMO to LUMO of the chromophore thus has considerable charge-transfer character. Based on the direction of charge transfer along the polyene chain, we expect that increasing the environmental electric field along the conjugate chain from C10 to C1 will result in a blue shift of the electronic excitation [43]. Conversely, decreasing the magnitude of the electric field will cause a red shift of the electronic excitation. This expectation is borne out in our results: Figure 7 shows that the largest electric field between C10 and C1, with a value of 22.7 MV/cm (with the Amber force field), is observed for PR105D with a corresponding 1B excitation energy of 2.509 eV, which is larger than PR105L with 19.8 MV/cm. Similarly, the smallest electric field of 13.2 MV/cm is observed for PR105K, which has the smallest 1B excitation energy of 2.315 eV. A higher strength of the electric field corresponds to a higher excitation energy, which yields a blue shift (see Figure 9).
As shown in Figure 9, the electric fields of PR105D (22.7 MV/cm), PR105L (19.8 MV/cm), PR105V (17.0 MV/cm), and PR105K (13.2 MV/cm) decrease gradually as the excitation energy progressively decreases in the order of 2.509, 2.396, 2.390, and 2.315 eV. As discussed above, there is a certain linear correlation between the electric field and excitation energy. It is worth noting that a prediction of the excitation energy change upon point mutation based on an electric field calculation is much faster compared to the ab initio EE-GMFCC calculations, although the correlation is not perfect in this study. The possible cause of such deviation might arise from the inaccuracy of the charge model used in the electric field calculations. To investigate the effect of the charge model on the electric field calculations of PRs, we tested the PPC charge model for comparison with the Amber ff14SB charge model [32]. The PPC model takes the polarization-induced effect of the protein into consideration by assigning a polarized atomic charge to each atom using the self-consistent RESP [57,58] fitting scheme for each amino acid, with the rest of the protein acting as an electrostatic field. Here, we utilized the PPC method to refit the atomic charges of the protein, and applied the PPC charge model to the PR system for each electric field calculation. Further details of the PPC charge fitting scheme are provided in Ref. [59]. The fitted PPC charges replace the original Amber charges for each atom of the PR systems, and the electric field is calculated using Equation (6). The results of EE-GMFCC-2B and a comparison of performance between Amber and PPC charge models are given in Supplementary Figures S7 and S8 of the Supplementary Materials. In general, the performance of the PPC model is not superior to the result predicted by the Amber ff14SB charge model, indicating that a more sophisticated method might need to be applied to make electric field-excitation energy correlations for quantitative accuracy for PR systems.

4. Conclusions

In this study, we applied the Electrostatically Embedded Generalized Molecular Fractionation with Conjugate Caps (EE-GMFCC) method to predict the excitation energy of PRs. Excitation energies of wild-type PR and its nine mutants were calculated with the EE-GMFCC method at the TD-B3LYP/6-31G* level over hundreds of thermally sampled snapshots from ab initio QM/MM molecular dynamics simulations.
The calculated excitation energies show good correlation with the experimental values of absorption wavelengths despite the fact that the experimental wavelengths among these ten systems vary by less than 50 nm. The correlation coefficient (R) between the EE-GMFCC results (1B + 2B) and experimental values is 0.937. In contrast, the results calculated by EE-GMFCC without two-body QM interaction corrections yield poorer agreement with the experiment. The correlation coefficient given by EE-GMFCC with 1B corrections was merely 0.710, indicating that the residues which are spatially in close contact with the central chromophore have the greatest impact on the excitation energy of the chromophore in PR.
We also utilized the GMFCC method to decompose the excitation energy contributions of residues near the chromophore. The most blue-shifting residue is ASP97, which yields a −59.5 nm (+267.9 meV) wavelength shift in average, whereas the most red-shifting residue is MET134 with a +6.6 nm (−26.3 meV) wavelength shift. The overall spectral shift of the 2B QM correction on PR mutants was small, mainly due to a cancellation between blue-shifting and red-shifting residues.
The calculated excitation energies using the EE-GMFCC method with 1B corrections show good correlations with the predicted average electric field using the Amber and PPC charge models, and the correlation coefficients (R) between them are 0.829 and 0.771, respectively. Predicting the excitation energy change based on the average electric field could be an alternative and efficient approach for the rational design of PRs with tailored photo-physical properties. Overall, our results demonstrate that the EE-GMFCC method is a useful tool for accurately and efficiently predicting the excited-state properties of large biological systems.
In this work, a relatively short 1 ps of trajectory data was analyzed because of the two following problems. First, MD simulations using classical force fields do not fully sample the correct configurations of the retinal structure due to the low accuracy of the force field, and we found that the predicted absorption wavelengths of incorrect retinal conformations deviate substantially from the experimental values; thus, a QM/MM approach was critical. Second, QM/MM MD simulations with more than 100 atoms in the QM region are very time consuming, limiting us to trajectories of 10 ps in length, which left 1 ps of production data following equilibration. Although it would have been ideal to use more uncorrelated points for the computation of the excitation energy, given the close comparison with the experiment and the low fluctuations observed, we think that this sampling is sufficient to showcase our methodology.
It is also worth noting that the 1-ps QM/MM MD trajectory that we use for analysis will not sample long timescale protein fluctuations [60]; however, the good agreement found between theoretical excitation energies and experimental measurements suggests that >1-ps fluctuations have a minor influence on the average absorption energy. This finding is in contrast to solvated chromophores, which can have large couplings between the excitation energy and >1-ps solvation dynamics [61,62,63,64]. The origin of this disparity could be due to a relatively conserved environment of the chromophore in the protein matrix, unlike in solvent, although we cannot fully rule out a fortuitous agreement of our results with experiment. The role of long-time fluctuations of the protein on the excitation energies of the PR chromophore is an interesting open question that we hope to address in the future.

Supplementary Materials

The following are available online. The fragmentation scheme of the EE-GMFCC method; Analysis of the protein structure; Comparison between the EE-GMFCC and traditional QM/MM calculations; The convergence test of the EE-GMFCC calculations; Excitation energy distributions of 100 conformations of PR105Q and its nine mutants; Correlations between the calculated electric fields (with the Amber and PPC charge models) and excitation energies calculated by EE-GMFCC, and experimental excitation energies.

Author Contributions

Conceptualization, X.H. and W.J.G.; methodology, X.H. and W.J.G.; software, X.J. and C.S.; validation, C.S.; formal analysis, C.S., X.H. and W.J.G.; investigation, C.S., X.H. and W.J.G.; resources, X.H. and W.J.G.; data curation, C.S. and X.H.; writing—original draft preparation, C.S.; writ-ing—review and editing, C.S., X.H. and W.J.G.; visualization, C.S. and X.H.; supervision, X.H. and W.J.G.; project administration, X.H. and W.J.G.; funding acquisition, X.H. and W.J.G.. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program of China (Grant numbers 2016YFA0501700, and 2019YFA0905201), National Natural Science Foundation of China (Grant numbers 21922301, 21761132022, 21673074, and 21603145), National Natural Science Foundation of China International Young Scientist Fund (Grant numbers 21750110439 and 21851110758), Shanghai Municipal Natural Science Foundation (Grant number 18ZR1412600), and the Fundamental Research Funds for the Central Universities. WJG thanks NYU Shanghai for startup funds.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in Supplementary Material.

Acknowledgments

We also thank the Supercomputer Center of East China Normal University (ECNU Multifunctional Platform for Innovation 001) and NYU Shanghai for providing computer resources.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ikuta, T.; Shihoya, W.; Sugiura, M.; Yoshida, K.; Watari, M.; Tokano, T.; Yamashita, K.; Katayama, K.; Tsunoda, S.P.; Uchihashi, T.; et al. Structural insights into the mechanism of rhodopsin phosphodiesterase. Nat. Commun. 2020, 11, 5605–5616. [Google Scholar] [CrossRef]
  2. Hontani, Y.; Broser, M.; Luck, M.; Weissenborn, J.; Kloz, M.; Hegemann, P.; Kennis, J.T.M. Dual Photoisomerization on Distinct Potential Energy Surfaces in a UV-Absorbing Rhodopsin. J. Am. Chem. Soc. 2020, 142, 11464–11473. [Google Scholar] [CrossRef]
  3. Lienard, M.A.; Bernard, G.D.; Allen, A.; Lassance, J.M.; Song, S.; Childers, R.R.; Yu, N.; Ye, D.; Stephenson, A.; Valencia-Montoya, W.A.; et al. The evolution of red color vision is linked to coordinated rhodopsin tuning in lycaenid butterflies. Proc. Natl. Acad. Sci. USA 2021, 118, 2008986118. [Google Scholar] [CrossRef]
  4. Han, C.T.; Song, J.; Chan, T.; Pruett, C.; Han, S. Electrostatic Environment of Proteorhodopsin Affects the pKa of Its Buried Primary Proton Acceptor. Biophys. J. 2020, 118, 1838–1849. [Google Scholar] [CrossRef]
  5. Hirschi, S.; Kalbermatter, D.; Ucurum, Z.; Fotiadis, D. Cryo-electron microscopic and X-ray crystallographic analysis of the light-driven proton pump proteorhodopsin reveals a pentameric assembly. J. Struct. Biol. X 2020, 4, 100024. [Google Scholar] [CrossRef]
  6. Ernst, O.P.; Lodowski, D.T.; Elstner, M.; Hegemann, P.; Brown, L.S.; Kandori, H. Microbial and animal rhodopsins: Structures, functions, and molecular mechanisms. Chem. Rev. 2014, 114, 126–163. [Google Scholar] [CrossRef]
  7. Béja, O.; Spudich, E.N.; Spudich, J.L.; Leclerc, M.; DeLong, E.F. Proteorhodopsin phototrophy in the ocean. Nature 2001, 411, 786. [Google Scholar] [CrossRef] [PubMed]
  8. Bamann, C.; Bamberg, E.; Wachtveitl, J.; Glaubitz, C. Proteorhodopsin. Biochim. Biophys. Acta Bioenerg. 2014, 1837, 614–625. [Google Scholar] [CrossRef] [Green Version]
  9. Béja, O.; Aravind, L.; Koonin, E.V.; Suzuki, M.T.; Hadd, A.; Nguyen, L.P.; Jovanovich, S.B.; Gates, C.M.; Feldman, R.A.; Spudich, J.L. Bacterial rhodopsin: Evidence for a new type of phototrophy in the sea. Science 2000, 289, 1902–1906. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Man, D.; Wang, W.; Sabehi, G.; Aravind, L.; Post, A.F.; Massana, R.; Spudich, E.N.; Spudich, J.L.; Béjà, O. Diversification and spectral tuning in marine proteorhodopsins. EMBO J. 2003, 22, 1725–1731. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Wang, W.; Nossoni, Z.; Berbasova, T.; Watson, C.T.; Yapici, I.; Lee, K.S.S.; Vasileiou, C.; Geiger, J.H.; Borhan, B. Tuning the electronic absorption of protein-embedded all-trans-retinal. Science 2012, 338, 1340–1343. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Ozaki, Y.; Kawashima, T.; Abe-Yoshizumi, R.; Kandori, H. A color-determining amino acid residue of proteorhodopsin. Biochemistry 2014, 53, 6032–6040. [Google Scholar] [CrossRef] [PubMed]
  13. Jin, X.; Glover, W.J.; He, X. Fragment Quantum Mechanical Method for Excited States of Proteins: Development and Application to the Green Fluorescent Protein. J. Chem. Theory Comput. 2020, 16, 5174–5188. [Google Scholar] [CrossRef] [PubMed]
  14. Wang, X.; Liu, J.; Zhang, J.Z.; He, X. Electrostatically embedded generalized molecular fractionation with conjugate caps method for full quantum mechanical calculation of protein energy. J. Phys. Chem. A 2013, 117, 7149–7161. [Google Scholar] [CrossRef] [PubMed]
  15. Jin, X.; Zhang, J.Z.; He, X. Full QM Calculation of RNA Energy Using Electrostatically Embedded Generalized Molecular Fractionation with Conjugate Caps Method. J. Phys. Chem. A 2017, 121, 2503–2514. [Google Scholar] [CrossRef]
  16. He, X.; Zhu, T.; Wang, X.; Liu, J.; Zhang, J.Z. Fragment quantum mechanical calculation of proteins and its applications. Acc. Chem. Res. 2014, 47, 2748–2757. [Google Scholar] [CrossRef]
  17. Wang, Y.; Liu, J.; Li, J.; He, X. Fragment-based quantum mechanical calculation of protein-protein binding affinities. J. Comput. Chem. 2018, 39, 1617–1628. [Google Scholar] [CrossRef]
  18. Liu, J.; Sun, H.; Glover, W.J.; He, X. Prediction of Excited-State Properties of Oligoacene Crystals Using Fragment-Based Quantum Mechanical Method. J. Phys. Chem. A 2019, 123, 5407–5417. [Google Scholar] [CrossRef]
  19. Zhang, W.; Liu, J.; Jin, X.; Gu, X.; Zeng, X.C.; He, X.; Li, H. Quantitative Prediction of Aggregation-Induced Emission: A Full Quantum Mechanical Approach to the Optical Spectra. Angew. Chem. Int. Ed Engl. 2020, 2–8. [Google Scholar] [CrossRef]
  20. Liu, J.; He, X. Fragment-based quantum mechanical approach to biomolecules, molecular clusters, molecular crystals and liquids. Phys. Chem. Chem. Phys. 2020, 22, 12341–12367. [Google Scholar] [CrossRef]
  21. Wang, Z.; Han, Y.; Li, J.; He, X. Combining the Fragmentation Approach and Neural Network Potential Energy Surfaces of Fragments for Accurate Calculation of Protein Energy. J. Phys. Chem. B 2020, 124, 3027–3035. [Google Scholar] [CrossRef]
  22. Isborn, C.M.; Luehr, N.; Ufimtsev, I.S.; Martinez, T.J. Excited-State Electronic Structure with Configuration Interaction Singles and Tamm-Dancoff Time-Dependent Density Functional Theory on Graphical Processing Units. J. Chem. Theory Comput. 2011, 7, 1814–1823. [Google Scholar] [CrossRef] [PubMed]
  23. Ufimtsev, I.S.; Martinez, T.J. Quantum Chemistry on Graphical Processing Units. 3. Analytical Energy Gradients, Geometry Optimization, and First Principles Molecular Dynamics. J. Chem. Theory Comput. 2009, 5, 2619–2628. [Google Scholar] [CrossRef] [PubMed]
  24. Titov, A.V.; Ufimtsev, I.S.; Luehr, N.; Martinez, T.J. Generating Efficient Quantum Chemistry Codes for Novel Architectures. J. Chem. Theory Comput. 2013, 9, 213–221. [Google Scholar] [CrossRef] [PubMed]
  25. Borin, V.A.; Wiebeler, C.; Schapiro, I. A QM/MM study of the initial excited state dynamics of green-absorbing proteorhodopsin. Faraday Discuss. 2018, 207, 137–152. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Eswar, N.; Eramian, D.; Webb, B.; Shen, M.-Y.; Sali, A. Protein structure modeling with MODELLER. In Structural Proteomics; Springer: San Francisco, CA, USA, 2008; pp. 145–149. [Google Scholar]
  27. Zhang, Y.; Skolnick, J. TM-align: A protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 2005, 33, 2302–2309. [Google Scholar] [CrossRef]
  28. Case, D.A.; Cheatham, T.E., 3rd; Darden, T.; Gohlke, H.; Luo, R.; Merz, K.M., Jr.; Onufriev, A.; Simmerling, C.; Wang, B.; Woods, R.J. The Amber biomolecular simulation programs. J. Comput. Chem. 2005, 26, 1668–1688. [Google Scholar] [CrossRef] [Green Version]
  29. Salomon-Ferrer, R.; Case, D.A.; Walker, R.C. An overview of the Amber biomolecular simulation package. WIREs. Comput. Mol. Sci. 2013, 3, 198–210. [Google Scholar] [CrossRef]
  30. Wang, J.; Wolf, R.M.; Caldwell, J.W.; Kollman, P.A.; Case, D.A. Development and testing of a general amber force field. J. Comput. Chem. 2004, 25, 1157–1174. [Google Scholar] [CrossRef] [PubMed]
  31. Price, D.J.; Brooks, C.L., III. A modified TIP3P water potential for simulation with Ewald summation. J. Chem. Phys. 2004, 121, 10096–10103. [Google Scholar] [CrossRef]
  32. Maier, J.A.; Martinez, C.; Kasavajhala, K.; Wickstrom, L.; Hauser, K.E.; Simmerling, C. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. J. Chem. Theory Comput. 2015, 11, 3696–3713. [Google Scholar] [CrossRef] [Green Version]
  33. Jakalian, A.; Bush, B.L.; Jack, D.B.; Bayly, C.I. Fast, Efficient Generation of High-Quality Atomic Charges. AM1-BCC Model: I. Method. J. Comput. Chem. 2000, 21, 132–146. [Google Scholar] [CrossRef]
  34. Walker, R.C.; Crowley, M.F.; Case, D.A. The implementation of a fast and accurate QM/MM potential method in Amber. J. Comput. Chem. 2008, 29, 1019–1031. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Coutinho, K.; Canuto, S. Advances in Quantum Chemistry; Elsevier: Amsterdam, The Netherlands, 1997; Volume 28, pp. 89–105. [Google Scholar]
  36. Tao, P.; Fisher, J.F.; Shi, Q.; Vreven, T.; Mobashery, S.; Schlegel, H.B. Matrix metalloproteinase 2 inhibition: Combined quantum mechanics and molecular mechanics studies of the inhibition mechanism of (4-phenoxyphenylsulfonyl)methylthiirane and its oxirane analogue. Biochemistry 2009, 48, 9839–9847. [Google Scholar] [CrossRef] [Green Version]
  37. Van der Kamp, M.W.; Mulholland, A.J. Combined quantum mechanics/molecular mechanics (QM/MM) methods in computational enzymology. Biochemistry 2013, 52, 2708–2728. [Google Scholar] [CrossRef]
  38. Case, D.A.; Ben-Shalom, I.Y.; Brozell, S.R.; Cerutti, D.S.; Cheatham, T.E.; Cruzeiro, V.W.D.; Darden, T.A.; Duke, R.E.; Ghoreishi, D.; Gilson, M.K.; et al. AMBER18; University of California: San Francisco, CA, USA, 2018. [Google Scholar]
  39. Frisch, M.J.; Trucks, G.W.; Schlegel, H.B.; Scuseria, G.E.; Robb, M.A.; Cheeseman, J.R.; Scalmani, G.; Barone, V.; Mennucci, B.; Petersson, G.A.; et al. Gaussian 09, RevisionA.01; Gaussian, Inc.: Wallingford, CT, USA, 2009. [Google Scholar]
  40. Zhao, Y.; Truhlar, D.G. The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: Two new functionals and systematic testing of four M06-class functionals and 12 other functionals. Theor. Chem. Acc. 2007, 120, 215–241. [Google Scholar] [CrossRef] [Green Version]
  41. Adcock, S.A.; McCammon, J.A. Molecular Dynamics: Survey of Methods for Simulating the Activity of Proteins. Chem. Rev. 2006, 106, 1589–1615. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Huang, S.; Zhang, Q.; Shiota, Y.; Nakagawa, T.; Kuwabara, K.; Yoshizawa, K.; Adachi, C. Computational Prediction for Singlet- and Triplet-Transition Energies of Charge-Transfer Compounds. J. Chem. Theory Comput. 2013, 9, 3872–3877. [Google Scholar] [CrossRef] [PubMed]
  43. Cheng, C.; Kamiya, M.; Uchida, Y.; Hayashi, S. Molecular Mechanism of Wide Photoabsorption Spectral Shifts of Color Variants of Human Cellular Retinol Binding Protein II. J. Am. Chem. Soc. 2015, 137, 13362–13370. [Google Scholar] [CrossRef]
  44. Chen, T.; Zheng, L.; Yuan, J.; An, Z.; Chen, R.; Tao, Y.; Li, H.; Xie, X.; Huang, W. Understanding the Control of Singlet-Triplet Splitting for Organic Exciton Manipulating: A Combined Theoretical and Experimental Approach. Sci. Rep. 2015, 5, 10923–10933. [Google Scholar] [CrossRef]
  45. Mao, J.; Do, N.N.; Scholz, F.; Reggie, L.; Mehler, M.; Lakatos, A.; Ong, Y.S.; Ullrich, S.J.; Brown, L.J.; Brown, R.C.; et al. Structural basis of the green-blue color switching in proteorhodopsin as determined by NMR spectroscopy. J. Am. Chem. Soc. 2014, 136, 17578–17590. [Google Scholar] [CrossRef]
  46. Burke, K.; Werschnik, J.; Gross, E.K. Time-dependent density functional theory: Past, present, and future. J. Chem. Phys. 2005, 123, 062206. [Google Scholar] [CrossRef] [Green Version]
  47. Chai, J.D.; Head-Gordon, M. Systematic optimization of long-range corrected hybrid density functionals. J. Chem. Phys. 2008, 128, 084106. [Google Scholar] [CrossRef]
  48. He, X.; Zhang, J.Z.H. The generalized molecular fractionation with conjugate caps/molecular mechanics method for direct calculation of protein energy. J. Chem. Phys. 2006, 124, 184703. [Google Scholar] [CrossRef]
  49. Wang, X.; He, X.; Zhang, J.Z. Predicting mutation-induced Stark shifts in the active site of a protein with a polarized force field. J. Phys. Chem. A 2013, 117, 6015–6023. [Google Scholar] [CrossRef]
  50. Wang, X.; Zhang, J.Z.; He, X. Quantum mechanical calculation of electric fields and vibrational Stark shifts at active site of human aldose reductase. J. Chem. Phys. 2015, 143, 184111. [Google Scholar] [CrossRef] [PubMed]
  51. Sandberg, D.J.; Rudnitskaya, A.N.; Gascon, J.A. QM/MM Prediction of the Stark Shift in the Active Site of a Protein. J. Chem. Theory Comput. 2012, 8, 2817–2823. [Google Scholar] [CrossRef] [PubMed]
  52. Duan, L.L.; Mei, Y.; Zhang, Q.G.; Zhang, J.Z. Intra-protein hydrogen bonding is dynamically stabilized by electronic polarization. J. Chem. Phys. 2009, 130, 115102. [Google Scholar] [CrossRef] [PubMed]
  53. Tong, Y.; Ji, C.G.; Mei, Y.; Zhang, J.Z.H. Simulation of NMR Data Reveals That Proteins’ Local Structures Are Stabilized by Electronic Polarization. J. Am. Chem. Soc. 2009, 131, 8636–8641. [Google Scholar] [CrossRef]
  54. Liu, J.; He, X.; Zhang, J.Z.H. Improving the scoring of protein-ligand binding affinity by including the effects of structural water and electronic polarization. J. Chem. Inf. Model. 2013, 53, 1306–1314. [Google Scholar] [CrossRef] [PubMed]
  55. Lu, T.; Chen, F. Multiwfn: A multifunctional wavefunction analyzer. J. Comput. Chem. 2012, 33, 580–592. [Google Scholar] [CrossRef]
  56. William, H.; Dalke, A.; Schulten, K. VMD: Visual molecular dynamics. J. Mol. Graph. 1996, 14, 33–38. [Google Scholar] [CrossRef]
  57. Cornell, W.D.; Cieplak, P.; Bayly, C.I.; Kollman, P.A. Application of RESP Charges to Calculate Conformational Energies, Hydrogen Bond Energies, and Free Energies of Solvation. J. Am. Chem. Soc. 1993, 115, 9620–9631. [Google Scholar] [CrossRef]
  58. Bayly, C.I.; Cieplak, P.; Cornell, W.; Kollman, P.A. A Well-Behaved Electrostatic Potential Based Method Using Charge Restraints for Deriving Atomic Charges: The RESP Model. J. Phys. Chem. B 1993, 97, 10269–10280. [Google Scholar] [CrossRef]
  59. Ji, C.; Mei, Y.; Zhang, J.Z. Developing polarized protein-specific charges for protein dynamics: MD free energy calculation of pKa shifts for Asp26/Asp20 in thioredoxin. Biophys. J. 2008, 95, 1080–1088. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  60. van Gunsteren, W.F.; Huenenberger, P.H.; Mark, A.E.; Smith, P.E.; Tironi, I.G. Computer simulation of protein motion. Comput. Phys. Commun. 1995, 91, 305–319. [Google Scholar] [CrossRef]
  61. Georg, H.C.; Coutinho, K.; Canuto, S. Solvent effects on the UV-visible absorption spectrum of benzophenone in water: A combined Monte Carlo quantum mechanics study including solute polarization. J. Chem. Phys. 2000, 113, 9132–9139. [Google Scholar] [CrossRef]
  62. Ali, A.; Le, T.T.B.; Striolo, A.; Cole, D.R. Salt Effects on the Structure and Dynamics of Interfacial Water on Calcite Probed by Equilibrium Molecular Dynamics Simulations. J. Phys. Chem. C 2020, 124, 24822–24836. [Google Scholar] [CrossRef]
  63. Manzoni, V.; Lyra, M.L.; Gester, R.M.; Coutinho, K.; Canuto, S. Study of the optical and magnetic properties of pyrimidine in water combining PCM and QM/MM methodologies. Phys. Chem. Chem. Phys. 2010, 12, 14023–14033. [Google Scholar] [CrossRef] [PubMed]
  64. Glover, W.J.; Larsen, R.E.; Schwartz, B.J. Simulating the formation of sodium:electron tight-contact pairs: Watching the solvation of atoms in liquids one molecule at a time. J. Phys. Chem. A 2011, 115, 5887–5894. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Graphical representation of fragments, concaps, and 2B QM interaction. The background α-helix cartoon of the PR protein with 80% transparency is described by background charges, whereas the sticks of the corresponding residues represent the QM region in the EE-GMFCC calculation. The green sticks denote the chromophore LYR231. The structures all come from the same configuration of the PR protein.
Figure 1. Graphical representation of fragments, concaps, and 2B QM interaction. The background α-helix cartoon of the PR protein with 80% transparency is described by background charges, whereas the sticks of the corresponding residues represent the QM region in the EE-GMFCC calculation. The green sticks denote the chromophore LYR231. The structures all come from the same configuration of the PR protein.
Molecules 26 04486 g001
Figure 2. (a) The top view of PR. (b) The side view of PR. In (a,b), the green moiety denotes the chromophore, and the blue moiety is residue 105. (c) The structure of LYR and all hydrogens were removed for simplicity. All atom names are marked for clarity.
Figure 2. (a) The top view of PR. (b) The side view of PR. In (a,b), the green moiety denotes the chromophore, and the blue moiety is residue 105. (c) The structure of LYR and all hydrogens were removed for simplicity. All atom names are marked for clarity.
Molecules 26 04486 g002
Figure 3. The work flow of theoretical calculations for PR105Q and its nine mutations in this study. Residue 105 was mutated to nine other residues.
Figure 3. The work flow of theoretical calculations for PR105Q and its nine mutations in this study. Residue 105 was mutated to nine other residues.
Molecules 26 04486 g003
Figure 4. The correlation between the experimental absorption wavelengths of different mutations of PR105Q and calculated results using EE-GMFCC. The 1B and 1B + 2B results are both provided. (a) 1B: only the one-body QM interactions are included. (b) 1B + 2B: both the one-body and two-body QM interactions are included (see Equation (4)). 1B and 2B represent the excitation energy contributions from the covalently-linked three neighboring residues, and the two non-covalently-linked residues that are spatially in close contact, respectively.
Figure 4. The correlation between the experimental absorption wavelengths of different mutations of PR105Q and calculated results using EE-GMFCC. The 1B and 1B + 2B results are both provided. (a) 1B: only the one-body QM interactions are included. (b) 1B + 2B: both the one-body and two-body QM interactions are included (see Equation (4)). 1B and 2B represent the excitation energy contributions from the covalently-linked three neighboring residues, and the two non-covalently-linked residues that are spatially in close contact, respectively.
Molecules 26 04486 g004
Figure 5. Distribution of calculated absorption wavelengths for one hundred conformations of PR105D extracted from the last ps of QM/MM MD trajectory. The average value of calculated results is 512 nm, which is in excellent agreement with the experimental value of 510 nm.
Figure 5. Distribution of calculated absorption wavelengths for one hundred conformations of PR105D extracted from the last ps of QM/MM MD trajectory. The average value of calculated results is 512 nm, which is in excellent agreement with the experimental value of 510 nm.
Molecules 26 04486 g005
Figure 6. Excitation energy contributions of 2B QM interactions involving residues close to chromophore are presented by the colors blue and red, where the color gradient in units of nm is scaled based on the 2B QM contribution, and the distance threshold λ for 2B correction was set to 4 Å. Residues with positive 2B QM correction have a blue shift of the absorption spectrum and are colored blue, whereas residues with negative 2B QM correction have a red shift and are colored red.
Figure 6. Excitation energy contributions of 2B QM interactions involving residues close to chromophore are presented by the colors blue and red, where the color gradient in units of nm is scaled based on the 2B QM contribution, and the distance threshold λ for 2B correction was set to 4 Å. Residues with positive 2B QM correction have a blue shift of the absorption spectrum and are colored blue, whereas residues with negative 2B QM correction have a red shift and are colored red.
Molecules 26 04486 g006
Figure 7. Correlation between the average electric field of 100 snapshots taken from QM/MM MD simulation trajectory for PR105Q and nine mutants, and the calculated 1B QM excitation energy by EE-GMFCC. (a) The average electric field calculated by the Amber ff14SB charge model. (b) The average electric field calculated by the PPC charge model.
Figure 7. Correlation between the average electric field of 100 snapshots taken from QM/MM MD simulation trajectory for PR105Q and nine mutants, and the calculated 1B QM excitation energy by EE-GMFCC. (a) The average electric field calculated by the Amber ff14SB charge model. (b) The average electric field calculated by the PPC charge model.
Molecules 26 04486 g007
Figure 8. Representative snapshot of PR105D from QM/MM MD simulation and calculated LUMO (a) and HOMO (b) orbitals at the TD-B3LYP/6-31G* level where LYR231 was subjected to quantum mechanical treatment with background charges of the remaining part of PR105D. Charge transfer process of electron excitation from HOMO to LUMO [55,56].
Figure 8. Representative snapshot of PR105D from QM/MM MD simulation and calculated LUMO (a) and HOMO (b) orbitals at the TD-B3LYP/6-31G* level where LYR231 was subjected to quantum mechanical treatment with background charges of the remaining part of PR105D. Charge transfer process of electron excitation from HOMO to LUMO [55,56].
Molecules 26 04486 g008
Figure 9. The calculated average electric fields and excitation energies of EE-GMFCC-1B for 100 snapshots extracted from QM/MM MD simulations of four representative mutations, namely, PR105D, PR105L, PR105V, and PR105K.
Figure 9. The calculated average electric fields and excitation energies of EE-GMFCC-1B for 100 snapshots extracted from QM/MM MD simulations of four representative mutations, namely, PR105D, PR105L, PR105V, and PR105K.
Molecules 26 04486 g009
Table 1. Calculated excitation energy using EE-GMFCC at the TD-B3LYP/6-31G* level for different mutations of PR. “Dev.” stands for the deviation from the experimental value. 1B and 2B denote the excitation energy calculated by 1B correction, and 1B + 2B corrections, respectively. 1B and 2B represent the excitation energy contributions from the covalently-linked three neighboring residues, and the two non-covalently-linked residues that are spatially in close contact, respectively. MUE denotes the mean unsigned error.
Table 1. Calculated excitation energy using EE-GMFCC at the TD-B3LYP/6-31G* level for different mutations of PR. “Dev.” stands for the deviation from the experimental value. 1B and 2B denote the excitation energy calculated by 1B correction, and 1B + 2B corrections, respectively. 1B and 2B represent the excitation energy contributions from the covalently-linked three neighboring residues, and the two non-covalently-linked residues that are spatially in close contact, respectively. MUE denotes the mean unsigned error.
MutationExp (nm) a1B (nm)2B (nm)Dev. (1B)Dev. (2B)
Q105D510495512−152
Q105W527504521−23−6
Q105C512507511−5−1
Q105L51651852226
PR105Q49349549623
Q105M524512528−124
Q105Y524519525−51
Q105V523520525−32
Q105I517512518−51
Q105K52953753889
Average −5.62.1
MUE 8.03.5
a Absorption wavelength of experimental value in high pH solution where ASP97 is deprotonated. [12].
Table 2. Excitation energy decomposition per residue based on an ensemble average over 100 conformations. The residues are spatially in close contact with the chromophore and their QM contributions are calculated by GMFCC at the TD-B3LYP/6-31G* level. “Ex” represents the excitation energy in eV. ΔEx represents the excitation energy difference between 2B (residue + LYR231) and LYR231. ΔWL represents the wavelength difference between 2B (residue + LYR231) and LYR231 in nm.
Table 2. Excitation energy decomposition per residue based on an ensemble average over 100 conformations. The residues are spatially in close contact with the chromophore and their QM contributions are calculated by GMFCC at the TD-B3LYP/6-31G* level. “Ex” represents the excitation energy in eV. ΔEx represents the excitation energy difference between 2B (residue + LYR231) and LYR231. ΔWL represents the wavelength difference between 2B (residue + LYR231) and LYR231 in nm.
Res. NameEx (eV)ΔEx (meV)ΔWL (nm)
LYR231 a2.23480.00.0
LEU402.2329−1.90.5
VAL682.23924.4−1.1
THR692.2322−2.60.7
ALA722.2317−3.10.8
TYR952.2326−2.20.5
ASP972.5027267.9−59.5
TRP982.2182−16.64.2
THR1012.2258−9.02.2
VAL1022.2265−8.32.1
ASP1052.277943.1−10.5
MET1342.2084−26.36.6
LEU1352.2155−19.34.8
GLY1382.23954.7−1.2
ALA1512.2183−16.44.1
PHE1522.2119−22.95.7
GLY1552.2345−0.20.1
CYS1562.2202−14.63.7
TRP1592.2239−10.92.7
TRP1972.2344−0.30.1
TYR2002.255120.3−5.0
PRO2012.2289−5.81.5
TYR2042.2308−4.01.0
TYR2232.2343−0.50.1
ASP2272.4446209.8−47.7
PHE2282.23641.6−0.4
PHE2342.2344−0.40.1
GLY2352.2341−0.70.2
a With only LYR231 included in the QM region. In other cases, each residue and LYR231 are in the QM region.
Table 3. With the Amber and PPC charge models, the calculated average electric field (in MV/cm) between C10 and C1 (see Figure 2) along the polyene chain of chromophore based on the ensemble average over 100 snapshots from QM/MM MD simulation. Experimental values (eV) of excitation energies are given for comparison. 2B represents the two-body corrected QM excitation energy, and 1B represents the one-body QM excitation energy calculated by EE-GMFCC.
Table 3. With the Amber and PPC charge models, the calculated average electric field (in MV/cm) between C10 and C1 (see Figure 2) along the polyene chain of chromophore based on the ensemble average over 100 snapshots from QM/MM MD simulation. Experimental values (eV) of excitation energies are given for comparison. 2B represents the two-body corrected QM excitation energy, and 1B represents the one-body QM excitation energy calculated by EE-GMFCC.
MutationsExp. (eV)2B (eV)1B (eV)Ave. Field a
(MV/cm)
Ave. Field b
(MV/cm)
Distance (C10-C1)
Q105D2.4362.4282.50922.725.99.849
Q105W2.3572.3822.46022.121.19.809
Q105C2.4272.4312.44520.218.39.799
Q105L2.4082.3782.39619.820.09.826
PR105Q2.5202.5022.50919.621.69.826
Q105M2.3712.3532.42219.221.19.824
Q105Y2.3712.3652.39117.518.09.830
Q105V2.3752.3652.39017.016.39.850
Q105I2.4032.3982.42716.716.79.855
Q105K2.3492.3112.31513.216.39.857
a Average electric field calculated by the Amber ff14SB charge model. b Average electric field calculated by the PPC charge model.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Shen, C.; Jin, X.; Glover, W.J.; He, X. Accurate Prediction of Absorption Spectral Shifts of Proteorhodopsin Using a Fragment-Based Quantum Mechanical Method. Molecules 2021, 26, 4486. https://doi.org/10.3390/molecules26154486

AMA Style

Shen C, Jin X, Glover WJ, He X. Accurate Prediction of Absorption Spectral Shifts of Proteorhodopsin Using a Fragment-Based Quantum Mechanical Method. Molecules. 2021; 26(15):4486. https://doi.org/10.3390/molecules26154486

Chicago/Turabian Style

Shen, Chenfei, Xinsheng Jin, William J. Glover, and Xiao He. 2021. "Accurate Prediction of Absorption Spectral Shifts of Proteorhodopsin Using a Fragment-Based Quantum Mechanical Method" Molecules 26, no. 15: 4486. https://doi.org/10.3390/molecules26154486

APA Style

Shen, C., Jin, X., Glover, W. J., & He, X. (2021). Accurate Prediction of Absorption Spectral Shifts of Proteorhodopsin Using a Fragment-Based Quantum Mechanical Method. Molecules, 26(15), 4486. https://doi.org/10.3390/molecules26154486

Article Metrics

Back to TopTop