2.1. Construction of Manβ(1,4)GlcNAcβ(1,4)GlcNAc
Mannose diacetylglucose trisaccharide is formed by a trisaccharide chain of β-1,4-linked one mannose and two acetylaminose. The structural formula of Manβ(1,4)GlcNAcβ(1,4)GlcNAc is shown in
Figure 1. The notations of the molecular groups mainly depend on the structural formula. As shown in
Figure 1, three sugar rings are labeled as M, G′, and G. Corresponding molecular groups on each sugar ring are labeled. For example, OH6
M is the hydroxyl group at position C6 on ring M. OH6′, and OH6 refers to the hydroxyl groups at position C6′ and C6 on rings G′ and G, respectively. As shown in
Figure 1, we connect the three oxygen atoms on the sugar rings (OM, OG′, and OG) as a split line. OM is labeled as the oxygen atom located on ring M of mannose. OG′ and OG are labeled as the oxygen atoms located on rings G′ and G of disaccharides with β-1,4-linked N-acetylglucosamine, respectively. The structure above the split line is defined as the upper part, and that below refers to the lower part. When hydrogen bonds (H-bonds) are properly considered in the building procedure, it is possible to construct all low-energy conformers easily and reasonably. Meanwhile, the combination of H-bonds in the upper and lower parts can effectively avoid conformer duplication. In
Figure 2, we establish a tree diagram of Manβ(1,4)GlcNAcβ(1,4)GlcNAc. Region SI is classified according to
cis and
trans glycosidic bonds.
Cis and
trans conformations are defined as follows. For the glycosidic linkage, we only consider four cases:
cis glycosidic linkage conformations for Manβ(1,4)GlcNAc with syn/syn (Φ, Ψ) ≈ −80°, 90° and
trans glycosidic linkage conformations for Manβ(1,4)GlcNAc with anti/syn (Φ, Ψ) ≈ 50°, 120°;
cis1 glycosidic linkage conformations for GlcNAcβ(1,4)GlcNAc with syn/syn (Φ
1, Ψ
1) ≈ −80°, 90° and
trans1 glycosidic linkage conformations for GlcNAcβ(1,4)GlcNAc with anti/syn (Φ
1, Ψ
1) ≈ 50°, 120°. For example, a
trans-
trans1 conformation of Manβ(1,4)GlcNAcβ(1,4)GlcNAc is shown in
Figure 1. Taking all four dihedral angles into account, four different forms are obtained for Manβ(1,4)GlcNAcβ(1,4)GlcNAc, already illustrating the structural variety of oligosaccharides. Region SII is constructed according to the structure of the H-bond. According to the H-bond types of each group of region SII, region SIII is combined by the upper and lower parts.
Next, we will illustrate the classification of each group configuration and the construction of the intra-group configuration in detail. Here, we build the basic skeleton of trisaccharides according to the position of glycosidic bonds and then construct the reasonable configurations of trisaccharides according to the H-bonds that could be formed. Among them, the skeleton of groups A and B is composed of two
trans-glycosidic bonds. The upper and lower H-bonds of conformer A1 show cooperative inter-ring H-bonds clockwise and counterclockwise, respectively. The upper H-bond of A1 (see
Figure 3 below) is OH6
M→OH6′→OH6→OG, and the lower H-bond is OH4
M→OH3
M→OH2
M→OH3′→NHCO′→OH3→NHCO→O1. Here, an arrow indicates the proton donor of the hydrogen atom directed toward the acceptor of the oxygen atom. While the upper H-bond of conformer A2 is OH6→OH6′→OH6
M, which is the counterclockwise cooperative inter-ring H-bond in the upper part. The lower H-bond of conformer A2 is consistent with conformer A1. Conformer A3 is distinguished from conformer A1 by the clockwise and counterclockwise of the lower cooperative inter-ring H-bonds of OH3′→OH2
M→OH3
M→OH4
M and NHCO′→OH3→NHCO→O1. According to the building rules, there will be a conformer in group A with the upper H-bond of OH6→OH6′→OH6
M and the lower H-bond of OH3′→OH2
M→OH3
M→OH4
M. However, the conformer will lead to high energy because OH6
M and OH4
M point to each other. Therefore, the conformer is not considered in group A. The difference between the configurations of groups B and A lies in the upper H-bonds. The upper part of group B does not have clockwise and counterclockwise cooperative inter-ring H-bonds. The upper H-bonds of conformer B1 (see
Figure 3 below) are OH6→OG′, OH6′→OM and OH6
M→OH4
M, and the lower H-bond is OH4
M→OH3
M→OH2
M→OH3′→NHCO′→OH3→NHCO→O1. Unfortunately, the clockwise cooperative inter-ring H-bonds of the lower part will lead to OH6
M and OH4
M pointing to each other. Therefore, there is only one conformer in group B. For Manβ(1,4)GlcNAcβ(1,4)GlcNAc of
trans-trans1 glycosidic linkage, the conformers are classified according to two groups containing four conformers. Then, we will build the representative structures with
trans-cis1 glycosidic linkage.
The
trans-cis1 glycosidic linkage of Manβ(1,4)GlcNAcβ(1,4)GlcNAc refers to the
trans glycosidic linkage in Manβ(1,4)GlcNAc and the
cis glycosidic linkage in GlcNAcβ(1,4)GlcNAc. As shown in
Figure 4, we construct eight groups with the difference in the upper H-bond. For group C, the upper H-bonds are OH3→OH6′ and OH6
M→OH4
M, and the lower H-bond is OH4
M→OH3
M→OH2
M→OH3′→NHCO′→OH6→OG. Because the lower clockwise cooperative inter-ring H-bond conflicts with the orientation of OH6
M, there is only one conformer in group C. The upper H-bonds of conformer D1 are OH6′→OH3→OG′ and OH6
M→OH4
M. The upper H-bonds of conformer E1 are OH3→OH6′→NHCO→O1 and OH6
M→OH4
M. The upper H-bonds of conformer F1 are OH3→OG′, OH6′→OM and OH6
M→OH4
M. Their lower H-bonds are the same as conformer C1. Therefore, there is only one conformer in groups D, E, and F because the lower clockwise cooperative inter-ring H-bond conflicts with the orientation of OH6
M.
As shown in
Figure 4, the H-bond of NHCO′→OH6 in the lower part of conformer E1 is indicated by a green dotted line. After the structural optimization, the H-bond disappears due to the competition between the upper and lower H-bonds. The upper H-bonds of conformer G1 in group G are OH3→OH6′ and OH6
M→OM. The counterclockwise cooperative H-bond of OH4
M→OH3
M→OH2
M→OH3′→NHCO′→OH6→OG is built in the lower part. Correspondingly, the clockwise cooperative H-bond of conformer G2 forms OH3′→OH2
M→OH3
M→OH4
M. The other H-bonds are the same as conformer G1. Therefore, Group G has two basic configurations. The upper H-bonds of conformer H1 are OH6′→OH3→OG′ and OH6
M→OM. The upper H-bonds of conformer I1 are OH3→OH6′→NHCO→O1 and OH6
M→OM. The upper H-bonds of conformer J1 are OH3→OG′ and OH6
M→OH6′→OG′. For conformers H1, I1, and J1, their lower H-bond is the same as G1 with counterclockwise cooperative H-bond. Conformers H2, I2, and J2 have a clockwise cooperative H-bond, which is the same as conformer G2. Therefore, each group H, I, and J, has two configurations. The initial configurations of
trans-cis1 glycosidic linkage are divided into eight groups with a total of 12 conformers.
Figure 5 shows the representative configurations of
trans-trans glycosidic linkage for Manβ(1,4)GlcNAcβ(1,4)GlcNAc. The structure difference for each group is the upper H-bond, and the difference for configurations in the same group relies on the lower H-bond. The upper H-bond of each configuration in the K group is OH6
M→OH3′→NHCO′→OH6→OG, and the difference in the configuration in the group is that the lower H-bonds of OH4
M→OH3
M→OH2
M→O′ and OH6′→OH3→NHCO→O1. Because the lower H-bonds of each configuration in the group have counterclockwise and clockwise cooperative H-bonds, there are eight configurations in the group K. In the same way, the upper H-bonds in groups L, M, and N are unchanged. The lower H-bonds have clockwise and counterclockwise cooperative H-bond directions. All three groups have eight configurations. All upper H-bonds of configurations are the same in-group O. Because the lower H-bonds of the M sugar ring in a clockwise direction conflict with the orientation of OH6
M, there is only one counterclockwise cooperative H-bond direction. Therefore, group O contains four conformers.
The initial structures of the
cis-trans1 glycosidic linkage for Manβ(1,4)GlcNAcβ(1,4)GlcNAc are shown in
Figure 6. The difference in various groups here is the upper H-bonds. The upper H-bond in group P is OH6
M→OH3′→NHCO′→OH3→NHCO→O1. The upper H-bonds in group Q are OH3→OH6
M and NHCO′→OH3→NHCO→O1. The upper H-bonds in group R are OH6
M→OH3′→OM and NHCO′→OH3→NHCO→O1. The upper H-bonds in group S are OH3′→OH6
M→ NHCO′ and OH3→O. The upper H-bonds in group T are OH6
M→OH4
M, OH3′→OM, and NHCO′→OH3→NHCO→O1. The lower H-bonds of each group are OH4
M→OH3
M→OH2
M→O′ and OH6′→OH6→OG. Due to the clockwise cooperative H-bonds of the M sugar ring in groups P, Q, R, and S, there are two conformers in each group. Since the clockwise cooperative H-bond in group T conflicts with the direction of OH6
M, there is only one configuration in this group.
2.2. Construction of Manα(1,3)Manα(1,6)Man
The structural formula of Manα(1,3)Manα(1,6)Man is shown in
Figure S1 in Supplementary Materials. The α-
D-mannose structure contains four hydroxyl groups and one hydroxymethyl group. The hydroxyl groups at positions C1 and C2 are axial, which are perpendicular to the equatorial plane. The hydroxyl groups at positions C3 and C4 are equatorial. The orientation of the hydrogen atoms in these four hydroxyl groups could be any position so that the trimannose will have more spatial preference. The notations of the molecular groups mainly depend on the structural formulas. As shown in
Figure S1, three sugar rings are labeled as M, M′ and M″. Corresponding molecular groups on each sugar ring are labeled. For example, OH6″ is the hydroxyl group at position C6 on ring M″.
Similar to the building process of Manβ(1,4)GlcNAcβ(1,4)GlcNAc, we still use the definition of
cis and
trans-glycosidic linkage to distinguish the glycosidic bonds between M and M″ rings of Manα(1,3)Manα(1,6)Man. The building tree is shown in
Figure S2 in the Supplementary Materials, first determining the type of glycosidic linkage, then determining the type of H-bond, and finally selecting the appropriate cooperative H-bond direction.
Next, we will discuss the classification and the construction of configuration in each group in detail. As shown in
Figure S3 in the Supplementary Materials, the skeleton of configuration in group A~D is composed of
cis-cis glycosidic linkage, which is distinguished by the cooperative H-bonds. The cooperative H-bonds formed in conformer A1 are OH6″→OH4′→OH3′→OH2′, OH2″→OH3″→OH4″→OH6′→OH2→O and OH4→O′. Among them, the cooperative H-bonds of OH6″→OH4′→OH3′→OH2′ and OH2″→OH3″→OH4″→OH6′→OH2→O have clockwise and counterclockwise H-bond directions. Therefore, Group A has four representative structures. The difference between configurations in groups A and B is the cooperative H-bond of OH2′→OH3′→OH4′→OH6″→OM″. Referred to the H-bond of OH2′→OH3′→OH4′→OH6″→OM″, the counterclockwise cooperative H-bond is OH6″→OH4′→OH3′→OH2′, which will be consistent with group A. Therefore, there are only two conformers in group B. The inter-ring H-bonds in groups C and D are OH4″→OH4′ and OH3″→OH6′, which are different from the inter-ring H-bonds of OH4′→OH6″ and OH4″→OH6′ in groups A and B. The H-bonds of OH4″→OH4′→OH3′→OH2′ and OH2″→OH3″→OH6′→OH2→OM are clockwise and counterclockwise, which results in four conformers in group C. The only difference between groups D and C is the direction of the hydroxymethyl group on the M″ ring. Therefore, group D also has four configurations. Here, OH4′→OH4″ torsional flexibility of H-bonds between rings in groups C and D is not as flexible as OH4′→OH6″ containing H-bonds of hydroxymethyl group in groups A and B, which will make it difficult to achieve the equilibrium state for structural optimization and cause H-bond breakage.
We build the structures of
cis-trans glycosidic linkage for Manα(1,3)Manα(1,6)Man, which are shown in
Figure S4 in the Supplementary Materials. While keeping the glycosidic bond in
cis formed by the M′ ring and the M ring unchanged, the glycosidic bond formed between M and M″ is rotated so that the glycosidic bond becomes a
trans structure, and then the cooperative H-bond configuration is built. In group E, H-bonds of OH4″→OH3″→OH2″→OH6′→OH2→OM and OH4′→OH3′→OH2′ contain clockwise and counterclockwise H-bond directions. Group E has two configurations. Groups E and F are distinguished by hydroxymethyl orientation on the M″ ring. Therefore, group F also has two configurations.
The representative configurations of
trans-cis glycosidic linkage for Manα(1,3)Manα(1,6)Man are shown in
Figure S5 in the Supplementary Materials. In group G, the intra-ring H-bond of OH4′→OH3′→OH2′→OM′ on the M′ mannose ring and OH4″→OH3″→OH2″→OM″ on the M″ mannose ring, respectively. The inter-ring H-bond of OH4→OH6′→OH6″ can be formed by the rotation of the hydroxymethyl group. The difference between groups G, H, and I is the H-bonds direction that hydroxymethyl forms. The H-bond in group H is OH6′→OH6″→OM″, and the H-bond in group I is OH6′→OH4″→OH6″. Moreover, none of the H-bonds have clockwise and counterclockwise cooperative H-bond directions. Therefore, there is only one configuration in each group. On the basis of group G, we built the configurations of group J, which are conformers built by adjusting the glycosidic bond between M and M″ mannose rings to form more stable inter-ring H-bonds. The novel inter-ring H-bond of OH4″→OM′ of group J makes the structure between M′ and M″ sugar rings stronger. Considering that the H-bonds of OH4′→OH3′→OH2′→OM′ and OH4″→OM′ conflict in the direction and affect the stability of the inter-ring H-bond, the construction of conformer J2 changes the H-bond of OH4′→OH3′→OH2′→OM′ to OH2′→OH3′→OH4′. Thus, there are two conformers in group J. On the basis of the conformers in group J, group K is constructed by selecting the hydroxymethyl group on the M″ ring to form the H-bond of OH6′→OH6″→OM″. Similar to conformer J2, the H-bond direction of OH4′→OH3′→OH2′→OM′ changes to OH2′→OH3′→OH4′ forms conformer K2.
The configurations of the
trans-trans glycosidic linkage for Manα(1,3) Manα(1,6)Man are constructed in
Figure S6 in the Supplementary Materials. While keeping the glycosidic bond in
trans formed by the M and the M′ ring unchanged, the glycosidic bond formed between M and M″ is rotated so that the glycosidic bond becomes a
trans structure. Then, the inter-ring H-bond of OH2→OH6″ between M and M″ sugar rings is built in group L. Moreover, the hydroxymethyl group on the M″ ring can form another H-bond of OH6″→OH4″ in group L. In group M, the hydroxymethyl group on the M″ ring rotates to form the H-bond of OH6″→OM″. The H-bonds in each group especially do not have clockwise and counterclockwise cooperative H-bond directions. There is only one configuration in the group L and M.
2.4. Analysis of Spectral Characteristics for Manβ(1,4)GlcNAcβ(1,4)GlcNAc and Manα(1,3) Manα(1,6)Man
Figure 7 shows the low-energy structures of Manβ(1,4)GlcNAcβ(1,4)GlcNAc within a relative energy of 15 kJ/mol, as well as the calculated IR spectra and characteristic peaks. It is worth noting that the Gibbs free energies at high temperatures possibly provide much better energetic estimates than 0 K [
39]. Since energies of all the conformers are calculated at the temperature of zero, a discussion requires considering the Gibbs free energies using the standard thermochemistry model (T = 298.15 K) to evaluate the conformational stability at the typical room temperature. As you can see from
Figure 7, the values of the Gibbs free energies are different from those at 0 K. Conformers F1 and R1 shown in
Figure 7 are basically isoenergetic at 0 K, but there is a small difference in the Gibbs free energy of 4.1 kJ/mol. The reason for the discrepancy mainly relies on both enthalpic and entropic effects between conformations whose H-bonding contents are different. Fortunately, the energetic order at 298.15 K is unchanged compared to 0 K.
A detailed analysis of the IR spectra and comparisons with the experimental IRID spectrum and predictions of DFT calculations shown in the higher panel of
Figure 7 are performed. The characteristic spectrum in our work is well consistent with the experimental value. In
Figure 7, the experimental IR spectra of Manβ(1,4)GlcNAcβ(1,4)GlcNAc are mainly concentrated at ~3350 to ~3600 cm
−1 with two strong peaks at ~3350 cm
−1 and ~3450 cm
−1. The majority of weak peaks crowd at higher wavenumbers, indicating the contribution from strongly and weakly hydrogen-bonded OH groups. In the experimental spectrum, the strong band at ~3450 cm
−1 is related to the tensile vibration mode labeled σ3′ of the lowest-energy conformation in
Figure 7. There are overlapping bands between ~3430 cm
−1 and ~3520 cm
−1, separated from the strong peaks at ~3350 cm
−1, about 80 cm
−1. The other strong, broad band at ~3350 cm
−1 is accounted for by the σ3 vibration of conformer R1 with the Gibbs free energy of 10.7 kJ/mol. The assignments of the vibrations indicated the fact that high-temperature energetics are required in particular comparison to experimental data. These bands reflect the change of H-bonds in the isomer structures of Manβ(1,4)GlcNAcβ(1,4)GlcNAc to some extent. Compared with the lowest-energy conformer, the dihedral angles related to the glycosidic bond and the cooperative H-bonds in other conformers make the strength of H-bonds different, resulting in the corresponding red and blue shifts. In addition, the proximity of energy values and spectral congestion in the IR spectra can lead to difficulties in conformational assignment.
In Ref. [
39], the initial structures were obtained by full-space conformational search, and then the low-energy configurations were selected for the optimization calculation using the M06-2X method. Similar to the lowest-energy conformer in our calculation, the two lowest-energy conformers are
trans-trans1 structures. In order to assess the reasons for the discrepancy, optimized structures’ relative and free energies are calculated using DFT (M06-2X/6-31+G*). We build the starting structures of two lowest-energy conformers in Ref. [
39], which can be found in
Figure S7 in the Supplementary Materials. The correction factors of the frequency calculation are 0.9734 (OH stretch modes) and 0.9600 (NH stretch modes). The results are shown in Supporting Information. The calculations suggest that the lowest energy configuration we built is still the minimum at 298.15 K. When the molecular structure is complex, the number of full-space conformational searches will largely affect that of initial configurations, even missing important molecular configurations. When the number of full-space conformational searches is increased, a large number of initial configurations are obtained. It is possible to ignore some low-energy configurations in the calculation process by selecting some representative initial configurations. Clearly, the configurations obtained using the TSTB sampling can comprehensively and efficiently determine the low-energy configuration of complex molecules.
Among the low-energy configurations of Manα(1,3)Manα(1,6)Man, there are three low-energy configurations below 10 kJ/mol obtained by full-space conformational search and quantum chemical calculations in Ref. [
37], while there are seven configurations with energies less than 10 kJ/mol established using the TSTB sampling. Since the energies of all the conformers in this paper are calculated at the temperature of zero, we also calculate the Gibbs free energies using the standard thermochemistry model (T = 298.15 K). Considering a 300 K stability, the energetic ordering can be strongly changed compared to 0 K. For example, conformer A1 is the lowest energy conformer at 0 K, but it is the second lowest one at 298.15 K. Conformer G1, which has a high relative energy of 20.1 kJ/mol at 0 K, turns to be the lowest conformer in Gibbs free energy. Conformer J2, the second lowest conformer in energy, ranks 5th lowest according to Gibbs free energy. In other words, at high temperatures, these conformers become a bit unstable. It is consistent with gas phase supersonic expansion experiments [
40]. Due to the high
cis-trans isomerization barrier being quite early inhibited along the expansion, it makes the observation of intrinsically unstable (at 0 K) forms provided possible because of entropic effects.
The configurations within 6 kJ/mol and two conformers with relatively high energies are shown in
Figure 8. As shown in
Figure 8, two of the computed spectra are in remarkably good agreement with the experimental spectrum, which presents two intense, strongly displaced, and broadened bands at low wavenumbers, located at ~3400 cm
−1 and ~3450 cm
−1, together with a cluster of bands lying at higher wavenumbers. Although their associated structures do not correspond to the lowest relative energy, one of them possesses the minimum free energy. The conformation with the lowest Gibbs free energy is 20.1 kJ/mol higher in relative energy. Its calculated vibrational spectrum does reproduce the two strong bands located at ~3450 cm
−1. The conformation with Gibbs free energy of 5.36 kJ/mol lies in 22.4 kJ/mol being higher in relative energy. The calculated vibrational spectrum located at ~3400 cm
−1 does reproduce the experimental band at a lower wavenumber. Moreover, the lowest free energy configuration can well match the experimental spectrum, which indicates that the conformer dominates the structure of mannotriose. Other configurations reported in Ref. [
37] are also predicted in our results. For other configurations, there are five low-energy configurations that are not reported in the literature.
2.5. Construction of Core Pentasaccharide
Considering our previous analysis of disaccharides and trisaccharides combined with the spatial structure of monosaccharides, we have a comprehensive understanding of the core pentasaccharide configuration.
Figure 9 displays a schematic diagram of the molecular structure of core pentasaccharide. In order to build core pentasaccharide in detail, we use three ways to build core pentasaccharide. In the first way, the structure block of trisaccharide is used to build core pentasaccharide, which completely remains the cooperative inter-ring H-bonds in each block. However, duplicate monosaccharides will simultaneously exit H-bonds with both sides, which will make the building process difficult. In the second way, the core pentasaccharide is built by trisaccharides and disaccharides, which do not interfere with each other. The disadvantage is that H-bonds between disaccharides and trisaccharides may be ignored. In the third way, three monosaccharides and one disaccharide are used to build the core pentasaccharide. In this way, the formation of inter-ring H-bonds can be comprehensively considered. However, the construction process is complicated.
Firstly, we use the structure unit of trisaccharide to build the core pentasaccharide. For these two trisaccharides, the previous comprehensive analysis has been carried out. Here, we directly select the low-energy configurations within 10 kJ/mol to build core pentasaccharide. The main concern is the change of glycosidic bonds and inter-ring H-bonds. Changes in intra-ring H-bonds and the direction of cooperative H-bonds are no longer considered. Three and two lowest-energy configurations are chosen for Manα(1,3)Manα(1,6)Man and Manβ(1,4)GlcNAcβ(1,4)GlcNAc, respectively. Therefore, six pentasaccharide configurations are constructed after the combination of two trisaccharides. However, there are two problems with the repeated monosaccharide in the construction process. One is that OH2 on the M ring has two directions. When the monosaccharide belongs to Manα(1,3)Manα(1,6)Man, the H-bond formed by OH2 is OH2→OM. When the M ring belongs to Manβ(1,4)GlcNAcβ(1,4)GlcNAc, the H-bond formed by OH2 is OH2→OH3′. Considering the fullness of the initial structures in the building process, both conformers are constructed. However, the configuration with the H-bond of OH2→OM is repeated with the building configuration of the core pentasaccharide in the second way. The second issue is that OH6 on the M ring is the glycosidic bond of M and M″ on Manα(1,3)Manα(1,6)Man. Hydroxyl groups on the M and G′ rings form a H-bond of OH6→OH6′ in Manβ(1,4)GlcNAcβ(1,4)GlcNAc. Because OH6 on the M ring is formed by dehydration at the position of the glycosidic bond, the lack of hydrogen will no longer form the H-bond of OH6→OH6′. So only OH6 on the M ring belongs to Manα(1,3)Manα(1,6)Man is considered. In summary, six configurations of core pentasaccharide are actually constructed, which are shown in
Figure S8 in the
Supplementary Materials.
As shown in
Figure 9, we use the structural unit of trisaccharide and disaccharide to build core pentasaccharide. According to the previous results, Manα(1,3)Manα(1,6)Man and GlcNAcβ(1,4)GlcNAc have three and two low-energy representative configurations within 10 kJ/mol, respectively. The combination results in a total of six initial configurations of core pentasaccharide. Considering that the glycosidic bond at the junction of trisaccharides and disaccharides will not affect their configuration, the glycosidic linkage is not limited to forming a new glycosidic bond type by rotation. Therefore, there will be two types of
cis and
trans glycosidic linkages. Actually, we have also built 12 configurations of core pentasaccharide. Representative configurations of
cis glycosidic linkage for core pentasaccharide are shown in
Figure S9 in the Supplementary Materials.
As shown in
Figure 9, we have the structural unit of three monosaccharides and one disaccharide to build the core pentasaccharide. Disaccharide of GlcNAcβ(1,4)GlcNAc has two low-energy representative configurations. Considering that middle monosaccharides are linked to the sugar rings of upper and lower monosaccharides and disaccharides, the functional groups that can form H-bonds are severely limited. Four types are divided to describe the construction method for the core pentasaccharide. First of all, the upper monosaccharide can form H-bonds with disaccharides and middle monosaccharides. Secondly, the lower monosaccharides can form H-bonds with disaccharides and middle monosaccharides. A total of four groups are constructed. The specific structures are shown in
Figure S10 in the Supplementary Materials. The construction method considers H-bonds between three monosaccharides, which transforms into the building problem of a mannotriose structure. It is equivalent to the construction method of trisaccharide and disaccharide.
Since the pentasaccharide contains 14 hydroxy and 2 acetamido groups, structures that optimize global H-bonds of OH···O and NH···O are to be expected in the gas phase: not surprisingly, its IRID spectrum presents a broad red-shifted quasi-continuum, ranging from ~3100 to ~3700 cm
−1. This suggests that a highly congested set of overlapping bands is associated with a large number of both strongly and weakly hydrogen-bonded OH and NHCO groups. Spectral congestion in the IRID spectrum could imply that the IR spectra are no longer conformer-selective and could actually be composite spectra made of several contributions. The vibration characteristics of core pentasaccharides are illustrated in
Figure 10. The notations and labeling of the vibrational modes mainly depend on the structural formula. As shown in
Figure 9, five sugar rings are labeled as M, M′, M″, G′, and G. The vibrational modes related to molecular groups on each sugar ring are labeled. For example, Gσ3 comes from the vibration of the hydroxyl group at position C3 on ring G.
After the initial configurations of core pentasaccharide are built, the optimized configurations are in good agreement with the initially expected structures. By good fortune, this structure of core pentasaccharide also has the lowest calculated relative energy (at 0 K and free energy at 298.15 K). Conformer A5, which has a slightly high relative energy of 0.3 kJ/mol−1, continues to be the second lowest conformer in the Gibbs free energy of 3.9 kJ/mol−1. Although poorly resolved, its contour from the experiment is in qualitative correspondence with the IR spectrum associated with its minimum free energy structure. Moreover, spectral congestion in the IRID spectrum could imply that the IR spectra are no longer conformer-selective and could be actually composite spectra made of several contributions.
Although the absorption band will displace due to the influence of chemical structure and external conditions, the band information, such as the absorption peak position, band intensity, band shape, and the presence of related peaks, can still comprehensively reflect the presence or absence of various functional groups. The vibration of the hydroxyl group is between ~3670 and ~3230 cm−1, and the vibration of the hydroxyl group in our calculation is ~3610 to ~3194 cm−1. The reason for the difference indicates the influence of H-bonds. The complex H-bonding network is often observed in a large size of the molecule. Additionally, the proximity of energy values and spectral congestion in the IR spectra can lead to difficulties in conformational assignment. Last but not least, due to the large size of the molecule and the difficulty of cooling it down during the supersonic expansion, the comparison with the experiment could imply that the IR spectra recorded are no longer conformer-selective and could be actually composite spectra made of several contributions.