1. Introduction
Terahertz (THz) spectroscopy stands as an established instrument for investigating molecular dynamics, with a special focus on biomolecules [
1,
2,
3,
4,
5,
6]. The central part of the THz spectral region extends from 0.3 THz to 3 THz (wavenumbers 10–100 cm
) and corresponds loosely to photon energies between 1 and 12 meV [
1,
2], equal to the energy of large-scale molecular vibrations that are mediated by intermolecular bonds [
4]. Covalent, ionic, or metallic bonds and, in some situations, even hydrogen bonds (H-bonds), take on the role of shaping the intramolecular arrangement of a molecule’s constituent atoms. The intermolecular forces that hold biomolecules together in the crystal lattice are typically H-bonds or weaker forces such as van der Waals, dipole–dipole, or dispersion forces. The complicated interplay between intramolecular and intermolecular forces is responsible for low-frequency vibrations of molecular crystals and creates the shape of the THz absorption spectrum. At higher frequencies, the spectra are determined primarily by intramolecular forces [
2,
7].
The THz vibrational modes are delocalized over a large number of atoms within the molecular crystal’s unit cell and are impossible to simply assign to a particular stretch or bend mode, as is possible for infrared spectroscopy, in particular above 1500 cm
. This is not a disadvantage but represents one of the strengths of THz spectroscopy, which is able to provide information on the low-frequency collective motion of highly flexible organic molecules and distinguish, for example, among structural isomers [
2]. In order to identify and analyze the low-frequency vibrational modes, ab initio simulations based on density functional theory (DFT) calculations are extremely useful. They were utilized to gain insights into the THz spectra of biological molecules. Moreover, these methods find frequent applications in the prediction of vibrational spectra for individual molecules, dimers, and clusters, and their usage is also on the rise for solid-state systems [
4,
6,
7]. However, when dealing with solid-state systems, a significant challenge persists due to the intricate nature of performing periodic DFT modeling on extensive structures such as macromolecules and molecular crystals [
8,
9]. In addition, it was shown that THz spectroscopy allows us to certify molecular structure and intermolecular H-bonds interactions within compounds, thus, offering a foundational basis for exploring emerging pharmaceutical cocrystal fields at the micro-molecular level [
10].
The presence of H-bonds and weaker forces allows biomolecules to be stable and highly flexible in their natural environments [
11]. This plays a significant role in biochemical reactions at the basis of life. Saccharides are carbohydrates or sugars that are major products of the photosynthesis of green plants. They are an essential structural component of living cells and function as the main source of energy that fuels various biochemical processes needed to maintain the life of animals and plants. Additionally, they play a relevant role in industrial applications, and they are used in the production of paper, food sectors, pharmaceuticals, and regenerative medicine [
12].
Saccharides may be classified according to the number of monomeric units that comprise them. The compounds studied in this article are: (i) the monosaccharide glucose and galactose, both with C
H
O
molecular structure but with a difference in the direction of the hydroxyl group in the carbon number 4, C(4), of the carbon skeleton; (ii) the disaccharide lactose, made of galactose and glucose units, and cellobiose, made of two glucose units, both bound together by a glycosidic linkage and with the general formula C
H
O
; (iii) the polysaccharide cellulose, an unbranched chain of D-glucose units linked also in this case by glycosidic linkages with (C
H
O
)
formula (
100). Their chemical structures are reported in
Figure 1.
Saccharides have been investigated since the beginning of THz spectroscopy [
11,
13]. The crystallized form of
-lactose monohydrate (the nomenclature given in [
14] is adopted) has become a sort of “gold standard” for THz spectroscopy due to its wide availability, low cost, and the typical absorption spectrum with two strong and narrow peaks at 0.53 and 1.37 THz [
6,
15,
16,
17,
18,
19,
20]. Another lactose polymorph,
-lactose, was studied in the frequency range of approximately 0.2–2 THz in a mixture with the
-form to quantitatively evaluate the anomer content in a non-destructive way. [
17]. The influence of water content in the lactose molecular crystal THz spectra was also studied recently, reporting complex phase transformations as a function of temperature [
6].
Using temperature-dependent THz spectroscopy, the temperature-induced molecular decomposition of
-lactose monohydrate was studied [
21], and the reaction rates and Arrhenius activation energy of the thermoactivated solid-state reaction were determined [
22]. In a recent study, the temperature dependence of the vibrational spectra and the resonance parameters of
-lactose monohydrate and other saccharides, including galactose and glucose monohydrate, was investigated. Anharmonicity in the vibrational modes appears as coupling between resonances and the creation of wide asymmetric bands, revealing a temperature-dependent square relationship for resonance eigenfrequencies in saccharides [
23,
24].
Anhydrous and monohydrate glucose spectra were experimentally investigated in the frequency range 0.3–3.7 THz [
20] and compared to solid-state DFT simulations [
25]. A 6 cm
(0.18 THz) overestimation of the predicted vibrational frequencies was observed for anhydrous glucose and interpreted as vibrational anharmonicity effects. In another study, the THz spectral transformation of monohydrate glucose under dehydration was measured in the range 0.8–2.2 THz [
26].
The THz absorption spectrum and refractive index of galactose were measured in the frequency range of 0.2–3 THz. Simulations using the Gaussian 03 package were conducted on isolated galactose molecules, and the predicted vibrational frequencies partially matched the experimental ones. The authors suggested that the non-predicted resonances might be due to interactions between galactose molecules. In a subsequent study, the absorption spectrum of galactose was measured in the range of 0.5–4 THz. The structural analyses and normal mode calculations were performed by using solid-state and gas-phase DFT simulations. The results demonstrated good agreement between experimental spectra and simulations, with differences within 0.2 THz [
27].
The THz absorption spectrum of cellobiose was measured in the 0.5–3.8 THz interval [
28]. The cellulose THz spectrum was first investigated with the main goal to determine the crystallinity of the sample [
29]. It was later studied in more detail in single freely standing paper sheets made of cotton cellulose, mainly in the I
phase, in the 0.2–3.5 THz range [
30]. By eliminating the Fabry–Perot (FP) interference effects from the transmittance data, the complex refractive index was numerically extracted [
31]. The cellulose spectrum consists of two main peaks at approximately 2.1 and 3.1 THz superimposed to the background due to the amorphous phase. A more recent study was focused on distinguishing the THz spectra of cellulose I allomorph (I
and I
), which exhibits differences in its crystal structure [
32].
None of these studies focused on the measurement of the attenuation (or absorbing) coefficient (
) or the molar attenuation coefficient (
) of the compounds according to the Lambert–Beer law
, where
T is the transmittance,
c is the concentration, and
d the thickness of the sample. Furthermore, the THz refractive index of the saccharides was rarely measured, although this measurement is easy for systems based on THz time-domain spectroscopy (THz-TDS), which allows the detection of the amplitude and phase of the coherent THz radiation [
2,
33]. The coherence of the THz radiation poses a crucial challenge when extracting optical parameters from the transmission data in THz-TDS, leading to FP interference effects due to the multiple reflections within planar samples. Several authors in the past have proposed different numerical methods to eliminate FP effects in the spectral data [
31,
34,
35,
36,
37,
38]. However, in all the previous described studies on the THz spectra of saccharides, although flat and parallel-faced samples were used, no mathematical techniques for the elimination of FP effects were employed.
In this work, we used THz-TDS to measure the transmission spectra of the previously cited seven saccharides in their solid state from 0.3 to 3.3 THz. The samples were carefully prepared as pellets made of saccharides and polyethylene (PET) powders. We eliminated the FP effects and estimated the volume fraction of air within the pellets in order to obtain a quantitative estimation of the absorption coefficient (in cm) and the refractive index as a function of the frequency. The DFT simulations were carried out by using a state-of-the-art method for the calculation of H-bonding interactions, which drives the collective motion of organic molecules. Simulation results provided the frequency of the vibrational modes and allowed us to characterize them in terms of intermolecular translations, intermolecular librations, and intramolecular vibrations.
2. Materials and Methods
2.1. Samples
We used for our experiments the following saccharides samples from Merck (Rahway, NJ, USA) (the abbreviations that will be used for the compounds from here on are indicated in brackets): anhydrous (96%) -D-Glucose (GLU), pharmaceutical secondary standard D-galactose (GAL), analytical standard -lactose monohydrate (LM), pharmaceutical secondary standard anhydrous -lactose (LA), pharmaceutical secondary standard anhydrous -lactose (LA), analytical standard -cellobiose (CBS), and Avicel® PH-101 microcrystalline cellulose I (CEL). Ultra-high molecular weight, surface-modified, 40–48 m particle size polyethylene (PET) was purchased from Sigma-Aldrich (St. Louis, MO, USA). All compounds were used without further purification.
2.2. Design of the Experiment (DOE)
The spectroscopic measurements were carried out on pellets prepared by mixing specific amounts of saccharides with PET powder at a mass ratio to obtain the highest optical absorbance within the dynamic range (
) of the THz-TDS set-up [
2]. The highest absorbance
that can be measured by the THz-TDS set-up is computed by using Equation (
1) [
2]:
where
n is the refractive index of the sample. The graphs of
and
as a function of the frequency are shown in the
Supplementary Materials. To estimate the mass ratios of saccharides to PET in the pellets that give the highest absorbance, the absorption coefficient of the main THz peaks of the saccharides investigated in this work,
, together with their refractive indexes in the THz range, were obtained from the preliminary THz-TDS experiments.
The absorption coefficient of the pellets (
) corresponding to the main saccharide peaks can be expressed as
where
is the volume fraction of saccharides, and
is the absorption coefficient of PET at the frequency of the main saccharide peaks. Thus, the maximum thickness of the pellets is given by
In practice, to avoid having a THz signal transmitted through the pellets at the same level as the noise floor of the THz-TDS set-up, a pellet thickness of
was chosen. Then, the masses of saccharides (m
) and PET (m
) in the pellets can be calculated as follows:
and
where
is the total volume of the pellet with radius
r (6.5 mm in our case) and the chosen thickness
d, and
and
are the mass densities of the saccharides and PET, respectively (values used in the calculations are shown in the
Supplementary Materials). The calculated values of the saccharides and PET mass ratios varied from approximately 1:10 to approximately 1:5.
2.3. Pellets Preparation
Seven pellets were prepared using the masses of powders obtained by the procedure described in the previous section. In addition, one pellet made of PET powder only was prepared. The powders were weighted by using an analytical scale SARTORIUS Analytic A200S (Göttingen, Germany) with a sensitivity of 0.1 mg. After weighing, each saccharide was carefully mixed with PET. The mixed powder was then transferred into the 13 mm diameter pellet die (Perkin Elmer, Waltham, MA, USA). Then, the pellet die was inserted into a manual hydraulic press and was connected to a rotary vacuum pump EDWARDS E2M2 (London, UK) with a maximum displacement of 2.8 m/h and ultimate vacuum without gas-ballast of 2.5 × 10 mbar. The vacuum pump was turned on for five minutes without applying pressure to the pellet die. Then, maintaining the rotary vacuum pump on, the pellet die was subjected to a slow growing pressure up to approximately 369 MPa, and then left at this pressure for 10 min.
Subsequently, in order to improve the homogeneity of the pellets, the pellet die was disconnected from the vacuum pump and removed from the hydraulic press and heated at approximately 80 °C, placing it in a FALC TD thermoblock for 15 min. Since the melting temperature of the PET is approximately 110 °C and the saccharides do not undergo any structural change at this temperature (for example,
LM undergoes a transition to
LA at temperatures above 93.5 °C [
39]), it is expected that the unique effect of heating is the softening of the PET particles. This allows a better filling of the voids within the powder particles, giving a decrease in the scattering of the THz radiation [
2]. The pellet die was then removed from the thermoblock and quickly placed again in the hydraulic press. Then, the pressing process was repeated at approximately 737 MPa for 10 min. By using this procedure, very homogeneous cylindrical pellets with a whitish appearance were obtained. Their thicknesses were measured by a thickness gauge with 0.01 mm accuracy ranged from 1.28 to 3.39 mm. The measured pellet masses are slightly lower (usually less than 10 mg) than the sum of the masses of saccharides and PET powders used in their preparation. This loss of mass is most likely caused by the transfer process of powders into the pellet die.
2.4. THz Time-Domain Spectroscopy Set-Up
The THz spectra were captured via transmission mode at a temperature of around 300 K, utilizing a Menlo Systems (Planegg, Germany) TERA K15 THz-TDS system. This setup featured photoconductive antennae stimulated by a femtosecond fiber-coupled laser (Menlo Systems T-Light). The femtosecond laser emitted light at a wavelength of 1560 nm, with a repetition frequency of 100 MHz and an optical pulse duration of roughly 90 fs. The THz radiation emitted by the photoconductive antenna underwent collection and focusing through a pair of polymethylpentene (TPX) lenses, each with a 50 mm focal length. The radiation transmitted through the sample pellets was then gathered and directed onto the detector antenna, employing another identical pair of TPX lenses with a 50 mm focal length.
For all measurements, the delay line scan range covered 200 ps, resulting in a spectral resolution of 5 GHz. The data points were recorded at intervals of 33.3 fs, with a scanning rate of approximately 4 Hz. Each spectrum was derived from an average of 2163 scans, each spanning roughly 0.23 s. This means that the acquisition time for each sample or reference was 500 s. The system’s spectral range extended from 0.3 to 4.4 THz. The frequency-dependent dynamic range (
) measured approximately 85 dB at 0.35 THz and approximately 22 dB at 3.5 THz (refer to
Supplementary Materials, Figures S1 and S2).
The pellets were positioned within the transmission lines, and their two flat parallel surfaces were set perpendicular to the THz beam line axis, all within a sample holder. The sample holder was a plate of 14.5 × 14.5 cm
and with a thickness of 0.6 cm with 16 circular pellet slots and an opening of 12 mm in diameter, realized in polylactic acid (PLA) by using a 3D printer ANYCUBIC i3 MEGA (Shenzhen, China). The sample holder was affixed to a bidirectional translation stage designed for mapping purposes, enabling movement within the
xy plane perpendicular to the THz beam. This configuration facilitated automated sequential measurements of all samples. Movements along the
x and
y directions were synchronized with THz measurements by using a self-made program in a LabVIEW (NI, Austin, TX, USA) environment. For each pellet, three transmission measurements were performed in different spots separated by 2 mm. The spectral measurements were performed out of the THz focus (approximately 7 mm away from the THz nominal focus) so that the THz pulse could probe a large area, thus, minimizing the pellets’ heterogeneity-related effects. The THz beam profile was carefully characterized before measurements by means of knife-edge measurements [
40,
41] at the sample position in order to know its size and assure its complete transmission through the pellets. The measured THz spot was approximately 7 mm in diameter.
Since water vapor, due to environmental humidity, absorbs THz radiation at frequencies corresponding to the molecular rotational transitions [
42], the THz set-up was placed into a closed compartment purged with dry nitrogen (N
). The relative humidity (RH) inside the compartment was measured by using a data logger HygroLog HL-20D (Rotronic, Switzerland). The spectral measurements were not carried out until the datalogger showed RH = 0 within its accuracy and the water vapor absorption lines were indistinguishable from noise up to 2.5 THz (nearly 120 min of purging before the first spectra acquisition).
2.5. THz-TDS Data Analysis
THz-TDS spectroscopy offers the capability to determine the complex refractive index of a sample, denoted as
, as a function of angular frequency
[
2]:
where
is the phase refractive index,
is the extinction coefficient,
is the absorption coefficient, and
c is the speed of light.
In our spectroscopic measurements in transmission mode, two time-domain THz pulses propagating through the pellets (sample) and through the N atmosphere (reference) were recorded, and their spectral amplitude and phase, for the sample and for reference, were calculated by a standard discrete fast Fourier transform (FFT) algorithm implemented in a Wolfram Mathematica (Champaign, IL, USA) program.
and
were compared by using the complex transfer function
, defined as
where
and
are the transfer function amplitude and phase, respectively.
The variation of
with frequency can be deduced through the solving of the inverse scattering problem for electromagnetic waves. This involves equating the experimental transfer function
(
j is a data array index) to the analytical expression of
, obtained by solving the direct electromagnetic problem.
The coherence of THz radiation in THz-TDS poses a significant challenge when extracting optical parameters from
and
due to the established FP interference effect resulting from multiple reflections within the sample [
43]. In this scenario, the analytical form of
, derived using the transfer matrix method [
44], is
where the
term takes into account the FP interference effect
Equations (
9) and (
10) are valid for radiation impinging perpendicularly on a flat sample. The extraction of the optical parameters as a function of frequency can be performed by equating the theoretical and experimental transfer functions:
=
. However, this equation can be solved only numerically.
In order to determine the optical parameters of saccharides from the THz-TDS spectral data, we have employed a numerical procedure written in the Wolfram Mathematica environment and already described in previous articles [
30,
31]. This procedure is based on the Davidenko method [
45] for finding the roots of complex functions. It establishes frequency-dependent optical parameters by solving the implicit equation derived from equating the experimental and theoretical transfer functions, incorporating the self-contained FP effects. The comprehensive elimination of residual FP oscillations inherent in the optical parameter trends is achieved solely through the optimization of sample thicknesses.
2.6. Absorption Coefficients and Refractive Indexes Calculations
The DOE, as discussed in
Section 2.2, did not incorporate the potential existence of a volume fraction of air,
, within the pellets, which could modify the optical parameters. Thus, in order to obtain more precise values of the saccharides’ absorption coefficient and refractive index, the value of
was estimated for all pellets. As already reported in
Section 2.3, the mass of the pellets was always lower than the sum of the masses of saccharides and PET used for their preparation. Then, the actual masses of saccharides and PET in the pellets,
and
, respectively, were estimated. The overall lost mass in the preparation is
, where
is the weighed mass of the pellet. It is then possible to estimate
and
by assuming that their lost masses,
, are proportional to their weighted mass percentage in the pellet:
where
.
From the knowledge of the mass densities of the saccharides and PET (values used in the calculations are shown in the
Supplementary Materials), it is then possible to calculate their actual volume fraction,
, as
Since
, it is possible to recover
. The results are shown in
Table 1.
Such values can now be used to calculate the absorption coefficients,
, and the refractive indexes,
, of the saccharides. Since the absorption coefficient and the refractive index of the pellets,
and
, respectively, are
and
by assuming
= 0 and
= 1 in the in the measured THz range, then
and
2.7. DFT Simulations
All the studied saccharides are modeled as crystals and their geometry is optimized by using a DFT plane-wave code with ultrasoft pseudopotentials that are included in the Quantum Espresso package. The convergence criterion of geometry optimization for total energy and force were both set at 1.0d-5 Ry. The theoretical THz spectra were obtained by exploiting DFT ab initio techniques. The Quantum Espresso package was used with the plane-waves basis set, within the finite displacements framework (FD code) [
46,
47]. To account for van der Waals interactions, a non-local van der Waals density functional known as vdW-DF2 was utilized. This functional builds upon the precise semilocal exchange functional, PW86 [
48], and was selected to enhance the portrayal of H-bonds [
49,
50].
Ultrasoft pseudopotentials [
51], suitable for the non-local van der Waals functionals, were generated [
52]. When using such functionals, the convergence on both wavefunctions and the charge cutoff is a delicate issue. In this study, Kohn–Sham wavefunctions were truncated using a cutoff of 40 Ry, while the charge density employed a cutoff of 800 Ry. The Irreducible Brillouin Zone was discretized using 14 k-points. The vibrational properties were evaluated through finite displacements within the DFT framework under the harmonic approximation. The calculations encompassed phonon eigenvalues and eigenfunctions at the
point, alongside the determination of Born effective charges. The simulated crystal structures and parameter information can be found in the
Supplementary Material.
2.8. Mode Analysis
The observed absorption characteristics of saccharides could arise from intermolecular vibrations facilitated by networks of H-bonds, as well as intramolecular vibrations encompassing the collective backbone motions of molecules. The intermolecular vibrations may entail both translations and rotations of molecules. In this investigation, we adopted the method outlined by F. Zhang et al. [
7] for mode analysis, which aims to categorize molecular optical phonon modes in relation to intermolecular translations, intermolecular librations, and intramolecular vibrations.
This method’s focus lies in assessing vibrations within the irreducible molecular units present within the unit cell. In specific cases, such as LA, LM, LA, CBS, and CEL, there are two molecules per unit cell, while GLU and GAL possess four molecules per unit cell. Intermolecular translations are characterized by the center of mass (COM) displacement along the crystallographic Cartesian coordinate system’s three axes: X, Y, and Z. Intermolecular libration, involving the constrained rotation of the entire molecule as a rigid body, can be deconstructed into three independent librations about the molecule’s three principal axes: x, y, and z.
For intramolecular vibrations within the
nth mode, the displacement vector of the
ith atom is determined by subtracting the contributions of all translations and principal librations from its overall motion. The amplitudes of the translations, principal librations, and intramolecular vibrations for the
nth normal mode are ascertained using the root-mean-square mass-weighted atomic displacement of all atoms within the molecule, given by the equation:
Here,
signifies the atomic displacement vectors of an irreducible molecule in the
nth normal mode, with
V representing
X,
Y, or
Z and
l pertaining to the three translations and three principal librations.
N denotes the number of atoms in the molecule. The percentage contribution
of each component within the
nth mode is computed as follows:
The analysis of the vibrational modes was performed using the atomic displacement vectors associated with each vibrational mode obtained from DFT simulations. A Fortran language program was written for this purpose.
4. Discussion
The observed spectral differences of the studied saccharides can be qualitatively explained in terms of molecular structure, H-bonding network, and the presence of water in the crystallographic structure. The pattern of absorption peaks looks similar for
LA and CBS (
Figure 2), and it is possible to appreciate an overall shift towards the higher frequencies of the spectrum of CBS compared to that of
LA. The crystal structure of
LA is nearly isomorphous with CBS. However, the H-bonding scheme around the axial O(4) atom of
LA is different from that of CBS. In particular, the
LA structure presents just one of the three H-bonds of CBS [
56,
57]. The fact that in
LA, the O(4) atom of the galactose subunit is connected through a single H-bond with the O(3’) atom of a neighboring glucose subunit can roughly explain the shift of THz peaks for
LA to lower frequencies. As shown in
Section 2.8, of the highest intensity absorption peaks of
LA and CBS (at 1.83 and 2.07 THz, respectively) approximately 60% are due to intermolecular librations.
The presence of fewer H-bonds gives a weaker overall force constant (
K) of the overall chemical bonds that regulate these intermolecular librations of the molecules. Since the oscillating mass is the same for
LA and CBS (their molecular mass is
M = 342.297 uma), according to the harmonic oscillator frequency formula
, a lower normal mode vibrational frequency
is expected. In order to estimate
K, we applied this formula to the 1.83 THz peak of
LA and the 2.07 THz peak of CBS, obtaining
K values of approximately 75 and 96 N/m, respectively. The difference is approximately 21 N/m. Since CBS possesses two additional H-bonds, by dividing this value by two, we found
K ≃ 10.5 N/m, a value within the force constant range of the H-bond, from approximately 3 to 80 N/m depending on the molecule [
58].
The THz spectra of CBS and CEL (
Figure 2a and
Table 2) appear to be very different: while CBS shows five evident peaks, CEL is characterized by two small intensity peaks at 2.15 and about 3 THz superimposed to the monotonic increase. Even if CBS is the repeating unit of CEL, the supramolecular structure of the two compounds is very different due to different intermolecular H-bond networks. The CEL supramolecular structure is characterized by a network of inter- and intramolecular H-bonds, which give great stiffness to CEL chains made up of hundreds to thousands of glucose units [
59]. CBS consists of two glucose units linked by a
-(1,4
) glycosidic bond, and its structure is, therefore, more flexible than that of CEL. These structural differences may explain the presence of the several vibrational peaks in the CBS spectrum compared to that of CEL.
GLU and GAL are stereoisomers of each other. The main structural difference is the opposite orientation of the OH groups at carbon atom C(4) in the pyranose ring. Despite this, GLU and GAL exhibit very different THz spectra. The rearrangement of the hydroxyl group changes the potential energy distribution, and this most likely causes differences in the intermolecular vibrational motions, giving very distinct THz spectra. A GLU unit linked to a GAL unit through
-(1,4) glucosidic linkage forms
LA. If the THz spectra of GLU and GAL are compared with the THz spectrum of
LA (
Figure 3), very marked differences are evident. The absorption spectrum of
LA shows five peaks at 1.19, 1.86, 2.20, 2.38, and at approximately 2.68 THz (see also
Section 3.3). The most evident peaks were at 1.19, 1.86, and approximately 2.68 THz. These insights underscore the inadequacy of a single-molecule approach for modeling spectra within the THz range. This is due to the presence of vibrational modes that extend across multiple molecules, in contrast to the mid-infrared spectroscopic range where vibrations tend to be localized around individual or a few molecular bonds [
6]. A relevant contribution to the THz spectra comes from the interactions between the molecular units. This properly explains the complexity of the low-energy dynamics of these compounds. Conversely, it can serve as an experimental tool to investigate and differentiate the crystalline structure of a molecular crystal specimen.
In order to obtain quantitative data, the peaks at 0.53 THz and 1.37 THz in
LM were subjected to Lorentzian curve fitting, yielding full-width-at-half-maximum (FWHM) values of (0.021 ± 0.002) THz and (0.047 ± 0.002) THz, respectively. This analysis draws upon both classical and quantum physics, where a Lorentzian line shape signifies homogeneous damping, indicating that the molecular oscillators accountable for absorption experience equivalent scattering or damping mechanisms, regardless of their individual characteristics or states [
15]. Mechanistically, this dynamic is often governed by an exponential decay over time, characterized by a time constant
linked to the Lorentzian linewidth via
. Consequently, for the vibrations responsible for the 0.53 THz band,
amounts to 15.1 ps, while for the 1.37 THz band, it stands at 6.8 ps. These
values notably exceed the room-temperature damping times associated with molecular vibrations, including collective ones (such as phonons) in solids [
15]. This suggests that the spectral features at 0.53 THz and 1.37 THz stem from a collective vibrational transition that evades the typically rapid scattering mechanisms. One plausible explanation lies in the significant reduction of phonon–phonon scattering, attributed to a relatively subdued lactose crystal intermolecular potential energy. This diminished potential energy likely arises from the presence of water molecules within the
LM crystal structure [
6]. The comparison of the spectra of
LM,
LA, and
LA shows the importance of water molecules in affecting the THz vibrational spectra by modifying the intermolecular interactions mediated by H-bonds.
The qualitative analyses of the THz spectra show the complexity of the observed features and the need for computational simulations in order to assign the experimental peaks to optically active vibrational modes and to obtain information on their vibrational characters. In the comparison between the experimental and calculated spectra, it must be considered that the theoretical method is able to determine the normal mode frequencies of crystalline samples in the harmonic oscillator approximation. This methodology forms the foundation for associating THz absorption peaks with crystalline samples [
4]. Nevertheless, even with a grasp of the normal modes within a molecular crystalline system, attributing these modes remains intricate due to the presence of anharmonic effects. These effects induce changes in phonon energies and concurrently curtail their lifetimes. As a consequence, the observed THz peaks generally exhibit frequencies that deviate from those predicted by the harmonic model, accompanied by significantly broader line shapes. Hence, anticipating a flawless alignment between a THz spectrum and a normal mode simulation grounded in the harmonic approximation would be unrealistic, even when employing the most precise modeling methodologies [
4]. In light of anharmonicity’s impact on both band frequency and line shape, the accuracy of normal mode assignments would be optimized by utilizing the THz spectra acquired at the lowest attainable temperature to mitigate the anharmonic effects. However, the frequency shifts caused by temperature variations seldom surpass 0.1 THz [
4,
5,
11].
Figures S12–S18 of the Supplementary Materials show that the vibrations within the molecules mostly involve bending vibrations of H atoms and rocking motion of the O-H group. Usually, these vibrations appear at higher frequencies, but due to interactions with vibrations between molecules, they also appear at lower frequencies [
4]. This mixing of intermolecular vibrations and the H and O-H vibrations was observed previously in other studies, such as the combination of vibrations and torsion in certain molecules. This mixing likely extends to the vibrations responsible for hydrogen bond formation in the O(N)-H groups [
4,
7]. Therefore, no clear demarcation exists between the predominant intermolecular librations and intramolecular vibrations for the studied saccharides. The distinctive frequency domains of intramolecular vibrations entirely coincide with those of intermolecular librations as a consequence of the strong mixing of these two categories of vibrations that takes place in the studied THz range [
7].
5. Conclusions
We conducted a detailed spectroscopic study of seven saccharides (GLU, GAL, LM, LA, LA, CBS, and CEL) by using a THz-TDS system. The THz-TDS data analysis was carefully performed to eliminate the FP interference effect, and the actual volume fraction of samples within the pellets was evaluated to recover the absorption coefficient (cm) and refractive index of the saccharides. The experimental results were compared with the theoretical spectra obtained by ab initio techniques based on DFT considering H-bonding interactions with an enhanced version of the van der Waals non-local density functional vdW-DF2.
The analysis of the saccharides’ low-energy vibrational modes followed the methodology introduced by F. Zhang et al. [
7]. This facilitated the calculation of the proportional contributions of intermolecular translations, intermolecular librations, and intramolecular vibrations for every mode within the saccharides. The low-energy vibrations of the studied saccharides are mainly due to intermolecular librations and intramolecular vibrations, with a small percentage of intermolecular translations. The convergence of the characteristic frequency ranges between intermolecular librations and intramolecular vibrations implies the flexibility of the molecular backbone and a significant amalgamation of these two vibration classes within these frequency domains.
Saccharides, and in particular, lactose, are one of the most used pharmaceutical excipients. In fact, approximately 70% of pharmaceutical dosage forms contain lactose as a filler, carrier, excipient, or binder agent. Therefore, we believe that our analysis of the vibrational modes in the THz region could be used in the design and synthesis pharmaceutical processes.