Next Article in Journal
Advances on Cellulose Manufacture in Biphasic Reaction Media
Previous Article in Journal
Polarization of Melatonin-Modulated Colostrum Macrophages in the Presence of Breast Tumor Cell Lines
Previous Article in Special Issue
Nuclear Magnetic Resonance Relaxation Pathways in Electrolytes for Energy Storage
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Application of 2D NMR Spectroscopy in Combination with Chemometric Tools for Classification of Natural Lignins

by
Anna V. Faleva
*,
Ilya A. Grishanovich
,
Nikolay V. Ul’yanovskii
and
Dmitry S. Kosyakov
*
Laboratory of Natural Compounds Chemistry and Bioanalytics, Core Facility Center “Arktika”, M.V. Lomonosov Northern (Arctic) Federal University, Northern Dvina Emb. 17, 163002 Arkhangelsk, Russia
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(15), 12403; https://doi.org/10.3390/ijms241512403
Submission received: 30 June 2023 / Revised: 31 July 2023 / Accepted: 2 August 2023 / Published: 3 August 2023
(This article belongs to the Special Issue Modern NMR Characterization of Materials)

Abstract

:
Lignin is considered a promising renewable source of valuable chemical compounds and a feedstock for the production of various materials. Its suitability for certain directions of processing is determined by the chemical structure of its macromolecules. Its formation depends on botanical origin, isolation procedure and other factors. Due to the complexity of the chemical composition, revealing the structural differences between lignins of various origins is a challenging task and requires the use of the most informative methods for obtaining and processing data. In the present study, a combination of two-dimensional nuclear magnetic resonance (2D NMR) spectroscopy and multivariate analysis of heteronuclear single quantum coherence (HSQC) spectra is proposed. Principal component analysis and hierarchical cluster analysis techniques demonstrated the possibility to effectively classify lignins at the level of belonging to classes and families of plants, and in some cases individual species, with an error rate for data classification of 2.3%. The reverse transformation of loading plots into the corresponding HSQC loading spectra allowed for structural information to be obtained about the latent components of lignins and their structural fragments (biomarkers) responsible for certain differences. As a result of the analysis of 34 coniferous, deciduous, and herbaceous lignins, 10 groups of key substructures were established. In addition to syringyl, guaiacyl, and p-hydroxyphenyl monomeric units, they include various terminal substructures: dihydroconiferyl alcohol, balanopholin, cinnamic acids, and tricin. It was shown that, in some cases, the substructures formed during the partial destruction of biopolymer macromolecules also have a significant effect on the classification of lignins of various origins.

1. Introduction

Being the second most abundant biopolymer in nature, lignin has attracted increasing attention from researchers as a renewable source of valuable chemical compounds and a feedstock for the production of various materials. Physical-chemical properties of lignin and thus its suitability for certain directions of processing are determined by the chemical structure of its macromolecules. They are formed as a result of enzymatic dehydrogenative polymerization of p-coumaryl, coniferyl, and synapyl alcohols (monolignols). The latter act as precursors for p-hydroxyphenyl (H), guaiacyl (G), and syringyl (S) phenylpropane structural units (PPU), respectively. An absence of genetic control of the polymerization process and a great diversity of bonds between the structural units of the macromolecule determine the irregularity of the polymer chains and the dependence of the resulting structure on many factors. Despite the fact that there are a large number of publications devoted to the structure of lignins of various origins [1,2,3,4,5], many aspects in this field still remain unclear due to its exceptional complexity. Naturally, the structure of lignin is primarily determined by the botanical origin of the biopolymer [5], but it also depends on the procedure used for isolating lignin preparations from the plant material [1,6,7,8,9]. At the same time, a number of recent studies showed that lignin structure can be additionally complicated due to the incorporation of non-typical moieties, such as catechin-type phenylpropane units (C) [4] and flavonoids (for example, tricin) [5,10].
In this regard, a detailed classification of lignins should be based on the most complete consideration of both major and minor structural features of preparations obtained from various plant materials. This is a very difficult and non-trivial task. Its solution is challenging and requires the use of the most sophisticated analytical techniques. Among them, high-resolution mass spectrometry (HRMS) and 2D NMR spectroscopy certainly dominate [11,12]. The latter has obvious advantages, making it possible to identify various structures in the lignin macromolecule without preliminary degradation of the biopolymer [13]. However, the complexity of NMR spectra of lignin and strong overlap of spectral peaks even in 2D NMR experiments makes the “manual” interpretation of the data very difficult. In this regard, most of the works available in the literature are focused on the search and quantification of the known and most abundant structural units using expert analysis of individual spectra and neglect the identification of minor fragments. However, they can play a key role in a detailed classification of lignins and deep understanding of their transformations in various biological and technological processes. To overcome this problem, chemometric approaches to the data mining and treatment, primarily hierarchical cluster analysis (HCA), principal component analysis (PCA), and partial least squares projection to latent structure (PLS), should be implemented. In addition to being widely used in metabolomic studies [14], they have proven themselves useful for revealing differences between lignin preparations based on FT-IR spectra [15,16]. Recently, Lancefield et al. [17] reported the use of PCA and PLS for the FT-IR analysis of 54 lignin samples differing in origin and fractionation procedure.
Surprisingly, there are still no data in the literature on the use of multivariate analysis (MVA) of 2D NMR data to study the structural differences of lignins. Nevertheless, this approach has already been successfully used to discriminate wines based on their polyphenolic composition [18], as well as identify differences between extracts of the poplar phloem and in the chemical composition of normal and tension wood [19,20]. This is primarily due to the complexity of processing 2D NMR spectra for their subsequent analysis using conventional MVA software, as well as difficulties in the subsequent extraction of information about specific compounds or structural fragments that contribute to certain differences. The latter factor is the reason why the majority of publications available in the literature in this field are limited to sample classification purposes. At the same time, methods for solving such problems are quite well known in the literature [21] and have been further developed for wood components by Hedenström et al. [19]. They proposed a procedure for MVA of frequency domain 2D NMR data without any need for peak picking or integration prior to analysis, where the loading’s plot can be visualized as pseudo-HSQC spectra to identify potential biomarkers.
The aim of this study is to expand the scope of this approach for the classification of lignins by botanical origin and to determine the key substructures (biomarkers) for their differentiation by PCA and HCA analysis of 2D NMR (HSQC) spectra. As an example, preparations of dioxane lignin isolated from coniferous and deciduous wood, as well as herbaceous plants, were studied.

2. Results and Discussion

2.1. General PCA-Based Classification of Lignins

Differences in the chemical composition and structure of 34 dioxane lignin preparations (Table 1), including 12 coniferous, 10 deciduous, and 12 grass lignin samples, were visualized using MVA of pre-processed 1H-13C HSQC spectra according to the procedure described in Section 3. In the case of spruce, pine, juniper, and larch, plant material samples obtained from different trees were used for isolation of lignin preparations to estimate the intra-species variability of the lignin structure.
The obtained PCA results showed that the observed data variations can be described by a total of 19 principal components. Based on the residual variance curve, it was found that four of them are sufficient to describe 70% of the differences (Figure S1), while PC1, PC2, PC3, and PC4 account for 39.9, 16.4, 8.5, and 6.0% of the total variance, respectively. The score plot for PC1 versus PC2 (Figure 1a) clearly demonstrates the distribution of the studied lignins into the three distinct clusters along the PC1 axis. The accumulation of points belonging to deciduous (hardwoods) lignins on the right side (large positive values of PC1) of the graph is observed, while the points on the left side (large negative values of PC1) correspond to coniferous (softwoods) lignins. The third cluster, related mainly to grass (herbs) lignins, is located near the center with a shift towards lignins isolated from hardwood samples. Replacing PC2 on the y-axis with PC3 made it possible to better differentiate grass lignins highlighting the areas corresponding to monocots and dicotyledons (Figure 1b). The score plot in the PC1-PC4 coordinates (Figure 1c) provided a complete separation of the lignins isolated from deciduous hardwood and grass (herbs).
The loading plots illustrating in detail the key structures responsible for the differences between the clusters in four principal components (PC1-PC4) were converted back to the corresponding HSQC NMR spectra (Supplementary Figures S2–S5), which allowed the main structures responsible for differences between the studied lignins to be revealed (Table 2).
As expected, the main differences between lignins are in the ratio of syringyl and guaiacyl monomeric units (S/G), namely, in the presence of S-structures in the composition of deciduous lignins, and vice versa, the predominance of G-structures in coniferous lignins. In the case of herbaceous lignins, the important role is played by the presence of H-type structures, which make a significant contribution to PC2. However, the composition of the main dimeric structures also largely affects the differentiation of lignins. In particular, fragments of β-aryl ether and resinol predominate in the composition of deciduous lignins, while the substructures of phenylcoumarane and secoisolariciresinol types are characteristic of coniferous lignin. This is in good agreement with the literature data. It is worth noting that the signals belonging to Hibbert’s ketone and its isomers are observed as negative peaks for PC1, PC2, and PC3 and thus dominate in the composition of coniferous lignins. This may indicate that the destruction of β-O-4 bonds in them during the isolation procedure takes place to a greater extent.
The differentiation of grass lignins is not so unambiguous. In particular, the dioxan-lignin of the dicotyledonous plant Sosnowsky’s hogweed (Heracleum) is located in the cluster of lignins isolated from hardwood. This may indicate that its chemical composition is distinguished with a significant proportion of S-structures and the absence of cinnamic acids which are characteristic of grass lignins. Another illustrative example is the lignin of saxifrage [22], which is located separately on score plots in PC1-PC2 and PC1-PC3 coordinates and goes beyond the confidence interval of herbaceous lignins area. This is due to the intense signals belonging to the H-units, fatty acids, and acetylated β-aryl ethers (Supplementary Figures S3 and S4).
Based on the data presented in Figure S4 (Supplementary Material), it can be seen that the cross-peaks of the flavonoid tricin, p-coumaric, and ferulic acids, as well as arabinofuranose, make the greatest contribution to the differentiation of grass lignins along the PC3 axis. The latter compound forms an ester bond with ferulic acid and along with tricin can be considered a main biomarker of lignins in cereal straw.
In the aforementioned differentiation of lignins of monocotyledonous and dicotyledonous herbaceous plants in the PC1-PC3 coordinates (Figure 1b), cattail (Typha) lignin which unexpectedly falls into the zone of dicotyledonous plants can be considered an exception. This is explained by the absence in its structure of flavonoid tricin fragments characteristic of monocots [23]. It is also noteworthy that some lignins isolated from dicotyledonous grasses are close to the cluster of deciduous lignins, which indicates a substantial similarity in their structure.
The combination of PC1 and PC4 for constructing the score plot (Figure 1c) proved to be most suitable to distinguish herbaceous and deciduous lignins. This suggests that the main part of the PC4 is contributed by the signals of substructures responsible for this classification. As can be seen from the loading plot HSQC spectrum (Supplementary Figure S5), the cross-peaks related to S/G/H monomeric units have no decisive effect on the separation of lignins along the PC4 axis, and the most intense contours are observed for signals which are characteristic of dimeric structures and some of their degradation products.

2.2. PCA-Based Discrimination of Coniferous Lignins

The results described above show that the coniferous lignins not only differ significantly from the lignins of hardwood and herbaceous plants but are also characterized by a significantly lower structural variability compared to the latter. In this regard, the differences in the composition and structure of dioxane lignins of coniferous trees belonging to different species were analyzed in more detail. The results of the PCA showed that 100% of the variation in the data can be described by 11 principal components, of which PC1 and PC2 account for 29.3 and 21.7%, respectively (Supplementary Figure S1). Combining the data obtained from the score plots and loading spectra, it was possible to establish the particular substructures which are characteristic of each species under study (spruce, pine, juniper, larch).
Obviously, when using the score plot in PC1-PC2 coordinates (Figure 2a), a clear separation is observed only between the spruce lignin, which is distinguished with highly positive PC1 values, and all other preparations. For the latter, intraspecies variations turned out to be comparable with inter-species differences. For example, pine and juniper lignins, despite belonging to different families, demonstrated very similar patterns indicating the identity of their main substructures. The HSQC spectrum extracted from the loading plot (Supplementary Figure S6) demonstrated that the main contribution to the special position of spruce lignin along the PC1 coordinate is made by Hibbert’s ketones, methyl-substituted phenylcoumarone, as well as the structures of vanillin and acetovanillone (Supplementary Table S1). On the other hand, cross peaks of H-type aromatic units, β-aryl ethers, and phenylcoumarane are characteristic of other coniferous lignins, the points of which are located on the left side of the score plot.
Based on the obtained results, two reasons can be suggested to explain the observed picture. First of all, the absence of H-type structures in the composition of spruce lignins may be due to the fact that the age of the plants selected for the isolation of spruce lignin preparations was about 80 years, while other representatives of coniferous trees were aged 25–30 years. Another reason may be associated with inherent limitations of the method caused by the lignin isolation procedure. In the latter, hydrochloric acid solution (0.7%) in dioxane was used as a mild hydrolytic agent facilitating the release of lignin. However, it also contributes to undesirable side processes of acid-catalyzed destruction and transformation of most labile structures in biopolymer macromolecules. They result in the formation of larger amounts of Hibbert’s ketones and other degradation products. Thus, to avoid possible classification errors, it is necessary to know which of the detected structures belong to intact lignin and which were formed during its isolation from a plant material. In the case of differentiation of lignins by classes, this factor may not have a significant effect, while a more detailed analysis, such as classification of coniferous lignins by families, is greatly complicated.
Of all the samples, only larch lignin preparations were not grouped together. An analysis of the PC2 loading HSQC spectrum (Supplementary Figure S7) makes it possible to establish the reason for such a strong displacement of the Larch 1 sample, which is located on the border of the confidence ellipse on the left side of the PC2 axis. The key structural differences in this case are due to the presence in this sample of taxifolin fragments not observed in other lignin preparations. The dramatic differences between individual lignin samples within the same tree species may be associated with the predominant contribution of random impurities in PC1-PC2, as well as the phasing error of some cross peaks [19], likely associated with residual solvents, which were not completely removed during the pre-processing of the spectra. In addition, these differences may be explained by the side reactions in the delignification process such as the destruction of the β-O-4 bond under mild conditions of acidolysis [24]. However, as previously noted in [16], the relative amounts of degradation products depend on the chemical composition of plant material, which in turn depends on the plant species.
The complete separation of the pine, larch, and juniper lignin areas on the score plots was not achieved either in the PC1-PC2 coordinates, or even when using other principal components with the highest contribution (up to PC6). This means that lignins of coniferous tree species have a fairly similar structure; therefore, a distinctive feature for each of the lignins may not be significant for the tested principal components. To this end, other PCs with less contribution to the description of all variations were analyzed revealing more subtle differences in the biopolymer structure. As a result, it was found that the signals described by PC7 (Figure 2b) may be partially responsible for the clustering of softwood lignins by species (families). In particular, they provide clear separation of juniper lignin from the preparations isolated from larch and pine.
The data obtained from the loading spectrum of PC7 (Supplementary Figure S8) showed that the distinguishing feature of juniper lignin is the absence of divanillyltetrahydrofuran and secoisolariciresinol fragments in their structure, as well as a large proportion of β-O-4 bonds and Hibbert’s ketones. In turn, pine lignin samples located in the area at the low PC7 values contained the mentioned structures and their composition is dominated by fragments of dibenzodioxocin, methyl-substituted phenylcoumarone, and other degradation fragments, including vanillin and vanillic acid (Supplementary Table S1).
Summarizing the results of PCA analysis and structural information from HSQC NMR loading spectra of lignins of various origins, the following classification of the studied biopolymer preparations presented in Figure 3 as a block scheme [25] can be proposed based on the specific substructures (biomarkers) identified in them.

2.3. Hierarchical Clustering of Lignins

In general, HCA analysis of the 2D HSQC NMR spectra of the studied lignins showed the same pattern as PCA. However, it allowed for a more detailed and clearer clustering of preparations isolated from the plants of various species and families (Figure 4).
Two large clusters are observed on the dendrogram, one of which, in turn, is divided into two separate subclusters of deciduous and herbaceous lignins. Another subcluster involves mainly coniferous lignin preparations. This differentiation is obviously caused by the ratio of the main types of PPU. It should also be noted that 3 of the 34 analyzed samples were differentiated separately. Two of them belong to cereal straw lignin, which is explained by the presence of flavonoid-type substructures and other impurities in them. In addition, it was shown that aspen lignin is not a part of the hardwood lignin subcluster, which is explained by the presence in its structure of fragments that were largely modified during isolation procedure.
In addition, it can be seen that the differentiation between the lignins of dicotyledonous and monocotyledonous grasses was not clear enough. This is evidenced by the clustering of wheat and cattail lignins together with representatives of dicotyledonous grass lignins. However, subclustering of lignins by families within the cluster of hardwood lignins attracted the most attention. It is known that most of them have a similar composition of substructures, differing only in their quantitative ratio. The exceptions are willow, aspen, and poplar lignins, which contain structures of p-hydroxybenzoates (Figure 3). The cluster of coniferous lignins also undergoes differentiation, but the differences in this case are not so predictable and require a more detailed study.

3. Materials and Methods

3.1. Plant Material and Dioxane Lignin Isolation

The samples of saxifrage (Saxifraga oppositifolia L.) stems were obtained from Piramida settlement (Svalbard, Norway). Other plants were harvested in the Primorskii district of the Arkhangelsk region (Russia). At least three samples of each plant species from different sites were averaged prior to lignin isolation. Dioxane lignins were isolated from plant tissues (xylem of woody plants, grass aerial part, cereal straw) by the Pepper’s method [26], involving a mild acidolysis of lignocellulosic biomass in an inert atmosphere and extraction of lignin in water-dioxane medium. Before isolation of lignin, the plant biomass (xylem of woody plants, cereal stalks and aerial parts of other herbaceous plants) was crushed in a ZM 200 centrifugal mill (Retsch, Haan, Germany) to a particle size of <1 mm, vacuum dried at 40 °C, and subjected to exhaustive extraction with acetone in a Soxhlet apparatus to remove low-molecular extractives. A complete list of the obtained lignin preparations, their elemental compositions and molecular weight characteristics, as well as the attained yields are presented in Table 1. Determination of the carbon and nitrogen content were carried on an elemental (CHNS) analyzer EA-3000 (EuroVector, Pavia, Italy). The calculation of the oxygen content was carried out by the difference. Data are reported in weight percent as the average of the three replicates. The mean square deviation of the random component of the measurement error was 0.3% for C and 0.1% for H. Number-average (Mn) and weight-average (Mw) molecular masses were determined by size-exclusion chromatography on an LC-20 chromatographic system (Shimadzu, Kyoto, Japan) consisted of an LC-20AD pump, a DGU-5A vacuum degasser, an STO-30A column thermostat, an SIL-30AC autosampler, and an SPD-M20A diode array UV-VIS spectrophotometric detector. The separation was carried out on at 40 °C on an MCX column, 300 × 8 mm, pore size 1000 Å (PSS, Mainz, Germany). Aqueous solution of sodium hydroxide (0.1 M) was used as a sample solvent and mobile phase.

3.2. Sample Analysis Using 2D 1H-13C HSQC NMR

In total, 50–80 mg of the dry lignin powder was dissolved in 0.55 mL of DMSO-d6 (Deutero GmbH, Kastellaun, Germany) and transferred to a 5 mm NMR tube. The 1H-13C HSQC spectra were recorded on a Bruker AVANCE III 600 MHz spectrometer (Bruker Biospin, Rheinstetten, Germany) using the Bruker library hsqcedetgpsisp2.3 pulse sequence. The experiments were carried out at 298 K using the following acquisition parameters: size of FID—1024 (F2) and 256 (F1), number of scans—32, relaxation delay—2 s, spectral width—15 ppm (F2) and 239 ppm (F1), transmitter offset—5.719 ppm (F2) and 98.2 ppm (F1), 1JC-H = 145 Hz. After zero filling, the resulting data matrix size was 1024 (F2) × 1024 (F1). The spectra were processed using Bruker’s Topspin software version 3.2.

3.3. Spectral Processing and PCA/HCA Analysis

Despite the fact that the assignment of the main cross-peaks is well described in the literature, visual comparison of data and integration of peaks related to the main substructures does not give a complete picture and does not allow us to establish a specific set of fragments characteristic of a particular lignin. The solution to this problem is possible by analyzing the data array of 2D HSQC spectra using multidimensional analysis. In this work, a combination of PCA/HCA methods was used. An overview of the procedure is shown in Figure 5.
The initial data was the HSQC spectrum data matrix in the range δC/δH 0-210/0-10 ppm, which includes all cross-peaks characterizing the composition and structure of lignin preparations. Preparation and preprocessing of NMR spectra for multivariate analysis was performed in accordance with protocol described by Hedenström et al. [19]. The obtained spectral data were analyzed by the PCA in order to get the most complete picture of the differences between the spectra and, consequently, the differences in the composition and structure of lignin preparations. Based on the data of the residual dispersion curve, an optimal set of principal components was determined, which were subject to further analysis and the construction of a scores and loadings plots. The latter were presented in the format of pseudo-HSQC spectra according to Hedenström et al. [19]. The sets of cross-peaks observed on these spectra and their intensity allowed us to draw conclusions about the contributions of certain fragments to the classification of lignins.
The scores used to construct scores plots were used as a distance matrix, allowing us to confirm the clustering of the studied lignin samples using the hierarchical method (HCA).
POKY software (version 20220114) was used for preliminary modeling of HSQC spectra [27]. For multidimensional analysis, pre-processing of all 2D 1H-13C HSQC spectra was performed in MATLAB R2021a software (Mathworks Inc., Natick, MA, USA), as described in Ref. [19]. PCA analysis was performed by MarkerView software version 1.2.1 (ABSciex, Toronto, ON, Canada) using Pareto scaling. The choice of this scaling option is based on the fact that it preserves the shapes of spectral lines better when loading pseudo-spectra [14]. Subsequently, the load vectors containing spectral information strongly correlating with the main differences were converted back into a 2D NMR load spectrum using the MATLAB R2021a software. HCA was performed based on the data of the main components using the OriginPro 2019b software (OriginLab Corp., Northampton, MA, USA). The error rate for cross-validation of training data was 2.27%.

3.4. Lignin Substructure Assignments

Qualitative analysis of 2D HSQC NMR spectra of the studied lignins and pseudo-NMR spectra representing loading plots of the main components was carried out by comparison with data from the literature sources [5,9,22,28,29,30,31,32]. The correctness of the identification was confirmed by the presence of all characteristic cross-peaks of the indicated fragments and the compliance of their chemical shifts with the literature data. The maximum permissible deviation was at 1 and 5 ppm for 1H and 13C dimensions, respectively.

4. Conclusions

The use of 2D NMR spectroscopy in combination with PCA and HCA makes it possible to effectively classify lignins of various botanical origins at the level of belonging to classes and families of plants, and, in some cases, individual species. The reverse transformation of loading plots into the corresponding HSQC NMR loading spectra provides structural information about the latent components of lignin and its structural fragments that cause certain differences. As a result of the study of 34 coniferous, deciduous, and herbaceous lignins, 10 groups of main substructures that determine the differences between lignins were established. In addition to syringyl, guaiacyl, and p-hydroxyphenyl PPU, they include various terminal substructures: dihydroconiferyl alcohol, balanopholin, cinnamic acids, and tricin. It was shown that, in some cases, the substructures formed during the partial destruction of biopolymer macromolecules also have a significant effect on the classification of lignins of various origins.
To the best of our knowledge, this study represents the first attempt to implement 2D NMR spectroscopy in the detailed classification of lignins and identification of related biomarkers in their structure using MVA. Along with the demonstrated advantages of the proposed approach, it is necessary to note its natural limitations revealed in our study. They are primarily related to the difficulty of interpretation and assignment of signals in 2D NMR spectra and the presence of overlapping peaks. In our opinion, overcoming this problem is possible by combining 2D NMR and high-resolution mass-spectrometry data, which may be a part of future research. It should also be focused on the introduction of the PLS-based approaches and expansion of the range of studied lignins, including the technical preparations obtained during the industrial processing of biomass.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms241512403/s1.

Author Contributions

Conceptualization, A.V.F. and D.S.K.; methodology, D.S.K. and A.V.F.; formal analysis, A.V.F., N.V.U. and I.A.G.; investigation, A.V.F., N.V.U. and I.A.G.; writing—original draft preparation, A.V.F. and I.A.G.; writing—review and editing, D.S.K.; visualization, I.A.G.; supervision, D.S.K.; funding acquisition, D.S.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Russian Science Foundation, grant number 21-73-20275.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in the article and Supplementary Materials.

Acknowledgments

This study was performed using an instrumentation of the Core Facility Center “Arktika” of the Lomonosov Northern (Arctic) Federal University. Russian Science Foundation, grant number 21-73-20275, funded this research. The authors thank M. Hedenström for providing the MATLAB script and W. Lee for providing the POKY software.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Shakeel, U.; Li, X.; Wang, B.; Geng, F.; Rehman, M.S.U.; Zhang, K.; Xu, J. Structural characterizations of lignins extracted under same severity using different acids. Int. J. Biol. Macromol. 2022, 194, 204–212. [Google Scholar] [CrossRef]
  2. Rencoret, J.; Gutiérrez, A.; Marques, G.; del Río, J.C.; Tobimatsu, Y.; Lam, P.Y.; Pérez-Boada, M.; Ruiz-Dueñas, F.J.; Barrasa, J.M.; Martínez, A.T. New Insights on Structures Forming the Lignin-Like Fractions of Ancestral Plants. Front. Plant Sci. 2021, 12, 740923. [Google Scholar] [CrossRef]
  3. Wannid, P.; Hararak, B.; Padee, S.; Klinsukhon, W.; Suwannamek, N.; Raita, M.; Khao-on, K.; Prahsarn, C. Structural and Thermal Characteristics of Novel Organosolv Lignins Extracted from Thai Biomass Residues: A Guide for Processing. J. Polym. Environ. 2022, 30, 2739–2750. [Google Scholar] [CrossRef]
  4. He, M.-K.; He, Y.-L.; Li, Z.-Q.; Zhao, L.-N.; Zhang, S.-Q.; Liu, H.-M.; Qin, Z. Structural characterization of lignin and lignin-carbohydrate complex (LCC) of sesame hull. Int. J. Biol. Macromol. 2022, 209 Pt A, 258–267. [Google Scholar] [CrossRef]
  5. Faleva, A.V.; Pikovskoi, I.I.; Pokryshkin, S.A.; Chukhchin, D.G.; Kosyakov, D.S. Features of the Chemical Composition and Structure of Birch Phloem Dioxane Lignin: A Comprehensive Study. Polymers 2022, 14, 964. [Google Scholar] [CrossRef]
  6. Hasanov, I.; Shanmugam, S.; Kikas, T. Extraction and isolation of lignin from ash tree (Fraxinus exselsior) with protic ionic liquids (PILs). Chemosphere 2022, 290, 133297. [Google Scholar] [CrossRef]
  7. Feng, C.; Zhu, J.; Cao, L.; Yan, L.; Qin, C.; Liang, C.; Yao, S. Acidolysis mechanism of lignin from bagasse during p-toluenesulfonic acid treatment. Ind. Crops Prod. 2022, 176, 114374. [Google Scholar] [CrossRef]
  8. Chen, L.; Liang, Z.; Zhang, X.; Zhang, L.; Wang, S.; Chen, C.; Zeng, L.; Min, D. A facile and novel lignin isolation procedure—Methanolic hydrochloric acid treatment at ambient temperature. Int. J. Biol. Macromol. 2022, 222, 1423–1432. [Google Scholar] [CrossRef] [PubMed]
  9. Belesov, A.V.; Ladesov, A.V.; Pikovskoi, I.I.; Faleva, A.V.; Kosyakov, D.S. Characterization of Ionic Liquid Lignins Isolated from Spruce Wood with 1-Butyl-3-methylimidazolium Acetate and Methyl Sulfate and Their Binary Mixtures with DMSO. Molecules 2020, 25, 2479. [Google Scholar] [CrossRef] [PubMed]
  10. Rencoret, J.; Rosado, M.J.; Kim, H.; Timokhin, V.I.; Gutiérrez, A.; Bausch, F.; Rosenau, T.; Potthast, A.; Ralph, J.; del Río, J.C. Flavonoids naringenin chalcone, naringenin, dihydrotricin, and tricin are lignin monomers in papyrus. Plant Physiol. 2022, 188, 208–219. [Google Scholar] [CrossRef]
  11. Lupoi, J.S.; Singh, S.; Parthasarathi, R.; Simmons, B.A.; Henry, R.J. Recent innovations in analytical methods for the qualitative and quantitative assessment of lignin. Renew. Sustain. Energy Rev. 2015, 49, 871–906. [Google Scholar] [CrossRef] [Green Version]
  12. Kosyakov, D.S.; Pikovskoi, I.I.; Ul’yanovskii, N.V. Dopant-assisted atmospheric pressure photoionization Orbitrap mass spectrometry—An approach to molecular characterization of lignin oligomers. Anal. Chim. Acta 2021, 1179, 338836. [Google Scholar] [CrossRef] [PubMed]
  13. Ralph, J.; Landucci, L.L. NMR of Lignins. In Lignin and Lignans: Advances in Chemistry, 2nd ed.; Heitner, C., Dimmel, D.R., Schmidt, J.A., Eds.; CRC Press (Taylor & Francis Group): Boca Raton, FL, USA, 2010; pp. 137–234. [Google Scholar]
  14. Worley, B.; Powers, R. Multivariate Analysis in Metabolomics. Curr. Metabolomics 2012, 1, 92–107. [Google Scholar] [CrossRef]
  15. Boeriu, C.G.; Bravo, D.; Gosselink, R.J.A.; van Dam, J.E.G. Characterisation of structure-dependent functional properties of lignin with infrared spectroscopy. Ind. Crops Prod. 2004, 20, 205–218. [Google Scholar] [CrossRef]
  16. Sammons, R.J.; Harper, D.P.; Labbé, N.; Bozell, J.J.; Elder, T.; Rials, T.G. Characterization of organosolv lignins using thermal and FT-IR spectroscopic analysis. BioResources 2013, 8, 2752–2767. [Google Scholar] [CrossRef]
  17. Lancefield, C.S.; Constant, S.; de Peinder, P.; Bruijnincx, P.C.A. Linkage Abundance and Molecular Weight Characteristics of Technical Lignins by Attenuated Total Reflection-FTIR Spectroscopy Combined with Multivariate Analysis. ChemSusChem 2019, 12, 1139–1146. [Google Scholar] [CrossRef] [Green Version]
  18. Masoum, S.; Bouveresse, D.J.-R.; Vercauteren, J.; Jalali-Heravi, M.; Rutledge, D.N. Discrimination of wines based on 2D NMR spectra using learning vector quantization neural networks and partial least squares discriminant analysis. Anal. Chim. Acta 2006, 558, 144–149. [Google Scholar] [CrossRef]
  19. Hedenström, M.; Wiklund, S.; Sundberg, B.; Edlund, U. Visualization and interpretation of OPLS models based on 2D NMR data. Chemom. Intell. Lab. Syst. 2008, 92, 110–117. [Google Scholar] [CrossRef]
  20. Hedenström, M.; Wiklund-Lindström, S.; Öman, T.; Lu, F.; Gerber, L.; Schatz, P.; Sundberg, B.; Ralph, J. Identification of lignin and polysaccharide modifications in populus wood by chemometric analysis of 2D NMR spectra from dissolved cell walls. Mol. Plant 2009, 2, 933–942. [Google Scholar] [CrossRef]
  21. Pedersen, H.T.; Dyrby, M.; Engelsen, S.B.; Bro, R. Application of Multi-Way Analysis to 2D NMR Data. Annu. Rep. NMR Spectrosc. 2006, 59, 207–233. [Google Scholar] [CrossRef]
  22. Faleva, A.V.; Kozhevnikov, A.Y.; Pokryshkin, S.A.; Belesov, A.V.; Pikovskoi, I.I. Structural characterization of the lignin from Saxifraga (Saxifraga oppositifolia L.) stems. Int. J. Biol. Macromol. 2020, 155, 656–665. [Google Scholar] [CrossRef]
  23. Lan, W.; Lu, F.; Regner, M.; Zhu, Y.; Rencoret, J.; Ralph, S.A.; Zakai, U.I.; Morreel, K.; Boerjan, W.; Ralph, J. Tricin, a flavonoid monomer in monocot lignification. Plant Physiol. 2015, 167, 1284–1295. [Google Scholar] [CrossRef] [PubMed]
  24. Evstigneyev, E.I.; Kalugina, A.V.; Ivanov, A.Y.; Vasilyev, A.V. Contents of α-O-4 and β-O-4 Bonds in Native Lignin and Isolated Lignin Preparations. J. Wood Chem. Technol. 2017, 37, 294–306. [Google Scholar] [CrossRef]
  25. Hamany Djande, C.Y.; Piater, L.A.; Steenkamp, P.A.; Tugizimana, F.; Dubery, I.A. A metabolomics approach and chemometric tools for differentiation of barley cultivars and biomarker discovery. Metabolites 2021, 11, 578. [Google Scholar] [CrossRef] [PubMed]
  26. Pepper, J.M.; Baylis, P.E.T.; Adler, E. The isolation and properties of lignins obtained by the acidolysis of spruce and aspen woods in dioxane-water medium. Can. J. Chem. 1959, 37, 1241–1248. [Google Scholar] [CrossRef]
  27. Lee, W.; Rahimi, M.; Lee, Y.; Chiu, A. POKY: A software suite for multidimensional NMR and 3D structure calculation of biomolecules. Bioinformatics 2021, 37, 3041–3042. [Google Scholar] [CrossRef]
  28. Ralph, S.A.; Ralph, J.; Landucci, L. NMR Database of Lignin and Cell Wall Model Compounds; US Forest Products Laboratory: Madison, WI, USA, 2004; Available online: https://www.glbrc.org/databases_and_software/nmrdatabase/NMR_DataBase_2009_Complete.pdf (accessed on 23 October 2022).
  29. Zhang, L.; Henriksson, G.; Gellerstedt, G. The formation of β-β structures in lignin biosynthesis—Are there two different pathways? Org. Biomol. Chem. 2003, 1, 3621–3624. [Google Scholar] [CrossRef]
  30. Faleva, A.V.; Kozhevnikov, A.Y.; Pokryshkin, S.A.; Falev, D.I.; Shestakov, S.L.; Popova, J.A. Structural characteristics of different softwood lignins according to 1D and 2D NMR spectroscopy. J. Wood Chem. Technol. 2020, 40, 178–189. [Google Scholar] [CrossRef]
  31. Pikovskoi, I.I.; Kosyakov, D.S.; Faleva, A.V.; Shavrina, I.S.; Kozhevnikov, A.Y.; Ul’yanovskii, N.V. Study of the sedge (Cárex) lignin by high-resolution mass spectrometry and NMR spectroscopy. Russ. Chem. Bull. 2020, 69, 2004–2012. [Google Scholar] [CrossRef]
  32. Karmanov, A.P.; Kocheva, L.S.; Belyy, V.A. Topological structure and antioxidant properties of macromolecules of lignin of hogweed Heracleum sosnowskyi Manden. Polymer 2020, 202, 122756. [Google Scholar] [CrossRef]
Figure 1. Score plots of PC2 (a), PC3 (b), and PC4 (c) versus PC1 as a result of PCA analysis of 1H-13C HSQC-NMR spectra of the studied lignin preparations: softwood lignin (green circles); hardwood lignin (blue circles); herbs lignin (yellow circles).
Figure 1. Score plots of PC2 (a), PC3 (b), and PC4 (c) versus PC1 as a result of PCA analysis of 1H-13C HSQC-NMR spectra of the studied lignin preparations: softwood lignin (green circles); hardwood lignin (blue circles); herbs lignin (yellow circles).
Ijms 24 12403 g001
Figure 2. Score plots of PC2 (a) and PC7 (b) versus PC1 as a result of PCA analysis of 1H-13C HSQC-NMR spectra of the coniferous lignin preparations: juniper lignin (light blue circles); larch lignin (yellow circles); pine lignin (purple circles); spruce lignin (orange circles).
Figure 2. Score plots of PC2 (a) and PC7 (b) versus PC1 as a result of PCA analysis of 1H-13C HSQC-NMR spectra of the coniferous lignin preparations: juniper lignin (light blue circles); larch lignin (yellow circles); pine lignin (purple circles); spruce lignin (orange circles).
Ijms 24 12403 g002
Figure 3. Block diagram of key substructures obtained as a result of the analysis of PCA-based HSQC NMR loading spectra of lignins of various biological origins.
Figure 3. Block diagram of key substructures obtained as a result of the analysis of PCA-based HSQC NMR loading spectra of lignins of various biological origins.
Ijms 24 12403 g003
Figure 4. Dendrogram obtained by hierarchical cluster analysis of 2D HSQC NMR spectra of the studied lignins of various biological origins: hardwood lignin (red lines); herbs lignin (green lines); softwood lignin (light blue lines); lignin containing flavonoids in its structure (purple lines).
Figure 4. Dendrogram obtained by hierarchical cluster analysis of 2D HSQC NMR spectra of the studied lignins of various biological origins: hardwood lignin (red lines); herbs lignin (green lines); softwood lignin (light blue lines); lignin containing flavonoids in its structure (purple lines).
Ijms 24 12403 g004
Figure 5. Overview of the procedure for multivariate analysis of 2D NMR data: (1) Each spectrum was processed by the POKY software in order to change the phase of negative cross-peaks to positive. (2) Each pre-processing spectrum is converted to a row vector and placed in a new data matrix X described in [19]. (3) Scores and loadings resulting from multivariate analysis of matrix X are performed. (4) Data from scores plot are used to hierarchical cluster analysis. (5) The loadings, initially represented as line plots, are converted to 2D loading spectra by reversing the unfolding procedure described in (2).
Figure 5. Overview of the procedure for multivariate analysis of 2D NMR data: (1) Each spectrum was processed by the POKY software in order to change the phase of negative cross-peaks to positive. (2) Each pre-processing spectrum is converted to a row vector and placed in a new data matrix X described in [19]. (3) Scores and loadings resulting from multivariate analysis of matrix X are performed. (4) Data from scores plot are used to hierarchical cluster analysis. (5) The loadings, initially represented as line plots, are converted to 2D loading spectra by reversing the unfolding procedure described in (2).
Ijms 24 12403 g005
Table 1. Lignin preparations and their main characteristics.
Table 1. Lignin preparations and their main characteristics.
PlantFamilyYield,
%
Elemental
Composition, %
Molecular Weight,
g mol−1
S/G/H Content, %Main Substructures *,
(Per 100 Aromatic Units)
COHMnMwSGHABCD
Softwood
Spruce (Picea abies)Pinaceae1567.026.66.4300084000.699.00.313.58.34.12.1
1162.630.17.3180046001.099.00.011.37.84.21.7
1261.930.67.5230052000.898.90.312.27.94.01.9
Cedar pine (Pinus cembra)1064.229.06.898041000.095.44.616.39.95.71.5
863.828.47.895036001.395.92.816.78.23.42.6
1063.628.77.7115043001.495.33.216.08.53.42.9
Larch (Larix sibirica)1061.232.06.895038000.898.80.420.88.53.43.1
963.829.27.0130041001.398.70.014.87.44.30.9
661.731.17.2175046002.193.64.319.89.33.23.3
Juniper (Juníperus commúnis)Cupressaceae863.030.07.0230058001.197.31.613.47.93.61.3
764.028.57.595042000.896.52.717.48.73.62.0
563.428.97.8160048001.496.12.519.69.33.83.3
Hardwood
Apricot (Prúnus armeníaca)Rosaceae1057.235.57.32100680079.317.92.830.31.710.46.0
Plum (Prúnus doméstica)1055.037.97.11900720080.316.73.036.01.48.40.0
Bird cherry (Prúnus pádus)1156.835.87.41350370066.033.80.234.62.89.823.4
Rowan (Sórbus aucupária)1055.737.07.3740550081.318.70.029.30.97.86.6
Birch (Bétula pubéscens)Betulaceae1559.933.26.91700440073.726.50.040.02.19.37.3
Karelian birch (Betula pendula var. Carelica)1057.934.27.9730350068.131.60.338.02.79.15.3
Lilac (Syringa vulgaris)Oleaceae759.732.67.71900480064.235.70.232.33.39.05.0
Willow (Sálix babylónica)Salicaceae1059.732.67.71020570063.534.22.332.94.29.65.6
Aspen (Pópulus trémula)1557.632.08.6680270067.327.15.624.92.18.115.9
Poplar (Populus alba)1559.533.07.5970340046.832.520.829.73.13.93.4
Monocots
Cattail (Týpha latifólia)Typhaceae558.233.97.9270010,05040.255.34.537.75.86.05.4
Sedge (Cárex heleonastes)Cyperaceae559.232.46.7300200041.442.815.924.63.32.40.0
Wheat (Tríticum)Poaceae459.530.99.6780300041.750.47.929.53.92.44.2
Wheatgrass (Elytrígia répens)758.932.46.8300205029.856.713.525.14.82.00.0
Dicotyledons
Hogweed Sosnowskyi (Heracleum)Apiaceae660.28.531.31800350055.943.30.843.14.811.36.2
Willowherb (Epilóbium)Onagraceae459.433.57.11430289037.337.025.823.72.60.00.0
Nettle (Urtíca dióica)Urticaceae460.337.52.21400170034.555.310.231.97.07.23.7
Saxifraga (Saxifrága oppositifólia)Saxifragaceae0.460.532.17.41500250014.048.637.522.53.80.00.0
Thistles (Cárduus)Asteraceae4 58.730.39.2730353058.137.94.039.64.011.52.9
Burdock (Árctium)457.731.68.4950280050.748.40.939.74.28.53.4
False flax (Camēlina)Brassicaceae0.459.029.09.2780473050.246.93.035.35.611.62.7
Orache (Atriplex)Amaranthaceae257.332.68.5390295063.036.20.838.93.515.13.3
* A—β-aryl ether, B—phenylcoumarane, C—resinol, D—1,3-dioxane. Mn—molecular masses number-average, Mw—molecular masses weight-average.
Table 2. Substructures responsible for the main differences between lignins along PC1-PC4 axes (“+” and “−” denote positive and negative correlations, respectively).
Table 2. Substructures responsible for the main differences between lignins along PC1-PC4 axes (“+” and “−” denote positive and negative correlations, respectively).
SubstructureLabelPC1PC2PC3PC4
Main substructures
Syringyl PPUS+−/+
Guaiacyl PPUGn/d *
p-Hydroxyphenyl PPUHn/d+n/d
p-hydroxybenzoatepB+n/dn/d+
β-aryl ether (S)A+n/d
β-aryl ether (G)n/d
β-aryl ether (H)+n/dn/d
PhenylcoumaraneB+
SecoisolariciresinolScn/dn/dn/d
DibenzodioxocinDn/dn/dn/d
ResinolC+
Dihydroconiferyl alcoholDCA
CinnamaldehydeJ
Ferulic acidFan/d++n/d
p-coumaric acidpCAn/d++n/d
BalanopholinBFn/d++
Substructures formed during isolation
Methyl substituted phenylcoumaronePn/d+
3,4-divanylyltetrahydrofuranDin/dn/dn/d
1,3-dioxane structure1,3D +
Hibbert ketoneHkn/d
α-hydroxypropiovanillone n/dn/d+
AcetovanilloneAVn/d+
VanillinVn/d+
Other
SugarsSugars+++
Fatty acids (+Acetate)Fatty acids (Ac)++− (+)
ArabinofuranoseArn/d++n/d
TricinTn/d+++
* Not detected (n/d).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Faleva, A.V.; Grishanovich, I.A.; Ul’yanovskii, N.V.; Kosyakov, D.S. Application of 2D NMR Spectroscopy in Combination with Chemometric Tools for Classification of Natural Lignins. Int. J. Mol. Sci. 2023, 24, 12403. https://doi.org/10.3390/ijms241512403

AMA Style

Faleva AV, Grishanovich IA, Ul’yanovskii NV, Kosyakov DS. Application of 2D NMR Spectroscopy in Combination with Chemometric Tools for Classification of Natural Lignins. International Journal of Molecular Sciences. 2023; 24(15):12403. https://doi.org/10.3390/ijms241512403

Chicago/Turabian Style

Faleva, Anna V., Ilya A. Grishanovich, Nikolay V. Ul’yanovskii, and Dmitry S. Kosyakov. 2023. "Application of 2D NMR Spectroscopy in Combination with Chemometric Tools for Classification of Natural Lignins" International Journal of Molecular Sciences 24, no. 15: 12403. https://doi.org/10.3390/ijms241512403

APA Style

Faleva, A. V., Grishanovich, I. A., Ul’yanovskii, N. V., & Kosyakov, D. S. (2023). Application of 2D NMR Spectroscopy in Combination with Chemometric Tools for Classification of Natural Lignins. International Journal of Molecular Sciences, 24(15), 12403. https://doi.org/10.3390/ijms241512403

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop