Next Article in Journal
HOX-Gene Cluster Organization and Genome Duplications in Fishes and Mammals: Transcript Variant Distribution along the Anterior–Posterior Axis
Next Article in Special Issue
Walnut Shell Biowaste Valorization via HTC Process for the Removal of Some Emerging Pharmaceutical Pollutants from Aqueous Solutions
Previous Article in Journal
Cane Vinasses Contain Bioactive Concentrations of Auxin and Abscisic Acid in Their Composition
Previous Article in Special Issue
Proof of Concept for the Detection with Custom Printed Electrodes of Enterobactin as a Marker of Escherichia coli
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Development of Honey Recognition Models Based on the Association between ATR-IR Spectroscopy and Advanced Statistical Tools

by
Maria David
,
Ariana Raluca Hategan
,
Camelia Berghian-Grosan
and
Dana Alina Magdas
*
National Institute for Research and Development of Isotopic and Molecular Technologies, 67-103 Donat Street, 400293 Cluj-Napoca, Romania
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(17), 9977; https://doi.org/10.3390/ijms23179977
Submission received: 3 August 2022 / Revised: 24 August 2022 / Accepted: 29 August 2022 / Published: 1 September 2022

Abstract

:
The newly developed prediction models, having the aim to classify Romanian honey samples by associating ATR-FTIR spectral data and the statistical method, PLS-DA, led to reliable differentiations among the samples, in terms of botanical and geographical origin and harvesting year. Based on this approach, 105 out of 109 honey samples were correctly attributed, leading to true positive rates of 95% and 97% accuracy for the harvesting differentiation model. For the botanical origin classification, 83% of the investigated samples were correctly predicted, when four honey varieties were simultaneously discriminated. The geographical assessment was achieved in a percentage of 91% for the Transylvanian samples and 85% of those produced in other regions, with overall accuracy of 88% in the cross-validation procedure. The signals, based on which the best classification models were achieved, allowed the identification of the most significant compounds for each performed discrimination.

Graphical Abstract

1. Introduction

Honey is a natural sweet substance made by Apis mellifera bees, either from the nectar of plants, from the secretion of plants or from the excretion of insects. It has been consumed since ancient times and it is known to have high nutritional value. Honey mainly consists of carbohydrates such as fructose, glucose and sucrose, whose concentrations depend on the botanical origin. Non-volatile compounds such as sugars [1,2], amino acids [3,4], proteins, minerals [1] and phenolic compounds [5] contribute to the taste and color of the honey, while the volatile compounds confer its aroma [6].
Since honey supply is lower than the demand, it is susceptible to adulteration. Monofloral honey, being the most appreciated, is the main target of adulteration by mixing multiple types of multifloral honey. The quality, aroma and physical properties of honey within the same floral source vary due to seasonal climatic variations or the geographical location of the apiary [3]. Therefore, an important issue has been to identify pure honey and verify its authenticity, by developing analytical approaches to permit the verification of the quality specification.
The composition of honey changes over time, and the most important modifications are related to its concentrations of dextrose, levulose, maltose, sucrose, higher sugars, diastase and total acidity, which lead to darkening and loss of aroma and flavor [7]. Furthermore, during storage, the Maillard reaction and/or caramelization can occur, when the concentration of monosaccharides decreases and the levels of organic acids, 5-hydroxymehyl-2-furaldehyde (HMF) and furosine increase [8,9].
The most popular method to authenticate the botanical origin of honey is melissopalynological analysis based on microscopic identification of the pollen type [10]. However, this method is time-consuming and implies many processing steps. Therefore, there is an ongoing need to develop faster, practical methods to determine the botanical or geographical origin of the honey. In recent years, the Fourier transform infrared spectroscopy (FT-IR) has been frequently used in the food industry for the detection and quantification of adulteration [11,12], because of its fast response time and wide spectral range of analysis. Moreover, if the spectrometer is equipped with an attenuated total reflectance (ATR) sample technique, it is nondestructive and does not require any sample manipulation [6,13].
Near-infrared and mid-infrared spectroscopy in corroboration with chemometric methods were extended, developed and successfully used for the quality control and botanical origin assessment of honey samples [14,15,16,17]. Several methods have been developed using partial least squares discriminant analysis (PLS-DA) for the detection of adulterated honey, by determining the sugar syrup content [18,19] or for sugar quantification in honey, such as glucose, fructose, sucrose and maltose [10,12]. Good results in differentiating the botanical origin of 11 honey types using linear discriminant analysis (LDA), having a correct classification rate ranging from 70 to 100%, were reported in the literature by Ruoff and co-workers [10]. Moreover, a good botanical classification of honey samples from Turkey by using principal component analysis (PCA) and hierarchical cluster analysis (HCA) was obtained by Gok and co-workers [18]. The near- and mid-infrared spectra of raw honey samples were also employed for the classification of honey samples with different botanical origins, leading to accuracies greater than 96% when PLS-DA and PCA were applied [19].
For the geographical authentication, Formosa et al. used spectral transformations, variable selection and PLS-DA to successfully classify the provenance of 21 local and 49 non-local honey samples [20]. Other reported studies present the use of chemometric tools (PLS, PCA, LDA) in NIR spectroscopic studies to differentiate honey according to its geographical origins [21,22]. All these studies employed the use of spectral transformations prior to multivariate analysis.
This study proposes a new approach for the development of prediction models to classify a honey set formed by 109 Romanian samples belonging to four botanical origins that were produced in two consecutive harvesting years. The association between ATR-FTIR spectral data and a supervised statistical method, PLS-DA, led to reliable differentiations among the samples in terms of the year of harvest, botanical origin and geographical provenance. The novelty of the present work is highlighted by the construction of a highly precise discrimination model capable of distinguishing honey belonging to 2020 production from that harvested in 2021. According to our knowledge, to date, there are no reported data related to the possibility of easily differentiating the harvesting year of honey based on the association of IR spectroscopy and advanced statistical models. The practical importance of this classification is related to the detection of the possible unfair practices related to the re-utilization of unsold honey by labeling it as recently harvested. In order to maximize the prediction rate of our developed differentiation models, special emphasis was given to the applied data preprocessing step, a fact that also proved to be a very effective step in the models’ prediction performance.

2. Results

All honey spectra were recorded in the spectral range 550–4000 cm−1. For a clear understanding of the samples’ characteristics, Figure 1 contains the FTIR spectra of acacia, colza, honeydew and linden honey species (Figure 1a) and acacia samples collected in the 2020 and 2021 years, respectively (Figure 1b).
As has been demonstrated over the years in the literature [18,23,24,25,26,27], six representative regions can be emphasized in the FTIR spectra of honey (Figure 1a). Most of these bands are associated with carbohydrates and O-H stretching vibrations (both from water and carbohydrates), but the overlapping with the peaks originating from some minor compounds such as amino acids, proteins or various organic acids can be also considered in some spectral regions (i.e., spectral range from 1170 to 1540 cm−1, 1550 to 1750 cm−1 or 2800 to 3000 cm−1) [13,26,27,28,29].
Moreover, visible differences are observed in almost all IR regions of the investigated honey varieties (Figure 1a) and especially in three domains, namely 950 to 1170 cm−1, 1170 to 1540 cm−1 and 3000 to 3670 cm−1, when studying the harvesting year influences (Figure 1b). Because the honey differentiations, according to some predefined criteria, such as harvesting year and botanical and geographical origin, are mainly realized based on the minor constituents of the honey—both organic and inorganic compounds—for the classification models’ development, only the fingerprint region between 550 and 1775 cm−1 was further considered, in order to reduce the dimensionality of the data set.

3. Discussion

For the development of the most effective recognition models, the obtained experimental spectra were preprocessed using autoscale. Then, a feature selection procedure that aimed to reduce the number of input variables only to those that had the highest discrimination potential was applied. The final differentiation models were constructed by applying the PLS-DA supervised method on the input space corresponding to only those previously identified meaningful markers. A comparison of the models’ performance showed that the best recognition models were obtained when a preprocessing step was performed and when only the relevant markers were used for the development of the models.
The previously reported studies emphasized the potential of IR spectroscopy in the differentiation of the botanical and geographical origins of honey [19,20,21], but according to our knowledge, no studies related to the differentiation of honey harvesting years have been performed.

3.1. Harvesting Year Differentiation

For the development of the model for harvesting year differentiation, a total of 109 honey samples, 60 from 2020 and 49 from 2021, were considered. The best obtained recognition model was constructed by taking into consideration 63 markers (i.e., spectral points), as resulted from the model-based feature selection method (Figure 2). Six latent variables were used to construct the differentiation model, the selection of which was made with the aim of minimizing the cross-validation error average (Figure 3). Through this classifier, 105 out of 109 honey samples were assigned to the correct class, leading to true positive rates of 95% and 97% for the groups of 2020 and 2021, respectively. The scores, ROC and RMSECV plots associated with the prediction model are presented in Figure 4 and Figure 5.
The markers based on which the best classification was achieved were situated in the following spectral ranges: 550–700, 1050–1100 and a band around 1419 cm−1 (Figure 2). An important observation pertained to the fact that the number corresponding to these relevant markers was considerably lower (i.e., 63) as compared to the initial dimensionality of the entire spectra (i.e., 1246).
An important spectral range that allowed the present classification is represented by the region between 550 and 700 cm−1. This region is known as the “crystalline region” and contains the exocyclic deformations (CCO) [24], or it can be associated with the M-O vibrations [30], being able to offer information about the presence of some metals that are directly related to environmental conditions (i.e., soils, pollution issues).
Another spectral region that contained markers for the harvesting year differentiation is the one situated between 1050 and 1100 cm−1. This region is specific for the C-O stretching vibration in carbohydrates such as sucrose, glucose and fructose [31]. This range also indicates the presence of C-C stretching modes and O-H vibrations, characteristic of organic acids, carotenes and polyphenols [12,32]. Monosaccharide concentrations can be considered as an important feature for the storage time of honey, since their concentration decreases with time due to spontaneous chemical degradation, such as the non-enzymatic Maillard reaction [9].
A meaningful marker for the present classification was represented by the band around 1419 cm−1, attributed to a combination of O-H bending of the C-O-H group and C-H bending vibrations of alkenes [12]. The structure of some organic acids contains -CH=CH- bonds, which can contribute to this band, e.g., in the case of fumaric acid (COOH-CH=CHCOOH. The presence of fumaric acid is an indicator of the deterioration of honey because of storage and aging [7,12].

3.2. Botanical Differentiation

The botanical classification model was constructed based on the entire set of honey samples that corresponded to the four floral types: acacia (41 samples), colza (18), honeydew (20) and linden (30). The most accurate obtained PLS-DA classifier was built on 146 features that were selected as significant markers for the botanical differentiation of the honey samples (Figure 6). Eighteen components proved to be the optimal selection for the number of latent variables (Figure 7), leading to 83% accuracy in the cross-validation evaluation procedure. Therefore, 91 samples were correctly predicted (34 acacia, 14 colza, 11 honeydew, 24 linden samples) according to their botanical class. The score plot associated with the botanical differentiation models is illustrated in Figure 8. Moreover, the RMSECV and ROC plots are presented in Figure 9a,b.
The markers based on which the best classification was achieved were situated in the spectral ranges between 550 and 1000, and 1550 and 1700 cm−1.
The spectral area 550–700 cm−1 can be also associated with the skeletal vibrations of carbohydrates, being known as the “crystalline region”, containing the exocyclic deformations (CCO) [24]. However, few influences for botanical discrimination can be also identified in the region below 700 cm−1 (Figure 6).
As has been illustrated in Figure 1a, the spectral area from 700 to 900 cm−1 is specific to the anomeric forms of saccharides, being called the fingerprint or “anomeric region”. By analyzing the bands from this region, it was possible to identify the α and β anomers in monosaccharides or higher saccharides [24]. The peaks from 950 to 1000 cm−1 correspond mainly to the C-O stretching of carbohydrates, and allow differentiation according to the presence of glucose, fructose and sucrose in the honey samples. These two regions have the most significant contribution to the botanical discrimination of samples, being in accordance with the fact that various types of honey contain different amounts of glucose and fructose, respectively. This region can also indicate the presence of C-O stretching modes characteristic of polyphenols [32], which is an indicator of floral origin, as flower honey is characterized by high concentration values of glucose and fructose and low polyphenol content, whereas honeydew honey has a lower concentration of glucose and fructose and higher polyphenol content compared with other varieties of honey [33].
The spectral region around 1530–1730 cm−1 represents an overlapped signal involving the O-H deformation vibration of water and the C=O stretching from carbohydrates, proteins, amino acids or organic acids [18,27,29,30]. The region of 1600–1700 cm−1 was also involved in the botanical differentiation in previously reported studies [18,26], and its efficiency can be explained through the presence of proteins and some interactions, i.e., water–carbohydrates and water–proteins, which are directly related to the floral origin. However, in this range, the water content has to be considered as having a significant influence on the IR spectra.
All these identified meaningful spectral regions from the current work were previously used for developing models in order to differentiate the botanical origins of honey samples [10,12,18,19,26].

3.3. Geographical Discrimination

The PLS-DA models designed for geographical differentiation were based on 54 samples: 34 from Transylvania and 20 honey samples from other regions. Once again, the recognition model that proved to have the highest discrimination capability corresponded to a reduced input space—namely, only 28 spectral points were taken into account (Figure 10). Regarding the selection for the number of latent variables, seven components were chosen for the construction of the PLS-DA model (Figure 11). Here, 88% accuracy was achieved after conducting the cross-validation procedure, leading to the correct prediction of 91% of the Transylvanian samples and 85% of the honey samples produced in the other regions. Satisfactory separation of the samples in terms of the geographical origin is reflected through the score plot associated with the first three latent variables (Figure 12). Moreover, the RMSECV and ROC plots associated with the developed classification model are presented in Figure 13a,b.
The markers based on which the best classification was achieved are illustrated in Figure 10, being mainly situated in the region 550–700 cm−1, and, besides these, a few narrow bands around 710, 810, 870 and 1180 cm−1 also proved to have discrimination potential. The spectral region 500–700 cm−1 is characteristic of the skeletal vibrations of carbohydrates, but these markers can be also indicative of the soil composition [34] and can reveal the differences in the mineral content of the soil. Thus, the peaks found in this area correspond to M-O vibrations and could give information about the presence of some metals, such as aluminum, copper, lead, iron, tin [30], manganese [35], lithium [36] or strontium [37], which could be seen as possible markers of geographical origin. Manganese, lithium and strontium elements have been identified as elemental markers in various food commodities from the Transylvania area, such as milk [38], carrots [39] and cheese [40]. These elements proved to be powerful discriminators of Transylvanian products as compared to those that originated from other geographical regions.
The signals from the region between 810 and 890 cm−1 are characteristic of the anomeric vibrations of carbohydrates or could be due to the C-H out-of-plane deformation vibrations [28], being important for honey analysis, while the band at 1180 cm−1 could be associated with C-O stretching vibrations from phenols [30].

4. Materials and Methods

4.1. Sample Description

For this study, 109 authentic honey samples collected during the 2020 and 2021 harvest, directly from producers located in the most important melliferous regions of Romania, were analyzed and the obtained analytical data were further processed.
The honey floral origin, year of harvest and geographical distribution are presented in detail in Table 1. With respect to the geographical origin, it was necessary to take into account the apiculture practices in Romania. Because, for some flowers (e.g., acacia), the blooming period is very short and differs from one Romanian region to another, moving the hives in order to increase honey production is a common practice. This habit leads to the imprecise geographical origin of some honey samples. For this reason, only the samples having an exact region of collection (i.e., Transylvania) were explicitly considered to belong to a precise area.
All collected honey samples were stored in the dark, in screw-cap jars, at room temperature (18–25 °C) until the spectral acquisition. Prior to the spectral collection, the crystalized honey samples were heated overnight (37 °C) and manually stirred to ensure their homogeneity.

4.2. Vibrational Spectroscopy Analysis and ATR-FTIR Spectroscopy

The infrared spectra of the honey samples were recorded with a Jasco 4100 FTIR spectrometer in attenuated total reflectance (ATR) mode with a ZnSe crystal. The detection system consisted of a DTGS detector, having a spectral resolution of 4 cm−1. At room temperature, samples were placed on the crystal. Each spectrum was recorded in the spectral range 550–4000 cm−1, by averaging 16 scans. The air spectrum was taken as background, and between measurements, the ATR crystal was carefully cleaned with acetone.

4.3. Statistical Treatments

For the development of the honey classification models, a supervised statistical method was applied, namely partial least squares discriminant analysis (PLS-DA). PLS-DA represents a classification methodology derived from PLS regression and it involves building a regression model between the data matrix containing the input variables (i.e., measurements) and a dummy matrix that illustrates the adherence of each sample from the data set to a predefined class. Therefore, the goal is the identification of the latent variables (i.e., PLS components) that both capture variance and attain correlation [41,42]. The evaluation of the developed discrimination models was performed by means of the venetian blinds cross-validation technique, having the number of data splits set to ten. Therefore, the performance metrics illustrating the classification ability of the honey differentiation are the results of an unbiased evaluation methodology. In the framework of the present study, the number of latent variables to keep for the construction of the PLS-DA models was chosen with the aim of minimizing the cross-validation classification error average.
The identification of the features having the highest classification power was achieved by applying a feature selection tool that relies on a comparison of the root-mean-square error of cross-validation values (RMSECV) of PLS models built on different groups of variables. Firstly, from the entire set of attributes, the ones having the lowest Variable Importance in Projection (VIP) and selectivity ratio (SR) values were eliminated from the input space; furthermore, the same procedure was followed until the performance of the PLS-DA model did not continue to improve [43].
The pretreatment technique that was used in the present work was autoscale. Through autoscale, the original matrix corresponding to the experimental data set is transformed so that each column has an average of zero and a standard deviation of one.
All of the previously mentioned data processing approaches (i.e., the application of preprocessing methods, the construction of the PLS-DA models, the selection of the most relevant variables for the development of the honey classification models) were conducted under the SOLO 8.9.1 (2021) software (Eigenvector Research, Manson, WA, USA).

5. Conclusions

This study proposes a new approach for the development of prediction models aiming to classify Romanian honey according to its harvesting year, botanical origin and geographical provenance by associating ATR-FTIR spectral data with analytical methods (i.e., PLS-DA). By applying, as a preprocessing step, the autoscale method, it was observed that the models gave a better classification. Dimensionality reduction through the application of the PLS-DA supervised method led, in most cases, to a drastic improvement in the classification rate. Thus, for the model developed for the harvest recognition, 105 out of 109 honey samples were correctly attributed, leading to true positive rates of 95% and 97% for the harvesting years 2020 and 2021, respectively. The markers based on which the best classification was achieved revealed that monosaccharaides and organic acids (e.g., fumaric acid) are important characteristics during the storage of honey.
The developed model for botanical origin classification presented a correct classification rate of 84% when four honey varieties were simultaneously classified. The markers selected for this classification can be explained by the differences in polyphenol concentration, water content and the interaction with the carbohydrates and proteins, whose concentrations depend on the floral origin.
The developed model for the geographical provenance allowed the correct classification of 91% of the Transylvania samples and 85% of those produced in other regions. Thus, the overall accuracy of the model developed for geographical origin recognition was 88% in the cross-validation evaluation technique.
The employment in the model construction of an extended honey sample set having a more balanced distribution could lead to the development of more robust recognition models. This subject represents a future work direction.

Author Contributions

Conceptualization, M.D., A.R.H., C.B.-G. and D.A.M.; methodology, A.R.H., D.A.M. and C.B.-G.; validation, C.B.-G.; formal analysis, C.B.-G.; data curation, M.D.; writing—original draft preparation, M.D. and A.R.H.; writing—review and editing, C.B.-G. and D.A.M.; supervision, D.A.M.; project administration, D.A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by a grant from the Romanian Ministry of Education and Research, CNCS–UEFISCDI, project number PN-III-P4-ID-PCE-2020-0644 (Contract no. 7PCE/2021), within PNCDI III.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Gomez-Barez, J.A.; Garcia-Villanova, R.J.; Elvira-Garcia, S.; Rivas-Pala, T.; Gonzalez-Paramas, A.M.; Sanchez-Sanchez, J. Geographical discrimination of honeys through the employment of sugar patterns and common chemical quality parameters. Eur. Food Res. Technol. 2000, 210, 437–444. [Google Scholar] [CrossRef]
  2. Serra Bonvehi, J.; Ventura Coll, F. Characterization of Citrus Honey (Citrms spp.) Produced in Spain. J. Agric. Food Chem. 1995, 43, 2053–2057. [Google Scholar] [CrossRef]
  3. Davies, A.M.C. Amino acid analysis of honey from eleven countries. J. Apic. Res. 1975, 14, 29–39. [Google Scholar] [CrossRef]
  4. Amiot, M.J.; Aubert, S.; Gonnet, M.; Tacchini, M. Honey phenolic compounds: A preliminary study on their identification and quantitation by families. Apidologie 1989, 20, 115–125. [Google Scholar] [CrossRef]
  5. Anklam, E. A review of the analytical methods to determine the geographical and botanical origin of honey. Food Chem. 1998, 63, 549–562. [Google Scholar] [CrossRef]
  6. Chin, N.L.; Sowndhararajan, K. A review on Analytical Methods for Honey Classification, Identification and Authentication. In Honey Analysis—New Advances and Challenges; de Alencar Arnaut de Toledo, V., Dechechi Chambó, E., Eds.; IntechOpen: London, UK, 2020. [Google Scholar]
  7. White, J.W.; Riethof, M.L.; Subers, M.H.; Kushnir, I. Composition of American Honeys. US Dep. Agric. 1962, 1261, 1–124. [Google Scholar]
  8. Mashhadi, A.; Bavali, A.; Mokhtari, F. Assay of honey freshness by a novel optical technique. Sci. Rep. 2022, 12, 901. [Google Scholar] [CrossRef]
  9. Castro-Vázquez, L.; Díaz-Maroto, M.C.; González-Viñas, M.A.; de la Fuente, E.; Pérez-Coello, M.S. Influence of Storage Conditions on Chemical Composition and Sensory Properties of Citrus Honey. J. Agric. Food Chem. 2008, 56, 1999–2006. [Google Scholar] [CrossRef]
  10. Ruoff, K.; Iglesias, M.T.; Luginbühl, W.; Bosset, J.O.; Bogdanov, S.; Amadò, R. Quantitative analysis of physical and chemical measurands in honey by mid-infrared spectrometry. Eur. Food Res. Technol. 2006, 223, 22–29. [Google Scholar] [CrossRef]
  11. Rodriguez-Saona, L.E.; Allendorf, M.E. Use of FTIR for rapid authentication and detection of adulteration of food. Annu. Rev. Food Sci. Technol. 2011, 2, 467–483. [Google Scholar] [CrossRef]
  12. Valand, R.; Tanna, S.; Lawson, G.; Bengtström, L. A review of Fourier Transform Infrared (FTIR) spectroscopy used in food adulteration and authenticity investigations. Food. Addit. Contam. Part A 2020, 37, 19–38. [Google Scholar] [CrossRef] [PubMed]
  13. de Almeida-Muradian, L.B.; Stramm, K.M.; Estevinho, L.M. Efficiency of the FT-IR ATR spectrometry for the prediction of the physicochemical characteristics of Meliponasubnitida honey and study of the temperature’s effect on those properties. Int. J. Food Sci. Technol. 2014, 49, 188–195. [Google Scholar] [CrossRef]
  14. Tewari, J.C.; Irudayaraj, J.M.K. Floral Classification of Honey Using Mid-Infrared Spectroscopy and Surface Acoustic Wave Based z-Nose Sensor. J. Agric. Food Chem. 2005, 53, 6955–6966. [Google Scholar] [CrossRef]
  15. Bisutti, V.; Merlanti, R.; Serva, L.; Lucatello, L.; Mirisola, M.; Balzan, S.; Capolongo, F. Multivariate and machine learning approaches for honey botanical origin authentication using near infrared spectroscopy. J. Near Infrared Spectrosc. 2019, 27, 65–74. [Google Scholar] [CrossRef]
  16. Chen, L.; Wang, J.; Ye, Z.; Zhao, J.; Xue, X.; Heyden, Y.V.; Sun, Q. Classification of Chinese honeys according to their floral origin by near infrared spectroscopy. Food Chem. 2012, 135, 338–342. [Google Scholar] [CrossRef]
  17. Riswahyuli, Y.; Rohman, A.; Setyabudi, F.M.C.S.; Raharjo, S. Indonesian wild honey authenticity analysis using attenuated total reflectance-fourier transform infrared (ATR-FTIR) spectroscopy combined with multivariate statistical techniques. Heliyon 2020, 6, e03662. [Google Scholar] [CrossRef]
  18. Gok, S.; Severcan, M.; Goormaghtigh, E.; Kandemir, I.; Severcan, F. Differentiation of Anatolian honey samples from different botanical origins by ATR-FTIR spectroscopy using multivariate analysis. Food Chem. 2015, 170, 234–240. [Google Scholar] [CrossRef]
  19. Gan, Z.; Yang, Y.; Li, J.; Wen, X.; Zhu, M.; Jiang, Y.; Ni, Y. Using sensor and spectral analysis to classify botanical origin and determine adulteration of raw honey. J. Food Eng. 2016, 178, 151–158. [Google Scholar] [CrossRef]
  20. Formosa, J.P.; Frederick, L.; David, M.; Claude, F. Application of ATR-FT-MIR for Tracing the Geographical Origin of Honey Produced in the Maltese Islands. Foods 2020, 9, 710. [Google Scholar] [CrossRef]
  21. Woodcock, T.; Downey, G.; O’Donnell, C.P. Near infrared spectral fingerprinting for confirmation of claimed PDO provenance of honey. Food Chem. 2009, 114, 742–746. [Google Scholar] [CrossRef]
  22. Guyon, F.; Logodin, E.; Magdas, D.A.; Gaillard, L. Potential of FTIR- ATR diamond in discriminating geographical and botanical origins of honeys from France and Romania. Talanta Open 2021, 3, 100022. [Google Scholar] [CrossRef]
  23. Gallardo-Velázquez, T.; Osorio-Revilla, G.; Zuñiga-de Loa, M.; Rivera-Espinoza, Y. Application of FTIR-HATR spectroscopy and multivariate analysis to the quantification of adulterants in Mexican honeys. Food Res. Int. 2009, 42, 313–318. [Google Scholar] [CrossRef]
  24. Mathlouthi, M.; Koenig, J.L. Vibrational spectra of carbohydrates. Adv. Carbohydr. Chem. Biochem. 1986, 44, 7–89. [Google Scholar] [CrossRef] [PubMed]
  25. Tewari, J.; Irudayaraj, J. Quantification of saccharides in multiple floral honeys using Fourier transform infrared microattenuated total reflectance spectroscopy. J. Agric. Food Chem. 2004, 52, 3237–3243. [Google Scholar] [CrossRef] [PubMed]
  26. Pauliuc, D.; Ciursă, P.; Ropciuc, S.; Dranca, F.; Oroian, M. Physicochemical parameters prediction and authentication of different monofloral honeys based on FTIR spectra. J. Food Compos. Anal. 2021, 102, 104021. [Google Scholar] [CrossRef]
  27. Stuart Tipson, R. Infrared Spectroscopy of Carbohydrates: A Review of the Literature; United States Government Printing Office: Washington, DC, USA, 1968. [Google Scholar]
  28. Sivakesava, S.; Irudayaraj, J. Prediction of inverted cane sugar adulteration of honey by Fourier transform infrared spectroscopy. J. Food Sci. 2001, 66, 972–978. [Google Scholar] [CrossRef]
  29. Philip, D. Honey mediated green synthesis of gold nanoparticles. Spectrochim. Acta Part A 2009, 73, 650–653. [Google Scholar] [CrossRef]
  30. Socrates, G. Infrared and Raman Characteristic Group Frequencies: Tables and Charts; John Wiley & Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
  31. Horvatinec, J.; Svecnjak, L. Infrared (FTIR) spectral features of honey bee (Apis mellifera L.) hemolymph. J. Cent. Eur. Agric. 2020, 21, 37–41. [Google Scholar] [CrossRef]
  32. Masek, A.; Chrzescijanska, E.; Kosmalska, A.; Zaborski, M. Characteristics of compounds in hops using cyclic voltammetry, UV–VIS, FTIR and GC–MS analysis. Food Chem. 2014, 156, 353–361. [Google Scholar] [CrossRef]
  33. Sanz, M.L.; Gonzalez, M.; de Lorenzo, C.; Sanz, J.; Martínez-Castro, I. A contribution to the differentiation between nectar honey and honeydew honey. Food Chem. 2005, 91, 313–317. [Google Scholar] [CrossRef]
  34. Chauhan, R.; Kumar, R.; Sharma, V. Soil forensics: A spectroscopic examination of trace evidence. Microchem. J. 2018, 139, 74–84. [Google Scholar] [CrossRef]
  35. Elsa, C.; Arya, R.; Baskar, G. Synthesis of manganese dioxide nanoparticles using co-precipitation method and its antimicrobial activity. Int. J. Mod. Sci. Technol. 2016, 01, 17–22. [Google Scholar]
  36. Wijaya, O.; Hartmann, P.; Younesi, R.; Markovits, I.I.E.; Rinaldi, A.; Janek, J.; Yazami, R. Gamma fluorinated ether as an additive for enhanced oxygen activity in Li–O2 batteries. J. Mater. Chem. A 2015, 37, 19061–19067. [Google Scholar] [CrossRef]
  37. Tabah, B.; Nagvenkar, A.P.; Perkas, N.; Gedanken, A. Solar-Heated Sustainable Biodiesel Production from Waste Cooking Oil Using a Sonochemically Deposited SrO Catalyst on Microporous Activated Carbon. Energy Fuels 2017, 31, 6228–6239. [Google Scholar] [CrossRef]
  38. Magdas, D.A.; Feher, I.; Dehelean, A.; Cristea, G.; Magdas, T.M.; Puscas, R.; Marincaş, O. Isotopic and elemental markers for geographical origin and organically grown carrots discrimination. Food Chem. 2018, 267, 231–239. [Google Scholar] [CrossRef]
  39. Magdas, D.A.; Dehelean, A.; Feher, I.; Cristea, G.; Puscas, R.; Dan, S.D.; Cordea, D.V. Discrimination markers for the geographical and species origin of raw milk within Romania. Int. Dairy J. 2016, 61, 135–141. [Google Scholar] [CrossRef]
  40. Magdas, D.A.; Feher, I.; Cristea, G.; Voica, C.; Tabaran, A.M.; Mihaiu, M.; Cordea, D.V.; Bâlteanu, V.A.; Dan, S.D. Geographical origin and species differentiation of Transylvanian cheese. Comparative study of isotopic and elemental profiling vs. DNA results. Food Chem. 2019, 277, 307–313. [Google Scholar] [CrossRef]
  41. Wise, B.M.; Shaver, J.M.; Gallagher, N.B.; Windig, W.; Bro, R.; Koch., R.S. Chemometrics Tutorial for PLS_Toolbox Solo; Eigenvector Research, Inc.: Wenatchee, WA, USA, 2006. [Google Scholar]
  42. Brereton, R.G.; Lloyd, G.R. Partial least squares discriminant analysis: Taking the magic away. J. Chemom. 2014, 28, 213–225. [Google Scholar] [CrossRef]
  43. (20) Eigenvector Research, Inc. Eigenvector Research Wiki. Available online: https://wiki.eigenvector.com/index.php?title=Selectvars (accessed on 15 March 2022).
Figure 1. ATR-FTIR experimental spectra of (a) honey varieties, (b) different harvesting years of acacia honey.
Figure 1. ATR-FTIR experimental spectra of (a) honey varieties, (b) different harvesting years of acacia honey.
Ijms 23 09977 g001
Figure 2. Plotted spectra of all investigated samples (for their differentiation distinct colors were used). The vertical lines marked the spectral points having high differentiation power for the harvesting year discrimination of the investigated honey samples.
Figure 2. Plotted spectra of all investigated samples (for their differentiation distinct colors were used). The vertical lines marked the spectral points having high differentiation power for the harvesting year discrimination of the investigated honey samples.
Ijms 23 09977 g002
Figure 3. Cross-validation (CV) and calibration (Cal.) classification error average as function of the number of latent variables used for the construction of the PLS-DA harvesting year differentiation model.
Figure 3. Cross-validation (CV) and calibration (Cal.) classification error average as function of the number of latent variables used for the construction of the PLS-DA harvesting year differentiation model.
Ijms 23 09977 g003
Figure 4. Score plot corresponding to the developed PLS-DA model for the harvesting year differentiation of the honey samples.
Figure 4. Score plot corresponding to the developed PLS-DA model for the harvesting year differentiation of the honey samples.
Ijms 23 09977 g004
Figure 5. (a) ROC curves containing estimated (blue), cross-validated (green) values and model threshold (red circle), and (b) overlapped RMSECV plots for 2020 (RMSECV 1—blue) and 2021 (RMSECV 2—green) groups associated with the PLS-DA model developed for harvesting year differentiation.
Figure 5. (a) ROC curves containing estimated (blue), cross-validated (green) values and model threshold (red circle), and (b) overlapped RMSECV plots for 2020 (RMSECV 1—blue) and 2021 (RMSECV 2—green) groups associated with the PLS-DA model developed for harvesting year differentiation.
Ijms 23 09977 g005
Figure 6. Plotted spectra of all investigated samples (for their differentiation distinct colors were used). The vertical lines marked the spectral points having high differentiation power for the botanical discrimination of the investigated honey samples.
Figure 6. Plotted spectra of all investigated samples (for their differentiation distinct colors were used). The vertical lines marked the spectral points having high differentiation power for the botanical discrimination of the investigated honey samples.
Ijms 23 09977 g006
Figure 7. Cross-validation (CV) and calibration (Cal.) classification error average as function of the number of latent variables used for the PLS-DA model aiming at the botanical differentiation of the honey samples.
Figure 7. Cross-validation (CV) and calibration (Cal.) classification error average as function of the number of latent variables used for the PLS-DA model aiming at the botanical differentiation of the honey samples.
Ijms 23 09977 g007
Figure 8. Score plot (LV1 vs. LV2 vs. LV3) associated with the PLS-DA model developed for the simultaneous differentiation of the four botanical honey classes (i.e., honeydew, colza, acacia and linden).
Figure 8. Score plot (LV1 vs. LV2 vs. LV3) associated with the PLS-DA model developed for the simultaneous differentiation of the four botanical honey classes (i.e., honeydew, colza, acacia and linden).
Ijms 23 09977 g008
Figure 9. (a) ROC curves containing estimated (blue), cross-validated (green) values and model threshold (red circle) and (b) RMSECV for each of the botanical honey classes (i.e., honeydew—1, colza—2, acacia—3 and linden—4) in PLS-DA model.
Figure 9. (a) ROC curves containing estimated (blue), cross-validated (green) values and model threshold (red circle) and (b) RMSECV for each of the botanical honey classes (i.e., honeydew—1, colza—2, acacia—3 and linden—4) in PLS-DA model.
Ijms 23 09977 g009aIjms 23 09977 g009b
Figure 10. Plotted spectra of all investigated samples (for their differentiation distinct colors were used). The vertical lines marked the spectral points having high differentiation power for the geographical discrimination of the investigated honey samples.
Figure 10. Plotted spectra of all investigated samples (for their differentiation distinct colors were used). The vertical lines marked the spectral points having high differentiation power for the geographical discrimination of the investigated honey samples.
Ijms 23 09977 g010
Figure 11. Cross-validation (CV) and calibration (Cal.) classification error average as function of the number of latent variables used to construct the PLS-DA model for the geographical differentiation of honey.
Figure 11. Cross-validation (CV) and calibration (Cal.) classification error average as function of the number of latent variables used to construct the PLS-DA model for the geographical differentiation of honey.
Ijms 23 09977 g011
Figure 12. PLS-DA score plot corresponding to the geographical differentiation of honey.
Figure 12. PLS-DA score plot corresponding to the geographical differentiation of honey.
Ijms 23 09977 g012
Figure 13. (a) ROC curves containing estimated (blue), cross-validated (green) values and model threshold (red circle), and (b) overlapped RMSECV plots for Transylvania (RMSECV 1—blue) and Others (RMSECV 2—green) classes associated with the PLS-DA model developed.
Figure 13. (a) ROC curves containing estimated (blue), cross-validated (green) values and model threshold (red circle), and (b) overlapped RMSECV plots for Transylvania (RMSECV 1—blue) and Others (RMSECV 2—green) classes associated with the PLS-DA model developed.
Ijms 23 09977 g013
Table 1. The honey botanical origin, year of harvest and geographical distribution of the authentic honey samples.
Table 1. The honey botanical origin, year of harvest and geographical distribution of the authentic honey samples.
Botanical OriginSample NumberHarvesting YearGeographical Origin
20202021TransylvaniaOthers
Acacia412021146
Linden30191176
Colza1881054
Honeydew2013784
Total10960493420
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

David, M.; Hategan, A.R.; Berghian-Grosan, C.; Magdas, D.A. The Development of Honey Recognition Models Based on the Association between ATR-IR Spectroscopy and Advanced Statistical Tools. Int. J. Mol. Sci. 2022, 23, 9977. https://doi.org/10.3390/ijms23179977

AMA Style

David M, Hategan AR, Berghian-Grosan C, Magdas DA. The Development of Honey Recognition Models Based on the Association between ATR-IR Spectroscopy and Advanced Statistical Tools. International Journal of Molecular Sciences. 2022; 23(17):9977. https://doi.org/10.3390/ijms23179977

Chicago/Turabian Style

David, Maria, Ariana Raluca Hategan, Camelia Berghian-Grosan, and Dana Alina Magdas. 2022. "The Development of Honey Recognition Models Based on the Association between ATR-IR Spectroscopy and Advanced Statistical Tools" International Journal of Molecular Sciences 23, no. 17: 9977. https://doi.org/10.3390/ijms23179977

APA Style

David, M., Hategan, A. R., Berghian-Grosan, C., & Magdas, D. A. (2022). The Development of Honey Recognition Models Based on the Association between ATR-IR Spectroscopy and Advanced Statistical Tools. International Journal of Molecular Sciences, 23(17), 9977. https://doi.org/10.3390/ijms23179977

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop