Next Article in Journal
Red and White Meat Intake in Relation to Gut Flora in Obese and Non-Obese Arab Females
Next Article in Special Issue
Thorough Characterization of ETHQB3.5, a QTL Involved in Melon Fruit Climacteric Behavior and Aroma Volatile Composition
Previous Article in Journal
Effect of Adding Matricaria recutita L., Cymbopogon citratus, or Mentha piperita L. Extracts to Fermented Orange Beverage: Sensory Evaluation, Physicochemical Characterization, and Prediction of Toxic Risks and Biological Activity In Silico
Previous Article in Special Issue
Soil Amendment and Storage Effect the Quality of Winter Melons (Benincasa hispida (Thunb) Cogn.) and Their Juice
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hyperspectral Imaging with Machine Learning Approaches for Assessing Soluble Solids Content of Tribute Citru

School of Information Engineering, Huzhou University, Huzhou 313000, China
*
Author to whom correspondence should be addressed.
Foods 2023, 12(2), 247; https://doi.org/10.3390/foods12020247
Submission received: 8 November 2022 / Revised: 22 December 2022 / Accepted: 24 December 2022 / Published: 5 January 2023
(This article belongs to the Special Issue Postharvest Management of Fruits and Vegetables Series II)

Abstract

:
Tribute Citru is a natural citrus hybrid with plenty of vitamins and nutrients. Fruits’ soluble solids content (SSC) is a critical quality index. This study used hyperspectral imaging at two spectral ranges (400–1000 nm and 900–1700 nm) to determine SSC in Tribute Citru. Partial least squares regression (PLSR) and support vector regression (SVR) models were established in order to determine SSC using the spectral information of the calyx and blossom ends. The average spectra of both ends as well as their fusion was studied. The successive projections algorithm (SPA) and the correlation coefficient analysis (CCA) were used to examine the differences in characteristic wavelengths between the two ends. Most models achieved performances with the correlation coefficient of the training, validation, and testing sets over 0.6. Results showed that differences in the performances among the models using the one-sided and two-sided spectral information. No particular regulation could be found for the differences in model performances and characteristic wavelengths. The results illustrated that the sampling side was an influencing factor but not the determinant factor for SSC determination. These results would help with the development of real-world applications for citrus quality inspection without concerning the sampling sides and the spectral ranges.

1. Introduction

China is now the country with the largest production of citrus. Citrus is a popular fruit not only for its unique taste but also for its high nutritional content. There are many excellent cultivars of citrus throughout the world. As one of them, Tribute Citru, also known as emperor citrus or Gong citrus, is a kind of citrus with high contents of carbohydrates, fiber, protein, calcium, etc. [1].
The soluble solids content (SSC) is a crucial indicator for evaluating the quality of Tribute Citru. SSC is the primary and most important quality parameter of concern to consumers. The common method of measuring SSC is to use a refractometer to measure the squeezed juice, which is time-consuming and damaging to the sample. Moreover, it cannot meet the demands for large-scale non-destructive measurement. Thus, rapid and non-destructive analytical techniques are needed.
Visible/near-infrared spectroscopy is a rapid and non-destructive technique that does not produce chemical contamination or damage to the sample(s) and has been applied in detecting fruit SSC [2,3,4]. Hyperspectral imaging technology combines imaging and spectroscopy technology, which can obtain spatial information at continuous spectral bands with high spectral resolution. Many research papers have been published based upon using hyperspectral imaging technology to detect SSC in fruits [5,6,7,8,9].
There are variations to the physiochemical characteristics within different parts of the fruit. VIS/NIR spectroscopy conducts point scan. When collecting spectral information of fruits, the different sampling points along with the equator or other representative regions were generally collected [10,11,12,13]. Some studies have also discussed the influence of spectra acquisition from different areas for SSC measurement [14,15,16].
The impact of variation within the samples should be considered in order to create a more accurate and robust SSC evaluation method. Hyperspectral imaging (HSI) can acquire the image of one side of the sample, covering the variations within the side of the sample. However, most studies acquired hyperspectral images of only one side of the samples [5,6,7]. Rare studies have considered using different sides of the fruits for SSC determination [8,9,17].
The SSC values were generally obtained to establish prediction models by averaging the measured values of different sample parts. The spectrum of each sample was also averaged by the spectra of different scanning points. Thus, it would be better if both the SSC value and the spectrum represent the corresponding sample. Given that different areas have both variations and similarities, it was worth investigating whether measuring both sides of samples for HSI was necessary. On the other hand, researchers have shown that hyperspectral imaging at different spectral ranges could be used to determine the SSC of fruits. The spectral ranges were also investigated to explore the influence of sampling sides.
In this study, hyperspectral imaging was used to determine SSC in Tribute Citru. The specific objectives were to (1) explore and compare the performances of hyperspectral imaging for SSC determination at two different spectral ranges (400–1000 nm and 900–1700 nm); (2) explore and compare the performances of sampling sides (the calyx end and the blossom end) for SSC determination; (3) explore the performances for SSC determination using the average spectra of the two sides and fused spectra of the two sides; (4) explore the characteristic wavelengths for SSC determination using the spectra of two sides to evaluate the performance consistency by the two sides.

2. Materials and Methods

2.1. Sample Preparation

Tribute Citru fruits were purchased from a local orchard in Liuzhou, Guangxi Province, China. A total of 458 samples of Tribute Citru fruits with similar sizes and no apparent damage on the surface were harvested on 22 November 2021. Before image acquisition, the fruits were stored at transported for four days and stored in the experimental room at a relatively stable temperature (15 ± 1 ℃) for three days. All the samples were numbered sequentially and awaited the acquisition of hyperspectral images and subsequent SSC measurement. Figure 1 shows the RGB (red, green, and blue) images of citrus taken by the hyperspectral imaging system. The image of the fruit blossom end and the calyx end are shown in Figure 1a,b, respectively.

2.2. SSC Measurement

The experimental arrangement for measuring SSC of Tribute Citru included a WIGGENS BR0035 Digital Brix Refractometer (WIGGENS Technology Ltd., Beijing, China) with an accuracy of 0.1 and °Brix range of 0–35%. For each of the 458 samples, the fruit was cut into three pieces by the axis of the calyx and the blossom. Each piece was squeezed, and the juice was used to measure the SSC value. To avoid contamination, the refractometer was washed with distilled water after each measurement and dried with clean paper before the next reading. The average SSC value of the three pieces was used as the SSC of the sample.

2.3. Hyperspectral Imaging Systems

In this study, hyperspectral imaging systems at two spectral ranges were used for image acquisition of citrus. The two hyperspectral imaging systems differed in the hyperspectral camera. One hyperspectral camera was FX10 (Spectral Imaging Ltd., Oulu, Finland) with a spectral range of 400–1000 nm (VIS/NIR), and the other hyperspectral camera was FX17 (Spectral Imaging Ltd., Oulu, Finland) with a spectral range of 900–1700 nm. The two hyperspectral imaging systems shared the same light source (covering the spectral range of 35–2500 nm), moving platform, computer, and image acquisition software. First, the hyperspectral images of all the samples were acquired by FX10 camera. Next, the camera was changed to FX17 without changing the other hardware, and the hyperspectral images of the samples were then acquired by FX17 camera. The hyperspectral images were generated by pushbroom imaging (line scanning).

2.4. Hyperspectral Image Acquisition and Spectra Extraction

2.4.1. Hyperspectral Image Acquisition

The samples were placed on the mobile platform in sequence for hyperspectral image acquisition, as shown in Figure 1. The system parameters were adjusted for hyperspectral image acquisition. For FX10, the moving speed of the mobile platform was 17 mm/s, and the exposure time was set as 13.59 ms. For FX17, the moving speed of the mobile platform was set as 25 mm/s, and the exposure time was set as 8.26 ms. The difference in speed was because the imaging time of the two cameras was inconsistent. The hyperspectral camera will scan the calyx and blossom ends of Tribute Citru as side A and side B samples, respectively.

2.4.2. Spectra Extraction

Eight or fewer fruits were acquired in one hyperspectral image. Each fruit within the hyperspectral image was first cut as sub-hyperspectral images. The ENVI 4.7 software (ITT, Visual Information Solutions, Boulder, CO, USA) was used to cut each of the 458 samples from the original hyperspectral images, after which each sub-hyperspectral image contained only one fruit. The entire fruit region was defined as the region of interest (ROI), and the average spectrum of all pixel-wise spectra within the ROI was extracted.
Due to the noise at the beginning and end of the original spectra, the wavelength range of 487–974 nm was selected for FX10 and 933–1690 nm for FX17 of each sample. To eliminate the background interference and reduce the sample shape influence, wavelet transformation and area normalization preprocessing algorithms were used to preprocess the spectra. The spectral dataset of the fruit calyx end was named SP-A, and the spectral dataset of the fruit blossom end was named SP-B. By measuring the two sides of the samples, information of the entire fruit could be obtained, and more variations could be covered.

2.5. Data Analysis Methods

2.5.1. Regression Models

Support vector regression (SVR) is an important applied branch of support vector machine (SVM), a supervised learning algorithm for predicting discrete values [18]. The goal of SVR is to construct a hyperplane to fit the samples to minimize the overall distance between all samples and the hyperplane. SVR assumes that an error between the output value and the predicted value can be tolerated, and only the loss is calculated when the absolute value of the difference between the two is larger than the error. Therefore, SVR has a certain degree of tolerance to outliers, robustness, and excellent generalization ability. SVR can solve both linear and non-linear issues. For non-linear problems, kernel functions are crucial to map the original data into high-dimensional spaces. In this study, the radial basis function (RBF) was used as the kernel function. The regularization parameters and kernel function coefficients were optimized by the grid-search method in the range of [10−6, 106].
Partial least squares regression (PLSR) is used to seek the underlying linear relationship between two matrices (independent matrix X and dependent matrix Y), which is a hidden variable approach to modeling the covariance structure in these two vector spaces. In order to explain the multidimensional direction with the biggest variance in the Y-space, the PLSR model tries to identify the multidimensional direction of the X-space [19]. PLSR decomposes both variables X and Y to extract factors, then the latent factors are arranged in descending order of their correlation. In contrast to conventional mulblossomle linear regression (MLR) models, PLSR can be constructed with mulblossomle correlated independent variables. The model can also be built if the number of samples is less than that of variables.

2.5.2. Wavelength Selection Methods

Successive projections algorithm (SPA) is a forward feature variable selection method. By projecting wavelengths onto other wavelengths, SPA compares the projection vectors, and the wavelength with the largest projection vector is selected into the candidate subset of characteristic wavelengths. The regression model is then used to evaluate the performances of different subsets. Therefore, SPA selects the combination of variables that contains the least redundant information and the least covariance [20].
Correlation coefficient analysis (CCA) is used to evaluate the degree of linear correlation between the independent and dependent variables [21]. Its value range is [−1, 1]. A positive value means positive correlation, a negative value means negative correlation, and a larger absolute value means a higher linear correlation. In this study, the correlation coefficient between each wavelength and the SSC content was examined in order to identify the characteristic wavelengths for SSC determination. The Pearson correlation coefficient was used in this study shown as Equation (1):
ρ x , y = c o v x , y σ x σ y
where ρx,y stands for the correlation coefficient between the variables x and y, cov(x,y) stands for the covariance between the variables x and y, and σx and σy stand for the standard deviation of x and y.

2.6. Model Evaluation and Software

Six indicators were used to evaluate model performance, including correlation coefficient and root mean square errors of calibration (Rc and RMSEC), correlation coefficient and root mean square errors of validation (Rv and RMSEV), and correlation coefficient and root mean square errors of prediction (Rp and RMSEP) were assessed. Generally, the closer Rc, Rv, and Rp are to 1, the higher the stability and fit of the model, while the closer RMSEC, RMSEV, and RMSEP are to 0, the stronger the predictive ability of the model is.
In this study, the ENVI 4.7 software was used to cut the image of each fruit from hyperspectral images. Matlab R2017b (The Math Works, Natick, MA, USA) software was applied to group the samples and select effective wavelengths by SPA and CCA. The SVR model was developed on PyCharm Community Edition (version 2021.2.1) (JetBrains, Prague, Czech Republic) with Python (version 3.7.12), and the PLSR model was conducted on the Unscrambler X10.1 software (CAMO AS, Oslo, Norway). Specifically, the Origin 2017 64Bit (OriginLab, Northampton, MA, USA) was used to draw spectrograms and dotted line drawings.

3. Results

3.1. Outlier Removal

In this study, 458 fruits were studied. To further establish prediction models, outliers were first identified and removed. The outliers were identified in two steps. First, since SSC value of each sample was averaged by three manual measurements. To avoid manual measurement errors, the average SSC of each fruit was first calculated, and the difference between each measurement and the average value was then calculated. Indeed, there was no particular criterion for this kind of outlier removal. In this study, when the difference between the measurement and the averaged value was larger than 15%, the sample was regarded as an outlier sample. In total, 38 samples were identified as outliers in this step. Second, the remaining 420 samples were all used to build a PLSR model, and the predictive error of each sample was calculated. In the end, 20 samples with larger predictive errors were manually identified as outliers (with a negative impact on model performances) in the second step. Next, the 400 samples were subsequently randomly divided into the calibration set (300 samples), the validation set (50 samples), and the prediction set (50 samples) at a ratio of 6:1:1. It should be noted that outlier removal was conducted using the spectra of the calyx end. As for the two hyperspectral imaging systems, the identified outlier samples were different due to the fact that the PLSR model performed differently. The statistical analysis of SSC content in different sets is shown in Table 1.

3.2. Spectral Profiles

Figure 2 shows the VIS/NIR reflectance spectra (obtained by FX10) and NIR reflectance spectra (obtained by NIR), with the horizontal coordinates indicating the 487–974 nm bands and the 933–1690 nm bands and the vertical coordinates indicating the reflectance of the spectra. Figure 2a,b shows the VIS/NIR spectra of the fruit calyx end (abbreviated as FX10-SP-A) and the fruit blossom end (abbreviated as FX10-SP-B) after removing the outlier samples. Figure 2c,d shows the spectral curves of the two sampling sides at the NIR range (FX17-SP-A and FX17-SP-B, respectively) after removing the outlier samples. It could be seen that the trends of the SP-A and SP-B curves were quite similar for each hyperspectral imaging system. From the spectral profiles, exploring the differences between the two sides for SSC measurement was difficult.

3.3. Regression Models

PLSR and SVR were used to establish regression models with the FX10 and FX17 datasets of the fruit calyx and blossom ends. The regression results are shown in Table 2.
For the SVR model and PLSR model, the samples were shot in the same order on the fruit calyx end and fruit blossom end. For both sides of the fruits, the samples in the calibration, validation, and prediction sets were the same. No samples were in the three sets at the same time.
For spectra extracted from the FX10 hyperspectral images, SVR and PLSR models obtained close results for each sampling side. Regression models for the calyx end (FX10-SP-A) showed slightly better performances than those for the blossom end (FX10-SP-B), with Rc, Rv, and Rp all over 0.7, as shown in Figure 3. However, the differences in the performances between the two sides were not significant. For spectra extracted from the FX17 hyperspectral images, SVR and PLSR models also obtained close results for each sampling side. The overall prediction performances of the models for the calyx end (FX17-SP-A) were close to those for the blossom end (FX17-SP-B). The overall results of the regression models for the two sides showed differences in the performances between the sides, and the differences were insignificant. These results matched the literature [17].
It was worth investigating whether it was better to use the information of both sides instead of a single side. Thus, two different approaches to use the spectral information of the two sides were implemented. First, the average spectrum of the spectra of the two sides was calculated for each sample. Second, the low-level fusion of the spectrum of each side by directly concatenating the two spectra ranges was conducted. The results of models using average spectra and low-level fusion are shown in Table 2. For the average spectra of FX10, the PLSR model showed slightly better performances than the SVR model. For the average spectra of FX17, the SVR model showed slightly better performances than the PLSR model. For the low-level fusion of FX10, the PLSR model outperformed the SVR model. For low-level fusion of FX17, SVR, and PLSR models obtained close results.
For different datasets, the performances of the PLSR and SVR models varied. For the two sides, the performances of the same regression model were close. Moreover, the models using the average spectra of the two sides and the low-level fusion did not show significant improvement over the models using spectral profiles of a single side.
Indeed, the differences in the performances of models using a single side illustrated that there were variations of spectral profiles between the two sides. The close performances of models showed that these variations were not significant. It was a fact that both sides were in the same fruit, and the differences in spectral profiles might not be significant unless the color and maturity degree of the two sides were significantly different. When building regression models using spectra, it would be better to collect enough representative samples covering the variations of spectral profiles and SSC values. In this study, the performances of the two sides for SSC measurement did not show significant differences using different regression models, indicating no significant variations in spectral profiles. On the other hand, the two spectral ranges obtained close performances with differences, which matched the previous studies [17,22,23,24].
The close results of models using the spectra of a single side and the spectra of two sides (by averaging and fusing) indicated that the spectral profiles were similar within one fruit. In this study, the color and maturity degree of the two sides was close, resulting in the similarity of the spectral profiles in the visible and NIR ranges, respectively. Thus, as for the hyperspectral image, acquiring one side of the fruits might be enough to represent the sample unless the internal quality varied significantly within one fruit.

3.4. Analysis of Characteristic Wavelengths

The performances of models using the spectral information of a single side were close. In order to further explore the influence of spectral variations on characteristic wavelengths for SSC measurement, two variable selection methods, SPA and CCA, were used to select the characteristic wavelengths. The characteristic wavelengths of the two sides selected by SPA are shown in Table 3.
As shown in Table 3, the number of selected wavelengths accounted for 10%, 6%, 19% and 20% of the total number of wavelengths at the two spectral ranges, respectively. Figure 4 shows the characteristic wavelengths selected by SPA. Although the number of characteristic wavelengths for FX10-SP-A was more than that for FX10-SP-B, they were quite similar for the selected wavelengths. The characteristic wavelengths at 516 and 765 nm of FX10 were the same for the two sides, and the characteristic wavelengths at 545, 803, 877, and 902 nm of FX10-SP-A and the characteristic wavelengths at 540, 800, 872, and 905 nm of FX10-SP-B were close. A similar phenomenon could be found for FX17 datasets. The characteristic wavelengths at 1111, 1139, 1251, 1413, and 1427 nm were the same for both sides of FX17. The characteristic wavelengths at 1202, 1230, 1342, 1462, and 1611 nm of FX17-SP-A and 1209, 1237, 1335, 1455, and 1604 nm of FX17-SP-B were quite close.
Figure 5 shows the correlation coefficient between each wavelength and the corresponding SSC values. Similarities could be found for the correlation coefficient curves of both sides. The peaks of the correlation coefficient curves of the two sides indicated the larger correlation coefficients, and these wavelengths had the potential to contribute more to SSC measurement. The wavelengths with higher correlation coefficients were 529–569 nm, 620–680 nm, 699–724 nm, and 952–974 nm for both FX10-SP-A and FX10-SP-B. For FX17, the correlation coefficient curves of both sides showed similar trends. The wavelengths at 1153–1279 nm and 1398–1547 nm showed higher correlation coefficients.
The wavelengths in the visible range (400–780 nm) were mainly related to the color information of the fruits. The wavelength bands in the range of 780–1100 nm can be assigned as second and third overtone of C–H stretching and the second overtone of O–H stretching [25]. The wavelengths between 1200–1389 nm can be attributed to the third and second overtone of C–H stretch [25]. The wavelengths between 1400 nm and 1440 nm were related to water [26]. The wavelengths in the range of 1460–1600 nm might be related to cellulose [27]. The wavelengths between 1569 nm and 1604 nm can also be attributed to the O–H stretching of the first overtone in carbohydrates [28]. The wavelength of 1647 nm was attributed to the C–H stretching modes of aromatic C–H [29].
According to Table 3, Figure 2, Figure 4 and Figure 5, it can be seen that the spectral data of the two sides of Tribute Citru were similar, and the characteristic wavelengths that contributed more to the SSC detection were also quite similar. These results also indicated that a single side of the fruit might be enough for the SSC measurement of Tribute Citru.
SVR and PLSR were built using the characteristic wavelengths identified by SPA and CCA. The results are shown in Table 4. For characteristic wavelengths of FX10 selected by SPA, the correlation coefficients of the SVR and PLSR models of the calyx and blossom ends were around 0.6. Similar results could be observed for characteristic wavelengths of FX17 selected by SPA.
For the characteristic wavelengths of FX10 identified by the CCA method, the wavelength ranges of 529–569 nm, 620–680 nm, 699–724 nm, and 952–974 nm with similar trends of the calyx and blossom ends were selected to build PLSR and SVR models. Relatively poor performances were obtained. The correlation coefficients of the training sets of the SVR and PLSR models were below 0.7. For the characteristic wavelengths of FX17 identified by the CCA method, the bands at 1153–1279 nm and 1398–1547 nm with similar trends of the calyx and blossom ends were selected to build PLSR and SVR models. The results were not good enough.
Compared with the results in Table 3, the regression models using full range spectra performed slightly better than the corresponding models using identified characteristic wavelengths. Although characteristic wavelength selection can reduce the number of input variables and simplify the models, some useful information may also be lost.

4. Discussion

SSC is the most important quality attribute of concern to consumers. This study used hyperspectral imaging at two different spectral ranges to estimate the SSC in Tribute Citru, and the effect of the sampling sides of fruits was also investigated.
The physicochemical properties within one fruit varied in different regions resulting in differences in the corresponding spectral profiles. As for point-scan near-infrared spectroscopy, spectral profiles were generally acquired from different sampling regions, and corresponding SSC values of the same sampling area were measured. Guthire et al. (2006) showed the importance of the uniformity of the spectra measurement and quality attributes measurement [30]. The influence of sampling regions has also been discussed [14,15,16]. Unlike point-scan near-infrared spectroscopy, hyperspectral imaging can acquire images of one side of the fruit facing the camera, and each pixel contains a spectrum. Based on this characteristic, HSI was generally used to acquire images of only one side of the fruits [5,6,7]. An issue should be addressed that there might also be variations of the physicochemical properties between different sides of the fruits. In this study, hyperspectral images of the two different sides of Tribute Citru fruits were acquired and compared. The overall results of the SSC prediction of the two sides showed insignificant differences using different regression methods. Further analysis of the characteristic wavelengths for SSC determination by SPA and CCA showed great similarities between the two sides. Similar results could be found in previous research, and the performances of SSC prediction using the hyperspectral images of two opposite sides of plum showed insignificant differences [17].
Two approaches to using the information of the two sides were explored, including averaging the spectra of the two sides and concatenating the spectra of the two sides. The results did not show significant improvements using the two approaches compared with models using one side. Some reasons for this might be that averaging of the spectra of the two sides could use the information of the entire fruit, and some of the variations might be reduced by averaging; concatenating the spectra of the two sides might keep the variations; and some redundant information was used, due to the fact that the spectral features of the two sides were similar.
Relatively poor performances were obtained for the SSC determination of Tribute Citru. Some other studies have also obtained results that were not good enough [31,32,33]. Various factors affected the prediction performances (such as samples, instruments, and experimental operations, etc.), and more efforts should be made to improve the prediction performances.
This study showed that either side (the calyx and blossom ends) of the Tribute Citru could be used for SSC measurement of fruits. Although there were variations in the spectral profiles and physicochemical properties between the two sides of the Tribute Citru, future real-world applications can be developed by acquiring the hyperspectral images of either side of the fruits, which will reduce the cost of model development and online detection.

5. Conclusions

In this study, the influence of the sampling side of fruit for SSC determination using HSI was explored. The average of the spectra of the two sampling sides (the calyx end and the blossom end) and the fusion (direct concatenation) of the spectra of the two sampling sides were also investigated. The overall results of the two regression methods (PLSR and SVR) illustrated that the sampling sides might not significantly affect the prediction performances. Moreover, the average of the spectra of the two sampling sides and the fusion of the spectra of the two sampling sides did not significantly improve the prediction performances. The performances of the two regression methods validated the results. The characteristic wavelengths for the SSC prediction using different sampling sides by the same characteristic wavelength selection method were quite similar, indicating that sampling on either side (the calyx end and the blossom end) might be used to predict SSC. The performances of hyperspectral imaging at two different spectral ranges showed no significant differences, illustrating that both could be used for the SSC prediction of Tribute Citru. In future studies, more samples should be used to develop more robust and accurate models with machine learning methods, such as deep learning. The real-world applications of hyperspectral imaging to inspect quality and safety of Tribute Citru might then be developed in the future by scanning either side (the calyx end and the blossom end) of the samples at different spectral ranges.

Author Contributions

Conceptualization, C.Z.; data curation, C.Z.; formal analysis, C.L.; funding acquisition, H.Q.; investigation, C.L., M.H. and Z.C.; methodology, C.Z.; project administration, H.Q.; resources, C.Z.; software, C.L., Z.C. and J.Z.; supervision, H.Q.; Validation, C.L. and M.H.; visualization, C.Z.; writing—original draft, C.L.; writing—review and editing, C.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Zhejiang Province Key Laboratory of Smart Management and Application of Modern Agricultural Resources under Grant 2020E10017.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data can be requested by contacting the corresponding authors via E-mail.

Acknowledgments

We would like to thank Guangyan Wen, Qian Li, and Manjing Zhang from the School of Information Engineering, Huzhou University, for their help in this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Li, G.; Liu, S.; Zhou, Q.; Han, J.; Qian, C.; Li, Y.; Meng, X.; Gao, X.; Zhou, T.; Li, P. Effect of Response Surface Methodology-Optimized Ultrasound-Assisted Pretreatment Extraction on the Composition of Essential Oil Released From Tribute citrus Peels. Front. Nutr. 2022, 9, 840780. [Google Scholar] [CrossRef]
  2. Liu, Y.; Chen, X.; Ouyang, A. Nondestructive determination of pear internal quality indices by visible and near-infrared spectrometry. LWT-FOOD 2008, 41, 1720–1725. [Google Scholar] [CrossRef]
  3. Jamshidi, B.; Minaei, S.; Mohajerani, E.; Ghassemian, H. Prediction of Soluble Solids in Oranges Using Visible/Near-Infrared Spectroscopy: Effect of Peel. Int. J. Food Prop. 2014, 17, 1460–1468. [Google Scholar] [CrossRef]
  4. Masithoh, R.E.; Pahlawan, M.F.R.; Wati, R.K. Non-destructive determination of SSC and pH of banana using a modular Vis/NIR spectroscopy: Comparison of Partial Least Square (PLS) and Principle Component Regression (PCR). In Proceedings of the IOP Conference Series: Earth and Environmental Science, Yogyakarta, Indonesia, 13–14 October 2020. [Google Scholar] [CrossRef]
  5. Zhang, H.; Zhan, B.; Pan, F.; Luo, W. Determination of soluble solids content in oranges using visible and near infrared full transmittance hyperspectral imaging with comparative analysis of models. Postharvest Biol. Technol. 2020, 163, 111148. [Google Scholar] [CrossRef]
  6. Wei, X.; He, J.; Zheng, S.; Ye, D. Modeling for SSC and firmness detection of persimmon based on NIR hyperspectral imaging by sample partitioning and variables selection. Infrared Phys. Technol. 2020, 105, 103099. [Google Scholar] [CrossRef]
  7. Zhang, D.; Xu, Y.; Huang, W.; Tian, X.; Xia, Y.; Xu, L.; Fan, S. Nondestructive measurement of soluble solids content in apple using near infrared hyperspectral imaging coupled with wavelength selection algorithm. Infrared Phys. Technol. 2019, 98, 297–304. [Google Scholar] [CrossRef]
  8. Gabrielli, M.; Lançon-Verdier, V.; Picouet, P.; Maury, C. Hyperspectral Imaging to Characterize Table Grapes. Chemosensors 2021, 9, 71. [Google Scholar] [CrossRef]
  9. Ma, T.; Xia, Y.; Inagaki, T.; Tsuchikawa, S. Non-destructive and fast method of mapping the distribution of the soluble solids content and pH in kiwifruit using object rotation near-infrared hyperspectral imaging approach. Postharvest Biol. Technol. 2021, 174, 111440. [Google Scholar] [CrossRef]
  10. Zou, X.; Zhao, J.; Li, Y. Selection of the efficient wavelength regions in FT-NIR spectroscopy for determination of SSC of ‘Fuji’ apple based on BiPLS and FiPLS models. Vib. Spectrosc. 2007, 44, 220–227. [Google Scholar] [CrossRef]
  11. Pissard, A.; Marques, E.J.N.; Dardenne, P.; Lateur, M.; Pasquini, C.; Pimentel, M.F.; Fernández Pierna, J.A.; Baeten, V. Evaluation of a handheld ultra-compact NIR spectrometer for rapid and non-destructive determination of apple fruit quality. Postharvest Biol. Technol. 2021, 172, 111375. [Google Scholar] [CrossRef]
  12. Liu, C.; Yang, S.X.; Li, X.; Xu, L.; Deng, L. Noise level penalizing robust Gaussian process regression for NIR spectroscopy quantitative analysis. Chemom. Intell. Lab. Syst. 2020, 201, 104014. [Google Scholar] [CrossRef]
  13. Liu, Y.; Zhang, Y.; Jiang, X.; Liu, H. Detection of the quality of juicy peach during storage by visible/near infrared spectroscopy. Vib. Spectrosc. 2020, 111, 103152. [Google Scholar] [CrossRef]
  14. Cayuela, J.A. Vis/NIR soluble solids prediction in intact oranges (Citrus sinensis L.) cv. Valencia Late by reflectance. Postharvest Biol. Technol. 2008, 47, 75–80. [Google Scholar] [CrossRef]
  15. Xu, X.; Mo, J.; Xie, L.; Ying, Y. Influences of Detection Position and Double Detection Regions on Determining Soluble Solids Content (SSC) for Apples Using Online Visible/Near-Infrared (Vis/NIR) Spectroscopy. Food Anal. Method 2019, 12, 2078–2085. [Google Scholar] [CrossRef]
  16. Castrignanò, A.; Buttafuoco, G.; Malegori, C.; Genorini, E.; Iorio, R.; Sblossomic, M.; Girone, G.; Venezia, A. Assessing the Feasibility of a Miniaturized Near-Infrared Spectrometer in Determining Quality Attributes of San Marzano Tomato. Food Anal. Method 2019, 12, 1497–1510. [Google Scholar] [CrossRef]
  17. Li, B.; Cobo-Medina, M.; Lecourt, J.; Harrison, N.; Harrison, R.J.; Cross, J.V. Application of hyperspectral imaging for nondestructive measurement of plum quality attributes. Postharvest Biol. Technol. 2018, 141, 8–15. [Google Scholar] [CrossRef]
  18. Huang, F.-H.; Liu, Y.-H.; Sun, X.; Yang, H. Quality inspection of nectarine based on hyperspectral imaging technology. Syst. Sci. Control Eng. 2021, 9, 350–357. [Google Scholar] [CrossRef]
  19. Rajkumar, P.; Wang, N.; Eimasry, G.; Raghavan, G.S.V.; Gariepy, Y. Studies on banana fruit quality and maturity stages using hyperspectral imaging. J. Food Eng. 2012, 108, 194–200. [Google Scholar] [CrossRef]
  20. Yuan, Y.; Wang, W.; Chu, X.; Xi, M.-j. Selection of Characteristic Wavelengths Using SPA and Qualitative Discrimination of Mildew Degree of Corn Kernels Based on SVM. Spectrosc. Spectr. Anal. 2016, 36, 226–230. [Google Scholar] [CrossRef]
  21. Jiang, W.; Lu, C.; Zhang, Y.; Ju, W.; Wang, J.; Xiao, M. Molecular spectroscopic wavelength selection using combined interval partial least squares and correlation coefficient optimization. Anal. Methods 2019, 11, 3108–3116. [Google Scholar] [CrossRef]
  22. Ye, W.; Yan, T.; Zhang, C.; Duan, L.; Chen, W.; Song, H.; Zhang, Y.; Xu, W.; Gao, P. Detection of Pesticide Residue Level in Grape Using Hyperspectral Imaging with Machine Learning. Foods 2022, 11, 1609. [Google Scholar] [CrossRef] [PubMed]
  23. Zhu, H.; Chu, B.; Fan, Y.; Tao, X.; Yin, W.; He, Y. Hyperspectral Imaging for Predicting the Internal Quality of Kiwifruits Based on Variable Selection Algorithms and Chemometric Models. Sci. Rep. 2017, 7, 7845. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Yan, T.; Duan, L.; Chen, X.; Gao, P.; Xu, W. Application and interpretation of deep learning methods for the geographical origin identification of Radix Glycyrrhizae using hyperspectral imaging. RSC Adv. 2020, 10, 41936–41945. [Google Scholar] [CrossRef]
  25. Daneshvar, A.; Tigabu, M.; Karimidoost, A.; Oden, P.C. Single seed Near Infrared Spectroscopy discriminates viable and non-viable seeds of Juniperus polycarpos. Silva Fennica 2015, 49, 1334. [Google Scholar] [CrossRef] [Green Version]
  26. Büning-Pfaue, H. Analysis of water in food by near infrared spectroscopy. Food Chem. 2003, 82, 107–115. [Google Scholar] [CrossRef]
  27. Jochemsen, A.; Alfredsen, G.; Burud, I. Hyperspectral imaging as a tool for profiling basidiomycete decay of Pinus sylvestris L. Int. Biodeterior. Biodegrad. 2022, 174, 105464. [Google Scholar] [CrossRef]
  28. Bowler, A.L.; Ozturk, S.; Rady, A.; Watson, N. Domain Adaptation for In-Line Allergen Classification of Agri-Food Powders Using Near-Infrared Spectroscopy. Sensors 2022, 22, 7239. [Google Scholar] [CrossRef]
  29. Okparanma, R.N.; Mouazen, A.M. Visible and Near-Infrared Spectroscopy Analysis of a Polycyclic Aromatic Hydrocarbon in Soils. Sci. World J. 2013, 2013, 160360. [Google Scholar] [CrossRef] [Green Version]
  30. Guthrie, J.A.; Liebenberg, C.J.; Walsh, K.B. NIR model development and robustness in prediction of melon fruit total soluble solids. Aust. J. Agric. Res. 2006, 57, 411. [Google Scholar] [CrossRef]
  31. Sánchez, M.-T.; De la Haba, M.-J.; Pérez-Marín, D. Internal and external quality assessment of mandarins on-tree and at harvest using a portable NIR spectrophotometer. Comput. Electron. Agr. 2013, 92, 66–74. [Google Scholar] [CrossRef]
  32. Wang, A.; Hu, D.; Xie, L. Comparison of detection modes in terms of the necessity of visible region (VIS) and influence of the peel on soluble solids content (SSC) determination of calyx orange using VIS–SWNIR spectroscopy. J. Food Eng. 2014, 126, 126–132. [Google Scholar] [CrossRef]
  33. Fu, X.; Wang, X.; Rao, X. An LED-based spectrally tuneable light source for visible and near-infrared spectroscopy analysis: A case study for sugar content estimation of citrus. Biosyst. Eng. 2017, 163, 87–93. [Google Scholar] [CrossRef]
Figure 1. RGB images of Tribute Citru. (a): the fruit blossom end; (b) the fruit calyx end.
Figure 1. RGB images of Tribute Citru. (a): the fruit blossom end; (b) the fruit calyx end.
Foods 12 00247 g001
Figure 2. VIS/NIR spectra and NIR spectra of the fruit calyx and blossom ends. (a) VIS/NIR spectra of the fruit calyx end (FX10-SP-A); (b) VIS/NIR spectra of the fruit blossom end (FX10-SP-B); (c) NIR spectra of the fruit calyx end (FX17-SP-A); (d) NIR spectra of the fruit blossom end (FX17-SP-B).
Figure 2. VIS/NIR spectra and NIR spectra of the fruit calyx and blossom ends. (a) VIS/NIR spectra of the fruit calyx end (FX10-SP-A); (b) VIS/NIR spectra of the fruit blossom end (FX10-SP-B); (c) NIR spectra of the fruit calyx end (FX17-SP-A); (d) NIR spectra of the fruit blossom end (FX17-SP-B).
Foods 12 00247 g002
Figure 3. Scatter plots of the measured SSC values and predicted SSC values by SVR. (a) FX10-SP-A of calibration set; (b) FX10-SP-A of validation set; (c) FX10-SP-A of prediction set.
Figure 3. Scatter plots of the measured SSC values and predicted SSC values by SVR. (a) FX10-SP-A of calibration set; (b) FX10-SP-A of validation set; (c) FX10-SP-A of prediction set.
Foods 12 00247 g003
Figure 4. Characteristic wavelengths obtained with SPA. (a) Selected wavelengths of VIS/NIR spectra including SP-A and SP-B; (b) selected wavelengths of NIR spectra including SP-A and SP-B. (Note: Given that the average spectra of the two sides partially overlap and are not convenient for observation, the reflectance values of the SP-B spectra of FX10 were averaged and increased by 0.01, and the reflectance values of the SP-B spectra of FX17 were averaged and increased by 0.001).
Figure 4. Characteristic wavelengths obtained with SPA. (a) Selected wavelengths of VIS/NIR spectra including SP-A and SP-B; (b) selected wavelengths of NIR spectra including SP-A and SP-B. (Note: Given that the average spectra of the two sides partially overlap and are not convenient for observation, the reflectance values of the SP-B spectra of FX10 were averaged and increased by 0.01, and the reflectance values of the SP-B spectra of FX17 were averaged and increased by 0.001).
Foods 12 00247 g004
Figure 5. Correlation coefficient corresponding to the wavelengths. (a) Correlation coefficient curves of VIS/NIR averaging spectra (FX10); (b) correlation coefficient curves of NIR averaging spectra (FX17).
Figure 5. Correlation coefficient corresponding to the wavelengths. (a) Correlation coefficient curves of VIS/NIR averaging spectra (FX10); (b) correlation coefficient curves of NIR averaging spectra (FX17).
Foods 12 00247 g005
Table 1. Statistical analysis of reference values of samples in the calibration, validation, and prediction sets.
Table 1. Statistical analysis of reference values of samples in the calibration, validation, and prediction sets.
CameraCalibration Set (°Brix)Validation Set (°Brix)Prediction Set (°Brix)
NumberMinMaxNumberMinMaxNumberMinMax
FX1030010.214.65010.615.05010.714.2
FX1730010.114.65010.514.65010.614.4
Table 2. Results and parameters of the calibration, validation, and prediction sets by SVR and PLSR models.
Table 2. Results and parameters of the calibration, validation, and prediction sets by SVR and PLSR models.
DatasetModelModel ParameterCalibration SetValidation SetPrediction Set
Rc kRMSEC n
(°Brix)
Rv lRMSEV o
(°Brix)
Rp mRMSEP p
(°Brix)
FX10-SP-A aSVR iGamma: 1000.0
C: 106
Eps: 10−5
0.8100.4350.7500.5110.7050.53
PLSR jFactor: 190.7860.4510.7690.4970.7310.51
FX10-SP-B bSVRGamma: 10.0
C: 106
Eps: 10−5
0.6540.5620.7020.5480.6720.557
PLSRFactor: 160.7440.4870.7050.5580.6550.571
FX17-SP-A cSVRGamma: 10,000.0
C: 1000.0
Eps: 10−5
0.7250.4970.7440.5780.5880.567
PLSRFactor: 110.7380.4860.7520.5610.6390.514
FX17-SP-B dSVRGamma: 100.0
C: 106
Eps: 0.001
0.7000.5160.7690.5440.5290.591
PLSRFactor: 80.6850.5240.7720.5400.5960.556
FX10-average eSVRGamma: 100.0
C: 106
Eps: 0.001
0.7260.5030.7330.5230.5130.658
PLSRFactor: 170.7820.4550.7510.5150.7360.515
FX17-average fSVRGamma: 10,000.0
C: 104
Eps: 0.001
0.7960.4370.7650.5460.7200.484
PLSRFactor: 110.7470.4790.7730.5400.6110.540
FX10-fusion gSVRGamma: 1000.0
C: 105
Eps: 0.001
0.8820.3440.7070.5660.5290.675
PLSRFactor: 180.7530.4800.7110.5390.6800.552
FX17-fusion hSVRGamma: 10.0
C: 106
Eps: 0.0001
0.7080.5100.7700.5430.5990.534
PLSRFactor: 120.7200.4990.7340.5820.5970.547
a FX10-SP-A: data of the calyx end obtained by FX10 camera; b FX10-SP-B: data of the blossom end obtained by FX10 camera; c FX17-SP-A: data of the calyx end obtained by FX17 camera; d FX17-SP-B: data of the blossom end obtained by FX17 camera; e FX10-average: the average spectra of the two sides obtained by FX10 camera; f FX17-average: the average spectra of the two sides obtained by FX17 camera; g FX10-fusion: the fusion spectra of the two sides obtained by FX10 camera; h FX17-fusion: the fusion spectra of the two sides obtained by FX17 camera; i SVR: support vector regression; j PLSR: partial least squares regression; k Rc: correlation coefficient of calibration; l Rv: correlation coefficient of validation; m Rp: correlation coefficient of prediction; n RMSEC: root mean square errors of calibration; o RMSEV: root mean square errors of validation; p RMSEP: root mean square errors of prediction.
Table 3. Characteristic wavelengths selected by SPA for the datasets of the two sides of the two hyperspectral image systems.
Table 3. Characteristic wavelengths selected by SPA for the datasets of the two sides of the two hyperspectral image systems.
DatasetsNumberCharacteristic Wavelengths (nm)
FX10-SP-A18494, 516, 545, 574, 596, 674, 699, 704, 707, 713, 721, 743, 754, 765, 803, 877, 902, 944
FX10-SP-B10516, 526, 540, 548, 765, 800, 817, 872, 905, 927
FX17-SP-A19993, 1111, 1139, 1153, 1167, 1202, 1230, 1244, 1251, 1265, 1342, 1363, 1384, 1413, 1427, 1462, 1505, 1554, 1611
FX17-SP-B201020, 1111, 1139, 1146, 1188, 1209, 1237, 1251, 1286, 1321, 1335, 1356, 1377, 1413, 1427, 1455, 1490, 1604, 1626, 1647
Table 4. Results and parameters of the calibration, validation, and prediction sets by SVR and PLSR models using identified characteristic wavelengths.
Table 4. Results and parameters of the calibration, validation, and prediction sets by SVR and PLSR models using identified characteristic wavelengths.
DatasetModelModel ParameterCalibration SetValidation SetPrediction Set
Rc kRMSEC n (°Brix)Rv lRMSEV o (°Brix)Rp mRMSEP p (°Brix)
SPA-FX10-SP-A aSVR iGamma: 105
C: 105
Eps: 10−3
0.8910.3330.8240.4740.5961.459
PLSR jFactor: 150.7710.4650.7670.4950.6780.544
SPA-FX10-SP-B bSVRGamma: 1000.0
C: 106
Eps: 10−5
0.6830.5350.7040.5440.6360.592
PLSRFactor: 90.7040.5180.6750.5640.6640.568
SPA-FX17-SP-A cSVRGamma: 104
C: 103
Eps: 10−5
0.6360.5580.7370.5880.5460.561
PLSRFactor: 140.7430.4820.7180.5920.6700.498
SPA-FX17-SP-B dSVRGamma: 103
C: 105
Eps: 10−3
0.6810.5290.7770.5410.5960.540
PLSRFactor: 80.6890.5220.7740.5370.5940.559
CCA-FX10-SP-A eSVRGamma: 10
C: 106
Eps: 10−5
0.5710.6030.6860.5600.4560.650
PLSRFactor: 140.6850.5310.6990.5530.6720.564
CCA-FX10-SP-B fSVRGamma: 103
C: 105
Eps: 10−3
0.667 0.550 0.604 0.609 0.663 0.560
PLSRFactor: 150.7050.5170.6650.6070.6290.594
CCA-FX17-SP-A gSVRGamma: 105
C: 103
Eps: 10−4
0.714 0.507 0.794 0.533 0.555 0.565
PLSRFactor: 90.6990.5140.7580.5530.6580.499
CCA-FX17-SP-B hSVRGamma: 104
C: 105
Eps: 10−3
0.6970.5170.7780.5340.5190.626
PLSRFactor: 90.6680.5350.7880.5270.5060.644
a SPA-FX10-SP-A: data of the calyx end obtained by FX10 camera using SPA; b SPA-FX10-SP-B: data of the blossom end obtained by FX10 camera using SPA; c SPA-FX17-SP-A: data of the calyx end obtained by FX17 camera using SPA; d SPA-FX17-SP-B: data of the blossom end obtained by FX17 camera using SPA; e CCA-FX10-SP-A: data of the calyx end obtained by FX10 camera using CCA; f CCA-FX10-SP-B: data of the blossom end obtained by FX10 camera using CCA; g CCA-FX17-SP-A: data of the calyx end obtained by FX17camera using CCA; h CCA-FX17-SP-B: data of the blossom end obtained by FX17 camera using CCA; i SVR: support vector regression; j PLSR: partial least squares regression; k Rc: correlation coefficient of calibration; l Rv: correlation coefficient of validation; m Rp: correlation coefficient of prediction; n RMSEC: root mean square errors of calibration; o RMSEV: root mean square errors of validation; p RMSEP: root mean square errors of prediction.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, C.; He, M.; Cai, Z.; Qi, H.; Zhang, J.; Zhang, C. Hyperspectral Imaging with Machine Learning Approaches for Assessing Soluble Solids Content of Tribute Citru. Foods 2023, 12, 247. https://doi.org/10.3390/foods12020247

AMA Style

Li C, He M, Cai Z, Qi H, Zhang J, Zhang C. Hyperspectral Imaging with Machine Learning Approaches for Assessing Soluble Solids Content of Tribute Citru. Foods. 2023; 12(2):247. https://doi.org/10.3390/foods12020247

Chicago/Turabian Style

Li, Cheng, Mengyu He, Zeyi Cai, Hengnian Qi, Jianhong Zhang, and Chu Zhang. 2023. "Hyperspectral Imaging with Machine Learning Approaches for Assessing Soluble Solids Content of Tribute Citru" Foods 12, no. 2: 247. https://doi.org/10.3390/foods12020247

APA Style

Li, C., He, M., Cai, Z., Qi, H., Zhang, J., & Zhang, C. (2023). Hyperspectral Imaging with Machine Learning Approaches for Assessing Soluble Solids Content of Tribute Citru. Foods, 12(2), 247. https://doi.org/10.3390/foods12020247

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop