Next Article in Journal
New Insights into the Design and Application of a Passive Acoustic Monitoring System for the Assessment of the Good Environmental Status in Spanish Marine Waters
Next Article in Special Issue
Influence of the Flow Rate in an Automated Microfluidic Electronic Tongue Tested for Sucralose Differentiation
Previous Article in Journal
A Wearable Device for Breathing Frequency Monitoring: A Pilot Study on Patients with Muscular Dystrophy
Previous Article in Special Issue
Towards a Multi-Enzyme Capacitive Field-Effect Biosensor by Comparative Study of Drop-Coating and Nano-Spotting Technique
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluation of Discrimination Performance in Case for Multiple Non-Discriminated Samples: Classification of Honeys by Fluorescent Fingerprinting

by
Elizaveta A. Rukosueva
1,
Valeria A. Belikova
2,
Ivan N. Krylov
1,
Vladislav S. Orekhov
1,
Evgenii V. Skorobogatov
1,
Andrei V. Garmash
1 and
Mikhail K. Beklemishev
1,*
1
Department of Chemistry, M.V.Lomonosov Moscow State University, GSP-1, Leninskie Gory, 1–3, 119991 Moscow, Russia
2
Laboratory of Multivariate Analysis and Global Modeling, Samara State Technical University, 244 Molodogvardeyskaya str., 443100 Samara, Russia
*
Author to whom correspondence should be addressed.
Sensors 2020, 20(18), 5351; https://doi.org/10.3390/s20185351
Submission received: 20 July 2020 / Revised: 8 September 2020 / Accepted: 15 September 2020 / Published: 18 September 2020
(This article belongs to the Special Issue Multisensor Systems and Signal Processing in Analytical Chemistry)

Abstract

:
In this study we develop a variant of fluorescent sensor array technique based on addition of fluorophores to samples. A correct choice of fluorophores is critical for the successful application of the technique, which calls for the necessity of comparing different discrimination protocols. We used 36 honey samples from different sources to which various fluorophores were added (tris-(2,2′-bipyridyl) dichlororuthenium(II) (Ru(bpy)32+), zinc(II) 8-hydroxyquinoline-5-sulfonate (8-Ox-Zn), and thiazole orange in the presence of two types of deoxyribonucleic acid). The fluorescence spectra were obtained within 400–600 nm and treated by principal component analysis (PCA). No fluorophore allowed for the discrimination of all samples. To evaluate the discrimination performance of fluorophores, we introduced crossing number (CrN) calculated as the number of mutual intersections of confidence ellipses in the PCA scores plots, and relative position (RP) characterized by the pairwise mutual location of group centers and their most distant points. CrN and RP parameters correlated with each other, with total sensitivity (TS) calculated by Mahalanobis distances, and with the overall rating based on all metrics, with coefficients of correlation over 0.7. Most of the considered parameters gave the first place in the discrimination performance to Ru(bpy)32+ fluorophore.

1. Introduction

Fluorescence spectrum of a sample comprises a specific fingerprint, which can be used for classification purposes. Most fluorimetric fingerprinting classification methods use intrinsic fluorescence of samples [1,2]. A less developed approach is based on adding fluorophores to samples [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22], which promises their better discrimination because of the interaction of fluorophore(s) with non-fluorescent components of the sample and corresponding changes in the spectrum. If more than one fluorophore is added to samples, we can talk about fluorescent sensor arrays [4,5,8,10,11,15,17,20].
A key issue in the development of the “add-a-fluorophore” strategy is selection of fluorophores, since their chemical nature determines the possibility of fluorescence quenching/dequenching by the sample components. To make a proper selection of fluorophore(s), a method of comparing the discrimination results is quite necessary. In case for incomplete discrimination, the simplest characteristic of performance is the number of discriminated groups. However, sometimes it does not reveal the difference between various added fluorophores. It is important to develop new techniques for assessing discrimination performance: effective, convenient, and at the same time simple.
In this work we are suggesting two measures of discrimination: first is calculation of the number of intersections of confidence ellipses in the principal component analysis scores plots (crossing number, CrN), and second is relative position (RP) characterizing the pairwise mutual location of group centers and their most distant points. The suggested parameters are compared with the standard metrics for the multiclass classification: total sensitivity (TS), that equals the ratio of correctly classified samples to the total number of samples, calculated using various methods (Mahalanobis distances between classes, quadratic discriminant analysis and support vector machine), and also the number of discriminated groups of samples in the PCA scores plots (No of groups).
In this study we classified 36 samples of honey of different origin. A number of examples of successful application of fluorescent fingerprinting technique to the discrimination of honeys is published [23]; honeys were identified and classified using 3D fluorescence in combination with parallel factor analysis [24,25], front-face fluorescence spectroscopy using factorial discriminant analysis [26] and cluster analysis [27], principal component and linear discriminant analysis [28]. No publications on adding fluorophores to honey have been found.
In an attempt to improve the discrimination of honeys with respect to using only their intrinsic emission, we added several fluorophores to the samples. No complete discrimination in one step was achieved with any fluorophore because of a large number of samples of similar nature. A large number of non-discriminated samples complicated the selection of the fluorophore allowing for the most complete discrimination. To solve this problem, we applied the suggested performance criteria.

2. Materials and Methods

2.1. Reagents and Fluorophores

We used the metal complexes proved efficient in the previous fingerprint studies as fluorophores for the middle-wave part of the visible spectrum [3]: zinc(II) complex with 8-hydroxyquinoline-5-sulfonic acid (8-Ox-Zn) (Scheme 1c) obtained by mixing equal volumes of aqueous solutions of Zn(II) chloride and the named acid (0.001 mol·L−1 each), and tris(2,2′-bipyridyl)dichlororuthenium(II) (Ru(bpy)32+) (Aldrich) (Scheme 1a) dissolved in water to obtain a 0.001 mol L−1 solution and diluted to 1 × 10−5 mol·L−1 prior to use. Also, thiazole orange dye was used as a fluorophore emitting in the long-wave portion of the spectrum, and its fluorescence was enhanced by intercalating it [29] into deoxyribonucleic acid (DNA). DNA was obtained from two sources: a sodium salt from herring testes from Aldrich (DNA-1), and medicinal drug «Derinat» (ZAO FP “Technomedservis”), which contained 2.5 g DNA and 1 g NaCl in 1 L of water (DNA-2). DNA-1 was dissolved in water to obtain 7.2 × 10−3 mol·L−1 (counting the average molar weight of a nucleotide pair as 670 g/mol). DNA-1 and DNA-2 solutions were diluted to 7.2 × 10−5 mol·L−1 with water prior to use. Thiazole orange (Scheme 1b) was used as a 2.1 × 10−4 mol·L−1 solution in water. Other reagents were of analytical reagent grade and used as received. Millipore water (18 MΩ cm) was used to prepare all solutions.

2.2. Honey Samples

The honey samples (Nos 1–36) were produced in Russia between 2016 and 2018. The information on the botanical origin of honeys was obtained from the label (for the samples purchased in the local stores) or conveyed by the farmer (for the farm honey samples). The farm samples included polyfloral honeys 1, 2, 3, 4, 27, 32, 33, 34, 35, 36; clover (trefoil) honeys 14, 15, 16; fireweed (Chamaenerion angustifolium) honeys 17, 18, 19, 28, 31; linden (Tilia) honeys 20, 21, 22, 23, linden and raspberries honey 24; hippophae (Rhamnus) honey 25; white acacia honey 26; lavender honey 29; honey with galipot 30. The samples from the stores included polyfloral honeys 5, 6, 7, 8, 9, 12, 13, linden (Tilia) honey 10, and buckwheat (Fagopyrum esculentum) honey 11. Honeys were dissolved at 0.1 g/mL in water in 10-mL vials and stored at 4°C until analysis.

2.3. Procedures and Instrumentation

Aliquots of honey solutions (usually 50 μL) were placed in the wells of a 96-well polystyrene plate (Thermo Scientific Nunc F96 MicroWell, white, cat. No 136101) and one of the following fluorophore solutions (see Section 2.1) was added: Ru(bpy)32+—50 μL; 8-Ox-Zn—100 μL; thiazole orange was mixed with DNA in different ratios. The obtained mixtures were diluted with distilled water to obtain a total volume of 330 μL. No fluorophores were added when registering the intrinsic emission of the samples. Each sample was measured in five parallel runs.
Agilent Cary Eclipse spectrofluorimeter with a microplate reader accessory was used to obtain emission spectra upon excitation at different wavelength for each fluorophore. Typically, we excited samples at 320 and 360 nm. The emission range was 400 to 800 nm, and both the excitation and emission slits were set at 10 nm.

2.4. Data Treatment

Complexity of choosing most efficient fluorophore is related with multidimensionality of initial data (several fluorophores, its respective data sets, included measurements of each sample from predetermined set on wide wavelengths region). To simplify the task, a dimension reduction method is applied to each dataset (principal component analysis; scores of two first principal component are used). After this each data set can be represented by a set of points on a plane (each point is corresponded one measurement) and the task might be reduced to assessment of relative position of groups of points (which corresponded one type of samples). Besides, we have proposed to use special indexes, as described below.

2.4.1. Data Pretreatment

The raw spectral data were normalized, which involved the calculation of the formal degree of quenching (Q) as (I − I0)/I0, where I0 is the average emission intensity of the blank at the corresponding wavelength. Before application of each statistical method, the data were decomposed into two principal components.

2.4.2. Principal Component Analysis (PCA)

In the data matrix for each of the 36 studied honeys (5 parallel wells for each) and 6 blanks, each row contained 301 intensities corresponding to wavelengths of the spectrum. The data matrix was centered and subjected to PCA using the Unscrambler X (CAMO Software, https://www.camo.com/unscrambler). Typically, two PCs were sufficient to reach a residual dispersion of ≤5%. The result was visualized as the scores plot (PC1 vs. PC2) constructed using Origin 8.1 spreadsheet (OriginLab Corp., https://www.originlab.com/index.aspx?go=PRODUCTS/Origin). Each experiment (an emission spectrum of a well) yielded one point in the scores plot. Different points belonging to the same sample were similarly shaped and colored in the Figures.

2.4.3. Assessment of Discrimination Performance Using Confidence Ellipses

Confidence ellipses for each individual honey were constructed according to the Student’s distribution using the results of the five parallel measurements with a confidence level of 80% and plotted on the scores plots. The criterion for the separation of two honey samples was Mahalanobis distance (DM) value equal to 4, which corresponded to visual non-intersection of the ellipses. The DM were calculated using the R language-based software.
As an easy way to characterize the discrimination performance, we counted the number of samples separated in the scores plot (the “Number of groups” parameter). The samples were considered separated if their confidence ellipses did not intersect.
Another technique involves manual counting the number of intersections of the confidence ellipses. We considered the number of such intersections as a measure of discrimination performance. The ellipses may intersect in a number of ways, for which the crossing numbers were postulated (Figure 1). If one ellipse lied completely inside another, then the number of intersections was postulated to be equal to 4 to account for the high degree of overlapping.

2.4.4. Assessment of Discrimination Performance by Relative Position (RP) Index

Let us consider two classes each of which is characterized by its center and boundary. Above we described the data set by an ellipse covering 95% of samples; in practical situations, when the distribution law is unknown, we suggest using the simplest method of constructing the border: a convex polygon built on the boundary points of the group. For illustration (Figure 2), we use two classes of points marked in red and green. We distinguish three variants of relative position of the groups: separation (Figure 2a,b), intersection (Figure 2c), and merging (Figure 2d). The separation variant is characterized by the distance between the centers of mass: for example, in Figure 2a the distance is greater than in Figure 2b. When the classes are merged, their centers lie inside the intersection region, and when the classes just intersected, their centers lie outside the intersection region.
RP parameter is calculated using four key points: the center of the first group C1; the center of the second group C2; point D1, characterizing the first group scattering with respect to the location of the second group; point D2, characterizing the second group scattering relatively the location of the first group (Figure 3).
The ratio of the lengths of the segments C1D2 to C1D1 has the following property: In case of separation of groups, this ratio exceeds unity; in the case of intersection it is less than unity. In order to take into account the case of group merging (Figure 3b), the value of the scalar projection onto the vector C2C1 (or C1C2) is used instead of the length of the segment, because it includes information about finding the point D2 (D1) relative to C1 (C2) containing in the sign of the number. Accordingly, in the case of merging groups, the ratio of such projections C1D2 and C1B1 will be less than zero. Thus, to interpret the value, two easily remembered thresholds, 0 and 1, are used.
Points C1 and C2 are calculated by taking the average value; points D1 and D2 are selected from the following conditions: vector C1D1 is the projection of vector C1A1 onto C1C2, where A1 is the boundary point of the first class with the smallest projection of C2A1 onto C2C1. D2 point is chosen similarly with respect to the second class. To assess the relative position of the first group relative to the second, it is proposed to use the following RP value (formula 1):
RP 12 = scalar   projection   of   C 1 A 2   on   C 1 C 2 scalar   projection   of   C 1 A 1   on   C 1 C 2
In case of separation of groups, the RP12 value is greater than 1. The larger is RP, the further the groups are located. In case of intersection, the RP value will be more than 0, but less than 1. A value less than 0 corresponds to the case of merging groups. This representation is intuitive and easy to use.
If the groups have a different level of significance, then the RP value can be calculated for the most significant group. If the groups have the same significance level, then use the smallest value from RP12 and RP21.
In an ideal case, when the boundaries of the groups set the boundary to only uniformly located points, the case of merger differs from the intersection by the number of points lying inside the intersection region. In case for crossing, the number of such points is greater than zero, but (approximately) less than half of the class volume. When the classes merge, more than approximately half of the points lie in the intersection area.
To extend the measure to N groups, the measure value for each pair of groups can be calculated and the average value taken (Formula 2):
RP = 1 0.5 N ( N 1 ) i = 1 N j = 1 , j i N min ( RP i j , RP j i )

2.4.5. Assessment of Discrimination Performance by Standard Metrics

Validity index, or total sensitivity, was calculated as number of correctly identified samples from the training dataset (determined using Mahalanobis distances between samples and their classes, marking samples matching more than one class as potentially misclassified) divided by the number of all samples [30]. The cross-validation procedure was performed automatically by Unscrambler X software.
Discriminant analysis (DA) and support vector machine (SVM) methods. Linear discriminant analysis (LDA), as the name suggests, produces a linear decision boundary between two classes. In this paper, the distance from the center of the groups is calculated using the Mahalanobis distance (LDA-Mah). This metric is only valid if the two classes have similar variance–covariance matrices, otherwise the pooled covariance matrix would be inaccurate. Quadratic discriminant analysis (QDA) is another distance metric similar to the LDA; however, the distance to each class is calculated using the sample variance–covariance matrix of each class. The boundary produced by QDA is a quadratic curve, which may consist of two separate sections of boundary lines [31]. SVM is a classification method based on statistical learning, wherein a function that describes a hyperplane for optimal separation of classes is determined. In SVMs the classification rule is determined by only a small number of training set samples called support vectors (SVs). A kernel function is used to map from the original space to the feature space and can represent many forms, thus providing the ability to handle nonlinear classification cases [32]. In this paper, the input space was transformed into feature space by means of the linear, 3rd degree polynomial and radial basis function (RBF) kernels.

3. Results

3.1. Discrimination of Honeys by Intrinsic Fluorescence

We studied 36 samples of honey, purchased from local supermarkets or from apiary farmers. The samples had different consistency and color, supposing different spectral features. The fluorescence spectra of all samples are shown in Figure 4a. All spectra are different. For an excitation wavelength of 320 nm, the intrinsic emission of honeys was observed in the range of 400–600 nm. The peak at 420 nm, which is present in all spectra, is generated by the material of the plate and was not taken into account in calculations, so the scores plots were calculated for 430–550 nm. To assess the possibility of discriminating honeys by their intrinsic fluorescence, a plot of PC1-PC2 was constructed (Figure 4b): several samples are well discriminated, while the others represent one large group. Thus, honeys can be divided into 11 groups according to their intrinsic fluorescence: some of those are individual honeys and other contain several unseparated samples.

3.2. Fluorophores Used in the Discrimination of Honeys

The method of “fluorescent eye” is based on the interaction of fluorophores with the components of the samples. Previously, we studied a number of fluorophores in this method and found those showing favorable results include metal complexes: 8-Ox-Zn and Ru(bpy)32+ [3,13]. In this study we also use a fluorescent dye thiazole orange (TO) that exhibits most intense emission being intercalated into DNA [33]. Moreover, DNA may form molecular complexes with marker compounds of the samples, bringing them to a closer proximity with the fluorophore.

3.2.1. Thiazole Orange (TO) as Fluorophore

Appreciable emission of TO in the region higher than 500 nm is only observed in the presence of DNA (Figure 5). When considering the whole emission spectrum, the best discrimination in terms of the number of discriminated groups was achieved by only adding DNA without the dye to the honey (11 groups, Table S1 in Supplementary Information). However, adding thiazole orange to that system and considering only the long-wave emission peak allowed for separating the honeys into 12 groups (Figure 6; for the selection of the TO: DNA ratio, see Table S1). These two systems (honey + DNA-2 by whole spectrum and honey + TO + DNA-2 by the long-wave peak) were chosen for further consideration; DNA-1 and DNA-2 without TO were also studied as blanks.

3.2.2. Metal Complexes as Fluorophores

We studied the complexes 8-Ox-Zn and Ru(bpy)32+ earlier and found them to be useful in solving classification problems of different samples [3]. Ru(bpy)32+ added to all honey samples discriminates them into 14 classes (Figure 7a). The distances between the classes are significantly higher compared to using 8-Ox-Zn which discriminates the samples into 9 classes (Figure 7b).

3.2.3. Reflection Photometry

Absorbance measurement is a technique alternative to fluorescence in array sensing. Some honey samples are noticeably colored, which offers a possibility to distinguish them by their own refection signal in visible region under illumination by a tungsten lamp. Samples with and without added Ru(bpy)32+ were studied in Camag visualizer, and the resulting images were treated to obtain the scores plots (Figure S2 in Supplementary Information). The number of discriminated groups did not exceed 5, which implies that photometry is inferior to fluorescence for discrimination purposes.

3.3. Discrimination Performance

In this study we tried to reach most complete discrimination of individual samples by adding fluorophores. When using that strategy, it is desirable to have a method of comparing fluorophores by their efficiency of discrimination. This is a challenging task when not all samples are discriminated. We counted the groups of points in the scores plots whose confidence ellipses did not intersect [14] to obtain the “number of groups” value (Table 1). However, in the case of plots similar to those shown in Figure 7 the number of groups does not give the correct information. For example, in Figure 7b many confidence ellipses intersect, for which reason these samples should be assigned to one group, while in fact many samples are discriminated pair-wise (for example, samples 11 and 22, 36 and 25 are clearly separated, though belonging to one large cluster of groups).

3.3.1. Discrimination Performance Estimation Methods Suggested in This Study

As can be seen from the above, it is complicated to assess the discrimination performance when the samples are not well separated, or only a part of the samples forms individual classes. In an attempt to overcome these difficulties, we have suggested two characteristics of discrimination performance.
The first one is based on counting the crossing number (CrN) of the confidence ellipses over the entire scores plot (Figure 1, the obtained CrN values are listed in Table 1). Lesser CrN values correspond to better performance. Probably, more precise results would be obtained if we estimated the overlap areas of the confidence ellipses, but such calculations would require sophisticated programming work. Another parameter is relative position (RP) of groups (Figure 2 and Figure 3). Higher RP values imply better discrimination. The RP values for the studied fluorophores are also shown in Table 1.

3.3.2. Estimation of Discrimination Performance by Standard Techniques

Total sensitivity (TS). This metric shows whether the point belongs to a given class and may serve as a measure of the classification performance. Having calculated the distances between each vector and each class, each sample was either assigned to a single class, marked as belonging to multiple classes or left unowned. A point was considered belonging to a class if the Mahalanobis distance DM from the point to the class was less than or equal to 4. Based on that information, TS values by Mahalanobis distance were calculated (Table 1), only counting samples belonging to a single class.
Discriminant analysis and support-vector machine. Linear discriminant analysis (LDA) is only applicable to classes with similar-looking covariance matrices; therefore, we used the discriminant analysis methods applicable for independent covariance matrix estimation: quadratic discriminant analysis (QDA) and LDA-Mah, based on the calculation of the Mahalanobis distances between classes. The support-vector machine (SVM) method is widely used to classify objects with different predefined algorithms; we used linear, 3rd degree polynomial, and radial basis function [34]. Discrimination performance of LDA-Mah and SVM methods was evaluated by calculating total sensitivity on the training dataset as a measure of correctly identified samples (TS, Section 2.4). Before proceeding to QDA or SVM, the data were decomposed into principle components. Table 1 presents the results of the sensitivity calculation for the test data set.

4. Discussion

Adding fluorophores to samples proved to be an efficient way of improving the discrimination performance using fluorescent fingerprinting when the samples are not intensely fluorescent or are not well discriminated by their intrinsic fluorescence [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22]. The discrimination performance should be quantitatively characterized. The simplest intuitive way of counting the number of discriminated groups faces difficulties in case of a large number of non-discriminated classes (Section 3.3, Figure 4b and Figure 7). A recognized measure of accuracy is total sensitivity (TS) that characterizes the percentage of correctly assigned samples. As alternative characteristics of performance, we suggested the CrN and RP parameters. The first metric can be calculated manually, provided the confidence ellipses have been constructed; the second is calculated in several steps including centers, borders, and scalar projections calculation (implementation as Matlab function is given in the Supplementary Information).
Addressing the problem of selection of the most efficient fluorophore, we can see that Ru(bpy)32+ takes the top positions among other fluorophores, judging by CrN and RP parameters (CrN = 112, RP = 44), the number of groups, and many variants of TS (Table 1). Next efficient fluorophores are TO–DNA-1 and DNA-1 without the dye. The other fluorophores tend to show less favorable metrics than honey by intrinsic fluorescence, which implies that addition of fluorophores worsens the discrimination of samples.
To account for all metrics and compare the fluorophores quantitatively, we calculated the overall rating of fluorophores. To obtain the rating value, the individual places of fluorophores with respect to all metrics were summed up (Table 2). The best rating (minimum sum of places) is also with Ru(bpy)32+.
It is also useful to know how the calculated parameters relate to each other. As can be seen from Table 1, the highest correlations with coefficients of 0.71–0.89 are between the new parameters CrN and RP, as well as between them and TS by Mahalanobis distance. The total sensitivity of a classification method obtained for the training dataset essentially describes the degree of separation between classes, with higher separation providing more freedom to the classification method in choosing the inter-class threshold, eventually allowing for perfect discrimination. Since the CrN parameter roughly estimates the class overlap (assuming the classes to be sampled from multivariate normal distributions), while the RP metric directly measures the distance between classes (accounting for within-class spread), these three metrics can be considered similar. The overall rating also shows a considerable correlation with CrN (r = 0.87) and RP (r = 0.89). Other metrics show a lower correlation with CrN and RP (r = 0.5–0.7). On the whole, all considered metrics show moderate correlation with each other, which implies that the best way of choosing the most efficient discrimination should involve using the greatest possible number of metrics. The proposed parameters CrN and RP have shown their efficiency and require further evaluation when assessing the discrimination performance.

5. Conclusions

Intrinsic fluorescence of 36 honey samples did not allow for their complete discrimination using PCA. Adding various fluorophores to these samples seemingly improved the discrimination; however, a quantitative assessment of the discrimination performance was needed. The task was complicated by the incomplete separation of classes. We used multiple metrics for evaluation of the discrimination performance, including standard ones (total sensitivity calculated by various methods, the number of discriminated groups) and the parameters suggested in this study (crossing number of confidence ellipses, CrN, and relative position of classes, RP). Those allowed for choosing one of the fluorophores (tris-(2,2′-bipyridyl) dichlororuthenium(II)) as the most efficient. However, the classification model developed in this study for the honey samples was only used to demonstrate the feasibility of using the suggested CrN and RP parameters for the evaluation of the discrimination efficiency. Correlation coefficients between the metrics confirmed that they are essentially independent and can be used together or in combination with standard metrics (the overall rating of fluorophores can be applied for that purpose).

Supplementary Materials

The following are available online at https://www.mdpi.com/1424-8220/20/18/5351/s1, Figure S1: Fluorescence spectra of honey samples with added fluorophores: (a) intercalated thiazole orange, (b) Ru(bpy)32+. Figure S2. Scores plots of the visible region reflection images of honey samples in the 96-well plates. Table S1. The number of discriminated groups of points in the scores plots of 23 honey samples with added thiazole orange (TO) intercalated into DNA-2 (TO–DNA-2 fluorophore); standard formulas for the calculation of confidence ellipses; Matlab function for relative position (RP) calculation.

Author Contributions

Conceptualization, M.K.B., V.S.B., and A.V.G.; software, V.A.B.; investigation, E.A.R., V.S.B., E.V.S., and V.S.O.; interpretation and discussion, A.V.G. and I.N.K.; writing—original draft preparation, E.A.R. and V.S.B.; writing—review and editing, M.K.B. and I.N.K.; supervision, M.K.B.; funding acquisition, M.K.B. and V.A.B. All authors have read and agreed to the published version of the manuscript.

Funding

The work of E.A.R., V.S.O., E.V.S. and M.K.B. was supported by the Russian Science Foundation (grant No. 20-13-00330). The work of V.A.B. who developed the algorithms for class distance calculation (RP) was supported by the Ministry of Education and Science of the Russian Federation within the framework of state task No. 0778-2020-0005.

Acknowledgments

The authors thank Alexey L. Pomerantsev for discussion.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sádecká, J.; Tóthová, J. Fluorescence spectroscopy and chemometrics in the food classification—A review. Czech J. Food Sci. 2007, 25, 159–173. [Google Scholar]
  2. Karoui, R.; Blecker, C. Fluorescence spectroscopy measurement for quality assessment of food systems—A review. Food Bioproc. Technol. 2011, 4, 364–386. [Google Scholar] [CrossRef]
  3. Rukosueva, E.A.; Dobrolyubov, E.; Goryacheva, I.Y.; Beklemishev, M.K. Discrimination of whiskies using an “add-a-fluorophore” fluorescent fingerprinting strategy. Microchem. J. 2019, 145, 397–405. [Google Scholar] [CrossRef]
  4. Han, J.; Cheng, H.; Wang, B.; Braun, M.S.; Fan, X.; Bender, M.; Huang, W.; Domhan, C.; Mier, W.; Lindner, T.; et al. A Polymer/Peptide Complex-Based Sensor Array That Discriminates Bacteria in Urine. Angew. Chem. Int. Ed. 2017, 56, 15246–15251. [Google Scholar] [CrossRef] [PubMed]
  5. Wang, B.; Han, J.; Bojanowski, N.M.; Bender, M.; Ma, C.; Seehafer, K.; Herrmann, A.; Bunz, U.H.F. Bunz, an Optimized Sensor Array Identifies All Natural Amino Acids. ACS Sens. 2018, 3, 1562–1568. [Google Scholar] [CrossRef] [PubMed]
  6. Han, J.; Wang, B.; Bender, M.; Seehafer, K.; Bunz, U.H.F. Poly(p-phenyleneethynylene)-based tongues discriminate fruit juices. Analyst 2017, 142, 537–543. [Google Scholar] [CrossRef]
  7. Han, J.; Bender, M.; Seehafer, K.; Bunz, U.H.F. Identification of White Wines by using Two Oppositely Charged Poly(p-phenyleneethynylene)s Individually and in Complex. Angew. Chem. Int. Ed. 2016, 55, 7689–7692. [Google Scholar] [CrossRef]
  8. Miranda, O.R.; You, C.-C.; Phillips, R.; Kim, I.-B.; Ghosh, P.S.; Bunz, U.H.F.; Rotello, V. Array-Based Sensing of Proteins Using Conjugated Polymers. J. Am. Chem. Soc. 2007, 129, 9856–9857. [Google Scholar] [CrossRef] [Green Version]
  9. Wu, D.; Schanze, K. Protein Induced Aggregation of Conjugated Polyelectrolytes Probed with Fluorescence Correlation Spectroscopy: Application to Protein Identification. ACS Appl. Mater. Interf. 2014, 6, 7643–7651. [Google Scholar] [CrossRef]
  10. Wu, Y.; Tan, Y.; Wu, J.; Chen, S.; Chen, Y.Z.; Zhou, X.; Jiang, Y.; Tan, C. Fluorescence Array-Based Sensing of Metal Ions Using Conjugated Polyelectrolytes. ACS Appl. Mater. Interf. 2015, 7, 6882–6888. [Google Scholar] [CrossRef]
  11. Han, J.; Ma, C.; Wang, B.; Bender, M.; Bojanowski, M.; Hergert, M.; Seehafer, K.; Herrmann, A.; Bunz, U.H.F. A Hypothesis-Free Sensor Array Discriminates Whiskies for Brand, Age, and Taste. Chem 2017, 2, 817–824. [Google Scholar] [CrossRef] [Green Version]
  12. Han, J.; Wang, B.; Bender, M.; Kushida, S.; Seehafer, K.; Bunz, U.H.F. Bunz, Poly(aryleneethynylene) Tongue Identifies Nonsteroidal Anti-inflammatory Drugs in Water: A Test Case for Combating Counterfeit Drugs. ACS Appl. Mater. Interf. 2017, 9, 790–797. [Google Scholar] [CrossRef] [PubMed]
  13. Divyanin, N.N.; Razina, A.V.; Rukosueva, E.A.; Garmash, A.V.; Beklemishev, M.K. Discrimination of 2-3-component mixtures of organic analytes by a “fluorescent tongue”. Microchem. J. 2017, 135, 48–54. [Google Scholar] [CrossRef]
  14. Divyanin, N.N.; Rukosueva, E.A.; Garmash, A.V.; Beklemishev, M.K. Recognition of Model Analyte Mixtures in the Presence of Blood Plasma Using a Mixture of Fluorophores (“Fluorescent Tongue”). J. Anal. Chem. 2018, 73, 1195–1201. [Google Scholar] [CrossRef]
  15. Sun, S.; Jiang, K.; Qian, S.; Wang, Y.; Lin, H. Applying Carbon Dots-Metal Ions Ensembles as a Multichannel Fluorescent Sensor Array: Detection and Discrimination of Phosphate Anions. Anal. Chem. 2017, 89, 5542–5548. [Google Scholar] [CrossRef]
  16. Kong, H.; Lu, Y.; Wang, H.; Wen, F.; Zhang, S.; Zhang, X. Protein Discrimination Using Fluorescent Gold Nanoparticles on Plasmonic Substrates. Anal. Chem. 2012, 84, 4258–4261. [Google Scholar] [CrossRef]
  17. Cao, Y.; Zhang, L.; Huang, X.; Xin, Y.; Ding, L. Discrimination of Metalloproteins by a Mini Sensor Array Based on Bispyrene Fluorophore/Surfactant Aggregate Ensembles. ACS Appl. Mater. Interf. 2016, 8, 35650–35659. [Google Scholar] [CrossRef]
  18. Köstereli, Z.; Scopelliti, R.; Severin, K. Pattern-based sensing of aminoglycosides with fluorescent amphiphiles. Chem. Sci. 2014, 5, 2456–2460. [Google Scholar] [CrossRef]
  19. Cao, Y.; Ding, L.; Hu, W.; Peng, J.; Fang, Y. A surfactant-modulated fluorescent sensor with pattern recognition capability: Sensing and discriminating multiple heavy metal ions in aqueous solution. J. Mater. Chem. A 2014, 43, 1–8. [Google Scholar] [CrossRef]
  20. Melnikov, A.G.; Melnikov, G.V.; Varezhnikov, A.S.; Efremova, V.V.; Kuenbaeva, V.R.; Dykin, V.S.; Sysoev, V.V. The fluorescencent multi-sensor array to selectively detect a presence of various heavy metals in aqua solutions. In Proceedings of the 2016 International Conference on Actual Problems of Electron Devices Engineering (APEDE), Saratov, Russia, 22–23 September 2016; pp. 1–3. [Google Scholar] [CrossRef]
  21. Abdel-Ghany, M.F.; Hussein, L.A.; Ayad, M.F.; Youssef, M.M. Youssef, Investigation of different spectrophotometric and chemometric methods for determination of entacapone, levodopa and carbidopa in ternary mixture. Spectr. Acta. A 2017, 171, 236–245. [Google Scholar] [CrossRef]
  22. Gitlina, A.Y.; Surkova, A.; Ivonina, M.V.; Sizov, V.V.; Petrovskii, S.K.; Legin, A.; Starova, G.L.; Koshevoy, I.O.; Grachova, E.V.; Kirsanov, D.O. Kirsanov. Cyclometalated Ir(III) complexes as tuneable multiband light sources for optical multisensor systems: Feasibility study. Dyes. Pigm. 2020, 180, 108428. [Google Scholar] [CrossRef]
  23. Siddiqui, A.J.; Musharraf, S.G.; Choudhary, M.I.; Rahman, A.-U. Application of analytical methods in authentication and adulteration of honey. Food Chem. 2017, 217, 687–698. [Google Scholar] [CrossRef] [PubMed]
  24. Lenhardt, L.; Bro, R.; Zeković, I.; Dramicanin, T.; Dramićanin, M.D. Fluorescence spectroscopy coupled with PARAFAC and PLS DA for characterization and classification of honey. Food Chem. 2015, 175, 284–291. [Google Scholar] [CrossRef] [PubMed]
  25. Dramicanin, T.; Lenhardt, L.; Zeković, I.; Dramićanin, M.D. Detection of Adulterated Honey by Fluorescence Excitation-Emission Matrices. J. Spectrosc. 2018, 8395212. [Google Scholar] [CrossRef] [Green Version]
  26. Karoui, R.; Dufour, E.; Bosset, J.-O.; De Baerdemaeker, J. The use of front face fluorescence spectroscopy to classify the botanical origin of honey samples produced in Switzerland. Food Chem. 2007, 101, 314–323. [Google Scholar] [CrossRef]
  27. Ali, H.; Khan, S.; Ullah, R.; Khan, B. Fluorescence fingerprints of Sidr honey in comparison with uni/polyforal honey samples. Europ. Food Res. Technol. 2020, 246, 1829–1837. [Google Scholar] [CrossRef]
  28. Ruoff, K.; Luginbühl, W.; Künzli, R.; Bogdanov, S.; Bosset, J.O.; Von Der Ohe, K.; Von Der Ohe, W.; Amadò, R. Authentication of the Botanical and Geographical Origin of Honey by Front-Face Fluorescence Spectroscopy. J. Agric. Food Chem. 2006, 54, 6858–6866. [Google Scholar] [CrossRef]
  29. Jacobsen, J.P.; Pedersen, J.B.; Hansen, L.F.; Wemmer, D.E. Site selective bis-intercalation of a homodimeric thiazole orange dye in DNA oligonucleotides. Nucl. Acids Res. 1995, 23, 753–760. [Google Scholar] [CrossRef] [Green Version]
  30. Parikh, R.; Mathai, A.; Parikh, S.; Sekhar, G.C.; Thomas, R. Understanding and using sensitivity, specificity and predictive values. Indian J. Ophthalmol. 2008, 56, 45–50. [Google Scholar] [CrossRef]
  31. Dixon, S.J.; Brereton, R.G. Comparison of performance of five common classifiers represented as boundary methods: Euclidean distance to centroids, linear discriminant analysis, quadratic discriminant analysis, learning vector quantization and support vector machines, as dependent on data structure. Chemom. Intell. Lab. Syst. 2009, 95, 1–17. [Google Scholar]
  32. Amari, S.-I.; Wu, S. Improving support vector machine classifiers by modifying kernel functions. Neural Netw. 1999, 12, 783–789. [Google Scholar] [CrossRef]
  33. Boger, D.L.; Tse, W.C. Thiazole Orange as the Fluorescent Intercalator in a High Resolution FID Assay for Determining DNA Binding Affinity and Sequence Selectivity of Small Molecules. Bioorg. Med. Chem. 2001, 9, 2511–2518. [Google Scholar] [CrossRef]
  34. Gumus, E.; Kilic, N.; Sertbaş, A.; Ucan, O.N. Evaluation of face recognition techniques using PCA, wavelets and SVM. Expert Syst. Appl. 2010, 37, 6404–6408. [Google Scholar] [CrossRef]
Scheme 1. Structures of fluorophores used in this study. (a) tris(2,2′-Bipyridyl)dichlororuthenium(II); (b) Thiazole orange; (c) 8-Hydroxyquinoline-5-sulfonic acid.
Scheme 1. Structures of fluorophores used in this study. (a) tris(2,2′-Bipyridyl)dichlororuthenium(II); (b) Thiazole orange; (c) 8-Hydroxyquinoline-5-sulfonic acid.
Sensors 20 05351 sch001
Figure 1. Numbers of intersections of confidence ellipses (crossing numbers, CrN) for various intersection patterns: (A) 2; (B) 4; (C) 4 (postulated).
Figure 1. Numbers of intersections of confidence ellipses (crossing numbers, CrN) for various intersection patterns: (A) 2; (B) 4; (C) 4 (postulated).
Sensors 20 05351 g001
Figure 2. Examples of relative positions of the datasets containing “red” and “green” classes: (a,b) separation, (c) intersection, (d) merging.
Figure 2. Examples of relative positions of the datasets containing “red” and “green” classes: (a,b) separation, (c) intersection, (d) merging.
Sensors 20 05351 g002
Figure 3. Key points (A1, A2, C1, C2, D1, D2) used in the calculation of relative position (RP) parameter characterizing the distance between the “green” and “red” classes: (a) separated classes, (b) merged classes.
Figure 3. Key points (A1, A2, C1, C2, D1, D2) used in the calculation of relative position (RP) parameter characterizing the distance between the “green” and “red” classes: (a) separated classes, (b) merged classes.
Sensors 20 05351 g003
Figure 4. (a) Fluorescence spectra of honey samples without added fluorophores, λex = 320 nm; (b) scores plots of these emission spectra.
Figure 4. (a) Fluorescence spectra of honey samples without added fluorophores, λex = 320 nm; (b) scores plots of these emission spectra.
Sensors 20 05351 g004
Figure 5. Fluorescence spectra of thiazole orange (TO) intercalated into DNA-2 (λex = 320 nm; the molar ratio of DNA-2 to TO is shown in the legend).
Figure 5. Fluorescence spectra of thiazole orange (TO) intercalated into DNA-2 (λex = 320 nm; the molar ratio of DNA-2 to TO is shown in the legend).
Sensors 20 05351 g005
Figure 6. The scores plots of the emission spectra of honeys with added thiazole orange intercalated into DNA-2 in a ratio of 1:53 (TO–DNA-2).
Figure 6. The scores plots of the emission spectra of honeys with added thiazole orange intercalated into DNA-2 in a ratio of 1:53 (TO–DNA-2).
Sensors 20 05351 g006
Figure 7. The scores plots of the emission spectra of 36 honey samples with added: (a) Ru(bpy)32+, (b) 8-Ox-Zn.
Figure 7. The scores plots of the emission spectra of 36 honey samples with added: (a) Ru(bpy)32+, (b) 8-Ox-Zn.
Sensors 20 05351 g007
Table 1. Performance characteristics in the discrimination of 36 honey samples and their correlation with the crossing number (CrN) and relative position (RP) values.
Table 1. Performance characteristics in the discrimination of 36 honey samples and their correlation with the crossing number (CrN) and relative position (RP) values.
FluorophoreTotal SensitivityNumber of Discriminated Groups #RPCrN
by Mahalanobis DistanceQDALDA-MahSVM LinSVM Polyn 3SVM Rad Bas Func
Ru(bpy)32+86.169729483861444112
TO–DNA-180.667758681831126112
DNA-180.669758383391226183
No fluorophores *80.672698381421130178
DNA-273.369696767331224208
8-Ox-Zn63.350477881751217192
TO–DNA-259.453588169721119239
r with CrN **0.790.490.540.700.690.490.420.711.00
r with RP ***0.830.670.440.620.670.190.701.000.71
* Honey without adding fluorophores. ** Correlation coefficient of the metric with CrN parameter. *** Correlation coefficient of the metric with RP parameter. # Number of groups discriminated in the scores plots using DM = 4 as criterium.
Table 2. Places of fluorophores by metric values (TS, CrN, RP etc.,) in the discrimination of 36 honey samples and their overall rating values of fluorophores calculated as the total sum of places by each metric.
Table 2. Places of fluorophores by metric values (TS, CrN, RP etc.,) in the discrimination of 36 honey samples and their overall rating values of fluorophores calculated as the total sum of places by each metric.
FluorophoreTotal SensitivityNumber of Discriminated Groups #RPCrNRating **
by Mahalanobis DistanceQDALDA-MahSVM LinSVM Polyn 3SVM rad bas Func
Ru(bpy)32+13311.51111.514
TO–DNA-1351.524263.51.528.5
DNA-1331.53.51.5633.5429
No fluorophores *314.53.54562332
DNA-2534.577735647.5
8-Ox-Zn67764337548
TO–DNA-276656466753
* Honey without adding fluorophores. ** The sum of places shown in this Table for each fluorophore, including CrN and RP. If two fluorophores showed the same result, for example, being in the 1st and 2nd place, the place shown for them was 1.5, and if three fluorophores were in 5, 6, and 7th place, they were all shown in the 6th place, etc. Green background denotes 1st or 1st–2nd place, blue—2nd–3rd–4th place, grey—the last place (7th) # See Table 1.

Share and Cite

MDPI and ACS Style

Rukosueva, E.A.; Belikova, V.A.; Krylov, I.N.; Orekhov, V.S.; Skorobogatov, E.V.; Garmash, A.V.; Beklemishev, M.K. Evaluation of Discrimination Performance in Case for Multiple Non-Discriminated Samples: Classification of Honeys by Fluorescent Fingerprinting. Sensors 2020, 20, 5351. https://doi.org/10.3390/s20185351

AMA Style

Rukosueva EA, Belikova VA, Krylov IN, Orekhov VS, Skorobogatov EV, Garmash AV, Beklemishev MK. Evaluation of Discrimination Performance in Case for Multiple Non-Discriminated Samples: Classification of Honeys by Fluorescent Fingerprinting. Sensors. 2020; 20(18):5351. https://doi.org/10.3390/s20185351

Chicago/Turabian Style

Rukosueva, Elizaveta A., Valeria A. Belikova, Ivan N. Krylov, Vladislav S. Orekhov, Evgenii V. Skorobogatov, Andrei V. Garmash, and Mikhail K. Beklemishev. 2020. "Evaluation of Discrimination Performance in Case for Multiple Non-Discriminated Samples: Classification of Honeys by Fluorescent Fingerprinting" Sensors 20, no. 18: 5351. https://doi.org/10.3390/s20185351

APA Style

Rukosueva, E. A., Belikova, V. A., Krylov, I. N., Orekhov, V. S., Skorobogatov, E. V., Garmash, A. V., & Beklemishev, M. K. (2020). Evaluation of Discrimination Performance in Case for Multiple Non-Discriminated Samples: Classification of Honeys by Fluorescent Fingerprinting. Sensors, 20(18), 5351. https://doi.org/10.3390/s20185351

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop