1. Introduction
The new concept of using an array of ion-selective electrodes instead of individual ones in combination with multivariate data-processing techniques emerged for the first time in the 1980s [
1,
2]. This approach allows the simultaneous quantification of several compounds in multicomponent media even when the selectivity of the available sensors is not sufficient. Furthermore, for sensors to be usable in the arrays, they do not need to be highly selective. The main requirements are sensitivity to the analytes of interest and cross-sensitivity [
1,
3]. Later on these multisensor systems have been named taste sensors or electronic tongues (ET) since their functioning principle mimics the sensory system of mammals, primarily olfaction and gustation [
4,
5,
6,
7]. Though several types of chemical sensors, such as voltametric, mass and optical ones, were used for the development of the electronic tongue, potentiometric chemical sensors remain the most common [
6].
The performance of the ET can be tuned by varying the number and types of sensors in the array and by data-processing methods [
8]. Thus, the composition of the sensor array gains particular importance, and sensors should be carefully selected depending on the analytical task. In the first studies on the multisensor systems, sensor arrays were constructed by including one or two sensors that were selective towards each of the analytes plus one or more generic (non-selective or cross-sensitive) ones. Another approach consists of the characterization of several sensors in a set of multicomponent solutions containing all analytes of interest at different concentration levels, followed by the calculation of the multivariate calibration model and variable selection [
9]. This approach is not always feasible in practice as it would result in measuring the responses of dozens of sensors in dozens or hundreds of complex mixtures, depending on the number of analytes. Thus, sensor selection is usually performed empirically by analyzing sensor responses in the individual solutions of analytes or, in the case of classification tasks, by trial and error, with neither method ensuring optimized sensor array composition.
To address this question, several attempts to formalize sensor selection for the electronic tongue have been proposed recently. Several works propose the use of Principal Component Analysis (PCA) for the simultaneous assessment of sensitivity and the reproducibility of sensors, permitting the selection of the most suitable ones for particular analytical tasks [
10,
11,
12]. Measurements with several sensors are carried out in the individual solutions of analytes with varying or equal concentrations, and the PCA model is calculated using sensor responses. In 2020, Sarma et al. [
10] proposed a visual examination of the PCA scores and loading plots for selecting sensors discriminating between samples. In other studies [
11,
12,
13], PCA was employed in combination with different clustering indices calculated using PCA scores, such as F factor, Dunn, Davies–Bouldin, Silhouette, and Calinski–Harabasz. This approach has the undisputable advantage of simplicity as it is based on a small number of measurements, relatively simple data-processing procedures, and a straightforward criterion for sensor selection. Though it does not always ensure the selection of the optimum sensor array, it is useful for indicating the most discriminating and cross-sensitive sensors. While this approach has proved to be useful for impedimetric [
10] and voltametric sensors [
11,
13], its applicability to potentiometric sensors is questionable, as estimating the cross-sensitivity of the latter without measurements in mixed solutions is not possible.
A sensor selection method developed specifically for potentiometric sensors consisting of the calculation of simulated sensor responses in mixed solutions and selection of the optimum sensor array configuration using a genetic algorithm and/or Fisher information criterion was proposed by Sibug-Torres et al. in 2019 [
14,
15]. Simulated sensor responses were calculated using the general equation for mixed-ion response involving monovalent and divalent ions for ion exchange and ionophore-based potentiometric sensor membranes [
16]. These works rely on extensive libraries of potentiometric sensor characteristics, including sensitivities and selectivity coefficients, which are available in the literature. The advantage of using a simulated dataset is the possibility of generating sensor responses in a large number of mixed solutions using sensor parameters, including sensitivity and selectivity coefficients that are determined in the course of sensor characterization. The approach proposed in [
14,
15] afforded optimum sensor array configurations with simulated data; however, its efficiency was not confirmed using experimental data.
The present study aims to develop a method for sensor selection for the electronic tongue using a simulated dataset. As a case study, the quantification of paralytic shellfish toxins (PSTs) using potentiometric chemical sensors was chosen. PSTs are a group of phytotoxins produced by some species of marine and freshwater phytoplankton that provoke paralytic shellfish poisoning in humans [
17]. The accumulation of PSTs in filter-feeding bivalves can occur during the proliferation of toxic phytoplankton or harmful algal blooms (HABs) [
18]. PSTs comprise more than 60 compounds sharing a tetrahydropurine ring (
Table 1) [
19] but with different substitutions at positions N1 (R1 side chain), C11 (R2 and R3 side chains), and C13 (R4 side chain). Structures of three PST groups (carbamoyl, decarbamoyl, and N-sulfocarbamoyl), classified according to their R4 side chain, are shown in
Table 1 [
20].
Specific toxin profiles observed in bivalves depend primarily on the toxin-producing phytoplankton but also on the bivalve species. The dinoflagellate
Gymnodinium catenatum, which is prevalent along the Atlantic coast of Portugal and Spain, the Gulf of Mexico, Venezuela, Chile, and Argentina, produces a toxin profile essentially characterized by N-sulfocarbamoyl group PSTs [
21]. In contrast, the dinoflagellate
Alexandrium minutum, common in Northern Europe, including the UK, Norway and Iceland, mainly produces carbamoyl PSTs [
22]. A similar carbamoyl profile is observed in
Alexandrium catenella [
23,
24]. Additionally, certain bivalve species, such as
Spisula solida, produce enzymes capable of hydrolyzing PSTs transforming carbamoyl and N-sulfocarbamoyl toxins into decarbamoyl analogs [
20,
25].
In our previous work, a series of potentiometric chemical sensors with solid inner contact and plasticized polyvinylchloride (PVC) membranes containing different ionophores were developed for the detection of three PSTs commonly found in Portuguese waters: dcSTX, GTX5 and C1+2 [
26]. However, developed sensors displayed cross-sensitivity to all three toxins and low selectivity, making simultaneous quantification of individual PSTs challenging. Taking advantage of sensor cross-sensitivity, our group developed, for the first time, an electronic tongue based on six potentiometric sensors for simultaneous quantification of four PSTs in model solutions and bivalve extracts [
27]. However, the reduced accuracy of quantitation of N-sulfocarbamoyl toxins (GTX5 and C1+2) remained an issue as the sensors exhibited low selectivity to these toxins in the presence of dcSTX. Moreover, the electronic tongue was developed for only the three most prevalent toxins of the Portuguese coast. In the present study, a range of potentiometric chemical sensors including those developed earlier plus new compositions were characterized in the solutions of eight PSTs representative of
G. catenatum and
A. minutum toxin profiles. The first group of toxins included dcSTX, GTX5, GTX6, C1+2, dcGTX2+3 and dcNEO, while the second included STX and GTX2+3. Sensor parameters, sensitivity and selectivity coefficients, were used for calculating simulated sensor responses in mixed toxin solutions. By applying Lasso regularization to the simulated data set, sensor selection for quantifying two groups of toxins was carried out aiming to minimize the quantification error and the number of sensors in the array. The optimization results were validated using experimental data, i.e., sensor responses measured in the mixed solutions of STX and GTX2+3, and dcSTX, GTX5, GTX6, and C1+2.
4. Conclusions
A simple and rapid methodology for a priori selection of the optimized array of potentiometric chemical sensors has been described. The proposed method employs simulated sensor responses in multicomponent solutions and utilizes Lasso regularization. Simulated sensor responses were calculated using sensor parameters, slopes of the electrode function and selectivity coefficients, determined in individual analyte solutions. The use of simulated data allows to streamline the optimization process eliminating the labor, time and resource-consuming step of making measurements with the sensor array in a large number of mixed solutions.
Quantification of PSTs corresponding to two widespread toxin profiles in bivalves has been selected as a case study. From the initial array of eight sensors, reduced sensor arrays for the quantification of four and two toxins representative of the two profiles, respectively, and total sample toxicity were selected. The sensor selection sought to minimize errors in the toxin concentration quantification while simultaneously reducing the number of sensors in the array. Since Lasso regularization produces a range of models with different numbers of non-zero regression coefficients, it was possible to select a sensor array configuration that represents a compromise between quantification errors and the number of sensors.
The potential of the proposed methodology was evaluated using experimental data measured in mixed solutions of two groups of toxins. Improved performance of the a priori optimized sensor array was observed for almost all studied toxins and total toxicity, with the exception of GTX5 and GTX6. Overall, the proposed methodology was demonstrated to be simple and efficient in selecting sensors for the electronic tongue.