Next Article in Journal
Wearable Activity Trackers in the Management of Rheumatic Diseases: Where Are We in 2020?
Next Article in Special Issue
Characterization of Biodegraded Ignitable Liquids by Headspace–Ion Mobility Spectrometry
Previous Article in Journal
Design and Validation of Android Smartphone Based Wireless Structural Vibration Monitoring System
Previous Article in Special Issue
Analysis of Phenolic Content in Grape Seeds and Skins by Means of a Bio-Electronic Tongue
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimization of Sensors to be Used in a Voltammetric Electronic Tongue Based on Clustering Metrics

Sensors and Biosensors Group, Department of Chemistry, Universitat Autònoma de Barcelona, 08193 Barcelona, Spain
*
Author to whom correspondence should be addressed.
Sensors 2020, 20(17), 4798; https://doi.org/10.3390/s20174798
Submission received: 31 July 2020 / Revised: 20 August 2020 / Accepted: 21 August 2020 / Published: 25 August 2020
(This article belongs to the Special Issue Multisensor Systems for Chemical Analysis)

Abstract

:
Herein we investigate the usage of principal component analysis (PCA) and canonical variate analysis (CVA), in combination with the F factor clustering metric, for the a priori tailored selection of the optimal sensor array for a given electronic tongue (ET) application. The former allows us to visually compare the performance of the different sensors, while the latter allows us to numerically assess the impact that the inclusion/removal of the different sensors has on the discrimination ability of the ET. The proposed methodology is based on the measurement of a pure stock solution of each of the compounds under study, and the posterior analysis by PCA/CVA with stepwise iterative removal of the sensors that demote the clustering when retained as part of the array. To illustrate and assess the potential of such an approach, the quantification of paracetamol, ascorbic acid, and uric acid mixtures were chosen as the study case. Initially, an array of eight different electrodes was considered, from which an optimal array of four sensors was derived to build the quantitative ANN model. Finally, the performance of the optimized ET was benchmarked against the results previously reported for the analysis of the same mixtures, showing improved performance.

1. Introduction

Electronic tongues (ETs) are analytical systems based on the combination of an array of sensors with low-selectivity and/or cross-response features in order to obtain some added value in the generation of analytical information. These are coupled with advanced chemometric tools that allow the interpretation and extraction of meaningful data from the complex readings [1,2]. Thus, the selection of the sensor array that will comprise the ET is a key step that will highly influence the performance of the system [3].
Despite its importance, most of the papers dealing with ETs focus on the developed application itself or the data treatment stage, but very few report on the choice of the sensors. The challenge here arises on the a priori selection of the best combination of sensors that can carry out the desired qualitative or quantitative task given the difficulty to assess the cross-reactivity shown between them and the impact that this will have in the final model. In this direction, one common approach that has been taken in the case of potentiometric sensors is the inclusion of at least one or two ion-selective sensors (ISEs) towards the compounds of interest plus some generic ones; however, this does not guarantee that the optimal array is selected [1]. Another approach would be the complete sensor characterization, which consists in carrying out the multi-analyte calibration, from which the surface response plots are built and the binary selectivity coefficients are calculated (e.g., Reference [4]). However, this is a tedious task even for the two-analyte case, and it does become not feasible for more complex mixtures as it may require hundreds of samples. Furthermore, such a task becomes even more complex when, e.g., voltammetric sensors are used instead of potentiometric ones due to the higher complexity and dimensionality of its response. Alternatively, another approach that has been employed is the use of feature selection methods upon measurement of all the samples, and to carry out the a posteriori removal of the variables and/or sensors that do contribute less to the classification or quantification task (e.g., usage of genetic algorithms or other pruning methods [5]). Although this approach does probably provide the best outcome possible, it is also more tedious and requires instruments capable of simultaneously measuring a large number of channels.
In the same direction, we had also reported previously on the usage of the autocorrelation between the signals of the different sensors that form the sensor array as an objective criterion for the selection or removal of redundant sensors in voltammetric arrays [6,7]. Although it provides a measure of sensors’ response redundancy, it is purely based on the signal, but not on the cross-reactivity towards the analytes of interest. More recently, principal component analysis (PCA) taking the transposed data matrix has also been suggested as a guiding method to select the best sensing units to compose the array of an ET [8]. However, the authors themselves conclude that further experiments are required to confirm the potential of the method.
Therefore, the development of a simple methodology that allows the a priori selection of the optimal sensor array to carry out a specific application is of utmost interest. In this direction, herein we propose the usage of PCA in combination with some clustering metrics as a tool to carry out such selection. The former allows us to visually compare the performance of the different sensors, while the latter allows us to numerically assess the impact that the inclusion/removal of the different sensors does have in the discrimination ability of the ET towards the compounds of interest.
In order to demonstrate and illustrate such a procedure, the simultaneous quantification of paracetamol (PA), ascorbic acid (AA), and uric acid (UA) mixtures was chosen as the study case. These mixtures correspond to a common case in the pharmaceutical field where the determination of paracetamol in the presence of ascorbic acid is attempted. The latter is usually present as an excipient, whereas the inclusion of uric acid is motivated as some studies suggest that ascorbic acid intake is related to uric acid concentration in serum [9,10]. This particular case was chosen as this mixture has already been previously analyzed in our laboratories employing different sensors arrays, thus providing us with guidance on which performance could be expected and whether previous results could be improved or not by tailoring the electrode choice to particular cases.
In this direction, the present work aims to demonstrate the advantages derived from the tailored selection of the sensor array for each ET developed application. Upon measurement of stock solutions of the different active pharmaceutical ingredients (APIs), those were transformed by using PCA/CVA, which in combination with the F factor metric allowed the selection of the different sensors. Next, a quantitative model was built by means of artificial neural networks (ANNs) to achieve the simultaneous determination of the three APIs, the performance of which was benchmarked against previously reported ETs.

2. Materials and Methods

2.1. Reagents and Apparatus

All reagents were of analytical reagent grade and were used as received without any further purification. All the buffer solutions were prepared in ultrapure water (18.2 MΩ·cm) purified by a MilliQ System (Millipore, Billerica, MA, USA). Potassium hydrogenphosphate, potassium dihydrogenphosphate and sodium chloride, which were used for the preparation of the phosphate buffer, were purchased from Merck (Darmstadt, Germany).
The active pharmaceutical ingredients (APIs), paracetamol and uric acid, were purchased from Sigma-Aldrich (St. Louis, MO, USA), whereas ascorbic acid was purchased from Panreac Química SLU (Barcelona, Spain). For the modification and preparation of the electrodes, we used nanoparticles of bismuth (III) oxide, titanium (IV) oxide, zinc (II) oxide and tin (IV) oxide, plus cobalt (II) phthalocyanine and polypyrrole purchased from Sigma-Aldrich (St. Louis, MO, USA), while Prussian blue was obtained from Acros Organics (Geel, Belgium). Graphite powder (particle size < 50 µm) used for the construction of the electrodes was purchased from BDH Laboratory Supplies (Poole, UK), and Epotek H77 resin and the corresponding hardener were obtained from Epoxy Technologies (Billerica, MA, USA).
All the electrochemical measurements were carried out in a PGSTAT 30 Autolab potentiostat (EcoChemie, The Netherlands) with GPES 4.7 version software (EcoChemie). Voltammetric measurements were conducted using a conventional three-electrode cell configuration where a combined electrode (Crison 5261, Barcelona, Spain), made up of a metallic platinum wire and an Ag/AgCl electrode was used as both the auxiliary and reference electrode.

2.2. Sensor Array

An array of seven different modified graphite epoxy composite (GEC) electrodes was initially prepared to be evaluated as the working electrodes that will form the ET [11]. Briefly, for the construction of the working electrodes, first of all, a shaped copper disc was soldered to an electrical connector, and then introduced into a 6 mm internal diameter PVC tube, resulting in a cylindrical cavity. A paste was then made by mixing 15% of graphite powder, 2% of the specific modifier, and the epoxy resin and the hardener (in the ratio 20:3 w/w). Next, this paste is loaded into the cavity of the PVC tube and cured at 80 °C for 3 days. Afterwards, using emery papers of decreasing grain size, electrode surfaces were polished until a flat shiny surface appeared. One of the main advantages of such electrodes is that re-polishing of the surface using emery paper allows the regeneration of the electrode, recovering any loss of their response.
In this manner, seven different GEC electrodes, each of them modified with cobalt (II) phthalocyanine (CoPc), polypyrrole (PPy), Prussian blue (PB), oxide nanoparticles of bismuth (Bi2O3), titanium (TiO2), zinc (ZnO) and tin (SnO2), were prepared. Additionally, a Pt disc electrode was also prepared and used to complete the eight-sensor array. The Pt disc electrode was constructed by soldering a Pt wire (99.95% purity, 1 mm diameter) to an electrical connector and then introducing the connector into a PVC tube. The wire was then coated in epoxy resin (exposing only the wire cross-section) and cured at 80 °C for 3 days.
Such modifiers were selected taking into account previous studies with ETs. Prussian blue is well-known to be an electron mediator in the development of many biosensors, which has also been used in the development of ETs [12,13,14]. The usage of nanoparticles has emerged as an alternative to the respective bulk metals given its higher surface/mass ratio and improved electrochemical properties [13,15,16]. Similarly, conducting polymers such as polypyrrole have electrocatalytic and antifouling properties [17,18,19], while phthalocyanines are reported to be efficient electrocatalysts in the determination of many important inorganic, organic, or biological compounds [20,21]. Lastly, the use of bare metal electrodes corresponds to one of the more common choices for the development of voltammetric ETs [22,23,24].

2.3. Samples Preparation and Voltammetric Measurements

All APIs stock solutions were prepared in 0.05 M phosphate buffer at pH 7.0 with 0.1 M KCl as saline background/supporting electrolyte. For the optimization of the sensor array, 250 μM stock solution of each of the APIs were measured separately in order to investigate their electrochemical behavior with the different sensors. Upon selection of the optimal sensor array, calibration curves for the APIs were built and their analytical response was further characterized in terms of linearity, sensitivity, limit of detection (LOD), reproducibility, etc. To this aim, solutions of increasing concentration of each API were prepared from the stock solutions in phosphate buffer and measured under the below conditions.
For the simultaneous analysis of APIs mixtures, a set of samples consisting of mixtures of the three compounds were prepared. Samples were divided into two subsets: the training subset based on a tilted 33 factorial design (27 samples), which was used to build the quantitative model [25], and the testing subset with samples randomly distributed along the experimental domain (11 samples) that was used to assess the actual performance of the built models.
The electrochemical behavior of all the APIs and their mixtures was assessed by recording a complete cyclic voltammogram between −0.7 V and +1.2 V vs. Ag/AgCl with a step potential of 10 mV and a scan rate of 100 mV·s−1, without the application of any pre-conditioning potential or accumulation time. Furthermore, to avoid any fouling effect or drifts during the measurements, a blank measurement in phosphate buffer was carried out after each measurement.
These conditions were used for all the experiments, except during initial experiments for the selection of the sensor array, that in order to ensure that the potential window was wide enough to see any possible peak, voltammetric measurements were carried out in the range −1.5 V to +1.5 V, but keeping all the other conditions unaltered. Similarly, as a 6-channel multipotentiostat was employed, final measurements were carried simultaneously, but some of the initial measurements for sensor selection had to be carried sequentially in groups.

2.4. Chemometric Analysis

Data analysis was carried out in Matlab 7.1 (MathWorks, Natick, MA, USA) by specific routines developed by the authors using the Statistics, Wavelet, and Neural network toolboxes [26,27,28]. Final representation and analysis of the data were done with the aid of Sigmaplot (Systat Software Inc., San Jose, CA, USA).
Recorded voltammograms were first compressed with discrete wavelet transform (DWT), which allowed us to decrease the dimensionality of the data while preserving the relevant information [26]. Next, PCA and canonical variate analysis (CVA) were used for the qualitative analysis of the data and a tailored selection of the ET array. Finally, ANNs were used to build the simultaneous quantitation models of the ternary mixtures.
For the selection of the optimal sensor array, data was submitted to PCA and CVA, and the clustering observed was evaluated by means of the F factor [29,30,31]. The F factor is defined as the ratio of variances between different clusters and the sum of internal variance in all clusters (Equation (1)). It can be used as a standard procedure to measure the capability of a particular sensor to discriminate between different classes of samples in ET applications. Since the F factor compares the variability between classes to variation within classes, with increasing F values the discrimination between different classes becomes easier.
F = i = 1 k n i z i ¯ z = 2 k 1 i = 1 k j = 1 n i z j i z i ¯ 2 i = 1 k n i k
where k is the number of classes, i the number of following class, j the following number of the sample in i-th class, ni the number of samples in i-th class, and zji the sensor response for j-th sample in i-th class, and z i ¯ and z =   are the mean value of a sensor response in a particular class of samples and the mean value of sensor response for all samples, respectively. These can be defined as:
z i ¯ = j = 1 n i z j i n i
z = = i = 1 k j = 1 n i z j i i = 1 k n i

3. Results and Discussion

3.1. Selection of Sensors

The aim of this work is to demonstrate the advantages derived from the tailored selection of the optimal sensor array for each case when developing ET applications, and to assess whether this can be done a priori taking only single measurements of the stock of each of the compounds that we aim to analyze (Figure 1).
To this aim, we took as a study case the analysis of three different APIs (PA, AA and UA), and attempted to choose the most appropriate electrodes for their electrochemical analysis. When choosing the optimal electrodes, it has to be kept in mind how important the cross-responses of the different electrodes towards the analytes of interest and among them are in ET applications. That is, each electrode shows a differentiated response between each of the analytes, and the different sensors show a differentiated response between them, so that they jointly allow us to differentiate the different compounds [3].
In this direction, the first step was to assess the voltammetric responses of each of the electrodes towards the individual compounds. For doing so, five replicate samples of the 250 µM solutions of each of the APIs were prepared, and the voltammetric measurements were carried as described in Section 2.3. An extract of the responses obtained for the different electrodes is shown in Figure 2. As it can be seen, the voltammetric profiles of each of the sensors are found to be different, with all of them showing distinct responses for each of the compounds.
To objectively carry out the selection of the final sensor array for the quantitative task, the next step was the compression of the data with DWT and its analysis by means of PCA and CVA. Firstly, the 2D score plots were obtained, and from those the F factor was calculated. This process was done with the 8-sensor array and repeated by removing one sensor at a time, similar to what could be considered a leave-one-out process, but from the sensors’ side (Figure 1). Next, the F values for each of the iterations were compared, and the sensor that when excluded led to the higher increase of the F index was then discarded. This process was repeated, reducing one-by-one the selected sensors, until the F value decayed (Figure 3).
In this manner, from the eight sensors initially considered, the first one that was discarded was the one modified with SnO2 nanoparticles, which led to a significant improvement in the clustering (the F value increased from 4.19 to 5.19). As stated, this process was repeated, taking the remaining seven sensors as reference for the F value, and leaving out again one sensor at a time for the calculations; the Bi2O3 nanoparticle modified sensor was discarded this time. In the next iteration, TiO2 modifier was discarded as no significant increase/decrease in the F value was observed. Thus, it was considered that its inclusion/exclusion does not demote the performance of the system. Lastly, the cobalt(II) phtalocyanine modified sensor was also discarded, whereas in the next iteration no further sensors were removed, as even the exclusion of the one with a larger F value implied a decrease of this parameter. In this manner, the selected sensor array was formed by four electrodes: a metallic Pt sensor and GECs modified with ZnO nanoparticles, Prussian blue, and polypyrrole, and the maximum F value achieved was 5.75.
The resulting PCA score plot obtained with the reduced sensor array is shown in Figure 4B, with an accumulated explained variance of ca. 82.3 %; a large value that reflects how most of the variance contained in the original data is now summarized with only these two coordinates (PCs). More importantly, we can observe how clear clusters are obtained for each of the compounds and how easily these can be distinguished from each other. Hence, based on the voltammetric profiles and the PCA score plot, we can say that the initial selection of sensors based on the F factor calculation seems to be a satisfactory step towards the electrochemical quantification of the three APIs. Figure 4A, on the other hand, shows the departing point with the eight-sensors array, and where the clustering is just preliminarily worked out. As an additional comment, it is also evident how the optimization of the array is also able to amend the drift content incorporated in the original set of sensors.
Lastly, the analysis of the loadings biplot is not very significant in this case as what we aim to evaluate is not only which features are more relevant, but which improve more the discrimination capabilities of the ET. However, it is quite well known that since PCA is focusing only on the data variance, external factors, e.g., drift on the sensors response, can dominate it. Therefore, such information cannot be only obtained from the scores and loadings plot, but from the F factor. Moreover, the large number of variables registered when voltammetric sensors are used (hundreds for each of the sensors) makes the interpretation of the loadings plot difficult/cumbersome. However, a numerical inspection of the loadings contribution from the magnitude of its vector allowed us to confirm that in both cases all the sensors contribute significantly to the obtained scores plot, with most of the loadings very close to the correlation circle: ca. 81% of them with a magnitude as big as 0.6 times the one of the loading with the highest contribution for (A), and 72% for (B). In terms of sensors, the number of variables for each of them that contribute to those percentages are 15 to 30% for (B), and 2.2 to 17% for (A) (data not shown).

3.2. Characterization of the Analytical Response

Upon selection of the reduced sensor array, the next step was the characterization of the voltammetric responses of each of the electrodes towards the individual APIs. On the one side, we wanted to confirm that, actually, different sensitivity was shown by the different sensors, as well as assess its linear range to restrict the experimental domain for the quantitative experiment. On the other side, we wanted to also assess sensors’ repeatability, as this is a critical parameter when working with ETs, which require performing a large number of consecutive measurements.

3.2.1. Calibration Curves

Prior to building the quantitative model for the mixtures’ analysis, the linear range and sensitivity of the different electrodes were evaluated. Individual calibration curves were built for each of the selected four sensors towards each of the APIs. To do so, stocks of increasing concentration within 0 to 500 µM for paracetamol and uric acid, and 0 to 2000 µM for ascorbic acid were prepared and measured as described in Section 2.3. From the recorded voltammograms, the maximum peak height was taken and plotted against its concentration (data not shown). From those, the fitted regression equations are summarized in Table 1. As represented by the large value of the coefficient of determination (R2), it can be concluded that good linearity was obtained within the above ranges for all the APIs and sensors. Hence, those same ranges were selected for the final quantitative model. Moreover, we can confirm how different sensitivities (calibration slopes) are obtained for each of the sensors and compounds, a situation that we aimed for with the selection of the sensor array (i.e., to avoid having redundant sensors or sensors that do not contribute to the discrimination of the different compounds).

3.2.2. Stability Measurements

Similarly, since building the quantitative model requires a considerable number of consecutive measurements with the sensor array, it is important to assess whether the repeatability within consecutive measurements for all the sensors is good enough. In this direction, a sample containing 100 µM of each of the APIs was prepared, and the performance of the sensor was evaluated by calculating the relative standard deviation (RSD) obtained after 18 consecutive measurements without changing the sample, equipment, and analyst.
Before the sample measurement, a blank buffer measurement was taken, repeating this cycle up to 18 times. In this manner, we could ensure that no loss of signal was obtained from the measurement of the sample mixture, while the measurement of the blank allowed us to ensure that no drifts on the baseline were being observed. After each cycle, cleaning of the electrode surfaces was carried out in phosphate buffer by recording a cyclic voltammogram under the same conditions. RSD% values of all the sensors were found to be below 4%, indicating good stability of the electrode responses. More specifically, the repeatability expressed as the percentage of RSD over 18 consecutive measurements of blank/standard cycles for the different electrodes forming the final array were: 2.4% for ZnO, 1.99% for PB, 2.4% for PPy and 3.6% for the Pt sensor.

3.3. Quantitative Analysis of APIs Mixtures

Upon selection of the optimal sensor array for the analysis of paracetamol, ascorbic acid, and uric acid, the next step was to evaluate the performance of such an array to achieve the simultaneous determination of their mixtures. To this aim, the set of samples described in Section 2.3 were measured under the same conditions as in previous experiments, recording a complete cyclic voltammogram for each of the electrodes (Figure 5). As could be expected from Figure 2, although a different voltammetric response is obtained for each of the considered compounds individually, there is a clear overlap when mixtures of those are analyzed simultaneously. Therefore, not being possible to achieve its quantification via univariate regression, requiring the aid of chemometric models; an approach that is possible thanks to the cross-response shown by the selected sensors, i.e., the different sensitivity shown by each of the electrodes towards each of the compounds (Table 1). This is a desirable condition for the proper performance of any multisensory array analysis system.
However, before building the quantification model, and especially if ANNs are to be used [32], a preprocessing step to reduce the high dimensionality of the data is required. The benefits derived from such a step are particularly critical with ANNs, as this contributes to preventing the under-determination problem encountered with an oversized ANN with excessively complex data, while it also reduces significantly the time and memory required for its modelling. Moreover, in general, it also leads to models with better performance and generalization ability as it avoids redundancy in the input data and reduces their complexity with the risk of overfitting [26]. In our case, this compression was achieved by means of DWT, using the Daubechies wavelet mother function and a fourth decomposition level. This allowed us to reduce the initial 1696 data points (currents corresponding to 424 polarization potentials × 4 sensors) down to 132 coefficients, representing a reduction of 92.2% without any loss of relevant information (r > 0.99 and fc > 0.95 in the compressed vs. original signal comparison).
For the selection of the neural network topology, a systematic study was carried out in which the number of neurons in the hidden layer, as well as the transfer functions between the input and hidden layer and between the hidden and output layers, were varied. The different ANN models were built employing the data of the training subset, and the selection of the optimal one was chosen from the comparison of the performance towards the testing subset. This data division helped to ensure that more unbiased data was obtained and to detect possible models that were being over-fitted, and consequently to better assess the accuracy of the model. The final ANN architecture had 132 neurons in the input layer (corresponding to the DWT coeffs., 33 × 4 sensors), 6 neurons and purelin transfer function in the hidden layer, and 3 neurons (one for each of the analytes) and satlins transfer function in the output layer.
The comparison graphs of predicted vs. expected concentrations as well as the fitted linear regressions for the three determined species for the chosen ANN model are shown in Figure 6. As can be seen, a very satisfactory trend is obtained for all the cases, with regression lines almost overlapping or very close to the ideal ones. In order to better evaluate the goodness of the comparison, the regression parameters were also calculated and are summarized in Table 2. The obtained values are very close to the ideal values of slope (1), intercept (0), and correlation coefficient (1).
In this direction, to ensure that the former are statistically within the confidence intervals of the calculated regression parameters, the joint confidence intervals were calculated and plotted (Figure 7), as this allows to rapidly detect whether there are or not differences between the actual and predicted values at a certain significance level, judging simultaneously the goodness of slope and intercept [33]. In this case, the ideal point (1,0) is within the ellipsoidal confidence intervals for the three species, both for the training and testing subset, which allows us to state that there are no significant differences between the actual concentration and the values predicted by the model.
Despite the results for the training subset are more precise than those for the testing one, remarkable accuracy is obtained in both cases. The higher confidence intervals for the testing subset correspond to the usual behavior, which can be explained by two factors. On the one side, those samples are not used at all during the modelling stage, and consequently represent a more realistic metric of the model performance. On the other side, the lower number of samples of the testing subset in comparison to the training one results in higher tabulated t and F values that ultimately lead to higher confidence intervals. Similarly, the larger concentration range for ascorbic acid also results in a larger uncertainty for the intercept.
Finally, in order to benchmark the performance of the current proposed ET which is based on an optimized sensor array, its performance is compared to the one reported in previous works in which the same mixtures were analyzed (Table 2) [12,23,34]. To this aim, the root mean square error (RMSE) and its normalization (NRMSE) were also calculated to obtain a global metric of the system performance. Although some differences in the performance might be due to the different data treatment employed (e.g., PLS vs ANNs), it can be ruled out that a significant improvement has been achieved with the reduced sensor array. It can be seen that the smallest total NRMSE is obtained in the work reported here, but also, as could be expected from this, the improvement in the slopes and correlation coefficients is also highly significant.

4. Conclusions

The application of a simple methodology for the selection of the optimal voltammetric sensor array prior to carrying out a quantitative application has been demonstrated. The proposed approach is based on the combination of PCA/CVA to assess the cross-response of the different sensors with the F factor to numerically carry out the selection of the sensors from the scores plot.
To illustrate the potential of the methodology, the discrimination and quantification of three different APIs have been demonstrated. In this work, we initially selected an array of eight different electrodes, and from those, only four sensors were selected for the quantitative application. Next, the performance to carry out the quantitative determination of the three APIs was attempted by building a DWT-ANN model. The performance of the model was very satisfactory, and huge improvement was observed when benchmarked against other reported ETs attempting the quantification of the same mixtures. This confirms the potential advantages derived from the current approach, which allows the a priori selection of the potential best sensor array based on its cross-response features, the implicit reduction of the number of sensors (with the added advantage of instrumentation simplicity), and an improvement in the modelling performance (without requiring a posteriori pruning of the most relevant sensors).
Nevertheless, despite the good performance shown here, it has to be considered that there are many other clustering indexes, and that those are not universal. Therefore, future work has to focus on the comparison between different indexes and the suitability for different applications.

Author Contributions

Conceptualization, M.d.V.; methodology, M.d.V. and X.C.; software, X.C.; investigation, M.S. and N.R.; writing—original draft preparation, X.C. and M.S.; writing—review and editing, M.d.V.; funding acquisition, M.d.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Spanish Ministry of Science and Innovation, MCINN (Madrid) through project PID2019-107102RB-C21C. Manel del Valle thanks the support from Generalitat de Catalunya through the program ICREA Academia. Munmi Sarma thanks the support of the Government of Catalonia Secretariat for Universities and Research of the Ministry of Economy and Knowledge.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Del Valle, M. Electronic Tongues Employing Electrochemical Sensors. Electroanalysis 2010, 22, 1539–1555. [Google Scholar] [CrossRef] [Green Version]
  2. Ciosek, P.; Wroblewski, W. Sensor arrays for liquid sensing—Electronic tongue systems. Analyst 2007, 132, 963–978. [Google Scholar] [CrossRef] [PubMed]
  3. Legin, A.; Kirsanov, D.; del Valle, M. Avoiding nonsense in electronic taste sensing. TrAC Trend. Anal. Chem. 2019, 121, 115675. [Google Scholar] [CrossRef]
  4. Mimendia, A.; Legin, A.; Merkoçi, A.; del Valle, M. Use of Sequential Injection Analysis to construct a potentiometric electronic tongue: Application to the multidetermination of heavy metals. Sensor Actuat. B Chem. 2010, 146, 420–426. [Google Scholar] [CrossRef]
  5. Dias, L.G.; Veloso, A.C.A.; Sousa, M.E.B.C.; Estevinho, L.; Machado, A.A.S.C.; Peres, A.M. A novel approach for honey pollen profile assessment using an electronic tongue and chemometric tools. Anal. Chim. Acta 2015, 900, 36–45. [Google Scholar] [CrossRef] [Green Version]
  6. Cetó, X.; Apetrei, C.; del Valle, M.; Rodríguez-Méndez, M.L. Evaluation of red wines antioxidant capacity by means of a voltammetric e-tongue with an optimized sensor array. Electrochim. Acta 2014, 120, 180–186. [Google Scholar] [CrossRef]
  7. Cetó, X.; González-Calabuig, A.; Capdevila, J.; Puig-Pujol, A.; del Valle, M. Instrumental measurement of wine sensory descriptors using a voltammetric electronic tongue. Sens. Actuators B Chem. 2015, 207 Pt B, 1053–1059. [Google Scholar] [CrossRef] [Green Version]
  8. Giacometti, J.A.; Shimizu, F.M.; Carr, O.; Oliveira, O.N. A Guiding Method to Select and Reduce the Number of Sensing Units in Electronic Tongues. In Proceedings of the 2016 IEEE SENSORS, Orlando, FL, USA, 30 October–3 November 2016; pp. 1–3. [Google Scholar]
  9. Huang, H.-Y.; Appel, L.J.; Choi, M.J.; Gelber, A.C.; Charleston, J.; Norkus, E.P.; Miller, E.R., III. The effects of vitamin C supplementation on serum concentrations of uric acid: Results of a randomized controlled trial. Arthritis Rheum. 2005, 52, 1843–1847. [Google Scholar] [CrossRef]
  10. Juraschek, S.P.; Miller, E.R., III; Gelber, A.C. Effect of oral vitamin C supplementation on serum uric acid: A meta-analysis of randomized controlled trials. Arthrit. Care Res. 2011, 63, 1295–1306. [Google Scholar] [CrossRef] [Green Version]
  11. Alegret, S.; Alonso, J.; Bartrolí, J.; Cespedes, F.; Martínez Fàbregas, E.; del Valle, M. Amperometric biosensors based on bulk-modified epoxy graphite biocomposites. Sens. Mater. 1996, 8, 147–153. [Google Scholar]
  12. Ortiz-Aguayo, D.; Bonet-San-Emeterio, M.; del Valle, M. Simultaneous Voltammetric Determination of Acetaminophen, Ascorbic Acid and Uric Acid by Use of Integrated Array of Screen-Printed Electrodes and Chemometric Tools. Sensors 2019, 19, 3286. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Medina-Plaza, C.; García-Hernández, C.; de Saja, J.A.; Fernández-Escudero, J.A.; Barajas, E.; Medrano, G.; García-Cabezón, C.; Martin-Pedrosa, F.; Rodriguez-Mendez, M.L. The advantages of disposable screen-printed biosensors in a bioelectronic tongue for the analysis of grapes. LWT Food Sci. Technol. 2015, 62, 940–947. [Google Scholar] [CrossRef] [Green Version]
  14. Paixão, T.R.L.C.; Bertotti, M. Fabrication of disposable voltammetric electronic tongues by using Prussian Blue films electrodeposited onto CD-R gold surfaces and recognition of milk adulteration. Sensor Actuat. B Chem. 2009, 137, 266–273. [Google Scholar] [CrossRef]
  15. Metters, J.P.; Kadara, R.O.; Banks, C.E. New directions in screen printed electroanalytical sensors: An overview of recent developments. Analyst 2011, 136, 1067–1076. [Google Scholar] [CrossRef]
  16. Sarma, M.; del Valle, M. Improved Sensing of Capsaicin with TiO2 Nanoparticles Modified Epoxy Graphite Electrode. Electroanalysis 2020, 32, 230–237. [Google Scholar] [CrossRef]
  17. Arrieta, A.A.; Apetrei, C.; Rodriguez-Mendez, M.L.; de Saja, J.A. Voltammetric sensor array based on conducting polymer-modified electrodes for the discrimination of liquids. Electrochim. Acta 2004, 49, 4543–4551. [Google Scholar] [CrossRef]
  18. Tsakova, V.; Seeber, R. Conducting polymers in electrochemical sensing: Factors influencing the electroanalytical signal. Anal. Bioanal. Chem. 2016, 408, 7231–7241. [Google Scholar] [CrossRef]
  19. Domínguez, R.B.; Moreno-Barón, L.; Muñoz, R.; Gutiérrez, J.M. Voltammetric electronic tongue and support vector machines for identification of selected features in Mexican coffee. Sensors 2014, 14, 17770–17785. [Google Scholar] [CrossRef]
  20. Rodriguez-Mendez, M.L.; García-Hernandez, C.; Medina-Plaza, C.; García-Cabezón, C.; Saja, J.A.d. Multisensor systems based on phthalocyanines for monitoring the quality of grapes. J. Porphyr. Phthalocya. 2016, 20, 889–894. [Google Scholar] [CrossRef]
  21. Medina-Plaza, C.; de Saja, J.A.; Rodriguez-Mendez, M.L. Bioelectronic tongue based on lipidic nanostructured layers containing phenol oxidases and lutetium bisphthalocyanine for the analysis of grapes. Biosens. Bioelectron. 2014, 57, 276–283. [Google Scholar] [CrossRef] [Green Version]
  22. Winquist, F.; Wide, P.; Lundström, I. An electronic tongue based on voltammetry. Anal. Chim. Acta 1997, 357, 21–31. [Google Scholar] [CrossRef]
  23. Gutés, A.; Calvo, D.; Céspedes, F.; del Valle, M. Automatic sequential injection analysis electronic tongue with integrated reference electrode for the determination of ascorbic acid, uric acid and paracetamol. Microchim. Acta 2007, 157, 1–6. [Google Scholar] [CrossRef]
  24. Cetó, X.; Pérez, S. Voltammetric electronic tongue for vinegar fingerprinting. Talanta 2020, 219, 121253. [Google Scholar] [CrossRef]
  25. Cetó, X.; Céspedes, F.; Pividori, M.I.; Gutiérrez, J.M.; del Valle, M. Resolution of phenolic antioxidant mixtures employing a voltammetric bio-electronic tongue. Analyst 2012, 137, 349–356. [Google Scholar] [CrossRef]
  26. Cetó, X.; Céspedes, F.; del Valle, M. Comparison of methods for the processing of voltammetric electronic tongues data. Microchim. Acta 2013, 180, 319–330. [Google Scholar] [CrossRef]
  27. The MathWorks Inc. Statistics and Machine Learning Toolbox Toolbox: User’s Guide; MathWorks: Natick, MA, USA, 2020. [Google Scholar]
  28. Beale, M.H.; Hagan, M.T.; Demuth, H.B. Neural Network Toolbox: User’s Guide; MathWorks: Natick, MA, USA, 2012. [Google Scholar]
  29. Ciosek, P.; Brzózka, Z.; Wróblewski, W. Classification of beverages using a reduced sensor array. Sensor Actuat. B Chem. 2004, 103, 76–83. [Google Scholar] [CrossRef]
  30. Kutyła-Olesiuk, A.; Zaborowski, M.; Prokaryn, P.; Ciosek, P. Monitoring of beer fermentation based on hybrid electronic tongue. Bioelectrochemistry 2012, 87, 104–113. [Google Scholar] [CrossRef]
  31. Sorvin, M.; Belyakova, S.; Stoikov, I.; Shamagsumova, R.; Evtugyn, G. Solid-Contact Potentiometric Sensors and Multisensors Based on Polyaniline and Thiacalixarene Receptors for the Analysis of Some Beverages and Alcoholic Drinks. Front. Chem. 2018, 6, 134. [Google Scholar] [CrossRef] [Green Version]
  32. Despagne, F.; Massart, D.L. Neural networks in multivariate calibration. Analyst 1998, 123, 157R–178R. [Google Scholar] [CrossRef]
  33. Cetó, X.; Voelcker, N.H.; Prieto-Simón, B. Bioelectronic tongues: New trends and applications in water and food analysis. Biosens. Bioelectron. 2016, 79, 608–626. [Google Scholar] [CrossRef]
  34. Cetó, X.; Gutés, A.; del Valle, M. Simple data preprocessing method for voltammetric electronic tongues using artificial neural networks. Acta Manilana 2013, 61, 39–49. [Google Scholar]
Figure 1. Schematic representation of the methodology followed for the a priori selection of the optimal sensor array. Briefly, stock solutions of each of the analytes are measured with all the considered sensors, obtaining a voltammogram for each of them. Next, those are submitted to PCA/CVA, and the clustering is evaluated by means of the F factor. This is repeated, leaving out of the analysis each of the sensors of the array (one at a time), and the one that leads to the higher improvement is removed. The whole process is repeated until a decrease in the F factor is observed after discarding one of the sensors. Finally, with the selected sensor array, the quantitative application is carried out.
Figure 1. Schematic representation of the methodology followed for the a priori selection of the optimal sensor array. Briefly, stock solutions of each of the analytes are measured with all the considered sensors, obtaining a voltammogram for each of them. Next, those are submitted to PCA/CVA, and the clustering is evaluated by means of the F factor. This is repeated, leaving out of the analysis each of the sensors of the array (one at a time), and the one that leads to the higher improvement is removed. The whole process is repeated until a decrease in the F factor is observed after discarding one of the sensors. Finally, with the selected sensor array, the quantitative application is carried out.
Sensors 20 04798 g001
Figure 2. Voltammograms obtained for the three APIs (250 µM in phosphate buffer) using the GECs modified with (A) SnO2, (B) Prussian Blue, (C) ZnO, (D) PPy, (E) CoPc, (F) TiO2 and (G) Bi2O3, and (H) the metallic Pt electrode.
Figure 2. Voltammograms obtained for the three APIs (250 µM in phosphate buffer) using the GECs modified with (A) SnO2, (B) Prussian Blue, (C) ZnO, (D) PPy, (E) CoPc, (F) TiO2 and (G) Bi2O3, and (H) the metallic Pt electrode.
Sensors 20 04798 g002aSensors 20 04798 g002b
Figure 3. (Top) Color map of the variation of the F values after iterative exclusion of the different electrodes; the Y-axis shows the sensor being left out for the calculation of the F factor and the X-axis shows the successive iterations for the selection of the less significant sensor. (Bottom) Bar plot of the changes of the F values after exclusion of the sensor that leads to the biggest F value at each iteration. In both cases, the color of the plot codifies the F factor values as per the color bar. Iteration 5 does not produce any further improvement, and then the process is stopped.
Figure 3. (Top) Color map of the variation of the F values after iterative exclusion of the different electrodes; the Y-axis shows the sensor being left out for the calculation of the F factor and the X-axis shows the successive iterations for the selection of the less significant sensor. (Bottom) Bar plot of the changes of the F values after exclusion of the sensor that leads to the biggest F value at each iteration. In both cases, the color of the plot codifies the F factor values as per the color bar. Iteration 5 does not produce any further improvement, and then the process is stopped.
Sensors 20 04798 g003
Figure 4. Score plot obtained from the DWT-PCA of five replicas of individual APIs using (A) the eight-sensor array or (B) the selected four-sensor array. () Buffer, () paracetamol, () ascorbic acid and () uric acid. Ellipses plotted correspond to 95% confidence limits for each of the clusters.
Figure 4. Score plot obtained from the DWT-PCA of five replicas of individual APIs using (A) the eight-sensor array or (B) the selected four-sensor array. () Buffer, () paracetamol, () ascorbic acid and () uric acid. Ellipses plotted correspond to 95% confidence limits for each of the clusters.
Sensors 20 04798 g004
Figure 5. Representative voltammograms obtained for certain arbitrary mixtures of the different APIs (the concentration for each compound is indicated in the legend) with the four-sensor selected array: GECs modified with (A) ZnO, (B) PPy and (C) Prussian Blue, and (D) the metallic Pt electrode.
Figure 5. Representative voltammograms obtained for certain arbitrary mixtures of the different APIs (the concentration for each compound is indicated in the legend) with the four-sensor selected array: GECs modified with (A) ZnO, (B) PPy and (C) Prussian Blue, and (D) the metallic Pt electrode.
Sensors 20 04798 g005
Figure 6. Modeling ability of the optimized DWT-ANN. Comparison graphs of obtained vs. expected concentrations for (A) paracetamol, (B) ascorbic acid, and (C) uric acid, for both the training (●, solid line) and testing subsets (○, dotted line). The dashed line corresponds to the ideal comparison line (y = x).
Figure 6. Modeling ability of the optimized DWT-ANN. Comparison graphs of obtained vs. expected concentrations for (A) paracetamol, (B) ascorbic acid, and (C) uric acid, for both the training (●, solid line) and testing subsets (○, dotted line). The dashed line corresponds to the ideal comparison line (y = x).
Sensors 20 04798 g006
Figure 7. Joint confidence intervals for the three species: (●, solid line) paracetamol, (, dash-dotted line) ascorbic acid, and (, dashed line) uric acid, and both for (A) training and (B) testing subsets. Also, the ideal point (1,0) is plotted (x); intervals are calculated at the 95% confidence level.
Figure 7. Joint confidence intervals for the three species: (●, solid line) paracetamol, (, dash-dotted line) ascorbic acid, and (, dashed line) uric acid, and both for (A) training and (B) testing subsets. Also, the ideal point (1,0) is plotted (x); intervals are calculated at the 95% confidence level.
Sensors 20 04798 g007
Table 1. Calibration data (y vs. x) for the individual calibrations of paracetamol, ascorbic acid, and uric acid employing the final sensors in the array.
Table 1. Calibration data (y vs. x) for the individual calibrations of paracetamol, ascorbic acid, and uric acid employing the final sensors in the array.
Electrode modifier ZnOPrussian BluePolypyrroleMetallic Pt
ParacetamolEquationy = 0.0598x + 0.613y = 0.0629x + 10.2y = 0.0857x + 29.4y = 0.0027x + 1.08
R20.99710.99930.99890.9589
LOD1 (μM)28.413.917.259.4
Ascorbic acidEquationy = 0.0152x + 0.989y = 0.0261x + 6.93y = 0.0304x + 21.0y = 0.0015x + 0.634
R20.99770.99900.99870.9624
LOD1 (μM)59.368.577.5417
Uric acidEquationy = 0.0452x + 1.12y = 0.0499x + 10.8y = 0.0668x + 31.0y = 0.0028x + 1.02
R20.99880.99830.99950.9967
LOD1 (μM)17.923.59.3528.3
1 Calculated from three times the standard error of the regression.
Table 2. Reported values and current results of the fitted regression lines for the comparison between obtained vs. expected values for the different sets of samples and the three considered APIs. Intervals are calculated at the 95% confidence level.
Table 2. Reported values and current results of the fitted regression lines for the comparison between obtained vs. expected values for the different sets of samples and the three considered APIs. Intervals are calculated at the 95% confidence level.
CompoundSlopeIntercept (μM)R2RMSE1 (μM)Total NRMSE1Sensor ArrayRef.
training subset (n = 33)Bare GEC plus metallic Pt and Au electrodes[23]
Paracetamol0.942 ± 0.03132 ± 210.96822
Ascorbic acid0.933 ± 0.04036 ± 250.9472
Uric acid0.873 ± 0.04658 ± 250.9232
testing subset (n = 15)
Paracetamol0.895 ± 0.10582 ± 710.84822
Ascorbic acid0.919 ± 0.08165 ± 410.9082
Uric acid0.871 ± 0.138−8 ± 860.7532
training subset (n = 33)Bare GEC plus metallic Pt and Au electrodes[34]
Paracetamol0.981 ± 0.03213 ± 240.992290.0257
Ascorbic acid0.990 ± 0.0316 ± 170.99325
Uric acid0.981 ± 0.0279 ± 160.99423
testing subset (n = 15)
Paracetamol0.990 ± 0.143−2 ± 800.945970.101
Ascorbic acid1.009 ± 0.136−28 ± 780.95266
Uric acid0.992 ± 0.20836 ± 1250.89173
training subset (n = 27)SPCEs modified with CoPc, PB, graphite and CuO[12]
Paracetamol1.000 ± 0.0820 ± 250.962291.00
Ascorbic acid1.000 ± 0.0890 ± 250.95531
Uric acid1.000 ± 0.1040 ± 310.94036
testing subset (n = 12)
Paracetamol1.021 ± 0.219−13 ± 280.915321.03
Ascorbic acid1.073 ± 0.422−3 ± 540.76271
Uric acid1.044 ± 0.334−32 ± 360.82944
training subset (n = 27)GECs modified with ZnO, PB, and PPy plus Pt metallic electrodeThis work
Paracetamol0.996 ± 0.0060.9 ± 1.90.99982.430.00378
Ascorbic acid0.999 ± 0.0041.1 ± 4.60.99995.86
Uric acid0.996 ± 0.0041.1 ± 1.20.99991.64
testing subset (n = 11)
Paracetamol1.021 ± 0.1349 ± 360.97126.40.0368
Ascorbic acid1.017 ± 0.049−20 ± 570.99631.2
Uric acid0.999 ± 0.0961 ± 270.98416.2
1 RMSE: root mean square error; NRMSE: normalized root mean square error; 2 data not available; GEC: graphite epoxy composite; SPCE: screen printed carbon electrode.

Share and Cite

MDPI and ACS Style

Sarma, M.; Romero, N.; Cetó, X.; Valle, M.d. Optimization of Sensors to be Used in a Voltammetric Electronic Tongue Based on Clustering Metrics. Sensors 2020, 20, 4798. https://doi.org/10.3390/s20174798

AMA Style

Sarma M, Romero N, Cetó X, Valle Md. Optimization of Sensors to be Used in a Voltammetric Electronic Tongue Based on Clustering Metrics. Sensors. 2020; 20(17):4798. https://doi.org/10.3390/s20174798

Chicago/Turabian Style

Sarma, Munmi, Noelia Romero, Xavier Cetó, and Manel del Valle. 2020. "Optimization of Sensors to be Used in a Voltammetric Electronic Tongue Based on Clustering Metrics" Sensors 20, no. 17: 4798. https://doi.org/10.3390/s20174798

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop