Application of MOS Gas Sensors Coupled with Chemometrics Methods to Predict the Amount of Sugar and Carbohydrates in Potatoes

Five potato varieties were studied using an electronic nose with nine MOS sensors. Parameters measured included carbohydrate content, sugar level, and the toughness of the potatoes. Routine tests were carried out while the signals for each potato were measured, simultaneously, using an electronic nose. The signals obtained indicated the concentration of various chemical components. In addition to support vector machines (SVMs that were used for the classification of the samples, chemometric methods, such as the partial least squares regression (PLSR) method, the principal component regression (PCR) method, and the multiple linear regression (MLR) method, were used to create separate regression models for sugar and carbohydrates. The predictive power of the regression models was characterized by a coefficient of determination (R2), a root-mean-square error of prediction (RMSEP), and offsets. PLSR was able to accurately model the relationship between the smells of different types of potatoes, sugar, and carbohydrates. The highest and lowest accuracy of models for predicting sugar and carbohydrates was related to Marfona potatoes and Sprite cultivar potatoes. In general, in all cultivars, the accuracy in predicting the amount of carbohydrates was somewhat better than the accuracy in predicting the amount of sugar. Moreover, the linear function had 100% accuracy for training and validation in the C-SVM method for classification of five potato groups. The electronic nose could be used as a fast and non-destructive method for detecting different potato varieties. Researchers in the food industry will find this method extremely useful in selecting the desired product and samples.


Introduction
The potato plant is one of the most important crops grown in the world. It is a rich source of nutrients, such as carbohydrates and protein. This product is native to South America and originated in Peru [1]. In the food industry, potatoes have a variety of applications, including baked potatoes, fried potatoes, potato crisps, potato starch, and dried fried potatoes [2].
As expectations for high-quality foods increase, it is increasingly important to determine their specifications correctly, efficiently, and purposefully. After harvesting and The olfactory machine is a system that differs in structure and approach from other methods, such as image processing and neural networking, and it is very effective in determining the quality of a variety and its classification. Moreover, olfactory systems can be used for most agricultural products, as they are characterized by odor [14,15].
Today, olfactory systems are automated, non-destructive, and cost-effective. They are ideally used for routine control and quality assurance in the food industry and for products that are related to the food industry. This technology was developed to facilitate automatic inspections using odor-based techniques; an electronic nose is a non-destructive method for measuring qualitative parameters [4].
In the present study, an electronic nose (e-nose) is used in combination with the SVM method to classify five potato cultivars. Three calibration methods-PCR, MLR, and PLSRare also employed to predict the sugar and carbohydrate content. This approach has not been reported in any previous studies on the prediction of sugar and carbohydrate contents of different products.

Sample Preparation
Five potato varieties were prepared, including Agria, Sprite, Sante, Marfona, and Jelly. The data measured were the sugar and carbohydrate contents and the toughness. At the same time, the samples were evaluated with the electronic nose to determine their volatile content.

Electronic Nose Instrument
An electronic nose's sensor array detects and differentiates between complex aromas of VOC emissions from biological systems. This analytical device is composed of a set of sensors, in an array, that react to volatile gases or vapors produced and released by the vegetative tissues of a biological sample. Each sensor in the sensor array responds to the VOC components that are present in a sample analyte and produces sensor outputs as electrical signals. The sensor responses are then sent to a computer system that uses multivariate data analysis methods to distinguish between the differences in the sensor data responses to the VOCs detected in the plant sample headspace, enabling unknown aromas to be classified and identified.
The electronic nose used in this study was manufactured by the Biosystems Engineering Department at Mohaghegh Ardabili University ( Figure 1). The detection system consists of sensors placed inside the measurement chamber. Commercial sensors, such as metal oxide sensors (MOSs), are the most common ones available for the detection of volatile organic compounds; nine metal oxide semiconductor sensors (MOSs) have been tested [4,16]. The names of the sensors and the main applications are MQ3 (alcohol), MQ4 (urban gases and methane), MQ8 (hydrogen), MQ9 (CO and combustible gas), MQ135 (steam ammonia, benzene, and sulfide), MQ136 (sulfur dioxide), TGS813 (CH4, C3H8, and C4H10), TGS822 (steam organic solvents), and TGS2620 (alcohol and steam organic solvents). Sensors of this type have different selectivity abilities and sensitivities, but when placed in a sensor matrix, they produce a unique chemical image of the gas mixture (or "fingerprint"). The obtained data are transferred to a data acquisition system that is responsible for processing the digital signals.

Sampling
Several potato samples of each variety (two to four potatoes) were placed in a sample container for 2 h. Each sample was tested immediately after harvest, so that the conditions for performing the tests were the same. Each potato cultivar was tested on one day. Then, the sample chamber was saturated with the odor of potatoes, and data collection was performed [17]. The data were collected by the olfactory machine in such a way that clean air was initially passed through the sensor chamber for 100 s to remove other odors. The odors of the samples were then sucked out of the specimen chamber by a pump for 100 s. then directed to the sensors. Finally, to prepare the samples for further data collection, clean air was injected into the sensor chamber for 100 s. In response to the potato odors, the output voltage of the sensors was changed and data were recorded at one-second intervals and stored in the computer. Equation (2) was used to correct the baseline to eliminate noise and possible deviations, as well as to normalize the sensors' response [18]: where Y S (t) is the normalized response, X S (0) is the baseline, and X S (t) is the sensor response.
7, x FOR PEER REVIEW 4 of 20 Figure 1. Schematic of an artificial olfactory (e-nose) system. The components of this system consist of the following parts: (a) air filter (activated charcoal to remove ambient-air VOC hydrocarbons), (b) sample headspace chamber, (c) solenoid air valves, (d) diaphragm pump, (e) e-nose sensor array chamber, (f) data acquisition recorder and wireless transmission card, and (g) personal computer (PC).

Sampling
Several potato samples of each variety (two to four potatoes) were placed in a sample container for 2 h. Each sample was tested immediately after harvest, so that the conditions for performing the tests were the same. Each potato cultivar was tested on one day. Then, the sample chamber was saturated with the odor of potatoes, and data collection was performed [17]. The data were collected by the olfactory machine in such a way that clean air was initially passed through the sensor chamber for 100 s to remove other odors. The odors of the samples were then sucked out of the specimen chamber by a pump for 100 s. then directed to the sensors. Finally, to prepare the samples for further data collection, clean air was injected into the sensor chamber for 100 s. In response to the potato odors, the output voltage of the sensors was changed and data were recorded at one-second intervals and stored in the computer. Equation (2) was used to correct the baseline to eliminate noise and possible deviations, as well as to normalize the sensors' response [18]: (1) Figure 1. Schematic of an artificial olfactory (e-nose) system. The components of this system consist of the following parts: (a) air filter (activated charcoal to remove ambient-air VOC hydrocarbons), (b) sample headspace chamber, (c) solenoid air valves, (d) diaphragm pump, (e) e-nose sensor array chamber, (f) data acquisition recorder and wireless transmission card, and (g) personal computer (PC).

Pattern Recognition System
The final and most important element of the electronic nose system is a pattern recognition function that assigns a series of received signals to one of the pattern classes or predicts concentrations using an appropriate mathematical calibration model. The most common models include the multiple linear regression (MLR), principal component regression (PCR), and partial least squares regression (PLSR) models. A number of explanatory and dependent variables are used to construct these models; calibration methods aim to build models that allow the quantification of traits or characteristics based on explanations of variables [19].

Sugar and Carbohydrate Extraction
The carbohydrate content (µg/mL) of the samples was extracted using a Nano spectrophotometer (Nanodrop) with a volume of 1000 µL and a standard glucose curve and cuvette [20]. The standard curve had a coefficient of determination of 0.9955 and the relationship was y = 0.0031x − 0.0211. For each sample, 30 replicates of data were collected and the wavelength of absorbance was used to calculate the amount of carbohydrate (by replacing the wavelength with the standard curve). In the 30 replicates, the sugar content of each sample was determined using a liquid refractometer, model BPTR100. This involved placing the potato juices on a refractometer at room temperature and measuring their sugar content in Brix. See [4] for further explanation.

Determination of Toughness
The toughness of the samples was determined using a proprietary tension/compression testing machine (Universal Testing Machine/STM 20, Santam Company, Tehran, Iran), which was equipped with a Bongshin load cell (model DBBP-100; Korea) with a capacity of 100 kg. First, the samples were separated from the potatoes by a cutting cylinder (15 mm long and 12 mm in diameter). The compressive tests were performed in a way that the potato samples in their most stable state were loaded quasi-elastically between two parallel plates and compressed under pre-set conditions until rupture occurred. Each potato variety was loaded at 10, 40, and 70 mm/min in seven replicates. When a sudden decrease in force occurred, as the rupture point was detected, the loading was stopped; after performing each experiment, the force-deformation curve and its related data were saved in Excel Software. Then, based on the rupture force, deformation, and specimen size, the toughness was calculated using the following equation [21]: where KIc is the toughness (MPa), E is the rupture energy (J), d is the deformation (mm), r is the specimen radius (mm), and h is the specimen height (mm).

Analysis of Variance
The obtained values for the sugar, carbohydrate, and toughness content of the five potato cultivars were analyzed using MSTATC software [Michigan State University, USA]. The statistical analyses were conducted using a completely randomized factorial test. The means were compared using Duncan's multiple range test at an 0.01 p-value level.

Chemometric Analysis
Chemometrics uses multivariate statistics to extract useful information from complex analytical data. To model and explain the relationship between odor data and chemical properties (sugar, carbohydrate, and toughness), multivariate calibration models using the PLSR, MLR, and PCR methods were used in this study. A one-variable calibration includes only one dependent variable and one independent variable. In this study, x was considered as the independent variable (output data from the sensors) and y was considered as the dependent variable (samples measured in the laboratory, i.e., percentage of sugar, carbohydrate, and toughness). The simplest condition is determined as follows: The coefficients "a and b" can be calculated using the partial least squares regression method. In this model, "b 0 " is the intercept and the coefficient, "a" is the beta coefficient, and "e" is the calibration error due to the measuring of the response error. This model is called the classical model. The one-variable method is used when the sample is simple and free from any disturbance. The calibration methods include the following univariate methods: partial least square regression (PLSR), multiple linear regression (MLR), and principal component regression (PCR).
The MLR model establishes a linear relationship between a dependent variable (y) and a set of multiple explanatory variables (x). This model can be used when the number of variables is less than the number of samples and there is a poor relationship between them. Otherwise, it is impossible to estimate the regression coefficient "a" using the least-squares method [19]. The MLR model is used to calculate the sensor regression coefficients to minimize the total squared deviations (gradient descent) between the predicted quality indicators and the actual quality indicators [21].
The PCR model reduces the number of explanatory variables by selecting several principal components (PCs) instead of the main variables. The main idea of this method is to establish the relationship between the PCs and the expected characteristic of the sample. It is possible to apply this method in two stages. The first stage involves identifying the principal components using principal component analysis (PCA). Thus, we obtain unrelated matrices of variables. In the second Partial least squares regression (PLSR), also known as PLS, is a new type of multivariate statistical analysis method. It is the most common method for developing multidimensional calibration models. It can process linear data and reduce the number of calibration samples required by the gold standard in chemistry [22]. When the dependent variables have a higher linear correlation, PLS can be more effective. PLS is a two-line model based on the matrix x (independent variables) and Y (dependent variables), which can be considered as external and internal relations [21].
A support vector machine (SVM) is a supervised learning model with a classification algorithm based on statistical theory. An SVM can nonlinearly plot data that cannot be linearly separated in a small space into a large space, using a kernel function. A hyperplane is created in a high dimensional space to maximize the distance between two classes and classify data in the high dimensional space. Radial basis functions (RBFs) are the most common kernel functions that lead to a high classification function. Since an SVM minimizes structural risks, it is considered a good classifier for nonlinear data and small samples [14,23].
In this study, the results from nine sensors were used as independent variables x, with sugar, carbohydrates, and toughness as dependent variables y. Unscrambler X 10.4 software (Unscrambler version X 10.4, CAMO, Trondheim, Norway) was used. First, PCR and PLSR analyses were performed; then, after determining the optimal parameters, PCR and PLSR models were written using the MLR model, considering the optimal parameters as independent variables.

Model Performance
To quantify the predictive power of the models, the coefficient of determination (R 2 ) and the root-mean-square error (RMSE) were calculated. In this experiment, the performance of the models was evaluated using the cross-validation technique. This technique is usually used when a data set is small and limited. In using this technique, a single observation is selected from all the items as a validation item, and the predictive performance is extracted from the other data. This process is repeated for all the observations, so that all the samples are used once as a validation item [24]. Thus, the model is calibrated with as many samples as possible. In addition, all data are used for model calibration and validation.

Sugar and Carbohydrate Content of Potato Varieties
The sugar and carbohydrate content and the toughness of five different potato varieties were measured using a refractometer, the Schiegel method [4], and an STM device. The related data were analyzed using Mstatc software. The results are presented in Table 1: ANOVA results for the sugar and carbohydrate content and toughness of the five different potato varieties was significant at 0.01, with the following coefficients of variation: 0.27%, 7.67%, and 2.28%, respectively. Table 2 illustrates the average Brix sugar (gram of sugar per 100 g of solution), carbohydrates (µg/mL), and toughness (MPa) of the potato varieties. According to Table 2, the Sprite variety had the highest sugar content (8.15 Brix) and the Agria and Jeli varieties recorded the lowest sugar content, with 6.18 and 6.12 Brix, respectively. The highest carbohydrate contents were observed in the Sprite, Sante, and Jelly varieties, with 276.7, 266, and 237 µg/mL, respectively, while the lowest values were recorded in the Marfona and Agria varieties. At a loading rate of 10 mm/min, the Marafona variety showed the highest value for toughness. According to the results, the Agria and Sprite varieties showed lower toughness. This may be explained by the fact that the higher the carbohydrate content in a variety, the crispier and less firm the variety. The letters a-f describe significant differences between the obtained means. Means with the same letters do not differ significantly. V1, V2 and V3 describe loading speeds of 10, 40, and 70 mm per minute, receptively.
The sugar content of different varieties varies because starch (the main ingredient in potatoes) is hydrolyzed differently as the plants respire. The lower the starch content of a variety, the less sugar it contains. In addition, the chemical composition varies depending on the potato variety, the soil, the climate, and agronomic factors. Generally, it is believed that potatoes with more sugar are suitable for the French fry industry, while potatoes with medium sugar content are suitable for deep frying [24]. The highest amount of carbohydrates was observed in three potato cultivars, Jelly, Sante, and Sprite, and the Sprite cultivar had the highest amount of sugar. The difference in sugar content of different types of potatoes is due to the difference in the hydrolysis of starch, which is caused by the respiration of the product. The lower the starch content in a cultivar, the lower the sugar.
Gumul, et al. [25], measured the sugar content of five different potato varieties. They found that the lower the sugar content of the different potato varieties, the lower the quality of the product, because at high temperatures the sugar reacts with maillard, leading to the formation of potential substances that are hazardous to human health. Figure 2 demonstrates the output response of the sensors to the odor of the potato samples. Radar diagrams were used to observe the differences in patterns (fingerprints) between different potato cultivars. The average output data of electronic nose sensors, during 100 s of measurements, were plotted as a radar diagram following normalization using Equation (1). Using this diagram, it is possible to visualize the difference between the response patterns of the sensors to the odor of each potato cultivar. According to the radar graph, there is a little similarity in the fingerprints of different potato cultivars. Accordingly, the highest odor is related to the Sprite, Sante, and Jelly cultivars. These three cultivars also had the highest carbohydrate content. It is likely that the reason for the greater odor of these three cultivars is their higher carbohydrate content. The gas sensors MOS, TGS813, and MQ135 showed the highest response to sample odor, while the sensors MQ136, MQ4, and MQ9 showed the lowest response.

Response Sensors
that potatoes with more sugar are suitable for the French fry industry, while potatoes with medium sugar content are suitable for deep frying [24]. The highest amount of carbohydrates was observed in three potato cultivars, Jelly, Sante, and Sprite, and the Sprite cultivar had the highest amount of sugar. The difference in sugar content of different types of potatoes is due to the difference in the hydrolysis of starch, which is caused by the respiration of the product. The lower the starch content in a cultivar, the lower the sugar.
Gumul, et al. [25], measured the sugar content of five different potato varieties. They found that the lower the sugar content of the different potato varieties, the lower the quality of the product, because at high temperatures the sugar reacts with maillard, leading to the formation of potential substances that are hazardous to human health. Figure 2 demonstrates the output response of the sensors to the odor of the potato samples. Radar diagrams were used to observe the differences in patterns (fingerprints) between different potato cultivars. The average output data of electronic nose sensors, during 100 s of measurements, were plotted as a radar diagram following normalization using Equation (1). Using this diagram, it is possible to visualize the difference between the response patterns of the sensors to the odor of each potato cultivar. According to the radar graph, there is a little similarity in the fingerprints of different potato cultivars. Accordingly, the highest odor is related to the Sprite, Sante, and Jelly cultivars. These three cultivars also had the highest carbohydrate content. It is likely that the reason for the greater odor of these three cultivars is their higher carbohydrate content. The gas sensors MOS, TGS813, and MQ135 showed the highest response to sample odor, while the sensors MQ136, MQ4, and MQ9 showed the lowest response.  The gas sensors, MOS, TGS813, and MQ135, showed the highest response to sample odor, while the sensors MQ136, MQ4, and MQ9 showed the lowest response.

Prediction of Quality Parameters of Carbohydrate and Sugar Based on PLSR and PCR
The relationship between electronic nose signals and the prediction of sugar and carbohydrate content was described by the PLSR and PCR models. The performance of the PLSR and PCR models for classification prediction was evaluated using R2 and RMSE. The root-mean-square error validation (RMSEval) of the root was chosen as a numerical tool to select the optimal model ( Table 3). The highest accuracy of prediction of the sugar and carbohydrate parameters was related to the Marfona cultivar. Both models, PCR and PLSR, were able to predict the amounts of sugar and carbohydrates with an accuracy of about 0.90, with only one optimal factor. Both models were very accurate in predicting the amount of carbohydrates and sugars.

Prediction of Quality Parameters of Carbohydrate, Sugar and Toughness Based on MLR
The multiple linear regression (MLR) method lies between the PLSR and PCR methods in terms of mean squared error; it was used to build the PLSR and PCR models. After analyzing and determining the optimal factors, the PCR and PLSR models were considered as independent variables and the MLR model was used. The MLR model described the relationship between the sensor signals and the carbohydrate and sugar parameters. The remaining variables were all significant at the 0.01 level. Then, different statistical methods were used to test the final equations for stability and validity. Three criteria were used to select the correct equation for further analysis: R 2 , RMSE, and the number of descriptors in the model. The leverage correction method, which uses cross-validation, was used instead of the PLSR and PCR validation methods.
For the best MLR models, the standard error of prediction is low, the F and R 2 values are high, the predictive power is high, and the descriptors are minimal. Correlation plots are shown in Figure 3 as a visual method to evaluate the models' fit with experimental data. Equations (4)- (7) X3 was not significant at the 0.01 level. The values obtained for the model were R 2 = 0.771 and RMSE = 6.667; p < 10 −4 .
Equations (8)-(11) represent the model obtained by the MLR model for the Agria cultivar based on the optimal factors of the PCR and PLSR models for predicting the amount of sugar and carbohydrate. Correlation plots are shown in Figure 4.   (11) X4 and X6 were not significant at the 0.01 level. The values obtained for the model were R 2 = 0.827 and RMSE = 4.745; p < 10 −4 .
In addition, Equations (12)-(15) represent the model obtained by the MLR method for the Jelly cultivar, based on the optimal factors of the PCR and PLSR models for predicting the amount of sugar and carbohydrate. Correlation plots are shown in Figure 5.
Sugar based on PCR = 6.674 + 6.769 X2 + 7.617 X3 + 0.002 (12) Molecules 2022, 27, x FOR PEER REVIEW 11 of 20 In addition, Equations (12)-(15) represent the model obtained by the MLR method for the Jelly cultivar, based on the optimal factors of the PCR and PLSR models for pre-   Equations (16)- (19) represent the model obtained by the MLR method for the Sante cultivar, based on the optimal factors of the PCR and PLSR models for predicting the amount of sugar and carbohydrate. Correlation plots are shown in Figure 6. Equations (16)- (19) represent the model obtained by the MLR method for the Sante cultivar, based on the optimal factors of the PCR and PLSR models for predicting the amount of sugar and carbohydrate. Correlation plots are shown in Figure 6.

Support Vector Machine (SVM)
To classify the samples, two methods were used-the C-SVM and the Nu-SVMwhich contain four kernel functions: the radial basis function, the sigmoid function, the polynomial function, and the linear function. These methods have penalty coefficients of "C and Nu", and a kernel coefficient of "γ", which is used by a network search algorithm. Seventy percent of the data were used for training and 30 percent were used for testing; all input weights were one. The results of the SVM method are summarized in Table 4. 1 Statistical analysis models used for data analysis: Nu-SVM = Nu-penalty coefficient support vector machine; C-SVM = C-penalty coefficient support vector machine. γ = core coefficient.
For the C-SVM method, the linear function had an accuracy of 100% in both training and validation for the five groups of potatoes. Figure 8 presents the classification accuracy results for the linear function, with 100% accuracy. For the C-SVM method, the linear function had an accuracy of 100% in both training and validation for the five groups of potatoes. Figure 8 presents the classification accuracy results for the linear function, with 100% accuracy.

Discussion
The e-nose tested in this study provided effective discriminations among certain potato cultivars based on analyses of sample volatiles using specific statistical methods for e-nose data analysis, as indicated in subsequent discussions. Radar diagrams were used to observe the differences in patterns (fingerprints) between different potato cultivars. Such diagrams are used in various applications in the food industry (especially in the electronic nose). The gas sensors MOS, TGS813, and MQ135 showed the highest responses

Discussion
The e-nose tested in this study provided effective discriminations among certain potato cultivars based on analyses of sample volatiles using specific statistical methods for e-nose data analysis, as indicated in subsequent discussions. Radar diagrams were used to observe the differences in patterns (fingerprints) between different potato cultivars. Such diagrams are used in various applications in the food industry (especially in the electronic nose). The gas sensors MOS, TGS813, and MQ135 showed the highest responses to the sample odors, while the sensors MQ136, MQ4, and MQ9 showed the lowest responses. Almost all the sensors showed a higher response for carbohydrates than for sugars. Knowing the response power of each sensor to different VOCs can help determine the different characteristics of the products; the most relevant and effective sensors with the maximum response to VOCs can be selected in the design of the electronic nose sensor array. The selection of the sensors with the suitable responses helps in the consideration of the transient response, to reduce the response time of the system. Moreover, knowing the role of the higher-level sensors in the data processing phase can reduce the complexity of the analysis, because sometimes additional variables in the data can lead to some problems, such as overtraining in data analysis [26]. For example, the results of research by Kiani, Minaei and Ghasemi-Varnamkhasti [11] showed that the fingerprint of mint leaves changes with drying. In addition, as in our research, they determined the efficiency of the sensors using radar diagrams and introduced the most effective and least effective ones during the drying time. In another study, Ran, et al. [27] used radar diagrams to investigate the effect of sensors on the electronic nose, and also to investigate electronic language in detecting the physicochemical properties of chicken powder. In another study, Wang, et al. [28] examined the performance of electronic nose sensors in detecting volatile compounds in pork at different cooking times/temperatures; they found that the sensors that were in SHS (superheated steam) mode performed better than other sensors. Huang, et al. [29] used an electronic nose to detect different types of volatile compounds in eels, for three modes of curing, steaming, and grilling; according to the radar diagram in their research, the fingerprints in all three modes of cooking were very similar. They also found that after cooking, the intensity of the smell of the samples intensified and the efficiency of the sensors changed.
The relationship between electronic nose signals and the predictions of amounts of sugar and carbohydrates was described by the PLSR and PCR models, and the MLR method was used to build the PLSR and PCR models. The highest accuracy of prediction of the sugar and carbohydrate parameters was related to the Marfona cultivar. The PCR and PLSR models were able to predict the amounts of sugar and carbohydrates with an accuracy of about 0.90. The lowest accuracies of prediction of the sugar and carbohydrate parameters were related to the Sprite cultivar. The PCR and PLSR models were able to predict the amounts of sugar and carbohydrates with an accuracy of about 0.80. The highest accuracy of the model in predicting the amounts of sugar and carbohydrates was related to the Marfona cultivar, with an accuracy of about 88% for predicting the amount of sugar and an accuracy of about 92% for predicting the amount of carbohydrates. In general, in all cultivars, the accuracy of predicting the amount of carbohydrates was somewhat better than the accuracy of predicting the amount of sugar. Since the sugar and carbohydrate models have good performance criteria, they can be considered reliable and acceptable models for prediction. A good correlation was found between the experimental and theoretical predictions for carbohydrates and sugars.
Abu-Khalaf [30] studied the quality parameters of olive oil using PLS models to analyze the chemical data and the EN. The results illustrated that the EN could model the acidity parameter with good performance. The correlation coefficients obtained, using the PLS model, for the calibration and validation of acidity were 0.87 and 0.87" respectively. Zhang, et al. [31] reported similar results using the PLSR method and an electronic nose for grapes, with R2 of 0.93. Zhou and Zeng [12] applied partial least squares spectroscopy and linear audit analysis (PLS-LDA) and found similar results, with an R 2 of 0.96.
The results of Kiani, Minaei and Ghasemi-Varnamkhasti [11] with an electronic nose and PLS, PCR, and MLR models, used in estimating the drying time of mint leaves, were very consistent with the results of our research. With these models, they were able to estimate the drying time of mint leaves with 93% accuracy. Zaki Dizaji, et al. [32] used the electronic nose technique and the PLS, PCR, and MLR models to predict the quality of sugarcane syrup based on the purity and the percentage of refined sugar. They stated that the accuracy of the PLS method for this purpose was 77 and 71%, respectively; they also reported that the accuracy of the PCR method was 72% and 68%, respectively. They further stated that the accuracy of the PLS method for this purpose was 82% and 74%, respectively. In another study conducted by Zhou, Fan, Tan, Peng, Cai and Zhang [12] to predict the amount of linalool in Osmanthus perfume, using an electronic nose, R2 for the PCR and MLR methods was reported at 0.736 and 0.895, respectively, and the RMSE was reported at 3.98 and 10.10, respectively. Gu, et al. [33] investigated the contamination of milled rice with an electronic nose based on the PLSR and SVM models and reported accuracies of 0.913-0.877 and 0.983-0.924, respectively. Du, et al. [34] evaluated the ability of an electronic nose to identify the SSC and TA of red meat kiwi fruit using the PLSR and SVM models; their results were R2 > 0.90 and R2 = 0.99, respectively.
Xu, et al. [35] evaluated the quality of tea using an electronic nose, an electronic tongue, and electronic vision, with the PLSR method. They were able to predict the amount of amino acids with an accuracy of 0.865, the amount of catechins with an accuracy of 0.888, the amount of polyphenols with an accuracy of 0.668, and the amount of caffeine with an accuracy of 0.791. Another study conducted by Zhang, et al. [36] to evaluate flavor improvement in Antarctic krill defluoridated hydrolysate by maillard reaction, using a PLSR sensor array and model, claimed that the PLSR results showed a very high correlation (100%) between the variables, which were the sensory properties and the volatile compounds. Zhang, et al. [37] predicted some quality indicators of kiwifruit using an electronic nose and chemometrics. According to their report, the PLSR and SVM models predicted the ripeness, SSC, and firmness of kiwifruit with an accuracy of 0.9928, 0.9143, and 0.9290, respectively. In a study conducted by Gu, et al. [38] on the rapid detection of infection levels in milled rice, using the PLSR method and the electronic nose method, rice infection levels were able to be estimated, with R2 = 0.864 and RMSE = 0.235. In another study conducted by Tian, et al. [39] on the effect of storage time and packaging methods on the freshness of dried Lycium, using chemometrics and electronic nose methods, it was concluded that the accuracy and capability of the PLS and MLR models in this field were equal to 0.9316 and 0.9330, respectively.
In other studies, similar results have been reported by some researchers. To predict the contamination of Sitophilus granarius in wheat grain by chemometrics and electronic nose methods, researchers reported that the MLR method can detect the uric acid content and protein content with (a) R2 = 0.958 and, RMSE = 1.401 and (b) R2 = 0.978 and RMSE = 0.275, respectively [40].An electronic nose was used for the early detection of Botrytis Cinerea infection in tomatoes, and it was able to detect infection in tomato plants with a PCA-based MLR model with 87.35% accuracy and an RMSE = 07.01 [41]. An electronic nose was also used to detect cheating in mutton mixed with pork. The accuracy of the results obtained with the MLR and PLS models for the electronic nose and electronic language was reported to be 91% and 98%, respectively [42]. Finally, to classify Sitophilus oryzae contamination in rice grain with the electronic nose, the MLR method reportedly estimated protein and uric acid levels of 0.972 and 0.997, respectively, and also reported RMSE of 2.08 and 1.05, respectively [43].
In the SVM model, based on a five-group classification, the classification accuracy was more than 94% for all models. In addition, in the C-SVM method with linear functions, the highest classification accuracy for learning and validation data was 100%. Previously, Karami, Rasekh, and Mirzaee-Ghaleh [7], determined the shelf life of edible oil using an electronic nose. They were able to correctly classify the oil samples with 100% accuracy using the C-SVM method. Gorji-Chakespari et al. [33] reported 99% accuracy in classifying Damascus rose essential oil by the SVM method. Karami, Rasekh, and Mirzaee-Ghaleh [8] also found 98% and 97% accuracies in training and validating SVM methods with linear kernel functions for oil oxidation detection. In a study conducted by Ghasemi-Varnamkhasti et al. [34] to describe the freshness of strawberries in polymer packaging, accuracies of 86.4% and 50.6%, respectively, were reported for training and validation using the C-SVM method, while accuracies of 85.2% and 55.6%, respectively, were reported for training and validation using the Nu-SVM and radial kernel function method. Rasekhet al. [35] classified edible essential oils into two groups and six groups using the SVM method and achieved a classification accuracy of 100% in all models for the two-group classification, i.e., the classification of essential oils of medicinal plants from fruit extracts. For the sixgroup classification, based on the type of extract, the polynomial and RBF functions had a classification accuracy of 98.9 in the C-SVM method, while in the Nu-SVM method the classification accuracy of linear functions and RBF was 100% in learning and 98.9% in validation. Similar results have been reported for other crops, such as grape leaves [44], fruit juices [45], essential oils [4,16], coffee bean [46], corn [47], and cucumbers [48].

Conclusions
A portable e-nose device, consisting of nine MOS gas sensors used in combination with an SVM statistical method, was used in this study to classify five potato cultivars. The classification of the potato cultivars was accomplished with 100% accuracy. These methods could possibly be used to indirectly detect other mineral and nutritional elements in agricultural products, following slightly more extensive studies to confirm the actual levels of elements in agricultural products, which could be correlated with the e-nose sensor array outputs. In addition, the PCR, PLSR, and MLR methods were used to predict the amount of sugar and carbohydrate in potatoes. The results of this study showed that the electronic nose is a suitable tool for predicting the amount of carbohydrate and sugar in potatoes, which can be generalized by these models to other cultivars as well. The carbohydrate and sugar models showed high predictive performance and exhibited the characteristics of a good model. This technique will be very useful in identifying potato varieties for factories and processing plants

Data Availability Statement:
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.