IoT System for Gluten Prediction in Flour Samples Using NIRS Technology, Deep and Machine Learning Techniques

Jossa-Bastidas, Oscar; Sanchez, Ainhoa Osa; Bravo-Lamas, Leire; Garcia-Zapirain, Begonya

doi:10.3390/electronics12081916

Open AccessArticle

IoT System for Gluten Prediction in Flour Samples Using NIRS Technology, Deep and Machine Learning Techniques

¹

eVIDA Research Group, University of Deusto, 48007 Bilbao, Spain

²

Food Technology Department, Leartiker S. Coop., Xemein Etorbidea 12, 48270 Markina-Xemein, Spain

^*

Author to whom correspondence should be addressed.

Electronics 2023, 12(8), 1916; https://doi.org/10.3390/electronics12081916

Submission received: 13 March 2023 / Revised: 8 April 2023 / Accepted: 12 April 2023 / Published: 18 April 2023

(This article belongs to the Special Issue IoT for Healthcare and Wellbeing: Trends, Challenges, and Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Gluten is a natural complex protein present in a variety of cereal grains, including species of wheat, barley, rye, triticale, and oat cultivars. When someone suffering from celiac disease ingests it, the immune system starts attacking its own tissues. Prevalence studies suggest that approximately 1% of the population may have gluten-related disorders during their lifetime, thus, the scientific community has tried to study different methods to detect this protein. There are multiple commercial quantitative methods for gluten detection, such as enzyme-linked immunosorbent assays (ELISAs), polymerase chain reactions, and advanced proteomic methods. ELISA-based methods are the most widely used; but despite being reliable, they also have certain constraints, such as the long periods they take to detect the protein. This study focuses on developing a novel, rapid, and budget-friendly IoT system using Near-infrared spectroscopy technology, Deep and Machine Learning algorithms to predict the presence or absence of gluten in flour samples. 12,053 samples were collected from 3 different types of flour (rye, corn, and oats) using an IoT prototype portable solution composed of a Raspberry Pi 4 and the DLPNIRNANOEVM infrared sensor. The proposed solution can collect, store, and predict new samples and is connected by using a real-time serverless architecture designed in the Amazon Web services. The results showed that the XGBoost classifier reached an Accuracy of 94.52% and an F2-score of 92.87%, whereas the Deep Neural network had an Accuracy of 91.77% and an F2-score of 96.06%. The findings also showed that it is possible to achieve high-performance results by only using the 1452–1583 nm wavelength range. The IoT prototype portable solution presented in this study not only provides a valuable contribution to the state of the art in the use of the NIRS + Artificial Intelligence in the food industry, but it also represents a first step towards the development of technologies that can improve the quality of life of people with food intolerances.

Keywords:

IoT; deep learning; gluten; near-infrared spectroscopy; machine learning; feature selection; flour samples

1. Introduction

Gluten is a protein formed by prolamin and glutelin that is found at different percentages in wheat, barley, rye, and possibly in some cultivars of oats due to cross-contamination [1]. Gluten gives elasticity to dough and glue-like consistency and spongy properties to pasta, bread, and cakes because of the interaction between prolamin and glutelin in presence of water and energy. However, prolamin is highly composed of glutamine and proline amino acids, which are known for being indigestible. They also prevent a complete enzymatic breakdown of gluten in consumers’ intestines, increasing the concentration of toxic oligopeptides [2]. Celiac disease (CD), one of the five major illnesses associated with gluten [3], is a multifactorial autoimmune disease triggered in individuals genetically predisposed to gluten intake, which has been on the rise in the past 15–20 years [4,5]. The resulting small intestinal inflammatory process is accompanied by the production of specific antibodies against gliadin (prolamin of wheat) and tissue transglutaminase (TG), leading to a variety of gastrointestinal and extraintestinal manifestations of a wide range of severity [4,6]. Despite the prevalence of CD rising to 1% worldwide; with some studies reporting a fourfold increase in the past 50 years, CD remains an often underdiagnosed condition [7,8,9].

With the number of people suffering from disorders caused by gluten increasing, Health Authorities have approved label regulations to protect consumers. Europe Commission Implementing Regulation (EU) No 828/2014 [10] indicates that gluten-free products should contain <20 mg/kg of gluten and very low gluten products <100 mg/kg. While these levels of detection can be quantified using highly sensitive methods, such as instrumental analytical techniques (capillary electrophoresis, PCR, QC-PCR, RP-HPLC, LC-MS, and MALDI-TOF-MS), they also imply elevated costs and specialized training [11,12,13]. The latest methodologies for quantifying gluten are based on DNA instead of protein extraction. Although they allow gluten quantification, they are expensive and have limitations with highly processed foods [12].

With gluten concentration control being essential for the food industry, the need for reliable, simple to use, and affordable analytical methods has led to the majority of commercialized gluten detection kits being based on immunochemical methodologies (ELISA). In fact, AOAC Official Method has validated one of these kits to measure gluten in various types and forms of food [14]. Although some authors have detected limitations and variations of 20 ppm among results using ELISA commercial kits for gluten quantification on heated and processed food [12,13,15], others have noticed some improvements when applying the methods to dairy products [16] and plant seeds [17], with results that allow quantification until 5.5 ppm. The ELISA method is quicker and easier to perform than instrumental techniques, however, it takes time (from 30 min to 2.5 h [13]) and laboratory equipment is necessary. All aforementioned analytical methods work having the analyte (protein) as the target, however, in the last few years, indirect analysis based on AI methods has been proposed more frequently [18]. They work using large databases, offering quick and cheap solutions to the food industry [19].

Near-infrared spectroscopy (NIRS) sensors have been proposed in some studies to analyze different proteins, including gluten. In a recent study, a Fourier transform infrared spectrometer technique was used to determine the total wheat protein and the wet gluten present in wheat flour. The authors obtained an external validation of 82% for

R^{2},

and concluded that it is possible to predict the gluten protein content of wheat flour using NIRS [20]. Additionally, in a recent systematic review, it was concluded that the NIRS is an excellent technique for analyzing the quality and safety of flour and it allows non-destructive analysis [21]. Similarly, a non-invasive and quick method was developed to determine the authenticity of vegetable protein powders and classify possible adulterations using NIRS and chemometric tools [22]. In this study were investigated three potential powder adulterants: whey, wheat, and soy protein. The authors used the one-class partial discriminant analysis for the authentication and the partial least squares discriminant analysis (PLS-DA) for the classification of the adulterants. In total 144 adulterated samples and 14 pure plant-based protein powder samples were analyzed. They achieved 100% sensibility and specificity in the prediction set in the proposed PLS-DA model to authenticate pure plant-based protein powder samples and classify adulterants [22]. In another study, the NIRS was used in the 900–2250 nm wavelength range to assess the protein content in potato flour noodles in a non-destructive manner. The performance of the model was evaluated through partial least squares regression (PLSR), and the results indicated a high degree of accuracy with an

R^{2}

value of 0.8925 and RMSE value of 0.1385% for the prediction set [23]. In another investigation, the NIRS technology was used to predict the purity of the flour PLSR using six samples of authentic almond flour. The study utilized three different NIRS devices: a MicroNIR^TM working in the 950–1650 nm wavelength range; a DLPR NIRscanTM Nano working in the 900–1700 nm wavelength range; and a NeoSpectra FT-NIR operating in the 1350–2500 nm wavelength range. The classification results achieved were 100% sensitivity and more than 95% specificity, and with

R^{2}

value of 0.90 [24].

Machine Learning (ML) and Deep Learning (DL) methods have been used in many application fields, including image recognition, audio classification, and lately natural language processing [25,26,27]. These Artificial Intelligence (AI) algorithms have also been combined with the potential of NIRS technology to solve problems in the food industry, in most cases related to the evaluation of food samples. In a recent study NIRS technology and ML models were used to evaluate the quality traits of sourdough bread flour made from six different flour sources [28]. The results showed that the ML models were able to classify the type of wheat used for the flours with an Accuracy and precision of 96.3% and 99.4% respectively by using the NIRS and a low-cost electronic nose. In another study, a rapid and non-destructive classification of six different Amaranthus species was conducted by using the Visible and Near-infrared (Vis-NIR) spectra in the ranges between 400–1075 nm and chemometric approaches [29]. The authors evaluated four different preprocessing methods to detect the optimal preprocessing technique with the highest classification Accuracy. The different preprocessing and modeling combinations showed classification accuracies from 71% to 99.7% after cross-validation. The combination of Savitzky-Golay preprocessing and Support Vector Machine (SVM) showed a maximum mean classification Accuracy of 99.7% for the discrimination of Amaranthus species. The authors concluded that Vis-NIR spectroscopy, in combination with appropriate preprocessing and AI methods, can be used to effectively classify Amaranthus species. Similarly, a portable low-cost spectrophotometric device was designed to classify 9 different food types of powder or flake structures. The device worked in the Vis-NIR region, and it employed ML algorithms for the data classification [30]. 18 features belonging to each sample were collected in the optical spectral region in the range between 410–940 nm. The SVM and the Convolutional Neural Network (CNN) achieved an Accuracy of 97% and 95%, respectively. A recent investigation used feature selection techniques to determine the most important wavelength ranges for gluten detection in flour samples. The highest Accuracy obtained was 84.42% by selecting the 1089–1325 nm wavelength range and using the Random Forest classifier [31].

The main objective of this study and the contribution to the state of the art was to develop an innovative, rapid, and budget-friendly IoT system using NIRS technology, ML, and DL algorithms for the detection of the presence or absence of gluten in three types of flour samples.

The manuscript is organized as follows. Section 2 explains the materials and methods employed in the study, including the preparation and collection of the data, the implementation of the hardware and software, and the theory of the methods used. In Section 3, the results are presented and analyzed in three subsections that include the hyperparameter tuning methodology used for the ML and DL methods, and their classification results. Finally, Section 4 presents the discussion and conclusions, highlighting the major findings of the study.

2. Materials and Methods

2.1. Data Preparation

Among the food that could respond adequately to NIRS technology, we selected commercial flours. Flour presents a homogenous color and is easy to work with.

We acquired commercial flours of rye, corn, and oat in February 2022 from two different Spanish brands, as seen in Table 1. They were analyzed in the Food Technology Department of Leartiker S. Coop twice using the sandwich ELISA commercial kit (Gluten (gliadin), Biosystem, Spain). High concentrations of gluten were quantified in rye flour (average of 22.4 g/kg), while very low concentrations were found in the oat flour (13.4 ppm) and corn samples (3.0 ppm) from El Granero, as seen in Table 1. However, the oat flour samples from El Alcavaran were quantified with 113 ppm more gluten compared with El Granero. Thus, the corn flour samples from two brands and the oat flour samples from El Granero can be labeled as gluten-free products (<20 ppm [10]).

We then adulterated each flour with commercial gluten (El Granero, Spain; set no. GT301131) for the training of the NIRS sensors. Per each kg of flour, 100 g of gluten was added and mixed (Thermomix^R TM6, Vorwerk, Wuppertal, Germany) for 15 min. The mixing procedure was validated in a previous step, where corn flour from the APASA brand was adulterated to 20 ppm of gluten. Afterward, we took 4 different samples from the cup (upper, bottom, two intermediate, and lateral zones) and analyzed them twice using the ELISA commercial kit (Gluten (gliadin), BioSystem, Barcelona, Spain). Finally, we divided each flour’s adulterated samples with concentrations of 0 (control), and 100 g of gluten per kg into subsamples and stored them in identified and airless zip bags that were sent to the facilities of the University of Deusto.

2.2. Implementation

Once the samples were prepared, we proceeded with the data collection, data analysis, and creation of the ML and DL prediction models. Figure 1 shows the hardware and software architecture used to collect the data and the communication among them.

Hardware and Software Description

Below is the explanation for each stage of the Hardware and software architecture shown in Figure 1a.

NIRS sensor and embedded system: as seen in Figure 1a, the IoT prototype portable solution is composed of a Raspberry Pi 4 (Raspberry Pi is a small and low-cost computer, which uses a screen a keyboard, and a mouse and can be used by people of all ages to learn how to program in programming languages as Scratch and Python [32]) and the DLPNIRNANOEVM a compacted evaluation module used for NIRS [33]. The DLPNIRNANOEVM sensor works in the 900–1700 nm wavelength range with a resolution of 4 nm, so one measure provides a total of 228 variables. The output of the sensor was the intensity. The DLPNIRNANOEVM sensor was connected to the Raspberry Pi 4 using the USB protocol and Python scripts were designed for collecting data from the flour samples. To activate the sensor, we created an HTTP endpoint for receiving different requests using JSON files. This endpoint receives different information, including the sensor’s name and id, as well as the action to execute, with different parameters such as the sensor’s state and the duration of data collection. The endpoint then enables an AWS lambda function that changes the status of the shadow in the AWS IoT microservice, which initiates data collection using the Message Queuing Telemetry Transport (MQTT) protocol (MQTT is a standard messaging protocol based on publish/subscribe messaging communication which is ideal for connecting remote devices [34]).
Data storing and data analysis: the data was received by AWS IoT and forwarded to AWS Lambda, which had several functionalities: managing the logic requests to the database, making the ML and DL predictions, and exposing the endpoints to the final user (client). The data was stored using AWS DynamoDB. The full explanation of this architecture is given in [35].
For the data analysis, we employed ML and DL algorithms programmed using Python programming language, version 3.9. On the one hand, the ML techniques were trained using the ml.m4.xlarge instance available in AWS sagemaker [36] equipped with 4vCPU and 16 GiB. On the other hand, the DL models were trained using a local machine equipped with an NVIDIA GeForce RTX 2060 SUPER graphic card with 8 GB of VRAM memory and 16 GB of shared GPU memory. We trained the DL algorithms in an eVida local machine to take advantage of the power of the graphic card but also because this made it easier to manipulate the system files, something useful for a custom tuning methodology later explained.
Visualization platform: it was designed using the Django framework and is communicated with the AWS platform using HTTP requests. The functionalities of this platform are the visualization of gluten measures, the collection of new samples, and the visualization of new predictions. It is worth mentioning that the explanation of the visualization platform is out of the scope of this study, therefore, we do not go into details.

To make a new prediction the IoT prototype portable solution takes between 30–60 s approximately, starting from the flour sample collection until the data is predicted with the presence or absence of gluten and visualized in the platform.

2.3. Data Collection Procedure

In Figure 1b, the blue and green boxes show the different types of flour with 0 g/kg and 100 g/kg of gluten concentration respectively. The data collection was carried out over approximately 2 months by 4 researchers of the eVida research team. We followed an internal protocol to ensure a correct data collection procedure, which is explained below:

For each new measurement, the operator had to change the collecting plate (grams capacity ≈ 400 mg) (4 in Figure 1b) and then take the flour samples from different locations of the bag (1 or 2 in the same figure). Hence, he/she scooped flour from the top, bottom, center, and lateral sides of the bag to randomize the data collection process as much as possible. Finally, the operator put the samples in the collecting plate (4 in Figure 1b).
Once the sample was on the collecting plate, the operator smashed the flour trying to keep it on a smooth surface. This was because, after some experiments, we realized that when the surface was not smooth, the data collection was not consistent.
To avoid cross-contamination of the samples, the operator had to wear different gloves when collecting the data from different flour types. Furthermore, it was necessary to use different spoons and collecting plates for each type of flour. The gloves were thrown away at the end of the day.
The time collection per sample was approximately 30 s, during this time window, the DLPNIRNANOEVM sensor measured the exposed sample and forwarded the data to the AWS platform.
All the samples were collected with the same sensor and embedded system. Therefore, to measure the data it was necessary to design and print a 3D mechanical system. On the right in Figure 1b the 3D mechanical system is shown, it is composed of a 3d black case at the top (it contains the DLPNIRNANOEVM inside) and the blue box at the bottom (it contains the Raspberry Pi 4). It was used to keep the sensor rigid during the measuring process, but also to collect the data in a dark environment.

2.4. Classification Framework

Figure 2 shows the pipeline we followed for the creation of a model able to predict the presence of gluten in the flour samples. Please note that when we refer to the “absence of gluten”, we are specifically alluding to flour samples that have a doping level of 0 g/kg of gluten.

2.4.1. Input Data

In the present study, we focused our efforts on a dichotomic case. Thus, we collected samples from 3 different types of flour (rye, corn, and oats) and doped them with 0 g/kg and 100 g/kg of gluten concentration as explained before.

The sensor provided 228 variables (intensity) for each wavelength from 900 nm to 1700 nm. We collected a total of 12,053 observations and split the dataset as follows: 70% (8438 observations) for the training set, 15% (1807 observations) for the validation set, and 15% (1808 observations) for the testing set. It is important to note that the testing set was kept apart from the other ones from the very beginning of the collection process by using different bags. Thus, it was totally unknown for the ML and DL algorithms.

The three different types of flour samples rye, corn, and oat from the training dataset are represented by the mean of observations in Figure 3. The mean of the flour samples doped with 0 g/kg is, on average, higher than the mean of the samples doped with 100 g/kg. However, for low (near 900 nm) and high (near 1700 nm) wavelengths the means overlap.

2.4.2. Preprocessing

The preprocessing stage was divided into three phases: the cleaning of the data, the standardization, and the feature selection procedure.

Data cleaning: Despite the good quality of the data provided by the DLPNIRNANOEVM sensor, we checked that it met the following requirements. First, we checked the valid data, looking at whether the variable names and values met the required formats. Second, we checked the complete data, looking for NaN values and replacing them with valid values. Third, we checked the consistent data, looking at the outlier values in the dataset. Finally, we checked the unique data deleting the duplicate values.
Standardization: once we cleaned the data, the next step was applying the standardization of the data. We applied row standardization, given the fact that all the columns had the same unit. During this process, the variables were standardized by removing the mean and scaling to unit variance [37]. The standard score is given by (1).

z = \frac{x - u}{σ}

(1)

where x is the wavelength value, u is the mean of all the wavelength variables and

σ

is the standard deviation.

Feature selection: We selected the 3 wavelength ranges to train the ML and DL algorithms: 1089–1325 nm; 1239–1353 nm and 1422–1583 nm; and the whole spectrum 900–1700 nm, based on our previous study [31]. The purpose was to corroborate the possibility of predicting the presence or absence of gluten in the flour by only selecting some of the wavelength variables.

2.4.3. Classification Models

2.4.3.1. Support Vector Machine

SVM is a supervised learning method used for regression, classification, outlier detection, and feature selection problems [38]. The main objective of an SVM algorithm is to create an optimal hyperplane that separates the classes as much as possible. In the case of a bi-dimensional space, the hyperplane is a line; in tridimensional space, the hyperplane is 2-dimensional planes; and so on the SVM creates n-dimensional

R^{n - 1}

planes, where n is the dimension or number of features [39]. A kernel function is needed to map the data.

Given a set of n observations with x representing the training data and y the class of the label, as seen in (2).

S = \{(x_{1}, y_{1}), \dots, (x_{i}, y_{i})\}

(2)

The decision function for nonlinear data of the algorithm is given by (3) [39]. Where m is the bias parameter and

α

determines the maximal margin classifier, a parameter related to the input vector.

f (x) = s g n (\sum_{i = 1}^{N} α_{i} y K_{i} (x_{i} . x) + m)

(3)

where K is the kernel function. In this study, we used the Polynomial, Radial basis function, and Sigmoid kernels.

2.4.3.2. Extreme Gradient Boosting (XGBoost)

XGBoost algorithm is an implementation of the gradient-boosted decision trees designed for speed and higher performance. The term “Gradient Boosting” was used by Friedman in 2001 and is applied to structured and tabular data [40]. There are C++, Python, R, Java, and more implementation libraries of this algorithm proposed by Tianqi Chen [41].

The XGBoost takes advantage of the second-order Taylor expansion of the loss function and adds a regularization term to balance the complexity of the model and the decline of the loss function. Equation (4) is used to calculate a prediction having a dataset with n examples and m features.

{\hat{y}}_{i} = ϕ (x_{i}) = \sum_{k = 1}^{K} f_{k} (x_{i}), f_{k} ϵ F

(4)

where

F = \{f (x) = w_{q (x)}\} (q : R^{m} \to T, w \in R^{T})

is the spacing of the trees, q is the structure of each tree and T is the number of leaves of the tree. Therefore, each f is an independent tree structure. The regularized objective can be minimized as follows in (5).

L (ϕ) = \sum_{i} l ({\hat{y}}_{i}, y_{i}) + \sum_{k} Ω (f_{k})

(5)

where

Ω (f) = Υ T + \frac{1}{2} λ ||w||^{2}

.

In this case, l is the convex of the loss function and

Ω

is the penalization of the complexity of the model. With the aim of improving the objective i instance and t iteration are added and using a second-order approximation obtaining (6) [41].

L^{(t)} ≃ \sum_{i = 1}^{n} [l (y_{i}, {\hat{y}}_{i}_{}^{(t - 1)} + g_{i} f_{t} (x_{i}) + \frac{1}{2} h_{i} f_{t}^{2} {(x_{i})}^{})] + Ω (f_{t})

(6)

where

g_{i} = \partial_{\hat{y}} l (y_{i}, {\hat{y}}_{i}_{}^{(t - 1)}) a n d h_{i} = \partial_{{\hat{y}}^{(t - 1)}}^{2} l (y_{i}, {\hat{y}}_{i}_{}^{(t - 1)})

are first and second order gradient statistics on the loss function. Removing the constant terms and defining I_j = { i|q(x_i)= j} as the instance of j [41]. Also, defining

w_{j}^{*} = - \frac{Σ_{i \in I j_{}}^{} g i}{Σ_{i \in I j_{}}^{} h_{i} + λ}

finally (7) is obtained. This can be used as a scoring function to measure the quality of a tree.

L^{(t)} (q) = - \frac{1}{2} \sum_{j = 1}^{T} \frac{{(Σ_{i \in I j_{}}^{} g i)}^{2}}{Σ_{i \in I j_{}}^{} h_{i} + λ} + γ T

(7)

2.4.3.3. Deep Neural Network

The functionality of DNN is mainly described in two processes: the forwarding and the backpropagation process. The Deep feedforward networks, also known as multilayer perceptron (MLP), are the quintessential DNN models and have the objective of approximating a function f* [42]. These models are called feedforward because the computation starts from the input x, going through the intermediate computations until obtaining the result of the output y. The feedforward networks are represented by a composition of multivariate functions.

f = g \circ f k \circ \dots \circ f 2 \circ f 1

(8)

where k is the last hidden layer and g is the output function. The function maps its inputs into the outputs f: Rm -> Rs; where m is the dimension of the x inputs and s is the dimension of the output. Each hidden layer is composed of multivariate functions.

f_{i} (x) = a (w_{i} x + b_{i})

(9)

Replacing (9) in (8).

f (x) = g (a (\dots a (w_{2} a (w_{1} x + b_{1}) + b_{2}) \dots + b_{K}))

(10)

where

f_{i}

are the hidden layer functions, w is the weight, b is the bias and a is the output activation. Each linear combination plus the bias produces the output of the node.

As we said before, the forward process takes the input x and propagates up to the hidden layer until obtaining the

\hat{y}

. During this process is produced a scalar cost

J (θ) .

The backpropagation propagates then the information backward through the network, with the purpose of calculating the gradient [42]. Suppose that x ∈ R_m, y ∈ R_s, f and g both map from real number to real number R_m -> R_s, and f maps R_n to R. If y = g(x) and z = f(y); z = f(g(x)), then using the chain rule.

\frac{\partial z}{\partial x_{i}} = \sum_{j} \frac{\partial z}{\partial y_{i}} \frac{\partial y_{i}}{\partial x_{i}}

(11)

\nabla_{x} z = {(\frac{\partial y}{\partial x})}^{⊤} \nabla_{y} z

(12)

In (12), (11) is rewritten in vector notation, where

\frac{\partial y}{\partial x}

is the n x m Jacobian matrix of g. It is possible to obtain the gradient of a variable x by multiplying the Jacobian matrix by a gradient

\nabla_{y z}

. Thus, the backpropagation is the result of multiplying the Jacobian gradient for each operation in the graph [42].

2.4.3.4. Hyperparameter Tuning Methodology for DNN

For the DNN hyperparameter tuning we proposed a methodology based on the operation principles of the Hill climbing and the Grid Search methods. The idea of the proposed methodology is to tune each hyperparameter in the desired order and to tune each hyperparameter individually looking to obtain the higher metrics for each iteration. We decided to use this methodology because the literature lacks a method interested in the order in which the hyperparameters should be tuned. It is worth mentioning that we are not suggesting a general specific order of the hyperparameters in this study, however, the methodology is designed to investigate the hyperparameter order for future investigations. Likewise, we still do not refer to the methodology as a new framework, because it lacks an optimal implementation that cares about time, and, at this point, in time it is only implemented using sequential logic.

Table 2 shows the pseudocode for the methodology proposed. The following abbreviations were used to simplify the process.

Hyperparameters Matrix ( $H M = (h m_{i j}); 1 \leq i \leq l, 1 \leq j \leq n$ ): it is a matrix of hyperparameters and its values. The hyperparameters will be tuned in the row (i) order indicated.
Filter ( $f_{h m} = [f_{h m_{i}}]$ ): it is a vector (for each hyperparameter) of thresholds that limit the number of models that pass to the next $h m_{i}$ iteration (See Table 2, step 3.1.2).
Metrics (m): it is a vector of the metrics to evaluate the performance of the models. The metrics will determine the score for each model.
Model values ( $M V = (m v_{k j}); 1 \leq k \leq f_{h m_{i}}, 1 \leq j \leq n$ ): it is a matrix that contains the best hyperparameter values for each model selected during each $h m_{i}$ iteration. (See Table 2, step 3.2).
Final model values ( $f m v$ ): it is a vector of the hyperparameter values for the best model selected at the end of the process.
Score( $s$ ): it is a vector of the scores for each ij iteration. The vector is rewritten for each j iteration.
Random vector ( $r$ ): it is a random vector.

where i is the hyperparameter iterated, j is the respective values, l is the number of hyperparameters, and n is the number of hyperparameter values. Furthermore, we followed the matrix notation: the matrixes are in capital letters and bolded, the vectors are in lowercase and bolded, and the scalars are in lowercase.

The algorithm takes as inputs the HM matrix and the

f_{h m_{i}}

, m vectors. The values of the HM matrix are randomly initiated. The algorithm then is going to look for the bests MV model values for each hyperparameter

h m_{i}

; for this, the algorithm is going to train the DNN algorithm, calculate the metrics and set a score for each model. The set_score method used in this study consists of giving the models a rating number for the metrics m selected, however, it could be changed for any other method to score the models. Once all the values have been tested, the algorithm sorts the

s

vector and selects the number of

f_{h m_{i}}

models. The

m v

vector that contains the values is assigned to the MV matrix and the k value is restarted to zero (i.e., the algorithm only keeps the

m v

values of the current iteration and removes the values of past

h m_{i}

iterations). For the second iteration, the procedures from 3.1.1 to 3.1.2 will be executed again, but this time considering the best

m v

values from the previous

h m_{i}

iteration. Finally, the vector of values at the end of the

h m_{i}

iterations will be assigned to the

f m v

vector and those are the values tuned at the end of the algorithm. It is worth noting that the last values

f_{h m_{i}} = f_{h m_{l}} = 1

because only one model is selected.

For a better understanding, we also graph the tuning process as seen in Figure 4. The red color represents the inputs while the blue one represents variables calculated during the process. In Figure 4 each hyperparameter

h m_{1}, h m_{2}, \dots, h m_{l},

of the matrix HM is represented as a green box. During each

h m_{i}

iteration the MV matrix is updated depending on the score calculated, the filters

f_{h m_{1}}, f_{h m_{2}}, \dots, f_{h m_{l}}

and the metrics m. In Figure 4, can be noted that the order of the hyperparameters matters. Thus, during the first iteration the

h m_{1}

hyperparameter is tuned, subsequently, in the second iteration, the

h m_{2}

hyperparameter is tuned, using the hyperparameter values found in the previous iteration

h m_{1}

. Therefore, the algorithm will tune the hyperparameters during each iteration until the end of the process. In Figure 4, the Experiment Numbers (

E N_{i}

) correspond to the number of executed experiments for each hyperparameter.

E N_{i}

was multiplied by three, because we repeated each training three times, and for each result we computed the mean of the training. This was done to reduce the randomness in the results.

2.4.4. Output

The output

\hat{y}

for the ML and DL models is the label indicating if the flour sample contains gluten or not. In this study “1” represents the presence of gluten and “0” represents its absence. To measure the performance of the classifiers, we calculated the metrics Accuracy and F2-score. We are particularly interested in the F2-score since it gives more weight to recall than the precision, which is particularly important considering overlooked False Negatives (

F N

) are more detrimental than False Positives (

F P

) in the prediction of the presence of gluten. For instance, wrongly predicting the absence of gluten in the flour will cause the user to consume gluten and suffer from potential adverse health effects. The equations for the Accuracy and F2-score are given by (13) and (14) respectively.

ACC = \frac{T P + T N}{T P + F P + T N + F N}

(13)

F 2 = \frac{T P}{T P + 0.2 F P + 0.8 F N}

(14)

The true positives (

T P

) represent the flour samples that were correctly predicted with the presence of gluten, while the true negatives (

T N

) represent the samples that were correctly predicted with the absence of gluten. The

F P

represents the samples that were predicted with the presence of gluten but did not have it, and finally, the

F N

represents the samples that were predicted with the absence of gluten but actually had it.

3. Results

This section provides the results obtained for the ML and DL algorithms. We focused our effort on trying to get the best models for each wavelength range proposed for both ML and DL methods. The ML results for each wavelength range model were compared because they were trained under similar conditions (same tuning method, same machine, and the same amount of training). Likewise, the DL methods were compared with the results for each wavelength range, and we show the results using the proposed methodology for hyperparameter tuning. However, despite employing a similar amount of training for the ML and DL models, it is not possible to make a fair comparison between them, because they were trained using different machines. It is important to mention that the sections about hyperparameter tuning 3.1 and 3.2 only present the results obtained for the validation set.

3.1. Machine Learning Hyperparameter Tuning

The AWS Sagemaker hyperparameter tuning tool was used to adjust the hyperparameters of the ML algorithms. The hyperparameters tuning experiments for SVM and XGBoost were executed 10 times, with each execution having 100 iterations. The selected tuning method was the Bayesian Optimization process available in AWS Sagemaker [43].

The Bayesian optimization algorithm differs from grid search or random search since it considers all the historical evaluations [44]. It can be solved mathematically as follows by Equation (15) [45,46]. The objective function is defined in the domain of

X; f : X \to R

.

x^{*} \in a r g m a x_{x \in X} f (x);

(15)

3.1.1. SVM

We considered the following hyperparameters for this study: C is the regularization parameter; gamma defines the reach of the influence of a single training, with low values and high values meaning far and close, respectively; and the kernel function that transforms the input data in a high dimensional space. The ranges of the parameters used for the tuning are shown in Table 3.

Below are the results of the hyperparameter tuning of the ML methods. The results revealed the impact of applying hyperparameter optimization in the different wavelength ranges. After the optimization procedure, the maximum Accuracy and F2-score were approximately 95% for the first two wavelength ranges and 80% for the third one. Furthermore, the same figure suggests that the models for the 900–1700 nm wavelength range were overfitting.

Additionally, Figure 5 shows the bivariate distribution for C and training time. Therefore, a darker red color indicates more concentration or values repeated while the light red color represents the opposite case. From this figure, it can be appreciated that the Bayesian optimization considers the historical results during the training as we expected. This means that, after the first execution iteration where the hyperparameter combinations were random, the Bayesian optimization algorithm found regions where it executed experiments more frequently. This behavior was repeated for the four wavelength ranges as observed in Figure 5, but it can be clearly appreciated in the 1089–1325 nm wavelength range, where most of the experiments were executed in the values close to

1 \times 10^{6}

, where the training times were low, with most of them less than 100 s. For the 900–1700 nm wavelength range, the C values with higher scores may be assumed close to 0–5000 as shown by the darker red color in the left-bottom from the same Figure. For the next 1239–1353 nm wavelength range, there are two possible values of C; the first is between 0 and 5000 and the second is between 650,000 and 800,000. However, the second one should be considered, because of the shorter training times. Finally, for the 1422–1583 nm wavelength range, the C values to be selected are close to 200,000 due to the low average training times and high Accuracy and F2-score.

As seen in Figure 5e,f the sigmoid kernel did not work well, therefore, it is best to choose either the poly or rbf kernels, due to the high Accuracies sometimes exceeding 90%. For the gamma, both types are eligible since the Accuracy result is similar for all wavelength ranges.

3.1.2. XGBoost

Four hyperparameters were considered for this study: alpha is the L1 regularization; lambda is the L2 regularization term on weights; max_depth is the maximum depth of a tree; and num_round is the number of rounds for the boosting. The ranges of the parameters used for the tuning are shown in Table 4.

Figure 6 shows the Accuracy and F2-score for the model with higher performance (from 10 repetitions) generated by the Bayesian optimization. In this case, we only included the graph for one wavelength range (1422–1583 nm), since the others were similar, to analyze the four numerical hyperparameters and avoid making the analysis extensive. The maximum Accuracy and F2-score were 95%.

In the case of max_depth, the best results values may be assumed close to 2–4 or 10; as shown by the darker red color and the blue line in Figure 6a. Furthermore, the training time is low and stays under 120 s for most of the values. In the case of lambda, there is no tendency, so many values can be selected. However, as can be seen in Figure 6b, there is a slightly greater proportion of darker tones around the values 0–200, with shorter training times (under 70 s). Hence, these values could be selected. For the case of alpha, it is possible to observe a tendency in the results; hence, as long alpha approaches zero, the F2-score and Accuracy increase. Finally, like for lambda, while many values can be selected for the num_round, the ones around 0–500 or 3000–4000 could be chosen due to the high repetition of experiments.

Table 5 shows the hyperparameter selected by the Bayesian optimization method. As can be seen, the hyperparameters selected coincide with the best values analyzed in Figure 5 and Figure 6 for both ML methods in all the wavelength ranges.

3.2. Deep Learning Hyperparameter Tuning

For the hyperparameter tuning of the DNNs, we used the proposed methodology explained in Section 2.4.3.4. Figure 7 shows the structure, the hyperparameter values, and the other parameters used for the tuning methodology.

As shown in Figure 7, the order followed during the tuning methodology was: (1) Hidden layers, (2) Optimizer, (3) Learning rate, (4) Epochs, (5) Activation function, and (6) Custom loss function. The metrics used to rate the experiments during each

h m_{i}

iteration were Accuracy, F2-score, and training time. The total number of experiments executed was 381. The tuning methodology implemented in Python allows the user to select the order of the hyperparameters, which we selected considering some studies in the literature and after executing a few experiments. We only included the most relevant experiments; otherwise, the study would have turned out to be very long.

The number of hidden layers was the first hyperparameter we selected to be tuned by the methodology. Secondly, the optimizer and learning rate was selected based on [47], which suggests tuning the learning rate first could save a lot of experiments. Subsequently, the epochs were tuned to adjust the underfitting or overfitting models. The activation function was next selected. At this point, expecting to have a model with good performance, we selected a custom loss function and slightly modified it to maximize the FN, considering the importance of these values in the prediction of the presence of gluten, as explained in Section 2.4.4.

Hidden layers

We did a few experiments and realized that when increasing the hidden layer, while the performance did not see major changes, the time increased considerably. This suggests that our binary classification problem can be solved without using hundreds of hidden layers. Furthermore, in a literature review was reported that among different studies good results were obtained by using no more than 10 hidden layers [48]. Therefore, we provided the tuning methodology with a vector from 1 to 10 and all the hidden layers with 10 neurons each. Figure 8 shows the score metrics obtained in the

h m_{1}

iteration by the hidden layers hyperparameter in the four wavelength ranges.

The first aspect to highlight from Figure 8 is that if the number of hidden layers increases the training time increases, as expected. Regarding the Accuracy and F2-score, in the 900–1700 nm and 1089–1325 nm wavelength ranges, the higher values are located between 6 and 8 hidden layers, but the scores, in general, are very similar. However, In the 1422–1583 nm wavelength range, if the number of hidden layers increases, the Accuracy and F2-score slightly increase.

Optimizer

We selected three different optimizers with the default learning rates. They were tuned taking into account the hidden layers selected in the

h m_{1}

iteration.

Figure 9 shows the experiments developed for all the wavelength ranges using three different optimizers: SGD, Adam, and Adadelta. The black lines located on the top of the bars correspond to the error with a 95% confidence interval. This Figure makes it very clear that the Adam optimizer did not work well and obtained results under 60% for Accuracy and under 50% for F2-score. It can also be noted that the results for SGD and Adadelta are similar. Regarding wavelength ranges, the 2 higher results are for the 900–1700 nm and 1422–1583 nm wavelength ranges, with slightly better performance for the first one.

Learning rate

Once the proposed tuning methodology selected the SGD and Adadelta optimizer in the

h m_{2}

iteration, 5 values starting from

1 \times 10^{- 5}

to 0.1 were evaluated. Figure 10 shows the Accuracy and training time obtained for each wavelength range. We did not graph the F2-score because it overlapped with the Accuracy in the shadow zones, making the visualization difficult. The shadow zones represent the mean and 95% confidence interval.

The first aspect to mention is that the Accuracy scores observed in Figure 10 are higher than the ones obtained in the previous iterations

h m_{1}

(hidden layers tuned) and

h m_{2}

(optimizer tuned), achieving Accuracies and F2-scores over 90% in the 900–1700 nm and 1422–1583 nm wavelength ranges. Thus, the improvements in performance are remarkable for each wavelength range. Another aspect to highlight is that the variability in the SGD optimizer was higher than the Adadelta optimizer, increasing when the learning rate values were higher as seen in Figure 10c,d. The conclusion is that higher results were obtained for the Adadelta optimizer with high values for the learning rate. For the training time, there are no remarkable differences in comparison with the previous

h m_{i}

iterations.

Epochs

All the previous

h m_{i}

iterations were executed using 100 epochs. When checking the 135 experiments for the epochs hyperparameter, we observed that most of the models were underfitting and a few other models needed more epochs to finish the learning. It is worth clarifying that all these graphs were obtained at the end of the training by the proposed tuning methodology, therefore, there may be existing underfitting and overfitting models. Unfortunately, this is an aspect that our methodology cannot avoid as we did not consider the loss curves in the metrics

m

to rate the models.

Figure 11 shows the result of increasing the epochs for the 900–1700 nm wavelength range, the only one we included because the results of the others were very similar. As can be seen in this Figure, when increasing the epochs, the Accuracy, F2-score, and stability of the graphs improve. However, if this value is highly increased, for example in the case of 400 epochs, the model will be overfitting.

Activation function

The next value to tune was the activation function. The higher results were obtained with the tanh activation function for all the wavelength ranges, therefore, there are no remarkable differences to show, because all previous experiments were executed in the

h m_{i}

iterations already used this activation function.

Loss function

At this point, expecting to have a model with good performance, we selected a custom loss function and slightly modified it to maximize the FN. To modify the loss function, we only introduced the weights α and β to the binary cross entropy equation as shown in (16).

Loss = - \frac{1}{n} \sum_{i = 1}^{n} (α Y_{i} \cdot \log {\hat{Y}}_{i} + β (1 - Y_{i}) \cdot \log (1 - {\hat{Y}}_{i}))

(16)

The above equation gives more weight to the predictions of FN than FP. After a few experiments, we realized that the loss function worked well for values of β over 0.7 and α = 1 − β. Therefore, we executed some experiments around these values which are shown in Figure 12.

On the one hand, if the value of β is very high, then the FN has more importance in the prediction, as seen in the F2-score in Figure 12. On the other hand, when β starts decreasing the F2-score and Accuracy are almost the same. Thus, it is necessary to obtain a tradeoff of how much the FN can be maximized without losing too many prediction rates for the FP.

Summary of the proposed tuning methodology

We included a summary of the whole procedure in one graph, including the loss function, Accuracy, and F2-score graphs.

Figure 13 shows the higher results for the 6

h m_{i}

iterations for the 900–1700 nm wavelength range. In the

h m_{1}

iteration (hidden layers tuned), it can be noted that the F2-score and the loss were good during the training process after tuning the hidden layers, but very unstable for the validation process. Subsequently, in the

h m_{2}

iteration (optimizer tuned) the F2-score and loss improved the stability, and while the score of the former slightly increased, that of the latter slightly decreased. In the

h m_{3}

iteration (learning rate tuned), there was a remarkable increase in the F2-score. Afterward, in the

h m_{4}

iteration (epochs tuned), the F2-score and the loss function improved, with the latter continuously declining. However, the model seems to be a little overfitting. In the

h m_{5}

iteration (activation function tuned), there is no difference with

h m_{4}

. Finally, in the

h m_{6}

iteration (loss function tuned), where the custom loss function was used, it can be appreciated that the F2-score is slightly higher than the Accuracy.

Table 6 shows the hyperparameters selected for all the wavelength ranges after completing the proposed tuning methodology. It can be appreciated that the hyperparameters are similar for all the wavelength ranges, having the same optimizer and activation function and almost a similar learning rate. The hidden layers and loss function have different values for each wavelength range.

3.3. Classification Results

Table 7 shows the Accuracy and F2-score with the higher results for the SVM and XGBoost methods for each wavelength range evaluated. The metric scores were obtained by predicting the testing set which contained 1808 observations of flour samples of rye, corn, and oats, as detailed in Section 2.4.1. The models used the hyperparameters from Table 5.

In Table 7, it can be appreciated that the best model for SVM was in the 1422–1583 nm wavelength range, achieving an Accuracy of 88.93% and an F2-score of 89.28%. It is worth mentioning that, despite the SVM reaching an Accuracy of 91.31% and an F2-score of 90.79% in the range of 900–1700 nm wavelength range, this model is not considered the best one because it used all the features during the training and could be overfitting, as seen in Figure 6a. The higher and the best results among the ML models were obtained by the XGBoost model in the 1422–1583 nm wavelength range, achieving an Accuracy of 94.52% and an F2-score of 92.87%. This model also has the advantage of only using 50 features for the training.

Table 8 shows the classification results of the DNN models with higher scores (1st row) and the DNN models after completing the

h m_{6}

iteration of the proposed tuning methodology (2nd row). The testing set evaluated was the same as for the ML methods.

As seen in Table 8, the higher Accuracy and F2-score are in the 900–1700 nm and 1422–1583 nm wavelength ranges. For the 1089–1325 nm, 1239–1353 nm, and 1422–1583 nm wavelength ranges, the higher Accuracy and F2-score results (1st row) are the same as those obtained after completing the tuning methodology (2nd row). However, for the 900–1700 nm wavelength range, the best Accuracy and F2-score (listed in the first row) were achieved in the

h m_{4}

iteration (epochs tuned), therefore, the last 2

h m_{i}

iterations did not contribute to improving the model.

The training time for most of the experiments was higher for the DNN in comparison to the ML methods, as expected. However, this does not pose a problem for the prediction of the presence of gluten in new samples because the models are already trained, and the forward propagation time was <1 s for all the experiments. This is very positive for the implementation of a rapid gluten detection technique.

Finally, considering the ML and DL experiments, the best model was the DNN in the 1422–1583 nm wavelength range, achieving an Accuracy of 91.77% and an F2-score of 96.06% in the prediction of the presence or absence of gluten in three different types of flour.

4. Discussion and Conclusions

This study presents a rapid, innovative, budget-friendly, and accurate IoT solution to predict the presence (doped with 100 g/kg + gluten naturally contained) and the absence (0 g/kg + gluten naturally contained) of gluten in 3 different types of flour samples (rye, corn, and oats). The development of the IoT prototype required a difficult and lengthy process comprised of data collection, the development of a serverless architecture for storing and data analysis, the creation of AI models to make predictions, and the visualization of the results. The results section showed very optimistic results in terms of performance metrics in the prediction of the absence or presence of gluten. The best DL model obtained 91.77% for Accuracy and 96.06% for F2-score, and similarly, among the ML models, the best result was obtained by the XGBoost model achieving an Accuracy of 94.52% and F2-score of 92.87%. Both models achieved high performances; however, we selected the DL models because we want to prioritize the F2-score giving more importance to the FN. This was done with the users of the prototype in mind; for whom it is more important to identify when the prototype wrongly predicts the samples as lacking gluten (output of the DNN: zero) when they actually contain it. For instance, in a real-life scenario, the wrong prediction of the absence of gluten in the flour would cause the user to consume gluten and suffer from potential adverse health effects. While the FPs also are important, the erroneous prediction of the presence of gluten does not pose a risk to the user, as he/she would simply not eat flour that is in reality gluten-free.

Regarding the proposed tuning methodology for the DNNs, one aspect to highlight is that we achieved good results executing a low number of experiments (327), given the total of possible combinations (3750) of the 28 hyperparameters evaluated. Furthermore, the total number of experiments could be reduced to 109 without repetition in the DNNs training.

We cannot make a fair comparison in terms of Accuracy and F2-score between our results and those of other studies due to the difference in the evaluation metrics and analytical methods employed. While other studies make use of instrumental analyses such as chromatography or enzyme immunoassay as ELISA, direct methodologies where the analyte (protein) is the target to be measured, in our approach (NIRS + AI) we used the data obtained from electromagnetic spectrum and emitted by food matrix to quantify the target analyte indirectly. Nevertheless, we can highlight the differences in the execution in terms of time and complexity of the experiment. Regarding the time needed to make a prediction, our IoT prototype portable solution provided optimistic results with the DL and ML models needing less than 1 s to predict an observation. Together with the duration of the collection in the platform (measuring time + storing time), which varies between 30–60 s, the time needed to predict the presence of gluten in any of the new rye, corn, and oat flour samples of ≈400 mg adds up to <1 min. Regarding the training time, the best DNN model needed ≈575.8121 s to complete the training. However, this will not be an issue for the final application because the models would be trained already. In terms of the complexity of the experiment, our IoT prototype portable solution also has a great ease-of-use advantage, because the user only needs the kit (sensor, Raspberry Pi 4, DLPNIRNANOEVM sensor, 3D Mechanical system, and collecting plate) and a device with an internet connection (mobile, tablet or pc), and to put the samples into the collecting plate and press a few clicks to make the prediction. Instead, other analytical methods take significantly more time, counting from the sample preparation until the data recovery; around 0.5–2.5 h for ELISA [13]; ≥0.5 h for HPLC [49]; and > 3 h for PCR [50]. Other studies have achieved very good results while decreasing the detection time, to illustrate, a recent study designed an optical nanosensor for rapid detection of the gluten content of samples containing wheat. They obtained results for the determination coefficient (R2) with 0.995 for folic acid-based-carbon dots molecularly imprinted polymer (FA-CDs-MIP) and 0.903 for FA-CDs none-imprinting polymer (FA-CDs-NIP), within a range of gluten detection of 0.36 to 2.20 µM [51]. They reported less than 4 min as the response time for the florescent nanosensor. In another investigation, a real-time artificial intelligence-based method was employed to detect adulterated lentil flour samples that contained trace levels of wheat (gluten) or pistachios (nuts) [19]. The authors used images taken with a simple reflex NIKON camera, model D5100, to train the network, with a total of 2200 images collected in a well-lit room without any spotlight illuminating the samples. Despite the difference in the input data and food samples employed, this study is one of the most similar to ours, as it achieved an accuracy of 96.4% in the classification of wheat flour, showing the potential of the NIRS technology. At first, the method seems to be accurate and particularly quick, with the photo capturing and the prediction of the CNN networks being instantaneous (the authors mention seconds). However, in contrast with our study, it raises several concerns regarding its implementation in an environment production, such as the difficult setup when using a big camera, the users involved, and the changing environmental conditions depending on the light to which the samples are exposed. Other authors conducted a multiclass classification study using ML approaches and the Fourier-Tranform (FTIR) Spectroscopy to detect and quantify cross-contact gluten in flour [52]. The samples used were non-gluten (corn flour) and gluten flours (wheat, barley, and rye) with a total of 640 samples (200 × 3 for contaminated samples and 10 × 4 pure samples). The supervised classification method was the Partial Least Square Discriminant Analysis (PLSDA) achieving true positive rates (TPR) of 0.87500, 0.81250, 0.9333, and 1.0 respectively for barley flour, wheat, rye, and corn flour classes. While the analytical method employed was different, the study showed optimistic results in addressing problems of multiclassification and quantification of gluten in the flour, a topic we plan to investigate in our future research. With regards to the time and resources employed, the authors stated that the study was developed in non-real time and that the samples were prepared using the FTIR spectrometer (Nicolet iS 50 Waltham, MA, USA, North America), laboratory equipment that due to its nature presents limitations, such as the need for user training and the difficult transportation among others. This highlights the value of the method employed in our study, which can be easily taken anywhere by the user and does not require any specific training. Finally, with the best results achieved in the present study (the DNN model in the 1422–1583 nm wavelength range), we improved the results achieved in our previous study, where we employed the Random Forest classifier [31], overcoming the Accuracy by 5% and the F2-score by 9%. It is worth mentioning that in that study the main objective was to find the best wavelength ranges using feature selection techniques.

Our study suffers from several limitations. First, it is only able to classify the presence or absence of gluten considering samples of 0 g/kg and 100 g/kg of gluten concentration. Therefore, the IoT prototype portable solution has a high chance of failure with middle concentrations samples, for example, 50 g/kg, while other experiments can quantify the gluten in samples under 100 mg/kg by using highly sensitive methods, such as enzyme immunoassay techniques or instrumental analytical techniques (capillary electrophoresis, PCR, QC-PCR, RP-HPLC, LC-MS, and MALDI-TOF-MS). However, these methods also imply elevated costs and specialized training [12,13,15]. While our prototype could not be considered low budget, with the overall costs amounting to 1200 euros, it is still less expensive than other methods [53,54,55,56,57] (around 1000–17,000 euros) and does not require specialized training. The methodology employed in our study has other limitations. First, few previous experiments are required to know the order of the hyperparameters. For example, in this study, we used the tanh activation function from the

h m_{i}

iteration, because it showed to have better performance of Accuracy than other activation functions. It is worth mentioning that these experiments are not executed to obtain higher or the best results, but they are useful to choose the initial hyperparameters. The lack of an optimal code implementation in our proposed tuning methodology constitutes another limitation, making it not possible to perform a fair comparison with other existing methods, such as Random Search, Bayesian Hyperparameter tuning, or Grid Search, since it ignores aspects such as vectorization and modular programming.

When it comes to future steps, we are considering working first on a multiclass problem that allows classifying the content of gluten in different levels and applying further DL and ML regression, which lets us quantify the amount of gluten in the samples examined. We also want to implement the proposed tuning methodology as a full Python framework, making it possible to validate it with different datasets and to compare the performance with other existing tuning hyperparameters frameworks.

The binary classification results obtained in this study are promising for the future designing of real-time IoT devices that allow rapid detection of the presence or absence of gluten and can be used by any ordinary person. Therefore, this study contributes to the state of the art of NIRS + Artificial Intelligence applied in the food industry and is aligned with the requirements of the 4.0 industry.

Author Contributions

Conceptualization, O.J.-B.; methodology, O.J.-B. and A.O.S.; software, O.J.-B. and A.O.S.; validation, O.J.-B., A.O.S., L.B.-L. and B.G.-Z.; formal analysis, O.J.-B., A.O.S., L.B.-L. and B.G.-Z.; investigation, O.J.-B. and A.O.S.; resources, L.B.-L.; data curation, O.J.-B. and A.O.S.; writing—original draft preparation, O.J.-B. and A.O.S.; writing—review and editing, O.J.-B., A.O.S., L.B.-L. and B.G.-Z.; visualization, O.J.-B. and A.O.S.; supervision, B.G.-Z.; project administration, L.B.-L. and B.G.-Z.; funding acquisition, L.B.-L. and B.G.-Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Department of Economic Development, Sustainability and Environment of the Basque Government, grant number KK-2021/00035 (ELKARTEK), and eVIDA research group IT1536-22.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to thank A. Olaneta and L. Zudaire from Leartiker S. Coop for their support in the preparation and analysis of the flour samples and the literature review. Likewise, to Unai Sainz Lugarezaresti, Guillermo Yedra Doria, and Uxue García Ugarte from the eVIDA Lab of the University of Deusto for the support in the development of the 3D mechanical system and the data collection.

Conflicts of Interest

The authors have no conflict of interest to declare.

References

Schalk, K.; Lexhaller, B.; Koehler, P.; Scherf, K. Isolation and Characterization of Gluten Protein Types from Wheat, Rye, Barley and Oats for Use as Reference Materials. PLoS ONE 2017, 12, e0172819. [Google Scholar] [CrossRef] [PubMed]
Calabriso, N.; Scoditti, E.; Massaro, M.; Maffia, M.; Chieppa, M.; Laddomada, B.; Carluccio, M.A. Non-Celiac Gluten Sensitivity and Protective Role of Dietary Polyphenols. Nutrients 2022, 14, 2679. [Google Scholar] [CrossRef] [PubMed]
Akhondi, H.; Ross, A.B. Gluten Associated Medical Problems. In StatPearls; StatPearls Publishing: Treasure Island, FL, USA, 2022. [Google Scholar]
Klemm, N.; Gooderham, M.J.; Papp, K. Could It Be Gluten? Additional Skin Conditions Associated with Celiac Disease. Int. J. Dermatol. 2022, 61, 33–38. [Google Scholar] [CrossRef] [PubMed]
Lebwohl, B.; Sanders, D.S.; Green, P.H.R. Coeliac Disease. Lancet 2018, 391, 70–81. [Google Scholar] [CrossRef] [PubMed]
Guandalini, S.; Dhawan, A.; Branski, D. Textbook of Pediatric Gastroenterology, Hepatology and Nutrition: A Comprehensive Guide to Practice; Springer: Berlin/Heidelberg, Germany, 2016; ISBN 978-3-319-17168-5. [Google Scholar]
Durazzo, M.; Ferro, A.; Brascugli, I.; Mattivi, S.; Fagoonee, S.; Pellicano, R. Extra-Intestinal Manifestations of Celiac Disease: What Should We Know in 2022? J. Clin. Med. 2022, 11, 258. [Google Scholar] [CrossRef] [PubMed]
Seidita, A.; Mansueto, P.; Compagnoni, S.; Castellucci, D.; Soresi, M.; Chiarello, G.; Cavallo, G.; De Carlo, G.; Nigro, A.; Chiavetta, M.; et al. Anemia in Celiac Disease: Prevalence, Associated Clinical and Laboratory Features, and Persistence after Gluten-Free Diet. J. Pers. Med. 2022, 12, 1582. [Google Scholar] [CrossRef]
Fasano, A.; Berti, I.; Gerarduzzi, T.; Not, T.; Colletti, R.B.; Drago, S.; Elitsur, Y.; Green, P.H.R.; Guandalini, S.; Hill, I.D.; et al. Prevalence of Celiac Disease in At-Risk and Not-At-Risk Groups in the United States: A Large Multicenter Study. Arch. Intern. Med. 2003, 163, 286–292. [Google Scholar] [CrossRef]
No 828/2014; Commission Implementing Regulation (EU) No 828/2014 of 30 July 2014 on the Requirements for the Provision of Information to Consumers on the Absence or Reduced Presence of Gluten in Food. Commission Implementing Regulation (EU). Official Journal of the European Union (EU): Brussels, Belgium, 2014.
Scherf, K.A.; Poms, R.E. Recent Developments in Analytical Methods for Tracing Gluten. J. Cereal Sci. 2016, 67, 112–122. [Google Scholar] [CrossRef]
Panda, R.; Garber, E.A.E. Detection and Quantitation of Gluten in Fermented-Hydrolyzed Foods by Antibody-Based Methods: Challenges, Progress, and a Potential Path Forward. Front. Nutr. 2019, 6, 97. [Google Scholar] [CrossRef]
Osorio, C.E.; Mejías, J.H.; Rustgi, S. Gluten Detection Methods and Their Critical Role in Assuring Safe Diets for Celiac Patients. Nutrients 2019, 11, 2920. [Google Scholar] [CrossRef]
Lacorn, M.; Dubois, T.; Weiss, T.; Zimmermann, L.; Schinabeck, T.-M.; Loos-Theisen, S.; Scherf, K. Determination of Gliadin as a Measure of Gluten in Food by R5 Sandwich ELISA RIDASCREEN® Gliadin Matrix Extension: Collaborative Study 2012.01. J. AOAC Int. 2022, 105, 442–455. [Google Scholar] [CrossRef] [PubMed]
Amnuaycheewa, P.; Niemann, L.; Goodman, R.E.; Baumert, J.L.; Taylor, S.L. Challenges in Gluten Analysis: A Comparison of Four Commercial Sandwich ELISA Kits. Foods 2022, 11, 706. [Google Scholar] [CrossRef] [PubMed]
Panda, R. Validated Multiplex-Competitive ELISA Using Gluten-Incurred Yogurt Calibrant for the Quantitation of Wheat Gluten in Fermented Dairy Products. Anal. Bioanal. Chem. 2022, 414, 8047–8062. [Google Scholar] [CrossRef] [PubMed]
Huang, X.; Ahola, H.; Daly, M.; Nitride, C.; Mills, E.C.; Sontag-Strohm, T. Quantification of Barley Contaminants in Gluten-Free Oats by Four Gluten ELISA Kits. J. Agric. Food Chem. 2022, 70, 2366–2373. [Google Scholar] [CrossRef] [PubMed]
Khoury, P.; Srinivasan, R.; Kakumanu, S.; Ochoa, S.; Keswani, A.; Sparks, R.; Rider, N.L. A Framework for Augmented Intelligence in Allergy and Immunology Practice and Research—A Work Group Report of the AAAAI Health Informatics, Technology, and Education Committee. J. Allergy Clin. Immunol. Pract. 2022, 10, 1178–1188. [Google Scholar] [CrossRef] [PubMed]
Pradana-López, S.; Pérez-Calabuig, A.M.; Otero, L.; Cancilla, J.C.; Torrecilla, J.S. Is My Food Safe?—AI-Based Classification of Lentil Flour Samples with Trace Levels of Gluten or Nuts. Food Chem. 2022, 386, 132832. [Google Scholar] [CrossRef]
Liu, Q.; Zhang, W.; Zhang, B.; Du, C.; Wei, N.; Liang, D.; Sun, K.; Tu, K.; Peng, J.; Pan, L. Determination of Total Protein and Wet Gluten in Wheat Flour by Fourier Transform Infrared Photoacoustic Spectroscopy with Multivariate Analysis. J. Food Compos. Anal. 2022, 106, 104349. [Google Scholar] [CrossRef]
Zhang, S.; Liu, S.; Shen, L.; Chen, S.; He, L.; Liu, A. Application of Near-Infrared Spectroscopy for the Nondestructive Analysis of Wheat Flour: A Review. Curr. Res. Food Sci. 2022, 5, 1305–1312. [Google Scholar] [CrossRef]
De Géa Neves, M.; Poppi, R.J.; Breitkreitz, M.C. Authentication of Plant-Based Protein Powders and Classification of Adulterants as Whey, Soy Protein, and Wheat Using FT-NIR in Tandem with OC-PLS and PLS-DA Models. Food Control 2022, 132, 108489. [Google Scholar] [CrossRef]
Zhang, J.; Guo, Z.; Ren, Z.; Wang, S.; Yin, X.; Zhang, D.; Wang, C.; Zheng, H.; Du, J.; Ma, C. A Non-Destructive Determination of Protein Content in Potato Flour Noodles Using near-Infrared Hyperspectral Imaging Technology. Infrared Phys. Technol. 2023, 130, 104595. [Google Scholar] [CrossRef]
Netto, J.M.; Honorato, F.A.; Celso, P.G.; Pimentel, M.F. Authenticity of Almond Flour Using Handheld near Infrared Instruments and One Class Classifiers. J. Food Compos. Anal. 2023, 115, 104981. [Google Scholar] [CrossRef]
Jojoa Acosta, M.F.; Caballero Tovar, L.Y.; Garcia-Zapirain, M.B.; Percybrooks, W.S. Melanoma Diagnosis Using Deep Learning Techniques on Dermatoscopic Images. BMC Med. Imaging 2021, 21, 6. [Google Scholar] [CrossRef] [PubMed]
Scarpiniti, M.; Comminiello, D.; Uncini, A.; Lee, Y.-C. Deep Recurrent Neural Networks for Audio Classification in Construction Sites. In Proceedings of the 2020 28th European Signal Processing Conference (EUSIPCO), Amsterdam, The Netherlands, 18–21 January 2021; pp. 810–814. [Google Scholar]
Jojoa, M.; Lazaro, E.; Garcia-Zapirain, B.; Gonzalez, M.J.; Urizar, E. The Impact of COVID 19 on University Staff and Students from Iberoamerica: Online Learning and Teaching Experience. Int. J. Environ. Res. Public Health 2021, 18, 5820. [Google Scholar] [CrossRef]
Gonzalez Viejo, C.; Harris, N.M.; Fuentes, S. Quality Traits of Sourdough Bread Obtained by Novel Digital Technologies and Machine Learning Modelling. Fermentation 2022, 8, 516. [Google Scholar] [CrossRef]
Sohn, S.-I.; Oh, Y.-J.; Pandian, S.; Lee, Y.-H.; Zaukuu, J.-L.Z.; Kang, H.-J.; Ryu, T.-H.; Cho, W.-S.; Cho, Y.-S.; Shin, E.-K. Identification of Amaranthus Species Using Visible-Near-Infrared (Vis-NIR) Spectroscopy and Machine Learning Methods. Remote Sens. 2021, 13, 4149. [Google Scholar] [CrossRef]
Heydarov, S.; Aydin, M.; Faydaci, C.; Tuna, S.; Ozturk, S. Low-Cost VIS/NIR Range Hand-Held and Portable Photospectrometer and Evaluation of Machine Learning Algorithms for Classification Performance. Eng. Sci. Technol. Int. J. 2023, 37, 101302. [Google Scholar] [CrossRef]
Jossa-Bastidas, O.; Sainz Lugarezaresti, U.; Osa Sanchez, A.; Yedra Doria, G.; Garcia-Zapirain, B. Gluten Analysis Composition Using Nir Spectroscopy and Artificial Intelligence Techniques. Telematique 2022, 21, 7487–7496. [Google Scholar]
What Is a Raspberry Pi? Available online: https://www.raspberrypi.org/help/what-is-a-raspberry-pi/ (accessed on 6 March 2023).
DLPNIRNANOEVM Evaluation Board|TI.Com. Available online: https://www.ti.com/tool/DLPNIRNANOEVM (accessed on 6 March 2023).
MQTT—The Standard for IoT Messaging. Available online: https://mqtt.org/ (accessed on 6 March 2023).
Osa Sanchez, A.; García Ugarte, U.; Gil Herrera, M.J.; Jossa-Bastidas, O.; Garcia-Zapirain, B. Design and Implementation of Food Quality System Using a Serverless Architecture: Case Study of Gluten Intolerance. Telematique 2022, 21, 7475–7486. [Google Scholar]
Precios de Amazon SageMaker—Machine Learning—Amazon Web Services. Available online: https://aws.amazon.com/es/sagemaker/pricing/ (accessed on 6 March 2023).
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-Learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Hafdaoui, H.; Boudjelthia, E.A.K.; Chahtou, A.; Bouchakour, S.; Belhaouas, N. Analyzing the Performance of Photovoltaic Systems Using Support Vector Machine Classifier. Sustain. Energy Grids Netw. 2022, 29, 100592. [Google Scholar] [CrossRef]
Rana, A.; Vaidya, P.; Gupta, G. A Comparative Study of Quantum Support Vector Machine Algorithm for Handwritten Recognition with Support Vector Machine Algorithm. Mater. Today Proc. 2022, 56, 2025–2030. [Google Scholar] [CrossRef]
Pan, S.; Zheng, Z.; Guo, Z.; Luo, H. An Optimized XGBoost Method for Predicting Reservoir Porosity Using Petrophysical Logs. J. Pet. Sci. Eng. 2022, 208, 109520. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
How Hyperparameter Tuning Works—Amazon SageMaker. Available online: https://docs.aws.amazon.com/sagemaker/latest/dg/automatic-model-tuning-how-it-works.html (accessed on 6 March 2023).
Du, L.; Gao, R.; Suganthan, P.N.; Wang, D.Z.W. Bayesian Optimization Based Dynamic Ensemble for Time Series Forecasting. Inf. Sci. 2022, 591, 155–175. [Google Scholar] [CrossRef]
Bai, T.; Li, Y.; Shen, Y.; Zhang, X.; Zhang, W.; Cui, B. Transfer Learning for Bayesian Optimization: A Survey. arXiv 2023, arXiv:2302.05927. [Google Scholar]
Garnett, R. Bayesian Optimization; Cambridge University Press: Cambridge, UK, 2023. [Google Scholar]
Greff, K.; Srivastava, R.K.; Koutník, J.; Steunebrink, B.R.; Schmidhuber, J. LSTM: A Search Space Odyssey. IEEE Trans. Neural Netw. Learn. Syst. 2017, 28, 2222–2232. [Google Scholar] [CrossRef]
Uzair, M.; Jamil, N. Effects of Hidden Layers on the Efficiency of Neural Networks. In Proceedings of the 2020 IEEE 23rd International Multitopic Conference (INMIC), Bahawalpur, Pakistan, 5–7 November 2020; pp. 1–6. [Google Scholar]
Wesley, I.J.; Larroque, O.; Osborne, B.G.; Azudin, N.; Allen, H.; Skerritt, J.H. Measurement of Gliadin and Glutenin Content of Flour by NIR Spectroscopy. J. Cereal Sci. 2001, 34, 125–133. [Google Scholar] [CrossRef]
Výrostková, J.; Regecová, I.; Zigo, F.; Marcinčák, S.; Kožárová, I.; Kováčová, M.; Bertová, D. Detection of Gluten in Gluten-Free Foods of Plant Origin. Foods 2022, 11, 2011. [Google Scholar] [CrossRef]
Karamdoust, S.; Milani-Hosseini, M.-R.; Faridbod, F. Simple Detection of Gluten in Wheat-Containing Food Samples of Celiac Diets with a Novel Fluorescent Nanosensor Made of Folic Acid-Based Carbon Dots through Molecularly Imprinted Technique. Food Chem. 2023, 410, 135383. [Google Scholar] [CrossRef]
Okeke, A. Fourier Transform Infrared Spectroscopy (As A Rapid Method) Coupled with Machine Learning Approaches for Detection And Quantification of Gluten Contaminations in Grain-Based Foods. Master’s Thesis, Biosystems and Agricultural Engineering, Michigan State University, East Lansing, MI, USA, 2020. [Google Scholar] [CrossRef]
NIRQuest512-2.5 Spectrometer|Ocean Insight. Available online: https://www.oceaninsight.com/products/spectrometers/near-infrared/nirquest2.5/ (accessed on 7 March 2023).
Solid Scanner. Available online: https://www.solidscanner.com/en/produkt/buy-solid-scanner/ (accessed on 7 March 2023).
SR-4N1000-25 Spectrometer|Ocean Insight. Available online: https://www.oceaninsight.com/products/spectrometers/general-purpose-spectrometer/ocean-sr4-series-spectrometers/sr-4n1000-25/ (accessed on 7 March 2023).
Microplate Readers: Multi-Mode and Absorbance Readers Products|BioTek. Available online: https://www.biotek.com/products/detection/ (accessed on 10 March 2023).
AGILENT HP 1200 HPLC System—Compra al Mejor Precio. Available online: https://es.bimedis.com/agilent-1200-hplc-system-m400841 (accessed on 10 March 2023).

Figure 1. Software architecture, hardware, flour samples, and 3D mechanical system used. (a) Serverless architecture. (b) Flour samples, NIRS sensor, and IoT prototype portable solution.

Figure 2. Hardware and software architecture.

Figure 3. Mean of the flour samples doped with 0 g/kg (blue line) and 100 g/kg of gluten (orange line).

Figure 4. Graph procedure of the proposed tuning methodology.

Figure 5. (a–d) show the Accuracy, F2-score, and time vs. the C hyperparameter. (e,f) show the accuracy vs. the kernel and gamma hyperparameters.

Figure 6. Hyperparameters of the best XGBoost model in the 1422–1583 nm wavelength range.

Figure 7. Graph procedure of the proposed tuning methodology and the values used.

Figure 8. Accuracy, F2-Score, Training time vs. Hidden Layers.

Figure 9. Accuracy, F2-Score, and Training time vs. Optimizer.

Figure 10. Accuracy and Training time vs. Learning rate.

Figure 11. Accuracy and F2-score vs. epochs.

Figure 12. Accuracy, F2-score, and loss vs. α and β loss function parameters.

Figure 13. Summary of the proposed tuning methodology.

Table 1. The specifications and natural gluten content of the commercial flours used for the preparation of the samples.

Type of Flour	Brand	Specifications	Natural Gluten Content (ppm)
Rye	El Alcavaran	Whole. Set no. 30620184/189/21	33,650 ± 4413.0
Rye	El Granero	Organic production. Whole. Set no. HC091231	11,108 ± 409.41
Corn	APASA	Set no. 024228	<LOD
Corn	El Granero	Organic production. Whole. Set no. HM281031	3.0210 * ± 0.1909
Oat	El Alcavaran	Organic production. Whole. Set no. A-40320172-040/21	126.02 ± 89.095
Oat	El Granero	Organic production. Whole. Set no. HAI161231	13.402 ± 0.4755

LOD, limit of detection. *, < LQD (limit of quantification, 4 ppm).

Table 2. Pseudocode of the proposed tuning methodology.

1. Input: HM,

f_{h m}

, m

2. Initialize MV = (r)

3. for

h m_{i}

in HM

3.1. for

m v_{k}

in MV

3.1.1. for hmij in

h m_{i}

3.1.1.1. train_DNN

(m v_{k j}

, hmij)

3.1.1.2. m_val <- calculate (m)

3.1.1.3. s[s_kj]<- set_score(m_val)

3.1.2. (

m v_{11}

,

m v_{12}

, …,

m v_{f_{h m_{i}} j}

) <-

s e l e c t_m a x_{f_{h m_{i}}}

(

s_{11}

,

s_{12}

, … s_1j)

3.2. MV <-

(m v_{11}

,

m v_{12}

, …,

m v_{k j}

)); k = 0.

4. Output:

f m v

<- MV

Table 3. Search space of the SVM method.

Hyperparameters	Lower Limit	Upper Limit	Kernel Types
C	0.000001	1000,000	-
kernel	-	-	poly, rbf, sigmoid
gamma	-	-	scale, auto

Table 4. Search space of the XGBoost method.

Hyperparameters	Lower Limit	Upper Limit
alpha	0	1000
lambda	0	1000
max_depth	0	10
num_round	1	4000

Table 5. Hyperparameters selected for the ML methods by applying the Bayesian optimization method.

Model	900–1700 nm	1089–1325 nm	1239–1353 nm	1422–1583 nm
SVM	‘C’:3732.752, ‘gamma’: ”scale”, ‘kernel’: “rbf”	‘C’: 985,957.277, ‘gamma’: ”scale”, ‘kernel’: “rbf”	‘C’: 752,630.194, ‘gamma’: ”auto”, ‘kernel’: “poly”	‘C’: 237,515.537, ‘gamma’: ”auto”, ‘kernel’: “rbf”
XGBoost	‘alpha’: 0.0, ‘lambda’: 0.0, ‘max_depth’: 6, ‘num_round’: 729	‘alpha’: 0.0128, ‘lambda’: 0.982, ‘max_depth’: 10, ‘num_round’: 1599	‘alpha’: 1.283, ‘lambda’: 60.669, ‘max_depth’: 8, ‘num_round’: 142	‘alpha’: 0.0, ‘lambda’: 150.466, ‘max_depth’: 3, ‘num_round’: 4000

Table 6. The hyperparameters selected after the proposed tuning methodology.

Hyperparameter	Hidden Layers	Optimizer	Learning Rate	Epochs	Activation Function	Loss Function
900–1700 nm	6	Adadelta	0.01	400	tanh	α = 0.175 β = 0.825
1089–1325 nm	4	Adadelta	0.1	500	tanh	α = 0.2 β = 0.8
1239–1353 nm	3	Adadelta	0.1	300	tanh	α = 0.3 β = 0.7
1422–1583 nm	8	Adadelta	0.1	500	tanh	α = 0.15 β = 0.85

Table 7. Classification results of the ML models.

Model	900–1700 nm	1089–1325 nm	1239–1353 nm	1422–1583 nm
SVM	ACC = 0.9131	ACC = 0.7863	ACC = 0.7814	ACC = 0.8893
	F2 = 0.9445	F2 = 0.8966	F2 = 0.8963	F2 = 0.8550
	TT = 72 s	TT = 117 s	TT = 128 s	TT = 97 s
XGBoost	ACC = 0.7769	ACC = 0.7625	ACC = 0.5755	ACC = 0.9452
	F2 = 0.8675	F2 = 0.8603	F2 = 0.6658	F2 = 0.9287
	TT = 383 s	TT = 854 s	TT = 143 s	TT = 1026 s

ACC, Accuracy; F2, F2-score; TT, Training time.

Table 8. Classification results for the DNN model.

Model	900–1700 nm	1089–1325 nm	1239–1353 nm	1422–1583 nm
Higher classification results for DNN	ACC = 0.9503	ACC = 0.7089	ACC = 0.7020	ACC = 0.9177
	F2 = 0.9447	F2 = 0.8936	F2 = 0.8998	F2 = 0.9606
	TT = 575.8121 s	TT = 586.9577 s	TT = 582.0159 s	TT = 575.8121 s
DNN after completing the tuning methodology	ACC = 0.9064	ACC = 0.7089	ACC = 0.7020	ACC = 0.9177
	F2 = 0.9370	F2 = 0.8936	F2 = 0.8998	F2 = 0.9606
	TT = 521.2503 s	TT = 586.9577 s	TT = 582.0159 s	TT = 766.5958 s

ACC, Accuracy; F2, F2-score; TT, Training time.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jossa-Bastidas, O.; Sanchez, A.O.; Bravo-Lamas, L.; Garcia-Zapirain, B. IoT System for Gluten Prediction in Flour Samples Using NIRS Technology, Deep and Machine Learning Techniques. Electronics 2023, 12, 1916. https://doi.org/10.3390/electronics12081916

AMA Style

Jossa-Bastidas O, Sanchez AO, Bravo-Lamas L, Garcia-Zapirain B. IoT System for Gluten Prediction in Flour Samples Using NIRS Technology, Deep and Machine Learning Techniques. Electronics. 2023; 12(8):1916. https://doi.org/10.3390/electronics12081916

Chicago/Turabian Style

Jossa-Bastidas, Oscar, Ainhoa Osa Sanchez, Leire Bravo-Lamas, and Begonya Garcia-Zapirain. 2023. "IoT System for Gluten Prediction in Flour Samples Using NIRS Technology, Deep and Machine Learning Techniques" Electronics 12, no. 8: 1916. https://doi.org/10.3390/electronics12081916

APA Style

Jossa-Bastidas, O., Sanchez, A. O., Bravo-Lamas, L., & Garcia-Zapirain, B. (2023). IoT System for Gluten Prediction in Flour Samples Using NIRS Technology, Deep and Machine Learning Techniques. Electronics, 12(8), 1916. https://doi.org/10.3390/electronics12081916

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

IoT System for Gluten Prediction in Flour Samples Using NIRS Technology, Deep and Machine Learning Techniques

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Preparation

2.2. Implementation

Hardware and Software Description

2.3. Data Collection Procedure

2.4. Classification Framework

2.4.1. Input Data

2.4.2. Preprocessing

2.4.3. Classification Models

2.4.3.1. Support Vector Machine

2.4.3.2. Extreme Gradient Boosting (XGBoost)

2.4.3.3. Deep Neural Network

2.4.3.4. Hyperparameter Tuning Methodology for DNN

2.4.4. Output

3. Results

3.1. Machine Learning Hyperparameter Tuning

3.1.1. SVM

3.1.2. XGBoost

3.2. Deep Learning Hyperparameter Tuning

3.3. Classification Results

4. Discussion and Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI