Estimation of Free Fatty Acids in Stored Paddy Rice Using Multiple-Kernel Support Vector Regression

: Grain quality changes during the storage period, and an important grain quality indictor is the free fatty acid (FFA) content. Understanding real-time change of FFA content in stored grain is signiﬁcant for grain storage safety. However, the FFA content requires manual detection with time-consuming and complex procedures. Thus, this paper is dedicated to developing a method to estimate FFA content in stored grain accurately. We proposed a machine learning approach—multiple-kernel support vector regression—to complete this goal, which improved the accuracy and robustness of the FFA estimation. The e ﬀ ectiveness of the proposed approach was validated by the grain storage data collected from northeast China. To show the merits of the proposed method, several prevailing prediction methods, such as single-kernel support vector regression, multiple linear regression, and back propagation neural network, were introduced for comparative purposes, and several quantitative statistical indexes were adopted to evaluate the performance of di ﬀ erent models. The results showed that the proposed approach can achieve a high accuracy with mean absolute error of 0.341 mg KOH / 100 g, root mean square error of 0.442 mg KOH / 100 g, and mean absolute percentage error of 2.026%. Among the four models tested, the multiple-kernel support vector regression model performed best and made the most robust forecasts of FFA content in stored grain.


Introduction
Food security has been a research hotspot all over the world due to its impact on the environment, economy, and society [1,2]. Approximately a third of food produced in the world is wasted every year [3]. As a result of poor grain storage management, the grain's quality and nutritional value deteriorate rapidly during storage [4]. Ensuring that the process of grain storage is of high quality is particularly important. The free fatty acid (FFA) content, a sensitivity indicator of the quality changes, is often employed as a measure of deterioration of stored grain [5,6]. In the process of grain storage, an increase of FFA content in stored grain has been observed during storage, which is attributed to the role of lipase hydrolysis [5,7,8]. This can impact the physical properties of rice in terms of its textural, flavor, composition, and eating quality [9][10][11]. Generally, FFA increases with storage period and the quality of stored grain reduces concomitantly [4,12,13]. Grain will deteriorate and become inedible when its FFA content reaches a certain standard value [7,14]. It is reasonable to use the fatty acid value as a quality index during wheat flour storage. Thus, it is of significance to study the changes in FFA accumulation, as they are directly related to the quality loss of stored grain.
In order to detect FFAs in food, researchers have developed a variety of accurate and reliable methods, such as classic titration with Fourier transform infrared spectroscopy [15] and gas chromatography [16]. In addition, a new method based on a homemade olfactory visualization sensor has been proposed to realize the quantitative determination of FFAs during rice storage [7]. The detectability of FFAs allows us to analyze changes of food quality. There have been many studies involving various grains, such as rice [6,17] and wheat [18], that have investigated the changing regularity of FFA during storage. Some mathematical equations [19,20] were developed to predict the FFA content during storage. These models estimate the FFA content of grain by establishing the relationship between FFA values and the storage period. Therefore, it is necessary to develop an efficient and rapid method to achieve an accurate prediction of FFA during storage.
In fact, the FFA content during grain storage is affected by many factors. Therefore, it is not possible to use the relationship between storage period and FFA content to predict the FFA value. Machine learning (ML) methods can help solve problems involving multiple independent variables, whether responses are linear or not. So far, the development of ML has provided a new idea for grain storage security, and has been used in many applications, such as FFA determination [21], grain storage loss analysis [22], and stored grain insect detection [23]. The successful application of ML in grain storage is of great significance to grain storage safety.
In ML, artificial neural networks (ANN) have been widely used for a variety of tasks [24,25]. However, ANN suffers from its weak generalization ability and over-fitting. On the contrary, support vector regression (SVR) has a better generalization ability and exhibits better prediction accuracy due to its implementation of the structural risk minimization principle which considers both the training error and the capacity of the regression model [26]. However, the kernel function and hyperparameters of SVR have profound impacts on the results, and it is time-consuming to determine the kernel and its hyperparameters [27]. Several researchers have proposed multiple kernel learning (MKL) to deal with these problems [28,29]. In this paper, we propose a model by means of multiple kernel support vector regression to predict the FFA content in stored grain during storage.

Study Area
Northeast China accounted for about 16-20% of China's total paddy rice output. The study sites were located in the Heilongjiang, Jilin, and Liaoning provinces of northeast China. The experimental data were collected from 7 grain reserve depots, including 3 in Heilongjiang province, 3 in Jilin province, and 1 in Liaoning province ( Figure 1).

Determination of FFA and Moisture
The FFA content of stored grain was measured by the national standard GB/T 20569-2006 (Guidelines for evaluation of paddy storage character). The fatty acids in paddy rice were extracted with anhydrous ethanol at room temperature, and then titrated with potassium hydroxide standard solution, after which the fatty acid value was calculated. The standard stipulated that each sample should be determined twice by the same inspector and the average value was taken as the result, and the difference between the two measured values should not be more than 2 mg KOH/100 g.
The moisture content of stored grain was measured by the national standard GB 5009.3-2016 (National Food Safety Standard-Determination of Moisture Content in Foods). Based on the physical properties of water in paddy rice, the weight lost during drying was determined by volatilization at 101.3 kPa and temperature 101-105 • C; then, the moisture content was calculated by weighing values before and after drying. The standard stipulated that each sample should be tested twice by the same inspector and the average value was taken as the result, and the absolute difference between the two measured values must not exceed 10% of the arithmetic mean.

Temperature Measurement System
The temperature was measured by a digital wireless monitoring system. A temperature monitoring system is shown in Figure 2. This system generally includes temperature sensors, temperature measuring cables, and a computer monitoring terminal. A set of temperature sensors were deployed in the granary. These sensors were encapsulated in cables and the cables were inserted into the grain pile at certain places. In addition, one digital temperature and humidity sensor was arranged at the central position over the grain surface in the granary to detect the temperature of the granary. The wire bus communication protocol was used between the computer monitoring terminal and the sensors to transmit the control command and report sensory data, and finally, the collected data were stored in the remote-control computer. Generally, the detection time of the grain temperature was from 09:00 a.m. to 10:00 a.m. every day, when the temperature was close to the daily mean temperature. As the temperature in the granary fluctuated little throughout the day, all data were sampled once a day.

Preliminary Analysis
During the storage process, the granary was a large time lag system, and the grain pile can form a "cold core" (Figure 3); this phenomenon was caused by the cycle of convection currents when the ambient air outside the granary was warm and the grain was cold. The grain pile can form a "warm core" in winter when the grain core was warm, and the outside edges was colder. To research the effects of temperature distribution on FFA, we selected a tall granary with a paddy rice pile that was 59 m in length, 19 m wide, and 6 m high (No. 6 Warehouse at Grain Reserve Depot of Shenyang, Liaoning, China). We took samples every three months from the end of warehousing (October 2017). An electric suction sampler was used to carry out sampling at four heights of 0, 1.9, 3.8, 5.7 m from the grain surface. Eighteen locations distributed evenly were sampled in each layer. Approximately 5 kg stored grain was sampled in each location, and each sample was poured from the sampler and individually bagged. The average moisture of the upper layer (UL), second layer (SL), third layer (TL), and lower layer (LL) of the stored grain was then be obtained, as shown in Figure 4.

Determination of FFA and Moisture
The FFA content of stored grain was measured by the national standard GB/T 20569-2006 (Guidelines for evaluation of paddy storage character). The fatty acids in paddy rice were extracted with anhydrous ethanol at room temperature, and then titrated with potassium hydroxide standard The structure of a digital wireless grain condition monitoring system. The width of the granary was generally 18-36 m and the length was 36-60 m. The headspace above the grain surface was not less than 1.8 m. The wall material was a reinforced concrete structure and the roof was made of a heat preserving material. In the granary, the temperature sensors layout was as follows: The distance between the rows and columns of the horizontal temperature measuring cables should be no more than 5 m; the distance between the vertical sensors should be no more than 2 m; and the distance from the cables to the grain surface, granary bottom, and granary wall should be within 0.3 m to 0.5 m.

Preliminary Analysis
During the storage process, the granary was a large time lag system, and the grain pile can form a "cold core" (Figure 3); this phenomenon was caused by the cycle of convection currents when the ambient air outside the granary was warm and the grain was cold. The grain pile can form a "warm core" in winter when the grain core was warm, and the outside edges was colder. To research the effects of temperature distribution on FFA, we selected a tall granary with a paddy rice pile that was 59 m in length, 19 m wide, and 6 m high (No. 6 Warehouse at Grain Reserve Depot of Shenyang, Liaoning, China). We took samples every three months from the end of warehousing (October 2017). An electric suction sampler was used to carry out sampling at four heights of 0, 1.9, 3.8, 5.7 m from the grain surface. Eighteen locations distributed evenly were sampled in each layer. Approximately 5 kg stored grain was sampled in each location, and each sample was poured from the sampler and individually bagged. The average moisture of the upper layer (UL), second layer (SL), third layer (TL), and lower layer (LL) of the stored grain was then be obtained, as shown in Figure 4. The structure of a digital wireless grain condition monitoring system. The width of the granary was generally 18-36 m and the length was 36-60 m. The headspace above the grain surface was not less than 1.8 m. The wall material was a reinforced concrete structure and the roof was made of a heat preserving material. In the granary, the temperature sensors layout was as follows: The distance between the rows and columns of the horizontal temperature measuring cables should be no more than 5 m; the distance between the vertical sensors should be no more than 2 m; and the distance from the cables to the grain surface, granary bottom, and granary wall should be within 0.3 m to 0.5 m.
Appl. Sci. 2020, 10, x FOR PEER REVIEW 5 of 16 Figure 3. Schematic diagram of "cold core" in granary.    Table 1 shows the average temperature of four layers at each three-month interval. As the depth of the grain layer increased, the average temperature reduced. Figure 4 and Table 1 show that during the storage period, the FFA in stored grain at the UL, SL, and TL increased at a rate significantly higher than that of the LL. In particular, the FFA in stored grain of the UL increased by 20.4%, while that of the LL increased by 11.3%. Furthermore, the greater the distance from the grain surface, the slower the increase of FFA in stored grain. This indicates that the storage conditions are better in the lower layers of the grain pile, that is, the lower the storage temperature, the better the conditions for long-term storage of grain.   Table 1 shows the average temperature of four layers at each three-month interval. As the depth of the grain layer increased, the average temperature reduced. Figure 4 and Table 1 show that during the storage period, the FFA in stored grain at the UL, SL, and TL increased at a rate significantly higher than that of the LL. In particular, the FFA in stored grain of the UL increased by 20.4%, while that of the LL increased by 11.3%. Furthermore, the greater the distance from the grain surface, the slower the increase of FFA in stored grain. This indicates that the storage conditions are better in the lower layers of the grain pile, that is, the lower the storage temperature, the better the conditions for long-term storage of grain.  Figure 5 shows the change in average moisture of the grain in each layer during storage. The reason for the sudden drop of moisture of stored grain in the third month was that aeration was carried out in the second month after the end of warehousing. On the whole, the moisture content in all layers decreased during storage, while the moisture in the UL and LL decreased faster than that in the SL and TL, because the UL was in direct contact with the ambient air in the headspace of the granary, and the LL was closest to the ventilation equipment. Table 1 and Figure 5 show that the LL is the region with low temperature and low moisture, while the UL is the region with high temperature and low moisture. This results in significant differences of fatty acid changes in each layer. Therefore, moisture content is related to FFA accumulation. all layers decreased during storage, while the moisture in the UL and LL decreased faster than that in the SL and TL, because the UL was in direct contact with the ambient air in the headspace of the granary, and the LL was closest to the ventilation equipment. Table 1 and Figure 5 show that the LL is the region with low temperature and low moisture, while the UL is the region with high temperature and low moisture. This results in significant differences of fatty acid changes in each layer. Therefore, moisture content is related to FFA accumulation.

Effective Accumulated Temperature of Grain Storage
From the analysis in Section 2.4, the changes of FFA content in stored grain are significantly affected by the stored temperature. This is because the temperature affects not only the types of insect present, but also the types and yields of the metabolites formed [30][31][32]. Hence, we introduce the effective accumulated temperature (EAT) of grain storage which can be calculated by where SEAT is the EAT of grain storage (°C d), d is the d th day, H is the total days (d), T0 is the threshold temperature (°C), and Td is the actual mean temperature of the day above threshold temperature (°C). Suitable storage temperature and moisture levels promote the germination and development of fungi, while the extensive activities of the fungi can produce heat and moisture, which in turn accelerates fungal growth and may lead to the emergence of other biological entities [33,34]. To calculate the accumulated temperature, we considered the minimum activity temperature of fungi in stored grain as the threshold temperature, which might be more favorable to the predicted results.
Research shows that molds can still be detected when the stored temperature is −8 °C [35], so −8 °C

Effective Accumulated Temperature of Grain Storage
From the analysis in Section 2.4, the changes of FFA content in stored grain are significantly affected by the stored temperature. This is because the temperature affects not only the types of insect present, but also the types and yields of the metabolites formed [30][31][32]. Hence, we introduce the effective accumulated temperature (EAT) of grain storage which can be calculated by where S EAT is the EAT of grain storage ( • C d), d is the d th day, H is the total days (d), T 0 is the threshold temperature ( • C), and T d is the actual mean temperature of the day above threshold temperature ( • C). Suitable storage temperature and moisture levels promote the germination and development of fungi, while the extensive activities of the fungi can produce heat and moisture, which in turn accelerates fungal growth and may lead to the emergence of other biological entities [33,34]. To calculate the accumulated temperature, we considered the minimum activity temperature of fungi in stored grain as the threshold temperature, which might be more favorable to the predicted results. Research shows that molds can still be detected when the stored temperature is −8 • C [35], so −8 • C was set as the threshold temperature in this paper. In addition, the stored grain temperature changes little over two consecutive days, so we consider the temperature measured by the sensor as T d of the day.

Support Vector Regression
Support vector machine (SVM) is an ML algorithm which is a discriminative classifier proposed for binary classification problems and is based on statistical learning theory [26,36]. Given the training set S= {(x 1 , y 1 ), . . . , (x m , y m )} of points x i ∈ d with the corresponding labels y i ∈ {−1, +1}, SVM classifiers attempt to find a classification hyperplane induced from the maximum margin principle and predicted to belong to a category based on the side of the gap on which they fall. Similar to SVM classification, the basic concept of SVM regression is to nonlinearly map the original data x into a high-dimensional feature space, and to solve a linear regression problem in this feature space. In a regression problem, each y i is the desired target, or output, value for the input vector x i . For SVR, a margin of tolerance (epsilon) is set in approximation to the SVM which would have already requested from the problem [37,38]. The regression function is expressed as: where ., . indicates the inner product of the involved arguments, w is the weight vector, φ(x) is the nonlinear mapping function, and b is the threshold. Based on the principle of minimizing the regularized risk, the objective function and constraints for SVR can be defined as follows: where C is a parameter which adjusts the tradeoff between the regression error and the regularization on f, l is the number of training patterns, ξ i ξ i ) is slack variables allowing for errors around the regression function, and ε ≥ 0 is the parameter in the ε-insensitive loss function and controls the accuracy of the regressor.
By adding Lagrangian multipliers α,α, the quadratic programming problem can be optimized as a dual problem. Then, the dual problem of Equation (3) can be written as where α = {α 1 , . . . , α l } andα = {α 1 , . . . ,α l } are the dual variables, and K(x i ,x j ) is a kernel function which represents the inner product φ(x i ), φ(x j ) . The sequential minimal optimization (SMO) [39] algorithm can be used to solve Equation (4). By solving α,α and b in Equation (4) using the KKT (Kurash-Kuhn-Tucker) condition, the regression function of Equation (1) becomes: Several kernel functions have been used successfully [37,40], and the most common of which are listed as follows: Linear kernel: Polynomial kernel: Sigmoid kernel: Appl. Sci. 2020, 10, 6555 8 of 16 Gaussian kernel:

Multiple-Kernel Support Vector Regression (MKSVR)
SVR based on the single kernel function (SKSVR) has been applied effectively in many fields [41,42]. However, when a dataset contains heterogeneous information, using a single kernel may not perform as well as expected [29,43,44]. Therefore, researchers proposed kernel fusion methods, i.e., MKL, which can help to deal with this problem [45]. MKL provides a more flexible framework than a single kernel function and enhances the interpretability of decision function [46].
The main idea of multiple kernel learning is to combine several kernel functions into a single function. One of the simplest and most common approaches is to consider convex combinations of multiple basic kernel functions: where M denotes the total number of kernels, and d m is a weight of the kernel. These weights are considered a vector of weights µ = [µ 1 , µ 2 , . . . , µ M ] T . To avoid over-fitting, we require the sum of weights to reach unity to restrict the range of the search space [47]. By referring to Equation (1), the objective function and constraints for MKSVR become: where Φ is the vector of function mappings. Similar to Equation (4), by introducing the Lagrangian, Equation (11) can be converted to the following Wolfe dual form: Appl. Sci. 2020, 10, 6555 9 of 16 The final regression estimation function is:

SimpleMKL for SVR
The SimpleMKL [48] algorithm was introduced to solve the optimization problem and has been used in many applications [27,49]. In this method, function J(d) is defined as the optimal objective value of problem of MKSVR. Due to the strong duality, J(µ) is also the objective value of the dual problem: First, by simple differentiation of the dual function (11) with respect to µ m , we have: Let q be the index of the largest element of vector µ; the differentiation of J(µ) with respect to µ q is: The descent direction D of gradients is computed by the following formula: where p is the index of an element of vector µ, which satisfies the condition that µ p > 0 and p is not the index of the largest component of µ. Then, we find the descent direction Dof gradients and update µas: where τ is the step size. Note that the stopping criterions can be performed according to the duality gap, KKT conditions, and the variation of µ between two consecutive steps or, even more simply, on a maximal number of iterations.

Performance Metrics
The performance of the model was evaluated by the measures of deviation between actual and theoretical values. The measures we used were the mean absolute error (MAE), mean absolute percentage error (MAPE), and root mean square error (RMSE). They can be computed as: where N represents the total number of FFA data which are needed to be estimated, y p (t) and y(t) are the estimated and true value of the tth sample.

Results
In this section, we perform an experiment on the prediction of FFA in stored grain to evaluate the efficacy of the proposed MKSVR approach and compare it with several ML methods, including SKSVR, multiple linear regression (MLR), and back propagation neural network (BPNN). The criteria MAE, MAPE, and RMSE were employed to evaluate model performance. All experiments were run on a Windows 10 computer with Intel 2.50 GHz, 8 GB RAM, and MATLAB R2016a environment. The experiments were repeated 20 times to evaluate the robustness of the algorithms.

Data Description and Preprocessing
Details of storage factors are shown in the Table 2. We used the first 12 storage factors as the input characteristic parameters, and the output parameter "detected FFA" is also the stored grain quality parameter to be predicted in this paper. It should be noted that all experimental data are from paddy rice during the storage in this paper. Detected FFA when sampling. mg KOH/100 g a Calculation method of average temperature: the temperatures for the ten days (include the day of sampling) prior to sampling were extracted, and then calculated the average temperature for these ten days.
As the temperature was collected by the digital wireless monitoring system, which comes from different grain reserve depots where equipment faults, weather and human factors that cause a lack or abnormality of the collected data often occur, we used the linear interpolation method [50] to complete the missing data and the Pauta criterion [51] to eliminate abnormal data. Finally, a total of 258 experimental data were obtained.
We used the "mapminmax" algorithm in MATLAB to scale inputs and targets so that they fell in the range 0-1. To avoid systematic differences between the training set and the test set, which leads to sample representativeness issues [52], we randomly selected 80% of the data as the training set, and the remaining 20% of the data as the test set.

Parameter Settings and Steps of Experiments
The results for the experiments depended on properly setting the types of the basis kernels and their parameters [53]. For SKSVR, we chose Gaussian kernel and performed the particle swarm optimization (PSO) [54] algorithm and ten-fold cross-validation procedure to find the optimal parameters for the data sets. The parameter ε was set to 0.001, the acceleration coefficients were c1 = c2 = 2, the population size was 30, the inertia weight was 0.9, the maximum number of iterations was 100, and parameters C and σ were forced to lie in the following intervals: C ∈ [10 0 , 10 3 ], σ ∈ [10 −2 , 10 2 ]. For the BPNN, the structural parameter was 10-15-1, where the number of the input layer neurons was 10, the number of hidden layer neurons was 15, the number of output layer neurons was 1, the learning rate was 0.01, and it iterated 5000 times. For MKSVR, we used the SimpleMKL Toolbox (http://asi.insa-rouen.fr/enseignants/~arakoto/code/mklindex.html) to implement algorithms, and the multiple kernel function was composed of 32 different basis kernels including 28 Gaussian kernels with parameter σ ∈ {0.01, 0.02, . . . , 0.09, 0.1, 0.2, . . . , 0.9, 1, 2, . . . , 9, 10} and 4 polynomial kernels with parameter s ∈ {1, 2, 3, 4}.
The detailed steps of the procedure we followed are listed as follows: Step 1: Import the pre-prepared dataset of grain storage parameters.
Step 3: Train the MLR model, ANN model, SKSVR model (by PSO algorithm), and MKSVR model (by SimpleMKL algorithm) with the training set.
Step 4: Use the test set to test the performances of the MLR model, ANN model, SKSVR model, and MKSVR model.
Step 5: Repeat steps 2-5 until the number of experiment repetitions is reached (20 times).

Comparison Results
Our experimental results from 20 experiments are shown in Table 3. The average values of MAE, MAPE, and RMSE for each algorithm were computed and are shown with the form of average ± standard deviation (SD). For each criterion, the algorithm with the minimum average and SD is considered to be the best on this criterion. criterion of BPNN were slightly smaller than SVR, the SD of each criterion of BPNN was higher than that of SVR. By considering both the average and the SD of each criterion, the performance of SKSVR and BPNN are very similar, and are at an acceptable level. However, the MLR model shows extremely poor performance for each criterion.
By analyzing Table 2, we find that the MKSVR prediction model is superior to the SKSVR and BPNN models, as well as the MLR models, and it can better predict the FFA content of grain during storage. As can be seen from the SD, the prediction model also has better accuracy and robustness.

Discussion
In China, grains are usually stored for 3-5 years after harvest. During the period of storage, the quality of the grains will decline over time. Grain quality is affected by many physical and biological factors during storage [33,55]. An increase of FFA content in stored grain has been observed during storage, which is related to the temperature distribution-the higher the storage temperature, the faster the fatty acid value increases [5,56]. Figure 6 shows the temperature distribution of a cross section of the stored grain on 26 July 2018 in Section 2.4. It can be clearly seen that a "cold core" similar to that in Figure 3 is formed inside the stored grain pile, which leads to significant differences in FFA content. In view of the effect of temperature on FFA accumulation, we included the effective accumulated temperature of grain storage as the input feature of ML, for the reason that the effect of temperature on FFA accumulation can be considered as reducing the error of the prediction results. The theory of accumulated temperature has been widely used in many fields [57][58][59]. The value of threshold temperature has a crucial significance [58,60], so we take the developmental zero temperature of the biological entities including insects and molds inside the stored grain as the threshold temperature in this paper.

MKSVR
0.341 ± 0.038 2.026 ± 0.237 0.442 ± 0.037 As can be seen in Table 3, the MKSVR model proposed in this paper achieves the best results. Compared with the MLR, BPNN, and SKSVR, the average predicting errors of MKSVR were improved. Compared with the MLR model, MAE dropped by 1.018, MAPE dropped by 5.569%, and RMSE dropped by 1.260. Compared with the BPNN model, MAE decreased by 0.090, MAPE decreased by 0.505%, and RMSE decreased by 0.132. Compared with the SKSVR model, MAE decreased by 0.134, MAPE decreased by 0.778%, and RMSE decreased by 0.132. Although the average values of each criterion of BPNN were slightly smaller than SVR, the SD of each criterion of BPNN was higher than that of SVR. By considering both the average and the SD of each criterion, the performance of SKSVR and BPNN are very similar, and are at an acceptable level. However, the MLR model shows extremely poor performance for each criterion.
By analyzing Table 2, we find that the MKSVR prediction model is superior to the SKSVR and BPNN models, as well as the MLR models, and it can better predict the FFA content of grain during storage. As can be seen from the SD, the prediction model also has better accuracy and robustness.

Discussion
In China, grains are usually stored for 3-5 years after harvest. During the period of storage, the quality of the grains will decline over time. Grain quality is affected by many physical and biological factors during storage [33,55]. An increase of FFA content in stored grain has been observed during storage, which is related to the temperature distribution-the higher the storage temperature, the faster the fatty acid value increases [5,56]. Figure 6 shows the temperature distribution of a cross section of the stored grain on 26 July 2018 in Section 2.4. It can be clearly seen that a "cold core" similar to that in Figure 3 is formed inside the stored grain pile, which leads to significant differences in FFA content. In view of the effect of temperature on FFA accumulation, we included the effective accumulated temperature of grain storage as the input feature of ML, for the reason that the effect of temperature on FFA accumulation can be considered as reducing the error of the prediction results. The theory of accumulated temperature has been widely used in many fields [57][58][59]. The value of threshold temperature has a crucial significance [58,60], so we take the developmental zero temperature of the biological entities including insects and molds inside the stored grain as the threshold temperature in this paper.  In fact, we obtained little data of the FFA content in stored grain because of the sparsity of sample locations, which are not representative of all stored grain in a granary. In such cases, temperature sensors arranged inside the grain pile are invaluable assets for the estimation of grain quality. Generally, the number of temperature sensors is much larger than that of sample locations. For example, a total of 417 sensors were arranged (one temperature and humidity sensor, and 416 temperature sensors); however, there were only 72 locations when sampling. Therefore, a wider range of grain quality monitoring can be achieved through the temperature sensors, and continuous monitoring can be conducted. Sampling consumes labor, and the subsequent quality determination still requires a lot of manpower and material resources.
Prior to this, researchers have established mathematical regression models between FFA and storage time for FFA estimation during grain storage [19,20], but these models are not very applicable once storage conditions change. To solve the problem associated with traditional mathematical methods not being able to fully reveal the essential characteristics of the grain during storage, our aim was to train the storage data set using ML to predict the FFA in stored grain. This work, of course, depends on the data set collected, that is, if the data set is larger, we typically obtain a better model in the case of selecting input features appropriately; then, we can obtain a good prediction effect on any given storage factors. The results from this paper demonstrate the applicability of the MKSVR model in realizing the real-time monitoring of grain quality changes, which can reduce any economic losses caused by grain quality reductions.

Conclusions
Here, we proposed an MKSVR approach for FFA estimation during grain storage. To validate the performance of MKSVR, ANN, MLR, and SKSVR were also applied to the FFA estimation. The performance of models was assessed by MAE, RMSE, and MAPE. All of the experiments were conducted using real data. The experimental results showed that the proposed MKSVR model outperformed the other models when estimating FFA content in stored grain. The smallest mean value of MAE was 0.341 mg KOH/100 g, the smallest RMSE was 0.442 mg KOH/100 g, and the smallest MAPE was 2.026%. The proposed MKSVR method integrates linear and nonlinear relations between FFA content and storage factors, effectively improving the predictive power and robustness of the model. Specifically, it provides technical support for the precise control of the stored grain quality and also plays a certain role in the management of grain storage.
In further study, there is a need to formulate a model that is more practical to suit the full range of grain types and qualities. This requires a larger number of samples and a greater number of independent variables in the analysis. We will collect more grain storage data, such as weather conditions, aeration, and humidity, and increase the number of input feature parameters of the model to improve its accuracy. We will analyze the influence of the collected characteristic parameters on fatty acid accumulation, then select characteristic parameters with a significant influence to retrain the model to try to improve the models.