A Novel Isomap-SVR Soft Sensor Model and Its Application in Rotary Kiln Calcination Zone Temperature Prediction

: Soft sensing technology has been proved to be an e ﬀ ective tool for the online estimation of unmeasured or variables that are di ﬃ cult to directly measure. The performance of a soft sensor depends heavily on its convergence speed and generalization ability to a great extent. Based on this idea, we propose a new soft sensor model, Isomap-SVR. First, the sample data set is divided into training set and testing set by using self-organizing map (SOM) neural network to ensure the fairness and symmetry of data segmentation. Isometric feature mapping (Isomap) method is used for dimensionality reduction of the model input data, which could not only reduce the structure complexity of the proposed model but speed up learning speed, and then the Support Vector Machine Regression (SVR) is applied to the regression model. A novel bat algorithm based on Cauchy mutation and L é vy ﬂight strategy is used to optimize parameters of Isomap and SVR to improve the accuracy of the proposed model. Finally, the model is applied to the prediction of the temperature of rotary kiln calcination zone, which is di ﬃ cult to measure directly. The simulation results show that the proposed soft sensor modeling method has higher learning speed and better generalization ability. Compared with other algorithms, this algorithm has obvious advantages and is an e ﬀ ective modeling method.


Introduction
In petrochemical, chemical, iron, and steel metallurgy and other process industries, due to economic or technical constraints, there are often some process variables that cannot be measured online or are difficult to measure using real sensors. However, these variables are usually key process parameters or quality indicators. These variables are usually determined by off-line sample analysis in the laboratory, so it is difficult to use them in the feedback signal of online control system. Online monitoring of these variables is an important guarantee to ensure safety, smooth production, efficient operation, and greater economic benefits. The main problem of quality control in the process industry is how to realize the estimation of these unpredictable variables online. Soft sensors are usually used to solve these problems. Soft sensor technology uses the idea of indirect measurement, multiple regression analysis, support vector machine, neural network, and other technical means to estimate the dominant variable through a certain correlation between the auxiliary variable and the dominant variable. It has the advantages of less investment, simple maintenance, and fast response.
Rotary kiln is the main production equipment of iron ore pellet, and the temperature of calcination zone is one of the key technical indexes to determine the quality of finished pellet. Increasing the temperature control level of calcination zone is of great significance to improving the quality of finished pellet. However, when the rotary kiln is running, Thermocouple signal transmission slip-ring is prone to poor contact and the temperature measurement signal is unstable. In addition, the thermocouple has an extremely short service life under the strong oxidation and high temperature environment. Therefore, to replace the faulty thermocouple, the rotary kiln must stop running, which seriously affects the production. Other non-contact temperature measurement methods (such as infrared temperature measurement, etc.) have extremely low accuracy under the harsh environment. Therefore, it is of great significance to establish a soft sensor model to predict the temperature of rotary kiln calcining zone. Due to the complex mechanism of pellet roasting, strong coupling, and nonlinear characteristics, conventional mechanism modeling and regression analysis methods are difficult to establish an accurate temperature measurement model. Intelligent modeling methods such as neural network, support vector machine and rough set are effective methods to solve the problem of soft measurement of key process variables in complex nonlinear systems. For soft sensor of calcining zone temperature of rotary kiln, literature [1] established a soft sensor model of calcining zone temperature of rotary kiln by using an improved limit learning machine. Literature [2] adopted a soft sensor model of calcining zone temperature of rotary kiln based on multi-model fusion technology based on least square support vector machine, and optimized the model structure with PSO algorithm. Literature [3] used t-s fuzzy neural network and model transfer method to establish a soft sensor model of calcining zone temperature of rotary kiln. Support Vector machine (SVR) is an effective tool for nonlinear modeling of complex systems. Currently, SVR algorithm has been widely used in the research field of soft sensing. Literature [4] used Gaussian SVR and cat mapping prediction, cloud model, and hybrid particle swarm optimization algorithm to establish the urban traffic flow prediction model, achieving better prediction effect than the SVR prediction method alone. Literature [5] established a soft sensor model of ethylene polymerization process by using PCA method. First, PCA method was used to reduce the dimension of sample data, and then particle swarm optimization algorithm was used to optimize the parameters of the SVR model. Literature [6] studies a time-dependent regression algorithm for online support vector machines (SVM), which overcomes the time-varying characteristics of the object. Simulation and industrial applications show that it has good accuracy. Literature [7] studied an evolutionary fuzzy SVR soft sensor model and successfully applied it to the prediction of welding residual stress. Soft sensor modeling is essentially a data-driven modeling method. Data preprocessing and model parameter selection will affect the model accuracy. The SVR algorithm has been improved and the prediction accuracy has been improved in the above literature; however, the data preprocessing and the model optimization process have not been combined, so the model accuracy still has some room for improvement. This paper proposes an Isomap-SVR soft sensing model based on improved bat algorithm optimization and applies the soft sensor model to the prediction of rotary kiln calcination zone temperature. The model uses Isomap method to reduce the dimension of the sample data. The parameter K of the Isomap and the structure of SVR model are joint optimized using an improved bat algorithm. The simulation results verify the effectiveness of the proposed method.

Pellet Production Process
Pellet production equipment is generally composed of grate, rotary kiln [8] and ring cooler. The pellets with diameters of 9~16 mm obtained in the pelleting process were preheated in the grate and then entered the rotary kiln from the kiln tail. The ball will move towards the kiln head with the rotation of the rotary kiln. The calcination flame is injected from the kiln head to the kiln tail, and the raw ball is solidified by the roasting of the high temperature flame and the action of the heat radiation in the kiln [9]. After calcination, the pellets enter the ring cooler from the kiln head for cooling, and the hot smoke generated by the ring cooler returns to the grate as a heat source for preheating the raw pellets. In normal operation of rotary kiln, the temperature of calcination belt should be controlled at 1250 • C, to ensure the quality of pellets and avoid the damage of the kiln body. The temperature Symmetry 2020, 12, 167 3 of 15 of rotary kiln calcination zone is difficult to be measured stably. Currently, the fuel quantity and other operating parameters of the system are generally adjusted by manual fire observation. As a result, the product quality is greatly affected by human factors, and the mass of finished pellet [10] fluctuates greatly.

Soft Sensor Model Structure
The soft sensor method is an effective solution to the problem of temperature prediction of rotary kiln calcination zone [11]. In this paper, based on the mechanism of rotary kiln process characteristics and heat balance [12], we select 13 process variable related to the rotary kiln calcination zone temperature which are easily detected as the soft sensor input. They are coal injection amount, chain grate machine material thickness, the chain grate machine preheating section of the hood I temperature, chain grate machine preheating section II hood temperature, kiln head temperature and temperature of preheater, ring I period of air temperature, cold machine ring cooler II period of air temperature, drum cover specific temperature, convulsions hood temperature, cold chain grate machine speed, rotating speed of rotary kiln and ring pattern in speed. The output of the soft sensor model is the temperature of calcination zone, and the temperature signal obtained when the slip-ring thermocouple temperature transmitter works steadily. The lower the data dimension, the simpler the model structure, in order to simplify the model structure and realize the joint optimization of parameters K of the Isomap algorithm and the structure SVR model, we have designed the model structure as shown in Figure 1. raw pellets. In normal operation of rotary kiln, the temperature of calcination belt should be controlled at 1250 °C, to ensure the quality of pellets and avoid the damage of the kiln body. The temperature of rotary kiln calcination zone is difficult to be measured stably. Currently, the fuel quantity and other operating parameters of the system are generally adjusted by manual fire observation. As a result, the product quality is greatly affected by human factors, and the mass of finished pellet [10] fluctuates greatly.

Soft Sensor Model Structure
The soft sensor method is an effective solution to the problem of temperature prediction of rotary kiln calcination zone [11]. In this paper, based on the mechanism of rotary kiln process characteristics and heat balance [12], we select 13 process variable related to the rotary kiln calcination zone temperature which are easily detected as the soft sensor input. They are coal injection amount, chain grate machine material thickness, the chain grate machine preheating section of the hood I temperature, chain grate machine preheating section II hood temperature, kiln head temperature and temperature of preheater, ring I period of air temperature, cold machine ring cooler II period of air temperature, drum cover specific temperature, convulsions hood temperature, cold chain grate machine speed, rotating speed of rotary kiln and ring pattern in speed. The output of the soft sensor model is the temperature of calcination zone, and the temperature signal obtained when the slip-ring thermocouple temperature transmitter works steadily. The lower the data dimension, the simpler the model structure, in order to simplify the model structure and realize the joint optimization of parameters K of the Isomap algorithm and the structure SVR model, we have designed the model structure as shown in Figure 1.

Data Dimension Reduction Processing Based on Isomap
Compared with traditional linear dimension reduction methods such as PCA [13] and MDS [14], Isomap is a globally optimized nonlinear dimension reduction algorithm. It replaces the Euclidean distance by geodesic distance, and extracts hidden nonlinear manifolds from high-dimensional data while retaining the inherent geometric characteristics of data. It is easier to achieve nonlinear

Data Dimension Reduction Processing Based on Isomap
Compared with traditional linear dimension reduction methods such as PCA [13] and MDS [14], Isomap is a globally optimized nonlinear dimension reduction algorithm. It replaces the Euclidean distance by geodesic distance, and extracts hidden nonlinear manifolds from high-dimensional data while retaining the inherent geometric characteristics of data. It is easier to achieve nonlinear reduction of high-dimensional data [15,16]. The implementation steps of Isomap [16] algorithm are as follows: Step 1: For input sample X = {x i } S i=1 , x i ∈ R p , p and S are dimension and rows of sample X, respectively. Set the neighborhood parameter K, Constructing k-Neighborhood Graph G. Calculate the Euclidean distance between each sample point and the remaining sample points D E ij (i j), if x j is one of the closet to k points of x i , E ij is an edge of G and the length is D E ij , finally the neighborhood weighting graph G is obtained.
Step 2: Calculating Geodetic Distance Matrix D M . The geodesic distance on a manifold is approximated by the shortest distance between the points, Obtain the geodetic distance matrix D M = D G , the shortest distance D G ij can be obtained by Floyd iterative algorithm [17,18].
Step 3: Construct q dimension embedding. Use MDS Method to Find Transform Matrix τ G . where: The eigenvalues of τ G are sorted in descending order. If the matrix of the eigenvectors corresponding to the previous q eigenvalues is U, the q dimension embedding result of the output is as follows: In the Isomap algorithm, the neighborhood parameter K determines the result of dimensionality reduction, and then affects the accuracy of the soft sensor model. The value of K must be selected reasonably.

SVR Algorithm
The core idea of Support Vector Machine Regression (SVR) [19] for nonlinear prediction is to find the optimal hyperplane so that the distance between sample data and the plane is the shortest. For the nonlinear sample set x i , y i N i=1 (y i is the output of x i , N is the number of samples). SVR solves the nonlinear relationship between x i and y i by mapping input vector x i into high-dimensional feature space through nonlinear mapping φ(·) and linear regression [20]. Its high-dimensional linear regression function is: ω is a high-dimensional weight vector, b is offset. If the penalty coefficient C is introduced, the relaxation factors ξ i and ξ * i , and the insensitive parameters ε, ω and b can be obtained by minimizing the objective function: The Lagrange multiplier is introduced to solve the quadratic programming problem with inequality constraints, Available: Symmetry 2020, 12, 167

of 15
In the formula: α i and α i are Lagrangian coefficients obtained by Lagrange multiplier [21] method, K x i , x j is a nonlinear kernel function, In this paper, the Gauss function [22] is chosen as the kernel function: The main parameters affecting the accuracy of SVR model are penalty coefficient C and kernel function width δ.

Bat Algorithm
Bat Algorithm (BA) is a new intelligent optimization algorithm proposed by Yang in 2010 to simulate the bat echolocation behavior. It has been successfully applied in many function optimization problems [23]. For the optimization problem that the solution variable is X = (x 1 , x 2 , · · · , x d ) T and the objective function is f (X). The implementation steps of the BA algorithm are as follows: Step 1: Initial population number M, maximum pulse loudness A 0 , maximum pulse frequency R 0 , position moving range and initial velocity V 0 of each bat, search pulse frequency range [f min , f max ], loudness attenuation factor α, frequency enhancement factor γ, maximum iteration number T and search precision ε of the algorithm. The initial position vector X 0 of each bat is randomly determined. Set the current iterations t = 1.
Step 2: Calculate the current optimal solution vector X t * according to the fitness function. Update the search pulse frequency, speed, and position of each bat. The update formulas are as follows: where β is a M dimensional random vector uniformly distributed on [0, 1], f i is an acoustic frequency.
Step 3: Generate a random parameters R rand between [0, 1]. For each bat, if R rand > R i , performs a local search, and X t i performs a random perturbation to generate a new solution: where: η is a M-dimensional random vector uniformly distributed on [−1, 1], A t is the average loudness of the current bat population.
Step 4: Generate a random parameters , accept the new solution newX t i , then update R t i and A t i according to the following formula: where: α is the loudness attenuation coefficient, γ is the frequency increase coefficient.
Step 5: If the minimum fitness meets the optimization requirement or the number of iterations is greater than T, the optimal fitness value and the optimal solution are output. Otherwise t = t + 1, and return to Step 2.
From the perspective of the algorithm itself, the basic BA achieves position update through speed update. There is no effective mechanism to escape from the local extremum. The algorithm is easy to premature and the search accuracy is poor [24][25][26].

Lévy Flight Strategy
To solve the shortcomings of the basic BA, this paper first introduces Lévy flight mechanism to improve the speed update formula of the basic BA. The 3D figure of Lévy flight process is shown in Figure 2.
Step 5: If the minimum fitness meets the optimization requirement or the number of iterations is greater than , the optimal fitness value and the optimal solution are output. Otherwise t = t + 1, and return to Step 2.
From the perspective of the algorithm itself, the basic BA achieves position update through speed update. There is no effective mechanism to escape from the local extremum. The algorithm is easy to premature and the search accuracy is poor [24][25][26].

Lévy Flight Strategy
To solve the shortcomings of the basic BA, this paper first introduces Lévy flight mechanism to improve the speed update formula of the basic BA. The 3D figure of Lévy flight process is shown in Figure 2.  From Figure 2, it can be observed that frequently small steps and occasional large step occur alternately during the process of Lévy flight, which will help the algorithm jump out of local minima.
Formula (15) is a new velocity update formula instead of Formula (10).
where X t b is the current optimal value and X t * is the global optimal value. c and µ are random parameters between [0, 1], sign() is a symbol function, symbol ⊕ is point multiplication, and random step length Lévy is from Lévy distribution: Formula (15) takes the optimal value of the current generation as one of the parameters of speed update, which can effectively reduce the attraction intensity of bats to the global optimal value. At the same time, the Lévy flight mechanism can effectively avoid the premature of the algorithm.

Cauchy Mutation Strategy
Secondly, there is no mutation mechanism in BA. At the later stage of search, almost all individuals will gather around the optimal solution. If this solution is local optimal solution, the algorithm will fall into local minimum. In this paper, adaptive mutation mechanism is introduced to avoid premature.
The specific method is that if the fitness value does not change within K (K = 3) generation, it means that the algorithm falls into local optimum, and then produces a random number ω rand between [0, 1], If ω rand < p t (Cauchy mutation probability), the individual bat carries out Cauchy mutation according to the formula (17).
The C(0, 1) in the formula is the standard Cauchy distribution. Since the Cauchy distribution is more prone to generate random numbers far from the origin than the Gaussian distribution, the range of mutation is larger, and it is easier to jump out of the local optimum. The Cauchy mutation probability is: The probability of mutation increases as the increase of iterations t, which is more conducive to overcoming the local optimum.
Finally, to overcome the shortcomings of poor local search accuracy of BA, a variable step size local search strategy is proposed. Formula (12) is modified to: where T is the max iterations of the algorithm, and t is the current iterations. As the increase of iterations, the step length of local search will be smaller and smaller, and the precision of local search will be improved continuously. In this paper, the improved bat algorithm is named IBA. The proposed IBA is evaluated on 6 typical test functions which listed in Table 1. It compared with two classical optimization algorithm (i.e., BA and Particle swarm optimization algorithm (PSO) [27]), and two other recently improved BAs (i.e., EBA [28] and dBA [29]).

Name
Function  Table 1.
The parameters settings of the algorithms used in the comparisons are listed in Table 2.
Ref. [28] ξ init = 0.6, n = 3 dBA Ref. [29] To make a fair comparison, all algorithms were tested using the same population size M = 30 and the same maximum number of iteration T = 500. For each trail, each algorithm was run 30 times independently on each test function. The convergence comparisons of all algorithms are graphically shown in Figure 3.  It can be seen from Figure 3 that the convergence speed and accuracy of the IBA algorithm are significantly better than that of BA, PSO, EBA, and dBA algorithm on f1 to f5. IBA has satisfactory performance on f6, and only loses in dBA algorithm. It indicates that the overall performance of IBA is much better than other algorithms.  It can be seen from Figure 3 that the convergence speed and accuracy of the IBA algorithm are significantly better than that of BA, PSO, EBA, and dBA algorithm on f1 to f5. IBA has satisfactory performance on f6, and only loses in dBA algorithm. It indicates that the overall performance of IBA is much better than other algorithms. It can be seen from Figure 3 that the convergence speed and accuracy of the IBA algorithm are significantly better than that of BA, PSO, EBA, and dBA algorithm on f 1 to f 5 . IBA has satisfactory performance on f 6 , and only loses in dBA algorithm. It indicates that the overall performance of IBA is much better than other algorithms.

Soft Sensor Model Optimization Based on IBA Algorithm
As mentioned above, parameter K of the Isomap algorithm, penalty coefficient C and kernel function width δ of the SVR model determines the performance of the soft sensor model. In this paper, the parameters K, C and δ are of the proposed soft sensor model are obtained by the IBA. The optimization process is as follows: Step 1: Initialize the parameters of the BA, randomly generate the initial position i of each bat, use the initial position of each bat as the parameter of the soft sensor model, apply the training data to train the SVR model, and calculate the fitness value. To ensure the robustness and fitting ability of the model and avoid overlearning, we select the determination coefficient R 2 and 5-fold cross verification accuracy Q 2 cv5 as fitness function, namely: where Step 2: Enter the BA iterative search process. An optimized search of the bat's speed and position is based on the IBA.
Step 3: Termination condition judgment: If the number of iterations is greater than the specified maximum number of iterations or the fitness value meets the requirements, the algorithm ends and outputs the optimal bat position (the optimal parameters K, C and δ). Otherwise return to Step 2 to continue the search. Figure 4 shows the overall implementation flow chart of the algorithm. Step 2: Enter the BA iterative search process. An optimized search of the bat's speed and position is based on the IBA.
Step 3: Termination condition judgment: If the number of iterations is greater than the specified maximum number of iterations or the fitness value meets the requirements, the algorithm ends and outputs the optimal bat position (the optimal parameters K, C and δ). Otherwise return to Step 2 to continue the search. Figure 4 shows the overall implementation flow chart of the algorithm.

Simulation Analysis
To verify the effectiveness of the proposed method, we collected the production data of a 2 million ton/year pellet production line. First, we use 3δ criterion to eliminate [30] the anomaly data, and then standardize them [31]. Finally, we obtain 500 sets of modeling data.
(1) Simulation process and result in an experiment

Simulation Analysis
To verify the effectiveness of the proposed method, we collected the production data of a 2 million ton/year pellet production line. First, we use 3δ criterion to eliminate [30] the anomaly data, and then standardize them [31]. Finally, we obtain 500 sets of modeling data.
(1) Simulation process and result in an experiment a. Modeling dataset classification To ensure the fairness of the model, the training data space is required to cover the testing data space as much as possible. Therefore, we use SOM neural network with clustering function [32] to divide the sample data set. First, a SOM neural network is established, the SOM neural network has 13 inputs and 16 hidden layer nodes. After 300 times of iterative training, the classification results of SOM network are shown in Figure 5. Then, we randomly select 80% of the data in each hidden node of SOM neural network as model training data and the rest as model test data. Figure 6 shows the distribution of training data, test data, and modeling data using SOM classification method in an experiment. It can be seen from Figure 5 that the space of training data basically covers that of the test data.

b. Joint Optimization of Soft Sensor Models
Parameter K of the Isomap algorithm, penalty coefficient C and kernel function width of the SVR model determines the performance of the soft sensor model. When SVR model is trained, the allowable error of the model = 1.25 (that is, the values can satisfy R 2 and Q 2 cv5 are both greater than 0.8). When using BA to optimize parameters of Isomap and SVR, the range of is [1,100], the range of is [1,100], and that of is [1,100]. The initial parameters of the BA are: Bat population Then, we randomly select 80% of the data in each hidden node of SOM neural network as model training data and the rest as model test data. Figure 6 shows the distribution of training data, test data, and modeling data using SOM classification method in an experiment. Then, we randomly select 80% of the data in each hidden node of SOM neural network as model training data and the rest as model test data. Figure 6 shows the distribution of training data, test data, and modeling data using SOM classification method in an experiment. It can be seen from Figure 5 that the space of training data basically covers that of the test data. Parameter K of the Isomap algorithm, penalty coefficient C and kernel function width δ of the SVR model determines the performance of the soft sensor model. When SVR model is trained, the allowable error of the model ε = 1.25 (that is, the values can satisfy R 2 and Q 2 cv5 are both greater than 0.8). When using BA to optimize parameters of Isomap and SVR, the range of K is [1,100], the range of C is [1,100], and that of δ is [1,100]. The initial parameters of the BA are: Bat population size M = 30, algorithm maximum search times N = 100, loudness attenuation coefficient α = 0.95, search frequency enhancement coefficient γ = 0.95, search frequency range [−2, 2], bat speed range [−5, 5], bat individual position range [1,100]. When optimizing the model, we select variables whose standard deviation of output variables of Isomap algorithm is greater than 0.05 as SVR model inputs.
After optimization, we get the following results: The best fitness value is 1.34, model fitting accuracy index R 2 = 0.887 and model robustness index   It can be seen from Figures 7 and 8 that the prediction accuracy of the proposed algorithm is significantly higher than that of the SVR model optimized by the basic Bat algorithm. The   It can be seen from Figures 7 and 8 that the prediction accuracy of the proposed algorithm is significantly higher than that of the SVR model optimized by the basic Bat algorithm. The temperature prediction error of proposed algorithm is basically distributed in [−22.5 C, 22.5 C]. In production, the measuring range of thermocouple is 0~1500 °C, and the accuracy level is 1.5, so the It can be seen from Figures 7 and 8 that the prediction accuracy of the proposed algorithm is significantly higher than that of the SVR model optimized by the basic Bat algorithm. The temperature prediction error of proposed algorithm is basically distributed in [−22.5 C, 22.5 C]. In production, the measuring range of thermocouple is 0~1500 • C, and the accuracy level is 1.5, so the maximum measurement error is 22.5. For this experiment, the prediction error of the proposed algorithm basically meets the actual measurement requirements.
(2) Statistical analysis a. Compared with other algorithms To verify the advantages of the proposed algorithm, we compared it with other algorithms (Each algorithm running 30 times independently, and SOM neural network is used to divide the sample data into training sample and test sample for each run), and the results are shown in Table 3. , the better robustness of the model. Q 2 ext represents the prediction ability of the model for independent external test samples, and the value range is [0, 1]. The larger the is Q 2 ext , the stronger external prediction ability of the model. The formula of Q 2 ext is: where V is the number of test samples; y(i) is the actual output of test samples,ŷ(i) is the model predictive output of test samples; y is the average output of training samples. Min_Q 2 ext , Max_Q 2 ext and Mean_Q 2 ext , represent the maximum, minimum, and average values of Q 2 ext of each algorithm in 30 experiments, respectively. From the statistical results in Table 1, we can draw the following conclusions: (1) For Isomap data segmentation method, the value of R 2 , Q 2 CV5 and Q 2 ext of the IBA-SVR algorithm proposed in this paper are better than those of the comparison algorithm. Therefore, the optimization performance of IBA algorithm for SVR model is better than that of other comparison algorithms.
(2) By comparing the dimensionality reduction methods of Isomap and PCA data under IBA-SVR model, we can see that the model performance of dimensionality reduction using Isomap algorithm is better. (3) The joint to optimization of Isomap and SVR model using IBA algorithm effectively improves the prediction accuracy of the model. 98.6% of the prediction results meet the accuracy requirements of level 1.5 instruments, and 97.7% of the prediction results meet the accuracy requirements of level 2.5 instruments.

b. Comparison of Data Segmentation Methods
To prove the superiority of SOM neural network in segmenting sample data, we compared it with the method of randomly segmenting under the proposed Isomap-SVR model. We have carried out 30 experiments, and the experimental results are shown in Table 4. From the statistical results in Table 4, we can draw the following conclusions: (1) Considering the fitting ability, different data segmentation methods have little influence on the fitting ability of the model. (2) Considering the robustness of the model, the model established by the method of randomly segmented data fluctuates more than that established by the method of SOM neural network. (3) Considering the external prediction ability of the model, the external prediction ability of the model established by the method of SOM neural network segmentation data is more stable.
Therefore, the SOM neural network data segmentation method is more suitable to build a stable soft sensor model of rotary kiln temperature.

Conclusions
In this paper, a novel Isomap-SVR soft sensor modeling method is presented and the calcination zone temperature of the rotary kiln is chosen for simulation. The sample data of rotary kiln system are divided into training data and test data by SOM neural network, which ensures the fairness of modeling data. An IBA is studied, and its advantages are verified by a set of representative test functions. It is used to jointly optimize parameters of Isomap and SVR models to ensure the rationality of parameters of the soft sensor model. The simulation results show that the proposed algorithm has the advantages of good fitting ability, strong robustness, and generalization ability compared with the comparison algorithms. The algorithm has a good application prospect. Applying the algorithm to the prediction of the calcination zone temperature of the rotary kiln can obtain a higher accuracy, which has an important guiding significance for the safe production and optimal operation of the pellet production process.