3.2. Results of Characteristic Variables Extraction
In order to evaluate the performance of the proposed algorithm, the variables selected for the IVPA method are shown in
Figure 7.
The light green dots indicate the selected variables. As shown in
Figure 7, IVPA selects fewer variables, and the selected variables cover the main region of the absorption peak of i-C
4H
10. The spectra of five alkane gases at a concentration of 500 ppm were used as input variables. A predictive model was established to estimate the concentrations of the five gases based on the variables selected by the IVPA method. The prediction results are shown in
Table 1. The table indicates that IVPA shows the highest cross-sensitivity to n-C
4H
10, which is attributed to the similar molecular structures of n-C
4H
10 and i-C
4H
10, resulting in similar absorption spectral shapes. Conversely, the cross-sensitivity to CH
4 is relatively low, primarily because methane exhibits significantly weaker absorbance in the wavenumber range below 3000 cm
−1, where the variables selected for the IVPA method are concentrated. It is worth noting that IVPA has the most accurate prediction for i-C
4H
10, with a maximum cross-sensitivity of 1.02% and a minimum cross-sensitivity of 0.11% for the other four gases. These results indicate that this method can effectively extract and analyze variables with significant spectral overlap.
3.3. Quantitative Analysis Model of Gases with Distinct Absorption Peaks
For gases with distinct absorption peaks, such as CO, CO
2, SF
6, C
2H
4, C
2H
2, and C
3H
6, the distinct absorption peaks and troughs were selected for quantitative analysis. The absorbance spectra of the six gases are shown in
Figure 8. In the figure, the concentration of SF
6 is 500 ppm, the concentration of CO is 7000 ppm, the concentration of CO
2 is 5000 ppm, and the concentration of the other three gases is 3000 ppm.
It can be seen from
Figure 8 that distinct absorption peaks are present for all six gases. It can also be observed that SF
6 exhibits the highest absorbance around 1000 cm
−1. At the same concentration, its absorption coefficient is approximately one order of magnitude higher than those of the other five gases. As a result, SF
6 has the lowest detection limit among the six gases analyzed. Although SF
6 shows strong absorbance near 1000 cm
−1, this region suffers from significant spectral overlap with C
2H
4 and C
3H
6, making it unsuitable for accurate feature extraction. Fortunately, SF
6 also presents a distinct but weak absorption peak at 614 cm
−1. Therefore, the absorbance at this wavenumber was chosen as the characteristic variable for SF
6 in the quantitative model. Taking SF
6 as an example, six quantitative methods for gases with distinct absorption peaks were introduced. Three spectral points around the distinct absorption peak of SF
6 at 614 cm
−1 were selected, as shown in
Figure 9.
As can be seen from
Figure 9, the selected three spectral lines are exactly equally spaced. Therefore, the characteristic variable for SF
6, denoted as
, is defined as follows:
In Equation (3), A denotes the absorbance, and the subscript indicates the wavenumber; for example, A
614 refers to the absorbance at 614 cm
−1. As shown in Equation (3), the selected characteristic variable is not affected by baseline shifts, such as horizontal translation or linear tilt, thereby enhancing the stability and robustness of the system. This is because the distances between the three selected spectral lines are equal. When the baseline moves or tilts, the first term and the second term in Equation (3) have opposite signs and equal amplitudes, which cancel each other out. If the distances between the three selected spectral lines are not equal, the selected spectral lines are 610, 614, and 616 cm
−1, respectively. The following expression can be constructed:
To verify the performance of the eigenvariables selected for SF
6, the spectra of 11 target gases (CH
4, C
2H
6, C
3H
8, n-C
4H
10, i-C
4H
10, CO, CO
2, C
2H
4, C
2H
2, C
3H
6, SF
6) with a concentration of 500 ppm were substituted into Equation (4), and the resulting characteristic value vector is [−0.00024, −0.00009, −0.00001, −0.00017, −0.00004, 0.00033, −0.00022, −0.00025, 0.00004, −0.00016, 0.0676]. It can be seen that the characteristic variable of SF
6 has the highest sensitivity to CO, which is 0.49%. The maximum cross-sensitivity of SF
6 to the other 10 gases is less than 0.5%, which indicates that the cross-sensitivity of the selected characteristic quantity to other gases is low, so this characteristic quantity can be selected to analyze SF
6. Meanwhile, the spectra of ten single-component SF
6 with volume percentage concentrations of 5 ppm, 10 ppm, 20 ppm, 50 ppm, 100 ppm, 200 ppm, 500 ppm, 1000 ppm, 2000 ppm, and 5000 ppm were substituted into Equation (4), and the corresponding eigenvalue vectors were obtained as 0.00085, 0.0017, 0.0034, 0.0082, 0.0160, 0.0307, 0.0676, 0.1127, 0.1811, and 0.2246. At this point, curve fitting methods such as spline fitting, polynomial fitting, etc., could be used to obtain the function relationship between the characteristic quantity and concentration. The fitting results obtained using the three different methods are presented in
Table 2.
Table 2 presents the fitting results between the characteristic quantities and SF
6 concentrations obtained using three different methods. It can be seen that both the quartic polynomial and cubic spline functions provide good fitting performance, while the results from quadratic polynomial fitting are comparatively less accurate. Therefore, the quartic polynomial was selected for the quantitative analysis of SF
6, and its polynomial expression is given as follows:
The scanned sample spectrum was first used in Equation (4) to extract the characteristic value
, which was then input into Equation (5) to predict the SF
6 concentration. In addition, in the actual analysis, the corresponding concentration information could be obtained according to the characteristic quantity of each gas, and then the obtained characteristic quantity was subtracted from the product of cross-sensitivity and gas concentration (i.e., the compensation method), and then the compensated characteristic quantity was substituted into Equation (5), which could obtain a more accurate analysis result. The selected spectral lines for each gas are summarized in
Table 3.
The repeatability test results for gases with distinct absorption peasks are shown in
Table 4.
3.4. Quantitative Analysis of Gases with Severe Spectral Overlap
For gas mixtures with severely overlapping absorption spectra, such as alkanes, including CH
4, C
2H
6, C
3H
8, i-C
4H
10, and C
4H
10, the spectral data were first subjected to feature selection using the variable selection method described in
Section 2.2. Subsequently, the selected features were used to develop predictive models using four different algorithms: partial least squares (PLS), backpropagation neural network (BPNN), Support Vector Machine (SVM), and Least Squares Support Vector Machine (LSSVM). Among the models, the BP neural network model demonstrated the best predictive performance. The reproducibility test results for spectral overlap are shown in
Table 5.
As can be seen from
Table 4 and
Table 5, when the gas concentration is low, the relative standard deviation value obtained by the FTIR spectrometer is relatively large; for example, when the concentration of C
2H
2 is 5 ppm in the standard gas, the calculated relative standard deviation reaches 3.54%. This is because when the gas concentration is low, the average concentration of the gas obtained will also be very low, and the relative standard deviation is the ratio of the standard deviation to the average value, so the relative standard deviation will be larger when the concentration of the gas is measured at a lower level. The relative standard deviation tends to be larger for gases present at lower concentrations. It can also be seen that the standard deviations of the 10 gases are less than 10 ppm except for CO
2. The standard deviation for CO
2 is 77.07 ppm. This is because the standard gas has the highest concentration of CO
2 at 20,000 ppm. The value of 77.07 ppm is very small in relation to 20,000 ppm, and the relative standard deviation of CO
2 is only 0.36%. Thus, these test results indicate the excellent repeatability performance of the analyzer. In addition, the errors in the indicated values of the 11 characteristic gases are calculated, and it can be seen that the quoted errors of the 11 characteristic gases are less than 3‰, while the relative errors are less than 10 percent.
Simultaneously, a gas chromatograph (GC), used as the reference instrument, was placed in a temporary cabin approximately 50 meters away from the wellhead to ensure safety, as it employs a hydrogen flame ionization detector. The GC system was equipped with a silica capillary column (2 mm in diameter, 4 m in length), with nitrogen as the carrier gas at a flow rate of 30 mL/min. Both the injector and detector temperatures were maintained at 60 °C. The FTIR and GC instruments were connected via pipeline, allowing the gas extracted from the wellhead to first pass through the FTIR system for analysis and then be transferred to the GC for subsequent measurement.
The detection limits of the 11 characteristic gases using the FTIR spectrometer are shown in
Table 6.
Furthermore, the indication error of the FTIR analyzer was evaluated based on data collected from two coal mine field sites—Liujia Coal Mine and Fengshuigou Coal Mine in Chifeng City, Inner Mongolia. The GC results served as the reference standard. Since the GC system is capable of quantifying only eight gas species—oxygen (O2), nitrogen (N2), CH4, C2H6, C2H4, C2H2, CO, and CO2—the error analysis focused on six of these gases.
As presented in
Table 7, both the indication error and relative error of the developed online analyzer for coal spontaneous combustion characteristic gases satisfy the analytical performance criteria established at the outset of the project. Specifically, for gas concentrations ranging from 0 to 3% of the F.S., the indication error remains below 3‰ F.S.; and for concentrations between 3% and 100% F.S., the relative error is maintained under 10%.
These results not only confirm the accuracy and reliability of the proposed analytical instrument in measuring key gases associated with coal spontaneous combustion but also demonstrate its potential for effective early warning and prediction in practical mining applications.