1. Introduction
Boilers are the primary heat supply equipment in thermal power plants and play a crucial role in power generation [
1,
2]. The combustion process inside the furnace is complex and highly dynamic [
3], and is considered one of the most intricate physical processes [
4]. The stability of fuel combustion directly affects the safe operation of the entire system. Instabilities in the combustion process can lead to increased pollutant emissions, reduced combustion efficiency, and, in severe cases, even boiler explosions [
5,
6]. In practical engineering applications, the temperature field within the furnace serves as a direct indicator of the combustion state. Therefore, accurately reconstructing the furnace temperature field is essential to optimize combustion control, reduce pollutant emissions, and improve the operational reliability of boiler systems.
During the normal operation of a boiler, the furnace interior is typically subjected to a high-temperature environment. Due to the implementation of cooling measures, such as water-cooled walls in most boilers, the thermal distribution varies significantly across different regions, influenced by factors such as burner arrangement, furnace geometry, and water-cooling design. In certain designs, the central region exhibits relatively uniform temperatures, while areas farther from the flame or subjected to enhanced cooling by water-cooled tubes experience a sharp temperature drop. However, for different types of boilers, the temperature distribution within the furnace does not necessarily follow the pattern of a “uniform center with a decreasing periphery”, but rather varies according to the specific design and operational requirements of the boiler. To monitor the complex thermal environment inside the furnace, non-contact temperature measurement techniques have become a research focus [
7,
8]. These methods include laser thermometry, infrared thermometry, and acoustic thermometry [
9,
10], all of which allow temperature measurements in high-temperature and harsh environments without disturbing the target temperature distribution. Among these, acoustic thermometry has garnered widespread attention in recent years for furnace temperature monitoring and reconstruction due to its wide measurement range and low cost [
11,
12,
13].
Acoustic-based temperature field reconstruction typically involves two key components: time-of-flight (ToF) measurement and temperature field reconstruction algorithms. ToF data are typically collected by various acoustic transceiver sensors and processed using time-domain, frequency-domain, or hybrid-domain methods [
14]. Once reliable ToF data are obtained, the next critical step is to address the ill-posed problem through reconstruction algorithms to achieve a stable and reliable solution. The prevailing temperature field reconstruction algorithms can be categorized into non-iterative and iterative methods. Non-iterative methods primarily include the truncated singular value decomposition (TSVD) [
15,
16,
17], Tikhonov regularization [
18,
19,
20], and the least squares method (LSM) [
21]. TSVD achieves a stable numerical solution by truncating singular values close to zero; however, as singular values continue to decrease, the stability of the solution deteriorates. The Tikhonov regularization method is well suited for solving inverse problems, but its reconstructed distributions tend to be overly smooth, leading to blurred edge details. LSM can only produce meaningful solutions when the number of grid points is smaller than the number of measurement paths, and its reconstruction results are often incomplete. Iterative methods mainly include the algebraic reconstruction technique (ART) [
2,
22,
23], the simultaneous iterative reconstruction technique (SIRT) [
24,
25], the simultaneous algebraic reconstruction technique (SART) [
26,
27,
28,
29], and the Landweber iteration method [
30,
31]. ART is computationally simple and easy to implement, but it suffers from low accuracy and robustness of the reconstruction. SIRT requires a longer iteration time, whereas SART offers greater robustness compared to ART and achieves faster convergence. The Landweber method is the steepest descent iterative algorithm; however, it is prone to the issue of semi-convergence.
Existing temperature field reconstruction algorithms have made significant progress in reducing computational errors and improving reconstruction accuracy. However, cumulative errors and amplification effects persist, making it challenging for traditional methods to completely overcome their inherent limitations. Conventional algorithms primarily enhance numerical stability by optimizing the condition number of matrices, yet the final solution still exhibits unavoidable biases. Furthermore, block numerical methods introduce errors at each computational step, further constraining overall accuracy. The accuracy and robustness of temperature field reconstruction largely depend on parameter selection, such as the shape parameter of radial basis functions and the regularization parameter used to address ill-posed problems. These parameters typically require prior knowledge and extensive numerical experiments for tuning, thereby increasing the complexity of the algorithm and limiting its generalizability across different application scenarios.
In recent years, machine learning techniques have been widely applied to temperature field reconstruction to overcome the inherent limitations of traditional numerical methods. Among these, convolutional neural networks (CNNs) have emerged as one of the most commonly adopted architectures, with numerous reconstruction approaches built upon CNN-based frameworks. The temperature field residual correction network (TRCN) effectively mitigates solution bias in conventional reconstruction algorithms by employing residual learning and nonlinear error prediction mechanisms, thereby enhancing reconstruction accuracy [
32]. Similarly, the fast response temperature field reconstruction network (FTRN) [
33] leverages a CNN architecture for rapid temperature field estimation. Despite their popularity, CNNs face notable challenges when applied to time-of-flight (ToF) data, which inherently exhibit sequential characteristics. To adapt ToF sequences to the fixed input format expected by CNNs, the FTRN model introduces a preprocessing step that reshapes the sequence into a matrix. This transformation can introduce computational redundancy and may disrupt the original temporal structure of the data [
34]. ToF data encode both temporal and spatial dependencies across multiple acoustic transmission paths. For example, in multipath acoustic propagation, certain paths may exhibit temporal correlations or thermoacoustic coupling, which require the modeling of long-range dependencies. However, conventional CNNs operate with fixed local receptive fields, making them inadequate for capturing such long-range contextual information—especially in long sequences—without significantly increasing network depth or employing dilated convolutions [
35,
36]. Moreover, CNNs generally assume structured two- or three-dimensional input formats; reshaping sequential ToF data to fit these formats may cause input–structure mismatches, resulting in feature redundancy or loss of essential sequential relationships [
34]. These limitations are particularly critical in furnace environments, where complex thermoacoustic propagation demands the preservation of original physical dependencies across paths. In addition, standard convolutional kernels remain static during training, limiting their adaptability to dynamic spatiotemporal patterns. As demonstrated by dynamic filter networks, failure to adapt convolutional kernels to varying input distributions may result in suboptimal modeling of temporal variations in highly dynamic sequences [
37].
Beyond CNN-based models, other machine learning techniques have also demonstrated effectiveness. Methods based on the Kernel Extreme Learning Machine (KELM) [
10] have been employed for high-resolution temperature field reconstruction, utilizing kernel-based learning to minimize reconstruction errors and preserve fine-grained structural details. Furthermore, Transformer-based architectures have shown considerable promise. The deep transfer operator learning method [
38], which incorporates Transformer blocks, has achieved impressive results in reconstructing temperature fields of lithium-ion battery packs. By integrating physics-informed neural networks and domain adaptation strategies, these models enhance reconstruction accuracy and generalizability while ensuring adherence to physical laws. Their ability to extract domain-invariant features enables robust and accurate reconstruction, even in previously unseen scenarios, thereby underscoring the potential of advanced neural architectures for complex temperature field modeling tasks.
However, the aforementioned traditional methods still exhibit the following limitations in global information modeling and edge temperature capture:
Cumulative errors and amplification effects can lead to distortions in reconstruction results under extreme conditions.
Parameter selection, such as regularization terms and basis function shape parameters, often relies on prior knowledge and extensive numerical experiments, resulting in limited generalization ability.
Given the dynamic and highly nonlinear characteristics of the furnace temperature field, balancing global semantic information and local detail representation remains a bottleneck that existing algorithms struggle to overcome.
Existing CNN-based models suffer from inherent limitations in capturing long-range dependencies across acoustic paths in ToF data due to their restricted local receptive fields. Although stacking multiple layers or employing dilated convolutions can extend the receptive field to some extent, these strategies significantly increase model complexity and still fall short of effectively modeling the global correlations among multiple propagation paths.
The inconsistency between the input acoustic ToF data and the output reconstructed temperature field distribution poses a bottleneck for machine learning models in data processing and feature distribution.
To address the limitations in existing temperature field reconstruction methods, including the difficulty of eliminating cumulative errors and amplification effects, high dependency on parameter selection, and insufficient capture of global information and edge temperature characteristics, this paper proposes a temperature reconstruction network (ATTRN) based on an acoustic information encoder (AIE) and a temperature field reconstruction decoder (TFRD). The acoustic information encoder adopts a Transformer architecture to model long-range dependencies, effectively encoding ToF data collected by acoustic transceivers to ensure a comprehensive representation of acoustic propagation information. Unlike RNNs or LSTMs, which are used primarily to predict the evolution of sequential data, our task aims to reconstruct image-like spatial temperature fields. RNN-based models process data sequentially and are limited in capturing global dependencies across multiple acoustic paths. Moreover, they are prone to vanishing gradients when dealing with long sequences. In contrast, the Transformer architecture handles the entire ToF sequence in parallel and uses self-attention to model global cross-path interactions effectively, which is essential for accurate spatial reconstruction. The temperature field reconstruction decoder utilizes a CNN structure, focusing on optimizing local details and enhancing spatial feature representation, thereby improving the accuracy of reconstructed temperature fields. This hybrid architecture explicitly matches the dual nature of the problem: the Transformer models the global structure and dependencies of ToF inputs, while the CNN preserves and enhances local gradient patterns essential for accurate edge temperature reconstruction. Moreover, by avoiding redundant matrix transformations and sequential bottlenecks, the Transformer achieves efficient information aggregation across distant time steps, while the CNN decoder ensures efficient local refinement with low computational overhead. Although some minor information loss may occur in convolutional preprocessing, the deep multi-scale structure of the CNN compensates for it through hierarchical feature recovery. By integrating these two architectures, the encoder precisely captures and characterizes acoustic propagation properties, while the decoder accurately reconstructs the temperature distribution. Their synergistic effect enables a more complete restoration of the temperature field. Ultimately, this structure enhances global feature extraction, improves the reconstruction accuracy of edge temperatures, and increases the robustness of the reconstruction process, allowing it to adapt to complex temperature field distributions and achieve more precise and stable temperature reconstruction.
The contributions of this paper are as follows:
The designed AIE module fully utilizes the global information of ToF data and works in conjunction with the TFRD module to achieve the transformation from ToF sequence input to temperature field distribution output.
The proposed ATTRN model, which integrates the AIE and TFRD architecture, significantly improves reconstruction accuracy and demonstrates excellent generalization ability, overcoming the limitations of traditional methods constrained by limited information availability, making it applicable to more complex scenarios.
The model enhances computational robustness and stability, making it suitable for various combustion conditions, thus facilitating efficient and stable temperature field reconstruction.
Further validation of the algorithm’s effectiveness was conducted using data from the boiler at Chongqing Qineng Power Plant, confirming the industrial practicality and applicability of the proposed approach.
In the implementation, the acoustic information encoder encodes the ToF data sequence and facilitates cross-path information interaction. A token encoding mechanism is adopted, where information blocks are first encoded and then processed by the Transformer to optimize feature representation. This structure allows the encoder to directly capture long-range dependencies between acoustic paths, without relying on recurrence or manual feature fusion, which improves both scalability and stability. The temperature field reconstruction decoder is responsible for feature upsampling, enhancing spatial details, and generating high-precision temperature field distributions. Its convolutional design is particularly effective for retaining spatial locality, capturing edge gradients, and restoring fine-scale structures critical for accurate temperature field reconstruction. By integrating machine learning with acoustic thermometry, the proposed method outperforms traditional LSM-based approaches and the CNN-based method in terms of reconstruction accuracy, computational stability, and robustness to temperature variations, offering a novel perspective for industrial furnace temperature field reconstruction.
3. Experiments and Results
3.1. Dataset Construction
Function-based temperature field simulation methods are widely adopted in the literature [
39,
40,
41]. The dataset used to train ATTRN and compare it with other methods is constructed based on the Acoustic Temperature Field Reconstruction Simulation Dataset (ATFRSD) from [
32]. In this study, ATFRSD-96 and ATFRSD-54 are generated according to different sensor layouts and boundary conditions. Specifically, ATFRSD-96 consists of data collected from 16 boundary sensors, forming 96 propagation paths, while ATFRSD-54 comprises data from 12 boundary sensors, forming 54 propagation paths, as illustrated in
Figure 4.
In practical engineering applications, furnace chambers typically incorporate water-cooled wall designs, leading to a rapid attenuation of the thermal gradient at the temperature field edges. To account for this characteristic, this study additionally constructs datasets that include the effects of water-cooled boundaries, named ATFRSD-96W and ATFRSD-54W, respectively. The scale and structure of these datasets remain consistent with their non-water-cooled counterparts. Each dataset comprises a total of 6400 samples, of which 5760 are allocated for training and 640 are reserved for testing, with all samples selected using stratified sampling. Each sample includes ToF measurement data along with the corresponding temperature field distribution, ensuring a comprehensive representation of temporal and spatial features to support model training and evaluation.
The generation of simulated temperature field data follows a controlled numerical modeling process. The initialization of each temperature field is given by:
where
is randomly sampled within a predefined range. To introduce spatial variability, a random number of elliptical heat sources and cold sources are superimposed on the background temperature field. Each heat or cold source
k contributes an additional Gaussian distribution term:
where the negative sign represents cold sources, determined by a specific probability range. The parameters
are determined through random sampling to ensure diversity in position, spatial extent, and amplitude.
For datasets incorporating water-cooled walls, a boundary cooling mechanism is adopted to simulate the rapid heat dissipation process at the furnace edges in a discretized manner:
where
includes the cooling coefficient
, the wall temperature
, and the boundary width parameter
. These adjustments ensure the formation of a steep thermal gradient near the boundary.
where
k is a constant with a value of 20.03. Therefore, the total ToF value for each path is obtained by accumulating the propagation time of all segments.
In each synthesized sample, the ToF is calculated along the propagation paths from boundary to boundary. A path from
to
is discretized into multiple segments, where the local acoustic velocity of each segment is determined by the following. The governing equations involved in the dataset generation process are detailed in Equations (
10)–(
13).
Table 1 and
Table 2 summarize the primary randomization ranges of these parameters. By incorporating randomized background temperatures, multiple superimposed heat/cold sources, and water-cooled wall effects, the dataset ultimately comprises 6400 training samples, covering a wide range of temperature field configurations. Each sample consists of the following two components:
The final temperature field distribution has a resolution of 64 × 64.
The corresponding ToF vector (with a dimension of 54 or 96, depending on the sensor layout).
It is important to note that the parameter ranges are deliberately set to be relatively broad. This design choice aims to enhance the information extraction capability and generalization ability of the neural network. Since the dataset is generated based on simple exponential functions, there is a potential risk of overfitting—where the neural network may learn specific parameters of the heat sources rather than capturing the overall variations in the temperature distribution. By expanding the parameter range, we ensure that the shapes of heat sources within the temperature fields are not highly uniform, thereby forcing the encoder to extract local temperature variations more effectively rather than merely memorizing the parameters of the generating function. This strategy further improves the network’s performance on real-world data.
3.2. Experimental Indicators
To better assess the precision of the reconstruction, we introduce the following performance metrics [
32]:
where
(unit: °C) represents the maximum absolute error in the measurement field, while
denotes the root mean square error (RMSE) of the measurement field. The error calculation is conducted over a region containing
n temperature sampling points, where
. In this formulation,
represents the temperature value of the
i-th sampling point in the temperature field model, with
, while
represents the reconstructed temperature value at the same location after applying a given algorithm.
is the mean temperature of the temperature field model. To compare the generalizability of different algorithms, the comparison involves evaluating the computational results of each algorithm on real datasets, using the mean error indicators
and
. Additionally, due to the potential loss of edge information in the LSM, we further introduce edge error, central error, and global error to provide a more comprehensive assessment of algorithm performance.
3.3. Robustness Evaluation of the ATTRN Framework
To assess the robustness of the ATTRN framework, we conducted experiments to analyze the impact of noise on ToF measurements. A detailed noise impact analysis indicates that the probability distribution of combustion noise within the furnace typically follows a Gaussian (GS) distribution. Based on the methods described in [
42,
43], we introduced additive Gaussian noise into the ToF data, with a mean value of 0 and standard deviations of 0.00005 s (low noise), 0.0001 s (medium noise), and 0.0002 s (high noise).
Subsequently, robustness tests were conducted under two different sensor configurations. In one configuration, 16 sensors were deployed along the boundary of the region, corresponding to 96 propagation paths. In the other configuration, 12 boundary sensors were used, corresponding to 54 propagation paths. Both sensor configurations included water-cooled walls to simulate the steep temperature gradient at the boundaries.
As illustrated in
Figure 5 and
Figure 6, the experimental results demonstrate the performance of ATTRN under varying noise levels. In the case of 54 propagation paths (
Figure 5), the central error of ATTRN increases from 1.7% to 3.9%, the edge error rises from 3.6% to 5.6%, and the global error increases from 2.6% to 4.6% as the noise level increases from 0 to 0.0002 s. This corresponds to an approximate error increase of 2.0%. In addition, the central error of the LSM-based approaches generally increases from 3.3% to 6.0%, with the LSM_RS rising from 3.7% to 6.9%. Regarding edge error, LSM_MK increases from 5.5% to 9.0%, LSM_IMQ and LSM_IQ increase to approximately 10.0%, while LSM_MQ and LSM_RS increase from around 9.0% to 12.5%. For global error, LSM_IQ rises from 4.1% to 6.8%, LSM_IMQ and LSM_IK from 4.9% to 7.2%, and LSM_MQ and LSM_RS from 6.5% to 9.5%. These results indicate that the LSM-based experiences an average increase in error of approximately 3–3.5% as noise increases to 0.0002 s. The CNN-based method shows a central error increase from 2.1% to 9.5%, edge error from 5.9% to 8.8%, and global error from 4.1% to 9.1%, reflecting an overall error increase of approximately 4–7.5%. For the 96-path configuration (
Figure 6), the central error of ATTRN increases from 1.1% to 4.4%, edge error from 3.4% to 6.1%, and global error from 2.2% to 4.1%, corresponding to an increase of approximately 2.0–3.0% as noise increases. Meanwhile, the central error of LSM and LSM_MK increases from 1.8% to 5.2%, while the other four LSM-based methods increase from 2.7% to 6.9%. In terms of edge error, LSM_MK increases from 4.5% to 7.7%, LSM_IMQ and LSM_IQ increase to approximately 8.1%, LSM_MQ from around 6.0% to 9.1%, and LSM_RS from 6.0% to 11.1%. For global error, except LSM_IQ which increases from 4.8% to 6.8%, other LSM-based approaches show an average increase of approximately 3.5%. These results suggest that the LSM-based approaches experience an error increase of about 3.4–5.1% as the noise level reaches 0.0002 s. For the CNN-based method, the central error increases from 2.6% to 9.7%, edge error from 5.0% to 9.8%, and global error from 3.7% to 9.6%, corresponding to an overall error increase of approximately 4.8–6.9%.
In summary, the results presented in
Figure 5 and
Figure 6 indicate that the ATTRN framework consistently outperforms both traditional LSM-based approaches and CNN-based methods in temperature field reconstruction under various levels of noise interference. This advantage primarily stems from the synergistic interaction between AIE and TFRD. Firstly, the Transformer architecture within AIE effectively models long-range dependencies, enabling it to extract global features even in the presence of noise perturbations in ToF data. This capability mitigates the impact of noise on the overall data trend. Additionally, the CNN-based TFRD enhances spatial detail recovery through multi-scale feature extraction, allowing it to efficiently filter small-scale noise interference. Moreover, the end-to-end learning framework enables ATTRN to adaptively adjust feature weights, ensuring strong robustness across varying noise levels. Compared to traditional methods, the ATTRN framework, when trained under noiseless conditions, achieves a significant reduction in temperature reconstruction error by approximately 2% to 6% under noiseless, low-noise, medium-noise, and high-noise conditions. Furthermore, as illustrated in
Figure 5 and
Figure 6, training the ATTRN framework under low-noise conditions enhances its noise resistance, demonstrating the inherent robustness of neural networks. This finding underscores ATTRN’s superior capability in handling noisy ToF measurement data, further highlighting its enhanced robustness and reliability in practical applications.
3.4. Comparison Experiment
To comprehensively compare the performance of different algorithms, we conducted experiments under the ATTRN framework, incorporating multiple LSM-based approaches and CNN-based methods. The coefficient matrix in the LSM formulation is of full rank, and therefore, the optimization problem can be solved directly without the need for iterative methods. These methods were evaluated on the ATFRSD-54, ATFRSD-54w, ATFRSD-96, and ATFRSD-96W datasets to assess their effectiveness in temperature field reconstruction. The experiments considered two error metrics:
(°C) and
. The
Table 3,
Table 4,
Table 5 and
Table 6 present the quantitative performance metrics for eight algorithms. It is important to note that we did not separately compare the single-peak, double-peak, and high-low-peak temperature fields. Instead, we directly compared the average
and
values across all three cases. This approach was adopted because the performance trends of traditional LSM-based approaches, CNN-based method, and ATTRN were consistent across these three temperature field distributions. In other words, each method exhibited a similar pattern of error metrics across different temperature fields. Therefore, using the average values for comparison not only improved evaluation efficiency but also ensured statistical robustness without compromising the validity of the conclusions. Experimental results indicate that compared to traditional methods, ATTRN achieves significant improvements in central region, edge region, and global error metrics. On the ATFRSD-96W test set, the global error of ATTRN is only 3.09%, significantly lower than LSM_RS (28.06%), LSM_MQ (22.64%), and CNN (20.93%). Furthermore, in the ATFRSD-54W test set, the edge error of ATTRN (4.78%) is notably lower than that of LSM_RS (14.09%), with a reduction of approximately 10%. This demonstrates that ATTRN achieves higher reconstruction accuracy in edge regions and is better suited to capturing the complex temperature gradient distribution within the furnace. Moreover, in test sets without water-cooled walls, ATTRN maintains an error of approximately 3% across central, edge, and global regions, which remains lower than the 3–12% error observed in LSM-based approaches and CNN-based method. In addition to average reconstruction accuracy, ATTRN also exhibits a remarkable reduction in
, which is critical for ensuring the stability and safety of thermal field monitoring. In the ATFRSD-96W test set, ATTRN achieves the lowest central
of only 68.72 °C, while all LSM-based approaches report significantly higher values ranging from 751.04 °C to 1082.34 °C, and the CNN-based method even reaches a peak error of 1227.39 °C. This trend is similarly evident in edge regions, where ATTRN’s
is merely 149.02 °C, in stark contrast to the 1170.17 °C of the CNN model and the extreme value of 1689.69 °C observed in LSM_RS. A similar advantage is observed in test sets without water-cooled walls. In the ATFRSD-54 test set, ATTRN achieves a central
of only 104.47 °C, whereas LSM-based approaches range from 120.05 °C to 252.51 °C, and the CNN-based model reaches a
of 1324.83 °C. For edge regions, ATTRN once again attains the lowest
at 132.28 °C, which is significantly lower than the values observed in LSM methods (ranging from 280.93 °C to 702.44 °C) and the CNN-based method (1336.45 °C). Thus, ATTRN consistently outperforms other comparative methods, exhibiting higher reconstruction accuracy under different sensor layouts and boundary conditions. Notably, as the number of heat sources increases and temperature variations become more complex, ATTRN demonstrates superior adaptability. Evaluations based on real-world datasets further validate this advantage, confirming the robustness and effectiveness of ATTRN in practical applications. Therefore, it can be concluded that ATTRN provides a comprehensive and superior solution for temperature field reconstruction. Compared to CNN-based methods, traditional LSM and its variants, ATTRN, maintain high accuracy, primarily due to its advanced AIE-TFRD architecture, which we designed. This structure possesses powerful global information-encoding capabilities and efficiently maps acoustic information to temperature field distributions. These findings further highlight the robustness and reliability of ATTRN in handling complex temperature distributions, enhancing its feasibility for real-world applications. Additionally, the test datasets were generated using previously proposed methods, ensuring consistency in evaluation. The superior performance of ATTRN over other methods suggests its capability to handle more challenging conditions, such as a higher number of heat sources and more complex temperature variations.
In summary, the extensive experimental results presented in
Table 3,
Table 4,
Table 5 and
Table 6 demonstrate that the proposed ATTRN outperforms traditional algorithms and the CNN-based method in terms of the accuracy of reconstruction. ATTRN not only improves reconstruction precision but also effectively reduces reconstruction bias compared to baseline algorithms, thereby achieving a lower global error, central error, and edge error. In addition, the number of heat sources significantly affects the accuracy of the prediction. Traditional methods perform well when the number of heat sources ranges from 1 to 3; however, as the number of heat sources exceeds 3, their accuracy drops sharply, leading to a rapid increase in error. This decline is particularly pronounced in the edge regions, where the accuracy of the reconstruction degrades the most. In contrast, ATTRN maintains consistently high performance even in scenarios with multiple heat sources, highlighting its strong generalization capability and robustness in complex environments.
Table 7 and
Table 8 provide the parameters of the ATTRN model for the 54-path and 96-path configurations, respectively.
3.5. Engineering Validation and Analysis
All engineering experiments conducted in this study follow the same test scenarios and data conditions as described in [
32]. The experimental subject of this study is a 330 MW subcritical natural circulation drum boiler at Chongqing Qineng Power Plant. The burner configuration, operating modes, evaporation rate, and typical working parameters of this boiler are detailed in
Table 9. To acquire furnace temperature field data, a comprehensive acoustic measurement system was employed. This system, manufactured by China Dongfang Boiler Control Co., Ltd., Zigong, China, consists of key components, including acoustic transceivers, preamplifiers, a signal processing controller, an Ethernet switch, and a host computer. The acoustic transceivers serve as the primary components for transmitting and receiving acoustic signals. The transmission unit utilizes a pneumatic sound source, where an electromagnetic valve regulates compressed air to generate acoustic signals. The transmission frequency is set at 3 kHz, with a signal intensity exceeding 130 dB. To withstand high-temperature environments, the receiving unit is equipped with an air-cooling mechanism, supporting a reception frequency range of 20 Hz–20 kHz, with a microphone sensitivity of approximately
dB. Additionally, the preamplifier is responsible for adjusting the gain of received acoustic signals, featuring an input impedance of 100 M ohm and a gain factor set to 0.5. The signal processing controller operates with a 12-bit synchronous sampling mode, using a sampling frequency of 20 kHz to ensure precise signal processing. The entire measurement system is connected to the host computer via a 100 M industrial Ethernet network, ensuring stable data transmission and real-time analysis.
To validate the effectiveness of the ATTRN framework in engineering applications, we selected three discrete temperature fields for testing. The evaluation metrics
and
are the same as those defined in Equations (
14) and (
15). The reconstruction errors are presented in
Table 10 and
Table 11, while the reconstructed temperature distributions are illustrated in
Figure 7,
Figure 8,
Figure 9,
Figure 10,
Figure 11 and
Figure 12.
The results demonstrate that, compared to the traditional LSM-based and CNN-based method, ATTRN achieves significant improvements in central, edge, and global error metrics. In real data sets of discrete fields in 54 and 96 paths, the global error of ATTRN is approximately 2.3%, significantly lower than the range of 4–7% observed in traditional methods and the CNN-based method. In addition, the edge error of ATTRN is approximately 3.4%, which remains markedly lower than the 5–9% range observed in traditional and CNN-based methods. Furthermore, with respect to central error, traditional LSM and CNN approaches exhibit values approximately 1% higher than ATTRN. Considering that ATTRN’s central error is only around 1.4%, this margin is relatively significant. In addition to lower average errors, ATTRN also demonstrates clear advantages in terms of . In the 54-path real data sets of discrete fields, ATTRN achieves the lowest central at 71.63 °C, which is significantly lower than those of LSM-based approaches, ranging from 82.82 °C to 120.36 °C, and the CNN-based method, which reaches 126.27 °C. Similarly, for edge regions, ATTRN’s is only 216.14 °C, whereas LSM-based approaches range from 226.15 °C to 427.65 °C, and the CNN-based method reaches 307.26 °C. A similar trend is observed in the 96-path configuration. ATTRN yields a central of just 62.49 °C, while traditional methods and CNN report significantly higher values ranging from 96.06 °C to 190.89 °C. For edge regions, ATTRN once again achieves the lowest at 192.23 °C, compared to 294.90 °C to 549.44 °C in the LSM-based and 209.87 °C in the CNN-based approach.
Thus, ATTRN consistently outperforms other comparison methods, exhibiting higher reconstruction accuracy under different sensor layouts and boundary conditions. This reinforces our belief that the acoustic information encoder-temperature field reconstruction decoder architecture that we designed effectively captures global temperature characteristics while suppressing noise interference, demonstrating strong temperature field reconstruction capabilities. By leveraging a Transformer-based AIE, the model enhances long-range dependency modeling, ensuring comprehensive encoding of acoustic path information. Meanwhile, the CNN-based TFRD further optimizes spatial details, resulting in more precise temperature distribution reconstruction. Consequently, this architecture significantly enhances the robustness and accuracy of temperature field reconstruction, making it well-suited for complex industrial measurement environments.
4. Conclusions
This study proposes a novel furnace temperature field reconstruction method based on an AIE-TFRD (ATTRN) model. By integrating Transformer and CNN architectures, the proposed method overcomes the limitations of traditional numerical approaches, such as cumulative error amplification, parameter dependency, and insufficient local detail capture. In the experimental phase, we constructed ATFRSD-54 and ATFRSD-96, which encompass various boundary conditions and sensor layouts, providing a comprehensive benchmark to evaluate the performance of ATTRN. The experimental results demonstrate that ATTRN achieves high reconstruction accuracy and robustness under different sensor configurations and boundary conditions. Notably, it maintains stable performance even in multi-heat-source and complex temperature field environments, exhibiting strong generalization capabilities. Compared to conventional LSM-based approaches, ATTRN significantly improves edge temperature feature recognition and global temperature distribution accuracy, offering a more reliable technological solution for furnace temperature monitoring and optimization control in power plants. Furthermore, validation using real-world data from the Chongqing Qineng Power Plant confirms that ATTRN effectively addresses the challenges of reconstructing highly dynamic and nonlinear temperature fields within boilers. This capability is critical for optimizing combustion control and ensuring operational safety. ATTRN demonstrates superior performance in terms of accuracy, noise resistance, and portability, making it adaptable to diverse operating conditions. It surpasses traditional iterative and non-iterative algorithms in temperature field reconstruction tasks. Given its feasibility and applicability in industrial scenarios, ATTRN can be further extended to more complex combustion systems and high-temperature environment monitoring.
In the future, it is worth noting that more rigorous physical modeling of wave propagation and refraction in non-uniform media remains a specialized and actively developing research domain [
39,
44,
45,
46,
47]. Nevertheless, the present work primarily focuses on establishing a feasible temperature-field reconstruction framework for industrial furnaces and, therefore, adopts a simplified path-discretization approach. Future efforts will incorporate additional physical factors—particularly those arising from strong temperature gradients and three-dimensional acoustic ray-bending—by leveraging physics-informed neural networks (PINNs). By refining the acoustic ray-tracing algorithm, we aim to enhance reconstruction fidelity in the presence of pronounced refraction effects and curved propagation paths, thereby paving the way for complete 3D tomography of combustion and flow fields. Moreover, in scenarios involving highly non-uniform temperature distributions or significant velocity gradients, sound wave refraction may become the dominant factor affecting acoustic travel times. This highlights the potential benefits of integrating fully nonlinear acoustic models or curved-ray tracing algorithms into the framework. Finally, our long-term plan includes further validation and parameter refinement (e.g., cooling coefficients and path discretization schemes) using proprietary industrial datasets. These extensions will improve the physical rigor of the ATTRN framework and expand its applicability to real-world multi-physics acoustic thermometry tasks under challenging high-temperature environments.