Next Article in Journal
Modelling Interval Data with Random Intercepts: A Beta Regression Approach for Clustered and Longitudinal Structures
Previous Article in Journal
A Hybrid Estimation Model for Graphite Nodularity of Ductile Cast Iron Based on Multi-Source Feature Extraction
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Model-Free Identification of Heat Exchanger Dynamics Using Convolutional Neural Networks

by
Mario C. Maya-Rodriguez
1,
Ignacio Carvajal-Mariscal
1,*,
Mario A. Lopez-Pacheco
1,*,
Raúl López-Muñoz
2 and
René Tolentino-Eslava
1
1
Escuela Superior de Ingeniería Mecánica y Eléctrica-Unidad Profesional Adolfo López Mateos, Instituto Politécnico Nacional, Mexico City 07738, Mexico
2
Unidad Profesional Interdisciplinaria en Ingeniería y Tecnologías Avanzadas, Instituto Politécnico Nacional, Mexico City 07340, Mexico
*
Authors to whom correspondence should be addressed.
Modelling 2025, 6(4), 127; https://doi.org/10.3390/modelling6040127
Submission received: 14 August 2025 / Revised: 15 September 2025 / Accepted: 9 October 2025 / Published: 14 October 2025
(This article belongs to the Special Issue Modelling of Nonlinear Dynamical Systems)

Abstract

Heat exchangers are widely used process equipment in industrial sectors, making the study of their temperature dynamics particularly appealing due to the nonlinearities involved. Model-free approaches enable the use of input and output data to generate specific and accurate estimations for each proposed system. In this work, a model-free identification strategy is proposed using a convolutional neural network to estimate the system’s behavior. Notably, the model does not rely on direct temperature measurements; instead, temperature is inferred from other system signals such as reference, flow, and control inputs. This data-driven approach offers greater specificity and adaptability, often outperforming manufacturer-provided coefficients whose performance may vary from design expectations. The results yielded an R 2 index of 0.9951 under nominal conditions and 0.9936 when the system was subjected to disturbances.

1. Introduction

In primary and transformation industries such as oil, gas, mining, power generation, and pharmaceuticals, among others, there are various subprocesses where heat transfer equipment is used to provide and remove heat from process streams. These devices are known as heat exchangers, with the shell and tube type being one of the most commonly used in process plants [1]. Their importance lies in the fact that heat exchangers are crucial for the transformation of raw materials through responsible and efficient energy use, as well as offering advantages in resource utilization and economic costs. However, thermal and mechanical design is often complex, despite the apparent simplicity of their physical structure. The actual operation of these devices often deviates from their design due to both internal and external uncertainty and variability [2]. Some key factors contributing to this deviation include the following: variations in flow rate and temperature of the streams, changes in stream composition, imprecise correlations for heat transfer, and pressure drop, among others. As a result of these deviations, the variability associated with the input variables and uncertainty in the internal parameters arises, causing the performance of the equipment to fluctuate, which can even become a risk factor for the process plant [3,4]. From the perspective of Automatic Control, this represents a significant challenge due to the large number of external and internal factors involved in capturing the dynamics of these systems. Although it is possible to mathematically model these devices to obtain a theoretical expression of their behavior, the literature clearly indicates that there are notable discrepancies between theoretical predictions and practical performance. Therefore, to contribute to the improvement of equipment performance from the field of Automatic Control, the following proposal can be considered: Parametric identification is ruled out because of the high number of parameters involved in a heat exchanger, the strong nonlinearity reported in the existing literature, and the presence of unmodeled dynamics. Instead, a recursive non-parametric identification approach appears to be more suitable for this type of system. Since it does not rely on a predefined mathematical structure that forces the system to resemble something known, it allows for greater flexibility and adaptability, which is a key advantage of this kind of methodology.
For Control Theory to have a significant impact on industrial production systems, it is important that some research efforts focus on the pursuit of knowledge in real-world problems. From this perspective, understanding the dynamics of a system becomes essential, since the main factor of interest concerning system modeling is its practical use and utility for the personnel in charge. Some advantages of this include the ability to perform plant testing during the design stage, the possibility of using the model for operator training, and the opportunity to experiment via simulation with hardware improvements or control system enhancements before final approval. The use of knowledge to construct a mathematical model requires making assumptions based on principles of physics and chemistry. From the implications derived from such studies, it is possible to obtain a set of ordinary or partial differential equations. This leads to the development of a theoretical mathematical model with a defined structure and parameters, which in most cases is difficult to solve. However, there are ways to avoid the complications of a theoretical model, and one of them is through experimental analysis using identification methods, which result in a mathematical model derived from the measurement of the input and output of the system [5]. It is important to emphasize that although theoretical mathematical models are elegant, they involve significant effort and often face practical challenges in industrial applications, such as: they tend to be complex even for simple systems and can be time-consuming to develop; theoretical coefficients are not exact, as system parameters tend to change over time due to usage; there are unmodeled dynamics that are omitted, making the mathematical model incomplete, among other issues. According to [6], Figure 1 shows a way to classify systems.
White box models allow for both the mathematical structure and parameters of the system to be known. Grey-box models have a known mathematical structure but unknown parameters. Meanwhile, black box models lack both a defined mathematical structure and known parameters. At first glance, white box models may seem preferable, as their mathematical formulation has a physical meaning and can be interpreted. However, in an industrial environment, this is often impractical because of the large number of coupled systems required to produce a product. Grey-box models offer more flexibility, since parameter identification can be performed by measuring the input and output of the system [7]. Nevertheless, this approach only works effectively when the assumed mathematical structure adequately reflects the production system. Otherwise, it forces the system to behave like something it is not, leading to discrepancies between the real-world behavior and the approximated model. Black box models and non-parametric identification, on the other hand, do not rely on predefined structures or parameters [8]. Although the resulting model lacks physical interpretation, it can provide a closer approximation to reality, as it is tailored specifically to each system based on its input–output behavior. It is important to note that these classifications are not absolute rules. There are cases where exact models and parameters can be determined, such as in industrial robotics. What truly matters is that individuals learn to identify the type of system they are dealing with and determine the most appropriate modeling approach to address the problem effectively.
Several efforts aimed at uncovering the dynamic behavior of heat exchanger systems are outlined in the following. In [9], a non-parametric modeling approach is presented for a heat exchanger, intended for testing a fuzzy logic-based control strategy. Meanwhile, ref. [10] describes a model-based, step-by-step methodology to predict dynamic pressure profiles during tube rupture events in liquid–liquid, vapor–liquid, and flashing liquid–liquid systems. In [11], system identification is performed using experimental data, followed by the proposal of a cascade-type control strategy in simulation. Furthermore, ref. [12] presents a system identification approach based on input–output measurements for a linear system. Similarly, ref. [13] employs nonlinear modeling techniques, including Nonlinear ARX, Box–Jenkins, and Output-Error models, derived from collected data. In [14], a parametric system identification procedure is developed to obtain a single-input single-output, linear time-invariant, time-delay model of a liquid–liquid heat exchanger system. These examples reflect the broad interest across disciplines in the challenge of accurately capturing the dynamic behavior of heat exchangers.
The use of different types of neural network algorithms has gained significant relevance in recent years due to advances in information processing [15]. Among the most commonly used algorithms is the multilayer neural network (MLP). In biodiesel production, in [16] a MLP is proposed to predict the biodiesel fuel ratio, engine load, air consumption, and fuel flow rate data, comparing it with other neural models such as the adaptive neuro-fuzzy interference system (ANFIS) and the radial basis function network (RBFN). In terms of heat exchangers, machine learning models have come to improve some activities related to them, such as in [17] where four machine learning models were used as a prediction method of the fouling factor to schedule cleaning cycles. Also in [18], an LSTM model was implemented to monitor the fouling layer in the heat exchanger plate. In the case of compact heat exchangers designs, an SVM and MLP were used in [19] to estimate some parameters of these designs such as outlet temperatures, heat transfer rate, etc. In [20], a similar estimation of the heat transfer rate was estimated by means of an MLP for helically coiled tube heat exchangers. Some other examples can be found in [21].
CNN is also a very common neural model algorithm, used in many areas, such as estimation of variables in industrial processes, as mentioned in [22], or in image classification for the detection of faulty parts in manufacturing processes as presented in [23]. This neural models, like many others, can be classified as a black box model that only depends on input and output data from the system; no more knowledge from the system is required. One of the main applications of CNN is still in image classification for various purposes, such as for the detection of anomalies in the manufacturing of cable adhesive as presented in [24]. One more application of CNNs is to use them in time series prediction by proposing a new architecture as in [25], where a CNN combined with an auto-encoder generates better performance using MSE over state-of-the-art methods for one-step prediction using data sets for this matter. Also, in [26], the use of CNN in the prediction of oil spill areas in submarine pipelines is considered, where different neural network models are compared using performance metrics to perform an adequate analysis of the results, along with various training techniques for the models used. In dynamic system identification CNN has some recent applications, such as the one presented in [27] where a CNN model and an LSTM are compared using three different examples. And in [28] a convolutional recurrent network is used to model nonlinear dynamics in a heat exchanger.
The main contribution of this work is the proposal of an estimation method for the temperature in a heat exchanger, using a convolutional neural network with a new architecture that only uses convolutional layers without the need of a fully connected layer; this is applied in the control of the temperature of the exchanger as an estimator of the variable temperature used in the closed loop of the system, also involving information of the flow variable. This methodology is used in two cases of study in which there are no output disturbances and in which there are output disturbances. In both cases, the CNN estimate is very accurate regarding the system dynamics of the heating section; this is verified using performance indexes for this matter. Moreover, a comparison with a MLP is shown, where both models estimate the temperature but using noisy data, which is a very common issue in industrial environments.

2. Materials and Methods

2.1. Case of Study

According to the experimental system [11], Figure 2 shows a control system based on the cascade control strategy, where the main function is to maintain the temperature in the shell and tube heat exchanger. To achieve this, the system is composed of two pumps responsible for recirculating a hot fluid and a cold fluid, respectively. Safety valves are included in case of overpressure in the pipeline. In addition, primary measuring instruments are installed, including temperature indicators, a temperature transmitter with an analog voltage signal, and a flow transmitter that also has an analog voltage signal. The system includes a control valve and a disturbance valve located at the pump discharge in the hot fluid pipeline. The cold fluid pipeline only includes a valve for system alignment.
The operation of the system focuses on a cascade-type control system, where two nested control loops are present. The outer control loop is known as the primary or leader control loop, and the inner control loop is known as the secondary or follower control loop. In this control strategy, it is necessary for the controlled variable (leader loop) to have slower dynamics compared to the manipulated variable (follower loop). For the case study considered in this work, this condition is met by having temperature as the controlled variable and flow as the manipulated variable. The control philosophy is based on the idea that once a Set-Point achievable by the system is selected and compared with the temperature acquired directly by the temperature transmitter installed in the shell and tube heat exchanger, the error signal of the leader control loop is obtained. This signal is then used by the primary controller, which in most cases according to the literature and successful cases reported in the industry, is a PID/PI controller. This signal functions as a Set-Point for the follower control loop, where it is compared with the signal acquired by the flow transmitter installed in the shell and tube heat exchanger. The error signal produced is used by a PI/P controller, and this control signal is sent to the final control element, which in this case is the control valve in the hot fluid pipeline. By reducing or increasing the flow of the hot fluid, it is possible to control the temperature of the fluid in a shell and tube heat exchanger. Figure 3 illustrates the control philosophy described.

2.2. Convolutional Neural Network

A convolutional neural network architecture was developed to act as an estimation method. Consider a system where the temperature signal is the output of it, which can be estimated by the following CNN:
T ^ k = CNN u k ,
where u k is the vector constructed from the signals from the leader controller, follower controller, reference, and flow signal; each of these variables are used in the same instant that the output signal would be estimated, and the previous one up to their own regression order, each with a specific one. These values are determined by trial and error.
The proposed CNN architecture consists of 4 convolutional layers, one after the other. Each convolutional layer consists of the convolutional operator between the filters in this layer, F 4 , and the input of the layer, y 1 ; considering a padding P = 0 and a stride S = 3 with this configuration, the output of this each layer is reduced. Mathematically, this operation is defined by the following:
x = conv y 1 , F , P , S ,
where x is an auxiliary vector. The size of the output is calculated by S I S F + 2 P / S + 1 , with S I being the input size and S F the filters size. After the convolution operator, an activation function is applied to the vector x ; in this case the ReLU operation has been used, described by (3):
y = ReLU x .
The CNN is designed to generate a single value output, so four convolutional layers are used in the architecture considering that eight filters in each layer are used for this matter, and in the output layer, = 4 , there is only one filter that uses as input the output of the previous layer, generating one single output signal, corresponding to the estimation of temperature of the system. This is shown in Figure 4, where each filter in all previous layers only considers one filter from the previous layers, generating in this architecture 8 different ways of acquiring features of the system. The hyper-parameters, number of layers, and filters per layer were chosen to ensure that the output size is 1, given that we estimate a single data point per iteration of the algorithm. By selecting this number of filters for each layer, in addition to the activation function, we can generate a size reduction sufficient for our objective. They were also selected in this way because several tests were performed to ensure that the CNN obtained adequate estimation results. This was achieved using an input with a significant amount of system data. To consider more features of the system, the initial value of the convolutional filters are set randomly which has been proven to have similar results than using a methodology especifically for the initialization of hyper-parameters.
As mentioned previously, the CNN input considers 4 signals with their respective previous values, all of them adding up to 192 input data. This vector is defined as follows:
u k   =   [ u F k u F k r F u R e f k u R e f k r R e f u P I D m k u P I D m k r P I D m u P I D s k u P I D s k r P I D s ]
where u F , r F , u R e f , r R e f , u P I D m , r P I D m , u P I D s , r P I D s are the signal values and the regression order of the flow, the reference, PID leader, and the PID follower signal, respectively.
One of the main contributions of this paper is that no fully connected layer is used as the output layer; instead, only convolutional layers are used. This is successfully performed by considering the correct amount of hyper-parameters of the CNN such as the number of filters, their size, the padding, and stride and size of the input vector. One more advantage is the inner noise reduction that CNN can do with the processed data. The training process is performed using the backpropagation algorithm; using offline data, this process is described in the next section. The estimation process is defined in Figure 5. Data from the real process are acquired and used as input to the CNN, generating the estimated temperature to be compared with the one obtained from the temperature sensor for the training process.

2.3. CNN Training

The training consists of updating the values of the filters in the CNN; this is achieved using the backpropagation algorithm (BP), which minimizes a cost function:
J = 1 2 e 2 ,
where e is defined as the estimation error between the real temperature measure of the system and the CNN output, e = T ^ T . The estimation error has to be propagated using the gradient calculated to each filter in the CNN; then, the update of the numerical values is determined by the delta rule:
F k + 1 = F ( k ) η J F
where η is the learning rate for the training, which is a numerical value between 0 and 1, defined by the user.
The gradient for the convolutional filters is the following:
J F = conv y 1 , rot χ , 90 , P , S
where the function rot · , 90 is equivalent to a 90° rotation of the first parameter of the function and χ is the gradient up to the filters in that layer, which is calculated as follows:
χ = J x = J y max y
with the max · function defined as the opposite of the ReLU operation, this operation passes the gradient in the specific position where the maximum value was obtained by the ReLU operation in the feedforward process, and passes zero values in the rest of the position otherwise. Finally, the propagated error through a convolutional layer can be computed as follows:
J y 1 = conv χ , rot F , 90 , P , S
Equations previously described in this section are used in each iteration of the training in order to obtain the best possible values for the convolutional filters; after this training is completed, the testing stage consists of verifying how well the estimation can be achieved with the trained CNN using similar data for this matter.

3. Experiments and Results

The experiments were performed in a M3 processor laptop with 16 GB of RAM using the MATLAB 2024b version. As described in Algorithm 1, first the hyper-parameters of the CNN must be defined, then it receives data from the system, such as control signals, references, and flow values, then the CNN computes the estimated temperature to calculate the estimation error for this iteration, and these steps are repeated until the total training data are used. In the training stage, 10,000 data points were used for this matter, which is a very similar signal to the one used for the testing stage, considering five training epochs. No extra processing such as scaling or normalization was required on the data to be used as input to the CNN.
Algorithm 1 Temperature estimation in heat exchanger.
Propose CNN hyper-parameters, u F , u R e f , u P I D m , u P I D s , P , S , F .
while  i N  do                                        ▹N -> total of data
       Use leader and follower control signal, reference and flow values into u k .
       Generate T ^ using CNN.
       Compute estimation error e = T ^ T .
       Update hyper-parameters F using gradient descendant method.
end while
Realize a testing stage to verify the training process.
Compare performance indexes such as MSE and R 2 .
if  M S E M m  then                        ▹Mm -> minimum value for acceptance estimation
     Repeat previous steps varying CNN hyper-parameters.
end if
For the simulation, the input values are as follows: u F = 5 , u R e f = 1 , u P I D m = 180 , and u P I D s = 6 . The number of filters in convolutional layers is defined as 8, and the size of filters is set at 4. The learning rate is set to 0.7 and five training epochs are needed for this. All these numerical values were obtained by trial and error after performing several simulations with different combinations of input numbers. It was observed that the leader PID signal was the most important for making the estimation; therefore, this signal has the highest number of inputs in the CNN. The first estimation was performed using the ideal system with no disturbances; the simulation results are shown in Figure 6, where the mainly high estimation error is presented when there is a drastic change in the reference.
It is important to note that the proposed reference signal is time-varying, as many processes involve thermal treatments of materials and substances to obtain a final product. Such is the case in biodiesel production, where it is necessary to raise the temperature at different operating points. The heat exchanger is responsible for supplying the hot fluid to the reactor through a coil or jacketed system that enables its heating.
Similar simulations were carried out but this time the system output presents disturbance at four different moments and with different amplitudes, one occurring at 2000 s of 4 °C, one more at 5000 s of 7 °C, at 6500 s of 12 °C, and one last one at 9000 s of 3 °C. Estimation results are presented in Figure 7. As in the previous case, the estimation error obtained a high value when the reference changed drastically, but in other moments the CNN behaves the same as the real system.
Disturbances in industrial production systems are inevitable, as processes generally do not form part of a fully coupled system. Instead, they interact with an environment where various conditions arise, often driven by external excitation sources, such as the nonlinear behavior of the surrounding medium. It is important to highlight that the identifier proposed in this work, based on the model presented in (4), is capable of delivering accurate estimations of the temperature variable of the heat exchanger, even in the presence of disturbances.
To evaluate the performance of the estimation method, five metrics were used for this matter, Mean Absolute error (MAE), Mean Square Error (MSE), Root Mean Square Error (RMSE), R 2 , and Root Mean Square Logarithm Error (RMSLE), which are defined by the following equations from (10) to (14).
MAE = 1 n i = 1 n T i T ^ i
MSE = 1 n i = 1 n T i T ^ i 2
RMSE = 1 n i = 1 n T i T ^ i 2
R 2 = 1 i = 1 n T i T ^ i 2 i = 1 n T i T ¯ 2
where T ¯ = 1 n i = 1 n T i is the mean of the real temperature values from the system.
RMSLE = 1 n i = 1 n log ( 1 + T ^ i ) log ( 1 + T i ) 2
From these metrics, Figure 8 shows the behavior of the system comparing the pairs of data, estimation with the real value; for perfect fitting, each pair should be above the identity line. As is shown, at the beginning several points are outliers, this is due to the initialization of the CNN inputs, which requires around 200 data to be completed; this is only if considering there are no data to work with. After this point, there are around 10 points out of the 20% error lines in this estimation, which represents 0.1% of data used.
For the simulation with disturbances, Figure 9 shows the behavior of the data. Similarly to the previous case, at the beginning of the estimation, outlier data are generated due to the CNN input; later, few data, of less than 10 points, are out of the 20% range of estimation error, occurring when the reference changes.
The comparison of all metrics employee in this work for both cases is shown in Table 1.

3.1. Noisy Data

One of the main challenges in applying control or system identification algorithms in industrial environment is dealing with noisy signals: sensor failures, connection cable failures, external agents, etc. To overcome this problem, it is important to perform adequate signal processing to ensure a better performance of the controller or the system in general. One advantage of CNN over some other estimation algorithms is that it can internally reduce signal noise. In this regard, a comparison was made between the proposed CNN and a multilayer perceptron, using the aforementioned metrics to assess the efficiency of the proposed algorithm. A two-layer MLP is designed, with 100 neurons in the hidden layer and the same amount of input data. This approach is implemented because a significant amount of system information is also required for system estimation. It is trained using the backpropagation algorithm, using a learning rate of 0.5. Five training epochs are used, each with 10,000 training data points, which are different for the testing stage. Random numbers representing temperature variations within a range of ± 2   ° C are used to generate noise in the simulation; this is performed for both training and testing purposes.
Table 2 shown the comparison between these two algorithms using the proposed metrics, where the numerical values are better for CNN in each of the metrics.
In Figure 10, an MLP and CNN comparison is shown. The MLP signal could not reduce the noise in the output; it has more noise in the output than the real system, while CNN reduces almost all the noise in the output, proving the efficiency of the proposed algorithm.

3.2. Discussion

As presented in Table 1, the methodology proposed in this paper obtained very similar results in each metric. Considering R 2 , the values obtained in both cases are very close to 1, which is what is sought when applying this metric. Considering metrics such as MSE or RMSE, both penalize large estimation errors generated through the simulation, seeking results close to zero, as in this case. The RMSLE metric is used to eliminate the effect of outliers that arises at the beginning of the estimation, for which results close to zero are also sought. Although it is possible to observe a noticeable deviation in the temperature estimation when a reference change occurs, even if only for a short period, this may be attributed to issues related to the hyper-parameters of the CNN. In general, learning-based approaches often face challenges in hyper-parameter selection. It would be valuable in future work to incorporate a heuristic-based search algorithm to mitigate these effects, especially considering that transients can significantly impact controllers when dealing with faster system variables.
The nonlinear behavior reported and observed in heat exchangers poses a challenge not only for design but also for the estimation of their dynamics and subsequent closed-loop control. The results obtained in this work will enable future studies to explore certain types of model-free controllers, which tend to be more flexible than model-based approaches. Noisy data can be reduced using a CNN in real-world environments, such as industrial. The comparison with an MLP shows that CNN can drastically reduce the output noise in the data without the need of an extra data processing.

4. Conclusions

This work addresses the non-parametric identification of the temperature variable in a heat exchanger using a convolutional neural network, which relies on reference, flow, and control signals as input data. This aspect is particularly important to highlight, as the model does not rely on direct temperature measurements. Instead, temperature is estimated from other available system signals, reinforcing the value of data driven approaches that infer key variables from indirect observations. The result is a black box model intended for use as a model-free approach. Among the advantages of this methodology are its flexibility and adaptability, which are particularly valuable in real-world applications. The results yielded an R 2 index of 0.9951 under nominal conditions and 0.9936 when the system was subjected to disturbances. One limitation of our proposal is that it cannot be generalized for all systems, since it requires some knowledge of the system to consider a proper input to the model. Also, the amount of input data for this system was somewhat high given that almost 200 iterations are required for CNN to adequately identify the system. Finally, the hyper-parameter selection remains a challenge for this type of algorithm, as the accuracy of the identification largely depends on choosing appropriate values and there is no previous defined methodology to select them. Nevertheless, this approach enables the development of more specific models by extracting information directly from the live process, which is often preferable to relying on manufacturer-provided coefficients especially considering that their stated performance may differ from actual behavior due to design variations. The study of complex and large-scale systems in the industrial sector can lead to more efficient resource utilization, with not only economic benefits but also positive environmental impacts due to improved process efficiency.

Author Contributions

M.C.M.-R. and M.A.L.-P. worked on all the tasks, I.C.-M. and R.L.-M. worked on the literature review, M.C.M.-R. and M.A.L.-P. conducted experimental studies, I.C.-M. and R.T.-E. performed the supervision, and all authors analyzed the results. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding authors.

Acknowledgments

We would like to thank SECIHTI, formerly known as CONAHCYT, for the M.C.M.R. postdoctoral fellowship and the IPN for the support provided through projects Nos. 20252370 and 20251381.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kakač, S.; Liu, H.; Pramuanjaroenkij, A. Heat Exchangers: Selection, Rating, and Thermal Design, 4th ed.; CRC Press: Boca Raton, FL, USA, 2020. [Google Scholar] [CrossRef]
  2. Caputo, A.C.; Federici, A.; Pelagagge, P.M.; Salini, P. On the design of shell-and-tube heat exchangers under uncertain operating conditions. Appl. Therm. Eng. 2022, 212, 118541. [Google Scholar] [CrossRef]
  3. Li, J.; Xu, W.; Li, J.; Huang, S.; Li, Z.; Qiao, B.; Yang, C.; Sun, D.; Zhang, G. Heat extraction model and characteristics of coaxial deep borehole heat exchanger. Renew. Energy 2021, 169, 738–751. [Google Scholar] [CrossRef]
  4. Lambert, J.; Gosselin, L. Sensitivity analysis of heat exchanger design to uncertainties of correlations. Appl. Therm. Eng. 2018, 136, 531–540. [Google Scholar] [CrossRef]
  5. Ljung, L.; Glad, T.; Hansson, A. Modeling and Identification of Dynamic Systems; Studentlitteratur: Lund, Sweden, 2021. [Google Scholar]
  6. Isermann, R. Identification of Dynamic Systems: An Introduction with Applications; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar]
  7. Rogers, T.; Holmes, G.; Cross, E.; Worden, K. On a Grey Box Modelling Framework for Nonlinear System Identification. In Special Topics in Structural Dynamics; Dervilis, N., Ed.; Conference Proceedings of the Society for Experimental Mechanics Series; Springer: Cham, Switzerland, 2017; Volume 6, pp. 167–178. [Google Scholar] [CrossRef]
  8. Nelles, O. Nonlinear System Identification: From Classical Approaches to Neural Networks, Fuzzy Models, and Gaussian Processes, 2nd ed.; Springer: Cham, Switzerland, 2020. [Google Scholar] [CrossRef]
  9. Rizk-Allah, R.M.; Hassanien, A.E.; Marafie, A. An improved equilibrium optimizer for numerical optimization: A case study on engineering design of the shell and tube heat exchanger. J. Eng. Res. 2024, 12, 240–255. [Google Scholar] [CrossRef]
  10. Harhara, A.; Hasan, M.F. Dynamic Modeling of Heat Exchanger Tube Rupture. BMC Chem. Eng. 2020, 2, 5. [Google Scholar] [CrossRef]
  11. Tridianto, E.; Ariwibowo, T.; Almasa, S.; Prasetya, H. Cascaded PID temperature controller for FOPDT model of shell-and-tube heat exchanger based on Matlab/Simulink. In Proceedings of the 2017 International Electronics Symposium on Engineering Technology and Applications (IES-ETA), Surabaya, Indonesia, 26–27 September 2017; pp. 185–191. [Google Scholar] [CrossRef]
  12. Olana, F.D.; Retta, B.N.; Abose, T.A.; Atnaw, S.M. Shell and Tube Heat Exchanger, Empirical Modeling Using System Identification. In Advances of Science and Technology. ICAST 2019; Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering; Habtu, N., Ayele, D., Fanta, S., Admasu, B., Bitew, M., Eds.; Springer: Cham, Switzerland, 2020; Volume 308, pp. 548–556. [Google Scholar] [CrossRef]
  13. Abirami, S.; Sivagamasundari, S. Modeling of Shell and Tube Heat Exchanger Using System Identification Techniques. Int. J. Eng. Manag. Res. 2020, 10, 36–42. [Google Scholar]
  14. Gupta, S.; Gupta, R.; Padhee, S. Parametric system identification and robust controller design for liquid–liquid heat exchanger system. IET Control Theory Appl. 2018, 12, 1474–1482. [Google Scholar] [CrossRef]
  15. Bakır, R.; Khail, A.A.; Bakır, H. Enhancing the prediction of flow characteristics in an inventive plate heat exchanger using deep learning techniques. Phys. Scr. 2025, 100, 035114. [Google Scholar] [CrossRef]
  16. Can, Ö.; Baklacioglu, T.; Özturk, E.; Turan, O. Artificial neural networks modeling of combustion parameters for a diesel engine fueled with biodiesel fuel. Energy 2022, 247, 123473. [Google Scholar] [CrossRef]
  17. Hosseini, S.; Khandakar, A.; Chowdhury, M.E.; Ayari, M.A.; Rahman, T.; Chowdhury, M.H.; Vaferi, B. Novel and robust machine learning approach for estimating the fouling factor in heat exchangers. Energy Rep. 2022, 8, 8767–8776. [Google Scholar] [CrossRef]
  18. Hou, G.; Zhang, D.; Yan, Q.; Wang, S.; Ma, L.; Jiang, M. Application of machine learning algorithms in real-time fouling monitoring of plate heat exchangers. Int. Commun. Heat Mass Transf. 2025, 164, 108809. [Google Scholar] [CrossRef]
  19. Uguz, S.; Ipek, O. Prediction of the parameters affecting the performance of compact heat exchangers with an innovative design using machine learning techniques. J. Intell. Manuf. 2022, 33, 1393–1417. [Google Scholar] [CrossRef]
  20. Colak, A.B.; Akgul, D.; Mercan, H.; Dalkılıç, A.S.; Wongwises, S. Estimation of heat transfer parameters of shell and helically coiled tube heat exchangers by machine learning. Case Stud. Therm. Eng. 2023, 42, 102713. [Google Scholar] [CrossRef]
  21. Zou, J.; Hirokawa, T.; An, J.; Huang, L.; Camm, J. Recent advances in the applications of machine learning methods for heat exchanger modeling—a review. Front. Energy Res. 2023, 11, 1294531. [Google Scholar] [CrossRef]
  22. Marei, M.; El Zaatari, S.; Li, W. Transfer learning enabled convolutional neural networks for estimating health state of cutting tools. Robot.-Comput.-Integr. Manuf. 2021, 71, 102145. [Google Scholar] [CrossRef]
  23. Wu, H.; Zhou, Z. Using convolution neural network for defective image classification of industrial components. Mob. Inf. Syst. 2021, 2021, 9092589. [Google Scholar] [CrossRef]
  24. Cho, H.W.; Shin, S.J.; Seo, G.J.; Kim, D.B.; Lee, D.H. Real-time anomaly detection using convolutional neural network in wire arc additive manufacturing: Molybdenum material. J. Mater. Process. Technol. 2022, 302, 117495. [Google Scholar] [CrossRef]
  25. Zhao, S.; Kong, M.; Li, R.; Hounye, A.H.; Su, R.; Hou, M.; Cao, C. SCARNet: Using convolution neural network to predict time series with time-varying variance. Multimed. Tools Appl. 2025, 84, 9809–9835. [Google Scholar] [CrossRef]
  26. Ji, H.; Zhang, X.; Wang, T.; Yang, K.; Jiang, J.; Xing, Z. Oil spill area prediction model of submarine pipeline based on bp neural network and convolutional neural network. Process Saf. Environ. Prot. 2025, 199, 107264. [Google Scholar] [CrossRef]
  27. Zhao, Y.; Jiang, C.; Vega, M.A.; Todd, M.D.; Hu, Z. Surrogate modeling of nonlinear dynamic systems: A comparative study. J. Comput. Inf. Sci. Eng. 2023, 23, 011001. [Google Scholar] [CrossRef]
  28. Bhattacharya, C.; Chakrabarty, A.; Laughman, C.; Qiao, H. Modeling nonlinear heat exchanger dynamics with convolutional recurrent networks. IFAC-Pap. 2022, 55, 99–106. [Google Scholar] [CrossRef]
Figure 1. Classification of mathematical model types according to their method of derivation [6].
Figure 1. Classification of mathematical model types according to their method of derivation [6].
Modelling 06 00127 g001
Figure 2. Heat exchanger taken as a case study [11].
Figure 2. Heat exchanger taken as a case study [11].
Modelling 06 00127 g002
Figure 3. Cascade control scheme for a heat exchanger.
Figure 3. Cascade control scheme for a heat exchanger.
Modelling 06 00127 g003
Figure 4. Proposed architecture for CNN.
Figure 4. Proposed architecture for CNN.
Modelling 06 00127 g004
Figure 5. Estimation process of temperature variable.
Figure 5. Estimation process of temperature variable.
Modelling 06 00127 g005
Figure 6. System identification with no disturbances.
Figure 6. System identification with no disturbances.
Modelling 06 00127 g006
Figure 7. System identification with output disturbances.
Figure 7. System identification with output disturbances.
Modelling 06 00127 g007
Figure 8. Comparison between temperature system and CNN with no disturbances.
Figure 8. Comparison between temperature system and CNN with no disturbances.
Modelling 06 00127 g008
Figure 9. Comparison between temperature system and CNN with output disturbances.
Figure 9. Comparison between temperature system and CNN with output disturbances.
Modelling 06 00127 g009
Figure 10. Comparison between MLP and CNN with noisy data.
Figure 10. Comparison between MLP and CNN with noisy data.
Modelling 06 00127 g010
Table 1. Performance indexes.
Table 1. Performance indexes.
MAE (°C)MSE (°C2)RMSE (°C) R 2 RMSLE
No disturbance0.01210.00680.08250.99510.0754
Disturbance0.02530.00880.09380.99360.0762
Table 2. Noisy data comparison.
Table 2. Noisy data comparison.
MAE (°C)MSE (°C2)RMSE (°C) R 2 RMSLE
CNN0.01590.00690.08290.99510.0750
MLP0.06950.01300.11410.99070.0931
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Maya-Rodriguez, M.C.; Carvajal-Mariscal, I.; Lopez-Pacheco, M.A.; López-Muñoz, R.; Tolentino-Eslava, R. Model-Free Identification of Heat Exchanger Dynamics Using Convolutional Neural Networks. Modelling 2025, 6, 127. https://doi.org/10.3390/modelling6040127

AMA Style

Maya-Rodriguez MC, Carvajal-Mariscal I, Lopez-Pacheco MA, López-Muñoz R, Tolentino-Eslava R. Model-Free Identification of Heat Exchanger Dynamics Using Convolutional Neural Networks. Modelling. 2025; 6(4):127. https://doi.org/10.3390/modelling6040127

Chicago/Turabian Style

Maya-Rodriguez, Mario C., Ignacio Carvajal-Mariscal, Mario A. Lopez-Pacheco, Raúl López-Muñoz, and René Tolentino-Eslava. 2025. "Model-Free Identification of Heat Exchanger Dynamics Using Convolutional Neural Networks" Modelling 6, no. 4: 127. https://doi.org/10.3390/modelling6040127

APA Style

Maya-Rodriguez, M. C., Carvajal-Mariscal, I., Lopez-Pacheco, M. A., López-Muñoz, R., & Tolentino-Eslava, R. (2025). Model-Free Identification of Heat Exchanger Dynamics Using Convolutional Neural Networks. Modelling, 6(4), 127. https://doi.org/10.3390/modelling6040127

Article Metrics

Back to TopTop