2.1. Overview of Plate Heat Exchangers
PHEs are compact, high-efficiency heat-transfer devices composed of a series of thin, corrugated metal plates arranged to form parallel flow channels for hot and cold fluids. These channels typically operate in a counter-flow or a cross-flow configuration, allowing for efficient thermal exchange with a high heat-transfer surface-to-volume ratio [
22]. The corrugation patterns, commonly chevron or herringbone, induce turbulence, even at low Reynolds numbers, thereby improving the heat-transfer coefficient. The modular structure of PHEs also facilitates scalability and maintenance, making them suitable for a wide range of applications, including HVAC systems, chemical processing, and renewable energy systems [
23].
PHEs are increasingly used in thermal energy systems due to their superior thermal performance, compactness, and adaptability to both liquid–liquid and liquid–solid heat-transfer scenarios [
6,
24]. Their small footprint and high effectiveness make them particularly attractive for applications where space and thermal efficiency are critical. In recent years, interest has grown in employing PHEs as a part of LHTES systems, where they can serve as both heat exchangers and encapsulation structures for PCMs [
25]. Their geometry allows for fast thermal response times and enhanced energy density when integrated with suitable PCMs [
26].
When PCMs are incorporated into PHE systems, the thermal performance benefits from the material’s ability to store and release latent heat during melting and solidification processes. However, a key limitation in such configurations is the inherently low thermal conductivity of most organic and inorganic PCMs, which can hinder heat-transfer rates [
27,
28]. Recent studies have demonstrated that integrating PCMs within the channels or adjacent cavities of PHEs allows for efficient coupling between sensible and latent heat storage, especially when supported by enhancements, like fins or nanocomposites [
17,
29,
30]. The structured geometry of PHEs promotes uniform distribution and rapid phase transition of PCMs, enabling a compact and high-efficiency TES system [
26].
The role of fins in PHE-PCM systems is critical to overcome the thermal conductivity barrier of PCMs [
31,
32]. Fins, often made from high-conductivity materials, like aluminum or copper, are inserted within or adjacent to PCM-filled cavities to enhance conductive heat transfer [
33]. Both longitudinal and transverse fin arrangements have been explored to maximize the contact area between the PCM and heat-transfer surface [
34]. Novel fin geometries, such as perforated, tree-shaped, or dendritic fins, have been shown to significantly accelerate the melting and solidification processes by enhancing thermal dispersion and reducing temperature gradients within the PCM domain [
35]. The integration of PHEs with PCMs and fins leads to substantial improvements in TES performance metrics, such as charging/discharging rates, energy density, and thermal efficiency [
36,
37].
Despite these developments, the accurate prediction and optimization of PHE performance remain challenging due to the complex interplay among the geometry, flow dynamics, and thermophysical properties of the working fluids. As the demand for energy efficiency and operational flexibility increases, there is a growing need for intelligent tools that can support rapid design evaluation, system diagnosis, and adaptive performance control [
10,
38]. In this context, the integration of ML techniques with PHE systems represents a significant opportunity for the development of next-generation, smart heat exchangers [
22,
39].
Figure 1 shows the schematic layout of a typical PHE.
Figure 2 summarizes the historical development timeline of plate-surface corrugations.
In conclusion, PHEs provide a versatile and high-performance platform for integrating PCMs in TES systems. Their modularity, high heat-transfer coefficient, and compatibility with structural enhancements, such as fins, make them ideal candidates for advanced LHTES systems. The combination of PCMs and fins within PHEs not only overcomes the limitations of low PCM thermal conductivity but also enables scalable, compact, and energy-dense storage solutions aligned with the growing demand for efficient thermal management in renewable and industrial energy systems.
2.2. Machine Learning for Modeling and Optimization in Thermal Systems
The rapid advancement of ML has introduced transformative capabilities across a wide spectrum of engineering disciplines, including the field of thermal sciences. In thermal systems, the inherently nonlinear and multivariable nature of heat and mass transfer processes often complicates the development of accurate predictive models using traditional physics-based methods [
40,
41]. This is particularly evident in systems involving complex geometries, variable material properties, multiphase interactions, or non-steady boundary conditions, where numerical simulations may be computationally expensive and experimental investigations limited by cost or feasibility constraints [
42]. In such contexts, ML offers a compelling alternative or complement, enabling data-driven modeling, real-time prediction, and optimization of thermal system performance [
43].
Figure 3 maps representative AI application domains across thermal energy storage.
ML algorithms can be broadly categorized into supervised, unsupervised, and reinforcement learning, with supervised learning being the most widely applied in thermal engineering. Among supervised models, artificial neural networks (ANNs), support vector machines (SVMs), decision trees, and ensemble methods, like random forests and gradient boosting, are commonly employed [
10]. These algorithms have shown notable success in approximating nonlinear input–output relationships in thermal systems, facilitating the prediction of key metrics, such as heat-transfer coefficients, pressure drops, temperature fields, and system efficiencies [
41,
45].
Figure 4 outlines the main categories of machine-learning algorithms used in thermal engineering.
Figure 5 presents the general AI modeling workflow adopted for thermal systems in this review.
The
Figure 6 illustrates representative machine-learning algorithm structures commonly applied in thermal and energy system research. Subfigure (a) shows a simple linear regression model used for predicting continuous variables. Subfigure (b) presents a typical artificial neural network (ANN) architecture comprising input, hidden, and output layers for complex nonlinear mapping. Subfigure (c) depicts a reservoir-computing framework, highlighting the recurrent connections within the reservoir layer and the linear output mapping. Subfigure (d) illustrates an ensemble-learning approach using decision tree structures, which combines multiple trees to generate a final prediction through aggregation. These diverse algorithmic frameworks demonstrate the breadth of machine-learning methods available for modeling and optimizing thermal processes.
Figure 6 illustrates common ML model structures employed for thermal prediction and control.
ANN models, in particular, have demonstrated strong predictive capabilities when trained on experimental or high-fidelity simulation datasets, owing to their universal approximation properties and robustness in handling noisy or incomplete data [
12,
47,
48]. Furthermore, AI integration allows for multiparameter design and control optimization, as shown in
Figure 7. Compared to traditional design workflows, AI-enhanced approaches enable faster iterations, improved handling of nonlinearity, and virtual testing that reduces physical prototyping costs and time.
Figure 7 compares a traditional design workflow against an AI-enhanced development process.
In the domain of heat exchanger analysis, several studies have leveraged ML to develop surrogate models that significantly reduce computational time while maintaining high accuracy. Kedem et al. [
49] highlight that accurate prediction of the dimensionless heat-transfer (
j factor) and friction factor (
f factor) is essential for optimizing plate–fin heat exchanger (PFHE) performance. Existing models, including prior ANN approaches, struggle to predict these factors for complex geometries, such as offset strips and wavy fins. To overcome this, their study introduces a novel, unified ANN model based on a multilayer perceptron (MLP) architecture. The ANN was trained and validated on an extensive dataset covering various fin configurations and Reynolds numbers from 300 to 10,000, encompassing both laminar and turbulent flows. Advanced techniques, like Bayesian regularization, were used to improve generalization.
Figure 8 depicts the shallow MLP architectures used by Kedem et al. for PFHE prediction.
Figure 9 shows the deeper MLP architectures evaluated for unified
j and
f prediction.
Kedem et al. [
49] compiled literature data for offset strips and wavy fins and trained a unified MLP to predict
j and
f over
–
(the mixed laminar/turbulent regime). Using
k-fold cross-validation
within this range, the model achieved an
RMSPE of ∼2% on wavy fins but notably higher errors for offset-strip fins (
RMSPEs of up to ∼15% for j and ∼17% for f). No extrapolation tests beyond the training domain were reported. Hence, the quoted accuracy applies to
in-domain interpolation and is geometry dependent.
Figure 10 reports the ANN model performance for friction factor across Reynolds number.
Similarly, Delfani et al. [
50] investigated the thermal performance of a nanofluid-based direct absorption solar collector using an ANN based on an MLP. In the experimental phase, nine collector prototypes with different geometries were tested to examine the effects of the collector depth and length under varying conditions, providing data for ANN training. The collector depth and length, working fluid flow rate and concentration, and reduced temperature difference were selected as input parameters to estimate the collector efficiency and Nusselt number. The ANN results showed that increasing the collector depth from 5 to 15 mm raised the efficiency by about 9%, while the collector length had minimal impact. The Nusselt number significantly increased with greater depth and nanofluid flow rate. The proposed ANN achieved the best accuracy for predicting collector efficiency at a nanofluid concentration of 1000 ppm, with an MAPE of 1.470%. For predicting the Nusselt number, the lowest MAPE of 2.576% was obtained with a collector length of 300 mm. Using forward stepwise regression, the optimal input combination for Nusselt number prediction included all the parameters. The close agreement between the experimental and predicted values confirms the ANN’s effectiveness in modeling the thermal performance of direct absorption solar collectors.
Figure 11 compares MLP-ANN predictions with experimental data for DASC efficiency and Nusselt number.
Beyond prediction, ML techniques have been successfully integrated with optimization frameworks to enhance thermal system design. Hybrid approaches combining ANNs with genetic algorithms (GAs), particle swarm optimization (PSO), or response surface methodology (RSM) are widely reported in the literature. Zhang et al. [
51] performed multiobjective optimization of elliptical-finned tubal heat exchangers using ANN-GA models, achieving optimal configurations for both heat transfer and pressure drop. These frameworks allow for rapid exploration of large design spaces, aiding engineers in identifying Pareto-optimal solutions without exhaustive simulation runs.
Another critical application of ML in thermal systems is in the modeling of nanofluids—base fluids enhanced with nanoparticles to increase thermal conductivity. Due to the complex interactions among the nanoparticle concentration, shape, base fluid, and temperature, classical correlations often fail to capture the dynamic thermophysical behaviors of nanofluids. ML models have been applied to predict properties such as thermal conductivity and viscosity with high precision. For instance, Ji et al. [
52] prepared TiO
2–Ag hybrid nanofluids by diluting high-concentration stock solutions and adding Span80 as a surfactant. The stability was evaluated through photographs taken over 0–30 days of storage. Thermal conductivity and dynamic viscosity were measured at varying volumetric fractions, temperatures, and ultrasonication times. The results indicated a maximum thermal conductivity enhancement of 9.98% and a peak dynamic viscosity ratio of 2.78 compared to those of the base fluid. Ultrasonication effectively reduced sedimentation. Two artificial neural networks (ANNs) were developed, and their architectures were optimized to minimize the prediction error. The ANNs predicted thermal conductivity and viscosity with maximum relative errors of 1.44% and 3.54%, respectively, demonstrating potential for reducing experimental time and cost.
Figure 12 and
Figure 13 compares experimental and ANN-predicted thermal conductivity and dynamic viscosity of the nanofluid.
Successfully training physics-informed neural networks (PINNs) for the solution of highly nonlinear partial differential equations (PDEs) on complex three-dimensional domains remains a formidable challenge in computational science and engineering. In the present study, PINNs are employed to address the three-dimensional incompressible Navier–Stokes equations at moderate-to-high Reynolds numbers within intricate geometries, where conventional numerical solvers often encounter substantial computational cost and convergence issues [
53,
54]. The proposed methodology strategically leverages very sparsely distributed solution data within the computational domain, thereby demonstrating the potential of PINNs to operate effectively under conditions of limited data availability [
53,
54].
A comprehensive investigation is conducted to elucidate the influences of the quantity of supplied data as well as the incorporation of PDE-based regularizers on the overall predictive performance of the network [
53]. Furthermore, a hybrid data–PINNs framework is developed to construct a surrogate model for a representative thermal management problem in electronic systems [
53]. This surrogate model facilitates near real-time sampling of the design space and is shown to outperform traditional purely data-driven neural networks (NNs) when evaluated on previously unseen query points [
53].
These findings underscore the capability of PINNs to deliver accurate solutions for complex three-dimensional nonlinear PDEs, even in scenarios characterized by data sparsity. In addition, they highlight the value of PINNs as an effective surrogate modeling strategy for design and optimization tasks governed by complex physical laws. Recent advances in this domain have further demonstrated that deep learning models, such as PINNs, integrate governing physical laws directly into the loss function during training, ensuring that the resulting predictions remain consistent with underlying physical principles [
55]. Such hybrid approaches are particularly promising for thermal systems and other applications where high-fidelity data are scarce or incomplete, offering improved generalizability and interpretability relative to those of conventional data-driven models [
56].
Furthermore, the integration of ML with real-time monitoring tools and Internet-of-Things (IoT) platforms enables the development of digital twins for thermal systems. These digital replicas can track system behavior in real time, detect anomalies, forecast failures, and guide adaptive control strategies for performance optimization. In the context of thermal energy storage (TES) systems—particularly those involving PCMs—ML models have been used to predict charging and discharging times, melting fractions, and storage efficiencies under variable boundary conditions [
3,
57]. This is of paramount importance for smart building applications, where demand-side management and thermal buffering are increasingly required for energy flexibility and sustainability.
Despite these successes, challenges remain. Many ML models require large, high-quality datasets for training, which may not always be available in thermal applications. Moreover, model interpretability and generalization beyond the training domain are ongoing concerns, particularly in safety-critical or regulation-bound applications. To address these issues, future research should focus on the development of hybrid models that integrate physical laws with data-driven learning, the use of transfer learning for knowledge reuse across domains, and the establishment of benchmark datasets and open-access platforms to accelerate collaborative research.
In summary, ML is rapidly reshaping the way thermal systems are modeled, optimized, and controlled. From surrogate modeling and nanofluid property prediction to PCM-based energy storage and digital twin implementation, ML offers robust tools to improve performance, reduce computational burdens, and enable intelligent thermal system design. Its integration with emerging technologies, such as IoT and advanced materials, positions it as a cornerstone of next-generation, sustainable thermal management solutions [
46].
Figure 14 sketches an AI-assisted materials design pipeline for thermal energy systems.
Despite the increasing adoption of ML in PHE research, there is a notable absence of standardized benchmark datasets. This hampers reproducibility and fair comparison across studies. For example, datasets used in ANN-based PHE prediction models (e.g., [
50,
58]) are not publicly accessible, and the experimental parameters are often underreported. As a result, model generalization remains difficult to assess. Initiatives toward open-access CFD or experimental datasets, such as those hosted on Zenodo or GitHub (e.g., [
59]), could significantly enhance transparency and enable robust cross-validation frameworks.
2.4. Comparative Benchmark Protocol for PHE–PCM ML Models
This work adopts a single, simple protocol so that different ML methods can be compared on an equal footing. We consider three routine prediction tasks that cover most use cases we found in the literature: (i) plate-heat-exchanger single-phase performance, where the inputs are the Reynolds number (), chevron angle (), pitch (p), plate spacing (b), hydraulic diameter (), and flow arrangement, and the targets are the Colburn factor (j) and friction factor (f) within , , and mm; (ii) printed-circuit HX heat-transfer prediction, where we map to the heat-transfer coefficient (h) for straight and hexagonal channels; and (iii) latent storage with PHE–PCM, where the geometry (the fin number, thickness, and width and the chevron/fin angle), operating conditions (HTF inlet temperature and mass flow), and PCM properties (k, , L, and ) are used to predict the charging time and/or liquid fraction at a chosen time.
To keep the results honest, splits are made by geometry family so that training and testing do not share the same plate or fin type. We use grouped fivefold cross-validation for model selection and then a held-out test set with disjoint geometries for the final numbers. We always report the MAE, RMSE, MAPE, and , together with a calibration slope/intercept (predicted vs. true). Practical details also matter, so we add the training time, per-sample inference time, and model size.
Each model is compared against the same baselines: the relevant published correlation for the geometry, a CFD reference (RANS with documented mesh/time-step checks), and a simple linear/ridge model. We also probe behavior outside the comfort zone with three stress tests: leaving out an entire geometric family (leave one geometry out), removing tail bins from training, and applying an unseen inlet-temperature bin for the storage case. The same metrics are reported for these tests.
Uncertainty is reported with 95% confidence intervals via bootstrap (1000 resamples) or using model ensembles; probabilistic models additionally report negative-log likelihood and show a calibration curve. For interpretability, we provide global feature importance (e.g., SHAP) and a few local explanations, and we state any physics-aligned monotonic constraints used during training. Finally, we list the computational environment (CPU/GPU, RAM, and software versions) and the random seeds so that others can reproduce the runs.
Assumptions are minimal and match typical practice: single-phase liquid flow in the HX channels, phase change only in the PCM domain (enthalpy–porosity), steady inflow boundaries with either wall heat flux or wall temperature, and validity ranges, as in
Table 1, with extrapolation results reported separately in
Table 2. Correlations used as baselines are geometry dependent (e.g., the chevron angle and pitch), so we flag any use outside their stated scope.