Next Article in Journal
Mamba-Based Macro–MicroSpatio-Temporal Model for Traffic Flow Prediction
Previous Article in Journal
Softsign-Based Nonlinear Control of Steam Condenser via Gbest-Guided Atom and Pattern Search Approach
Previous Article in Special Issue
Enhancing Security and Applicability of Local LLM-Based Document Retrieval Systems in Smart Grid Isolated Environments
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Advanced Machine Learning Approach for Fast Temperature Estimation in SiC-Based Power Electronics Converters †

by
Kalle Bundgaard Troldborg
,
Sigurd Illum Skov
,
Arman Fathollahi
* and
Jørgen Houe Pedersen
Department of Electrical and Computer Engineering, Aarhus University, 8200 Aarhus, Denmark
*
Author to whom correspondence should be addressed.
This paper is a substantially revised and extended version of our earlier conference paper “Troldborg, K.B.; Skov, S.I.; Fathollahi, A.; Pedersen, J.H. Intelligent Machine Learning–Based Scheme for Fast Temperature Estimation of SiC Power Modules in Electric Vehicle Power Converters. In Proceedings of the 2026 IEEE Intelligent Vehicles Symposium (IV), Detroit, MI, USA, 2026”.
Electronics 2026, 15(6), 1325; https://doi.org/10.3390/electronics15061325
Submission received: 27 February 2026 / Revised: 18 March 2026 / Accepted: 18 March 2026 / Published: 22 March 2026

Abstract

Accurate and fast junction-temperature estimation in Silicon Carbide (SiC) power modules is crucial for reliable operation, health monitoring and predictive control of power electronic converters in different applications. However, direct temperature measurement inside the module is difficult and high-fidelity thermal models are often very computationally expensive for real-time implementation. This paper proposes a digital twin development approach for fast and accurate temperature estimation in all three dimensions of a SiC MOSFET power module by a combination of finite element method (FEM) modelling and neural networks. The work is especially relevant in thermal monitoring and managing power electronics converters such as renewable energy systems, energy storage systems, Electric Vehicles (EV), etc. The model incorporates a neural network trained on data generated from an FEM model built in COMSOL Multiphysics. The developed digital twin can estimate the temperature distribution, including the ten junction temperatures of the Wolfspeed EAB450M12XM3 module, with an average estimation time of 0.063 s, enabling predictive control. In order to improve practical applicability and model synchronization with the physical system, NTC-based feedback techniques are discussed (single-Temperature Coefficient (NTC) and double-NTC approaches). The proposed framework is investigated in terms of prediction accuracy and computational performance related to the FEM-generated reference data. The approach improves model reliability by adjusting the parameters of the critical digital and physical modules. The combination of FEM-based modelling and machine learning can provide a foundation for accurate, real-time thermal monitoring in power electronic modules.

1. Introduction

Power electronic converters are key enablers in modern electrified systems and are increasingly deployed in a wide range of applications such as electric vehicles (EVs) [1], renewable energy conversion systems [2], communication networks [3] and small and large-scale energy storage systems [4,5]. In these applications, converters are needed to operate with high efficiency, high power density and high reliability under varying thermal and electrical loading conditions [6,7]. In EV inverters, renewable energy interfaces and storage converters, the thermal behaviour of the power module is directly linked to operating conditions, cooling performance and the condition of the thermal path inside the module [8,9]. Temperature variations in the SiC dies, solder layers, direct copper bonded (DCB) substrate and thermal interface material (TIM) may lead to performance degradation and accelerated ageing. Therefore, fast and accurate estimation of the internal temperature distribution, especially the junction temperatures, is crucial for real-time monitoring, health assessment and predictive control. Nevertheless, direct measurement of junction temperature inside commercial power modules is difficult in practice and only restricted sensing points are typically available [10,11].
Modern power converters utilizing wide band gap (WBG) materials possess the capacity to effect a significant transformation in energy efficiency compared to devices reliant on conventional silicon (Si). Gallium nitride (GaN) and silicon carbide (SiC) are regarded as very promising wide bandgap (WBG) materials that enable substantial surpassing of the performance thresholds of established silicon (Si) switching devices [12,13]. WBG-based power devices provide rapid switching with reduced power losses at elevated switching frequencies, hence enabling the creation of high-power density and high-efficiency power converters [14,15]. Developing effective and reliable advanced power electronics requires robust thermal monitoring strategies followed by predictive control engineering. Without appropriate monitoring, it is necessary to leave a substantial temperature margin, as overheating can cause severe damage to the module. Leaving a large margin, however, can lead to suboptimal operation strategies. Finding a good balance between the two can lead to more effective and reliable usage of power modules [15,16]. The main driver for temperature increases in power modules is the power loss due to switching and on-state losses, resulting in a high-power density [17,18].
Adequate cooling strategies will allow the power modules to operate at higher power levels. Cooling strategies may include both liquid cooling, natural and forced convection, etc. The materials used in the multilayer structure of modules are also very important for leading the heat away while ensuring electrical isolation [19,20]. Two specific layer types are of particular interest because they are especially susceptible to production variations and thermal degradation. These are the Thermal Interface Material (TIM) layer and solder layers, which can vary slightly from module to module. As discussed in [21,22], this has a significant influence on the thermal behaviour of the module. Furthermore, thermal degradation as a direct consequence of thermo-mechanical stress is thoroughly discussed in studies [21,23]. Measuring the junction temperature of a power module can be a difficult task since the junction is not readily available for physical sensors. High junction temperatures leave the power module susceptible to damage and can cause unreliable operation, as discussed in [21,24]. Therefore, this temperature is of high importance to system monitoring.
Digital twin technology is an important framework for bridging this gap between combining a digital representation of the physical system with data connections that enable synchronization between the physical and digital environments [25,26,27]. In thermal monitoring applications, a digital twin can be employed to estimate internal temperatures that are not directly measurable, while continuously adapting to the behaviour of the physical module with available sensor feedback [28,29]. The effectiveness of such a digital twin depends on both the fidelity of the thermal model and the computational time of the estimation technique. Therefore, there is a need for digital twin strategies that maintain the thermal accuracy of high-fidelity models while enabling fast inference suitable for real-time implementation [30,31]. Recent studies have shown that data-driven thermal models, reduced-order thermal networks and digital twin frameworks can significantly reduce the computational burden of detailed electro-thermal simulations while improving the practical applicability of thermal monitoring methods in power electronic systems [24,32,33]. However, many of the existing approaches are focused either on junction-temperature estimation at limited locations, reduced-order lumped thermal descriptions, or offline surrogate models without explicit synchronization with the physical module through available temperature measurements. In [34], a neural-network-based thermal model for heat sinks in power electronics applications is presented, and the potential of data-driven thermal modelling for computationally efficient temperature prediction is demonstrated. Reference [35] develops an N-layer Fourier approach for transient thermal modelling of a power module and provides a detailed description of multilayer thermal behaviour, although with limited suitability for fast online implementation. In [36], fractional diffusion equations of single and distributed order, which offer a theoretical basis for describing heat-transfer processes with memory effects, are discussed. Authors in [37] propose a fractional-order equivalent impedance model for the junction-to-case thermal characteristics of IGBT modules, showing that compact fractional-order thermal models can remain highly accurate. In addition, Ref. [38] presents a thermal-model-based junction-temperature estimation method for traction inverters and further underlines the practical relevance of reduced thermal models for converter thermal monitoring. In addition, the combined problem of preserving the spatial thermal information of a high-fidelity FEM model while achieving sufficiently low computation time for online implementation remains insufficiently addressed for multilayer SiC power modules. Therefore, there is still a need for a digital twin framework that can retain the thermal fidelity of detailed modelling while enabling fast inference, sensor-based synchronization and future integration into real-time monitoring and predictive control applications.
External temperature measuring strategies, such as placing sensors on the casing or other parts of the multilayer structure, might give an indication of the temperature. For more accurate temperature estimation, using tools like FEM simulators and digital twins can be helpful [39]. Another important characteristic to consider is the computation time because even short-term high temperature conditions might cause a system change or malfunction. The high complexity of FEM models comes at a cost; the large number of calculations results in long computation times, which makes such models too slow for real-time applications. This limitation makes it difficult to react to dynamic overheating. In other words, highly accurate FEM models may fail to keep up with the pace of temperature increase in applications where rapid thermal transients occur. A potential solution to this problem is the use of neural networks, which can significantly reduce computation time while maintaining acceptable accuracy [24,32].
Artificial intelligence and machine learning have shown promising results across a wide range of research fields and engineering applications [40,41,42]. These developments highlight their strong potential to further strengthen thermal modelling approaches for power electronic modules. In thermal modelling of power electronic modules, machine learning techniques have been applied to directly learn temperature-field distributions, to establish surrogate mappings between design parameters and thermal-performance indicators, and to enhance classical thermal-network models for monitoring purposes [43,44]. In particular, neural-network-based and physics-informed approaches have shown promising capability for reconstructing thermal fields, reducing the amount of required training data and improving computational efficiency compared with conventional high-fidelity modelling alone [45,46]. In addition, data-driven digital twin frameworks and machine-learning-assisted thermal models have been reported for junction-temperature estimation and online thermal monitoring of power modules [32,47,48]. However, the considerable number of existing approaches are focused on limited temperature locations, reduced-order descriptions or offline prediction without explicit synchronization between a high-fidelity three-dimensional thermal model and the physical module through available sensor feedback.
In this paper, which is an extension of [49], a neural network-based digital twin for fast and accurate temperature estimation in all three dimensions of a SiC MOSFET power module is presented. The target entity is the Wolfspeed EAB450M12XM3 power module, and the digital twin is developed for use in a power electronic converter. A FEM thermal model is developed in COMSOL Multiphysics® 6.1 to generate the dataset for neural-network training. The trained neural network is then used as a fast estimator of the temperature distribution, which includes the ten junction temperatures of the module with an average estimation time of 0.063 s. In order to improve practical applicability and synchronization between the digital twin and the physical module, NTC-based feedback techniques are incorporated. Two approaches are considered, featuring single-NTC and double-NTC measurements. These strategies are employed to improve synchronization, calibration and degradation tracking of the thermal behaviour in the multilayer structure of the module. By combining FEM-based modelling and machine learning with sensor-based synchronization, the proposed framework provides a foundation for accurate and computationally efficient thermal monitoring in power electronic modules. The main contributions of this paper can be summarized as follows:
Development of a neural network–based digital twin for fast three-dimensional temperature estimation in a SiC power module.
Generation of a physics-based thermal dataset using a FEM model in COMSOL Multiphysics.
Investigation of single-NTC and double-NTC synchronization approaches for improved model calibration and degradation tracking.
Investigation of the accuracy and the computational speed for real-time thermal monitoring and predictive control applications.
The remainder of this paper is organized as follows. Section 2 describes the development of the neural network-based digital twin, including the FEM model setup, dataset generation and temperature-estimation methods using NTC feedback. Section 3 presents the FEM-based thermal modelling of the SiC module, while Section 4 details the neural network framework and evaluation. Section 5 discusses time-dependent thermal modelling, and at the end, Section 6 concludes the paper and outlines future perspectives.

2. Developing the Neural Network–Based Digital Twin

2.1. Digital Twin Development

Several definitions of digital twins have been proposed throughout the years [50,51,52]. In this paper, the definition from the International Electrotechnical Commission (IEC) is particularly relevant: The digital representation of a target entity with data connections that enable convergence between the physical and digital states at an appropriate rate of synchronization [52,53]. A digital twin is thus a digital representation of a physical system designed to mirror its behaviour and characteristics as closely as possible via data connections that calibrate the physical and digital domains. The advantages of digital twins in monitoring and control applications depend on how well the digital twin captures physical behaviour and how fast the model responds to physical changes. The efficiency of a digital twin comes down to a fine balance of the trade-off between low computational time and high accuracy [33].

2.2. Target Entity

As a case study, the power module EAB450M12XM3 from Wolfspeed has been chosen as the target entity. In [32], the power module was dismantled and its internal multilayer structure presented. Figure 1 illustrates a schematic of this internal structure, showing the SiC dies, a DCB (Direct Copper Bonded) layer of ceramic substrate Si3N4 in between, solder layers and baseplate arrangement. In addition, thermal parameters and layer thicknesses are based on [32]. The module is mounted on a heatsink using a thermal interface material (TIM) to prevent trapped air pockets. A Negative Temperature Coefficient (NTC) thermistor is placed on the upper copper plate of the DCB layer to provide temperature measurements at that location. Note that the Layer thickness and thermal parameters can be found in [24].

2.3. Predictive Control Strategy and Working Principle

For a control strategy to be considered predictive, it must calculate the expected outcome within a system when a change occurs. The model should therefore predict a future temperature based on current system measurements. To be effective, the computation time must be significantly shorter than the time it takes for the temperature to reach its steady state, allowing the controller to adjust parameters such as power flow or cooling to prevent thermal damage. The core principle of the proposed methods is the interaction between the physical power module and the digital twin, as shown in Figure 2.
Data for training the neural network is generated using the FEM model, while the NTC thermistor provides real-time feedback for synchronization. The digital twin estimates the temperature distribution T ( X , Y , Z ) across the entire module, which can be used for health monitoring and ageing analysis. The digital twin provides the ability to incorporate a controller that should be able to take appropriate actions as mentioned. However, the particular controller design falls outside the scope of this paper.

2.4. Neural Network with NTC Adjustment Method

This method involves predicting the temperature using the trained neural network with the NTC as an anchor point to ensure synchronization (see Figure 3). If the system encounters a difference between the predicted and the measured NTC temperature, the digital twin’s predictions can be adjusted to better align with the physical readings. Furthermore, the ageing and degradation of certain elements within the module will naturally cause the digital twin to gradually perform worse over time, as the module is no longer working under the same conditions as initially. Here, the NTC serves as a way to coarsely adjust the predictions, though it is not able to detect and adapt to the thermal degradation of specific layers. This also assumes that the T(NTC) and T ( X , Y , Z ) changes proportionally and therefore can be adjusted by the same factor C . In order to allow for calibration in the digital twin and to account for thermal degradation in the multi-layer structure of the physical module, an alternative method is proposed.

2.5. Neural Network with Double NTC Calibration Method

When the materials in the multi-layer structure age from power cycles and cracks start to propagate, the thermal coupling between layers weakens, i.e., the corresponding thermal resistances increase. This method proposes the placement of an additional NTC thermistor on the copper baseplate, illustrated in Figure 4, to achieve a better insight into this relation.
The two NTC thermistors surround the DCB-layer and the solder layer below. The thermal behaviour from material degradation can be captured by the relation between the measurements from NTC1 and NTC2. If the measured temperatures move further apart over time, it can indicate that thermal degradation occurs in either the DCB or solder layer, or both. If the measured temperatures both increase over time, it can indicate that thermal degradation occurs in the TIM layer.
A calibration process of the digital model can be carried out to align with the actual thermal profile of the multi-layer structure while providing insight into the thermo-mechanical condition. As the NTCs are placed on top of the DCB and below the solder, this method does not distinguish between which of the two layers the degradation happens in. Therefore, only the conductivity of the solder layer is assumed to be variable in generating the dataset. However, the junction temperature can be more intelligently adjusted, as it is now known if the thermal resistances in specific parts of the system have changed. This serves as an addition to the single NTC method, for calibration purposes. The general idea of the calibration method is shown in Figure 5. As in the method using the single NTC thermistor, the scaled estimations are adjusted by the NTC error and then output a T ( X , Y , Z ) s. This is represented by the dotted line to the right.

3. FEM-Based Thermal Modelling of the Sic Module

In this section, the FEM-based thermal modelling of the SiC power module is presented, and the modelling assumptions, boundary conditions and dataset generation procedure are described.

3.1. Thermal Modelling Considerations

The temperature distribution in the power module greatly depends on the heat conductivity k of the materials and the heat transfer coefficient h of the heatsink. In general, the thermal conductivity of a material is temperature-dependent. This dependency varies greatly from one material to another, and the fact that k = k ( T ) adds another complexity to the problem [55]. Therefore, it can be helpful to consider how k varies in the temperature range of interest, but also to what degree it affects the final temperature. For SiC, Si3N4 and copper, reference [24] reveals how k changes with temperature. The thermal conductivity of the latter two is almost constant in the presented temperature range. Even though for SiC it is almost halved from 0 °C to 175 °C [24], the thermal resistance constitutes only 2% of the overall thermal resistance of the power module [22]. Therefore, k is considered constant for SiC, Si3N4 and copper. The thermal resistance of solder and TIM can be difficult to predict based on how the materials are applied.
In [4], Semikron discusses some of the difficulties of this subject and indicates how tricky this process is, especially for TIM. Although the TIM layer is by far the thinnest layer in the multi-layer structure, it accounts for about 50% of the total thermal resistance of the power module [22], due to its low thermal conductivity. Therefore, small variations in the thickness of the TIM have a large impact on the temperature distribution in the power module. The temperature distribution in the FEM model is calculated using different values of thermal conductivity for both the TIM layer and the solder layer. This enables the FEM model to capture the thermal behaviour of the system under gradually increasing thermal resistances from material degradation. Also, the result enables the digital model to be calibrated to its physical counterpart. Furthermore, the temperature distribution is also calculated by different values of the heat transfer coefficient, h, associated with the cooling system. This is based on the junction-to-case thermal resistance given by the manufacturer [54]. The required heat transfer coefficient to maintain a junction temperature below 175 °C is investigated for a range of power losses. Figure 6 shows the rated power loss as a function of h to demonstrate a sufficient cooling system.
Finally, a parametric sweep has been conducted in COMSOL 6.2, and the temperature distribution has been calculated for all combinations of the parameter values in Table 1.
The result sets the basis for training the neural network. Notably, the power loss Q and the ambient temperature T a m b are absent from the parametric sweep. Since the heat transfer coefficient is uniformly distributed and the remaining surfaces of the power module are thermally insulated, the temperature field can be adjusted linearly by these parameters. An example is demonstrated in Figure 7. The X , Y and Z coordinates are determined by the mesh presented in the next section.

3.2. The Dataset

The dataset produced by the parametric sweep consists of 12,467,268 unique data points, with changing values of X , Y , Z , k s o l d e r , k T I M , and h . Every data point corresponds to a temperature T at that given geometrical location.
This set is split into three different datasets called the training set, validation set and test set. The training set makes up 70% of the data and is used to train the neural network, while the validation and test sets make up 15% each. The validation set is used to calculate the running loss of a model and to tune the hyperparameters. This makes sure that the predictions do not overfit to the training set. The neural network is then finally evaluated on the test set, which has never been seen while training, to make sure the hyperparameters have not been overfitted to the validation set. This also ensures that the neural network is able to generalize well on new data points. Based on this dataset, the neural network inputs and outputs are as shown in Figure 8.

3.3. Model Description

In order to model the heat transfer in the power module, the software COMSOL Multiphysics 6.2 has been used. It enables one to take care of various phenomena affecting the overall heat transfer in the system, such as radiant heat transmitted from or to the power module, thermal-induced stress and changes in thermal resistance from expanding materials, volumetric heat release occurring from some chemical reaction or cooling effects from translational motion. These are important features to consider if one wishes to quantify the thermal behaviour of the physical system with extremely high detail. However, using such a complex and detailed model requires a large number of critical parameters, e.g., thermal expansion coefficients, stress/strain relation for each material and the emissivity of the inside and outside surface of the power module.
Therefore, to simplify the problem of heat transfer in the power module, the FEM model computes the three-dimensional temperature field within the power module from Laplace’s equation [56]:
2 T = 0
The geometry corresponds to the actual multilayer structure (SiC dies, solder, DCB, baseplate, and TIM). An alternative method for including the time-dependent term in the heat conduction equation is presented in a later section. Figure 9 shows a schematic of the multi-layer structure of the power module and illustrates the problem. It is assumed that heat is generated evenly in the 10 SiC dies and transferred to the corresponding solder layer beneath, expressed as:
d T d z z = i 0 = Q k s o l d A
z = i 0 corresponds to the interface between the SiC die and the solder. k s o l d is the thermal conductivity of that layer. At the bottom side of the power module, heat is conducted to a heatsink and transferred through convection, expressed as:
d T d z z = i 8 = h k h s ( T T )
Here z = i 8 denotes the interface between the heatsink and the coolant. k h s is the thermal conductivity from the heatsink to the coolant. T is the ambient temperature.
Even though the power module might be placed in the open, where surrounding elements like air can aid the cooling process, for the sake of keeping the junction temperature below 175 °C, it is assumed that all heat is transferred by the heatsink. Therefore, the power module is thermally insulated from the top and sides, so that:
d T d x x = 0 = d T d y y = 0 = d T d z z = 0
Regarding the initial condition of the problem, it is assumed that before operation, the temperature of the power module has settled to the surrounding temperature, so that:
T X , Y , Z , t 0 = T
As for the thermal parameters, all of the materials in the multi-layer structure are considered to be homogeneous. That is, the thermal conductivity k and pressure-specific heat capacity, T are assumed to be spatially independent.

3.4. Model Setup in COMSOL Multiphysics

The representation of the geometry of the power module in the FEM model can be seen in Figure 10. Important to mention is the bottom plate in Figure 10, which acts as the heatsink with a uniformly distributed effective heat transfer coefficient h . Figure 11 shows the corresponding mesh that has been built.
The shape and number of elements in the mesh were chosen to achieve a compromise between accuracy and computational speed. From every node in the mesh, a data point corresponding to that specific coordinate ( X , Y , Z ) is created.

3.5. Thermal Model Results

The results from the COMSOL model of the power module are computed from the solution of Equation (1). Figure 12 shows the calculated temperature distribution along the z-axis at the SiC chip with the highest temperature at a total power loss of 1000 W under a 25 °C ambient temperature with a heat transfer coefficient of 3400 W/m2K. Figure 13 shows the temperature field of the entire power module.

4. Neural Network Framework

The objective of the proposed Digital Twin is to estimate the ten junction temperatures of the SiC MOSFET module in real time, based on operating conditions, using a feed-forward neural network trained on Multiphysics simulation data. No optimization procedure is performed during inference, as the network acts as a direct surrogate model. Although the junction temperatures of the SiC MOSFET module are governed by physical laws, the neural network is trained in a data-driven manner using high-fidelity FEM simulations. Physical consistency is therefore captured implicitly through the training data, rather than via explicit physics constraints in the loss function or optimization procedure. This approach allows the digital twin to efficiently predict the full temperature field, including the ten junction temperatures. The process and method from data collection to model evaluation are illustrated in Figure 14. The data for training and testing the neural network is gathered from the aforementioned parametric sweep done on the FEM-model. This provides labelled data for training and testing the model. Hereby, input parameters are mapped to temperature results. First, the input features are chosen with respect to the controllable parameters that show non-linear behaviour within the model boundaries in the FEM model. The input features are then standardized using the Z-score normalization, where each input feature is transformed by subtracting its mean and dividing by its standard deviation, using the StandardScaler from scikit-Learn [57].
The neural network’s main function is to mimic the FEM simulations, but with a much shorter calculation time. In order to find the most appropriate neural network, a manual and custom version of a grid search [58] is designed. The models are trained for 10,000 epochs, but to minimize unproductive time consumption and overfitting, the Keras early stopping feature is implemented with a patience of 20. As summarized in Table 2, this trains a neural network with each of the combinations of the chosen set of parameters [34,59,60].
The neural networks are evaluated using MAE and MSE to compare the models. The best model is chosen and then confirmed through visual inspection to ensure accuracy in areas of great importance. Of these models in particular, the tanh activation function maintained a low MSE and MAE, but by visual inspection of the temperature field, it did not reproduce the stepwise temperature variations between the material layers with the same fidelity as the ReLU-based model. For the present thermal problem, this observation is physically meaningful. The temperature distribution in the power module is continuous, but due to the multilayer structure and the changes in thermal properties between SiC, solder, DCB, baseplate, and TIM, the spatial gradients vary from layer to layer. Therefore, the target mapping is not only nonlinear but also piecewise-varying over the geometry. In this regard, the piecewise-linear nature of the ReLU activation function makes it well-suited for preserving such local variations without introducing excessive smoothing. In contrast, smoother saturating activation functions, such as tanh, may smooth the transitions more than desired in regions where relatively sharp thermal-gradient changes occur. This practical observation is also consistent with studies showing that piecewise-linear or piecewise-convex neural-network structures can provide favourable optimization behaviour and reduce curvature-related difficulties compared with smoother nonlinear mappings [61,62,63]. In addition, ReLU is computationally simple and provides non-saturating gradients in its active region, which is beneficial for stable training of deeper neural-network structures. Based on these considerations and the visual as well as numerical results obtained in this work, the ReLU activation function was selected for the proposed digital twin.

4.1. The Hardware Used to Run the Neural Network Framework

The type of machine used to run a neural network significantly impacts how fast it is possible to run the calculations. Generally, both training and inference happen much faster on a GPU than on a CPU. This is due to the GPU’s ability to parallelize the computation to a much greater extent and thereby reduce the calculation time significantly. In this case, the neural network is run on an NVIDIA Quadro RTX 8000 GPU with 48 GB of VRAM, which provides great computational power and memory capacity. This makes it particularly well-suited for training large-scale models, working with high-resolution data, and handling large batch sizes. However, since the model is run on a shared GPU cluster, limitations may occur at times, potentially limiting the available GPU power and leading to variable performance depending on cluster load.
The model accuracy was evaluated on unseen FEM test data using the Mean Absolute Error (MAE) and Mean Squared Error (MSE):
M A E = 1 n i = 1 n y i y ^ i ,   M S E = 1 n i = 1 n y i y ^ i 2

4.2. The Neural Network Architecture

Based on the grid search, a neural network is built on the following configuration, summarized in Table 3. The neural network was implemented in PyTorch1.13.1 and trained using the Adam optimizer with mean squared error as the loss function. Because of the early stopping feature, this model has been trained for 454 epochs, and the model from epoch 434 has been chosen. This model is illustrated in Figure 15.

4.3. The Neural Network Results

The neural network is evaluated on the test set, with respect to model accuracy and calculation time. The following metrics, which are summarized in Table 4, have been calculated:
With the MSE being close to the MAE, we can expect that there are relatively few critical outliers since the MSE is more sensitive to larger relative errors due to the squaring of errors. This indicates that the model is consistently predicting close to the test data.
The distribution of the prediction error is seen in Figure 16. The errors are well distributed around the zero-error marker, indicating no significant bias. It also shows that almost all errors are within ±2.5 °C, confirming that larger errors are very rare.

4.4. The Neural Network Learning Curves

In order to further investigate eventual overfitting to the training data, a learning curve is computed for the neural network, as seen in Figure 17. Since the loss decreases very quickly in the early epochs, a zoom-in on the last 100 epochs is also shown. The learning curve generally shows that the test loss follows the trend of the training loss, though it is much more fluctuating. As the early stopping keeps the model with the lowest test loss, this suggests that the neural network is not at significant risk of overfitting. Even though the neural network is chosen based on the metrics above, a look at important details about the neural network predictions is necessary. This is done on data points that are not present in any of the datasets.
A benchmark used to evaluate the neural network’s performance is the temperature at the centre of the hottest chip. Looking at a vertical straight line from the centre of the hottest chip to the bottom of the heatsink, the temperature is predicted as shown in Figure 18 and Figure 19.
Comparing the result with the corresponding calculation in COMSOL gives a picture of how well the neural network estimates the calculated temperature throughout the entirety of the multi-layer structure of the power module, both before and after scaling by P l o s s and T a m b . It is clear from this that the neural network very well follows the FEM for this task with only minor deviations. This shows that the neural network is adequately designed for predicting the temperature in the hottest vertical line of the power module.
There is, however, a small deviation at the top of the graph, which corresponds to the SiC die, which is a problematic location to have even a small error, as this is one of the main objectives. A horizontal surface at the bottom of the baseplate is used to make sure that the thermal spreading is captured correctly. This is shown in Figure 20 and Figure 21. Here we see that the two heatmaps are very similar, which shows great coherence.

4.5. The Proposed Neural Network-Based Scheme Calculation Time

As the aim is to decrease calculation time, the model’s calculation time is evaluated. For this, the temperature is predicted for 1000 random data points within the power module’s geometry. As the junction temperature under rated conditions increases rapidly, it is necessary to make sure that the model calculates sufficiently fast. In Table 5, the speed of predictions, including scaling, is shown.
The predictions take on average 0.064 s, with the slowest prediction being 0.097 s. For predictive control, this is indeed fast enough, and therefore, the neural network is able to predict before the temperature changes. The distribution of prediction times can be seen in the histogram in Figure 22.
Here, it is seen that most of the calculation times are below 0.07 s. It is, however, still necessary to leave room for predictions to take up to 0.1 s, excluding an eventual buffer. For comparison, the transient junction temperature is calculated in COMSOL. Figure 23 shows a comparison between the transient junction temperature and the calculation speed of the FEM model and the neural network.

5. Discussion and Time-Dependent Thermal Modelling

The digital twin allows for accurate and fast temperature estimates. This means that the power module could be operated closer to the maximum allowable temperature and therefore potentially improve operating efficiency. Furthermore, the digital twin provides insight into temperatures in all 3 dimensions, allowing for analysis of thermo-mechanical stresses along the interface between the different layer materials. Here, the proposed Double NTC Calibration Method can serve as an aid to detect where the thermal degradation might start. This could potentially lead to improved lifetime estimates. The digital twin is, however, not able to detect and adapt to thermal degradation in the solder layer between the SiC die and the upper copper plate of the DCB layer. If crack propagation begins there, the temperature of the solder and SiC die will be higher than the prediction, due to a weaker thermal coupling with the NTC.
This event can occur without the digital twin detecting it. Evaluation of the digital twin is done solely compared to the FEM model, which means there is a lack of data to directly investigate the accuracy of the FEM model. Since the FEM model is the basis of the neural network’s training data, the digital twin’s accuracy is dependent on how well the FEM model captures the physical module. Rigorous physical testing would enhance the practicability of the digital twin. Here, alternative methods to measure junction temperature, e.g., using optical fibre, as discussed, could lead to better model validation. Adding to this, testing the control strategy on the modelled power module to compare the NTC temperatures with the estimates is a beneficial further step. Also, further work detailing the model could lead to closer proximity. Furthermore, it would be interesting to evaluate the performance under real-world conditions, such as fluctuating power loads, ambient conditions, and ageing effects.
The reported average inference time of 0.063 s for estimating the ten junction temperatures of the SiC MOSFET module is achieved because the trained neural network acts as a computationally efficient surrogate for the FEM simulations. During inference, no iterative optimization or trajectory-based computation is performed; the network produces predictions through a single forward pass from input operating conditions to the corresponding junction temperatures. This direct mapping ensures both accuracy and speed, enabling real-time monitoring without relying on optimizer behaviour. The neural network is able to reproduce the FEM model results with satisfactory accuracy while maintaining significantly reduced prediction time once trained. The selected architecture (7 hidden layers with 128 neurons per layer) was chosen to ensure sufficient representational capacity to approximate the nonlinear mapping between the input parameters and the resulting temperature field. Nevertheless, smaller neural network architectures could potentially achieve comparable performance with reduced computational cost. In particular, reducing the number of layers or neurons may shorten training time and lower hardware requirements, which would be beneficial in deployment scenarios with limited computational resources. In the present study, training and inference were performed on a high-capacity GPU, and therefore model accuracy and stability during training were prioritized over minimizing network size. Future work could investigate more compact architectures, model compression techniques such as pruning, or physics-informed neural network formulations that explicitly incorporate diffusion physics. Model compression approaches have been studied as a means to reduce neural network size and computational cost while maintaining predictive performance [64]. These approaches can further improve computational efficiency while maintaining the predictive accuracy demonstrated in this study. In addition to the spatial temperature estimation considered in this work, the dynamic thermal behaviour of the power module is very important for describing the response properties, and it is also highly relevant for reliability- and heat-ageing-related investigations. A very detailed description of the transient temperature distribution in the individual layers of the power module can be obtained by solving the time-dependent heat-transfer equation for each layer as:
k T = ρ C p T t
where ρ is the mass density and C p is the specific heat capacity. By a proper description of the boundary conditions at each interface, the temperature can, in principle, be estimated at any point in the multilayer structure. Using separation of variables leads to a Fourier-series solution [35], where the details of the transient solution can be found. It should be emphasized that such a solution provides a fully detailed description of the temperature distribution. However, for digital twin applications, this method is not suitable due to the long calculation time associated with the Fourier series evaluation.
When the main interest is the junction temperature, other analysis methods become more attractive. A formal time-domain solution to Equation (7) can be written as follows:
T r , t = T 0 r + 1 ρ C p τ = 0 t k T r , τ d τ
which is an integro-differential equation [36] with the initial condition T 0 r = T r , 0 + . Basically, Equation (8) is a variant of the so-called Cauchy problem. An appealing way to address this problem is to make use of fractional calculus [36,37], an outline of which is intended for future implementation. Let β denote a real number in the interval 0 < β < 1 . Then, the time evolution may be written as:
t T r , t = 1 ρ C p D t 1 β k T r , t
where the Riemann-Liouville (R-L) time fractional derivative is introduced:
D t β f t = d m d t m 1 Γ m β τ = 0 t f τ d τ t τ β + 1 m
where Γ m β is the Gamma function,
Γ z = 0 e t t z 1 d t
As mentioned above, a detailed 3D solution can be obtained by separation of variables, leading to a Fourier-series solution, which contains rich information, including heat spreading [35] in each individual layer as heat is transferred through the multilayer system. This phenomenon represents a heat (temporary) storage effect, or heat-capacity effect, with a delayed response. While this is physically informative, the Fourier-series solution is far too slow for an online digital twin approach. Therefore, a compact model that can account for at least some of the same thermal-memory and capacity phenomena should be considered.
A promising candidate is the Foster thermal network model [37] which has been used in power module thermal modelling for a long time. In the present work, the detailed 3D information is provided by the FEM model and its neural network surrogate model, where the latter provides very fast calculations. The preferred neural network surrogate model and the FEM model represent the same thermal information, since the neural network is trained on FEM-generated data. A relevant future direction is therefore to use the neural network surrogate model (and the subsidiary FEM model) to provide parameters for a Foster-type thermal network model.
The Foster thermal network model is able to represent the above-mentioned heat-capacity phenomena. The number of cells in such a model can be chosen arbitrarily [37,38]. In general, more cells lead to higher accuracy but also to a larger number of parameters that must be estimated. To keep the number of parameters at a minimum, a time-fractional-calculus-based model is of particular interest. In addition to reduced parameterization, it is well-suited for systems with capacity and memory effects [37,38], and it is also a strong candidate for integrating information from the double-thermistor system.
In order to compare a Foster thermal network model with a time fractional-order model, a thermal impedance expression for the Foster model is needed. In the Laplace s -domain, the thermal impedance of an n -cell Foster model is given by [37,38] as follows:
Z t h s = i = 1 n R i 1 s C i R i + 1 s C i = i = 1 n R i 1 + s τ i
where τ i = R i C i . By taking the inverse Laplace transform, the voltage response of the n -cell Foster-type model can be written as:
v t = L 1 V s = i = 1 n I R i 1 e t / τ i
The common definition of the thermal impedance across the multilayer power-module structure from junction to case is:
Z t h = T j T c P = Δ T P
where T j is the junction temperature, T c is the case temperature, and P is the power. For the case of constant applied power P , the time-domain temperature response becomes:
Δ T t = i = 1 n P R i 1 e t / τ i
and thus the thermal impedance in the time domain is:
Z t h t = i = 1 n R i 1 e t / τ i
For the time fractional-order model, the heat-conduction dissipation process can be described using a fractional-order thermal capacitor C β through the relationship between current i t and voltage v t as follows [37,38]:
i t = C β D t β v t
where β can be interpreted as the order of the thermal capacitor, and the fractional-order derivative is given by Equation (10). Taking the Laplace transform of Equation (17) yields:
I s = L i t = L C β D t β v t = C β s β V s
from which the fractional-order impedance expression is obtained as:
Z C β = V s I s = 1 s β C β
This makes it possible to establish a connection to the thermal impedance in Equation (12) while also providing a mathematically suitable framework for implementing information from the double-thermistor innovation. In [37,38], an interesting comparison is reported between a four-cell Foster thermal network model and a fractional-order equivalent thermal impedance model, where the latter requires only two parameters to be estimated. It is concluded in [37,38] that the fractional-order model is more concise than the Foster model while remaining highly accurate in describing the characteristic curves in the frequency domain.
Power modules are, in general, subject to electrical load variations, causing temperature fluctuations in the multi-layer structure. These create thermomechanical stresses, which again induce cracks and voids between the layers. Cracks and voids will alter the thermal impedance in the layers in question and thereby change the performance of the power module. In [65], a so-called frequency-domain model has been utilized to monitor degradation in the power modules, an interesting work which also made use of FEM analysis for acquiring detailed information about the heat flow in the multi-layer structure. Furthermore, Ref. [65] also presents detailed laboratory experiments with numerous temperature sensors, the results of which were combined with the FEM analysis and thereby degradation under various loading (frequency) profiles were obtained. The proposed two-sensor technique in the present work can give valuable information about the degradation process as well. Especially because of the Neural Network Model, a response in the temperature profile from the junction temperature to the case temperature can be monitored in situ. The frequency content in the two measured temperatures can be analyzed and combined very fast. Taking the inverse Laplace transform of Equation (19) leads to an expression for the time-dependent thermal impedance of the power module. As pointed out in [66], an estimation of this can be obtained from Equation (12), and hence, the time-dependent thermal impedance can be implemented in the two-sensor technique proposed in the present work. By continuous acquisition of data from the two temperature sensors, the uncertainties of the measurements can be found by statistical means. Hence, accurate and fast temperature measurements open the possibility to estimate the case temperature continuously. However, in case a discrepancy between the measured and estimated temperatures occurs, a first indication of degradation is obtained.
Considered as a new thermal modelling approach, the fractional-order equivalent thermal impedance model therefore appears to be highly promising. For the present work, it can be of particular interest in combination with the applied digital twin approach. Furthermore, the dual-thermistor innovation is expected to be of special interest for in situ adjustment of the few parameters in the fractional-order model. If such in situ adjustment reveals gradual parameter changes, this may indicate reliability-related degradation mechanisms. This topic is of high interest for future work.

6. Conclusions

In this paper, a fast and accurate method for estimating the temperature distribution within the entire multilayer structure of a Silicon Carbide (SiC) power module has been developed using neural networks combined with digital twin methodology. The model enables the prediction of all ten junction temperatures of the Wolfspeed EAB450M12XM3 module with high precision and short computation time. Feedback from single and double NTC thermistor deployment is suggested to compensate for estimation errors and to adjust the digital twin, which creates a more reliable and robust system. Especially, the proposed two-sensor technique can give valuable information about the thermal degradation process as well. The multilayer structure has been modelled in COMSOL Multiphysics to produce precise temperature fields under various operating conditions that form the basis for the neural network training. The proposed digital twin framework demonstrates that combining FEM-based modelling with neural networks enables speed and accuracy, which support the implementation of predictive thermal control strategies. The work allows for future investigation into controller integration, experimental validation, and monitoring strategies under real operating conditions. Some of the monitoring strategies can be related to fluctuating junction temperatures and how these can be utilized as failure indicators. It means that valuable information can be extracted from the time-dependent term in the heat equation, a term which has been omitted in the present work. Not only steady state oscillations but also transient behaviour can be taken into account to establish a framework for lifetime estimations. Hence, environmental and load-imposed cycling can be implemented in such a monitoring strategy.

Author Contributions

Conceptualization, J.H.P. and A.F.; Methodology, K.B.T. and S.I.S.; Software, K.B.T. and S.I.S.; Validation, K.B.T., S.I.S. and J.H.P.; Investigation, K.B.T., S.I.S., J.H.P. and A.F.; Writing—original draft, K.B.T. and S.I.S.; Writing—review & editing, A.F. and J.H.P.; Supervision, J.H.P. and A.F. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the European Union’s Horizon Europe research and innovation programme under grant agreement number 101056896, project RHODaS. Views and opinions expressed are, however, those of the authors only and do not necessarily reflect those of the European Union or CINEA. Neither the European Union nor the granting authority can be held responsible for them.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to privacy restriction.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mirković, N.; Delgado, A.; Alou, P.; Vasić, M. 24 kW Single-Stage Three-Phase Inductive Power Transfer Charger for EV Applications Based on Matrix Converter and DD2Q Coil Topology. IEEE Trans. Power Electron. 2026, 41, 2991–3010. [Google Scholar] [CrossRef]
  2. Hossain, M.S.; Islam, M.R.; Biswas, S.P.; Sutanto, D.; Muttaqi, K.M. Advanced PWM Technique for Performance Improvement of Solid-State Transformer-Based Renewable Energy Systems. IEEE Trans. Ind. Electron. 2026, 73, 2196–2207. [Google Scholar] [CrossRef]
  3. Gheisarnejad, M.; Fathollahi, A.; Sharifzadeh, M.; Laurendeau, E.; Andresen, B.; Al-Haddad, K. Optimal Policy Gradient-Based Type-2 Fuzzy Control for Multi-DC Terminal PEC Converter in 5G-Based Commercial Buildings. IEEE J. Emerg. Sel. Top. Power Electron. 2025, 13, 5149–5161. [Google Scholar] [CrossRef]
  4. Li, X.; Lyu, L.; Geng, G.; Jiang, Q.; Zhao, Y.; Ma, F.; Jin, M. Power Allocation Strategy for Battery Energy Storage System Based on Cluster Switching. IEEE Trans. Ind. Electron. 2022, 69, 3700–3710. [Google Scholar] [CrossRef]
  5. Fathollahi, A.; Nielsen, T.W.; Andresen, B.; Dannesboe, C. Designing an IGBT-Based Active Front-End Rectifier with Voltage Oriented-based Control for Power-to-Hydrogen Electrolyzers. In Proceedings of the 2025 7th Global Power, Energy and Communication Conference (GPECOM), Bochum, Germany, 11–13 June 2025; pp. 69–74. [Google Scholar]
  6. Zhou, Y.; Wang, Z.; Tan, L.; Man, C.; Dou, J.; Shen, D.; Wu, Y.; Ma, K.; Liao, Y.; Yang, L.; et al. A High-Density 3.3 kV/2000 a SiC MOSFET Power Module: Design, Fabrication, and Experimental Evaluation. IEEE Trans. Power Electron. 2026, 41, 4696–4713. [Google Scholar] [CrossRef]
  7. Hietanen, J.M.; Gokhale, K.; Veik, B.; Kärkkäinen, T.J.; Silventoinen, P. Voltage Sharing Between MOSFETs in a SiC-Based Two-Switch Flyback. IEEE Trans. Power Electron. 2025, 41, 7563–7573. [Google Scholar] [CrossRef]
  8. Liang, Z.; Zhu, L.; Sun, Y.; Li, J.; Qin, R.; Arka, B.; Costinett, D.; Bai, H. Full Integration of On-Board Charger, Auxiliary Power Module, and Wireless Charger for Electric Vehicles Using Multipurpose Magnetic Couplers. IEEE Trans. Ind. Electron. 2024, 71, 9962–9967. [Google Scholar] [CrossRef]
  9. Gheisarnejad, M.; Fathollahi, A.; Sharifzadeh, M.; Laurendeau, E.; Al-Haddad, K. Data-Driven Switching Control Technique Based on Deep Reinforcement Learning for Packed E-Cell as Smart EV Charger. IEEE Trans. Transp. Electrif. 2025, 11, 3194–3203. [Google Scholar] [CrossRef]
  10. Tao, Y.; Li, J.; Gao, G.; Liu, Z.; Rinderknecht, S. Goal-Oriented Data-Driven Control for a Holistic Thermal Management System of an Electric Vehicle. IEEE Trans. Autom. Sci. Eng. 2024, 21, 4893–4904. [Google Scholar] [CrossRef]
  11. Amini, M.R.; Kolmanovsky, I.; Sun, J. Hierarchical MPC for Robust Eco-Cooling of Connected and Automated Vehicles and Its Application to Electric Vehicle Battery Thermal Management. IEEE Trans. Control Syst. Technol. 2021, 29, 316–328. [Google Scholar] [CrossRef]
  12. Bamati, S.; Chaoui, H.; Yahyaouy, A.; Gualous, H. State of Charge Estimation Across Dynamic and Sub-Zero Thermal Conditions for Electric Vehicles. IEEE Access 2025, 13, 176399–176409. [Google Scholar] [CrossRef]
  13. Mukherjee, S.; Barbosa, P. Design and Optimization of an Integrated Resonant Inductor with High-Frequency Transformer for Wide Gain Range DC–DC Resonant Converters in Electric Vehicle Charging Applications. IEEE Trans. Power Electron. 2023, 38, 6380–6394. [Google Scholar] [CrossRef]
  14. Lotfi, A.; Maalandish, M.; Aghaei, A.; Samadian, A.; Fathollahi, A.; Hosseini, S.H.; Khooban, M.-H. Novel multilevel inverter topology with low switch count. Int. J. Emerg. Electr. Power Syst. 2024, 25, 305–319. [Google Scholar] [CrossRef]
  15. Shu, J.; Wang, H.; Tao, M.; Ng, Y.H.; Feng, S.; Du, Y.; Zhou, Z.; Sun, J.; Lee, S.W.R.; Chen, K.J. Stacked Strongly Coupled GaN/SiC Cascode Device with Fast Switching and Reclaimed Strong dv/dt Control. IEEE Trans. Electron Devices 2025, 72, 2647–2653. [Google Scholar] [CrossRef]
  16. Ibrahim, A.; Salem, M.; Kamarol, M.; Delgado, M.T.; Desa, M.K.M. Review of active thermal control for power electronics: Potentials, limitations, and future trends. IEEE Open J. Power Electron. 2024, 5, 414–435. [Google Scholar] [CrossRef]
  17. Baliga, B.J. Fundamentals of Power Semiconductor Devices; Springer: Berlin/Heidelberg, Germany, 2018. [Google Scholar]
  18. Teimouri, A.; Fathollahi, A.; Raeiszadeh, M.; Rezaei, M.; Mosavi, A. High-Speed Fault Detection and Location Approach for Multilevel Inverters Using Deep Learning and Reliability Evaluation. IEEE Open J. Ind. Electron. Soc. 2026, 7, 291–312. [Google Scholar] [CrossRef]
  19. Leslie, S.G. Cooling options and challenges of high power semiconductor modules. Electron. Cool. 2006, 12, 20. [Google Scholar]
  20. Jones-Jackson, S.; Rodriguez, R.; Emadi, A. Jet Impingement Cooling in Power Electronics for Electrified Automotive Transportation: Current Status and Future Trends. IEEE Trans. Power Electron. 2021, 36, 10420–10435. [Google Scholar] [CrossRef]
  21. Semiconductors, P. Application Manual Power Semiconductors; SEMIKRON International GmbH: Nuremberg, Germany, 2010. [Google Scholar]
  22. Wolfspeed, Inc. PRD-08376, Thermal Characterization Methods and Applications. 2024. Available online: https://assets.wolfspeed.com/uploads/2024/02/Wolfspeed_PRD-08376_Thermal_Characterization_Methods_and_Applications_Application_Note.pdf (accessed on 13 April 2025).
  23. Zhang, X.; Cheng, Z.; Sun, X.; Wu, K.; Li, J. A direct bonding copper degradation monitoring method for insulated gate bipolar transistor modules: Boundary-dependent thermal network combined with feedback control. High Volt. 2023, 8, 1264–1274. [Google Scholar] [CrossRef]
  24. Chen, M.; Wang, H.; Pan, D.; Wang, X.; Blaabjerg, F. Thermal characterization of silicon carbide MOSFET module suitable for high-temperature computationally efficient thermal-profile prediction. IEEE J. Emerg. Sel. Top. Power Electron. 2020, 9, 3947–3958. [Google Scholar] [CrossRef]
  25. Deantoni, J.; Muñoz, P.; Gomes, C.; Verbrugge, C.; Mittal, R.; Heinrich, R.; Bellis, S.; Vallecillo, A. Quantifying and combining uncertainty for improving the behavior of Digital Twin Systems. Automatisierungstechnik 2025, 73, 81–99. [Google Scholar] [CrossRef]
  26. Böttjer, T.; Tola, D.; Kakavandi, F.; Wewer, C.R.; Ramanujan, D.; Gomes, C.; Larsen, P.G.; Iosifidis, A. A review of unit level digital twin applications in the manufacturing industry. CIRP J. Manuf. Sci. Technol. 2023, 45, 162–189. [Google Scholar] [CrossRef]
  27. Fathollahi, A.; Dannesboe, C.; Andresen, B. Modelling and experimental investigation of the power conversion system for solid oxide water electrolysis. IET Conf. Proc. 2025, 2025, 424–429. [Google Scholar] [CrossRef]
  28. Wu, Y.; Sun, J.; Yang, G.; Cui, L.; Wang, Z.; Wang, M. Research on digital twin based temperature field monitoring system for LNG storage tanks. Measurement 2023, 215, 112864. [Google Scholar] [CrossRef]
  29. Torchio, R.; Conte, F.; Scarpa, M.; Filippini, M.; Pase, F.; Toso, F.; Nasab, P.S.; Marson, E.; Viroli, A.; Posa, P.; et al. Digital Twins in Power Electronics: A Comprehensive Approach to Enhance Virtual Thermal Sensing. IEEE Trans. Power Electron. 2025, 40, 6977–6987. [Google Scholar] [CrossRef]
  30. Qi, X.; Ma, C.; Li, M.; Liu, J.; He, J.; Totis, G.; Hua, C.; Wang, L.; Cui, G.; Li, G.; et al. A novel thermal error compensation framework towards precision manufacturing: Integrating digital twin technology and edge server deployment. J. Intell. Manuf. 2026. [Google Scholar] [CrossRef]
  31. Wu, Z.; Ma, C.; Zhang, L.; Gui, H.; Liu, J.; Liu, Z. Predicting and compensating for small-sample thermal information data in precision machine tools: A spatial-temporal interactive integration network and digital twin system approach. Appl. Soft Comput. 2024, 161, 111760. [Google Scholar] [CrossRef]
  32. Shuai, Z.; He, S.; Xue, Y.; Zheng, Y.; Gai, J.; Li, Y.; Li, G.; Li, J. Junction temperature estimation of a SiC MOSFET module for 800V high-voltage application in electric vehicles. eTransportation 2023, 16, 100241. [Google Scholar] [CrossRef]
  33. Wu, C.; Cui, Z.; Xia, Q.; Yue, J.; Lyu, F. An Overview of Digital Twin Technology for Power Electronics: State-of-the-Art and Future Trends. IEEE Trans. Power Electron. 2025, 40, 13337–13362. [Google Scholar] [CrossRef]
  34. Molinero, D.; Santamargarita, D.; Bueno, E.; Vasic, M.; Marrón, M. Development of an Artificial Neural Network Based Thermal Model for Heat Sinks in Power Electronics Applications. IEEE Open J. Power Electron. 2024, 5, 1500–1509. [Google Scholar] [CrossRef]
  35. Choudhury, K.R.; Rogers, D.J. Transient Thermal Modeling of a Power Module: An N-Layer Fourier Approach. IEEE Trans. Power Electron. 2020, 35, 10580–10591. [Google Scholar] [CrossRef]
  36. Mainardi, F.; Pagnini, G.; Gorenflo, R. Some aspects of fractional diffusion equations of single and distributed order. Appl. Math. Comput. 2007, 187, 295–305. [Google Scholar] [CrossRef]
  37. Shang, H.; Deng, C.; Chen, F.; Zhang, N.; Feng, Z.; Chen, X. Fractional-order equivalent impedance model for junction-case thermal characteristics of insulated gate bipolar transistor modules. COMPEL-Int. J. Comput. Math. Electr. Electron. Eng. 2025, 44, 373–391. [Google Scholar] [CrossRef]
  38. Kong, K.; Choi, J.; Park, G.; Baek, S.; Ju, S.; Han, Y. Insulated gate bipolar transistor junction temperature estimation technology for traction inverters using a thermal model. Electronics 2025, 14, 999. [Google Scholar] [CrossRef]
  39. Fekete, T.; Antók, D.; Tatár, L.; Bereczki, P. Extending reliability of FEM simulations, based on optically assisted tensile tests and digital twins. Procedia Struct. Integr. 2022, 42, 1684–1691. [Google Scholar] [CrossRef]
  40. Fathollahi, A. Machine Learning and Artificial Intelligence Techniques in Smart Grids Stability Analysis: A Review. Energies 2025, 18, 3431. [Google Scholar] [CrossRef]
  41. Nti, I.K.; Adekoya, A.F.; Weyori, B.A.; Nyarko-Boateng, O. Applications of artificial intelligence in engineering and manufacturing: A systematic review. J. Intell. Manuf. 2022, 33, 1581–1601. [Google Scholar] [CrossRef]
  42. Peykarporsan, R.; Oshnoei, S.; Fthollahi, A.; Lie, T.T. A Novel Intelligent Fractional Order Cascade Control to Enhance Wind Energy Conversion in Wind Farms: A Practical Case Study. IEEE Trans. Energy Convers. 2025, 40, 1736–1749. [Google Scholar] [CrossRef]
  43. Hua, Y.; Wang, Z.-Q.; Yuan, X.-Y.; Bai Li, Y.; Wu, W.-T.; Aubry, N. Estimation of steady-state temperature field in Multichip Modules using deep convolutional neural network. Therm. Sci. Eng. Prog. 2023, 40, 101755. [Google Scholar] [CrossRef]
  44. Zhang, B.; Wu, G.; Gu, Y.; Wang, X.; Wng, F. Multi-domain physics-informed neural network for solving forward and inverse problems of steady-state heat conduction in multilayer media. Phys. Fluids 2022, 34, 116116. [Google Scholar] [CrossRef]
  45. Xu, Z.; Gao, Y.; Wang, X.; Tao, X.; Xu, Q. Surrogate Thermal Model for Power Electronic Modules using Artificial Neural Network. In Proceedings of the IECON 2019—45th Annual Conference of the IEEE Industrial Electronics Society, Lisbon, Portugal, 14–17 October 2019; pp. 3160–3165. [Google Scholar]
  46. Zhang, Y.; Wang, Z.; Wang, H.; Blaabjerg, F. Artificial Intelligence-Aided Thermal Model Considering Cross-Coupling Effects. IEEE Trans. Power Electron. 2020, 35, 9998–10002. [Google Scholar] [CrossRef]
  47. Bhoi, S.K.; Frikha, M.A.; Martin, G.E.; Hosseinabadi, F.; Chakraborty, S.; Baghdadi, M.E.; Hegazy, O. A Data-Driven Thermal Digital Twin of a 3-Phase Inverter Using Hi-Fidelity Multi-Physics Modelling. In Proceedings of the 2023 25th European Conference on Power Electronics and Applications (EPE’23 ECCE Europe), Aalborg, Denmark, 4–8 September 2023; pp. 1–8. [Google Scholar]
  48. Kim, M.K.; Yoon, Y.D.; Yoon, S.W. Actual Maximum Junction Temperature Estimation Process of Multichip SiC MOSFET Power Modules With New Calibration Method and Deep Learning. IEEE J. Emerg. Sel. Top. Power Electron. 2023, 11, 5602–5612. [Google Scholar] [CrossRef]
  49. Troldborg, K.B.; Skov, S.I.; Fathollahi, A.; Pedersen, J.H. Intelligent Machine Learning–Based Scheme for Fast Temperature Estimation of SiC Power Modules in Electric Vehicle Power Converters. In Proceedings of the 2026 IEEE Intelligent Vehicles Symposium (IV), Detroit, MI, USA, 22–25 June 2026. [Google Scholar]
  50. Fathollahi, A.; Andresen, B. Power Quality Analysis and Improvement of Power-to-X Plants Using Digital Twins: A Practical Application in Denmark. IEEE Trans. Energy Convers. 2025, 40, 1909–1921. [Google Scholar] [CrossRef]
  51. Abrazeh, S.; Mohseni, S.R.; Zeitouni, M.J.; Parvaresh, A.; Fathollahi, A.; Gheisarnejad, M.; Khooban, M.H. Virtual Hardware-in-the-Loop FMU Co-Simulation Based Digital Twins for Heating, Ventilation, and Air-Conditioning (HVAC) Systems. IEEE Trans. Emerg. Top. Comput. Intell. 2023, 7, 65–75. [Google Scholar] [CrossRef]
  52. International Electrotechnical Commission. Internationally Agreed Concepts and Terminology for Digital Twins. Available online: https://www.iec.ch/blog/internationally-agreed-concepts-and-terminology-digital-twins (accessed on 8 May 2025).
  53. Jeong, D.Y.; Baek, M.S.; Lim, T.B.; Kim, T.W.L.; Kim, Y.W.; Kim, S.H.; Lee, Y.T.; Jung, W.S.; Lee, I.B. Digital Twin: Technology Evolution Stages and Implementation Layers With Technology Elements. IEEE J. Mag. 2022, 10, 52609–52620. [Google Scholar] [CrossRef]
  54. Wolfspeed, Inc. EAB450M12XM3. 2019. Available online: https://www.wolfspeed.com/products/power/sic-power-modules/xm3-power-module-family/eab450m12xm3/ (accessed on 26 February 2026).
  55. Nellis, G.; Klein, S. Heat Transfer; Cambridge University Press: Cambridge, UK, 2008. [Google Scholar]
  56. Guo, S.; Li, X.; Zhang, J.; Bin, G.; Huang, W. A triple reciprocity method in Laplace transform boundary element method for three-dimensional transient heat conduction problems. Int. J. Heat Mass Transf. 2017, 114, 258–267. [Google Scholar] [CrossRef]
  57. StandardScaler. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html (accessed on 26 February 2026).
  58. scikit-learn. GridSearchCV. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.GridSearchCV.html (accessed on 8 May 2025).
  59. Fathollahi, A.; Gheisarnejad, M.; Sharifzadeh, M.; Laurendeau, E.; Andresen, B.; Al-Haddad, K. On-Policy Machine Learning Based-Disturbance Rejection Control for Grid-Tied PEC9 Inverter Under Parameters Mismatch and Distorted Grid Voltage. IEEE Trans. Emerg. Top. Comput. Intell. 2026, 10, 1025–1037. [Google Scholar] [CrossRef]
  60. Zou, J.; Han, Y.; So, S.-S. Overview of artificial neural networks. In Artificial Neural Networks: Methods and Applications; Springer: Berlin/Heidelberg, Germany, 2009; pp. 14–22. [Google Scholar]
  61. Krasopoulos, P.T.; Maratos, N.G. An Interior Point Recurrent Neural Network for Convex Optimization Problems. In Mathematics Without Boundaries: Surveys in Interdisciplinary Research; Pardalos, P.M., Rassias, T.M., Eds.; Springer: New York, NY, USA, 2014; pp. 409–427. [Google Scholar]
  62. Theodorakatos, N.P.; Babu, R.; Lytras, M.D. Avoiding the Maratos Effect in Non-Convex Optimization Through Piecewise Convexity: A Case Study in Optimal PMU Placement Problem. Algorithms 2026, 19, 11. [Google Scholar] [CrossRef]
  63. Maratos, N.G.; Moraitis, M.A. Some results on the Sign recurrent neural network for unconstrained minimization. Neurocomputing 2018, 287, 1–21. [Google Scholar] [CrossRef]
  64. Dantas, P.V.; Sabino da Silva, W.; Cordeiro, L.C.; Carvalho, C.B. A comprehensive review of model compression techniques in machine learning. Appl. Intell. 2024, 54, 11804–11844. [Google Scholar] [CrossRef]
  65. Ma, K.; Zhong, Q.; Xu, M.; Wang, D.; Wang, H.; Wang, S. Degradation Monitoring of Power Modules Based on Frequency-Domain Thermal Model. In Proceedings of the 2022 IEEE Energy Conversion Congress and Exposition (ECCE), Detroit, MI, USA, 9–13 October 2022; pp. 1–5. [Google Scholar]
  66. Yang, Z.; Chen, X.; Zhu, B.; Zhang, L.; Huang, Y. On Fractional-Order Thermal Network Modeling and Parameter Extraction of IGBT Modules. IEEE Trans. Power Electron. 2025, 41, 4756–4765. [Google Scholar] [CrossRef]
Figure 1. The schematic of the internal multilayer structure of the SiC Power Module [54].
Figure 1. The schematic of the internal multilayer structure of the SiC Power Module [54].
Electronics 15 01325 g001
Figure 2. The core concept of the proposed temperature estimation approach [54].
Figure 2. The core concept of the proposed temperature estimation approach [54].
Electronics 15 01325 g002
Figure 3. Estimation method using the neural network and the NTC thermistor for corrections.
Figure 3. Estimation method using the neural network and the NTC thermistor for corrections.
Electronics 15 01325 g003
Figure 4. The model calibration framework with additional NTC.
Figure 4. The model calibration framework with additional NTC.
Electronics 15 01325 g004
Figure 5. The calibration method utilizing neural network and two NTC sensors.
Figure 5. The calibration method utilizing neural network and two NTC sensors.
Electronics 15 01325 g005
Figure 6. The required heat transfer coefficient to maintain a junction—temperature below 175 °C.
Figure 6. The required heat transfer coefficient to maintain a junction—temperature below 175 °C.
Electronics 15 01325 g006
Figure 7. The temperature-field scaling with power loss Q and ambient temperature T a m b (top: Q = 100 W, T a m b = 25   °C, bottom: Q = 150 W, T a m b = 50   °C).
Figure 7. The temperature-field scaling with power loss Q and ambient temperature T a m b (top: Q = 100 W, T a m b = 25   °C, bottom: Q = 150 W, T a m b = 50   °C).
Electronics 15 01325 g007
Figure 8. The Neural Network Features.
Figure 8. The Neural Network Features.
Electronics 15 01325 g008
Figure 9. Schematic of the multilayer power-module thermal problem and boundary conditions.
Figure 9. Schematic of the multilayer power-module thermal problem and boundary conditions.
Electronics 15 01325 g009
Figure 10. The model geometry is built in COMSOL Multiphysics.
Figure 10. The model geometry is built in COMSOL Multiphysics.
Electronics 15 01325 g010
Figure 11. COMSOL FEM mesh of the SiC power-module thermal model.
Figure 11. COMSOL FEM mesh of the SiC power-module thermal model.
Electronics 15 01325 g011
Figure 12. The temperature distribution along the z-axis through the centre of the hottest SiC chip (upper half-bridge), with a sketch of the multilayer structure.
Figure 12. The temperature distribution along the z-axis through the centre of the hottest SiC chip (upper half-bridge), with a sketch of the multilayer structure.
Electronics 15 01325 g012
Figure 13. The temperature field of the SiC power module.
Figure 13. The temperature field of the SiC power module.
Electronics 15 01325 g013
Figure 14. Flowchart of the proposed neural network.
Figure 14. Flowchart of the proposed neural network.
Electronics 15 01325 g014
Figure 15. The final Neural Network architecture.
Figure 15. The final Neural Network architecture.
Electronics 15 01325 g015
Figure 16. Error distribution for predictions on the test set.
Figure 16. Error distribution for predictions on the test set.
Electronics 15 01325 g016
Figure 17. Learning curves for all 454 epochs.
Figure 17. Learning curves for all 454 epochs.
Electronics 15 01325 g017
Figure 18. FEM calculations on a vertical line from the hottest point on the module.
Figure 18. FEM calculations on a vertical line from the hottest point on the module.
Electronics 15 01325 g018
Figure 19. Predictions on a vertical line from the hottest point on the module.
Figure 19. Predictions on a vertical line from the hottest point on the module.
Electronics 15 01325 g019
Figure 20. FEM-horizontal surface at the bottom of the baseplate.
Figure 20. FEM-horizontal surface at the bottom of the baseplate.
Electronics 15 01325 g020
Figure 21. NN—horizontal surface at the bottom of the baseplate.
Figure 21. NN—horizontal surface at the bottom of the baseplate.
Electronics 15 01325 g021
Figure 22. Histogram of neural-network prediction times (including scaling) for 1000 random points within the power-module geometry.
Figure 22. Histogram of neural-network prediction times (including scaling) for 1000 random points within the power-module geometry.
Electronics 15 01325 g022
Figure 23. Comparison of the transient junction-temperature response and model computation time, illustrating the speed advantage of the neural network over the COMSOL FEM model for predictive thermal estimation.
Figure 23. Comparison of the transient junction-temperature response and model computation time, illustrating the speed advantage of the neural network over the COMSOL FEM model for predictive thermal estimation.
Electronics 15 01325 g023
Table 1. Values used for parametric sweep.
Table 1. Values used for parametric sweep.
ParameterValue
h (W/m2K)100, 1000, 2250, 2500, 2750, 3000, 3250, 3500, 3750, 5000, 7000
ksolder (W/mK)18, 24, 30, 33, 36, 42
kTIM (W/mK)1.5, 2.0, 2.5, 2.75, 3.0, 3.5
Table 2. Parameter combinations used for model grid search.
Table 2. Parameter combinations used for model grid search.
HyperparameterValues Tested
Neurons in layer32, 64, 128
Number of layers5, 6, 7
Activation function‘relu’, ‘tanh’, ‘sigmoid’
Optimizers‘sgd’, ‘adam’
Batch sizes2048, 4096, 8192
Dropout rate0, 0.01, 0.05
Table 3. Model hyperparameters used neural network architecture.
Table 3. Model hyperparameters used neural network architecture.
HyperparameterValues Tested
Neurons in layer128
Number of layers7
Activation function‘relu’
Optimizers‘adam’
Batch sizes8192
Dropout rate0, 0.01, 0.05
Table 4. Evaluation metrics for the proposed neural network.
Table 4. Evaluation metrics for the proposed neural network.
MetricValue
MAE0.3782
MSE0.2822
Table 5. Evaluation computational time for the proposed neural network.
Table 5. Evaluation computational time for the proposed neural network.
MetricTime (s)
Avg time0.0635
Median time0.0622
Fastest time0.0541
Slowest time0.0973
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Troldborg, K.B.; Skov, S.I.; Fathollahi, A.; Pedersen, J.H. Advanced Machine Learning Approach for Fast Temperature Estimation in SiC-Based Power Electronics Converters. Electronics 2026, 15, 1325. https://doi.org/10.3390/electronics15061325

AMA Style

Troldborg KB, Skov SI, Fathollahi A, Pedersen JH. Advanced Machine Learning Approach for Fast Temperature Estimation in SiC-Based Power Electronics Converters. Electronics. 2026; 15(6):1325. https://doi.org/10.3390/electronics15061325

Chicago/Turabian Style

Troldborg, Kalle Bundgaard, Sigurd Illum Skov, Arman Fathollahi, and Jørgen Houe Pedersen. 2026. "Advanced Machine Learning Approach for Fast Temperature Estimation in SiC-Based Power Electronics Converters" Electronics 15, no. 6: 1325. https://doi.org/10.3390/electronics15061325

APA Style

Troldborg, K. B., Skov, S. I., Fathollahi, A., & Pedersen, J. H. (2026). Advanced Machine Learning Approach for Fast Temperature Estimation in SiC-Based Power Electronics Converters. Electronics, 15(6), 1325. https://doi.org/10.3390/electronics15061325

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop