1. Introduction
As threats against the next generation of defense systems increase in scale and complexity, modernizing its existing components is imperative. To combat these novel threats, the Navy is shifting towards directed energy weapons such as air defense radar, Unmanned aerial systems jamming, and laser weapons [
1]. However, these new systems have more significant power demands than legacy weapon systems [
2]. To address this issue, the Navy has begun switching its fleet to fully electric ships. Integrating new electric propulsion systems will ensure that the loads of next-generation weapon systems are met and offer additional benefits, such as enhanced fuel efficiency, lower noise levels, and reduced emissions [
3]. Fully integrated electric propulsion systems are certainly viable, and have been utilized on commercial vessels. However, the implementation of this technology onto Navy vessels introduces a new set of engineering challenges. Naval weapon and mission systems require reliable power distribution, which, if not properly managed, could necessitate the shutdown of the systems for indeterminate periods. Moreover, a lack of stability in the power grid of a ship can lead to thermal, mechanical, and electrical stress on the vessel, jeopardizing the safety of its crew. Consequently, modernizing the power and energy systems on ships has become more imperative than ever to support new combat systems [
4].
To modernize naval fleets and support directed energy weapons, a revolution in control systems is necessary. Traditional naval power and thermal management systems are unsuitable for the large and complex load profiles of modern directed energy weapons [
5]. These systems operate reactively, adjusting to power fluctuations only after they occur, which is incompatible with the real-time requirements of modern vessels [
6]. Additionally, legacy control systems lack integrated fault detection, making it difficult to anticipate or address potential system failures needed to proactively manage rapid power fluctuations of next-generation weapon systems [
7]. Moreover, traditional naval systems are compartmentalized, leading to communication gaps between subsystems, requiring input from human operators [
8]. In contrast, a fully electric ship requires an interconnected autonomous control architecture capable of managing ship resources, optimizing performance based on desired operation, and adapting to faults, all in real time. To achieve this, significant advances in modeling and simulation are necessary to form a virtual representation of a ship that can predict system behavior under various conditions [
9].
Digital twins are a tool that can enable better control and performance of a system by continuously assimilating real-time sensor data into faithful system representations to link the virtual and physical spaces together [
10]. As such, a digital twin comprises two key elements: a virtual representation and a feedback loop between the virtual and physical counterparts [
10]. The digital twin is continuously updated using real-time sensor data to accurately replicate and adapt to the behavior of the physical counterpart during its operational state [
11]. While the concept of digital twins has been around since the early 2000s, their implementation was hindered by the technology of that time [
12]. Recently, digital twins have gained popularity with the advent of the fourth industrial revolution, as evidenced by the increased number of publications and patents in recent years [
13]. As engineering problems become more complex and improving the efficiency of existing systems becomes more challenging, digital twins have emerged as an attractive tool for performing functions such as prognostication, optimization, testing, and control [
14]. For instance, a digital twin updated using fuzzy logic and operational data from a power plant has demonstrated the feasibility of creating an accurate virtual representation of the plant [
15]. This approach allowed for safer training of new control room operators and studying different control schemes. Other digital twin applications include online parameter optimization to increase the efficiency of a system. The cooling water system of a district plant behavior was reproduced and helped in optimization efforts, achieving 2–3% in energy saving from the previous year [
16]. Another example of increased system performance boosted by digital twins, is using online measurements from a heat pump to calibrate a model to reduce fouling and unplanned downtime [
17]. Real-time health monitoring is also an area that significantly benefits from digital twin implementation. A digital twin of cooling fans achieved a 95% fault detection success rate, informing a user of a fault before it had occurred [
18]. Furthermore, a digital twin modeled on a system designed to emulate liquid-cooled power converters detected faults in the cooling system with a 100% success rate in under 1 min [
19]. A complete digital twin implementation enables real-time monitoring, improves system reliability, enhances risk management, and increases system efficiency [
20]. In naval applications, a digital twin can be coupled with other existing combat systems and electrical models to enhance the survivability of a ship. Digital twins allow operators to address dynamic changes in the system and support strategic decision-making. Most importantly, this tool can aid in the operational management of the next generation of naval electric ships and high-power systems.
Digital twins have become increasingly important for use in advancing the design, operation, and maintenance of complex engineering systems. While many publications suggest different frameworks for the development of digital twins, the National Academy of Engineering (NAE) has identified and the general characteristics and implementation procedures for a useful digital twin [
10]. Using the framework information provided by the NAE, digital twins can be developed and implemented for a variety of unique physical systems. The standardization provided by the NAE allows for consistency in the production of quality digital twins for a variety of physical systems. This consistency is especially critical for mission-driven systems such as naval vessels, where the digital twin framework must operate in real-time, be computationally efficient, and accurate within an acceptable margin of error. Meeting these conditions requires continuous digital twin tuning techniques utilizing reliable data streams and processing capabilities to update the virtual representation to reflect changes in its physical counterpart.
Population-based optimization algorithms have recently gained considerable traction as a framework for updating digital twin faithful representations. Data-driven algorithms can learn from a large amount of data to model the complex behavior of a system that other techniques fail to capture [
21]. These meta-heuristic algorithms seek an optimal solution to a cost function by utilizing a swarm of model instances to systematically explore a search-space [
22]. Specifically, particles in the swarm will each represent different variations of the same virtual representation. The swarm iteratively evaluates them based on real-time sensor data and seeks to minimize discrepancy between the simulation and actual system behavior. The manner in which these particles move throughout the search space is defined by an algorithm specific to its swarm-like behavior, discussed in more detail later in this paper. A comprehensive review of the most popular swarm optimization algorithms was done by testing them on 30 different benchmark functions [
23]. It was concluded that in an environment where computation cost and speed are necessary, Particle Swarm Optimization (PSO) performed the best. Population-based algorithms have shown successful implementation in updating multi-physics model parameters for heat management, such as optimizing cooling strategies in a data center [
24], enhancing prediction performance of CNC machining technology spindle thermal error [
25], digital twin controller of HVAC systems [
26]. These publications focus on the performance of the digital twin rather than the speed at which it is updated. In the context of updating a virtual representation of a cooling system to form a digital twin, speed is of the utmost importance [
27]. Among population-based methods, PSO was chosen as the model updating method selected for the present framework because bounded-time optimization on an edge computer requires low implementation complexity, direct handling of bounded parameters, and repeated restart between successive data windows.
This work makes three contributions. First, it formulates a parameter-tuning digital twin framework for a liquid-cooled power electronics thermal system using a coupled electrical, fluid, and thermal representation. Second, it implements an edge-resident particle swarm framework that executes on the same local computing environment as the data acquisition and simulation workflow, which removes network latency from the update loop and constrains calibration to fixed ten-minute data windows. Third, it introduces recovery, calibrated, and stall window logic so that the swarm can switch between rapid state recovery and fine parameter refinement after a physical discontinuity. Relative to prior digital twin studies that emphasize offline calibration or accuracy improvement under nominal conditions [
24,
25,
26,
27], the present framework emphasizes bounded update latency, computational feasibility at the edge, and recovery from abrupt system changes under experimentally relevant thermal loads. The framework is demonstrated on a liquid-cooled testbed representative of shipboard thermal management for six power converter modules, and the continuously updated representation improves root mean squared error by more than 90% at four measurement locations. The remainder of this paper is organized as follows.
Section 2 discusses the methodology behind the proposed digital twin, the PSO framework, the cooling system, and modeling methods. The results and discussion are presented in
Section 3. The final section provides a conclusion to this work.
2. Materials and Methods
This section provides an overview of the digital twin framework, the cooling system of the power and energy testbed, the faithful representation, data acquisition, and updating scheme that form the digital twin framework.
2.1. Digital Twin Framework
The digital twin framework aims to assimilate real-time sensor data to link a physical counterpart to its virtual representation. An overview of the digital twin framework is showcased in
Figure 1. During the operation of a system, operators control various mechanical and electrical components to meet certain objectives. Throughout the lifecycle of the system, these actions will cause the system to degrade, forever changing its behavior. These changes are captured by sensors instrumented throughout the system, and data is relayed to the digital twin via data acquisition equipment. Running alongside the physical system, the digital twin utilizes the acquired data to tailor the virtual representation to its physical counterpart. To accomplish this goal, a digital twin tuning using an updating technique within the digital twin autonomously tunes parameters in the faithful representation. The tuned faithful representation outputs simulation data and is evaluated against the real-world data. The process of assessing permutations of virtual representation parameters happens continuously until the optimal parameters are found. Once the optimum is found, the virtual representation is deemed calibrated, and the projected digital twin response is returned to the operator. The operator can then leverage the digital twin to perform strategic planning, system optimization, or increase operational efficiency.
2.2. Testbed Configuration Overview
In this study, a notional shipboard power system is used as an example to validate and study the proposed digital twin framework. The power system is replicated by a testbed composed of six power electronic converters that interface with various loads as illustrated in the electrical setup section of the diagram in
Figure 2. These converters are part of a microgrid designed to emulate the power and energy systems of a naval ship, effectively replicating the onboard power system [
28]. Each module is designed to manage specific voltage inputs and outputs while supporting a load. The converters in use typically produce power losses ranging from approximately 100–150 W, which is comparable to power losses generated by many Naval power electronics [
29]. Typically, these power losses would be handled by the cooling system. However, the power modules were not operational in this work and only act as heat sinks. Instead, the dissipated heat from the power modules was emulated by a water heater.
The testbed is outfitted with a cooling loop for heat management and designed to mimic the cooling system of a naval ship. The cooling loop is composed of a submersible pump, tank, water heater, a three-fan radiator, as shown in
Figure 3, and the six power modules shown in detail in
Figure 4. The testbed is also instrumented with nine temperature sensors that record temperature data during experiments. There are four points where these temperature sensors are located: six on the heat sinks of the power modules, one submerged in the tank, and one after both the heater and radiator. The direction in which the water flows through the cooling loop is shown in
Figure 2. Distilled water is circulated throughout the loop at 3.785 L/min by a submersible pump inside a 37.85-L tank. Water first flows through a water heater coupled with a power supply where the dissipated heat produced by ship systems is emulated based on the approximate power losses from the six labeled power converters, C1–C6, during standard operation. This heated water then flows through the coolant plates of six power modules. While the power modules can produce heat, they were not operational during testing and only act as thermal masses. After the power modules, water enters the fan radiator and the chilled water returns to the tank. These components help form the cooling system that deploys a 40-h load profile simulating a battle-time scenario.
2.3. Characterization of the Multi-Physics Representation
A multi-physics approach was used to capture the behavior of the testbed cooling system. The virtual representation couples three physical fields: electrical, fluid, and thermal. The electrical field maps the measured heater power input to the heat source applied to the loop. The fluid field represents coolant transport through the heater, power modules, radiator, and tank at the measured nominal flow rate of 3.785 L/min. The thermal field predicts the transient temperatures of the coolant and the effective solid thermal masses associated with these components. This decomposition was selected because the dominant experimental inputs are electrical power and coolant transport, whereas the dominant outputs are temperatures measured at four locations.
Each major cooling-loop element was represented as a lumped control volume governed by the energy balance
where
is the effective thermal capacitance of component
j,
is the coolant mass flow rate,
is the coolant specific heat,
is inlet coolant temperature,
is internal heat generation,
is the effective thermal resistance between adjacent control volumes, and
is the effective ambient heat rejection coefficient. For the heater,
is prescribed by the measured electrical power supplied to the water heater. For the power modules,
during the baseline experiment because the modules act as thermal masses rather than active heat sources. For the radiator and tank, the ambient heat rejection term dominates the sink behavior.
Before implementing the representation into the digital twin framework, it must be characterized to ensure that it faithfully represents the physical system. While the updating scheme can fit an inaccurate representation to a physical system, that approach reduces the accuracy of future predictions. To ensure the highest level of accuracy in the virtual representation, the cooling loop components were characterized and their parameters were determined using four categories: measurement, experiment, estimation, and online tuning. Measured parameters include geometric quantities and directly observed operating conditions, such as pipe lengths, pipe diameters, tank volume, coolant flow rate, and heater power input. Experimentally identified parameters were obtained by fitting the representation to recorded temperature data when direct measurements were insufficient to match the transient response. Estimated parameters include effective thermal resistance and ambient heat rejection terms when closed-form calculation would require assumptions not supported by the available hardware information. The subset selected for online updating consists of the parameters that are both weakly identifiable by direct measurement and strongly influential on transient temperature error.
Table 1 summarizes the parameters of the lumped multi-physics representation, the associated component, the determination method, and whether each quantity was fixed, a fixed boundary condition (BC) or updated online.
2.4. Digital Twin Updating Scheme
The objective of the updating scheme is to continuously recalibrate the bounded tunable parameter vector so that the multi-physics representation remains synchronized with the cooling-loop measurements. A new optimization cycle is triggered when a complete ten-minute window of sensor data becomes available. Let window k contain the temperature measurements from the heater, power modules, radiator, and tank sensors over that interval. The representation is not re-identified at every sample because each optimization cycle requires repeated evaluation of the full multi-physics model. Periodic updating was therefore adopted to provide a bounded optimization time, reduce sensitivity to instantaneous sensor noise, and prevent incomplete transients from dominating the search. Continuous updating could reduce latency only if the representation and optimizer were reformulated for streaming operation. That was not the objective of the present study.
Each particle position represents one candidate value of the tunable parameter vector, and the corresponding velocity governs how that candidate moves through the feasible set [
30]. The particle position update is written as
and the velocity update is
where
W is the inertia factor,
and
are the cognitive and social acceleration coefficients, and
and
are random scalars sampled from
. The feasible set is defined by lower and upper bounds on every entry of
together with the physical constraints
,
,
, and
. Consequently, the objective function is window-dependent cost defined below, and the constraints are the parameter bounds and positivity requirements that keep the search within physically admissible values.
The cost function is selected according to the mismatch between the measured and simulated temperatures at the beginning of each new data window. If the initial mismatch is large, a recovery window is triggered and the cost emphasizes state alignment at the start of the window,
where
and
are the four-sensor temperature vectors at the first sample of the current window. If the initial mismatch is small, a calibrated window is used and the cost is the root mean squared error over the full window,
where
is the number of samples in the current window. A stall window combines both objectives when the initial mismatch and the full-window error are simultaneously non-negligible,
The selected window type determines both the cost function and the level of particle exploration. Recovery windows emphasize rapid state correction, calibrated windows emphasize fine parameter refinement, and stall windows combine both behaviors to avoid premature stagnation near the previous optimum as seen in
Figure 5.
Choosing an appropriate window size is paramount for the performance of the updating scheme. If the window is too small, the updating scheme may not have enough time to converge before the next window becomes available. If the window is too large, the latency between the physical system and the virtual representation reduces the ability of the digital twin to adapt to dynamic changes in the physical system. To determine the appropriate update interval, the simulation displayed in
Figure 6 was run 10,000 times on the same five-minute window. The average runtime of the simulation was 4.636 s, with a standard deviation of 0.634 s. The maximum runtime was 16.925 s, but a runtime of 16 s or more occurred only twice out of 10,000 instances. As a result of this investigation, a ten-minute window was chosen.
Using the measured average runtime, one ten-minute window supports approximately serial model evaluations. For a swarm with particles, this corresponds to approximately full particle-swarm iterations per window on average. As a result, a 10-particle swarm allows about 12 full iterations, a 20-particle swarm about 6, and a 30-particle swarm about 4. This runtime budget bounds the practical number of configurable parameters that can be processed in real time. In the present implementation, the 12-parameter vector remained computationally feasible within the ten-minute deadline, whereas larger swarms would reduce the number of full iterations available before the next data window arrives. Small swarms reduce search diversity and can delay recovery after a discontinuity, while very large swarms increase computational cost without a proportional reduction in window error.
The function of the updating algorithm is shown in
Figure 7. The algorithm starts by initializing a swarm of particles with random positions and velocities. Once enough data is collected to fill a window, the main loop begins. The same dissipated heat load profile deployed onto the water heater, measured in watts, is also applied to the digital twin, linking the virtual representation to the physical system. After the load profile is configured, the PSO update loop begins. The values of the representation parameters are tuned using the position of each particle, and the configured virtual representation is run and evaluated against the current sensor window using the cost function associated with that window type. When the global-best calibrated-window root mean squared error
is less than 0.1 °C, the representation is flagged as calibrated for that window. When a new window is acquired, the global best solution from the previous window is retained, the acceleration factors are reset, and an additive perturbation sampled from
is applied to each particle position to prevent entrapment at the previous optimum before the next window begins.
The manually tuned model is the static baseline representation. It was identified once offline from the 40-h experimental data and then held fixed unless the updating scheme was activated. The online updating scheme therefore does not replace the baseline model. It starts from that manually tuned representation and recalibrates the tunable subset of parameters whenever a new sensor window becomes available.
The pertinent values used to initialize and govern the PSO algorithm are given in
Table 2, and have been included in this manuscript to allow for reproducibility. Particle positions were randomly initialized using a uniform distribution within the ranges listed in
Table 2. All parameters were constrained to the global bounds
, and the optimization objective was to minimize the temperature–pressure RMSE between simulated and measured system responses.
2.5. Test Scenario
An arbitrary ten-hour test scenario was deployed onto the testbed to gather data for characterization of the virtual representation. The scenario was created to replicate a battle-time scenario while keeping in mind the limitations of the testbed cooling system. Many ship systems were considered while designing a load profile. The electrical loads of the systems were calculated, and dissipated heat was dumped into the system via the power supply and water heater. However, not every system was modeled in the same way. Hotel and communication systems are air-cooled and are only modeled electrically. The water-cooled, propulsion, navigational, and directed energy systems are modeled electrically and thermally. The propulsion system is throttled throughout the test and represents the majority of the load for the duration of the test. The navigational, communication, and radar systems have a constant load throughout the test. Unlike these systems, hotel and directed energy systems are applied intermittently, depending on the time. The load of each system is combined to create a ten-hour load profile deployed onto the power supply
Figure 8.
The ten-hour load profile was repeated four times, creating a 40-h experiment; the results are shown in
Figure 9. During the experiment, temperature readings were taken at four points along the cooling system (heater, power modules, radiator, and tank). As previously mentioned in
Section 2.3, experimental data was used to fit the virtual representation. Results of this manually tuned representation are shown in
Figure 10 and its performance metrics in
Table 3. The simulation results show that the manually tuned virtual representation is relatively accurate.
One significant benefit of a calibrated representation is that it can be leveraged to conduct test scenarios without physically altering the testbed. In later sections, this manually tuned virtual representation will synthesize experimental data to assess the performance of the updating scheme when handling discontinuities.