1. Introduction
About 23% of carbon dioxide (CO
2) emissions are emitted from the transportation sector, so emission regulations for transportation systems are being strengthened [
1]. In particular, fossil fuels used for propulsion in transportation systems are known to be a major contributor to greenhouse gas emissions, as their combustion process releases a significant amount of CO
2. Excluding the COVID-19 period in 2020, CO
2 emissions from fossil fuel combustion in the transportation system have continued to increase, exceeding 8 Gt since 2020. The shipping sector accounts for about 3% of total CO
2 emissions, which is lower than the automotive sector, but it has been steadily increasing. To address this situation, the International Maritime Organization (IMO) has been progressively tightening emission regulations in the shipping sector, aiming for net-zero greenhouse gas emissions by 2050 [
2,
3].
Achieving compliance with these regulations ultimately requires the electrification of propulsion systems [
4]. However, depending on the type of ship or the operating profile, there may be limitations in securing total propulsion power using only electrified propulsion. For ships that need to travel long distances or require high propulsion power immediately, the capacity and cost of the energy storage system (ESS) limit their ability to meet the load demand of all-electric propulsion systems. In addition, all-electric propulsion still contains technical limitations, including insufficient charging infrastructure, energy management systems, and operational safety, which must be resolved before it can be applied in large mobility systems such as ships. For cases requiring immediate high propulsion power or power necessary for medium- to long-distance operations, a hybrid electric propulsion system that combines an internal combustion diesel engine with a battery-based electric propulsion system can be considered an appropriate solution [
5]. The combination of marine diesel engines and batteries is one of the most widely used hybrid configurations.
The authors of [
6] confirmed that an oceangoing bulk carrier with a diesel–battery hybrid propulsion system can reduce fuel consumption by 2–3% while lowering CO
2 and NOX emissions by 5–7% [
7]. The Viking Lady Offshore Supply Vessel, one of the hybrid electric propulsion ships currently in operation, is equipped with four Wärtsilä 6R32DF dual-fuel engines, each with a rated power of approximately 2100 kW, and two Alconza QD 560 M2-6W electric motors, each with a rated power of 2300 kW [
8]. Another hybrid electric propulsion vessel, the Color Hybrid Ferry, is a plug-in hybrid ferry built in Norway. It is equipped with a Rolls-Royce Bergen B33:45L diesel engine capable of generating approximately 3600 kW of power and a battery with a capacity of approximately 5 MWh. Such a hybrid electric propulsion system offers high efficiency and performance during high-load operations while maintaining optimal fuel consumption rates during low-load operations [
9,
10]. A hybrid electric propulsion ship integrates a diesel engine as the primary power source with an electric propulsion system for auxiliary power, designed to reduce fuel consumption and emissions. It is generally known that hybrid electric propulsion can reduce fuel consumption by about 15 to 25% compared to conventional diesel engine-based systems.
In a land vehicle system, where hybrid electric propulsion was first applied, the power flow of the hybrid electric propulsion system is illustrated in
Figure 1 [
11]. Hybrid electric propulsion systems using different power sources can provide the power for operations by combining engines and electric power (case 1+3) or can charge the battery using the engine (case 2). However, since regenerative braking is impossible for ships, recovering operational load as electric energy is impossible (case 4). Therefore, compared to land vehicles, the strategies for distributing and managing hybrid power may be different. The algorithm for distributing and controlling power between the internal combustion engine and electric power corresponds to sigma (Σ) in
Figure 1. The hybrid power control strategy controls the propulsion output at the center of different power sources and, in some cases, adjusts the energy flow accordingly.
In a hybrid propulsion system, the distribution of internal combustion engine power and electric propulsion power can be achieved through various control strategies. Representative power control strategies include thermostat control, load leveling, equivalent consumption minimization strategy (ECMS) [
12], and horizon optimization. The thermostat control strategy is established based on the battery state of charge (SOC). When the SOC drops below a certain threshold, the engine increases power to recharge the battery [
13]. In [
14], thermostat control strategy was applied to a series hybrid electric vehicle (SHEV) to enable the propulsion system to operate at its efficient points. The load leveling control strategy determines the operating mode based on demanded power, selecting between electric propulsion, assist mode (using both the engine and motor), and charge mode for battery recharging. A representative application of load leveling was proposed in the field of electric vehicles, where charging and discharging times were strategically controlled to balance peak demand and minimize grid stress, particularly by employing regional time-shifted charging and daytime discharging of private EVs [
15]. The ECMS control strategy, also known as the instantaneous optimization method, replaces the fuel consumption of the engine and the energy consumption of electric propulsion with an equivalent fuel consumption at each time step. By calculating the total fuel consumption, it determines the power distribution ratio that minimizes overall fuel usage. Although the control strategy does not have an optimal solution when considering the entire operation, it can be considered a pseudo-optimal solution because it contains an optimal solution, at least for each time interval. Huang et al. [
16] proposed MPC-ECMS for hybrid trucks, and simulation results showed that the proposed control strategy improved fuel consumption by about 4.2% compared to the existing control strategy. Horizon optimization is a method for finding a global solution and is not used for real-time power control [
17]. Instead, it is utilized to pre-evaluate the system’s optimal efficiency or to acquire prior knowledge for developing rule-based control strategies [
18,
19].
For hybrid power distribution control during actual ship operations, real-time computation must be feasible, and since future operational data is not precisely known, reliance on rule-based control strategies or the instantaneous optimization method is unavoidable. When establishing a control rule or an instantaneous optimization control strategy, it is generally necessary to optimize relevant control variables based on a representative operational profile [
20]. For example, in the case of the load leveling method, the demanded power threshold for switching to electric mode must be optimized, while in ECMS, the fuel–electric energy equivalent parameter needs to be adjusted. However, the pre-set deterministic control variables may not be appropriate when variations in maritime environmental conditions and operational profiles occur. Consequently, a deterministic control strategy based on a single operational profile may result in lower performance in actual operation conditions than in ideal scenarios.
In this study, to overcome the limitations of the deterministic ship hybrid power control strategy, a control strategy with stochastic characteristics was established to deal with the variability of the operating load. While stochastic control methods have been explored in the hybrid electric vehicle [
21], very few studies have attempted a probabilistic power control framework in the marine field. Most existing hybrid power control strategies rely on deterministic control variables optimized for specific reference operational profiles. However, under real-time operating conditions where power demands fluctuate, such deterministic strategies may result in degraded performance. This study introduces a novel approach by generating operational profiles based on a Markov chain and training a stochastic control map that accommodates variable load conditions. Through this, a real-time applicable probabilistic control strategy is proposed, representing a new contribution to marine hybrid propulsion control. A new operational profile is derived by adding variability to the reference operational profile, and a stochastic control strategy trained on this data is proposed. In order to derive a new operational profile, this study proposes a Markov chain-based operational profile generation algorithm. In addition, a memory factor is applied to solve the control fluctuation problem, which is a limitation of the stochastic control strategy, and a SOC correction factor is applied to sustain the battery SOC.
This paper is organized as follows:
Section 2 introduces a Markov chain-based operational profile generation algorithm that can generate multiple operational profiles, including variations from a reference operational profile.
Section 3 introduces a ship efficiency model for evaluating the algorithm and derives horizon-optimal solutions from various operational profiles. These solutions are then utilized for training the stochastic control strategy.
Section 4 introduces the stochastic control strategy, and
Section 5 evaluates its performance on new operational profiles.
The stochastic control strategy developed in this study demonstrates high efficiency even when the demanded power differs from the reference operational profile. Therefore, the results of this study can be utilized as a control strategy with real-time applicability to hybrid electric propulsion ships, demonstrating robust efficiency even in situations where operating fluctuations occur.
2. Markov Chain-Based Operational Cycle Augmentation
The demanded power required for ship operation can fluctuate due to variations in environmental operating conditions, such as wind loads, currents, and wave resistance, as well as changes in hotel loads and propulsion requirements. In the case of land vehicles, demanded power can be relatively categorized based on acceleration requirements, road incline, and aerodynamic load. If representative driving cycles for highway and urban driving conditions are established, variations in demanded power remain within a relatively small range. However, for ships, even when operating on a fixed route, the optimal control solution for a specific operational cycle may not be suitable in real-world conditions due to the large fluctuations in operational load. To overcome this issue, this study aims to generate a new operational cycle that considers fluctuations in operational load along a specific route to develop a control strategy trained on this data.
Based on the reference operational cycle, the Markov chain model was used as a method to generate a new operational cycle that incorporates variability. The Markov chain models state transitions over time based on the current state or the previous k states within a defined memory.
In this case, when the memory state
k is set to
k = 1, it is assumed that there is no memory of the previous state, which is defined as the Markov property. If the time of a specific state is
t and its probability distribution follows the Markov property, the joint probability distribution, including previous time steps, can be simplified into a product of sequential probabilities as follows:
The driving or operating state of mobility systems is known to follow the Markov property, where the current state is determined by the immediately previous state [
22]. In land vehicles, these Markov properties are used to generate standardized driving cycles [
23,
24]. In previous studies on these systems, actual vehicle driving data is first collected and segmented into smaller data sections with similar characteristics. After composing the segmented drive cycles into modal events of acceleration, deceleration, cruise, and idle modes, a new driving cycle is generated assuming that each modal event has a Markov property. Finally, among the generated driving cycles, the one with a speed–acceleration frequency distribution (SAFD) that is most similar to the actual driving data is selected as the standardized driving cycle.
The speed–acceleration frequency distribution (SAFD), used as a comparison standard, is a two-dimensional histogram representing the occurrence frequency of speed and acceleration combinations. It probabilistically characterizes the acceleration and deceleration patterns of the driving data and serves as a criterion for comparing the similarity of driving cycles in terms of demanded power.
Referring to the research cases of these land vehicles, this study applies a Markov chain-based operational data augmentation process, as shown in
Figure 2, to create multiple operational cycles with variations in demanded power from a representative ship operational cycle.
SAFD and Markov Transition Matrix of Operational Data
The target vessel in this study is a support vessel or cruise ship with large fluctuations in operating speed and acceleration. The power configuration of the vessel is illustrated in
Figure 3. A diesel engine for primary propulsion is mechanically connected to the propeller shaft, while an electric motor is arranged in parallel to assist propulsion. This hybrid architecture allows for the simultaneous use of the engine and motor during propulsion and also enables partial conversion of engine power into electrical energy via the motor. Detailed specifications and efficiency maps of the power sources are presented in
Section 3.1.
The representative operation cycle shown in
Figure 4 was derived from a hybrid electric recreational boat operating on the Kuala Terengganu (KT) river in Malaysia [
25]. The KT operation cycle consists of the acceleration phase, deceleration phase, and cruise phase.
The operational data shows that the vessel primarily operates at a speed of 3–4 m/s but accelerates to 8 m/s in the final segment to return to the port. The total operating time is 2000 s (approximately 33 min) per cycle. If the hybrid power control strategy is optimized based on the operational data in
Figure 4, the speed of 3–4 m/s would account for the largest proportion of operation. Thus, the control strategy is likely to be optimized for this operating point. Additionally, considering the high-speed return condition in the later phase, the system may reserve electrical energy in advance and use it at the final step for more efficient operation. However, if the maximum speed operation time changes or the proportion of operation at 3–4 m/s varies, a power distribution control strategy optimized for a specific operational dataset may show suboptimal performance. Therefore, this study generated an SAFD based on the representative operational data in
Figure 4 and used it to create multiple operational datasets with variations.
To generate the SAFD, the speed range was divided into 20 bins from the minimum speed (0 m/s) to the maximum speed (8.78 m/s). Similarly, the acceleration range was divided into 20 bins from the minimum acceleration (−0.6 m/s
2) to the maximum acceleration (0.43 m/s
2). The SAFD generated for a total of 400 bins is shown in
Figure 5.
The SAFD results show that, as observed in the time series operational data, the largest number of operation points are concentrated in the 2–3 m/s speed range. For acceleration, the distribution is relatively concentrated around 0 m/s2.
In the SAFD, each combination of speed and acceleration is defined as a state, and the transition frequency between states is counted based on the time series operational data. By normalizing the transition frequencies between states, a Markov transition matrix can be constructed.
Figure 6 presents a visualized graph of the Markov transition matrix generated.
The Markov transition matrix probabilistically determines the speed and acceleration at each time step. The speed and acceleration profiles determined in this way will be probabilistically similar to the reference operational data. However, as transitions are repeated, the generated operational data may deviate from the reference data, reducing similarity. Therefore, in this study, quantitative similarity was evaluated among the operational data generated based on the Markov transition matrix to determine the final operational data for use.
The similarity of operational data was evaluated based on maximum speed, average speed, root mean square (RMS) speed, maximum acceleration, and average acceleration. The comprehensive similarity evaluation index is defined as follows:
where
represents the similarity index and
denote the average speed of the newly generated operational data and the reference operational data, respectively.
,
,
,
,
,
,
,
represent the maximum speed of new operating data, the maximum speed of reference operating data, the speed of new operating data, the speed of reference operating data, the average acceleration of new operating data, the average acceleration of reference operating data, the maximum acceleration of new operating data, and the maximum acceleration of reference operating data, respectively. The operational data generated in this study was designed to probabilistically simulate various operation times while preserving the probabilistic characteristics and major operational patterns of the reference operational data (2000s). In addition,
, an index that can evaluate the similarity with the reference operational data, was added to quantitatively evaluate how well each generated operational data point reflects the key statistical characteristics of the original data. By using only data whose index is within the predefined threshold range in the study, it was ensured that the consistency of the periodic operational patterns of the reference operational data were statistically preserved in the newly generated operational data.
If the index (
) satisfies a predefined threshold range, it is saved as newly generated operational data.
Figure 7 is an example of newly generated operational data based on a Markov chain. Since the operational data is probabilistically generated based on the Markov chain matrix, the results vary with each execution. The speed values of the newly generated operational data form a discrete velocity profile as they follow the resolution of the Markov chain matrix bins. To obtain a more continuous speed profile, the number of bins in the Markov chain matrix can be increased.
In this study, 150 new operational datasets were generated, with 10 datasets for each of 15 different conditions. This study was conducted under quasi-static conditions, and, therefore, dynamic response factors related to passenger comfort were not considered. In future work, operational profiles will be developed based on actual ship operation scenarios by incorporating the frequency of rapid acceleration and deceleration, and subsequent analyses will be carried out under more realistic conditions. The data for each case in
Table 1 represents the aggregated values of the 10 newly generated operational datasets under each condition. The total duration of the operational data varies from 1000 to 8000 s, and the cycle similarity was set to range between a minimum of 0.5 and a maximum of 2.0, as shown in the table. Increasing the threshold limit of the similarity results in greater differences between the newly generated operational data and the reference operational data. Conversely, if the threshold limit is set too low, the two datasets become nearly identical, making it difficult to properly reflect variations in operational load.
The RMS speed of the operational profile is related to the propulsion power required for ship operation. As shown in
Table 1, it varies within the minimum and maximum range, reflecting fluctuations in operational demanded power. The RMS speed of the reference operational data is 4.6579. It was observed that cases with a smaller
range (when the difference between Max
and Min
is small, e.g., cases 3, 6, 9, 12, and 15) show an RMS speed range closer to that of the reference operational data compared to cases with a larger
range (e.g., cases 1, 4, 7, 10, and 13). This confirms that the operational similarity index in this study was appropriately set.
5. Control Strategy Validation
The stochastic power control strategy developed in this study can achieve consistently high performance even under variable load conditions. To validate this, performance was evaluated using a new operational dataset that was not included in the training data from the KT River. The new operational dataset follows the speed profile shown in
Figure 17. For relative comparison with the stochastic power control strategy, the commonly used load leveling technique was applied to the same operational profile. Based on the previous dynamic programming (DP) results, the load leveling method sets
to −0.5 for demanded powers exceeding 100 kW to increase the engine’s power rating. For demanded power below 100 kW, electric propulsion is considered more efficient, so
is set to 1, allowing operation in electric mode. Additionally, to maintain the SOC, the SOC correction factor is applied in the same way as the stochastic power control strategy.
Table 3 shows the global optimal performance results for new operational data obtained through dynamic programming, the stochastic power control strategy, and the load leveling method. Although the global optimal performance cannot be practically implemented as a real-time controller, it represents the best possible performance assuming full knowledge of the entire operational dataset, making it a suitable reference for relative comparison. From this perspective, both the stochastic power control strategy and the load leveling method show 14–20% higher total fuel consumption compared to the optimal solution.
The battery SOC at the final termination point is different depending on each control strategy and the SOC correction factor setting. When the final battery SOC differs, a direct comparison of fuel consumption becomes difficult. Therefore, as shown in
Figure 18, the trend of total fuel consumption was analyzed based on battery SOC. By comparing the total fuel consumption with respect to the final SOC, the performance difference between the stochastic control strategy and the load leveling method can be identified. The stochastic control strategy shows about a 3% reduction in fuel consumption compared to the load leveling method. In the case of large-scale systems such as hybrid electric propulsion ships operating over long durations and distances, even a 3% improvement in propulsion efficiency can lead to an annual reduction of tens to hundreds of tons of fuel consumption, accompanied by a corresponding decrease in CO
2 and NOx [
7]. These findings indicate that even a 3% efficiency improvement yields substantial operational and environmental benefits and serves as a quantitative basis supporting the commercial viability of eco-friendly ship technologies, in alignment with the IMO’s decarbonization objectives [
6].
The stochastic power control strategy results in approximately 14% higher fuel consumption compared to the maximum performance achieved by dynamic programming. However, it demonstrates better performance than the load leveling method. Additionally, even when variations occur in the operational data used for training, the strategy remains meaningful and applicable as a power control approach.
The performance results were analyzed from the perspective of power distribution between energy sources.
Figure 19 presents the comparison of engine power (
) and electric propulsion power (
) relative to the ship demanded power (
) under both the load leveling control strategy and the stochastic control strategy. Interestingly, in the stochastic control strategy, both engine power and electric charging power are used at a higher level compared to the load leveling control strategy. In contrast, in the case of the load leveling control strategy, electric propulsion is only utilized at powers below 100 kW, so the engine power does not change significantly. The reason why total fuel consumption can be reduced despite increased engine and motor power is shown in
Figure 20. Compared to the results of the load leveling control strategy represented by the red points, the operating points of the stochastic control strategy represented by the green points can be seen to be closer to the efficient operating ranges of the engine and motor. In the engine map, the optimal efficiency region is indicated in blue, while in the motor map, it is represented in yellow. As a result, the stochastic control strategy achieves higher efficiency than the load leveling control strategy. However, since the stochastic control strategy establishes control solutions based on a probabilistic model, it may show slight differences in each operational simulation.
6. Conclusions
This study proposed a stochastic model-based power control algorithm for hybrid electric propulsion ships, designed to maintain high efficiency under variable operational conditions. The proposed power control algorithm implements optimal power control solutions as a stochastic control map, enabling probabilistic control decisions based on operational requirements. To implement this stochastic control strategy, Markov chain-based models were used to generate diverse operational datasets with variations, derived from the reference operational data. By training on these varied operational datasets, the proposed method maintains consistently high performance, even when deviations occur from the reference operational conditions. The existing deterministic power control strategy can obtain the optimal control solution through the global optimization technique when the operating data is accurately identified. However, in reality, such prior information is often unavailable, and performance degradation can occur due to environmental fluctuations and load variations during operation. By comparing the stochastic control strategy proposed in this study with the deterministic load leveling method, it was observed that in non-reference operational conditions, the stochastic control strategy reduced total fuel consumption.
A potential drawback of the stochastic control strategy is its tendency to cause frequent power load fluctuations. To address this, a memory factor was added to prevent frequent power load fluctuations. In addition, the SOC correction factor was applied to satisfy the battery SOC recovery requirement at the end of the operation. The SOC correction factor can be selected by the user according to the requirements at the end of the operation, rather than proposing a fixed optimal value.
The results of this study are suitable for application to ships with basic reference operation plans that may experience some fluctuations in demanded power. Although this study generated 15 new operational datasets and trained the control algorithm on a total of 150 datasets, future research could incorporate a larger dataset to derive an optimal solution and further improve model training. This approach is expected to enhance probabilistic control performance even under larger variations in operational conditions. When the training dataset is expanded to thousands of operating cycles, the computation time required to obtain optimal control solutions for each cycle increases linearly. In this study, it took approximately 101.47 s to derive an optimal control solution for a single operating cycle using dynamic programming. Therefore, if the training dataset were expanded to several thousand cycles, the total offline computation time would proportionally increase to several hours. However, this process is performed only once during the control strategy development stage and does not affect real-time implementation. Once the stochastic control map is generated, the real-time controller only performs a simple table lookup and retrieval of control ratios, which requires minimal computational effort. Therefore, the proposed control strategy can be effectively implemented on embedded shipboard control systems without any real-time performance issues.
Future research will focus on applying the proposed stochastic control strategy to a real hybrid power system, introducing load variations to validate its real-time performance. Based on this study that assumed quasi-static conditions, a more realistic operating cycle that reflects the transient situation of the operating cycle will be constructed, and the stochastic control strategy based on this will be applied to a test-bed that simulates the propulsion structure of a hybrid electric ship. In this case, frequent switching between propulsion modes may lead to overloading or reduced reliability of powertrain components. Therefore, a control strategy that reflects such physical constraints should be established in parallel in the future. Through this, it is expected that the stability and effectiveness of the control strategy can be secured even in an actual operating environment. The stochastic control strategy development framework proposed in this study is expected to serve as a foundation for the advancement of next-generation power control algorithms in eco-friendly ship propulsion systems.