Robust Adaptive Control Strategy for a Bidirectional DC-DC Converter Based on Extremum Seeking and Sliding Mode Control

This paper presents a new control strategy that combines classical control and an optimization scheme to regulate the output voltage of the bidirectional converter under the presence of matched and mismatched disturbances. In detail, a control-oriented modeling method is presented first to capture the system dynamics in a common canonical form, allowing different disturbances to be considered. To estimate and compensate for unknown disturbances, an extended state observer (ESO)-based continuous sliding mode control is then proposed, which can guarantee high tracking precision, fast disturbance rejection, and chattering reduction. Next, an extremum seeking (ES)-based adaptive scheme is introduced to ensure system robustness as well as optimal control effort under different working scenarios. Finally, comparative simulations with classical proportional-integral-derivative (PID) control and constant switching gains are conducted to verify the effectiveness of the proposed adaptive control methodology through three case studies of load resistance variations, buck/boost mode switching, and input voltage variation.


Introduction
Bidirectional DC-DC converters play an important role as a regulator to achieve the desired output of voltage or current and stabilize the DC bus in numerous applications such as hybrid microgrid systems [1,2], hybrid electric vehicles [3], fuel cell hybrid systems [4], and photovoltaic or wind energy systems [5,6]. In the hybrid system, the bidirectional DC-DC converter (BDC) is used to transform power between energy storage sources (battery, supercapacitor, . . . ) and DC bus in two directions with a boost mode for delivering energy to load power demand and a buck mode for charging the regenerative energy back to storage sources. It can be said that the system qualification is significantly affected by this converter's characteristics. Despite being widely commercialized and utilized in a variety of systems, the BDC control has been receiving constant attention due to existing problems of voltage ripple, output tracking performance subject to time-varying system parametric uncertainties, and matched and mismatched disturbances. Consequently, developing a control strategy that can address these regards and enhance system performance still remains challenging.
Concerning the control of DC-DC converters, several exciting methodologies have been suggested for disturbances rejection, sustaining the output voltage regulation, and

•
The objectives of tracking accuracy, disturbance suppression, and chattering alleviation are overall achieved by the proposed ESO-based CSMC. • An ES-based adaptive scheme particularly designed for CSMC is proposed to adjust the switching gain, which can reduce the voltage ripple and the magnitude of the peaking phenomenon.

•
The stability of the proposed strategy for the whole system is theoretically proven by the Lyapunov approach in the influence of the matched and mismatched disturbances.

•
The validation is performed to verify the effectiveness and reliability of the proposed strategy under three case studies of load resistance variation, buck/boost mode switching, and input voltage variation.
The rest of this paper is organized as follows. Section 2 presents the nonlinear model of the bidirectional DC-DC converter. Section 3 introduces the proposed control algorithm including mismatched disturbance estimation by ESO, voltage tracking and disturbance rejection by CSMC, switching parameter adaptation by ES, and stability analysis. Furthermore, the system setup and simulation results are given in Section 4. Ultimately, a conclusion is given in Section 5.

Generalities of the Bidirectional Converter Circuit
Consider a typical bidirectional DC-DC converter and equivalent circuit of the two states whose schematic diagrams are shown in Figure 1. In this configuration, a voltage source S V and a DC bus are respectively located on the high-voltage side and low-voltage side of the converter. 1 v and 2 v are the voltage at the high and low sides of the converter, respectively. 1 R denotes the internal resistance of the voltage source, whereas, in the simplest case, 2 R is considered to be a load resistance. The inductor L is accompanied by a parasitic resistance, L R . dson R indicates In this configuration, a voltage source V S and a DC bus are respectively located on the high-voltage side and low-voltage side of the converter. v 1 and v 2 are the voltage at the high and low sides of the converter, respectively. R 1 denotes the internal resistance of the voltage source, whereas, in the simplest case, R 2 is considered to be a load resistance. The inductor L is accompanied by a parasitic resistance, R L . R dson indicates the MOSFET turn-on resistance. Capacitor C H and C L represent the input and output capacitor of the converter, respectively. Q 1 and Q 2 are two active switches that are directly controlled by two complementary gating signals Gate 1 and Gate 2 , separately. As a result, the inductor current i L , the input current i S , and the output current i 0 can fundamentally flow in both directions. Because of the complementary control signal, no matter what mode (buck or boost) the bidirectional converter is in, there are only two states, namely State 1 (Q 1 ON-Q 2 OFF) and State 2 (Q 1 OFF-Q 2 ON). These states are described by equivalent circuits as shown in Figure 1b,c. The governing equation sets for these states can be expressed as: State 2 : where R eq = R dson + R L . Since the two active switches Q 1 and Q 2 are controlled complementarily, there should be a single controller to operate the bidirectional converter. Hence, two governing equation sets (1) and (2) could be fused by introducing the duty cycle variable µ ∈ [0, 1]. This action results in the following average modeling equation set.
However, it is expected that in practice the target system might experience several disturbances, namely resistive load variations, unmodeled terms, or parameter uncertainties. Consequently, the average modeling equation set (3) can be modified as: where µ and R 2 denote the nominal duty cycle and nominal load resistance, respectively, ∆µ represents the duty cycle perturbation, and R 2 is the actual load resistance.

Control-Oriented Modeling
Let x 1 = e = LC L (v 2 − V r ) be the first system state, where V r denotes the desired output voltage. The system model can be rewritten as follows. .
where x 2 = Li L − L/ R 2 v 2 represents the second system state and d 1 = −L 1/R 2 − 1/ R 2 v 2 + ϕ 1 denotes the first disturbance of the system with ϕ 1 being the first unmodeled term.
where ϕ 2 denotes the second unmodeled term. In the above equation set, f could be interpreted as the system dynamics, d 2 represents the second disturbance of the system, and u represents the virtual control input, from which the actual command duty cycle can be calculated as: Combining (5)-(7), one can obtain the following canonical form, which is readily available for nonlinear control algorithm design: .
Consequently, the problem of voltage tracking control for a bidirectional DC-DC converter now is transformed into the problem of stabilizing a second-order nonlinear system. The canonical format (9) is more suitable for nonlinear control algorithm applications than the original governing Equation (4). Moreover, in this model, different disturbances have been considered and isolated from the system dynamics such that they form only two well-known types of disturbances. Specifically, the disturbance d 1 enters the system in a different channel with the control input u. As a result, d 1 could be interpreted as a mismatched disturbance, whereas d 2 represents a matched one. Remark 1. The control-oriented modeling presented in this section particularly addresses the problem of voltage regulation on the low-voltage side of the bidirectional DC-DC converter. For other tasks including voltage regulation on the high-voltage side or current control, the modeling must be tailored individually for each task. This fact, on one hand, represents a unique characteristic of the control-oriented modeling method, on the other hand, might be its limitation.

Control Algorithm Design
To regulate the low-voltage side v 2 of the bidirectional DC-DC converter whose dynamics model is captured in (4), a model-control scheme is proposed in Figure 2. Figure 2 illustrates the proposed control algorithm for the converter, where different control problems are addressed as follows: • Control-oriented modeling: transforming the average model (4)  must be tailored individually for each task. This fact, on one hand, represents a unique characteristic of the control-oriented modeling method, on the other hand, might be its limitation.

Control Algorithm Design
To regulate the low-voltage side 2 v of the bidirectional DC-DC converter whose dynamics model is captured in (4), a model-control scheme is proposed in Figure 2.  Consider the second-order nonlinear system (9), which is subject to both matched and mismatched disturbances. Assumption 1. Both mismatched and matched disturbances are differentiable and bounded satisfying: where D i > 0, D i > 0 are two unknown positive constants. To achieve the objective of stabilizing the nonlinear system (9) in the adverse influence of disturbances, an ESO is employed to precisely estimate the mismatched disturbance in real time. Subsequently, the estimation result is utilized in a CSMC design to suppress the negative effect of disturbances, thus enhancing the disturbance rejection capability and stabilizing the performance of the control system. Meanwhile, the matched disturbance is left for the robustness of the CSMC to handle.

Extended State Observer for Mismatched Disturbance Estimation
Consider a linear extended state observer addressing mismatched disturbance of the following form.
wherex 1 andd 1 denote the estimation of the first state x 1 and the mismatched disturbance d 1 , respectively; α 1 and α 2 are the observer parameters with positive constants; ρ is an arbitrarily small positive constant. Let the estimation error of the observer (11) be: The following results can be then obtained: Subsequently, combining (13) and (14) yields the following estimation error system. (15) where A = −α 1 1 −α 2 0 and B = 0 1 .
The characteristic equation of matrix A can be expressed as follows.
From (16), if the design parameters α 1 and α 2 are properly selected such that the matrix A is Hurwitz, then for any given symmetric positive definite matrix Q, there exists a unique symmetric positive definite matrix P that satisfies the following Lyapunov equation [29]: A Lyapunov candidate is considered as Define: and suppose that the mismatched disturbance satisfies Taking the derivative of the above Lyapunov function yields where the last equality is followed because the quantities ρ B Using (22) and the Cauchy-Schwarz inequality [30], the Lyapunov function derivative (21) becomes where the Rayleigh principle has been applied such that Therefore, if condition (20) is satisfied, the Lyapunov derivative (23) is always negative, and thus, the bounded stability of the observer is guaranteed. Consequently, the estimation errors will converge to a neighborhood of the origin.

Remark 2.
Concerning condition (20), it is advised to select Q such that it maximizes (19) [29]. More transparently, it will certainly be beneficial to design ρ 1 to be a sufficiently small number for higher robustness. However, a smaller ρ causes a larger peaking phenomenon if the initial condition of the first state and that of the first state estimation are different, or x 1 (0) =x 1 (0). In that case, ρ could be designed as follows.
where R, r, and t max are positive tuning numbers. The max( * ) function prevents dividing to zero.

Continuous Sliding Mode Control Design
Inspired by the work on matched disturbance rejection [30], this section introduces a continuous asymptotic sliding mode control to overcome the effects of both matched and mismatched disturbances and ultimately stabilize the system (9). The idea is to design a sliding mode surface so that as the sliding mode occurs, the system would follow the desired dynamic. Moreover, a sliding mode control law is designed in terms of the control function derivative. Hence, the actual control is continuous and chattering-reduced because of the integration of the high-frequency switching function. Consider the nonlinear system (9) and let v(t) be the derivative of the control input u(t) such that Denote respectively the prime and auxiliary sliding mode variables of the following forms where c, c > 0 are design positive constants, andd 1 is the estimated mismatched disturbance, which is purposedly added into the sliding mode surface to suppress the influence of mismatched disturbance. Taking the derivative of the auxiliary sliding mode variable (29) and using (9), (27), Based on (30), to compensate for the system dynamics and effects of disturbances, a high-frequency switching control v(t) can be designed as where k 0 > 0 is a design positive constant, η is designed later. By substituting (31) into (30), the derivative of the auxiliary sliding mode variable now becomes where e d In (34), Assumption 1 on matched disturbance and the results of the stability analysis on mismatched disturbance estimation errors in the previous section have been utilized. Specifically, the estimation error and its derivative are implied to be bounded satisfying Consider the case where the switching parameter η is sufficiently selected such that One could obtain the following result by substituting (36) into (34) and using (33).
Separating the variables and integrating both sides of (37) over the period 0 ≤ τ ≤ t yield As a result, V(t) would reach the equilibrium in a finite time t r satisfying Consequently, s(t) will converge to zero in a finite time. As the sliding mode, s(t) = 0, occurs, the following results could be obtained from (29) Thus, the prime sliding mode variable σ(t) will asymptotically converge to zero. As this happens, from (28), the system dynamics now becomes Here, the asymptotic convergence of the mismatched disturbance observer on the right-hand side of the equation leads to the asymptotic convergence of the first system state x 1 (t) on the left-hand side as well. Therefore, the overall control objective is achieved.

Adaptive Scheme by Extremum Seeking
The switching parameter η in (36) plays a crucial role in control performance. Specifically, a large value of η would not only shorten the convergence time but also strengthen the system's robustness against disturbances, thus ensuring tracking accuracy. Nonetheless, there is a causal relationship between the switching parameter magnitude and the chatter-ing phenomenon observed in the control signal. In detail, a small variation in disturbance could be easily suppressed by selecting a sufficient value for the switching parameter. However, if the disturbance varies in a much wider range than expected, a consistently large switching parameter would be inefficient due to excessive chattering phenomena. In such cases, it is therefore important to adjust the switching parameter accordingly. Thus, in this section, we introduce an ES-based adaptive scheme for the switching parameter to achieve system stability, tracking accuracy, and chattering reduction as well as avoid unnecessarily large control efforts. Figure 3 illustrates the proposed switching parameter optimization by ES, whose stability analysis was conclusively proved in the literature [31][32][33]. Four sequential stages include:

1.
Demodulation: acquiring the gradient information by multiplying the calculated objective function J by another sinusoidal signal M(t) = a sin ωt with the same frequency ω (rad/s) as the modulation signal and a denotes the tuning parameter. An optional high-pass filter could be added to this stage to remove bias from the responded objective function.

2.
Parameter update: updating the switching parameterη by integrating the demodulated signal. This stage consists of a learning rate k, which determines convergence speed and accuracy. An optional low-pass filter could be added to this stage to filter out high-frequency noise from the demodulated signal.

3.
Modulation: perturbing the being-optimized switching parameter with a low-amplitude sinusoidal signal S(t) = b sin ωt with b as the modulation amplitude.

Adaptive Scheme by Extremum Seeking
The switching parameter  in (36) plays a crucial role in control performance. Specifically, a large value of  would not only shorten the convergence time but also strengthen the system's robustness against disturbances, thus ensuring tracking accuracy. Nonetheless, there is a causal relationship between the switching parameter magnitude and the chattering phenomenon observed in the control signal. In detail, a small variation in disturbance could be easily suppressed by selecting a sufficient value for the switching parameter. However, if the disturbance varies in a much wider range than expected, a consistently large switching parameter would be inefficient due to excessive chattering phenomena. In such cases, it is therefore important to adjust the switching parameter accordingly. Thus, in this section, we introduce an ES-based adaptive scheme for the switching parameter to achieve system stability, tracking accuracy, and chattering reduction as well as avoid unnecessarily large control efforts. Figure 3 illustrates the proposed switching parameter optimization by ES, whose stability analysis was conclusively proved in the literature [31][32][33]. Four sequential stages include: 1. Demodulation: acquiring the gradient information by multiplying the calculated objective function J by another sinusoidal signal t with the same frequency  (rad/s) as the modulation signal and a denotes the tuning parameter. An optional high-pass filter could be added to this stage to remove bias from the responded objective function. 2. Parameter update: updating the switching parameter ˆ by integrating the demodulated signal. This stage consists of a learning rate k, which determines convergence speed and accuracy. An optional low-pass filter could be added to this stage to filter out high-frequency noise from the demodulated signal. 3. Modulation: perturbing the being-optimized switching parameter with a lowamplitude sinusoidal signal   ( ) sin S t b t with b as the modulation amplitude.  An adaptive scheme for the switching parameter  could be intuitively designed based on the working principle of the sliding mode control algorithm as follows. Ideally, as the disturbance becomes large enough to influence the system,  should be sufficiently large to drive the sliding surfaces to the origin in a short amount of time.
However, once the sliding mode occurs, it is unnecessary to keep  at that same level but more effective to reduce its amplitude to a level just sufficient enough to eliminate the influence of the disturbance. This action is expected to not only ensure system stability, tracking precision, and fast convergence but also reduce control effort and severity of the chattering phenomenon. From this perspective and based on the above ES scheme, the adaptive law of the switching gain is designed as An adaptive scheme for the switching parameter η could be intuitively designed based on the working principle of the sliding mode control algorithm as follows. Ideally, as the disturbance becomes large enough to influence the system, η should be sufficiently large to drive the sliding surfaces to the origin in a short amount of time. However, once the sliding mode occurs, it is unnecessary to keep η at that same level but more effective to reduce its amplitude to a level just sufficient enough to eliminate the influence of the disturbance. This action is expected to not only ensure system stability, tracking precision, and fast convergence but also reduce control effort and severity of the chattering phenomenon. From this perspective and based on the above ES scheme, the adaptive law of the switching gain is designed as .η where S(t) = b sin ωt and M(t) = a sin ωt are the period perturbation signals, k is the learning rate, J describes the cost function, k 1 denotes the objective function gain, k 2 is the objective error gain, k 3 is the objective function sliding variable gain, e = x 1 is the tracking error, and s is the auxiliary sliding mode variable defined at (29).

Remark 3.
Fundamentally, the ES control algorithm might have an optional high-pass and low-pass filter in its design. However, (44) indicates that the optimal value of the objective function is J * = 0, thus a high-pass filter is not required in the scheme. Also, a low-pass filter could lead to delays in response and therefore should be refrained from use.
Based on the adaptive law (42), the high-frequency switching control v(t) (31) can be rewritten as ES control is a model-free optimization algorithm. Thus, its precision does not depend on the knowledge of the target system but rather lies on the objective function design and parameter selection. Together with the working principle of the sliding mode control, in (44), s should be weighed more than e, or k 3 > k 2 .
The forcing frequency ω and the amplitudes a, b of the demodulation and modulation signals, are selected by the trial-and-error method. However, it should be ensured that the time scale of the system dynamics should be considerably smaller than that of the forcing frequency. In some cases, a time-frequency response plot of the objective function could offer a glimpse of how to select an appropriate forcing frequency. Besides, the amplitude of the demodulation signal should be much larger than that of the modulation signal, or a b.

Stability Analysis
Theorem 1. Considering the control law (31) implemented with the ESO function in (11), and the adaptive law of the switching gain (42) guarantee the stability of the closed-loop system. Then, the estimated disturbance, positive control gains, and tracking errors are bounded.
Proof of Theorem 1. The Lyapunov function can be defined as We define η η − η * is the adaptive gain error with the desired optimal value η * . The time derivative of the Lyapunov function (46) is expressed as follows: Substituting (23), (32), and (42) into (47), and applying Young's inequality, one yields By multiplying e −c 0 t into (48), it can be obtained as The result of integrating (50) is yielded Based on References [34][35][36], the proposed control strategy is ultimately uniformly bounded in the presence of matched and mismatched disturbances. The tracking errors and estimation errors of disturbances are bounded. As the Lyapunov function (51), theorem 1 is proved.

System Setup
To examine the practicality of the proposed controller in a bidirectional DC-DC converter, several aspects were considered. First, the bidirectional DC-DC converter was built in the environment of MATLAB/Simulink with components from the Simscape Specialized Power Systems library instead of the numerical dynamic equations (4) to closely reflect its nature. The simulation is conducted with the solver ode4 (Runge-Kutta), and the fundamental sample time is 10 −6 s. System parameters are listed in Table 1. Second, it is assumed in this study that only the inductor current i L , the high-side voltage v 1 , and the low-side voltage v 2 sensing are available to be collected. Meanwhile, other voltage and current sensors are unavailable and only implemented for observation and comparison.
In this section, the simulation results are presented to demonstrate the practicality of the proposed control scheme in voltage regulation of a bidirectional DC-DC converter. Our major objective is not to propose a novel algorithm that would surpass existing algorithms in the literature but rather introduce and study a blend of classical control (SMC) and optimization scheme (ES), thus adding more flexibility to the vast variety of control algorithms targeting disturbance rejection control. For this purpose, comparisons are drawn between the PID control, the constant switching gains, and the proposed adaptive scheme. For the PID control, there are two control loops are implemented including the outer voltage control loop (PI controller 1) and the inner current control loop (PI controller 2). Excepting for switching gains of the adaptive scheme, the parameters of ESO, PID controllers, and SMC are chosen to be the same for all cases, shown in Table 2. Besides, the ES parameters of the proposed adaptive case are recorded in Table 3.

Case Study Results
To verify the effectiveness of the proposed control algorithm on the bidirectional DC-DC converter, three typical study cases including load resistance variations, buck/boost mode switching, and input voltage variation are conducted. Table 2. Parameters of the observer and controllers for the comparative study. Two types of resistance variations including a subtle change and two drastic changes are considered to examine the control performance as follows:

Observer/Controllers Parameter
With the reference output voltage deliberately kept at V r = 12 V, a resistance step from 100 Ω to 50 Ω would create a subtle change in the load current, from i o = 0.12 A to i o = 0.24 A. Meanwhile, resistance steps from 50 Ω to 2.5 Ω and from 2.5 Ω to 75 Ω would create a drastic change, from i o = 0.24 A to i o = 4.8 A, and from i o = 4.8 A to i o = 0.16 A in the load current. The load resistance variation is shown in Figure 4.  A comparison is drawn between the PID control, the constant switching gains, and the proposed ES-based adaptive scheme. The two constant switching gain candidates were decided based on the highest and lowest switching gains observed in the proposed adaptive scheme case. These gains are selected with η = 5450 as the lowest gain and η = 9900 as the highest gain and are used under the same testing conditions for comparisons. Figure 5 illustrates the voltage-tracking performance of the proposed ES-based adaptive scheme compared to the PID control and the other two constant switching gains. In Figure 5a, the voltage-tracking performances of four strategies are presented, while the corresponding comparative error efforts are described in Figure 5b A comparison is drawn between the PID control, the constant switching gains, and the proposed ES-based adaptive scheme. The two constant switching gain candidates were decided based on the highest and lowest switching gains observed in the proposed adaptive scheme case. These gains are selected with η = 5450 as the lowest gain and η = 9900 as the highest gain and are used under the same testing conditions for comparisons. Figure 5 illustrates the voltage-tracking performance of the proposed ES-based adaptive scheme compared to the PID control and the other two constant switching gains. In Figure 5a, the voltage-tracking performances of four strategies are presented, while the corresponding comparative error efforts are described in Figure 5b. These figures are noticeable for all cases at t = 0.1 s, the subtle change in load resistance from 100 Ω to 50 Ω causes no abnormal phenomenon in the tracking performance. Meanwhile, at t = 0.2 s and t = 0.3 s, the effects caused by the drastic changes in load resistance, from 50 Ω to 2.5 Ω with the load current increases 20 times and from 2.5 Ω to 75 Ω with the load current reduces 30 times, are profound. Specifically, at t = 0.2 s, the PID control takes a magnitude of peaking phenomenon of approximately 2 V, while two constant switching gains have a smaller peak magnitude of 1 V, and the proposed scheme achieves the lowest voltage peaking with a magnitude of 0.9 V. Similarly, at the transient period of t = 0.3 s, the peaking voltage by using PID control reaches to 2.3 V compared to the desired voltage, while the smaller one is 1.2 V from using η = 5450 and the best voltage regulation obtains an overshoot voltage of 1 V by the proposed adaptive scheme and η = 9900. Such differences indicate that only drastic load resistance variations have a negative impact on the control robustness. As can be seen, different working conditions are likely to require different values of the switching gains that can maintain the system's performance. The inductor currents of four control strategies are presented in Figure 6. Overall, the proposed strategy improves inductor current performance while reducing current ripple under steady-state working conditions. Meanwhile, when using the PID control, the peaking current value at = 0.2 s t is 6 A, which is greater than 1 A in comparison to other strategies. When the load is drastically changed at = 0.3 s t , the inductor current adaptation of PID control also performs poorly because the magnitude increases to −2 A, whereas the inductor current of the proposed strategy and two constant switching gains achieves roughly −1 A. The inductor currents of four control strategies are presented in Figure 6. Overall, the proposed strategy improves inductor current performance while reducing current ripple under steady-state working conditions. Meanwhile, when using the PID control, the peaking current value at t = 0.2 s is 6 A, which is greater than 1 A in comparison to other strategies. When the load is drastically changed at t = 0.3 s, the inductor current adaptation of PID control also performs poorly because the magnitude increases to −2 A, whereas the inductor current of the proposed strategy and two constant switching gains achieves roughly −1 A.
High precision estimation of the ESO is presented in Figure 7 in which Figure 7a shows the estimated state, while the estimated mismatched disturbance is shown in Figure 7b. Specifically, it can be seen that the estimated statex 1 and the estimated mismatched disturbanced 1 can track the actual one in a steady-state period and the drastic load resistance changes at t = 0.2 s and t = 0.3 s. The proposed observer ensures that the asymptotic converges to the small estimation error approximately of ±5 × 10 −10 and ±2 × 10 −5 for state and mismatched disturbance, respectively.
proposed strategy improves inductor current performance while reducing current ripple under steady-state working conditions. Meanwhile, when using the PID control, the peaking current value at = 0.2 s t is 6 A, which is greater than 1 A in comparison to other strategies. When the load is drastically changed at = 0.3 s t , the inductor current adaptation of PID control also performs poorly because the magnitude increases to −2 A, whereas the inductor current of the proposed strategy and two constant switching gains achieves roughly −1 A. High precision estimation of the ESO is presented in Figure 7 in which Figure 7a shows the estimated state, while the estimated mismatched disturbance is shown in Figure 7b. Specifically, it can be seen that the estimated state 1 x and the estimated Trajectories of the prime and auxiliary sliding mode variables, σ and s, of the proposed controller are described in Figure 8a,b, respectively. The obtained results illustrated that the σ and s variables have asymptotic convergence to zero in finite time.
Since s is partly made of the derivative of σ and σ itself (29), any noise-corrupted signal from system states 1,2 x and mismatched disturbance estimation 1 d could affect σ , as in (28)  Trajectories of the prime and auxiliary sliding mode variables, σ and s, of the proposed controller are described in Figure 8a,b, respectively. The obtained results illustrated that the σ and s variables have asymptotic convergence to zero in finite time. Since s is partly made of the derivative of σ and σ itself (29), any noise-corrupted signal from system states x 1,2 and mismatched disturbance estimationd 1 could affect σ, as in (28), and eventually get magnified in s. The chattering in the trajectory of s in Figure 8b has demonstrated the problem and implied that the SMC algorithm is sensitive to noises.

Buck/Boost Mode Switching
In this case study, a controlled current source is used to replace the load resistor on the low-voltage side of a bidirectional DC/DC converter. The current generated by this current source varies between negative and positive values, indicating buck and boost mode, respectively. The main simulation results are presented in Figures 10-15, which show similar patterns to the load resistance variations case, especially in mode switching conditions at = 0.1 s, 0.2 s t and 0.3 s . The trajectory of the current source demand is described in Figure 10.

Buck/Boost Mode Switching
In this case study, a controlled current source is used to replace the load resistor on the low-voltage side of a bidirectional DC/DC converter. The current generated by this current source varies between negative and positive values, indicating buck and boost mode, respectively. The main simulation results are presented in Figures 10-15, which show similar patterns to the load resistance variations case, especially in mode switching conditions at t = 0.1 s, 0.2 s and 0.3 s. The trajectory of the current source demand is described in Figure 10.

Buck/Boost Mode Switching
In this case study, a controlled current source is used to replace the load resistor on the low-voltage side of a bidirectional DC/DC converter. The current generated by this current source varies between negative and positive values, indicating buck and boost mode, respectively. The main simulation results are presented in Figures 10-15, which show similar patterns to the load resistance variations case, especially in mode switching conditions at = 0.1 s, 0.2 s t and 0.3 s . The trajectory of the current source demand is described in Figure 10.    In this section, a comparison is conducted to confirm the efficacy of the proposed ESbased adaptive scheme for regulating the high-side and low-side voltages of the converter under the switching conditions of buck and boost mode. The PID control and two constant switching gains are also considered for this comparison. Similarly to the previous section, two constant switching gains are chosen for comparison, with η = 7280 serving the lowest value and η = 15150 serving the highest value. Figure 11 presents the comparison of the low-voltage side response under buck/boost mode switching between the proposed ES-based adaptive scheme compared to the PID control and the other two constant switching gains. The voltage-tracking performances of all strategies are described in Figure 11a, while Figure 11b depicts the corresponding comparative tracking error. It can be seen that the primary target at the low side is to maintain the desired voltage at 12 V. Thus, it is necessary to reduce the voltage chattering and peaking magnitude when switching modes occur to guarantee the voltage quality. Specifically, during the steady-state period of buck and boost mode, the low-voltage side of all control strategies achieves good tracking performance, allowing the obtained voltage to remain constant at 12 V. Meanwhile, the low-voltage side of PID control has the highest peaking magnitude of approximately 3 V when the current source switches from buck mode to boost mode at = 0. In this section, a comparison is conducted to confirm the efficacy of the proposed ESbased adaptive scheme for regulating the high-side and low-side voltages of the converter under the switching conditions of buck and boost mode. The PID control and two constant switching gains are also considered for this comparison. Similarly to the previous section, two constant switching gains are chosen for comparison, with η = 7280 serving the lowest value and η = 15, 150 serving the highest value. Figure 11 presents the comparison of the low-voltage side response under buck/boost mode switching between the proposed ES-based adaptive scheme compared to the PID control and the other two constant switching gains. The voltage-tracking performances of all strategies are described in Figure 11a, while Figure 11b depicts the corresponding comparative tracking error. It can be seen that the primary target at the low side is to maintain the desired voltage at 12 V. Thus, it is necessary to reduce the voltage chattering and peaking magnitude when switching modes occur to guarantee the voltage quality. Specifically, during the steady-state period of buck and boost mode, the low-voltage side of all control strategies achieves good tracking performance, allowing the obtained voltage to remain constant at 12 V. Meanwhile, the low-voltage side of PID control has the highest peaking magnitude of approximately 3 V when the current source switches from buck mode to boost mode at t = 0.1 s with the current change from −2 A to 4 A, while the constant switching gain η = 7280 takes a smaller peaking magnitude of 2 V, and the proposed scheme and other constant switching gain η = 15, 150 achieve the smallest peaking value of 1.8 V. For the working mode changes from boost to buck at t = 0.2 s, the proposed adaptive scheme achieves the greatest voltage regulation with the peaking value about −1.2 V, while these peaking magnitudes of other strategies are −1.3 V, −1.5 V, and −2.5 V for the PID control, constant switching gains η = 15, 150, and η = 7280, respectively. Similarly, when switching from buck to boost mode at t = 0.3 s with a current change from −1 A to 2 A, the peaking voltage is 1.5 V when using PID control, while the smaller values are 0.8 V and 0.7 V when using η = 7280 and proposed adaptive scheme, respectively. The use of adaptive gain η = 15, 150 achieves the highest voltage response with a peaking magnitude of 0.65 V. The obtained results demonstrate the necessity of adjusting the switching gain in order to decrease peaking phenomena and improve the tracking accuracy for bidirectional DC-DC converter voltage regulation.
For the high-voltage side response, the voltage variations under the buck/boost mode switching of four control strategies are shown in Figure 12 in which Figure 12a Figure 13 describes the inductor current response by using the proposed adaptive scheme and other strategies. It can be seen that the proposed adaptive strategy achieves a better current ripple than other strategies during the steady-state of buck or boost mode working conditions. Meanwhile, when the converter switches between buck and boost modes at t = 0.1 s, 0.2 s, and 0.3 s, the PID control demonstrates the highest peaking current values of −7 A, 3 A, and −3.5 A, respectively. On the other hand, the inductor current of the proposed strategy and two constant switching gains have roughly equal values of −5 A, 2 A, and −3 A at t = 0.1 s, 0.2 s, and 0.3 s, respectively. Figure 14a,b describe the estimated results of the state and mismatched disturbance using the ESO, respectively. In particular, the estimated statex 1 shows the high tracking accuracy of the real state x 1 during the steady-state period and when the converter switches between buck and boost modes at t = 0.1 s, 0.2 s, and 0.3 s as shown in Figure 14a. As revealed in Figure 14b, the proposed observer ensures the estimated error of the mismatched disturbance converges to zero under the steady-state working conditions. However, there is still a small estimation error of this mismatched disturbance when mode switching occurs at t = 0.1 s, 0.2 s, and 0.3 s as shown in Figure 14b.
The comparison of switching gain adaptation and the duty cycle command is presented in Figure 15a,b, respectively. As shown in Figure 15a, the proposed ES-based adaptation allows for the adjustment of switching gain η to ensure the tracking performance and chattering reduction of the high-side and low-side voltages based on the switching modes of the bidirectional DC/DC converter. As a result, the highest and lowest gains are obtained with η = 15, 150 and η = 7280, respectively. In Figure 15b, the duty cycle command of the PID control and two constant switching gains have the same level during the steady-state working condition. Meanwhile, the proposed strategy produces a smaller duty cycle magnitude than others. When the converter switches the operation modes at t = 0.1 s, 0.2 s, and 0.3 s, the duty cycle of four control strategies exhibits the same value of peaking magnitude. Thus, it can be seen that the flexibility of adaptive gains can make sure chattering reductions while also improving the voltage tracking precision.

Input Voltage Variation
For this test, a sinusoidal power supply of V S = 24 + 4 sin(20πt) is applied to the high-voltage side of the DC/DC converter. The trajectory of this input voltage is described in Figure 16.      Figure 17 presents the output voltage response of the proposed adaptive scheme, the PID control, and the other two constant switching gains for regulating the output voltage to achieve the desired value at 12 V. It is seen from Figure 17a, the proposed adaptive strategy maintains the output voltage with the least amount of ripple. Meanwhile, constant switching gains show a higher ripple magnitude than the adaptive scheme, and the PID control has the largest output voltage ripple, which affects the output voltage quality of the DC/DC converter. In Figure 17b, the comparative error efforts of four strategies are described. As can be seen, the proposed control strategy achieves the highest tracking accuracy of desired output voltage with the error approximated at ±0.1 V. The The inductor currents of the proposed strategy and others are described in Figure 18. It can be observed that the inductor current shows higher ripple amplitude than other strategies when using the PID controller and constant switching gain η = 13510 with the current fluctuating from 0.7 A to 2 A. Meanwhile, the proposed strategy and constant switching gain η = 18350 achieve the improvement of inductor current ripple in the range of (0.9-1.6) A under conditions of input voltage variation.
The estimation results of the 1 x state and the mismatched disturbance are shown in Figure 19. The actual values of state and disturbance can be estimated with high precision of estimated state 1 x and the estimated mismatched disturbance 1 d , see  Figure 17 presents the output voltage response of the proposed adaptive scheme, the PID control, and the other two constant switching gains for regulating the output voltage to achieve the desired value at 12 V. It is seen from Figure 17a, the proposed adaptive strategy maintains the output voltage with the least amount of ripple. Meanwhile, constant switching gains show a higher ripple magnitude than the adaptive scheme, and the PID control has the largest output voltage ripple, which affects the output voltage quality of the DC/DC converter. In Figure 17b, the comparative error efforts of four strategies are described. As can be seen, the proposed control strategy achieves the highest tracking accuracy of desired output voltage with the error approximated at ±0.1 V. The The inductor currents of the proposed strategy and others are described in Figure 18. It can be observed that the inductor current shows higher ripple amplitude than other strategies when using the PID controller and constant switching gain η = 13510 with the current fluctuating from 0.7 A to 2 A. Meanwhile, the proposed strategy and constant switching gain η = 18350 achieve the improvement of inductor current ripple in the range of (0.9-1.6) A under conditions of input voltage variation.
The estimation results of the 1 x state and the mismatched disturbance are shown in Figure 19. The actual values of state and disturbance can be estimated with high precision of estimated state 1 x and the estimated mismatched disturbance 1 d , see  Figure 17 presents the output voltage response of the proposed adaptive scheme, the PID control, and the other two constant switching gains for regulating the output voltage to achieve the desired value at 12 V. It is seen from Figure 17a, the proposed adaptive strategy maintains the output voltage with the least amount of ripple. Meanwhile, constant switching gains show a higher ripple magnitude than the adaptive scheme, and the PID control has the largest output voltage ripple, which affects the output voltage quality of the DC/DC converter. In Figure 17b, the comparative error efforts of four strategies are described. As can be seen, the proposed control strategy achieves the highest tracking accuracy of desired output voltage with the error approximated at ±0.1 V. The PID controller, on the other hand, takes the least tracking accuracy with the error in the range of ±0.5 V, while two constant switching gains obtain greater output voltage adaptation with the tracking error in the range of ±0.2 V and ±0.25 V for η = 18, 350 and η = 13, 510, respectively.
The inductor currents of the proposed strategy and others are described in Figure 18. It can be observed that the inductor current shows higher ripple amplitude than other strategies when using the PID controller and constant switching gain η = 13, 510 with the current fluctuating from 0.7 A to 2 A. Meanwhile, the proposed strategy and constant switching gain η = 18, 350 achieve the improvement of inductor current ripple in the range of (0.9-1.6) A under conditions of input voltage variation.
The estimation results of the x 1 state and the mismatched disturbance are shown in Figure 19. The actual values of state and disturbance can be estimated with high precision of estimated statex 1 and the estimated mismatched disturbanced 1 , see Figure 19a,b. The proposed observer can achieve the magnitude of state and mismatched disturbance in the range of (−1.4 → 0.75) × 10 −7 and (−4.4 → 3.7) × 10 −3 , respectively. It can also guarantee that the estimated error converges to zero under the conditions of input voltage variation. Figure 20a,b present, respectively, the switching gain adaptation for the proposed control strategy and the duty cycle command to control the DC/DC converter. It is seen from Figure 20a, the proposed adaptive scheme can adjust the switching gain η in the range of (13,350), which ensures the output voltage tracking performance and ripple reduction of the inductor current and the output voltage. For the duty cycle command as described in Figure 20b, the proposed strategy shows a smaller ripple value than others when the sinusoidal input voltage changes in the range of (24-28) V. Besides, it also achieves the narrowest duty cycle magnitude when the input voltage in the range of (20-24) V.

Conclusions
This paper presented a voltage regulation scheme that combined classical control and optimization for a bidirectional DC-DC converter and it was susceptible to both matched and mismatched disturbances. The theoretical analysis and simulation results explored several pros and cons of the proposed algorithm. First, the control-oriented modeling method captured the dynamics of the converter, allowing different disturbances to be considered and isolated into two common types of disturbances, namely matched and mismatched disturbances. Next, inspired by the work on matched disturbance rejection, an ESO-based CSMC algorithm addressing both mismatched and matched disturbances was proposed, backed by stability analyses, and verified in simulations. With proper parameter selection, the primary objective of tracking control in the presence of disturbance variations and noisy measurement was overall achieved. Nonetheless, the proposed controller was still sensitive to noises and could not completely eliminate the chattering effect. Then, an ES-based adaptive scheme for the switching parameter of SMC was introduced to adapt to different working conditions. The simulated results clearly demonstrated that the proposed strategy performed better than the other three strategies in stabilizing voltage tracking accuracy under the presence of load resistance variations, buck/boost mode switching, and input voltage variation of a bidirectional DC/DC converter. Even though the proposed scheme showed great potential achievements, it could not realize the no-transition performance in voltage controlling the converter, which possessed a fast dynamic response and noise-affected signals. Moreover, the application to voltage regulation of a bidirectional DC-DC converter has revealed the strength and weaknesses of the proposed control algorithm in practice. However, it could ensure tracking precision and chattering reduction in steady-state under different working conditions. Additionally, the presented strategy still has a limitation because verifying the proposed algorithm in actual experiments has not yet been performed. The aforementioned shortcomings open up various possibilities for improvement in both system modeling and control algorithm design in future research such as implementation in a real hardware system, improvement of the adaptive gains response, development of advanced control strategy to system performance, and so on.