Data-Driven Characteristic Prediction and Output Optimization for Wireless Power Transfer Systems

Yang, Shengtao; Lian, Jing

doi:10.3390/electronics15122586

Open AccessArticle

Data-Driven Characteristic Prediction and Output Optimization for Wireless Power Transfer Systems

by

Shengtao Yang

and

Jing Lian

^*

School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China

^*

Author to whom correspondence should be addressed.

Electronics 2026, 15(12), 2586; https://doi.org/10.3390/electronics15122586

Submission received: 23 April 2026 / Revised: 9 June 2026 / Accepted: 10 June 2026 / Published: 11 June 2026

Download

Browse Figures

Versions Notes

Abstract

Constant current/voltage (CC/CV) output of wireless power transfer (WPT) systems deviates due to increased load resistance during charging and mutual inductance variations caused by misalignment. Dynamically regulating the DC input voltage can maintain a stable output at the preset value, and predicting the mutual inductance and load resistance can help monitor charging status. However, joint prediction of characteristics and regulation degree can be nonlinear and complicated. This work proposes a data-driven method for characteristic prediction and output optimization for WPT systems based on the current waveform from only the transmitter side. A Multi-Scale Parallel Convolutional (MSPC) neural network is applied to simultaneously predict the load resistance, mutual inductance, output deviation factor and regulation coefficient. By leveraging its multi-scale feature extraction capabilities, it can accurately estimate the aforementioned parameters based on only the AC current waveform at the transmitter side. To improve the model’s generalizability under practical conditions, transfer learning (TL) is utilized to minimize the discrepancy between simulated and physical data. Finally, a 140 W prototype of the series-series (SS)-compensated WPT system is built to validate the effectiveness of the proposed method.

Keywords:

wireless power transfer; characteristics predictions; output optimization; deep learning; transfer learning

1. Introduction

Wireless power transfer (WPT) technology has emerged as a transformative solution to eliminate the constraints of physical connectors, revolutionizing power delivery across industries by offering unparalleled convenience, safety, and flexibility. Its applications span electric transportation [1,2], portable electronics [3,4], medical implants [5,6,7], etc., where reliable and contactless power transfer is critical.

Constant-current (CC) and constant-voltage (CV) charging modes [8,9] of WPT systems are widely used in plenty of scenarios. Previous studies have achieved load-independent CC or CV output with input zero-phase angle (ZPA) by incorporating compensation networks operating at natural resonance frequencies, such as series–series (SS), parallel–parallel (PP), series–parallel (SP), parallel–series (PS) [10], and higher-order compensation networks, such as LCC-S, LCC–LCC, LCL-S, and S-LCC [11,12].

However, the properties of such topologies are derived from the fundamental harmonic approximation (FHA) method, which neglects the higher-order harmonic components of the square-wave voltage generated by DC/AC inverters. Thus, the output DC current in CC mode exhibits deviation under varying load conditions. The effective resistance of the battery increases as the charging process progresses. When the load resistance rises, the output current in CC mode will attenuate due to heavy load, as shown in Figure 1. This current drop entails consequences beyond prolonged charging time. Since the accuracy of the Coulomb Counting Method (

Δ Q = \int I (t) d t

) heavily relies on a precise current profile [13,14], this attenuation will introduce significant State of Charge (SOC) estimation errors in the Battery Management System (BMS) if not taken into account. And it triggers premature CC-to-CV mode transitions, preventing the batteries from being fully charged [15]. Prolonged exposure to suboptimal charging conditions accelerates battery degradation [16].

Moreover, the change in the mutual inductance caused by misalignment [17,18] also leads to the deviation of the output. Numerous effective methods have been proposed to tackle this problem by optimizing the coupler structure [19,20,21] and designing topologies with high tolerance to misalignment [22,23]. However, the methods mentioned above can only mitigate the effects of load variation and misalignment to a certain extent, owing to the lack of control.

Therefore, implementing basic closed-loop control is a widely adopted strategy to realize stable output for WPT systems. As shown in Figure 2, a typical WPT system is composed of a DC/AC inverter, a coupler, an AC/DC rectifier and a compensation network. To maintain the DC output at a preset value, a DC/DC converter is cascaded with the inverter, such as Boost [24] and Buck–Boost [25]. An analog-to-digital converter (ADC) sampling module is placed at the output stage to acquire the feedback signal for the closed-loop control, which adjusts the conversion ratio of the DC/DC module. Some classical and effective control methods, including Proportional-Integral (PI) control, Model Predictive Control (MPC) and Sliding Mode Control (SMC), are widely adopted.

However, these methods require additional detection devices and a communication channel to transfer data between the transmitter side (TX) and the receiver side (RX), which increases cost and complexity of both hardware and software. Moreover, the variation of load resistance and mutual inductance can hardly be measured by detection devices at the RX. Consequently, considerable research efforts have been directed towards WPT control strategies based on information only from the TX [26,27,28].

Deep learning (DL) is an emerging technology designed for nonlinear complex issues, which has achieved outstanding success in computer vision, natural language processing and advanced control theories, etc. Reference [29] proposed a machine learning method using random forest and AdaBoost to estimate load resistance and coupling coefficient from transmitter-side measurements. However, this approach relies on manually extracted harmonic features via fast Fourier transform (FFT), which introduces additional computational overhead. While DL can bypass manual FFT, existing network architectures remain insufficient for analyzing harmonic-rich WPT waveforms. Image-based CNN [30] relies on external hardware and inherently neglects electrical dynamics. Furthermore, although LSTM [31] and TCN [32] process temporal data, their structures are misaligned with WPT waveform analysis. LSTM focuses on sequential temporal dependencies. TCN prioritizes long-term dependencies. Both lack parallel multi-scale receptive fields. Thus, they fail to concurrently extract harmonic-rich features. Moreover, few prior works simultaneously predict the load resistance and mutual inductance using only TX information while achieving output regulation.

Moreover, the data used in the aforementioned studies to train and validate DL models is mostly generated by offline simulation tools such as MATLAB/Simulink, LTspice or PLECS, etc. Such data has an inherent discrepancy from physical scenarios due to component parameter tolerances, parasitic effects, and switching dynamics [33,34]. This limitation reduces the prediction accuracy under practical operating conditions. Transfer learning (TL) [35] is a machine learning paradigm where knowledge acquired from solving the source domain is leveraged to improve learning efficiency and performance on a different but related target domain. A growing number of studies have utilized this algorithm in power converters [36,37], while its application in WPT parameter prediction and control remains relatively scarce.

To tackle these problems, this work proposes a Multi-Scale Parallel Convolutional (MSPC) deep learning model to achieve joint prediction of load resistance and mutual inductance, as well as deviation factor of the DC output and regulation coefficient of the DC input voltage in WPT systems utilizing the current waveform from only the TX. The MSPC model adopts a parallel structure of convolutional kernels with various dilation rates, which can capture multi-scale features from AC waveforms. This model is tailored to the harmonic-rich waveforms produced by power semiconductor switching. And TL is applied to enhance the prediction accuracy by minimizing the discrepancy between simulation data and physical data. Specifically, the MSPC model is first pre-trained using simulation datasets. The subsequent connection layers of the model are then updated via fine-tuning. With characteristic prediction finished, the regulation coefficient is used to optimize the output of the WPT system by regulating the input DC voltage. An SS-compensated experimental prototype is built to demonstrate that our method shows advantages and potential in joint characteristic prediction and output optimization using information from only the TX of WPT systems.

2. Analysis of SS-Compensated WPT System

The proposed data-driven method is designed and tested for an example of an SS-compensated WPT system. The schematic of the whole system is given in Figure 3, where

L_{P}

and

L_{S}

are the self-inductances of the coupler and M is the mutual inductance between two coils.

C_{P}

and

C_{S}

represent the compensation capacitors of the TX and RX, respectively. The full-bridge inverter consists of

Q_{1}

–

Q_{4}

, converting a DC input voltage

U_{DC}

to an AC input voltage

u_{AB}

with an operation frequency of angular frequency

ω

. And

i_{AB}

refers to the current flowing through the capacitor

C_{P}

. A full-bridge rectifier consists of

D_{1}

–

D_{4}

and the filtering capacitor

C_{O}

. It generates the DC output current

I_{O}

flowing through the load resistance

R_{L}

.

The coupler can be modeled as a loosely coupled transformer. The equivalent circuit of the SS compensated WPT system is shown in Figure 4. To ensure input ZPA and load-independent ZPA, the values of

C_{P}

and

C_{S}

are subject to

ω = \frac{1}{\sqrt{L_{P} C_{P}}} = \frac{1}{\sqrt{L_{S} C_{S}}} = 2 π f_{sw}

(1)

where

f_{sw}

is the resonant frequency of the system. The SS compensation topology is able to achieve CC output operating at

f_{sw}

, which is the frequency of the fundamental wave of

u_{AB}

. Based on the FHA method,

u_{AB}

is given as

u_{AB} (t) = \frac{4 U_{DC}}{π} sin \frac{π D}{2} sin ω t

(2)

where D is the duty cycle of the PWM wave. And the output current

I_{O}

is a load-independent expression given as

I_{O} = \frac{8 U_{DC}}{π^{2}} sin \frac{π D}{2} \cdot \frac{1}{ω M}

(3)

However, Equation (3) is derived under the condition that the higher-order harmonics of

u_{A B}

are neglected, whose standard notation should be given as

u_{A B} (t) = \frac{4 U_{DC}}{π} sin \frac{π D}{2} (sin ω t + \frac{1}{3} sin 3 ω t + \frac{1}{5} sin 5 ω t + \dots)

(4)

Therefore, the AC input of the subsequent stage can be viewed as a parallel combination of a fundamental voltage source and higher-order harmonic voltage sources. As shown in Figure 4, the input current

i_{r}

of the rectifier can be expressed as

i_{r} = \frac{j ω M u_{AB}}{(j ω L_{P} + \frac{1}{j ω C_{P}}) (j ω L_{S} + \frac{1}{j ω C_{S}} + \frac{8}{π^{2}} R_{L}) + ω^{2} M^{2}}

(5)

The higher-order harmonic components are not at resonance. Therefore, the output current consists of superimposed fundamental and harmonic components following rectification, which exhibits attenuation under heavy load conditions. The content of harmonic components can be calculated by

H_{n} (100 %) = \frac{U_{n}}{U_{1}} \times 100 %

(6)

where

U_{n}

and

U_{1}

denote the amplitude of the n-th harmonic and fundamental components of

i_{AB}

, respectively. This work simulates the third and fifth harmonic contents of

i_{AB}

in the SS-compensated WPT system using MATLAB/Simulink 2024a. The simulation parameters are based on the design values listed in Table 1, with

C_{S}

adjusted to 45 nF to ensure zero-voltage switching (ZVS). The variations in

R_{L}

and M are set from 1 to 30 Ω and 3 to 7 μH, with steps of 1 Ω and 0.1 μH, respectively. The contents of the third and fifth harmonics are illustrated in Figure 5. The result indicates that the harmonic content of

i_{AB}

decreases as the load resistance and mutual inductance increase.

The majority of research treats the rectifier as an equivalent pure resistance, whose value equals

\frac{8}{π^{2}} R_{L}

. In practice, the input impedance of the rectifier is inductive due to the nonlinearity of the diodes [38,39]. This occurs because the conduction of diodes introduces a phase shift, known as the rectifier angle

γ

between the fundamental voltage and current. Therefore, the input impedance

Z_{e}

can be modeled as an effective resistance

R_{e}

and an effective inductance

L_{e}

in series, whose value can be expressed as

Z_{e} = R_{e} + j L_{e}

(7)

As shown in Figure 3,

u_{r}

and

i_{r}

represent the input voltage and current of the rectifier. Assuming the WPT system operates in steady state, the output voltage

U_{O}

remains constant. Therefore, assuming D equals 1,

u_{r}

is a square wave with an amplitude of

U_{r}

, which can be expressed as

u_{r} (t) = \{\begin{matrix} U_{r}, & 0 \leq t \leq 0.5 T_{S} \\ - U_{r}, & 0.5 T_{S} \leq t \leq T_{S} \end{matrix}, U_{r} = U_{O} + 2 U_{D}

(8)

where

T_{S}

is the switching period and

U_{D}

is the forward voltage drop of each diode. Based on the FHA method, the fundamental component

u_{r, 1}

of

u_{r}

can be expressed as

u_{r, 1} (t) = \frac{4 (U_{O} + 2 U_{D})}{π} sin ω t

(9)

As shown in Figure 6,

i_{o, 1}

is the rectified current of

i_{r, 1}

. And

I_{O, 1}

is the fundamental component of

I_{O}

, which can be calculated by

I_{O, 1} = \frac{1}{T_{S} / 2} \int_{0}^{\frac{T_{S}}{2}} i_{o, 1} (t) d t = \frac{2}{T_{S}} \int_{0}^{\frac{T_{S}}{2}} I_{r, 1} sin (ω t - γ) d t = \frac{2}{π} I_{r, 1} cos γ

(10)

where

I_{r, 1}

is the amplitude of

i_{o, 1}

, which can then be given as

I_{r, 1} = \frac{π I_{O, 1}}{2 cos γ}

(11)

Hence, by substituting Equations (9) and (11), the phasor

Z_{e}

can be expressed as

Z_{e} = \frac{U_{r, 1}}{I_{r, 1}} = \frac{U_{r, 1}}{I_{r, 1}} ∠ γ = \frac{8 (U_{O} + 2 U_{D})}{π^{2} I_{O, 1}} cos γ ∠ γ

(12)

By substituting Equation (12),

R_{e}

can be expressed as

R_{e} = | Z_{e} | cos γ = \frac{8 (U_{O} + 2 U_{D})}{π^{2} I_{O, 1}} {cos}^{2} γ

(13)

By substituting Equation (12),

L_{e}

can be expressed as

L_{e} = | Z_{e} | sin γ = \frac{4 (U_{O} + 2 U_{D})}{π^{2} I_{O, 1}} sin 2 γ

(14)

Hence, the effective circuit of the SS-compensated WPT system can be modeled as shown in Figure 7.

Under actual operating conditions, it is almost impossible for an SS-compensated WPT system to achieve strictly load-independent CC output. Both simulation and experimental studies confirm that the actual output current

I_{O}

decreases with increasing load resistance

R_{L}

.

3. MSPC Model and Transfer Learning Framework

To achieve characteristic prediction via information from the TX, as well as optimization of the decayed output current

I_{O}

, this work proposes an MSPC model assisted by TL. To precisely quantify the extent of output deviation and input regulation, this paper defines two scaling factors that serve as the prediction target of the deep learning model. Firstly, we define the output deviation factor of

I_{O}

as

α = \frac{I_{act}}{I_{th}}

(15)

where

I_{act}

and

I_{th}

represent the output current under actual and theoretical conditions derived from the FHA method, respectively. Then, we define the regulation coefficient of

U_{DC}

as

β = \frac{U_{reg}}{U_{th}}

(16)

where

U_{reg}

denotes the regulated DC input voltage required to maintain the CC output, and

U_{th}

denotes the initial DC input voltage. By scaling the input voltage by a factor of

β

, the output current can be calibrated to approach its theoretical value. To achieve the joint prediction of

R_{L}

, M,

α

and

β

, the data-driven method is discussed in the following sections.

3.1. Multi-Scale Parallel Convolutional Model

In this paper, the MSPC model is proposed to jointly predict load resistance

R_{L}

, mutual inductance M, deviation factor

α

and regulation coefficient

β

from the transmitter-side current waveform

i_{AB}

. As illustrated in Figure 8, the proposed network employs a parallel multi-branch architecture where each branch adopts distinct dilation rates to capture dynamic information across different temporal scales. The model consists of two core modules: a Multi-Scale Feature Extractor and a Global Regressor.

3.1.1. Multi-Scale Feature Extractor

The Multi-Scale Feature Extractor serves as the core module of the MSPC model, utilizing one-dimensional dilated convolutions to capture multi-scale temporal dynamics in power electronics waveforms. For a given input sequence

X \in R^{B \times 1 \times L}

, where B denotes the batch size and L denotes the sequence length, the dilated convolution operation at position t with dilation rate d is defined as

(X *_{d} K) (t) = \sum_{i = 0}^{k - 1} K (i) \cdot X (t - d \cdot i)

(17)

where

K \in R^{k}

is the convolution kernel of size k and d controls the spacing between elements in the input sequence. This formulation enables the network to expand its receptive field exponentially without increasing the number of parameters or sacrificing resolution.

As shown in Figure 8, the extractor adopts a parallel three-branch structure. Each branch comprises two stacked dilated convolutional layers with distinct dilation rates. As discussed in Section 2,

i_{AB}

contains superimposed fundamental and higher-order harmonic components, whose contents vary significantly with

R_{L}

and M. The dilation rates are designed to cover the temporal scales corresponding to these components. The high-frequency branch employs dilation rates

d = 1

and

d = 1

, capturing rapid transients associated with switching events. The mid-frequency branch utilizes dilation rates

d = 2

and

d = 4

, extracting intermediate-scale features related to envelope variations. The low-frequency branch applies dilation rates

d = 8

and

d = 16

, capturing long-term trends and gradual changes in the waveform.

For the j-th branch, the feature extraction process can be expressed as

F_{j} = σ (K_{j, 2} *_{d_{j, 2}} σ (K_{j, 1} *_{d_{j, 1}} X))

(18)

where

K_{j, 1}

and

K_{j, 2}

denote the convolution kernels of the first and second layers in branch j, respectively, while

d_{j, 1}

and

d_{j, 2}

are their corresponding dilation rates. The function

σ (\cdot)

represents the ReLU activation function. Each branch outputs a feature map

F_{j} \in R^{B \times 8 \times L}

, with 8 representing the number of output channels. WPT waveforms exhibit structured and sparse frequency-domain properties, making eight channels sufficient to encode essential harmonic information. Furthermore, this compact dimension acts as implicit regularization to prevent overfitting given the limited training data.

The multi-scale features from all three branches are then concatenated along the channel dimension:

F_{concat} = [F_{high}, F_{mid}, F_{low}] \in R^{B \times 24 \times L}

(19)

where

F_{concat}

represents the concatenated feature map, while

F_{high}

,

F_{mid}

, and

F_{low}

denote the outputs of the high-frequency, mid-frequency, and low-frequency branches, respectively. A pointwise convolution (kernel size

1 \times 1

) is subsequently applied to fuse the multi-scale features:

F_{fused} = σ (K_{fuse} *_{1} F_{concat}) \in R^{B \times 16 \times L}

(20)

where

K_{fuse} \in R^{1 \times 24 \times 16}

is the fusion kernel, and

* 1

denotes standard convolution with stride 1. This design enables the model to simultaneously capture high-frequency transients from switching events, mid-frequency variations from control dynamics, and low-frequency power trends.

3.1.2. Global Regressor

The Global Regressor transforms the multi-scale features into task-specific predictions. First, an adaptive global average pooling layer aggregates the temporal dimension:

f_{pool} = \frac{1}{L} \sum_{t = 1}^{L} F_{fused} [:, :, t] \in R^{B \times 16}

(21)

This operation compresses the temporal information into a fixed-size vector while preserving the channel-wise characteristics. The pooled features are then passed through a fully connected layer:

Y = (W_{fc} f_{pool} + b_{fc}) \in R^{B \times D}

(22)

where

W fc \in R^{D \times 16}

and

b fc \in R^{D}

are learnable parameters and

D = 4

denotes the number of prediction targets. The final output vector is given as

output = {[R_{L}, M, α, β]}^{T}

(23)

To ensure fair and balanced learning across all tasks without bias, the loss weights for the four output variables (

R_{L}

, M,

α

and

β

) are kept equal. The overall MSPC model thus achieves joint prediction by leveraging multi-scale temporal features extracted from the TX current waveform, enabling accurate characterization of system states under varying load and coupling conditions. The detailed architecture and parameters of the MSPC model are shown in Appendix A, Table A1.

3.2. Transfer Learning

Transfer learning is a machine learning paradigm that aims to transfer knowledge learned from one task (source domain) to a different but related task (target domain) to improve learning efficiency and performance. The core concept of TL can be formalized as

f_{T}^{*} = A (D_{T}; K_{S})

(24)

where

f_{T}^{*}

is the optimal model for the target task;

D_{T}

represents the target domain training data;

K_{S}

denotes the knowledge extracted from the source domain

D_{S}

;

A

represents the transfer learning algorithm that integrates target data with source knowledge. In this work, the source domain

D_{S}

consists of abundant simulation data. The target domain

D_{T}

contains limited experimentally detected waveforms of

i_{AB}

. A fine-tuning strategy is employed to bridge the gap between simulated and physical scenarios. As shown in Figure 9, the MSPC model is first pre-trained on

D_{S}

to extract general waveform features. The model parameters are divided into two parts based on their functions. The Multi-Scale Feature Extractor parameters

θ_{f}

include the multi-scale parallel convolutional layers. The Global Regressor parameters

θ_{r}

include the global averaging and fully connected layers. The pre-training stage on

D_{S}

solves

({\hat{θ}}_{f}, {\hat{θ}}_{r}) = arg min_{θ_{f}, θ_{r}} L_{S} (θ_{f}, θ_{r}; D_{S})

(25)

where

L_{S}

is the loss function on the source domain.

({\hat{θ}}_{f}, {\hat{θ}}_{r})

are the pre-trained parameters. In this work, the Mean Squared Error (MSE) is adopted as the loss function; thus, the above equation can be formulated in detail as

L_{S} (θ_{f}, θ_{r}; D_{S}) = \frac{1}{N_{S}} \sum_{i = 1}^{N_{S}} {∥y_{i} - {\hat{y}}_{i}∥}_{2}^{2}

(26)

where

N_{S}

is the number of samples in the source domain dataset

D_{S}

,

y_{i} = {[R_{L, i}, M_{i}, α_{i}, β_{i}]}^{T}

denotes the ground-truth vector for the i-th sample as defined in Equation (26),

{\hat{y}}_{i}

represents the corresponding predicted vector, and

{∥\cdot∥}_{2}^{2}

denotes the squared L2 norm.

The model then adapts to the target domain using the knowledge

K_{S} = {{\hat{θ}}_{f}, {\hat{θ}}_{r}}

. The Multi-Scale Feature Extractor

{\hat{θ}}_{f}

is frozen to preserve its general feature extraction capability. Only the Global Regressor parameters are fine-tuned with the limited target data. This fine-tuning process is formulated as

θ_{r}^{*} = arg min_{θ_{r}} L_{T} ({\hat{θ}}_{f}, θ_{r}; D_{T})

(27)

where

L_{T}

is the loss function on the target domain.

θ_{r}^{*}

represents the optimized regressor parameters adapted to the real-world data distribution. Similarly,

L_{T}

shares the same MSE formulation as

L_{S}

, calculated over

D_{T}

.

The final model for the target task is

f_{T}^{*} (x) = f (x; {\hat{θ}}_{f}, θ_{r}^{*})

. It combines a frozen feature extractor with a fine-tuned regressor. Pre-trained on abundant simulation data, the Multi-Scale Feature Extractor captures the parameter-dependent harmonic characteristics with respect to

R_{L}

and M. The discrepancy of

i_{AB}

between

D_{S}

and

D_{T}

mainly lies in amplitude and initial phases, while the variation laws of

i_{AB}

in distinct scenarios remain constant. Therefore, only the Global Regressor requires adjustment to compensate for these physical offsets. This approach mitigates the discrepancy between

D_{S}

and

D_{T}

without requiring extensive experimental data. The overall framework of the MSPC model is illustrated in Figure 9.

3.3. Data Augmentation

All fine-tuning data from

D_{T}

is derived from actual measurements taken on each WPT prototype. When the ranges of

R_{L}

and M variations are relatively wide, obtaining sufficiently valid

D_{T}

data for fine-tuning still requires a substantial amount of work. To reduce the workload of collecting experimental data, data augmentation is applied to increase the diversity of data from

D_{T}

. Although these methods slightly alter the original information in the target domain data, they simulate the inevitable errors encountered in real-world conditions. This enhances the practical applicability of this data-driven approach. The specific data augmentation methods deployed are phase shifting, frequency distortion and amplitude scaling. Figure 9 shows the framework of TL assisted by data augmentation. Details and mechanisms are as follows.

To begin with, since the input data is obtained using sliding windows, the initial phase of the current waveform is typically random. Phase shifting increases the amount of valid data by copying the waveform starting from different phase points in the range of

[0, 2 π]

. Moreover, the PWM signal driving inverter transistors is generated by a digital controller like DSP or FPGA. There is a slight, unavoidable discrepancy between the actual and preset resonance frequency. Therefore, data points are randomly added to or removed from the

D_{T}

arrays to simulate a deviation of approximately

1 %

in frequency. Finally, since the detection of the waveform has inevitable minor errors in amplitude, the amplitude of waveforms in

D_{T}

is randomly scaled up or down by 0.01–0.1 times.

4. Online Validation of the Data-Driven Method

4.1. Benchmark Models

In this work, we compare four benchmark DL models with the proposed MSPC model to evaluate its predictive performance. The benchmark models are: Convolutional Neural Network (CNN), Temporal Convolutional Network (TCN), Long Short-Term Memory (LSTM) and Bidirectional Long Short-Term Memory (BiLSTM). They serve as basic and commonly used DL models for sequential prediction.

CNN is a widely used model initially designed for computer vision, which is also highly effective for time series processing in industrial scenarios. In this work, the 1D CNN consists of two convolutional layers (kernel sizes 7 and 5, filters = 16) with batch normalization, ReLU, and dropout (0.1); a max pooling layer (kernel size = 2) for downsampling; and global average pooling followed by a fully connected layer for final prediction.

The basic TCN consists of causal convolution and dilated convolution modules, serving as an enhanced CNN variant specifically designed for time series processing. It uses causal convolutions with dilation rates of 1 and 2 (kernel size = 3, filters = 16) to expand receptive fields without pooling. The remaining structure (global average pooling and fully connected layer) is identical to CNN.

LSTM employs unique gating units to control memory retention and updates, allowing it to effectively learn long-term dependencies in sequences, which has established it as a classic solution for various time-series tasks. In this work, the LSTM model is configured with a hidden size of 64, 2 layers, and a dropout rate of 0.3 to serve as a recurrent baseline for comparison.

As an extension of the standard LSTM, BiLSTM processes input sequences in both forward and backward directions, enabling it to capture contextual information from past and future states simultaneously. In this work, the BiLSTM is configured with a hidden size of 64, 2 layers, and a dropout rate of 0.3, serving as a bidirectional recurrent baseline for comparison.

4.2. Generation of Training Data in $D_{S}$

Data from

D_{S}

is generated by MATLAB/Simulink 2024a, with

R_{L}

varied from 1 to 30

Ω

in steps of 1

Ω

and M varied from 3 to 7 μH in steps of 0.1 μH.

i_{A B}

in each scenario has a sequence length of 5000. A sliding window method is taken to segment the initial data, whose window size is 2000 and stride is 100. This method turns the shape of

D_{S}

into

R^{38130 \times 2000}

, and the shape of labels is

R^{38130 \times 4}

according to Equation (23).

To generate highly accurate ground-truth labels for the neural network, an automated Simulink-based simulation framework combined with Brent’s method is developed. Specifically, for a given combination of

R_{L}

and M, Brent’s method is employed as an iterative optimizer to dynamically adjust

U_{DC}

in the simulation model. The algorithm intelligently switches between inverse quadratic interpolation, the secant method, and bisection based on convergence conditions. For instance, when the interpolation fails or only two points are valid, the next candidate is approximated by the secant-based update rule as follows:

β_{k + 1} = β_{k} - f (β_{k}) \frac{β_{k} - β_{k - 1}}{f (β_{k}) - f (β_{k - 1})}

(28)

where

f (β_{k}) = I_{act} (β_{k}) - I_{th}

represents the output current error. This process of running a full simulation cycle and dynamically updating the input voltage repeats automatically until

| f (β_{k + 1}) |

converges to a predefined tolerance threshold. In this work, the iteration terminates when the voltage

U_{DC}

variation between consecutive steps falls below 0.01 V, corresponding to a current error of approximately 1 mA. Through this mechanism, the exact

β

required to counteract the non-ideal current droop is reliably obtained for every operating scenario.

4.3. Validation Results of Prediction Accuracy

To evaluate the performance of time series predictive models, the following metrics are utilized to quantify the prediction accuracy. Mean Absolute Error (MAE) quantifies the average absolute value of prediction error, which is expressed as

M A E = \frac{1}{n} \sum_{i = 1}^{n} | y_{i} - \hat{y_{i}} |

(29)

where n denotes the sample size and

\bar{y}

denotes the average value of the actual samples. Root Mean Squared Error (RMSE) is defined as the square root of the mean squared errors, which is expressed as

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}

(30)

Coefficient of Determination (

R^{2}

) quantifies how well the data fit the regression model. A value closer to 1 indicates superior predictive performance. A low

R^{2}

is generally an unacceptable sign for predictive models. It is calculated by

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y_{i}})}^{2}}

(31)

Data from

D_{S}

is split into a training set and a validation set at a 7:3 ratio. To prevent data leakage caused by the overlapping sliding windows, this split is performed at the scenario level (combinations of

R_{L}

and M) rather than the window level, ensuring no overlap between training and validation scenarios. Both the training data and the labels were normalized. The hardware and software environment for training can be seen in Table 2. To ensure the fairness of the comparison, all baseline models have been tuned to their optimal configurations. The adaptive average pooling layer, flatten operation and final fully connected layer are kept identical across the MSPC model with CNN and TCN. All models are trained for 500 epochs with a learning rate of 0.001. The validation results are shown in Table 3. All three loss metrics perform best in the MSPC model. Moreover,

R^{2}

is extremely close to 1. The findings indicate that the MSPC model outperforms other models in prediction loss, due to its ability to extract dynamic features at multiple time scales of the WPT waveform, which contains high-frequency current components induced by power electronic switching devices.

4.4. Sensitivity Analysis on Dilation Rates

To justify the selection of dilation rates of

{(1, 1), (2, 4), (8, 16)}

mentioned in Section 3.1.1, a sensitivity analysis is conducted by comparing the proposed scheme with two alternative configurations. The architecture and other hyperparameters of the MSPC model are kept identical. The validation results are presented in Table 4.

Scheme 1 yields the highest RMSE (0.0332) due to its insufficient receptive field to capture the long-term fundamental envelope of

i_{A B}

. Scheme 2 also degrades performance since the overly sparse dilation skips crucial sampling points of the higher-order harmonic components. The proposed exponential scheme (

{(1, 1), (2, 4), (8, 16)}

) achieves the optimal balance, which best exploits the multi-scale feature extraction capability of the MSPC model.

4.5. Ablation Study on Multi-Scale Architecture

To verify the necessity of the multi-scale parallel design in the MSPC model, an ablation study is conducted. As given in Equation (19),

F_{concat}

is formed by concatenating

F_{high}

,

F_{mid}

and

F_{low}

in parallel. The performance of the MSPC model employing one or two branches is assessed. All variants were trained and validated under identical conditions. The results are shown in Table 5. The validation results indicate that relying on a single temporal scale yields suboptimal prediction accuracy. Specifically,

F_{low}

achieves the best single-branch performance (

R^{2} = 0.9492

). However, its prediction error remains relatively high (MAE = 0.0265). Conversely,

F_{high}

and

F_{mid}

branches alone struggle to capture the comprehensive waveform dynamics, resulting in lower

R^{2}

values. This confirms that a single receptive field is insufficient to characterize the harmonic-rich WPT waveforms. Combining any two branches leads to a substantial performance improvement. Notably, the combination of

[F_{mid}, F_{low}]

yields an

R^{2}

of 0.9901.

Ultimately, the proposed full architecture

[F_{high}, F_{mid}, F_{low}]

achieves the optimal performance across all schemes (MAE = 0.0105, RMSE = 0.0156,

R^{2}

= 0.9969). This indicates that the multi-frequency features associated with switching events play a necessary role in completing the waveform representation and minimizing prediction errors. Therefore, the parallel multi-scale design is strictly necessary to comprehensively extract the multi-scale harmonic information in WPT systems.

4.6. Computational Complexity and Inference Latency

To evaluate the feasibility of deploying the MSPC model on resource-constrained embedded systems, a comparison of computational complexity is conducted. Floating Point Operations (FLOPs) represent the total number of addition and multiplication operations required to perform a single model inference. A lower FLOPs count indicates reduced computational demand, resulting in faster processing speed on embedded platforms. This work compares the FLOPs and parameter counts of the MSPC network with four benchmark models and three widely used algorithms for edge computing (TinyRNN, MiniRocket and 1D Transformer). The results are shown in Table 6. The shapes of input and output data of validation models remain consistent.

As observed, LSTM, BiLSTM and 1D Transformer suffer from massive computational overhead, while the MSPC model maintains a lightweight profile with 2.096

M

FLOPs and 1.147

K

parameters. This demonstrates the potential of the MSPC model for future deployment on embedded devices. Furthermore, the inference latency of the proposed MSPC model was measured. The average single-inference time over 100 consecutive runs is merely 1.337 ms. This verifies the real-time feasibility of the MSPC model for embedded WPT controllers.

5. Experiments

5.1. Experimental Setup

To verify the above data-driven method for characteristic prediction and output optimization, an SS compensated WPT prototype is built, as shown in Figure 10. The parameters of the experimental setup are given in Table 1. To realize precise frequency hopping, Microcontroller TMS320F28335 is used to drive

Q_{1}

–

Q_{4}

. Operating at a kHz-level switching frequency,

Q_{1}

–

Q_{4}

use MOSFET IPW65R080CFD and

D_{1}

–

D_{4}

use diodes IDW15E65D2 from Infineon Inc. The initial duty cycle D of the PWM is set as 0.95. To ensure ZVS,

C_{S}

of 44.107 nF has a decrement compared to the calculated value.

A current sensor probe (CP1015) is used to detect the waveform of

i_{A B}

. During the experiment, the misalignment distance is varied within the range 0–55 mm, which results in a range of 2.9 μH to 6.7 μH for the mutual inductance variation. The current sensor probe is used to transmit the detected waveform to the host computer, which runs MATLAB/Simulink 2024a and Visual Studio Code for generating simulation data and operating DL models.

The initial 30 experimental scenarios for

D_{T}

data are selected to cover the primary operational ranges, with M varying from approximately 4.46 to 6.53 μH and

R_{L}

ranging from 1.2 to 30 Ω. Subsequently, data augmentation is applied to expand the scenario numbers from 30 to 300. To apply transfer learning, the Multi-Scale Feature Extractor of MSPC is frozen to preserve the feature extraction capability learned from

K_{S}

. The parameters of the Global Regressor are fine-tuned using the data from

D_{T}

. Additionally, the output linear layer of the comparative models is unfrozen. The fine-tuning epoch of TL is 500. The hardware and software environment remains the same as what is shown in Table 2. Figure 11 shows the experimental and simulated

i_{AB}

waveforms when

R_{L} = 14

Ω and

M = 5.2

μH. As depicted in the figure, the measured current waveform exhibits slight deviations in both amplitude and frequency compared to the simulated data, as mentioned in Section 3.3. The Total Harmonic Distortion (THD) also varies due to the non-ideal characteristics of practical circuit connections. Moreover, the inherent sensor noise from measurement equipment cannot be neglected.

5.2. Characteristic Prediction Results

To evaluate the prediction accuracy of the MSPC model assisted with transfer learning, a full factorial experiment was conducted with 15 levels of

R_{L}

and four levels of M, resulting in 60 distinct test conditions. Table 7 illustrates the prediction performance of tested models without and with TL, respectively. Models trained only on simulation data suffer from severe prediction errors in physical scenarios. For instance, the errors for

α

and

β

exceed

50 %

across all models. This clearly highlights the gap between simulation and real-world environments. However, assisted by TL, all models show significant improvement in accuracy. This indicates TL’s generalization capability in practical applications. The proposed MSPC model combined with TL achieves the lowest errors. Its average relative error remains below

2.55 %

across all four predicted parameters. This better performance illustrates its effective multi-scale feature extraction and generalization ability.

Figure 12 shows the prediction results of MSPC assisted by TL. Most predicted values closely match the measured ones, with the average error remaining below 3% and 1% for

R_{L}

and M, respectively. The results comparison also highlights the advantage of TL in reducing the discrepancy between simulated and actual conditions, which is crucial for characteristic prediction with information only from TX in WPT systems.

5.3. Output Optimization Results

To validate the output optimization performance of the data-driven method, the predicted

β

is used to compensate for

I_{O}

attenuation by multiplying

U_{DC}

by

β

. Regarding error propagation, the direct prediction of

β

is decoupled from the estimation of

R_{L}

and M. This ensures that characteristic prediction errors do not propagate to the output optimization process, guaranteeing the robustness of the CC regulation. Figure 13 shows the result of output optimization under different load resistances when M is 4.98 μH.

α_{real}

denotes the ratio of the actual output DC current to the theoretical one. By querying the MSPC model’s output results under each scenario,

U_{DC}

is manually multiplied by

β_{pred}

.

β_{sim}

is derived from simulation using Brent’s method mentioned in Section 4.2. And

β_{real}

is obtained by continuously adjusting the voltage of the DC power supply until

I_{O}

is fully compensated. The results illustrate that simply using data from

D_{S}

cannot reliably predict the deviation factor or optimize the CC output current attenuated by heavy load resistance. In contrast, the proposed MSPC model with TL effectively compensates for this nonlinear error.

Figure 14 and Figure 15 show the experimental waveforms of the prototype without and with output optimized under four operating conditions, respectively, where

u_{GS}

represents the voltage between the gate and source of MOSFET

Q_{1}

. As shown in Figure 14,

I_{O}

exhibits attenuation under increasing

R_{L}

. For instance, when M is

6.53 μ H

, increasing the load resistance from

15 Ω

to

30 Ω

causes a noticeable drop in

I_{O}

from 2.11 A to 1.98 A. A similar degradation trend is observed when M is

5.63 μ H

.

The steady-state waveforms obtained after applying the proposed data-driven optimization are depicted in Figure 15. By scaling the DC input voltage according to the predicted

β

,

I_{O}

is successfully restored and maintained at a stable level. Specifically, under the same

6.53 μ H

condition, the output at

15 Ω

and

30 Ω

are clamped at 2.38 A and 2.37 A, respectively, with a tiny deviation from the preset value (2.3707 A). These experimental results consistently validate that the proposed data-driven optimization method can effectively compensate for the nonlinear current attenuation. Figure 14 and Figure 15 show that

u_{AB}

and

i_{AB}

are free from ringing at the switching instants. This indicates that the system maintains clear ZVS across all scenarios. For instance, when

R_{L}

is

15 Ω

and M is 6.53

μ H

, the zero-crossing point of

i_{AB}

lags behind that of

u_{AB}

by approximately 400 ns (phase angle =

{28.8}^{\circ}

). The lagging extent of the current

i_{AB}

decreases with increasing

R_{L}

. When

R_{L}

reaches its maximum value of 30

Ω

, ZVS reaches its critical boundary. The soft switching performance is preserved after output optimization.

In future embedded deployments, the DSP will sample the TX current in real time. The lightweight MSPC model will then infer the value of

β

. The target DC input voltage required by the WPT system can be calculated as

U_{reg} = β \times U_{th}

. Subsequently, the PWM duty cycle of the front-end DC/DC converter (as shown in Figure 2) is regulated to track this target voltage. Through this continuous process, the output current of the WPT system is automatically stabilized at the preset value without manual intervention.

6. Conclusions

This work has proposed a data-driven method for joint characteristic prediction and output optimization for WPT systems relying solely on the TX AC current. A Multi-Scale Parallel Convolutional neural network is introduced to predict load resistance, mutual inductance, deviation factor, and regulation coefficient, while the output current is optimized by multiplying the input DC voltage by the regulation coefficient under heavy load conditions. Transfer learning and data augmentation are applied to minimize the discrepancy between simulation data and physical conditions. The MSPC model demonstrates excellent prediction performance, achieving an

R^{2}

of 0.9969. In experimental validation, the average relative error for characteristic prediction is below

2.55 %

, and the output optimization effectively maintains the CC output against load variations.

Author Contributions

Methodology, S.Y.; validation, J.L.; writing—original draft, S.Y.; supervision, J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the project “Development of Deep-Learning-Based Wireless Power Transfer Equipment for Underwater Robots”, grant number 2023h406.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Detailed architecture and parameters of the MSPC model.

Layer	Kernel Size	Dilation Rate	Input Dim	Output Dim
High-frequency branch (Short-time scale)
Conv1d	3	1	$1 \times 2000$	$8 \times 2000$
ReLU	–	–	$8 \times 2000$	$8 \times 2000$
Conv1d	3	1	$8 \times 2000$	$8 \times 2000$
ReLU	–	–	$8 \times 2000$	$8 \times 2000$
Mid-frequency branch (Mid-time scale)
Conv1d	3	2	$1 \times 2000$	$8 \times 2000$
ReLU	–	–	$8 \times 2000$	$8 \times 2000$
Conv1d	3	4	$8 \times 2000$	$8 \times 2000$
ReLU	–	–	$8 \times 2000$	$8 \times 2000$
Low-frequency branch (Long-time scale)
Conv1d	3	8	$1 \times 2000$	$8 \times 2000$
ReLU	–	–	$8 \times 2000$	$8 \times 2000$
Conv1d	3	16	$8 \times 2000$	$8 \times 2000$
ReLU	–	–	$8 \times 2000$	$8 \times 2000$
Feature Concatenation and Fusion
Concatenate	–	–	$8 \times 2000$ ( $\times 3$ )	$24 \times 2000$
Conv1d ( $1 \times 1$ )	1	1	$24 \times 2000$	$16 \times 2000$
ReLU	–	–	$16 \times 2000$	$16 \times 2000$
Global Regressor
AdaptiveAvgPool1d	–	–	$16 \times 2000$	$16 \times 1$
Flatten	–	–	$16 \times 1$	16
Linear	–	–	16	4

The dimensions are expressed as (Channel × Sequence Length). The sequence length L is 2000. The output dimension is 4, corresponding to

{[R_{L}, M, α, β]}^{T}

.

References

Li, H.; Tan, L.; Xu, H.; Wu, Z.; Huang, X. A Wide-Range Global Optimal Control Strategy for Wireless Charging Systems in Electric Vehicles. IEEE Trans. Power Electron. 2024, 39, 16864–16876. [Google Scholar] [CrossRef]
Teeneti, C.R.; Truscott, T.T.; Beal, D.N.; Pantic, Z. Review of Wireless Charging Systems for Autonomous Underwater Vehicles. IEEE J. Ocean. Eng. 2021, 46, 68–87. [Google Scholar]
Wang, Y.; Sun, Z.; Guan, Y.; Xu, D. Overview of Megahertz Wireless Power Transfer. Proc. IEEE 2023, 111, 528–554. [Google Scholar] [CrossRef]
Fang, Z.; Han, S.; Huang, M.; Martins, R.P.; Lu, Y. Design and Analysis of Small-TX Large-RX Coupler in Wireless Charging System for Mobile Devices. IEEE Trans. Circuits Syst. I 2026, 73, 2170–2180. [Google Scholar]
Yao, S.; Lin, X.; Chen, P.; Guo, X.; Li, C.; Fu, M. Loss Analysis and Heat Optimization for a 700-mW Inductive Power Transfer System for Implanted Brain-Computer Interface. IEEE Trans. Power Electron. 2026. early access. [Google Scholar] [CrossRef]
Park, Y.; Dang Hung, P.; Youn, D.; Kwon, D.; Kim, C.; Je, M. A Wireless Power and Data Transfer System for Medical Implants Using a Miniaturized Inductive Link with Frequency-Splitting Enhancement. IEEE J. Solid-State Circuits 2025, 60, 3966–3984. [Google Scholar] [CrossRef]
Xia, F.; Mao, F.; Lu, Y.; Sawan, M. Optimizing Power Transfer Efficiency in Biomedical Implants: A Comparative Analysis of SS and SP Inductive Link Topologies. IEEE Trans. Power Electron. 2024, 39, 11770–11783. [Google Scholar] [CrossRef]
Cheng, C.; Li, W.; Zhou, Z.; Deng, Z.; Mi, C. A Load-Independent Wireless Power Transfer System with Multiple Constant Voltage Outputs. IEEE Trans. Power Electron. 2020, 35, 3328–3331. [Google Scholar]
Lian, J.; Qu, X. An LCLC-LC-Compensated Capacitive Power Transferred Battery Charger with Near-Unity Power Factor and Configurable Charging Profile. IEEE Trans. Ind. Appl. 2022, 58, 1053–1060. [Google Scholar]
Qu, X.; Han, H.; Wong, S.-C.; Tse, C.K.; Chen, W. Hybrid IPT Topologies with Constant Current or Constant Voltage Output for Battery Charging Applications. IEEE Trans. Power Electron. 2015, 30, 6329–6337. [Google Scholar] [CrossRef]
Li, S.; Li, W.; Deng, J.; Nguyen, T.D.; Mi, C.C. A Double-Sided LCC Compensation Network and Its Tuning Method for Wireless Power Transfer. IEEE Trans. Veh. Technol. 2015, 64, 2261–2273. [Google Scholar] [CrossRef]
Mai, R.; Chen, Y.; Zhang, Y.; Yang, N.; Cao, G.; He, Z. Optimization of the Passive Components for an S-LCC Topology-Based WPT System for Charging Massive Electric Bicycles. IEEE Trans. Ind. Electron. 2018, 65, 5497–5508. [Google Scholar] [CrossRef]
Harinarayanan, J.; Balamurugan, P. SOC Estimation for a Lithium-Ion Pouch Cell Using Machine Learning under Different Load Profiles. Sci. Rep. 2025, 15, 18091. [Google Scholar] [CrossRef]
Chaudhari, T.; Chakravorty, S. Analysis and Advancements of the State of Charge Estimation Methods in Smart Battery Management System Supported by Lithium-Ion Battery Operated Electric Vehicles. Next Energy 2025, 8, 100337. [Google Scholar] [CrossRef]
Bose, B.; Garg, A.; Panigrahi, B.K.; Kim, J. Determination of Constant Current to Constant Voltage Switch-over Point for Health-Aware Fast Charging Using Heuristic Algorithm. J. Energy Storage 2023, 67, 107543. [Google Scholar] [CrossRef]
Tahir, M.U.; Chakraborty, S.; Akboy, E.; Sangwongwanich, A.; Stroe, D.I.; Hegazy, O.; Blaabjerg, F. System-Level Performance Analysis of Li-Ion Batteries and DC–DC Converters Under Various Charging Strategies. IEEE Open J. Power Electron. 2025, 6, 1674–1684. [Google Scholar] [CrossRef]
Wang, M.; Song, G.; Yin, R.; Shi, Y. Design and Analysis of an Anti-Misalignment Wireless Power Transfer System. IEEE Microw. Wirel. Tech. Lett. 2023, 33, 228–231. [Google Scholar] [CrossRef]
Sun, Z.; Hu, G.; Li, G.; Wang, Y.; Guan, Y.; Xu, D. Analysis and Design of a 6.78 M Wireless Power Transfer System with Strong Misalignment Tolerance Based on Simplified Impedance Trajectory Tracing. IEEE Trans. Ind. Electron. 2024, 71, 13470–13475. [Google Scholar] [CrossRef]
Zhao, W.; Qu, X.; Lian, J.; Tse, C.K. A Family of Hybrid IPT Couplers with High Tolerance to Pad Misalignment. IEEE Trans. Power Electron. 2022, 37, 3617–3625. [Google Scholar] [CrossRef]
Xu, H.; Tan, L.; Huang, X. Design of a Magnetic Coupler for Wireless Power Transfer Systems with High Rotational Misalignment Tolerance Based on Rotating Magnetic Flux. IEEE J. Emerg. Sel. Top. Power Electron. 2025, 13, 6741–6752. [Google Scholar] [CrossRef]
Zhang, B.; Jiang, C.Q.; Ma, T.; Guo, H.; Yang, J.; Yang, F.; Wang, Y.; Chen, C. A Thermally Reliable Variable-Topology Magnetic Coupler with Closed-Loop Multiphysics Co-Design for Guided-Platform AUV Wireless Charging. IEEE Trans. Transp. Electrif. 2026. early access. [Google Scholar] [CrossRef]
Zhang, Y.; Wei, G.; Zhang, J.; Hao, L.; Cheng, L. A Hybrid Topology Relay Based Wireless Power Transfer System with Mutual Inductance Enhancement and High Misalignment Tolerance. IEEE Trans. Power Electron. 2025, 40, 7640–7645. [Google Scholar] [CrossRef]
Zhang, X.; Li, G.; Wang, F.; Wang, Y.; Chen, T.; Yang, Q. An Underwater Hybrid Wireless Power Transfer System with Constant Power Output Against Misalignment. IEEE Trans. Power Electron. 2026, 41, 1402–1416. [Google Scholar] [CrossRef]
Chen, J.; Xie, F.; Zhang, B.; Chen, Y.; Xiao, W. Transmission Range Extension Strategy of Parity–Time-symmetry-based Wireless Power Transfer System by a Boost Converter. Int. J. Circuit Theory Appl. 2023, 51, 510–524. [Google Scholar] [CrossRef]
Ma, C.; Qu, X.; Guo, Z.; Tan, L. Four-Switch Buck-Boost Integrated Bridge for Bidirectional Inductive Power Transfer with Hybrid Energy Storage System. IEEE Trans. Ind. Electron. 2025, 72, 9028–9038. [Google Scholar] [CrossRef]
Zeng, J.; Wu, J.; Li, K.; Yang, Y.; Hui, S.Y.R. Dynamic Monitoring of Battery Variables and Mutual Inductance for Primary-Side Control of a Wireless Charging System. IEEE Trans. Ind. Electron. 2024, 71, 7966–7974. [Google Scholar] [CrossRef]
Hong, W.; Lee, S.; Lee, S.-H. Sensorless Control of Series–Series Tuned Inductive Power Transfer System. IEEE Trans. Ind. Electron. 2023, 70, 10578–10587. [Google Scholar] [CrossRef]
Guo, Y.; Zhang, Y.; Li, S.; Tao, C.; Wang, L. Load Parameter Joint Identification of Wireless Power Transfer System Based on the DC Input Current and Phase-Shift Angle. IEEE Trans. Power Electron. 2020, 35, 10542–10553. [Google Scholar] [CrossRef]
Mahmud, S.A.A.; Jayathurathnage, P.; Tretyakov, S.A. Machine Learning Assisted Characteristics Prediction for Wireless Power Transfer Systems. IEEE Access 2022, 10, 40496–40505. [Google Scholar] [CrossRef]
Bertoluzzo, M.; Di Barba, P.; Forzan, M.; Mognaschi, M.E.; Sieni, E. A Deep Learning Approach to Improve the Control of Dynamic Wireless Power Transfer Systems. Energies 2023, 16, 7865. [Google Scholar] [CrossRef]
Dai, Z.; Yang, Y.; Luo, Y.; Chen, S.; Lin, Z. A Receiver Position Estimation Method Based on LSTM for Multi-Transmitter Single-Receiver Wireless Power Transfer Systems. Electronics 2024, 13, 4670. [Google Scholar] [CrossRef]
Park, J.J.; Moon, J.H.; Jang, H.H.; Kim, D.I. Unified Simultaneous Wireless Information and Power Transfer for IoT: Signaling and Architecture with Deep Learning Adaptive Control. IEEE Internet Things J. 2022, 9, 17551–17567. [Google Scholar] [CrossRef]
Bai, H.; Huang, G.; Liu, C.; Huangfu, Y.; Gao, F. A Controller HIL Testing Approach of High Switching Frequency Power Converter via Slower-Than-Real-Time Simulation. IEEE Trans. Ind. Electron. 2024, 71, 8690–8702. [Google Scholar] [CrossRef]
Diz, S.D.L.; López, R.M.; Sánchez, F.J.R.; Llerena, E.D.; Peña, E.J.B. A Real-Time Digital Twin Approach on Three-Phase Power Converters Applied to Condition Monitoring. Appl. Energy 2023, 334, 120606. [Google Scholar] [CrossRef]
Pan, S.J.; Yang, Q. A Survey on Transfer Learning. IEEE Trans. Knowl. Data Eng. 2010, 22, 1345–1359. [Google Scholar] [CrossRef]
Zeng, Y.; Rodriguez, E.; Liu, Q.; Liang, G.; Jie, H.; Pou, J.; Ruan, H.; Kotturu, J. Easy Transfer Learning-Based Model-Data-Hybrid-Driven Fault Detection for Battery Inverters. IEEE Trans. Ind. Electron. 2025, 72, 5481–5487. [Google Scholar] [CrossRef]
Xu, H.; Liu, Z.; Jiang, D.; Qu, R.; Tian, J. Deep Transfer Learning Technology-Based Condition Monitoring and Fault Diagnosis of Electric Vehicle Electric Powertrain Systems: A Review. IEEE Trans. Power Electron. 2026, 41, 823–848. [Google Scholar] [CrossRef]
Yang, Y.; Wu, J.; Zhou, J.; Tan, S.-C.; Hui, S.Y.R. Real-Time Parameter Estimation in SS Compensated Wireless Power Transfer Systems Considering Nonlinearity of the Diode Rectifier. IEEE Trans. Power Electron. 2026, 41, 8772–8785. [Google Scholar] [CrossRef]
Li, S.; Li, F.; Zhang, R.; Tao, C.; Wang, L. Accurate Modeling, Design, and Load Estimation of LCC -S Based WPT System with a Wide Range of Load. IEEE Trans. Power Electron. 2023, 38, 11763–11775. [Google Scholar] [CrossRef]

Figure 1. Charging process of batteries and current attenuation under heavy load conditions.

Figure 2. Schematic of the typical WPT system with closed-loop control.

Figure 3. Schematic of the SS-compensated WPT system.

Figure 4. Equivalent model of the loosely coupled transformer.

Figure 5. (a) The 3rd harmonic content. (b) The 5th harmonic content.

Figure 6. Steady waveforms of the rectifier in WPT systems.

Figure 7. The effective circuit of the SS compensated WPT system.

Figure 8. Structure of the MSPC network, where d represents the dilation rate.

Figure 9. Framework of the transfer learning and data augmentation.

Figure 10. The experimental prototype.

Figure 11. The experimental and simulated

i_{AB}

waveforms when

R_{L} = 14

Ω and

M = 5.2

μH.

Figure 11. The experimental and simulated

i_{AB}

waveforms when

R_{L} = 14

Ω and

M = 5.2

μH.

Figure 12. Identification results of M and

R_{L}

of the MSPC model with TL.

Figure 12. Identification results of M and

R_{L}

of the MSPC model with TL.

Figure 13. Attenuation and optimization results when M is 4.98 μH, where

α_{sim}

and

β_{sim}

represent the coefficients from simulation results,

α_{real}

and

β_{real}

represent the tested coefficients from experiments,

α_{pred}

and

β_{pred}

represent the results predicted by MSPC model with TL.

Figure 13. Attenuation and optimization results when M is 4.98 μH, where

α_{sim}

and

β_{sim}

represent the coefficients from simulation results,

α_{real}

and

β_{real}

represent the tested coefficients from experiments,

α_{pred}

and

β_{pred}

represent the results predicted by MSPC model with TL.

Figure 14. Experimental waveforms of the WPT circuit. (a)

R_{L} = 15 Ω

and

M = 6.53 μ H

. (b)

R_{L} = 30 Ω

and

M = 6.53 μ H

. (c)

R_{L} = 15 Ω

and

M = 5.63 μ H

. (d)

R_{L} = 30 Ω

and

M = 5.63 μ H

.

Figure 14. Experimental waveforms of the WPT circuit. (a)

R_{L} = 15 Ω

and

M = 6.53 μ H

. (b)

R_{L} = 30 Ω

and

M = 6.53 μ H

. (c)

R_{L} = 15 Ω

and

M = 5.63 μ H

. (d)

R_{L} = 30 Ω

and

M = 5.63 μ H

.

Figure 15. Experimental waveforms of the WPT circuit with output optimized. (a)

R_{L} = 15 Ω

,

M = 6.53 μ H

and

U_{reg} = 27.46 V

. (b)

R_{L} = 30 Ω

,

M = 6.53 μ H

and

U_{reg} = 31.62 V

. (c)

R_{L} = 15 Ω

,

M = 5.63 μ H

and

U_{reg} = 27.85 V

. (d)

R_{L} = 30 Ω

,

M = 5.63 μ H

and

U_{reg} = 32.03 V

.

Figure 15. Experimental waveforms of the WPT circuit with output optimized. (a)

R_{L} = 15 Ω

,

M = 6.53 μ H

and

U_{reg} = 27.46 V

. (b)

R_{L} = 30 Ω

,

M = 6.53 μ H

and

U_{reg} = 31.62 V

. (c)

R_{L} = 15 Ω

,

M = 5.63 μ H

and

U_{reg} = 27.85 V

. (d)

R_{L} = 30 Ω

,

M = 5.63 μ H

and

U_{reg} = 32.03 V

.

Table 1. Experimental parameter details.

Description	Design Value	Experiment Value
Input DC voltage ( $U_{DC}$ /V)	24.0	24.0
TX self-inductance ( $L_{P}$ /μH)	13.715	13.654
RX self-inductance ( $L_{S}$ /μH)	12.300	12.181
Mutual inductance (M/μH)	3.0 to 6.0	2.9 to 6.7
TX capacitor ( $C_{P}$ /nF)	46.173	45.051
RX capacitor ( $C_{S}$ /nF)	51.484	44.107
Resonance frequency ( $f_{0}$ /kHz)	200	198–200
Load resistance ( $R_{L}$ / $Ω$ )	1 to 30	1 to 30

Table 2. Training hardware and software environment.

Hardware	Intel Core i9-14900HX CPU
	NVIDIA GeForce RTX 4060 GPU
	32 GB RAM
Software	Python 3.13.5
	Pytorch 2.7.1
	Numpy 2.1.3
	Adam Optimizer

Table 3. Validation results among tested models.

Model	MAE	RMSE	$R^{2}$
CNN	0.0799	0.1188	0.8296
TCN	0.0597	0.1006	0.8793
LSTM	0.0472	0.0789	0.8803
BiLSTM	0.0321	0.0446	0.9246
MSPC	0.0105	0.0156	0.9969

Table 4. Prediction results of the MSPC model with distinct dilation rates.

Dilation Rates	Description	MAE	RMSE	$R^{2}$
${(1, 1), (2, 2), (4, 4)}$	Scheme 1	0.0214	0.0332	0.9812
${(1, 1), (4, 8), (16, 32)}$	Scheme 2	0.0178	0.0285	0.9885
${(1, 1), (2, 4), (8, 16)}$	Proposed Scheme	0.0105	0.0156	0.9969

The dilation rates in each scheme are assigned to

F_{high}

,

F_{mid}

,

F_{low}

, respectively.

Table 5. Validation results of multi-scale architecture.

$F_{concat}$	MAE	RMSE	$R^{2}$
$F_{high}$	0.0583	0.1042	0.8658
$F_{mid}$	0.0625	0.1030	0.8683
$F_{low}$	0.0265	0.0398	0.9492
$[F_{high}, F_{mid}]$	0.0186	0.0292	0.9892
$[F_{high}, F_{low}]$	0.0207	0.0327	0.9847
$[F_{mid}, F_{low}]$	0.0155	0.0208	0.9901
$[F_{high}, F_{mid}, F_{low}]$	0.0105	0.0156	0.9969

Table 6. Comparison results of computational complexity.

Model	FLOPs (M)	Parameters (K)
CNN	3.056	1.539
TCN	1.920	0.963
LSTM	102.912	50.627
BiLSTM	271.360	134.019
TinyRNN	1.760	0.899
MiniRocket	1.638	1.074
1D Transformer	133.248	67.011
MSPC	2.096	1.147

Table 7. Validation error of tested models with and without TL.

Model	$R_{L}$	M	$α$	$β$
CNN	32.88%	12.66%	60.17%	73.41%
TCN	33.47%	12.52%	53.40%	64.23%
LSTM	47.90%	19.11%	59.47%	65.97%
BiLSTM	39.65%	28.47%	53.21%	71.43%
MSPC	45.64%	12.14%	53.62%	63.84%
CNN+TL	30.76%	6.23%	18.64%	10.29%
TCN+TL	35.66%	9.98%	7.85%	15.94%
LSTM+TL	10.96%	8.63%	10.07%	8.36%
BiLSTM+TL	7.32%	3.18%	3.38%	6.29%
MSPC+TL	2.55%	0.91%	1.52%	2.64%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, S.; Lian, J. Data-Driven Characteristic Prediction and Output Optimization for Wireless Power Transfer Systems. Electronics 2026, 15, 2586. https://doi.org/10.3390/electronics15122586

AMA Style

Yang S, Lian J. Data-Driven Characteristic Prediction and Output Optimization for Wireless Power Transfer Systems. Electronics. 2026; 15(12):2586. https://doi.org/10.3390/electronics15122586

Chicago/Turabian Style

Yang, Shengtao, and Jing Lian. 2026. "Data-Driven Characteristic Prediction and Output Optimization for Wireless Power Transfer Systems" Electronics 15, no. 12: 2586. https://doi.org/10.3390/electronics15122586

APA Style

Yang, S., & Lian, J. (2026). Data-Driven Characteristic Prediction and Output Optimization for Wireless Power Transfer Systems. Electronics, 15(12), 2586. https://doi.org/10.3390/electronics15122586

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Data-Driven Characteristic Prediction and Output Optimization for Wireless Power Transfer Systems

Abstract

1. Introduction

2. Analysis of SS-Compensated WPT System

3. MSPC Model and Transfer Learning Framework

3.1. Multi-Scale Parallel Convolutional Model

3.1.1. Multi-Scale Feature Extractor

3.1.2. Global Regressor

3.2. Transfer Learning

3.3. Data Augmentation

4. Online Validation of the Data-Driven Method

4.1. Benchmark Models

4.2. Generation of Training Data in $D_{S}$

4.3. Validation Results of Prediction Accuracy

4.4. Sensitivity Analysis on Dilation Rates

4.5. Ablation Study on Multi-Scale Architecture

4.6. Computational Complexity and Inference Latency

5. Experiments

5.1. Experimental Setup

5.2. Characteristic Prediction Results

5.3. Output Optimization Results

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Data-Driven Characteristic Prediction and Output Optimization for Wireless Power Transfer Systems

Abstract

1. Introduction

2. Analysis of SS-Compensated WPT System

3. MSPC Model and Transfer Learning Framework

3.1. Multi-Scale Parallel Convolutional Model

3.1.1. Multi-Scale Feature Extractor

3.1.2. Global Regressor

3.2. Transfer Learning

3.3. Data Augmentation

4. Online Validation of the Data-Driven Method

4.1. Benchmark Models

4.2. Generation of Training Data in D S

4.3. Validation Results of Prediction Accuracy

4.4. Sensitivity Analysis on Dilation Rates

4.5. Ablation Study on Multi-Scale Architecture

4.6. Computational Complexity and Inference Latency

5. Experiments

5.1. Experimental Setup

5.2. Characteristic Prediction Results

5.3. Output Optimization Results

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2. Generation of Training Data in $D_{S}$