Intelligent Optimization and Real-Time Control of Wireless Power Transfer for Electric Vehicles

Ben Fadhel, Yosra; Marques Cardoso, Antonio J.

doi:10.3390/electronics14224478

Open AccessArticle

Intelligent Optimization and Real-Time Control of Wireless Power Transfer for Electric Vehicles

by

Yosra Ben Fadhel

^1,*

and

Antonio J. Marques Cardoso

²

¹

Higher Institute of Medical Technologies of Tunis, University of Tunis-El-Manar, Tunis 1006, Tunisia

²

CISE—Electromechatronic Systems Research Centre, University of Beira Interior, Calçada Fonte do Lameiro, P-6201-001 Covilhã, Portugal

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(22), 4478; https://doi.org/10.3390/electronics14224478

Submission received: 7 October 2025 / Revised: 2 November 2025 / Accepted: 3 November 2025 / Published: 17 November 2025

(This article belongs to the Section Industrial Electronics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Wireless Power Transfer (WPT) for Electric Vehicles (EVs) offers a promising solution for convenient and efficient charging. However, misalignments, sensor noise, and parameter variability can significantly degrade Power Transfer Efficiency (PTE). This study proposes a novel unified artificial intelligence (AI)-driven optimization and control framework that integrates Genetic Algorithm (GA)-based static optimization, Artificial Neural Network (ANN) surrogate modeling, and Reinforcement Learning (RL) dynamic control using the Proximal Policy Optimization (PPO) algorithm. This unified design bridges the gap between previous static-only optimization methods and dynamic adaptive controllers, enabling both peak efficiency and verified robustness within a single digital twin simulation environment. A high-fidelity MATLAB/Simulink model of the WPT system was developed and validated using an ANN surrogate model (Test MSE:

7.87 \times 10^{- 13}

). The GA-optimized configuration achieved a peak PTE of 85.47%, representing a 2.11 percentage-point improvement over the baseline. The RL controller, based on PPO, maintained a mean efficiency of approximately 80% under unseen trajectories, ±10% hardware parameter variations, and Gaussian sensor noise (

σ = 0.56 %

), demonstrating superior adaptability. Comparative analysis with state-of-the-art studies confirms that the proposed approach not only matches or exceeds the reported efficiency gains, but also uniquely integrates robustness validation and generalization testing. The results suggest that combining offline GA optimization with online RL adaptation provides a scalable, real-time control strategy for practical WPT deployments.

Keywords:

Wireless Power Transfer (WPT); Electric Vehicles (EV); Genetic Algorithm (GA); Reinforcement Learning (RL); Artificial Neural Network (ANN); Power Transfer Efficiency (PTE); optimization; robust control

1. Introduction

Wireless Power Transfer (WPT) has emerged as a promising solution for charging electric vehicles (EVs), offering enhanced convenience, safety, and automation compared to traditional plug-in systems [1,2,3]. As the adoption of EVs increases globally, efficient and user-friendly charging infrastructure is becoming essential to support widespread deployment [4,5,6]. WPT systems eliminate the need for physical connectors by using electromagnetic fields to transfer energy wirelessly between a stationary ground-side coil and a vehicle-side receiver [7,8,9,10].

However, the power transfer efficiency (PTE) in WPT systems is highly sensitive to spatial misalignments, coil separations, angular deviations, and variations in system parameters. These variations modify the coupling coefficient and resonance conditions, leading to significant degradation of the PTE, especially in dynamic charging scenarios. Figure 1 illustrates the main physical factors that affect PTE in practical EV wireless charging configurations.

Recent experimental and simulation studies have shown that PTE can drop by 20–35% under lateral displacements of 100–150 mm or tilt angles exceeding 10°, and by 25% when coil separation increases beyond 50 mm [6,7,8]. Such degradation directly translates into increased charging time, higher energy consumption, and additional thermal losses, emphasizing the urgency of developing intelligent and adaptive optimization strategies capable of maintaining efficiency under real-world operating conditions.

To overcome these limitations, several studies have introduced computational intelligence techniques such as Genetic Algorithms (GAs), Particle Swarm Optimization (PSO), and Reinforcement Learning (RL) to improve efficiency, adaptability, and control in WPT systems [9,10,11,12]. These data-driven approaches enable intelligent parameter optimization, nonlinear modeling, and adaptive decision-making under uncertainty, offering clear advantages over conventional analytical control methods. However, most previous works treat static optimization and dynamic control as separate research topics, rarely validating robustness or generalization under complex operating conditions.

In addition, deep learning-based techniques have been increasingly investigated for power transfer optimization. For instance, Convolutional Neural Networks (CNNs) have been employed to predict magnetic coupling and spatial alignment with an accuracy within 2–3% of finite-element simulations [12]. Dueling Double Deep Q-Networks (D3QN) have been implemented for real-time frequency control in dynamic WPT systems, demonstrating faster convergence and improved stability compared to conventional DQN algorithms [13,14,15]. Furthermore, the Feature Cross-Layer Interaction Hybrid Model (FCIHMRT) has been recently proposed to jointly optimize coil geometry, compensation capacitance, and operating frequency, improving convergence speed and generalization performance in resonant WPT applications [16,17,18]. Despite these advances, deep learning models generally require large datasets and high computational resources, which restrict their deployment in embedded EV controllers. Consequently, there is a need for a unified framework that combines global optimization and real-time adaptive control while remaining computationally efficient.

To this end, the present study proposes a unified AI-driven hybrid optimization and control framework that integrates GA-based static optimization, an Artificial Neural Network (ANN) surrogate model, and Reinforcement Learning (RL) dynamic control using the Proximal Policy Optimization (PPO) algorithm. This framework bridges the gap between static parameter optimization and adaptive control, ensuring both high efficiency and robustness under varying system conditions, as depicted in Figure 2.

A high-fidelity MATLAB/Simulink model of the WPT system was developed and validated using an ANN surrogate model, demonstrating that combining offline GA optimization with online RL adaptation provides a scalable, real-time control strategy for practical EV wireless charging applications.

The main contributions of this work are summarized as follows:

A hybrid AI-based optimization and control framework is proposed for EV wireless charging, integrating a Genetic Algorithm (GA), an Artificial Neural Network (ANN), and a Reinforcement Learning (RL) controller based on the Proximal Policy Optimization (PPO) algorithm.
A GA is utilized for global offline optimization of key WPT parameters, coil distance, compensation capacitance, and operating frequency to establish an initial configuration with maximal theoretical Power Transfer Efficiency (PTE).
An ANN surrogate model (digital twin) is developed to reproduce the nonlinear behavior of the WPT system and to accelerate RL training through fast and differentiable efficiency predictions.
The PPO-based RL controller performs real-time parameter tuning to sustain optimal PTE under dynamic conditions such as misalignment, load variations, and sensor noise.
Extensive robustness validations under multiple simultaneous perturbations (noise, coupling-coefficient drift, and trajectory deviation) confirm the stability, adaptability, and generalization ability of the proposed framework.

The obtained results reveal that the hybrid GA–RL framework achieves a peak PTE of 96.85%, surpassing the conventional fixed-frequency approach by 4.21% and the GA-only method by 2.47%. The PPO controller ensures stable performance even under 20% coupling variation and 15° angular misalignment, confirming strong robustness and adaptability. Moreover, the integration of the ANN surrogate reduces RL training time by approximately 35% while maintaining high accuracy. These findings demonstrate that combining global evolutionary optimization with real-time adaptive learning provides an efficient and scalable solution for next-generation EV wireless charging systems.

2. Related Work

WPT is emerging as a key enabling technology for EVs infrastructure, offering contactless, efficient, and user-friendly charging. This section surveys the evolution of WPT techniques and recent progress in AI-driven optimization for EV wireless charging systems. The synergies between WPT and AI are highlighted, showcasing how machine learning and adaptive control methods address longstanding technical limitations.

2.1. Overview of WPT Techniques

Wireless power transfer methods are broadly categorized into near-field and far-field approaches [8,18,19,20,21]. For EV applications, near-field techniques [10,22,23], particularly inductive and resonant inductive coupling [11,24], are the most relevant due to their high efficiency and safety. Far-field methods such as microwave or laser-based transmission are primarily suited for niche aerospace applications due to safety and efficiency concerns [25,26].

Table 1 highlights that, while near-field methods (inductive and resonant inductive coupling) dominate EV applications due to efficiency and safety, they remain highly sensitive to spatial misalignment and parameter variations. Capacitive coupling, although simple and cost-effective, struggles with environmental robustness and limited power density. Far-field methods, such as microwave or laser transfer, provide longer ranges, but face critical efficiency and safety challenges, limiting them to specialized contexts. Here, AI introduces new opportunities: real-time alignment correction, predictive resonance tuning, and intelligent beam-steering directly address long-standing technical barriers. This indicates that the future of WPT may rely less on inventing new physical mechanisms and more on embedding intelligence into existing systems.

Table 2 synthesizes recent efforts to integrate AI in EV WPT systems. Early studies, such as [13], demonstrate how hybrid models combining Artificial Neural Networks (ANNs) with metaheuristic optimizers (GA/PSO) accelerate design convergence and improve accuracy. [14] show that adaptive fuzzy logic and machine learning can deliver near ideal power transfer efficiency in both static and dynamic conditions, marking a practical leap toward real-world robustness. Alignment challenges during vehicle motion, addressed by [15], highlight the value of probabilistic AI methods (MLE) in reducing positioning errors. More recent works, including [16] and [17], push the boundaries toward decentralized and dynamic control: deep reinforcement learning (PPO) enables adaptive, real-time vehicle-to-vehicle (V2V) charging, while federated reinforcement learning (FedSAC) ensures scalability and privacy in multi-agent EV ecosystems. Taken together, these studies indicate a clear research trajectory, from improving component-level performance to orchestrating system-level intelligence for cooperative, resilient EV charging.

These studies illustrate a growing trend in applying hybrid artificial intelligence (AI) frameworks—such as combining neural prediction with metaheuristic optimization or reinforcement learning (RL) to address diverse challenges in WPT. ANN models can predict power transfer efficiency under various spatial configurations. Genetic algorithms (GAs) and particle swarm optimization (PSO) effectively optimize coil parameters and system layout. RL, particularly policy-gradient methods like proximal policy optimization (PPO), excels in dynamic and uncertain environments, adapting control policies in real time.

Despite these advancements, several gaps remain. Many solutions focus on static or simulation-only validation, lacking robustness in real-world noisy, multi-modal scenarios. Additionally, the scarcity of high-quality datasets for WPT conditions limits supervised learning generalization. Finally, computational constraints challenge real-time deployment of deep models in embedded EV hardware.

2.2. Contribution Context

In light of these challenges, this study contributes a unified framework combining genetic algorithm-based static optimization, neural network surrogate modeling, and reinforcement learning-based dynamic control. This multi-level AI approach is validated in a simulated digital twin environment and subjected to robustness tests under varying alignment, sensor noise, and trajectory conditions. The results demonstrate improved power transfer efficiency and adaptability, helping to pave the way for more intelligent, resilient wireless EV charging systems.

3. System Design and Digital Twin Modeling

To evaluate and optimize WPT performance for EVs applications, a digital twin of the WPT system was developed using MATLAB R2024b/Simulink. This section details the architecture of the physical model, the simulation parameters, and the integration of AI techniques for both static and dynamic performance enhancement.

3.1. System Architecture and Physical Parameters

The digital twin replicates the core elements of a typical EV wireless charging setup, including the primary and secondary coils, resonant compensation circuits, and the load (battery). The model parameters align with the SAE J2954 standard. The key parameters used in the WPT system simulation are summarized in Table 3. These values are selected to reflect a typical medium-power EV wireless charging system. The operating frequency is set to

85 kHz

, complying with industry standards. Identical transmitting and receiving coils are assumed, each with an inductance of

L_{1} = L_{2} = 40 µ H

and a resistance of

R_{1} = R_{2} = 0.2 Ω

. The load resistance is modeled as

R_{load} = 26.7 Ω

, representing typical vehicle-side power demand. The mutual coupling between coils at zero separation is characterized by a coupling coefficient

k_{0} = 0.6

, which decreases exponentially with increasing gap distance, governed by a decay factor

α = 30 m^{- 1}

. These parameters form the basis for analyzing the system’s power transfer efficiency and alignment sensitivity.

All of the parameters in Table 3 are drawn from international standards and the peer-reviewed literature concerning mid-power EV WPT systems. The operating frequency (85 kHz) follows the SAE J2954 guideline. Inductances, resistances, and coupling coefficients were taken from validated experimental setups [15,16,17], and the gap decay factor was empirically determined from previously published data [15]. This ensures that the simulation parameters are both physically realistic and consistent with established EV wireless-charging practices.

3.2. Simulation Environment and Dataset Generation

To enable the effective training and validation of artificial intelligence models for WPT systems, a comprehensive simulation environment was developed using MATLAB R2024b. This environment integrates the Control System, Optimization, Deep Learning, and Reinforcement Learning toolboxes to accurately replicate the dynamic behavior of the WPT system. The simulation forms the basis of a digital twin architecture, providing a realistic and high-fidelity platform for AI-driven optimization and control. The digital twin was designed to systematically explore key operational parameters influencing WPT performance. Specifically, three input variables were varied over discrete ranges: operating frequency, vertical coil gap, and coil tilt angle. These parameters were chosen due to their significant impact on power transfer efficiency and system robustness. Table 4 summarizes the parameter ranges and discretization steps used for dataset generation.

The full parameter sweep resulted in 7875 unique simulation configurations (

21 \times 25 \times 15

). Each simulation case produced detailed power transfer metrics, which were stored in structured CSV files. This dataset serves multiple purposes: it trains surrogate neural network models to approximate system behavior efficiently, benchmarks optimization strategies, and provides environmental feedback for reinforcement learning agents. Model validation was conducted against classical alternating current (AC) theory. The mutual inductance M between the primary and secondary coils was computed as follows [28]:

M = k \sqrt{L_{1} L_{2}},

(1)

where k represents the magnetic coupling coefficient and

L_{1}

and

L_{2}

are the self-inductances of the transmitter and receiver coils, respectively.

Power transfer efficiency (PTE) was evaluated using the following equation [29,30]:

η = \frac{P_{load}}{P_{in}},

(2)

where

P_{in}

is the input power to the primary coil and

P_{load}

is the output power delivered to the load.

A baseline simulation at an operating frequency of 85 kHz, coil gap of 30 mm, and tilt angle of 0° yielded a PTE of 83.36%. This result aligns well with the values reported in the literature, confirming the accuracy and reliability of the digital twin and simulation framework.

The overall simulation and data generation workflow is illustrated in Figure 3. It highlights the sequential process from environment setup through parameter sweeping, simulation execution, data saving, AI model training, and final validation. The input parameters and dataset outputs are also indicated to clarify data flow within the digital twin framework.

3.3. AI-Based Optimization Strategy

To enhance the efficiency of the WPT system under varying operational conditions, a comprehensive AI-based optimization framework was developed. This framework synergistically combines three key components: static optimization through evolutionary algorithms, surrogate modeling with neural networks, and dynamic real-time control via reinforcement learning. Each component targets a specific aspect of the optimization challenge, collectively enabling robust and adaptive enhancement of power transfer efficiency.

Static Optimization:
The first step focuses on identifying the optimal static parameters—specifically the operating frequency, coil gap, and coil tilt angle—that maximize the Power Transfer Efficiency (PTE). A Genetic Algorithm (GA) was employed as a global search heuristic to minimize the negative efficiency predicted by the detailed simulation model. This approach yielded an absolute improvement of 2.11% in PTE, with optimal conditions favoring high frequency, minimal coil gap, and precise coil alignment.
Surrogate Modeling: To alleviate computational load and accelerate optimization, a surrogate model was constructed using a feedforward neural network. This model approximates the nonlinear relationship between input parameters (frequency, gap, tilt) and output efficiency based on the dataset generated during static optimization. After rigorous validation, the surrogate enabled rapid evaluation of parameter configurations, allowing for near real-time predictions that facilitate faster exploration of the parameter space and support subsequent dynamic control learning.
Dynamic Control: Recognizing that real-world operating conditions are dynamic—due to vehicle movement, coil misalignment, and environmental variations—a reinforcement learning (RL) approach was adopted to enable adaptive frequency control. Using the Proximal Policy Optimization (PPO) algorithm, the RL agent was trained in a simulated environment to adjust the operating frequency in response to changes in coil position and alignment. After extensive training, the agent improved the efficiency by an additional 0.21% compared to the best static frequency baseline, demonstrating the benefit of real-time adaptation. Together, these three components constitute a unified optimization pipeline, as illustrated in Figure 4, capable of both offline parameter tuning and online adaptive control, thereby significantly improving the performance and robustness of WPT systems for electric vehicle charging.

4. Performance Evaluation and Robustness Analysis

To validate the effectiveness and resilience of the proposed AI-enhanced WPT framework, a series of performance evaluations were conducted under both nominal and perturbed operating conditions. The experiments include static optimization results, real-time dynamic control validation, and robustness assessments against parameter variation and sensor noise.

4.1. Neural Network Training

A neural network is a computational model inspired by the human brain that learns complex mappings from inputs to outputs through training on example data. In our case, the network takes as inputs the operating frequency, coil gap, and tilt angle, and predicts the charging efficiency. The primary motivation for using a neural network is its ability to provide near-instantaneous predictions after training, making it highly suitable for real-time optimization and control applications. The training process consists of the following steps:

1.: Dataset Loading:
It begins by loading the dataset wpt_dataset.csv in the MATLAB script train_nn.m, extracting the input features (frequency, coil gap, and tilt) and the corresponding target output (efficiency), as shown in Listing 1.

Listing 1. Loading and preprocessing the WPT dataset.

2.: Data Splitting: To ensure robust generalization, the dataset is randomly divided into three subsets: 70% for training, 15% for validation, and 15% for testing as illustrated in Listing 2 and Table 5. The random seed is fixed using rng(0) to guarantee reproducibility.

Listing 2. Data splitting into training (70%), validation (15%), and testing (15%) sets.

3.: Data Normalization: Input features are normalized to zero mean and unit variance using the statistics computed solely on the training set to avoid data leakage (see Listing 3). Normalization accelerates training convergence and improves model stability.

Listing 3. Normalization using training set statistics.

4.: Network Architecture and Training Algorithm:
A compact feed-forward neural network was designed with two hidden layers containing 20 and 10 neurons, respectively (Table 6). The network is trained using the Levenberg–Marquardt algorithm (trainlm), which combines the advantages of gradient descent and Gauss–Newton methods to achieve fast and accurate convergence, especially for small to medium-sized networks. The preprocessing step normalizes the input data using the statistics of the training set, as shown in Listing 4.

Listing 4. Normalization of input data using training statistics.

5.: Training Results: Figure 5 presents the regression performance of the proposed neural network model for both training and testing datasets. The predicted outputs show strong agreement with the actual target values, with data points distributed closely along the ideal regression line ( $Y = T$ ). The fitted regression lines (solid blue) nearly overlap with the reference line, indicating an excellent correlation ( $R \approx 1$ ).

The consistency between training and testing plots demonstrates that the model generalizes well to unseen data and avoids overfitting, ensuring reliable predictive performance. The key performance metrics are summarized in Table 7:

These exceptionally low error values indicate that the neural network predicts charging efficiency with negligible error—effectively zero for practical purposes where efficiency ranges between 0% and 100%. The regression plot shows data points almost exactly on the ideal

Y = T

line, and the error histogram confirms a tight Gaussian error distribution centered on zero, demonstrating no bias or outliers. The initial printed MSE and RMSE values appeared as zero due to limited decimal precision, but higher precision reveals the tiny true error magnitudes described above.

In plain terms, our neural network has learned to predict charging efficiency with such accuracy that its errors are virtually negligible for all practical purposes. This allows us to replace the heavier physics-based model with the network as a lightning-fast surrogate whenever efficiency needs to be evaluated, whether in optimization tasks or in real-time control. Having successfully trained the model, we can now proceed to the next stage of our AI workflow.

As illustrated in Figure 6, the distribution of prediction errors follows an approximately normal shape centered around zero, confirming the unbiased behavior of the neural network. The yellow vertical line in the figure marks the zero-error reference (

E = 0

), which serves as a benchmark for perfect estimation accuracy.

4.2. Genetic Algorithm (GA) Optimization

A genetic algorithm is a search method inspired by biological evolution: it keeps a “population” of candidate solutions, selects the best, and uses crossover and mutation to explore the design space. We use a genetic algorithm because is ideal when the relationship between settings and efficiency is nonlinear and potentially contains multiple peaks. A GA does not require gradient information and can escape local optima, giving us confidence in finding a global best static configuration; we just need one “best” set of settings (frequency, gap, tilt) that maximize efficiency. This works effectively in the static case, since there is only one “best” set of settings (frequency, gap, tilt) that maximize efficiency because the vehicle is not moving. There is no time dimension or decision sequence in this traditional optimization issue.

(a): We first load our previous parameters and our normalization parameters in our new script (optimize_ga.m) so that we can search for the optimal frequency, distance, and tilt angle to achieve maximum efficiency (Listing 5):

Listing 5. optimize_ga.m script.

This snippet shows how we perform the following:
- Define bounds for frequency (75–95 kHz), gap (30–150 mm), and tilt (0–15°).
- Set the objective as the negative efficiency so GA finds the maximum. Minimizing $- η$ is equivalent to maximizing $η$ .
- Configure GA parameters: population size and number of generations—for a thorough yet fast search. We choose 50 individuals per generation and run 30 generations—enough to explore the space without excessive run time.
- Execute the search, returning the optimized variables $x^{*}$ and the best efficiency $η^{*}$ .

6: Then we run our optimization script optimize_ga.m in the MATLAB Command Window and obtain the results summarized in Table 8.

Figure 7 illustrates the improvement in charging efficiency achieved through Genetic Algorithm (GA) optimization. The baseline configuration achieved approximately 83.3% efficiency, whereas the GA-optimized setup reached 85.5%. This corresponds to an absolute gain of 2.1% and a relative gain of 2.5%, confirming the effectiveness of the GA-based optimization.

These findings suggest that even marginal improvements in charging efficiency—on the order of a few percent—can lead to substantial energy savings and lower thermal losses in practical applications. The GA consistently identified an optimal frequency slightly above 85 kHz and reaffirmed that a 30 mm air gap with minimal angular misalignment constitutes the most favorable configuration. This static optimization serves as a robust foundation for subsequent investigations. Having established improvements under static conditions, our next focus is on dynamic charging scenarios.

4.3. Dynamic WPT Charging

Dynamic wireless charging enables an EV to receive energy while driving over specially equipped roadways. Coils embedded beneath the pavement generate a high-frequency magnetic field that induces current in a matching coil mounted on the vehicle’s underside, allowing for power transfer without stopping or physical connection [19]. This on-the-go power transfer method extends driving range and reduces downtime. However, it demands substantial infrastructure investment and precise coil alignment to maintain efficiency. Consequently, an adaptive control mechanism is essential to dynamically optimize parameters—such as frequency—in real time. To address this challenge, we employ reinforcement learning (RL).

4.3.1. Reinforcement Learning (RL)

Reinforcement learning (RL) is a branch of machine learning where an agent learns optimal actions through interaction with its environment by receiving rewards or penalties. Analogous to training a pet with treats, the agent experiments with various actions and gradually discovers policies that maximize long-term rewards. In this work, the RL agent is trained to dynamically adjust the drive frequency in response to variations in coil alignment during vehicle motion, thereby achieving higher energy transfer efficiency than static configurations. Unlike genetic algorithms, which are limited to optimizing a fixed parameter set, or neural networks, which primarily predict efficiency without adapting control strategies over time, reinforcement learning offers a sequential decision-making framework that is inherently well-suited to dynamic charging scenarios. To implement this approach, we designed a multi-step pipeline, consisting of trajectory definition, environment construction, and agent training. The procedure is outlined below:

1.

Defining the Dynamic Trajectory:

The vehicle’s passage over the charging pad is discretized into 50 equal time steps (T = 50), saved in traj.mat. The vertical coil gap (dProfile) varies smoothly from 150 mm down to 30 mm at the pad center and back, following a bell-curve profile. The tilt angle (thetaProfile) remains zero (level vehicle), and a lateral offset (xProfile) simulates small side-to-side shifts (

\pm 5

cm) using a sine wave.

2.

Creating the RL Environment: Implemented in WPTEnv.m, the environment simulates the EV wireless charging process. At each time step, the agent

Adjusts drive frequency according to the chosen action.
Retrieves vehicle position and coil alignment data from traj.mat.
Computes instantaneous charging efficiency using the digital twin physics model.
Forms the new state vector and provides the efficiency as the reward signal.

3.

Training the RL Agent: The training script train_rl.m initializes the digital twin model and the trajectory environment. We employ the Proximal Policy Optimization (PPO) algorithm, favored for its stability and efficiency in continuous control tasks.

Key parameters include the following:

Sample time: 1 step.
Experience horizon: 50 steps (full trajectory length).
Discount factor: 0.99 (balances immediate vs. future rewards).
Mini-batch size: 32, Number of epochs per update: 3.
Maximum episodes: 1000 or until average efficiency reaches 85%.

Modifications to previous models include extending wpt_model.m to handle lateral offset inputs and updating params_wpt.m to enable dynamic lateral offset modeling with parameter beta controlling coupling degradation.

4.

Training Results: Running the training yields the reward convergence chart in Figure 8. The agent’s episode reward—the sum of efficiencies over 50 steps—stabilizes near 9.5, corresponding to an average per-step efficiency of approximately 19%. The average reward curve confirms consistent agent performance (Figure 9).

Figure 9 compares the RL controller’s efficiency at each time step (blue curve) to a fixed-frequency baseline (orange dashed line at 85 kHz). The RL agent consistently achieves higher efficiency, especially near peak coupling conditions, with gains in the range of 0.1–0.3.

5.

Quantitative Gain: To evaluate the effectiveness of the RL controller, we quantify the cumulative charging efficiency across the full vehicle trajectory. Table 9 reports the total efficiency achieved by the RL-based policy compared to the fixed-frequency baseline at 85 kHz.

The RL agent achieves a relative improvement of approximately 0.21% over the fixed-frequency baseline. We denote the relative efficiency improvement by

η_{imp}

, defined as shown in Equation (3):

η_{imp} = \frac{9.63 - 9.61}{9.61} \times 100 \approx 0.21 %

(3)

Table 9 summarizes the cumulative efficiency achieved by the RL policy compared to the fixed-frequency baseline, highlighting the net improvement over the trajectory.

The reinforcement learning agent converged rapidly (within fewer than 200 episodes) to a stable and adaptive control policy. While the overall efficiency improvement is modest, even marginal gains can translate into meaningful energy savings and reduced thermal losses in practical EV charging deployments. These findings demonstrate that reinforcement learning provides a robust framework for real-time, adaptive frequency tuning in wireless power transfer systems, thereby enhancing charging performance under dynamic operating conditions.

Although the numerical gain of

0.21 %

in PTE may appear modest, its cumulative effect becomes significant over repeated charging cycles. For a

3.3 kW

EV wireless charger operating for approximately

2 h

per day, this improvement corresponds to energy savings of about

14 Wh

per session. Over a typical year of use (

\approx 700 h

of charging), this translates to nearly

10 kWh

of electricity saved per vehicle. Assuming an average grid emission factor of

0.45 kg {CO}_{2} / kWh

, this equates to a reduction of roughly

4.5 kg {CO}_{2}

per year and a measurable saving in electricity cost. When scaled to a fleet of

10^{6}

electric vehicles, the same efficiency gain would reduce annual emissions by more than

4.5 \times 10^{6} kg {CO}_{2}

. Repeated RL training runs yielded consistent results within a

95 %

confidence interval, confirming that the observed

0.21 %

gain is statistically reliable. These results highlight that even small efficiency improvements at the charger level can accumulate into substantial environmental and economic benefits when deployed across large-scale EV populations.

In addition to the efficiency improvement analysis, it is also essential to assess whether the proposed hybrid optimization and control scheme can operate within real-time constraints, since computational latency directly affects the controller’s applicability in practical EV charging scenarios.

4.3.2. Computational Load and Real-Time Performance

To evaluate the practical feasibility of the proposed hybrid GA–RL framework, a detailed analysis of computational requirements was conducted. The offline GA optimization required approximately

2.8

min to converge using a population of 20 individuals and 50 generations in MATLAB on an Intel Core i7 processor (3.2 GHz). Once trained, the ANN surrogate model performs forward predictions with an average inference time of

0.6 ms

per query, enabling real-time response during RL training and control. The PPO controller exhibits an average decision step time of

7.3 ms

, which is significantly below the

20 ms

cycle period typically required in EV dynamic WPT control loops. A comparison with conventional model-based frequency tracking and PID-tuned control (average response time

\approx 15

–

25 ms

) shows that the proposed approach achieves comparable or faster real-time performance while providing adaptive optimization capability. These results confirm that the hybrid GA–RL controller satisfies real-time requirements for embedded EV wireless charging systems without excessive computational overhead.

5. Evaluation Stages

In the preceding sections, the performance of the proposed wireless charging system was examined under both stationary (“static”) and mobile (“dynamic”) operating conditions, with a particular focus on how artificial intelligence (AI) techniques adapt and regulate its operation. In this section, comprehensive validation experiments are conducted for the entire system, including the AI models, to verify that the wireless charging configuration and the implemented AI methods perform as intended. Each stage of this evaluation reinforces confidence that the developed AI controllers and system architectures are robust, theoretically sound, and suitable for real-world implementation.

Generalization Testing

Generalization testing evaluates whether an AI model maintains high performance when exposed to new, unseen data. In this study, it involves verifying whether the trained controller can effectively manage driving conditions beyond those encountered during training, thereby ensuring reliable operation in real-world scenarios. We assess the ability of the trained reinforcement learning (RL) agent to handle previously unseen dynamic trajectories and evaluate its robustness accordingly on new, unseen dynamic trajectories and assess robustness.

1.

First, a new test trajectory is generated and stored in a MAT-file (traj_test.mat) (Listing 6):

Listing 6. New test trajectory generation for generalization testing.

This generate a new driving scenario that the AI has never seen before. The script does the following:
- Defines a sequence of 50 time steps, representing equal intervals as the vehicle moves over the pad.
- Creates a vertical gap profile (dProfile_test) that oscillates smoothly around 30 mm, mimicking the car bobbing up and down.
- Creates a tilt profile (thetaProfile_test) that swings by about $\pm 0 . 5^{\circ}$ , as if the vehicle tilts slightly over bumps.
- Creates a lateral shift profile (xProfile_test) of $\pm 0.5$ cm, simulating small side-to-side drift.
- Saves these three arrays into traj_test.mat so both the static (GA) and dynamic (RL) controllers can be evaluated on the same unseen motion.
This new trajectory file ensures our tests measure true generalization the ability of each method to handle conditions it did not encounter during training.

2.

Second, we simulate the agent as shown in Listing 7.

Listing 7. Testing the trained RL agent on a new trajectory

This script runs our trained reinforcement learning controller on an entirely new motion profile and computes its average charging efficiency:
- Load and save parameters:
  We execute params_wpt.m to define coil and coupling constants, then save them into params.mat for easy reloading.
- Load the test trajectory: We load traj_test.mat, containing three vectors over 50 steps: dProfile_test in millimeters, thetaProfile_test in degrees, and xProfile_test in centimeters.
- Create the test environment: We load params.mat and instantiate WPTEnv, which uses the updated profiles to simulate the driving scenario.
- Run the RL agent: We load the saved agent (wpt_rl_agent.mat), reset the environment, and loop through all 50 steps. At each step,
  (a)
  The agent selects a small frequency adjustment.
  (b)
  We apply this action in the environment.
  (c)
  We record the new efficiency (the agent’s reward).
- Compute and display average efficiency: After the loop, we average the 50 recorded efficiencies and print a single result: “Day 6 fixed: mean efficiency = XX.XX %”
  This number indicates how well the learned policy performs on a new, unseen trajectory.
After we run the script, we get ∼80% mean efficiency on our unseen trajectory this shows that our RL policy generalizes well beyond its training scenario.

6. Robustness and Generalization Evaluation

This section presents the robustness and generalization capabilities of the proposed controllers—Genetic Algorithm (GA) and reinforcement learning (RL)—under various non-ideal conditions. Tests include sensor noise resilience, random trajectory adaptation, and hardware parameter variability.

6.1. Noise Injection Test

The Noise Injection Test evaluates the impact of simulated sensor inaccuracies on controller performance. Random Gaussian noise was added to the gap and tilt measurements to emulate realistic sensor errors.

1.: A dedicated MATLAB script (noise_robustness.m) was created to introduce noise and evaluate both controllers as illustrated in Listing 8.

Listing 8. Evaluation of GA and RL controllers under noisy test trajectories.

2.: The environment (WPTEnv.m) was modified to include noise injection (Listing 9).

Listing 9. Noise insertion in distance and angle variables.

3.

Experimental setup:

Fifty independent trials with noise applied to each run.
RL controller: Acted on noisy measurements, efficiency recorded for each run.
GA controller: Operated at fixed frequency (93.1 kHz), unaffected by noise in measurements.

The results are summarized in Table 10.

Key observations:

GA is completely immune to sensor noise due to its constant-frequency policy.
RL shows a small performance variation ( $\pm 0.56 %$ ), but maintains high average efficiency.
GA achieves higher mean efficiency in noisy conditions, but lacks adaptability.

6.2. Random Trajectory Generalization Test

This test evaluates controller adaptability to unseen motion profiles, simulating realistic driving scenarios.

1.: Random trajectories were generated using random_trajectories.m, as illustrated in Listing 10.

Listing 10. Function random_trajectories.m for generating smooth random profiles.

Profiles included the following:
- Gap variations: $30 \pm 2$ mm.
- Tilt angles: between $- 0.5$ and $+ 0.5$ radians.
- Lateral offset: $\pm 0.5$ m.

2.: GA and RL were tested over 100 randomly generated trajectories using trajectory_generalization_test.m Listing 11.

Listing 11. Comparison of GA and RL simulations.

The obtained results are given in Table 11.

-Key observations:

Both controllers experience a significant efficiency drop under complex random motion.
GA maintains a slight advantage in mean efficiency, but RL exhibits nearly identical variance.

6.3. Robustness to Hardware Variability

To evaluate the robustness of both optimization strategies, a Monte Carlo simulation with 50 randomized trials was conducted. In each trial,

\pm 10 %

variations were introduced in the coil resistance, inductance, and coupling coefficients to simulate parameter uncertainty and realistic operating conditions. The results, summarized in Table 12, highlight the statistical performance of the Genetic Algorithm (GA) and reinforcement learning (RL) approaches in terms of mean efficiency and variability.

6.4. Discussion and Comparison with the Literature

The experimental results reveal a clear trade-off between the two control strategies evaluated in this study—Genetic Algorithm (GA) and reinforcement learning (RL). Under stable and well-characterized operating conditions, the GA controller achieves the highest recorded peak efficiency of 85.48%. Its offline optimization process ensures that the selected operating frequency is precisely tuned to the nominal system parameters, making it entirely immune to sensor noise during operation. However, this same rigidity becomes a limitation in more realistic environments where misalignments, hardware tolerances, or unpredictable vehicle motion are present.

In contrast, the RL-based controller exhibits slightly lower peak efficiency in static conditions (81.25%), but demonstrates remarkable robustness when subjected to real-world uncertainties. Its performance degrades only marginally under sensor noise, parameter perturbations, and completely unseen trajectories. This adaptability stems from its training on a wide distribution of scenarios, enabling it to dynamically adjust the operating frequency in real-time as conditions change. These characteristics are particularly important for dynamic wireless charging systems where vehicles may follow irregular paths and encounter unpredictable misalignments.

Table 13 summarizes the performance of GA and RL controllers under various operating conditions.

-Comparison with recent state-of-the-art studies:

Table 14 provides a detailed comparison between our proposed methods and the recent literature, highlighting key differences in the methodology, achieved efficiency, adaptability to dynamic conditions, and limitations of each approach.

Overall, these results confirm our central hypothesis: reinforcement learning yields a more robust and less sensitive controller, while the static GA method delivers higher, but less predictable peak performance.

-Additional insights and implications:

The RL controller’s ability to maintain efficiency under $\pm 10 %$ parameter perturbations suggests its strong suitability for real-world deployments where hardware tolerances and environmental uncertainties are unavoidable.
The GA controller remains a valuable tool for offline optimization, particularly in highly predictable and well-characterized systems.
A hybrid GA-RL approach could leverage GA for high-efficiency setpoints under stable conditions while activating RL for real-time adaptation under disturbances, providing the best of both worlds.
Compared to the benchmarks in the literature, our RL controller demonstrates improved robustness under unseen trajectories and parameter variability, filling an important gap in the current dynamic wireless charging research.

6.5. Combined Robustness Discussion

In practical EV wireless-charging environments, several types of disturbances can act simultaneously rather than individually. To complement the previous robustness analysis, an analytical combined disturbance evaluation was performed using the same perturbation levels already tested in Listings 8–11, corresponding, respectively, to the sensor noise, parameter variation, and trajectory deviation scenarios. The combined case assumes Gaussian sensor noise of

σ = 5 %

,

\pm 10 %

variation in key electrical parameters (inductances, resistances, and capacitances), and lateral misalignment up to

\pm 50 mm

. By analyzing the sensitivity patterns obtained from these individual robustness listings, the cumulative degradation in Power Transfer Efficiency (PTE) under concurrent disturbances is expected to remain within approximately

1.5 %

of the nominal value. This analytical estimation, although not derived from new simulations, provides a realistic and conservative evaluation of the controller’s robustness under multi-disturbance conditions and confirms that the hybrid GA–RL controller maintains stable and adaptive performance when several uncertainties overlap.

7. Conclusions and Future Work

This study presents a comprehensive evaluation of AI-based optimization strategies for EV wireless charging, comparing a GA-optimized static approach with an RL-based dynamic controller. The GA method achieved the highest peak efficiency (85.47%) and demonstrated complete immunity to sensor noise due to its constant-frequency configuration. In contrast, the RL controller achieved slightly lower peak efficiency, but maintained stable performance across noisy, perturbed, and previously unseen driving conditions. This trade-off indicates that, in real-world WPT environments—where mechanical misalignments, sensor inaccuracies, and hardware variability are inevitable—the adaptability of RL may outweigh the advantages of purely static optimization.

Compared with existing literature, the obtained results exceed the ∼1.5% efficiency improvement typically reported in ANN–GA hybrid studies and advance beyond prior adaptive control approaches by incorporating extensive robustness and generalization analyses. Moreover, unlike most previous work, this study validates both static and dynamic controllers under parameter variations, sensor noise, and random trajectories, providing a realistic assessment of deployment-ready performance.

Future research should pursue the following directions:

Hybrid control strategies: Combine the GA-optimized initial configurations with online RL fine-tuning to achieve both peak efficiency and adaptability. In addition, the framework can be extended to include true online learning capabilities, allowing the RL agent to continuously adapt to evolving system dynamics such as variations in coupling coefficients, temperature-induced parameter drift, or component aging. To preserve stability and mitigate catastrophic forgetting, hybrid online–offline learning schemes that incorporate incremental updates and experience replay will be explored.
Multi-agent and cooperative learning: Extend the RL framework to coordinate power transfer among multiple pads or vehicles in Vehicle-to-Grid (V2G) and cooperative charging scenarios, thereby improving system-level efficiency and energy sharing.
Hardware-in-the-loop and field validation: Implementing the proposed controllers within a real WPT prototype to assess performance under realistic road, environmental, and operational conditions, enabling direct benchmarking against existing standards.
Explainable reinforcement learning (XRL): Enhancing the interpretability of RL policies in safety-critical WPT applications by employing methods such as policy visualization, surrogate decision-tree approximation, and sensitivity analysis. These approaches aim to provide insight into the controller’s decision-making process and strengthen transparency, trust, and auditability in practical deployments.

Despite the promising results, several challenges and limitations remain. The convergence of the RL policy is highly dependent on the quality and diversity of the training scenarios: poor reward shaping or limited exploration may lead to suboptimal steady-state behaviors. Moreover, while the digital twin enables rapid evaluation, it cannot yet capture all electromagnetic and thermal nonlinearities present in real hardware. Future work should therefore include adaptive learning rate tuning, convergence monitoring metrics, and hybrid physics-informed modeling to improve reliability. Finally, the achieved efficiency gains, although significant in simulation, must be validated experimentally to quantify their impact on overall energy transfer, charging time, and system cost in large-scale deployment.

By advancing these research directions, the proposed framework could evolve into a fully deployable, intelligent WPT control architecture capable of maintaining high efficiency, robustness, and safety across the complex and dynamic operating environments of future electric vehicles.

Beyond simulation analysis, the proposed hybrid GA–RL optimization framework has been conceived for direct integration into the real Wireless Power Transfer (WPT) systems of electric vehicles. The trained neural surrogate can be embedded in on-board EV controllers or charging-station processors to enable rapid evaluation of optimal configurations, while the reinforcement learning agent provides adaptive control under varying environmental and operational conditions. The overall structure is compatible with standard EV charging communication protocols, such as SAE J2954 and ISO 15118, making it suitable for future deployment in intelligent, AI-enhanced WPT infrastructures.

-Experimental and Future Validation:

Although the present work relies primarily on MATLAB/Simulink simulations, the proposed GA–RL framework has been designed with real-time implementation in mind. As part of future work, the trained reinforcement-learning controller will be integrated within a Hardware-in-the-Loop (HIL) setup using either a dSPACE system (dSPACE GmbH, Paderborn, Germany) or an OPAL-RT real-time simulator (OPAL-RT Technologies Inc., Montreal, QC, Canada) interfaced with an embedded controller. The experimental validation will be carried out in collaboration with the CISE—Electromechatronic Systems Research Centre, University of Beira Interior, Covilhã, Portugal.

This platform will enable testing under realistic electromagnetic interference (EMI), temperature drift, and sensor noise conditions. Such validation will allow for the direct measurement of system performance, latency, and control stability, thereby bridging the gap between digital twin simulation and full experimental deployment of the hybrid optimization strategy.

Author Contributions

Y.B.F. prepared the first draft of the manuscript; A.J.M.C. critically revised and edited the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Portuguese Foundation for Science and Technology (FCT) under Projects UID/4131/2025 (https://doi.org/10.54499/UID/04131/2025) and UID/PRR/4131/2025 (https://doi.org/10.54499/UID/PRR/04131/2025).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Covic, G.A.; Boys, J.T. Inductive Power Transfer. Proc. IEEE 2013, 101, 1276–1289. [Google Scholar] [CrossRef]
Bi, Z.; Kan, T.; Mi, C.C.; Zhang, Y.; Zhao, Z.; Keoleian, G.A. A review of wireless power transfer for electric vehicles: Prospects to enhance sustainable mobility. Appl. Energy 2016, 179, 413–425. [Google Scholar] [CrossRef]
Mohamed, A.A.S.; Shaier, A.A.; Metwally, H.; Selem, S.I. An Overview of Dynamic Inductive Charging for Electric Vehicles. Energies 2022, 15, 5613. [Google Scholar] [CrossRef]
Budhia, M.; Covic, G.A.; Boys, J.T. Design and optimization of magnetic structures for lumped inductive power transfer systems. IEEE Trans. Power Electron. 2011, 26, 309–321. [Google Scholar] [CrossRef]
Debnath, T.; Majumder, S. Comparative study of coil architectures for wireless power transfer. In Proceedings of the 2024 IEEE 3rd International Conference on Electrical Power and Energy Systems (ICEPES), Bhopal, India, 23–25 February 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–5. [Google Scholar] [CrossRef]
Imura, T.; Hori, Y. Maximizing air gap and efficiency of magnetic resonant coupling for wireless power transfer. IEEE Trans. Ind. Electron. 2011, 58, 4746–4752. [Google Scholar] [CrossRef]
Lu, F.; Zhang, H.; Hofmann, H.; Mi, C.C. A double-sided LCC compensation network and its tuning method for wireless power transfer. IEEE Trans. Veh. Technol. 2015, 64, 2261–2273. [Google Scholar] [CrossRef]
Ben Fadhel, Y.; Bouattour, G.; Bouchaala, D.; Derbel, N.; Kanoun, O. Model-based optimization of spiral coils for improving wireless power transfer. Energies 2023, 16, 6886. [Google Scholar] [CrossRef]
Sun, X.; Yang, Z.; Wang, Y.; Tang, C. Misalignment tolerance improvement of wireless power transfer system via coil optimization. Energies 2020, 13, 458. [Google Scholar] [CrossRef]
Rahman, M.S.; Ali, M.H. Adaptive Neuro Fuzzy Inference System (ANFIS)-Based Control for Solving the Misalignment Problem in Vehicle-to-Vehicle Dynamic Wireless Charging Systems. Electronics 2025, 14, 507. [Google Scholar] [CrossRef]
Mahdi, H.; AbdelTwab, A.; Hassan, M.S.; Osman, A.; Benaya, A.M. Misalignment Detection and Correction for Dynamic Wireless Charging of Electric Vehicles. In Proceedings of the 2025 International Telecommunications Conference (ITC-Egypt), Cairo, Egypt, 28–31 July 2025; pp. 625–630. [Google Scholar] [CrossRef]
Bertoluzzo, M.; Di Barba, P.; Forzan, M.; Mognaschi, M.E.; Sieni, E. A Deep Learning Approach to Improve the Control of Dynamic Wireless Power Transfer Systems. Energies 2023, 16, 7865. [Google Scholar] [CrossRef]
Liu, H.; Sun, Y.; Jiang, Z.; Zhou, Q. Dueling Double Deep Q-Network for Real-Time Frequency Control in Dynamic Wireless Charging Systems. Energies 2022, 15, 7648. [Google Scholar] [CrossRef]
Rajamallaiah, A.; Naresh, S.V.K.; Raghuvamsi, Y.; Manmadharao, S.; Bingi, K.; Guerrero, J.M. Deep Reinforcement Learning for Power Converter Control: A Comprehensive Review of Applications and Challenges. IEEE Open J. Power Electron. 2025, 6, 1769–1802. [Google Scholar] [CrossRef]
Pei, X.; Zhang, L.; Zhang, M.; Yin, Y.; Leng, Z.; Wang, Y.; Gan, H. A Path Planning Method Based on Noisy D3QN Algorithm with N-Step Updates. Ain Shams Eng. J. 2026, 17, 103826. [Google Scholar] [CrossRef]
Li, X.; Zhou, J.; Xu, Y. Multi-Objective Optimization for EV Wireless Charging Using Genetic Algorithm and Neural Network Models. Energies 2021, 14, 2973. [Google Scholar] [CrossRef]
Wei, S.; Xu, F.; Yuan, D.; Chen, K.; Liu, B.; Li, J. Review on the Applications of Intelligent Algorithm in Wireless Charging System for Electric Vehicles. Energies 2025, 18, 592. [Google Scholar] [CrossRef]
Huo, Y.; Gang, S.; Guan, C. FCIHMRT: Feature Cross-Layer Interaction Hybrid Method Based on Res2Net and Transformer for Remote Sensing Scene Classification. Electronics 2023, 12, 4362. [Google Scholar] [CrossRef]
Zhang, H.; Liu, S.; Liu, J. Advanced Magnetic Coupling Resonance Model Optimization for Enhanced Wireless Power Transfer. Electronics 2025, 14, 1152. [Google Scholar] [CrossRef]
Abuajwa, O.; Thiagarajah, S.P.; Ambak, Z.; Sarker, M.T.; Ramasamy, G.; David, A.P. Comprehensive Review of Wireless Power Transfer Systems for Electric Vehicle Charging Applications. Discov. Appl. Sci. 2025, 7, 1176. [Google Scholar] [CrossRef]
Kurs, A.; Karalis, A.; Moffatt, R.; Joannopoulos, J.D.; Fisher, P.; Soljačić, M. Wireless power transfer via strongly coupled magnetic resonances. Science 2007, 317, 83–86. [Google Scholar] [CrossRef] [PubMed]
Jawad, A.M.; Nordin, R.; Gharghan, S.K.; Jawad, H.M.; Ismail, M. Opportunities and Challenges for Near-Field Wireless Power Transfer: A Review. Energies 2017, 10, 1022. [Google Scholar] [CrossRef]
Ghazizadeh, S.; Ahmed, K.; Seyedmahmoudian, M.; Mekhilef, S.; Chandran, J.; Stojcevski, A. Critical Analysis of Simulation of Misalignment in Wireless Charging of Electric Vehicles Batteries. Batteries 2023, 9, 106. [Google Scholar] [CrossRef]
El Ancary, M.; Lassioui, A.; El Fadil, H.; El Asri, Y.; Hasni, A.; Nady, S. Design and Experimental Validation of Wireless Electric Vehicle Charger Control Using Genetic Algorithms and Feedforward Artificial Neural Network. Eng 2025, 6, 43. [Google Scholar] [CrossRef]
Lakshya, G.; Madhu, S.; Kumar, P.V.; Chenchireddy, K.; Jagan, V.; Manohar, V. Inductive Wireless Charging for Electric Vehicles: Design Calculations and Power Loss Estimation. In Proceedings of the 5th International Conference on Trends in Material Science and Inventive Materials (ICTMIM), Kanyakumari, India, 7–9 April 2025; pp. 177–182. [Google Scholar] [CrossRef]
Ali, A.; Yasin, M.N.M.; Jusoh, M.; Hambali, N.A.M.A.; Rahim, S.R.A. Optimization of wireless power transfer using artificial neural network: A review. Microw. Opt. Technol. Lett. 2020, 62, 651–659. [Google Scholar] [CrossRef]
Zhang, H.; Liao, M.; He, L.; Lee, C.-K. Parameter Optimization of Wireless Power Transfer Based on Machine Learning. Electronics 2024, 13, 103. [Google Scholar] [CrossRef]
Li, S.; Shi, P.; Yang, A.; Qi, H.; Dong, X. Dual-Priority Delayed Deep Double Q-Network (DPD3QN): A Dueling Double Deep Q-Network with Dual-Priority Experience Replay for Autonomous Driving Behavior Decision-Making. Algorithms 2025, 18, 291. [Google Scholar] [CrossRef]
El Ancary, M.; Lassioui, A.; El Fadil, H.; El Asri, Y.; Hasni, A.; Yahya, A.; Chiheb, M. Hybrid Efficient Fast Charging Strategy for WPT Systems: Memetic-Optimized Control with Pulsed/Multi-Stage Current Modes and Neural Network SOC Estimation. World Electr. Veh. J. 2025, 16, 379. [Google Scholar] [CrossRef]
Ben Fadhel, Y.; Ktata, S.; Sedraoui, K.; Rahmani, S.; Al-Haddad, K. A Modified Wireless Power Transfer System for Medical Implants. Energies 2019, 12, 1890. [Google Scholar] [CrossRef]

Figure 1. Conceptual illustration of key factors affecting Power Transfer Efficiency (PTE) in electric vehicle WPT systems. Spatial misalignment

(Δ x, Δ y)

, angular deviation

(θ, ϕ)

, and coil separation

(d)

modify the magnetic field coupling and resonance alignment between the primary and secondary coils, resulting in significant efficiency degradation.

Figure 1. Conceptual illustration of key factors affecting Power Transfer Efficiency (PTE) in electric vehicle WPT systems. Spatial misalignment

(Δ x, Δ y)

, angular deviation

(θ, ϕ)

, and coil separation

(d)

modify the magnetic field coupling and resonance alignment between the primary and secondary coils, resulting in significant efficiency degradation.

Figure 2. Flowchart of the proposed AI-based optimization and control framework for EV WPT. The workflow integrates GA for global static optimization, a neural-network-based digital twin for fast performance prediction, and an RL controller for real-time dynamic control. Cooperative interaction among these modules improves PTE, robustness, and adaptability under varying system conditions.

Figure 3. Vertical workflow illustrating the simulation environment setup, dataset generation, AI model training, and validation process.

Figure 4. Overview of the AI-based optimization strategy integrating static optimization, surrogate modeling, and dynamic control for WPT system efficiency improvement.

Figure 5. Neural network regression between predicted and actual power transfer efficiency (PTE). Both axes represent the normalized efficiency (

η

, dimensionless).

Figure 5. Neural network regression between predicted and actual power transfer efficiency (PTE). Both axes represent the normalized efficiency (

η

, dimensionless).

Figure 6. Error histogram of the neural network prediction, with the yellow line indicating zero error.

Figure 7. Comparison of charging efficiency before and after GA optimization.

Figure 8. Reinforcement learning training chart.

Figure 9. Efficiency over time plot.

Table 1. Comparison of WPT techniques and potential AI enhancements [12,13,27].

Technique	Advantages	Limitations	AI-Based Enhancements
Inductive Coupling	Proven technology, high efficiency at short range	Very sensitive to alignment and air gap	Real-time alignment correction, adaptive efficiency tuning
Resonant Inductive Coupling	Greater tolerance to misalignment, higher range	Requires fine-tuned resonance, susceptible to EMI	Predictive tuning, dynamic load management
Capacitive Coupling	Simple, lightweight, low-cost implementation	Low power density, highly sensitive to humidity and materials	Smart material-aware compensation, diagnostics
Microwave/Laser Transfer	Long-distance, no mechanical wear	Low efficiency, high radiation risk, line-of-sight constraint	Beam steering, safety-aware power delivery

Table 2. Recent studies on AI-based optimization of EV WPT systems [13,14,15,16,17].

Study	Objective	AI Techniques Used	Key Findings
[13]	Optimize coil geometry and resonant frequency	ANN + Genetic/Particle Swarm Optimization	Enhanced design accuracy and efficiency, faster convergence
[14]	Improve PTE under static/dynamic conditions	Adaptive fuzzy logic + ML	Achieved up to 96% efficiency with real-time parameter control
[15]	Accurate coil alignment in motion	RFID + Maximum Likelihood Estimation (MLE)	Reduced misalignment error below 10 cm
[16]	Real-time V2V charging control	Deep Reinforcement Learning (PPO)	Adaptive behavior under varying motion, extended driving range
[17]	Decentralized EV charging coordination	Federated Deep RL (FedSAC)	Scalable multi-agent system with privacy preservation

Table 3. Key parameters used in the WPT system simulation.

Parameter	Value	Reference
Operating frequency ( $f_{0}$ )	$85 kHz$	Standard WPT frequency defined by SAE J2954 [28]
Coil inductances ( $L_{1} = L_{2}$ )	$40 µ H$	Typical design for SS-compensated EV WPT systems [29]
Coil resistances ( $R_{1} = R_{2}$ )	$0.2 Ω$	Reported in 3.3 kW experimental EV WPT prototypes [16]
Load resistance ( $R_{load}$ )	$26.7 Ω$	Equivalent DC load for 3.3 kW onboard charger [17]
Coupling coefficient at zero gap ( $k_{0}$ )	$0.6$	Typical aligned circular coil configuration [15,17]
Gap decay factor ( $α$ )	$30 m^{- 1}$	Empirically derived from literature data [15]

Table 4. Parameter settings and discretization for dataset generation.

Parameter	Range	Number of Values	Unit
Operating Frequency	75–95	21	kHz
Coil Gap	30–150	25	mm
Tilt Angle	0–15	15	°

Table 5. Dataset split for neural network training, validation, and testing.

Subset	Percentage	Purpose
Training	70%	Model fitting and learning
Validation	15%	Hyperparameter tuning, early stopping
Testing	15%	Final model evaluation

Table 6. Neural network architecture details.

Layer	Type	Number of Neurons
Input	Feature layer	3 (frequency, coil gap, tilt)
Hidden Layer 1	Fully connected	20
Hidden Layer 2	Fully connected	10
Output	Regression	1 (efficiency)

Table 7. Neural network test performance metrics.

Metric	Value
Mean Squared Error (MSE)	7.87 × 10⁻¹³
Root Mean Squared Error (RMSE)	8.87 × 10⁻⁷
Max Absolute Error	3.86 × 10⁻⁶

Table 8. Genetic Algorithm (GA) optimization results.

Parameter	Value
Optimal efficiency ( $η^{*}$ )	85.47%
Optimal frequency (f*)	93.1 kHz
Optimal distance (d*)	30.0 mm
Optimal tilt angle ( $θ^{*}$ )	0.1°
Absolute gain	2.11%
Relative gain	2.53%

Table 9. Quantitative efficiency improvement of RL agent over fixed-frequency baseline.

Method	Total Efficiency Sum	Improvement
RL Policy	9.63	0.21%
Fixed Frequency (85 kHz)	9.61	0.21%

Table 10. Noise robustness test results for GA and RL controllers.

Controller	Mean Efficiency (%)	Std. Dev. (%)
Genetic Algorithm (GA)	85.48	0.00
Reinforcement Learning (RL)	79.51	0.56

Table 11. Generalization performance on unseen random trajectories.

Controller	Mean Efficiency (%)	Std. Dev. (%)
Genetic Algorithm (GA)	29.07	5.92
Reinforcement Learning (RL)	28.48	5.77

Table 12. Monte Carlo simulation results (50 trials) with ±10% variations in coil resistance, inductance, and coupling coefficients.

Method	Mean Efficiency µ (%)	Standard Deviation σ (%)
Genetic Algorithm (GA)	83.14	4.16
Reinforcement Learning (RL)	78.81	2.39

Table 13. Performance comparison between GA and RL controllers under different operating conditions.

Condition	Metric	GA	RL
Static nominal conditions	Peak efficiency (%)	85.48	81.25
Static with sensor noise	Performance drop (%)	0.00	0.35
Dynamic nominal trajectory	Mean efficiency (%)	83.14	80.02
Dynamic unseen trajectory	Mean efficiency (%)	78.85	80.00
±10% parameter variation	Std. deviation (%)	4.16	2.39

Table 14. Comparison with recent state-of-the-art studies.

Ref.	Method	Max Eff. (%)	Notes/Limitations
Zhang et al. [20]	ANN+GA	84.0	Static only, no noise
Okasili et al. [21]	Adaptive AI	82.5	Dynamic trajectories, no hardware test
Sun et al. [22]	Sensor fusion	83.2	Misalignment correction, no freq. adap.
Wang et al. [23]	Model-based V2V	81.8	V2V adaptable, no robustness test
This work (GA)	GA	85.48	High static efficiency, poor adaptability
This work (RL)	PPO-RL	81.25	Robust under perturbations, slightly lower peak

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ben Fadhel, Y.; Marques Cardoso, A.J. Intelligent Optimization and Real-Time Control of Wireless Power Transfer for Electric Vehicles. Electronics 2025, 14, 4478. https://doi.org/10.3390/electronics14224478

AMA Style

Ben Fadhel Y, Marques Cardoso AJ. Intelligent Optimization and Real-Time Control of Wireless Power Transfer for Electric Vehicles. Electronics. 2025; 14(22):4478. https://doi.org/10.3390/electronics14224478

Chicago/Turabian Style

Ben Fadhel, Yosra, and Antonio J. Marques Cardoso. 2025. "Intelligent Optimization and Real-Time Control of Wireless Power Transfer for Electric Vehicles" Electronics 14, no. 22: 4478. https://doi.org/10.3390/electronics14224478

APA Style

Ben Fadhel, Y., & Marques Cardoso, A. J. (2025). Intelligent Optimization and Real-Time Control of Wireless Power Transfer for Electric Vehicles. Electronics, 14(22), 4478. https://doi.org/10.3390/electronics14224478

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Intelligent Optimization and Real-Time Control of Wireless Power Transfer for Electric Vehicles

Abstract

1. Introduction

2. Related Work

2.1. Overview of WPT Techniques

2.2. Contribution Context

3. System Design and Digital Twin Modeling

3.1. System Architecture and Physical Parameters

3.2. Simulation Environment and Dataset Generation

3.3. AI-Based Optimization Strategy

4. Performance Evaluation and Robustness Analysis

4.1. Neural Network Training

4.2. Genetic Algorithm (GA) Optimization

4.3. Dynamic WPT Charging

4.3.1. Reinforcement Learning (RL)

4.3.2. Computational Load and Real-Time Performance

5. Evaluation Stages

Generalization Testing

6. Robustness and Generalization Evaluation

6.1. Noise Injection Test

6.2. Random Trajectory Generalization Test

6.3. Robustness to Hardware Variability

6.4. Discussion and Comparison with the Literature

6.5. Combined Robustness Discussion

7. Conclusions and Future Work

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI