This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem
by
Mehmet Bilge Han Taş
Mehmet Bilge Han Taş 1,2
,
Kemal Özkan
Kemal Özkan 1,3,*,
İnci Sarıçiçek
İnci Sarıçiçek 3,4
and
Ahmet Yazıcı
Ahmet Yazıcı 1,3
1
Department of Computer Engineering, Eskisehir Osmangazi University, Eskisehir 26040, Türkiye
2
Department of Computer Engineering, Erzincan Binali Yildirim University, Erzincan 24100, Türkiye
3
The Center for Intelligent Systems Applications Research (CISAR), Eskisehir 26040, Türkiye
4
Department of Industrial Engineering, Eskisehir Osmangazi University, Eskisehir 26040, Türkiye
*
Author to whom correspondence should be addressed.
Appl. Sci. 2026, 16(1), 278; https://doi.org/10.3390/app16010278 (registering DOI)
Submission received: 1 December 2025
/
Revised: 21 December 2025
/
Accepted: 24 December 2025
/
Published: 26 December 2025
Abstract
The Dynamic Electric Vehicle Routing Problem (D-EVRP) presents a framework that requires electric vehicles to meet demand with limited energy capacity. When dynamic demand flows and charging requirements are considered together, traditional methods cannot provide sufficient adaptation for real-time decision-making. Therefore, a learning-based approach was chosen to ensure that decision-making processes respond quickly to changing conditions. The solution utilizes a model with a Double Deep Q-Network (DDQN) architecture and a discrete valuation structure. Prioritized Experience Replay (PER) was implemented to increase model stability, allowing infrequent but effective experiments to contribute more to the learning process. The state representation is constructed using the vehicle’s location, battery level, load status, and current customer demands. Scalability is ensured by dividing customer locations into clusters using the K-means method, with each cluster handled by an independent representative. The approach was tested with real-world road data obtained from the Meşelik Campus of Osmangazi University in Eskişehir. Experiments conducted under different demand levels and data sizes have shown that the PER-assisted DDQN structure produces more stable and shorter route lengths in dynamic scenarios, but random selection, greedy method and genetic algorithm experience significant performance losses as dynamicity increases.
Share and Cite
MDPI and ACS Style
Taş, M.B.H.; Özkan, K.; Sarıçiçek, İ.; Yazıcı, A.
Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem. Appl. Sci. 2026, 16, 278.
https://doi.org/10.3390/app16010278
AMA Style
Taş MBH, Özkan K, Sarıçiçek İ, Yazıcı A.
Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem. Applied Sciences. 2026; 16(1):278.
https://doi.org/10.3390/app16010278
Chicago/Turabian Style
Taş, Mehmet Bilge Han, Kemal Özkan, İnci Sarıçiçek, and Ahmet Yazıcı.
2026. "Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem" Applied Sciences 16, no. 1: 278.
https://doi.org/10.3390/app16010278
APA Style
Taş, M. B. H., Özkan, K., Sarıçiçek, İ., & Yazıcı, A.
(2026). Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem. Applied Sciences, 16(1), 278.
https://doi.org/10.3390/app16010278
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.