Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem

Taş, Mehmet Bilge Han; Özkan, Kemal; Sarıçiçek, İnci; Yazıcı, Ahmet

doi:10.3390/app16010278

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem

¹

Department of Computer Engineering, Eskisehir Osmangazi University, Eskisehir 26040, Türkiye

²

Department of Computer Engineering, Erzincan Binali Yildirim University, Erzincan 24100, Türkiye

³

The Center for Intelligent Systems Applications Research (CISAR), Eskisehir 26040, Türkiye

⁴

Department of Industrial Engineering, Eskisehir Osmangazi University, Eskisehir 26040, Türkiye

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2026, 16(1), 278; https://doi.org/10.3390/app16010278 (registering DOI)

Submission received: 1 December 2025 / Revised: 21 December 2025 / Accepted: 24 December 2025 / Published: 26 December 2025

(This article belongs to the Section Computing and Artificial Intelligence)

Download Versions Notes

Abstract

The Dynamic Electric Vehicle Routing Problem (D-EVRP) presents a framework that requires electric vehicles to meet demand with limited energy capacity. When dynamic demand flows and charging requirements are considered together, traditional methods cannot provide sufficient adaptation for real-time decision-making. Therefore, a learning-based approach was chosen to ensure that decision-making processes respond quickly to changing conditions. The solution utilizes a model with a Double Deep Q-Network (DDQN) architecture and a discrete valuation structure. Prioritized Experience Replay (PER) was implemented to increase model stability, allowing infrequent but effective experiments to contribute more to the learning process. The state representation is constructed using the vehicle’s location, battery level, load status, and current customer demands. Scalability is ensured by dividing customer locations into clusters using the K-means method, with each cluster handled by an independent representative. The approach was tested with real-world road data obtained from the Meşelik Campus of Osmangazi University in Eskişehir. Experiments conducted under different demand levels and data sizes have shown that the PER-assisted DDQN structure produces more stable and shorter route lengths in dynamic scenarios, but random selection, greedy method and genetic algorithm experience significant performance losses as dynamicity increases.

Keywords: electric vehicle routing problem; deep reinforcement learning; Double Deep Q-Network; dynamic requests; energy-aware routing

Share and Cite

MDPI and ACS Style

Taş, M.B.H.; Özkan, K.; Sarıçiçek, İ.; Yazıcı, A. Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem. Appl. Sci. 2026, 16, 278. https://doi.org/10.3390/app16010278

AMA Style

Taş MBH, Özkan K, Sarıçiçek İ, Yazıcı A. Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem. Applied Sciences. 2026; 16(1):278. https://doi.org/10.3390/app16010278

Chicago/Turabian Style

Taş, Mehmet Bilge Han, Kemal Özkan, İnci Sarıçiçek, and Ahmet Yazıcı. 2026. "Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem" Applied Sciences 16, no. 1: 278. https://doi.org/10.3390/app16010278

APA Style

Taş, M. B. H., Özkan, K., Sarıçiçek, İ., & Yazıcı, A. (2026). Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem. Applied Sciences, 16(1), 278. https://doi.org/10.3390/app16010278

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Double Deep Q-Network-Based Solution for the Dynamic Electric Vehicle Routing Problem

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI