Energy-Aware Spatio-Temporal Multi-Agent Route Planning for AGVs
Abstract
1. Introduction
2. Related Works
- 1.
- Classical heuristic and graph-search methods, including A*, Dijkstra, D*, D* Lite, Theta*, Lifelong Planning A*, etc. They guarantee the existence of an optimal or approximate route. However, these methods suffer from high computational complexity when scaled to large or dynamic environments and generally do not account for the remaining AGV battery charge.
- 2.
- Local optimization methods, including the DWA, Artificial Potential Field (APF), Velocity Obstacle (VO), and their improved variants. They may implicitly account for energy consumption through constraints on maximum speed or acceleration.
- 3.
- Evolutionary and swarm-intelligence methods, such as Genetic Algorithms (GA), ACO, Particle Swarm Optimization (PSO), and GWO, etc. These approaches perform well for multi-criteria tasks but often require many iterations and are unstable in dynamic environments. Therefore, they are effective for real-time tasks but are prone to local minima and do not guarantee a global optimum. They can incorporate energy as an optimization criterion, but in a simplified manner, without dynamic discharge modeling or constraints on returning to the charging station.
- 4.
- Machine Learning and Reinforcement Learning methods, such as Deep Q-Learning, DDPG, Proximal Policy Optimization (PPO), MADDPG, and Graph Neural Networks (GNNs), for multi-agent planning. These methods enable AGVs to adapt their behavior to environmental changes but require large amounts of training data and substantial computational resources. They can incorporate SoC into the state space. However, literature analysis shows that these methods typically model only a minimum SoC constraint after which the AGV must return to charging and do not address battery degradation or inter-cell charge variation.
- 5.
- Hybrid methods integrate global planners (e.g., A* and D*) with local obstacle-avoidance techniques (e.g., DWA and APF) or combine heuristic methods with reinforcement learning or evolutionary algorithms. They strive to balance speed, adaptability, and accuracy, yet most studies do not jointly consider energy constraints, time efficiency, and the presence of live dynamic obstacles. In such methods, battery considerations are typically limited to pre-planned charging-point routing or are completely ignored in agent-coordination scenarios.
3. Materials and Methods
- the number of grid cells into which the map of the premises, along with the route that the AGV must travel, is divided;
- stationary obstacles where AGV transportation is prohibited;
- dynamic obstacles, which are categorized into human (personnel) and inanimate, such as boxes and pallets, etc., that accidentally appear on the AGV’s path and can be rearranged;
- the remaining charge of the AGV’s battery;
- the time spent traveling through the cells.
- —dynamic transit cost of moving from node u to v at time step t;
- —estimated energy consumption for the transition from u to v;
- —traversal time for the transition from u to v, including base time, turns, and delays due to dynamic obstacles;
- —static obstacle indicator at target cell v;
- —inanimate dynamic obstacle indicator at cell v at time t;
- —human dynamic obstacle indicator at cell v at time t;
- —weighting factors;
- —sufficiently large penalty constant (e.g., ).
3.1. Algorithms for Finding the Shortest AGV Route
3.1.1. A* Algorithm
- —estimated total path cost through node u;
- —accumulated path cost from the start node to node u;
- —heuristic estimate of the cost from node u to the target node.
- 1.
- Place the starting vertex in the open list; while the open list is not empty, select the vertex u with the smallest . If u is the target vertex, terminate the search and move it to the closed list.
- 2.
- For each neighbor v of node u, we calculate the accumulated cost to reach the neighbor node v via node u as (3):When the time index is clear from context or the environment state is fixed (e.g., during initial A* planning on a static snapshot), we write for brevity.
- 3.
- Update the priority of affected nodes in the open list and continue by expanding the node with the smallest evaluation function (2).
3.1.2. D* and D* Lite Algorithms
- 1.
- Initialization: , . Add goal to the priority queue.
- 2.
- While start is inconsistent or start-key < queue-key, remove nodes with a minimum key; update and neighbors, maintaining consistency.
- 3.
- During agent movement: if new obstacles appear or weights change, update only for affected nodes; recalculate path only for inconsistent nodes.
3.1.3. M* Algorithm
- 1.
- Compute individual shortest paths using .
- 2.
- Detect conflicts along these paths.
- 3.
- If a conflict is found, temporarily couple only the conflicting agents and replan locally; otherwise, execute the next step.
- 4.
- Repeat until all agents reach their goals.
3.2. Setting Constraints to Find the Shortest AGV Route
3.2.1. Grid Discretization
3.2.2. Stationary Obstacles
3.2.3. The Remaining Charge of the AGV Battery
- —set of indices for which the matrix B has defined values greater than or equal to ;
- —minimum threshold value to include the entry in ( is the minimum technologically permissible discharge threshold for AGV Formica 1, below which the cell is inaccessible).
3.2.4. Dynamic Obstacles
- —traversal cost of moving from cell u to cell v at time t;
- —penalty coefficient for inanimate dynamic obstacles;
- —occupancy indicator of inanimate obstacles at cell v at time t.
3.2.5. The Time Spent Traveling Through Cells
3.3. Proposed Methodology for Finding the Shortest AGV Route
- s—the starting node;
- q—the goal node;
- —accumulated cost to reach node u from the start node;
- —heuristic estimate from s to the goal u (e.g., Manhattan distance, );
- —the evaluation function for the priority queue;
- —cost of moving from node to neighbor from u to v.
3.4. Multi-Agent Path Planning with Dynamic Obstacles
- —transition cost from u to v at time t;
- —feasibility indicator of cell v, see (7);
- —inanimate-obstacle indicator at cell v at time t;
- —penalty coefficient for inanimate obstacles (in this research, we use ).
- —set of agents in vertex or edge conflict;
- —current agent position;
- —other agent position;
- —adjacency condition.
3.5. The Submodule Architecture
4. Experimental Setup
- Empirically determined parameters:
- the coefficient of influence of the turn, which takes into account the delay of the AGV when changing direction;
- the coefficient of influence of dynamic obstacles, which estimates the additional time due to the proximity of moving objects;
- the basic time of passage of a free cell, determined based on the average speed of the AGV.
- Conditionally accepted parameters:
- cell side length, chosen for the unification of calculations;
- the minimum allowable battery charge level, taken as a safety threshold;
- penalties for passing through cells with moving objects, determined as conditional values for modeling the difficulty of passage.
5. Results
6. Discussion
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A

References
- Banik, S.; Banik, S.C.; Mahmud, S.S. Path Planning Approaches in Multi-robot System: A Review. Eng. Rep. 2025, 7, e13035. [Google Scholar] [CrossRef]
- Fragapane, G.; de Koster, R.; Sgarbossa, F.; Strandhagen, J.O. Planning and control of autonomous mobile robots for intralogistics: Literature review and research agenda. Eur. J. Oper. Res. 2021, 294, 405–426. [Google Scholar] [CrossRef]
- AbuJabal, N.; Baziyad, M.; Fareh, R.; Brahmi, B.; Rabie, T.; Bettayeb, M. A Comprehensive Study of Recent Path-Planning Techniques in Dynamic Environments for Autonomous Robots. Sensors 2024, 24, 8089. [Google Scholar] [CrossRef] [PubMed]
- Jayalakshmi, K.P.; Nair, V.G.; Sathish, D. A Comprehensive Survey on Coverage Path Planning for Mobile Robots in Dynamic Environments. IEEE Access 2025, 13, 60158–60185. [Google Scholar] [CrossRef]
- Sang, W.; Yue, Y.; Zhai, K.; Lin, M. Research on AGV Path Planning Integrating an Improved A* Algorithm and DWA Algorithm. Appl. Sci. 2024, 14, 7551. [Google Scholar] [CrossRef]
- Wu, B.; Chi, X.; Zhao, C.; Zhang, W.; Lu, Y.; Jiang, D. Dynamic Path Planning for Forklift AGV Based on Smoothing A* and Improved DWA Hybrid Algorithm. Sensors 2022, 22, 7079. [Google Scholar] [CrossRef]
- Yin, X.; Cai, P.; Zhao, K.; Zhang, Y.; Zhou, Q.; Yao, D. Dynamic Path Planning of AGV Based on Kinematical Constraint A* Algorithm and Following DWA Fusion Algorithms. Sensors 2023, 23, 4102. [Google Scholar] [CrossRef] [PubMed]
- Gong, X.; Gao, Y.; Wang, F.; Zhu, D.; Zhao, W.; Wang, F.; Liu, Y. A Local Path Planning Algorithm for Robots Based on Improved DWA. Electronics 2024, 13, 2965. [Google Scholar] [CrossRef]
- Sun, Y.; Yuan, Q.; Gao, Q.; Xu, L. A Multiple Environment Available Path Planning Based on an Improved A* Algorithm. Int. J. Comput. Intell. Syst. 2024, 17, 172. [Google Scholar] [CrossRef]
- Xiao, J.; Yu, X.; Sun, K.; Zhou, Z.; Zhou, G. Multiobjective path optimization of an indoor AGV based on an improved ACO-DWA. Math. Biosci. Eng. 2022, 19, 12532–12557. [Google Scholar] [CrossRef]
- Pratama, A.Y.; Ariyadi, M.R.; Tamara, M.N.; Purnomo, D.S.; Ramadhan, N.A.; Pramujati, B. Design of Path Planning System for Multi-Agent AGV Using A* Algorithm. In Proceedings of the 2023 International Electronics Symposium (IES), Denpasar, Indonesia, 8–10 August 2023; pp. 335–341. [Google Scholar] [CrossRef]
- Sun, M.; Lu, L.; Ni, H.; Wang, Y.; Gao, J. Research on dynamic path planning method of moving single target based on visual AGV. SN Appl. Sci. 2022, 4, 86. [Google Scholar] [CrossRef]
- Daniel, K.; Nash, A.; Koenig, S.; Felner, A. Theta*: Any-Angle Path Planning on Grids. Artif. Intell. 2010, 175, 87–117. [Google Scholar] [CrossRef]
- Koenig, S.; Likhachev, M. Lifelong Planning A*. Artif. Intell. 2004, 155, 93–146. [Google Scholar] [CrossRef]
- Karaman, S.; Frazzoli, E. Sampling-Based Algorithms for Optimal Motion Planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
- Jin, J.; Zhang, Y.; Zhou, Z.; Jin, M.; Yang, X.; Hu, F. Conflict-based search with D* lite algorithm for robot path planning in unknown dynamic environments. Comput. Electr. Eng. 2023, 105, 108473. [Google Scholar] [CrossRef]
- Zhu, X.; Yan, B.; Yue, Y. Path Planning and Collision Avoidance in Unknown Environments for USVs Based on an Improved D* Lite. Appl. Sci. 2021, 11, 7863. [Google Scholar] [CrossRef]
- Liu, N.; Shen, S.; Kong, X.; Zhang, H.; Bräunl, T. Cooperative Hybrid Multi-Agent Pathfinding Based on Shared Exploration Maps. J. Intell. Robot. Syst. 2025, 111, 97. [Google Scholar] [CrossRef]
- Feng, J.; Yang, Y.; Zhang, H.; Sun, S.; Xu, B. Path Planning and Trajectory Tracking for Autonomous Obstacle Avoidance in Automated Guided Vehicles at Automated Terminals. Axioms 2024, 13, 27. [Google Scholar] [CrossRef]
- Ren, Z.; Rathinam, S.; Likhachev, M.; Choset, H. Multi-Objective Path-Based D* Lite. IEEE Robot. Autom. Lett. 2022, 7, 3318–3325. [Google Scholar] [CrossRef]
- Chen, X.; Chen, C.; Wu, H.; Postolache, O.; Wu, Y. An improved artificial potential field method for multi-AGV path planning in ports. Intell. Robot. 2025, 5, 19–33. [Google Scholar] [CrossRef]
- Kasaura, K.; Yonetani, R.; Nishimura, S. Elevating Priorities for an Efficient and Complete Lifelong Multi-AGV Pathfinding on Roadmaps. Robot. Auton. Syst. 2026, 197, 105295. [Google Scholar] [CrossRef]
- Liu, R. Research on Optimization of the AGV Shortest-Path Model and Obstacle Avoidance Planning in Dynamic Environments. Math. Probl. Eng. 2022, 2022, 2239342. [Google Scholar] [CrossRef]
- Tao, M.; Li, Q.; Yu, J. Multi-Objective Dynamic Path Planning with Multi-Agent Deep Reinforcement Learning. J. Mar. Sci. Eng. 2025, 13, 20. [Google Scholar] [CrossRef]
- Guo, Z.; Xia, Y.; Li, J.; Liu, J.; Xu, K. Hybrid Optimization Path Planning Method for AGV Based on KGWO. Sensors 2024, 24, 5898. [Google Scholar] [CrossRef] [PubMed]
- Bai, Y.; Ding, X.; Hu, D.; Jiang, Y. Research on Dynamic Path Planning of Multi-AGVs Based on Reinforcement Learning. Appl. Sci. 2022, 12, 8166. [Google Scholar] [CrossRef]
- Chen, P.; Pei, J.; Lu, W.; Li, M. A deep reinforcement learning based method for real-time path planning and dynamic obstacle avoidance. Neurocomputing 2022, 497, 64–75. [Google Scholar] [CrossRef]
- Tsmots, I.; Teslyuk, V.; Opotiak, Y.; Parcei, R.; Zinko, R. The basic architecture of mobile robotic platform with intelligent motion control system and data transmission protection. Ukr. J. Inf. Technol. 2021, 3, 74–80. [Google Scholar] [CrossRef]
- Lin, S.; Liu, A.; Wang, J.; Kong, X. A Review of Path-Planning Approaches for Multiple Mobile Robots. Machines 2022, 10, 773. [Google Scholar] [CrossRef]
- Pavliuk, O.; Cupek, R.; Steclik, T.; Medykovskyy, M.; Drewniak, M. A Novel Methodology Based on a Deep Neural Network and Data Mining for Predicting the Segmental Voltage Drop in Automated Guided Vehicle Battery Cells. Electronics 2023, 12, 4636. [Google Scholar] [CrossRef]
- Pavliuk, O.; Medykovskyy, M.; Steclik, T. Predicting AGV Battery Cell Voltage Using a Neural Network Approach with Preliminary Data Analysis and Processing. In Proceedings of the 2023 IEEE International Conference on Big Data (BigData), Sorrento, Italy, 15–18 December 2023; pp. 5087–5096. [Google Scholar] [CrossRef]
- Yemets, K.; Izonin, I.; Dronyuk, I. Time Series Forecasting Model Based on the Adapted Transformer Neural Network and FFT-Based Features Extraction. Sensors 2025, 25, 652. [Google Scholar] [CrossRef]
- Pavliuk, O.; Mishchuk, M.; Strauss, C. Transfer Learning Approach for Human Activity Recognition Based on Continuous Wavelet Transform. Algorithms 2023, 16, 77. [Google Scholar] [CrossRef]
- Zhang, Y.; Wang, F.; Fu, F.; Su, Z. Multi-AGV Path Planning for Indoor Factory by Using Prioritized Planning and Improved Ant Algorithm. J. Eng. Technol. Sci. 2018, 50, 534–547. [Google Scholar] [CrossRef]
- Singh, N.; Akcay, A.; Dang, Q.V.; Martagan, T.; Adan, I. Dispatching AGVs with battery constraints using deep reinforcement learning. Comput. Ind. Eng. 2024, 187, 109678. [Google Scholar] [CrossRef]
- Singh, N.; Dang, Q.V.; Akcay, A.; Adan, I.; Martagan, T. A matheuristic for AGV scheduling with battery constraints. Eur. J. Oper. Res. 2022, 298, 855–873. [Google Scholar] [CrossRef]
- Guo, W.; Hu, H.; Sha, M.; Lian, J.; Yang, X. Battery-Powered AGV Scheduling and Routing Optimization with Flexible Dual-Threshold Charging Strategy in Automated Container Terminals. J. Mar. Sci. Eng. 2025, 13, 1526. [Google Scholar] [CrossRef]




| Agent | Found Path | Stop Reason | Number of Visited Cells | Remaining SoC | Travel time (Conventional Units) |
|---|---|---|---|---|---|
| 1 | (0, 4), (1, 4), (2, 4), (3, 4), (4, 4), (4, 3), (4, 2), (4, 1) | None | 8 | 0.400 | 7.20 |
| 2 | (0, 6), (0, 7), (1, 7), (2, 7), (3, 7), (4, 7), (5, 7), (6, 7), (7, 7), (8, 7), (9, 7) | None | 11 | 0.500 | 10.70 |
| 3 | (0, 3) | battery_low | 1 | 0.870 | 0 |
| Method | Agent | Number of Cells Passed | Number of Cells Considered in Order |
|---|---|---|---|
| Proposed | 1 | 8 | 61 |
| Proposed | 2 | 11 | 84 |
| Proposed | 3 | 12 | 97 |
| A* | 1 | 8 | 118 |
| A* | 2 | 11 | 74 |
| A* | 3 | 14 | 194 |
| Cooperative A* | 1 | 8 | 126 |
| Cooperative A* | 2 | 11 | 74 |
| Cooperative A* | 3 | 14 | 194 |
| M* | 1 | 8 | 25 |
| M* | 2 | 2 | None |
| M* | 3 | 12 | 34 |
| D* Lite | 1 | 10 | 67 |
| D* Lite | 2 | 11 | 19 |
| D* Lite | 3 | 14 | 33 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Pavliuk, O.; Mishchuk, M. Energy-Aware Spatio-Temporal Multi-Agent Route Planning for AGVs. Appl. Sci. 2026, 16, 3060. https://doi.org/10.3390/app16063060
Pavliuk O, Mishchuk M. Energy-Aware Spatio-Temporal Multi-Agent Route Planning for AGVs. Applied Sciences. 2026; 16(6):3060. https://doi.org/10.3390/app16063060
Chicago/Turabian StylePavliuk, Olena, and Myroslav Mishchuk. 2026. "Energy-Aware Spatio-Temporal Multi-Agent Route Planning for AGVs" Applied Sciences 16, no. 6: 3060. https://doi.org/10.3390/app16063060
APA StylePavliuk, O., & Mishchuk, M. (2026). Energy-Aware Spatio-Temporal Multi-Agent Route Planning for AGVs. Applied Sciences, 16(6), 3060. https://doi.org/10.3390/app16063060

