This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO
by
Weilai Jiang
Weilai Jiang 1,2,
Jingwei Liu
Jingwei Liu 1,2,*,
Wei Wang
Wei Wang 3 and
Yaonan Wang
Yaonan Wang 1,2
1
The College of Artificial Intelligence and Robotics, Hunan University, Changsha 410082, China
2
Greater Bay Area Institute for Innovation, Hunan University, Guangzhou 511300, China
3
The College of Architecture and Urban Planning, Hunan University, Changsha 410082, China
*
Author to whom correspondence should be addressed.
Biomimetics 2026, 11(1), 25; https://doi.org/10.3390/biomimetics11010025 (registering DOI)
Submission received: 27 November 2025
/
Revised: 21 December 2025
/
Accepted: 30 December 2025
/
Published: 1 January 2026
Abstract
To address the path planning challenges for land–air amphibious biomimetic robots in unstructured environments, this study proposes a global path planning algorithm based on an Improved Proximal Policy Optimization (IPPO) framework. Unlike traditional single-domain navigation, amphibious robots face significant kinematic discontinuities when switching between terrestrial and aerial modes. To mitigate this, we integrate a Gated Recurrent Unit (GRU) module into the policy network, enabling the agent to capture temporal dependencies and make smoother decisions during mode transitions. Furthermore, to enhance exploration efficiency and stability, we replace the standard Gaussian noise with Ornstein–Uhlenbeck (OU) noise, which generates temporally correlated actions aligned with the robot’s physical inertia. Additionally, a Multi-Head Self-Attention mechanism is introduced to the value network, allowing the agent to dynamically prioritize critical environmental features—such as narrow obstacles—over irrelevant background noise. The simulation results demonstrate that the proposed IPPO algorithm significantly outperforms standard PPO baselines, achieving higher convergence speed, improved path smoothness, and greater success rates in complex amphibious scenarios.
Share and Cite
MDPI and ACS Style
Jiang, W.; Liu, J.; Wang, W.; Wang, Y.
Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO. Biomimetics 2026, 11, 25.
https://doi.org/10.3390/biomimetics11010025
AMA Style
Jiang W, Liu J, Wang W, Wang Y.
Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO. Biomimetics. 2026; 11(1):25.
https://doi.org/10.3390/biomimetics11010025
Chicago/Turabian Style
Jiang, Weilai, Jingwei Liu, Wei Wang, and Yaonan Wang.
2026. "Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO" Biomimetics 11, no. 1: 25.
https://doi.org/10.3390/biomimetics11010025
APA Style
Jiang, W., Liu, J., Wang, W., & Wang, Y.
(2026). Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO. Biomimetics, 11(1), 25.
https://doi.org/10.3390/biomimetics11010025
Article Metrics
Article metric data becomes available approximately 24 hours after publication online.