Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO

Jiang, Weilai; Liu, Jingwei; Wang, Wei; Wang, Yaonan

doi:10.3390/biomimetics11010025

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO

by

Weilai Jiang

^1,2,

Jingwei Liu

^1,2,*,

Wei Wang

³ and

Yaonan Wang

^1,2

¹

The College of Artificial Intelligence and Robotics, Hunan University, Changsha 410082, China

²

Greater Bay Area Institute for Innovation, Hunan University, Guangzhou 511300, China

³

The College of Architecture and Urban Planning, Hunan University, Changsha 410082, China

^*

Author to whom correspondence should be addressed.

Biomimetics 2026, 11(1), 25; https://doi.org/10.3390/biomimetics11010025 (registering DOI)

Submission received: 27 November 2025 / Revised: 21 December 2025 / Accepted: 30 December 2025 / Published: 1 January 2026

Download Versions Notes

Abstract

To address the path planning challenges for land–air amphibious biomimetic robots in unstructured environments, this study proposes a global path planning algorithm based on an Improved Proximal Policy Optimization (IPPO) framework. Unlike traditional single-domain navigation, amphibious robots face significant kinematic discontinuities when switching between terrestrial and aerial modes. To mitigate this, we integrate a Gated Recurrent Unit (GRU) module into the policy network, enabling the agent to capture temporal dependencies and make smoother decisions during mode transitions. Furthermore, to enhance exploration efficiency and stability, we replace the standard Gaussian noise with Ornstein–Uhlenbeck (OU) noise, which generates temporally correlated actions aligned with the robot’s physical inertia. Additionally, a Multi-Head Self-Attention mechanism is introduced to the value network, allowing the agent to dynamically prioritize critical environmental features—such as narrow obstacles—over irrelevant background noise. The simulation results demonstrate that the proposed IPPO algorithm significantly outperforms standard PPO baselines, achieving higher convergence speed, improved path smoothness, and greater success rates in complex amphibious scenarios.

Keywords: land–air amphibious biomimetic robot; path planning; reinforcement learning; PPO; GRU

Share and Cite

MDPI and ACS Style

Jiang, W.; Liu, J.; Wang, W.; Wang, Y. Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO. Biomimetics 2026, 11, 25. https://doi.org/10.3390/biomimetics11010025

AMA Style

Jiang W, Liu J, Wang W, Wang Y. Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO. Biomimetics. 2026; 11(1):25. https://doi.org/10.3390/biomimetics11010025

Chicago/Turabian Style

Jiang, Weilai, Jingwei Liu, Wei Wang, and Yaonan Wang. 2026. "Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO" Biomimetics 11, no. 1: 25. https://doi.org/10.3390/biomimetics11010025

APA Style

Jiang, W., Liu, J., Wang, W., & Wang, Y. (2026). Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO. Biomimetics, 11(1), 25. https://doi.org/10.3390/biomimetics11010025

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO

Abstract

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI