Previous Article in Journal
Gait Planning and Load-Bearing Capacity Analysis of Bionic Quadrupedal Robot Actuated by Water Hydraulic Artificial Muscles
Previous Article in Special Issue
Bioinspired Simultaneous Learning and Motion–Force Hybrid Control for Robotic Manipulators Under Multiple Constraints
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO

1
The College of Artificial Intelligence and Robotics, Hunan University, Changsha 410082, China
2
Greater Bay Area Institute for Innovation, Hunan University, Guangzhou 511300, China
3
The College of Architecture and Urban Planning, Hunan University, Changsha 410082, China
*
Author to whom correspondence should be addressed.
Biomimetics 2026, 11(1), 25; https://doi.org/10.3390/biomimetics11010025 (registering DOI)
Submission received: 27 November 2025 / Revised: 21 December 2025 / Accepted: 30 December 2025 / Published: 1 January 2026

Abstract

To address the path planning challenges for land–air amphibious biomimetic robots in unstructured environments, this study proposes a global path planning algorithm based on an Improved Proximal Policy Optimization (IPPO) framework. Unlike traditional single-domain navigation, amphibious robots face significant kinematic discontinuities when switching between terrestrial and aerial modes. To mitigate this, we integrate a Gated Recurrent Unit (GRU) module into the policy network, enabling the agent to capture temporal dependencies and make smoother decisions during mode transitions. Furthermore, to enhance exploration efficiency and stability, we replace the standard Gaussian noise with Ornstein–Uhlenbeck (OU) noise, which generates temporally correlated actions aligned with the robot’s physical inertia. Additionally, a Multi-Head Self-Attention mechanism is introduced to the value network, allowing the agent to dynamically prioritize critical environmental features—such as narrow obstacles—over irrelevant background noise. The simulation results demonstrate that the proposed IPPO algorithm significantly outperforms standard PPO baselines, achieving higher convergence speed, improved path smoothness, and greater success rates in complex amphibious scenarios.
Keywords: land–air amphibious biomimetic robot; path planning; reinforcement learning; PPO; GRU land–air amphibious biomimetic robot; path planning; reinforcement learning; PPO; GRU

Share and Cite

MDPI and ACS Style

Jiang, W.; Liu, J.; Wang, W.; Wang, Y. Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO. Biomimetics 2026, 11, 25. https://doi.org/10.3390/biomimetics11010025

AMA Style

Jiang W, Liu J, Wang W, Wang Y. Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO. Biomimetics. 2026; 11(1):25. https://doi.org/10.3390/biomimetics11010025

Chicago/Turabian Style

Jiang, Weilai, Jingwei Liu, Wei Wang, and Yaonan Wang. 2026. "Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO" Biomimetics 11, no. 1: 25. https://doi.org/10.3390/biomimetics11010025

APA Style

Jiang, W., Liu, J., Wang, W., & Wang, Y. (2026). Global Path Planning for Land–Air Amphibious Biomimetic Robot Based on Improved PPO. Biomimetics, 11(1), 25. https://doi.org/10.3390/biomimetics11010025

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop