Next Article in Journal
Artificial Neural Network-Based Conveying Object Measurement Automation System Using Distance Sensor
Previous Article in Journal
A Dual-Purpose Biomedical Measurement System for the Evaluation of Real-Time Correlations Between Blood Pressure and Breathing Parameters
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Pose-Perceptive Convolution: Learning Geometry-Adaptive Receptive Fields for Robust 6D Pose Estimation

1
College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 201418, China
2
School of Computer Science and Technology, Tongji University, Shanghai 201804, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Sensors 2026, 26(2), 453; https://doi.org/10.3390/s26020453
Submission received: 10 November 2025 / Revised: 28 December 2025 / Accepted: 7 January 2026 / Published: 9 January 2026

Abstract

6D object pose estimation is crucial for applications such as robotic manipulation and augmented reality, yet it remains highly challenging when dealing with objects of significantly different aspect ratios or the drastic appearance variations of a single object caused by pose changes. Most existing methods focus on designing more complex backend fusion modules, while largely overlooking a fundamental problem at the feature extraction frontend: the geometric mismatch between the fixed, square receptive fields of standard convolutions and the varied projected morphologies of objects. This mismatch, along with noise in fused features and ambiguity in regression, limits the performance ceiling of current methods. To this end, this paper proposes a novel Pose-Perceptive Convolution (PPC) and constructs a new Pose-Perceptive Fusion Network (PPF-Net). Its core component, the Pose-Perceptive Convolution, fundamentally resolves the aforementioned geometric mismatch by dynamically adapting the shape and sampling density of its receptive field. Experiments on four benchmarks show that PPF-Net improves the VSD score by 19.4% over FFB6D on MP6D, and achieves 96.7% ADD-S on YCB-Video, approaching state-of-the-art accuracy. Crucially, these gains are realized with minimal computational overhead, avoiding the heavy latency of backend-intensive approaches. This validates that frontend feature extraction is an efficient strategy for robust 6D pose estimation.
Keywords: 6D pose estimation; RGB-depth fusion; geometric mismatch; receptive-field adaptation 6D pose estimation; RGB-depth fusion; geometric mismatch; receptive-field adaptation

Share and Cite

MDPI and ACS Style

Lai, Y.; Song, Y.; Zhang, Q.; Wang, Y.; An, K.; Zhang, H. Pose-Perceptive Convolution: Learning Geometry-Adaptive Receptive Fields for Robust 6D Pose Estimation. Sensors 2026, 26, 453. https://doi.org/10.3390/s26020453

AMA Style

Lai Y, Song Y, Zhang Q, Wang Y, An K, Zhang H. Pose-Perceptive Convolution: Learning Geometry-Adaptive Receptive Fields for Robust 6D Pose Estimation. Sensors. 2026; 26(2):453. https://doi.org/10.3390/s26020453

Chicago/Turabian Style

Lai, Yi, Yaqing Song, Qixian Zhang, Yue Wang, Kang An, and Hui Zhang. 2026. "Pose-Perceptive Convolution: Learning Geometry-Adaptive Receptive Fields for Robust 6D Pose Estimation" Sensors 26, no. 2: 453. https://doi.org/10.3390/s26020453

APA Style

Lai, Y., Song, Y., Zhang, Q., Wang, Y., An, K., & Zhang, H. (2026). Pose-Perceptive Convolution: Learning Geometry-Adaptive Receptive Fields for Robust 6D Pose Estimation. Sensors, 26(2), 453. https://doi.org/10.3390/s26020453

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop