Dynamic Pose Estimation Using Multiple RGB-D Cameras
AbstractHuman poses are difficult to estimate due to the complicated body structure and the self-occlusion problem. In this paper, we introduce a marker-less system for human pose estimation by detecting and tracking key body parts, namely the head, hands, and feet. Given color and depth images captured by multiple red, green, blue, and depth (RGB-D) cameras, our system constructs a graph model with segmented regions from each camera and detects the key body parts as a set of extreme points based on accumulative geodesic distances in the graph. During the search process, local detection using a supervised learning model is utilized to match local body features. A final set of extreme points is selected with a voting scheme and tracked with physical constraints from the unified data received from the multiple cameras. During the tracking process, a Kalman filter-based method is introduced to reduce positional noises and to recover from a failure of tracking extremes. Our system shows an average of 87% accuracy against the commercial system, which outperforms the previous multi-Kinects system, and can be applied to recognize a human action or to synthesize a motion sequence from a few key poses using a small set of extremes as input data. View Full-Text
Externally hosted supplementary file 1
Description: Experimental Results Video
Share & Cite This Article
Hong, S.; Kim, Y. Dynamic Pose Estimation Using Multiple RGB-D Cameras. Sensors 2018, 18, 3865.
Hong S, Kim Y. Dynamic Pose Estimation Using Multiple RGB-D Cameras. Sensors. 2018; 18(11):3865.Chicago/Turabian Style
Hong, Sungjin; Kim, Yejin. 2018. "Dynamic Pose Estimation Using Multiple RGB-D Cameras." Sensors 18, no. 11: 3865.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.