Authors:
Thomas Schnürer
1
;
Stefan Fuchs
2
;
Markus Eisenbach
3
and
Horst-Michael Groß
3
Affiliations:
1
Neuroinformatics and Cognitive Robotics Lab, Ilmenau University of Technology, 98684 Ilmenau, Germany, Honda Research Institute Europe GmbH, 63073 Offenbach/Main and Germany
;
2
Honda Research Institute Europe GmbH, 63073 Offenbach/Main and Germany
;
3
Neuroinformatics and Cognitive Robotics Lab, Ilmenau University of Technology, 98684 Ilmenau and Germany
Keyword(s):
Real-time 3D Joint Estimation, Human-Robot-Interaction, Deep Learning.
Related
Ontology
Subjects/Areas/Topics:
Applications
;
Computer Vision, Visualization and Computer Graphics
;
Geometry and Modeling
;
Image-Based Modeling
;
Pattern Recognition
;
Robotics
;
Software Engineering
Abstract:
To allow for safe Human-Robot-Interaction in industrial scenarios like manufacturing plants, it is essential to always be aware of the location and pose of humans in the shared workspace. We introduce a real-time 3D pose estimation system using single depth images that is aimed to run on limited hardware, such as a mobile robot. For this, we optimized a CNN-based 2D pose estimation architecture to achieve high frame rates while simultaneously requiring fewer resources. Building upon this architecture, we extended the system for 3D estimation to directly predict Cartesian body joint coordinates. We evaluated our system on a newly created dataset by applying it to a specific industrial workbench scenario. The results show that our system’s performance is competitive to the state of the art at more than five times the speed for single person pose estimation.