Regression-based 3D Hand Pose Estimation using Heatmaps

Chaitanya Bandi, Ulrike Thomas

2020

Abstract

3D hand pose estimation is a challenging problem in human-machine interaction applications. We introduce a simple and effective approach for 3D hand pose estimation in grasping scenarios taking advantage of a low-cost RGB-D camera. 3D hand pose estimation plays a major role in an environment where objects are handed over between the human and robot hand to avoid collisions and to collaborate in shared workspaces. We consider Convolutional Neural Networks (CNNs) to determine a solution to our challenge. The idea of cascaded CNNs is very appropriate for real-time applications. In the paper, we introduce an architecture for direct 3D normalized coordinates regression and a small-scale dataset for human-machine interaction applications. In a cascaded network, the first network minimizes the search space, then the second network is trained within the confined region to detect more accurate 2D heatmaps of joint’s locations. Finally, 3D normalized joints are regressed directly on RGB images and depth maps can lift normalized coordinates to camera coordinates.

Download


Paper Citation


in Harvard Style

Bandi C. and Thomas U. (2020). Regression-based 3D Hand Pose Estimation using Heatmaps. In Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP; ISBN 978-989-758-402-2, SciTePress, pages 636-643. DOI: 10.5220/0008973206360643


in Bibtex Style

@conference{visapp20,
author={Chaitanya Bandi and Ulrike Thomas},
title={Regression-based 3D Hand Pose Estimation using Heatmaps},
booktitle={Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP},
year={2020},
pages={636-643},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008973206360643},
isbn={978-989-758-402-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP
TI - Regression-based 3D Hand Pose Estimation using Heatmaps
SN - 978-989-758-402-2
AU - Bandi C.
AU - Thomas U.
PY - 2020
SP - 636
EP - 643
DO - 10.5220/0008973206360643
PB - SciTePress