Multitask Metamodel for Keypoint Visibility Prediction in Human Pose Estimation

Romain Guesdon; Carlos Crispim-Junior; Laure Tougne

doi:10.5220/0010831200003124

Multitask Metamodel for Keypoint Visibility Prediction in Human Pose Estimation

Romain Guesdon, Carlos Crispim-Junior, Laure Tougne

2022

Abstract

The task of human pose estimation (HPE) aims to predict the coordinates of body keypoints in images. Even if nowadays, we achieve high performance on HPE, some difficulties remain to be fully overcome. For instance, a strong occlusion can deceive the methods and make them predict false-positive keypoints with high confidence. This can be problematic in applications that require reliable detection, such as posture analysis in car-safety applications. Despite this difficulty, actual HPE solutions are designed to always predict coordinates for each keypoint. To answer this problem, we propose a new metamodel that predicts both keypoints coordinates and their visibility. Visibility is an attribute that indicates if a keypoint is visible, non-visible, or not labeled. Our model is composed of three modules: the feature extraction, the coordinate estimation, and the visibility prediction modules. We study in this paper the performance of the visibility predictions and the impact of this task on the coordinate estimation. Baseline results are provided on the COCO dataset. Moreover, to measure the performance of this method in a more occluded context, we also use the driver dataset DriPE. Finally, we implement the proposed metamodel on several base models to demonstrate the general aspect of our metamodel.

Download

Paper Citation

in Harvard Style

Guesdon R., Crispim-Junior C. and Tougne L. (2022). Multitask Metamodel for Keypoint Visibility Prediction in Human Pose Estimation. In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP; ISBN 978-989-758-555-5, SciTePress, pages 428-436. DOI: 10.5220/0010831200003124

in Bibtex Style

@conference{visapp22,
author={Romain Guesdon and Carlos Crispim-Junior and Laure Tougne},
title={Multitask Metamodel for Keypoint Visibility Prediction in Human Pose Estimation},
booktitle={Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP},
year={2022},
pages={428-436},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010831200003124},
isbn={978-989-758-555-5},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP
TI - Multitask Metamodel for Keypoint Visibility Prediction in Human Pose Estimation
SN - 978-989-758-555-5
AU - Guesdon R.
AU - Crispim-Junior C.
AU - Tougne L.
PY - 2022
SP - 428
EP - 436
DO - 10.5220/0010831200003124
PB - SciTePress