loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock
Seg2Pose: Pose Estimations from Instance Segmentation Masks in One or Multiple Views for Traffic Applications

Topics: 3D Deep Learning; Camera Networks and Vision; Deep Learning for Tracking; Deep Learning for Visual Understanding ; Machine Learning Technologies for Vision; Object Detection and Localization; Stereo Vision and Structure from Motion; Tracking and Visual Navigation; Video Surveillance and Event Detection

Authors: Martin Ahrnbom ; Ivar Persson and Mikael Nilsson

Affiliation: Centre for Mathematical Sciences, Lund University, Sweden

Keyword(s): Pose Estimation, Instance Segmentation, Convolutional Neural Network, Traffic Safety, Road Users, Tracking, Stereo Camera, Trinocular Camera Array, Traffic Surveillance.

Abstract: A system we denote Seg2Pose is presented which converts pixel coordinate tracks, represented by instance segmentation masks across multiple video frames, into world coordinate pose tracks, for road users seen by static surveillance cameras. The road users are bound to a ground surface represented by a number of 3D points and does not necessarily have to be perfectly flat. The system works with one or more views, by using a late fusion scheme. An approximate position, denoted the normal position, is computed from the camera calibration, per-class default heights and the ground surface model. The position is then refined a novel Convolutional Neural Network we denote Seg2PoseNet, taking instance segmentations and cropping positioning as its input. We evaluate this system quantitatively both on synthetic data from CARLA Simulator and on a real recording from a trinocular camera. The system outperforms the baseline method of only using the normal positions, which is roughly equivalent of a typical 2D to 3D conversion system, in both datasets. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.145.91.108

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Ahrnbom, M., Persson, I. and Nilsson, M. (2022). Seg2Pose: Pose Estimations from Instance Segmentation Masks in One or Multiple Views for Traffic Applications. In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP; ISBN 978-989-758-555-5; ISSN 2184-4321, SciTePress, pages 777-784. DOI: 10.5220/0010777700003124

@conference{visapp22,
author={Martin Ahrnbom and Ivar Persson and Mikael Nilsson},
title={Seg2Pose: Pose Estimations from Instance Segmentation Masks in One or Multiple Views for Traffic Applications},
booktitle={Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP},
year={2022},
pages={777-784},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010777700003124},
isbn={978-989-758-555-5},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP
TI - Seg2Pose: Pose Estimations from Instance Segmentation Masks in One or Multiple Views for Traffic Applications
SN - 978-989-758-555-5
IS - 2184-4321
AU - Ahrnbom, M.
AU - Persson, I.
AU - Nilsson, M.
PY - 2022
SP - 777
EP - 784
DO - 10.5220/0010777700003124
PB - SciTePress