Combined Depth and Semantic Segmentation from Synthetic Data and a W-Net Architecture

Kevin Swingler, Teri Rumble, Ross Goutcher, Paul Hibbard, Mark Donoghue, Dan Harvey

2024

Abstract

Monocular pixel level depth estimation requires an algorithm to label every pixel in an image with its estimated distance from the camera. The task is more challenging than binocular depth estimation, where two cameras fixed a small distance apart are used. Algorithms that combine depth estimation with pixel level semantic segmentation show improved performance but present the practical challenge of requiring a dataset that is annotated at pixel level with both class labels and depth values. This paper presents a new convolutional neural network architecture capable of simultaneous monocular depth estimation and semantic segmentation and shows how synthetic data generated using computer games technology can be used to train such models. The algorithm performs at over 98% accuracy on the segmentation task and 88% on the depth estimation task.

Download


Paper Citation


in Harvard Style

Swingler K., Rumble T., Goutcher R., Hibbard P., Donoghue M. and Harvey D. (2024). Combined Depth and Semantic Segmentation from Synthetic Data and a W-Net Architecture. In Proceedings of the 16th International Joint Conference on Computational Intelligence - Volume 1: NCTA; ISBN 978-989-758-721-4, SciTePress, pages 413-422. DOI: 10.5220/0012877500003837


in Bibtex Style

@conference{ncta24,
author={Kevin Swingler and Teri Rumble and Ross Goutcher and Paul Hibbard and Mark Donoghue and Dan Harvey},
title={Combined Depth and Semantic Segmentation from Synthetic Data and a W-Net Architecture},
booktitle={Proceedings of the 16th International Joint Conference on Computational Intelligence - Volume 1: NCTA},
year={2024},
pages={413-422},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012877500003837},
isbn={978-989-758-721-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Joint Conference on Computational Intelligence - Volume 1: NCTA
TI - Combined Depth and Semantic Segmentation from Synthetic Data and a W-Net Architecture
SN - 978-989-758-721-4
AU - Swingler K.
AU - Rumble T.
AU - Goutcher R.
AU - Hibbard P.
AU - Donoghue M.
AU - Harvey D.
PY - 2024
SP - 413
EP - 422
DO - 10.5220/0012877500003837
PB - SciTePress