Authors:
Benedikt Ortelt
1
;
Christian Herrmann
2
;
Dieter Willersinn
3
and
Jürgen Beyerer
2
Affiliations:
1
Robert Bosch GmbH, Germany
;
2
Fraunhofer IOSB and Karlsruhe Institute of Technology KIT, Germany
;
3
Fraunhofer IOSB, Germany
Keyword(s):
Instance Segmentation, Multi-scale Analysis, Foveated Imaging, Cityscapes.
Related
Ontology
Subjects/Areas/Topics:
Computer Vision, Visualization and Computer Graphics
;
Image and Video Analysis
;
Segmentation and Grouping
Abstract:
Instance-based semantic labeling is an important task for the interpretation of images in the area of autonomous or assisted driving applications. Not only indicating the semantic class for each pixel of an image, but also separating different instances from the same class, even if neighboring in the image, it can replace a multi-class object detector. In addition, it offers a better localization of objects in the image by replacing the object detector bounding box with a fine-grained object shape.
The recently presented Cityscapes dataset promoted this topic by offering a large set of data labeled on pixel level.
Building on the previous work of \cite{uhrig2016b}, this work proposes two improvements compared to this baseline strategy leading to significant performance improvements. First, a better distance measure for angular differences, which is unaffected by the $-\pi/\pi$ discontinuity, is proposed. This leads to improved object center localization. Second, the imagery from veh
icle perspective includes a fixed vanishing point. A foveal concept counteracts the fact that objects get smaller in the image towards this point. This strategy especially improves the results for small objects in large distances from the vehicle.
(More)