Data for Image Recognition Tasks: An Efficient Tool for Fine-Grained Annotations
Marco Filax, Tim Gonschorek, Frank Ortmeier
2019
Abstract
Using large datasets is essential for machine learning. In practice, training a machine learning algorithm requires hundreds of samples. Multiple off-the-shelf datasets from the scientific domain exist to benchmark new approaches. However, when machine learning algorithms transit to industry, e.g., for a particular image classification problem, hundreds of specific purpose images are collected and annotated in laborious manual work. In this paper, we present a novel system to decrease the effort of annotating those large image sets. Therefore, we generate 2D bounding boxes from minimal 3D annotations using the known location and orientation of the camera. We annotate a particular object of interest in 3D once and project these annotations on to every frame of a video stream. The proposed approach is designed to work with off-the-shelf hardware. We demonstrate its applicability with an example from the real world. We generated a more extensive dataset than available in other works for a particular industrial use case: fine-grained recognition of items within grocery stores. Further, we make our dataset available to the interested vision community consisting of over 60,000 images. Some images were taken under ideal conditions for training while others were taken with the proposed approach in the wild.
DownloadPaper Citation
in Harvard Style
Filax M., Gonschorek T. and Ortmeier F. (2019). Data for Image Recognition Tasks: An Efficient Tool for Fine-Grained Annotations.In Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-351-3, pages 900-907. DOI: 10.5220/0007688709000907
in Bibtex Style
@conference{icpram19,
author={Marco Filax and Tim Gonschorek and Frank Ortmeier},
title={Data for Image Recognition Tasks: An Efficient Tool for Fine-Grained Annotations},
booktitle={Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2019},
pages={900-907},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007688709000907},
isbn={978-989-758-351-3},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Data for Image Recognition Tasks: An Efficient Tool for Fine-Grained Annotations
SN - 978-989-758-351-3
AU - Filax M.
AU - Gonschorek T.
AU - Ortmeier F.
PY - 2019
SP - 900
EP - 907
DO - 10.5220/0007688709000907