Learning Transformation Invariant Representations with Weak Supervision

Benjamin Coors, Alexandru Condurache, Alfred Mertins, Andreas Geiger

2018

Abstract

Deep convolutional neural networks are the current state-of-the-art solution to many computer vision tasks. However, their ability to handle large global and local image transformations is limited. Consequently, extensive data augmentation is often utilized to incorporate prior knowledge about desired invariances to geometric transformations such as rotations or scale changes. In this work, we combine data augmentation with an unsupervised loss which enforces similarity between the predictions of augmented copies of an input sample. Our loss acts as an effective regularizer which facilitates the learning of transformation invariant representations. We investigate the effectiveness of the proposed similarity loss on rotated MNIST and the German Traffic Sign Recognition Benchmark (GTSRB) in the context of different classification models including ladder networks. Our experiments demonstrate improvements with respect to the standard data augmentation approach for supervised and semi-supervised learning tasks, in particular in the presence of little annotated data. In addition, we analyze the performance of the proposed approach with respect to its hyperparameters, including the strength of the regularization as well as the layer where representation similarity is enforced.

Download


Paper Citation


in Harvard Style

Coors B., Condurache A., Mertins A. and Geiger A. (2018). Learning Transformation Invariant Representations with Weak Supervision. In Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018) - Volume 5: VISAPP; ISBN 978-989-758-290-5, SciTePress, pages 64-72. DOI: 10.5220/0006549000640072


in Bibtex Style

@conference{visapp18,
author={Benjamin Coors and Alexandru Condurache and Alfred Mertins and Andreas Geiger},
title={Learning Transformation Invariant Representations with Weak Supervision},
booktitle={Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018) - Volume 5: VISAPP},
year={2018},
pages={64-72},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006549000640072},
isbn={978-989-758-290-5},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018) - Volume 5: VISAPP
TI - Learning Transformation Invariant Representations with Weak Supervision
SN - 978-989-758-290-5
AU - Coors B.
AU - Condurache A.
AU - Mertins A.
AU - Geiger A.
PY - 2018
SP - 64
EP - 72
DO - 10.5220/0006549000640072
PB - SciTePress