loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Björn Barz and Joachim Denzler

Affiliation: Computer Vision Group, Friedrich Schiller University Jena, Jena, Germany

Keyword(s): Weakly-supervised Localization, Class Activation Maps, Dense Class Maps, Cosine Loss, Object Detection.

Abstract: Can we learn to localize objects in images from just image-level class labels? Previous research has shown that this ability can be added to convolutional neural networks (CNNs) trained for image classification post hoc without additional cost or effort using so-called class activation maps (CAMs). However, while CAMs can localize a particular known class in the image quite accurately, they cannot detect and localize instances of multiple different classes in a single image. This limitation is a consequence of the missing comparability of prediction scores between classes, which results from training with the cross-entropy loss after a softmax activation. We find that CNNs trained with the cosine loss instead of cross-entropy do not exhibit this limitation and propose a variation of CAMs termed Dense Class Maps (DCMs) that fuse predictions for multiple classes into a coarse semantic segmentation of the scene. Even though the network has only been trained for single-label classificati on at the image level, DCMs allow for detecting the presence of multiple objects in an image and locating them. Our approach outperforms CAMs on the MS COCO object detection dataset by a relative increase of 27% in mean average precision. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.137.219.68

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Barz, B. and Denzler, J. (2022). Weakly-supervised Localization of Multiple Objects in Images using Cosine Loss. In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP; ISBN 978-989-758-555-5; ISSN 2184-4321, SciTePress, pages 287-296. DOI: 10.5220/0010760800003124

@conference{visapp22,
author={Björn Barz. and Joachim Denzler.},
title={Weakly-supervised Localization of Multiple Objects in Images using Cosine Loss},
booktitle={Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP},
year={2022},
pages={287-296},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010760800003124},
isbn={978-989-758-555-5},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP
TI - Weakly-supervised Localization of Multiple Objects in Images using Cosine Loss
SN - 978-989-758-555-5
IS - 2184-4321
AU - Barz, B.
AU - Denzler, J.
PY - 2022
SP - 287
EP - 296
DO - 10.5220/0010760800003124
PB - SciTePress