On the Influence of Superpixel Methods for Image Parsing

Johann Strassburg; Rene Grzeszick; Leonard Rothacker; Gernot A. Fink

doi:10.5220/0005355705180527

On the Influence of Superpixel Methods for Image Parsing

Johann Strassburg, Rene Grzeszick, Leonard Rothacker, Gernot A. Fink

2015

Abstract

Image parsing describes a very fine grained analysis of natural scene images, where each pixel is assigned a label describing the object or part of the scene it belongs to. This analysis is a keystone to a wide range of applications that could benefit from detailed scene understanding, such as keyword based image search, sentence based image or video descriptions and even autonomous cars or robots. State-of-the art approaches in image parsing are data-driven and allow for recognizing arbitrary categories based on a knowledge transfer from similar images. As transferring labels on pixel level is tedious and noisy, more recent approaches build on the idea of segmenting a scene and transferring the information based on regions. For creating these regions the most popular approaches rely on over-segmenting the scene into superpixels. In this paper the influence of different superpixel methods will be evaluated within the well known Superparsing framework. Furthermore, a new method that computes a superpixel-like over-segmentation of an image is presented that computes regions based on edge-avoiding wavelets. The evaluation on the SIFT Flow and Barcelona dataset will show that the choice of the superpixel method is crucial for the performance of image parsing.

References

Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., and Süsstrunk, S. (2012). SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(11):2274-282.
Badino, H., Franke, U., and Pfeiffer, D. (2009). The stixel world-a compact medium level representation of the 3d-world. In Pattern Recognition, pages 51- 60. Springer.
Comaniciu, D. and Meer, P. (2002). Mean Shift: A Robust Approach Toward Feature Space Analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(5):603-619.
Farabet, C., Couprie, C., Najman, L., and LeCun, Y. (2013). Learning hierarchical features for scene labeling. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(8):1915-1929.
Fattal, R. (2009). Edge-avoiding wavelets and their applications. ACM Transactions on Graphics (TOG), 28(3):22.
Felzenszwalb, P. F. and Huttenlocher, D. P. (2004). Efficient graph-based image segmentation. International Journal of Computer Vision, 59(2):167-181.
Fredman, M. L. and Willard, D. E. (1994). Transdichotomous algorithms for minimum spanning trees and shortest paths. Journal of Computer and System Sciences, 48(3):533-551.
Gonzalez, R. C. and Woods, R. E. (2002). Digital image processing. Prentice Hall.
Liu, C., Yuen, J., and Torralba, A. (2011). Nonparametric scene parsing via label transfer. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(12):2368-2382.
Neubert, P. and Protzel, P. (2012). Superpixel benchmark and comparison. In Proc. Forum Bildverarbeitung.
Sweldens, W. (1998). The lifting scheme: A construction of second generation wavelets. SIAM Journal on Mathematical Analysis, 29(2):511-546.
Tighe, J. and Lazebnik, S. (2010). Superparsing: Scalable nonparametric image parsing with superpixels. In Proc. European Conference on Computer Vision (ECCV), pages 352-365. Springer.
Tighe, J. and Lazebnik, S. (2013a). Finding things: Image parsing with regions and per-exemplar detectors. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 3001-3008. IEEE.
Tighe, J. and Lazebnik, S. (2013b). Superparsing. International Journal of Computer Vision (IJCV), 101(2):329-349.
Uytterhoeven, G. and Bultheel, A. (1997). The red-black wavelet transform. TW Reports.
Vedaldi, A. and Soatto, S. (2008). Quick shift and kernel methods for mode seeking. In Computer VisionECCV 2008, pages 705-718. Springer.

Download

Paper Citation

in Harvard Style

Strassburg J., Grzeszick R., Rothacker L. and Fink G. (2015). On the Influence of Superpixel Methods for Image Parsing . In Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2015) ISBN 978-989-758-090-1, pages 518-527. DOI: 10.5220/0005355705180527

in Bibtex Style

@conference{visapp15,
author={Johann Strassburg and Rene Grzeszick and Leonard Rothacker and Gernot A. Fink},
title={On the Influence of Superpixel Methods for Image Parsing},
booktitle={Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2015)},
year={2015},
pages={518-527},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005355705180527},
isbn={978-989-758-090-1},
}

in EndNote Style

TY - CONF
JO - Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2015)
TI - On the Influence of Superpixel Methods for Image Parsing
SN - 978-989-758-090-1
AU - Strassburg J.
AU - Grzeszick R.
AU - Rothacker L.
AU - Fink G.
PY - 2015
SP - 518
EP - 527
DO - 10.5220/0005355705180527