Benchmarking Binarisation Techniques for 2D Fiducial Marker Tracking

Yves Rangoni, Eric Ras

Abstract

This paper proposes a comparative study of different binarisation techniques for 2D fiducial marker tracking. The application domain is the recognition of objects for Tangible User Interface (TUI) using a tabletop solution. In this case, the common technique is to use markers, attached to the objects, which can be identified using camera-based pattern recognition techniques. Among the different operations that lead to a good recognition of these markers, the step of binarisation of greyscale image is the most critical one. We propose to investigate how this important step can be improved not only in terms of quality but also in term of computational efficiency. State-of-the-art thresholding techniques are benchmarked on this challenging task. A real-world tabletop TUI is used to perform an objective and goal oriented evaluation through the ReacTIVision framework. A computational efficient implementation of one of the best window-based thresholders is proposed in order to satisfy the real-time processing of a video stream. The experimental results reveal that an improvement of up to 10 points of the fiducial tracking recognition rate can be reached when selecting the right thresholder over the embedded method while being more robust and still remaining time-efficient.

References

  1. Asai, T., Arimura, H., Uno, T., and Nakano, S. I. (2003). Discovering frequent substructures in large unordered trees. In 6th International Conference on Discovery Science, volume 2843, pages 47-61.
  2. Ashley, J., Laurent, B., Greg, P., and Fujinaga, E. I. (2007). A comparative survey of image binarisation algorithms for optical recognition on degraded musical sources. In 8th International Conference on Music Information Retrieval, pages 509-512.
  3. Bernsen, J. (1986). Dynamic thresholding of gray-level images. In 8th International Conference on Pattern Recognition, pages 1251-1255.
  4. Chen, Q., Du, Y., Lin, R., and Tian, Y. (2012). Fast QR code image process and detection. In Wang, Y. and Zhang, X., editors, Internet of Things, volume 312 of Communications in Computer and Information Science, pages 305-312. Springer Berlin Heidelberg.
  5. Coelho, L. P. (2013). Mahotas: Open source software for scriptable computer vision. Journal of Open Research Software, 1.
  6. Cormen, T. H., Stein, C., Rivest, R. L., and Leiserson, C. E. (2001). Introduction to Algorithms. McGraw-Hill Higher Education, 2nd edition.
  7. Costanza, E. and Robinson, J. A. (2003). A region adjacency tree approach to the detection and design of fiducials. In Vision, Video and Graphics, pages 63- 70.
  8. Droettboom, M., MacMillan, K., and Fujinaga, I. (2003). The Gamera framework for building custom recognition systems. In Symposium on Document Image Understanding Technologies, pages 275-286.
  9. Dunser, A., Looser, J., Grasset, R., Seichter, H., and Billinghurst, M. (2010). Evaluation of tangible user interfaces for desktop AR. In International Symposium on Ubiquitous Virtual Reality, pages 36-39.
  10. Ishii, H. (2008). The tangible user interface and its evolution. In Communications of the ACM, pages 32-36.
  11. Kaltenbrunner, M., Jordà, S., Geiger, G., and Alonso, A. (2006). The reacTable: A collaborative musical instrument. In Workshop on Tangible Interaction in Collaborative Environments, pages 406-411.
  12. Kato, H. and Billinghurst, M. (1999). Marker tracking and HMD calibration for a video-based augmented reality conferencing system. In 2nd IEEE and ACM International Workshop on Augmented Reality (IWAR), page 8594.
  13. Lee, S. U., Chung, S. Y., and Park, R. H. (1990). A comparative performance study of several global thresholding techniques for segmentation. Computer Vision, Graphics, and Image Processing, 52(2):171-190.
  14. Leedham, G., Varma, S., Patankar, A., and Govindarayu, V. (2002). Separating text and background in degraded document images A comparison of global threshholding techniques for multi-stage threshholding. In 8th International Workshop on Frontiers in Handwriting Recognition, pages 244-249.
  15. Maquil, V. and Ras, E. (2012). Collaborative problem solving with objects: Physical aspects of a tangible tabletop in technology-based assessment. In From Research to Practice in the Design of Cooperative Systems: Results and Open Challenges, pages 153-166.
  16. Rangoni, Y., van Beusekom, J., and Breuel, T. M. (2009). Language independent thresholding optimization using a Gaussian mixture modelling of the character shapes. In Proceedings of the International Workshop on Multilingual OCR, pages 1-5. ACM.
  17. Saini, R., Dutta, M., and Kumar, R. (2012). A comparative study of several image segmentation techniques. Journal of Information and Operations Management, 3(1):21-24.
  18. Sauvola, J. and Pietikäinen, M. (2000). Adaptive document image binarization. In Pattern Recognition, volume 33, pages 225-236.
  19. Sezgin, M. and Sankur, B. (2004). Survey over image thresholding techniques and quantitative performance evaluation. Journal of Electronic Imaging, 13(1):146- 168.
  20. Shafait, F., Keysers, D., and Breuel, T. M. (2008). Efficient implementation of local adaptive thresholding techniques using integral images. In Document Recognition and Retrieval, volume 6815.
  21. Trier, O. T. D. and Taxt, T. (1995). Evaluation of binarization methods for document images. IEEE Transactions On Pattern Analysis And Machine Intelligence, 17:312-315.
  22. Ullmer, B. and Ishii, H. (2001). Emerging frameworks for tangible user interfaces. In Human-Computer Interaction in the New Millennium, pages 915-931. John M. Carroll, ed. Addison-Wesley.
  23. van Dam, A. (1997). Post-WIMP user interfaces. Communications of the ACM, 40(2):63-67.
  24. X. Zhang, S. Fronz, N. N. (2002). Visual marker detection and decoding in AR systems: A comparative study. In International Symposium on Mixed and Augmented Reality, pages 97-106.
  25. Zhang, H., Fritts, J. E., and Goldman, S. A. (2003). An entropy based objective evaluation method for image segmentation. Storage and Retrieval Methods and Applications for Multimedia, pages 38-49.
Download


Paper Citation


in Harvard Style

Rangoni Y. and Ras E. (2014). Benchmarking Binarisation Techniques for 2D Fiducial Marker Tracking . In Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-018-5, pages 616-623. DOI: 10.5220/0004820706160623


in Bibtex Style

@conference{icpram14,
author={Yves Rangoni and Eric Ras},
title={Benchmarking Binarisation Techniques for 2D Fiducial Marker Tracking},
booktitle={Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2014},
pages={616-623},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004820706160623},
isbn={978-989-758-018-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Benchmarking Binarisation Techniques for 2D Fiducial Marker Tracking
SN - 978-989-758-018-5
AU - Rangoni Y.
AU - Ras E.
PY - 2014
SP - 616
EP - 623
DO - 10.5220/0004820706160623