Anatomical Landmark Tracking by One-shot Learned Priors for Augmented Active Appearance Models
Oliver Mothes, Joachim Denzler
2017
Abstract
For animal bipedal locomotion analysis, an immense amount of recorded image data has to be evaluated by biological experts. During this time-consuming evaluation single anatomical landmarks have to be annotated in each image. In this paper we reduce this effort by automating the annotation with a minimum level of user interaction. Recent approaches, based on Active Appearance Models, are improved by priors based on anatomical knowledge and an online tracking method, requiring only a single labeled frame. However, the limited search space of the online tracker can lead to a template drift in case of severe self-occlusions. In contrast, we propose a one-shot learned tracking-by-detection prior which overcomes the shortcomings of template drifts without increasing the number of training data. We evaluate our approach based on a variety of real-world X-ray locomotion datasets and show that our method outperforms recent state-of-the-art concepts for the task at hand.
References
- Amthor, M., Haase, D., and Denzler, J. (2012). Fast and robust landmark tracking in x-ray locomotion sequences containing severe occlusions. In International Workshop on Vision, Modelling, and Visualization (VMV). Eurographics Association.
- Amthor, M., Haase, D., and Denzler, J. (2014). Robust pictorial structures for x-ray animal skeleton tracking. In International Conference on Computer Vision Theory and Applications (VISAPP). SCITEPRESS.
- Andrada, E., Nyakatura, J. A., Bergmann, F., and Blickhan, R. (2013). Adjustments of global and local hindlimb properties during terrestrial locomotion of the common quail (coturnix coturnix). Journal of Experimental Biology.
- Andriluka, M., Roth, S., and Schiele, B. (2010). Monocular 3d pose estimation and tracking by detection. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE.
- Berclaz, J., Fleuret, F., Türetken, E., and Fua, P. (2011). Multiple object tracking using k-shortest paths optimization. Pattern Analysis and Machine Intelligence, IEEE Transactions on.
- Cootes, T., Edwards, G., and Taylor, C. (1998). Active appearance models. In Computer Vision ECCV98. Springer Berlin Heidelberg.
- Cootes, T. F., Edwards, G. J., and Taylor, C. J. (2001). Active appearance models. IEEE Trans. Pattern Anal. Mach. Intell.
- Dalal, N. and Triggs, B. (2005). Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. IEEE.
- Dehghan, A., Modiri Assari, S., and Shah, M. (2015). Gmmcp tracker: Globally optimal generalized maximum multi clique problem for multiple object tracking. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE.
- Felzenszwalb, P. F., Girshick, R. B., McAllester, D., and Ramanan, D. (2010). Object detection with discriminatively trained part-based models. IEEE Transactions on Pattern Analysis and Machine Intelligence.
- Freytag, A., Schadt, A., and Denzler, J. (2015). Interactive image retrieval for biodiversity research. In German Conference on Pattern Recognition (GCPR). Springer.
- Haase, D., Andrada, E., Nyakatura, J. A., Kilbourne, B. M., and Denzler, J. (2013). Automated approximation of center of mass position in x-ray sequences of animal locomotion. Journal of Biomechanics.
- Haase, D. and Denzler, J. (2011). Anatomical landmark tracking for the analysis of animal locomotion in x-ray videos using active appearance models. In Scandinavian Conference on Image Analysis (SCIA). Springer.
- Haase, D. and Denzler, J. (2013). 2d and 3d analysis of animal locomotion from biplanar x-ray videos using augmented active appearance models. EURASIP Journal on Image and Video Processing.
- Haase, D., Nyakatura, J. A., and Denzler, J. (2011). Multiview active appearance models for the x-ray based analysis of avian bipedal locomotion. In Annual Symposium of the German Association for Pattern Recognition (DAGM). Springer.
- Hariharan, B., Malik, J., and Ramanan, D. (2012). Discriminative decorrelation for clustering and classification. In Computer Vision-ECCV 2012. Springer.
- Jiang, X., Haase, D., Körner, M., Bothe, W., and Denzler, J. (2013). Accurate 3d multi-marker tracking in x-ray cardiac sequences using a two-stage graph modeling approach. In Computer Analysis of Images and Patterns, pages 117-125. Springer.
- Kendall, D. G. (1984). Shape manifolds, procrustean metrics, and complex projective spaces. Bulletin of the London Mathematical Society.
- Lelieveldt, B., zmc, M., van der Geest, R., Reiber, J., and Sonka, M. (2003). Multi-view active appearance models for consistent segmentation of multiple standard views: application to long- and short-axis cardiac {MR} images. International Congress Series.
- Li, L., Nawaz, T., and Ferryman, J. (2015). Pets 2015: Datasets and challenge. In Advanced Video and Signal Based Surveillance (AVSS), 2015 12th IEEE International Conference on. IEEE.
- Lowe, D. G. (2004). Distinctive image features from scaleinvariant keypoints. International Journal of Computer Vision.
- Nyakatura, J. A., Andrada, E., Blickhan, R., and Fischer, M. S. (2011). Avian bipedal locomotion. In 5th International Symposium on Adaptive Motion of Animals and Machines (AMAM). Elsevier.
- Sun, Y., Wang, X., and Tang, X. (2013). Deep convolutional network cascade for facial point detection. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on. IEEE.
- Zhang, L., Li, Y., and Nevatia, R. (2008). Global data association for multi-object tracking using network flows. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE.
- Zhou, E., Fan, H., Cao, Z., Jiang, Y., and Yin, Q. (2013). Extensive facial landmark localization with coarse-tofine convolutional network cascade. In Computer Vision Workshops (ICCVW), 2013 IEEE International Conference on. IEEE.
Paper Citation
in Harvard Style
Mothes O. and Denzler J. (2017). Anatomical Landmark Tracking by One-shot Learned Priors for Augmented Active Appearance Models . In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 6: VISAPP, (VISIGRAPP 2017) ISBN 978-989-758-227-1, pages 246-254. DOI: 10.5220/0006133302460254
in Bibtex Style
@conference{visapp17,
author={Oliver Mothes and Joachim Denzler},
title={Anatomical Landmark Tracking by One-shot Learned Priors for Augmented Active Appearance Models},
booktitle={Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 6: VISAPP, (VISIGRAPP 2017)},
year={2017},
pages={246-254},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006133302460254},
isbn={978-989-758-227-1},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 6: VISAPP, (VISIGRAPP 2017)
TI - Anatomical Landmark Tracking by One-shot Learned Priors for Augmented Active Appearance Models
SN - 978-989-758-227-1
AU - Mothes O.
AU - Denzler J.
PY - 2017
SP - 246
EP - 254
DO - 10.5220/0006133302460254