Deep Manifold Embedding for 3D Object Pose Estimation

Hiroshi Ninomiya, Yasutomo Kawanishi, Daisuke Deguchi, Ichiro Ide, Hiroshi Murase, Norimasa Kobori, Yusuke Nakano

2017

Abstract

Recently, 3D object pose estimation is being focused. The Parametric Eigenspace method is known as one of the fundamental methods for this. It represents the appearance change of an object caused by pose change with a manifold embedded in a low-dimensional subspace. It obtains features by Principal Component Analysis (PCA), which maximizes the appearance variation. However, there is a problem that it cannot handle a pose change with slight appearance change since there is not always a correlation between pose change and appearance change. In this paper, we propose a method that introduces “Deep Manifold Embedding” which maximizes the pose variation directly. We construct a manifold from features extracted from Deep Convolutional Neural Networks (DCNNs) trained with pose information. Pose estimation with the proposed method achieved the best accuracy in experiments using a public dataset.

References

  1. Broekens, J., Heerink, M., and Rosendal, H. (2009). Assistive social robots in elderly care: A review. Gerontechnology, 8(2):94-103.
  2. Chin, R. T. and Dyer, C. R. (1986). Model-based recognition in robot vision. ACM Computing Surveys, 18(1):67-108.
  3. Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-Fei, L. (2009). Imagenet: A large-scale hierarchical image database. In Proc. 22nd IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, pages 248-255.
  4. Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., and Darrell, T. (2013). DeCAF: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310.1531.
  5. Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. R. (2012). Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580.
  6. Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Pereira, F., Burges, C. J. C., Bottou, L., and Weinberger, K. Q., editors, Advances in Neural Information Processing Systems 25, pages 1097- 1105. Curran Associates, Inc.
  7. Murase, H. and Nayar, S. K. (1995). Visual learning and recognition of 3-d objects from appearance. Int. J. Comput. Vision, 14(1):5-24.
  8. Nair, V. and Hinton, G. E. (2010). Rectified linear units improve restricted Boltzmann machines. In Furnkranz, J. and Joachims, T., editors, Proc. 27th Int. Conf. on Machine Learning, pages 807-814. Omnipress.
  9. Nene, S. A., Nayar, S. K., and Murase, H. (1996). Columbia object image library (COIL-20). Technical report, CUCS-005-96, Department of Computer Science, Columbia University.
  10. Razavian, A. S., Azizpour, H., Sullivan, J., and Carlsson, S. (2014). CNN features off-the-shelf: An astounding baseline for recognition. In Proc. 27th IEEE Conf. on Computer Vision and Pattern Recognition Workshops, pages 512-519.
  11. Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., and LeCun, Y. (2013). OverFeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229.
Download


Paper Citation


in Harvard Style

Ninomiya H., Kawanishi Y., Deguchi D., Ide I., Murase H., Kobori N. and Nakano Y. (2017). Deep Manifold Embedding for 3D Object Pose Estimation . In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, (VISIGRAPP 2017) ISBN 978-989-758-226-4, pages 173-178. DOI: 10.5220/0006101201730178


in Bibtex Style

@conference{visapp17,
author={Hiroshi Ninomiya and Yasutomo Kawanishi and Daisuke Deguchi and Ichiro Ide and Hiroshi Murase and Norimasa Kobori and Yusuke Nakano},
title={Deep Manifold Embedding for 3D Object Pose Estimation},
booktitle={Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, (VISIGRAPP 2017)},
year={2017},
pages={173-178},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006101201730178},
isbn={978-989-758-226-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, (VISIGRAPP 2017)
TI - Deep Manifold Embedding for 3D Object Pose Estimation
SN - 978-989-758-226-4
AU - Ninomiya H.
AU - Kawanishi Y.
AU - Deguchi D.
AU - Ide I.
AU - Murase H.
AU - Kobori N.
AU - Nakano Y.
PY - 2017
SP - 173
EP - 178
DO - 10.5220/0006101201730178