Estimating Coarse 3D Shape and Pose from the Bounding Contour
Paria Mehrani, James H. Elder
2017
Abstract
Single-view reconstruction of a smooth 3D object is an ill-posed problem. Surface cues such as shading and texture provide local constraints on shape, but these cues can be weak, making it a challenge to recover globally correct models. The bounding contour can play an important role in constraining this global integration. Here we focus in particular on information afforded by the overall elongation (aspect ratio) of the bounding contour. We hypothesize that the tendency of objects to be relatively compact and the generic view assumption together induce a statistical dependency between the observed elongation of the object boundary and the coarse 3D shape of the solid object, a dependency that could potentially be exploited by single-view methods. To test this hypothesis we assemble a new dataset of solid 3D shapes and study the joint statistics of ellipsoidal approximations to these shapes and elliptical approximations of their orthographically projected boundaries. Optimal estimators derived from these statistics confirm our hypothesis, and we show that these estimators can be used to generate coarse 3D shape-pose estimates from the bounding contour that are significantly and substantially superior to competing methods.
References
- Barron, J. T. and Malik, J. (2015). Shape, illumination, and reflectance from shading. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(8):1670- 1687.
- Besl, P. J. and McKay, H. D. (1992). A method for registration of 3-d shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(2):239-256.
- Blum, H. (1973). Biological shape and visual science (Part I). J. Theor. Biol., 38:205-287.
- C¸alli, B., Walsman, A., Singh, A., Srinivasa, S., Abbeel, P., and Dollar, A. M. (2015). Benchmarking in manipulation research: The YCB object and model set and benchmarking protocols. ArXiv e-prints.
- Cole, F., Isola, P., Freeman, W., Durand, F., and Adelson, E. (2012). Shapecollage: Occlusion-aware, examplebased shape interpretation. In Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., and Schmid, C., editors, Computer Vision ECCV 2012, volume 7574 of Lecture Notes in Computer Science, pages 665-678. Springer Berlin Heidelberg.
- Elder, J. H. (2014). Bridging the dimensional gap: Perceptual organization of contour into two-dimensional shape. In Wagemans, J., editor, Oxford Handbook of Perceptual Organization, Oxford, UK. Oxford University Press.
- Freeman, W. T. (1994). The generic viewpoint assumption in a framework for visual perception. Nature, 368(6471):542-545.
- Igarashi, T., Matsuoka, S., and Tanaka, H. (1999). Teddy: A sketching interface for 3d freeform design. In Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH 7899, pages 409-416, New York, NY, USA. ACM Press/Addison-Wesley Publishing Co.
- Karsch, K., Liao, Z., Rock, J., Barron, J. T., and Hoiem, D. (2013). Boundary Cues for 3D Object Shape Recovery. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pages 2163- 2170.
- Koenderink, J. (1984). What does the occluding contour tell us about solid shape? Perception, 13(321-330).
- Oswald, M., Toppe, E., Kolev, K., and Cremers, D. (2009). Non-parametric single view reconstruction of curved objects using convex optimization. In Denzler, J., Notni, G., and Süe, H., editors, Pattern Recognition, volume 5748 of Lecture Notes in Computer Science, pages 171-180. Springer Berlin Heidelberg.
- Prasad, M., Zisserman, A., and Fitzgibbon, A. W. (2006). Single view reconstruction of curved surfaces. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, volume 2, pages 1345-1354.
- Singh, A., Sha, J., Narayan, K. S., Achim, T., and Abbeel, P. (2014). Bigbird: A large-scale 3D database of object instances. In 2014 IEEE International Conference on Robotics and Automation (ICRA), pages 509-516.
- Todd, J. (2004). The visual perception of 3D shape. Trends in Cognitive Sciences, 8(3):115-121.
- Todd, J. T. and Reichel, F. D. (1989). Ordinal structure in the visual perception and cognition of smoothly curved surfaces. Psychological Review, 96(4):643- 657.
- Toppe, E., Oswald, M. R., Cremers, D., and Rother, C. (2011). Image-based 3D Modeling via Cheeger Sets. In Proceedings of the 10th Asian Conference on Computer Vision - Volume Part I, ACCV'10, pages 53-64, Berlin, Heidelberg. Springer-Verlag.
- Tse, P. (2002). A contour propagation approach to surface filling-in and volume formation. Psychological Review, 109(1):91-115.
- Twarog, N. R., Tappen, M. F., and Adelson, E. H. (2012). Playing with Puffball: Simple scale-invariant inflation for use in vision and graphics. In Proceedings of the ACM Symposium on Applied Perception, SAP 7812, pages 47-54, New York, NY, USA. ACM.
Paper Citation
in Harvard Style
Mehrani P. and Elder J. (2017). Estimating Coarse 3D Shape and Pose from the Bounding Contour . In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2017) ISBN 978-989-758-225-7, pages 603-610. DOI: 10.5220/0006190306030610
in Bibtex Style
@conference{visapp17,
author={Paria Mehrani and James H. Elder},
title={Estimating Coarse 3D Shape and Pose from the Bounding Contour},
booktitle={Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2017)},
year={2017},
pages={603-610},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006190306030610},
isbn={978-989-758-225-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2017)
TI - Estimating Coarse 3D Shape and Pose from the Bounding Contour
SN - 978-989-758-225-7
AU - Mehrani P.
AU - Elder J.
PY - 2017
SP - 603
EP - 610
DO - 10.5220/0006190306030610