Cui, Y., Chang, W., N
¨
oll, T., and Stricker, D. (2012). Kinec-
tavatar: Fully automatic body capture using a single
kinect. In Computer Vision - ACCV 2012 Workshops,
pages 133–147, Berlin, Heidelberg. Springer Berlin
Heidelberg.
Dou, M., Khamis, S., Degtyarev, Y., Davidson, P., Fanello,
S. R., Kowdle, A., Escolano, S. O., Rhemann, C.,
Kim, D., Taylor, J., Kohli, P., Tankovich, V., and
Izadi, S. (2016). Fusion4d: Real-time performance
capture of challenging scenes. ACM Trans. Graph.,
35(4):114:1–114:13.
Geman, S. and McClure, D. (1987). Statistical methods
for tomographic image reconstruction. Bulletin of the
International Statistical Institute, 52:5–21.
Guo, K., Xu, F., Wang, Y., Liu, Y., and Dai, Q. (2015). Ro-
bust non-rigid motion tracking and surface reconstruc-
tion using l0 regularization. In ICCV, pages 3083–
3091.
Huang, Y., Bogo, F., Lassner, C., Kanazawa, A., Gehler,
P. V., Romero, J., Akhter, I., and Black, M. J. (2017).
Towards accurate marker-less human shape and pose
estimation over time. In 3DV.
Innmann, M., Zollh
¨
ofer, M., Nießner, M., Theobalt, C., and
Stamminger, M. (2016). Volumedeform: Real-time
volumetric non-rigid reconstruction. In ECCV.
Ionescu, C., Papava, D., Olaru, V., and Sminchisescu, C.
(2014). Human3.6m: Large scale datasets and pre-
dictive methods for 3d human sensing in natural envi-
ronments. IEEE Trans. Pattern Analysis and Machine
Intelligence, 36(7):1325–1339.
Izadi, S., Kim, D., Hilliges, O., Molyneaux, D., Newcombe,
R., Kohli, P., Shotton, J., Hodges, S., Freeman, D.,
Davison, A., and Fitzgibbon, A. (2011). Kinectfu-
sion: Real-time 3d reconstruction and interaction us-
ing a moving depth camera. In Proceedings of the 24th
Annual ACM Symposium on User Interface Software
and Technology, pages 559–568.
Kanazawa, A., Black, M. J., Jacobs, D. W., and Malik,
J. (2018). End-to-end recovery of human shape and
pose. In CVPR.
Leroy, V., Franco, J.-S., and Boyer, E. (2017). Multi-View
Dynamic Shape Refinement Using Local Temporal
Integration. In ICCV.
Li, H., Adams, B., Guibas, L. J., and Pauly, M. (2009). Ro-
bust single-view geometry and motion reconstruction.
ACM Trans. Graph., pages 175:1–175:10.
Loper, M., Mahmood, N., and Black, M. J. (2014). Mosh:
Motion and shape capture from sparse markers. ACM
Trans. Graph., 33(6):220:1–220:13.
Loper, M., Mahmood, N., Romero, J., Pons-Moll, G., and
Black, M. J. (2015). Smpl: A skinned multi-person
linear model. ACM Trans. Graph., 34(6):248:1–
248:16.
Loper, M. M. and Black, M. J. (2014). OpenDR: An ap-
proximate differentiable renderer. In ECCV.
Mehta, D., Sridhar, S., Sotnychenko, O., Rhodin, H.,
Shafiei, M., Seidel, H.-P., Xu, W., Casas, D., and
Theobalt, C. (2017). Vnect: Real-time 3d human pose
estimation with a single rgb camera. ACM Transac-
tions on Graphics, 36(4).
Newcombe, R. A., Fox, D., and Seitz, S. M. (2015). Dy-
namicfusion: Reconstruction and tracking of non-
rigid scenes in real-time. In CVPR.
Pons-Moll, G., Romero, J., Mahmood, N., and Black, M. J.
(2015). Dyna: A model of dynamic human shape in
motion. ACM Trans. Graph., 34(4):120:1–120:14.
Slavcheva, M., Baust, M., Cremers, D., and Ilic, S. (2017).
KillingFusion: Non-rigid 3D Reconstruction without
Correspondences. In CVPR.
Varol, G., Romero, J., Martin, X., Mahmood, N., Black,
M. J., Laptev, I., and Schmid, C. (2017). Learning
from Synthetic Humans. In CVPR.
Wei, S.-E., Ramakrishna, V., Kanade, T., and Sheikh, Y.
(2016). Convolutional pose machines. In CVPR.
Weiss, A., Hirshberg, D., and Black, M. J. (2011). Home
3D body scans from noisy image and range data. In
ICCV.
Xu, W., Chatterjee, A., Zollh
¨
ofer, M., Rhodin, H., Mehta,
D., Seidel, H., and Theobalt, C. (2017). Monoperfcap:
Human performance capture from monocular video.
CoRR, abs/1708.02136.
Yu, T., Guo, K., Xu, F., Dong, Y., Su, Z., Zhao, J., Li, J.,
Dai, Q., and Liu, Y. (2017). Bodyfusion: Real-time
capture of human motion and surface geometry using
a single depth camera. In ICCV.
Zhang, Q., Fu, B., Ye, M., and Yang, R. (2014). Quality
dynamic human body modeling using a single low-
cost depth camera. In CVPR, pages 676–683.
Zollhofer, M., Niessner, M., Izadi, S., Rehmann, C., Zach,
C., Fisher, M., Wu, C., Fitzgibbon, A., Loop, C.,
Theobalt, C., and Stamminger, M. (2014). Real-time
non-rigid reconstruction using an rgb-d camera. ACM
Trans. Graph., 33:156:1–156:12.
Template based Human Pose and Shape Estimation from a Single RGB-D Image
581