various scenes. In particular our approach produces
quite clear edges with no noisy small segments in
their proximity, an issue happening with other ap-
proaches on some scenes. Foreground objects are also
clearly extracted and the background region is cor-
rectly handled on most scenes. However some small
issues are present in the corridor and bed scenes (rows
2 and 3). In particular the blanket of the bed scene
(row 3) is quite critical for our approach since the
color data is very noisy and the orientation of the nor-
mals on the rough surface is very unstable.
8 CONCLUSIONS
In this paper we have introduced a novel scheme for
the joint segmentation of color and depth informa-
tion. The proposed approach exploits together spatial
constraints, surface orientation information and color
data to improve the segmentation performances. The
regions of the initial over-segmentation are merged
by exploiting a surface fitting scheme that allows to
determine if the regions candidate for merging cor-
respond to the same 3D surface. Experimental re-
sults demonstrate the effectiveness of this scheme
and its ability to recognize the objects in the scene.
Performances on real data acquired with the Kinect
show that the proposed method is able to outperform
state-of-the-art approaches in most situations. Fur-
ther research will be devoted to the combination of
the proposed approach with a recursive region split-
ting scheme. Furthermore, an advanced scheme for
the automatic balancing of the various clues relevance
will be developed. Finally, since our region merging
algorithm is highly vectorizable, parallel computing
implementations will be considered.
REFERENCES
Arbelaez, P., Maire, M., Fowlkes, C., and Malik, J. (2011).
Contour detection and hierarchical image segmenta-
tion. Pattern Analysis and Machine Intelligence, IEEE
Transactions on, 33(5):898–916.
Bleiweiss, A. and Werman, M. (2009). Fusing time-of-
flight depth and color for real-time segmentation and
tracking. In Proc. of DAGM Workshop, pages 58–69.
Dal Mutto, C., Zanuttigh, P., and Cortelazzo, G. (2011).
Scene segmentation assisted by stereo vision. In Pro-
ceedings of 3DIMPVT 2011, Hangzhou, China.
Dal Mutto, C., Zanuttigh, P., and Cortelazzo, G. (2012a).
Fusion of geometry and color information for scene
segmentation. IEEE Journal of Selected Topics in Sig-
nal Processing, 6(5):505–521.
Dal Mutto, C., Zanuttigh, P., and Cortelazzo, G. M.
(2012b). Time-of-Flight Cameras and Microsoft
Kinect. SpringerBriefs. Springer.
Erdogan, C., Paluri, M., and Dellaert, F. (2012). Planar
segmentation of rgbd images using fast linear fitting
and markov chain monte carlo. In Proc. of CRV.
Felzenszwalb, P. and Huttenlocher, D. (2004). Efficient
graph-based image segmentation. International Jour-
nal of Computer Vision, 59(2):167–181.
Fowlkes, C., Belongie, S., Chung, F., and Malik, J. (2004).
Spectral grouping using the nystr
¨
om method. IEEE
Transactions on Pattern Analysis and Machine Intel-
ligence, 26(2):214–225.
Gupta, S., Arbel
´
aez, P., Girshick, R., and Malik, J.
(2014). Indoor scene understanding with rgb-d im-
ages: Bottom-up segmentation, object detection and
semantic segmentation. International Journal of Com-
puter Vision, pages 1–17.
Gupta, S., Arbelaez, P., and Malik, J. (2013). Perceptual
organization and recognition of indoor scenes from
RGB-D images. In Proceedings of CVPR.
Hasnat, M. A., Alata, O., and Trmeau, A. (2014). Unsuper-
vised rgb-d image segmentation using joint clustering
and region merging. In Proceedings of BMVC.
Pagnutti, G. and Zanuttigh, P. (2014). Scene segmentation
from depth and color data driven by surface fitting. In
IEEE International Conference on Image Processing
(ICIP), pages 4407–4411. IEEE.
Pagnutti, G. and Zanuttigh, P. (2015). Scene segmentation
based on nurbs surface fitting metrics. In In proc. of
STAG Workshop.
Piegl, L. and Tiller, W. (1997). The NURBS Book (2Nd Ed.).
Springer-Verlag, Inc., New York, USA.
Ren, X., Bo, L., and Fox, D. (2012). Rgb-(d) scene labeling:
Features and algorithms. In Proc. of CVPR.
Shi, J. and Malik, J. (2000). Normalized cuts and image
segmentation. IEEE Transactions on Pattern Analysis
and Machine Intelligence, 22(8):888–905.
Silberman, N., Hoiem, D., Kohli, P., and Fergus, R. (2012).
Indoor segmentation and support inference from rgbd
images. In Proceedings of ECCV.
Srinivasan, N. and Dellaert, F. (2014). A rao-blackwellized
mcmc algorithm for recovering piecewise planar 3d
model from multiple view rgbd images. In IEEE In-
ternational Conference on Image Processing (ICIP).
Taylor, C. J. and Cowley, A. (2013). Parsing indoor scenes
using rgb-d imagery. In Robotics: Science and Sys-
tems, volume 8, pages 401–408.
Wallenberg, M., Felsberg, M., Forss
´
en, P.-E., and Dellen,
B. (2011). Channel coding for joint colour and depth
segmentation. In Proc. of DAGM, volume 6835, pages
306–315.
VISAPP 2016 - International Conference on Computer Vision Theory and Applications
100