Figure 4: Recognition success rate for each item.
brary (Rusu and Cousins, 2011). In order to avoid
influence of performance of the method to generate
object hypothesis, we applied common model match-
ing method, the VPM.
In order to decide whether the recognition is suc-
cess or not, we evaluated the F measure calculated
by comparing the recognized object region and the
ground truth. If the F measure exceeding 0.5, then
we decided recognition is success.
Figure 5 shows the recognition performance of
each method. Both method are used the VPM for gen-
erating object hypotheses, so the results depended on
the algorithm of the verification. Average recognition
rate of the proposed HV method and the previous HV
method are 52.8% and 25.8 %, respectively. It have
been confirmed that the reliability of recognition is
higher than the previous HV method.
Recognition rate of the ID 2, 12, 19, and 25 are
relatively lower than the other method. These items
are thin compared with others, so the area of appear-
ance in the input scene was small. As a result, these
items are not recognized by the VPM.
5 CONCLUSION
In this research, we have proposed the method to en-
hance the reliability of the Hypothesis Verification
(HV) method that simultaneously recognizes layout
of multiple objects. The proposed method have em-
ployed not only the RGB-D consistency between the
input scene and the scene hypothesis but also the
physical consistency. By considering the physical
consistency of the scene hypothesis, the proposed HV
method can efficiently reject false one. In addition,
the method have applied a reliable model matching
method, the VPM. As for future work, we will de-
velop the method to recognize thin objects.
ACKNOWLEDGEMENTS
This work was partially supported by Grant-in-Aid
for Scientific Research (C) 26420398.
REFERENCES
Akizuki, S. and Hashimoto, M. (2015a). A proposal
of the global reference frame for surface flatness-
independent 3d object detection. In Proc. Joint Con-
ference of IWAIT and IFMIA.
Akizuki, S. and Hashimoto, M. (2015b). Stable position and
pose estimation of industrial parts using evaluation of
observability of 3d vector pairs. 27(2):174–181.
Aldoma, A., Tombari, F., di Stefano, L., and Vincze, M.
(2012a). A global hypotheses verification method for
3d object recognition. In Computer Vision - ECCV
2012 - 12th European Conference on Computer Vi-
sion, pages 511–524.
Aldoma, A., Tombari, F., Prankl, J., Richtsfeld, A., di Ste-
fano, L., and Vincze, M. (2013). Multimodal cue in-
tegration through hypotheses verification for RGB-D
object recognition and 6dof pose estimation. In IEEE
International Conference on Robotics and Automa-
tion, pages 2104–2111.
Aldoma, A., Tombari, F., Rusu, R. B., and Vincze, M.
(2012b). OUR-CVFH - oriented, unique and repeat-
able clustered viewpoint feature histogram for ob-
ject recognition and 6dof pose estimation. In Pattern
Recognition - Joint 34th DAGM and 36th OAGM Sym-
posium, pages 113–122.
Chen, H. and Bhanu, B. (2007). 3d free-form object recog-
nition in range images using local surface patches.
28(10):1252–1262.
Drost, B., Ulrich, M., Navab, N., and Ilic, S. (2010). Model
globally, match locally: Efficient and robust 3d object
recognition. In The Twenty-Third IEEE Conference
on Computer Vision and Pattern Recognition CVPR,
pages 998–1005.
Fuji, T., Kimura, N., and Ito, K. (2015). Architecture for
recognizing stacked box objects for automated ware-
housing robot system. In Proceedings of the 17th
Irish Machine Vision and Image Processing confer-
ence, pages 50–56.
Gottschalk, S., Lin, M. C., and Manocha, D. (1996). Obb-
tree: A hierarchical structure for rapid interference de-
tection. In SIGGRAPH, pages 171–180.
Hashimoto, M., Sumi, K., and Usami, T. (1999). Recogni-
tion of multiple objects based on global image consis-
tency. In Proceedings of the British Machine Vision
Conference, pages 1–10.
Rusu, R. B. and Cousins, S. (2011). 3d is here: Point cloud
library (PCL). In IEEE International Conference on
Robotics and Automation, ICRA. IEEE.
Tombari, F., Salti, S., and di Stefano, L. (2010). Unique
signatures of histograms for local surface description.
In European Conference on Computer Vision ECCV,
pages 356–369.
Tombari, F. and Stefano, L. D. (2010). Object recognition in
3d scenes with occlusions and clutter by hough voting.
In Proc. Fourth Pacific-Rim Symposium on Image and
Video Technology, pages 349–355.