5 CONCLUSIONS
We have presented a 3D model based algorithm for
6D object pose tracking in sequence of RGB images.
The object has been segmented using U-Net neural
network. The 6D object pose estimation has been
performed by particle filter combined with particle
swarm optimization algorithm. Owing to clustering
particles representing the probability distribution in
the PF, the extracted modes were processed by PSO
to represent them by a few representative particles in
the refined probability distribution. In future work we
are going to apply this algorithm for object manipula-
tion by Franka Emika. The initialization of the track-
ing will be done on the basis of pose regression neural
networks.
ACKNOWLEDGEMENTS
This work was supported by Polish National
Science Center (NCN) under a research grant
2017/27/B/ST6/01743.
REFERENCES
Arthur, D. and Vassilvitskii, S. (2007). K-means++: The
Advantages of Careful Seeding. In Proc. ACM-SIAM
Symp. on Discrete Algorithms, pages 1027–1035.
Brachmann, E., Krull, A., Michel, F., Gumhold, S., Shotton,
J., and Rother, C. (2014). Learning 6D object pose es-
timation using 3D object coordinates. In ECCV, pages
536–551. Springer.
Brachmann, E., Michel, F., Krull, A., Yang, M., Gumhold,
S., and Rother, C. (2016). Uncertainty-Driven 6D
Pose Estimation of Objects and Scenes from a Single
RGB Image. In IEEE Conf. on Computer Vision and
Pattern Recognition (CVPR), pages 3364–3372.
Bradski, G. and Kaehler, A. (2013). Learning OpenCV:
Computer Vision in C++ with the OpenCV Library.
O’Reilly Media, Inc., 2nd edition.
Deng, X., Mousavian, A., Xiang, Y., Xia, F., Bretl, T., and
Fox, D. (2019). PoseRBPF: A Rao-Blackwellized
Particle Filter for 6D Object Pose Tracking. In
Robotics: Science and Systems (RSS).
Doucet, A., Godsill, S., and Andrieu, C. (2000). On Se-
quential Monte Carlo Sampling Methods for Bayesian
Filtering. Statistics and Computing, 10(3):197–208.
Fischler, M. A. and Bolles, R. C. (1981). Random Sample
Consensus: A Paradigm for Model Fitting with Ap-
plications to Image Analysis and Automated Cartog-
raphy. Commun. ACM, 24(6):381–395.
Hinterstoisser, S., Holzer, S., Cagniart, C., Ilic, S., Kono-
lige, K., Navab, N., and Lepetit, V. (2011). Multi-
modal templates for real-time detection of texture-less
objects in heavily cluttered scenes. In Int. Conf. on
Computer Vision, pages 858–865.
Hinterstoisser, S., Lepetit, V., Ilic, S., Holzer, S., Bradski,
G., Konolige, K., and Navab, N. (2013). Model based
training, detection and pose estimation of texture-less
3D objects in heavily cluttered scenes. In Computer
Vision – ACCV 2012, pages 548–562. Springer.
Ioffe, S. and Szegedy, C. (2015). Batch normalization: Ac-
celerating deep network training by reducing internal
covariate shift. In ICML - vol. 37, pages 448–456.
Izadi, S., Kim, D., Hilliges, O., Molyneaux, D., Newcombe,
R., Kohli, P., Shotton, J., Hodges, S., Freeman, D.,
Davison, A., and Fitzgibbon, A. (2011). KinectFu-
sion: Real-time 3D Reconstruction and Interaction
Using a Moving Depth Camera. In Proc. ACM Symp.
on User Interface Soft. and Techn., pages 559–568.
Kennedy, J. and Eberhart, R. (1995). Particle Swarm Opti-
mization. In Proc. of IEEE Int. Conf. on Neural Net-
works, pages 1942–1948. IEEE Press, Piscataway, NJ.
Lepetit, V., Pilet, J., and Fua, P. (2004). Point matching as a
classification problem for fast and robust object pose
estimation. In Proc. IEEE Conf. on Computer Vision
and Pattern Recognition, volume 2, pages II–II.
Marchand, E., Uchiyama, H., and Spindler, F. (2016).
Pose estimation for augmented reality: A hands-on
survey. IEEE Trans. on Vis. and Comp. Graphics,
22(12):2633–2651.
Rad, M. and Lepetit, V. (2017). BB8: A scalable, accurate,
robust to partial occlusion method for predicting the
3D poses of challenging objects without using depth.
In IEEE Int. Conf. on Comp. Vision, pages 3848–3856.
Ronneberger, O., Fischer, P., and Brox, T. (2015). U-
Net: Convolutional networks for biomedical image
segmentation. In MICCAI, pages 234–241. Springer.
Rymut, B., Kwolek, B., and Krzeszowski, T. (2013). GPU-
accelerated human motion tracking using particle fil-
ter combined with PSO. In Adv. Concepts Intell. Vi-
sion Syst., LNCS, vol. 8192, pages 426–437. Springer.
Sengupta, S., Basak, S., and Peters, R. A. (2019). Parti-
cle Swarm Optimization: A survey of historical and
recent developments with hybridization perspectives.
Machine Learning and Knowl. Extr., 1(1):157–191.
Ugolotti, R., Nashed, Y. S. G., Mesejo, P., Ivekovi
ˇ
c,
v., Mussi, L., and Cagnoni, S. (2013). Particle
Swarm Optimization and Differential Evolution for
Model-based Object Detection. Appl. Soft Comput.,
13(6):3092–3105.
Vidal, J., Lin, C., and Mart
´
ı, R. (2018). 6d pose estima-
tion using an improved method based on point pair
features. In Int. Conf. on Control, Automation and
Robotics (ICCAR), pages 405–409.
Xiang, Y., Schmidt, T., Narayanan, V., and Fox, D. (2018).
PoseCNN: A Convolutional Neural Network for 6D
Object Pose Estimation in Cluttered Scenes. In
Robotics: Science and Systems XIV (RSS).
Zabulis, X., Lourakis, M. I., and Stefanou, S. S. (2014).
3D pose refinement using rendering and texture-based
matching. In Computer Vision and Graphics, pages
672–679. Springer.
3D Model-based 6D Object Pose Tracking on RGB Images using Particle Filtering and Heuristic Optimization
697