Figure 5: Identity switches due to exiting the frame.
5 CONCLUSIONS
This paper deals with the tracking of young handball
players during handball practice. The goal was to
detect and track all players on the handball court so
that the performance of a particular athlete and the
adoption of a particular technique can be analyzed.
This is a very demanding task of multiple object
tracking since players move fast, often change
direction, and are very often occluded and out of the
camera field view. For detection of player the
YOLOv3 detector was used and DeepSORT
algorithm for player tracking. The results were
evaluated on custom dataset that contains handball
videos with marked player ID-s. The performances of
the algorithm were tested according to common
multiple object tracking measures: IDF1, IDP, IDR,
MOTA, MOTP. The results of MOTA and MOTP are
excellent but not relevant because the same detector
was used for ground truth detections and in tracking.
Due to the relatively large number of players on the
field that are often occluded, and the demanding
scenario, players were correctly identified 24.7% of
the time, according to the IDF1 measure.
A detailed analysis of the results showed that the
scale of an object, occlusion, swapping IDs, and the
similar color of the players’ clothes with the
background, many times appear as problems. Those
issues are challenging even for people familiar with
players and the rule of the game, so in the future, we
will consider different methods to focus monitoring
only on players who are active, perform a given
action, or are carriers of the game.
Also, we will consider defining an appropriate
multiplayer tracking metric that would appropriately
evaluate those elements of athlete tracking that are
relevant to the task of monitoring and analyzing
athlete activity, and performing a particular action.
ACKNOWLEDGMENTS
This research was fully supported by the Croatian
Science Foundation under the project IP-2016-06-
8345 “Automatic recognition of actions and activities
in multimedia content from the sports domain”
(RAASS) and by the University of Rijeka under the
project number uniri-drustv-18-222.
REFERENCES
Bernardin, K., & Stiefelhagen, R. (2008). Evaluating
multiple object tracking performance: the CLEAR
MOT metrics. Journal on Image and Video Processing,
2008, 1.
Bewley, A., Ge, Z., Ott, L., Ramos, F., & Upcroft, B. (2016,
September). Simple online and realtime tracking. In
2016 IEEE International Conference on Image
Processing (ICIP) (pp. 3464-3468). IEEE.
Burić, M., Ivašić-Kos, M. & Pobar, M. (2019) (in press)
Player Tracking in Sports Videos. In 19th IEEE
International Conference on Computer and
Information Technology (CIT 2019)
Burić, M., Pobar, M., & Ivašić-Kos, M. (2019, January).
Adapting YOLO network for ball and player detection.
In Proceedings of the 8th International Conference on
Pattern Recognition Applications and Methods
(ICPRAM 2019) (pp. 845-851).
Burić, M., Pobar, M., & Ivašić-Kos, M. (2018, January).
Ball detection using YOLO and Mask R-CNN. In 2018
International Conference on Computational Science
and Computational Intelligence (CSCI).
Ciaparrone, G., Sánchez, F. L., Tabik, S., Troiano, L.,
Tagliaferri, R., & Herrera, F. (2019). Deep Learning in
Video Multi-Object Tracking: A Survey. arXiv preprint
arXiv:1907.12740.
Dicle, C., Camps, O. I., & Sznaier, M. (2013). The way they
move: Tracking multiple targets with similar
appearance. In Proceedings of the IEEE international
conference on computer vision (pp. 2304-2311).
Girshick, R. (2015). Fast r-CNN. In Proceedings of the
IEEE international conference on computer vision (pp.
1440-1448).
He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017).
Mask r-CNN. In Proceedings of the IEEE international
conference on computer vision (pp. 2961-2969).
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.,
Fu, C. Y., & Berg, A. C. (2016, October). SSD: Single
shot multibox detector. In European conference on
computer vision (pp. 21-37). Springer, Cham.
Ivasic-Kos, M., Pobar, M. (2018). “Building a labeled
dataset for recognition of handball actions using mask
R-CNN and STIPS,” in 7th IEEE European Workshop