Authors:
Murilo Couceiro
1
;
2
;
Inês Rito Lima
2
;
Alexandre Ulisses
2
;
Tiago Mendes-Neves
1
;
3
and
João Mendes-Moreira
3
Affiliations:
1
Faculdade de Engenharia, Universidade do Porto, Porto, Portugal
;
2
MOG Technologies, Department of Innovation, Maia, Portugal
;
3
LIAAD - INESC TEC, Porto, Portugal
Keyword(s):
Metadata Enhancement, Player Tracking, Computer Vision, Sports Broadcasting.
Abstract:
The broadcast of audio-video sports content is a field with increasingly larger audiences demanding higher quality content and involvement. This growth creates the necessity to develop more content to engage the users and keep this trend. Otherwise, it may stall or even diminish. Therefore, enhancing the user experience, engagement, and involvement during live sports event broadcasts is of utmost importance. This paper proposes a solution to extract event’s information from video, resorting to Computer Vision techniques and Deep Learning algorithms. More specifically, the project encompassed the definition and implementation of field registration, object detection and tracking tasks. Focusing on football sports events, a novel dataset combining several video sources was created and used for analysis and metadata extraction. In particular, the proposed solution can detect and track players with acceptable precision using state-of-the-art methods, like YOLOv5 and DeepSORT. Furthermore,
resorting to unsupervised learning techniques, the system provides team segmentation based on the colour of the players’ kits. A series of visual representations regarding the players’ movements on the field enables broadcast enrichment and increased user experience. The presented solution is framed in the H2020 DataCloud project and will be deployed in a cloud environment simplifying its access and utilisation.
(More)