Detection of Players on a Soccer Team
based on Informed Filters using Only Color Features
Takuro Oki
1
and Ryusuke Miyamoto
2
1
Department of Fundamental Science and Technology,
Graduate School of Science and Technology, Meiji University, Kanagawa 214-8571, Japan
2
Department of Computer Science, School of Science and Technology, Meiji University, Kanagawa 214-8571, Japan
1 INTRODUCTION
Semantic analysis of sports videos has become an ac-
tive research topic. Player detection on the field is
a particularly important technique for various appli-
cations that are essential for semantic event detection
and tactical analysis, such as calculations of the dis-
tance covered by a player during a soccer match. Tra-
cab(ChyronHego, 2003) is one of the most famous
systems that can visualize the statistics of players’
performance during a match. However, the current
detection and tracking systems used for Tracab are
very large and expensive, so they are only found at
large stadiums. Many major teams require this sys-
tem when they play a match.
To solve this problem, we tackled this task by
using a simple monocular camera and developed a
highly accurate soccer player detection method using
only color features(Miyamoto and Oki, 2016). This
method is based on a simple sliding window algo-
rithm, but it does not use background subtraction or
inter-frame difference. This is because they are not
appropriate for moving cameras, though our system
has to operate properly for aerial photographs taken
by drones.
In our previous work(Miyamoto and Oki, 2016),
we tried to find all humans on the field including
coaches and referees. However, for team tactics and
player activity analysis, it is more useful to detect
only players that belong to a certain team. There-
fore, in this paper, we improve the previous method
(Miyamoto and Oki, 2016) and try to enable soccer
players to be accurately detected on the basis of their
teams.
2 INFORMED HAAR-LIKE
FEATURES
Informed Haar-like features (Zhang et al., 2014) en-
ables accurate human detection by representing the
object boundary properly. They possess two unique
points: a well-designed feature pool for construction
of a classifier and computation of features using bi-
nary and ternary template models. A binary model
computes feature values using two types of rectan-
gles as coefficients: −1 and +1. Its basic idea is
the same as Haar-like features proposed by (Viola
and Jones, 2001). A ternary model is applied to rep-
resent more complex geometric configurations than
the binary model and has three types of rectangle as
weights: −1, +1, and 0.
Filtered Channel Features(Zhang et al., 2015) us-
ing a feature pool including more complex templates
has topped the state-of-the-art accuracyfor human de-
tection and outperformed recent schemes based on
deep learning.
3 DETECTION OF PLAYERS ON
A SOCCER TEAM BASED ON
INFORMED FILTERS USING
ONLY COLOR FEATURES
Our previous proposal(Miyamoto and Oki, 2016) can
accurately detect people using only color features if
they are appropriately selected, but does not use his-
tograms of oriented gradients. This method targets
all people shown in the image without considering to
which team they belong. However, to obtain more
useful information for tactical analysis, players be-
longing to a certain team need to be detected. Thus,
we construct a classifier that enables detection of
players on a certain team. To construct a classifier, we
generate training samples and define samples that in-
clude a target team’s player as positive and the others
as negative. In addition, the goal keepers are excluded
from the detection target. To calculate feature value,
we generate templates and used only color features
like in our previous work(Miyamoto and Oki, 2016).