Estimation of Overlapped Tactical Actions from Soccer Match Video

Kento Kuroda

, Ikuma Uchida

, Keisuke Fujii

2 a

and Yoshinari Kameda

3 b

Graduate School of Science and Technology, University of Tsukuba, Japan

Graduate School of Informatics, Nagoya University, Japan

Center for Computational Sciences, University of Tsukuba, Japan

Keywords:

Tactical Action, Soccer, Action Estimation.

Abstract:

In a soccer match, there may be overlaps of tactical actions performed at a certain point in time, because two

teams take different tactical actions and each tactical action has no clear temporal boundary. In this study, we

propose a new method for the estimation of overlapped tactical actions from soccer match videos. We enable

the estimation of overlapped tactical actions considering exclusive relationships. We achieve this by having

the deep learning model learn all tactical actions simultaneously. We validate this method using data from 10

matches. We succeeded in expressing the tactical actions performed at a given time in terms of the strength of

several tactical actions.

1 INTRODUCTION

According to Federation Internationale de Foot-

ball Association (FIFA) coaching manual, soccer is

broadly classiﬁed into offensive and defensive ac-

tions (Barnerat et al., 2000). They are further clas-

siﬁed into several tactical actions. In soccer, tactical

actions are actions that are performed by the uniﬁ-

cation of several players to score a goal or defend a

goal. People involved in soccer, such as the play-by-

play announcers and the coaches, can subjectively es-

timate tactical actions that are performed in a speciﬁc

match situation. If we can estimate tactical actions

for a speciﬁc match situation without having to rely

on people involved in soccer, we can treat tactical ac-

tions as objective indicators. This would make it pos-

sible to represent match situations using the strength

of each tactical action performed by the two teams.

It is expected that automatic editing of match videos,

tactical analysis with more information, and describ-

ing match situations will become possible.

Each tactical action can be expressed by the tem-

poral changes of the positional relationships among

22 players and between the ball and the players. There

may be overlaps to tactical actions performed at a cer-

tain point in time. There are two reasons for this. The

ﬁrst reason is that the two teams will each have their

https://orcid.org/0000-0001-5487-4297

https://orcid.org/0000-0001-6776-1267

Figure 1: Overview. We estimate tactical actions of the two

teams. We use the temporal changes of the positional rela-

tionships of the ball and players in tracking data.

tactical actions. The second reason is that the tem-

poral boundaries of tactical actions performed by the

same team are unclear. In addition, overlapped tac-

tical actions have exclusive relationships. In soccer,

some tactical actions are performed simultaneously,

while some are not.

In previous studies, speciﬁc tactical actions, such

as counter-attacks, were recognized (Sigari et al.,

2015; Kobayashi et al., 2012; Fassmeyer et al., 2021;

Bauer and Anzer, 2021). In addition, match situations

were categorized into ﬁve tactical actions (Suzuki

et al., 2020). There are two problems with the pre-

vious studies. The ﬁrst problem is that it does not

take into account the overlapping of tactical actions.

Therefore, the tactical actions that a team performs at

a given time are discretely represented by only one

Kuroda, K., Uchida, I., Fujii, K. and Kameda, Y.

Estimation of Overlapped Tactical Actions from Soccer Match Video.

DOI: 10.5220/0013066300003828

In Proceedings of the 12th International Conference on Sport Sciences Research and Technology Support (icSPORTS 2024), pages 257-264

ISBN: 978-989-758-719-1; ISSN: 2184-3201

257

in the conventional approaches (Suzuki et al., 2020;

Wang et al., 2015). The second problem is that it is

difﬁcult to collect a large amount of data when using

deep learning to recognize tactical actions.

We propose a new method for the estimation of

overlapped tactical actions from soccer match videos.

We enable the estimation of overlapped tactical ac-

tions considering exclusive relationships. We achieve

this by having the deep learning model learn all tac-

tical actions simultaneously. We handcraft a large

number of tactical actions based on simple deﬁni-

tions to estimate tactical actions using deep learning.

We estimate tactical actions in deep learning by fo-

cusing on the temporal changes of the positional re-

lationships among 22 players and between the ball

and the players. We estimate eight types of tacti-

cal actions, including offensive and defensive tacti-

cal actions. The four offensive tactical actions are

long counter, short counter, opposition half posses-

sion, and own half possession. The four defensive

tactical actions are counter-press, high press, middle

press, and block. We validated the estimation of over-

lapped tactical actions considering exclusive relation-

ships by using 10 matches. We succeeded in express-

ing the tactical actions performed at a given time in

terms of the strength of several tactical actions. Fig-

ure 1 shows an overview of this study.

2 RELATED WORKS

2.1 Research on Recognition of Tactical

Actions

This study aims to estimate overlapped tactical ac-

tions. Similar to this study, there are studies that have

investigated tactical action.

There are studies that recognize only speciﬁc tac-

tical actions. First, several methods have been pro-

posed for counter-attack recognition, which is also

the estimated target of this study. Sigari et al. pro-

posed a method to recognize counter-attacks based on

camera motion (Sigari et al., 2015). Kobayashi et al.

proposed a method to recognize counter-attacks us-

ing machine learning based on the position of agents

in the RoboCup (Kobayashi et al., 2012). They pro-

posed a model to recognize counter-attacks based on

the following three characteristics: the collapse of

the opponent’s defense, the attack by a small num-

ber of players, and the movement of the ball into

the opposition half. Dennis et al. used a Varia-

tional Auto-Encoder (VAE) to extract counter-attack

features from unlabeled data. The authors extracted

the characteristics of counter-attacks from unlabeled

data using VAE (Fassmeyer et al., 2021). In addi-

tion, Bauer et al. proposed a method for recogniz-

ing and quantitatively evaluating the tactical action

of counter-press (Bauer and Anzer, 2021). Forcher

et al. proposed a quantitative evaluation method for

defensive tactical actions (Forcher et al., 2022). Al-

though there have been studies on the recognition of

various tactical actions, recognition methods have not

yet been established.

Suzuki et al. proposed a method to classify several

tactical actions of a speciﬁc match situation through

whole match (Suzuki et al., 2020). The classiﬁcation

was performed using deep learning with the opinions

of experts as a teacher signal. However, it is believed

that there are match situations that cannot be repre-

sented because they did not recognize tactical actions

in overlapping. They considered exclusive relation-

ships between the tactical actions performed by the

two teams. They corrected the recognition results to

account for exclusive relationships. Since we focus

on estimating tactical actions in overlapping, we use a

method that takes exclusive relationships into account

when training the deep learning models.

2.2 Research Using the Temporal

Changes of the Ball and Players

Position

With the development and spread of tracking devices,

tracking data has become readily available. This has

led to research using the temporal changes of the ball

and players’ positions.

First, the temporal changes of the ball and players’

position are used to analyze match situations. Lucey

et al. examined a method for quantifying the possibil-

ity of chances by considering defensive positions and

formations obtained from tracking data (Lucey et al.,

2015). Kamiya et al. estimated the changes in match

situations based on variables such as the ball position,

front line position, and compactness (Kamiya et al.,

2017).

In addition, indicators of team characteristics have

been generated using tracking data. Lucey et al. pro-

posed a method for capturing team characteristics

and identifying teams from tracking data alone us-

ing multi-agent plan recognition (Lucey et al., 2012).

Bialkowski et al. proposed a method for analyzing

roles within a team by assigning each player soccer

position (Bialkowski et al., 2014).

In this study, we also use the positions of the ball

and players obtained from the broadcast videos for the

estimation of tactical actions.

icSPORTS 2024 - 12th International Conference on Sport Sciences Research and Technology Support

258

3 ESTIMATION OF

OVERLAPPED TACTICAL

ACTIONS

In this section, we describe a method for estimating

overlapped tactical actions. We describe the details of

the tactical actions we estimate in Section 3.1. We de-

scribe the estimation of each tactical action in Section

3.2. We describe the estimation of overlapped tactical

actions in Section 3.3.

3.1 Description of Tactical Actions

This section describes tactical actions. There are eight

types of tactical actions that we estimate. Four types

of offensive tactical actions are long counter (lc),

short counter (sc), opposition half possession (opp),

and own half possession (own). Four types of defen-

sive tactical actions are counter-press (cp), high press

(hp), middle press (mp), and block (bl). FIFA states

that there are 7 phases of play when the ball is in pos-

session, attacking, and 9 phases of play when the ball

is not in possession, defending (FIFA Training Cen-

tre, 2022). Among the 16 phases of play, 8 represen-

tative ones were selected as the tactical actions to be

estimated.

The overlapped tactical actions have exclusive re-

lationships. There are two types of exclusive rela-

tionships. First, there are tactical actions that can-

not be performed by the two teams at the same time.

Second, there are tactical actions that cannot be per-

formed by the same team at the same time. In soccer,

two teams play with one ball. It is impossible for the

two teams to simultaneously perform a tactical action

when it is in ball possession. It is also impossible

for the same team to perform a tactical action when

it is in ball possession and a tactical action when it is

not in ball possession at the same time. In this study,

the six tactical actions, excluding counter-attacks, are

the tactical actions that cannot be performed by the

two teams at the same time, as shown in the Table 1.

Counter-attacks can overlap the execution times of the

two teams. Long and short counters are tactical ac-

tions that cannot be performed by the same team at

the same time, as shown in the Table 2. Other than

the combination of long and short counters, the exe-

cution times of the two tactical actions can overlap in

the transition.

3.2 Estimation of Each Tactical Action

In this section, we describe the estimation method for

each tactical action. First, we organize the state vari-

ables used in this study. We describe the position of

Table 1: Tactical actions that can be performed simultane-

ously by the two teams. Y represents tactical actions that

can be performed by two teams simultaneously; N repre-

sents tactical actions that cannot be performed by two teams

simultaneously.

A Team

lc sc opp own cp hp mp bl

B Team

lc Y Y Y Y Y Y Y Y

sc Y Y Y Y Y Y Y Y

opp Y Y N Y Y Y Y Y

own Y Y Y N Y Y Y Y

cp Y Y Y Y N Y Y Y

hp Y Y Y Y Y N Y Y

mp Y Y Y Y Y Y N Y

bl Y Y Y Y Y Y Y N

Table 2: Tactical actions that can be performed simultane-

ously by the same team. Y represents tactical actions that a

team can perform simultaneously; N represents tactical ac-

tions that a team cannot perform simultaneously.

X Team

lc sc opp own cp hp mp bl

X Team

lc N Y Y Y Y Y Y

sc N Y Y Y Y Y Y

opp Y Y Y Y Y Y Y

own Y Y Y Y Y Y Y

cp Y Y Y Y Y Y Y

hp Y Y Y Y Y Y Y

mp Y Y Y Y Y Y Y

bl Y Y Y Y Y Y Y

the ball and players and the sequence consisting of

them. Next, we describe the teacher signal given to

the sequence.

The state variables used in this study are the posi-

tions of the ball and players on the ﬁeld. The position

of the ball is given in three-dimensional coordinates

as shown in Fig. 2. The state variable is represented

by the following equation. The position of the player

is given by xy two-dimensional coordinates in Fig. 2.

O = (O

, O

)

⊤

(1)

A[n] = (A[n]

, A[n]

)

⊤

(2)

B[n] = (B[n]

, B[n]

)

⊤

(3)

P = {O

O, A

A[0], . . . , A

A[10], B

B[0], . . . , B

B[10]} (4)

G = {P

, . . . , P

}

⊤

0 ≤ t ≤ T (5)

= {P

t−l

, . . . , P

}

⊤

(6)

The position of the ball at a given time is repre-

sented by Eq. (1). The position of the n-th player of

A team is represented by Eq. (2). The position of the

n-th player of B team is represented by Eq. (3). As

Estimation of Overlapped Tactical Actions from Soccer Match Video

259

Figure 2: Direction of the 3D coordinates of the ﬁeld.

Figure 3: Sorting the positions of the ball and players in the

tracking data by ball, A team keeper to forward, and B team

keeper to forward.

in Eq. (4), P

P contains the positions of the ball, the

players of A team, and the players of B team, in that

order. The positions of the players of each team are

stored in the order of keeper, defender, midﬁelder, and

forward, as shown in Fig. 3. The position P

P at time t

is denoted as P

. A match can be represented by the

Eq. (5) using P

. To represent a speciﬁc match situ-

ation, a match is divided into sequences consisting of

position at each time and past time. We denote by l

the length of past time included in the sequence. The

sequence X

at a given time t can be expressed by Eq.

(6).

Next, we describe the teacher signal. To train a

deep learning model, it is necessary to provide the

teacher signal Y

along with the sequence X

con-

taining the positions of the ball and players. The

teacher signal for a speciﬁc tactical action is denoted

as Y [tacaction]

. The teacher signal indicates whether

the tactical action is being performed at time t. We as-

sign a value of 1 to the teacher signal Y [tac action]

when the tactical action is being performed and 0

when it is not. This is expressed by Eq. (7). We

recognize tactical actions with simple deﬁnitions. We

denote the start time of a tactical action as start, the

end time as end, and the focus time as t. If the focus

time t is between the start time start and the end time

end, the tactical action is considered to be ongoing at

time t.

Y [tac action]



0 (t < start, end < t)

1 (start ≤ t ≤ end)

(7)

Figure 4: Teacher signal. We have the deep learning model

to learn all tactical actions simultaneously.

3.3 Estimation of Overlapped Tactical

Actions Considering Exclusive

Relationships

We discuss the method to estimate overlapped tactical

actions considering exclusive relationships. We have

the deep learning model to learn all tactical actions

simultaneously, as shown in Fig. 4. This allows for

the estimation of overlapped tactical actions. When

we train the deep learning model, we make the teacher

signals of the tactical actions exclusive, which cannot

be performed at the same time. This allows for the

estimation considering exclusive relationships. The

tactical actions that cannot be performed at the same

time are based on Table 1 and 2.

4 IMPLEMENTATION

In this section, we describe how to implement the pro-

posed method in Section 3. We describe the data used

in this study in Section 4.1. We describe how to divide

the data into sequences in Section 4.2. We describe

the teacher signals for each sequence in Section 4.3.

We describe the processes of training the model and

the estimation in Section 4.4.

4.1 Data Format

In this study, two sets of data are used. The ﬁrst is

tracking data provided by SkillCorner. The second is

event data provided by StatsBomb. We describe the

tracking data. The three-dimensional coordinates of

the ball and 22 players on the ﬁeld are obtained at

10 fps. SkillCorner detects the ball and the players

from the broadcast video and estimates their positions

on the ﬁeld. The positions of players not captured in

the broadcast video are predicted using deep learning.

Next, we describe the event data. Event data contains

events such as passes, drives, shots, and goals, along

with the time of occurrence and the position of the

ball at that time. It also includes information such as

whether the ball carrier was under pressure for each

icSPORTS 2024 - 12th International Conference on Sport Sciences Research and Technology Support

260

Figure 5: Sequence segmentation.

event. This information can be used to provide the

teacher signals for tactical actions.

4.2 Sequence Segmentation

We describe the method for sequence segmentation.

As shown in Eq. (6), the data is divided into se-

quences. The length of past time l is set to l = 10.0.

As shown in Fig. 5, the sequence X

at focus time t

includes the position data from t − 10.0 to t. Since

the fps is 10, the sequence contains 100 frames. The

focus time t is slid 0.1 seconds at a time for segmenta-

tion. This allows the estimation results for each frame

to be outputted.

Each frame contains a three-dimensional coordi-

nate indicating the ball position, two-dimensional co-

ordinates indicating the positions of 11 players from

A team, and two-dimensional coordinates indicating

the positions of 11 players from B team, as described

by Eq. (4) and shown in Fig. 3. The players of

each team are arranged in the order of goalkeeper, de-

fender, midﬁelder, and forward. There is no speciﬁc

rule regarding the order of players within the same po-

sition. Additionally, in Fig. 2, A team is set to attack

in the positive direction, while B team is set to attack

in the negative direction.

4.3 Acquiring Teacher Signals

We describe the method for obtaining the teacher sig-

nal. We handcraft a large number of tactical actions

based on simple deﬁnitions. As shown in Fig. 6, the

start and end times of each team’s eight types of tac-

tical actions are recognized from the event data using

simple deﬁnitions. Next, the teacher signal at focus

time t is obtained based on the start and end times of

each tactical action. Finally, we reﬂect exclusive rela-

tionships of the tactical actions in the teacher signal.

First, we describe the recognition of tactical ac-

tions using simple deﬁnitions. We adapt the deﬁni-

tions of team style indicators published by the anal-

ysis company DataStadium. The deﬁnitions of each

Figure 6: Assignment a teacher signal. We recognize start

and end times of each tactical action from event data. We

assign a teacher signal for the tactical action based on the

start and end times.

tactical action are shown in Table 3. We have modi-

ﬁed some of the deﬁnitions to simplify the recognition

from event data. Based on these deﬁnitions, the start

and end times of each tactical action are recognized

from the event data through whole matches.

Next, we provide the teacher signal Y

for each

sequence X

. The teacher signal for A team’s tactical

actions is denoted as Y

Y A

. The teacher signal for B

team’s tactical actions is denoted as Y

Y B

. Since there

are eight types of tactical actions, the teacher signal

for one team consists of 8 values, as shown in Eqs.

(8) and (9). To estimate the tactical actions of both

teams simultaneously, the teacher signal Y

is struc-

tured as shown in Eq. (10). As explained in Section

3.2, the teacher signal is either 0 or 1, as expressed

by Eq. (7). The start and end are the start and end

times recognized from the event data, and t refers to

the focus time.

Finally, to consider exclusive relationships of tac-

tical actions, we make the teacher signals of the tac-

tical actions exclusive, which cannot be performed at

the same time.

Y A

= (YA[lc]

,YA[sc]

,YA[opp]

,YA[own]

YA[cp]

,YA[hp]

,YA[mp]

,YA[bl]

)

(8)

Y B

= (Y B[lc]

,Y B[sc]

,Y B[opp]

,Y B[own]

Y B[cp]

,Y B[hp]

,Y B[mp]

,Y B[bl]

)

(9)

= (Y

Y A

, Y

Y B

) (10)

4.4 Training Model and Estimation

First, we describe the deep learning model used.

Next, we describe the processes of training the model

and the estimation.

The deep learning model used in this study is

Long Short-Term Memory (LSTM). We use LSTM

because it excels in supervised learning of time series

data.

Estimation of Overlapped Tactical Actions from Soccer Match Video

261

Table 3: Deﬁnition of tactical actions.

Tactical Action Deﬁnition

long counter Attacks that penetrate into the attacking third within 15 seconds after winning the ball in the defensive third

short counter Attacks that penetrate into the attacking third within 10 seconds after winning the ball in the middle third or behind the attacking third

opposition half possession Attacks with the ball possession for more than 20 seconds in opposition half

own half possession Attacks with the ball possession for more than 20 seconds in own half

counter-press Defenses that presses an opponent’s ball carrier less than 5 seconds after the ball lost.

high press Defenses that presses continuously against the play of the opponent’s keeper or defenders

middle press Defenses that presses continuously against the play of an opponent’s midﬁelders or forwards

block Defenses that do not press in own half when the opponent do the ball possession

Figure 7: The process of training. We input sequences and

teacher signals into a deep learning model.

Figure 8: The process of estimation. We input a sequence

into a trained model. The trained model outputs a estima-

tion result for each tactical action of the two teams.

As shown in Fig. 7, the combinations of sequence

and teacher signal Y

are input for training the

LSTM model. Next, as shown in Fig. 8, when a se-

quence X

is input to the trained LSTM model, the

estimation result Z

is output. The estimation results

in Z

, similar to the teacher signal, consists of eight

values for each team, as shown in Eqs. (11) and (12).

The estimation result Z

is composed of the estima-

tion results of both teams as shown in Eq. (13). In

certain tactical actions, the closer the estimation re-

sult is to 1, the higher the likelihood that the tactical

action is being performed.

= (ZA[lc]

, ZA[sc]

, ZA[opp]

, ZA[own]

ZA[cp]

, ZA[hp]

, ZA[mp]

, ZA[bl]

)

(11)

= (ZB[lc]

, ZB[sc]

, ZB[opp]

, ZB[own]

ZB[cp]

, ZB[hp]

, ZB[mp]

, ZB[bl]

)

(12)

= (Z

, Z

) (13)

5 EXPERIMENT

In this section, we describe the estimation results of

tactical actions using a trained model. First, we de-

scribe the dataset used for the experiments in Sec-

tion 5.1. In Section 5.2, we present the estimation

results for a particular scene. Finally, in Section 5.3,

we describe the estimation results of tactical actions

through whole matches as an ablation study.

5.1 Experimental Dataset

We describe the dataset used in this experiment. We

use both a training dataset and a test dataset. First, we

describe the training dataset. For the training dataset,

we use tracking data from SkillCorner and event data

from StatsBomb for 40 matches. All 40 matches are

from the La Liga 2023-2024 season. From these 40

matches, we prepare 2500 sequences for each team’s

tactical actions, except for counter-pressing, which

has 2000 sequences. A total of 39,000 sequences and

their corresponding teacher signals are used to train

the LSTM model. Next, we describe the test dataset.

We prepare a test dataset of 10 matches, all from the

La Liga 2023-2024 season, but not included in the

training dataset. After training the LSTM model with

the training dataset, we input the test dataset into the

trained LSTM model to verify the estimation results.

Finally, we describe a particular scene that we evalu-

ate qualitatively. We use the scene with overlapped

tactical actions. The scene is a one-minute period

from 43:30 to 44:30 in the second half of the Real

Madrid vs. Barcelona match. The scene is illustrated

using images that reﬂect the positions of the ball and

the players on the ﬁeld of the CG environment based

on the tracking data, as shown in Figure 9. In this

scene, the players are switching between offense and

defense, and overlapped tactical actions are occurring

in between.

5.2 Estimation Results of Overlapped

Tactical Actions

In this section, we describe the estimation results of

the situation with overlapped tactical actions using

the trained LSTM model. From this experiment, we

conﬁrmed that the trained model can estimate over-

lapped tactical actions considering exclusive relation-

ships. As the results show, we succeeded in express-

ing the tactical actions performed at a given time in

icSPORTS 2024 - 12th International Conference on Sport Sciences Research and Technology Support

262

Figure 9: The scene with overlapped tactical actions. The

scene is a one-minute period from 43:30 to 44:30 in the

second half of the Real Madrid vs. Barcelona match

Figure 10: Estimation results of tactical actions of Real

Madrid.

Figure 11: Estimation results of tactical actions of

Barcelona.

terms of the strength of several tactical actions. Cur-

rently, the dataset we handcrafted does not represent

the strength of tactical actions as time-varying. To

verify the performance for continuous estimation, we

need to expand the dataset for this purpose in the fu-

ture.

Figure 10 shows the estimation results for Real

Madrid’s tactical actions. Figure 11 shows the esti-

mation results for Barcelona’s tactical actions. The

trained model estimated that Real Madrid was attack-

ing with opposition half possession from 43:30 to

43:50. At the same time, it estimated a high pos-

sibility that Barcelona was performing long counter

and middle press. The estimation results changed as

the attack and defense roles reversed at around 43:50.

The model estimated a high possibility that Real

Madrid was performing block and long counter while

Barcelona was performing short counter and oppo-

sition half possession. In the actual match, around

44:30, Real Madrid regained the ball and attempted

a counter-attack, but it was stopped by a foul. The

trained model estimated this as a long counter. The

model did not estimate the likely to be performed two

tactical actions that cannot be performed at the same

Figure 12: Estimation results for the ﬁrst 45 minutes of the

10 games. This graph shows the tactical actions estimated

as most likely to be performed by each team at each time.

time based on Table 1 and 2. We believe that the esti-

mation of overlapped tactical actions considering ex-

clusive relationships was achieved.

5.3 Ablation Study

In this section, we describe the estimation results of

tactical actions through whole matches using a test

dataset of 10 matches. From this experiment, we con-

ﬁrmed the validity of the estimation results. Addition-

ally, we demonstrated the potential to represent match

situations through the tactical actions performed by

the two teams. Figure 12 shows the tactical actions

that were estimated as most likely to be performed by

each team at each time. We present the estimation

results for the ﬁrst 45 minutes of each match. Tac-

tical actions are estimated only in situations where

play continues for more than 10 seconds, as each se-

quence requires 10 seconds of position data. Look-

ing at Fig. 12 as a whole, the team with the superior

score often performs offensive tactical actions most

of the time. Conversely, inferior teams often perform

defensive tactical actions most of the time. By com-

paring the last three matches, we can conﬁrm the cor-

relation between estimation results and the objective

match content using Barcelona as an example. In the

match against Real Betis, Barcelona had an advantage

in terms of score. The estimation results show a domi-

Estimation of Overlapped Tactical Actions from Soccer Match Video

263

nance of possession attacks. In the match against Real

Madrid, the team performed defensive tactical actions

most of the time, indicating a struggle against a strong

opponent. In the match against Getafe, although of-

fensive tactical actions were predominant, the score

was 0-0. The estimation results predict that Barcelona

struggled to convert their attacks into goals. In fact,

Barcelona’s expected goals (xG) in this match were

signiﬁcantly higher than Getafe’s, but they failed to

score. From these observations, we conﬁrmed that

the estimation results align with the objective match

content. Furthermore, by representing match situa-

tions using the combination of tactical actions of the

two teams, we demonstrated the potential to predict

match situations.

6 CONCLUSION

In this study, we propose a new method for estimating

overlapped tactical actions from soccer match videos.

We enable the estimation of overlapped tactical ac-

tions considering exclusive relationships. We achieve

this by having the deep learning model learn all tacti-

cal actions simultaneously. We handcraft a large num-

ber of tactical actions based on simple deﬁnitions to

estimate tactical actions using deep learning. We esti-

mate tactical actions in deep learning by focusing on

the temporal changes of the positional relationships

among 22 players and between the ball and the play-

ers.

We validated the estimation of overlapped tactical

actions considering exclusive relationships by using

10 matches. We succeeded in expressing the tacti-

cal actions performed at a given time in terms of the

strength of several tactical actions. In the ablation

study, we conﬁrmed the validity of the estimation re-

sults.

ACKNOWLEDGEMENTS

Part of this research is supported by JSPS KAKENHI

Grand Number 21H03476 and 21H05300.

REFERENCES

Barnerat, T., Crevoisier, J., Hoek, F., Redon, P., and

Ritschard, M. (2000). FIFA Coaching. Druckerei

Feldegg AG.

Bauer, P. and Anzer, G. (2021). Data-driven detection of

counterpressing in professional football: A supervised

machine learning task based on synchronized posi-

tional and event data with expert-based feature extrac-

tion. Data Mining and Knowledge Discovery, 35.

Bialkowski, A., Lucey, P., Carr, P., Yue, Y., Sridharan, S.,

and Matthews, I. (2014). Large-scale analysis of soc-

cer matches using spatiotemporal tracking data. In

2014 IEEE International Conference on Data Mining,

pages 725–730.

Fassmeyer, D., Anzer, G., Bauer, P., and Brefeld, U. (2021).

Toward automatically labeling situations in soccer.

Frontiers in Sports and Active Living, 3.

FIFA Training Centre (2022). EFI Explanation

Documents (PDF) — ﬁfatrainingcentre.com.

https://www.ﬁfatrainingcentre.com/en/fwc2022/

eﬁ-metrics/eﬁ-metrics-pdfs.php. [accessed on 13th

July 2024].

Forcher, L., Altmann, S., Forcher, L., Jekauc, D., and

Kempe, M. (2022). The use of player tracking data to

analyze defensive play in professional soccer - a scop-

ing review. International journal of Sports Science &

Coaching, 17:174795412210757.

Kamiya, K., Nakanishi, W., and Izumi, Y. (2017). Track-

ing data to extract changes in football game situation.

Proceedings of the Institute of Statistical Mathemat-

ics, 65.

Kobayashi, Y., Kawamura, H., and Suzuki, K. (2012).

Counter attack detection with machine learning from

log ﬁles of robocup simulation. In The 6th Interna-

tional Conference on Soft Computing and Intelligent

Systems, and The 13th International Symposium on

Advanced Intelligence Systems, pages 1821–1826.

Lucey, P., Bialkowski, A., Carr, P., Foote, E., and Matthews,

I. (2012). Characterizing multi-agent team behavior

from partial team tracings: Evidence from the english

premier league. In The AAAI Conference on Artiﬁcial

Intelligence, pages 1387–1393.

Lucey, P., Bialkowski, A., Monfort, M., Carr, P., and

Matthews, I. (2015). ”quality vs quantity”: Improved

shot prediction in soccer using strategic features from

spatiotemporal data. In The 8th Annual MIT Sloan

Sports Analytics Conference, pages 1–9.

Sigari, M.-H., Soltanian-Zadeh, H., Kiani, V., and Pour-

reza, A.-R. (2015). Counterattack detection in broad-

cast soccer videos using camera motion estimation. In

2015 The International Symposium on Artiﬁcial Intel-

ligence and Signal Processing (AISP), pages 101–106.

Suzuki, G., Takahashi, S., Ogawa, T., and Haseyama, M.

(2020). An estimation method of candidate region

for superimposing information based on gaze track-

ing data in soccer videos. In 2020 IEEE Interna-

tional Conference on Consumer Electronics - Taiwan

(ICCE-Taiwan), pages 1–2.

Wang, Q., Zhu, H., Hu, W., Shen, Z., and Yao, Y. (2015).

Discerning tactical patterns for professional soccer

teams: an enhanced topic model with applications. In

The 21th ACM SIGKDD International Conference on

Knowledge Discovery and Data Mining, pages 2197–

2206.

icSPORTS 2024 - 12th International Conference on Sport Sciences Research and Technology Support

264