advanced decision-making capabilities, thereby
improving the realism and practicality of military
simulations. This work seeks to combine intent
recognition with behavior modeling in military
simulation platform, and thus allow a decision-
making agent.
Figure 1 shows the framework of this paper. The
circle consisting of four modules formulates a
cognition loop(Xu et al., 2019). Specifically, the
TankSim V1.20 firstly feed the observation into EBT
Models, which is a behavior model that we generate
using the Evolutionary Algorithm. The EBT models
guides blue tank actions, making them more closely
resemble real-world military forces(Jie Yang, 2021).
Then the blue tank makes real-time decisions. Finally,
the red tank makes decisions based on actions of blue
tank using the SBR algorithm(Avrahami-Zilberbrand
& Kaminka, 2005).
The rest of this paper is arranged as follows:
Section 2 introduces related work. Section 3
introduces the algorithms we used. Section 4
introduces the experiments and results of this work.
Section 5 introduces conclusions and future work.
2 RELATED WORK
2.1 Intent Recognition Algorithms
Since the formal definition of planning recognition
process was proposed in 1978(Schmidt et al., 1978),
scholars have started to study methods for plan
recognition. A method of generalized planning
recognition was proposed in 1986(Kautz & Allen,
1986), which describes task plan recognition with a
planning graph, represents the decomposition of the
task with the vertices of the graph, and proposes to
use the graph overlay for the solution of the problem,
which to some extent laid the foundation for
subsequent research. This method, although efficient,
assumes that the top-level goal of plan is unique and
does not consider the different priori probabilities of
different goals.
The Symbolic Plan Recognition (SBR) method
was proposed in 2005 (Avrahami-Zilberbrand &
Kaminka, 2005), which efficiently implements plan
recognition using tagging and back propagating, and
can quickly give partial solutions thus applying to
multiple aspects, but the efficiency decreases when
multiple plans are run concurrently. Further, the
authors proposed Utility-based Plan Recognition
(UPR)(Avrahami-Zilberbrand & Kaminka, 2007),
which can recognize multiple plans in overlapping
and interleaved contexts. The SBR family of
algorithms runs efficiently and can produce results in
each time and is usually used as a frequent choice of
recognition method by researchers.
A probabilistic planning recognition algorithm
based on planning tree grammar was proposed in
2009(Geib, 2009), which regarded plan recognition
as the parsing of the grammar tree, which effectively
solved the plan recognition in the case of multiple
concurrent plans, but it needs to construct a complete
parsing set. In the same period, a Planning
Recognition as Planning (PRaP) approach was
proposed in 2009(Ramírez & Geffner, 2009), which
used planning techniques to solve the goal
recognition problem by comparing the marginal cost,
which is the difference between the consistency of a
given observation and the best plan, between different
plans for the same goal through multiple invocations
of the AI planning system. This method is limited by
the fact that it can only reason about one plan at a time
and is computationally expensive.
Since then, attention has been paid to improving
the performance of the recognizer. A cost-based goal
recognition method that improves the speed of the
recognizer compared to PRaP, but is limited to the
path planning domain(Masters & Sardina, 2017). A
method of sampling the parse space using Monte-
Carlo tree search can significantly improves the speed
of solution compared to full parse (Kantharaju et al.,
2019).
2.2 Behavior Modelling Algorithms
Currently, commonly used behavior modeling
methods include, but are not limited to, the behavior
tree (BT), the finite state machines (FSM), and the
dynamic script (DS).
A rule-based behavior decision-making algorithm
for unidirectional two-channels by combining fuzzy
inference with a finite state machine was proposed in
2023(WANG Liang et al., 2023). A vehicle-level
expected functional safety hazard recognition method
based on a model of finite state machine was
proposed in 2023(XIONG Lu et al., 2023). Due to the
special structural characteristics, finite state machines
can only save a finite number of steps of state transfer,
so it is difficult for the FSM system to monitor the
historical execution flow of the transfer from the
initial state to the final state, and vice versa.
Dynamic Scripts (DS) is a reinforcement learning
technique based on rule scripts proposed in
2006(Spronck et al., 2006). In 2015, an Evolution
Dynamic Script (EDS), which embeds an evolution
approach to discovering new rules during DS learning