p = p = p
q
1
1
L
3
2
1
6
5
4
9
8
7
q
3
2
3
p
q
q 0=
2
Figure 1: Formation described by curvilinear coordinates p
and q.
direction of q is defined in the left half plane in direc-
tion of the forward movement.
The shape of the formation is then uniquely deter-
mined by the parameters p
i
(t) and q
i
(t), defined for
each follower i, which can vary during the mission.
To convert the state of the followers in curvilinear co-
ordinates to the state in rectangular coordinates, the
simple equations in (Barfoot and Clark, 2004) can be
applied.
The main idea of the receding horizon control is to
solve a moving finite horizon optimal control problem
for a system starting from current states or configura-
tion ψ(t
0
) over the time interval [t
0
,t
f
] under a set of
constraints on the system states and control inputs. In
this framework, the length t
f
− t
0
of the time interval
[t
0
,t
f
] is known as the control horizon. After a so-
lution from the optimization problem is obtained on
a control horizon, a portion of the computed control
actions is applied on the interval [t
0
,δt
n
+ t
0
], known
as the receding step, where δt
n
:= ∆tn.
3
This pro-
cess is then repeated on the interval [t
0
+ δt
n
,t
f
+ δt
n
]
as the finite horizon moves by time steps defined by
the sampling time δt
n
, yielding a state feedback con-
trol scheme strategy. Advantages of the receding time
horizon control scheme become evident in terms of
adaptation to unknown events and change of strategy
depending on new goals or new events such as appear-
ing obstacles in the environment.
3 METHOD DESCRIPTION
In this section, a concept of complementary virtual
leaders, which enables backward driving of forma-
tions, will be proposed. The basic idea of the classical
leader-follower concept (see Figure 1) is based on the
fact that the followers continue with forward move-
ment until the place where the leader changed the po-
larity of its velocity. Such a behavior could cause col-
lisions or unacceptable disordered motion leading to
3
Number of applied constant control inputs n is chosen
according to computational demands as was explained in
(Saska et al., 2009).
a breakage of the shape of formation. Natural con-
ception of the formation movement supposes that the
entire group keeps compact shape and so all mem-
bers should change the polarity of their velocity in the
same moment.
Such an approach requires an extension of the
standard method employing one leader to an approach
with two virtual leaders, one for the forward move-
ment and one for the backward movement. Their
leading role is switched always when the sign of the
leader’s velocity is changed. The suspended virtual
leader becomes temporarily a virtual follower. The
virtual follower traces the virtual leader similarly as
the other followers to be able to undertake its leading
duties at time of the next switching. Therefore, all
robots in our system will be considered as followers
and there is no physical leader. The virtual leaders
will be positioned at the axis of the formation, one in
front of the formation and one behind the formation.
In the presented approach, we propose to solve
collision free trajectory planning and optimal control
together for both virtual leaders in one optimization
step. This ensures integrity of the solution for the
separated plants. Beyond this, we extend the stan-
dard RHC method with one control horizon into an
approach utilizing two finite time intervals T
N
and T
M
.
The first time interval T
N
should provide immediate
control inputs for the formation regarding the local
environment. By applying this portion of the control
sequence, the group is be able to respond to changes
in workspace that can be dynamic or newly detected
static obstacles. The difference ∆t(k+1) = t
k+1
−t
k
is
kept constant (later denoted only ∆t) in this time inter-
valand should satisfy the requirementsof the classical
receding horizon control scheme.
The second interval T
M
takes into account infor-
mation about the global characteristics of the environ-
ment to navigate the formation to the goal and to au-
tomatically compose the entire maneuver containing
usually multiple switching between the virtual lead-
ers. Here, we should highlight that also the number
of switchings between the virtual leaders is designed
automatically via the optimization process. The tran-
sition points in this part can be distributed irregularly
to effectively cover the environment. During the op-
timization process, more points should be automat-
ically allocated in the regions where a complicated
maneuver of the formation is needed. This is enabled
due to the varying values of ∆t(k+ 1) = t
k+1
− t
k
that
will be for the compact description collected into the
vector T
∆
L,M
= {∆t(N + 1), ...,∆t(N + M)}.
To define the trajectory planning problem with
two time intervals in a compact form we need to
gather states ψ
L
(k),k ∈ {1,...,N} and ψ
L
(k),k ∈