− We propose a method to cluster the road sections
based upon the network density statistics. Unlike
some existing work, this clustering takes into
account the orientation of the trajectory. Besides,
this method utilizes the network topology to create
relevant clusters.
− We propose a model to assess the evolution for
dense route pairs at two consecutive time intervals.
− We propose a graph conveying the evolution as a
mean to describe the information in a synthetic
manner and to question the evolution of the density
through the whole network.
The rest of the paper is structured as follows. We
describe a few preliminary concepts in section 2. In
section 3, we present the first step concerning the
clustering of road sections. The second step related
to the evolution graph construction is presented in
section 4. In section 5, we present the result of our
experimental study. Finally, section 6 concludes this
paper and sketches some future orientations.
2 PRELIMINARIES
The representation of the network is given by the set
of road sections. The road section is represented
through a graph NG (N, S). S is the set of directed
segments, where each one represents the smallest
unit of road section. N is the set of nodes, where
each one represents a road junction.
Besides, knowing the set of trajectories, we
compute a matrix of transitions for the road network
at each time interval. This tells how many times the
junction have been taken for each turning movement
(i.e. between each pair of adjacent sections), by
reporting the number of moving objects going from
one section to another at each time interval. This
matrix is denoted M and M(i,j) represents the
number of moving objects passing through S
i
to
section S
j
within the interval It
n
(n ∈ {1,…,k}, k
stands for the number of time intervals). We also
denote S
ij
the transition (or turning movement) from
S
i
to S
j
.
We adopt a symbolic representation of the
trajectories as in (Du Mouza C. and P. Rigaux,
2004), (Wan T., K. Zeitouni, 2005). In this model, a
moving object trajectory tr is described by an
identifier (tid) and a sequence of symbols where
each one refers to a road section (S
i
), followed by a
temporal identifier (t
i
) referring the time of entry of
the trajectory tid to S
i
:
tr = (tid , <(S
i1
t
j1
), (S
i2
t
j2
), …, (S
ik
t
jk
)>) with S
in
∈ S
The order of symbols in the sequence above shows
the movement direction.
Concerning the similarity measure adopted in
this work, we define the similarity (Trans_sim) at
the level of the network for two adjacent transitions
S
ij
and S
jk
as the difference of their density values:
While the similarity between nonadjacent
transitions is null:
Trans_sim (S
ij
, S
uv
) = 0 if i≠v and j≠u
(2)
We define another similarity measure between dense
routes (Route_sim). It allows comparing the dense
route. Two routes are considered similar (with a
similarity equal to 1) if they share at least one road
section that corresponds to two successive time
intervals. Otherwise, their similarity is null.
3 SECTION CLUSTERING
We call our proposed algorithm NETSCAN. It
carries out the clustering of dense sections and
incorporates them by forming dense routes. It is
inspired from the density based clustering principle
introduced with DBSCAN algorithm (Ester et al.,
1996), while applying it to road sections. It takes as
input the set of sections that constitute the road
network, the spatiotemporal transitions matrix
associated with each time interval, a density
threshold α and a similarity threshold ε between the
transition densities. NETSCAN finds firstly the
dense transitions, i.e. those having maximum value.
Afterwards, for each dense transition, it groups the
connected segments and transitions that have similar
densities, thus creating dense routes.
The process begins with the transition having the
maximal density. Then, it begins searching the
connected transitions in both ways in order to find
those with a density ε near to the maximal one. To
insure the non reuse of transitions that are included
in dense routes, they are marked at the first
assignment.
The extension of a dense route is done in both
ways if the constraints are verified, i.e., the
candidate transition is only marked if it respects the
α and ε thresholds. The obtained segment clusters
correspond to the densest routes in the network. This
procedure is performed again for each time interval.
The dense routes are represented as a sequence of
segments, the same as with the trajectories. Each
segment is identified by an associated symbol.
Trans_sim (S
ij
, S
uv
)= |M(i,j) – M(j,k)| (1)
KDIR 2009 - International Conference on Knowledge Discovery and Information Retrieval
320