one, i.e., lowering the clustering threshold with help
of a specific auxiliary algorithm which uses
precomputed distances defined on the current
clusters.) As it was mentioned above, there can be
found some interesting problems focused on
efficient realization of this algorithm variant.
However we will not study these problems in the
article either. Instead of considering such
algorithmic problems we are going to approve
empirically the expediency of using this algorithm in
the considered DOPs.
Thus we have shown a formal description of one
specific clustering algorithm. As it was mentioned in
Introduction the authors suppose that one of its
applications could be the problem of forming the
initial screen when searching in the Internet. In
(Melnikova and Radionov, 2005), two different
possible variants of using this algorithm in such a
problem are given and a preliminary version of
search engine was developed basing on this
approach. However here we are more concerned
about another possible application area of the
algorithm and this area is discrete optimization
problems.
The following question may arise: why do we
need this very algorithm, what makes it better than
the other ones described above? (Another question
may arise why in our case we need to solve
problems “by analogy” though the goal of each
anytime-algorithm is in achieving one pseudo-
optimal solution. A possible answer to this question
is given in Section 5 of this paper.) It is impossible
to give the exact answer, as it always seems in case
of heuristics. But practical programming and the
solution of several DOPs has shown that this
algorithm is the best to illustrate possible locations
of situations. Here we mean the situations both for a
certain DOP and for game-playing programs. We
attempt to focus on situations not only within one
cluster, but also in an order which corresponds to the
possible sequence (M
1
,…M
n
) in the formulae above
and what is more important to the selection of the
situations which are included more than once in into
this sequence (let us call them key situations).
Situations are sorted in the given way for ordered
processing of the situations, for solving the
corresponding subtasks in a similar order, and we
certainly start with the key situations. (We mark in
advance that our vision of the term subtask will bee
defined more exactly in the next section.) By such
situations sorting we achieve successful solving of
problems “by analogy” because the situations which
are close in a “good” metric most likely have similar
solutions. Here we first have in mind the possibility
to make the same step in TB&B. And not just
possibility to make the same step, but heuristic
choice and receiving the same separating element for
such step in two subtasks close in the metric.
So the authors hope that they have given the answer
to the possible question about necessity of using the
described algorithm of clustering in different DOPs.
And as it was mentioned above to do clustering with
help of this algorithm we need to find a metric on
the situations set (subtasks set). The following two
Sections of the paper are dedicated to description of
such metrics for two different DOPs.
3 METRICS FOR SITUATIONS IN
ONE PROBLEM
In this section we give a metric for the problem of
vertex minimization in nondeterministic finite
automata of Rabin-Scott. However the authors leave
this title of the section because in principle similar
heuristics can be applied to quite different discrete
optimization problems.
Let us explain the task setting (for more details
see (Melnikov, 2000) and others). Some matrix is
filled with numbers 0 and 1 (in terms of (Melnikov,
2000), numbers 1 correspond to elements of binary
relation #). Let us use the term Grid for any couple
of subsets of columns and rows of the matrix if those
subsets meet the following two conditions:
• at the intersection of each column with each row
of those subsets 1 is placed;
• we can not add any row or column to those
subsets without breaking the first condition.
In the set of all possible grids we need to find a
subset which includes all 1s of the given matrix.
And to reach the requirement of vertex minimization
in nondeterministic finite automata we should find
that one of the subsets which contains the minimal
number of grids in it. (One description of the
sufficient requirement is given in (Polák, 2004).
Besides in present time the authors of the paper are
preparing a publication with a description of such
algorithm based on description of the set of all
possible arcs of an automation, which is given in
(Melnikov, and Sciarini-Guryanova, 2002). To be
more specific it is based on presence of analogues of
all these arcs in the given automation. However here
we consider only the requirement of vertex
minimization so there is no need to consider arcs and
each grid is a special description of an automation
state.)
We can demonstrate that the problem is not trivial
by showing the following simple example. As usual
ICSOFT 2006 - INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES
274