Principal Component Analysis (PCA) in maintaining
neighborhoods of high-dimensional individuals.
A tool called ELICIT was developed by Cruz and
Machado (Cruz et al., 2015) to enable the visual
exploration of evolutionary computation algorithms.
Two levels of view is provided by ELICIT, namely
General View to cover the whole population, and In-
dividual View to cover a particular individual. For an
individual, both genotype and phenotype can be visu-
alized. However, this tool lacks in providing enough
statistical information beside the visualizations.
In a recent effort, He and Yen (He and Yen, 2016)
proposed a new method to visualize the population of
Many-Objective Evolutionary Algorithms (MaOEAs)
in high-dimensional objective space. They claimed
that their proposed method maps individuals from a
high-dimensional objective space into a 2D polar co-
ordinate graph while preserving Pareto dominance re-
lationship, retaining shape and location of the Pareto
front, and maintaining distribution of individuals. Al-
though effective in visualizing the high-dimensional
objective space, this tool only provides a single view
into the EA population which might not be sufficient
for gaining a comprehensive insight.
A screenshot from each of the reviewed works is
presented in Fig. 1. Although all these works have
their own strengths and weaknesses which some are
already mentioned, each of them has at least one of
the following limitations:
• Poor level of user-interactivity.
• Expert knowledge required on the context in or-
der to digest the visualization result, which makes
it unsuitable for users with less knowledge in evo-
lutionary computation.
In contrast, by taking into consideration the above
limitations, the tool proposed in this paper provides
the user with a high level of interactivity in a 3-D en-
vironment to move inside and in between views with
different levels of granularity. However, it is notewor-
thy that the 3-D environment is merely used to orga-
nize the information space, and the third dimension
itself contains no information.
3 VISUALIZATION APPROACH
3.1 Overview
The data produced by the evolutionary process of GA
including all the individuals genotypes and their ob-
jective values will be given to the visualizer as in-
put. The process pipeline includes a clustering algo-
rithm to perform a (global) clustering across all gen-
erations of a GA run based on the distribution (simi-
larity) of individuals in parameter space. Then, clus-
ters will pass through a symbol mapping process, to
be described in subsections to follow. Finally, all the
clusters and their mapped symbols will be passed to
an interactive visualization interface. Since the infor-
mation to be visualized is over multiple generations,
clusters and sub-clusters, the visualization interface
contains square-walls to ease the organization, parti-
tioning and positioning of this information. Symbols
to be used in the elaboration are listed below:
• N: total number of individuals in all generations
of a GA run
• M: number of generations
• K: number of clusters
• I = {i
n
| 1 < n <= N } is the set of all individuals
• C
k
, 1 < k <= K,C
k
⊆ I is a cluster of individuals
across all generations
• C = {C
1
,C
2
,C
3
, ...,C
K
} is the set of all clusters
• c
km
, 1 < k <= K, 1 < m <= M, c
km
⊆ C
k
is a part
of cluster C
k
in generation m (sub-cluster)
Fig. 2 gives a bird’s-eye view of the visualization
interface where the whole population of N individuals
are placed on a tower, with each column representing
a cluster, each row representing a generation and each
square-wall representing a sub-cluster.
There is a control panel at right side of the inter-
face (Fig. 2), which not only provides some statistics
of clustering and information about the individuals,
but also enables the user to choose between three op-
tions from different families of clustering algorithms,
set their associated parameters and re-run the cluster-
ing (i.e. restart the whole process pipeline). The op-
tions for clustering algorithm are as follows:
• Centroid-based: k-means
• Density-based: DBSCAN
• Connectivity-based: Hierarchical agglomerative
The user is able to move the camera in six direc-
tions to have a look from an arbitrary angle. More-
over, different walls, floors and towers (multiple tow-
ers in case of multi-population visualization) can be
chosen by a mouse click to get the related statistics
in the control panel. As shown in Fig. 2, the wall on
down-left corner of the tower is currently being ac-
tivated. In fact, blue color indicates the active wall
(sub-cluster) and red border indicates the active floor
(generation). The ”Next” and ”Previous” buttons in
the control panel can be used to navigate through in-
dividuals of the active wall to see their gene values,
A Clustering-based Visual Analysis Tool for Genetic Algorithm
235