alyze the range of outcomes (uncertainty) and find
anomalies and outliers within the samples for qual-
ity control purposes. Some of the expected results
are the following. Defects floating in the liquid steel
may have an ascending force, like that of bubbles in
sparkling water. As a result, since the border of the
steel slab solidifies first when it meets a lower ambi-
ent temperature, defects should be trapped in the so
called inclusion band, which is located in the upper
part of the slab. The larger a defect is, the greater
its ascending force and therefore the higher its posi-
tion in the inclusion band. Other properties that may
influence the position of defects are sphericity (form
descriptor), type of defect, and properties inherited
from the melting charge, such as material ingredients
or the duration of the oxygen blowing process. With
certain types of defects, the defects size may corre-
late with its sphericity, again like bubbles in sparkling
water. The bigger the defect is, the more spherical it
may be. A more complex relation may be the melt-
ing temperature in combination with the defects po-
sition. Because initial temperature determines how
long steel remains molten, the degree of the influence
of the defects size on its position may vary. Several of
these effects and relationships can be analyzed with
my system.
4 STATE OF THE ART
4.1 Visualizing the Complex Data
Hierarchy
One possibility is to visualize a selected level of
traversal t with some kind of visualization. For in-
stance, a histogram is shown which summarizes a sin-
gle dimension of all the nodes on level t, e.g., the
cleanliness of all samples. This actually means, that
each node on level t is simplified so that it can be vi-
sualized. This of course can be very beneficial, as
some of the ”unimportant” dimensions aren’t shown
and thus won’t distract the end user. I call this kind of
visualization a level overview visualization as it gives
a brief summary of the whole level t. Another exam-
ple is the visualization of a two dimensional graph to
reveal the influence of an input parameter to an out-
come dimension (trend analysis), e.g., the steel clean-
liness over the smelting temperature. While my sys-
tems allows the visualization of a level overview, it is
not the focus of my research.
Small multiples (Tufte, 1990), on the other hand,
are well known to visualize multiple nodes using the
data of lower hierarchical levels, e.g., each sample is
represented by a histogram of the volumes of the de-
fects found in it. This is actually a more detailed way
to visualize the complete level t. I applied that idea
and extended the small multiples to ”small multiples
of multiple views”.
Finally, there exist many visualization systems to
analyze such data sets. A single node is a com-
plex data structure, because it has context information
available and also multiple child-nodes consisting of
various dimensions and modalities. Hierarchical visu-
alizations, like treemaps, could reveal the hierarchical
structure but that would not be of interest here as there
is a fixed hierarchy known by the end users. State of
the art are visualization systems, which allow the se-
lection of hierarchical levels and data dimensions in
combination to a visualization type (histogram, graph,
etc.) to get a visualization, or view, on a selected slice
of the data. Multiple of those visualizations and views
(Multiple Views) can be arranged side by side to sup-
port the data analysis in more detail and are today’s
state of the art. Further more, linking and brushing
abroad single views are used to help analyzing the
data further more.
4.2 First Contribution
I retain and support the state of the art of general
purpose multiple view systems to get an insight of
a complex node, like one sample. Users can build
their own MultiView-layouts using various visual-
ization types to visualize different dimensions and
hierarchical levels at once (Wang Baldonado et al.,
2000). My added contribution is, that the user-created
MultiView-layout is reused multiple times to visual-
ize multiple nodes side by side. Therefore, the user is
encouraged and supported to configure the layout in
such a way, that the specific data from a node of level
t is visualized, as shown in fig. 2 and 4.
There is some text-based information from par-
ent nodes (melting charges), some statistical visual-
izations summarizing lower nodes (piechart of defect
types, etc.) and scientific volume visualizations from
especially dangerous defects found through simple
data mining techniques.
The expected outcome is a huge enhancement in
data selection, because the typical node selection of
state of the art systems uses some text-based explorers
or smaller simplified visualizations. My system visu-
alizes the nodes fully so that the users can search vi-
sually by scrolling through the nodes, comparable to
small multiples. It is able to seamlessly adapt to dif-
ferent roles, whether the analysis of a single ensemble
member (typically a large coordinated multiple view)
or the trend analysis of multiple ensemble members
DoctoralThesis-AVisualAnalysisSystemforHierarchicalEnsembleData
53