borhood relationships between data items. All con-
struction stages are optimized for this task: 1) similar-
ity of feature axes is evaluated by similarity of neigh-
borhood relationships shown in each axis; 2) axis
placement is optimized so that similar axes (showing
similar neighbor relationships) are placed nearby in
the PCP, allowing the user to retrieve similar axes eas-
ily from looking at the PCP; 3) coloring of data lines
is optimized to show overall neighbor relationships
of data items across all features, allowing the user to
track relationships of similar data items over all axes.
We do not claim neighbor retrieval is the only task
PCPs should be optimized for—relating data items
(to neighbors) is one of the component tasks in ex-
ploratory data analysis, and methods can be created
to optimize PCPs for other component tasks. Fu-
ture work could find theoretical connections describ-
ing earlier PCP works (such as axis ordering meth-
ods) as approximate optimization of other component
tasks of exploratory data analysis. In this sense, our
work is the first in a research direction of optimizing
PCPs for subtasks of exploratory data analysis.
Our construction method is general and applicable
both to 2D and 3D PCPs. Resulting PCPs have a sim-
ilar form as traditional 2D and 3D PCPs, but the new
PCPs are optimized for an analysis task; the PCPs are
directly pluggable into visualization systems featur-
ing PCPs, potentially improving their ability to serve
data analysts. For reasonably-sized data construction
of the plots is already fast; for very large data sets
recent work in speedup of scatter plot optimizations
(Yang et al., 2013) may be adapted to PCP optimiza-
tion. As other further work, it is easy to add prefer-
ences about layouts as penalties to Eq. (6), such as a
repulsion term (Peltonen and Lin, 2015) keeping axes
a desired minimum apart if needed for readability.
ACKNOWLEDGEMENTS
We acknowledge the computational resources pro-
vided by the Aalto Science-IT project. Authors be-
long to the Finnish CoE in Computational Inference
Research COIN. The work was supported in part by
TEKES (Re:Know project). The work was also sup-
ported in part by the Academy of Finland, decision
numbers 252845, 256233, and 295694.
REFERENCES
Achtert, E., Kriegel, H.-P., Schubert, E., and Zimek, A.
(2013). Interactive data mining with 3d-parallel-
coordinate-trees. In SIGMOD, pages 1009–1012,
New York, NY, USA. ACM.
Ankerst, M., Berchtold, S., and Keim, D. A. (1998). Simi-
larity clustering of dimensions for an enhanced visual-
ization of multidimensional data. In INFOVIS, pages
52–60.
Caldas, J., Gehlenborg, N., Faisal, A., Brazma, A., and
Kaski, S. (2009). Probabilistic retrieval and visualiza-
tion of biologically relevant microarray experiments.
Bioinformatics, 25:i145–i153.
Claessen, J. and van Wijk, J. (2011). Flexible Linked Axes
for Multivariate Data Visualization. IEEE T. Vis. Com-
put. Gr., 17:2310–2316.
Fanea, E., Carpendale, S., and Isenberg, T. (2005). An in-
teractive 3d integration of parallel coordinates and star
glyphs. In INFOVIS, pages 149–156. IEEE.
Fua, Y.-H., Ward, M. O., and Rundensteiner, E. A. (1999).
Hierarchical parallel coordinates for exploration of
large datasets. In VIS, pages 43–50. IEEE Computer
Society Press.
Guo, D. (2003). Coordinating computational and visual ap-
proaches for interactive feature selection and multi-
variate clustering. Inform. Vis., 2:232–246.
Heinrich, J., Stasko, J., and Weiskoph, D. (2012). The par-
allel coordinates matrix. In Eurovis, pages 37–41.
Heinrich, J. and Weiskopf, D. (2013). State of the Art of
Parallel Coordinates. In EG2013 - STARs. The Euro-
graphics Association.
Herman, I., , Melanc¸on, G., and Marshall, M. S. (2000).
Graph visualization and navigation in information vi-
sualization: a survey. IEEE T. Vis. Comput. Gr., 6:24–
43.
Hinton, G. E. and Roweis, S. T. (2002). Stochastic neighbor
embedding. In NIPS, pages 833–840.
Inselberg, A. (2009). Parallel Coordinates: Visual Multidi-
mensional Geometry and Its Applications. Springer.
Johansson, J., Ljung, P., Jern, M., and Cooper, M. (2006).
Revealing structure in visualizations of dense 2d and
3d parallel coordinates. Inform. Vis., 5:125–136.
Koropatkin, N. M., Cameron, E. A., and Martens, E. C.
(2012). How glycan metabolism shapes the human
gut microbiota. Nat. Rev. Microbiol., 10:323–335.
Laplante, M. and Sabatini, D. M. (2012). mTOR signaling
in growth control and disease. Cell, 149:274–293.
Lichman, M. (2013). UCI machine learning repository.
Makwana, H., Tanwani, S., and Jain, S. (2012). Axes re-
ordering in parallel coordinate for pattern optimiza-
tion. Int. J. Comput. Appl., 40:43–48.
Parkinson, H. E. et al. (2009). Arrayexpress update - from
an archive of functional genomics experiments to the
atlas of gene expression. Nucleic Acids Res., 37:868–
872.
Peltonen, J. and Kaski, S. (2011). Generative modeling for
maximizing precision and recall in information visual-
ization. In AISTATS 2011, volume 15, pages 597–587.
JMLR W&CP.
Peltonen, J. and Lin, Z. (2015). Information retrieval ap-
proach to meta-visualization. Mach. Learn., 99:189–
229.
IVAPP 2017 - International Conference on Information Visualization Theory and Applications
50