scholars: (1) Showing the POS-combination pattern
of the hierarchical form, (2) exploring results
according to the POS pattern, and (3) searching the
source corpus for the analysis-result verification.
Therefore, PreechVis, an interactive-visualization
tool that covers the necessary functions for the
exploration of larger amounts of multiword corpus
information, has been created for this study. This
work provides the following contributions: (1) The
two types of corpus-data words for the exploration of
more information and accurate results is presented,
(2) an interactive visualization and multifunctional
tool for the visual analysis is presented utilizing the
multiword type, and (3) the visualization tool is
assessed via case studies for which the U.S.
Presidential Address is used to verify the utility of
PreechVis.
2 RELATED WORKS
For this work, it is assumed that with a textual
multiword analysis, the information is easier to
discover and to analyze through an interactive
visualization. Supporting this hypothesis, Carlos
Ramisch summarized a textual-analysis technique for
multiword expressions in a recent book (Ramisch,
2015). Further, many studies on the visual tools for
textual analyses have been conducted (Koch et al.,
2014), (Sun et al.,2014).
Numerous visualizations have been created to
extract and explore more information in large corpus
data. For a number of visualizations such as EvoRiver
(Sun et al., 2014) and OpinionFlow (Wu et al., 2014),
a word-based analysis is employed regardless of the
multiword combinations. The focus here is the word-
based multiword analysis and the defining of how the
multiword result can be presented in an interactive
visualization. Much research has been conducted on
corpus-data visualization.
2.1 Word-based Corpus Visualization
Word-based corpus visualization, which aims to
understand and explore words-based text corpora, has
received considerable attention in recent years (Sun
et al., 2014), (Cui et al., 2014a), (Wu et al., 2014).
EvoRiver(Sun et al., 2014) is a time-based
visualization that allows users to explore
competition-related interactions and to detect
dynamically evolving patterns, as well as their major
causes. Cui et al. (Cui et al., 2014a) presented an
interactive visual textual-analysis approach that
allows users to progressively explore and analyze the
complex evolutionary patterns of hierarchical topics.
Wu et al. (Wu et al., 2014) introduced a visual-
analysis system called OpinionFlow to empower
analysts to detect opinion-propagation patterns and
glean insights. Further, for OpinionFlow, a Sankey
graph is combined with a tailored density map in one
view to visually convey the diffusion of opinions
among many users.
These related works focus on visual explorations
of words without a considering multiword
expressions. Whereas, our present work includes
multiword expressions in its word-based illustrations.
2.2 Visual Graph Comparison
A visual-graph comparison aims to analyze the
similarities and differences between variables. A
number of visual-graph- comparison methods have
been proposed by many studies (Andrews et al.,
2009), (Cui et al., 2014b), (Collins and Carpendale,
2007).
Andrews et al. (Andrews et al., 2009) presented a
technique and a prototype tool to support the visual
comparison of graphs and the interactive
reconciliation of candidate graphs into a single
reference graph. Cui et al. (Cui et al., 2014b)
introduced a novel flow-based visualization design
for the summarization of high-level evolution
patterns in a dynamic graph. Collins et al. (Collins
and Carpendale,2007) described VisLink, a
visualization environment in which one can display
multiple two-dimensional (2D) visualizations,
reposition and reorganize them in a three-dimensional
(3D) form, and display the relationships between
them by propagating the edges from one visualization
to another.
These works assume that visual graph
comparisons can help users understand the
similarities and differences between variables. In this
paper, the PreechVis allows for wider comparison,
across analysis results extracted from multiple
corpora based on user selections.
2.3 Verification of the Visual Findings
A visual analysis of the corpus data can help uses
understand the corpus without reading it. However, to
assess the utility of a visual-analysis tool, derived
insights must be verified through comparison with the
real corpus (Koch et al., 2014), (Stasko et al., 2007).
Koch et al. (Koch et al., 2014) presented a method
that supports visual-analytics tasks on large text
documents that is particularly useful in situations
where scrutiny is required and the textual source must
IVAPP 2018 - International Conference on Information Visualization Theory and Applications
98