PreechVis: Visual Profiling using Multiple-word Combinations

Seongmin Mun, Gyeongcheol Choi, Guillaume Desagulier, Kyungwon Lee

2018

Abstract

Words in the corpus include features and information, and the visualizing of such words can improve the user’s understanding of them. Words in text corpus may be consist of one-word or they may be a combination of words that together, constitute a word. The latter is referred to as a multiword expression. And if we analyze both single word and multiword with visualization, we can get more accurate results and more information than we analyze only single word from corpus. An interactive visualization can be useful for analyzing multiword expressions, because the following features are of interest to linguistics scholars: (1) Showing the combinatory POS pattern of a hierarchical form, (2) exploring results according to the POS pattern, and (3) searching the source corpus for the analysis-result verification. Therefore, we propose PreechVis, an interactive-visualization tool that includes all of the requisite functions for an analysis for which multiple words (http://202.30.24.167:3010/PreechVisMWE) are utilized. For the present study, we used a total of 957 speeches of 43 U.S. Presidents from George Washington to Barack Obama as the corpus data. PreechVis is divided into two views. In the first view, the system consists of a combination of Sunburst and RadVis. Through the circular Sunburst, we present the POS and its combination patterns for each gram. In RadVis, the Presidents were positioned according to their frequency value. In addition, when the President was selected, the frequency value was displayed on Sunburst to improve the user’s understanding. In the second view, the user can simultaneously confirm and verify the details of the result using the Wordcloud. The two different views are synchronized with each other and are changed by the selected grams, issues, and Presidents. In the experiments and case studies on the U.S.-President speech data, we verified the effectiveness and usability of PreechVis.

Download


Paper Citation


in Harvard Style

Mun S., Choi G., Desagulier G. and Lee K. (2018). PreechVis: Visual Profiling using Multiple-word Combinations. In Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018) - Volume 2: IVAPP; ISBN 978-989-758-289-9, SciTePress, pages 97-107. DOI: 10.5220/0006615500970107


in Bibtex Style

@conference{ivapp18,
author={Seongmin Mun and Gyeongcheol Choi and Guillaume Desagulier and Kyungwon Lee},
title={PreechVis: Visual Profiling using Multiple-word Combinations},
booktitle={Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018) - Volume 2: IVAPP},
year={2018},
pages={97-107},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006615500970107},
isbn={978-989-758-289-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018) - Volume 2: IVAPP
TI - PreechVis: Visual Profiling using Multiple-word Combinations
SN - 978-989-758-289-9
AU - Mun S.
AU - Choi G.
AU - Desagulier G.
AU - Lee K.
PY - 2018
SP - 97
EP - 107
DO - 10.5220/0006615500970107
PB - SciTePress