A Study on the Role of Similarity Measures in Visual Text Analytics

F. San Roman S.; R. D. de Pinho; R. Minghim; M. C. F. de Oliveira

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

A Study on the Role of Similarity Measures in Visual Text Analytics

Topics: High-Dimensional Data and Dimensionality Reduction; Information and Scientific Visualization; Text and Document Visualization

In Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, 429-438, 2013 , Barcelona, Spain

Authors: F. San Roman S. ¹ ; R. D. de Pinho ² ; R. Minghim ¹ and M. C. F. de Oliveira ¹

Affiliations: ¹ Universidade de São Paulo, Brazil ; ² Ministério da Ciência and Tecnologia e Inovação, Brazil

Keyword(s): Visual Text Analytics, Visual Text Mining, Vector Space Model, High-dimensional Data Visualization and Multidimensional Projections.

Related Ontology Subjects/Areas/Topics: Abstract Data Visualization ; Computer Vision, Visualization and Computer Graphics ; General Data Visualization ; High-Dimensional Data and Dimensionality Reduction ; Information and Scientific Visualization ; Text and Document Visualization

Abstract: Text Analytics is essential for a large number of applications and good approaches to obtain visual mappings of text are paramount. Many visualization techniques, such as similarity based point placement layouts, have proved useful to support visual analysis of documents. However, they are sensitive to data quality, which, in turn, relies on a critical preprocessing step that involves text cleaning and in some cases term detecting and weighting, as well as the definition of a similarity function. Not much has been discussed on the effect of these important similarity calculations in the quality of visual representations. This paper presents a study on the role of different text similarity measurements on the generation of visual text mappings. We focus mainly on two types of distance functions, those based on the well-known text vector representation and on direct string comparison measurements, comparing their effect on visual mappings obtained with point placement techniques. We f ind that both have their value but, in many circumstances, the vector space model (VSM) is the best solution when discrimination is important. However, the VSM is not incremental, that is, new additions to a collection force a recalculation of the whole feature space and similarities. In this work we also propose a new incremental model based on the VSM, which is shown to present the best visualization results in many configurations tested. We show the evaluation results and offer recommendations on the application of different text similarity measurements for Visual Text Analytics tasks. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

San Roman S., F., D. de Pinho, R., Minghim, R. and C. F. de Oliveira, M. (2013). A Study on the Role of Similarity Measures in Visual Text Analytics. In Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications (VISIGRAPP 2013) - IVAPP; ISBN 978-989-8565-46-4; ISSN 2184-4321, SciTePress, pages 429-438. DOI: 10.5220/0004214004290438

@conference{ivapp13,
author={F. {San Roman S.} and R. {D. de Pinho} and R. Minghim and M. {C. F. de Oliveira}},
title={A Study on the Role of Similarity Measures in Visual Text Analytics},
booktitle={Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications (VISIGRAPP 2013) - IVAPP},
year={2013},
pages={429-438},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004214004290438},
isbn={978-989-8565-46-4},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications (VISIGRAPP 2013) - IVAPP
TI - A Study on the Role of Similarity Measures in Visual Text Analytics
SN - 978-989-8565-46-4
IS - 2184-4321
AU - San Roman S., F.
AU - D. de Pinho, R.
AU - Minghim, R.
AU - C. F. de Oliveira, M.
PY - 2013
SP - 429
EP - 438
DO - 10.5220/0004214004290438
PB - SciTePress