Dismantling Composite Visualizations in the Scientific Literature

Po-Shen Lee; Bill Howe

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Dismantling Composite Visualizations in the Scientific Literature

Topics: Image Understanding

In Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM, 79-91, 2015 , Lisbon, Portugal

Authors: Po-Shen Lee and Bill Howe

Affiliation: University of Washington, United States

Keyword(s): Visualization, Multi-chart Figure, Chart Segmentation, Chart Recognition and Understanding, Scientific Literature Retrieval, Content-based Image Retrieval.

Related Ontology Subjects/Areas/Topics: Applications ; Computer Vision, Visualization and Computer Graphics ; Image Understanding ; Pattern Recognition

Abstract: We are analyzing the visualizations in the scientific literature to enhance search services, detect plagiarism, and study bibliometrics. An immediate problem is the ubiquitous use of multi-part figures: single images with multiple embedded sub-visualizations. Such figures account for approximately 35% of the figures in the scientific literature. Conventional image segmentation techniques and other existing approaches have been shown to be ineffective for parsing visualizations. We propose an algorithm to automatically segment multi-chart visualizations into a set of single-chart visualizations, thereby enabling downstream analysis. Our approach first splits an image into fragments based on background color and layout patterns. An SVM-based binary classifier then distinguishes complete charts from auxiliary fragments such as labels, ticks, and legends, achieving an average 98.1% accuracy. Next, we recursively merge fragments to reconstruct complete visualizations, choosing be tween alternative merge trees using a novel scoring function. To evaluate our approach, we used 261 scientific multi-chart figures randomly selected from the Pubmed database. Our algorithm achieves 80% recall and 85% precision of perfect extractions for the common case of eight or fewer sub-figures per figure. Further, even imperfect extractions are shown to be sufficient for most chart classification and reasoning tasks associated with bibliometrics and academic search applications. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.179

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Lee, P.-S. and Howe, B. (2015). Dismantling Composite Visualizations in the Scientific Literature. In Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM; ISBN 978-989-758-077-2; ISSN 2184-4313, SciTePress, pages 79-91. DOI: 10.5220/0005213100790091

@conference{icpram15,
author={Po{-}Shen Lee and Bill Howe},
title={Dismantling Composite Visualizations in the Scientific Literature},
booktitle={Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM},
year={2015},
pages={79-91},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005213100790091},
isbn={978-989-758-077-2},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM
TI - Dismantling Composite Visualizations in the Scientific Literature
SN - 978-989-758-077-2
IS - 2184-4313
AU - Lee, P.
AU - Howe, B.
PY - 2015
SP - 79
EP - 91
DO - 10.5220/0005213100790091
PB - SciTePress