VISUALIZATION OF AND RETRIEVAL OF BACKGROUND INFORMATION RELATING TO WORDS IN WEB DOCUMENTS - A Visualization Interface based on Vector Representation

Kouji Shimatsuka, Tatsuhiro Yonekura

2009

Abstract

When people encounter unfamiliar words, they often use tools such as search engines to obtain background information on these words. However, the semantic content of words can be complex, and it is not always possible to understand the meaning of words from textual information alone. In this paper we quantify the semantic content of words by means of a simple and convenient text-based method whereby the semantic content is constructed from linguistic, visual and auditory characteristic values. Using characteristic vectors generated in this way, users are able to visually check and search for background information on unfamiliar terms in a web document.

References

  1. Gerard Salton, Michael J. MeGill, 1983. Introduction to Modern Information Retrieval. MeGraw-Hill.
  2. E. Chisholm, T. Kolda, 1999. New term weighting formulas for the vector space method in information retrieval. Technical Memorandum ORNL-13756.
  3. K. Kita, K. Tsuda, and M. Shishibori, 2002. Information Retieval Algorithms. Kyoritsu Shuppan Press.
  4. Kanada, Y., 1999. A Method of Geographical Name Extraction from Japanese Text for Thematic Geographical Search. 18th International Conference on Information and Knowledge Management (CIKM'99), pp. 46-54.
  5. Matsuo, Y., Ishizuka, M., 2002. Keyword Extraction from a Document using Word Co-occurrence Statistical Information.Journal of the Japanese Society for Artificial Intelligence(17-3D), pp.217-223.
Download


Paper Citation


in Harvard Style

Shimatsuka K. and Yonekura T. (2009). VISUALIZATION OF AND RETRIEVAL OF BACKGROUND INFORMATION RELATING TO WORDS IN WEB DOCUMENTS - A Visualization Interface based on Vector Representation . In Proceedings of the Fifth International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8111-81-4, pages 419-422. DOI: 10.5220/0001839204190422


in Bibtex Style

@conference{webist09,
author={Kouji Shimatsuka and Tatsuhiro Yonekura},
title={VISUALIZATION OF AND RETRIEVAL OF BACKGROUND INFORMATION RELATING TO WORDS IN WEB DOCUMENTS - A Visualization Interface based on Vector Representation},
booktitle={Proceedings of the Fifth International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2009},
pages={419-422},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001839204190422},
isbn={978-989-8111-81-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fifth International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - VISUALIZATION OF AND RETRIEVAL OF BACKGROUND INFORMATION RELATING TO WORDS IN WEB DOCUMENTS - A Visualization Interface based on Vector Representation
SN - 978-989-8111-81-4
AU - Shimatsuka K.
AU - Yonekura T.
PY - 2009
SP - 419
EP - 422
DO - 10.5220/0001839204190422