loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Hermine Njike Fotzo and Patrick Gallinari

Affiliation: Université de Paris 6 – LIP6, France

ISBN: 972-8865-00-7

ISSN: 2184-4992

Keyword(s): Concept hierarchies, typed hyperlinks generation, thematic annotations, text segmentation.

Related Ontology Subjects/Areas/Topics: Biomedical Engineering ; Data Engineering ; Enterprise Information Systems ; Health Information Systems ; Information Systems Analysis and Specification ; Knowledge Management ; Ontologies and the Semantic Web ; Society, e-Business and e-Government ; Web Information Systems and Technologies

Abstract: With the development and the availability of large textual corpora, there is a need for enriching and organizing these corpora so as to make easier the research and navigation among the documents. The Semantic Web research focuses on augmenting ordinary Web pages with semantics. Indeed, wealth of information exists today in electronic form, they cannot be easily processed by computers due to lack of external semantics. Furthermore, the semantic addition is an help for user to locate, process information and compare documents contents. For now, Semantic Web research has been focused on the standardization, internal structuring of pages, and sharing of ontologies in a variety of domains. Concerning external structuring, hypertext and information retrieval communities propose to indicate relations between documents via hyperlinks or by organizing documents into concepts hierarchies, both being manually developed. We consider here the problem of automatically structuring and organizing co rpora in a way that reflects semantic relations between documents. We propose an algorithm for automatically inferring concepts hierarchies from a corpus. We then show how this method may be used to create specialization/generalization links between documents leading to document hierarchies. As a byproduct, documents are annotated with keywords giving the main concepts present in the documents. We also introduce numerical criteria for measuring the relevance of the automatically generated hierarchies and describe some experiments performed on data from the LookSmart and New Scientist web sites. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.209.10.183

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Njike Fotzo H.; Gallinari P. and (2004). INFORMATION ACCESS VIA TOPIC HIERARCHIES AND THEMATIC ANNOTATIONS FROM DOCUMENT COLLECTIONS.In Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 972-8865-00-7, pages 69-76. DOI: 10.5220/0002634000690076

@conference{iceis04,
author={Hermine {Njike Fotzo} and Patrick Gallinari},
title={INFORMATION ACCESS VIA TOPIC HIERARCHIES AND THEMATIC ANNOTATIONS FROM DOCUMENT COLLECTIONS},
booktitle={Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2004},
pages={69-76},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002634000690076},
isbn={972-8865-00-7},
}

TY - CONF

JO - Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - INFORMATION ACCESS VIA TOPIC HIERARCHIES AND THEMATIC ANNOTATIONS FROM DOCUMENT COLLECTIONS
SN - 972-8865-00-7
AU - Njike Fotzo, H.
AU - Gallinari, P.
PY - 2004
SP - 69
EP - 76
DO - 10.5220/0002634000690076

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.