loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Arnaud Renard ; Sylvie Calabretto and Béatrice Rumpler

Affiliation: Université de Lyon, CNRS, INSA-Lyon, LIRIS, UMR5205, France

ISBN: 978-989-674-025-2

Keyword(s): Information retrieval, (semi-)Structured documents, XML, Fuzzy semantic matching, Semantic resource, Thesaurus, Ontology, Error correction, OCR.

Related Ontology Subjects/Areas/Topics: Accessibility Issues and Technology ; Internet Technology ; Ontology and the Semantic Web ; Web Information Systems and Technologies ; Web Interfaces and Applications ; XML and Data Management

Abstract: Nowadays, semantics is one of the greatest challenges in IR systems evolution, as well as when it comes to (semi-)structured IR systems which are considered here. Usually, this challenge needs an additional external semantic resource related to the documents collection. In order to compare concepts and from a wider point of view to work with semantic resources, it is necessary to have semantic similarity measures. Similarity measures assume that concepts related to the terms have been identified without ambiguity. Therefore, misspelled terms interfere in term to concept matching process. So, existing semantic aware (semi-)structured IR systems lay on basic concept identification but don’t care about terms spelling uncertainty. We choose to deal with this last aspect and we suggest a way to detect and correct misspelled terms through a fuzzy semantic weighting formula which can be integrated in an IR system. In order to evaluate expected gains, we have developed a prototype which first results on small datasets seem interesting. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.229.142.175

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Renard A.; Calabretto S.; Rumpler B. and (2010). FUZZY SEMANTIC MATCHING IN (SEMI-)STRUCTURED XML DOCUMENTS - Indexation of Noisy Documents.In Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST, ISBN 978-989-674-025-2, pages 253-260. DOI: 10.5220/0002807502530260

@conference{webist10,
author={Arnaud Renard and Sylvie Calabretto and Béatrice Rumpler},
title={FUZZY SEMANTIC MATCHING IN (SEMI-)STRUCTURED XML DOCUMENTS - Indexation of Noisy Documents},
booktitle={Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST,},
year={2010},
pages={253-260},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002807502530260},
isbn={978-989-674-025-2},
}

TY - CONF

JO - Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST,
TI - FUZZY SEMANTIC MATCHING IN (SEMI-)STRUCTURED XML DOCUMENTS - Indexation of Noisy Documents
SN - 978-989-674-025-2
AU - Renard, A.
AU - Calabretto, S.
AU - Rumpler, B.
PY - 2010
SP - 253
EP - 260
DO - 10.5220/0002807502530260

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.