loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Author: Pawel Chrzaszcz

Affiliation: Jagiellonian University and AGH University of Science and Technology, Poland

Keyword(s): Semantic Labels, Wikipedia, Inflection Dictionary.

Abstract: Inflection dictionaries are widely used in many natural language processing tasks, especially for inflecting languages. However, they lack semantic information, which could increase the accuracy of such processing. This paper describes a method to extract semantic labels from encyclopedic entries. Adding such labels to an inflection dictionary could eliminate the need of using ontologies and similar complex semantic structures for many typical tasks. A semantic label is either a single word or a sequence of words that describes the meaning of a headword, hence it is similar to a semantic category. However, no taxonomy of such categories is known prior to the extraction. Encyclopedic articles consist of headwords and their definitions, so the definitions are used as sources for semantic labels. The described algorithm has been implemented for extracting data from the Polish Wikipedia. It is based on definition structure analysis, heuristic methods and word form recognition and process ing with use of the Polish Inflection Dictionary. This paper contains a description of the method and test results as well as discussion on possible further development. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.189.170.227

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Chrzaszcz, P. (2012). Enrichment of Inflection Dictionaries: Automatic Extraction of Semantic Labels from Encyclopedic Definitions. In Proceedings of the 9th International Workshop on Natural Language Processing and Cognitive Science (ICEIS 2012) - NLPCS; ISBN 978-989-8565-16-7, SciTePress, pages 106-119. DOI: 10.5220/0004100501060119

@conference{nlpcs12,
author={Pawel Chrzaszcz.},
title={Enrichment of Inflection Dictionaries: Automatic Extraction of Semantic Labels from Encyclopedic Definitions},
booktitle={Proceedings of the 9th International Workshop on Natural Language Processing and Cognitive Science (ICEIS 2012) - NLPCS},
year={2012},
pages={106-119},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004100501060119},
isbn={978-989-8565-16-7},
}

TY - CONF

JO - Proceedings of the 9th International Workshop on Natural Language Processing and Cognitive Science (ICEIS 2012) - NLPCS
TI - Enrichment of Inflection Dictionaries: Automatic Extraction of Semantic Labels from Encyclopedic Definitions
SN - 978-989-8565-16-7
AU - Chrzaszcz, P.
PY - 2012
SP - 106
EP - 119
DO - 10.5220/0004100501060119
PB - SciTePress