AUTOMATIC MULTILINGUAL LEXICON GENERATION USING WIKIPEDIA AS A RESOURCE

Ahmad R. Shahid; Dimitar Kazakov

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

AUTOMATIC MULTILINGUAL LEXICON GENERATION USING WIKIPEDIA AS A RESOURCE

Topics: Data Mining; Machine Learning; Natural Language Processing

In Proceedings of the International Conference on Agents and Artificial Intelligence ICAART - Volume 1, 357-360, 2009 , Porto, Portugal

Authors: Ahmad R. Shahid and Dimitar Kazakov

Affiliation: The University of York, United Kingdom

Keyword(s): Multilingual Lexicons, Web Crawler,Wikipedia, Natural Language Processing, Web mining, Data mining.

Related Ontology Subjects/Areas/Topics: Applications ; Artificial Intelligence ; Computational Intelligence ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Machine Learning ; Natural Language Processing ; Pattern Recognition ; Sensor Networks ; Signal Processing ; Soft Computing ; Symbolic Systems

Abstract: This paper proposes a method for creating a multilingual dictionary by taking the titles of Wikipedia pages in English and then finding the titles of the corresponding articles in other languages. The creation of such multilingual dictionaries has become possible as a result of exponential increase in the size of multilingual information on the web. Wikipedia is a prime example of such multilingual source of information on any conceivable topic in the world, which is edited by the readers. Here, a web crawler has been used to traverse Wikipedia following the links on a given page. The crawler takes out the title along with the titles of the corresponding pages in other targeted languages. The result is a set of words and phrases that are translations of each other. For efficiency, the URLs are organized using hash tables. A lexicon has been constructed which contains 7-tuples corresponding to 7 different languages, namely: English, German, French, Polish, Bulgarian, Greek and Chinese.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.119

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

R. Shahid, A., Kazakov and D. (2009). AUTOMATIC MULTILINGUAL LEXICON GENERATION USING WIKIPEDIA AS A RESOURCE. In Proceedings of the International Conference on Agents and Artificial Intelligence - ICAART; ISBN 978-989-8111-66-1; ISSN 2184-433X, SciTePress, pages 357-360. DOI: 10.5220/0001783003570360

@conference{icaart09,
author={Ahmad {R. Shahid} and Dimitar Kazakov},
title={AUTOMATIC MULTILINGUAL LEXICON GENERATION USING WIKIPEDIA AS A RESOURCE},
booktitle={Proceedings of the International Conference on Agents and Artificial Intelligence - ICAART},
year={2009},
pages={357-360},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001783003570360},
isbn={978-989-8111-66-1},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the International Conference on Agents and Artificial Intelligence - ICAART
TI - AUTOMATIC MULTILINGUAL LEXICON GENERATION USING WIKIPEDIA AS A RESOURCE
SN - 978-989-8111-66-1
IS - 2184-433X
AU - R. Shahid, A.
AU - Kazakov, D.
PY - 2009
SP - 357
EP - 360
DO - 10.5220/0001783003570360
PB - SciTePress