AN EXTENSIVE COMPARISON OF METRICS FOR AUTOMATIC EXTRACTION OF KEY TERMS

Luis F. S. Teixeira; Gabriel P. Lopes; Rita A. Ribeiro

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

AN EXTENSIVE COMPARISON OF METRICS FOR AUTOMATIC EXTRACTION OF KEY TERMS

Topics: Natural Language Processing

In Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, 55-63, 2012 , Vilamoura, Algarve, Portugal

Authors: Luis F. S. Teixeira ¹ ; Gabriel P. Lopes ¹ and Rita A. Ribeiro ²

Affiliations: ¹ FCT/UNL, Portugal ; ² CA3-Uninova, Portugal

Keyword(s): Document keywords, Document topics, Words, Multi-words, Prefixes, Automatic extraction, Suffix arrays.

Related Ontology Subjects/Areas/Topics: Applications ; Artificial Intelligence ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Natural Language Processing ; Pattern Recognition ; Symbolic Systems

Abstract: In this paper we compare twenty language independent statistical-based metrics for key term extraction from any document collection. While some of those metrics are widely used, others were recently created. Two different document representations are considered in our experiments. One is based on words and multi-words and the other is based on word prefixes of fixed length (5 characters for the experiments made) for handling morphologically rich languages, namely Portuguese and Czech. English is also experimented, as a non-morphologically rich language. Results are manually evaluated and agreement between evaluators is assessed using k-Statistics. The metrics based on Tf-Idf and Phi-square proved to have higher precision and recall. The use of prefix-based representation of documents enabled a significant improvement for documents written in Portuguese.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

F. S. Teixeira, L., P. Lopes, G. and A. Ribeiro, R. (2012). AN EXTENSIVE COMPARISON OF METRICS FOR AUTOMATIC EXTRACTION OF KEY TERMS. In Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART; ISBN 978-989-8425-95-9; ISSN 2184-433X, SciTePress, pages 55-63. DOI: 10.5220/0003720400550063

@conference{icaart12,
author={Luis {F. S. Teixeira} and Gabriel {P. Lopes} and Rita {A. Ribeiro}},
title={AN EXTENSIVE COMPARISON OF METRICS FOR AUTOMATIC EXTRACTION OF KEY TERMS},
booktitle={Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART},
year={2012},
pages={55-63},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003720400550063},
isbn={978-989-8425-95-9},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART
TI - AN EXTENSIVE COMPARISON OF METRICS FOR AUTOMATIC EXTRACTION OF KEY TERMS
SN - 978-989-8425-95-9
IS - 2184-433X
AU - F. S. Teixeira, L.
AU - P. Lopes, G.
AU - A. Ribeiro, R.
PY - 2012
SP - 55
EP - 63
DO - 10.5220/0003720400550063
PB - SciTePress