UNSUPERVISED ORGANISATION OF SCIENTIFIC DOCUMENTS

André Lourenço; Liliana Medina; Ana Fred; Joaquim Filipe

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

UNSUPERVISED ORGANISATION OF SCIENTIFIC DOCUMENTS

In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, 549-560, 2011 , Paris, France

Authors: André Lourenço ¹ ; Liliana Medina ² ; Ana Fred ³ and Joaquim Filipe ⁴

Affiliations: ¹ Instituto Superior de Engenharia de Lisboa and a, Portugal ; ² Institute for Systems and Technologies of Information and Control and Communication, Portugal ; ³ Instituto Superior Técnico, Portugal ; ⁴ Institute for Systems and Technologies of Information, Control and Communication and Polytechnic Institute of Setúbal, Portugal

Keyword(s): Unsupervised learning, Clustering, Clustering combination, Clustering ensembles, Text mining, Feature selection, Concept induction, Metaterm.

Abstract: Unsupervised organisation of documents, and in particular research papers, into meaningful groups is a difficult problem. Using the typical vector-space-model representation (Bag-of-words paradigm), difficulties arise due to its intrinsic high dimensionality, high redundancy of features, and the lack of semantic information. In this work we propose a document representation relying on a statistical feature reduction step, and an enrichment phase based on the introduction of higher abstraction terms, designated as metaterms, derived from text, using as prior knowledge papers topics and keywords. The proposed representation, combined with a clustering ensemble approach, leads to a novel document organization strategy. We evaluate the proposed approach taking as application domain conference papers, topic information being extracted from conference topics or areas. Performance evaluation on data sets from NIPS and INSTICC conferences show that the proposed approach leads to interesting and encouraging results. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.142.133.210

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Lourenço, A.; Medina, L.; Fred, A. and Filipe, J. (2011). UNSUPERVISED ORGANISATION OF SCIENTIFIC DOCUMENTS. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - SSTM; ISBN 978-989-8425-79-9; ISSN 2184-3228, SciTePress, pages 549-560. DOI: 10.5220/0003722905570568

@conference{sstm11,
author={André Louren\c{C}o. and Liliana Medina. and Ana Fred. and Joaquim Filipe.},
title={UNSUPERVISED ORGANISATION OF SCIENTIFIC DOCUMENTS},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - SSTM},
year={2011},
pages={549-560},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003722905570568},
isbn={978-989-8425-79-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - SSTM
TI - UNSUPERVISED ORGANISATION OF SCIENTIFIC DOCUMENTS
SN - 978-989-8425-79-9
IS - 2184-3228
AU - Lourenço, A.
AU - Medina, L.
AU - Fred, A.
AU - Filipe, J.
PY - 2011
SP - 549
EP - 560
DO - 10.5220/0003722905570568
PB - SciTePress