A Novel Framework to Represent Documents using a Semantically-grounded Graph Model

Antonio M. Rinaldi; Cristiano Russo

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

A Novel Framework to Represent Documents using a Semantically-grounded Graph Model

Topics: Concept Mining; Mining Text and Semi-Structured Data; Visual Data Mining and Data Visualization

In Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 0IC3K, 203-211, 2018 , Seville, Spain

Authors: Antonio M. Rinaldi ¹ and Cristiano Russo ²

Affiliations: ¹ Dipartimento di Ingegneria Elettrica e delle Tecnologie dell’Informazione, IKNOS-LAB-Intelligent and Knowledge Systems-LUPT, University of Naples Federico II and Italy ; ² LISSI Laboratory, University of Paris-Est Creteil (UPEC) and France

Keyword(s): Document Representation, Semantic and Linguistic Analysis, WordNet, Lexical Chains, NoSQL, Neo4J.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Concept Mining ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining Text and Semi-Structured Data ; Symbolic Systems ; Visual Data Mining and Data Visualization

Abstract: As an increasing number of text-based documents, whose complexity increases in turn, are available over the Internet, it becomes obvious that handling such documents as they are, i.e. in their original natural-language based format, represents a daunting task to face up for computers. Thus, some methods and techniques have been used and refined, throughout the last decades, in order to transform the digital documents from the full text version to another suitable representation, making them easier to handle and thus helping users in getting the right information with a reduced algorithmic complexity. One of the most spread solution in document representation and retrieval has consisted in transforming the full text version into a vector, which describes the contents of the document in terms of occurrences patterns of words. Although the wide adoption of this technique, some remarkable drawbacks have been soon pointed out from the researchers’ community, mainly focused on the lack of semantics for the associated terms. In this work, we use WordNet as a generalist linguistic database in order to enrich, at a semantic level, the document representation by exploiting a label and properties based graph model, implemented in Neo4J. This work demonstrates how such representation allows users to quickly recognize the document topics and lays the foundations for cross-document relatedness measures that go beyond the mere word-centric approach. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 18.219.79.34

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Rinaldi, A. M. and Russo, C. (2018). A Novel Framework to Represent Documents using a Semantically-grounded Graph Model. In Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR; ISBN 978-989-758-330-8; ISSN 2184-3228, SciTePress, pages 203-211. DOI: 10.5220/0006932502030211

@conference{kdir18,
author={Antonio M. Rinaldi and Cristiano Russo},
title={A Novel Framework to Represent Documents using a Semantically-grounded Graph Model},
booktitle={Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR},
year={2018},
pages={203-211},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006932502030211},
isbn={978-989-758-330-8},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR
TI - A Novel Framework to Represent Documents using a Semantically-grounded Graph Model
SN - 978-989-758-330-8
IS - 2184-3228
AU - Rinaldi, A.
AU - Russo, C.
PY - 2018
SP - 203
EP - 211
DO - 10.5220/0006932502030211
PB - SciTePress