loading
  • Login
  • Sign-Up

Research.Publish.Connect.

Paper

Authors: Ngoc Phuoc An Vo 1 and Octavian Popescu 2

Affiliations: 1 Xerox Research Centre Europe, France ; 2 IBM T.J.Watson Research, United States

ISBN: 978-989-758-203-5

Keyword(s): Machine Learning, Natural Language Processing (NLP), Semantic Textual Similarity (STS).

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Evolutionary Computing ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Symbolic Systems

Abstract: Building a system able to cope with various phenomena which falls under the umbrella of semantic similarity is far from trivial. It is almost always the case that the performances of a system do not vary consistently or predictably from corpora to corpora. We analyzed the source of this variance and found that it is related to the word-pair similarity distribution among the topics in the various corpora. Then we used this insight to construct a 4-module system that would take into consideration not only string and semantic word similarity, but also word alignment and sentence structure. The system consistently achieves an accuracy which is very close to the state of the art, or reaching a new state of the art. The system is based on a multi-layer architecture and is able to deal with heterogeneous corpora which may not have been generated by the same distribution.

PDF ImageFull Text

Download
Sign In Guest: Register as new SCITEPRESS user or Join INSTICC now for free.

Sign In SCITEPRESS user: please login.

Sign In INSTICC Members: please login. If not a member yet, Join INSTICC now for free.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.159.209.32. INSTICC members have higher download limits (free membership now)

In the current month:
Recent papers: 1 available of 1 total
2+ years older papers: 2 available of 2 total

Paper citation in several formats:
Vo N. and Popescu O. (2016). A Multi-Layer System for Semantic Textual Similarity.In Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, ISBN 978-989-758-203-5, pages 56-67. DOI: 10.5220/0006045800560067

@conference{kdir16,
author={Ngoc Phuoc An Vo and Octavian Popescu},
title={A Multi-Layer System for Semantic Textual Similarity},
booktitle={Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR,},
year={2016},
pages={56-67},
doi={10.5220/0006045800560067},
isbn={978-989-758-203-5},
}

TY - CONF

JO - Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR,
TI - A Multi-Layer System for Semantic Textual Similarity
SN - 978-989-758-203-5
AU - Vo N.
AU - Popescu O.
PY - 2016
SP - 56
EP - 67
DO - 10.5220/0006045800560067

Sorted by: Show papers

Note: The preferred Subjects/Areas/Topics, listed below for each paper, are those that match the selected paper topics and their ontology superclasses.
More...

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.

Show authors

Note: The preferred Subjects/Areas/Topics, listed below for each author, are those that more frequently used in the author's papers.
More...