Novel Semantics-based Distributed Representations for Message Polarity Classification using Deep Convolutional Neural Networks

Abhinay Pandya, Mourad Oussalah

Abstract

Unsupervised learning of distributed representations (word embeddings) obviates the need for task-specific feature engineering for various NLP applications. However, such representations learned from massive text datasets do not faithfully represent finer semantic information in the feature space required by specific applications. This is owing to the fact that (a) models learning such representations ignore the linguistic structure of the sentences, (b) they fail to capture \textit{polysemous} usages of the words, and (c) they ignore pre-existing semantic information from manually-created ontologies. In this paper, we propose three semantics-based distributed representations of words and phrases as features for message polarity classification: Sentiment-Specific Multi-Word Expressions Embeddings(SSMWE) are sentiment encoded distributed representations of \textit{multi-word expressions (MWEs)}; Sense-Disambiguated Word Embeddings(SDWE) are sense-specific distributed representations of words; and WordNet embeddings(WNE) are distributed representations of hypernym and hyponym of the correct sense of a given word. We examine the effects of these features incorporated in a convolutional neural network(CNN) model for evaluation on the SemEval benchmarked dataset. Our approach of using these novel features yields 14.24\% improvement in the macro-averaged F1 score on SemEval datasets over existing methods. While we have shown promising results in twitter sentiment classification, we believe that the method is general enough to be applied to many NLP applications where finer semantic analysis is required.

Download


Paper Citation


in Harvard Style

Pandya A. and Oussalah M. (2017). Novel Semantics-based Distributed Representations for Message Polarity Classification using Deep Convolutional Neural Networks.In Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, ISBN 978-989-758-271-4, pages 71-82. DOI: 10.5220/0006500800710082


in Bibtex Style

@conference{kdir17,
author={Abhinay Pandya and Mourad Oussalah},
title={Novel Semantics-based Distributed Representations for Message Polarity Classification using Deep Convolutional Neural Networks},
booktitle={Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR,},
year={2017},
pages={71-82},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006500800710082},
isbn={978-989-758-271-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR,
TI - Novel Semantics-based Distributed Representations for Message Polarity Classification using Deep Convolutional Neural Networks
SN - 978-989-758-271-4
AU - Pandya A.
AU - Oussalah M.
PY - 2017
SP - 71
EP - 82
DO - 10.5220/0006500800710082