Concept Extraction with Convolutional Neural Networks

Andreas Waldis; Luca Mazzola; Michael Kaufmann

doi:10.5220/0006901201180129

Concept Extraction with Convolutional Neural Networks

Andreas Waldis, Luca Mazzola, Michael Kaufmann

2018

Abstract

For knowledge management purposes, it would be interesting to classify and tag documents automatically based on their content. Concept extraction is one way of achieving this automatically by using statistical or semantic methods. Whereas index-based keyphrase extraction can extract relevant concepts for documents, the inverse document index grows exponentially with the number of words that candidate concpets can have. To adress this issue, the present work trains convolutional neural networks (CNNs) containing vertical and horizontal filters to learn how to decide whether an N-gram (i.e, a consecutive sequence of N characters or words) is a concept or not, from a training set with labeled examples. The classification training signal is derived from the Wikipedia corpus, knowing that an N-gram certainly represents a concept if a corresponding Wikipedia page title exists. The CNN input feature is the vector representation of each word, derived from a word embedding model; the output is the probability of an N-gram to represent a concept. Multiple configurations for vertical and horizontal filters were analyzed and configured through a hyper-parameterization process. The results demonstrated precision of between 60 and 80 percent on average. This precision decreased drastically as N increased. However, combined with a TF-IDF based relevance ranking, the top five N-gram concepts calculated for Wikipedia articles showed a high precision of 94%, similar to part-of-speech (POS) tagging for concept recognition combined with TF-IDF, but with a much better recall for higher N. CNN seems to prefer longer sequences of N-grams as identified concepts, and can also correctly identify sequences of words normally ignored by other methods. Furthermore, in contrast to POS filtering, the CNN method does not rely on predefined rules, and could thus provide language-independent concept extraction.

Download

Paper Citation

in Harvard Style

Waldis A., Mazzola L. and Kaufmann M. (2018). Concept Extraction with Convolutional Neural Networks.In Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA, ISBN 978-989-758-318-6, pages 118-129. DOI: 10.5220/0006901201180129

in Bibtex Style

@conference{data18,
author={Andreas Waldis and Luca Mazzola and Michael Kaufmann},
title={Concept Extraction with Convolutional Neural Networks},
booktitle={Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA,},
year={2018},
pages={118-129},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006901201180129},
isbn={978-989-758-318-6},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA,
TI - Concept Extraction with Convolutional Neural Networks
SN - 978-989-758-318-6
AU - Waldis A.
AU - Mazzola L.
AU - Kaufmann M.
PY - 2018
SP - 118
EP - 129
DO - 10.5220/0006901201180129