Simple App Review Classification with Only Lexical Features

Faiz Ali Shah, Kairit Sirts, Dietmar Pfahl

Abstract

User reviews submitted to app marketplaces contain information that falls into different categories, e.g., feature evaluation, feature request, and bug report. The information is valuable for developers to improve the quality of mobile applications. However, due to the large volume of reviews received every day, manual classification of user reviews into these categories is not feasible. Therefore, developing automatic classification methods using machine learning approaches is desirable. In this study, we compare the simplest textual machine learning classifier using only lexical features—the so-called Bag-of-Words (BoW) approach—with the more complex models used in previous works adopting rich linguistic features. We find that the performance of the simple BoW model is very competitive and has the advantage of not requiring any external linguistic tools to extract the features. Moreover, we experiment with deep learning based Convolutional Neural Network (CNN) models that have recently achieved state-of-the-art results in many classification tasks. We find that, on average the CNN models do not perform better than the simple BoW model—it is possible that for the CNN model to gain an advantage, a larger training set would have been necessary.

Download


Paper Citation


in Harvard Style

Sirts K. and Pfahl D. (2018). Simple App Review Classification with Only Lexical Features.In Proceedings of the 13th International Conference on Software Technologies - Volume 1: ICSOFT, ISBN 978-989-758-320-9, pages 112-119. DOI: 10.5220/0006855901120119


in Bibtex Style

@conference{icsoft18,
author={Kairit Sirts and Dietmar Pfahl},
title={Simple App Review Classification with Only Lexical Features},
booktitle={Proceedings of the 13th International Conference on Software Technologies - Volume 1: ICSOFT,},
year={2018},
pages={112-119},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006855901120119},
isbn={978-989-758-320-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Conference on Software Technologies - Volume 1: ICSOFT,
TI - Simple App Review Classification with Only Lexical Features
SN - 978-989-758-320-9
AU - Sirts K.
AU - Pfahl D.
PY - 2018
SP - 112
EP - 119
DO - 10.5220/0006855901120119