Authors: Shahab Mokarizadeh ; Mohammad Tafiqur Rahman and Mihhail Matskin

Affiliation: Royal Institute of Technology (KTH), Sweden

ISBN: 978-989-8565-54-9

ISSN: 2184-3252

Keyword(s): Android Apps, Software Repository, Correlation Analysis, Topic Modeling.

Abstract: In this paper, we focus on analyzing Google Play, the largest Android app store that provides a wide collection of data on features (ratings, price and number of downloads) and descriptions related to application functionality. The overall objective of this analysis effort is to provide in-depth insight about intrinsic properties of App repositories in general. This allows us to draw a comprehensive picture of current situation of App market in order to help application developers to understand customers’ desire and attitude and the trend in the market. To this end, we suggest an analysis approach which examines the given collection of Apps in two directions. In the first direction, we measure the correlation between app features while in the second direction we construct cluster of similar applications and then examine their characteristics in association with features of interest. The examined dataset are collected from Google Play (in 2012) and Android Market (in 2011). In our anal ysis results, we identified a strong correlation between price and number of downloads and similarly between price and participation. Moreover, by employing a probabilistic topic modeling technique and K-means clustering method, we find out that the categorization system of Google Play does not respect properly similarity of applications. We also determined that there is a high competition between App providers producing similar applications. (More)

Paper citation in several formats:
