Product Feature Taxonomy Learning based on User Reviews

Nan Tian, Yue Xu, Yuefeng Li, Ahmad Abdel-Hafez, Audun Josang


In recent years, the Web 2.0 has provided considerable facilities for people to create, share and exchange information and ideas. Upon this, the user generated content, such as reviews, has exploded. Such data provide a rich source to exploit in order to identify the information associated with specific reviewed items. Opinion mining has been widely used to identify the significant features of items (e.g., cameras) based upon user reviews. Feature extraction is the most critical step to identify useful information from texts. Most existing approaches only find individual features about a product without revealing the structural relationships between the features which usually exist. In this paper, we propose an approach to extract features and feature relationships, represented as a tree structure called feature taxonomy, based on frequent patterns and associations between patterns derived from user reviews. The generated feature taxonomy profiles the product at multiple levels and provides more detailed information about the product. Our experiment results based on some popularly used review datasets show that our proposed approach is able to capture the product features and relations effectively.


  1. Abbasi, A., Chen, H., and Salem, A. (2008). Sentiment analysis in multiple languages: Feature selection for opinion classification in web forum. ACM Transactions on Information Systems, 26(3).
  2. Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3:993 - 1022.
  3. Ding, X., Liu, B., and Yu, P. S. (2008). A holistic lexiconbased approach to opinion mining. In Proceedings of the 2008 International Conference on Web Search and Data Mining, pages 231 - 240.
  4. Hai, Z., Chang, K., Kim, J., and Yang, C. (2013). Identifying features in opinion mining via intrinsic and extrinsic domain relevance. IEEE Transactions on Knowledge and Data Engineering, pages 1 - 1.
  5. Hofmann, T. (2001). Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42(1 - 2):177 - 196.
  6. Hu, M. and Liu, B. (2004a). Mining and summarizing customer reviews. In 10th ACM SIGKDD international conference on Knowledge discovery and data mining.
  7. Hu, M. and Liu, B. (2004b). Mining opinion features in customer reviews. In Proceedings of the 19th national conference on Artifical intelligence.
  8. Hu, W., Gong, Z., and Guo, J. (2010). Mining product features from online reviews. In IEEE International Conference on E-Business Engineering, pages 24 - 29.
  9. Lau, R. Y., Lai, C. C., Ma, J., and Li, Y. (2009). Automatic domain ontology extraction for context-sensitive opinion mining. In Proceedings of the Thirtieth International Conference on Information Systems.
  10. Lewis, D. D. (1992). An evaluation of phrasal and clustered representations on a text categorization task. In Proceedings of the 15th ACM International Conference on Research and Development in Information Retrieval, pages 177 - 196.
  11. Pasquier, N., Bastide, Y., Taouil, R., and Lakhal, L. (1999). Efficient mining of association rules using closed itemset lattices. Information Systems, 24(1):25 - 46.
  12. Popescu, A.-M. and Etzioni, O. (2005). Extracting product features and opinions from reviews. In Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 339-346.
  13. Scaffidi, C., Bierhoff, K., Chang, E., Felker, M., Ng, H., and Jin, C. (2007). Red opal: Product-feature scoring from reviews. In Proceedings of the 8th ACM conference on Electronic commerce, number 182 - 191.
  14. Subrahmanian, V. S. and Reforgiato, D. (2008). Ava: Adjective-verb-adverb combinations for sentiment analysis. IEEE Intelligent Systems, pages 43 - 50.
  15. Tang, J., Leung, H.-f., Luo, Q., Chen, D., and Gong, J. (2009). Towards ontology learning from folksonomies. In Proceedings of the 21st international jont conference on Artifical intelligence, pages 2089 - 2094.
  16. Wright, A. (2009). Our sentiments, exactly. Communications of the ACM, 52(4):14 - 15.
  17. Xu, Y., Li, Y., and Shaw, G. (2011). Representations for association rules. Data and Knowledge Engineering, 70(6):237 - 256.
  18. Zhang, Y. and Zhu, W. (2013). Extracting implicit features in online customer reviews for opinion mining. In Proceedings of the 22nd international conference on World Wide Web companion, pages 103 - 104.

Paper Citation

in Harvard Style

Tian N., Xu Y., Li Y., Abdel-Hafez A. and Josang A. (2014). Product Feature Taxonomy Learning based on User Reviews . In Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-758-024-6, pages 184-192. DOI: 10.5220/0004850201840192

in Bibtex Style

author={Nan Tian and Yue Xu and Yuefeng Li and Ahmad Abdel-Hafez and Audun Josang},
title={Product Feature Taxonomy Learning based on User Reviews},
booktitle={Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},

in EndNote Style

JO - Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - Product Feature Taxonomy Learning based on User Reviews
SN - 978-989-758-024-6
AU - Tian N.
AU - Xu Y.
AU - Li Y.
AU - Abdel-Hafez A.
AU - Josang A.
PY - 2014
SP - 184
EP - 192
DO - 10.5220/0004850201840192