precision and recall are 72.77% and 76.12%. The
macro precision and recall are 74.23% and 81.87%.
4 CONCLUSIONS
As B2C electronic commerce is so popular
nowadays, it is convenience to shop online.
However, consumers are expecting more than just
convenience, choice, and lower price as offered by
B2C electronic commerce. Consumers desire to
have an intelligent Web information system to
support their purchasing decisions. In such system,
consumer reviews can be compared in terms of their
product features so that consumers are able to
identify the best consumer products that they want.
In this work, we propose a Web content mining
approach using class association rules mining to
overcome the problems that exist in the traditional
linguistic and natural language processing approach
for sentiment analysis. The writing on consumer
reviews is usually informal and contains a lot of
grammatical errors. As a result, many sentences
cannot be correctly parsed and further processed to
determine the product features that they are
describing. In our class association rules mining
approach, sentences are not required to be parsed by
natural language processor. Based on the usage of
words and their frequency, we determine the
relationships between words (or phrases) and
product features classes. Using the learned rules, we
determine the product features described in an
opinion sentence.
Our experimental result shows that it achieves a
promising performance. The performance is
especially good for the product features that are
specific and clear. However, it still suffers in
relatively lower precision and recall when the
product features are not as well-defined.
In the future, we shall investigate other statistical
techniques to support the selection of words so that
words with higher discriminating power can be
identified to produce better performance in
sentiment analysis.
REFERENCES
Bhargava, H. K., Choudhary, V., Krishnan, R., 2000.
Pricing and Product Design: Intermediary Strategies in
an Electronic Market. In International Journal of
Electronic Commerce, Vol. 5, No. 1, pp.37-56.
Das, S., Chen, M., 2001. Yahoo! for Amazon: Extracting
Market Sentiment from Stock Message Boards. In
Proceedings of the 8th Asia Pacific Finance
Association (APFA) Annual Meeting.
Hatzivassiloglou, V., Wiebe, J., 2000. Effects of Adjective
Orientation and Gradability on Sentence Subjectivity.
In Proceedings of 18th International Conference on
Computational Linguistics (COLING), Saarbrücken,
Germany, pp.299-305.
Hu, M., Liu, B., 2004. Mining and Summarizing Customer
Reviews. In Proceedings of the 10th ACM Conference
on Knowledge Discovery and Data Mining (KDD),
Seattle, WA, pp.168-177.
Liu, B., Hu, M., Cheng, J., 2005. Opinion Observer:
Analyzing and Comparing Opinions on the Web. In
Proceedings of the 14th International Conference on
World Wide Web (WWW’05), Chiba, Japan, pp.342-
351
Menczer, F., Street, W.N., Monge, A. E., Adaptive
Assistants for Customized E-shopping. In IEEE
Intelligent Systems, Vol. 17, No. 6, pp. 12-19.
Pang, B., Lee, L., Vaithyanathan, S., 2002. Thumbs Up?
Sentiment Classification Using Machine Learning
Techniques. In Proceedings of 2002 Conference on
Empirical Methods in Natural Language Processing
(EMNLP 2002), pp.79-86.
Turney, P., 2002. Thumbs Up or Thumbs Down?
Semantic Orientation Applied to Unsupervised
Classification of Reviews. In Proceedings of the 40th
Conference on Association for Computational
Linguistics (ACL), Philadelphia, PA, pp.417-424.
Wiebe, J., Bruce, R., O’Hara, T., 1999. Development and
Use of a Gold Standard Data Set for Subjectivity
Classifications. In Proceedings of the 37th Conference
on Association for Computational Linguistics (ACL),
College Park, MD, pp.246-253
Wei, C., Yang, C. S., Huang, C. N., 2006. Turning Online
Product Reviews to Customer Knowledge: A
Semantic-based Sentiment Classification Approach. In
Proceedings of 10th Pacific Asia Conference on
Information Systems (PACIS), Kuala Lumpur,
Malaysia.
Wong, R., Yang, C. C., 2005. Collaborative Infomediary
for Financial News. In Proceedings of the Fourth
Workshop on e-Business (WEB 2005), Las Vegas, NV,
December 2005.
Yang, C. C., Wong, R., 2006. Measuring Success Factors
of E-Commerce Infomediary. In Proceedings of the
Pacific Asia Conference on Information Systems
(PACIS), Kuala Lumpur, Malaysia, July 2006.
WEBIST 2008 - International Conference on Web Information Systems and Technologies
192