commercial systems and applications based on
sentiment analysis have emerged. For example,
Google Shopping, a representative platform based
on product reviews, provides users with product
search and price comparison services for online
purchasing platforms. OpinionE Q allows
commercial organizations and individuals to
customize product analytics services on demand (
Li
2019
). However, due to the increasingly swelling
problem of the diversity of sentiment research
objects and the complexity of sentiment tasks.
Sentiment objects range from positive and negative
tendency comments on products and services to
categorizing user and topic sentiments in social
media categories, with more diverse expressions and
a greater variety of emotions, and a corresponding
shift in research content, including more attention to
the user's information as well as the change of the
emotion for the user (
Li 2019, Ziang, Wang, Li,
Lévy, Nie, Jurafsky, Ng 2017, Lan, Chen,
Goodman, Gimpel, Sharma, ALBERT 2020,
Young, Hazarika, Poria, Cambria 2018, Rush
2018, Vaswani, Shazeer, Parmar, Uszkoreit, Jones,
Gomez, Kaiser, Polosukhin 2017
). On the other
hand, consumers' comments on short texts are
diverse in their expression of emotions, either in a
straightforward manner or through rhetorical devices
or even irony. The multiple forms of expression of
emotions such as irony make the analysis inaccurate.
This project applies a CNN model with fewer
parameters but better performance to text data for
sentiment analysis, and uses the analysis results to
evaluate the attributes of products. This will help
merchants to improve their products and consumers
to choose more suitable products for themselves.
2 METHODS
2.1 Data Collection and Pre-processing
At present, online shopping had become one of the
important ways for people to trade and consume. By
the end of 2020, Taobao and Jingdong two major
e-commerce platforms had reached 1.2 billion active
purchasing users in the past year. Due to consumer
preference, digital enthusiasts tended to purchase
electronic products from official channels. As of
July 2021, as the P40 model phone's main and
official sales channel, more than 30,000 after-sales
reviews had been accumulated in the HUAWEI
Mall. If the entire model phone's review data were
crawled and studied, the process would be very
tedious, resulting in unnecessary human, financial,
and material waste. Therefore, this paper randomly
selected 500 real reviews of HUAWEI P40 model
mobile phones as the basis of the research. The
study first selected 10,000 reviews data of other
mobile phone products with known labels to train
the CNN model. The consumers' attitudes of
"positive", "negative" or "other" are determined
based on the content of the reviews of different
labels.
For text data, text data usually contained a lot of
noise that is not relevant to the task, so it should be
Pre-processed such as removing URL links,
removing emoji emoji, converting English upper
case to lower case and converting Chinese
traditional to simplified, etc.
Simultaneously, 492 valid questionnaires were
collected from online questionnaire survey of
HUAWEI mobile phone users. The results showed
that the users' concern about HUAWEI p40 mobile
phone are mainly focused on six aspects, such as
running speed, photography, battery life,
communication, appearance and display effect.
According to the above six tags, the dataset is
classified by Python in six labels to get six review
sets.
2.2 CNN Model Building
In the field of computer deep learning, CNN
convolutional neural network is one of the most
successful application of deep learning algorithm.
Consisting of neurons with learnable weights and
bias constants, each neuron receives some input and
does some dot product computation to output a score
for each classification.
Input OutputConv
Pool
Conv Pool
Figure 1: CNN convolutional neural network model.