context of the BERT model fine-tuning, so that it can
realize the emotional polarity of the evaluation text
data of the logistics enterprise. After the model
validation test, the relevant indexes show that the
BERT model effectively completes the sentiment
classification prediction of logistics enterprise
evaluation, and then the Fine-tuning test with
different data imbalance is conducted around the
model tuning, and the results show that the model can
achieve the best performance and complete the task
of predicting the sentiment of the text of the logistics
enterprise when the data imbalance is 30-40%.
2 RELEVANT LITERATURE
Text data, as a typical unstructured data, has also
gradually started to be studied in the era of big
data.Feldman R et al. first proposed the concept of
text mining in their work (Feldman,
Sanger, 2007)
and introduced several methods to perform mining
analysis. Since Chinese text does not come with its
own disambiguation characters, many scholars have
tried various methods in solving features, recognition
and representation,. For example, scholars such as Xu
G made a study on several different algorithms for
Chinese participles and proposed an automatic
Chinese participle system based on the forward
maximal matching method
(Xu, Hu, Wang, 2007)
while Hu Yan et al. proposed a method for text feature
extraction from the point of view of Chinese lexical
properties (Hu, Wu, Zhong, 2007).
The analysis of text sentiment is also an important
task and branching direction in text mining. The main
methods of text sentiment analysis as Hu R
mentioned in their review study (Hu, Rui, Zeng, 2018)
from the earliest sentiment lexicon matching as Li J
et al. used (Li, Xu, Xiong, 2010), to the traditional
machine learning methods such as Bayes, SVM and
so on as Hasan A and other scholars used (Hasan,
Moin, Karim, 2018) and then evolved to the current
deep learning methods based on various types of
neural networks as Dang N C mentioned in his review
study mentioned new methods such as CNN, RNN,
LSTM models currently used for sentiment analysis
(Dang, Moreno-GarcΓa, De la Prieta, 2018) .
Nowadays, the mining of sentiment in text has
also become more delicate and in-depth, and the term
"fine-grained sentiment analysis" has been proposed,
for example, Lai Y and other scholars have used CNN
to realize fine-grained sentiment classification of
microblog text (Lai, Zhang, Han, 2020) .
Treiblmaier and other scholars systematically
reviewed the use of word cloud graphs, topic models
and other methods for text data extraction and
sentiment analysis in logistics and supply chain
management (Treiblmaier, Mair, 2021), constructing
a method system for logistics text mining analysis.
While Singh et al. incorporated sentiment analysis
indicators into the overall performance evaluation
system of 3PL logistics enterprises (Singh et al.,
2022). Hong and Lim, both groups of researchers
based on the perspective of user satisfaction,
respectively, utilized CNN (Hong et al., 2019) and Bi-
LSTM models for text sentiment analysis to parse the
elements of users logistics satisfaction in the specific
scenarios of fresh e-commerce logistics and cold chain
logistics, and put forward suggestions for improvement
(Lim, Li, Song, 2021).
3 BERT-BASED SENTIMENT
ANALYSIS MODEL
3.1 Structure of BERT
The BERT model was first proposed by Devlin J et
al. of Google in 2018 (Devlin, Chang, Lee, et al.,
2018), BERT is Bidirectional Encoder Repre-
sentations from Transformers, and its basic structure
mainly consists of the Encoder part of the
Transformer model (Vaswani, Shazeer, Parmar, et al.,
2017). It consists of a fully-connected stack of
multiple Encoder units and uses two pre-training
tasks, MLM and NSP, which have excellent
performance in processing short and medium texts
and consider the contextual bi-directional interaction.
The basic structure of the model is shown in Fig.1.
Figure 1: The structure of BERT.
DMEIS 2024 - The International Conference on Data Mining, E-Learning, and Information Systems