learning model, and a set of existing NLP methods
such as text pre-processing and word embedding
leads to successful multilingual sentiment analysis.
We compared five deep learning models by
training them on 14,400 (80%) tweets from our
dataset and testing them on 3,600 (20%) tweets. Our
CNN model achieves the best accuracy of 85.91%
and an F1-score of 84.61%.
A second experiment was carried out by applying
the winning CNN model on a balanced set of tweets
with emojis collected from Twitter. The CNN model
achieved satisfying accuracy (higher than 90%) for
the European languages, while its accuracy for Arabic
tweets is 86%. This difference can be explained by
the translation software's inability to translate non-
standard Arabic to English and the wide use of non-
standard Arabic on social media.
Future work may explore the ability of
transformer-based models to successfully tackle the
sentiment analysis problem for corpora containing
documents in multiple Arabic dialects.
REFERENCES
Banea, C., Mihalcea, R., & Wiebe, J. (2010). Multilingual
subjectivity: Are more languages better? Proceedings
of the 23rd International Conference on Computational
Linguistics (Coling 2010),
Bautin, M., Vijayarenu, L., & Skiena, S. (2008).
International sentiment analysis for news and blogs.
ICWSM,
Go, A., Bhayani, R., & Huang, L. (2009). Twitter sentiment
classification using distant supervision. CS224N
project report, Stanford, 1(12), 2009.
Google. Explore the world in over 100 languages. Google
Translate API
Kim, S.-M., & Hovy, E. (2006). Identifying and analyzing
judgment opinions. Proceedings of the human language
technology conference of the NAACL, main
conference,
Liu, B., & Zhang, L. (2012). A survey of opinion mining
and sentiment analysis. In Mining text data (pp. 415-
463). Springer.
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013).
Efficient estimation of word representations in vector
space. arXiv preprint arXiv:1301.3781.
Porter, M. F. (1980). An algorithm for suffix stripping.
Program, 14(3), 130-137.
Portuguese Tweets for Sentiment Analysis. (2019).
Portuguese Tweets
Ravi, K., & Ravi, V. (2015). A survey on opinion mining
and sentiment analysis: tasks, approaches and
applications. Knowledge-Based Systems, 89, 14-46.
Saad, M. (2020). Arabic Sentiment Twitter Corpus. Arabic
Tweets
Shin, B., Lee, T., & Choi, J. D. (2016). Lexicon integrated
CNN models with attention for sentiment analysis.
arXiv preprint arXiv:1610.06272.
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., &
Salakhutdinov, R. (2014). Dropout: a simple way to
prevent neural networks from overfitting. The journal
of machine learning research, 15(1), 1929-1958.
Statista. (2019). Internet: most common languages online
2020 | Statista. Statista 2019
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., & Stede,
M. (2011). Lexicon-based methods for sentiment
analysis. Computational linguistics, 37(2), 267-307.
Wang, J., Yu, L.-C., Lai, K. R., & Zhang, X. (2016).
Dimensional sentiment analysis using a regional CNN-
LSTM model. Proceedings of the 54th annual meeting
of the association for computational linguistics (volume
2: Short papers),
Wu, Y., Schuster, M., Chen, Z., Le, Q. V., Norouzi, M.,
Macherey, W., Krikun, M., Cao, Y., Gao, Q., &
Macherey, K. (2016). Google's neural machine
translation system: Bridging the gap between human
and machine translation. arXiv preprint
arXiv:1609.08144.
Y.Kim. (2014). Convolutional neural networks for sentence
classification. Arxiv (2014), pp. 23-31.
https://doi.org/10.1145/1599272.1599278