processing algorithms and models, this model is more
suitable for the financial industry, especially for retail
investors.
Moreover, this paper also plays a positive role in
studying the relationship between investor sentiment
and stock index. The relationship between investor
sentiment and stock index is not a simple linear
relationship, but a very complex nonlinear
relationship. The deep learning model can well depict
this complex relationship. This study also proves that
we can predict the rise and fall of stock index with
high accuracy after considering the sentiment of
investors, especially retail investors. In the future, the
study can be improved in the following aspects.
Firstly, exclude the survivor bias in investor
comments, since profitable investors tend to comment
while others not. Secondly, improve the size of data
sample, especially the data in tag set, which can make
the results more accurate and close to reality.
REFERENCES
Chiu C.C., Sainath T.N., Wu Y., et al. (2018). Stateof-the-
art Speech Recognition with Sequence-to-sequence
Models, 2018 IEEE International Conference on
Acoustics, Speech and Signal Processing.
Cowles A. (1933). Can Stock Market Forecasters Forecast?
Econometrica, 1(3).
Deng, L., Yang, L. (2018). Deep Learning in Natural
Language Processing, Singapore: Springer.
Devlin, J. , Chang, M.W., Lee, K., Toutanova, K. (2019).
BERT: Pre-training of Deep Bidirec-tional Trans
formers for Language Understanding, Proceedings of
NAACL-HLT.
He, K., Zhang, X., Ren, S., Sun, J. (2016). Deep Residual
Learning for Image Recognition, 2016 IEEE
Conference on Computer Vision and Pattern
Recognition.
Lee Y.W., Jiang X.C., Indro C.D. (2002). Stock Market
Volatility, Excess Return, and the Role of Investor
Sentiment. Journal of Banking & Finance, 26(12),
2277 -2299.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S.,Dean
J. (2013). Distributed Re presenltationsof Words and
Phrases and 'Their Com positionality. Proceedings of
Advances in Neural Information Pro-cessing Systems.
Oprea, D.S., Brad, L. (2014). Investor Sentiment and Stock
Returns: Evidence from Romania. International
Journal of Academic Research in Accounting Finance
and Management Sciences, 42, 19-25.
Pennington, J., Socher, R., Manning, C.D., (2014). Glove:
Global Vectors for Word Representation, Proceedings
of the 2014 Conference on Empirical Methods in
Natural Language Processing.
S. Gopal Krishna Patro, Kishore Kumar Sahu. (2015).
Normalization: A Preprocessing Stage. CoRR, abs,
1503.06462.
Xu, X.C., Tian, K. (2021). A Novel Financial Text
Sentiment Analysis-Based Approach for Stock Index
Prediction. The Journal of Quantitative & Technical
Economics, 38(12), 124-145.
Y, S.X., L, Z.C., Wang, X.J. (2020). News sentiment credit
spreads and information asymmetry. North American
Journal of Economics and Finance, 52.