Predicting Stock Market Movement: An Evolutionary Approach

Salah Bouktif, Mamoun Adel Awad

2015

Abstract

Social Networks are becoming very popular sources of all kind of data. They allow a wide range of users to interact, socialize and express spontaneous opinions. The overwhelming amount of exchanged data on businesses, companies and governments make it possible to perform predictions and discover trends in many domains. In this paper we propose a new prediction model for the stock market movement problem based on collective classification. The model is using a number of public mood states as inputs to predict Up and Down movement of stock market. The proposed approach to build such a model is simultaneously promoting performance and interpretability. By interpretability, we mean the ability of a model to explain its predictions. A particular implementation of our approach is based on Ant Colony Optimization algorithm and customized for individual Bayesian classifiers. Our approach is validated with data collected from social media on the stock of a prestigious company. Promising results of our approach are compared with four alternative prediction methods namely, bagging, Adaboost, best expert, and expert trained on all the available data.

References

  1. Ahmed, F., Bouktif, S., Serhani, A., and Khalil, I. (2008). Integrating function point project information for improving the accuracy of effort estimation. In Advanced Engineering Computing and Applications in Sciences, pages 193-198. IEEE.
  2. Asur, S. and Huberman, B. A. (2010). Predicting the future with social media. In International Conference on Web Intelligence and Intelligent Agent Technology, pages 492-499.
  3. Bollen, J., Mao, H., and Zeng, X. (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1):1-8.
  4. Bouktif, S., Ahmed, F., Khalil, I., Antoniol, G., and Sahraoui, H. (2010). A novel composite model approach to improve software quality prediction. Information and Software Technology, 52(12):1298-1311.
  5. Bouktif, S. and Awad, M. A. (2013). Ant colony based approach to predict stock market movement from mood collected on twitter. In Proceedings of IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pages 837-845.
  6. Bouktif, S., Kégl, B., and Sahraoui, S. (2002). Combining software quality predictive models: An evolutionary approach. In Proceeding of the International Conference on Software Maintenance, pages 385-392.
  7. Bouktif, S., Sahraoui, H. A., and Antoniol, G. (2006). Simulated annealing for improving software quality prediction. In Genetic and Evolutionary Computation Conference proceeding, Seattle, USA, pages 1893- 1900.
  8. Chen, J., Huang, H., Tian, S., and Qu, Y. (2009). Feature selection for text classification with n aïve bayes. Expert Systems with Applications, 36(3):5432-5435.
  9. Deneubourg, J., Aron, S., Goss, S., and Pasteels, J. (1990). The self-organizing exploratory pattern of the argentine ant. Journal of insect behavior, 3(2):159-168.
  10. Dorigo, M., Birattari, M., and Stutzle, T. (2006). Ant colony optimization. Computational Intelligence Magazine, IEEE, 1(4):28-39.
  11. Fenton, N. and Neil, M. (1999). A critique of software defect prediction models. IEEE Transactions on Software Engineering, 25(5):675-689.
  12. Freund, Y. and Schapire, R. (1997). A Decision-theoretic Generalization of On-line Learning and an Application to Boosting. Journal of Computer and System Sciences, 55(1):119-139.
  13. Galar, M., Fernández, A., Barrenechea, E., Bustince, H., and Herrera, F. (2012). A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 42(4):463-484.
  14. Gayo-Avello, D. (2012). I wanted to predict elections with twitter and all i got was this lousy paper a balanced survey on election prediction using twitter data. arXiv preprint arXiv:1204.6441.
  15. Ginsberg, J., Mohebbi, M. H., Patel, R. S., Brammer, L., Smolinski, M. S., and Brilliant, L. (2008). Detecting influenza epidemics using search engine query data. Nature, 457(7232):1012-1014.
  16. Goebel, M. and Gruenwald, L. (1999). A survey of data mining and knowledge discovery software tools. ACM SIGKDD Explorations Newsletter, 1(1):20-33.
  17. Kim, H., Loh, W.-Y., Shih, Y.-S., and Chaudhuri, P. (2007). Visualizable and interpretable regression models with good prediction power. IIE Transactions, 39(6):565- 579.
  18. Lavrac?, N. (1999). Selected techniques for data mining in medicine. Artificial intelligence in medicine , 16(1):3- 23.
  19. Liu, Y. (2006). Word of mouth for movies: Its dynamics and impact on box office revenue. Journal of marketing, pages 74-89.
  20. Merz, C. (1998). Classification and Regression by Combining Models. PhD thesis, university of California Irvine.
  21. Michalski, R. S., Carbonell, J. G., and Mitchell, T. M. (1986). Machine learning: An artificial intelligence approach, volume 2. Morgan Kaufmann.
  22. Moerland, P. and Mayoraz, E. (1999). DynaBoost: Combining boosted hypotheses in a dynamic way. Technical report, IDIAP, Switzerland.
  23. Oza, N. and Tumer, K. (2008). Classifier ensembles: Select real-world applications. Information Fusion, 9(1):4- 20.
  24. Ramoni, R. and Sebastiani, P. (1999). Robust bayesian classification. Technical report, Knowledge Media Institute, the Open University.
  25. Trusov, M., Bucklin, R. E., and Pauwels, K. (2009). Effects of word-of-mouth versus traditional marketing: Findings from an internet social networking site. Journal of marketing, 73(5):90-102.
  26. Van Gerven, M., Jurgelenaite, R., Taal, B., Heskes, T., and Lucas, P. (2007). Predicting carcinoid heart disease with the noisy-threshold classifier. Artificial Intelligence in Medicine, 40(1):45-55.
  27. Youden, W. J. (1961). How to evaluate accuracy. Materials Research and Standards, ASTM.
  28. Yu, S. and Kak, S. (2012). A survey of prediction using social media. arXiv preprint arXiv:1203.1647.
Download


Paper Citation


in Harvard Style

Bouktif S. and Adel Awad M. (2015). Predicting Stock Market Movement: An Evolutionary Approach . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015) ISBN 978-989-758-158-8, pages 159-167. DOI: 10.5220/0005578401590167


in Bibtex Style

@conference{kdir15,
author={Salah Bouktif and Mamoun Adel Awad},
title={Predicting Stock Market Movement: An Evolutionary Approach},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)},
year={2015},
pages={159-167},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005578401590167},
isbn={978-989-758-158-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)
TI - Predicting Stock Market Movement: An Evolutionary Approach
SN - 978-989-758-158-8
AU - Bouktif S.
AU - Adel Awad M.
PY - 2015
SP - 159
EP - 167
DO - 10.5220/0005578401590167