Predicting stock price movement direction is a challenging
problem influenced by different factors and capricious events. The
conventional stock price prediction machine learning
models heavily rely on the internal financial features, especially the stock
price history. However, there are many outside-of-company features that deeply interact with the companies’
stock price performance, especially during
the COVID period. In this study, we selected 9 COVID vaccine companies and collected their relevant features
over the past 20 months. We added handcrafted external information, including
COVID-related statistics and company-specific vaccine progress information. We implemented,
evaluated, and compared several machine learning
models, including Multilayer Perceptron Neural Networks with logistic regression
and decision trees with boosting and bagging algorithms. The results suggest that
the application of feature engineering and data mining techniques can effectively
enhance the performance of models predicting stock price movement during the COVID period. The results show that COVID-related
handcrafted features help to increase the model prediction accuracy by 7.3%
and AUROC by 6.5% on average. Further exploration
showed that with data selection the decision tree model withgradient,
boosting algorithm achieved 70% in AUROC and 66% in the accuracy.
References
[1]
Hong, H., Bian, Z.C. and Lee, C.-C. (2021) COVID-19 and Instability of Stock Market Performance: Evidence from the US. Financial Innovation, 7, 1-18. https://doi.org/10.1186/s40854-021-00229-1
[2]
Vijh, M., Chandola, D., Tikkiwal, V.A. and Kumar, A. (2020) Stock Closing Price Prediction Using Machine Learning Techniques. Procedia Computer Science, 167, 599-606. https://doi.org/10.1016/j.procs.2020.03.326
[3]
Usmani, S. and Shamsi, J.A. (2021) News Sensitive Stock Market Prediction: Literature Review and Suggestions. PeerJ Computer Science, 7, e490. https://doi.org/10.7717/peerj-cs.490
[4]
Aravind, M. and Manojkrishnan, C.G. (2020) COVID 19: Effect on Leading Pharmaceutical Stocks Listed with NSE. International Journal of Research in Pharmaceutical Sciences, 1, 31-36. https://doi.org/10.26452/ijrps.v11iSPL1.2014
[5]
Vierlboeck, M. and Nilchiani, R.R. (2021) Effects of COVID-19 Vaccine Developments and Rollout on the Capital Market—A Case Study. https://arxiv.org/abs/2105.12267
[6]
Adekoya, A.F. and Nti, I.K. (2020) The COVID-19 Outbreak and Effects on Major Stock Market Indices across the Globe: A Machine Learning Approach. Indian Journal of Science and Technology, 13, 3695-3706. https://doi.org/10.17485/IJST/v13i35.1180
[7]
Rouf, N., Malik, M.B. and Arif, T. (2020) A Machine Learning Based Approach to Unleash the Impact of COVID-19 on Indian Stock Market. https://doi.org/10.21203/rs.3.rs-54882/v1
[8]
Kara, Y., Boyacioglu, M.A. and Baykan, M.K. (2011) Predicting Direction of Stock Price Index Movement Using Artificial Neural Networks and Support Vector Machines: The Sample of the Istanbul Stock Exchange. Expert Systems with Applications, 38, 5311-5319. https://doi.org/10.1016/j.eswa.2010.10.027
[9]
Nelson, D.M.Q., Pereira, A.C.M. and de Oliveira, R.A. (2017) Stock Market’s Price Movement Prediction with LSTM Neural Networks. 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, 14-19 May 2017, 1419. https://doi.org/10.1109/IJCNN.2017.7966019
[10]
McNally, S., Roche, J. and Caton, S. (2018) Predicting the Price of Bitcoin Using Machine Learning. 2018 26th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Cambridge, 21-23 March 2018, 339. https://doi.org/10.1109/PDP2018.2018.00060
[11]
Persio, L.D. and Honchar, O. (2016) Artificial Neural Networks Architectures for Stock Price Prediction: Comparisons and Applications. International Journal of Circuits, Systems and Signal Processing, 10, 403-413.
[12]
Ramos-Prez, E., Alonso-Gonzlez, P.J. and Nez-Velzquez, J.J. (2019) Forecasting Volatility with a Stacked Model Based on a Hybridized Artificial Neural Network. Expert Systems with Applications, 129, 1-9. https://doi.org/10.1016/j.eswa.2019.03.046
[13]
Zhang, X., Zhang, Y., Wang, S., Yao, Y., Fang, B. and Yu, P.S. (2018) Improving Stock Market Prediction via Heterogeneous Information Fusion. Knowledge-Based Systems, 143, 236-247. https://doi.org/10.1016/j.knosys.2017.12.025
[14]
Nam, K. and Seong, N. (2019) Financial News-Based Stock Movement Prediction Using Causality Analysis of Influence in the Korean Stock Market. Decision Support Systems, 117, 100-112. https://doi.org/10.1016/j.dss.2018.11.004
[15]
Zhang, X., Liu, S. and Zheng, X. (2021) Stock Price Movement Prediction Based on a Deep Factorization Machine and the Attention Mechanism. Mathematics, 9, 800. https://doi.org/10.3390/math9080800
[16]
Kotsiantis, S.B., Kanellopoulos, D. and Pintelas, P.E. (2007) Data Preprocessing for Supervised Leaning.
[17]
Fawcett, T. (2004) ROC Graphs: Notes and Practical Considerations for Researchers. Machine Learning, 31, 1-38.
[18]
Shang, Y. (2016) On the Likelihood of Forests. Physica A: Statistical Mechanics and Its Applications, 456, 157-166. https://doi.org/10.1016/j.physa.2016.03.021