Stock Return Prediction Using Voting Regressor Ensemble Learning

Ramadhan Ridho Arrohman; Riza Arifudin

doi:10.15294/rji.v1i2.68048

Ramadhan Ridho Arrohman Universitas Negeri Semarang
Riza Arifudin Universitas Negeri Semarang

DOI: https://doi.org/10.15294/rji.v1i2.68048

Keywords: Stock Return, Ensemble Learning, Regression, Stock Market

Abstract

Abstract. The value of return on stock prices is often used in predicting profits in the process of buying and selling shares based on the calculation of the return on investment. The calculation of the value of return on stock prices can be predicted automatically at certain periods, both weekly and daily

Purpose: The problem faced is determining a good algorithm for making predictions due to fluctuating data on stock prices making it difficult to predict.

Methods: The stages carried out by the researcher include the data preprocessing stage and then proceed to the Exploratory Data Analysis (EDA) stage to get a pattern from the data, followed by the modeling stage on the data. This research was developed using the Python programming language where the models used to make predictions can be obtained in real-time.

Result: The results obtained in this study show that the Voting Regressor has the best model with an error rate of 0.032523 using Root Mean Square Error (RMSE). The results of this study can be further developed to automatically predict stock return values in the future.

References

[1] P. Chhajer, M. Shah, and A. Kshirsagar, “The applications of artificial neural networks, support vector machines, and long–short term memory for stock market prediction,” Decis . Anal. J. , vol. 2, no. November 2021, p. 100015, 2022, doi: 10.1016/j.dajour.2021.100015.
[2] AM More, PU Rathod, RH Patil, DR Sarode, and B. Student, “Stock Market Prediction System using Hadoop,” Int. J.Eng. sci. Comput. , vol. 8, no. 3, pp. 16138–16140, 2018, [Online]. Available: http://ijesc.org/.
[3] DP Gandhmal and K. Kumar, “Systematic analysis and review of stock market prediction techniques,” Comput. sci. Rev. , vol. 34, p. 100190, 2019, doi: 10.1016/j.cosrev.2019.08.001.
[4] Y. Freund and RE Schapire, "A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting," J. Comput. syst. sci. , vol. 55, no. 1, pp. 119–139, 1997, doi: 10.1006/jcss.1997.1504.
[5] L. Breiman, “A Data Mining Based System for Transaction Fraud Detection,” 2021 IEEE Int. Conf. Consum. electrons. Comput. Eng. ICCECE 2021 , pp. 542–545, 2021, doi: 10.1109/ICCECE51280.2021.9342376.
[6] T. Chen and C. Guestrin, “XGBoost: A scalable tree boosting system,” Proc. ACM SIGKDD Int. Conf. Knowl. Discov. Data Min. , vol. 13-17-August, pp. 785–794, 2016, doi: 10.1145/2939672.2939785.
[7] R. Gholami and N. Fakhari, Support Vector Machine: Principles, Parameters, and Applications , 1st ed. Elsevier Inc., 2017.
[8] Z. -H. Zhou, Ensemble methods: foundations and algorithms . CRC Press, 2012.
[9] TG Dietterich, “Ensemble methods in machine learning,” in Multiple Classifier Systems: First International Workshop, MCS 2000 Cagliari, Italy, June 21--23, 2000 Proceedings 1 , 2000, pp. 1–15.
[10] A. Megaritis, N. Vlastakis, and A. Triantafyllou, “Stock market volatility and jumps in times of uncertainty,” J. Int. Money Financec. , vol. 113, p. 102355, 2021, doi: 10.1016/j.jimonfin.2021.102355.
[11] RS Hudson and A. Gregoriou, “Calculating and comparing security returns is harder than you think: A comparison between logarithmic and simple returns,” Int. Rev. financec. Anal. , vol. 38, pp. 151–162, 2015, doi: 10.1016/j.irfa.2014.10.008.
[12] P. Miskolczi, “Note on simple and logarithmic returns,” Appl. Studs. Agribus. Commer. , vol. 11, no. 1–2, pp. 127–136, 2017, doi: 10.19041/apstract/2017/1-2/16.
[13] A. Meucci, “Quant Nugget 2: Linear vs. Compounded Returns,” Common pitfalls in Portfolio Management , GARP risk professional, pp. 1–5, 2010.
[14] VN Vapnik, The Nature of Statistical Learning Theory , Second. New York: Springer US, 1995.
[15] M. Ouahilal, M. El Mohajir, M. Chahhou, and BE El Mohajir, “A novel hybrid model based on Hodrick–Prescott filter and support vector regression algorithm for optimizing stock market price prediction,” J. Big Data , vol. 4, no. 1, pp. 1–22, 2017, doi: 10.1186/s40537-017-0092-5.
[16] Y. CAO, Q. -G. MIAO, J. -C. LIU, and L. GAO, “Advance and Prospects of AdaBoost Algorithm,” Acta Autom. Sin. , vol. 39, no. 6, pp. 745–758, 2013, doi: 10.1016/s1874-1029(13)60052-x.
[17] JH Friedman, “Greedy function approximation: a gradient boosting machine,” Ann. Stats. , pp. 1189–1232, 2001.
[18] A. Cutler, DR Cutler, and JR Stevens, “Random forests,” Ensemble Mach. Learn. Methods Appl. , pp. 157–175, 2012.
[19] Z. Ali, I. ur Rehman, and Z. Jaan, "An Empirical Analysis on Software Development Efforts Estimation in Machine Learning Perspective," ADCAIJ Adv. District. Comput. Artif. Intell. J. , vol. 10, no. 3, pp. 227–240, 2021.

Stock Return Prediction Using Voting Regressor Ensemble Learning

Abstract

References

Most read articles by the same author(s)