Next Article in Journal
Exploring Technology Influencers from Patent Data Using Association Rule Mining and Social Network Analysis
Previous Article in Journal
How Managers Use Information Systems for Strategy Implementation in Agritourism SMEs
Previous Article in Special Issue
Adversarial Hard Attention Adaptation
Open AccessArticle

Evaluation of Tree-Based Ensemble Machine Learning Models in Predicting Stock Price Direction of Movement

School of Information and Software Engineering, University of Electronic Science and Technology of China (UESTC), Chengdu 610051, China
Author to whom correspondence should be addressed.
Information 2020, 11(6), 332;
Received: 24 May 2020 / Revised: 14 June 2020 / Accepted: 17 June 2020 / Published: 20 June 2020
(This article belongs to the Special Issue Machine Learning on Scientific Data and Information)
Forecasting the direction and trend of stock price is an important task which helps investors to make prudent financial decisions in the stock market. Investment in the stock market has a big risk associated with it. Minimizing prediction error reduces the investment risk. Machine learning (ML) models typically perform better than statistical and econometric models. Also, ensemble ML models have been shown in the literature to be able to produce superior performance than single ML models. In this work, we compare the effectiveness of tree-based ensemble ML models (Random Forest (RF), XGBoost Classifier (XG), Bagging Classifier (BC), AdaBoost Classifier (Ada), Extra Trees Classifier (ET), and Voting Classifier (VC)) in forecasting the direction of stock price movement. Eight different stock data from three stock exchanges (NYSE, NASDAQ, and NSE) are randomly collected and used for the study. Each data set is split into training and test set. Ten-fold cross validation accuracy is used to evaluate the ML models on the training set. In addition, the ML models are evaluated on the test set using accuracy, precision, recall, F1-score, specificity, and area under receiver operating characteristics curve (AUC-ROC). Kendall W test of concordance is used to rank the performance of the tree-based ML algorithms. For the training set, the AdaBoost model performed better than the rest of the models. For the test set, accuracy, precision, F1-score, and AUC metrics generated results significant to rank the models, and the Extra Trees classifier outperformed the other models in all the rankings. View Full-Text
Keywords: stock price; machine learning; technical indicators; feature extraction stock price; machine learning; technical indicators; feature extraction
Show Figures

Figure 1

MDPI and ACS Style

Ampomah, E.K.; Qin, Z.; Nyame, G. Evaluation of Tree-Based Ensemble Machine Learning Models in Predicting Stock Price Direction of Movement. Information 2020, 11, 332.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Search more from Scilit
Back to TopTop