Predicting the Direction of NEPSE Index Movement with News Headlines Using Machine Learning

Author:

Dahal Keshab Raj1ORCID,Gupta Ankrit2,Pokhrel Nawa Raj3

Affiliation:

1. Department of Mathematics, State University of New York Cortland, Cortland, NY 13045, USA

2. Department of Computer Science, Central Michigan University, Mt Pleasant, MI 48859, USA

3. Department of Physics and Computer Science, Xavier University of Louisiana, New Orleans, LA 70125, USA

Abstract

Predicting stock market movement direction is a challenging task due to its fuzzy, chaotic, volatile, nonlinear, and complex nature. However, with advancements in artificial intelligence, abundant data availability, and improved computational capabilities, creating robust models capable of accurately predicting stock market movement is now feasible. This study aims to construct a predictive model using news headlines to predict stock market movement direction. It conducts a comparative analysis of five supervised classification machine learning algorithms—logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and artificial neural network (ANN)—to predict the next day’s movement direction of the close price of the Nepal Stock Exchange (NEPSE) index. Sentiment scores from news headlines are computed using the Valence Aware Dictionary for Sentiment Reasoning (VADER) and TextBlob sentiment analyzer. The models’ performance is evaluated based on sensitivity, specificity, accuracy, and the area under the receiver operating characteristic (ROC) curve (AUC). Experimental results reveal that all five models perform equally well when using sentiment scores from the TextBlob analyzer. Similarly, all models exhibit almost identical performance when using sentiment scores from the VADER analyzer, except for minor variations in AUC in SVM vs. LR and SVM vs. ANN. Moreover, models perform relatively better when using sentiment scores from the TextBlob analyzer compared to the VADER analyzer. These findings are further validated through statistical tests.

Publisher

MDPI AG

Reference134 articles.

1. Sentiment analysis of covid-19 tweets from selected hashtags in nigeria using vader and text blob analyser;Abiola;Journal of Electrical Systems and Information Technology,2023

2. Ordinal logistic regression in epidemiological studies;Abreu;Revista de Saude Publica,2009

3. Simple and effective confidence intervals for proportions and differences of proportions result from adding two successes and two failures;Agresti;The American Statistician,2000

4. Ahangar, Reza Gharoie, Yahyazadehfar, Mahmood, and Pournaghshband, Hassan (2010). The comparison of methods artificial neural network with linear regression using specific variables for prediction stock price in tehran stock exchange. arXiv.

5. Trees vs neurons: Comparison between random forest and ann for high-resolution prediction of building energy consumption;Ahmad;Energy and Buildings,2017

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3