Utilizing Text Mining for Labeling Training Models from Futures Corpus in Generative AI

Author:

Chou Hsien-Ming1ORCID,Cho Tsai-Lun123

Affiliation:

1. Department of Information Management, Chung Yuan Christian University, Taoyuan City 320314, Taiwan

2. Department of Information Management, Chien Hsin University of Science and Technology, Taoyuan City 320678, Taiwan

3. Department of Mathematics, National Tsing Hua University, Hsinchu 300044, Taiwan

Abstract

For highly time-constrained, very short-term investors, reading and extracting valuable information from financial news poses significant challenges. The wide range of topics covered in these news articles further compounds the difficulties for investors. The diverse content adds complexity and uncertainty to the text, making it arduous for very short-term investors to swiftly and accurately extract valuable insights. Variations between authors, media sources, and cultural backgrounds also introduce additional complexities. Hence, performing a bull–bear semantic analysis of financial news using text mining technologies can alleviate the volume, time, and energy pressures on very short-term investors, while enhancing the efficiency and accuracy of their investment decisions. This study proposes labeling bull–bear words using a futures corpus detection method that extracts valuable information from financial news, allowing investors to quickly understand market trends. Generative AI models are trained to provide real-time bull–bear advice, aiding investors in adapting to market changes and devising effective trading strategies. Experimental results show the effectiveness of various models, with random forest and SVMs achieving an impressive 80% accuracy rate. MLP and deep learning models also perform well. By leveraging these models, the study reduces the time spent reading financial articles, enabling faster decision making and increasing the likelihood of investment success. Future research can explore the application of this method in other domains and enhance model design for improved predictive capabilities and practicality.

Funder

National Science and Technology Council of Taiwan

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference25 articles.

1. Fake news detection methods: A survey and new perspectives;Refoufi;Adv. Intell. Syst. Sustain. Dev. (AI2SD’2020),2022

2. Longoni, C., Fradkin, A., Cian, L., and Pennycook, G. (2022, January 21–24). News from generative artificial intelligence is believed less. Proceedings of the ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea.

3. Stock price prediction by deep neural generative model of news articles;Matsubara;IEICE Trans. Inf. Syst.,2018

4. Galaxy: A generative pre-trained model for task-oriented dialog with semi-supervised learning and explicit policy injection;He;Proc. AAAI Conf. Artif. Intell.,2022

5. Ahnve, F., Fantenberg, K., Svensson, G., and Hardt, D. (2022, January 10–13). Predicting stock price movements with text data using labeling based on financial theory. Proceedings of the IEEE International Conference on Big Data (Big Data), Atlanta, GA, USA.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3