首页|文本分析在金融研究中的应用:文献综述与使用范式

文本分析在金融研究中的应用:文献综述与使用范式

扫码查看
在当前大数据和人工智能时代,以文本为代表的非结构化数据引起了学者们的广泛关注,而这其中文本数据在社会科学尤其在金融研究中发挥着越来越重要的作用.文本分析通过研究文本的情感情绪、政策不确定性、语义相似性等非传统主题正创造着新的金融研究范式.论文聚焦于文本分析在金融研究中的应用这一主题,通过梳理国内外相关文献,首先对文本数据在金融研究中的使用流程与范式进行了总结,详细描述了金融文本的获取方法、金融文本的处理方式、金融文本的表示模型以及金融文本指标的构建这四个关键流程;之后论文结合大语言模型盛行的背景重点分析了大语言模型在金融研究中的潜在应用,并探讨了大语言模型可能带来的风险和挑战,如数据隐私、模型偏见等问题;最后论文对文本分析在金融研究中的应用进行了总结,反思了当前学术界对文本分析的批判,并对大语言模型在金融研究中的使用进行了展望.
The Application of Text Analysis in Financial Research:A Literature Review and Paradigm of Use
In the current era of Big Data and Artificial Intelligence,unstructured data,particularly text,has attracted widespread attention from scholars.The use of textual data is playing an increasingly important role in the social sciences,especially in financial research.Text analysis is creating new paradigms in financial research by studying non-traditional themes such as sentiment,policy uncertainty,and semantic similarity.This paper focuses on the application of text analysis in financial research.By reviewing relevant domestic and international literature,it first summarizes the usage processes and paradigms of text data in financial research,detailing four key processes:methods for obtaining financial texts,ways of processing financial texts,models for representing financial texts,and the construction of financial text indicators.Subsequently,in the context of the prevalence of Large Language Model(LLM),the paper focuses on analyzing their potential applications in financial research and discusses the potential risks and challenges posed by LLM,such as data privacy and model bias.Finally,the paper summarizes the application of text analysis in financial research,reflects on the criticisms from the academic community regarding text analysis,and looks ahead to the future use of LLM in financial research.

Text AnalysisFinancial ResearchUnstructured Big DataDeep LearningLarge Language Model

尹振涛、王振

展开 >

中国社会科学院大学应用经济学院

中国社会科学院金融研究所

文本分析 金融研究 非结构化大数据 深度学习 大语言模型

2024

农村金融研究
中国农村金融学会

农村金融研究

北大核心
影响因子:0.477
ISSN:1003-1812
年,卷(期):2024.(11)