现代计算机2024,Vol.30Issue(23) :108-112.DOI:10.3969/j.issn.1007-1423.2024.23.021

基于TF-IDF算法的舆情分析研究——以日本排放核废水事件为例

Research on public opinion analysis based on TF-IDF algorithm—Taking Japan's nuclear wastewater discharge incident as an example

杜宇灏 李环宇 林晓霞
现代计算机2024,Vol.30Issue(23) :108-112.DOI:10.3969/j.issn.1007-1423.2024.23.021

基于TF-IDF算法的舆情分析研究——以日本排放核废水事件为例

Research on public opinion analysis based on TF-IDF algorithm—Taking Japan's nuclear wastewater discharge incident as an example

杜宇灏 1李环宇 2林晓霞2
扫码查看

作者信息

  • 1. 山东科技大学智能装备学院,泰安 271000;泰安市广播电视台,泰安 271000
  • 2. 山东科技大学智能装备学院,泰安 271000
  • 折叠

摘要

日本核废水排海事件在互联网引起了极大的反响,迅速放大扩散到社会多个方面形成了一次舆情事件,在一定程度上影响到了社会管理甚至社会的安定.由此可见及时捕捉网络舆情,分析其特点,相关职能部分据此采取化解防范措施,已经成为当前亟待解决的问题.针对这一需求,开发了一个基于TF-IDF和Word2Vec算法的舆情监测程序.首先对微博内容文本进行清洗和分词处理,后利用TF-IDF算法提取微博文本关键词;其次按照关键词权重排序并生成词云图;最后将单词转换为高维向量并可视化在二维平面上,为舆情监测提供决策依据.

Abstract

The incident of Japan's nuclear wastewater discharge to the sea has caused great repercussions on the Internet.It has rapidly expanded and spread to many aspects of society,forming a public opinion event,which has affected social management and even social stability to a certain extent.It can be seen that timely capture of online public opinion,analysis of its characteris-tics,and relevant functional departments taking preventive measures based on this have become urgent problems to be solved.A public opinion monitoring program based on TF-IDF and Word2Vec algorithm has been developed to meet this demand.Firstly,the Weibo content text is cleaned and segmented,and then the TF-IDF algorithm is used to extract Weibo text keywords.Secondly,the keywords are sorted by weight and a word cloud map is generated.Finally,the words are converted into high-dimensional vec-tors and visualized on a two-dimensional plane,providing decision-making basis for public opinion monitoring.

关键词

核废水/舆情监测/TF-IDF/可视化

Key words

nuclear waste water/public opinion monitoring/TF-IDF/visualization

引用本文复制引用

出版年

2024
现代计算机
中大控股

现代计算机

影响因子:0.292
ISSN:1007-1423
段落导航相关论文