图书情报工作2024,Vol.68Issue(24) :104-113.DOI:10.13266/j.issn.0252-3116.2024.24.009

基于BERTopic的大科学装置科学研究联合基金资助主题挖掘

Theme Mining of the Projects by Joint Research Fund of Large-scale Scientific Facility Based on BERTopic

王志强 李宜展 李云龙 李泽霞
图书情报工作2024,Vol.68Issue(24) :104-113.DOI:10.13266/j.issn.0252-3116.2024.24.009

基于BERTopic的大科学装置科学研究联合基金资助主题挖掘

Theme Mining of the Projects by Joint Research Fund of Large-scale Scientific Facility Based on BERTopic

王志强 1李宜展 1李云龙 2李泽霞1
扫码查看

作者信息

  • 1. 中国科学院文献情报中心 北京 100190;中国科学院大学经济与管理学院信息资源管理系 北京 100190
  • 2. 中国科学院前沿科学与教育局 北京 100864
  • 折叠

摘要

[目的/意义]明晰大科学装置基金资助项目和资助论文的主题信息,为未来国家围绕重大设施的资助布局提供决策参考.[方法/过程]利用BERTopic在语义层面对大科学装置科学研究联合基金资助项目与资助论文进行主题提取.[结果/结论]研究发现,不同类型的大科学装置研究主题有其特定重点,研究主题分布和优势研究主题具有鲜明特征.依托大科学装置的研究型项目居多,与新原理、新方法与关键技术等相关研究项目较少.通过对比大科学装置科学研究联合基金资助项目主题和论文主题分析的异同,对基金的管理工作和未来大科学装置的资助布局提供一定的参考借鉴.

Abstract

[Purpose/Significance]To clarify the theme information of the projects and published papers fund-ed by the large-scale scientific facility fund can provide a reference for the future research layout of large-scale sci-entific facility.[Method/Process]The BERTopic deep learning model was used to extract the topic of the projects and published papers funded by the joint research fund of large-scale scientific facility at the semantic level.[Result/Conclusion]The research finds that different types of large-scale scientific facility have their specific focus,and their distribution and advantages have distinct characteristics.Most of the research projects rely on large-scale sci-entific facility,and the research related on new principles,new methods and key technologies are less.The analysis of the similarities and differences of the project and the paper funded by the fund provides certain reference for the fund management and the layout of large-scale scientific facility in the future.

关键词

主题模型/BERTopic/主题识别/大科学装置/联合基金

Key words

topic model/large-scale scientific facility/joint research fund/BERTopic/topic identification

引用本文复制引用

出版年

2024
图书情报工作
中国科学院文献情报中心

图书情报工作

CSTPCDCSSCICHSSCD北大核心
影响因子:2.203
ISSN:0252-3116
段落导航相关论文