首页|国家基础学科公共科学数据中心建设与发展实践

国家基础学科公共科学数据中心建设与发展实践

扫码查看
数智时代,科学数据作为国家重要战略性资源和生产要素之一,将对国家科技水平提升和经济发展提供重要动能。《科学数据管理办法》发布后,国家基础学科公共科学数据中心正式成立,旨在汇集管理我国基础学科领域及典型区域长期科研活动积累的科学数据,以及相关基础领域政府预算资金支持项目汇交的科学数据,具有基础性、跨学科性和前沿性特征。经过近五年的建设实践,目前已形成完善的基础学科数据资源体系,以及支持分布式科学数据资源统一管理、集成融合、分析挖掘和应用服务的标准体系、服务体系和技术体系,持续深化数据国际交流合作。同时,创新科学数据出版新模式,引领科学数据的高效汇聚、开放共享、多学科交叉融合分析和创新应用。未来将在基础学科高质量数据集构建、数据治理服务模式创新、多学科交叉融合应用等方面进一步开展工作。本文综述国家基础学科公共科学数据中心建设模式与实践成效,以期为科学数据管理机构建设运行提供通用参考。
Construction and practice of National Basic Science Data Center
In the era of digital intelligence,scientific data,as one of the country's important strategic resources and production factors,will provide significant momentum for the enhancement of national technological level and economic development.European and American countries have earlier accumulated,effectively managed,and preserved scientific data,and formulated a series of policies related to scientific data management and open sharing,which have contributed to the development of technological innovation.In 2018,the General Office of the State Council issued the"Scientific Data Management Measures",and the Ministry of Science and Technology and the Ministry of Finance supported the establishment of 20 national scientific data centers.Among them,the only National Basic Science Data Center(referred to as"NBSDC")led by a team with computer backgrounds was officially established,aiming to collect and manage scientific data accumulated through long-term scientific research activities in China's basic disciplines and typical regions,as well as scientific data submitted by government budget-funded projects in related basic fields.It possesses the characteristics of being fundamental,multidisciplinary,and cutting-edge.Positive results have been achieved based on nearly five years of NBSDC's construction and practice.Firstly,a rich basic discipline data resource system has been established,enhancing the disciplinary support capability,with a total data volume exceeding 2.95 PB.Secondly,a data policy and standard system has been built,releasing multiple national and group standards,promoting the continuous accumulation,integration,orderly preservation,and open sharing of scientific data.Thirdly,the G-FAIR principle that conforms to China's national conditions has been proposed,and a diversified and multi-dimensional service system has been constructed,ensuring that international FAIR principles are followed while ensuring data security and controllability.Fourthly,continuous in-depth thinking and active exploration have been conducted in the areas of aggregating quality resources,automating acquisition,and embedding into research workflows,and building a technological system and integrated data infrastructure to promote data fusion applications.Fifthly,international cooperation and exchanges in scientific data have been deepened,with professional academic platforms built through international organizations and academic conferences,accelerating the construction of cross-border data sharing ecosystems.Overall,a comprehensive basic discipline data resource system has been formed,as well as a standard system,service system,and technological system that support the unified management,integration,analysis,mining,and application services of distributed scientific data resources.At the same time,new modes of scientific data publication have been innovated,leading to efficient aggregation,open sharing,cross-disciplinary integration analysis,and innovative applications of scientific data.In the future,related work will be carried out such as the construction of high-quality datasets in basic disciplines,innovation in data governance service models,and cross-disciplinary integration applications.This article summarizes the construction model and practical achievements of the national public scientific data center for basic disciplines,aiming to provide general references for the construction and operation of scientific data management institutions.

scientific data centeropen sharinginterdisciplinarityopen science

高瑜蔚、胡良霖、朱艳华、李坤、赵欢、马晓萌、王璐

展开 >

中国科学院计算机网络信息中心,北京 100190

国家基础学科公共科学数据中心,北京 100190

首都师范大学中国语言智能研究中心,北京 100089

科学数据中心 开放共享 多学科交叉 开放科学

中国科学院信息化专项中国科学院网络安全和信息化专项咨询研究项目

WX145XQ07-03CAS-WX2023ZX01-11

2024

科学通报
中国科学院国家自然科学基金委员会

科学通报

CSTPCD北大核心
影响因子:1.269
ISSN:0023-074X
年,卷(期):2024.69(24)
  • 1
  • 16