中国科学(生命科学)2024,Vol.54Issue(6) :1088-1100.DOI:10.1360/SSV-2023-0150

国家基因组科学数据中心:现状及展望

Current status and prospect of the National Genomics Data Center

陈梅丽 王彦青 李茹姣 马英克 张思思 张欣 宋述慧 肖景发 赵文明 章张 鲍一明
中国科学(生命科学)2024,Vol.54Issue(6) :1088-1100.DOI:10.1360/SSV-2023-0150

国家基因组科学数据中心:现状及展望

Current status and prospect of the National Genomics Data Center

陈梅丽 1王彦青 1李茹姣 2马英克 1张思思 1张欣 1宋述慧 2肖景发 2赵文明 2章张 2鲍一明2
扫码查看

作者信息

  • 1. 中国科学院北京基因组研究所(国家生物信息中心)国家基因组科学数据中心,北京 100101
  • 2. 中国科学院北京基因组研究所(国家生物信息中心)国家基因组科学数据中心,北京 100101;中国科学院大学,北京 100049
  • 折叠

摘要

面向我国人口健康和社会可持续发展的重大战略需求,国家基因组科学数据中心(National Genomics Data Center,NGDC)自2019年成立以来,已初步建成具有自主知识产权、安全可控、涵盖领域广的多维组学数据汇交、存储、管理和共享体系,涵盖基础组学数据资源、国家人类遗传资源、重要战略生物资源、生物安全资源以及生物信息分析工具和平台等,为人口健康、公共安全、育种改良、生物多样性等相关研究提供重要资源和参考信息.截至目前,NGDC已存储和管理27.6PB的数据量,数据编号被Springer Nature,Elsevier,Wiley,Taylor& Francis等全球主要出版集团推荐或认可.尽管NGDC已连续六年被本领域国际权威期刊《核酸研究》称为与美国NCBI、欧洲EBI齐名的国际主要生物数据中心,但与国际一流数据中心仍存在一定差距.展望未来,NGDC将重点聚焦于数据智能审编、数据融合检索、生物大数据云平台、前沿算法工具等,同时在经费争取、人才培养和国际合作方面加大工作力度,建成国际领先的基因组科学数据中心,支撑我国生命与健康科学领域的科技创新发展和自立自强.

Abstract

Since its foundation in 2019,the National Genomics Data Center(NGDC)has played a pivotal role in addressing China's strategic goals related to population health and sustainable social development.Over time,NGDC has preliminarily established a suite of systems designed for the submission,deposition,management and sharing of multi-omics data.These systems encompass fundamental omics data repositories,national human genetics repositories,strategically significant organisms and biosecurity databases resources.Additionally,NGDC has incorporated a range of bioinformatics analysis tools and platforms into its infrastructure.These resources collectively serve as valuable references for research areas such as population health,public health security,breeding improvement,biodiversity conservation and related fields.Up to date,NGDC has successfully archived an impressive 27.6 PB of omics data.The Data made accessible through NGDC garnered recognition and endorsement from prominent international publishers,including Springer Nature,Elsevier,Wiley,and Taylor & Francis.Moreover,NGDC has consistently been recognized as one of the major global data centers by Nucleic Acids Research for six consecutive years.However,NGDC acknowledges the need for further improvement to reach the level of world-class data centers.In the future,NGDC will prioritize the development of intelligent and automated data curation systems,data fusion and retrieval capabilities,and the establishment of cloud platform tailored for biological big data.Advanced algorithms will also be a key focus of NGDC's technological advancements.Simultaneously,NGDC is committed to enhancing its efforts in grant applications,talent development and international collaborations.These endeavors collectively contribute to NGDC's aspiration to become a world-leading genomics data center,supporting the innovative advancements in life and health sciences in China,driven by self-reliance and continuous self-improvement.

关键词

基因组/生物信息/大数据/多组学/人类遗传资源/汇交管理/国家基因组科学数据中心/国家生物信息中心

Key words

genome/bioinformatics/big data/multi-omics/human genetic resources/repository management/National Genomics Data Center/China National Center for Bioinformation

引用本文复制引用

基金项目

中国科学院战略性先导科技专项(B类)(XDB38030200)

国家重点研发计划(2021YFF0703704)

出版年

2024
中国科学(生命科学)
中国科学院

中国科学(生命科学)

CSTPCDCSCD北大核心
影响因子:0.725
ISSN:1674-7232
参考文献量9
段落导航相关论文