中文医学知识大模型问答语料数据集构建研究

Study on the Construction of a Question-Answer Corpus Dataset for Chinese Medical Knowledge Large Language Models

吕婷钰 ¹李晓瑛 ¹张颖 ¹刘宇炀 ¹杜晋华 ²李心怡 ²罗妍 ¹唐小利 ¹任慧玲 ¹刘辉 ¹尹浩²

扫码查看

作者信息

1. 中国医学科学院/北京协和医学院医学信息研究所/图书馆北京 100005
2. 清华大学网络大数据研究中心北京 100084
折叠

摘要

目的/意义构建中文医学知识问答语料数据集,为医学垂域大模型提供标准化的评测基准,进而提升大模型处理中文医学问答任务的准确率和效率.方法/过程构建中文医学论文知识问答数据集、医学名词解释问答数据集和以中国执业医师资格考试真题为基础的问答数据集,整理相关开源数据集.结果/结论自主构建的中文医学知识问答语料数据集丰富了中文医学问答语料来源,能够作为一项标准化的评测基准,推动医学领域大模型实现客观全面的定量评估,今后将利用电子病历、在线健康社区等数据,为健康中国战略的实施提供更坚实的人工智能支持.

Abstract

Purpose/Significance To construct a Chinese medical knowledge Q&A corpus dataset as a standardized evaluation bench-mark for large language models(LLMs)in the medical domain,so as to improve the accuracy and efficiency of LLMs in handling Chinese medical questions.Method/Process Chinese medical paper knowledge,medical terminology explanations and supplementary questions are acquired from the Chinese medical licensing examination,and open-source Chinese medical Q&A datasets are encompassed in the developed Q&A datasets.Result/Conclusion The Chinese medical knowledge Q&A corpus datasets enrich the sources of existing datasets and promote the objective and comprehensive quantitative evaluation of large models in the medical field.In the near future,additional data such as electronic medical records and those from online health communities will be used to strengthen the support of artificial intelli-gence for the Healthy China strategy.

关键词

大语言模型/语料数据集/模型评测/医学

Key words

large language models/corpus dataset/model evaluation/medicine

引用本文复制引用

基金项目

国家社会科学基金(20BTQ062)

中央高校基本科研业务费资助项目(3332023163)

出版年

2024

医学信息学杂志

中国医学科学院

医学信息学杂志

CSTPCD

影响因子：1.348

ISSN：1673-6036

参考文献量19

段落导航