基于大语言预训练模型的中医个性化处方推荐研究
Research on Personalized Prescription Recommendation of Traditional Chinese Medicine Based on Large Language Pre-Training Model
王欣宇 1杨涛 2胡孔法3
作者信息
- 1. 南京中医药大学人工智能与信息技术学院,江苏南京 210023
- 2. 南京中医药大学人工智能与信息技术学院,江苏南京 210023;南京大学信息管理学院,江苏南京 210023;江苏省中医药防治肿瘤协同创新中心,江苏南京 210023
- 3. 南京中医药大学人工智能与信息技术学院,江苏南京 210023;江苏省中医外用药开发与应用工程研究中心,江苏南京 210023;南京中医药大学唐仲英中医疫病研究中心,江苏南京 210023
- 折叠
摘要
目的 针对中医个性化处方推荐问题,研究自动化处方推荐任务,为中医临床辅助决策提供参考.方法 基于大语言预训练文本生成模型设计一种中医个性化处方推荐算法.将中医处方推荐任务转化为端到端(seq2seq)的文本生成任务,即将临床症状描述文本通过模型转化为处方文本,以实现处方推荐任务的需求,并利用基于大语言预训练的BART(Bidirectional and Auto-Regressive Transformers)模型的预训练参数来提升模型对通用语义信息的理解,通过对训练集处方内中药排序提升模型的处方推荐性能.结果 实验证明通过大语言预训练模型以及端到端的文本生成架构可有效提升模型的生成性能,同时对处方内中药依次排序可以获取更高准确率,并且通过中药的排列获取更多值得参考的有价值信息.中医个性化处方推荐模型在处方排序后分别在前5、10、15味生成的处方分别取得了 58.60、53.79和49.67的准确率.结论 中医个性化处方推荐模型取得了更优的处方推荐效果,表明其可为中医临床治疗疾病进行参考,达到辅助临床决策支持的效果.
Abstract
Objective Aiming at the problem of personalized prescription recommendation of TCM,the automatic prescription recommendation task is studied to provide reference for TCM clinical decision-making.Methods Based on the large language pre-trained text generation model,a personalized prescription recommendation algorithm of traditional Chinese medicine is de-signed.The TCM prescription recommendation task is transformed into an end-to-end(seq2seq)text generation task,that is,the clinical symptom description text is converted into prescription text through the model to realize the requirements of the pre-scription recommendation task,and the pre-training parameters of the Bidirectional and Auto-Regressive Transformers(BART)model based on large language pre-training are used to improve the model's understanding of general semantic information,and the prescription recommendation performance of the model is improved by ordering TCM medicines in the prescription of the train-ing set.Results Experiments show that the model generation performance can be effectively improved through the large language pre-training model and the end-to-end text generation architecture,and the sequential ordering of Chinese drug in the pre-scription can obtain more valuable information worthy of reference in the arrangement of Chinese drugs at the same time as higher accuracy.In this paper,the TCM personalized prescription recommendation model has achieved an accuracy rate of 58.60,53.79 and 49.67 in the top 5,10 and 15 herbs respectively after prescription ordering.Conclusion In this paper,the personalized pre-scription recommendation model of TCM has achieved better prescription recommendation effect,indicating that it can be used as a reference for the clinical treatment of diseases in TCM,and meet the needs of clinical auxiliary decision support.
关键词
处方推荐/大语言模型/中医/文本生成Key words
prescription recommendation/large language model/TCM/text generation引用本文复制引用
基金项目
国家自然科学基金项目(82074580)
国家重点研发计划项目(2022YFC3500201)
中国博士后科学基金面上项目(2021M701674)
江苏省重点研发计划项目(BE2022712)
江苏省博士后科研资助计划项目(2021K457C)
江苏高校"青蓝工程"项目(2021)()
出版年
2024