现代信息科技2024,Vol.8Issue(4) :153-161,166.DOI:10.19850/j.cnki.2096-4706.2024.04.032

两阶段多标签分类探索中医证素辨证规律

Exploration of the Rules of Traditional Chinese Medicine Syndrome Elements Differentiation by Two-stage Multi-label Classification

蓝勇 程春雷 叶青 胡杭乐 沈友志
现代信息科技2024,Vol.8Issue(4) :153-161,166.DOI:10.19850/j.cnki.2096-4706.2024.04.032

两阶段多标签分类探索中医证素辨证规律

Exploration of the Rules of Traditional Chinese Medicine Syndrome Elements Differentiation by Two-stage Multi-label Classification

蓝勇 1程春雷 2叶青 1胡杭乐 1沈友志1
扫码查看

作者信息

  • 1. 江西中医药大学 计算机学院,江西 南昌 330004
  • 2. 江西中医药大学 计算机学院,江西 南昌 330004;江西中医药大学 中医人工智能重点研究室,江西 南昌 330004
  • 折叠

摘要

探索证素辨证规律能更好地辅助临床决策和促进中医辨证理论的传承.中医文本句式结构复杂、表述标准不一,难以匹配符号规则,且神经网络黑盒特性又难以直接解释其辨证过程.为探索中医证素辨证规律,第一阶段使用神经网络模型对证素进行多标签分类,通过稀疏注意力捕获与证素相关的关键词及其权重生成证素表征;第二阶段使用随机森林对融入相关证素标签的证素表征进行分类训练,后对随机森林规则提取以探索辨证规律,提高证素辨证的可解释性.实验结果表明,该方法提升了证素辨识的准确率,同时F1保持较高水平,有利于探索证素辨证规律.

Abstract

Exploring the rules of syndrome element differentiation can better assist clinical decision-making and promote the inheritance of TCM syndrome differentiation theory.The sentence structure of TCM texts is complex,with varying expression standards,making it difficult to match symbol rules,and the black box characteristics of neural networks are difficult to directly explain their differentiation process.In order to explore the rules of syndrome element differentiation in TCM,it uses a neural network model to classify syndrome elements with multiple labels in the first stage,and generates syndrome element representation by capturing keywords and their weights related to syndrome elements through sparse attention.In the second stage,the random forest is used to conduct classified training on syndrome element representation incorporating relevant syndrome element labels,and then random forest rules are extracted to explore the syndrome differentiation rules and improve the interpretability of syndrome element differentiation.The experimental results show that this method improves the accuracy of syndrome element identification,while maintaining a high level of F1,which is conductive to exploring the rules of syndrome element differentiation.

关键词

证素辨证规律/稀疏注意力/多标签分类/随机森林模型/可解释性

Key words

rules of syndrome element differentiation/sparse attention/multi-label classification/random forest model/interpretability

引用本文复制引用

基金项目

江西省自然科学基金资助项目(20224BAB206102)

国家自然科学基金(82260988)

江西省教育厅科学技术研究项目(GJJ2200923)

江西省卫生和计划生育委员会-科技计划项目(202211404)

江西中医药大学博士启动基金(2018WBZR021)

出版年

2024
现代信息科技
广东省电子学会

现代信息科技

ISSN:2096-4706
参考文献量12
段落导航相关论文