基于双阶段元学习的小样本中医舌色域自适应分类方法

扫码查看

原文链接

万方数据
维普

中文摘要：舌色是中医(TCM)望诊最关注的诊察特征之一.在实际应用中,通过一台设备采集到的舌象数据训练得到的舌色分类模型应用于另一台设备时,由于舌象数据分布特性不一致,分类性能往往急剧下降.为此,该文提出一种基于双阶段元学习的小样本中医舌色域自适应分类方法.首先,设计了一种双阶段元学习训练策略,从源域有标注样本中提取域不变特征,并利用目标域的少量有标注数据对网络模型进行微调,使得模型可以快速适应目标域的新样本特性,提高舌色分类模型的泛化能力并克服过拟合.接下来,提出了一种渐进高质量伪标签生成方法,利用训练好的模型对目标域的未标注样本进行预测,从中挑选出置信度高的预测结果作为伪标签,逐步生成高质量的伪标签.最后,利用这些高质量的伪标签,结合目标域的有标注数据对模型进行训练,得到舌色分类模型.考虑到伪标签中含有噪声问题,采用了对比正则化函数,可以有效抑制噪声样本在训练过程中产生的负面影响,提升目标域舌色分类准确率.在两个自建中医舌色分类数据集上的实验结果表明,在目标域仅提供20张有标注样本的情况下,舌色分类准确率达到了91.3％,与目标域有监督的分类性能仅差2.05％.

外文标题：Few Shot Domain Adaptation Tongue Color Classification in Traditional Chinese Medicine via Two-stage Meta-learning

外文摘要：Tongue color is one of the most concerning diagnostic features of tongue diagnosis in Traditional Chinese Medicine (TCM). In practical applications, the performance of the model trained from the tongue data acquired by one device is dramatically degraded when applied to other devices due to the data distribution discrepancy. Therefore, in this paper, a few shot domain adaptation tongue color classification method with two-stage meta-learning is proposed. Firstly, a two-stage meta-learning training strategy is proposed to extract domain invariant features from labeled samples in the source domain, and then, the meta-trained network model is fine-tuned using a few labeled data in the target domain, so that the model can quickly adapt to the new sample characteristics in the target domain, improving the generalization ability of the tongue color classification model and avoid overfitting problem. Next, a progressive pseudo label generation strategy is proposed, which uses the meta-trained model to predict the unlabeled samples in the target domain. The prediction results with high confidence are selected and treated as pseudo labels. So high-quality pseudo labels can be gradually generated. Finally, these high-quality pseudo labels are used to train the model, together with the labeled data in the target domain. The tongue color classification model can be obtained. Considering the noisy pseudo labels, the contrast regularization function is adopted, which can effectively suppress the negative impact of noisy samples in the training process and improve the tongue color classification accuracy in the target domain. The experimental results on two self-established TCM tongue color classification datasets show that the classification accuracy of tongue color in the target domain reaches 91.3％ when only 20 labeled samples are given in the target domain, which is only 2.05％ lower than that of the supervised classification model in the target domain.

外文关键词：

Tongue color classification in Traditional Chinese Medicine(TCM)Few shotDomain adaptationTwo-stage meta-learning

作者：

卓力、张雷、贾童瑶、李晓光、张辉

展开 >

作者单位：

北京工业大学计算智能与智能系统北京重点实验室北京 100124

北京工业大学信息学部北京 100124

关键词：

中医舌色分类小样本域自适应双阶段元学习

基金：

国家自然科学基金国家中医药局中医药创新团队及人才支持计划

项目编号：

61871006ZYYCXTD-C-202210

出版年：

2024

DOI：

10.11999/JEIT230249

电子与信息学报

中国科学院电子学研究所国家自然科学基金委员会信息科学部

电子与信息学报

CSTPCD北大核心

影响因子：1.302

ISSN：1009-5896

年,卷(期)：2024.46(3)

参考文献量28