首页|基于卷积神经网络和迁移学习的肿瘤舌象识别研究

基于卷积神经网络和迁移学习的肿瘤舌象识别研究

扫码查看
目的 探究卷积神经网络ResNet152模型识别肿瘤舌象图片的性能.方法 选取2019年1月至2021年12月于北京中医药大学东直门医院、北京中医药大学东方医院和北京中医药大学第三附属医院采集的5 943张舌图,其中包括肿瘤舌图1433张,非肿瘤舌图4 510张.经过图像预处理后,随机选取1 000张舌图作为测试集,其余的4 943张作为训练集.通过图像扩增技术将4 943张训练集舌图扩增为54 535张,并输入在ImageNet-2012数据集上经过预训练的卷积神经网络ResNet152模型,以建立舌象自动识别系统.然后将1 000张测试集舌图输入模型,记录识别结果.最后,运用GRAD-CAM技术对测试集中模型正确识别为肿瘤的舌图进行可视化分析,统计和分析模型识别肿瘤舌图重点关注的舌象特征.结果 在测试集中,卷积神经网络ResNet152模型识别肿瘤舌象的分类正确率为85.7%,召回率为84.9%,准确率为85.5%,F1值为85.2%,曲线下面积为91.3%.对正确识别为肿瘤的舌图进行可视化分析,结果显示对模型正确识别肿瘤贡献度最高的舌象特征是瘀斑和裂纹.结论 卷积神经网络ResNet152模型提供了一种非侵入性且高效的肿瘤检测方法,能够辅助肿瘤的诊断,瘀斑和裂纹可能是模型预测肿瘤最关注的两个舌象特征.
Study of tumor tongue image recognition via convolutional neural network and transfer learning
Objective To explore the performance of the convolutional neural network-ResNet152 model in identifying tumor tongue image.Methods A dataset consist of 5 943 tongue images,including 1 433 tumor tongue image and 4 510 non-tumor tongue image were collected from Dongzhimen Hos pital,Beijing University of Chinese Medicine,Dongfang Hospital Beijing University of Chinese Medicine,and Beijing University of Chinese Medicine Third Affiliated Hospital from January 2019 to December 2021.After image preprocessing,1 000 tongue image were randomly selected as the test set and the remaining 4 943 as the training set.The tongue images of 4 943 training sets were amplified to 54 535 by image amplification technique,and then input into ResNet152 model of convolutional neural network pre-trained on ImageNet-2012 data set to establish an automatic tongue image recognition system.Then 1 000 tongues of test sets were fed into the model and the recognition results were recorded.Finally,the GRAD-CAM technology was used to visually analyze the tongue image correctly identified as tumor by the test set model,and the tongue image features focused on tumor tongue image were identified statistically and analytically by the model.Results In identifying tongue images of tumor,the accuracy of conv-olutional neural network-ResNet152 model was 85.7%,recall rate was 84.9%,precision was 85.5%,F1 score was 85.2%,and area under curve was 91.3%.A visual analysis was conducted on tongue image correctly identified as tumor,revealing that the tongue features with the highest contribution to the model's correct tumor recognition were ecchymosis and fissures.Conclusion The convolutional neural network-ResNet152 model provides a non-invasive and efficient approach that contributes to tumor detection and diagnosis.The features of ecchymosis and fissures might be the primary focus of the model in predicting tumors based on tongue images.

Convolutional neural networkTransfer learningTumorTongue image recognition

曾孟霞、关静、李子健、张新峰、沈洋、刘传波、赵瑞珍、姜琳

展开 >

北京中医药大学中医学院,北京 102488

北京工业大学信息学部,北京 100124

北京中医药大学东直门医院肿瘤科,北京 100700

北京中医药大学东方医院肿瘤科,北京 100078

北京中医药大学第三附属医院体检科,北京 100029

展开 >

卷积神经网络 迁移学习 肿瘤 舌象识别

国家重点研发计划"中医智能舌诊系统研发"子课题

2017YFC1703302

2024

中国医药导报
中国医学科学院

中国医药导报

CSTPCD
影响因子:1.759
ISSN:1673-7210
年,卷(期):2024.21(12)
  • 32