基于多随机经验核的弥漫大B细胞淋巴瘤复发预测
Precise Prediction of Diffuse Large B-Cell Lymphoma based on Multiple Random Empirical Kernel Learning Machine
李雪玲 1赵艳琳 1张岩波 2余红梅 2周洁 3李琼 1王俊霞 1乔宇 1张高源 1赵志强 4罗艳虹2
作者信息
- 1. 山西医科大学公共卫生学院卫生统计教研室(030001);重大疾病风险评估山西省重点实验室
- 2. 山西医科大学公共卫生学院卫生统计教研室(030001);重大疾病风险评估山西省重点实验室;煤炭环境致病与防治教育部重点实验室
- 3. 山西省肿瘤医院核医学PET/CT中心
- 4. 山西省肿瘤医院血液科
- 折叠
摘要
目的 基于多随机经验核分类器构建弥漫大B细胞淋巴瘤完全缓解后两年内复发情况的预测模型,为患者的治疗提供决策依据.方法 利用山西省某三甲医院 2010-2020 年电子病历库中符合本研究要求的 445 名患者信息,基于五种常见类别不平衡处理方法以及多随机经验核分类器构建复发预测模型,并与五种分类器进行比较.结果 基于SMOTE Tomek Links+多随机经验核分类器的复发预测模型取得了最优的分类性能(accuracy=0.89,precision=0.87,recall=0.92,f1-Score=0.89,brier score=0.11).结论 对DLBCL实际数据集,本文使用SMOTE Tomek links处理不平衡数据并构建多随机经验核模型,模型性能达到最优的同时计算复杂度也不高,可为DLBCL复发预测提供有力参考.
Abstract
Objectives To construct a prediction model of relapse in diffuse large B-cell lymphoma within two years after complete remission based on multiple randomized empirical kernel learning machine to provide a basis for patient treatment decisions.Methods Using the information of 445 patients who met the requirements of this study in the electronic medical record database of a tertiary hospital in Shanxi Province from 2010 to 2020,a relapse prediction model was constructed based on five common categories of imbalance treatment methods and a multiple stochastic empirical kernel learning machine,and compared with the five classifiers.Results The recurrence prediction model based on SMOTE Tomek Links+multiple randomized empirical kernel learning machine achieved optimal classification performance(accuracy=0.89,precision=0.87,recall=0.92,f1-Score=0.89,brier score=0.11).Conclusion For the actual DLBCL dataset,in this paper,we used SMOTE Tomek links to process the imbalance data and construct a multiple randomized empirical kernel learning machine,which achieves the optimal model performance with low computational complexity and can provide a powerful reference for DLBCL recurrence prediction.
关键词
弥漫大B细胞淋巴瘤/复发预测/经验核映射/类别不平衡Key words
Diffuse large B-cell lymphoma/Recurrence prediction/Empirical kernel mapping/Category imbalance引用本文复制引用
基金项目
山西省科技厅应用基础研究计划面上项目(202103021224245)
国家自然科学基金青年科学基金(81502897)
国家自然科学基金青年科学基金(82273742)
山西医科大学博士启动基金(BS2017029)
出版年
2024