首页|基于随机生存森林的食管癌生存预测模型研究

基于随机生存森林的食管癌生存预测模型研究

扫码查看
目的 基于随机生存森林算法构建食管癌患者生存预后的预测模型,探讨食管癌患者的预后影响因素并评估模型价值.方法 收集新疆医科大学附属肿瘤医院信息系统中2010年1月1日-2019年12月1日确诊的食管癌患者的临床病历资料进行随访研究,随访时间为2010年1月1日—2020年12月31日或患者死亡.建立随机生存森林模型,并与传统的Cox回归模型进行比较.预测结局为1、3、5、10年生存率,使用预测错误率和Brier分数评价模型预测精度,寻找影响食管癌预后的重要影响因素,发现因素间的交互作用,评价模型效果.采用R3.6、SPSS 21.0软件进行数据分析,使用survival、rms、randomForestSRC等程序包进行数据筛选与模型的建立和评价.结果 3 018例食管癌患者中出现结局事件623例(20.64%);截尾数据中,死于其他原因1 309例(54.66%);观察期结束仍存活1 086例(45.34%).中位生存期为1.41(1.30,1.52)年,1、3、5、10 年生存率分别为 60.30%、33.34%、27.40%、19.08%.临床分期、年龄、远处转移、淋巴结转移、浸润深度是RSF模型的重要影响因素.随机生存森林模型1、3、5、10年的预测错误率为38.4%、38.5%、34.7%、32.0%,均低于Cox回归模型的预测错误率,具有更高的准确度.结论 随机生存森林模型可以有效预测食管癌患者死亡率,且优于传统模型,可以为食管癌患者预后的改善和生命质量的提高提供科学依据.
Study on survival prediction model of esophageal cancer based on random survival forests
Objective This study aimed to construct a survival prediction model for esophageal cancer using the random survival forests and traditional Cox regression to explore prognostic factors and evaluate the model's performance.Methods Clinical records of esophageal cancer patients diagnosed between January 1,2010,to December 1,2019,at the affiliated cancer hospital of Xinjiang Medical University were collected for a follow-up study,with the follow-up period being from January 1,2010,to December 31,2020,or patient death.A random survival forests model was established and compared with the traditional Cox regression model.The model predicted 1-year,3-year,5-year,and 10-year survival rates,and prediction accuracy was evaluated using error rates and Brier scores.To find the important factors influencing the prognosis of esophageal cancer,discover the interaction between factors and evaluate the model effect.R 3.6 and SPSS 21.0 software were used for data analysis.Data screening and model building and evaluation were performed using the program packages survival,rms,and randomForestSRC.Results Out of 3 018 esophageal cancer patients,623(20.64%)experienced an outcome event.Among censored data,1 309(54.66%)died from other causes,while 1 086(45.34%)remained alive at the end of the observation period.The median survival time was 1.41(1.30,1.52)years,with corresponding survival rates of 60.30%,33.34%,27.40%,and 19.08%at 1,3,5,and 10 years respectively.Clinical stage,age,distant metastasis,lymph node metastasis,and depth of infiltration were identified as important influencing factors in the random survival forests model.The prediction error rates for the model at 1,3,5,and 10 years were 0.384,0.385,0.347,and 0.320,which were lower than the Cox regression model and had higher accuracy.Conclusions The random survival forest model effectively predicted mortality rate for esophageal cancer patients outperforming the traditional model,which can provide a scientific basis for the improvement of prognosis and quality of life of esophageal cancer patients.

Esophageal cancerRandom survival forestsCox proportional hazards modelSurvival analysisPrediction model

夏文俊、刘新亚、张燕、毛宏凯、曹明芹

展开 >

新疆医科大学,新疆维吾尔自治区 乌鲁木齐 830000

食管癌 随机生存森林 Cox回归 生存分析 预测模型

2024

中国预防医学杂志
中华预防医学会

中国预防医学杂志

CSTPCD
影响因子:1.004
ISSN:1009-6639
年,卷(期):2024.25(6)
  • 4