首页|机器学习对H.pylori感染患者的特征变量及预测模型研究

机器学习对H.pylori感染患者的特征变量及预测模型研究

扫码查看
目的 分析H.pylori感染患者感染的危险因素,建立H.pylori感染患者预测模型,为防治H.pylori感染提供参考.方法 选取2021 年 7 至 2022 年 5 月在中山市中医院、中山市东凤人民医院、中山市南区医院共 1 477 例接受 H.pylori检测者为研究对象,依据胃镜和14 C、13 C呼气试验的检测结果,将 H.pylori受检人群分为感染组和无感染组,分别进行问卷调查,调查内容包括受检者基本情况、临床表征、慢性基础病、生活和饮食习惯等,共计 63 个变量.采用单因素及机器学习中的 Logistic回归、决策树分析及添加交互项的 Logistic回归对 H.pylori感染进行多因素分析,并比较 3 个模型的 ROC 曲线下的面积、灵敏度、特异度,验证模型的准确性,建立 H.pylori感染预测模型,将特征和危险因素建立森林图.结果 Logistic回归分析的 AUC 为 0.7361,灵敏度为 0.7615,特异度为0.6034.决策树分析的 AUC为 0.6528,灵敏度为 0.6801,特异度为 0.5773.添加交互项后的 Logistic 回归分析的 AUC 为 0.7388,灵敏度为 0.7588,特异度为 0.6034.添加交互项的多因素Logistic回归结果显示,有胃胀,口气、口臭,在家煮食午餐,在家无而外出有使用公筷习惯,同居家人有感染,疫情后才使用公筷,居住 4~10 层楼,同时有胃胀及口气、口臭为模型的显著性变量.结论 胃胀,有口气、口臭,同时有胃胀及口气、口臭,在家煮食午餐,居住的楼层数,外出居家是否使用公筷,是否有使用公筷习惯,家人是否感染 H.pylori是感染 H.pylori的特征因素,用 Logistic回归模型作为主模型进行变量筛选,添加交互后的模型,AUC 有所提升,交互项的预测模型对 H.pylori感染者预判能力好,运算容易,使用经济、便利,适合区域性推广.
Machine learning study of characteristic variables and predictive models for patients with H.pylori infection
Objective To analyze the risk factors of infection in patients with H.pylori infection,establish a predic-tion model for patients with H.pylori infection,and provide reference for the prevention and treatment of H.pylori infec-tion.Methods A total of 1 477 patients tested for H.pylori in Zhongshan Hospital of Traditional Chinese Medicine,Dongfeng People's Hospital of Zhongshan and Zhongshan South District Hospital from Jul.2021 to May 2022 were se-lected as study subjects.The results of gastroscopy,14 C and 13 C breath tests were used to divide the H.pylori tested pop-ulation into infected and non-infected groups,and questionnaires were administered to investigate the contents of the sur-vey,including the basic a total of 63 variables were included in the survey,including the basic conditions,clinical fea-tures,chronic underlying diseases,life and dietary habits.The results were subjected to multifactorial analysis of H.pylori infection using single factor and Logistic regression in machine learning,decision tree analysis,and Logistic regression with added interaction terms,and the area under the ROC curve,sensitivity and specificity of the 3 models were com-pared to verify the accuracy of the models,and a prediction model of H.pylori infection was established,and the charac-teristics and risk factors were established as a forest plot.Results The AUC of Logistic regression was 0.7361,sensi-tivity was 0.7615,and specificity was 0.6034.The AUC of decision tree was 0.6528,sensitivity was 0.6801,and specificity was 0.5773.The AUC of Logistic with the addition of interaction term was 0.7388,sensitivity was 0.7588,and specificity was 0.6034.The multifactorial Logistic regression with the addition of interaction terms showed that hav-ing stomach bloating,bad breath and halitasis,cooking lunch at home,not having it at home but having the habit of using communal chopsticks when going out,having infected family members living together use public chopsticks only after the epidemic,lived in 4 to 10 floors,and also had stomach bloating and both bad breath and halitosis.Conclusion Stomach bloating,having bad breath and halitasis,having both stomach bloating and bad breath and halitasis,cooking lunch at home,number of floors lived in,whether to use public chopsticks at home,whether to have the habit of using public chopsticks,whether to have family members infected with H.pylori are the characteristic factors of H.pylori infec-tion,and Logistic regression model was used as the main model for variable screening.The AUC area of the model after adding the interaction is improved,and the prediction model with the interaction term has good predictive ability for H.pylori infected patients,easily to calculate,economical and convenient to use,and suitable for regional extension.

Helicobacter pyloriBinary Logistic regression modelDecision treeForest plotInteraction term

袁一鸣、杜结玲、洪慧斯、韦翠花、卢苑香

展开 >

中山市中医院药学部,广东 中山 528401

中山市中医院内镜中心

中山市中医院科教科

中山市南区医院内镜室

中山市东凤人民医院内镜室

展开 >

幽门螺杆菌 二元Logistic回归模型 决策树 森林图 交互项

中山市科学技术局第一批社会公益(医疗卫生一般项目)

2021SYF01

2024

胃肠病学和肝病学杂志
郑州大学

胃肠病学和肝病学杂志

CSTPCD
影响因子:1.029
ISSN:1006-5709
年,卷(期):2024.33(8)