首页|对胆道闭锁诊断生物标志物及其与免疫细胞浸润关系的基于机器学习的分析

对胆道闭锁诊断生物标志物及其与免疫细胞浸润关系的基于机器学习的分析

扫码查看
目的:基于生物信息学技术筛选胆道闭锁的诊断生物标志物,分析其与免疫细胞浸润的相关性.方法:从基因表达综合(GEO)数据库中获取与胆道闭锁相关的基因表达谱,利用R语言筛选与胆道闭锁相关的差异基因并进行基因的富集分析.将差异基因与加权基因共表达网络选择的主要模块基因集合,通过使用LASSO回归和SVM-RFE分析两种机器学习算法鉴定核心基因.采用受试者工作特征(ROC)曲线验证核心基因对胆道闭锁的诊断准确性.通过ssGSEA检测 28种免疫细胞的浸润水平及分析其与核心基因的联系.结果:共筛选出 76 个差异基因.富集分析显示,差异基因主要富集在细胞外基质受体相互作用和白细胞介素-17(IL-17)信号通路及磷脂酰肌醇 3-激酶/蛋白激酶B(PI3K/Akt)信号通路等.关键模块和差异基因重叠后,筛选到 26个重叠基因.通过SVM-RFE和Lasso这两种机器学习算法,确定 3个核心基因(CXCL8、LAMC2、KRT19)作为胆道闭锁的可能诊断生物标志物.差异分析显示,CXCL8、LAMC2、KRT19 在胆道闭锁患儿中表达量显著高于正常对照者(P<0.001).ROC曲线分析显示,CXCL8、LAMC2 和KRT19 具有较高的诊断价值,三者联合诊断的ROC曲线下面积为 0.969,并且其与免疫细胞浸润呈正相关.结论:CXCL8、LAMC2 和KRT19 在胆道闭锁患儿的肝组织中均上调,联合诊断的准确性高,可能是胆道闭锁诊断的候选基因,这为后续治疗提供了新的思路.
Machine learning-based analysis of diagnostic biomarkers for biliary atresia and their relationship with immune cell infiltration
Objective:To screen diagnostic biomarkers for biliary atresia and their correlation with immune cell infiltration based on bioinformatics technology.Methods:The gene expression profiles related to biliary atresia were obtained from gene expression omnibus(GEO)database,and the differential genes related to biliary atresia were screened by R language,and gene enrichment analysis was conducted.The main module genes selected by differential gene and weighted gene co-expression network were collected,core genes were identified by using two machine learning algorithms:LASSO regression and SVM-RFE analysis.Receiver operating characteristic(ROC)curve was used to verify the diagnostic accuracy of core genes for biliary atresia.The infiltration levels of 28 kinds of immune cells were detected by ssGSEA and their association with core genes was analyzed.Results:A total of 76 differential genes were screened.Enrichment analysis showed that differential genes were mainly enriched in extracellular matrix receptor interaction and interleukin-17(IL-7)signaling pathway and phosphatidylinositol 3-kinase/protein kinase B(PI3K/Akt)signaling pathway.After overlapping of key modules and differential genes,26 overlapping genes were screened.Two machine learning algorithms,SVM-RFE and Lasso,were used to identify three core genes(CXCL8,LAMC2,KRT19)as potential diagnostic biomarkers for biliary atresia.Differential analysis showed that the expression levels of CXCL8,LAMC2 and KRT19 in children with biliary atresia were significantly higher than those in normal controls(P<0.001).ROC curve analysis showed that CXCL8,LAMC2,and KRT19 had high diagnostic value,and the area under ROC curve of the combined diagnosis was 0.969.It had been proven to be positively correlated with concentrated immune cell infiltration.Conclusions:CXCL8,LAMC2,and KRT19 are all up-regulated in the liver tissue of children with biliary atresia,and their combined diagnosis has high accuracy.They may be candidate genes for the diagnosis of biliary atresia,and provide new ideas for subsequent treatment.

Biliary atresiaMachine learningGEO databaseBioinformaticsDiagnostic marker

姜一敏、蔡子涵、吴琼

展开 >

泗阳医院检验科,江苏 泗阳 223700

胆道闭锁 机器学习 GEO数据库 生物信息学 诊断标志物

2024

感染、炎症、修复
解放军总医院第一附属医院

感染、炎症、修复

影响因子:0.404
ISSN:1672-8521
年,卷(期):2024.25(4)