首页|基于加权基因共表达网络分析和机器学习筛选脓毒症免疫抑制特征基因及其靶向中药活性成分预测

基于加权基因共表达网络分析和机器学习筛选脓毒症免疫抑制特征基因及其靶向中药活性成分预测

扫码查看
目的 本研究通过加权基因共表达网络分析(weighted gene co-expression network analysis,WGCNA)和多种机器学习方法识别脓毒症免疫抑制特征基因,并预测其靶向中药活性成分.方法 从美国国家生物技术信息中心基因表达综合数据库中下载GSE182522 数据集,使用Bioconductor的"limma"软件包提取出差异表达基因(differentially expressed genes,DEGs),对DEGs进行京都基因与基因组百科全书(Kyoto encyclopedia of genes and genomes,KEGG)通路和基因本体论(gene ontology,GO)富集分析,同时采用WGCNA,以及最小绝对收缩和选择算子(least absolute shrinkage and selection operator,LASSO)回归、支持向量机递归特征消除(support vector machine-recursive feature elimination,SVM-RFE)、弹性网络回归分析 3 种机器学习法筛选出脓毒症免疫抑制特征基因.随后,构建受试者操作特征曲线(receiver operating characteristic curve,ROC curve,简称ROC曲线)验证特征基因的诊断效能,并绘制箱线图展示特征基因的表达模式.选取SPF级BALB/c小鼠 39 只,采用随机数字表法将小鼠随机分为脓毒症免疫正常组(n=20)和脓毒症免疫抑制组(n=19).盲肠结扎穿孔术构建脓毒症小鼠模型,在术后12h取脓毒症免疫正常组小鼠的外周血,术后24 h取脓毒症免疫抑制组小鼠的外周血.采用逆转录定量聚合酶链反应(reverse transcription quantitative polymerase chain reaction,RT-qPCR)检测两组小鼠TRBV7-2 的表达情况,验证TRBV7-2 的诊断效能.最后,通过分子对接技术预测特征基因潜在靶点中药活性成分.结果 共筛选出 445 个DEGs,其中上调基因 173 个,下调基因 272 个.DEGs主要富集于氮代谢、胆汁分泌、胃酸分泌 3 条信号通路,分子功能的正调控、细胞对含氧化合物的反应、细胞对含氮化合物的反应、对肽的响应、葡萄糖输入的调控等生物过程,质膜蛋白复合体、T细胞受体复合物、细胞侧膜等细胞组分,核苷三磷酸酶调节活性、GTPase激活剂活性、外源蛋白结合等分子功能.通过WGCNA共筛选出 56 个枢纽基因,3 种机器学习交集得到 1 个特征基因:TRBV7-2.ROC曲线分析得出TRBV7-2的AUG=0.72,具有较高的临床诊断价值,其表达量在脓毒症免疫抑制患者样本中显著下调.RT-qPCR结果显示,脓毒症免疫抑制组小鼠血液中TRBV7-2 基因表达低于脓毒症免疫正常组.通过分子对接技术得出小檗碱、黄芩苷、苍术素等10个潜在靶向中药活性成分.结论 TRBV7-2 可能作为脓毒症免疫抑制的诊断生物标志物,小檗碱等 10 味靶向中药活性成分可能成为降低脓毒症致死率的立足点.
Exploring signature gene and predicting the target active ingredients of TCM for immunosuppression in sepsis based on weighted gene co-expression network analysis and machine learning
Objective To identify immunosuppression in sepsis signature genes and predict the active ingredient of traditional Chinese medicine(TCM)using weighted gene co-expression network analysis(WGCNA)and variable machine learning methods.Method The GSE182522 dataset was downloaded from the gene expression omnibus,and differentially expressed genes(DEGs)were extracted using the limma software package of Bioconductor.Kyoto Encyclopedia of Genes and Genomes(KEGG)pathway and gene ontology(GO)enrichment analysis were performed on DEGs.Additionally,WGCNA and the least absolute shrinkage and selection operator(LASSO)regression,support vector machine-recursive feature elimination(SVM-RFE),and elastic network regression were used to screen immunosuppression in sepsis signature genes.Then,the receiver operating characteristic curve(ROC curve)was used to evaluate the diagnostic performance,and draw a box plot to show the expression patterns of the feature genes.A total of 39 SPF BALB/c mice were randomly assigned to 2 groups:immunocompetent-sepsis group(n=20)and immunosuppressed-sepsis group(n=19).The animal model of sepsis was established by cecal ligation and perforation.Peripheral blood of mice in immunocompetent-sepsis group or immunosuppressed-sepsis group were collected at 12 h or 24 h,respectively.Reverse transcription quantitative polymerase chain reaction(RT-qPCR)was used to detect the expression of TRBV7-2 in two groups of mice,and to verify the diagnostic efficacy of TRBV7-2.Finally,the potential active ingredient of TCM was predicted by molecular docking technology.Result A total of 445 DEGs were screened out,among which 173 genes were upregulated and 272 genes were downregulated.Functional enrichment analysis revealed that these DEGs were mainly involved in three signaling pathways:nitrogen metabolism,bile secretion and gastric acid secretion.Additionally,these DEGs were significantly enriched in various biological processes,including the positive regulation of molecular function,cellular response to oxygen-containing compounds,cellular response to nitrogen-containing compounds,cellular response to peptide hormone stimuli,and the regulation of glucose import.In terms of cellular components,the DEGs were notably linked to the plasma membrane protein complex,T-cell receptor complex,and lateral plasma membrane.At the molecular function level,these genes were particularly involved in nucleoside triphosphatase regulator activity,GTPase activator activity,and exogenous protein binding.A total of 56 key genes were screened by WGCNA,and one signature gene was obtained by the intersection of 3 kinds of machine learning:TRBV7-2.ROC curve analysis showed that TRBV7-2 had high clinical diagnostic value and was significantly down-regulated in patients with immunosuppression in sepsis samples.The results of RT-qPCR showed that the expression of TRBV7-2 gene in blood of immunosuppressed-sepsis group was lower than that of immunocompetent-sepsis group.By molecular docking technique,10 potential active ingredients of TCM,such as berberine,baicalin and atractylodin were predicted.Conclusion TRBV7-2 may be used as a diagnostic biomarker of immunosuppression in sepsis,and berberine and the 10 other active ingredients found in TCM may be provide a foundation for reducing the mortality of sepsis.

SepsisImmunosuppressionWeighted gene co-expression network analysisMachine learningSignature geneTargeted traditional Chinese medicine

林文杰、吴慧萍

展开 >

福州市仓山区上渡街道社区卫生服务中心,福建 福州 350007

福建师范大学 数学与统计学院,福建 福州 350007

脓毒症 免疫抑制 加权基因共表达网络分析 机器学习 特征基因 靶向中药

2024

创伤与急诊电子杂志

创伤与急诊电子杂志

影响因子:0.305
ISSN:
年,卷(期):2024.12(1)