首页|Hierarchical learning of gastric cancer molecular subtypes by integrating multi-modal DNA-level omics data and clinical stratification

Hierarchical learning of gastric cancer molecular subtypes by integrating multi-modal DNA-level omics data and clinical stratification

扫码查看
Molecular subtyping of gastric cancer(GC)aims to comprehend its genetic landscape.However,the efficacy of current subtyping methods is hampered by their mixed use of molecular features,a lack of strategy optimization,and the limited availability of public GC datasets.There is a pressing need for a precise and easily adoptable subtyping approach for early DNA-based screening and treatment.Based on TCGA subtypes,we developed a novel DNA-based hierarchical classifier for gastric cancer molecular subtyping(HCG),which employs gene mutations,copy number aberrations,and methylation patterns as predictors.By incorporating the closely related esophageal adenocarcinomas dataset,we expanded the TCGA GC dataset for the training and testing of HCG(n=453).The optimization of HCG was achieved through three hierarchical strategies using Lasso-Logistic regres-sion,evaluated by their overall the area under receiver operating character-istic curve(auROC),accuracy,F1 score,the area under precision-recall curve(auPRC)and their capability for clinical stratification using multivariate survival analysis.Subtype-specific DNA alteration biomarkers were dis-cerned through difference tests based on HCG defined subtypes.Our HCG classifier demonstrated superior performance in terms of overall auROC(0.95),accuracy(0.88),F1 score(0.87)and auPRC(0.86),significantly improving the clinical stratification of patients(overall p-value=0.032).Dif-ference tests identified 25 subtype-specific DNA alterations,including a high mutation rate in the SYNE1,ITGB4,and COL22A1 genes forthe MSI subtype,and hypermethylation of ALS2CL,KIAA0406,and RPRD1B genes for the EBV subtype.HCG is an accurate and robust classifier for DNA-based GC molecular subtyping with highly predictive clinical stratification performance.The training and test datasets,along with the analysis programs of HCG,are accessible on the GitHub website(github.com/LabxSCUT).

DNA alterationsgastric cancerhierarchical classificationmolecular subtypingmulti-omics

Binyu Yang、Siying Liu、Jiemin Xie、Xi Tang、Pan Guan、Yifan Zhu、Xuemei Liu、Yunhui Xiong、Zuli Yang、Weiyao Li、Yonghua Wang、Wen Chen、Qingjiao Li、Li C.Xia

展开 >

Department of Statistics and Financial Mathematics,School of Mathematics,South China University of Technology,Guangzhou,China

School of Physics and Optoelectronics,South China University of Technology,Guangzhou,China

Department of Pathology,The Sixth Affiliated Hospital,Sun Yat-sen University,Guangzhou,China

School of Food Science and Engineering,South China University of Technology,Guangzhou,China

Department of Laboratory Medicine,the Eighth Affiliated Hospital,Sun Yat-sen University,Shenzhen,China

展开 >

2024

定量生物学(英文版)

定量生物学(英文版)

ISSN:
年,卷(期):2024.12(2)