首页|联合对比学习与图神经网络的自优化单细胞聚类

联合对比学习与图神经网络的自优化单细胞聚类

扫码查看
单细胞RNA测序技术(single-cell RNA sequencing,scRNA-seq)在单个细胞的水平上对转录组进行高通量测序分析,其核心应用是识别具有不同功能的细胞亚群,通常基于细胞聚类来完成.然而,scRNA-seq数据高维度、高噪声、高稀疏的特点使得聚类充满挑战.常规的聚类方法表现不佳,现有的单细胞聚类方法也大多只考虑基因的表达模式,而忽略了细胞之间的关系.针对这些问题,提出了一个联合对比学习与图神经网络的自优化单细胞聚类方法(self-optimizing single-cell clustering with contrastive learning and graph neural network,scCLG).该方法采用自编码器来学习细胞的特征分布.首先构建细胞-基因图,使用图神经网络进行编码,以有效利用细胞之间的关系信息.通过子图采样和特征掩码获取增广视图用于对比学习,进一步优化特征表示.最后使用自优化的策略将聚类模块和特征模块联合训练,不断优化特征表示和聚类中心,实现更准确的聚类.在 10 个真实的scRNA-seq数据集上的实验表明,scCLG能够学习到细胞特征的良好表示,在聚类精度上全面优于其他方法.
Self-optimizing Single-cell Clustering with Contrastive Learning and Graph Neural Network
Single-cell RNA sequencing(scRNA-seq)performs high-throughput sequencing analysis of the transcriptomes at the level of individual cells.Its primary application is to identify cell subpopulations with distinct functions,usually based on cell clustering.However,the high dimensionality,noise,and sparsity of scRNA-seq data make clustering challenging.Traditional clustering methods are inadequate,and most existing single-cell clustering approaches only consider gene expression patterns while ignoring relationships between cells.To address these issues,a self-optimizing single-cell clustering method with contrastive learning and graph neural network(scCLG)is proposed.This method employs an autoencoder to learn cellular feature distribution.First,it begins by constructing a cell-gene graph,which is encoded using a graph neural network to effectively harness information on intercellular relationships.Subgraph sampling and feature masking create augmented views for contrastive learning,further optimizing feature representation.Finally,a self-optimizing strategy is utilized to jointly train the clustering and feature modules,continually refining feature representation and clustering centers for more accurate clustering.Experiments on 10 real scRNA-seq datasets demonstrate that scCLG can learn robust representations of cell features,significantly surpassing other methods in clustering accuracy.

single-cell RNA sequencing(scRNA-seq)clusteringcontrastive learninggraph neural network(GNN)autoencoder

蒋维康、王劲贤

展开 >

复旦大学计算机科学技术学院,上海 200438

单细胞RNA测序 聚类 对比学习 图神经网络 自编码器

国家自然科学基金

61972100

2024

计算机系统应用
中国科学院软件研究所

计算机系统应用

CSTPCD
影响因子:0.449
ISSN:1003-3254
年,卷(期):2024.33(9)
  • 1