廖明辉, 罗甫林, 杜博. 单细胞RNA测序数据的自监督低通滤波图聚类网络[J]. 武汉大学学报 ( 信息科学版). DOI: 10.13203/j.whugis20220108
引用本文: 廖明辉, 罗甫林, 杜博. 单细胞RNA测序数据的自监督低通滤波图聚类网络[J]. 武汉大学学报 ( 信息科学版). DOI: 10.13203/j.whugis20220108
LIAO Minghui, LUO Fulin, DU Bo. Self-supervised Low-pass Filted Graph Clustering Networks for Single Cell RNA Sequencing Data[J]. Geomatics and Information Science of Wuhan University. DOI: 10.13203/j.whugis20220108
Citation: LIAO Minghui, LUO Fulin, DU Bo. Self-supervised Low-pass Filted Graph Clustering Networks for Single Cell RNA Sequencing Data[J]. Geomatics and Information Science of Wuhan University. DOI: 10.13203/j.whugis20220108

单细胞RNA测序数据的自监督低通滤波图聚类网络

Self-supervised Low-pass Filted Graph Clustering Networks for Single Cell RNA Sequencing Data

  • 摘要: 近年兴起的单细胞RNA测序(single-cell RNA sequencing,scRNA-seq)技术可以测出每个单细胞的转录组表达量,利用单细胞RNA测序数据可以将具有相似生物学状态或相似功能的单细胞聚类成同一细胞群,从而指导下游生物学分析。针对单细胞RNA测序数据的复杂、高维、携带大量噪声的特点,提出了一种自监督低通滤波图聚类网络(Self-supervised Low-pass Filted Graph Clustering Network,SLFGCN)算法用于单细胞RNA测序数据的聚类研究。该方法首先构建了一个低通滤波的图卷积网络,以细胞为节点构建图网络结构,在谱域的图信息经过低通滤波图卷积操作后,获得更加平滑的图信号,即同一簇的细胞提取到更相似的节点特征,从而利于单细胞RNA测序数据聚类;然后,通过图自编码模型,建立自监督模块优化模型,进一步优化聚类效果。通过在单细胞RNA测序数据上与相关算法的对比实验结果表明,提出的方法能更好地获取单细胞RNA表达数据的内在特征,改善聚类效果。

     

    Abstract: Single-cell RNA sequencing (scRNA-seq) provides high-resolution observation tools at the cell level for biological domains, such as embryonic development, cancer evolution and cell differentiation. A key step in using scRNA-seq data is to cluster cells with similar biological functions into one group. However, the current clustering methods are not able to perform the clustering task well in a large number of high-dimensional and complex scRNA-seq data, and don’t use the structural relationship information between samples. Here, we propose a GCN based deep clustering framework, named Self-supervised Low-pass Filted Graph Clustering Networks (SLFGCN). Firstly, a new propagation method of graph convolutional network is proposed. For the proposed method, the graph information in the spectral domain passes through the frequency response function of the low-pass filter to obtain smoother node feature representation, which is more conducive to the clustering task. Secondly, we use the self-supervised module to optimize the network based on the representation learned from the low-pass filted GCN module and the representation learned from the graph auto-encoders module, which can obtain better clustering effect. Experiments indicate that our model outperforms the state-of-the-art methods in various evaluation metrics on real datasets. Further, the visualization results show that our model provides representations generating better intra-cluster compactness and inter-cluster separability.

     

/

返回文章
返回