Visualization and Analysis of Single Cell RNA-Seq Data by Maximizing Correntropy Based Non-Negative Low Rank Representation

  • Cui Na Jiao
  • , Jin Xing Liu
  • , Juan Wang
  • , Junliang Shang
  • , Chun Hou Zheng

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

The exploration of single cell RNA-sequencing (scRNA-seq) technology generates a new perspective to analyze biological problems. One of the major applications of scRNA-seq data is to discover subtypes of cells by cell clustering. Nevertheless, it is challengeable for traditional methods to handle scRNA-seq data with high level of technical noise and notorious dropouts. To better analyze single cell data, a novel scRNA-seq data analysis model called Maximum correntropy criterion based Non-negative and Low Rank Representation (MccNLRR) is introduced. Specifically, the maximum correntropy criterion, as an effective loss function, is more robust to the high noise and large outliers existed in the data. Moreover, the low rank representation is proven to be a powerful tool for capturing the global and local structures of data. Therefore, some important information, such as the similarity of cells in the subspace, is also extracted by it. Then, an iterative algorithm on the basis of the half-quadratic optimization and alternating direction method is developed to settle the complex optimization problem. Before the experiment, we also analyze the convergence and robustness of MccNLRR. At last, the results of cell clustering, visualization analysis, and gene markers selection on scRNA-seq data reveal that MccNLRR method can distinguish cell subtypes accurately and robustly.

Original languageEnglish
Pages (from-to)1872-1882
Number of pages11
JournalIEEE Journal of Biomedical and Health Informatics
Volume26
Issue number4
DOIs
StatePublished - 1 Apr 2022
Externally publishedYes

Keywords

  • Low rank representation
  • clustering
  • correntropy
  • gene markers
  • single cell RNA-sequencing

Fingerprint

Dive into the research topics of 'Visualization and Analysis of Single Cell RNA-Seq Data by Maximizing Correntropy Based Non-Negative Low Rank Representation'. Together they form a unique fingerprint.

Cite this