Skip to main navigation Skip to search Skip to main content

scNMF-Impute: imputation for single-cell RNA-seq data based on nonnegative matrix factorization

  • Qufu Normal University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Single-cell RNA sequencing (scRNA-seq) data are collected at an unheard-of rate thanks to the advancement of high-throughput sequencing technologies. However, due to the limitations of current technology, scRNA-seq is sometimes unable to capture the expressed genes, resulting in a large number of zero counts (also known as dropout events) in the data. These dropout events can cause data loss in the gene expression matrix and severely hampers the accuracy of downstream analysis. To address this problem, in this paper, we propose a new imputation method called scNMF-impute. The scNMF-impute method imputes the dropout events and performs dimensionality reduction under the framework of nonnegative matrix factorization (NMF). To effectively identify the location of the dropout and recover the value of the dropout, we explicitly model the dropout events as a matrix. Therefore, the gene expression matrix without dropout is represented as the sum of the original data matrix and the dropout matrix. In addition, to reduce the influence of dropout on factorization, we introduce the similarity information between genes into the NMF model. The introduction of gene similarity information can ensure the accurate recovery of data structures obscured by dropout events in the gene expression matrix. We conducted extensive experiments on simulated datasets and real scRNA-seq datasets to verify the effectiveness of scNMF-impute and other state-of-the-art methods. The results show that scNMF-impute can accurately calculate missing data and restore true gene expression, thus improving the accuracy of existing clustering methods and obtaining more accurate cell clustering results.

Original languageEnglish
Title of host publicationProceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023
EditorsXingpeng Jiang, Haiying Wang, Reda Alhajj, Xiaohua Hu, Felix Engel, Mufti Mahmud, Nadia Pisanti, Xuefeng Cui, Hong Song
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3200-3207
Number of pages8
ISBN (Electronic)9798350337488
DOIs
StatePublished - 2023
Externally publishedYes
Event2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023 - Istanbul, Turkey
Duration: 5 Dec 20238 Dec 2023

Publication series

NameProceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023

Conference

Conference2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023
Country/TerritoryTurkey
CityIstanbul
Period5/12/238/12/23

Keywords

  • Dropout event
  • Gene expression
  • Imputation
  • Nonnegative Matrix Factorization
  • scRNA-seq datasets

Fingerprint

Dive into the research topics of 'scNMF-Impute: imputation for single-cell RNA-seq data based on nonnegative matrix factorization'. Together they form a unique fingerprint.

Cite this