TY - GEN
T1 - A New Method for Processing scRNA-seq Data by Coupling Low-Rank Representation and Concept Factorization
AU - Zhang, Zhenduo
AU - Liu, Jinxing
AU - Li, Shengjun
AU - Wang, Juan
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The advent and development of single-cell RNA sequencing (scRNA-seq) have provided new avenues for exploring cellular heterogeneity. Although many researchers have designed and developed efficient models to address cell heterogeneity and diversity by clustering cells into several groups, the performance of these methods may need improvement due to the characteristics of scRNA-seq data, such as high dimensionality, sparsity, and high dropout rates. In this paper, we propose a new method that couples low-rank representation (LRR) and concept factorization (CF) to learn a better clustering assignment matrix from both global and local perspectives, named SLRRGCF. Specifically, the LRR with similarity constraints based on tired random walk (TRW) can reduce the dimensionality of high-dimensional data while capturing more comprehensive global structure. At the same time, hypergraph regularization and CF are utilized to capture the local structure of the data further and directly obtain the clustering assignment matrix. We evaluated the performance of SLRRGCF on several real datasets, and comparisons with other competitive methods validated the effectiveness of our approach.
AB - The advent and development of single-cell RNA sequencing (scRNA-seq) have provided new avenues for exploring cellular heterogeneity. Although many researchers have designed and developed efficient models to address cell heterogeneity and diversity by clustering cells into several groups, the performance of these methods may need improvement due to the characteristics of scRNA-seq data, such as high dimensionality, sparsity, and high dropout rates. In this paper, we propose a new method that couples low-rank representation (LRR) and concept factorization (CF) to learn a better clustering assignment matrix from both global and local perspectives, named SLRRGCF. Specifically, the LRR with similarity constraints based on tired random walk (TRW) can reduce the dimensionality of high-dimensional data while capturing more comprehensive global structure. At the same time, hypergraph regularization and CF are utilized to capture the local structure of the data further and directly obtain the clustering assignment matrix. We evaluated the performance of SLRRGCF on several real datasets, and comparisons with other competitive methods validated the effectiveness of our approach.
KW - clustering
KW - concept factorization
KW - low-rank representation
KW - single-cell RNA sequencing
KW - tired random walk
UR - https://www.scopus.com/pages/publications/85217276771
U2 - 10.1109/BIBM62325.2024.10822518
DO - 10.1109/BIBM62325.2024.10822518
M3 - 会议稿件
AN - SCOPUS:85217276771
T3 - Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024
SP - 5574
EP - 5581
BT - Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024
A2 - Cannataro, Mario
A2 - Zheng, Huiru
A2 - Gao, Lin
A2 - Cheng, Jianlin
A2 - de Miranda, Joao Luis
A2 - Zumpano, Ester
A2 - Hu, Xiaohua
A2 - Cho, Young-Rae
A2 - Park, Taesung
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024
Y2 - 3 December 2024 through 6 December 2024
ER -