RPCA-Based Tumor Classification Using Gene Expression Data

  • Jin Xing Liu
  • , Yong Xu
  • , Chun Hou Zheng
  • , Heng Kong
  • , Zhi Hui Lai

Research output: Contribution to journalArticlepeer-review

85 Scopus citations

Abstract

Microarray techniques have been used to delineate cancer groups or to identify candidate genes for cancer prognosis. As such problems can be viewed as classification ones, various classification methods have been applied to analyze or interpret gene expression data. In this paper, we propose a novel method based on robust principal component analysis (RPCA) to classify tumor samples of gene expression data. Firstly, RPCA is utilized to highlight the characteristic genes associated with a special biological process. Then, RPCA and RPCA+LDA (robust principal component analysis and linear discriminant analysis) are used to identify the features. Finally, support vector machine (SVM) is applied to classify the tumor samples of gene expression data based on the identified features. Experiments on seven data sets demonstrate that our methods are effective and feasible for tumor classification.

Original languageEnglish
Article number6998825
Pages (from-to)964-970
Number of pages7
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume12
Issue number4
DOIs
StatePublished - 1 Jul 2015
Externally publishedYes

Keywords

  • Classification
  • data mining
  • feature selection
  • principal component analysis
  • sparse method

Fingerprint

Dive into the research topics of 'RPCA-Based Tumor Classification Using Gene Expression Data'. Together they form a unique fingerprint.

Cite this