Abstract
Microarray techniques have been used to delineate cancer groups or to identify candidate genes for cancer prognosis. As such problems can be viewed as classification ones, various classification methods have been applied to analyze or interpret gene expression data. In this paper, we propose a novel method based on robust principal component analysis (RPCA) to classify tumor samples of gene expression data. Firstly, RPCA is utilized to highlight the characteristic genes associated with a special biological process. Then, RPCA and RPCA+LDA (robust principal component analysis and linear discriminant analysis) are used to identify the features. Finally, support vector machine (SVM) is applied to classify the tumor samples of gene expression data based on the identified features. Experiments on seven data sets demonstrate that our methods are effective and feasible for tumor classification.
| Original language | English |
|---|---|
| Article number | 6998825 |
| Pages (from-to) | 964-970 |
| Number of pages | 7 |
| Journal | IEEE/ACM Transactions on Computational Biology and Bioinformatics |
| Volume | 12 |
| Issue number | 4 |
| DOIs | |
| State | Published - 1 Jul 2015 |
| Externally published | Yes |
Keywords
- Classification
- data mining
- feature selection
- principal component analysis
- sparse method