Characteristic gene selection via L2,1-norm Sparse Principal Component Analysis

  • Yao Lu
  • , Ying Lian Gao
  • , Jin Xing Liu
  • , Chang Gang Wen
  • , Ya Xuan Wang
  • , Jiguo Yu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Sparse Principal Component Analysis (SPCA) is a method that can get the sparse loadings of the principal components (PCs), and it may formulate PCA as a regression-type optimization problem by using the elastic net. But the selected features are different with each PC and generally independent. A new method named SPCA has been proposed for removing these detect, which replaces the elastic net with L2,1-norm penalty. The results of the method on gene expression data are still unknown. Therefore, we will take a test to prove this point in this paper. Firstly, this method is applied to the simulated data for obtaining an optimal parameter. Secondly, the L2,1SPCA method is applied to the gene expression data, that is the head and neck squamous carcinoma data (HNSC). Thirdly, the characteristic genes are selected according the PCs. The results consist of very lower P-value and very higher hit count, which shows the method of L2,1SPCA can obtain higher recognition accuracy and higher relevancy to the genes. Finally, the experimental results demonstrate that the L2,1SPCA works well and has good performances in the gene expression data.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016
EditorsKevin Burrage, Qian Zhu, Yunlong Liu, Tianhai Tian, Yadong Wang, Xiaohua Tony Hu, Qinghua Jiang, Jiangning Song, Shinichi Morishita, Kevin Burrage, Guohua Wang
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1828-1833
Number of pages6
ISBN (Electronic)9781509016105
DOIs
StatePublished - 17 Jan 2017
Externally publishedYes
Event2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016 - Shenzhen, China
Duration: 15 Dec 201618 Dec 2016

Publication series

NameProceedings - 2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016

Conference

Conference2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016
Country/TerritoryChina
CityShenzhen
Period15/12/1618/12/16

Keywords

  • Gene expression data
  • L-norm
  • Row-sparse
  • Sparse principal component analysis

Fingerprint

Dive into the research topics of 'Characteristic gene selection via L2,1-norm Sparse Principal Component Analysis'. Together they form a unique fingerprint.

Cite this