DSTPCA: Double-Sparse Constrained Tensor Principal Component Analysis Method for Feature Selection

Research output: Contribution to journalArticlepeer-review

17 Scopus citations

Abstract

The identification of differentially expressed genes plays an increasingly important role biologically. Therefore, the feature selection approach has attracted much attention in the field of bioinformatics. The most popular method of principal component analysis studies two-dimensional data without considering the spatial geometric structure of the data. The recently proposed tensor robust principal component analysis method performs sparse and low-rank decomposition on three-dimensional tensors and effectively preserves the spatial structure. Based on this approach, the L_{2,1}L2,1- norm regularization term is introduced into the DSTPCA (Double-Sparse Constrained Tensor Principal Component Analysis) method. The DSTPCA method removes the redundant noise by double sparse constraints on the objective function to obtain sufficiently sparse results. After the regularization norm is introduced into the model, the ADMM (alternating direction method of multipliers) algorithm is used to solve the optimal problem. In the experiment of feature selection, while the more redundant genes were filtered out, the more genes closely associated with disease were screened. Experimental results using different datasets indicate that our method outperforms other methods.

Original languageEnglish
Article number8847348
Pages (from-to)1481-1491
Number of pages11
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume18
Issue number4
DOIs
StatePublished - 1 Jul 2021
Externally publishedYes

Keywords

  • Differentially expressed genes
  • feature selection
  • sparse constraint
  • tensor principal component analysis

Fingerprint

Dive into the research topics of 'DSTPCA: Double-Sparse Constrained Tensor Principal Component Analysis Method for Feature Selection'. Together they form a unique fingerprint.

Cite this