Abstract
The identification of differentially expressed genes plays an increasingly important role biologically. Therefore, the feature selection approach has attracted much attention in the field of bioinformatics. The most popular method of principal component analysis studies two-dimensional data without considering the spatial geometric structure of the data. The recently proposed tensor robust principal component analysis method performs sparse and low-rank decomposition on three-dimensional tensors and effectively preserves the spatial structure. Based on this approach, the L_{2,1}L2,1- norm regularization term is introduced into the DSTPCA (Double-Sparse Constrained Tensor Principal Component Analysis) method. The DSTPCA method removes the redundant noise by double sparse constraints on the objective function to obtain sufficiently sparse results. After the regularization norm is introduced into the model, the ADMM (alternating direction method of multipliers) algorithm is used to solve the optimal problem. In the experiment of feature selection, while the more redundant genes were filtered out, the more genes closely associated with disease were screened. Experimental results using different datasets indicate that our method outperforms other methods.
| Original language | English |
|---|---|
| Article number | 8847348 |
| Pages (from-to) | 1481-1491 |
| Number of pages | 11 |
| Journal | IEEE/ACM Transactions on Computational Biology and Bioinformatics |
| Volume | 18 |
| Issue number | 4 |
| DOIs | |
| State | Published - 1 Jul 2021 |
| Externally published | Yes |
Keywords
- Differentially expressed genes
- feature selection
- sparse constraint
- tensor principal component analysis
Fingerprint
Dive into the research topics of 'DSTPCA: Double-Sparse Constrained Tensor Principal Component Analysis Method for Feature Selection'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver