Abstract
Non-negative Matrix Factorization (NMF) is recognized as one of fundamentally important and highly popular methods for clustering and feature selection, and many related methods have been proposed so far. Nevertheless, their performances, especially on real data, are still unclear due to few studies focusing on their comparison. This study aims at a assessment study of several representative methods from clustering and feature selection, including NMF, GNMF, MD-NMF, L2,1NMF, LNMF, Convex-NMF and Semi-NMF, on the data of the Cancer Genome Atlas (TCGA), which is one of current research hotspot of bioinformatics. Specifically, three data types of four cancers are either separately or integratedly decomposed as the coefficient matrices and the basis matrices by these NMF methods. The coefficient matrices are evaluated by accuracies of clustered samples and the basis matrices are assessed by p-values of selected genes. Experiment results not only show merits and limitations of compared NMF methods, which may provide guidelines for applying them and proposing novel NMF methods, but also reveal several clues for the exploration of related cancers.
| Original language | English |
|---|---|
| Title of host publication | Intelligent Computing Theories and Application - 14th International Conference, ICIC 2018, Proceedings |
| Editors | Kang-Hyun Jo, De-Shuang Huang, Xiao-Long Zhang |
| Publisher | Springer Verlag |
| Pages | 407-418 |
| Number of pages | 12 |
| ISBN (Print) | 9783319959320 |
| DOIs | |
| State | Published - 2018 |
| Externally published | Yes |
| Event | 14th International Conference on Intelligent Computing, ICIC 2018 - Wuhan, China Duration: 15 Aug 2018 → 18 Aug 2018 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 10955 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 14th International Conference on Intelligent Computing, ICIC 2018 |
|---|---|
| Country/Territory | China |
| City | Wuhan |
| Period | 15/08/18 → 18/08/18 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Clustering
- Dimensionality reduction
- Genomic data
- Non-negative Matrix Factorization
Fingerprint
Dive into the research topics of 'Performance Analysis of Non-negative Matrix Factorization Methods on TCGA Data'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver