Performance Analysis of Non-negative Matrix Factorization Methods on TCGA Data

  • Mi Xiao Hou
  • , Jin Xing Liu
  • , Junliang Shang
  • , Ying Lian Gao
  • , Xiang Zhen Kong
  • , Ling Yun Dai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Non-negative Matrix Factorization (NMF) is recognized as one of fundamentally important and highly popular methods for clustering and feature selection, and many related methods have been proposed so far. Nevertheless, their performances, especially on real data, are still unclear due to few studies focusing on their comparison. This study aims at a assessment study of several representative methods from clustering and feature selection, including NMF, GNMF, MD-NMF, L2,1NMF, LNMF, Convex-NMF and Semi-NMF, on the data of the Cancer Genome Atlas (TCGA), which is one of current research hotspot of bioinformatics. Specifically, three data types of four cancers are either separately or integratedly decomposed as the coefficient matrices and the basis matrices by these NMF methods. The coefficient matrices are evaluated by accuracies of clustered samples and the basis matrices are assessed by p-values of selected genes. Experiment results not only show merits and limitations of compared NMF methods, which may provide guidelines for applying them and proposing novel NMF methods, but also reveal several clues for the exploration of related cancers.

Original languageEnglish
Title of host publicationIntelligent Computing Theories and Application - 14th International Conference, ICIC 2018, Proceedings
EditorsKang-Hyun Jo, De-Shuang Huang, Xiao-Long Zhang
PublisherSpringer Verlag
Pages407-418
Number of pages12
ISBN (Print)9783319959320
DOIs
StatePublished - 2018
Externally publishedYes
Event14th International Conference on Intelligent Computing, ICIC 2018 - Wuhan, China
Duration: 15 Aug 201818 Aug 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10955 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Intelligent Computing, ICIC 2018
Country/TerritoryChina
CityWuhan
Period15/08/1818/08/18

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Clustering
  • Dimensionality reduction
  • Genomic data
  • Non-negative Matrix Factorization

Fingerprint

Dive into the research topics of 'Performance Analysis of Non-negative Matrix Factorization Methods on TCGA Data'. Together they form a unique fingerprint.

Cite this