SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:gup.ub.gu.se/253133"
 

Search: onr:"swepub:oai:gup.ub.gu.se/253133" > Clustering cancer g...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Clustering cancer gene expression data: a comparative study.

de Souto, Marcilio C P (author)
Costa, Ivan G (author)
de Araujo, Daniel S A (author)
show more...
Ludermir, Teresa B (author)
Schliep, Alexander, 1967 (author)
Gothenburg University,Göteborgs universitet,Institutionen för data- och informationsteknik, datavetenskap (GU),Department of Computer Science and Engineering, Computing Science (GU)
show less...
 (creator_code:org_t)
2008-11-27
2008
English.
In: BMC bioinformatics. - : Springer Science and Business Media LLC. - 1471-2105. ; 9
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • The use of clustering methods for the discovery of cancer subtypes has drawn a great deal of attention in the scientific community. While bioinformaticians have proposed new clustering methods that take advantage of characteristics of the gene expression data, the medical community has a preference for using "classic" clustering methods. There have been no studies thus far performing a large-scale evaluation of different clustering methods in this context.We present the first large-scale analysis of seven different clustering methods and four proximity measures for the analysis of 35 cancer gene expression data sets. Our results reveal that the finite mixture of Gaussians, followed closely by k-means, exhibited the best performance in terms of recovering the true structure of the data sets. These methods also exhibited, on average, the smallest difference between the actual number of classes in the data sets and the best number of clusters as indicated by our validation criteria. Furthermore, hierarchical methods, which have been widely used by the medical community, exhibited a poorer recovery performance than that of the other methods evaluated. Moreover, as a stable basis for the assessment and comparison of different clustering methods for cancer gene expression data, this study provides a common group of data sets (benchmark data sets) to be shared among researchers and used for comparisons with new methods. The data sets analyzed in this study are available at http://algorithmics.molgen.mpg.de/Supplements/CompCancer/.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Bioinformatics (hsv//eng)

Keyword

Algorithms
Cluster Analysis
Computational Biology
methods
DNA
Complementary
metabolism
Gene Expression Profiling
Gene Expression Regulation
Neoplastic
Genes
Neoplasm
Humans
Models
Biological
Models
Statistical
Multigene Family
Neoplasms
diagnosis
genetics
Normal Distribution
Oligonucleotide Array Sequence Analysis
Pattern Recognition
Automated
methods

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view