Identification of sample annotation errors in gene expression datasets

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Search: WFRF:(Mattsson Johanna S M) > Identification of s...

Identification of sample annotation errors in gene expression datasets

Lohr, Miriam (author): TU Dortmund Univ, Dept Stat, D-44227 Dortmund, Germany.

Hellwig, Birte (author): TU Dortmund Univ, Dept Stat, D-44227 Dortmund, Germany.

Edlund, Karolina (author): Dortmund TU, Leibniz Res Ctr Working Environm & Human Factors, Dortmund, Germany.

Mattsson, Johanna S. M. (author): Uppsala universitet,Klinisk och experimentell patologi

Botling, Johan (author): Uppsala universitet,Klinisk och experimentell patologi

Schmidt, Marcus (author): Univ Hosp, Dept Obstet & Gynecol, Mainz, Germany.

Hengstler, Jan G. (author): Dortmund TU, Leibniz Res Ctr Working Environm & Human Factors, Dortmund, Germany.

Micke, Patrick (author): Uppsala universitet,Klinisk och experimentell patologi

Rahnenfuehrer, Joerg (author): TU Dortmund Univ, Dept Stat, D-44227 Dortmund, Germany.

show less...

TU Dortmund Univ, Dept Stat, D-44227 Dortmund, Germany Dortmund TU, Leibniz Res Ctr Working Environm & Human Factors, Dortmund, Germany. (creator_code:org_t)

2015-11-25
2015
English.
In: Archives of Toxicology. - : Springer Science and Business Media LLC. - 0340-5761 .- 1432-0738. ; 89:12, s. 2265-2272

Related links:: https://doi.org/10.1...; show more...; https://uu.diva-port... (primary) (Raw object); https://link.springe...; https://urn.kb.se/re...; https://doi.org/10.1...; show less...

Journal article (peer-reviewed)

Abstract Subject headings

The comprehensive transcriptomic analysis of clinically annotated human tissue has found widespread use in oncology, cell biology, immunology, and toxicology. In cancer research, microarray-based gene expression profiling has successfully been applied to subclassify disease entities, predict therapy response, and identify cellular mechanisms. Public accessibility of raw data, together with corresponding information on clinicopathological parameters, offers the opportunity to reuse previously analyzed data and to gain statistical power by combining multiple datasets. However, results and conclusions obviously depend on the reliability of the available information. Here, we propose gene expression-based methods for identifying sample misannotations in public transcriptomic datasets. Sample mix-up can be detected by a classifier that differentiates between samples from male and female patients. Correlation analysis identifies multiple measurements of material from the same sample. The analysis of 45 datasets (including 4913 patients) revealed that erroneous sample annotation, affecting 40 % of the analyzed datasets, may be a more widespread phenomenon than previously thought. Removal of erroneously labelled samples may influence the results of the statistical evaluation in some datasets. Our methods may help to identify individual datasets that contain numerous discrepancies and could be routinely included into the statistical analysis of clinical gene expression data.

Find in a library

Archives of Toxicology (Search for host publication in LIBRIS)

To the university's database

Find more in SwePub

By the author/editor: Lohr, Miriam; Hellwig, Birte; Edlund, Karolina; Mattsson, Johann ...; Botling, Johan; Schmidt, Marcus; show more...; Hengstler, Jan G ...; Micke, Patrick; Rahnenfuehrer, J ...; show less...

About the subject

MEDICAL AND HEALTH SCIENCES: MEDICAL AND HEAL ...; and Basic Medicine; and Pharmacology and ...

Articles in the publication: Archives of Toxi ...

By the university: Uppsala University

Search outside SwePub

Extend your search to:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se

Identification of sample annotation errors in gene expression datasets

Subject headings

Keyword

Publication and Content Type

Find in a library

To the university's database

Find more in SwePub

Search outside SwePub