SwePub
Tyck till om SwePub Sök här!
Sök i LIBRIS databas

  Extended search

WFRF:(Grabherr Manfred G.)
 

Search: WFRF:(Grabherr Manfred G.) > RNA-sequence data n...

RNA-sequence data normalization through in silico prediction of reference genes : the bacterial response to DNA damage as case study

Berghoff, Bork A. (author)
Justus Liebig Univ, Inst Mikrobiol & Mol Biol, Giessen, Germany.
Karlsson, Torgny (author)
Uppsala universitet,Medicinsk genetik och genomik
Kallman, Thomas (author)
Uppsala universitet,Institutionen för medicinsk biokemi och mikrobiologi,Science for Life Laboratory, SciLifeLab
show more...
Wagner, Gerhart E. H. (author)
Uppsala universitet,Mikrobiologi
Grabherr, Manfred G. (author)
Uppsala universitet,Science for Life Laboratory, SciLifeLab,Institutionen för medicinsk biokemi och mikrobiologi
show less...
Justus Liebig Univ, Inst Mikrobiol & Mol Biol, Giessen, Germany Medicinsk genetik och genomik (creator_code:org_t)
2017-09-05
2017
English.
In: BioData Mining. - : Springer Science and Business Media LLC. - 1756-0381. ; 10
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • Background: Measuring how gene expression changes in the course of an experiment assesses how an organism responds on a molecular level. Sequencing of RNA molecules, and their subsequent quantification, aims to assess global gene expression changes on the RNA level (transcriptome). While advances in high-throughput RNA-sequencing (RNA-seq) technologies allow for inexpensive data generation, accurate post-processing and normalization across samples is required to eliminate any systematic noise introduced by the biochemical and/or technical processes. Existing methods thus either normalize on selected known reference genes that are invariant in expression across the experiment, assume that the majority of genes are invariant, or that the effects of up-and down-regulated genes cancel each other out during the normalization.Results: Here, we present a novel method, moose(2), which predicts invariant genes in silico through a dynamic programming (DP) scheme and applies a quadratic normalization based on this subset. The method allows for specifying a set of known or experimentally validated invariant genes, which guides the DP. We experimentally verified the predictions of this method in the bacterium Escherichia coli, and show how moose(2) is able to (i) estimate the expression value distances between RNA-seq samples, (ii) reduce the variation of expression values across all samples, and (iii) to subsequently reveal new functional groups of genes during the late stages of DNA damage. We further applied the method to three eukaryotic data sets, on which its performance compares favourably to other methods. The software is implemented in C++ and is publicly available from http://grabherr.github.io/moose2/.Conclusions: The proposed RNA-seq normalization method, moose(2), is a valuable alternative to existing methods, with two major advantages: (i) in silico prediction of invariant genes provides a list of potential reference genes for downstream analyses, and (ii) non-linear artefacts in RNA-seq data are handled adequately to minimize variations between replicates.

Subject headings

NATURVETENSKAP  -- Biologi -- Mikrobiologi (hsv//swe)
NATURAL SCIENCES  -- Biological Sciences -- Microbiology (hsv//eng)

Keyword

RNA-seq
Transcriptomics
Normalization
Gene expression
DNA damage
Stress response

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

To the university's database

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view