Sökning: WFRF:(Schliep Alexander 1967)
> (2010-2014) >
SLIQ: simple linear...
SLIQ: simple linear inequalities for efficient contig scaffolding.
-
Roy, Rajat S (författare)
-
Chen, Kevin C (författare)
-
Sengupta, Anirvan M (författare)
-
visa fler...
-
- Schliep, Alexander, 1967 (författare)
- Gothenburg University,Göteborgs universitet,Institutionen för data- och informationsteknik, datavetenskap (GU),Department of Computer Science and Engineering, Computing Science (GU)
-
visa färre...
-
(creator_code:org_t)
- Mary Ann Liebert Inc, 2012
- 2012
- Engelska.
-
Ingår i: Journal of computational biology : a journal of computational molecular cell biology. - : Mary Ann Liebert Inc. - 1557-8666. ; 19:10, s. 1162-75
- Relaterad länk:
-
http://arxiv.org/pdf...
-
visa fler...
-
https://gup.ub.gu.se...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Scaffolding is an important subproblem in de novo genome assembly, in which mate pair data are used to construct a linear sequence of contigs separated by gaps. Here we present SLIQ, a set of simple linear inequalities derived from the geometry of contigs on the line that can be used to predict the relative positions and orientations of contigs from individual mate pair reads and thus produce a contig digraph. The SLIQ inequalities can also filter out unreliable mate pairs and can be used as a preprocessing step for any scaffolding algorithm. We tested the SLIQ inequalities on five real data sets ranging in complexity from simple bacterial genomes to complex mammalian genomes and compared the results to the majority voting procedure used by many other scaffolding algorithms. SLIQ predicted the relative positions and orientations of the contigs with high accuracy in all cases and gave more accurate position predictions than majority voting for complex genomes, in particular the human genome. Finally, we present a simple scaffolding algorithm that produces linear scaffolds given a contig digraph. We show that our algorithm is very efficient compared to other scaffolding algorithms while maintaining high accuracy in predicting both contig positions and orientations for real data sets.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Bioinformatics (hsv//eng)
Nyckelord
- Algorithms
- Contig Mapping
- methods
- Genome
- Human
- Humans
- Sequence Analysis
- DNA
- methods
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas