Sökning: onr:"swepub:oai:gup.ub.gu.se/253104" >
CLEVER: clique-enum...
CLEVER: clique-enumerating variant finder.
-
Marschall, Tobias (författare)
-
Costa, Ivan G (författare)
-
Canzar, Stefan (författare)
-
visa fler...
-
Bauer, Markus (författare)
-
Klau, Gunnar W (författare)
-
- Schliep, Alexander, 1967 (författare)
- Gothenburg University,Göteborgs universitet,Institutionen för data- och informationsteknik, datavetenskap (GU),Department of Computer Science and Engineering, Computing Science (GU)
-
Schönhuth, Alexander (författare)
-
visa färre...
-
(creator_code:org_t)
- 2012-10-11
- 2012
- Engelska.
-
Ingår i: Bioinformatics (Oxford, England). - : Oxford University Press (OUP). - 1367-4811 .- 1367-4803. ; 28:22, s. 2875-82
- Relaterad länk:
-
https://academic.oup...
-
visa fler...
-
https://gup.ub.gu.se...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Next-generation sequencing techniques have facilitated a large-scale analysis of human genetic variation. Despite the advances in sequencing speed, the computational discovery of structural variants is not yet standard. It is likely that many variants have remained undiscovered in most sequenced individuals.Here, we present a novel internal segment size based approach, which organizes all, including concordant, reads into a read alignment graph, where max-cliques represent maximal contradiction-free groups of alignments. A novel algorithm then enumerates all max-cliques and statistically evaluates them for their potential to reflect insertions or deletions. For the first time in the literature, we compare a large range of state-of-the-art approaches using simulated Illumina reads from a fully annotated genome and present relevant performance statistics. We achieve superior performance, in particular, for deletions or insertions (indels) of length 20-100 nt. This has been previously identified as a remaining major challenge in structural variation discovery, in particular, for insert size based approaches. In this size range, we even outperform split-read aligners. We achieve competitive results also on biological data, where our method is the only one to make a substantial amount of correct predictions, which, additionally, are disjoint from those by split-read aligners.CLEVER is open source (GPL) and available from http://clever-sv.googlecode.com.as@cwi.nl or tm@cwi.nl.Supplementary data are available at Bioinformatics online.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Bioinformatics (hsv//eng)
Nyckelord
- Algorithms
- Computer Simulation
- Genetic Variation
- Genome
- Human
- Humans
- INDEL Mutation
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas