SwePub
Sök i LIBRIS databas

  Utökad sökning

L773:1545 5963 OR L773:1557 9964
 

Sökning: L773:1545 5963 OR L773:1557 9964 > Safety in Multi-Ass...

Safety in Multi-Assembly via Paths Appearing in All Path Covers of a DAG

Cáceres, Manuel (författare)
Mumey, Brendan (författare)
Husić, Edin (författare)
visa fler...
Rizzi, Romeo (författare)
Cairo, Massimo (författare)
Sahlin, Kristoffer (författare)
Stockholms universitet,Matematiska institutionen,Science for Life Laboratory (SciLifeLab)
Tomescu, Alexandru I. (författare)
visa färre...
 (creator_code:org_t)
Institute of Electrical and Electronics Engineers (IEEE), 2022
2022
Engelska.
Ingår i: IEEE/ACM Transactions on Computational Biology & Bioinformatics. - : Institute of Electrical and Electronics Engineers (IEEE). - 1545-5963 .- 1557-9964 .- 2374-0043. ; 19:6, s. 3673-3684
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • A multi-assembly problem asks to reconstruct multiple genomic sequences from mixed reads sequenced from all of them. Standard formulations of such problems model a solution as a path cover in a directed acyclic graph, namely a set of paths that together cover all vertices of the graph. Since multi-assembly problems admit multiple solutions in practice, we consider an approach commonly used in standard genome assembly: output only partial solutions ( contigs , or safe paths ), that appear in all path cover solutions. We study constrained path covers, a restriction on the path cover solution that incorporate practical constraints arising in multi-assembly problems. We give efficient algorithms finding all maximal safe paths for constrained path covers. We compute the safe paths of splicing graphs constructed from transcript annotations of different species. Our algorithms run in less than 15 seconds per species and report RNA contigs that are over 99% precise and are up to 8 times longer than unitigs. Moreover, RNA contigs cover over 70% of the transcripts and their coding sequences in most cases. With their increased length to unitigs, high precision, and fast construction time, maximal safe paths can provide a better base set of sequences for transcript assembly programs.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Bioinformatics (hsv//eng)

Nyckelord

Graph algorithms
Network problems
Analysis of Algorithms and Problem Complexity
Biology and genetics
datalogi
Computer Science

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy