SwePub
Sök i LIBRIS databas

  Utökad sökning

WFRF:(Torres Maria Fernanda)
 

Sökning: WFRF:(Torres Maria Fernanda) > (2021) > A bioinformatic pla...

A bioinformatic platform to integrate target capture and whole genome sequences of various read depths for phylogenomics

Ribeiro, P. G. (författare)
Torres Jimenez, Maria Fernanda (författare)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Andermann, Tobias (författare)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
visa fler...
Antonelli, Alexandre, 1978 (författare)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Bacon, Christine D. (författare)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Matos-Maravi, P. (författare)
visa färre...
 (creator_code:org_t)
2021-10-31
2021
Engelska.
Ingår i: Molecular Ecology. - : Wiley. - 0962-1083 .- 1365-294X. ; 30:23, s. 6021-6035
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • The increasing availability of short-read whole genome sequencing (WGS) provides unprecedented opportunities to study ecological and evolutionary processes. Although loci of interest can be extracted from WGS data and combined with target sequence data, this requires suitable bioinformatic workflows. Here, we test different assembly and locus extraction strategies and implement them into secapr, a pipeline that processes short-read data into multilocus alignments for phylogenetics and molecular ecology analyses. We integrate the processing of data from low-coverage WGS (<30x) and target sequence capture into a flexible framework, while optimizing de novo contig assembly and loci extraction. Specifically, we test different assembly strategies by contrasting their ability to recover loci from targeted butterfly protein-coding genes, using four data sets: a WGS data set across different average coverages (10x, 5x and 2x) and a data set for which these loci were enriched prior to sequencing via target sequence capture. Using the resulting de novo contigs, we account for potential errors within contigs and infer phylogenetic trees to evaluate the ability of each assembly strategy to recover species relationships. We demonstrate that choosing multiple sizes of kmer simultaneously for assembly results in the highest yield of extracted loci from de novo assembled contigs, while data sets derived from sequencing read depths as low as 5x recovers the expected species relationships in phylogenetic trees. By making the tested assembly approaches available in the secapr pipeline, we hope to inspire future studies to incorporate complementary data and make an informed choice on the optimal assembly strategy.

Ämnesord

NATURVETENSKAP  -- Geovetenskap och miljövetenskap (hsv//swe)
NATURAL SCIENCES  -- Earth and Related Environmental Sciences (hsv//eng)

Nyckelord

de novo assembly
loci extraction
low-coverage whole genome sequencing
secapr
target sequence capture
selection
abyss
Biochemistry & Molecular Biology
Environmental Sciences & Ecology
Evolutionary Biology

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy