Sökning: id:"swepub:oai:DiVA.org:su-147151" >
Bioinformatic proce...
Bioinformatic processing of RAD-seq data dramatically impacts downstream population genetic inference
-
- Shafer, Aaron B. A. (författare)
- Uppsala universitet,Evolutionsbiologi,Trent Univ, Forens Sci & Environm & Life Sci, 2014 East Bank Dr, Peterborough, ON K9J 7B8, Canada.,Univ Lausanne, Dept Ecol & Evolut, CH-1015 Lausanne, Switzerland.
-
- Peart, Claire R. (författare)
- Uppsala universitet,Evolutionsbiologi
-
- Tusso, Sergio (författare)
- Uppsala universitet,Evolutionsbiologi
-
visa fler...
-
- Maayan, Inbar (författare)
- Uppsala universitet,Evolutionsbiologi
-
Brelsford, Alan (författare)
-
- Wheat, Christopher W. (författare)
- Stockholms universitet,Zoologiska institutionen,Stockholm Univ, Dept Zool, S-10691 Stockholm, Sweden.
-
- Wolf, Jochen B. W. (författare)
- Uppsala universitet,Evolutionsbiologi,Ludwig Maximilians Univ Munchen, Div Evolutionary Biol, Fac Biol, Grosshaderner Str 2, D-82152 Planegg Martinsried, Germany.
-
visa färre...
-
(creator_code:org_t)
- 2017
- 2017
- Engelska.
-
Ingår i: Methods in Ecology and Evolution. - 2041-210X. ; 8:8, s. 907-917
- Relaterad länk:
-
https://urn.kb.se/re...
-
visa fler...
-
https://doi.org/10.1...
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- 1. Restriction site-associated DNA sequencing (RAD-seq) provides high-resolution population genomic data at low cost, and has become an important component in ecological and evolutionary studies. As with all high-throughput technologies, analytic strategies require critical validation to ensure precise and unbiased interpretation. 2. To test the impact of bioinformatic data processing on downstream population genetic inferences, we analysed mammalian RAD-seq data (>100 individuals) with 312 combinations of methodology (de novo vs. mapping to references of increasing divergence) and filtering criteria (missing data, HWE, F-IS, coverage, mapping and genotype quality). In an effort to identify commonalities and biases in all pipelines, we computed summary statistics (nr. loci, nr. SNP, pi, Het(obs), F-IS, F-ST, N-e and m) and compared the results to independent null expectations (isolation-by-distance correlation, expected transition-to-transversion ratio T-s/T-v and Mendelian mismatch rates of known parent-offspring trios). 3. We observed large differences between reference-based and de novo approaches, the former generally calling more SNPs and reducing F-IS and T-s/T-v. Data completion levels showed little impact on most summary statistics, and FST estimates were robust across all pipelines. The site frequency spectrum was highly sensitive to the chosen approach as reflected in large variance of parameter estimates across demographic scenarios (single-population bottlenecks and isolation-with-migration model). Null expectations were best met by reference-based approaches, although contingent on the specific criteria. 4. We recommend that RAD-seq studies employ reference-based approaches to a closely related genome, and due to the high stochasticity associated with the pipeline advocate the use of multiple pipelines to ensure robust population genetic and demographic inferences.
Ämnesord
- NATURVETENSKAP -- Biologi (hsv//swe)
- NATURAL SCIENCES -- Biological Sciences (hsv//eng)
- NATURVETENSKAP -- Biologi -- Evolutionsbiologi (hsv//swe)
- NATURAL SCIENCES -- Biological Sciences -- Evolutionary Biology (hsv//eng)
Nyckelord
- bioinformatics
- GBS
- genotyping by sequencing
- population genetics
- RAD-seq
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas