SwePub
Sök i LIBRIS databas

  Utökad sökning

WFRF:(Nettelblad Carl 1985 )
 

Sökning: WFRF:(Nettelblad Carl 1985 ) > A joint use of pool...

A joint use of pooling and imputation for genotyping SNPs

Clouard, Camille (författare)
Uppsala universitet,Avdelningen för beräkningsvetenskap,Tillämpad beräkningsvetenskap
Ausmees, Kristiina (författare)
Uppsala universitet,Avdelningen för beräkningsvetenskap,Tillämpad beräkningsvetenskap
Nettelblad, Carl, 1985- (författare)
Uppsala universitet,Avdelningen för beräkningsvetenskap,Tillämpad beräkningsvetenskap
 (creator_code:org_t)
2022-10-13
2022
Engelska.
Ingår i: BMC Bioinformatics. - : Springer Nature. - 1471-2105. ; 23
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • BackgroundDespite continuing technological advances, the cost for large-scale genotyping of a high number of samples can be prohibitive. The purpose of this study is to design a cost-saving strategy for SNP genotyping. We suggest making use of pooling, a group testing technique, to drop the amount of SNP arrays needed. We believe that this will be of the greatest importance for non-model organisms with more limited resources in terms of cost-efficient large-scale chips and high-quality reference genomes, such as application in wildlife monitoring, plant and animal breeding, but it is in essence species-agnostic. The proposed approach consists in grouping and mixing individual DNA samples into pools before testing these pools on bead-chips, such that the number of pools is less than the number of individual samples. We present a statistical estimation algorithm, based on the pooling outcomes, for inferring marker-wise the most likely genotype of every sample in each pool. Finally, we input these estimated genotypes into existing imputation algorithms. We compare the imputation performance from pooled data with the Beagle algorithm, and a local likelihood-aware phasing algorithm closely modeled on MaCH that we implemented.ResultsWe conduct simulations based on human data from the 1000 Genomes Project, to aid comparison with other imputation studies. Based on the simulated data, we find that pooling impacts the genotype frequencies of the directly identifiable markers, without imputation. We also demonstrate how a combinatorial estimation of the genotype probabilities from the pooling design can improve the prediction performance of imputation models. Our algorithm achieves 93% concordance in predicting unassayed markers from pooled data, thus it outperforms the Beagle imputation model which reaches 80% concordance. We observe that the pooling design gives higher concordance for the rare variants than traditional low-density to high-density imputation commonly used for cost-effective genotyping of large cohorts.ConclusionsWe present promising results for combining a pooling scheme for SNP genotyping with computational genotype imputation on human data. These results could find potential applications in any context where the genotyping costs form a limiting factor on the study size, such as in marker-assisted selection in plant breeding.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Bioinformatics (hsv//eng)

Nyckelord

Pooling
Imputation
Genotyping
Beräkningsvetenskap
Scientific Computing

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Av författaren/redakt...
Clouard, Camille
Ausmees, Kristii ...
Nettelblad, Carl ...
Om ämnet
NATURVETENSKAP
NATURVETENSKAP
och Data och informa ...
och Bioinformatik
Artiklar i publikationen
BMC Bioinformati ...
Av lärosätet
Uppsala universitet

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy