A deep learning framework for characterization of genotype data

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Search: onr:"swepub:oai:DiVA.org:uu-470290" > A deep learning fra...

1 of 1
Previous record
Next record
To hitlist

A deep learning framework for characterization of genotype data

Ausmees, Kristiina (author): Uppsala universitet,Avdelningen för beräkningsvetenskap,Tillämpad beräkningsvetenskap,Science for Life Laboratory, SciLifeLab

Nettelblad, Carl, 1985- (author): Uppsala universitet,Avdelningen för beräkningsvetenskap,Tillämpad beräkningsvetenskap,Science for Life Laboratory, SciLifeLab

(creator_code:org_t)

2022-01-25
2022
English.
In: G3. - : Oxford University Press (OUP). - 2160-1836. ; 12:3

Related links:: https://doi.org/10.1...; show more...; https://uu.diva-port... (primary) (Raw object); https://academic.oup...; https://urn.kb.se/re...; https://doi.org/10.1...; show less...

Journal article (peer-reviewed)

Abstract Subject headings

Dimensionality reduction is a data transformation technique widely used in various fields of genomics research. The application of dimensionality reduction to genotype data is known to capture genetic similarity between individuals, and is used for visualization of genetic variation, identification of population structure as well as ancestry mapping. Among frequently used methods are principal component analysis, which is a linear transform that often misses more fine-scale structures, and neighbor-graph based methods which focus on local relationships rather than large-scale patterns. Deep learning models are a type of nonlinear machine learning method in which the features used in data transformation are decided by the model in a data-driven manner, rather than by the researcher, and have been shown to present a promising alternative to traditional statistical methods for various applications in omics research. In this study, we propose a deep learning model based on a convolutional autoencoder architecture for dimensionality reduction of genotype data. Using a highly diverse cohort of human samples, we demonstrate that the model can identify population clusters and provide richer visual information in comparison to principal component analysis, while preserving global geometry to a higher extent than t-SNE and UMAP, yielding results that are comparable to an alternative deep learning approach based on variational autoencoders. We also discuss the use of the methodology for more general characterization of genotype data, showing that it preserves spatial properties in the form of decay of linkage disequilibrium with distance along the genome and demonstrating its use as a genetic clustering method, comparing results to the ADMIXTURE software frequently used in population genetic studies.

Subject headings

NATURVETENSKAP -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
NATURAL SCIENCES -- Computer and Information Sciences -- Bioinformatics (hsv//eng)
NATURVETENSKAP -- Matematik -- Beräkningsmatematik (hsv//swe)
NATURAL SCIENCES -- Mathematics -- Computational Mathematics (hsv//eng)
NATURVETENSKAP -- Biologi -- Genetik (hsv//swe)
NATURAL SCIENCES -- Biological Sciences -- Genetics (hsv//eng)

Keyword

deep learning
convolutional autoencoder
dimensionality reduction
genetic clustering
population genetics
Beräkningsvetenskap
Scientific Computing

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

G3 (Search for host publication in LIBRIS)

To the university's database

1 of 1
Previous record
Next record
To hitlist

Find more in SwePub

By the author/editor: Ausmees, Kristii ...; Nettelblad, Carl ...

About the subject

NATURAL SCIENCES: NATURAL SCIENCES; and Computer and Inf ...; and Bioinformatics

NATURAL SCIENCES: NATURAL SCIENCES; and Mathematics; and Computational Ma ...

NATURAL SCIENCES: NATURAL SCIENCES; and Biological Scien ...; and Genetics

Articles in the publication: G3

By the university: Uppsala University

Search outside SwePub

Extend your search to:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se