Sökning: onr:"swepub:oai:research.chalmers.se:b08e55e7-e74b-4341-a2f8-c092aea1f10c" >
NCAE: data-driven r...
NCAE: data-driven representations using a deep network-coherent DNA methylation autoencoder identify robust disease and risk factor signatures
-
- Martinez, David (författare)
- Linköpings universitet,Bioinformatik,Tekniska fakulteten,Linköping University
-
- Dwivedi, Sanjiv (författare)
- Linköpings universitet,Bioinformatik,Tekniska fakulteten,Linköping University
-
- Jörnsten, Rebecka, 1971 (författare)
- Chalmers tekniska högskola,Chalmers University of Technology,Göteborgs universitet,University of Gothenburg,Chalmers Univ Technol, Sweden
-
visa fler...
-
- Gustafsson, Mika (författare)
- Linköpings universitet,Bioinformatik,Tekniska fakulteten,Linköping University
-
visa färre...
-
(creator_code:org_t)
- OXFORD UNIV PRESS, 2023
- 2023
- Engelska.
-
Ingår i: Briefings in Bioinformatics. - : OXFORD UNIV PRESS. - 1467-5463 .- 1477-4054. ; 24:5
- Relaterad länk:
-
https://research.cha... (primary) (free)
-
visa fler...
-
https://liu.diva-por... (primary) (Raw object)
-
https://doi.org/10.1...
-
https://research.cha...
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- Precision medicine relies on the identification of robust disease and risk factor signatures from omics data. However, current knowledge-driven approaches may overlook novel or unexpected phenomena due to the inherent biases in biological knowledge. In this study, we present a data-driven signature discovery workflow for DNA methylation analysis utilizing network-coherent autoencoders (NCAEs) with biologically relevant latent embeddings. First, we explored the architecture space of autoencoders trained on a large-scale pan-tissue compendium (n = 75 272) of human epigenome-wide association studies. We observed the emergence of co-localized patterns in the deep autoencoder latent space representations that corresponded to biological network modules. We determined the NCAE configuration with the strongest co-localization and centrality signals in the human protein interactome. Leveraging the NCAE embeddings, we then trained interpretable deep neural networks for risk factor (aging, smoking) and disease (systemic lupus erythematosus) prediction and classification tasks. Remarkably, our NCAE embedding-based models outperformed existing predictors, revealing novel DNA methylation signatures enriched in gene sets and pathways associated with the studied condition in each case. Our data-driven biomarker discovery workflow provides a generally applicable pipeline to capture relevant risk factor and disease information. By surpassing the limitations of knowledge-driven methods, our approach enhances the understanding of complex epigenetic processes, facilitating the development of more effective diagnostic and therapeutic strategies.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Bioinformatics (hsv//eng)
- NATURVETENSKAP -- Biologi -- Bioinformatik och systembiologi (hsv//swe)
- NATURAL SCIENCES -- Biological Sciences -- Bioinformatics and Systems Biology (hsv//eng)
Nyckelord
- DNA methylation
- deep learning
- autoencoders
- biomarkers
- systems medicine
- transfer learning
Publikations- och innehållstyp
- art (ämneskategori)
- ref (ämneskategori)
Hitta via bibliotek
Till lärosätets databas