SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:research.chalmers.se:4cef4e85-a85e-4849-b036-2fe14a92e7c1"
 

Search: onr:"swepub:oai:research.chalmers.se:4cef4e85-a85e-4849-b036-2fe14a92e7c1" > Adjusting for covar...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Adjusting for covariates and assessing modeling fitness in machine learning using MUVR2

Yan, Yingxiao, 1997 (author)
Chalmers tekniska högskola,Chalmers University of Technology
Schillemans, Tessa (author)
Karolinska Institutet
Skantze, Viktor (author)
Stiftelsen Fraunhofer-Chalmers Centrum för Industrimatematik (FCC),Fraunhofer-Chalmers Research Centre for Industrial Mathematics (FCC)
show more...
Brunius, Carl, 1974 (author)
Chalmers tekniska högskola,Chalmers University of Technology
show less...
 (creator_code:org_t)
2024
2024
English.
In: Bioinformatics Advances. - 2635-0041. ; 4:1
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • Motivation: Machine learning (ML) methods are frequently used in Omics research to examine associations between molecular data and for example exposures and health conditions. ML is also used for feature selection to facilitate biological interpretation. Our previous MUVR algorithm was shown to generate predictions and variable selections at state-of-the-art performance. However, a general framework for assessing modeling fitness is still lacking. In addition, enabling to adjust for covariates is a highly desired, but largely lacking trait in ML. We aimed to address these issues in the new MUVR2 framework. Results: The MUVR2 algorithm was developed to include the regularized regression framework elastic net in addition to partial least squares and random forest modeling. Compared with other cross-validation strategies, MUVR2 consistently showed state-of-the-art performance, including variable selection, while minimizing overfitting. Testing on simulated and real-world data, we also showed that MUVR2 allows for the adjustment for covariates using elastic net modeling, but not using partial least squares or random forest.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Bioinformatics (hsv//eng)
NATURVETENSKAP  -- Matematik -- Sannolikhetsteori och statistik (hsv//swe)
NATURAL SCIENCES  -- Mathematics -- Probability Theory and Statistics (hsv//eng)

Publication and Content Type

art (subject category)
ref (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view