SwePub
Sök i LIBRIS databas

  Utökad sökning

id:"swepub:oai:lup.lub.lu.se:ad1a2291-e9e2-4a7c-aa10-00ffa6e0b75b"
 

Sökning: id:"swepub:oai:lup.lub.lu.se:ad1a2291-e9e2-4a7c-aa10-00ffa6e0b75b" > Adaptive Bayesian S...

Adaptive Bayesian SLOPE : Model Selection With Incomplete Data

Jiang, Wei (författare)
École Polytechnique
Bogdan, Małgorzata (författare)
Lund University,Lunds universitet,Statistiska institutionen,Ekonomihögskolan,Department of Statistics,Lund University School of Economics and Management, LUSEM,Wroclaw University
Josse, Julie (författare)
École Polytechnique
visa fler...
Majewski, Szymon (författare)
École Polytechnique
Miasojedow, Błażej (författare)
University of Warsaw
Ročková, Veronika (författare)
University of Chicago
visa färre...
 (creator_code:org_t)
 
2021-10-14
2022
Engelska.
Ingår i: Journal of Computational and Graphical Statistics. - : Informa UK Limited. - 1061-8600 .- 1537-2715. ; 31:1, s. 113-137
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • We consider the problem of variable selection in high-dimensional settings with missing observations among the covariates. To address this relatively understudied problem, we propose a new synergistic procedure—adaptive Bayesian SLOPE with missing values—which effectively combines SLOPE (sorted l 1 regularization) with the spike-and-slab LASSO (SSL) and is accompanied by an efficient stochastic approximation of expected maximization (SAEM) algorithm to handle missing data. Similarly as in SSL, the regression coefficients are regarded as arising from a hierarchical model consisting of two groups: the spike for the inactive and the slab for the active. However, instead of assigning independent spike and slab Laplace priors for each covariate, here we deploy a joint SLOPE “spike-and-slab” prior which takes into account the ordering of coefficient magnitudes in order to control for false discoveries. We position our approach within a Bayesian framework which allows for simultaneous variable selection and parameter estimation while handling missing data. Through extensive simulations, we demonstrate satisfactory performance in terms of power, false discovery rate (FDR) and estimation bias under a wide range of scenarios including complete data and existence of missingness. Finally, we analyze a real dataset consisting of patients from Paris hospitals who underwent severe trauma, where we show competitive performance in predicting platelet levels. Our methodology has been implemented in C++ and wrapped into open source R programs for public use. Supplemental files for this article are available online.

Ämnesord

NATURVETENSKAP  -- Matematik -- Sannolikhetsteori och statistik (hsv//swe)
NATURAL SCIENCES  -- Mathematics -- Probability Theory and Statistics (hsv//eng)

Nyckelord

FDR control
Health data
Incomplete data
Penalized regression
Spike and slab prior
Stochastic approximation EM

Publikations- och innehållstyp

art (ämneskategori)
ref (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy