SwePub
Sök i LIBRIS databas

  Utökad sökning

onr:"swepub:oai:DiVA.org:kth-88088"
 

Sökning: onr:"swepub:oai:DiVA.org:kth-88088" > Have I seen you bef...

Have I seen you before? : Principles of Bayesian predictive classification revisited

Corander, Jukka, 1965- (författare)
University of Helsinki,Department of mathematics and statistics
Cui, Yao (författare)
University of Helsinki,Department of mathematics and statistics
Koski, Timo, 1952- (författare)
KTH,Matematisk statistik,computational biostatistics
visa fler...
Sirén, Jukka (författare)
University of Helsinki,Department of mathematics and statistics
visa färre...
 (creator_code:org_t)
2011-10-04
2013
Engelska.
Ingår i: Statistics and computing. - : Springer Berlin/Heidelberg. - 0960-3174 .- 1573-1375. ; 23:1, s. 59-73
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • A general inductive Bayesian classification framework is considered using a simultaneous predictive distribution for test items. We introduce a principle of generative supervised and semi-supervised classification based on marginalizing the joint posterior distribution of labels for all test items. The simultaneous and marginalized classifiers arise under different loss functions, while both acknowledge jointly all uncertainty about the labels of test items and the generating probability measures of the classes. We illustrate for data from multiple finite alphabets that such classifiers achieve higher correct classification rates than a standard marginal predictive classifier which labels all test items independently, when training data are sparse. In the supervised case for multiple finite alphabets the simultaneous and the marginal classifiers are proven to become equal under generalized exchangeability when the amount of training data increases. Hence, the marginal classifier can be interpreted as an asymptotic approximation to the simultaneous classifier for finite sets of training data. It is also shown that such convergence is not guaranteed in the semi-supervised setting, where the marginal classifier does not provide a consistent approximation.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences (hsv//eng)

Nyckelord

Classification
Exchangeability
Inductive learning
Predictive inference

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy