Sökning: onr:"swepub:oai:DiVA.org:liu-133783" >
Implicit readabilit...
Implicit readability ranking using the latent variable of a Bayesian Probit model
-
- Falkenjack, Johan (författare)
- SICS East Swedish ICT AB,NLPLAB
-
- Jönsson, Arne, 1955- (författare)
- SICS East Swedish ICT AB
-
(creator_code:org_t)
- Uppsala universitet Humanistisk-samhällsvetenskapliga vetenskapsområdet, 2016
- 2016
- Engelska.
-
Ingår i: CL4LC 2016 - Computational Linguistics for Linguistic Complexity. - : Uppsala universitet Humanistisk-samhällsvetenskapliga vetenskapsområdet. - 9784879747099 ; , s. 104-112
- Relaterad länk:
-
http://aclweb.org/an...
-
visa fler...
-
https://liu.diva-por... (primary) (Raw object)
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- Data driven approaches to readability analysis for languages other than English has been plagued by a scarcity of suitable corpora. Often, relevant corpora consist only of easy-to-read texts with no rank information or empirical readability scores, making only binary approaches, such as classification, applicable. We propose a Bayesian, latent variable, approach to get the most out of these kinds of corpora. In this paper we present results on using such a model for readability ranking. The model is evaluated on a preliminary corpus of ranked student texts with encourag- ing results. We also assess the model by showing that it performs readability classification on par with a state of the art classifier while at the same being transparent enough to allow more sophisticated interpretations.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)
Hitta via bibliotek
Till lärosätets databas