Sökning: onr:"swepub:oai:gup.ub.gu.se/178259" >
Using the probabili...
Using the probability of readability to order Swedish texts
-
- Falkenjack, Johan, 1986- (författare)
- Linköpings universitet,Interaktiva och kognitiva system,Tekniska högskolan,Santa Anna IT Research Institute AB, Linköping, Sweden
-
- Heimann Mühlenbock, Katarina, 1952 (författare)
- Gothenburg University,Göteborgs universitet,Institutionen för svenska språket,Department of Swedish,Språkbanken, University of Gothenburg, Gothenburg
-
(creator_code:org_t)
- 2012
- 2012
- Engelska.
-
Ingår i: Proceedings of the Fourth Swedish Language Technology Conference. ; , s. 27-28
- Relaterad länk:
-
https://liu.diva-por... (primary) (Raw object)
-
visa fler...
-
https://gup.ub.gu.se...
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- In this study we present a new approach to rank readability in Swedish texts based on lexical, morpho-syntactic and syntactic analysis of text as well as machine learning. The basic premise and theory is presented as well as a small experiment testing the feasibility, but not actual performance, of the approach. The experiment shows that it is possible to implement a system based on the approach, however, the actual performance of such a system has not been evaluated as the necessary resources for such an evaluation does not yet exist for Swedish. The experiment also shows that a classifier based on the aforementioned linguistic analysis, on our limited test set, outperforms classifiers based on established metrics used to assess readability such as LIX, OVIX and Nominal Ratio.
Ämnesord
- HUMANIORA -- Språk och litteratur (hsv//swe)
- HUMANITIES -- Languages and Literature (hsv//eng)
- TEKNIK OCH TEKNOLOGIER -- Annan teknik (hsv//swe)
- ENGINEERING AND TECHNOLOGY -- Other Engineering and Technologies (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)