Sökning: onr:"swepub:oai:DiVA.org:su-192115" >
A Multi-Word Expres...
A Multi-Word Expression Dataset for Swedish
-
- Kurfali, Murathan, 1990- (författare)
- Stockholms universitet,Institutionen för lingvistik
-
- Östling, Robert (författare)
- Stockholms universitet,Institutionen för lingvistik
-
- Sjons, Johan (författare)
- Stockholms universitet,Institutionen för lingvistik
-
visa fler...
-
- Wirén, Mats (författare)
- Stockholms universitet,Institutionen för lingvistik
-
visa färre...
-
(creator_code:org_t)
- Marseille : European Language Resources Association (ELRA), 2020
- 2020
- Engelska.
-
Ingår i: Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). - Marseille : European Language Resources Association (ELRA). ; , s. 4402-4409
- Relaterad länk:
-
http://www.lrec-conf...
-
visa fler...
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- We present a new set of 96 Swedish multi-word expressions annotated with degree of (non-)compositionality. In contrast to most previous compositionality datasets we also consider syntactically complex constructions and publish a formal specification of each expression. This allows evaluation of computational models beyond word bigrams, which have so far been the norm. Finally, we use the annotations to evaluate a system for automatic compositionality estimation based on distributional semantics. Our analysis of the disagreements between human annotators and the distributional model reveal interesting questions related to the perception of compositionality, and should be informative to future work in the area.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
Nyckelord
- multi-word expressions
- compositionality
- distributional semantic
- datorlingvistik
- Computational Linguistics
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)