Sökning: WFRF:(Kågebäck Mikael 1981) >
Neural context embe...
Neural context embeddings for automatic discovery of word senses
-
- Kågebäck, Mikael, 1981 (författare)
- Chalmers tekniska högskola,Chalmers University of Technology
-
- Johansson, Fredrik, 1988 (författare)
- Chalmers tekniska högskola,Chalmers University of Technology
-
- Johansson, Richard, 1975 (författare)
- Gothenburg University,Göteborgs universitet,Institutionen för svenska språket,Department of Swedish,University of Gothenburg
-
visa fler...
-
- Dubhashi, Devdatt, 1965 (författare)
- Chalmers tekniska högskola,Chalmers University of Technology
-
visa färre...
-
(creator_code:org_t)
- ISBN 9781941643464
- 2015
- 2015
- Engelska.
-
Ingår i: Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing. Denver, United States. - 9781941643464 ; , s. 25-32
- Relaterad länk:
-
https://gup.ub.gu.se...
-
visa fler...
-
https://research.cha...
-
visa färre...
Abstract
Ämnesord
Stäng
- Word sense induction (WSI) is the problem of automatically building an inventory of senses for a set of target words using only a text corpus. We introduce a new method for embedding word instances and their context, for use in WSI. The method, Instance-context embedding (ICE), leverages neural word embeddings, and the correlation statistics they capture, to compute high quality embeddings of word contexts. In WSI, these context embeddings are clustered to find the word senses present in the text. ICE is based on a novel method for combining word embeddings using continuous Skip-gram, based on both se- mantic and a temporal aspects of context words. ICE is evaluated both in a new system, and in an extension to a previous system for WSI. In both cases, we surpass previous state-of-the-art, on the WSI task of SemEval-2013, which highlights the generality of ICE. Our proposed system achieves a 33% relative improvement.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences (hsv//eng)
Nyckelord
- språkteknologi
- lexikal semantik
- ordbetydelser
- korpusar
- distributionella metoder
- språkteknologi
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)
Hitta via bibliotek
Till lärosätets databas