Search: onr:"swepub:oai:DiVA.org:uu-392095" >
Real-valued syntact...
Real-valued syntactic word vectors
-
- Basirat, Ali, 1982- (author)
- Uppsala universitet,Institutionen för lingvistik och filologi,Computational linguistics
-
- Nivre, Joakim, 1962- (author)
- Uppsala universitet,Institutionen för lingvistik och filologi,Computational linguistics
-
(creator_code:org_t)
- 2020
- 2020
- English.
-
In: Journal of experimental and theoretical artificial intelligence (Print). - 0952-813X .- 1362-3079. ; 32:4, s. 557-579
- Related links:
-
https://doi.org/10.1...
-
show more...
-
https://uu.diva-port... (primary) (Raw object)
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
show less...
Abstract
Subject headings
Close
- We introduce a word embedding method that generates a set of real-valued word vectors from a distributional semantic space. The semantic space is built with a set of context units (words) which are selected by an entropy-based feature selection approach with respect to the certainty involved in their contextual environments. We show that the most predictive context of a target word is its preceding word. An adaptive transformation function is also introduced that reshapes the data distribution to make it suitable for dimensionality reduction techniques. The final low-dimensional word vectors are formed by the singular vectors of a matrix of transformed data. We show that the resulting word vectors are as good as other sets of word vectors generated with popular word embedding methods.
Subject headings
- HUMANIORA -- Språk och litteratur (hsv//swe)
- HUMANITIES -- Languages and Literature (hsv//eng)
- HUMANIORA -- Språk och litteratur -- Jämförande språkvetenskap och allmän lingvistik (hsv//swe)
- HUMANITIES -- Languages and Literature -- General Language Studies and Linguistics (hsv//eng)
- TEKNIK OCH TEKNOLOGIER -- Elektroteknik och elektronik -- Datorsystem (hsv//swe)
- ENGINEERING AND TECHNOLOGY -- Electrical Engineering, Electronic Engineering, Information Engineering -- Computer Systems (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
Keyword
- Word embeddings
- context selection
- transformation
- dependency parsing
- singular value decomposition
- entropy
Publication and Content Type
- ref (subject category)
- art (subject category)
Find in a library
To the university's database