SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:ltu-90107"
 

Search: onr:"swepub:oai:DiVA.org:ltu-90107" > Word2Vec: Optimal h...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Word2Vec: Optimal hyperparameters and their impact on natural language processing downstream tasks

Adewumi, Oluwatosin, 1978- (author)
Luleå tekniska universitet,EISLAB
Liwicki, Foteini (author)
Luleå tekniska universitet,EISLAB
Liwicki, Marcus (author)
Luleå tekniska universitet,EISLAB
 (creator_code:org_t)
2022-03-24
2022
English.
In: Open Computer Science. - : Walter de Gruyter. - 2299-1093. ; 12:1, s. 134-141
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • Word2Vec is a prominent model for natural language processing tasks. Similar inspiration is found in distributed embeddings (word-vectors) in recent state-of-the-art deep neural networks. However, wrong combination of hyperparameters can produce embeddings with poor quality. The objective of this work is to empirically show that Word2Vec optimal combination of hyper-parameters exists and evaluate various combinations. We compare them with the publicly released, original Word2Vec embedding. Both intrinsic and extrinsic (downstream) evaluations are carried out, including named entity recognition and sentiment analysis. Our main contributions include showing that the best model is usually task-specific, high analogy scores do not necessarily correlate positively with F1 scores, and performance is not dependent on data size alone. If ethical considerations to save time, energy, and the environment are made, then relatively smaller corpora may do just as well or even better in some cases. Increasing the dimension size of embeddings after a point leads to poor quality or performance. In addition, using a relatively small corpus, we obtain better WordSim scores, corresponding Spearman correlation, and better downstream performances (with significance tests) compared to the original model, which is trained on a 100 billion-word corpus.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Language Technology (hsv//eng)

Keyword

Word2Vec
hyperparameters
embeddings
named entity recognition
sentiment analysis
Maskininlärning
Machine Learning

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Adewumi, Oluwato ...
Liwicki, Foteini
Liwicki, Marcus
About the subject
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Language Technol ...
Articles in the publication
Open Computer Sc ...
By the university
Luleå University of Technology

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view