SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:uu-518286"
 

Search: onr:"swepub:oai:DiVA.org:uu-518286" > Using Wikidata for ...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Using Wikidata for Enhancing Compositionality in Pre-trained Language Models

Beloucif, Meriem (author)
Uppsala universitet,Institutionen för lingvistik och filologi,Computational Linguistics
Bansal, Mihir (author)
Carnegie Mellon University
Biemann, Chris (author)
Hamburg University,Language Technology
 (creator_code:org_t)
INCOMA, 2023
2023
English.
In: Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing. - : INCOMA. - 9789544520922 ; , s. 170-178
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • One of the many advantages of pre-trained language models (PLMs) such as BERT and RoBERTa is their flexibility and contextual nature. These features give PLMs strong capabilities for representing lexical semantics. However, PLMs seem incapable of capturing high-level semantics in terms of compositionally. We show that when augmented with the relevant semantic knowledge, PMLs learn to capture a higher degree of lexical compositionality. We annotate a large dataset from Wikidata highlighting a type of semantic inference that is easy for humans to understand but difficult for PLMs, like the correlation between age and date of birth. We use this resource for finetuning DistilBERT, BERT large and RoBERTa. Our results show that the performance of PLMs against the test data continuously improves when augmented with such a rich resource. Our results are corroborated by a consistent improvement over most GLUE benchmark natural language understanding tasks.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Language Technology (hsv//eng)

Keyword

BERT
Pretrained Language Models
Computational Linguistics
Semantics in LLM
Datorlingvistik
Computational Linguistics

Publication and Content Type

ref (subject category)
kon (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Beloucif, Meriem
Bansal, Mihir
Biemann, Chris
About the subject
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Language Technol ...
Articles in the publication
Proceedings of t ...
By the university
Uppsala University

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view