SwePub
Sök i LIBRIS databas

  Extended search

id:"swepub:oai:DiVA.org:uu-518284"
 

Search: id:"swepub:oai:DiVA.org:uu-518284" > BERTie Bott's Every...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

BERTie Bott's Every Flavor Labels : A Tasty Introduction to Semantic Role Labeling for Galician

Bruton, Micaella (author)
Uppsala universitet,Institutionen för lingvistik och filologi,Computational Linguitics
Beloucif, Meriem (author)
Uppsala universitet,Institutionen för lingvistik och filologi,Computational Linguistics
 (creator_code:org_t)
Association for Computational Linguistics, 2023
2023
English.
In: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. - : Association for Computational Linguistics. - 9798891760608 ; , s. 10892-10902
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • In this paper, we leverage existing corpora, WordNet, and dependency parsing to build the first Galician dataset for training semantic role labeling systems in an effort to expand available NLP resources. Additionally, we introduce verb indexing, a new pre-processing method, which helps increase the performance when semantically parsing highly-complex sentences. We use transfer-learning to test both the resource and the verb indexing method. Our results show that the effects of verb indexing were amplified in scenarios where the model was both pre-trained and fine-tuned on datasets utilizing the method, but improvements are also noticeable when only used during fine-tuning. The best-performing Galician SRL model achieved an f1 score of 0.74, introducing a baseline for future Galician SRL systems. We also tested our method on Spanish where we achieved an f1 score of 0.83, outperforming the baseline set by the 2009 CoNLL Shared Task by 0.025 showing the merits of our verb indexing method for pre-processing.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Language Technology (hsv//eng)

Keyword

BERT
Pretrained Language Models
Computational Linguistics
Galician
Datorlingvistik
Computational Linguistics

Publication and Content Type

ref (subject category)
kon (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Bruton, Micaella
Beloucif, Meriem
About the subject
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Language Technol ...
Articles in the publication
Proceedings of t ...
By the university
Uppsala University

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view