SwePub
Sök i LIBRIS databas

  Extended search

WFRF:(de Lhoneux Miryam 1990 )
 

Search: WFRF:(de Lhoneux Miryam 1990 ) > Linguistically Info...

  • de Lhoneux, Miryam,1990-Uppsala universitet,Institutionen för lingvistik och filologi,Computational Linguistics (author)

Linguistically Informed Neural Dependency Parsing for Typologically Diverse Languages

  • BookEnglish2019

Publisher, publication year, extent ...

  • Uppsala :Acta Universitatis Upsaliensis,2019
  • 178 s.
  • electronicrdacarrier

Numbers

  • LIBRIS-ID:oai:DiVA.org:uu-394133
  • ISBN:9789151307671
  • https://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-394133URI

Supplementary language notes

  • Language:English
  • Summary in:English

Part of subdatabase

Classification

  • Subject category:vet swepub-contenttype
  • Subject category:dok swepub-publicationtype

Series

  • Studia Linguistica Upsaliensia,1652-1366 ;24

Notes

  • This thesis presents several studies in neural dependency parsing for typologically diverse languages, using treebanks from Universal Dependencies (UD). The focus is on informing models with linguistic knowledge. We first extend a parser to work well on typologically diverse languages, including morphologically complex languages and languages whose treebanks have a high ratio of non-projective sentences, a notorious difficulty in dependency parsing. We propose a general methodology where we sample a representative subset of UD treebanks for parser development and evaluation. Our parser uses recurrent neural networks which construct information sequentially, and we study the incorporation of a recursive neural network layer in our parser. This follows the intuition that language is hierarchical. This layer turns out to be superfluous in our parser and we study its interaction with other parts of the network. We subsequently study transitivity and agreement information learned by our parser for auxiliary verb constructions (AVCs). We suggest that a parser should learn similar information about AVCs as it learns for finite main verbs. This is motivated by work in theoretical dependency grammar. Our parser learns different information about these two if we do not augment it with a recursive layer, but similar information if we do, indicating that there may be benefits from using that layer and we may not yet have found the best way to incorporate it in our parser. We finally investigate polyglot parsing. Training one model for multiple related languages leads to substantial improvements in parsing accuracy over a monolingual baseline. We also study different parameter sharing strategies for related and unrelated languages. Sharing parameters that partially abstract away from word order appears to be beneficial in both cases but sharing parameters that represent words and characters is more beneficial for related than unrelated languages.

Subject headings and genre

Added entries (persons, corporate bodies, meetings, titles ...)

  • Nivre, JoakimUppsala universitet,Institutionen för lingvistik och filologi(Swepub:uu)joani384 (thesis advisor)
  • Stymne, SaraUppsala universitet,Institutionen för lingvistik och filologi (thesis advisor)
  • Bender, Emily,ProfessorUniversity of Washington, Department of Linguistics (opponent)
  • Uppsala universitetInstitutionen för lingvistik och filologi (creator_code:org_t)

Internet link

Find in a library

To the university's database

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view