SwePub
Sök i LIBRIS databas

  Extended search

WFRF:(Ahl Caroline)
 

Search: WFRF:(Ahl Caroline) > (2020-2024) > Swedish-Turkish Par...

  • Megyesi, BeataUppsala universitet,Institutionen för lingvistik och filologi (author)

Swedish-Turkish Parallel Treebank

  • Article/chapterEnglish2008

Publisher, publication year, extent ...

  • Paris :European Language Resources Association (ELRA),2008
  • printrdacarrier

Numbers

  • LIBRIS-ID:oai:DiVA.org:uu-87592
  • https://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-87592URI

Supplementary language notes

  • Language:English
  • Summary in:English

Part of subdatabase

Classification

  • Subject category:ref swepub-contenttype
  • Subject category:kon swepub-publicationtype

Notes

  • In this paper, we describe our work on building a parallel treebank for a less studied and typologically dissimilar language pair, namely Swedish and Turkish. The treebank is a balanced syntactically annotated corpus containing both fiction and technical documents. In total, it consists of approximately 160,000 tokens in Swedish and 145,000 in Turkish. The texts are linguistically annotated using different layers from part of speech tags and morphological features to dependency annotation. Each layer is automatically processed by using basic language resources for the involved languages. The sentences and words are aligned, and partly manually corrected. We create the treebank by reusing and adjusting existing tools for the automatic annotation, alignment, and their correction and visualization. The treebank was developed within the project Supporting research environment for minor languages aiming at to create representative language resources for language pairs dissimilar in language structure. Therefore, efforts are put on developing a general method for formatting and annotation procedure, as well as using tools that can be applied to other language pairs easily.

Subject headings and genre

Added entries (persons, corporate bodies, meetings, titles ...)

  • Dahlqvist, BengtUppsala universitet,Institutionen för lingvistik och filologi(Swepub:uu)bengtdq (author)
  • Pettersson, EvaUppsala universitet,Institutionen för lingvistik och filologi(Swepub:uu)evpet102 (author)
  • Nivre, JoakimUppsala universitet,Institutionen för lingvistik och filologi(Swepub:uu)joani384 (author)
  • Uppsala universitetInstitutionen för lingvistik och filologi (creator_code:org_t)

Related titles

  • In:Proceedings of the Sixth International Language Resources and Evaluation (LREC'08)Paris : European Language Resources Association (ELRA)

Internet link

To the university's database

Find more in SwePub

By the author/editor
Megyesi, Beata
Dahlqvist, Bengt
Pettersson, Eva
Nivre, Joakim
About the subject
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Language Technol ...
Articles in the publication
By the university
Uppsala University

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view