SwePub
Sök i LIBRIS databas

  Extended search

WFRF:(Blokland Rogier 1971 )
 

Search: WFRF:(Blokland Rogier 1971 ) > A pseudonymization ...

A pseudonymization method for language documentation corpora: an experiment with spoken Komi

Partanen, Niko, 1986- (author)
University of Helsinki, Finland
Blokland, Rogier, 1971- (author)
Uppsala universitet,Institutionen för moderna språk
Rießler, Michael, 1971- (author)
University of Joensuu, Finland
 (creator_code:org_t)
Vienna, Austria, 2020
2020
English.
In: Proceedings of the 6th International Workshop on Computational Linguistics of Uralic Languages. - Vienna, Austria. - 9781952148002 ; , s. 1-8
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • This article introduces a novel and creative application of the Constraint Grammar formalism, by presenting an automated method for pseudonymising a Zyrian Komi spoken language corpus in an effective, reliable and scalable manner. The method is intended to be used to minimize various kinds of personal information found in the corpus in order to make spoken language data available while preventing the spread of sensitive personal data about the recorded informants or other persons mentioned in the texts. In our implementation, a Constraint Grammar based pseudonymisation tool is used as an automatically applied shallow layer that derives from the original corpus data a version which can be shared for open research use.
  • Seo artikli tutvustas vahtsõt ja loovat piirdmiisi grammatiga (PG) formalismõ pruuk ́mist. Taas om metod ́, kon PG pruugitas tuusjaos, et süräkomi kõnõldu keele korpusõ lindistuisi saassiq tegüsähe, kimmähe ja kontrol ́misõvõimalusõga vaŕonimmiga käkkiq. Seo metod ́ om tett, et korpusõn saassiq kõnõlõjidõ andmit nii pall ́o vähembäs võttaq, ku või, ja et tulõmit saassiq kergehe käsilde kontrolliq. Mi plaani perrä pruugitas taad ku automaatsõt kihti, miä tege korpusõ säändses, et taad või kergehe uuŕmisõ jaos jakaq.

Subject headings

HUMANIORA  -- Språk och litteratur -- Studier av enskilda språk (hsv//swe)
HUMANITIES  -- Languages and Literature -- Specific Languages (hsv//eng)

Keyword

Komi
language documentation
pseudonymisation
constraint grammar
Finsk-ugriska språk
Finno-Ugric Languages

Publication and Content Type

ref (subject category)
kon (subject category)

Find in a library

To the university's database

Find more in SwePub

By the author/editor
Partanen, Niko, ...
Blokland, Rogier ...
Rießler, Michael ...
About the subject
HUMANITIES
HUMANITIES
and Languages and Li ...
and Specific Languag ...
Articles in the publication
Proceedings of t ...
By the university
Uppsala University

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view