Search: WFRF:(Blokland Rogier 1971 ) >
A pseudonymization ...
A pseudonymization method for language documentation corpora: an experiment with spoken Komi
-
- Partanen, Niko, 1986- (author)
- University of Helsinki, Finland
-
- Blokland, Rogier, 1971- (author)
- Uppsala universitet,Institutionen för moderna språk
-
- Rießler, Michael, 1971- (author)
- University of Joensuu, Finland
-
(creator_code:org_t)
- Vienna, Austria, 2020
- 2020
- English.
-
In: Proceedings of the 6th International Workshop on Computational Linguistics of Uralic Languages. - Vienna, Austria. - 9781952148002 ; , s. 1-8
- Related links:
-
https://www.aclweb.o...
-
show more...
-
https://uu.diva-port... (primary) (Raw object)
-
https://urn.kb.se/re...
-
show less...
Abstract
Subject headings
Close
- This article introduces a novel and creative application of the Constraint Grammar formalism, by presenting an automated method for pseudonymising a Zyrian Komi spoken language corpus in an effective, reliable and scalable manner. The method is intended to be used to minimize various kinds of personal information found in the corpus in order to make spoken language data available while preventing the spread of sensitive personal data about the recorded informants or other persons mentioned in the texts. In our implementation, a Constraint Grammar based pseudonymisation tool is used as an automatically applied shallow layer that derives from the original corpus data a version which can be shared for open research use.
- Seo artikli tutvustas vahtsõt ja loovat piirdmiisi grammatiga (PG) formalismõ pruuk ́mist. Taas om metod ́, kon PG pruugitas tuusjaos, et süräkomi kõnõldu keele korpusõ lindistuisi saassiq tegüsähe, kimmähe ja kontrol ́misõvõimalusõga vaŕonimmiga käkkiq. Seo metod ́ om tett, et korpusõn saassiq kõnõlõjidõ andmit nii pall ́o vähembäs võttaq, ku või, ja et tulõmit saassiq kergehe käsilde kontrolliq. Mi plaani perrä pruugitas taad ku automaatsõt kihti, miä tege korpusõ säändses, et taad või kergehe uuŕmisõ jaos jakaq.
Subject headings
- HUMANIORA -- Språk och litteratur -- Studier av enskilda språk (hsv//swe)
- HUMANITIES -- Languages and Literature -- Specific Languages (hsv//eng)
Keyword
- Komi
- language documentation
- pseudonymisation
- constraint grammar
- Finsk-ugriska språk
- Finno-Ugric Languages
Publication and Content Type
- ref (subject category)
- kon (subject category)
Find in a library
To the university's database