Sökning: WFRF:(Blokland Rogier 1971 ) >
Transforming Archiv...
Transforming Archived Resources with Language Technology : From Manuscripts to Language Documentation
-
- Partanen, Niko, 1986- (författare)
- University of Helsinki
-
- Blokland, Rogier, 1971- (författare)
- Uppsala universitet,Finsk-ugriska språk
-
- Rießler, Michael, 1971- (författare)
- University of Eastern Finland
-
visa fler...
-
- Rueter, Jack (författare)
- University of Helsinki
-
visa färre...
-
(creator_code:org_t)
- CEUR-WS, 2022
- 2022
- Engelska.
-
Ingår i: Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022). - : CEUR-WS. ; , s. 370-380
- Relaterad länk:
-
http://ceur-ws.org/V...
-
visa fler...
-
https://uu.diva-port... (primary) (Raw object)
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- Transcriptions in different languages are a ubiquitous data format in linguistics and in many other fields in the humanities. However, the majority of these resources remain both under-used and under-studied. This may be the case even when the materials have been published in print, but is certainly the case for the majority of unpublished transcriptions. Our paper presents a workflow adapted in the research project Language Documentation Meets Language Technology, which combines text recognition, automatic transliteration and forced alignment into a process which allows us to convert earlier transcribed documents to a structure that is comparable with contemporary language documentation corpora. This has complex practical and methodological considerations.
Ämnesord
- HUMANIORA -- Språk och litteratur -- Studier av enskilda språk (hsv//swe)
- HUMANITIES -- Languages and Literature -- Specific Languages (hsv//eng)
Nyckelord
- documentary linguistics
- language technology
- text recognition
- forced alignment
- Zyrian Komi
- Finsk-ugriska språk
- Finno-Ugric Languages
Publikations- och innehållstyp
- vet (ämneskategori)
- kon (ämneskategori)