Sökning: WFRF:(Blokland Rogier 1971 ) >
Using computational...
Using computational approaches to integrate endangered language legacy data into documentation corpora : Past experiences and challenges ahead
-
- Blokland, Rogier, 1971- (författare)
- Uppsala universitet,Institutionen för moderna språk
-
- Partanen, Niko, 1986- (författare)
- Institute for the Languages of Finland
-
- Rießler, Michael, 1971- (författare)
- University of Freiburg
-
visa fler...
-
- Wilbur, Joshua (författare)
- University of Freiburg
-
visa färre...
-
(creator_code:org_t)
- Boulder, Colorado : University of Colorado, 2019
- 2019
- Engelska.
-
Ingår i: Proceedings of the Workshop on Computational Methods for Endangered Languages. - Boulder, Colorado : University of Colorado. ; , s. 24-30
- Relaterad länk:
-
https://scholar.colo...
-
visa fler...
-
https://uu.diva-port... (primary) (Raw object)
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- The systematic integration of pre-digital published transcriptions of legacy language materials offers many possibilities to enrich documentary corpora with data that is often very comparable to contemporary collections, and often originating from the same speech communities researchers currently work with. Especially recent advances in text recognition technologies make the reuse of old materials a very attractive and accessible task. However, the output of text recognition needs to be connected to further parts of the pipeline, namely forced alignment and speech recognition. The workflows discussed here attempt to reach a maximally useful situation where legacy data is transformed into a usable and comparable format, but not yet transformed into a time aligned corpus.
Ämnesord
- HUMANIORA -- Språk och litteratur -- Studier av enskilda språk (hsv//swe)
- HUMANITIES -- Languages and Literature -- Specific Languages (hsv//eng)
Nyckelord
- Zyrian Komi
- endangered languages
- computational linguistics
- documentary linguistics
- Datorlingvistik
- Computational Linguistics
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)