SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:uu-126666"
 

Search: onr:"swepub:oai:DiVA.org:uu-126666" > Bootstrapping Langu...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Bootstrapping Language Description : The case of Mpiemo (Bantu A, Central African Republic)

Hammarström, Harald (author)
Gothenburg University,Göteborgs universitet,Institutionen för data- och informationsteknik, datavetenskap (GU),Department of Computer Science and Engineering, Computing Science (GU),Department of Computing Science, Chalmers University, Gothenburg
Thornell, Christina, 1948 (author)
Gothenburg University,Göteborgs universitet,Institutionen för orientaliska och afrikanska språk,Department of Oriental and African Languages,Department of African Languages, Gothenburg University, Gothenburg
Petzell, Malin, 1972 (author)
Gothenburg University,Göteborgs universitet,Institutionen för orientaliska och afrikanska språk,Department of Oriental and African Languages,Department of African Languages, Gothenburg University, Gothenburg
show more...
Westerlund, Torbjörn, 1971- (author)
Uppsala universitet,Institutionen för lingvistik och filologi
show less...
 (creator_code:org_t)
2008
2008
English.
In: Proceedings of the 6th edition of the Language Resources and Evaluation Conference (LREC 2008), 28-30 may 2008, Marrakech, Morocco.
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • Linguists have long been producing grammatical decriptions of yet undescribed languages. This is a time-consuming process, which has already adapted to improved technology for recording and storage. We present here a novel application of NLP techniques to bootstrap analysis of collected data and speed-up manual selection work. To be more precise, we argue that unsupervised induction of morphology and part-of-speech analysis from raw text data is mature enough to produce useful results. Experiments with Latent Semantic Analysis were less fruitful. We exemplify this on Mpiemo, a so-far essentially undescribed Bantu language of the Central African Republic, for which raw text data was available.

Subject headings

HUMANIORA  -- Språk och litteratur -- Studier av enskilda språk (hsv//swe)
HUMANITIES  -- Languages and Literature -- Specific Languages (hsv//eng)
NATURVETENSKAP  -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Language Technology (hsv//eng)
NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Keyword

Mpiemo
Bantu A
Central African Republic
NLP
Latent Semantic Analysis
bootstrapping
African languages
Afrikanska språk
Computational linguistics
Datorlingvistik
Acquisition
Machine Learning
Endangered languages
Language modelling

Publication and Content Type

ref (subject category)
kon (subject category)

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view