SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Alemu Argaw Atelach) srt2:(2009)"

Sökning: WFRF:(Alemu Argaw Atelach) > (2009)

  • Resultat 1-4 av 4
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Asker, Lars, et al. (författare)
  • Classifying Amharic Webnews
  • 2009
  • Ingår i: Information retrieval (Boston). - : Springer Science and Business Media LLC. - 1386-4564 .- 1573-7659. ; 12:3, s. 416-435
  • Tidskriftsartikel (refereegranskat)abstract
    • We present work aimed at compiling an Amharic corpus from the Web and automatically categorizing the texts. Amharic is the second most spoken Semitic language in the World (after Arabic) and used for countrywide communication in Ethiopia. It is highly inflectional and quite dialectally diversified. We discuss the issues of compiling and annotating a corpus of Amharic news articles from the Web. This corpus was then used in three sets of text classification experiments. Working with a less-researched language highlights a number of practical issues that might otherwise receive less attention or go unnoticed. The purpose of the experiments has not primarily been to develop a cutting-edge text classification system for Amharic, but rather to put the spotlight on some of these issues. The first two sets of experiments investigated the use of Self-Organizing Maps (SOMs) for document classification. Testing on small datasets, we first looked at classifying unseen data into 10 predefined categories of news items, and then at clustering it around query content, when taking 16 queries as class labels. The second set of experiments investigated the effect of operations such as stemming and part-of-speech tagging on text classification performance. We compared three representations while constructing classification models based on bagging of decision trees for the 10 predefined news categories. The best accuracy was achieved using the full text as representation. A representation using only the nouns performed almost equally well, confirming the assumption that most of the information required for distinguishing between various categories actually is contained in the nouns, while stemming did not have much effect on the performance of the classifier.
  •  
2.
  • Gambäck, Björn, et al. (författare)
  • An Amharic Corpus for Machine Learning
  • 2009
  • Ingår i: Proccedings of the 6th World Congress on African Linguistics. - : Matthias Brenzinger.
  • Konferensbidrag (refereegranskat)
  •  
3.
  • Gambäck, Björn, et al. (författare)
  • Methods for Amharic part-of-speech tagging
  • 2009. - 1
  • Ingår i: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics.
  • Konferensbidrag (refereegranskat)abstract
    • The paper describes a set of experiments involving the application of three state-of- the-art part-of-speech taggers to Ethiopian Amharic, using three different tagsets. The taggers showed worse performance than previously reported results for Eng- lish, in particular having problems with unknown words. The best results were obtained using a Maximum Entropy ap- proach, while HMM-based and SVM- based taggers got comparable results.
  •  
4.
  • Gambäck, Björn, et al. (författare)
  • Methods for Amharic Part-of-Speech Tagging
  • 2009
  • Ingår i: EACL 2009 WS on Language Technology for African Languages. - : Guy De Pauw, Gilles-Maurice de Schryver, Lori Levin. - 1932432256
  • Konferensbidrag (refereegranskat)
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-4 av 4
Typ av publikation
konferensbidrag (3)
tidskriftsartikel (1)
Typ av innehåll
refereegranskat (4)
Författare/redaktör
Asker, Lars (4)
Gambäck, Björn (4)
Olsson, Fredrik (3)
Alemu Argaw, Atelach (3)
Argaw, Atelach Alemu (1)
Eyassu, Samuel (1)
visa fler...
Nigussie, Lemma (1)
visa färre...
Lärosäte
Stockholms universitet (3)
RISE (1)
Språk
Engelska (4)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (2)
År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy