Resource Lenient Approaches to Cross Language Information Retrieval: Using Amharic

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: id:"swepub:oai:DiVA.org:su-57267" > Resource Lenient Ap...

1 av 1
Föregående post
Nästa post
Till träfflistan

Resource Lenient Approaches to Cross Language Information Retrieval : Using Amharic

Argaw, Atelach Alemu, 1974- (författare): Stockholms universitet,Institutionen för data- och systemvetenskap

Asker, Lars, Docent (preses): Stockholms universitet,Institutionen för data- och systemvetenskap

Karlgren, Jussi, Docent (preses): Swedish Institute of Computer Science

visa fler...

Oard, Douglas, Professor (opponent): University of Maryland, College of Information Studies

visa färre...

(creator_code:org_t)

ISBN 9789174472363
Stockholm : Department of Computer and Systems Sciences, Stockholm University, 2011
Engelska 152 s.
Serie: Report Series / Department of Computer & Systems Sciences, 1101-8526 ; 11-003

Relaterad länk:: https://urn.kb.se/re...

Doktorsavhandling (övrigt vetenskapligt/konstnärligt)

Abstract Ämnesord

Stäng

Information Retrieval (IR) deals with finding and presenting information from a collection of documents/data that are relevant to an information need (a query) expressed by a user. Cross Language Information Retrieval (CLIR) is a subfield of IR where queries are posed in a different language than that of the document collection. Computational linguistic tools and resources are essential to accomplish the tasks in CLIR and to date, CLIR research is dominated by a very limited number of languages for which such tools and resources are available. In order to facilitate global information sharing, it is important to enable access to information using as many languages as possible. This requires an investigation into the feasibility of CLIR for languages with a limited set of computational linguistic resources. Amharic is a well-studied language with a rich history and culture, but has very limited computational linguistic tools and resources. This dissertation provides an in depth investigation into a CLIR system for Amharic (against English and French document collections). Scalable techniques were developed to accomplish Amharic CLIR tasks and each task was evaluated individually as a stand alone experiment. Large scale IR experiments were then conducted in order to evaluate the effect of three parameters, namely, transliteration, word sense discrimination, and term selection based on part of speech tags, on the overall IR performance. The effects were measured by individually tuning each of these parameters through a series of benchmarking experiments, geared towards optimizing retrieval precision as well as recall. The results give an insight into the performance of the chosen approaches, the challenges, and their impact on the overall IR performance.

Ämnesord

NATURVETENSKAP -- Data- och informationsvetenskap -- Systemvetenskap, informationssystem och informatik (hsv//swe)
NATURAL SCIENCES -- Computer and Information Sciences -- Information Systems (hsv//eng)

Nyckelord

Cross language information retrieval
Amharic
stemming
MRD based query translation
transliteration
named entity detection
translation term selection
sense discrimination
POS tagging
Computer and systems science
Data- och systemvetenskap
Computer and Systems Sciences
data- och systemvetenskap

Publikations- och innehållstyp

vet (ämneskategori)
dok (ämneskategori)

Hitta via bibliotek

Resource Lenient Approaches to Cross Language Information Retrieval Using Amhari... (Sök publikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Argaw, Atelach A ...; Asker, Lars, Doc ...; Karlgren, Jussi, ...; Oard, Douglas, P ...

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Systemvetenskap ...

Delar i serien: Report Series / ...

Av lärosätet: Stockholms universitet

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se