SwePub
Tyck till om SwePub Sök här!
Sök i LIBRIS databas

  Extended search

id:"swepub:oai:DiVA.org:kth-171392"
 

Search: id:"swepub:oai:DiVA.org:kth-171392" > Swedish full text r...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Swedish full text retrieval : Effectiveness of different combinations of indexing strategies with query terms

Ahlgren, Per (author)
University College of Borås
Kekälainen, Jaana (author)
 (creator_code:org_t)
2006-09-01
2006
English.
In: Information retrieval (Boston). - : Springer Science and Business Media LLC. - 1386-4564 .- 1573-7659. ; 9:6, s. 681-697
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • In this paper, which treats Swedish full text retrieval, the problem of morphological variation of query terms in the document database is studied. The Swedish CLEF 2003 test collection was used, and the effects of combination of indexing strategies with query terms on retrieval effectiveness were studied. Four of the seven tested combinations involved indexing strategies that used normalization, a form of conflation. All of these four combinations employed compound splitting, both during indexing and at query phase. SWETWOL, a morphological analyzer for the Swedish language, was used for normalization and compound splitting. A fifth combination used stemming, while a sixth attempted to group related terms by right hand truncation of query terms. The truncation was performed by a search expert. These six combinations were compared to each other and to a baseline combination, where no attempt was made to counteract the problem of morphological variation of query terms in the document database. Both the truncation combination, the four combinations based on normalization and the stemming combination outperformed the baseline. Truncation had the best performance. The main conclusion of the paper is that truncation, normalization and stemming enhanced retrieval effectiveness in comparison to the baseline. Further, normalization and stemming were not far below truncation.

Subject headings

SAMHÄLLSVETENSKAP  -- Medie- och kommunikationsvetenskap -- Biblioteks- och informationsvetenskap (hsv//swe)
SOCIAL SCIENCES  -- Media and Communications -- Information Studies (hsv//eng)

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Ahlgren, Per
Kekälainen, Jaan ...
About the subject
SOCIAL SCIENCES
SOCIAL SCIENCES
and Media and Commun ...
and Information Stud ...
Articles in the publication
Information retr ...
By the university
Royal Institute of Technology

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view