SwePub
Sök i LIBRIS databas

  Utökad sökning

id:"swepub:oai:DiVA.org:hj-9771"
 

Sökning: id:"swepub:oai:DiVA.org:hj-9771" > Textual content, ci...

Textual content, cited references, similarity order, and clustering : an experimental study in the context of science mapping

Ahlgren, Per (författare)
Department of e-Resources, University Library, Stockholm University
Colliander, Cristian, 1980- (författare)
Jönköping University,Högskolebiblioteket,Högskolan i Jönköping, Högskolebiblioteket
 (creator_code:org_t)
2009
2009
Engelska.
Ingår i: Proceedings of the 12th International Conference on Scientometrics and Informetrics. ; , s. 862-873
  • Konferensbidrag (refereegranskat)
Abstract Ämnesord
Stäng  
  • This paper deals with document-document similarity approaches, the issue of similarity order, and clustering methods, in the context of science mapping. Using two data sets of bibliographic records, associated with the fields of information retrieval and scientometrics, we investigate how well two document-document similarity approaches, a text-based approach and bibliographic coupling, agree with ground truth classifications (obtained by subject experts), under first-order and second-order similarities, and under four different clustering methods. The clustering methods are average linkage, complete linkage, Ward’s method and consensus clustering. The performance of first-order and second-order similarities is compared within the two document-document similarity approaches, and under each clustering method. We also compare the performance of the clustering methods. The results show that the text-based approach consistently outperformed bibliographic coupling with regard to the information retrieval data set, but performed consistently worse than the latter approach regarding the scientometrics data set. For the similarity order issue, second-order similarities performed better than first-order in 12 out of 16 cases. Average linkage had the best overall performance among the clustering methods, followed by consensus clustering. The main conclusion of the study is that second-order similarities seem to be a better choice than first-order in the science mapping context.

Ämnesord

SAMHÄLLSVETENSKAP  -- Sociologi (hsv//swe)
SOCIAL SCIENCES  -- Sociology (hsv//eng)

Nyckelord

Bibliometrics
Citation data
Text mining
Similarity order
Consensus clustering
Sociology
Sociologi

Publikations- och innehållstyp

ref (ämneskategori)
kon (ämneskategori)

Till lärosätets databas

Hitta mer i SwePub

Av författaren/redakt...
Ahlgren, Per
Colliander, Cris ...
Om ämnet
SAMHÄLLSVETENSKAP
SAMHÄLLSVETENSKA ...
och Sociologi
Artiklar i publikationen
Av lärosätet
Jönköping University
Umeå universitet

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy