Automatic subject indexing of text

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: id:"swepub:oai:DiVA.org:lnu-68349" > Automatic subject i...

1 av 1
Föregående post
Nästa post
Till träfflistan

Automatic subject indexing of text

Golub, Koraljka (författare): Linnéuniversitetet,Institutionen för kulturvetenskaper (KV),Library and Information Science

(creator_code:org_t)

International Society for Knowledge Organization, 2017
2017
Engelska.
Ingår i: ISKO. - : International Society for Knowledge Organization.

Relaterad länk:: http://www.isko.org/...; visa fler...; https://urn.kb.se/re...; visa färre...

Bokkapitel (refereegranskat)

Abstract Ämnesord

Stäng

Automatic subject indexing addresses problems of scale and sustainability and can be at the same time used to enrich existing metadata records, establish more connections across and between resources from various metadata and resource collections, and enhance consistency of the metadata. In this entry automatic subject indexing focuses on assigning index terms or classes from established knowledge organization systems (KOS) for subject indexing like thesauri, subject headings systems and classification systems. The following major approaches are discussed, in terms of their similarities and differences, advantages and disadvantages for automatic assigned indexing from KOSs: “text categorization”, “document clustering”, and “document classification”. Text categorization is perhaps the most widespread, machine-learning approach with what seems generally good reported performance. This, however, is dependent on availability of training corpora with documents already categorized which are in many cases not there. Document clustering automatically both creates groups of related documents and extracts names of subjects depicting the group at hand. It does not require training documents, but the reported automatically extracted terms and structures are not always of good quality, reflecting the underlying problems of the natural language; also, they both change when new documents are added to the collection and this mutability may not be user-friendly. Document classification re-uses the intellectual effort invested into creating KOSs for subject indexing and even simple string-matching algorithms have been reported to achieve good results because one concept can be described using a number of different terms, including equivalent, related, narrower and broader terms. Finally, applicability of automatic subject indexing to operative information systems and challenges of evaluation are outlined, suggesting the need for more research.

Hitta via bibliotek

Automatic subject indexing of text (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Golub, Koraljka

Om ämnet

SAMHÄLLSVETENSKAP: SAMHÄLLSVETENSKA ...; och Medie och kommun ...; och Biblioteks och i ...

Artiklar i publikationen: ISKO

Av lärosätet: Linnéuniversitetet

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se

Automatic subject indexing of text

Ämnesord

Nyckelord

Publikations- och innehållstyp

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Sök utanför SwePub