GDTM: Graph-based Dynamic Topic Models

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Search: onr:"swepub:oai:DiVA.org:kth-263828" > GDTM: Graph-based D...

1 of 1
Previous record
Next record
To hitlist

GDTM: Graph-based Dynamic Topic Models

Ghoorchian, Kambiz, 1981- (author): KTH,Programvaruteknik och datorsystem, SCS,KTH Royal Institute of Technology, Sweden

Sahlgren, Magnus (author): RISE,Datavetenskap

(creator_code:org_t)

2020-05-15
2020
English.
In: Progress in Artificial Intelligence. - : Springer Nature. - 2192-6352 .- 2192-6360. ; 9, s. 195-207

Related links:: https://doi.org/10.1...; show more...; https://link.springe...; https://urn.kb.se/re...; https://doi.org/10.1...; https://urn.kb.se/re...; show less...

Journal article (peer-reviewed)

Abstract Subject headings

Dynamic Topic Modeling (DTM) is the ultimate solution for extracting topics from short texts generated in Online Social Networks (OSNs) like Twitter. A DTM solution is required to be scalable and to be able to account for sparsity in short texts and dynamicity of topics. Current solutions combine probabilistic mixture models like Dirichlet Multinomial or PitmanYor Process with approximate inference approaches like Gibbs Sampling and Stochastic Variational Inference to, respectively, account for dynamicity and scalability in DTM. However, these solutions rely on weak probabilistic language models, which do not account for sparsity in short texts. In addition, their inference is based on iterative optimization algorithms, which have scalability issues when it comes to DTM. We present GDTM, a single-pass graph-based DTM algorithm, to solve the problem. GDTM combines a context-rich and incremental feature representation model, called Random Indexing (RI), with a novel online graph partitioning algorithm to address scalability and dynamicity. In addition, GDTM uses a rich language modeling approach based on the Skip-gram technique to account for sparsity. We run multiple experiments over a large-scale Twitter dataset to analyze the accuracy and scalability of GDTM and compare the results with four state-of-the-art approaches. The results show that GDTM outperforms the best approach by 11% on accuracy and performs by an order of magnitude faster while creating 4 times better topic quality over standard evaluation metrics.

Find in a library

Progress in Artificial Intelligence (Search for host publication in LIBRIS)

To the university's database

1 of 1
Previous record
Next record
To hitlist

Find more in SwePub

By the author/editor: Ghoorchian, Kamb ...; Sahlgren, Magnus

About the subject

NATURAL SCIENCES: NATURAL SCIENCES; and Computer and Inf ...; and Computer Science ...

Articles in the publication: Progress in Arti ...

By the university: Royal Institute of Technology; RISE

Search outside SwePub

Extend your search to:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se

GDTM: Graph-based Dynamic Topic Models

Subject headings

Keyword

Publication and Content Type

Find in a library

To the university's database

Find more in SwePub

Search outside SwePub