SwePub
Sök i LIBRIS databas

  Utökad sökning

WFRF:(Li Chun Biu)
 

Sökning: WFRF:(Li Chun Biu) > Shape-aware stochas...

Shape-aware stochastic neighbor embedding for robust data visualisations

Wängberg, Tobias (författare)
Stockholms universitet,Matematiska institutionen
Tyrcha, Joanna, 1956- (författare)
Stockholms universitet,Matematiska institutionen
Li, Chun-Biu (författare)
Stockholms universitet,Matematiska institutionen
 (creator_code:org_t)
2022-11-14
2022
Engelska.
Ingår i: BMC Bioinformatics. - : Springer Science and Business Media LLC. - 1471-2105. ; 23:1
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • Background: The t-distributed Stochastic Neighbor Embedding (t-SNE) algorithm has emerged as one of the leading methods for visualising high-dimensional (HD) data in a wide variety of fields, especially for revealing cluster structure in HD single-cell transcriptomics data. However, t-SNE often fails to correctly represent hierarchical relationships between clusters and creates spurious patterns in the embedding. In this work we generalised t-SNE using shape-aware graph distances to mitigate some of the limitations of the t-SNE. Although many methods have been recently proposed to circumvent the shortcomings of t-SNE, notably Uniform manifold approximation (UMAP) and Potential of heat diffusion for affinity-based transition embedding (PHATE), we see a clear advantage of the proposed graph-based method.Results: The superior performance of the proposed method is first demonstrated on simulated data, where a significant improvement compared to t-SNE, UMAP and PHATE, based on quantitative validation indices, is observed when visualising imbalanced, nonlinear, continuous and hierarchically structured data. Thereafter the ability of the proposed method compared to the competing methods to create faithfully low-dimensional embeddings is shown on two real-world data sets, the single-cell transcriptomics data and the MNIST image data. In addition, the only hyper-parameter of the method can be automatically chosen in a data-driven way, which is consistently optimal across all test cases in this study.Conclusions: In this work we show that the proposed shape-aware stochastic neighbor embedding method creates low-dimensional visualisations that robustly and accurately reveal key structures of high-dimensional data.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences (hsv//eng)
NATURVETENSKAP  -- Matematik -- Sannolikhetsteori och statistik (hsv//swe)
NATURAL SCIENCES  -- Mathematics -- Probability Theory and Statistics (hsv//eng)

Nyckelord

Data visualisation
Dimensionality reduction
Graph distance
Dimensionality reduction validation

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Av författaren/redakt...
Wängberg, Tobias
Tyrcha, Joanna, ...
Li, Chun-Biu
Om ämnet
NATURVETENSKAP
NATURVETENSKAP
och Data och informa ...
NATURVETENSKAP
NATURVETENSKAP
och Matematik
och Sannolikhetsteor ...
Artiklar i publikationen
BMC Bioinformati ...
Av lärosätet
Stockholms universitet

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy