SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:kth-176121"
 

Search: onr:"swepub:oai:DiVA.org:kth-176121" > The power of both c...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

The power of both choices : Practical load balancing for distributed stream processing engines

Nasir, M. Anis U. (author)
KTH,Kommunikationsnät
De Francisci Morales, G. (author)
García-Soriano, D. (author)
show more...
Kourtellis, N. (author)
Serafini, M. (author)
show less...
 (creator_code:org_t)
IEEE conference proceedings, 2015
2015
English.
In: Proceedings - International Conference on Data Engineering. - : IEEE conference proceedings. - 9781479979639 ; , s. 137-148
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • We study the problem of load balancing in distributed stream processing engines, which is exacerbated in the presence of skew. We introduce Partial Key Grouping (PKG), a new stream partitioning scheme that adapts the classical 'power of two choices' to a distributed streaming setting by leveraging two novel techniques: key splitting and local load estimation. In so doing, it achieves better load balancing than key grouping while being more scalable than shuffle grouping. We test PKG on several large datasets, both real-world and synthetic. Compared to standard hashing, PKG reduces the load imbalance by up to several orders of magnitude, and often achieves nearly-perfect load balance. This result translates into an improvement of up to 60% in throughput and up to 45% in latency when deployed on a real Storm cluster.

Subject headings

TEKNIK OCH TEKNOLOGIER  -- Elektroteknik och elektronik (hsv//swe)
ENGINEERING AND TECHNOLOGY  -- Electrical Engineering, Electronic Engineering, Information Engineering (hsv//eng)

Keyword

Balancing
Distributed parameter control systems
Engines
Distributed stream processing
Distributed streaming
Large datasets
Load balance
Load imbalance
Novel techniques
Orders of magnitude
Power-of-two
Network management

Publication and Content Type

ref (subject category)
kon (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view