Sökning: onr:"swepub:oai:DiVA.org:kth-170185" >
Lightweight Asynchr...
Lightweight Asynchronous Snapshots for Distributed Dataflows
-
- Carbone, Paris, 1986- (författare)
- KTH,Programvaruteknik och Datorsystem, SCS
-
- Fóra, Gyula (författare)
- CSL Computer Systems Laboratory, SICS Swedish Institute of Compute Science
-
- Ewen, Stephan (författare)
- Data Artisans GmbH
-
visa fler...
-
- Haridi, Seif (författare)
- KTH,Programvaruteknik och Datorsystem, SCS
-
- Tzoumas, Kostas (författare)
- Data Artisans GmbH
-
visa färre...
-
(creator_code:org_t)
- ISBN 9789175956510
- 2015
- Engelska 8 s.
-
Serie: TRITA-ICT ; 2015:08
- Relaterad länk:
-
https://kth.diva-por... (primary) (Raw object)
-
visa fler...
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- Distributed stateful stream processing enables the deployment and execution of large scale continuous computations in the cloud, targeting both low latency and high throughput. One of the most fundamental challenges of this paradigm is providing processing guarantees under potential failures. Existing approaches rely on periodic global state snapshots that can be used for failure recovery. Those approaches suffer from two main drawbacks. First, they often stall the overall computation which impacts ingestion. Second, they eagerly persist all records in transit along with the operation states which results in larger snapshots than required. In this work we propose Asynchronous Barrier Snapshotting (ABS), a lightweight algorithm suited for modern dataflow execution engines that minimises space requirements. ABS persists only operator states on acyclic execution topologies while keeping a minimal record log on cyclic dataflows. We implemented ABS on Apache Flink, a distributed analytics engine that supports stateful stream processing. Our evaluation shows that our algorithm does not have a heavy impact on the execution, maintaining linear scalability and performing well with frequent snapshots.
Ämnesord
- TEKNIK OCH TEKNOLOGIER -- Elektroteknik och elektronik -- Datorsystem (hsv//swe)
- ENGINEERING AND TECHNOLOGY -- Electrical Engineering, Electronic Engineering, Information Engineering -- Computer Systems (hsv//eng)
Nyckelord
- fault tolerance
- distributed computing
- stream processing
- dataflow
- cloud computing
- state management
- Informations- och kommunikationsteknik
- Information and Communication Technology
- Computer Science
- Datalogi
Publikations- och innehållstyp
- vet (ämneskategori)
- rap (ämneskategori)
Hitta via bibliotek
Till lärosätets databas