Tyck till om SwePub Sök
här!
Sökning: onr:"swepub:oai:DiVA.org:uu-152251" >
Massive scale-out o...
Massive scale-out of expensive continuous queries
-
- Zeitler, Erik (författare)
- Uppsala universitet,Datalogi,UDBL
-
- Risch, Tore (författare)
- Uppsala universitet,Datalogi,UDBL
-
(creator_code:org_t)
- 2011
- 2011
- Engelska.
-
Ingår i: 36th International Conference on Very Large Data Bases.
- Relaterad länk:
-
https://urn.kb.se/re...
Abstract
Ämnesord
Stäng
- Scalable execution of expensive continuous queries over massive data streams requires input streams to be split into parallel sub-streams. The query operators are continuously executed in parallel over these sub-streams. Stream splitting involves both partitioning and replication of incoming tuples, depending on how the continuous query is parallelized. We provide a stream splitting operator that enables such customized stream splitting. However, it is critical that the stream splitting itself keeps up with input streams of high volume. This is a problem when the stream splitting predicates have some costs. Therefore, to enable customized splitting of high-volume streams, we introduce a parallelized stream splitting operator, called parasplit. We investigate the performance of parasplit using a cost model and experimentally. Based on these results, a heuristic is devised to automatically parallelize the execution of parasplit. We show that the maximum stream rate of parasplit approaches network speed, and that the parallelization is resource efficient. Finally, the scalability of our approach is experimentally demonstrated on the Linear Road Benchmark, showing an order of magnitude higher stream processing rate over previously published results, allowing at least 512 expressways.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
Nyckelord
- Computer science
- Datavetenskap
- Databases
- Databaser
- Datavetenskap med inriktning mot databasteknik
- Computer Science with specialization in Database Technology
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)