Search: onr:"swepub:oai:DiVA.org:kth-192471" >
Performance Optimiz...
Performance Optimization Techniques and Tools for Distributed Graph Processing
-
- Kalavri, Vasiliki, 1986- (author)
- KTH,Programvaruteknik och Datorsystem, SCS
-
- Vlassov, Vladimir (thesis advisor)
- KTH,Programvaruteknik och Datorsystem, SCS
-
- Schulte, Christian (thesis advisor)
- KTH,Programvaruteknik och Datorsystem, SCS
-
show more...
-
- Haridi, Seif (thesis advisor)
- KTH,Programvaruteknik och Datorsystem, SCS
-
- Van Roy, Peter (thesis advisor)
- Université catholique de Louvain
-
- Boncz, Peter (opponent)
- CWI Amsterdam
-
show less...
-
(creator_code:org_t)
- ISBN 9789177291015
- KTH Royal Institute of Technology, 2016
- English 158 s.
-
Series: TRITA-ICT ; 2016:25
- Related links:
-
https://kth.diva-por... (primary) (Raw object)
-
show more...
-
https://urn.kb.se/re...
-
show less...
Abstract
Subject headings
Close
- In this thesis, we propose optimization techniques for distributed graph processing. First, we describe a data processing pipeline that leverages an iterative graph algorithm for automatic classification of web trackers. Using this application as a motivating example, we examine how asymmetrical convergence of iterative graph algorithms can be used to reduce the amount of computation and communication in large-scale graph analysis. We propose an optimization framework for fixpoint algorithms and a declarative API for writing fixpoint applications. Our framework uses a cost model to automatically exploit asymmetrical convergence and evaluate execution strategies during runtime. We show that our cost model achieves speedup of up to 1.7x and communication savings of up to 54%. Next, we propose to use the concepts of semi-metricity and the metric backbone to reduce the amount of data that needs to be processed in large-scale graph analysis. We provide a distributed algorithm for computing the metric backbone using the vertex-centric programming model. Using the backbone, we can reduce graph sizes up to 88% and achieve speedup of up to 6.7x.
Subject headings
- TEKNIK OCH TEKNOLOGIER -- Elektroteknik och elektronik -- Datorsystem (hsv//swe)
- ENGINEERING AND TECHNOLOGY -- Electrical Engineering, Electronic Engineering, Information Engineering -- Computer Systems (hsv//eng)
Keyword
- Graph processing
- distributed systems
- big data
- Informations- och kommunikationsteknik
- Information and Communication Technology
Publication and Content Type
- vet (subject category)
- dok (subject category)
Find in a library
To the university's database