SwePub
Sök i LIBRIS databas

  Utökad sökning

WFRF:(Tao Z)
 

Sökning: WFRF:(Tao Z) > (2010-2014) > Understanding the b...

Understanding the behavior of in-memory computing workloads

Jiang, Tao (författare)
Chinese Academy of Sciences
Zhang, Q. (författare)
Chinese Academy of Sciences
Hou, Rui (författare)
Chinese Academy of Sciences
visa fler...
Chai, L. (författare)
Chinese Academy of Sciences
McKee, Sally A, 1963 (författare)
Chalmers tekniska högskola,Chalmers University of Technology
Jia, Z. (författare)
Chinese Academy of Sciences
Sun, N. (författare)
Chinese Academy of Sciences
visa färre...
 (creator_code:org_t)
ISBN 9781479964536
2014
2014
Engelska.
Ingår i: IISWC 2014 - IEEE International Symposium on Workload Characterization. - 9781479964536 ; , s. 22-30
  • Konferensbidrag (refereegranskat)
Abstract Ämnesord
Stäng  
  • © 2014 IEEE. The increasing demands of big data applications have led researchers and practitioners to turn to in-memory computing to speed processing. For instance, the Apache Spark framework stores intermediate results in memory to deliver good performance on iterative machine learning and interactive data analysis tasks. To the best of our knowledge, though, little work has been done to understand Spark's architectural and microarchitectural behaviors. Furthermore, although conventional commodity processors have been well optimized for traditional desktops and HPC, their effectiveness for Spark workloads remains to be studied. To shed some light on the effectiveness of conventional generalpurpose processors on Spark workloads, we study their behavior in comparison to those of Hadoop, CloudSuite, SPEC CPU2006, TPC-C, and DesktopCloud. We evaluate the benchmarks on a 17-node Xeon cluster. Our performance results reveal that Spark workloads have significantly different characteristics from Hadoop and traditional HPC benchmarks. At the system level, Spark workloads have good memory bandwidth utilization (up to 50%), stable memory accesses, and high disk IO request frequency (200 per second). At the microarchitectural level, the cache and TLB are effective for Spark workloads, but the L2 cache miss rate is high. We hope this work yields insights for chip and datacenter system designers.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences (hsv//eng)

Publikations- och innehållstyp

kon (ämneskategori)
ref (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy