SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Nrasimhamurthy Sai) "

Sökning: WFRF:(Nrasimhamurthy Sai)

  • Resultat 1-1 av 1
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Chien, Steven W. D., et al. (författare)
  • Characterizing Deep-Learning I/O Workloads in TensorFlow
  • 2018
  • Ingår i: Proceedings of PDSW-DISCS 2018: 3rd Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems, Held in conjunction with SC 2018: The International Conference for High Performance Computing, Networking, Storage and Analysis. - : Institute of Electrical and Electronics Engineers (IEEE). ; , s. 54-63
  • Konferensbidrag (refereegranskat)abstract
    • The performance of Deep-Learning (DL) computing frameworks rely on the rformance of data ingestion and checkpointing. In fact, during the aining, a considerable high number of relatively small files are first aded and pre-processed on CPUs and then moved to accelerator for mputation. In addition, checkpointing and restart operations are rried out to allow DL computing frameworks to restart quickly from a eckpoint. Because of this, I/O affects the performance of DL plications. this work, we characterize the I/O performance and scaling of nsorFlow, an open-source programming framework developed by Google and ecifically designed for solving DL problems. To measure TensorFlow I/O rformance, we first design a micro-benchmark to measure TensorFlow ads, and then use a TensorFlow mini-application based on AlexNet to asure the performance cost of I/O and checkpointing in TensorFlow. To prove the checkpointing performance, we design and implement a burst ffer. find that increasing the number of threads increases TensorFlow ndwidth by a maximum of 2.3 x and 7.8 x on our benchmark environments. e use of the tensorFlow prefetcher results in a complete overlap of mputation on accelerator and input pipeline on CPU eliminating the fective cost of I/O on the overall performance. The use of a burst ffer to checkpoint to a fast small capacity storage and copy ynchronously the checkpoints to a slower large capacity storage sulted in a performance improvement of 2.6x with respect to eckpointing directly to slower storage on our benchmark environment.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-1 av 1
Typ av publikation
konferensbidrag (1)
Typ av innehåll
refereegranskat (1)
Författare/redaktör
Markidis, Stefano (1)
Laure, Erwin (1)
Herman, Pawel (1)
Sishtla, Chaitanya P ... (1)
Chien, Steven W. D. (1)
Santos, Luis (1)
visa fler...
Nrasimhamurthy, Sai (1)
visa färre...
Lärosäte
Kungliga Tekniska Högskolan (1)
Språk
Engelska (1)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (1)
År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy