SwePub
Sök i LIBRIS databas

  Utökad sökning

onr:"swepub:oai:research.chalmers.se:0ebda652-0de7-48d1-afd5-a981cd8d3da5"
 

Sökning: onr:"swepub:oai:research.chalmers.se:0ebda652-0de7-48d1-afd5-a981cd8d3da5" > ROSE::FTTransform -...

ROSE::FTTransform - A source-to-source translation framework for exascale fault-tolerance research

Lidman, Jacob, 1985 (författare)
Chalmers tekniska högskola,Chalmers University of Technology
Quinlan, D. J. (författare)
Lawrence Livermore National Laboratory
Liao, C. (författare)
Lawrence Livermore National Laboratory
visa fler...
McKee, Sally A, 1963 (författare)
Chalmers tekniska högskola,Chalmers University of Technology
visa färre...
 (creator_code:org_t)
ISBN 9781467322645
2012
2012
Engelska.
Ingår i: IEEE/IFIP 42nd International Conference on Dependable Systems and Networks Workshops, DSN-W 2012; Boston, MA; United States; 25 June 2012 through 28 June 2012. - 9781467322645
  • Konferensbidrag (refereegranskat)
Abstract Ämnesord
Stäng  
  • Exascale computing systems will require sufficient resilience to tolerate numerous types of hardware faults while still assuring correct program execution. Such extreme-scale machines are expected to be dominated by processors driven at lower voltages (near the minimum 0.5 volts for current transistors). At these voltage levels, the rate of transient errors increases dramatically due to the sensitivity to transient and geographically localized voltage drops on parts of the processor chip. To achieve power efficiency, these processors are likely to be streamlined and minimal, and thus they cannot be expected to handle transient errors entirely in hardware. Here we present an open, compiler-based framework to automate the armoring of High Performance Computing (HPC) software to protect it from these types of transient processor errors. We develop an open infrastructure to support research work in this area, and we define tools that, in the future, may provide more complete automated and/or semi-automated solutions to support software resiliency on future exascale architectures. Results demonstrate that our approach is feasible, pragmatic in how it can be separated from the software development process, and reasonably efficient (0% to 30% overhead for the Jacobi iteration on common hardware; and 20%, 40%, 26%, and 2% overhead for a randomly selected subset of benchmarks from the Livermore Loops [1]).

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Nyckelord

Fault Tolerance
High Performance Computing
Exascale
Source-to-Source Compiler
Redundancy

Publikations- och innehållstyp

kon (ämneskategori)
ref (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy