SwePub
Sök i LIBRIS databas

  Utökad sökning

id:"swepub:oai:DiVA.org:uu-315176"
 

Sökning: id:"swepub:oai:DiVA.org:uu-315176" > Concise Loads and S...

Concise Loads and Stores : The Case for an Asymmetric Compute-Memory Architecture for Approximation

Jain, Animesh (författare)
Univ Michigan, Ann Arbor, MI 48109 USA.
Hill, Parker (författare)
Univ Michigan, Ann Arbor, MI 48109 USA.
Lin, Shih-Chieh (författare)
Univ Michigan, Ann Arbor, MI 48109 USA.
visa fler...
Khan, Muneeb (författare)
Uppsala universitet,Datorarkitektur och datorkommunikation
Haque, Md E. (författare)
Univ Michigan, Ann Arbor, MI 48109 USA.
Laurenzano, Michael A. (författare)
Univ Michigan, Ann Arbor, MI 48109 USA.
Mahlke, Scott (författare)
Univ Michigan, Ann Arbor, MI 48109 USA.
Tang, Lingjia (författare)
Univ Michigan, Ann Arbor, MI 48109 USA.
Mars, Jason (författare)
Univ Michigan, Ann Arbor, MI 48109 USA.
visa färre...
Univ Michigan, Ann Arbor, MI 48109 USA Datorarkitektur och datorkommunikation (creator_code:org_t)
2016
2016
Engelska.
Ingår i: 2016 49Th Annual IEEE/ACM International Symposium On Microarchitecture (MICRO). - 9781509035083
  • Konferensbidrag (refereegranskat)
Abstract Ämnesord
Stäng  
  • Cache capacity and memory bandwidth play critical roles in application performance, particularly for data-intensive applications from domains that include machine learning, numerical analysis, and data mining. Many of these applications are also tolerant to imprecise inputs and have loose constraints on the quality of output, making them ideal candidates for approximate computing. This paper introduces a novel approximate computing technique that decouples the format of data in the memory hierarchy from the format of data in the compute subsystem to significantly reduce the cost of storing and moving bits throughout the memory hierarchy and improve application performance. This asymmetric compute-memory extension to conventional architectures, ACME, adds two new instruction classes to the ISA - load-concise and store-concise - along with three small functional units to the micro-architecture to support these instructions. ACME does not affect exact execution of applications and comes into play only when concise memory operations are used. Through detailed experimentation we find that ACME is very effective at trading result accuracy for improved application performance. Our results show that ACME achieves a 1.3x speedup ( up to 1.8x) while maintaining 99% accuracy, or a 1.1x speedup while maintaining 99.999% accuracy. Moreover, our approach incurs negligible area and power overheads, adding just 0.005% area and 0.1% power to a conventional modern architecture.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences (hsv//eng)

Publikations- och innehållstyp

ref (ämneskategori)
kon (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy