Sökning: id:"swepub:oai:DiVA.org:uu-315176" >
Concise Loads and S...
Concise Loads and Stores : The Case for an Asymmetric Compute-Memory Architecture for Approximation
-
- Jain, Animesh (författare)
- Univ Michigan, Ann Arbor, MI 48109 USA.
-
- Hill, Parker (författare)
- Univ Michigan, Ann Arbor, MI 48109 USA.
-
- Lin, Shih-Chieh (författare)
- Univ Michigan, Ann Arbor, MI 48109 USA.
-
visa fler...
-
- Khan, Muneeb (författare)
- Uppsala universitet,Datorarkitektur och datorkommunikation
-
- Haque, Md E. (författare)
- Univ Michigan, Ann Arbor, MI 48109 USA.
-
- Laurenzano, Michael A. (författare)
- Univ Michigan, Ann Arbor, MI 48109 USA.
-
- Mahlke, Scott (författare)
- Univ Michigan, Ann Arbor, MI 48109 USA.
-
- Tang, Lingjia (författare)
- Univ Michigan, Ann Arbor, MI 48109 USA.
-
- Mars, Jason (författare)
- Univ Michigan, Ann Arbor, MI 48109 USA.
-
visa färre...
-
Univ Michigan, Ann Arbor, MI 48109 USA Datorarkitektur och datorkommunikation (creator_code:org_t)
- 2016
- 2016
- Engelska.
-
Ingår i: 2016 49Th Annual IEEE/ACM International Symposium On Microarchitecture (MICRO). - 9781509035083
- Relaterad länk:
-
https://urn.kb.se/re...
Abstract
Ämnesord
Stäng
- Cache capacity and memory bandwidth play critical roles in application performance, particularly for data-intensive applications from domains that include machine learning, numerical analysis, and data mining. Many of these applications are also tolerant to imprecise inputs and have loose constraints on the quality of output, making them ideal candidates for approximate computing. This paper introduces a novel approximate computing technique that decouples the format of data in the memory hierarchy from the format of data in the compute subsystem to significantly reduce the cost of storing and moving bits throughout the memory hierarchy and improve application performance. This asymmetric compute-memory extension to conventional architectures, ACME, adds two new instruction classes to the ISA - load-concise and store-concise - along with three small functional units to the micro-architecture to support these instructions. ACME does not affect exact execution of applications and comes into play only when concise memory operations are used. Through detailed experimentation we find that ACME is very effective at trading result accuracy for improved application performance. Our results show that ACME achieves a 1.3x speedup ( up to 1.8x) while maintaining 99% accuracy, or a 1.1x speedup while maintaining 99.999% accuracy. Moreover, our approach incurs negligible area and power overheads, adding just 0.005% area and 0.1% power to a conventional modern architecture.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences (hsv//eng)
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)
Hitta via bibliotek
Till lärosätets databas