SwePub
Sök i LIBRIS databas

  Utökad sökning

WFRF:(McKee Sally A 1963)
 

Sökning: WFRF:(McKee Sally A 1963) > Active memory contr...

Active memory controller

Fang, Z. (författare)
Zhang, L. (författare)
Chinese Academy of Sciences
Carter, J. B. (författare)
IBM Austin Research Laboratory
visa fler...
McKee, Sally A, 1963 (författare)
Chalmers tekniska högskola,Chalmers University of Technology
Ibrahim, A. (författare)
Advanced Micro Devices, Inc.
Parker, M. A. (författare)
Jiang, X. W. (författare)
Intel Corporation
visa färre...
 (creator_code:org_t)
2012-01-17
2012
Engelska.
Ingår i: Journal of Supercomputing. - : Springer Science and Business Media LLC. - 1573-0484 .- 0920-8542. ; 62:1, s. 510-549
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • Inability to hide main memory latency has been increasingly limiting the performance of modern processors. The problem is worse in large-scale shared memory systems, where remote memory latencies are hundreds, and soon thousands, of processor cycles. To mitigate this problem, we propose an intelligent memory and cache coherence controller (AMC) that can execute Active Memory Operations (AMOs). AMOs are select operations sent to and executed on the home memory controller of data. AMOs can eliminate a significant number of coherence messages, minimize intranode and internode memory traffic, and create opportunities for parallelism. Our implementation of AMOs is cache-coherent and requires no changes to the processor core or DRAM chips. In this paper, we present the microarchitecture design of AMC, and the programming model of AMOs. We compare AMOs' performance to that of several other memory architectures on a variety of scientific and commercial benchmarks. Through simulation, we show that AMOs offer dramatic performance improvements for an important set of data-intensive operations, e.g., up to 50x faster barriers, 12x faster spinlocks, 8.5x-15x faster stream/array operations, and 3x faster database queries. We also present an analytical model that can predict the performance benefits of using AMOs with decent accuracy. The silicon cost required to support AMOs is less than 1% of the die area of a typical high performance processor, based on a standard cell implementation.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences (hsv//eng)

Nyckelord

Memory architecture
Distributed shared memory
support
Interprocessor synchronization
architectures
Cache coherence
synchronization
chip
multiprocessors

Publikations- och innehållstyp

art (ämneskategori)
ref (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy