SwePub
Sök i LIBRIS databas

  Utökad sökning

WFRF:(McKee Sally A 1963)
 

Sökning: WFRF:(McKee Sally A 1963) > Active memory contr...

  • Fang, Z. (författare)

Active memory controller

  • Artikel/kapitelEngelska2012

Förlag, utgivningsår, omfång ...

  • 2012-01-17
  • Springer Science and Business Media LLC,2012

Nummerbeteckningar

  • LIBRIS-ID:oai:research.chalmers.se:500818bf-1553-48b5-ad5b-c87b80a95a28
  • https://research.chalmers.se/publication/165058URI
  • https://doi.org/10.1007/s11227-011-0735-9DOI

Kompletterande språkuppgifter

  • Språk:engelska
  • Sammanfattning på:engelska

Ingår i deldatabas

Klassifikation

  • Ämneskategori:art swepub-publicationtype
  • Ämneskategori:ref swepub-contenttype

Anmärkningar

  • Inability to hide main memory latency has been increasingly limiting the performance of modern processors. The problem is worse in large-scale shared memory systems, where remote memory latencies are hundreds, and soon thousands, of processor cycles. To mitigate this problem, we propose an intelligent memory and cache coherence controller (AMC) that can execute Active Memory Operations (AMOs). AMOs are select operations sent to and executed on the home memory controller of data. AMOs can eliminate a significant number of coherence messages, minimize intranode and internode memory traffic, and create opportunities for parallelism. Our implementation of AMOs is cache-coherent and requires no changes to the processor core or DRAM chips. In this paper, we present the microarchitecture design of AMC, and the programming model of AMOs. We compare AMOs' performance to that of several other memory architectures on a variety of scientific and commercial benchmarks. Through simulation, we show that AMOs offer dramatic performance improvements for an important set of data-intensive operations, e.g., up to 50x faster barriers, 12x faster spinlocks, 8.5x-15x faster stream/array operations, and 3x faster database queries. We also present an analytical model that can predict the performance benefits of using AMOs with decent accuracy. The silicon cost required to support AMOs is less than 1% of the die area of a typical high performance processor, based on a standard cell implementation.

Ämnesord och genrebeteckningar

Biuppslag (personer, institutioner, konferenser, titlar ...)

  • Zhang, L.Chinese Academy of Sciences (författare)
  • Carter, J. B.IBM Austin Research Laboratory (författare)
  • McKee, Sally A,1963Chalmers tekniska högskola,Chalmers University of Technology(Swepub:cth)mckee (författare)
  • Ibrahim, A.Advanced Micro Devices, Inc. (författare)
  • Parker, M. A. (författare)
  • Jiang, X. W.Intel Corporation (författare)
  • Chinese Academy of SciencesIBM Austin Research Laboratory (creator_code:org_t)

Sammanhörande titlar

  • Ingår i:Journal of Supercomputing: Springer Science and Business Media LLC62:1, s. 510-5491573-04840920-8542

Internetlänk

Hitta via bibliotek

Till lärosätets databas

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy