Sökning: id:"swepub:oai:DiVA.org:uu-485382" >
Free Atomics :
Free Atomics : Hardware Atomic Operations without Fences
-
- Asgharzadeh, Ashkan (författare)
- Univ Murcia, Comp Engn Dept, Murcia 30100, Spain.
-
- Cebrian, Juan M. (författare)
- Univ Murcia, Comp Engn Dept, Murcia 30100, Spain.
-
- Perais, Arthur (författare)
- Univ Grenoble Alpes, CNRS, Grenoble INP, Inst Engn,TIMA, Grenoble, France.
-
visa fler...
-
- Kaxiras, Stefanos (författare)
- Uppsala universitet,Datorarkitektur och datorkommunikation,Avdelningen för datorteknik
-
- Ros, Alberto (författare)
- Univ Murcia, Comp Engn Dept, Murcia 30100, Spain.
-
visa färre...
-
Univ Murcia, Comp Engn Dept, Murcia 30100, Spain Univ Grenoble Alpes, CNRS, Grenoble INP, Inst Engn,TIMA, Grenoble, France. (creator_code:org_t)
- 2022-06-11
- 2022
- Engelska.
-
Ingår i: PROCEEDINGS OF THE 2022 THE 49TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '22). - New York, NY, USA : Association for Computing Machinery (ACM). - 9781450386104 ; , s. 14-26
- Relaterad länk:
-
https://urn.kb.se/re...
-
visa fler...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Atomic Read-Modify-Write (RMW) instructions are primitive synchronization operations implemented in hardware that provide the building blocks for higher-abstraction synchronization mechanisms to programmers. According to publicly available documentation, current x86 implementations serialize atomic RMW operations, i.e., the store buffer is drained before issuing atomic RMWs and subsequent memory operations are stalled until the atomic RMW commits. This serialization, carried out by memory fences, incurs a performance cost which is expected to increase with deeper pipelines. This work proposes Free atomics, a lightweight, speculative, deadlock-free implementation of atomic operations that removes the need for memory fences, thus improving performance, while preserving atomicity and consistency. Free atomics is, to the best of our knowledge, the first proposal to enable store-to-load forwarding for atomic RMWs. Free atomics only requires simple modifications and incurs a small area overhead (15 bytes). Our evaluation using gem5-20 shows that, for a 32-core configuration, Free atomics improves performance by 12.5%, on average, for a large range of parallel workloads and 25.2%, on average, for atomic-intensive parallel workloads over a fenced atomic RMW implementation.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datorteknik (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Engineering (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
Nyckelord
- Multi-core architectures
- microarchitecture
- atomic Read-Modify-Write instructions
- Total-Store-Order (TSO)
- store-to-load forwarding
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)
Hitta via bibliotek
Till lärosätets databas