SwePub
Sök i LIBRIS databas

  Utökad sökning

onr:"swepub:oai:DiVA.org:uu-492103"
 

Sökning: onr:"swepub:oai:DiVA.org:uu-492103" > Speculative inter-t...

  • Feliu, JosueUniv Murcia, Comp Engn Dept, Murcia 30100, Spain. (författare)

Speculative inter-thread store-to-load forwarding in SMT architectures

  • Artikel/kapitelEngelska2023

Förlag, utgivningsår, omfång ...

  • Elsevier,2023
  • printrdacarrier

Nummerbeteckningar

  • LIBRIS-ID:oai:DiVA.org:uu-492103
  • https://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-492103URI
  • https://doi.org/10.1016/j.jpdc.2022.11.007DOI

Kompletterande språkuppgifter

  • Språk:engelska
  • Sammanfattning på:engelska

Ingår i deldatabas

Klassifikation

  • Ämneskategori:ref swepub-contenttype
  • Ämneskategori:art swepub-publicationtype

Anmärkningar

  • Applications running on out-of-order cores have benefited for decades of store-to-load forwarding which accelerates communication of store values to loads of the same thread. Despite threads running on a simultaneous multithreading (SMT) core could also access the load queues (LQ) and store queues (SQ) / store buffers (SB) of other threads to allow inter-thread store-to-load forwarding, we have skipped exploiting it because if we allow communication of different SMT threads via their LQs and SQs/SBs, write atomicity may be violated with respect to the outside world beyond the acceptable model of read -own-write-early multiple-copy atomicity (rMCA).In our prior work, we leveraged this idea to propose inter-thread store-to-load forwarding (ITSLF). ITLSF accelerates synchronization and communication of threads running in a simultaneous multi-threading processor by allowing stores in the store-queue of a thread to forward data to loads of another thread running in the same core without violating rMCA.In this work, we extend the original ITSLF mechanism to allow inter-thread forwarding from speculative stores (Spec-ITSLF). Spec-ITSLF allows forwarding store values to other threads earlier, which further accelerates synchronization. Spec-ITSLF outperforms a baseline SMT core by 15%, which is 2% better on average (and up to 5% for the TATP workload) than the original ITSLF mechanism. More importantly, Spec-ITSLF is on par with the original ITSLF mechanism regarding storage overhead but does not need to keep track of the speculative state of stores, which was an important source of overhead and complexity in the original mechanism. (c) 2022 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Ämnesord och genrebeteckningar

Biuppslag (personer, institutioner, konferenser, titlar ...)

  • Ros, AlbertoUniv Murcia, Comp Engn Dept, Murcia 30100, Spain. (författare)
  • Acacio, Manuel E.Univ Murcia, Comp Engn Dept, Murcia 30100, Spain. (författare)
  • Kaxiras, StefanosUppsala universitet,Datorarkitektur och datorkommunikation,Avdelningen för datorteknik,Uppsala Univ, Dept Informat Technol, S-75236 Uppsala, Sweden.(Swepub:uu)steka984 (författare)
  • Univ Murcia, Comp Engn Dept, Murcia 30100, Spain.Datorarkitektur och datorkommunikation (creator_code:org_t)

Sammanhörande titlar

  • Ingår i:Journal of Parallel and Distributed Computing: Elsevier173, s. 94-1060743-73151096-0848

Internetlänk

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy