Sökning: onr:"swepub:oai:DiVA.org:kth-63630" >
Area and Performanc...
-
Chen, XiaowenKTH,Elektroniksystem
(författare)
Area and Performance Optimization of Barrier Synchronization on Multi-core Network-on-Chips
- Artikel/kapitelEngelska2010
Förlag, utgivningsår, omfång ...
Nummerbeteckningar
-
LIBRIS-ID:oai:DiVA.org:kth-63630
-
https://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-63630URI
Kompletterande språkuppgifter
-
Språk:engelska
-
Sammanfattning på:engelska
Ingår i deldatabas
Klassifikation
-
Ämneskategori:ref swepub-contenttype
-
Ämneskategori:kon swepub-publicationtype
Anmärkningar
-
Key: Nostrum. QC 20120425
-
Barrier synchronization is commonly and widelyused to synchronize the execution of parallel processor coreson multi-core Network-on-Chips (NoCs). Since its globalnature may cause heavy serialization resulting in largeperformance penalty, barrier synchronization should becarefully designed to have low latency communication and tominimize overall completion time. Therefore, in the paper, wepropose a fast barrier synchronization mechanism, targetingMulti-core NoCs. The fast barrier synchronization mechanismincludes a dedicated hardware module, named Fast BarrierSynchronizer (FBS), integrated with each processor node. Itoffers a set of barrier counters and can concurrently processsynchronization requests issued by the local node and remotenodes via the on-chip network. The salient feature of our fastbarrier synchronization mechanism is that, once the barriercondition is reached, the “barrier release” acknowledgement isrouted to all processor nodes in a broadcast way in order tosave chip area by avoiding storing source node informationand to minimize completion time by avoiding serialization ofbarrier releasing. Synthesis results suggest that the FBS canrun over 1 GHz in SMIC® 130nm technology with small areaoverhead. We implemented a FBS-enhanced multi-core NoCarchitecture on our FPGA platform using the Xilinx® Virtex 5as the FPGA chip. FPGA utilization and simulation resultsshow that our fast barrier synchronization demonstrates botharea and performance advantages over the barriersynchronization counterpart with unicast barrier releasing.
Ämnesord och genrebeteckningar
Biuppslag (personer, institutioner, konferenser, titlar ...)
-
Chen, Shuming
(författare)
-
Lu, ZhonghaiKTH,Elektroniksystem(Swepub:kth)u1wqy6h0
(författare)
-
Jantsch, AxelKTH,Elektroniksystem(Swepub:kth)u18wycaf
(författare)
-
KTHElektroniksystem
(creator_code:org_t)
Sammanhörande titlar
-
Ingår i:3rd IEEE International Conference on Computer and Electrical Engineering (ICCEE)
Internetlänk