↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Träfflista för sökning "L773:9781450321303 "

Search: L773:9781450321303

Result 1-2 of 2

Sort/group result

Sort by: Hits per page:

Enumeration	Reference	Cover	Find
1.	Ciobanu, Catalin, 1983, et al. (author) FASTER run-time reconfiguration management 2013 In: Proceedings of the International Conference on Supercomputing. - New York, NY, USA : ACM. - 9781450321303 ; , s. 463- Conference paper (peer-reviewed)abstract The FASTER project Run-Time System Manager offloads programmers from low-level operations by performing task placement, scheduling, and dynamic FPGA reconfiguration. It also manages device fragmentation, configuration caching, pre-fetching and reuse, bitstream compression, and optimizes the system thermal and power footprints. We propose a micro-reconfiguration aware, configuration content agnostic ISA interface and a technology independent Task Configuration Microcode format targeting Maxeler Data Flow computers and Xilinx XUPV5 platforms. We achieve improved resource utilization with negligible performance overhead. Up to 4Gbps for DMA transfers, and up to 3Gbps for FPGA reconfiguration on Xilinx Virtex-5/6 devices is achieved.
2.	Koukos, Konstantinos, et al. (author) Towards more efficient execution : a decoupled access-execute approach 2013 In: Proc. 27th ACM International Conference on Supercomputing. - New York : ACM Press. - 9781450321303 ; , s. 253-262 Conference paper (peer-reviewed)abstract The end of Dennard scaling is expected to shrink the range of DVFS in future nodes, limiting the energy savings of this technique. This paper evaluates how much we can increase the effectiveness of DVFS by using a software decoupled access-execute approach. Decoupling the data access from execution allows us to apply optimal voltage-frequency selection for each phase and therefore improve energy efficiency over standard coupled execution.The underlying insight of our work is that by decoupling access and execute we can take advantage of the memory-bound nature of the access phase and the compute-bound nature of the execute phase to optimize power efficiency, while maintaining good performance. To demonstrate this we built a task based parallel execution infrastructure consisting of: (1) a runtime system to orchestrate the execution, (2) power models to predict optimal voltage-frequency selection at runtime, (3) a modeling infrastructure based on hardware measurements to simulate zero-latency, per-core DVFS, and (4) a hardware measurement infrastructure to verify our model's accuracy.Based on real hardware measurements we project that the combination of decoupled access-execute and DVFS has the potential to improve EDP by 25% without hurting performance. On memory-bound applications we significantly improve performance due to increased MLP in the access phase and ILP in the execute phase. Furthermore we demonstrate that our method can achieve high performance both in presence or absence of a hardware prefetcher.

Skapa referenser, mejla, bekava och länka

Permalink

Result 1-2 of 2

Refine your search

Type of publication: conference paper (2)

Type of content: peer-reviewed (2)

Author/Editor: Kaxiras, Stefanos (1); Black-Schaffer, Davi ... (1); Koukos, Konstantinos (1); Gaydadjiev, Georgi, ... (1); Ciobanu, Catalin, 19 ... (1); Pnevmatikatos, Dioni ... (1); show more...; Papadimitriou, Kypri ... (1); Spiliopoulos, Vasile ... (1); show less...

University: Uppsala University (1); Chalmers University of Technology (1)

Language: English (2)

Research subject (UKÄ/SCB): Engineering and Technology (2); Natural sciences (1)

Year: 2013 (2)

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

Copyright © LIBRIS - National Library Systems
LIBRIS.kb.se

pil uppåt

Close

Copy and save the link in order to return to this view