Sökning: id:"swepub:oai:DiVA.org:umu-132981" >
ScSF :
ScSF : a scheduling simulation framework
-
- Gonzalo P., Rodrigo, 1980- (författare)
- Umeå universitet,Institutionen för datavetenskap,Distributed Systems
-
- Elmroth, Erik (författare)
- Umeå universitet,Institutionen för datavetenskap,Distributed Systems
-
- Östberg, Per-Olov (författare)
- Umeå universitet,Institutionen för datavetenskap,Distributed Systems
-
visa fler...
-
- Ramakrishnan, Lavanya (författare)
- Lawrence Berkeley National Lab, Berkeley, California, USA
-
visa färre...
-
(creator_code:org_t)
- 2018-02-28
- 2018
- Engelska.
-
Ingår i: Job Scheduling Strategies for Parallel Processing. - Cham : Springer. - 9783319773971 - 9783319773988 ; , s. 152-173
- Relaterad länk:
-
https://umu.diva-por... (primary) (Raw object)
-
visa fler...
-
http://umu.diva-port...
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- High-throughput and data-intensive applications are increasingly present, often composed as workflows, in the workloads of current HPC systems. At the same time, trends for future HPC systems point towards more heterogeneous systems with deeper I/O and memory hierarchies. However, current HPC schedulers are designed to support classical large tightly coupled parallel jobs over homogeneous systems. Therefore, There is an urgent need to investigate new scheduling algorithms that can manage the future workloads on HPC systems. However, there is a lack of appropriate models and frameworks to enable development, testing, and validation of new scheduling ideas.In this paper, we present an open-source scheduler simulation framework (ScSF) that covers all the steps of scheduling research through simulation. ScSF provides capabilities for workload modeling, workload generation, system simulation, comparative workload analysis, and experiment orchestration. The simulator is designed to be run over a distributed computing infrastructure enabling to test at scale. We describe in detail a use case of ScSF to develop new techniques to manage scientific workflows in a batch scheduler. In the use case, such technique was implemented in the framework scheduler. For evaluation purposes, 1728 experiments, equivalent to 33 years of simulated time, were run in a deployment of ScSF over a distributed infrastructure of 17 compute nodes during two months. Finally, the experimental results were analyzed in the framework to judge that the technique minimizes workflows’ turnaround time without over-allocating resources. Finally, we discuss lessons learned from our experiences that will help future researchers.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
Nyckelord
- slurm
- simulation
- scheduling
- HPC
- High Performance Computing
- workload
- generation
- analysis
- datalogi
- Computer Science
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)
Hitta via bibliotek
Till lärosätets databas