Sökning: onr:"swepub:oai:DiVA.org:kth-339917" >
Kub :
Kub : Enabling Elastic HPC Workloads on Containerized Environments
-
- Araújo De Medeiros, Daniel (författare)
- KTH,Beräkningsvetenskap och beräkningsteknik (CST)
-
- Wahlgren, Jacob (författare)
- KTH,Beräkningsvetenskap och beräkningsteknik (CST)
-
- Schieffer, Gabin (författare)
- KTH,Beräkningsvetenskap och beräkningsteknik (CST)
-
visa fler...
-
- Peng, Ivy Bo (författare)
- KTH,Beräkningsvetenskap och beräkningsteknik (CST)
-
visa färre...
-
(creator_code:org_t)
- Institute of Electrical and Electronics Engineers (IEEE), 2023
- 2023
- Engelska.
-
Ingår i: Proceedings of the 35th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD). - : Institute of Electrical and Electronics Engineers (IEEE).
- Relaterad länk:
-
https://urn.kb.se/re...
-
visa fler...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- The conventional model of resource allocation in HPC systems is static. Thus, a job cannot leverage newly available resources in the system or release underutilized resources during the execution. In this paper, we present Kub, a methodology that enables elastic execution of HPC workloads on Kubernetes so that the resources allocated to a job can be dynamically scaled during the execution. One main optimization of our method is to maximize the reuse of the originally allocated resources so that the disruption to the running job can be minimized. The scaling procedure is coordinated among nodes through remote procedure calls on Kubernetes for deploying workloads in the cloud. We evaluate our approach using one synthetic benchmark and two production-level MPI-based HPC applications - GRO-MACS and CM1. Our results demonstrate that the benefits of adapting the allocated resources depend on the workload characteristics. In the tested cases, a properly chosen scaling point for increasing resources during execution achieved up to 2x speedup. Also, the overhead of checkpointing and data reshuffling significantly influences the selection of optimal scaling points and requires application-specific knowledge.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
Nyckelord
- HPC
- Cloud
- scaling
- Kubernetes
- Elasticity
- Malleability
- Datalogi
- Computer Science
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)