SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:umu-132983"
 

Search: onr:"swepub:oai:DiVA.org:umu-132983" > HPC scheduling in a...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

HPC scheduling in a brave new world

Gonzalo P., Rodrigo, 1980- (author)
Umeå universitet,Institutionen för datavetenskap,Distributed Systems
Elmroth, Erik, Professor (thesis advisor)
Umeå universitet,Institutionen för datavetenskap
Ramakrishnan, Lavanya, PHd (thesis advisor)
Lawrence Berkeley National Lab, USA
show more...
Östberg, Per-Olov (thesis advisor)
Umeå universitet,Institutionen för datavetenskap
Deelman, Ewa, Professor (opponent)
USC Information Sciences Institute, Los Angeles, USA
show less...
 (creator_code:org_t)
ISBN 9789176016930
Umeå : Umeå universitet, 2017
English 122 s.
  • Doctoral thesis (other academic/artistic)
Abstract Subject headings
Close  
  • Many breakthroughs in scientific and industrial research are supported by simulations and calculations performed on high performance computing (HPC) systems. These systems typically consist of uniform, largely parallel compute resources and high bandwidth concurrent file systems interconnected by low latency synchronous networks. HPC systems are managed by batch schedulers that order the execution of application jobs to maximize utilization while steering turnaround time. In the past, demands for greater capacity were met by building more powerful systems with more compute nodes, greater transistor densities, and higher processor operating frequencies. Unfortunately, the scope for further increases in processor frequency is restricted by the limitations of semiconductor technology. Instead, parallelism within processors and in numbers of compute nodes is increasing, while the capacity of single processing units remains unchanged. In addition, HPC systems’ memory and I/O hierarchies are becoming deeper and more complex to keep up with the systems’ processing power. HPC applications are also changing: the need to analyze large data sets and simulation results is increasing the importance of data processing and data-intensive applications. Moreover, composition of applications through workflows within HPC centers is becoming increasingly important. This thesis addresses the HPC scheduling challenges created by such new systems and applications. It begins with a detailed analysis of the evolution of the workloads of three reference HPC systems at the National Energy Research Supercomputing Center (NERSC), with a focus on job heterogeneity and scheduler performance. This is followed by an analysis and improvement of a fairshare prioritization mechanism for HPC schedulers. The thesis then surveys the current state of the art and expected near-future developments in HPC hardware and applications, and identifies unaddressed scheduling challenges that they will introduce. These challenges include application diversity and issues with workflow scheduling or the scheduling of I/O resources to support applications. Next, a cloud-inspired HPC scheduling model is presented that can accommodate application diversity, takes advantage of malleable applications, and enables short wait times for applications. Finally, to support ongoing scheduling research, an open source scheduling simulation framework is proposed that allows new scheduling algorithms to be implemented and evaluated in a production scheduler using workloads modeled on those of a real system. The thesis concludes with the presentation of a workflow scheduling algorithm to minimize workflows’ turnaround time without over-allocating resources.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Keyword

High Performance Computing
HPC
supercomputing
scheduling
workflows
workloads
exascale
business data processing
administrativ databehandling

Publication and Content Type

vet (subject category)
dok (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view