A Hybrid MPI+PGAS Approach to Improve Strong Scalability Limits of Finite Element Solvers

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: onr:"swepub:oai:DiVA.org:kth-278710" > A Hybrid MPI+PGAS A...

1 av 1
Föregående post
Nästa post
Till träfflistan

Jansson, Niclas,1983-KTH,Parallelldatorcentrum, PDC (författare)

A Hybrid MPI+PGAS Approach to Improve Strong Scalability Limits of Finite Element Solvers

Artikel/kapitelEngelska2020

Förlag, utgivningsår, omfång ...

Institute of Electrical and Electronics Engineers (IEEE),2020
printrdacarrier

Nummerbeteckningar

LIBRIS-ID:oai:DiVA.org:kth-278710
https://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-278710URI
https://doi.org/10.1109/CLUSTER49012.2020.00041DOI

Kompletterande språkuppgifter

Språk:engelska
Sammanfattning på:engelska

Ingår i deldatabas

SwePubSwePub

Klassifikation

Ämneskategori:ref swepub-contenttype
Ämneskategori:kon swepub-publicationtype

Anmärkningar

QC 20200928
Current finite element codes scale reasonably well as long as each core has sufficient amount of local work that can balance communication costs. However, achieving efficient performance at exascale will require unreasonable large problem sizes, in particular for low-order methods, where the small amount of work per element already is a limiting factor on current post petascale machines. Key bottlenecks for these methods are sparse matrix assembly, where communication latency starts to limit performance as the number of cores increases, and linear solvers, where efficient overlapping is necessary to amortize communication and synchronization cost of sparse matrix vector multiplication and dot products. We present our work on improving strong scalability limits of message passing based general low-order finite element based solvers. Using lightweight one-sided communication offered by partitioned global address space languages (PGAS), we demonstrate that the scalability of performance critical, latency sensitive sparse matrix assembly can achieve almost an order of magnitude better scalability. Linear solvers are also addressed via a signaling put algorithm for low-cost point-to-point synchronization, achieving similar performance as message passing based linear solvers. We introduce a new hybrid MPI+PGAS implementation of the open source general finite element framework FEniCS, replacing the linear algebra backend with a new library written in Unified Parallel C (UPC). A detailed description of the implementation and the hybrid interface to FEniCS is given, and the feasibility of the approach is demonstrated via a performance study of the hybrid implementation on Cray XC40 machines.

Ämnesord och genrebeteckningar

NATURVETENSKAP Matematik Beräkningsmatematik hsv//swe
NATURAL SCIENCES Mathematics Computational Mathematics hsv//eng
NATURVETENSKAP Data- och informationsvetenskap Datavetenskap hsv//swe
NATURAL SCIENCES Computer and Information Sciences Computer Sciences hsv//eng

Biuppslag (personer, institutioner, konferenser, titlar ...)

KTHParallelldatorcentrum, PDC (creator_code:org_t)

Sammanhörande titlar

Ingår i:Proceedings - IEEE International Conference on Cluster Computing, ICCC: Institute of Electrical and Electronics Engineers (IEEE), s. 303-313

Internetlänk

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Jansson, Niclas, ...

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Matematik; och Beräkningsmatema ...

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Datavetenskap

Artiklar i publikationen: Proceedings - IE ...

Av lärosätet: Kungliga Tekniska Högskolan

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se