SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:umu-145462"
 

Search: onr:"swepub:oai:DiVA.org:umu-145462" > On solving separabl...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

On solving separable block tridiagonal linear systems using a GPU implementation of radix-4 PSCR method

Myllykoski, Mirko, 1988- (author)
Umeå universitet,Institutionen för datavetenskap,Department of Mathematical Information Technology, University of Jyväskylä
Rossi, Tuomo (author)
Department of Mathematical Information Technology, University of Jyväskylä
Toivanen, Jari (author)
Department of Mathematical Information Technology, University of Jyväskylä; Department of Aeronautics & Astronautics, Stanford University
 (creator_code:org_t)
Elsevier, 2018
2018
English.
In: Journal of Parallel and Distributed Computing. - : Elsevier. - 0743-7315 .- 1096-0848. ; 115, s. 56-66
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • Partial solution variant of the cyclic reduction (PSCR) method is a direct solver that can be applied to certain types of separable block tridiagonal linear systems. Such linear systems arise, e.g., from the Poisson and the Helmholtz equations discretized with bilinear finite-elements. Furthermore, the separability of the linear system entails that the discretization domain has to be rectangular and the discretization mesh orthogonal. A generalized graphics processing unit (GPU) implementation of the PSCR method is presented. The numerical results indicate up to 24-fold speedups when compared to an equivalent CPU implementation that utilizes a single CPU core. Attained floating point performance is analyzed using roofline performance analysis model and the resulting models show that the attained floating point performance is mainly limited by the off-chip memory bandwidth and the effectiveness of a tridiagonal solver used to solve arising tridiagonal subproblems. The performance is accelerated using off-line autotuning techniques.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
NATURVETENSKAP  -- Data- och informationsvetenskap -- Programvaruteknik (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Software Engineering (hsv//eng)

Keyword

Fast direct solver
GPU computing
Partial solution technique
PSCR method
Roofline model
Separable block tridiagonal linear system
business data processing
administrativ databehandling

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Myllykoski, Mirk ...
Rossi, Tuomo
Toivanen, Jari
About the subject
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Computer Science ...
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Software Enginee ...
Articles in the publication
Journal of Paral ...
By the university
Umeå University

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view