On the Efficiency of Supernodal Factorization in Interior-Point Method Using CPU-GPU Collaboration

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: L773:2169 3536 OR L773:2169 3536 > On the Efficiency o...

On the Efficiency of Supernodal Factorization in Interior-Point Method Using CPU-GPU Collaboration

Ahmad, Muhammad Ovais, Senior Lecturer (författare): Karlstads universitet,Institutionen för matematik och datavetenskap (from 2013),SQUAD - SOFTWARE QUALITY AND DIGITAL MODERNISATION

(creator_code:org_t)

IEEE Computer Society Digital Library, 2020
2020
Engelska.
Ingår i: IEEE Access. - : IEEE Computer Society Digital Library. - 2169-3536. ; 8, s. 120892-120904

Relaterad länk:: https://doi.org/10.1...; visa fler...; https://ieeexplore.i...; https://urn.kb.se/re...; https://doi.org/10.1...; visa färre...

Tidskriftsartikel (refereegranskat)

Abstract Ämnesord

Stäng

Primal-dual interior-point method (PDIPM) is the most efficient technique for solving sparse linear programming (LP) problems. Despite its efficiency, PDIPM remains a compute-intensive algorithm. Fortunately, graphics processing units (GPUs) have the potential to meet this requirement. However, their peculiar architecture entails a positive relationship between problem density and speedup, conversely implying a limited affinity of GPUs for problem sparsity. To overcome this difficulty, the state-of-the-art hybrid (CPU-GPU) implementation of PDIPM exploits presence of supernodes in sparse matrices during factorization. Supernodes are groups of similar columns that can be treated as dense submatrices. Factorization method used in the state-of-the-art solver performs only selected operations related to large supernodes on GPU. This method is known to underutilize GPU’s computational power while increasing CPU-GPU communication overhead. These shortcomings encouraged us to adapt another factorization method, which processes sets of related supernodes on GPU, and introduce it to the PDIPM implementation of a popular open-source solver. Our adaptation enabled the factorization method to better mitigate the effects of round-off errors accumulated over multiple iterations of PDIPM. To augment performance gains, we also used an efficient CPU-based matrix multiplication method. When tested for a set of well-known sparse problems, the adapted solver showed average speed-ups of approximately 55X, 1.14X and 1.05X over the open-source solver’s original version, the state-of-the-art solver, and a highly optimized proprietary solver known as CPLEX, respectively. These results strongly indicate that our proposed hybrid approach can lead to significant performance gains for solving large sparse problems.

Hitta via bibliotek

IEEE Access (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

Hitta mer i SwePub

Av författaren/redakt...: Ahmad, Muhammad ...

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Datavetenskap

Artiklar i publikationen: IEEE Access

Av lärosätet: Karlstads universitet

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se

On the Efficiency of Supernodal Factorization in Interior-Point Method Using CPU-GPU Collaboration

Ämnesord

Nyckelord

Publikations- och innehållstyp

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Sök utanför SwePub