E-Thesis 405 views 1173 downloads
Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques / FILIPPOS PANTEKIS
Swansea University Author: FILIPPOS PANTEKIS
-
PDF | E-Thesis – open access
Copyright: The Author, Filippos Pantekis, 2025 Distributed under the terms of a Creative Commons Attribution 4.0 License (CC BY 4.0)
Download (9.62MB)
DOI (Published version): 10.23889/SUThesis.69053
Abstract
Serial (i.e., non-parallel) algorithms have historically been superseded by parallel equivalents which keep up with the evolution of CPUs, specifically, from single-core to the modern-day multi-core. This is a non-trivial transition, typically involving complex analysis and adjustment of code to expl...
| Published: |
Swansea University, Wales, UK
2025
|
|---|---|
| Institution: | Swansea University |
| Degree level: | Doctoral |
| Degree name: | Ph.D |
| Supervisor: | O’Reilly, L.; James, P.; and Moller, F. |
| URI: | https://cronfa.swan.ac.uk/Record/cronfa69053 |
| first_indexed |
2025-03-06T16:48:50Z |
|---|---|
| last_indexed |
2025-03-07T05:49:40Z |
| id |
cronfa69053 |
| recordtype |
RisThesis |
| fullrecord |
<?xml version="1.0"?><rfc1807><datestamp>2025-03-06T15:06:59.4484763</datestamp><bib-version>v2</bib-version><id>69053</id><entry>2025-03-06</entry><title>Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques</title><swanseaauthors><author><sid>e34468d15f5e269a81ea8c7f3fa99c2d</sid><firstname>FILIPPOS</firstname><surname>PANTEKIS</surname><name>FILIPPOS PANTEKIS</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2025-03-06</date><abstract>Serial (i.e., non-parallel) algorithms have historically been superseded by parallel equivalents which keep up with the evolution of CPUs, specifically, from single-core to the modern-day multi-core. This is a non-trivial transition, typically involving complex analysis and adjustment of code to exploit the architecture. Furthermore, the need for such development has increased due to the emergence and widespread availability of massively parallel manycore (co)processors over recent years that have offered access to increased computational performance relative to conventional processors (CPUs). However, this has come with the cost of developing bespoke algorithms which exploit the specialities and requirements of such hardware.Graphics Processing Units (GPUs) are a prime example of commercially available massively parallel co-processors that have been shown to offer significant performance gains when approached in an appropriate manner. At the same time, a plethora of ‘hard’ constraint satisfaction problems exist which, when approached carefully, benefit from the computational power of these devices, such as the Boolean Satisfiability (SAT) and the N-Queens problems.In this work, we explore the applicability of current GPU technology to the SAT andN-Queens problems. We present our design of a hybrid solver for SAT, which utilises a fast implementation of a scalable, loosely coupled GPU-based checker. Furthermore, we present a fully GPU-based, and fully scalable N-Queens solver that is state-of- the-art, built around our DoubleSweep-Light algorithm, which surpasses results of other solvers. Beyond algorithms and approaches for these specific problems, our exploration yields lessons and identifies general optimisations that can be applied to a broad range of problems, both for memory- and compute-bound kernels.</abstract><type>E-Thesis</type><journal/><volume/><journalNumber/><paginationStart/><paginationEnd/><publisher/><placeOfPublication>Swansea University, Wales, UK</placeOfPublication><isbnPrint/><isbnElectronic/><issnPrint/><issnElectronic/><keywords>High Performance Computing, GPGPUs, Optimisation</keywords><publishedDay>7</publishedDay><publishedMonth>2</publishedMonth><publishedYear>2025</publishedYear><publishedDate>2025-02-07</publishedDate><doi>10.23889/SUThesis.69053</doi><url/><notes>A selection of content is redacted or is partially redacted from this thesis to protect sensitive and personal information.</notes><college>COLLEGE NANME</college><CollegeCode>COLLEGE CODE</CollegeCode><institution>Swansea University</institution><supervisor>O’Reilly, L.; James, P.; and Moller, F.</supervisor><degreelevel>Doctoral</degreelevel><degreename>Ph.D</degreename><degreesponsorsfunders>EPSRC</degreesponsorsfunders><apcterm/><funders>EPSRC</funders><projectreference/><lastEdited>2025-03-06T15:06:59.4484763</lastEdited><Created>2025-03-06T14:50:03.3219260</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Mathematics and Computer Science - Computer Science</level></path><authors><author><firstname>FILIPPOS</firstname><surname>PANTEKIS</surname><order>1</order></author></authors><documents><document><filename>69053__33749__f2af770661fe42b6839343e61e1082ae.pdf</filename><originalFilename>2025_Pantekis_F.final.69053.pdf</originalFilename><uploaded>2025-03-06T15:02:04.2669522</uploaded><type>Output</type><contentLength>10086807</contentLength><contentType>application/pdf</contentType><version>E-Thesis – open access</version><cronfaStatus>true</cronfaStatus><documentNotes>Copyright: The Author, Filippos Pantekis, 2025
Distributed under the terms of a Creative Commons Attribution 4.0 License (CC BY 4.0)</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>https://creativecommons.org/licenses/by/4.0/</licence></document></documents><OutputDurs/></rfc1807> |
| spelling |
2025-03-06T15:06:59.4484763 v2 69053 2025-03-06 Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques e34468d15f5e269a81ea8c7f3fa99c2d FILIPPOS PANTEKIS FILIPPOS PANTEKIS true false 2025-03-06 Serial (i.e., non-parallel) algorithms have historically been superseded by parallel equivalents which keep up with the evolution of CPUs, specifically, from single-core to the modern-day multi-core. This is a non-trivial transition, typically involving complex analysis and adjustment of code to exploit the architecture. Furthermore, the need for such development has increased due to the emergence and widespread availability of massively parallel manycore (co)processors over recent years that have offered access to increased computational performance relative to conventional processors (CPUs). However, this has come with the cost of developing bespoke algorithms which exploit the specialities and requirements of such hardware.Graphics Processing Units (GPUs) are a prime example of commercially available massively parallel co-processors that have been shown to offer significant performance gains when approached in an appropriate manner. At the same time, a plethora of ‘hard’ constraint satisfaction problems exist which, when approached carefully, benefit from the computational power of these devices, such as the Boolean Satisfiability (SAT) and the N-Queens problems.In this work, we explore the applicability of current GPU technology to the SAT andN-Queens problems. We present our design of a hybrid solver for SAT, which utilises a fast implementation of a scalable, loosely coupled GPU-based checker. Furthermore, we present a fully GPU-based, and fully scalable N-Queens solver that is state-of- the-art, built around our DoubleSweep-Light algorithm, which surpasses results of other solvers. Beyond algorithms and approaches for these specific problems, our exploration yields lessons and identifies general optimisations that can be applied to a broad range of problems, both for memory- and compute-bound kernels. E-Thesis Swansea University, Wales, UK High Performance Computing, GPGPUs, Optimisation 7 2 2025 2025-02-07 10.23889/SUThesis.69053 A selection of content is redacted or is partially redacted from this thesis to protect sensitive and personal information. COLLEGE NANME COLLEGE CODE Swansea University O’Reilly, L.; James, P.; and Moller, F. Doctoral Ph.D EPSRC EPSRC 2025-03-06T15:06:59.4484763 2025-03-06T14:50:03.3219260 Faculty of Science and Engineering School of Mathematics and Computer Science - Computer Science FILIPPOS PANTEKIS 1 69053__33749__f2af770661fe42b6839343e61e1082ae.pdf 2025_Pantekis_F.final.69053.pdf 2025-03-06T15:02:04.2669522 Output 10086807 application/pdf E-Thesis – open access true Copyright: The Author, Filippos Pantekis, 2025 Distributed under the terms of a Creative Commons Attribution 4.0 License (CC BY 4.0) true eng https://creativecommons.org/licenses/by/4.0/ |
| title |
Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques |
| spellingShingle |
Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques FILIPPOS PANTEKIS |
| title_short |
Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques |
| title_full |
Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques |
| title_fullStr |
Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques |
| title_full_unstemmed |
Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques |
| title_sort |
Solving Hard Problems at Scale using Massively Parallel Manycore Processors: An Investigation of GPGPU Acceleration Techniques |
| author_id_str_mv |
e34468d15f5e269a81ea8c7f3fa99c2d |
| author_id_fullname_str_mv |
e34468d15f5e269a81ea8c7f3fa99c2d_***_FILIPPOS PANTEKIS |
| author |
FILIPPOS PANTEKIS |
| author2 |
FILIPPOS PANTEKIS |
| format |
E-Thesis |
| publishDate |
2025 |
| institution |
Swansea University |
| doi_str_mv |
10.23889/SUThesis.69053 |
| college_str |
Faculty of Science and Engineering |
| hierarchytype |
|
| hierarchy_top_id |
facultyofscienceandengineering |
| hierarchy_top_title |
Faculty of Science and Engineering |
| hierarchy_parent_id |
facultyofscienceandengineering |
| hierarchy_parent_title |
Faculty of Science and Engineering |
| department_str |
School of Mathematics and Computer Science - Computer Science{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Mathematics and Computer Science - Computer Science |
| document_store_str |
1 |
| active_str |
0 |
| description |
Serial (i.e., non-parallel) algorithms have historically been superseded by parallel equivalents which keep up with the evolution of CPUs, specifically, from single-core to the modern-day multi-core. This is a non-trivial transition, typically involving complex analysis and adjustment of code to exploit the architecture. Furthermore, the need for such development has increased due to the emergence and widespread availability of massively parallel manycore (co)processors over recent years that have offered access to increased computational performance relative to conventional processors (CPUs). However, this has come with the cost of developing bespoke algorithms which exploit the specialities and requirements of such hardware.Graphics Processing Units (GPUs) are a prime example of commercially available massively parallel co-processors that have been shown to offer significant performance gains when approached in an appropriate manner. At the same time, a plethora of ‘hard’ constraint satisfaction problems exist which, when approached carefully, benefit from the computational power of these devices, such as the Boolean Satisfiability (SAT) and the N-Queens problems.In this work, we explore the applicability of current GPU technology to the SAT andN-Queens problems. We present our design of a hybrid solver for SAT, which utilises a fast implementation of a scalable, loosely coupled GPU-based checker. Furthermore, we present a fully GPU-based, and fully scalable N-Queens solver that is state-of- the-art, built around our DoubleSweep-Light algorithm, which surpasses results of other solvers. Beyond algorithms and approaches for these specific problems, our exploration yields lessons and identifies general optimisations that can be applied to a broad range of problems, both for memory- and compute-bound kernels. |
| published_date |
2025-02-07T05:27:10Z |
| _version_ |
1851097801478373376 |
| score |
11.089386 |

