Fast finite difference Poisson solvers on heterogeneous architectures

Valero-Lara, P., Pinelli, A. & Prieto-Matias, M. (2014). Fast finite difference Poisson solvers on heterogeneous architectures. Computer Physics Communications Package, 185(4), doi: 10.1016/j.cpc.2013.12.026

[img]
Preview
Text - Accepted Version
Available under License : See the attached licence file.

Download (211kB) | Preview
[img]
Preview
Text (Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Licence) - Other
Download (201kB) | Preview

Abstract

In this paper we propose and evaluate a set of new strategies for the solution of three dimensional separable elliptic problems on CPU–GPU platforms. The numerical solution of the system of linear equations arising when discretizing those operators often represents the most time consuming part of larger simulation codes tackling a variety of physical situations. Incompressible fluid flows, electromagnetic problems, heat transfer and solid mechanic simulations are just a few examples of application areas that require efficient solution strategies for this class of problems. GPU computing has emerged as an attractive alternative to conventional CPUs for many scientific applications. High speedups over CPU implementations have been reported and this trend is expected to continue in the future with improved programming support and tighter CPU–GPU integration. These speedups by no means imply that CPU performance is no longer critical. The conventional CPU-control–GPU-compute pattern used in many applications wastes much of CPU’s computational power. Our proposed parallel implementation of a classical cyclic reduction algorithm to tackle the large linear systems arising from the discretized form of the elliptic problem at hand, schedules computing on both the GPU and the CPUs in a cooperative way. The experimental result demonstrates the effectiveness of this approach.

Item Type: Article
Additional Information: © 2014, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/
Uncontrolled Keywords: Fast finite difference Poisson solvers; Parallel computing; CPU–GPU heterogeneous architectures
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QC Physics
Divisions: School of Engineering & Mathematical Sciences > Engineering
URI: http://openaccess.city.ac.uk/id/eprint/6900

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics