Optimal kernel design for finite element numerical integration on GPUs
Authors:
- Krzysztof Banaś,
- Filip Krużel,
- Jan Bielański
Abstract
The article presents the design and optimization of GPU kernels for numerical integration, as it is applied in the standard form in finite element codes. The optimization process employs auto-tuning, with the main emphasis on the placement of variables in shared memory or registers. OpenCL and first order FEM approximation are selected for code design, but the techniques are also applicable to CUDA programming model and other types of finite element discretizations (including discontinuous Galerkin and isogeometric). The auto-tuning optimization is performed for four example graphics processors and the obtained results are discussed.
- Record ID
- CUTd6bf3f368ad2481abf4e86005dbdd5b8
- Publication categories
- ;
- Author
- Journal series
- Computing in Science & Engineering, ISSN 1521-9615, e-ISSN 1558-366X
- Issue year
- 2020
- Vol
- 22
- No
- 6
- Pages
- 61-74
- Other elements of collation
- tab.; wykr.; Bibliografia (na s.) - 73-74; Bibliografia (liczba pozycji) - 21; Oznaczenie streszczenia - Abstr.; Data udostępnienia on-line - 2019-09-18; Numeracja w czasopiśmie - Vol. 22, Iss. 6
- Keywords in English
- finite element analysis, graphics processing units, jacobian matrices, shape, approximation algorithms, optimization, computational modeling
- DOI
- DOI:10.1109/MCSE.2019.2940656 Opening in a new tab
- URL
- https://ieeexplore.ieee.org/document/8843948 Opening in a new tab
- Language
- eng (en) English
- Score (nominal)
- 100
- Additional fields
- Indeksowana w: Web of Science, Scopus
- Uniform Resource Identifier
- https://cris.pk.edu.pl/info/article/CUTd6bf3f368ad2481abf4e86005dbdd5b8/
- URN
urn:pkr-prod:CUTd6bf3f368ad2481abf4e86005dbdd5b8
* presented citation count is obtained through Internet information analysis, and it is close to the number calculated by the Publish or PerishOpening in a new tab system.