Vectorized implementation of the FEM numerical integration algorithm on a modern CPU
Authors:
- Filip Krużel
Abstract
The main aim of this study is to answer the question: how to effectively implement the creation of the finite element stiffness matrix in parallel simulations of Finite Element Method using the full advantages of modern multiprocessors such as parallelization combined with vectorization. In this work, an efficient method for implementation of a Finite Element Method numerical integration algorithm on a modern Intel Haswell CPU architecture was developed. This algorithm was chosen, due to its non-trivial structure and the fact that its optimization is often omitted in research in favour of accelerating the other phases of FEM. Tests included two types of tasks to solve, with the use of two types of approximation and two types of finite elements. During this study, several methods for the implementation of the chosen algorithm was investigated, including Intel Cilk Plus, Intel Intrinsics and other computing techniques. Results were compared with an older Sandy Bridge architecture, showing a significant impact of vectorization and large cache on the performance of the modern CPUs. Our research gives suggestions for choosing the optimal design of algorithms and effectively using all of the features of the modern CPUs.
- Record ID
- CUT30e3914a064f4a2da7088e4ffc65adf2
- Publication categories
- ; ;
- Author
- Pages
- 414-420
- Other elements of collation
- tab.; Bibliografia (na s.) - 420; Oznaczenie streszczenia - Abstr.
- Substantive notes
- Punktacja MNiSW/MEiN (rozdział) - 5
- Book
- Iacono Mauro, Mauro Iacono Palmieri Francesco, Francesco Palmieri Gribaudo Marco Marco Gribaudo [et al.] (eds.): Proceedings of the 33rd International ECMS Conference on Modelling and Simulation : ECMS 2019, June 11th - June 14th, 2019, Caserta, Italy, European Conference for Modelling and Simulation, no. Vol. 33, Iss. 1, 2019, [S.l.], European Council for Modelling and Simulation, ISBN 978-3-937436-65-4
- Keywords in English
- CPU, optimization, parallelization, vectorization
- DOI
- DOI:10.7148/2019-0414 Opening in a new tab
- URL
- http://www.scs-europe.net/dlib/2019/2019-0414.htm Opening in a new tab
- Language
- eng (en) English
- License
- Score (nominal)
- 70
- Additional fields
- Indeksowana w: Web of Science, Scopus, CORE
- Uniform Resource Identifier
- https://cris.pk.edu.pl/info/article/CUT30e3914a064f4a2da7088e4ffc65adf2/
- URN
urn:pkr-prod:CUT30e3914a064f4a2da7088e4ffc65adf2
* presented citation count is obtained through Internet information analysis, and it is close to the number calculated by the Publish or PerishOpening in a new tab system.