Efficient High-Order Discontinuous Galerkin Finite Elements with Matrix-Free Implementations
Martin Kronbichler () and
Momme Allalen
Additional contact information
Martin Kronbichler: Technical University of Munich
Momme Allalen: Leibniz-Rechenzentrum der Bayerischen Akademie der Wissenschaften
A chapter in Advances and New Trends in Environmental Informatics, 2018, pp 89-110 from Springer
Abstract:
Abstract This work presents high-order discontinuous Galerkin finite element kernels optimized for node-level performance on a series of Intel architectures ranging from Sandy Bridge to Skylake. The kernels implement matrix-free evaluation of integrals with sum factorization techniques. In order to increase performance and thus to help to achieve higher energy efficiency, this work proposes an element-based shared-memory parallelization option and compares it to a well-established shared-memory parallelization with global face data. The new algorithm is supported by the relevant metrics in terms of arithmetics and memory transfer. On a single node with $$2\times 24$$ 2 × 24 cores of Intel Skylake Scalable, we report more than 1,200 GFLOPs/s in double precision for the full operator evaluation and up to 175 GB/s of memory throughput. Finally, we also show that merging the more arithmetically heavy operator evaluation with vector operations in application code allows to more than double efficiency on the latest hardware both with respect to energy as well as regarding time to solution.
Keywords: High-order discontinuous Galerkin method; Sum factorization; Matrix-free method; Shared-memory parallelization; Energy efficiency by software optimization; Merged vector operations (search for similar items in EconPapers)
Date: 2018
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:prochp:978-3-319-99654-7_7
Ordering information: This item can be ordered from
http://www.springer.com/9783319996547
DOI: 10.1007/978-3-319-99654-7_7
Access Statistics for this chapter
More chapters in Progress in IS from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().