Multithreaded Reproducible Banded Matrix-Vector Multiplication
Tao Tang,
Haijun Qi (),
Qingfeng Lu and
Hao Jiang
Additional contact information
Tao Tang: College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
Haijun Qi: College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
Qingfeng Lu: College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
Hao Jiang: College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
Mathematics, 2024, vol. 12, issue 3, 1-12
Abstract:
Reproducibility refers to getting bitwise identical floating point results from multiple runs of the same program, is an important basis for debugging or correctness checking in many codes. However the round-off error and non-associativity of floating point makes attaining reproducibility a challenge in large-scale, long-term parallel computing or solving ill conditioned problems. The dgbmv performs general banded matrix-vector multiplication for double precision, is the most basic Level-2 operation in BLAS. First, we designed a reproducible algorithm for banded matrix-vector multiplication repro _ dgbmv based on the technique of error-free transformation. Then the error of the algorithm is analyzed. Second, the algorithm is parallelized into repro _ dgbmv _ thread on ARM and x86 platforms. The numerical test results verify that repro _ dgbmv _ thread is reproducible and has higher accuracy than ordinary dgbmv . In numerical experiments on ARM platform, as the number of threads increases from 1 to 8, the run time of this algorithm is reduced by 5.2–7 times, while the run time of multithreaded dgbmv is only reduced by 2.2–3.8 times. In numerical experiments on x86 platform, as the number of threads increases from 1 to 15, the run time of this algorithm is reduced by 7.7–10.6 times, while the run time of multithreaded dgbmv is only reduced by 4.2–6.8 times.
Keywords: reproducibility; multithreading; banded matrix; matrix-vector multiplication (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/12/3/422/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/3/422/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:3:p:422-:d:1328019
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().