EconPapers    
Economics at your fingertips  
 

A DSMC program that supports different heterogeneous systems to simulate near-continuum gases

Chenchen Zhang (), MinHua Wen (), Bin Zhang, James Lin () and Hong Liu ()
Additional contact information
Chenchen Zhang: School of Aeronautics and Astronautics, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, P. R. China
MinHua Wen: Center for High Performance Computing, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, P. R. China
Bin Zhang: School of Aeronautics and Astronautics, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, P. R. China3Sichuan Research Institute, Shanghai Jiao Tong University, Chengdu 610213, P. R. China
James Lin: Center for High Performance Computing, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, P. R. China
Hong Liu: School of Aeronautics and Astronautics, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, P. R. China

International Journal of Modern Physics C (IJMPC), 2022, vol. 33, issue 11, 1-18

Abstract: It has become a trend for Direct Simulation Monte Carlo (DSMC) to simulate near-continuum gas, which makes the problem of a large amount of computation more prominent in turn. Moreover, heterogeneous architecture is becoming mainstream for modern supercomputers. Speeding DSMC on heterogeneous systems has been a must consideration for DSMC researchers. However, little research on the DSMC program that supports different heterogeneous systems is available. In this paper, we developed a DSMC program supporting heterogeneous systems through OpenMP4.5, and our codes can execute on many accelerators in theory such as Nvidia GPUs, AMD GPUs and Intel Many Integrated Core (MIC) products. The implementation of our heterogeneous DSMC is elaborated first, and then we explain the algorithm proposed to generate random numbers on accelerators. In addition, the optimization methods, including load-balance and granularity refinement strategy, branch elimination, and particles’ removal and entering, are described in detail, respectively. A shock-bubble interaction (SBI) case under Knudsen number of 1.66789×10−3 is utilized to validate the accuracy of our DSMC and do the performance analysis. An Nvidia RTX 2080 serves as a device, and the results show that our program can not only gain accurate flowfield information, but also perform well under different computing loads. More specifically, the whole speedup ratio of 52∼63 can be obtained relative to a single CPU core, and for the subroutine Collision, the ratio can even reach approximately 90.

Keywords: DSMC; heterogeneous; OpenMP4.5; optimization (search for similar items in EconPapers)
Date: 2022
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0129183122501479
Access to full text is restricted to subscribers

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wsi:ijmpcx:v:33:y:2022:i:11:n:s0129183122501479

Ordering information: This journal article can be ordered from

DOI: 10.1142/S0129183122501479

Access Statistics for this article

International Journal of Modern Physics C (IJMPC) is currently edited by H. J. Herrmann

More articles in International Journal of Modern Physics C (IJMPC) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().

 
Page updated 2025-03-20
Handle: RePEc:wsi:ijmpcx:v:33:y:2022:i:11:n:s0129183122501479