Heterogeneous LBM Simulation Code with LRnLA Algorithms

Vadim Levchenko; Anastasia Perepelkina

doi:10.4208/cicp.OA-2022-0055

Heterogeneous LBM Simulation Code with LRnLA Algorithms

Preview

Add to basket

Year: 2023

Author: Vadim Levchenko, Anastasia Perepelkina

Communications in Computational Physics, Vol. 33 (2023), Iss. 1 : pp. 214–244

Abstract

A design of a new heterogeneous code for LBM simulations is proposed. By heterogeneous computing we mean a collaborative computation on CPU and GPU, which is characterized by the following features: the data is distributed between CPU and GPU memory spaces taking advantage of both parallel hierarchies; the capabilities of both SIMT GPU and SIMD GPU parallelization are used for calculations; the algorithms in use efficiently conceal the CPU-GPU data exchange; the subdivision of the computing task is performed with an account for the strong points of both processing units: high performance of GPU, low latency, and advanced memory hierarchy of CPU. This code is a continuation of our work in the development of LRnLA codes for LBM. Previous LRnLA codes had good efficiency both for CPU and GPU computing, and allowed GPU simulation performed on data stored in CPU RAM without performance loss on CPU-GPU data transfer. In the new code, we use methods and instruments that can be flexibly adapted to GPU and CPU instruction sets. We present the theoretical study of the performance of the proposed code and suggest implementation techniques. The bottlenecks are identified. As a result, we conclude that larger problems can be simulated with higher efficiency in the heterogeneous system.

Submit Article

You do not have full access to this article.

Already a Subscriber? Sign in as an individual or via your institution

Journal Article Details

Publisher Name: Global Science Press

Language: English

DOI: https://doi.org/10.4208/cicp.OA-2022-0055

Communications in Computational Physics, Vol. 33 (2023), Iss. 1 : pp. 214–244

Published online: 2023-01

AMS Subject Headings: Global Science Press

Pages: 31

Keywords: LBM Roofline memory-bound GPU LRnLA.

Author Details

Vadim Levchenko

Anastasia Perepelkina

Parallel Computational Technologies

An Efficient LRnLA Algorithm and Data Structure for Manycore and Multicore Computers with Hierarchical Cache
Levchenko, Vadim | Perepelkina, Anastasia
2023
https://doi.org/10.1007/978-3-031-38864-4_3 [Citations: 0]
Fast-QSGS: A GPU accelerated program for structure generation of granular disordered media
Yang, Guang | Liu, Tong | Lu, Xukang | Wang, Moran
Computer Physics Communications, Vol. 302 (2024), Iss. P.109241
https://doi.org/10.1016/j.cpc.2024.109241 [Citations: 0]
Supercomputing

Construction of Locality-Aware Algorithms to Optimize Performance of Stencil Codes on Heterogeneous Hardware
Levchenko, Vadim | Perepelkina, Anastasia
2023
https://doi.org/10.1007/978-3-031-49435-2_11 [Citations: 0]
Performance evaluation of the LBM simulations in fluid dynamics on SX-Aurora TSUBASA vector engine
Sun, Xiangcheng | Takahashi, Keichi | Shimomura, Yoichi | Takizawa, Hiroyuki | Wang, Xian
Computer Physics Communications, Vol. 307 (2025), Iss. P.109411
https://doi.org/10.1016/j.cpc.2024.109411 [Citations: 0]
Recalibration of LBM Populations for Construction of Grid Refinement with No Interpolation
Berezin, Arseniy | Perepelkina, Anastasia | Ivanov, Anton | Levchenko, Vadim
Fluids, Vol. 8 (2023), Iss. 6 P.179
https://doi.org/10.3390/fluids8060179 [Citations: 1]
An Efficient GPU Algorithm for Lattice Boltzmann Method on Sparse Complex Geometries
Qin, Zhangrong | Lu, Xusheng | Lv, Long | Tang, Zhongxiang | Wen, Binghai
IEEE Transactions on Parallel and Distributed Systems, Vol. 36 (2025), Iss. 2 P.239
https://doi.org/10.1109/TPDS.2024.3510810 [Citations: 0]

Journals

Resources

About Us

Open Access

Journals

Resources

About Us

Open Access

Heterogeneous LBM Simulation Code with LRnLA Algorithms

Abstract

Journal Article Details

Author Details

Parallel Computational Technologies

An Efficient LRnLA Algorithm and Data Structure for Manycore and Multicore Computers with Hierarchical Cache

Fast-QSGS: A GPU accelerated program for structure generation of granular disordered media

Supercomputing

Construction of Locality-Aware Algorithms to Optimize Performance of Stencil Codes on Heterogeneous Hardware

Performance evaluation of the LBM simulations in fluid dynamics on SX-Aurora TSUBASA vector engine

Recalibration of LBM Populations for Construction of Grid Refinement with No Interpolation

An Efficient GPU Algorithm for Lattice Boltzmann Method on Sparse Complex Geometries

Heterogeneous LBM Simulation Code with LRnLA Algorithms

Abstract

Full Text

Additional Information

Journal Article Details

Author Details

Cited By

Parallel Computational Technologies

An Efficient LRnLA Algorithm and Data Structure for Manycore and Multicore Computers with Hierarchical Cache

Fast-QSGS: A GPU accelerated program for structure generation of granular disordered media

Supercomputing

Construction of Locality-Aware Algorithms to Optimize Performance of Stencil Codes on Heterogeneous Hardware

Performance evaluation of the LBM simulations in fluid dynamics on SX-Aurora TSUBASA vector engine

Recalibration of LBM Populations for Construction of Grid Refinement with No Interpolation

An Efficient GPU Algorithm for Lattice Boltzmann Method on Sparse Complex Geometries