Skip to content
Snippets Groups Projects
Commit 319909f0 authored by Martin Bauer's avatar Martin Bauer
Browse files

New GPU communication scheme with GPU kernels for packing

Features:
   - uses generated pack infos for packing & unpacking directly on GPU
   - can directly send GPU buffers if cuda-enabled MPI is available,
     otherwise the packed buffers are transfered to CPU first
   - communication hiding with cuda streams: communication can be run
     asynchronously - especially useful when compute kernel is also
     split up into inner and outer part

- added RAII classes for CUDA streams and events
- equivalence test that checks if generated CPU and GPU (overlapped)
  versions are computing same result as normal waLBerla LBM kernel
parent b3213d8a
Showing with 999 additions and 18 deletions
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment