New GPU communication scheme with GPU kernels for packing (319909f0) · Commits · itischler / waLBerla

There was an error fetching the commit references. Please try again later.

Commit 319909f0 authored 6 years ago by

Martin Bauer

New GPU communication scheme with GPU kernels for packing

Features:
   - uses generated pack infos for packing & unpacking directly on GPU
   - can directly send GPU buffers if cuda-enabled MPI is available,
     otherwise the packed buffers are transfered to CPU first
   - communication hiding with cuda streams: communication can be run
     asynchronously - especially useful when compute kernel is also
     split up into inner and outer part

- added RAII classes for CUDA streams and events
- equivalence test that checks if generated CPU and GPU (overlapped)
  versions are computing same result as normal waLBerla LBM kernel

parent b3213d8a

Hide whitespace changes

Inline Side-by-side

Showing with 999 additions and 18 deletions

Please register or to comment

Admin message

New GPU communication scheme with GPU kernels for packing