src/cuda/GPUField.h · 319909f0bdf9d38296d194eaa52ccd92c825a29f · itischler / waLBerla

Failed to fetch fork details. Try again later.

New GPU communication scheme with GPU kernels for packing · 319909f0

Martin Bauer authored 6 years ago

Features:
   - uses generated pack infos for packing & unpacking directly on GPU
   - can directly send GPU buffers if cuda-enabled MPI is available,
     otherwise the packed buffers are transfered to CPU first
   - communication hiding with cuda streams: communication can be run
     asynchronously - especially useful when compute kernel is also
     split up into inner and outer part

- added RAII classes for CUDA streams and events
- equivalence test that checks if generated CPU and GPU (overlapped)
  versions are computing same result as normal waLBerla LBM kernel

319909f0

Forked from waLBerla / waLBerla

Source project has a limited visibility.

Admin message