Commits · 020743a1a0387db068fb87b2660129133f4d960b · itischler / waLBerla

Jan 22, 2019

Precompute x and f allocation size of GPUField · 020743a1
Martin Bauer authored 6 years ago

020743a1

New GPU communication scheme with GPU kernels for packing · 319909f0

Martin Bauer authored 6 years ago

Features:
   - uses generated pack infos for packing & unpacking directly on GPU
   - can directly send GPU buffers if cuda-enabled MPI is available,
     otherwise the packed buffers are transfered to CPU first
   - communication hiding with cuda streams: communication can be run
     asynchronously - especially useful when compute kernel is also
     split up into inner and outer part

- added RAII classes for CUDA streams and events
- equivalence test that checks if generated CPU and GPU (overlapped)
  versions are computing same result as normal waLBerla LBM kernel

319909f0

Sep 26, 2017
- Field data getters used by code generation · 733e2ad1
  Martin Bauer authored 7 years ago
  
  733e2ad1
Aug 02, 2017
- Python export for GPUFields and interface to pycuda · ba5733cc
  Martin Bauer authored 7 years ago
  
  ba5733cc
- CUDA communication that does not rely on cuda aware MPI · dd28a536
  Paulo Carvalho authored 7 years ago and Martin Bauer committed 7 years ago
  
  dd28a536
- CUDA support · 6fc7b559
  Martin Bauer authored 7 years ago
  
  6fc7b559

Admin message