Skip to content
Snippets Groups Projects
  1. Dec 08, 2017
    • João Victor Tozatti Risso's avatar
      GPUPackInfo: add asynchronous (un)packing capabilities · 6bfe8c59
      João Victor Tozatti Risso authored
      
      Changes introduced in this commit are the following:
      
      - CUDA streams: Add support for asynchronous (un)packing operations using CUDA
        streams in cuda::communication::GPUPackInfo. Through asynchronous operations
        it is possible to overlap GPU computation and MPI communication in simulations
        (e.g. LBM simulations). Asynchronous copies in CUDA require pinned memory on
        the host, and for that purpose a staging buffer is introduced (i.e.
        cuda::communication::PinnedMemoryBuffer) in the cuda module, which is used to
        stage data between the GPU and the MPI buffers.
      
      - zyxf layout: Add zyxf field layout support in GPUPackInfo through extensions
        of the functions in cuda::GPUCopy.
      
      - Extended GPUPackInfo test: Add stream and zyxf layout tests to the
        GPUPackInfoTest to test the proposed implementation.
      
      - Extended Kernel: add CUDA stream and shared memory configuration support in
        cuda::Kernel class.
      
      Signed-off-by: default avatarJoão Victor Tozatti Risso <joaovictortr@protonmail.com>
      6bfe8c59
  2. Nov 29, 2017
  3. Nov 27, 2017
  4. Nov 24, 2017
  5. Nov 17, 2017
  6. Nov 14, 2017
  7. Nov 10, 2017
  8. Nov 09, 2017
  9. Nov 07, 2017
  10. Oct 24, 2017
  11. Oct 20, 2017
  12. Oct 05, 2017
  13. Oct 04, 2017
  14. Oct 02, 2017
  15. Sep 29, 2017
  16. Sep 20, 2017
  17. Aug 23, 2017
  18. Aug 16, 2017
  19. Aug 07, 2017
  20. Aug 03, 2017
  21. Aug 02, 2017
  22. Aug 01, 2017
  23. Jul 25, 2017
  24. Jul 24, 2017