Extend Support for CUDA and HIP kernel invocations
This MR extends the support for CUDA and HIP kernel invocations and API reflection in pystencils-sfg, reflecting the concepts introduced in pystencils!449 (merged).
- Move GPU-related stuff into new composer mix-in
SfgGpuComposer
- Replace
cuda_invoke()
bygpu_invoke()
- Implement three separate signatures of
gpu_invoke()
, depending on the kernel indexing scheme selected by the user - Replace
extensions.gpu
bylang.gpu
- Add explanations on GPU kernels to user manual
- Fix some errors and clarify some interface descriptions in the user manual
This MR will already be using Target.HIP
and therefore will have to wait for pystencils!458 (merged).
Edited by Frederik Hennig
Merge request reports
Activity
Filter activity
changed milestone to %Version 0.1
added 1 commit
- 6c120a84 - Add CUDA and HIP API provider protocols. Factor out GPU stuff into separate Gpu Composer.
added 2 commits
requested review from @ob28imeq
assigned to @da15siwa
Please register or sign in to reply