Various GPU-related and some general fixes.
- Recombine KernelWrapper APIs of CPU and GPU JIT
- Clean up JIT module
- Fix kernel constraints analysis
- Fix handling of slices with negative start index
- Fix sparse iteration spaces on GPUs
- Reintroduce GPU periodicity module
- Extend GPU test cases
- Reintroduce kwargs for create_kernel
- Restrict parsing of structured data types to aligned types
- Expose GPU block size selection through
CreateKernelConfig
Edited by Frederik Hennig