- 08 Jul, 2019 1 commit
-
-
Stephan Seitz authored
This enables astnodes.Nodes to have a member required_global_declarations by which they can specify a global declaration required for their usage.
-
- 26 Apr, 2019 2 commits
-
-
- 24 Apr, 2019 1 commit
-
-
Martin Bauer authored
- turned on restrict keyword by default (makes large difference on GPUs) - smarter block indexing: changing block size depending on domain size Example: previously there where (1,1,1) blocks when requested block size was (64, 1, 1) and domain size (1, 512, 512), now the block size is changed automatically to (1, 64, 1) in this case - added __lauch_bounds__ to kernels to allow better optimizations from the CUDA compiler
-
- 14 Apr, 2019 1 commit
-
-
Martin Bauer authored
- style changes marked by flake - using newest kerncraft version
-
- 05 Apr, 2019 1 commit
-
-
Martin Bauer authored
- don't use for production runs!
-
- 03 Apr, 2019 1 commit
-
-
Martin Bauer authored
-
- 01 Apr, 2019 2 commits
-
-
Martin Bauer authored
-
Martin Bauer authored
-
- 21 Mar, 2019 1 commit
-
-
Martin Bauer authored
This restructuring allows for easier separation of modules into separate repositories later. Also, now pip install with repo url can be used. The setup.py files have also been updated to correctly reference each other. Module versions are not extracted from git state
-
- 18 Mar, 2019 1 commit
-
-
Martin Bauer authored
-
- 15 Mar, 2019 2 commits
-
-
Martin Bauer authored
-
Martin Bauer authored
-
- 07 Mar, 2019 2 commits
-
-
Martin Bauer authored
-
Martin Bauer authored
-
- 26 Feb, 2019 1 commit
-
-
Martin Bauer authored
- counter-based philox RNG: counter/key is filled with cell coordinate and optional external parameters like block position and time step - works on CPU and GPU - on CPU only for non-vectorized versions - introduced more flexible "CustomCodeNode" that can inject backend-specific hand-written code
-
- 18 Feb, 2019 1 commit
-
-
Martin Bauer authored
-
- 16 Nov, 2018 1 commit
-
-
Martin Bauer authored
-> by default sympy generates "fmin" C-function for sp.Min which is wrong for integer parameters
-
- 14 Nov, 2018 1 commit
-
-
Martin Bauer authored
- small (length < 5) arrays with shape and stride information had to be memcpy'd to the GPU before every kernel call - instead of passing the information as arrays, the single elements are passed - leads to more function arguments, but simplifies GPU kernel calls -> changes in all backends required
-
- 13 Nov, 2018 1 commit
-
-
Martin Bauer authored
-
- 26 Oct, 2018 1 commit
-
-
Martin Bauer authored
-
- 25 Oct, 2018 1 commit
-
-
Martin Bauer authored
-
- 19 Oct, 2018 1 commit
-
-
Martin Bauer authored
-
- 05 Sep, 2018 2 commits
-
-
Martin Bauer authored
-
Martin Bauer authored
- constants are typed - no need any more for this parameter
-
- 06 Jul, 2018 1 commit
-
-
Martin Bauer authored
-
- 25 Jun, 2018 1 commit
-
-
Martin Bauer authored
-
- 18 May, 2018 1 commit
-
-
Martin Bauer authored
-
- 14 May, 2018 3 commits
-
-
Martin Bauer authored
-
Martin Bauer authored
-
Martin Bauer authored
-
- 13 May, 2018 1 commit
-
-
Martin Bauer authored
- support aligned load/stores - nontemporal stores - aligned memory allocation for arrays and temporary buffers
-
- 11 May, 2018 1 commit
-
-
Martin Bauer authored
- vectorization for loops with ranges that are not a multiple of vector width - vectorization for variable sized loops if special transformation replace_inner_stride_with_one is run
-
- 27 Apr, 2018 1 commit
-
-
Martin Bauer authored
-
- 13 Apr, 2018 1 commit
-
-
Martin Bauer authored
- removed warnings - added flake8 as CI target
-
- 10 Apr, 2018 4 commits
-
-
Martin Bauer authored
-
Martin Bauer authored
-
Martin Bauer authored
- test run again - notebooks not yet
-
Martin Bauer authored
-
- 05 Mar, 2018 1 commit
-
-
Martin Bauer authored
- in-kernel Neumann boundaries - flag-interface for boundary handling makes one flag field multiple boundary handlings possible - generator: support for bitwise logical operators
-