Commits · 6373c03aeba2af10d15ee259677ae5f1d7d9a76f · pycodegen / pystencils

Jul 11, 2019
- Import sorting using isort · 6373c03a
  Martin Bauer authored 5 years ago
  
  6373c03a
- Restructured KernelFunction node to get rid of monkey-patching · b9e53581
  Martin Bauer authored 5 years ago
```
- backend, target and compile are now normal members of the
  KernelFunction node and populated in constructor
```
  b9e53581
Jul 10, 2019

Add DestructuringBindingsForFieldClass to use pystencils kernels in a more C++-ish way · 8e63c9ff

Stephan Seitz authored 5 years ago

DestructuringBindingsForFieldClass defines all field-related variables
in its subordinated block.
However, it leaves a TypedSymbol of type 'Field' for each field
undefined.
By that trick we can generate kernels that accept structs as
kernelparameters.
Either to include a pystencils specific Field struct of the following
definition:

```cpp
template<DTYPE_T, DIMENSION>
struct Field
{
    DTYPE_T* data;
    std::array<DTYPE_T, DIMENSION> shape;
    std::array<DTYPE_T, DIMENSION> stride;
}

or to be able to destructure user defined types like `pybind11::array`,
`at::Tensor`, `tensorflow::Tensor`

```

8e63c9ff

Jul 08, 2019

Add global_declarations to cbackend · 3463ff54

Stephan Seitz authored 5 years ago

This enables astnodes.Nodes to have a member required_global_declarations
by which they can specify a global declaration required for their usage.

3463ff54

Jun 27, 2019
- Add test for `address_of` · 9f79445e
  Stephan Seitz authored 5 years ago
  
  9f79445e
Jun 18, 2019

CUDA indexing: clip to maximum cuda block size · 1754ef27

Martin Bauer authored 5 years ago

- previous method did not work with kernels generated for walberla where
  block size changes are made at runtime
- device query does not always work, since the compile system may have
  no GPU or not the same GPU
-> max block size is passed as parameter and only optionally determined
   by a device query

release/0.2.3

1754ef27

Jun 12, 2019

Bugfix - Vectorization of in-place LB update wrong · 754c7767

Martin Bauer authored 5 years ago

Block.subs method tried to be too smart:
a = field[..]
b = a + b

was "simplified" incorrectly to
b = field[...] + b

754c7767

May 05, 2019
- Refactoring of plotting and stencil plotting · 7b4c3f2d
  Martin Bauer authored 5 years ago
```
- stencil plotting & transformation now in ps.stencil
- additional documentation & notebooks
```
  release/0.2.2
  
  7b4c3f2d
May 03, 2019
- Kerncraft interface: update to work with kerncraft 0.8.0 · 0998f2e1
  Martin Bauer authored 5 years ago
  
  0998f2e1
Apr 28, 2019
- Tests and documentation for derivative module · 61d1bae6
  Martin Bauer authored 5 years ago
  
  61d1bae6
- Added CI and test files · bc82de86
  Martin Bauer authored 5 years ago
  
  bc82de86
Apr 26, 2019

Enhancement in move_constants_before loop · eec4dc4b

Martin Bauer authored 5 years ago

When two loops have assignments to the same symbol with different
rhs and both are pulled before the loops, one of them is now renamed.
Previously one of them was left inside the loop.

Fixes #27

eec4dc4b

Apr 24, 2019

Improvements for GPU code generation · f504b40f

Martin Bauer authored 5 years ago

- turned on restrict keyword by default (makes large difference on GPUs)
- smarter block indexing: changing block size depending on domain size
  Example: previously there where (1,1,1) blocks when requested
  block size was (64, 1, 1) and domain size (1, 512, 512), now the
  block size is changed automatically to (1, 64, 1) in this case
- added __lauch_bounds__ to kernels to allow better optimizations from
  the CUDA compiler

f504b40f

Apr 03, 2019
- Bugfix in vectorization, in case conditionals are pulled before loop · a3cb1634
  Martin Bauer authored 5 years ago
  
  a3cb1634
Apr 01, 2019
- Support for conditionals in vectorized loops (by specifying all/any) · 696eb2d5
  Martin Bauer authored 5 years ago
  
  696eb2d5
Mar 28, 2019
- pystencils.fds: staggered spatial finite differences · 42c9e289
  Martin Bauer authored 5 years ago
  
  42c9e289
Mar 22, 2019
- Benchmark fixes · 5fdda96c
  Martin Bauer authored 6 years ago
  
  5fdda96c
- Additional tests for packinfo generation & fast approximation for div and sqrt · 0df63c2d
  Martin Bauer authored 6 years ago
  
  0df63c2d
Mar 21, 2019

Separated modules into subfolders with own setup.py · 1e02cdc7

Martin Bauer authored 6 years ago

This restructuring allows for easier separation of modules into
separate repositories later. Also, now pip install with repo url can be
used.

The setup.py files have also been updated to correctly reference each
other. Module versions are not extracted from git state

1e02cdc7

Admin message