Commits · a274d66280484b1fcfaa850fc8fea65d95deff5b · pycodegen / pystencils

Mar 15, 2019
- Staggered Kernel: different option for GPU (one block for each code path) · a274d662
  Martin Bauer authored 6 years ago
  
  a274d662
Mar 12, 2019
- CUDA backend: use fastmath by default · 8e4aae93
  Martin Bauer authored 6 years ago
  
  8e4aae93
Feb 26, 2019

Random number generation support for pystencils · 6a01f3e2

Martin Bauer authored 6 years ago

- counter-based philox RNG: counter/key is filled with cell coordinate
  and optional external parameters like block position and time step
- works on CPU and GPU - on CPU only for non-vectorized versions

- introduced more flexible "CustomCodeNode" that can inject
  backend-specific hand-written code

6a01f3e2

Nov 14, 2018

Removed 'symbol_name_to_variable_name' · b72ef215

Martin Bauer authored 6 years ago

- was not used consistently before
- symbol names are expected to be valid C identifiers
- for complicated field names, the latex_name of field should be used

b72ef215

PEP8 compliance · b98decd5
Martin Bauer authored 6 years ago

b98decd5

Pass field information (shape,stride) as single elements instead of arr · 7a94740d

Martin Bauer authored 6 years ago

- small (length < 5) arrays with shape and stride information had to be
  memcpy'd to the GPU before every kernel call
- instead of passing the information as arrays, the single elements are
  passed
- leads to more function arguments, but simplifies GPU kernel calls

-> changes in all backends required

7a94740d

Jun 07, 2018

pystencils field · 8ca5e2fb

Martin Bauer authored 6 years ago

- better latex display for indirect accesses
- new field type 'custom': only custom fields can be accessed indirectly
  no static bounds check possible for custom fields

8ca5e2fb

Apr 30, 2018
- Additional tests · ca31d3cc
  Martin Bauer authored 6 years ago
  
  ca31d3cc
Apr 13, 2018
- flake8 linter · e31f1062
  Martin Bauer authored 6 years ago
```
- removed warnings
- added flake8 as CI target
```
  e31f1062
- pystencils.plot2d refactored: more documentation & tests · 1fcc24b8
  Martin Bauer authored 6 years ago
  
  1fcc24b8
Apr 10, 2018
- Rest of PEP8 renaming · 4a7299f1
  Martin Bauer authored 6 years ago
  
  4a7299f1
- PEP8 name refactoring · d72cd721
  Martin Bauer authored 6 years ago
```
- test run again
- notebooks not yet
```
  d72cd721
- PEP8 naming · 3bcfac93
  Martin Bauer authored 6 years ago
  
  3bcfac93
Feb 06, 2018

Changed parameter bind caching for CPU and GPU kernels · d30c5f73

Martin Bauer authored 7 years ago

- previously all objects where cached by id()
- for waLBerla simulations in each time step a new np.array view
  is created from the waLBerla field. Each of these views has a
  different id -> caching did not work for waLBerla setups
- changed hash for numpy arrays: instead of id, a tuple of
  (dataPtr, strides, shapes) is used as hash input

d30c5f73

New Boundary step system / fixes in lbmpy phasefield · b148d508

Martin Bauer authored 7 years ago

- scaling interface width eta instead of surface tensions tau to correct
  interface profile & surface tensions

b148d508

Jan 31, 2018
- KernelFunction node stores now backend string - compiled functions have ast attribute · 04784686
  Martin Bauer authored 7 years ago
  
  04784686
Jan 19, 2018

Code generation for field serialization into buffers · 979ee93b

João Victor Tozatti Risso authored 7 years ago and

Martin Bauer committed 7 years ago

Concept: Generate code involving the (un)packing of fields (from)to linear
(1D) arrays, i.e. (de)serialization of the field values for buffered
communication.

A linear index is generated for the buffer, by inferring the strides and
variables of the loops over fields in the AST. In the CPU, this information is
obtained through the makeLoopOverDomain function, in
pystencils/transformations/transformations.py. On CUDA, the strides of
the fields (excluding buffers) are combined with the indexing variables to infer
the indexing of the buffer.

What is supported:
    - code generation for both CPU and GPU
    - (un)packing of fields with all the memory layouts supported by
    pystencils
    - (un)packing slices of fields (from)into the buffer
    - (un)packing subsets of cell values from the fields (from)into the buffer

Limitations:

- assumes that only one buffer and one field are being operated within
each kernel, however multiple equations involving the buffer and the
field are supported.

- (un)packing multiple cell values (from)into the buffer is supported,
however it is limited to the fields with indexDimensions=1. The same
applies to (un)packing subset of cell values of each cell.

Changes in this commit:

- add the FieldType enumeration to pystencils/field.py, to mark fields
of various types. This is replaces and is a generalization of the
isIndexedField boolean flag of the Field class. For now, the types
supported are: generic, indexed and buffer fields.

- add the fieldType property to the Field class, which indicates the
type of the field. Modifications were also performed to the member
functions of the Field class to add this property.

- add resolveBufferAccesses function, which replaces the fields marked
as buffers with the actual field access in the AST traversal.

Miscelaneous changes:

- add blockDim and gridDim variables as CUDA indexing variables.

979ee93b

Dec 11, 2017
- LB boundary generation for waLBerla · 6860c7d2
  Martin Bauer authored 7 years ago
  
  6860c7d2
Dec 02, 2017
- Bugfix in CUDA Jit · 74b69826
  Martin Bauer authored 7 years ago
  
  74b69826
Oct 10, 2017
- Renamed types.py to data_types.py · 26cac6b4
  Martin Bauer authored 7 years ago
```
- renaming because of clashes with types.py from other packages
```
  26cac6b4
Jul 21, 2017
- Module to create waLBerla sweeps from pystencils · bab19cf4
  Martin Bauer authored 7 years ago
  
  bab19cf4
Apr 11, 2017

Bugfix in JIT cacheing · 93b1d694

Martin Bauer authored 7 years ago

- cache relied on uniqueness of  python id()
- id may be reused if object is freed
-> object must be held alive
-> kernel keeps all it arguments it was ever called with, alive (problematic in terms of memory consumption)

93b1d694

Fixed CUDA resource problems in GPU test (too many registers used) · eee767f9
Martin Bauer authored 7 years ago
```
-> smaller block
```
eee767f9

Mar 30, 2017
- more CUDA bugfixes & periodicity kernels for CUDA · 117f8b73
  Martin Bauer authored 7 years ago
  
  117f8b73
Mar 24, 2017
- Caching for jitted cpu and gpu kernels (big speedup for small work sizes) · 771a9b22
  Martin Bauer authored 8 years ago
  
  771a9b22
- Conditional AST Node & advanced CUDA indexing · ff641ec9
  Martin Bauer authored 8 years ago
```
- abstraction layer for selecting CUDA block and grid sizes
  - line based (was implemented before)
  - block based (new, more flexible)
-  new conditional (if/else) ast node, which is necessary for indexing schemes (guarding if)
```
  ff641ec9
- GPU bugfixes and lbmpy GPU support · 3f45aed6
  Martin Bauer authored 8 years ago
```
- bugfix for CUDA kernels with variable field sizes
- extended tests for pystencils gpu kernels
```
  3f45aed6
- pystencils: indexed kernels for GPUs · cb511afe
  Martin Bauer authored 8 years ago
  
  cb511afe
Mar 01, 2017

pystencils: cpujit · dd17cd30

Martin Bauer authored 8 years ago

- windows support
- automatic caching and creation of shared library with all generated kernels
- restrict keyword and function prefixes are preprocessor macros now -> easier to generate one code for linux, cuda, windows

dd17cd30

Feb 21, 2017
- lbmpy: cuda & square channel scenario · 235b9062
  Martin Bauer authored 8 years ago
  
  235b9062
Dec 08, 2016
- Support for symbol names which are not legal C++ variable identifiers · 96566fce
  Martin Bauer authored 8 years ago
  
  96566fce
Nov 06, 2016
- CUDA jit, first test working · 0735ea85
  Martin Bauer authored 8 years ago
  
  0735ea85
- Worked on CUDA code generation · 8c693cd1
  Martin Bauer authored 8 years ago
  
  8c693cd1
Nov 03, 2016

Documentation & Restructuring · aea202a7

Martin Bauer authored 8 years ago

- added sphinx files for documentation generation
- collected kernel creation functions in new "cpu" and "cudagpu" modules

aea202a7

Nov 02, 2016
- Restructuring: moved to pystencils · 61541046
  Martin Bauer authored 8 years ago
  
  61541046

Admin message