Commits · 047846863ed71c7a13633b90aaaaf972dc83397d · pycodegen / pystencils

Jan 31, 2018
- KernelFunction node stores now backend string - compiled functions have ast attribute · 04784686
  Martin Bauer authored 7 years ago
  
  04784686
Jan 19, 2018

Code generation for field serialization into buffers · 979ee93b

João Victor Tozatti Risso authored 7 years ago and

Martin Bauer committed 7 years ago

Concept: Generate code involving the (un)packing of fields (from)to linear
(1D) arrays, i.e. (de)serialization of the field values for buffered
communication.

A linear index is generated for the buffer, by inferring the strides and
variables of the loops over fields in the AST. In the CPU, this information is
obtained through the makeLoopOverDomain function, in
pystencils/transformations/transformations.py. On CUDA, the strides of
the fields (excluding buffers) are combined with the indexing variables to infer
the indexing of the buffer.

What is supported:
    - code generation for both CPU and GPU
    - (un)packing of fields with all the memory layouts supported by
    pystencils
    - (un)packing slices of fields (from)into the buffer
    - (un)packing subsets of cell values from the fields (from)into the buffer

Limitations:

- assumes that only one buffer and one field are being operated within
each kernel, however multiple equations involving the buffer and the
field are supported.

- (un)packing multiple cell values (from)into the buffer is supported,
however it is limited to the fields with indexDimensions=1. The same
applies to (un)packing subset of cell values of each cell.

Changes in this commit:

- add the FieldType enumeration to pystencils/field.py, to mark fields
of various types. This is replaces and is a generalization of the
isIndexedField boolean flag of the Field class. For now, the types
supported are: generic, indexed and buffer fields.

- add the fieldType property to the Field class, which indicates the
type of the field. Modifications were also performed to the member
functions of the Field class to add this property.

- add resolveBufferAccesses function, which replaces the fields marked
as buffers with the actual field access in the AST traversal.

Miscelaneous changes:

- add blockDim and gridDim variables as CUDA indexing variables.

979ee93b

Dec 11, 2017
- LB boundary generation for waLBerla · 6860c7d2
  Martin Bauer authored 7 years ago
  
  6860c7d2
Dec 02, 2017
- Bugfix in CUDA Jit · 74b69826
  Martin Bauer authored 7 years ago
  
  74b69826
Oct 10, 2017
- Renamed types.py to data_types.py · 26cac6b4
  Martin Bauer authored 7 years ago
```
- renaming because of clashes with types.py from other packages
```
  26cac6b4
Jul 21, 2017
- Module to create waLBerla sweeps from pystencils · bab19cf4
  Martin Bauer authored 7 years ago
  
  bab19cf4
Apr 11, 2017

Bugfix in JIT cacheing · 93b1d694

Martin Bauer authored 7 years ago

- cache relied on uniqueness of  python id()
- id may be reused if object is freed
-> object must be held alive
-> kernel keeps all it arguments it was ever called with, alive (problematic in terms of memory consumption)

93b1d694

Fixed CUDA resource problems in GPU test (too many registers used) · eee767f9
Martin Bauer authored 7 years ago
```
-> smaller block
```
eee767f9

Mar 30, 2017
- more CUDA bugfixes & periodicity kernels for CUDA · 117f8b73
  Martin Bauer authored 8 years ago
  
  117f8b73
Mar 24, 2017
- Caching for jitted cpu and gpu kernels (big speedup for small work sizes) · 771a9b22
  Martin Bauer authored 8 years ago
  
  771a9b22
- Conditional AST Node & advanced CUDA indexing · ff641ec9
  Martin Bauer authored 8 years ago
```
- abstraction layer for selecting CUDA block and grid sizes
  - line based (was implemented before)
  - block based (new, more flexible)
-  new conditional (if/else) ast node, which is necessary for indexing schemes (guarding if)
```
  ff641ec9
- GPU bugfixes and lbmpy GPU support · 3f45aed6
  Martin Bauer authored 8 years ago
```
- bugfix for CUDA kernels with variable field sizes
- extended tests for pystencils gpu kernels
```
  3f45aed6
- pystencils: indexed kernels for GPUs · cb511afe
  Martin Bauer authored 8 years ago
  
  cb511afe
Mar 01, 2017

pystencils: cpujit · dd17cd30

Martin Bauer authored 8 years ago

- windows support
- automatic caching and creation of shared library with all generated kernels
- restrict keyword and function prefixes are preprocessor macros now -> easier to generate one code for linux, cuda, windows

dd17cd30

Feb 21, 2017
- lbmpy: cuda & square channel scenario · 235b9062
  Martin Bauer authored 8 years ago
  
  235b9062
Dec 08, 2016
- Support for symbol names which are not legal C++ variable identifiers · 96566fce
  Martin Bauer authored 8 years ago
  
  96566fce
Nov 06, 2016
- CUDA jit, first test working · 0735ea85
  Martin Bauer authored 8 years ago
  
  0735ea85
- Worked on CUDA code generation · 8c693cd1
  Martin Bauer authored 8 years ago
  
  8c693cd1
Nov 03, 2016

Documentation & Restructuring · aea202a7

Martin Bauer authored 8 years ago

- added sphinx files for documentation generation
- collected kernel creation functions in new "cpu" and "cudagpu" modules

aea202a7

Nov 02, 2016
- Restructuring: moved to pystencils · 61541046
  Martin Bauer authored 8 years ago
  
  61541046

Admin message