pystencils merge requests

WIP: make kerncraft/matplotlib tests pass with new image

2019-09-02T08:28:51+02:00

WIP: make kerncraft/matplotlib tests pass with new image

Add pystencils-autodiff

2019-08-09T08:54:59+02:00

This adds pystencils_autodiff (https://pypi.org/project/pystencils-autodiff/0.1.3/) to pystencils. After installing the extension, you can access all its classes in the submodule `pystenicls.autodiff`. If it's not installed but you try to import it you get an error with installation instructions. The internal code of pystencils_autodiff is still very ugly. I hope I can clean it up in the next days/weeks.

Seeding of RNG

2019-08-09T08:55:31+02:00

For https://i10git.cs.fau.de/pycodegen/lbmpy/merge_requests/2

Add AssignmentCollection.has_exclusive_writes

2019-08-07T11:43:33+02:00

An assumption of pystencils is that output stencil writes never overlap. This allows massive parallelization without race conditions or atomics. When I use my autodiff transformations I use this condition to check whether the assumption still hold for the backward assignments.

WIP: Astnodes for interpolation

2019-09-24T18:59:50+02:00

This PR needs maybe still needs some clean-up. However, it would be good to recieve already some feed-back. What works: - Using CUDA textures - Using HW accelerated interpolation for float32 textures - Implement linear interpolations either via software (CPU, GPU), texture accesses without HW-interpolation but HW boundary handling - Adding transformed coordinate systems to fields What does not work: - HW boundary handling for CUDA textures for the boundary handling modes `mirror` and `wrap` (apparently they have been removed from CUDA's API but are still present in pycuda. Now there's only ``` cudaBoundaryModeZero = 0 Zero boundary mode cudaBoundaryModeClamp = 1 Clamp boundary mode cudaBoundaryModeTrap = 2 Trap boundary mode ``` Wtf is trap boundary mode? Nothing is documented so we can only experiment. What kind of works: - B-Spline interpolation on GPU using this repo as a submodule (http://www.dannyruijters.nl/cubicinterpolation/), to lazy for tests. Don't know how to prove correctness - Textures for dtypes with itemsize > 4. PyCUDA has helper header (https://github.com/inducer/pycuda/blob/master/pycuda/cuda/pycuda-helpers.hpp) that loads doubles by two int fetches. However, this hack seems to be only working if we add a 0.5 offset and make all functions in this header accept float.

Run flynt (https://pypi.org/project/flynt/) on pystencils

2019-08-06T08:03:25+02:00

This replaces usages of "" % s by Python's f-strings. Is this a good thing? I don't know. :shrug:

Fix #10: Add jinja2 to pystencils's dependencies

2019-08-06T08:05:38+02:00

Alternative would be remove jinja2 (see other PR). However, I think dependency on jinja2 is not to heavy. This could make some implementations more elegant.

implemented derivation of gradient weights via rotation

2019-08-03T14:01:23+02:00

derive gradient weights of other direction with already calculated weights of one direction via rotation and apply them to a field.

Fixup for DestructuringBindingsForFieldClass

2019-07-18T10:28:09+02:00

- rename header Field.h is not a unique name in waLBerla context - add PyStencilsField.h - bindings were lacking data type

Add CustomSympyPrinter._print_Sum

2019-08-05T16:44:54+02:00

This makes sympy.Sum printable as instantaniously invoked lambda (Attention: C++-only, works in CUDA)

Auto-format pystencils/rng.py (trailing whitespace)

2019-07-18T10:28:27+02:00

My editor feels better if that whitespace is not there.

Add `pystencils.make_python_function` used for KernelFunction.compile

2019-07-18T11:56:29+02:00

`KernelFunction.compile = None` is currently set by the `create_kernel` function of each respective backend as partial function of `.make_python_function`. The code would be clearer with a unified `make_python_function`. `KernelFunction.compile` can then be implemented as a call to this function with the respective backend.

Fix deprecation warning: `collections.abc` instead of `abc`

2019-07-10T17:04:10+02:00

DeprecationWarning: Using or importing the ABCs from 'collections' instead of from '' is deprecated, and in 3.8 it will stop working

Add autodiff

2019-07-11T12:12:00+02:00

Draft for minimal integration of automatic differentiation. Tensorflow and Torch backend were removed (apart from AutoDiffOp.create_tensorlfow()) Only tests without LBM or tf/torch dependencies have been added. This implies that also numeric gradient checking is missing (depends on either tf or torch). Should we really move the backends into separate modules? Apart from adding auto-differentiation functionality I added two more changes. Would be happy to AutoDiffOp.create_tensorflow() after feedback.