pystencils merge requests

pystencils merge requests https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests 2019-08-20T16:27:21+02:00 https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/37 Remove main methods from tests (sorry for adding them) 2019-08-20T16:27:21+02:00 Stephan Seitz

Remove main methods from tests (sorry for adding them)

... or code will be executed when pytest is collecting the tests. I found out that I can use "-s" to convince vim-test to show me test output. ... or code will be executed when pytest is collecting the tests. I found out that I can use "-s" to convince vim-test to show me test output. https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/36 Pre-push hook 2019-08-20T16:28:40+02:00 Stephan Seitz

Pre-push hook

This prevents me from pushing stuff that either fails in quicktest or flake8. Has to be copied manually to `.git/hooks` and `python3` has to be adapted to your Python executable. ~~Is there an update in flake8 that `.flake8` is not... This prevents me from pushing stuff that either fails in quicktest or flake8. Has to be copied manually to `.git/hooks` and `python3` has to be adapted to your Python executable. ~~Is there an update in flake8 that `.flake8` is not recognized automatically anymore and that we need to append C901?~~ Probably, I installed just different linter on my PC at home. flake8 can use different linters. https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/35 Fix get_type_of_expression for constants like sympy.pi 2019-08-22T08:31:17+02:00 Stephan Seitz

Fix get_type_of_expression for constants like sympy.pi

Problem: some constant expressions are neither Float,Integer,Rational and don't have arguments. ```python >>> from sympy import * >>> isinstance(pi, Integer) False >>> isinstance(pi, Float) False >>> isinstance(pi, Rational) F... Problem: some constant expressions are neither Float,Integer,Rational and don't have arguments. ```python >>> from sympy import * >>> isinstance(pi, Integer) False >>> isinstance(pi, Float) False >>> isinstance(pi, Rational) False >>> pi.args () ``` https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/34 Address #13: Use sympy.codegen.rewriting.optimize 2019-09-23T10:55:13+02:00 Stephan Seitz

Address #13: Use sympy.codegen.rewriting.optimize

It's really comfortable to write optimizations in terms of `sympy.codegen.rewrite.RewriteOptim`: ```python # Evaluates all constant terms evaluate_constant_terms = ReplaceOptim( lambda e: hasattr(e, 'is_constant') a... It's really comfortable to write optimizations in terms of `sympy.codegen.rewrite.RewriteOptim`: ```python # Evaluates all constant terms evaluate_constant_terms = ReplaceOptim( lambda e: hasattr(e, 'is_constant') and e.is_constant, lambda p: p.evalf() ) ``` This PR adds a parameter `sympy_optimizations` to the `create_*_kernel` functions that applies the list of optimizations to the assignments before creating the AST. `sympy.codegen.rewrite` already has some optimizations. Some similar to the optimizations of pystencils. For example `create_expand_pow_optimization(limit)` is really similar to the logic in `CustomSympyPrinter._print_Pow`. See #13 Problem: old versions of sympy (e.g. from ubuntu CI) don't have `sympy.codegen.rewrite`. The optimizations are skipped in that case. `test_and_coverage` applies all optimizations. We could also try to implement a fma-optimization (fused-multipy add) with that and `sympy.Wild`. https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/33 Add KernelFunction.fields_written 2019-08-16T08:59:16+02:00 Stephan Seitz

Add KernelFunction.fields_written

I found myself needing this convenience wrapper in various places. I found myself needing this convenience wrapper in various places. https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/32 Bugfix: Readd __launch_bounds__ for dialect 'cuda' 2019-08-15T09:14:26+02:00 Stephan Seitz

Bugfix: Readd __launch_bounds__ for dialect 'cuda'

__launch_bounds__ was deactivated when introducing `CudaBackend` __launch_bounds__ was deactivated when introducing `CudaBackend` https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/31 Bugfix: TypedSymbol.is_negative should not be implemented in terms of super().is_positive 2019-08-14T17:03:02+02:00 Stephan Seitz

Bugfix: TypedSymbol.is_negative should not be implemented in terms of super().is_positive

This can lead to surprising simplifications This can lead to surprising simplifications https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/30 AES-NI Random Number Generator 2019-09-02T10:21:21+02:00 Michael Kuron mkuron@icp.uni-stuttgart.de

AES-NI Random Number Generator

I was looking at how to vectorize the Philox RNG yesterday. Before I knew it, I had implemented a working RNG using AES-NI instructions :nerd: ... Not entirely what I had intended to do, but it might still be useful to someone and should... I was looking at how to vectorize the Philox RNG yesterday. Before I knew it, I had implemented a working RNG using AES-NI instructions :nerd: ... Not entirely what I had intended to do, but it might still be useful to someone and should be similarly fast as a vectorized Philox. There is one place that could be optimized because I fall back to scalar instructions: I failed to reimplement `_mm_cvtepu64_pd` (the solution from https://stackoverflow.com/a/41148578 produces incorrect results in the least-significant half of the mantissa). Perhaps someone else can try to fix that. I did not integrate this with the `vector_instruction_set` parameter of the code generation. Perhaps you can do that, @bauer. It needs support for SSE2 and AES instructions (which look like SSE2 instructions, but their availability is determined by a separate CPUID flag). It will also make use of `_mm_cvtepu32_ps` and `_mm_cvtepu64_pd` from AVX512 if available (these are 128-bit instructions that actually look like SSE2 instructions). Martin Bauer Martin Bauer https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/29 Basic support for OpenCL (experimental) 2019-08-22T08:37:37+02:00 Stephan Seitz

Basic support for OpenCL (experimental)

Basic support for OpenCL Problem: OpenCL cannot import `stdint.h`. Temporary fix: define custom `opencl_stdint.h` (~~defines currently only `int64_t`~~ `) TODO: - ~~implement `opencl_stdint.h`~~ - implement shard_mem, textures,... Basic support for OpenCL Problem: OpenCL cannot import `stdint.h`. Temporary fix: define custom `opencl_stdint.h` (~~defines currently only `int64_t`~~ `) TODO: - ~~implement `opencl_stdint.h`~~ - implement shard_mem, textures, built-in functions - ~~avoid CUDA intrinsics (`fast_div`)~~ https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/28 Philox tests and clean up 2019-08-13T14:17:37+02:00 Michael Kuron mkuron@icp.uni-stuttgart.de

Philox tests and clean up

Test the Philox against reference data and clean up duplicated code in the code generation. The latter will make it easier to later add a vectorized Philox. Test the Philox against reference data and clean up duplicated code in the code generation. The latter will make it easier to later add a vectorized Philox. Martin Bauer Martin Bauer https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/27 Fix error message of CBackend for unsupported nodes 2019-08-15T09:15:02+02:00 Stephan Seitz

Fix error message of CBackend for unsupported nodes

Concatenating `__class__` and `str` is not supported. Should be `str(type(self))` (full type path) or `self.__class__.__name__` (just class name) Concatenating `__class__` and `str` is not supported. Should be `str(type(self))` (full type path) or `self.__class__.__name__` (just class name) https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/25 Make generate_c also work if astnode does not have member `instruction_set` 2019-08-06T22:07:34+02:00 Stephan Seitz

Make generate_c also work if astnode does not have member `instruction_set`

generate_c currently only works for KernelFunctions, since member `instruction_set` is required. generate_c can generate code for any astnode if this requirement is dropped. generate_c currently only works for KernelFunctions, since member `instruction_set` is required. generate_c can generate code for any astnode if this requirement is dropped. https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/24 Remove deprecation warning ('cachedir' parameter has been deprecated) 2019-08-08T08:57:45+02:00 Stephan Seitz

Remove deprecation warning ('cachedir' parameter has been deprecated)

Warning was: ``` /localhome/seitz_local/projects/pystencils/pystencils/cache.py:15: DeprecationWarning: The 'cachedir' parameter has been deprecated in version 0.12 and will be removed in version 0.14. You provided "cachedir='/local... Warning was: ``` /localhome/seitz_local/projects/pystencils/pystencils/cache.py:15: DeprecationWarning: The 'cachedir' parameter has been deprecated in version 0.12 and will be removed in version 0.14. You provided "cachedir='/localhome/seitz_local/.cache/pystencils'", use "location='/localhome/seitz_local/.cache/pystencils'" instead. disk_cache = Memory(cachedir=cache_dir, verbose=False).cache ``` https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/21 Add RELEASE-VERSION to .gitignore 2019-08-06T22:04:11+02:00 Stephan Seitz

Add RELEASE-VERSION to .gitignore

https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/18 Fix #10: Avoid jinja2 dependency 2019-08-06T08:05:02+02:00 Stephan Seitz

Fix #10: Avoid jinja2 dependency

This commit avoid dependency of core pystencils on jinja2. However this could make the printing of some AST-nodes less elegant (see https://i10git.cs.fau.de/pycodegen/pystencils/merge_requests/17). This commit avoid dependency of core pystencils on jinja2. However this could make the printing of some AST-nodes less elegant (see https://i10git.cs.fau.de/pycodegen/pystencils/merge_requests/17). https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/16 Declare FieldShapeSymbol and FieldStrideSymbol as strictly positive 2019-08-06T08:06:27+02:00 Stephan Seitz

Declare FieldShapeSymbol and FieldStrideSymbol as strictly positive

We can assume that FieldShapeSymbol and FieldStrideSymbol are always positive. `TypedSymbol` should forward kwargs to `sympy.Symbol`. We can assume that FieldShapeSymbol and FieldStrideSymbol are always positive. `TypedSymbol` should forward kwargs to `sympy.Symbol`. https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/15 implemented derivation of gradient weights via rotation 2020-11-25T13:23:50+01:00 Markus Holzer

implemented derivation of gradient weights via rotation

derive gradient weights of other direction with already calculated weights of one direction via rotation and apply them to a field. derive gradient weights of other direction with already calculated weights of one direction via rotation and apply them to a field. https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/14 Remove floor, ceiling for integer symbols 2019-08-02T22:26:37+02:00 Stephan Seitz

Remove floor, ceiling for integer symbols

# Original Intent Allow optimizations by SymPy when we know that a `TypedSymbol` `is_integer` or `is_real` (e.g. drop rounding functions). We can deduce some of those properties with Numpy's type system (https://docs.scipy.org/doc... # Original Intent Allow optimizations by SymPy when we know that a `TypedSymbol` `is_integer` or `is_real` (e.g. drop rounding functions). We can deduce some of those properties with Numpy's type system (https://docs.scipy.org/doc/numpy-1.13.0/reference/arrays.scalars.html). We have to be careful since all the `is_*` methods have ternary logic (`True`, `False`, `None`== we don't know). Field.Access can take advantage of those optimizations by making it a subclass of `TypedSymbol`. # Extended Changes By writing a test I realized that it would be handy to compare `AssignmentCollection`s and use the functions `find`, `match`, `subs`, `replace` of SymPy. https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/12 fix compiler options for macOS 2019-07-31T09:14:52+02:00 Michael Kuron mkuron@icp.uni-stuttgart.de

fix compiler options for macOS

Martin Bauer Martin Bauer https://i10git.cs.fau.de/pycodegen/pystencils/-/merge_requests/9 Add CudaBackend, CudaSympyPrinter 2019-07-18T10:04:27+02:00 Stephan Seitz

Add CudaBackend, CudaSympyPrinter

Add CudaBackend, CudaSympyPrinter to extract CUDA-specific code from CBackend, CustomSympyPrinter Cuda built-ins are added to `CudaSympyPrinter.known_functions` to use them as sympy.Function Add CudaBackend, CudaSympyPrinter to extract CUDA-specific code from CBackend, CustomSympyPrinter Cuda built-ins are added to `CudaSympyPrinter.known_functions` to use them as sympy.Function