- Nov 16, 2018
-
-
Martin Bauer authored
-> by default sympy generates "fmin" C-function for sp.Min which is wrong for integer parameters
-
- Nov 14, 2018
-
-
Martin Bauer authored
- small (length < 5) arrays with shape and stride information had to be memcpy'd to the GPU before every kernel call - instead of passing the information as arrays, the single elements are passed - leads to more function arguments, but simplifies GPU kernel calls -> changes in all backends required
-
- Nov 13, 2018
-
-
Martin Bauer authored
-
- Oct 26, 2018
-
-
Martin Bauer authored
-
- Oct 25, 2018
-
-
Martin Bauer authored
-
- Oct 19, 2018
-
-
Martin Bauer authored
-
- Sep 05, 2018
-
-
Martin Bauer authored
-
Martin Bauer authored
- constants are typed - no need any more for this parameter
-
- Jul 06, 2018
-
-
Martin Bauer authored
-
- Jun 25, 2018
-
-
Martin Bauer authored
-
- May 18, 2018
-
-
Martin Bauer authored
-
- May 14, 2018
-
-
Martin Bauer authored
-
Martin Bauer authored
-
Martin Bauer authored
-
- May 13, 2018
-
-
Martin Bauer authored
- support aligned load/stores - nontemporal stores - aligned memory allocation for arrays and temporary buffers
-
- May 11, 2018
-
-
Martin Bauer authored
- vectorization for loops with ranges that are not a multiple of vector width - vectorization for variable sized loops if special transformation replace_inner_stride_with_one is run
-
- Apr 27, 2018
-
-
Martin Bauer authored
-
- Apr 13, 2018
-
-
Martin Bauer authored
- removed warnings - added flake8 as CI target
-
- Apr 10, 2018
-
-
Martin Bauer authored
-
Martin Bauer authored
-
Martin Bauer authored
- test run again - notebooks not yet
-
Martin Bauer authored
-
- Mar 05, 2018
-
-
Martin Bauer authored
- in-kernel Neumann boundaries - flag-interface for boundary handling makes one flag field multiple boundary handlings possible - generator: support for bitwise logical operators
-
- Feb 08, 2018
-
-
Martin Bauer authored
- step class for LB phasefield generic enough to work with 3-phase and N-phase models - cahn hilliard can either be solved by LBM or by finite differences - 3 phase model can be solved with rho phase or without
-
- Dec 03, 2017
-
-
Martin Bauer authored
- loop cutting - simplification of conditionals inside loop
-
- Oct 17, 2017
-
-
Martin Bauer authored
- support for logical operators, and/or - both phase field kernels can be vectorized now
-
Martin Bauer authored
- phasefield phi sweep vectorizes successfully
-
- Oct 10, 2017
-
-
Martin Bauer authored
- activated vectorization for LBM kernels
-
Martin Bauer authored
- renaming because of clashes with types.py from other packages
-
Martin Bauer authored
- sympys piecewise defined functions are mapped to blend instructions - cast function is now a class - several bugfixes
-
- Oct 09, 2017
-
-
Martin Bauer authored
- first vectorization tests are running - type system: use memoized getTypeOfExpression - casts are done using sp.Function('cast') - C backend adapted for vectorization support - AST nodes can required optional headers
-
- Sep 20, 2017
-
-
Martin Bauer authored
-
- Jul 26, 2017
-
-
Martin Bauer authored
-
- Jul 21, 2017
-
-
Martin Bauer authored
-
- Jul 07, 2017
-
-
Martin Bauer authored
-
- Mar 24, 2017
-
-
Martin Bauer authored
- abstraction layer for selecting CUDA block and grid sizes - line based (was implemented before) - block based (new, more flexible) - new conditional (if/else) ast node, which is necessary for indexing schemes (guarding if)
-
- Mar 16, 2017
-
-
Martin Bauer authored
-
- Mar 14, 2017
-
-
Martin Bauer authored
- this extension is necessary for more generic boundary treatment - cells can now be structs, i.e. contain different data types - instead of having numeric index dimensions, one can use the index per cell to adress struct elements
-
- Mar 13, 2017
-
-
Martin Bauer authored
- use data type class consistently instead of strings (in TypedSymbol, Field and jit module) - new datatype class is based on numpy types with additional specifier information (const and restrict) - translation between data type class and other modules (numpy, ctypes)
-
- Mar 01, 2017
-
-
Martin Bauer authored
- windows support - automatic caching and creation of shared library with all generated kernels - restrict keyword and function prefixes are preprocessor macros now -> easier to generate one code for linux, cuda, windows
-