- Mar 15, 2019
-
-
Martin Bauer authored
-
- Mar 07, 2019
-
-
Martin Bauer authored
-
- Feb 18, 2019
-
-
Martin Bauer authored
-
- Jan 23, 2019
-
-
Martin Bauer authored
- removed warnings from generated code - made generated code string deterministic, generating the same twice gives binary equally files now
-
- Dec 10, 2018
-
-
Martin Bauer authored
- bug was caused by change in stride parameter passing
-
- Dec 03, 2018
-
-
Martin Bauer authored
- substitute_array_accesses_with_constants - get_loop_hierarchy
-
- Nov 14, 2018
-
-
Martin Bauer authored
- was not used consistently before - symbol names are expected to be valid C identifiers - for complicated field names, the latex_name of field should be used
-
Martin Bauer authored
- small (length < 5) arrays with shape and stride information had to be memcpy'd to the GPU before every kernel call - instead of passing the information as arrays, the single elements are passed - leads to more function arguments, but simplifies GPU kernel calls -> changes in all backends required
-
Martin Bauer authored
- put all buffer related stuff into separate functions - should be functionally equivalent
-
- Oct 10, 2018
-
-
Martin Bauer authored
- bugfix in vector-typing of piecewise functions - cast_function is now a sympy atom - fixes problems with sympy > 1.1 - replace_inner_stride_with_one is a bit faster now
-
- Oct 02, 2018
-
-
Martin Bauer authored
-
- Jul 30, 2018
-
-
Martin Bauer authored
-
- Jun 20, 2018
-
-
Martin Bauer authored
-
- Jun 07, 2018
-
-
Martin Bauer authored
-
- May 28, 2018
-
-
Martin Bauer authored
- different approach in pystencils: absolute indexing
-
Martin Bauer authored
- extended pystencils - Visualization for list LBM - not yet working
-
- May 14, 2018
-
-
Martin Bauer authored
-
Martin Bauer authored
-
Martin Bauer authored
-
- May 13, 2018
-
-
Martin Bauer authored
- support aligned load/stores - nontemporal stores - aligned memory allocation for arrays and temporary buffers
-
- May 11, 2018
-
-
Martin Bauer authored
- vectorization for loops with ranges that are not a multiple of vector width - vectorization for variable sized loops if special transformation replace_inner_stride_with_one is run
-
- May 07, 2018
-
-
Martin Bauer authored
- move_constants_before_loop: in the rare case that a symbol with same name exists in move target - the assignment is not moved previously an exception was raised in this case - consistent naming of intermediate base pointers
-
- Apr 30, 2018
-
-
Martin Bauer authored
-
- Apr 28, 2018
-
-
Martin Bauer authored
-
- Apr 27, 2018
-
-
Martin Bauer authored
-
Martin Bauer authored
- previously moved only SympyAssignments - relevant for custom boundary nodes, which are now pulled before loops
-
Martin Bauer authored
- SSA form is checked - loop independence condition is checked - bug fix in Field.create_generic when using index_shape
-
- Apr 18, 2018
-
-
Martin Bauer authored
- new kernel creation function for kernels that write to staggered fields - simplification of conditionals with integers uses ISL library now via islpy bindings
-
Martin Bauer authored
-
- Apr 13, 2018
-
-
Martin Bauer authored
- removed warnings - added flake8 as CI target
-
- Apr 10, 2018
-
-
Martin Bauer authored
-
Martin Bauer authored
-
Martin Bauer authored
-
Martin Bauer authored
- test run again - notebooks not yet
-
Martin Bauer authored
-
Martin Bauer authored
- assignment collection - sympyextensions
-
Martin Bauer authored
- Previously sympy.Eq was used to represent assignments. However Eq represents equality not assignment. This means that sometimes sympy "simplified" an equation like a = a to True, -> replaced sp.Eq by pystencils.Assignment everywhere - renamed EquationCollection to AssignmentCollection
-
- Mar 06, 2018
-
-
Martin Bauer authored
-
- Jan 31, 2018
-
-
Martin Bauer authored
-
- Jan 19, 2018
-
-
Concept: Generate code involving the (un)packing of fields (from)to linear (1D) arrays, i.e. (de)serialization of the field values for buffered communication. A linear index is generated for the buffer, by inferring the strides and variables of the loops over fields in the AST. In the CPU, this information is obtained through the makeLoopOverDomain function, in pystencils/transformations/transformations.py. On CUDA, the strides of the fields (excluding buffers) are combined with the indexing variables to infer the indexing of the buffer. What is supported: - code generation for both CPU and GPU - (un)packing of fields with all the memory layouts supported by pystencils - (un)packing slices of fields (from)into the buffer - (un)packing subsets of cell values from the fields (from)into the buffer Limitations: - assumes that only one buffer and one field are being operated within each kernel, however multiple equations involving the buffer and the field are supported. - (un)packing multiple cell values (from)into the buffer is supported, however it is limited to the fields with indexDimensions=1. The same applies to (un)packing subset of cell values of each cell. Changes in this commit: - add the FieldType enumeration to pystencils/field.py, to mark fields of various types. This is replaces and is a generalization of the isIndexedField boolean flag of the Field class. For now, the types supported are: generic, indexed and buffer fields. - add the fieldType property to the Field class, which indicates the type of the field. Modifications were also performed to the member functions of the Field class to add this property. - add resolveBufferAccesses function, which replaces the fields marked as buffers with the actual field access in the AST traversal. Miscelaneous changes: - add blockDim and gridDim variables as CUDA indexing variables.
-