Skip to content
Snippets Groups Projects
  1. Mar 15, 2019
  2. Mar 07, 2019
  3. Feb 18, 2019
  4. Jan 23, 2019
    • Martin Bauer's avatar
      waLBerla codegeneration improved · 21fcd7fc
      Martin Bauer authored
      - removed warnings from generated code
      - made generated code string deterministic, generating the same twice
        gives binary equally files now
      21fcd7fc
  5. Dec 10, 2018
  6. Dec 03, 2018
  7. Nov 14, 2018
  8. Oct 10, 2018
  9. Oct 02, 2018
  10. Jul 30, 2018
  11. Jun 20, 2018
  12. Jun 07, 2018
  13. May 28, 2018
  14. May 14, 2018
  15. May 13, 2018
    • Martin Bauer's avatar
      Improved Vectorization · 501b2d7e
      Martin Bauer authored
      - support aligned load/stores
      - nontemporal stores
      - aligned memory allocation for arrays and temporary buffers
      501b2d7e
  16. May 11, 2018
    • Martin Bauer's avatar
      Generalized vectorization · 57a3c27e
      Martin Bauer authored
      - vectorization for loops with ranges that are not a multiple of vector width
      - vectorization for variable sized loops if special transformation
        replace_inner_stride_with_one is run
      57a3c27e
  17. May 07, 2018
    • Martin Bauer's avatar
      Fixes in field index resolve · 755f168c
      Martin Bauer authored
      - move_constants_before_loop: in the rare case that a symbol
        with same name exists in move target - the assignment is not moved
        previously an exception was raised in this case
      - consistent naming of intermediate base pointers
      755f168c
  18. Apr 30, 2018
  19. Apr 28, 2018
  20. Apr 27, 2018
  21. Apr 18, 2018
  22. Apr 13, 2018
  23. Apr 10, 2018
  24. Mar 06, 2018
  25. Jan 31, 2018
  26. Jan 19, 2018
    • João Victor Tozatti Risso's avatar
      Code generation for field serialization into buffers · 979ee93b
      João Victor Tozatti Risso authored and Martin Bauer's avatar Martin Bauer committed
      Concept: Generate code involving the (un)packing of fields (from)to linear
      (1D) arrays, i.e. (de)serialization of the field values for buffered
      communication.
      
      A linear index is generated for the buffer, by inferring the strides and
      variables of the loops over fields in the AST. In the CPU, this information is
      obtained through the makeLoopOverDomain function, in
      pystencils/transformations/transformations.py. On CUDA, the strides of
      the fields (excluding buffers) are combined with the indexing variables to infer
      the indexing of the buffer.
      
      What is supported:
          - code generation for both CPU and GPU
          - (un)packing of fields with all the memory layouts supported by
          pystencils
          - (un)packing slices of fields (from)into the buffer
          - (un)packing subsets of cell values from the fields (from)into the buffer
      
      Limitations:
      
      - assumes that only one buffer and one field are being operated within
      each kernel, however multiple equations involving the buffer and the
      field are supported.
      
      - (un)packing multiple cell values (from)into the buffer is supported,
      however it is limited to the fields with indexDimensions=1. The same
      applies to (un)packing subset of cell values of each cell.
      
      Changes in this commit:
      
      - add the FieldType enumeration to pystencils/field.py, to mark fields
      of various types. This is replaces and is a generalization of the
      isIndexedField boolean flag of the Field class. For now, the types
      supported are: generic, indexed and buffer fields.
      
      - add the fieldType property to the Field class, which indicates the
      type of the field. Modifications were also performed to the member
      functions of the Field class to add this property.
      
      - add resolveBufferAccesses function, which replaces the fields marked
      as buffers with the actual field access in the AST traversal.
      
      Miscelaneous changes:
      
      - add blockDim and gridDim variables as CUDA indexing variables.
      979ee93b