Skip to content
Snippets Groups Projects
  1. Feb 06, 2018
  2. Jan 31, 2018
  3. Jan 19, 2018
    • João Victor Tozatti Risso's avatar
      Code generation for field serialization into buffers · 979ee93b
      João Victor Tozatti Risso authored and Martin Bauer's avatar Martin Bauer committed
      Concept: Generate code involving the (un)packing of fields (from)to linear
      (1D) arrays, i.e. (de)serialization of the field values for buffered
      communication.
      
      A linear index is generated for the buffer, by inferring the strides and
      variables of the loops over fields in the AST. In the CPU, this information is
      obtained through the makeLoopOverDomain function, in
      pystencils/transformations/transformations.py. On CUDA, the strides of
      the fields (excluding buffers) are combined with the indexing variables to infer
      the indexing of the buffer.
      
      What is supported:
          - code generation for both CPU and GPU
          - (un)packing of fields with all the memory layouts supported by
          pystencils
          - (un)packing slices of fields (from)into the buffer
          - (un)packing subsets of cell values from the fields (from)into the buffer
      
      Limitations:
      
      - assumes that only one buffer and one field are being operated within
      each kernel, however multiple equations involving the buffer and the
      field are supported.
      
      - (un)packing multiple cell values (from)into the buffer is supported,
      however it is limited to the fields with indexDimensions=1. The same
      applies to (un)packing subset of cell values of each cell.
      
      Changes in this commit:
      
      - add the FieldType enumeration to pystencils/field.py, to mark fields
      of various types. This is replaces and is a generalization of the
      isIndexedField boolean flag of the Field class. For now, the types
      supported are: generic, indexed and buffer fields.
      
      - add the fieldType property to the Field class, which indicates the
      type of the field. Modifications were also performed to the member
      functions of the Field class to add this property.
      
      - add resolveBufferAccesses function, which replaces the fields marked
      as buffers with the actual field access in the AST traversal.
      
      Miscelaneous changes:
      
      - add blockDim and gridDim variables as CUDA indexing variables.
      979ee93b
  4. Jan 11, 2018
  5. Dec 11, 2017
  6. Dec 02, 2017
  7. Oct 10, 2017
  8. Oct 09, 2017
    • Martin Bauer's avatar
      Vectorization & Type system overhaul · ea847bc5
      Martin Bauer authored
      - first vectorization tests are running
      - type system: use memoized getTypeOfExpression
      - casts are done using sp.Function('cast')
      - C backend adapted for vectorization support
      - AST nodes can required optional headers
      ea847bc5
  9. Sep 26, 2017
  10. Jul 21, 2017
  11. Jul 01, 2017
    • Martin Bauer's avatar
      Kerncraft coupling · 3b4deebe
      Martin Bauer authored
      - pystencils can create now a non-compilable kernel that can be
        analyzed by kerncraft
      3b4deebe
  12. May 05, 2017
  13. Apr 11, 2017
  14. Apr 06, 2017
  15. Mar 30, 2017
  16. Mar 29, 2017
  17. Mar 24, 2017
  18. Mar 01, 2017
    • Martin Bauer's avatar
      pystencils: cpujit · dd17cd30
      Martin Bauer authored
      - windows support
      - automatic caching and creation of shared library with all generated kernels
      - restrict keyword and function prefixes are preprocessor macros now -> easier to generate one code for linux, cuda, windows
      dd17cd30
  19. Feb 21, 2017
  20. Feb 09, 2017
  21. Dec 08, 2016
  22. Nov 21, 2016
  23. Nov 17, 2016
  24. Nov 11, 2016
    • Martin Bauer's avatar
      Changed symbolsRead/ symbolsDefined semantics · b207d071
      Martin Bauer authored
      problem in moveConstantBeforeLoops transformation:
      
      --> a should end up here
      {
      for() {
        const int a = 5;
      }
      for() {
        const int a = 5
      }
      }
      
      the "a" of the lower loop was not moved up, since it could not move across first loop (which is wrong)
      b207d071
  25. Nov 09, 2016
  26. Nov 06, 2016
  27. Nov 03, 2016