Skip to content
Snippets Groups Projects
  1. Jan 09, 2019
  2. Nov 14, 2018
    • Martin Bauer's avatar
      Pass field information (shape,stride) as single elements instead of arr · 7a94740d
      Martin Bauer authored
      - small (length < 5) arrays with shape and stride information had to be
        memcpy'd to the GPU before every kernel call
      - instead of passing the information as arrays, the single elements are
        passed
      - leads to more function arguments, but simplifies GPU kernel calls
      
      -> changes in all backends required
      7a94740d
  3. Oct 26, 2018
  4. Oct 19, 2018
  5. Oct 10, 2018
  6. Oct 02, 2018
  7. May 14, 2018
  8. May 13, 2018
    • Martin Bauer's avatar
      Improved Vectorization · 501b2d7e
      Martin Bauer authored
      - support aligned load/stores
      - nontemporal stores
      - aligned memory allocation for arrays and temporary buffers
      501b2d7e
  9. May 11, 2018
    • Martin Bauer's avatar
      Generalized vectorization · 57a3c27e
      Martin Bauer authored
      - vectorization for loops with ranges that are not a multiple of vector width
      - vectorization for variable sized loops if special transformation
        replace_inner_stride_with_one is run
      57a3c27e
  10. Apr 27, 2018
  11. Apr 18, 2018
  12. Apr 10, 2018
  13. Jan 11, 2018
    • Martin Bauer's avatar
      pystencils cleanup · c598dc78
      Martin Bauer authored
      - single function to create kernel for specified target
      - data type creation from string - reuse numpy functionality
      - bugfixes in dot display
      c598dc78
  14. Nov 02, 2017
  15. Oct 23, 2017
  16. Oct 17, 2017
  17. Oct 11, 2017
  18. Oct 10, 2017
  19. Oct 09, 2017
    • Martin Bauer's avatar
      Vectorization & Type system overhaul · ea847bc5
      Martin Bauer authored
      - first vectorization tests are running
      - type system: use memoized getTypeOfExpression
      - casts are done using sp.Function('cast')
      - C backend adapted for vectorization support
      - AST nodes can required optional headers
      ea847bc5
  20. Sep 26, 2017
  21. Aug 11, 2017
  22. Jul 21, 2017
  23. Jul 01, 2017
    • Martin Bauer's avatar
      Kerncraft coupling · 3b4deebe
      Martin Bauer authored
      - pystencils can create now a non-compilable kernel that can be
        analyzed by kerncraft
      3b4deebe
  24. Jun 09, 2017
  25. Mar 20, 2017
  26. Mar 19, 2017
  27. Mar 16, 2017
  28. Mar 14, 2017
    • Martin Bauer's avatar
      pystencils: fields can now contain structs · ec3faf51
      Martin Bauer authored
      - this extension is necessary for more generic boundary treatment
      - cells can now be structs, i.e. contain different data types
      - instead of having numeric index dimensions, one can use the index per cell to adress struct elements
      ec3faf51
  29. Mar 13, 2017
    • Martin Bauer's avatar
      pystencils: Cleaned up type system · c8b455fe
      Martin Bauer authored
      - use data type class consistently instead of strings (in TypedSymbol, Field and jit module)
      - new datatype class is based on numpy types with additional specifier information (const and restrict)
      - translation between data type class and other modules (numpy, ctypes)
      c8b455fe
  30. Mar 10, 2017
  31. Mar 02, 2017
  32. Mar 01, 2017
    • Martin Bauer's avatar
      pystencils: cpujit · dd17cd30
      Martin Bauer authored
      - windows support
      - automatic caching and creation of shared library with all generated kernels
      - restrict keyword and function prefixes are preprocessor macros now -> easier to generate one code for linux, cuda, windows
      dd17cd30
  33. Feb 23, 2017