An error occurred while loading the file. Please try again.
-
Concept: Generate code involving the (un)packing of fields (from)to linear (1D) arrays, i.e. (de)serialization of the field values for buffered communication. A linear index is generated for the buffer, by inferring the strides and variables of the loops over fields in the AST. In the CPU, this information is obtained through the makeLoopOverDomain function, in pystencils/transformations/transformations.py. On CUDA, the strides of the fields (excluding buffers) are combined with the indexing variables to infer the indexing of the buffer. What is supported: - code generation for both CPU and GPU - (un)packing of fields with all the memory layouts supported by pystencils - (un)packing slices of fields (from)into the buffer - (un)packing subsets of cell values from the fields (from)into the buffer Limitations: - assumes that only one buffer and one field are being operated within each kernel, however multiple equations involving the...
979ee93b