pystencils.codegen.config.GpuIndexingScheme.Blockwise4D

pystencils.codegen.config.GpuIndexingScheme.Blockwise4D#

GpuIndexingScheme.Blockwise4D = 2#

On a 3D grid of 1D blocks, map the fastest coordinate onto the intra-block thread index, and slower coordinates onto the block index.

Supports up to four-dimensional iteration spaces. Using this indexing scheme, the iteration counters of up to four dimensions are assigned like follows, from slowest to fastest:

ctr_3 = blockIdx.z;
ctr_2 = blockIdx.y;
ctr_1 = blockIdx.x;
ctr_0 = threadIDx.x;