pystencils.codegen.config.GpuOptions.assume_warp_aligned_block_size

pystencils.codegen.config.GpuOptions.assume_warp_aligned_block_size#

GpuOptions.assume_warp_aligned_block_size: BasicOption[bool] = None#

Specifies whether block sizes are divisible by the hardware’s warp size.

If set to True, the code generator can employ optimizations that require this assumption, e.g. warp-level reductions. The pystencils Cupy runtime also checks if user-provided block sizes fulfill this criterion.