parallelLoopPatterns
fails on the Intel FPGA emulator SYCL backend
#2450
Labels
parallelLoopPatterns
fails on the Intel FPGA emulator SYCL backend
#2450
The test tries to run kernels with 128 blocks and 64×2²⁰ (67108864) threads.
This results in 8×2³⁰ (8796093022208) threads, which causes at least two problems:
int32_t
variable, and so are limited toINT_MAX
;uint32_t
indices, which overflow for values over 2³²-1.The assumption of the compiler can be relaxed with the
-fno-sycl-id-queries-fit-in-int
flag, but I don't think we should do that by default in alpaka.The examples could be fixed to use
uint64_t
(or plainint64_t
) indices.Or, the example could clamp the number of blocks or the number of threads per block so the product fits in
INT_MAX
.The text was updated successfully, but these errors were encountered: