PTX: Add cuda::ptx:barrier_cluster_{arrive,wait}
#1366
Merged
cuda::ptx:barrier_cluster_{arrive,wait}
#1366