-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Specialize cuda::std::numeric_limits
for FP8 types
#3478
Conversation
/ok to test |
Also ensure that we actually can enable FP8 due to FP16 and BF16 requirements Co-authored-by: Michael Schellenberger Costa <[email protected]>
* moves emptykernel to detail ns * second batch * third batch * fourth batch * fixes cuda parallel * concatenates nested namespaces
I would like to backport this PR to CCCL 2.8, so I can also backport the deprecation of |
/ok to test |
/ok to test |
🟩 CI finished in 2h 23m: Pass: 100%/135 | Total: 3d 05h | Avg: 34m 30s | Max: 1h 15m | Hits: 277%/23291
|
Project | |
---|---|
CCCL Infrastructure | |
+/- | libcu++ |
CUB | |
Thrust | |
CUDA Experimental | |
python | |
CCCL C Parallel Library | |
Catch2Helper |
Modifications in project or dependencies?
Project | |
---|---|
CCCL Infrastructure | |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | python |
+/- | CCCL C Parallel Library |
+/- | Catch2Helper |
🏃 Runner counts (total jobs: 135)
# | Runner |
---|---|
92 | linux-amd64-cpu16 |
17 | linux-amd64-gpu-v100-latest-1 |
15 | windows-amd64-cpu16 |
10 | linux-arm64-cpu16 |
1 | linux-amd64-gpu-h100-latest-1-testing |
Thanks a lot for making that happen! |
Backport failed for Please cherry-pick the changes locally. git fetch origin branch/2.8.x
git worktree add -d .worktree/backport-3478-to-branch/2.8.x origin/branch/2.8.x
cd .worktree/backport-3478-to-branch/2.8.x
git checkout -b backport-3478-to-branch/2.8.x
ancref=$(git merge-base d47c1c1681546c67d3788a80d6458e2d910f2ba1 74d86dc7862543f7881af0905c138a097b2ba506)
git cherry-pick -x $ancref..74d86dc7862543f7881af0905c138a097b2ba506 |
Co-authored-by: David Bayer <[email protected]>
Partially implements #525.