Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deprecate cub::Trait::CATEGORY|PRIMITIVE|NULL_TYPE #3689

Merged
merged 3 commits into from
Feb 5, 2025

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Feb 5, 2025

This PR is pulled out of #3384 and deprecates those parts of cub::Traits which have replacements in libcu++ traits and limits.

cub::Traits<T>::PRIMITIVE does not have a direct replacement in libcu++, but a code search indicates that it is not widely used outside CUB. I added an internal trait, that we can expose again if users need it.

  • No SASS changes for the radix sort benchmarks

@bernhardmgruber bernhardmgruber marked this pull request as ready for review February 5, 2025 10:34
@bernhardmgruber bernhardmgruber requested a review from a team as a code owner February 5, 2025 10:34
@NVIDIA NVIDIA deleted a comment from copy-pr-bot bot Feb 5, 2025
@bernhardmgruber bernhardmgruber requested a review from a team as a code owner February 5, 2025 12:00
Copy link
Contributor

@elstehle elstehle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good. Would you mind confirming that nothing slipped through the cracks by comparing sass for the radix sort benchmarks?
I can also take this on, just let me know.

Copy link
Contributor

github-actions bot commented Feb 5, 2025

🟨 CI finished in 1h 45m: Pass: 98%/90 | Total: 2d 18h | Avg: 44m 14s | Max: 1h 18m | Hits: 174%/13398
  • 🟨 cub: Pass: 97%/44 | Total: 1d 18h | Avg: 57m 23s | Max: 1h 18m | Hits: 88%/4168

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/42  | Total:  1d 15h | Avg: 57m 05s | Max:  1h 18m | Hits:  88%/4168  
      🟩 arm64              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total:  5h 22m | Avg:  1h 04m | Max:  1h 05m | Hits:  90%/1042  
      🟩 12.5               Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
      🔍 12.8               Pass:  97%/37  | Total:  1d 10h | Avg: 55m 21s | Max:  1h 17m | Hits:  87%/3126  
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 09m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 22m | Avg:  1h 04m | Max:  1h 05m | Hits:  90%/1042  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
      🔍 nvcc12.8           Pass:  97%/35  | Total:  1d 07h | Avg: 54m 46s | Max:  1h 17m | Hits:  87%/3126  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 09m
      🔍 nvcc               Pass:  97%/42  | Total:  1d 15h | Avg: 56m 59s | Max:  1h 18m | Hits:  88%/4168  
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  4h 09m | Avg:  1h 02m | Max:  1h 05m
      🟩 Clang15            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
      🟩 Clang16            Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 06m
      🟩 Clang17            Pass: 100%/2   | Total:  1h 59m | Avg: 59m 44s | Max:  1h 01m
      🟩 Clang18            Pass: 100%/7   | Total:  6h 00m | Avg: 51m 26s | Max:  1h 09m
      🟩 GCC7               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 04m
      🟩 GCC8               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC9               Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m
      🟩 GCC10              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 06m
      🟩 GCC11              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m
      🟩 GCC12              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 07m
      🔍 GCC13              Pass:  90%/10  | Total:  6h 44m | Avg: 40m 24s | Max:  1h 09m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 17m | Hits:  90%/2084  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 15m | Hits:  86%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/17  | Total: 16h 25m | Avg: 57m 57s | Max:  1h 09m
      🔍 GCC                Pass:  95%/21  | Total: 18h 13m | Avg: 52m 03s | Max:  1h 09m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 51m | Avg:  1h 12m | Max:  1h 17m | Hits:  88%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
    🔍 gpu: rtxa6000 🔍
      🟩 h100               Pass: 100%/2   | Total: 51m 47s | Avg: 25m 53s | Max: 26m 59s
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 12h | Avg:  1h 05m | Max:  1h 18m | Hits:  88%/4168  
      🔍 rtxa6000           Pass:  87%/8   | Total:  4h 16m | Avg: 32m 01s | Max:  1h 08m
    🚨 jobs: GraphCapture 🚨
      🟩 Build              Pass: 100%/37  | Total:  1d 15h | Avg:  1h 04m | Max:  1h 18m | Hits:  88%/4168  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 23m 48s | Avg: 23m 48s | Max: 23m 48s
      🔥 GraphCapture       Pass:   0%/1   | Total: 12m 13s | Avg: 12m 13s | Max: 12m 13s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 12m | Avg: 24m 05s | Max: 24m 48s
      🟩 TestGPU            Pass: 100%/2   | Total: 43m 49s | Avg: 21m 54s | Max: 23m 40s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 21h 24m | Avg:  1h 04m | Max:  1h 17m | Hits:  90%/3126  
      🔍 20                 Pass:  95%/24  | Total: 20h 39m | Avg: 51m 39s | Max:  1h 18m | Hits:  82%/1042  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 51m 47s | Avg: 25m 53s | Max: 26m 59s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 09m | Avg:  1h 09m | Max:  1h 09m
    
  • 🟩 thrust: Pass: 100%/43 | Total: 23h 40m | Avg: 33m 02s | Max: 1h 02m | Hits: 212%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 39m 31s | Avg: 19m 45s | Max: 28m 34s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 22h 35m | Avg: 33m 04s | Max:  1h 02m | Hits: 212%/9230  
      🟩 arm64              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 26s | Max: 33m 47s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 07m | Avg: 37m 32s | Max: 56m 16s | Hits: 174%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 43m | Avg: 51m 53s | Max: 52m 11s
      🟩 12.8               Pass: 100%/36  | Total: 18h 49m | Avg: 31m 22s | Max:  1h 02m | Hits: 222%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 52m 31s | Avg: 26m 15s | Max: 26m 55s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 07m | Avg: 37m 32s | Max: 56m 16s | Hits: 174%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 43m | Avg: 51m 53s | Max: 52m 11s
      🟩 nvcc12.8           Pass: 100%/34  | Total: 17h 56m | Avg: 31m 40s | Max:  1h 02m | Hits: 222%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 52m 31s | Avg: 26m 15s | Max: 26m 55s
      🟩 nvcc               Pass: 100%/41  | Total: 22h 48m | Avg: 33m 22s | Max:  1h 02m | Hits: 212%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 51s | Max: 32m 58s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 07m | Avg: 33m 45s | Max: 33m 59s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 03m | Avg: 31m 45s | Max: 32m 17s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 08s | Max: 32m 01s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 47m | Avg: 23m 58s | Max: 33m 47s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 05s | Max: 35m 11s
      🟩 GCC8               Pass: 100%/1   | Total: 32m 31s | Avg: 32m 31s | Max: 32m 31s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 14s | Max: 33m 20s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 24s | Max: 31m 28s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 17s | Max: 36m 04s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 49s | Max: 35m 03s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 16m | Avg: 24m 31s | Max: 37m 44s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 50s | Max: 56m 16s | Hits: 174%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 36m | Avg: 52m 06s | Max:  1h 02m | Hits: 238%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 43m | Avg: 51m 53s | Max: 52m 11s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 04m | Avg: 28m 30s | Max: 33m 59s
      🟩 GCC                Pass: 100%/19  | Total:  9h 26m | Avg: 29m 48s | Max: 37m 44s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 26m | Avg: 53m 12s | Max:  1h 02m | Hits: 212%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 43m | Avg: 51m 53s | Max: 52m 11s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 19h 39m | Avg: 35m 45s | Max: 58m 33s | Hits: 174%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 00m | Avg: 24m 04s | Max:  1h 02m | Hits: 270%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 22h 17m | Avg: 36m 08s | Max:  1h 02m | Hits: 174%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 06s | Avg: 16m 42s | Max: 34m 55s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 12s | Avg: 11m 04s | Max: 11m 47s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 37m 44s | Avg: 37m 44s | Max: 37m 44s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 15m | Avg: 36m 47s | Max: 58m 33s | Hits: 174%/5538  
      🟩 20                 Pass: 100%/21  | Total: 10h 45m | Avg: 30m 43s | Max:  1h 02m | Hits: 270%/3692  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 7m 39s | Avg: 3m 49s | Max: 5m 18s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  5m 18s
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  5m 18s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  5m 18s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  5m 18s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  5m 18s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  5m 18s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  5m 18s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s
      🟩 Test               Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 53s | Avg: 27m 53s | Max: 27m 53s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 53s | Avg: 27m 53s | Max: 27m 53s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 27m 53s | Avg: 27m 53s | Max: 27m 53s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 27m 53s | Avg: 27m 53s | Max: 27m 53s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 53s | Avg: 27m 53s | Max: 27m 53s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 53s | Avg: 27m 53s | Max: 27m 53s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 53s | Avg: 27m 53s | Max: 27m 53s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 27m 53s | Avg: 27m 53s | Max: 27m 53s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 53s | Avg: 27m 53s | Max: 27m 53s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
+/- Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

Copy link
Contributor

github-actions bot commented Feb 5, 2025

🟨 CI finished in 1h 42m: Pass: 97%/90 | Total: 2d 17h | Avg: 43m 25s | Max: 1h 18m | Hits: 174%/13398
  • 🟨 cub: Pass: 95%/44 | Total: 1d 16h | Avg: 55m 41s | Max: 1h 18m | Hits: 88%/4168

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/42  | Total:  1d 14h | Avg: 55m 16s | Max:  1h 18m | Hits:  88%/4168  
      🟩 arm64              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total:  5h 09m | Avg:  1h 01m | Max:  1h 06m | Hits:  90%/1042  
      🟩 12.5               Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
      🔍 12.8               Pass:  94%/37  | Total:  1d 09h | Avg: 53m 43s | Max:  1h 18m | Hits:  87%/3126  
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 09m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 09m | Avg:  1h 01m | Max:  1h 06m | Hits:  90%/1042  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
      🔍 nvcc12.8           Pass:  94%/35  | Total:  1d 06h | Avg: 53m 02s | Max:  1h 18m | Hits:  87%/3126  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 09m
      🔍 nvcc               Pass:  95%/42  | Total:  1d 14h | Avg: 55m 13s | Max:  1h 18m | Hits:  88%/4168  
    🔍 gpu: rtxa6000 🔍
      🟩 h100               Pass: 100%/2   | Total: 53m 30s | Avg: 26m 45s | Max: 27m 51s
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 12h | Avg:  1h 04m | Max:  1h 18m | Hits:  88%/4168  
      🔍 rtxa6000           Pass:  75%/8   | Total:  3h 39m | Avg: 27m 27s | Max:  1h 05m
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 21h 04m | Avg:  1h 03m | Max:  1h 18m | Hits:  90%/3126  
      🔍 20                 Pass:  91%/24  | Total: 19h 46m | Avg: 49m 26s | Max:  1h 18m | Hits:  82%/1042  
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 04m | Avg:  1h 01m | Max:  1h 02m
      🟩 Clang15            Pass: 100%/2   | Total:  1h 57m | Avg: 58m 38s | Max: 59m 25s
      🟩 Clang16            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🟩 Clang17            Pass: 100%/2   | Total:  1h 58m | Avg: 59m 16s | Max:  1h 00m
      🟨 Clang18            Pass:  85%/7   | Total:  5h 43m | Avg: 49m 02s | Max:  1h 09m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 59m | Avg: 59m 52s | Max: 59m 59s
      🟩 GCC8               Pass: 100%/1   | Total: 58m 36s | Avg: 58m 36s | Max: 58m 36s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 56m | Avg: 58m 15s | Max: 58m 49s
      🟩 GCC10              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🟩 GCC11              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🟩 GCC12              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m
      🟨 GCC13              Pass:  90%/10  | Total:  6h 24m | Avg: 38m 29s | Max:  1h 09m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 15m | Hits:  90%/2084  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 18m | Hits:  86%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
    🟨 cxx_family
      🟨 Clang              Pass:  94%/17  | Total: 15h 44m | Avg: 55m 34s | Max:  1h 09m
      🟨 GCC                Pass:  95%/21  | Total: 17h 36m | Avg: 50m 18s | Max:  1h 09m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 55m | Avg:  1h 13m | Max:  1h 18m | Hits:  88%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
    🟨 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 14h | Avg:  1h 03m | Max:  1h 18m | Hits:  88%/4168  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 23m 02s | Avg: 23m 02s | Max: 23m 02s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 36s | Avg: 15m 36s | Max: 15m 36s
      🟨 HostLaunch         Pass:  66%/3   | Total: 52m 46s | Avg: 17m 35s | Max: 25m 39s
      🟨 TestGPU            Pass:  50%/2   | Total: 26m 45s | Avg: 13m 22s | Max: 23m 48s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 53m 30s | Avg: 26m 45s | Max: 27m 51s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 09m | Avg:  1h 09m | Max:  1h 09m
    
  • 🟩 thrust: Pass: 100%/43 | Total: 23h 36m | Avg: 32m 57s | Max: 1h 04m | Hits: 212%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 39m 29s | Avg: 19m 44s | Max: 28m 16s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 22h 36m | Avg: 33m 04s | Max:  1h 04m | Hits: 212%/9230  
      🟩 arm64              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 22s | Max: 32m 14s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 05m | Avg: 37m 03s | Max: 53m 20s | Hits: 174%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 46m | Avg: 53m 02s | Max: 53m 49s
      🟩 12.8               Pass: 100%/36  | Total: 18h 45m | Avg: 31m 15s | Max:  1h 04m | Hits: 222%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 53m 23s | Avg: 26m 41s | Max: 28m 08s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 05m | Avg: 37m 03s | Max: 53m 20s | Hits: 174%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 46m | Avg: 53m 02s | Max: 53m 49s
      🟩 nvcc12.8           Pass: 100%/34  | Total: 17h 52m | Avg: 31m 32s | Max:  1h 04m | Hits: 222%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 53m 23s | Avg: 26m 41s | Max: 28m 08s
      🟩 nvcc               Pass: 100%/41  | Total: 22h 43m | Avg: 33m 15s | Max:  1h 04m | Hits: 212%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 07m | Avg: 31m 49s | Max: 33m 50s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 04s | Max: 33m 29s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 23s | Max: 33m 22s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 03m | Avg: 31m 49s | Max: 33m 28s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 41m | Avg: 23m 07s | Max: 31m 00s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 26s | Max: 32m 31s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 02s | Avg: 31m 02s | Max: 31m 02s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 39s | Max: 34m 54s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 40s | Max: 34m 22s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 02s | Max: 33m 16s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 35s | Max: 36m 37s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 18m | Avg: 24m 49s | Max: 37m 18s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 52m | Avg: 56m 29s | Max: 59m 39s | Hits: 174%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 31m | Avg: 50m 35s | Max:  1h 04m | Hits: 238%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 02s | Max: 53m 49s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 01m | Avg: 28m 20s | Max: 33m 50s
      🟩 GCC                Pass: 100%/19  | Total:  9h 24m | Avg: 29m 42s | Max: 37m 18s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 24m | Avg: 52m 57s | Max:  1h 04m | Hits: 212%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 46m | Avg: 53m 02s | Max: 53m 49s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 19h 32m | Avg: 35m 31s | Max: 59m 39s | Hits: 174%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 04m | Avg: 24m 27s | Max:  1h 04m | Hits: 270%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 22h 12m | Avg: 36m 01s | Max:  1h 04m | Hits: 174%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 32s | Avg: 16m 50s | Max: 33m 51s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 25s | Avg: 11m 08s | Max: 11m 47s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 35m 27s | Avg: 35m 27s | Max: 35m 27s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 10m | Avg: 36m 31s | Max: 59m 39s | Hits: 174%/5538  
      🟩 20                 Pass: 100%/21  | Total: 10h 46m | Avg: 30m 48s | Max:  1h 04m | Hits: 270%/3692  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 7m 35s | Avg: 3m 47s | Max: 5m 17s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 18s | Avg:  2m 18s | Max:  2m 18s
      🟩 Test               Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
    
  • 🟩 python: Pass: 100%/1 | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
+/- Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

@elstehle
Copy link
Contributor

elstehle commented Feb 5, 2025

I've compared the sass for cub.bench.radix_sort.keys.base and cub.cpp17.test.device_radix_sort_custom for this PR against main and there were no changes. So, we're good to go 😉

@bernhardmgruber bernhardmgruber enabled auto-merge (squash) February 5, 2025 18:08
Copy link
Contributor

github-actions bot commented Feb 5, 2025

🟩 CI finished in 2h 17m: Pass: 100%/90 | Total: 2d 17h | Avg: 43m 58s | Max: 1h 18m | Hits: 174%/13398
  • 🟩 cub: Pass: 100%/44 | Total: 1d 17h | Avg: 56m 50s | Max: 1h 18m | Hits: 88%/4168

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 15h | Avg: 56m 28s | Max:  1h 18m | Hits:  88%/4168  
      🟩 arm64              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 09m | Avg:  1h 01m | Max:  1h 06m | Hits:  90%/1042  
      🟩 12.5               Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
      🟩 12.8               Pass: 100%/37  | Total:  1d 09h | Avg: 55m 04s | Max:  1h 18m | Hits:  87%/3126  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 09m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 09m | Avg:  1h 01m | Max:  1h 06m | Hits:  90%/1042  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
      🟩 nvcc12.8           Pass: 100%/35  | Total:  1d 07h | Avg: 54m 28s | Max:  1h 18m | Hits:  87%/3126  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 09m
      🟩 nvcc               Pass: 100%/42  | Total:  1d 15h | Avg: 56m 25s | Max:  1h 18m | Hits:  88%/4168  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 04m | Avg:  1h 01m | Max:  1h 02m
      🟩 Clang15            Pass: 100%/2   | Total:  1h 57m | Avg: 58m 38s | Max: 59m 25s
      🟩 Clang16            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🟩 Clang17            Pass: 100%/2   | Total:  1h 58m | Avg: 59m 16s | Max:  1h 00m
      🟩 Clang18            Pass: 100%/7   | Total:  6h 11m | Avg: 53m 03s | Max:  1h 09m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 59m | Avg: 59m 52s | Max: 59m 59s
      🟩 GCC8               Pass: 100%/1   | Total: 58m 36s | Avg: 58m 36s | Max: 58m 36s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 56m | Avg: 58m 15s | Max: 58m 49s
      🟩 GCC10              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🟩 GCC11              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🟩 GCC12              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m
      🟩 GCC13              Pass: 100%/10  | Total:  6h 47m | Avg: 40m 43s | Max:  1h 09m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 15m | Hits:  90%/2084  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 18m | Hits:  86%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 12m | Avg: 57m 13s | Max:  1h 09m
      🟩 GCC                Pass: 100%/21  | Total: 17h 58m | Avg: 51m 22s | Max:  1h 09m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 55m | Avg:  1h 13m | Max:  1h 18m | Hits:  88%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 53m 30s | Avg: 26m 45s | Max: 27m 51s
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 12h | Avg:  1h 04m | Max:  1h 18m | Hits:  88%/4168  
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 30m | Avg: 33m 45s | Max:  1h 05m
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 14h | Avg:  1h 03m | Max:  1h 18m | Hits:  88%/4168  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 23m 02s | Avg: 23m 02s | Max: 23m 02s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 36s | Avg: 15m 36s | Max: 15m 36s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 15m | Avg: 25m 00s | Max: 25m 39s
      🟩 TestGPU            Pass: 100%/2   | Total: 54m 54s | Avg: 27m 27s | Max: 31m 06s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 53m 30s | Avg: 26m 45s | Max: 27m 51s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 09m | Avg:  1h 09m | Max:  1h 09m
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 04m | Avg:  1h 03m | Max:  1h 18m | Hits:  90%/3126  
      🟩 20                 Pass: 100%/24  | Total: 20h 37m | Avg: 51m 32s | Max:  1h 18m | Hits:  82%/1042  
    
  • 🟩 thrust: Pass: 100%/43 | Total: 23h 36m | Avg: 32m 57s | Max: 1h 04m | Hits: 212%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 39m 29s | Avg: 19m 44s | Max: 28m 16s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 22h 36m | Avg: 33m 04s | Max:  1h 04m | Hits: 212%/9230  
      🟩 arm64              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 22s | Max: 32m 14s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 05m | Avg: 37m 03s | Max: 53m 20s | Hits: 174%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 46m | Avg: 53m 02s | Max: 53m 49s
      🟩 12.8               Pass: 100%/36  | Total: 18h 45m | Avg: 31m 15s | Max:  1h 04m | Hits: 222%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 53m 23s | Avg: 26m 41s | Max: 28m 08s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 05m | Avg: 37m 03s | Max: 53m 20s | Hits: 174%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 46m | Avg: 53m 02s | Max: 53m 49s
      🟩 nvcc12.8           Pass: 100%/34  | Total: 17h 52m | Avg: 31m 32s | Max:  1h 04m | Hits: 222%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 53m 23s | Avg: 26m 41s | Max: 28m 08s
      🟩 nvcc               Pass: 100%/41  | Total: 22h 43m | Avg: 33m 15s | Max:  1h 04m | Hits: 212%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 07m | Avg: 31m 49s | Max: 33m 50s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 04s | Max: 33m 29s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 23s | Max: 33m 22s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 03m | Avg: 31m 49s | Max: 33m 28s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 41m | Avg: 23m 07s | Max: 31m 00s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 26s | Max: 32m 31s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 02s | Avg: 31m 02s | Max: 31m 02s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 39s | Max: 34m 54s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 40s | Max: 34m 22s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 02s | Max: 33m 16s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 35s | Max: 36m 37s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 18m | Avg: 24m 49s | Max: 37m 18s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 52m | Avg: 56m 29s | Max: 59m 39s | Hits: 174%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 31m | Avg: 50m 35s | Max:  1h 04m | Hits: 238%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 02s | Max: 53m 49s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 01m | Avg: 28m 20s | Max: 33m 50s
      🟩 GCC                Pass: 100%/19  | Total:  9h 24m | Avg: 29m 42s | Max: 37m 18s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 24m | Avg: 52m 57s | Max:  1h 04m | Hits: 212%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 46m | Avg: 53m 02s | Max: 53m 49s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 19h 32m | Avg: 35m 31s | Max: 59m 39s | Hits: 174%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 04m | Avg: 24m 27s | Max:  1h 04m | Hits: 270%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 22h 12m | Avg: 36m 01s | Max:  1h 04m | Hits: 174%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 32s | Avg: 16m 50s | Max: 33m 51s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 25s | Avg: 11m 08s | Max: 11m 47s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 35m 27s | Avg: 35m 27s | Max: 35m 27s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 10m | Avg: 36m 31s | Max: 59m 39s | Hits: 174%/5538  
      🟩 20                 Pass: 100%/21  | Total: 10h 46m | Avg: 30m 48s | Max:  1h 04m | Hits: 270%/3692  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 7m 35s | Avg: 3m 47s | Max: 5m 17s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  5m 17s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 18s | Avg:  2m 18s | Max:  2m 18s
      🟩 Test               Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
    
  • 🟩 python: Pass: 100%/1 | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 32m 44s | Avg: 32m 44s | Max: 32m 44s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
+/- Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

@bernhardmgruber bernhardmgruber merged commit c35e567 into NVIDIA:main Feb 5, 2025
103 of 106 checks passed
@bernhardmgruber bernhardmgruber deleted the depr_basetraits branch February 5, 2025 18:19
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Feb 5, 2025
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Feb 5, 2025
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Feb 6, 2025
miscco pushed a commit to bernhardmgruber/cccl that referenced this pull request Feb 6, 2025
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Feb 6, 2025
miscco added a commit that referenced this pull request Feb 6, 2025
…3689) (#3703)

* Deprecate cub::Trait::CATEGORY|PRIMITIVE|NULL_TYPE (#3689)

* Fix FP type detection

---------

Co-authored-by: Michael Schellenberger Costa <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

3 participants