Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace CUB macros in tunings and benchmarks #3931

Merged
merged 3 commits into from
Feb 26, 2025

Conversation

bernhardmgruber
Copy link
Contributor

Split out of #3821

Copy link
Contributor

🟩 CI finished in 1h 51m: Pass: 100%/93 | Total: 2d 13h | Avg: 39m 26s | Max: 1h 18m | Hits: 74%/133929
  • 🟩 cub: Pass: 100%/45 | Total: 1d 15h | Avg: 52m 30s | Max: 1h 18m | Hits: 65%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 13h | Avg: 52m 09s | Max:  1h 18m | Hits:  65%/51055 
      🟩 arm64              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  62%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 49m | Avg: 57m 54s | Max:  1h 03m | Hits:  54%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m | Hits:  62%/2248  
      🟩 12.8               Pass: 100%/38  | Total:  1d 08h | Avg: 51m 01s | Max:  1h 18m | Hits:  67%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 56m | Avg: 58m 21s | Max: 58m 49s | Hits:  68%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 49m | Avg: 57m 54s | Max:  1h 03m | Hits:  54%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m | Hits:  62%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 06h | Avg: 50m 36s | Max:  1h 18m | Hits:  67%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 56m | Avg: 58m 21s | Max: 58m 49s | Hits:  68%/2100  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 13h | Avg: 52m 13s | Max:  1h 18m | Hits:  65%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 50m | Avg: 57m 39s | Max:  1h 01m | Hits:  63%/4868  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 45s | Max: 58m 36s | Hits:  62%/2430  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 54m | Avg: 57m 22s | Max: 58m 30s | Hits:  62%/2430  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 49m | Avg: 54m 30s | Max: 54m 58s | Hits:  62%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 37m | Avg: 48m 13s | Max:  1h 01m | Hits:  75%/8175  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 52m | Avg: 56m 21s | Max: 56m 42s | Hits:  62%/2434  
      🟩 GCC8               Pass: 100%/1   | Total: 54m 00s | Avg: 54m 00s | Max: 54m 00s | Hits:  62%/1217  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 18s | Max: 57m 27s | Hits:  62%/2434  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 56m | Avg: 58m 12s | Max: 58m 16s | Hits:  62%/2434  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 57m | Avg: 58m 52s | Max:  1h 02m | Hits:  62%/2430  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 03m | Hits:  62%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 42m | Avg: 36m 36s | Max:  1h 11m | Hits:  82%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 09m | Hits:  13%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 18m | Hits:  13%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m | Hits:  62%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 15h 05m | Avg: 53m 15s | Max:  1h 01m | Hits:  67%/20333 
      🟩 GCC                Pass: 100%/22  | Total: 17h 18m | Avg: 47m 12s | Max:  1h 11m | Hits:  72%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 44m | Avg:  1h 11m | Max:  1h 18m | Hits:  13%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m | Hits:  62%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 09m | Avg: 23m 06s | Max: 24m 35s | Hits:  87%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 10h | Avg:  1h 00m | Max:  1h 18m | Hits:  57%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 06m | Avg: 30m 50s | Max: 58m 34s | Hits:  90%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 12h | Avg: 59m 04s | Max:  1h 18m | Hits:  58%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 18s | Avg: 22m 18s | Max: 22m 18s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 27s | Avg: 17m 27s | Max: 17m 27s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 13m | Avg: 24m 20s | Max: 25m 02s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 04m | Avg: 21m 20s | Max: 23m 27s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 09m | Avg: 23m 06s | Max: 24m 35s | Hits:  87%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m | Hits:  62%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 19h 39m | Avg: 58m 58s | Max:  1h 18m | Hits:  56%/23535 
      🟩 20                 Pass: 100%/25  | Total: 19h 43m | Avg: 47m 19s | Max:  1h 13m | Hits:  73%/29950 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 20h 49m | Avg: 27m 46s | Max: 56m 51s | Hits: 79%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 35m 17s | Avg: 17m 38s | Max: 25m 04s | Hits:  89%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 19h 59m | Avg: 27m 53s | Max: 56m 51s | Hits:  79%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 50m 50s | Avg: 25m 25s | Max: 26m 31s | Hits:  79%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 40m | Avg: 32m 02s | Max: 47m 57s | Hits:  74%/8901  
      🟩 12.5               Pass: 100%/2   | Total:  1h 34m | Avg: 47m 06s | Max: 48m 51s | Hits:  73%/3562  
      🟩 12.8               Pass: 100%/38  | Total: 16h 35m | Avg: 26m 11s | Max: 56m 51s | Hits:  80%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 48m 23s | Avg: 24m 11s | Max: 24m 28s | Hits:  79%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 40m | Avg: 32m 02s | Max: 47m 57s | Hits:  74%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 34m | Avg: 47m 06s | Max: 48m 51s | Hits:  73%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 15h 47m | Avg: 26m 18s | Max: 56m 51s | Hits:  80%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 48m 23s | Avg: 24m 11s | Max: 24m 28s | Hits:  79%/3562  
      🟩 nvcc               Pass: 100%/43  | Total: 20h 01m | Avg: 27m 56s | Max: 56m 51s | Hits:  79%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 46m | Avg: 26m 33s | Max: 27m 16s | Hits:  79%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 55m 31s | Avg: 27m 45s | Max: 29m 18s | Hits:  79%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 51m 56s | Avg: 25m 58s | Max: 26m 13s | Hits:  79%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 55m 13s | Avg: 27m 36s | Max: 28m 42s | Hits:  79%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 20m | Avg: 20m 06s | Max: 25m 49s | Hits:  85%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 54m 28s | Avg: 27m 14s | Max: 28m 37s | Hits:  79%/3564  
      🟩 GCC8               Pass: 100%/1   | Total: 26m 36s | Avg: 26m 36s | Max: 26m 36s | Hits:  79%/1782  
      🟩 GCC9               Pass: 100%/2   | Total: 59m 21s | Avg: 29m 40s | Max: 29m 41s | Hits:  79%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 55m 59s | Avg: 27m 59s | Max: 29m 01s | Hits:  79%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 59m 11s | Avg: 29m 35s | Max: 29m 57s | Hits:  79%/3564  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 01s | Max: 31m 41s | Hits:  79%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 13m | Avg: 19m 18s | Max: 29m 31s | Hits:  87%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 36m | Avg: 48m 15s | Max: 48m 34s | Hits:  55%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 20m | Avg: 46m 55s | Max: 56m 51s | Hits:  60%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 34m | Avg: 47m 06s | Max: 48m 51s | Hits:  73%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  6h 49m | Avg: 24m 05s | Max: 29m 18s | Hits:  81%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  8h 28m | Avg: 24m 13s | Max: 31m 41s | Hits:  83%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 57m | Avg: 47m 27s | Max: 56m 51s | Hits:  58%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 34m | Avg: 47m 06s | Max: 48m 51s | Hits:  73%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 05s | Avg: 13m 32s | Max: 16m 25s | Hits:  89%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total: 16h 47m | Avg: 30m 31s | Max: 50m 20s | Hits:  76%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 35m | Avg: 21m 31s | Max: 56m 51s | Hits:  86%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 19h 19m | Avg: 30m 30s | Max: 56m 51s | Hits:  76%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 07s | Avg: 16m 22s | Max: 33m 35s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 41m 16s | Avg: 10m 19s | Max: 10m 40s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 27m 05s | Avg: 13m 32s | Max: 16m 25s | Hits:  89%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 29m 31s | Avg: 29m 31s | Max: 29m 31s | Hits:  79%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 10h 34m | Avg: 31m 42s | Max: 50m 20s | Hits:  75%/35611 
      🟩 20                 Pass: 100%/23  | Total:  9h 40m | Avg: 25m 14s | Max: 56m 51s | Hits:  82%/40961 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 09s | Avg: 8m 04s | Max: 13m 39s | Hits: 98%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 09s | Avg:  8m 04s | Max: 13m 39s | Hits:  98%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 09s | Avg:  8m 04s | Max: 13m 39s | Hits:  98%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 09s | Avg:  8m 04s | Max: 13m 39s | Hits:  98%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 09s | Avg:  8m 04s | Max: 13m 39s | Hits:  98%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 09s | Avg:  8m 04s | Max: 13m 39s | Hits:  98%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 09s | Avg:  8m 04s | Max: 13m 39s | Hits:  98%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 09s | Avg:  8m 04s | Max: 13m 39s | Hits:  98%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 30s | Avg:  2m 30s | Max:  2m 30s | Hits:  98%/154   
      🟩 Test               Pass: 100%/1   | Total: 13m 39s | Avg: 13m 39s | Max: 13m 39s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 39m 43s | Avg: 39m 43s | Max: 39m 43s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 39m 43s | Avg: 39m 43s | Max: 39m 43s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 39m 43s | Avg: 39m 43s | Max: 39m 43s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 39m 43s | Avg: 39m 43s | Max: 39m 43s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 39m 43s | Avg: 39m 43s | Max: 39m 43s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 39m 43s | Avg: 39m 43s | Max: 39m 43s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 39m 43s | Avg: 39m 43s | Max: 39m 43s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 39m 43s | Avg: 39m 43s | Max: 39m 43s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 39m 43s | Avg: 39m 43s | Max: 39m 43s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
+/- CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber enabled auto-merge (squash) February 25, 2025 08:53
@bernhardmgruber bernhardmgruber requested a review from a team as a code owner February 25, 2025 09:33
Copy link
Contributor

🟨 CI finished in 1h 06m: Pass: 53%/93 | Total: 1d 11h | Avg: 23m 03s | Max: 1h 01m | Hits: 91%/57355
  • 🟨 thrust: Pass: 4%/45 | Total: 6h 43m | Avg: 8m 57s | Max: 40m 46s | Hits: 99%/3562

    🚨 cudacxx_family: nvcc 🚨
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 51s | Avg:  5m 25s | Max:  5m 36s | Hits:  99%/3562  
      🔥 nvcc               Pass:   0%/43  | Total:  6h 32m | Avg:  9m 07s | Max: 40m 46s
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 51s | Avg:  5m 25s | Max:  5m 36s | Hits:  99%/3562  
      🟥 nvcc12.0           Pass:   0%/5   | Total: 59m 48s | Avg: 11m 57s | Max: 30m 06s
      🟥 nvcc12.5           Pass:   0%/2   | Total: 27m 22s | Avg: 13m 41s | Max: 13m 49s
      🟥 nvcc12.8           Pass:   0%/36  | Total:  5h 05m | Avg:  8m 29s | Max: 40m 46s
    🟥 cmake_options
      🟥 -DTHRUST_DISPATCH_TYPE=Force32bit Pass:   0%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  6m 48s
    🟨 cpu
      🟨 amd64              Pass:   4%/43  | Total:  6h 27m | Avg:  9m 00s | Max: 40m 46s | Hits:  99%/3562  
      🟥 arm64              Pass:   0%/2   | Total: 16m 27s | Avg:  8m 13s | Max:  8m 41s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 59m 48s | Avg: 11m 57s | Max: 30m 06s
      🟥 12.5               Pass:   0%/2   | Total: 27m 22s | Avg: 13m 41s | Max: 13m 49s
      🟨 12.8               Pass:   5%/38  | Total:  5h 16m | Avg:  8m 19s | Max: 40m 46s | Hits:  99%/3562  
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total: 29m 28s | Avg:  7m 22s | Max:  7m 42s
      🟥 Clang15            Pass:   0%/2   | Total: 15m 38s | Avg:  7m 49s | Max:  7m 51s
      🟥 Clang16            Pass:   0%/2   | Total: 14m 53s | Avg:  7m 26s | Max:  7m 32s
      🟥 Clang17            Pass:   0%/2   | Total: 15m 40s | Avg:  7m 50s | Max:  7m 56s
      🟨 Clang18            Pass:  28%/7   | Total: 33m 30s | Avg:  4m 47s | Max:  7m 46s | Hits:  99%/3562  
      🟥 GCC7               Pass:   0%/2   | Total: 14m 43s | Avg:  7m 21s | Max:  7m 28s
      🟥 GCC8               Pass:   0%/1   | Total:  7m 40s | Avg:  7m 40s | Max:  7m 40s
      🟥 GCC9               Pass:   0%/2   | Total: 16m 39s | Avg:  8m 19s | Max:  8m 38s
      🟥 GCC10              Pass:   0%/2   | Total: 15m 46s | Avg:  7m 53s | Max:  7m 59s
      🟥 GCC11              Pass:   0%/2   | Total: 15m 49s | Avg:  7m 54s | Max:  8m 01s
      🟥 GCC12              Pass:   0%/2   | Total: 17m 38s | Avg:  8m 49s | Max:  9m 01s
      🟥 GCC13              Pass:   0%/10  | Total: 46m 58s | Avg:  4m 41s | Max:  8m 51s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  1h 00m | Avg: 30m 05s | Max: 30m 06s
      🟥 MSVC14.42          Pass:   0%/3   | Total:  1h 11m | Avg: 23m 51s | Max: 40m 46s
      🟥 NVHPC24.7          Pass:   0%/2   | Total: 27m 22s | Avg: 13m 41s | Max: 13m 49s
    🟨 cxx_family
      🟨 Clang              Pass:  11%/17  | Total:  1h 49m | Avg:  6m 25s | Max:  7m 56s | Hits:  99%/3562  
      🟥 GCC                Pass:   0%/21  | Total:  2h 15m | Avg:  6m 26s | Max:  9m 01s
      🟥 MSVC               Pass:   0%/5   | Total:  2h 11m | Avg: 26m 21s | Max: 40m 46s
      🟥 NVHPC              Pass:   0%/2   | Total: 27m 22s | Avg: 13m 41s | Max: 13m 49s
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  5m 25s | Avg:  2m 42s | Max:  5m 25s
      🟨 rtx2080            Pass:   6%/33  | Total:  5h 33m | Avg: 10m 07s | Max: 30m 48s | Hits:  99%/3562  
      🟥 rtx4090            Pass:   0%/10  | Total:  1h 04m | Avg:  6m 24s | Max: 40m 46s
    🟨 jobs
      🟨 Build              Pass:   5%/38  | Total:  6h 43m | Avg: 10m 37s | Max: 40m 46s | Hits:  99%/3562  
      🟥 TestCPU            Pass:   0%/3  
      🟥 TestGPU            Pass:   0%/4  
    🟥 sm
      🟥 90                 Pass:   0%/2   | Total:  5m 25s | Avg:  2m 42s | Max:  5m 25s
      🟥 90;90a;100         Pass:   0%/1   | Total:  8m 51s | Avg:  8m 51s | Max:  8m 51s
    🟨 std
      🟨 17                 Pass:   5%/20  | Total:  3h 46m | Avg: 11m 20s | Max: 30m 48s | Hits:  99%/1781  
      🟨 20                 Pass:   4%/23  | Total:  2h 49m | Avg:  7m 23s | Max: 40m 46s | Hits:  99%/1781  
    
  • 🟩 cub: Pass: 100%/45 | Total: 1d 04h | Avg: 37m 28s | Max: 1h 01m | Hits: 90%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 02h | Avg: 37m 11s | Max:  1h 01m | Hits:  90%/51055 
      🟩 arm64              Pass: 100%/2   | Total:  1h 27m | Avg: 43m 42s | Max: 43m 53s | Hits:  99%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 16m | Avg: 39m 22s | Max: 43m 20s | Hits:  82%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  1h 30m | Avg: 45m 22s | Max: 46m 38s | Hits:  89%/2248  
      🟩 12.8               Pass: 100%/38  | Total: 23h 18m | Avg: 36m 48s | Max:  1h 01m | Hits:  92%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 44m | Avg: 52m 21s | Max: 52m 33s | Hits:  99%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 16m | Avg: 39m 22s | Max: 43m 20s | Hits:  82%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 30m | Avg: 45m 22s | Max: 46m 38s | Hits:  89%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 21h 34m | Avg: 35m 56s | Max:  1h 01m | Hits:  91%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 44m | Avg: 52m 21s | Max: 52m 33s | Hits:  99%/2100  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 02h | Avg: 36m 47s | Max:  1h 01m | Hits:  90%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 29m | Avg: 37m 25s | Max: 39m 52s | Hits:  98%/4868  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 16m | Avg: 38m 18s | Max: 40m 24s | Hits:  98%/2430  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 12m | Avg: 36m 20s | Max: 36m 21s | Hits:  99%/2430  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 17m | Avg: 38m 35s | Max: 40m 50s | Hits:  96%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  4h 27m | Avg: 38m 15s | Max: 52m 33s | Hits:  99%/8175  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 15m | Avg: 37m 44s | Max: 39m 49s | Hits:  97%/2434  
      🟩 GCC8               Pass: 100%/1   | Total: 36m 11s | Avg: 36m 11s | Max: 36m 11s | Hits:  97%/1217  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 13m | Avg: 36m 45s | Max: 36m 58s | Hits:  97%/2434  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 20m | Avg: 40m 21s | Max: 41m 20s | Hits:  95%/2434  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 16m | Avg: 38m 03s | Max: 39m 15s | Hits:  94%/2430  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 21m | Avg: 40m 49s | Max: 41m 24s | Hits:  94%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  5h 05m | Avg: 27m 47s | Max: 49m 13s | Hits:  98%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 40m | Avg: 50m 15s | Max: 57m 10s | Hits:  14%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:  14%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 30m | Avg: 45m 22s | Max: 46m 38s | Hits:  89%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 10h 43m | Avg: 37m 52s | Max: 52m 33s | Hits:  98%/20333 
      🟩 GCC                Pass: 100%/22  | Total: 12h 09m | Avg: 33m 08s | Max: 49m 13s | Hits:  97%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  3h 42m | Avg: 55m 37s | Max:  1h 01m | Hits:  14%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 30m | Avg: 45m 22s | Max: 46m 38s | Hits:  89%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 50m 26s | Avg: 16m 48s | Max: 24m 24s | Hits:  99%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total: 23h 52m | Avg: 42m 07s | Max:  1h 01m | Hits:  88%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 23m | Avg: 25m 26s | Max: 42m 32s | Hits:  98%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 01h | Avg: 40m 58s | Max:  1h 01m | Hits:  88%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 32s | Avg: 21m 32s | Max: 21m 32s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 34s | Avg: 16m 34s | Max: 16m 34s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 10m | Avg: 23m 39s | Max: 24m 24s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 01m | Avg: 20m 24s | Max: 21m 01s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 50m 26s | Avg: 16m 48s | Max: 24m 24s | Hits:  99%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total: 49m 13s | Avg: 49m 13s | Max: 49m 13s | Hits:  92%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 13h 46m | Avg: 41m 19s | Max:  1h 01m | Hits:  85%/23535 
      🟩 20                 Pass: 100%/25  | Total: 14h 20m | Avg: 34m 24s | Max:  1h 00m | Hits:  94%/29950 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 10s | Avg: 7m 35s | Max: 12m 50s | Hits: 98%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 10s | Avg:  7m 35s | Max: 12m 50s | Hits:  98%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 10s | Avg:  7m 35s | Max: 12m 50s | Hits:  98%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 10s | Avg:  7m 35s | Max: 12m 50s | Hits:  98%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 10s | Avg:  7m 35s | Max: 12m 50s | Hits:  98%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 10s | Avg:  7m 35s | Max: 12m 50s | Hits:  98%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 10s | Avg:  7m 35s | Max: 12m 50s | Hits:  98%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 10s | Avg:  7m 35s | Max: 12m 50s | Hits:  98%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 20s | Avg:  2m 20s | Max:  2m 20s | Hits:  98%/154   
      🟩 Test               Pass: 100%/1   | Total: 12m 50s | Avg: 12m 50s | Max: 12m 50s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 39m 56s | Avg: 39m 56s | Max: 39m 56s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 39m 56s | Avg: 39m 56s | Max: 39m 56s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 39m 56s | Avg: 39m 56s | Max: 39m 56s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 39m 56s | Avg: 39m 56s | Max: 39m 56s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 39m 56s | Avg: 39m 56s | Max: 39m 56s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 39m 56s | Avg: 39m 56s | Max: 39m 56s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 39m 56s | Avg: 39m 56s | Max: 39m 56s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 39m 56s | Avg: 39m 56s | Max: 39m 56s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 39m 56s | Avg: 39m 56s | Max: 39m 56s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
+/- CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

Copy link
Contributor

🟨 CI finished in 1h 27m: Pass: 53%/93 | Total: 2d 10h | Avg: 37m 48s | Max: 1h 22m | Hits: 69%/57355
  • 🟨 thrust: Pass: 4%/45 | Total: 17h 38m | Avg: 23m 30s | Max: 57m 24s | Hits: 79%/3562

    🚨 cudacxx_family: nvcc 🚨
      🟩 ClangCUDA          Pass: 100%/2   | Total: 46m 10s | Avg: 23m 05s | Max: 24m 11s | Hits:  79%/3562  
      🔥 nvcc               Pass:   0%/43  | Total: 16h 52m | Avg: 23m 32s | Max: 57m 24s
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 46m 10s | Avg: 23m 05s | Max: 24m 11s | Hits:  79%/3562  
      🟥 nvcc12.0           Pass:   0%/5   | Total:  2h 27m | Avg: 29m 24s | Max: 45m 29s
      🟥 nvcc12.5           Pass:   0%/2   | Total:  1h 16m | Avg: 38m 18s | Max: 39m 52s
      🟥 nvcc12.8           Pass:   0%/36  | Total: 13h 08m | Avg: 21m 53s | Max: 57m 24s
    🟥 cmake_options
      🟥 -DTHRUST_DISPATCH_TYPE=Force32bit Pass:   0%/2   | Total: 19m 26s | Avg:  9m 43s | Max: 19m 26s
    🟨 cpu
      🟨 amd64              Pass:   4%/43  | Total: 16h 51m | Avg: 23m 31s | Max: 57m 24s | Hits:  79%/3562  
      🟥 arm64              Pass:   0%/2   | Total: 46m 44s | Avg: 23m 22s | Max: 24m 41s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  2h 27m | Avg: 29m 24s | Max: 45m 29s
      🟥 12.5               Pass:   0%/2   | Total:  1h 16m | Avg: 38m 18s | Max: 39m 52s
      🟨 12.8               Pass:   5%/38  | Total: 13h 54m | Avg: 21m 57s | Max: 57m 24s | Hits:  79%/3562  
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total:  1h 38m | Avg: 24m 33s | Max: 25m 11s
      🟥 Clang15            Pass:   0%/2   | Total: 48m 35s | Avg: 24m 17s | Max: 25m 12s
      🟥 Clang16            Pass:   0%/2   | Total: 49m 55s | Avg: 24m 57s | Max: 26m 07s
      🟥 Clang17            Pass:   0%/2   | Total: 47m 09s | Avg: 23m 34s | Max: 23m 38s
      🟨 Clang18            Pass:  28%/7   | Total:  1h 55m | Avg: 16m 28s | Max: 24m 11s | Hits:  79%/3562  
      🟥 GCC7               Pass:   0%/2   | Total: 51m 27s | Avg: 25m 43s | Max: 26m 17s
      🟥 GCC8               Pass:   0%/1   | Total: 22m 39s | Avg: 22m 39s | Max: 22m 39s
      🟥 GCC9               Pass:   0%/2   | Total: 51m 56s | Avg: 25m 58s | Max: 26m 31s
      🟥 GCC10              Pass:   0%/2   | Total: 50m 26s | Avg: 25m 13s | Max: 25m 35s
      🟥 GCC11              Pass:   0%/2   | Total: 54m 56s | Avg: 27m 28s | Max: 29m 27s
      🟥 GCC12              Pass:   0%/2   | Total: 52m 28s | Avg: 26m 14s | Max: 27m 59s
      🟥 GCC13              Pass:   0%/10  | Total:  2h 15m | Avg: 13m 34s | Max: 26m 59s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  1h 37m | Avg: 48m 32s | Max: 51m 35s
      🟥 MSVC14.42          Pass:   0%/3   | Total:  1h 45m | Avg: 35m 13s | Max: 57m 24s
      🟥 NVHPC24.7          Pass:   0%/2   | Total:  1h 16m | Avg: 38m 18s | Max: 39m 52s
    🟨 cxx_family
      🟨 Clang              Pass:  11%/17  | Total:  5h 59m | Avg: 21m 07s | Max: 26m 07s | Hits:  79%/3562  
      🟥 GCC                Pass:   0%/21  | Total:  6h 59m | Avg: 19m 58s | Max: 29m 27s
      🟥 MSVC               Pass:   0%/5   | Total:  3h 22m | Avg: 40m 32s | Max: 57m 24s
      🟥 NVHPC              Pass:   0%/2   | Total:  1h 16m | Avg: 38m 18s | Max: 39m 52s
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total: 13m 27s | Avg:  6m 43s | Max: 13m 27s
      🟨 rtx2080            Pass:   6%/33  | Total: 15h 19m | Avg: 27m 52s | Max: 51m 35s | Hits:  79%/3562  
      🟥 rtx4090            Pass:   0%/10  | Total:  2h 05m | Avg: 12m 30s | Max: 57m 24s
    🟨 jobs
      🟨 Build              Pass:   5%/38  | Total: 17h 38m | Avg: 27m 50s | Max: 57m 24s | Hits:  79%/3562  
      🟥 TestCPU            Pass:   0%/3  
      🟥 TestGPU            Pass:   0%/4  
    🟥 sm
      🟥 90                 Pass:   0%/2   | Total: 13m 27s | Avg:  6m 43s | Max: 13m 27s
      🟥 90;90a;100         Pass:   0%/1   | Total: 26m 40s | Avg: 26m 40s | Max: 26m 40s
    🟨 std
      🟨 17                 Pass:   5%/20  | Total:  9h 46m | Avg: 29m 19s | Max: 51m 35s | Hits:  79%/1781  
      🟨 20                 Pass:   4%/23  | Total:  7h 32m | Avg: 19m 40s | Max: 57m 24s | Hits:  79%/1781  
    
  • 🟩 cub: Pass: 100%/45 | Total: 1d 15h | Avg: 53m 09s | Max: 1h 22m | Hits: 68%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 13h | Avg: 52m 48s | Max:  1h 22m | Hits:  68%/51055 
      🟩 arm64              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:  65%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 04m | Avg:  1h 00m | Max:  1h 06m | Hits:  56%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m | Hits:  65%/2248  
      🟩 12.8               Pass: 100%/38  | Total:  1d 08h | Avg: 51m 32s | Max:  1h 22m | Hits:  69%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 06m | Hits:  71%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 04m | Avg:  1h 00m | Max:  1h 06m | Hits:  56%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m | Hits:  65%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 06h | Avg: 50m 58s | Max:  1h 22m | Hits:  69%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 06m | Hits:  71%/2100  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 13h | Avg: 52m 46s | Max:  1h 22m | Hits:  68%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 55m | Avg: 58m 48s | Max:  1h 03m | Hits:  66%/4868  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 55m | Avg: 57m 52s | Max:  1h 01m | Hits:  66%/2430  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 41s | Max: 57m 09s | Hits:  66%/2430  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 54s | Max: 59m 47s | Hits:  66%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 43m | Avg: 49m 01s | Max:  1h 06m | Hits:  77%/8175  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 56m | Avg: 58m 10s | Max:  1h 00m | Hits:  65%/2434  
      🟩 GCC8               Pass: 100%/1   | Total: 54m 07s | Avg: 54m 07s | Max: 54m 07s | Hits:  65%/1217  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 16s | Max: 57m 51s | Hits:  65%/2434  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:  65%/2434  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 56m | Avg: 58m 04s | Max:  1h 00m | Hits:  65%/2430  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:  65%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 36m | Avg: 36m 03s | Max:  1h 06m | Hits:  84%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 16m | Hits:  13%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 22m | Hits:  13%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m | Hits:  65%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 15h 21m | Avg: 54m 11s | Max:  1h 06m | Hits:  70%/20333 
      🟩 GCC                Pass: 100%/22  | Total: 17h 25m | Avg: 47m 31s | Max:  1h 06m | Hits:  74%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 56m | Avg:  1h 14m | Max:  1h 22m | Hits:  13%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m | Hits:  65%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 11m | Avg: 23m 46s | Max: 25m 30s | Hits:  88%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 10h | Avg:  1h 01m | Max:  1h 22m | Hits:  60%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 06m | Avg: 30m 48s | Max:  1h 02m | Hits:  91%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 13h | Avg:  1h 00m | Max:  1h 22m | Hits:  61%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 43s | Avg: 21m 43s | Max: 21m 43s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 04s | Avg: 17m 04s | Max: 17m 04s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 09m | Avg: 23m 09s | Max: 24m 09s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 03m | Avg: 21m 10s | Max: 22m 19s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 11m | Avg: 23m 46s | Max: 25m 30s | Hits:  88%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m | Hits:  65%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 20h 03m | Avg:  1h 00m | Max:  1h 16m | Hits:  59%/23535 
      🟩 20                 Pass: 100%/25  | Total: 19h 49m | Avg: 47m 34s | Max:  1h 22m | Hits:  75%/29950 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 09s | Avg: 7m 34s | Max: 12m 42s | Hits: 98%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max: 12m 42s | Hits:  98%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max: 12m 42s | Hits:  98%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max: 12m 42s | Hits:  98%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max: 12m 42s | Hits:  98%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max: 12m 42s | Hits:  98%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max: 12m 42s | Hits:  98%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max: 12m 42s | Hits:  98%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 27s | Avg:  2m 27s | Max:  2m 27s | Hits:  98%/154   
      🟩 Test               Pass: 100%/1   | Total: 12m 42s | Avg: 12m 42s | Max: 12m 42s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 49m 48s | Avg: 49m 48s | Max: 49m 48s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 49m 48s | Avg: 49m 48s | Max: 49m 48s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 49m 48s | Avg: 49m 48s | Max: 49m 48s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 49m 48s | Avg: 49m 48s | Max: 49m 48s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 49m 48s | Avg: 49m 48s | Max: 49m 48s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 49m 48s | Avg: 49m 48s | Max: 49m 48s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 49m 48s | Avg: 49m 48s | Max: 49m 48s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 49m 48s | Avg: 49m 48s | Max: 49m 48s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 49m 48s | Avg: 49m 48s | Max: 49m 48s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
+/- CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

Copy link
Contributor

🟩 CI finished in 1h 01m: Pass: 100%/93 | Total: 18h 08m | Avg: 11m 42s | Max: 49m 26s | Hits: 92%/133929
  • 🟩 cub: Pass: 100%/45 | Total: 8h 23m | Avg: 11m 11s | Max: 34m 15s | Hits: 93%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  8h 12m | Avg: 11m 27s | Max: 34m 15s | Hits:  92%/51055 
      🟩 arm64              Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 50s | Hits:  99%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 50m 57s | Avg: 10m 11s | Max: 28m 29s | Hits:  85%/5908  
      🟩 12.5               Pass: 100%/2   | Total: 20m 18s | Avg: 10m 09s | Max: 10m 37s | Hits:  98%/2248  
      🟩 12.8               Pass: 100%/38  | Total:  7h 12m | Avg: 11m 22s | Max: 34m 15s | Hits:  94%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 55s | Hits: 100%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 50m 57s | Avg: 10m 11s | Max: 28m 29s | Hits:  85%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 20m 18s | Avg: 10m 09s | Max: 10m 37s | Hits:  98%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  7h 02m | Avg: 11m 44s | Max: 34m 15s | Hits:  93%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 55s | Hits: 100%/2100  
      🟩 nvcc               Pass: 100%/43  | Total:  8h 14m | Avg: 11m 29s | Max: 34m 15s | Hits:  92%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 08s | Avg:  5m 47s | Max:  6m 16s | Hits: 100%/4868  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 56s | Avg:  5m 58s | Max:  6m 01s | Hits: 100%/2430  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 08s | Avg:  6m 04s | Max:  6m 09s | Hits: 100%/2430  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 22s | Avg:  6m 11s | Max:  6m 23s | Hits: 100%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 09m | Avg:  9m 54s | Max: 23m 25s | Hits: 100%/8175  
      🟩 GCC7               Pass: 100%/2   | Total: 11m 59s | Avg:  5m 59s | Max:  6m 12s | Hits:  99%/2434  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 18s | Avg:  6m 18s | Max:  6m 18s | Hits:  99%/1217  
      🟩 GCC9               Pass: 100%/2   | Total: 11m 50s | Avg:  5m 55s | Max:  6m 11s | Hits:  99%/2434  
      🟩 GCC10              Pass: 100%/2   | Total: 12m 48s | Avg:  6m 24s | Max:  6m 41s | Hits:  99%/2434  
      🟩 GCC11              Pass: 100%/2   | Total: 12m 53s | Avg:  6m 26s | Max:  6m 36s | Hits:  99%/2430  
      🟩 GCC12              Pass: 100%/2   | Total: 13m 48s | Avg:  6m 54s | Max:  7m 03s | Hits:  99%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 42m | Avg: 14m 44s | Max: 24m 21s | Hits:  99%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 59m 10s | Avg: 29m 35s | Max: 30m 41s | Hits:  15%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 52s | Max: 34m 15s | Hits:  15%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 20m 18s | Avg: 10m 09s | Max: 10m 37s | Hits:  98%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 08m | Avg:  7m 34s | Max: 23m 25s | Hits: 100%/20333 
      🟩 GCC                Pass: 100%/22  | Total:  3h 51m | Avg: 10m 31s | Max: 24m 21s | Hits:  99%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  2h 02m | Avg: 30m 43s | Max: 34m 15s | Hits:  15%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total: 20m 18s | Avg: 10m 09s | Max: 10m 37s | Hits:  98%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 50m 43s | Avg: 16m 54s | Max: 24m 13s | Hits:  99%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total:  5h 12m | Avg:  9m 11s | Max: 34m 15s | Hits:  91%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 20m | Avg: 17m 34s | Max: 24m 21s | Hits:  99%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 30m | Avg:  8m 55s | Max: 34m 15s | Hits:  91%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 22s | Avg: 22m 22s | Max: 22m 22s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 18s | Avg: 17m 18s | Max: 17m 18s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 11m | Avg: 23m 59s | Max: 24m 21s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 01m | Avg: 20m 36s | Max: 21m 41s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 50m 43s | Avg: 16m 54s | Max: 24m 13s | Hits:  99%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 05s | Avg:  7m 05s | Max:  7m 05s | Hits:  99%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 16m | Avg:  9m 49s | Max: 30m 41s | Hits:  88%/23535 
      🟩 20                 Pass: 100%/25  | Total:  5h 07m | Avg: 12m 17s | Max: 34m 15s | Hits:  96%/29950 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 8h 40m | Avg: 11m 33s | Max: 31m 36s | Hits: 91%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 20m 33s | Avg: 10m 16s | Max: 11m 11s | Hits:  96%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  8h 21m | Avg: 11m 40s | Max: 31m 36s | Hits:  91%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 18m 28s | Avg:  9m 14s | Max:  9m 52s | Hits:  92%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 01m | Avg: 12m 19s | Max: 26m 02s | Hits:  88%/8901  
      🟩 12.5               Pass: 100%/2   | Total: 39m 40s | Avg: 19m 50s | Max: 20m 32s | Hits:  92%/3562  
      🟩 12.8               Pass: 100%/38  | Total:  6h 59m | Avg: 11m 01s | Max: 31m 36s | Hits:  92%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 30s | Avg:  5m 15s | Max:  5m 17s | Hits:  99%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 01m | Avg: 12m 19s | Max: 26m 02s | Hits:  88%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 39m 40s | Avg: 19m 50s | Max: 20m 32s | Hits:  92%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  6h 48m | Avg: 11m 21s | Max: 31m 36s | Hits:  91%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 30s | Avg:  5m 15s | Max:  5m 17s | Hits:  99%/3562  
      🟩 nvcc               Pass: 100%/43  | Total:  8h 29m | Avg: 11m 51s | Max: 31m 36s | Hits:  91%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 35m 03s | Avg:  8m 45s | Max:  9m 11s | Hits:  93%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 18m 10s | Avg:  9m 05s | Max:  9m 10s | Hits:  93%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 19m 03s | Avg:  9m 31s | Max:  9m 32s | Hits:  93%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max:  8m 55s | Hits:  93%/3562  
      🟩 Clang18            Pass: 100%/7   | Total: 55m 37s | Avg:  7m 56s | Max: 10m 18s | Hits:  96%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 18m 34s | Avg:  9m 17s | Max:  9m 50s | Hits:  93%/3564  
      🟩 GCC8               Pass: 100%/1   | Total:  9m 36s | Avg:  9m 36s | Max:  9m 36s | Hits:  92%/1782  
      🟩 GCC9               Pass: 100%/2   | Total: 20m 10s | Avg: 10m 05s | Max: 10m 30s | Hits:  93%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 18m 56s | Avg:  9m 28s | Max:  9m 50s | Hits:  93%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 19m 08s | Avg:  9m 34s | Max:  9m 37s | Hits:  92%/3564  
      🟩 GCC12              Pass: 100%/2   | Total: 20m 23s | Avg: 10m 11s | Max: 10m 17s | Hits:  92%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 38m | Avg:  9m 48s | Max: 11m 25s | Hits:  95%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 49m 24s | Avg: 24m 42s | Max: 26m 02s | Hits:  70%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 20m | Avg: 26m 59s | Max: 31m 36s | Hits:  70%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 39m 40s | Avg: 19m 50s | Max: 20m 32s | Hits:  92%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 25m | Avg:  8m 33s | Max: 10m 18s | Hits:  94%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  3h 24m | Avg:  9m 45s | Max: 11m 25s | Hits:  94%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 10m | Avg: 26m 04s | Max: 31m 36s | Hits:  70%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total: 39m 40s | Avg: 19m 50s | Max: 20m 32s | Hits:  92%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 35s | Avg:  8m 47s | Max: 10m 46s | Hits:  96%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total:  6h 07m | Avg: 11m 08s | Max: 26m 02s | Hits:  91%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 14m | Avg: 13m 29s | Max: 31m 36s | Hits:  92%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  7h 10m | Avg: 11m 19s | Max: 26m 02s | Hits:  91%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 46m 43s | Avg: 15m 34s | Max: 31m 36s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 40s | Avg: 10m 55s | Max: 11m 25s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 17m 35s | Avg:  8m 47s | Max: 10m 46s | Hits:  96%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 10m 34s | Avg: 10m 34s | Max: 10m 34s | Hits:  93%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 58m | Avg: 11m 56s | Max: 26m 02s | Hits:  89%/35611 
      🟩 20                 Pass: 100%/23  | Total:  4h 20m | Avg: 11m 20s | Max: 31m 36s | Hits:  92%/40961 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 14m 48s | Avg: 7m 24s | Max: 12m 31s | Hits: 98%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 14m 48s | Avg:  7m 24s | Max: 12m 31s | Hits:  98%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 14m 48s | Avg:  7m 24s | Max: 12m 31s | Hits:  98%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 14m 48s | Avg:  7m 24s | Max: 12m 31s | Hits:  98%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 14m 48s | Avg:  7m 24s | Max: 12m 31s | Hits:  98%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 14m 48s | Avg:  7m 24s | Max: 12m 31s | Hits:  98%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 14m 48s | Avg:  7m 24s | Max: 12m 31s | Hits:  98%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 14m 48s | Avg:  7m 24s | Max: 12m 31s | Hits:  98%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s | Hits:  98%/154   
      🟩 Test               Pass: 100%/1   | Total: 12m 31s | Avg: 12m 31s | Max: 12m 31s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 49m 26s | Avg: 49m 26s | Max: 49m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 49m 26s | Avg: 49m 26s | Max: 49m 26s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 49m 26s | Avg: 49m 26s | Max: 49m 26s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 49m 26s | Avg: 49m 26s | Max: 49m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 49m 26s | Avg: 49m 26s | Max: 49m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 49m 26s | Avg: 49m 26s | Max: 49m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 49m 26s | Avg: 49m 26s | Max: 49m 26s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 49m 26s | Avg: 49m 26s | Max: 49m 26s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 49m 26s | Avg: 49m 26s | Max: 49m 26s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
+/- CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber merged commit a52e626 into NVIDIA:main Feb 26, 2025
106 of 109 checks passed
@bernhardmgruber bernhardmgruber deleted the ref_bench_min_max branch February 26, 2025 14:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

5 participants