Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor cub::RegBoundScaling and cub::MemBoundScaling #3664

Closed

Conversation

fbusato
Copy link
Contributor

@fbusato fbusato commented Feb 4, 2025

Fixes #3663

Description

Note: First two points already covered by PR #3685

  1. * Deprecate cub::RegBoundScaling and cub::MemBoundScaling
  2. * Forward them to cub::detail namespace implementation
  3. Refactor internal code

Could be backported to 2.8

@fbusato fbusato added 2.8.0 target for 2.8.0 release 3.0 Targeted for 3.0 release labels Feb 4, 2025
@fbusato fbusato self-assigned this Feb 4, 2025
@fbusato fbusato requested review from a team as code owners February 4, 2025 01:23
@fbusato fbusato changed the title Deprecate cub::RegBoundScaling and cub::MemBoundScaling Deprecate cub::RegBoundScaling and cub::MemBoundScaling Feb 4, 2025
Copy link
Contributor

github-actions bot commented Feb 4, 2025

🟨 CI finished in 1h 46m: Pass: 91%/90 | Total: 2d 16h | Avg: 43m 09s | Max: 1h 18m | Hits: 186%/9230
  • 🟨 cub: Pass: 86%/44 | Total: 1d 16h | Avg: 55m 17s | Max: 1h 18m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  85%/42  | Total:  1d 14h | Avg: 54m 54s | Max:  1h 18m
      🟩 arm64              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m
      🔍 nvcc               Pass:  85%/42  | Total:  1d 14h | Avg: 54m 44s | Max:  1h 18m
    🟨 ctk
      🟨 12.0               Pass:  80%/5   | Total:  5h 05m | Avg:  1h 01m | Max:  1h 04m
      🟩 12.5               Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m
      🟨 12.8               Pass:  86%/37  | Total:  1d 09h | Avg: 53m 37s | Max:  1h 18m
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m
      🟨 nvcc12.0           Pass:  80%/5   | Total:  5h 05m | Avg:  1h 01m | Max:  1h 04m
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m
      🟨 nvcc12.8           Pass:  85%/35  | Total:  1d 06h | Avg: 52m 51s | Max:  1h 18m
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 56m | Avg: 59m 12s | Max:  1h 02m
      🟩 Clang15            Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 05m
      🟩 Clang16            Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m
      🟩 Clang17            Pass: 100%/2   | Total:  1h 54m | Avg: 57m 22s | Max: 58m 01s
      🟨 Clang18            Pass:  85%/7   | Total:  6h 06m | Avg: 52m 17s | Max:  1h 07m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 56m | Avg: 58m 06s | Max: 58m 30s
      🟩 GCC8               Pass: 100%/1   | Total: 58m 03s | Avg: 58m 03s | Max: 58m 03s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 59m | Avg: 59m 49s | Max: 59m 57s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 36s | Max: 58m 32s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 07s | Max:  1h 00m
      🟩 GCC12              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m
      🟨 GCC13              Pass:  90%/10  | Total:  6h 39m | Avg: 39m 58s | Max:  1h 18m
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
      🟥 MSVC14.39          Pass:   0%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 15m
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m
    🟨 cxx_family
      🟨 Clang              Pass:  94%/17  | Total: 16h 04m | Avg: 56m 42s | Max:  1h 07m
      🟨 GCC                Pass:  95%/21  | Total: 17h 30m | Avg: 50m 02s | Max:  1h 18m
      🟥 MSVC               Pass:   0%/4   | Total:  4h 34m | Avg:  1h 08m | Max:  1h 15m
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m
    🟨 gpu
      🟩 h100               Pass: 100%/2   | Total: 51m 39s | Avg: 25m 49s | Max: 27m 59s
      🟨 rtx2080            Pass:  88%/34  | Total:  1d 11h | Avg:  1h 02m | Max:  1h 18m
      🟨 rtxa6000           Pass:  75%/8   | Total:  4h 20m | Avg: 32m 32s | Max:  1h 07m
    🟨 jobs
      🟨 Build              Pass:  89%/37  | Total:  1d 14h | Avg:  1h 01m | Max:  1h 18m
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 02s | Avg: 21m 02s | Max: 21m 02s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 22s | Avg: 16m 22s | Max: 16m 22s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 13m | Avg: 24m 25s | Max: 25m 19s
      🟥 TestGPU            Pass:   0%/2   | Total: 40m 38s | Avg: 20m 19s | Max: 20m 19s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 51m 39s | Avg: 25m 49s | Max: 27m 59s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 18m | Avg:  1h 18m | Max:  1h 18m
    🟨 std
      🟨 17                 Pass:  85%/20  | Total: 20h 18m | Avg:  1h 00m | Max:  1h 12m
      🟨 20                 Pass:  87%/24  | Total: 20h 14m | Avg: 50m 35s | Max:  1h 18m
    
  • 🟨 cccl_c_parallel: Pass: 50%/2 | Total: 6m 22s | Avg: 3m 11s | Max: 4m 06s

    🚨 jobs: Test 🚨
      🟩 Build              Pass: 100%/1   | Total:  2m 16s | Avg:  2m 16s | Max:  2m 16s
      🔥 Test               Pass:   0%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
    🟨 cpu
      🟨 amd64              Pass:  50%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  4m 06s
    🟨 ctk
      🟨 12.8               Pass:  50%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  4m 06s
    🟨 cudacxx
      🟨 nvcc12.8           Pass:  50%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  4m 06s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  50%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  4m 06s
    🟨 cxx
      🟨 GCC13              Pass:  50%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  4m 06s
    🟨 cxx_family
      🟨 GCC                Pass:  50%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  4m 06s
    🟨 gpu
      🟨 rtx2080            Pass:  50%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  4m 06s
    
  • 🟥 python: Pass: 0%/1 | Total: 7m 39s | Avg: 7m 39s | Max: 7m 39s

    🟥 cpu
      🟥 amd64              Pass:   0%/1   | Total:  7m 39s | Avg:  7m 39s | Max:  7m 39s
    🟥 ctk
      🟥 12.8               Pass:   0%/1   | Total:  7m 39s | Avg:  7m 39s | Max:  7m 39s
    🟥 cudacxx
      🟥 nvcc12.8           Pass:   0%/1   | Total:  7m 39s | Avg:  7m 39s | Max:  7m 39s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/1   | Total:  7m 39s | Avg:  7m 39s | Max:  7m 39s
    🟥 cxx
      🟥 GCC13              Pass:   0%/1   | Total:  7m 39s | Avg:  7m 39s | Max:  7m 39s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/1   | Total:  7m 39s | Avg:  7m 39s | Max:  7m 39s
    🟥 gpu
      🟥 rtx2080            Pass:   0%/1   | Total:  7m 39s | Avg:  7m 39s | Max:  7m 39s
    🟥 jobs
      🟥 Test               Pass:   0%/1   | Total:  7m 39s | Avg:  7m 39s | Max:  7m 39s
    
  • 🟩 thrust: Pass: 100%/43 | Total: 23h 57m | Avg: 33m 26s | Max: 1h 02m | Hits: 186%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 37m 53s | Avg: 18m 56s | Max: 26m 42s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 22h 58m | Avg: 33m 36s | Max:  1h 02m | Hits: 186%/9230  
      🟩 arm64              Pass: 100%/2   | Total: 59m 36s | Avg: 29m 48s | Max: 31m 07s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 10m | Avg: 38m 04s | Max:  1h 00m | Hits: 141%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 50m | Avg: 55m 03s | Max: 55m 17s
      🟩 12.8               Pass: 100%/36  | Total: 18h 57m | Avg: 31m 35s | Max:  1h 02m | Hits: 197%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 56m 34s | Avg: 28m 17s | Max: 28m 40s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 10m | Avg: 38m 04s | Max:  1h 00m | Hits: 141%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 50m | Avg: 55m 03s | Max: 55m 17s
      🟩 nvcc12.8           Pass: 100%/34  | Total: 18h 00m | Avg: 31m 47s | Max:  1h 02m | Hits: 197%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 56m 34s | Avg: 28m 17s | Max: 28m 40s
      🟩 nvcc               Pass: 100%/41  | Total: 23h 01m | Avg: 33m 41s | Max:  1h 02m | Hits: 186%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 14s | Max: 34m 13s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 00m | Avg: 30m 21s | Max: 30m 57s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 03m | Avg: 31m 30s | Max: 32m 01s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 52s | Max: 33m 34s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 49m | Avg: 24m 11s | Max: 33m 46s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 55s | Max: 33m 07s
      🟩 GCC8               Pass: 100%/1   | Total: 32m 51s | Avg: 32m 51s | Max: 32m 51s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 08m | Avg: 34m 25s | Max: 36m 41s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 51s | Max: 33m 52s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 22s | Max: 32m 33s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 32s | Max: 38m 03s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 13m | Avg: 24m 11s | Max: 37m 16s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 58m | Avg: 59m 25s | Max:  1h 00m | Hits: 141%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 32m | Avg: 50m 49s | Max:  1h 02m | Hits: 215%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 50m | Avg: 55m 03s | Max: 55m 17s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 11m | Avg: 28m 55s | Max: 34m 13s
      🟩 GCC                Pass: 100%/19  | Total:  9h 24m | Avg: 29m 42s | Max: 38m 03s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 31m | Avg: 54m 15s | Max:  1h 02m | Hits: 186%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 50m | Avg: 55m 03s | Max: 55m 17s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 19h 55m | Avg: 36m 12s | Max:  1h 00m | Hits: 141%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 02m | Avg: 24m 15s | Max:  1h 02m | Hits: 253%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 22h 35m | Avg: 36m 38s | Max:  1h 02m | Hits: 141%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 11s | Avg: 16m 23s | Max: 32m 43s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 06s | Avg: 11m 02s | Max: 11m 27s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 33m 50s | Avg: 33m 50s | Max: 33m 50s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 37m | Avg: 37m 51s | Max:  1h 00m | Hits: 141%/5538  
      🟩 20                 Pass: 100%/21  | Total: 10h 42m | Avg: 30m 36s | Max:  1h 02m | Hits: 253%/3692  
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

Copy link
Contributor

github-actions bot commented Feb 4, 2025

🟨 CI finished in 1h 37m: Pass: 91%/90 | Total: 21h 54m | Avg: 14m 36s | Max: 1h 13m | Hits: 199%/9230
  • 🟨 cub: Pass: 86%/44 | Total: 12h 04m | Avg: 16m 27s | Max: 1h 13m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  85%/42  | Total: 11h 54m | Avg: 17m 00s | Max:  1h 13m
      🟩 arm64              Pass: 100%/2   | Total:  9m 52s | Avg:  4m 56s | Max:  5m 07s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 31s
      🔍 nvcc               Pass:  85%/42  | Total: 11h 55m | Avg: 17m 01s | Max:  1h 13m
    🟨 ctk
      🟨 12.0               Pass:  80%/5   | Total:  1h 17m | Avg: 15m 34s | Max: 57m 13s
      🟩 12.5               Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 13m
      🟨 12.8               Pass:  86%/37  | Total:  8h 22m | Avg: 13m 35s | Max:  1h 08m
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 31s
      🟨 nvcc12.0           Pass:  80%/5   | Total:  1h 17m | Avg: 15m 34s | Max: 57m 13s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 13m
      🟨 nvcc12.8           Pass:  85%/35  | Total:  8h 13m | Avg: 14m 06s | Max:  1h 08m
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 23s | Avg:  5m 20s | Max:  5m 50s
      🟩 Clang15            Pass: 100%/2   | Total: 12m 01s | Avg:  6m 00s | Max:  6m 07s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  5m 45s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 08s | Avg:  5m 34s | Max:  5m 46s
      🟨 Clang18            Pass:  85%/7   | Total:  1h 07m | Avg:  9m 36s | Max: 23m 51s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  5m 54s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s
      🟩 GCC9               Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  5m 41s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 30s | Avg:  5m 45s | Max:  5m 50s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 59s | Avg:  5m 59s | Max:  6m 01s
      🟩 GCC12              Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 00s
      🟨 GCC13              Pass:  90%/10  | Total:  2h 14m | Avg: 13m 28s | Max: 24m 23s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 05m
      🟥 MSVC14.39          Pass:   0%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 08m
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 13m
    🟨 cxx_family
      🟨 Clang              Pass:  94%/17  | Total:  2h 03m | Avg:  7m 14s | Max: 23m 51s
      🟨 GCC                Pass:  95%/21  | Total:  3h 17m | Avg:  9m 25s | Max: 24m 23s
      🟥 MSVC               Pass:   0%/4   | Total:  4h 19m | Avg:  1h 04m | Max:  1h 08m
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 13m
    🟨 gpu
      🟩 h100               Pass: 100%/2   | Total: 29m 07s | Avg: 14m 33s | Max: 24m 23s
      🟨 rtx2080            Pass:  88%/34  | Total:  9h 18m | Avg: 16m 26s | Max:  1h 13m
      🟨 rtxa6000           Pass:  75%/8   | Total:  2h 16m | Avg: 17m 03s | Max: 23m 51s
    🟨 jobs
      🟨 Build              Pass:  89%/37  | Total:  9h 35m | Avg: 15m 33s | Max:  1h 13m
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 19m 27s | Avg: 19m 27s | Max: 19m 27s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 18s | Avg: 17m 18s | Max: 17m 18s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 11m | Avg: 23m 52s | Max: 24m 23s
      🟥 TestGPU            Pass:   0%/2   | Total: 40m 26s | Avg: 20m 13s | Max: 22m 04s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 29m 07s | Avg: 14m 33s | Max: 24m 23s
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 20s | Avg:  6m 20s | Max:  6m 20s
    🟨 std
      🟨 17                 Pass:  85%/20  | Total:  5h 53m | Avg: 17m 41s | Max:  1h 13m
      🟨 20                 Pass:  87%/24  | Total:  6h 10m | Avg: 15m 25s | Max:  1h 10m
    
  • 🟨 cccl_c_parallel: Pass: 50%/2 | Total: 9m 57s | Avg: 4m 58s | Max: 7m 59s

    🚨 jobs: Test 🚨
      🟩 Build              Pass: 100%/1   | Total:  1m 58s | Avg:  1m 58s | Max:  1m 58s
      🔥 Test               Pass:   0%/1   | Total:  7m 59s | Avg:  7m 59s | Max:  7m 59s
    🟨 cpu
      🟨 amd64              Pass:  50%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟨 ctk
      🟨 12.8               Pass:  50%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟨 cudacxx
      🟨 nvcc12.8           Pass:  50%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  50%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟨 cxx
      🟨 GCC13              Pass:  50%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟨 cxx_family
      🟨 GCC                Pass:  50%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟨 gpu
      🟨 rtx2080            Pass:  50%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    
  • 🟥 python: Pass: 0%/1 | Total: 4m 52s | Avg: 4m 52s | Max: 4m 52s

    🟥 cpu
      🟥 amd64              Pass:   0%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
    🟥 ctk
      🟥 12.8               Pass:   0%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
    🟥 cudacxx
      🟥 nvcc12.8           Pass:   0%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
    🟥 cxx
      🟥 GCC13              Pass:   0%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
    🟥 gpu
      🟥 rtx2080            Pass:   0%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
    🟥 jobs
      🟥 Test               Pass:   0%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
    
  • 🟩 thrust: Pass: 100%/43 | Total: 9h 35m | Avg: 13m 23s | Max: 1h 01m | Hits: 199%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 10s | Avg:  8m 35s | Max: 10m 49s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  9h 25m | Avg: 13m 47s | Max:  1h 01m | Hits: 199%/9230  
      🟩 arm64              Pass: 100%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 14s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 08m | Avg: 13m 43s | Max: 47m 24s | Hits: 157%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
      🟩 12.8               Pass: 100%/36  | Total:  6h 25m | Avg: 10m 41s | Max: 55m 47s | Hits: 209%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  5m 36s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 08m | Avg: 13m 43s | Max: 47m 24s | Hits: 157%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
      🟩 nvcc12.8           Pass: 100%/34  | Total:  6h 14m | Avg: 11m 00s | Max: 55m 47s | Hits: 209%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  5m 36s
      🟩 nvcc               Pass: 100%/41  | Total:  9h 24m | Avg: 13m 46s | Max:  1h 01m | Hits: 199%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 07s | Avg:  5m 16s | Max:  5m 40s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 33s | Avg:  5m 46s | Max:  5m 49s
      🟩 Clang16            Pass: 100%/2   | Total: 12m 02s | Avg:  6m 01s | Max:  6m 05s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  5m 55s
      🟩 Clang18            Pass: 100%/7   | Total: 45m 02s | Avg:  6m 26s | Max: 10m 33s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 31s | Avg:  5m 45s | Max:  5m 55s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 51s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 44s | Avg:  5m 52s | Max:  6m 00s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 51s | Avg:  5m 55s | Max:  6m 15s
      🟩 GCC12              Pass: 100%/2   | Total: 12m 43s | Avg:  6m 21s | Max:  6m 35s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 00m | Avg:  7m 36s | Max: 10m 49s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 31m | Avg: 45m 58s | Max: 47m 24s | Hits: 157%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 15m | Avg: 45m 02s | Max: 55m 47s | Hits: 226%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 41m | Avg:  5m 56s | Max: 10m 33s
      🟩 GCC                Pass: 100%/19  | Total:  2h 05m | Avg:  6m 36s | Max: 10m 49s
      🟩 MSVC               Pass: 100%/5   | Total:  3h 47m | Avg: 45m 25s | Max: 55m 47s | Hits: 199%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total:  7h 01m | Avg: 12m 46s | Max:  1h 01m | Hits: 157%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 33m | Avg: 15m 23s | Max: 55m 47s | Hits: 261%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  8h 15m | Avg: 13m 23s | Max:  1h 01m | Hits: 157%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 48m 16s | Avg: 16m 05s | Max: 31m 43s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 32m 05s | Avg: 10m 41s | Max: 10m 49s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  4h 51m | Avg: 14m 34s | Max:  1h 00m | Hits: 157%/5538  
      🟩 20                 Pass: 100%/21  | Total:  4h 27m | Avg: 12m 43s | Max:  1h 01m | Hits: 261%/3692  
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

Copy link
Contributor

github-actions bot commented Feb 4, 2025

🟨 CI finished in 1h 45m: Pass: 95%/90 | Total: 2d 17h | Avg: 43m 50s | Max: 1h 14m | Hits: 186%/9230
  • 🟨 cub: Pass: 90%/44 | Total: 1d 17h | Avg: 55m 59s | Max: 1h 14m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  90%/42  | Total:  1d 15h | Avg: 55m 43s | Max:  1h 14m
      🟩 arm64              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 06m
      🔍 nvcc               Pass:  90%/42  | Total:  1d 14h | Avg: 55m 37s | Max:  1h 14m
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/17  | Total: 16h 05m | Avg: 56m 49s | Max:  1h 07m
      🟩 GCC                Pass: 100%/21  | Total: 18h 00m | Avg: 51m 27s | Max:  1h 12m
      🔥 MSVC               Pass:   0%/4   | Total:  4h 28m | Avg:  1h 07m | Max:  1h 13m
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 14m
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 51m 10s | Avg: 25m 35s | Max: 27m 23s
      🔍 rtx2080            Pass:  88%/34  | Total:  1d 12h | Avg:  1h 03m | Max:  1h 14m
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 07m | Avg: 30m 54s | Max:  1h 02m
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  89%/37  | Total:  1d 14h | Avg:  1h 02m | Max:  1h 14m
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 52s | Avg: 20m 52s | Max: 20m 52s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 20s | Avg: 16m 20s | Max: 16m 20s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 10m | Avg: 23m 39s | Max: 24m 01s
      🟩 TestGPU            Pass: 100%/2   | Total: 39m 08s | Avg: 19m 34s | Max: 20m 35s
    🟨 ctk
      🟨 12.0               Pass:  80%/5   | Total:  5h 10m | Avg:  1h 02m | Max:  1h 07m
      🟩 12.5               Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 14m
      🟨 12.8               Pass:  91%/37  | Total:  1d 09h | Avg: 54m 11s | Max:  1h 13m
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 06m
      🟨 nvcc12.0           Pass:  80%/5   | Total:  5h 10m | Avg:  1h 02m | Max:  1h 07m
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 14m
      🟨 nvcc12.8           Pass:  91%/35  | Total:  1d 07h | Avg: 53m 38s | Max:  1h 13m
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 03m | Avg:  1h 00m | Max:  1h 07m
      🟩 Clang15            Pass: 100%/2   | Total:  1h 56m | Avg: 58m 14s | Max: 59m 36s
      🟩 Clang16            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m
      🟩 Clang17            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 05m
      🟩 Clang18            Pass: 100%/7   | Total:  5h 58m | Avg: 51m 10s | Max:  1h 06m
      🟩 GCC7               Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m
      🟩 GCC8               Pass: 100%/1   | Total: 58m 12s | Avg: 58m 12s | Max: 58m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 04m
      🟩 GCC10              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 06m
      🟩 GCC11              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m
      🟩 GCC12              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m
      🟩 GCC13              Pass: 100%/10  | Total:  6h 34m | Avg: 39m 26s | Max:  1h 12m
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m
      🟥 MSVC14.39          Pass:   0%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 13m
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 14m
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 51m 10s | Avg: 25m 35s | Max: 27m 23s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 12m | Avg:  1h 12m | Max:  1h 12m
    🟨 std
      🟨 17                 Pass:  85%/20  | Total: 20h 59m | Avg:  1h 02m | Max:  1h 14m
      🟨 20                 Pass:  95%/24  | Total: 20h 03m | Avg: 50m 09s | Max:  1h 14m
    
  • 🟩 thrust: Pass: 100%/43 | Total: 1d 00h | Avg: 33m 39s | Max: 1h 03m | Hits: 186%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 36m 58s | Avg: 18m 29s | Max: 25m 48s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 23h 07m | Avg: 33m 49s | Max:  1h 03m | Hits: 186%/9230  
      🟩 arm64              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 03s | Max: 31m 27s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 04m | Avg: 36m 55s | Max: 53m 12s | Hits: 141%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 53m | Avg: 56m 56s | Max: 58m 25s
      🟩 12.8               Pass: 100%/36  | Total: 19h 08m | Avg: 31m 54s | Max:  1h 03m | Hits: 197%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 54m 17s | Avg: 27m 08s | Max: 28m 03s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 04m | Avg: 36m 55s | Max: 53m 12s | Hits: 141%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 53m | Avg: 56m 56s | Max: 58m 25s
      🟩 nvcc12.8           Pass: 100%/34  | Total: 18h 14m | Avg: 32m 11s | Max:  1h 03m | Hits: 197%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 54m 17s | Avg: 27m 08s | Max: 28m 03s
      🟩 nvcc               Pass: 100%/41  | Total: 23h 12m | Avg: 33m 58s | Max:  1h 03m | Hits: 186%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 06m | Avg: 31m 39s | Max: 32m 13s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 52s | Max: 33m 44s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 07m | Avg: 33m 59s | Max: 34m 28s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 03m | Avg: 31m 44s | Max: 33m 33s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 48m | Avg: 24m 05s | Max: 35m 01s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 04s | Max: 33m 55s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 57s | Avg: 31m 57s | Max: 31m 57s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 13m | Avg: 36m 58s | Max: 38m 11s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 30s | Max: 34m 38s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 43s | Max: 35m 28s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 14m | Avg: 37m 11s | Max: 38m 06s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 15m | Avg: 24m 23s | Max: 36m 39s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 43s | Max: 56m 14s | Hits: 141%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 33m | Avg: 51m 07s | Max:  1h 03m | Hits: 215%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 53m | Avg: 56m 56s | Max: 58m 25s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 12m | Avg: 28m 58s | Max: 35m 01s
      🟩 GCC                Pass: 100%/19  | Total:  9h 37m | Avg: 30m 25s | Max: 38m 11s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 22m | Avg: 52m 33s | Max:  1h 03m | Hits: 186%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 56s | Max: 58m 25s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 20h 04m | Avg: 36m 29s | Max: 58m 25s | Hits: 141%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 02m | Avg: 24m 16s | Max:  1h 03m | Hits: 253%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 22h 45m | Avg: 36m 54s | Max:  1h 03m | Hits: 141%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 48m 12s | Avg: 16m 04s | Max: 32m 30s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 25s | Avg: 11m 08s | Max: 11m 31s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 36m 39s | Avg: 36m 39s | Max: 36m 39s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 34m | Avg: 37m 44s | Max: 57m 03s | Hits: 141%/5538  
      🟩 20                 Pass: 100%/21  | Total: 10h 55m | Avg: 31m 12s | Max:  1h 03m | Hits: 253%/3692  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 7m 43s | Avg: 3m 51s | Max: 5m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  5m 26s
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  5m 26s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  5m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  5m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  5m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  5m 26s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  5m 26s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s
      🟩 Test               Pass: 100%/1   | Total:  5m 26s | Avg:  5m 26s | Max:  5m 26s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 16s | Avg: 27m 16s | Max: 27m 16s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 16s | Avg: 27m 16s | Max: 27m 16s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 27m 16s | Avg: 27m 16s | Max: 27m 16s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 27m 16s | Avg: 27m 16s | Max: 27m 16s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 16s | Avg: 27m 16s | Max: 27m 16s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 16s | Avg: 27m 16s | Max: 27m 16s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 16s | Avg: 27m 16s | Max: 27m 16s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 27m 16s | Avg: 27m 16s | Max: 27m 16s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 16s | Avg: 27m 16s | Max: 27m 16s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

Copy link
Contributor

github-actions bot commented Feb 4, 2025

🟨 CI finished in 1h 51m: Pass: 95%/90 | Total: 2d 18h | Avg: 44m 31s | Max: 1h 20m | Hits: 171%/9230
  • 🟨 cub: Pass: 90%/44 | Total: 1d 17h | Avg: 56m 18s | Max: 1h 20m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  90%/42  | Total:  1d 15h | Avg: 56m 04s | Max:  1h 20m
      🟩 arm64              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 06m
      🔍 nvcc               Pass:  90%/42  | Total:  1d 15h | Avg: 55m 58s | Max:  1h 20m
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/17  | Total: 16h 08m | Avg: 56m 59s | Max:  1h 06m
      🟩 GCC                Pass: 100%/21  | Total: 17h 48m | Avg: 50m 53s | Max:  1h 10m
      🔥 MSVC               Pass:   0%/4   | Total:  4h 50m | Avg:  1h 12m | Max:  1h 20m
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 15m
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 53m 34s | Avg: 26m 47s | Max: 29m 46s
      🔍 rtx2080            Pass:  88%/34  | Total:  1d 12h | Avg:  1h 03m | Max:  1h 20m
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 09m | Avg: 31m 11s | Max:  1h 01m
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  89%/37  | Total:  1d 14h | Avg:  1h 02m | Max:  1h 20m
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 43s | Avg: 20m 43s | Max: 20m 43s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 11m | Avg: 23m 51s | Max: 24m 41s
      🟩 TestGPU            Pass: 100%/2   | Total: 42m 56s | Avg: 21m 28s | Max: 21m 30s
    🟨 ctk
      🟨 12.0               Pass:  80%/5   | Total:  5h 11m | Avg:  1h 02m | Max:  1h 06m
      🟩 12.5               Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 15m
      🟨 12.8               Pass:  91%/37  | Total:  1d 09h | Avg: 54m 29s | Max:  1h 20m
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 06m
      🟨 nvcc12.0           Pass:  80%/5   | Total:  5h 11m | Avg:  1h 02m | Max:  1h 06m
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 15m
      🟨 nvcc12.8           Pass:  91%/35  | Total:  1d 07h | Avg: 53m 58s | Max:  1h 20m
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 09m | Avg:  1h 02m | Max:  1h 06m
      🟩 Clang15            Pass: 100%/2   | Total:  1h 58m | Avg: 59m 06s | Max: 59m 57s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 57m | Avg: 58m 39s | Max:  1h 00m
      🟩 Clang17            Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 04m
      🟩 Clang18            Pass: 100%/7   | Total:  5h 55m | Avg: 50m 46s | Max:  1h 06m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 59m | Avg: 59m 36s | Max:  1h 00m
      🟩 GCC8               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC9               Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m
      🟩 GCC10              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 05m
      🟩 GCC11              Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 10m
      🟩 GCC12              Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m
      🟩 GCC13              Pass: 100%/10  | Total:  6h 26m | Avg: 38m 36s | Max:  1h 06m
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 11m
      🟥 MSVC14.39          Pass:   0%/2   | Total:  2h 36m | Avg:  1h 18m | Max:  1h 20m
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 15m
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 53m 34s | Avg: 26m 47s | Max: 29m 46s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    🟨 std
      🟨 17                 Pass:  85%/20  | Total: 21h 19m | Avg:  1h 03m | Max:  1h 15m
      🟨 20                 Pass:  95%/24  | Total: 19h 58m | Avg: 49m 56s | Max:  1h 20m
    
  • 🟩 thrust: Pass: 100%/43 | Total: 1d 00h | Avg: 34m 47s | Max: 1h 12m | Hits: 171%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 39m 54s | Avg: 19m 57s | Max: 29m 07s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 23h 55m | Avg: 35m 00s | Max:  1h 12m | Hits: 171%/9230  
      🟩 arm64              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 14s | Max: 31m 17s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 19m | Avg: 39m 48s | Max:  1h 04m | Hits: 137%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
      🟩 12.8               Pass: 100%/36  | Total: 19h 28m | Avg: 32m 27s | Max:  1h 12m | Hits: 179%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 57m 49s | Avg: 28m 54s | Max: 29m 48s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 19m | Avg: 39m 48s | Max:  1h 04m | Hits: 137%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
      🟩 nvcc12.8           Pass: 100%/34  | Total: 18h 30m | Avg: 32m 40s | Max:  1h 12m | Hits: 179%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 57m 49s | Avg: 28m 54s | Max: 29m 48s
      🟩 nvcc               Pass: 100%/41  | Total: 23h 58m | Avg: 35m 04s | Max:  1h 12m | Hits: 171%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 46s | Max: 34m 03s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 33m 20s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 09m | Avg: 34m 54s | Max: 35m 41s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 14s | Max: 32m 27s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 47m | Avg: 23m 56s | Max: 31m 20s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 05m | Avg: 32m 54s | Max: 33m 25s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 44s | Avg: 31m 44s | Max: 31m 44s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 08m | Avg: 34m 19s | Max: 34m 27s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 28s | Max: 35m 00s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 12m | Avg: 36m 06s | Max: 36m 24s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 34s | Max: 35m 24s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 16m | Avg: 24m 32s | Max: 35m 43s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m | Hits: 135%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 51m | Avg: 57m 17s | Max:  1h 12m | Hits: 194%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 14m | Avg: 29m 05s | Max: 35m 41s
      🟩 GCC                Pass: 100%/19  | Total:  9h 32m | Avg: 30m 08s | Max: 36m 24s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 00m | Avg:  1h 00m | Max:  1h 12m | Hits: 171%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 20h 47m | Avg: 37m 48s | Max:  1h 07m | Hits: 127%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 08m | Avg: 24m 51s | Max:  1h 12m | Hits: 235%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 23h 34m | Avg: 38m 14s | Max:  1h 12m | Hits: 122%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 48m 52s | Avg: 16m 17s | Max: 32m 31s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 32m 21s | Avg: 10m 47s | Max: 11m 14s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 34m 36s | Avg: 34m 36s | Max: 34m 36s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 13h 12m | Avg: 39m 36s | Max:  1h 07m | Hits: 127%/5538  
      🟩 20                 Pass: 100%/21  | Total: 11h 03m | Avg: 31m 36s | Max:  1h 12m | Hits: 235%/3692  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 7m 30s | Avg: 3m 45s | Max: 5m 19s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  5m 19s
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  5m 19s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  5m 19s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  5m 19s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  5m 19s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  5m 19s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  5m 19s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 11s | Avg:  2m 11s | Max:  2m 11s
      🟩 Test               Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
    
  • 🟩 python: Pass: 100%/1 | Total: 25m 51s | Avg: 25m 51s | Max: 25m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 25m 51s | Avg: 25m 51s | Max: 25m 51s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 25m 51s | Avg: 25m 51s | Max: 25m 51s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 25m 51s | Avg: 25m 51s | Max: 25m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 25m 51s | Avg: 25m 51s | Max: 25m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 25m 51s | Avg: 25m 51s | Max: 25m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 25m 51s | Avg: 25m 51s | Max: 25m 51s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 25m 51s | Avg: 25m 51s | Max: 25m 51s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 25m 51s | Avg: 25m 51s | Max: 25m 51s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

@fbusato fbusato removed 2.8.0 target for 2.8.0 release backport branch/2.8.x labels Feb 4, 2025
@fbusato fbusato changed the title Deprecate cub::RegBoundScaling and cub::MemBoundScaling Refactor cub::RegBoundScaling and cub::MemBoundScaling Feb 4, 2025
@fbusato
Copy link
Contributor Author

fbusato commented Feb 10, 2025

main task covered by #3685. Dropping this PR

@fbusato fbusato closed this Feb 10, 2025
@fbusato fbusato deleted the reg-mem-bound-scaling-deprecation branch February 11, 2025 18:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.0 Targeted for 3.0 release
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

Deprecate cub::RegBoundScaling and cub::MemBoundScaling
3 participants