Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor RLE tuning #3127

Merged
merged 10 commits into from
Dec 12, 2024
Merged

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Dec 11, 2024

  • No SASS changes except kernel symbol names

Sorry, something went wrong.

Copy link
Contributor

🟨 CI finished in 2h 18m: Pass: 95%/94 | Total: 2d 01h | Avg: 31m 26s | Max: 56m 43s | Hits: 92%/9260
  • 🟨 cub: Pass: 91%/45 | Total: 1d 07h | Avg: 41m 35s | Max: 56m 43s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  90%/43  | Total:  1d 05h | Avg: 41m 22s | Max: 56m 43s
      🟩 arm64              Pass: 100%/2   | Total:  1h 32m | Avg: 46m 17s | Max: 46m 25s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 09s | Max: 55m 44s
      🔍 nvcc               Pass:  90%/43  | Total:  1d 05h | Avg: 41m 03s | Max: 56m 43s
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/19  | Total: 13h 30m | Avg: 42m 39s | Max: 55m 44s
      🟩 GCC                Pass: 100%/19  | Total: 11h 27m | Avg: 36m 10s | Max: 48m 03s
      🟩 Intel              Pass: 100%/1   | Total: 50m 10s | Avg: 50m 10s | Max: 50m 10s
      🔥 MSVC               Pass:   0%/4   | Total:  3h 36m | Avg: 54m 06s | Max: 56m 43s
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 47m | Avg: 53m 35s | Max: 54m 05s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  89%/39  | Total:  1d 05h | Avg: 44m 39s | Max: 56m 43s
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 23m 19s | Avg: 23m 19s | Max: 23m 19s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 55s | Avg: 19m 55s | Max: 19m 55s
      🟩 HostLaunch         Pass: 100%/2   | Total: 40m 04s | Avg: 20m 02s | Max: 22m 07s
      🟩 TestGPU            Pass: 100%/2   | Total: 46m 39s | Avg: 23m 19s | Max: 24m 19s
    🟨 ctk
      🟨 11.1               Pass:  85%/7   | Total:  4h 23m | Avg: 37m 37s | Max: 48m 48s
      🟩 12.5               Pass: 100%/2   | Total:  1h 47m | Avg: 53m 35s | Max: 54m 05s
      🟨 12.6               Pass:  91%/36  | Total:  1d 01h | Avg: 41m 41s | Max: 56m 43s
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 46m | Avg: 53m 09s | Max: 55m 44s
      🟨 nvcc11.1           Pass:  85%/7   | Total:  4h 23m | Avg: 37m 37s | Max: 48m 48s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 47m | Avg: 53m 35s | Max: 54m 05s
      🟨 nvcc12.6           Pass:  91%/34  | Total: 23h 14m | Avg: 41m 01s | Max: 56m 43s
    🟨 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  2h 44m | Avg: 41m 05s | Max: 46m 06s
      🟩 Clang10            Pass: 100%/1   | Total: 45m 13s | Avg: 45m 13s | Max: 45m 13s
      🟩 Clang11            Pass: 100%/1   | Total: 44m 52s | Avg: 44m 52s | Max: 44m 52s
      🟩 Clang12            Pass: 100%/1   | Total: 44m 13s | Avg: 44m 13s | Max: 44m 13s
      🟩 Clang13            Pass: 100%/1   | Total: 43m 00s | Avg: 43m 00s | Max: 43m 00s
      🟩 Clang14            Pass: 100%/1   | Total: 44m 42s | Avg: 44m 42s | Max: 44m 42s
      🟩 Clang15            Pass: 100%/1   | Total: 43m 45s | Avg: 43m 45s | Max: 43m 45s
      🟩 Clang16            Pass: 100%/1   | Total: 43m 54s | Avg: 43m 54s | Max: 43m 54s
      🟩 Clang17            Pass: 100%/1   | Total: 44m 43s | Avg: 44m 43s | Max: 44m 43s
      🟩 Clang18            Pass: 100%/7   | Total:  4h 51m | Avg: 41m 40s | Max: 55m 44s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 11m | Avg: 35m 37s | Max: 36m 48s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 27m | Avg: 43m 50s | Max: 45m 12s
      🟩 GCC8               Pass: 100%/1   | Total: 41m 59s | Avg: 41m 59s | Max: 41m 59s
      🟩 GCC9               Pass: 100%/3   | Total:  1h 53m | Avg: 37m 44s | Max: 42m 47s
      🟩 GCC10              Pass: 100%/1   | Total: 44m 42s | Avg: 44m 42s | Max: 44m 42s
      🟩 GCC11              Pass: 100%/1   | Total: 46m 50s | Avg: 46m 50s | Max: 46m 50s
      🟩 GCC12              Pass: 100%/1   | Total: 48m 03s | Avg: 48m 03s | Max: 48m 03s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 53m | Avg: 29m 11s | Max: 46m 10s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 50m 10s | Avg: 50m 10s | Max: 50m 10s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 48m 48s | Avg: 48m 48s | Max: 48m 48s
      🟥 MSVC14.29          Pass:   0%/1   | Total: 55m 29s | Avg: 55m 29s | Max: 55m 29s
      🟥 MSVC14.39          Pass:   0%/2   | Total:  1h 52m | Avg: 56m 04s | Max: 56m 43s
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 35s | Max: 54m 05s
    🟨 std
      🟩 11                 Pass: 100%/5   | Total:  3h 14m | Avg: 38m 51s | Max: 45m 21s
      🟨 14                 Pass:  75%/4   | Total:  2h 54m | Avg: 43m 32s | Max: 48m 48s
      🟨 17                 Pass:  83%/12  | Total:  9h 22m | Avg: 46m 50s | Max: 55m 44s
      🟨 20                 Pass:  95%/24  | Total: 15h 40m | Avg: 39m 12s | Max: 56m 43s
    🟨 gpu
      🟨 v100               Pass:  91%/45  | Total:  1d 07h | Avg: 41m 35s | Max: 56m 43s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 16m 19s | Avg: 16m 19s | Max: 16m 19s
    
  • 🟩 thrust: Pass: 100%/46 | Total: 17h 22m | Avg: 22m 40s | Max: 52m 44s | Hits: 92%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 32m 57s | Avg: 16m 28s | Max: 21m 05s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total: 16h 35m | Avg: 22m 37s | Max: 52m 44s | Hits:  92%/9260  
      🟩 arm64              Pass: 100%/2   | Total: 47m 27s | Avg: 23m 43s | Max: 27m 28s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  2h 13m | Avg: 19m 02s | Max: 42m 16s | Hits:  90%/1852  
      🟩 12.5               Pass: 100%/2   | Total:  1h 25m | Avg: 42m 36s | Max: 43m 48s
      🟩 12.6               Pass: 100%/37  | Total: 13h 44m | Avg: 22m 16s | Max: 52m 44s | Hits:  93%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 38m 36s | Avg: 19m 18s | Max: 19m 50s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  2h 13m | Avg: 19m 02s | Max: 42m 16s | Hits:  90%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 25m | Avg: 42m 36s | Max: 43m 48s
      🟩 nvcc12.6           Pass: 100%/35  | Total: 13h 05m | Avg: 22m 27s | Max: 52m 44s | Hits:  93%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 38m 36s | Avg: 19m 18s | Max: 19m 50s
      🟩 nvcc               Pass: 100%/44  | Total: 16h 44m | Avg: 22m 49s | Max: 52m 44s | Hits:  92%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  1h 08m | Avg: 17m 00s | Max: 20m 11s
      🟩 Clang10            Pass: 100%/1   | Total: 22m 12s | Avg: 22m 12s | Max: 22m 12s
      🟩 Clang11            Pass: 100%/1   | Total: 20m 33s | Avg: 20m 33s | Max: 20m 33s
      🟩 Clang12            Pass: 100%/1   | Total: 22m 06s | Avg: 22m 06s | Max: 22m 06s
      🟩 Clang13            Pass: 100%/1   | Total: 21m 56s | Avg: 21m 56s | Max: 21m 56s
      🟩 Clang14            Pass: 100%/1   | Total: 19m 27s | Avg: 19m 27s | Max: 19m 27s
      🟩 Clang15            Pass: 100%/1   | Total: 20m 46s | Avg: 20m 46s | Max: 20m 46s
      🟩 Clang16            Pass: 100%/1   | Total: 21m 02s | Avg: 21m 02s | Max: 21m 02s
      🟩 Clang17            Pass: 100%/1   | Total: 23m 42s | Avg: 23m 42s | Max: 23m 42s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 01m | Avg: 17m 20s | Max: 24m 11s
      🟩 GCC6               Pass: 100%/2   | Total: 28m 43s | Avg: 14m 21s | Max: 16m 47s
      🟩 GCC7               Pass: 100%/2   | Total: 33m 55s | Avg: 16m 57s | Max: 20m 10s
      🟩 GCC8               Pass: 100%/1   | Total: 20m 26s | Avg: 20m 26s | Max: 20m 26s
      🟩 GCC9               Pass: 100%/3   | Total: 56m 18s | Avg: 18m 46s | Max: 23m 50s
      🟩 GCC10              Pass: 100%/1   | Total: 23m 19s | Avg: 23m 19s | Max: 23m 19s
      🟩 GCC11              Pass: 100%/1   | Total: 25m 25s | Avg: 25m 25s | Max: 25m 25s
      🟩 GCC12              Pass: 100%/1   | Total: 26m 22s | Avg: 26m 22s | Max: 26m 22s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 24m | Avg: 18m 02s | Max: 28m 57s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 31m 59s | Avg: 31m 59s | Max: 31m 59s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 42m 16s | Avg: 42m 16s | Max: 42m 16s | Hits:  90%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 43m 43s | Avg: 43m 43s | Max: 43m 43s | Hits:  89%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 59m | Avg: 39m 57s | Max: 52m 44s | Hits:  94%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 25m | Avg: 42m 36s | Max: 43m 48s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  6h 01m | Avg: 19m 00s | Max: 24m 11s
      🟩 GCC                Pass: 100%/19  | Total:  5h 58m | Avg: 18m 53s | Max: 28m 57s
      🟩 Intel              Pass: 100%/1   | Total: 31m 59s | Avg: 31m 59s | Max: 31m 59s
      🟩 MSVC               Pass: 100%/5   | Total:  3h 25m | Avg: 41m 10s | Max: 52m 44s | Hits:  92%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 25m | Avg: 42m 36s | Max: 43m 48s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total: 17h 22m | Avg: 22m 40s | Max: 52m 44s | Hits:  92%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total: 16h 09m | Avg: 24m 14s | Max: 52m 44s | Hits:  90%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 39m 16s | Avg: 13m 05s | Max: 24m 15s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 34m 15s | Avg: 11m 25s | Max: 11m 52s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  9m 48s | Avg:  9m 48s | Max:  9m 48s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  1h 08m | Avg: 13m 47s | Max: 17m 58s
      🟩 14                 Pass: 100%/4   | Total:  1h 39m | Avg: 24m 51s | Max: 42m 16s | Hits:  90%/1852  
      🟩 17                 Pass: 100%/12  | Total:  5h 28m | Avg: 27m 22s | Max: 43m 43s | Hits:  90%/3704  
      🟩 20                 Pass: 100%/23  | Total:  8h 33m | Avg: 22m 18s | Max: 52m 44s | Hits:  95%/3704  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 36s | Avg: 5m 48s | Max: 6m 44s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  6m 44s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  6m 44s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  6m 44s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  6m 44s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  6m 44s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  6m 44s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  6m 44s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
      🟩 Test               Pass: 100%/1   | Total:  6m 44s | Avg:  6m 44s | Max:  6m 44s
    
  • 🟩 python: Pass: 100%/1 | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 94)

# Runner
70 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16

Copy link
Contributor

🟨 CI finished in 46m 47s: Pass: 98%/94 | Total: 13h 00m | Avg: 8m 18s | Max: 27m 42s | Hits: 98%/12324
  • 🟨 cub: Pass: 97%/45 | Total: 6h 15m | Avg: 8m 20s | Max: 21m 18s | Hits: 96%/3064

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/43  | Total:  6h 02m | Avg:  8m 25s | Max: 21m 18s | Hits:  96%/3064  
      🟩 arm64              Pass: 100%/2   | Total: 12m 32s | Avg:  6m 16s | Max:  6m 41s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/7   | Total: 43m 29s | Avg:  6m 12s | Max: 15m 19s | Hits:  96%/766   
      🟩 12.5               Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 47s
      🔍 12.6               Pass:  97%/36  | Total:  5h 11m | Avg:  8m 38s | Max: 21m 18s | Hits:  96%/2298  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 47s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 43m 29s | Avg:  6m 12s | Max: 15m 19s | Hits:  96%/766   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 47s
      🔍 nvcc12.6           Pass:  97%/34  | Total:  5h 01m | Avg:  8m 52s | Max: 21m 18s | Hits:  96%/2298  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 47s
      🔍 nvcc               Pass:  97%/43  | Total:  6h 05m | Avg:  8m 30s | Max: 21m 18s | Hits:  96%/3064  
    🔍 cxx: GCC13 🔍
      🟩 Clang9             Pass: 100%/4   | Total: 23m 17s | Avg:  5m 49s | Max:  6m 51s
      🟩 Clang10            Pass: 100%/1   | Total:  7m 16s | Avg:  7m 16s | Max:  7m 16s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 51s | Avg:  5m 51s | Max:  5m 51s
      🟩 Clang12            Pass: 100%/1   | Total:  6m 06s | Avg:  6m 06s | Max:  6m 06s
      🟩 Clang13            Pass: 100%/1   | Total:  6m 18s | Avg:  6m 18s | Max:  6m 18s
      🟩 Clang14            Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
      🟩 Clang15            Pass: 100%/1   | Total:  6m 16s | Avg:  6m 16s | Max:  6m 16s
      🟩 Clang16            Pass: 100%/1   | Total:  6m 04s | Avg:  6m 04s | Max:  6m 04s
      🟩 Clang17            Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 10m | Avg: 10m 06s | Max: 21m 12s
      🟩 GCC6               Pass: 100%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 38s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 54s | Avg:  5m 57s | Max:  6m 03s
      🟩 GCC8               Pass: 100%/1   | Total:  6m 37s | Avg:  6m 37s | Max:  6m 37s
      🟩 GCC9               Pass: 100%/3   | Total: 16m 45s | Avg:  5m 35s | Max:  7m 20s
      🟩 GCC10              Pass: 100%/1   | Total:  6m 16s | Avg:  6m 16s | Max:  6m 16s
      🟩 GCC11              Pass: 100%/1   | Total:  6m 22s | Avg:  6m 22s | Max:  6m 22s
      🟩 GCC12              Pass: 100%/1   | Total:  6m 22s | Avg:  6m 22s | Max:  6m 22s
      🔍 GCC13              Pass:  87%/8   | Total:  1h 21m | Avg: 10m 09s | Max: 21m 18s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 51s | Avg:  6m 51s | Max:  6m 51s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 19s | Avg: 15m 19s | Max: 15m 19s | Hits:  96%/766   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 13m 22s | Avg: 13m 22s | Max: 13m 22s | Hits:  96%/766   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 29m 48s | Avg: 14m 54s | Max: 16m 04s | Hits:  96%/1532  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 47s
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/19  | Total:  2h 24m | Avg:  7m 37s | Max: 21m 12s
      🔍 GCC                Pass:  94%/19  | Total:  2h 24m | Avg:  7m 36s | Max: 21m 18s
      🟩 Intel              Pass: 100%/1   | Total:  6m 51s | Avg:  6m 51s | Max:  6m 51s
      🟩 MSVC               Pass: 100%/4   | Total: 58m 29s | Avg: 14m 37s | Max: 16m 04s | Hits:  96%/3064  
      🟩 NVHPC              Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 47s
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/39  | Total:  4h 36m | Avg:  7m 05s | Max: 16m 04s | Hits:  96%/3064  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 18s | Avg: 21m 18s | Max: 21m 18s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
      🔍 HostLaunch         Pass:  50%/2   | Total: 21m 12s | Avg: 10m 36s | Max: 21m 12s
      🟩 TestGPU            Pass: 100%/2   | Total: 39m 30s | Avg: 19m 45s | Max: 20m 44s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/5   | Total: 26m 13s | Avg:  5m 14s | Max:  6m 51s
      🟩 14                 Pass: 100%/4   | Total: 32m 31s | Avg:  8m 07s | Max: 15m 19s | Hits:  96%/766   
      🟩 17                 Pass: 100%/12  | Total:  1h 33m | Avg:  7m 49s | Max: 13m 44s | Hits:  96%/1532  
      🔍 20                 Pass:  95%/24  | Total:  3h 42m | Avg:  9m 16s | Max: 21m 18s | Hits:  96%/766   
    🟨 gpu
      🟨 v100               Pass:  97%/45  | Total:  6h 15m | Avg:  8m 20s | Max: 21m 18s | Hits:  96%/3064  
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 46s | Avg:  4m 46s | Max:  4m 46s
    
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 07m | Avg: 7m 59s | Max: 23m 50s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 20m 45s | Avg: 10m 22s | Max: 14m 26s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  5h 58m | Avg:  8m 08s | Max: 23m 50s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  5m 07s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 46m 00s | Avg:  6m 34s | Max: 19m 01s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 30m 43s | Avg: 15m 21s | Max: 15m 28s
      🟩 12.6               Pass: 100%/37  | Total:  4h 51m | Avg:  7m 52s | Max: 23m 50s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 04s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 46m 00s | Avg:  6m 34s | Max: 19m 01s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 30m 43s | Avg: 15m 21s | Max: 15m 28s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  4h 41m | Avg:  8m 01s | Max: 23m 50s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 04s
      🟩 nvcc               Pass: 100%/44  | Total:  5h 57m | Avg:  8m 07s | Max: 23m 50s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 28s | Avg:  5m 22s | Max:  6m 23s
      🟩 Clang10            Pass: 100%/1   | Total:  8m 27s | Avg:  8m 27s | Max:  8m 27s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 21s | Avg:  5m 21s | Max:  5m 21s
      🟩 Clang18            Pass: 100%/7   | Total: 44m 03s | Avg:  6m 17s | Max: 10m 56s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 32s | Avg:  4m 16s | Max:  4m 46s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 16s | Avg:  5m 08s | Max:  5m 09s
      🟩 GCC8               Pass: 100%/1   | Total:  6m 01s | Avg:  6m 01s | Max:  6m 01s
      🟩 GCC9               Pass: 100%/3   | Total: 15m 27s | Avg:  5m 09s | Max:  6m 14s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 GCC11              Pass: 100%/1   | Total:  6m 47s | Avg:  6m 47s | Max:  6m 47s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 59s | Avg:  5m 59s | Max:  5m 59s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 02m | Avg:  7m 50s | Max: 14m 26s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 31s | Avg:  7m 31s | Max:  7m 31s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 01s | Avg: 19m 01s | Max: 19m 01s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 17m 36s | Avg: 17m 36s | Max: 17m 36s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 59m 13s | Avg: 19m 44s | Max: 23m 50s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 30m 43s | Avg: 15m 21s | Max: 15m 28s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 52m | Avg:  5m 55s | Max: 10m 56s
      🟩 GCC                Pass: 100%/19  | Total:  2h 01m | Avg:  6m 22s | Max: 14m 26s
      🟩 Intel              Pass: 100%/1   | Total:  7m 31s | Avg:  7m 31s | Max:  7m 31s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 35m | Avg: 19m 10s | Max: 23m 50s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 30m 43s | Avg: 15m 21s | Max: 15m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 07m | Avg:  7m 59s | Max: 23m 50s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 51m | Avg:  7m 17s | Max: 19m 01s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 39m 04s | Avg: 13m 01s | Max: 23m 50s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 37m 24s | Avg: 12m 28s | Max: 14m 26s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 58s | Avg:  4m 58s | Max:  4m 58s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 23m 15s | Avg:  4m 39s | Max:  5m 51s
      🟩 14                 Pass: 100%/4   | Total: 35m 17s | Avg:  8m 49s | Max: 19m 01s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 45m | Avg:  8m 45s | Max: 17m 36s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 03m | Avg:  7m 58s | Max: 23m 50s | Hits:  99%/3704  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 01s | Avg: 5m 00s | Max: 7m 35s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 26s | Avg:  2m 26s | Max:  2m 26s
      🟩 Test               Pass: 100%/1   | Total:  7m 35s | Avg:  7m 35s | Max:  7m 35s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 94)

# Runner
70 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16

@bernhardmgruber bernhardmgruber enabled auto-merge (squash) December 12, 2024 12:22
Copy link
Contributor

🟩 CI finished in 1h 29m: Pass: 100%/94 | Total: 13h 25m | Avg: 8m 33s | Max: 27m 42s | Hits: 98%/12324
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 07m | Avg: 7m 59s | Max: 23m 50s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 20m 45s | Avg: 10m 22s | Max: 14m 26s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  5h 58m | Avg:  8m 08s | Max: 23m 50s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  5m 07s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 46m 00s | Avg:  6m 34s | Max: 19m 01s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 30m 43s | Avg: 15m 21s | Max: 15m 28s
      🟩 12.6               Pass: 100%/37  | Total:  4h 51m | Avg:  7m 52s | Max: 23m 50s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 04s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 46m 00s | Avg:  6m 34s | Max: 19m 01s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 30m 43s | Avg: 15m 21s | Max: 15m 28s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  4h 41m | Avg:  8m 01s | Max: 23m 50s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 04s
      🟩 nvcc               Pass: 100%/44  | Total:  5h 57m | Avg:  8m 07s | Max: 23m 50s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 28s | Avg:  5m 22s | Max:  6m 23s
      🟩 Clang10            Pass: 100%/1   | Total:  8m 27s | Avg:  8m 27s | Max:  8m 27s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 21s | Avg:  5m 21s | Max:  5m 21s
      🟩 Clang18            Pass: 100%/7   | Total: 44m 03s | Avg:  6m 17s | Max: 10m 56s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 32s | Avg:  4m 16s | Max:  4m 46s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 16s | Avg:  5m 08s | Max:  5m 09s
      🟩 GCC8               Pass: 100%/1   | Total:  6m 01s | Avg:  6m 01s | Max:  6m 01s
      🟩 GCC9               Pass: 100%/3   | Total: 15m 27s | Avg:  5m 09s | Max:  6m 14s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 GCC11              Pass: 100%/1   | Total:  6m 47s | Avg:  6m 47s | Max:  6m 47s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 59s | Avg:  5m 59s | Max:  5m 59s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 02m | Avg:  7m 50s | Max: 14m 26s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 31s | Avg:  7m 31s | Max:  7m 31s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 01s | Avg: 19m 01s | Max: 19m 01s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 17m 36s | Avg: 17m 36s | Max: 17m 36s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 59m 13s | Avg: 19m 44s | Max: 23m 50s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 30m 43s | Avg: 15m 21s | Max: 15m 28s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 52m | Avg:  5m 55s | Max: 10m 56s
      🟩 GCC                Pass: 100%/19  | Total:  2h 01m | Avg:  6m 22s | Max: 14m 26s
      🟩 Intel              Pass: 100%/1   | Total:  7m 31s | Avg:  7m 31s | Max:  7m 31s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 35m | Avg: 19m 10s | Max: 23m 50s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 30m 43s | Avg: 15m 21s | Max: 15m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 07m | Avg:  7m 59s | Max: 23m 50s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 51m | Avg:  7m 17s | Max: 19m 01s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 39m 04s | Avg: 13m 01s | Max: 23m 50s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 37m 24s | Avg: 12m 28s | Max: 14m 26s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 58s | Avg:  4m 58s | Max:  4m 58s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 23m 15s | Avg:  4m 39s | Max:  5m 51s
      🟩 14                 Pass: 100%/4   | Total: 35m 17s | Avg:  8m 49s | Max: 19m 01s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 45m | Avg:  8m 45s | Max: 17m 36s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 03m | Avg:  7m 58s | Max: 23m 50s | Hits:  99%/3704  
    
  • 🟩 cub: Pass: 100%/45 | Total: 6h 39m | Avg: 8m 52s | Max: 24m 22s | Hits: 96%/3064

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 26m | Avg:  8m 59s | Max: 24m 22s | Hits:  96%/3064  
      🟩 arm64              Pass: 100%/2   | Total: 12m 32s | Avg:  6m 16s | Max:  6m 41s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 43m 29s | Avg:  6m 12s | Max: 15m 19s | Hits:  96%/766   
      🟩 12.5               Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 47s
      🟩 12.6               Pass: 100%/36  | Total:  5h 35m | Avg:  9m 19s | Max: 24m 22s | Hits:  96%/2298  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 47s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 43m 29s | Avg:  6m 12s | Max: 15m 19s | Hits:  96%/766   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 47s
      🟩 nvcc12.6           Pass: 100%/34  | Total:  5h 26m | Avg:  9m 35s | Max: 24m 22s | Hits:  96%/2298  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 47s
      🟩 nvcc               Pass: 100%/43  | Total:  6h 29m | Avg:  9m 04s | Max: 24m 22s | Hits:  96%/3064  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 23m 17s | Avg:  5m 49s | Max:  6m 51s
      🟩 Clang10            Pass: 100%/1   | Total:  7m 16s | Avg:  7m 16s | Max:  7m 16s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 51s | Avg:  5m 51s | Max:  5m 51s
      🟩 Clang12            Pass: 100%/1   | Total:  6m 06s | Avg:  6m 06s | Max:  6m 06s
      🟩 Clang13            Pass: 100%/1   | Total:  6m 18s | Avg:  6m 18s | Max:  6m 18s
      🟩 Clang14            Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
      🟩 Clang15            Pass: 100%/1   | Total:  6m 16s | Avg:  6m 16s | Max:  6m 16s
      🟩 Clang16            Pass: 100%/1   | Total:  6m 04s | Avg:  6m 04s | Max:  6m 04s
      🟩 Clang17            Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 10m | Avg: 10m 06s | Max: 21m 12s
      🟩 GCC6               Pass: 100%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 38s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 54s | Avg:  5m 57s | Max:  6m 03s
      🟩 GCC8               Pass: 100%/1   | Total:  6m 37s | Avg:  6m 37s | Max:  6m 37s
      🟩 GCC9               Pass: 100%/3   | Total: 16m 45s | Avg:  5m 35s | Max:  7m 20s
      🟩 GCC10              Pass: 100%/1   | Total:  6m 16s | Avg:  6m 16s | Max:  6m 16s
      🟩 GCC11              Pass: 100%/1   | Total:  6m 22s | Avg:  6m 22s | Max:  6m 22s
      🟩 GCC12              Pass: 100%/1   | Total:  6m 22s | Avg:  6m 22s | Max:  6m 22s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 45m | Avg: 13m 11s | Max: 24m 22s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 51s | Avg:  6m 51s | Max:  6m 51s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 19s | Avg: 15m 19s | Max: 15m 19s | Hits:  96%/766   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 13m 22s | Avg: 13m 22s | Max: 13m 22s | Hits:  96%/766   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 29m 48s | Avg: 14m 54s | Max: 16m 04s | Hits:  96%/1532  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 47s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 24m | Avg:  7m 37s | Max: 21m 12s
      🟩 GCC                Pass: 100%/19  | Total:  2h 48m | Avg:  8m 53s | Max: 24m 22s
      🟩 Intel              Pass: 100%/1   | Total:  6m 51s | Avg:  6m 51s | Max:  6m 51s
      🟩 MSVC               Pass: 100%/4   | Total: 58m 29s | Avg: 14m 37s | Max: 16m 04s | Hits:  96%/3064  
      🟩 NVHPC              Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/45  | Total:  6h 39m | Avg:  8m 52s | Max: 24m 22s | Hits:  96%/3064  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  4h 36m | Avg:  7m 05s | Max: 16m 04s | Hits:  96%/3064  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 18s | Avg: 21m 18s | Max: 21m 18s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
      🟩 HostLaunch         Pass: 100%/2   | Total: 45m 34s | Avg: 22m 47s | Max: 24m 22s
      🟩 TestGPU            Pass: 100%/2   | Total: 39m 30s | Avg: 19m 45s | Max: 20m 44s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 46s | Avg:  4m 46s | Max:  4m 46s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 26m 13s | Avg:  5m 14s | Max:  6m 51s
      🟩 14                 Pass: 100%/4   | Total: 32m 31s | Avg:  8m 07s | Max: 15m 19s | Hits:  96%/766   
      🟩 17                 Pass: 100%/12  | Total:  1h 33m | Avg:  7m 49s | Max: 13m 44s | Hits:  96%/1532  
      🟩 20                 Pass: 100%/24  | Total:  4h 06m | Avg: 10m 16s | Max: 24m 22s | Hits:  96%/766   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 01s | Avg: 5m 00s | Max: 7m 35s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 35s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 26s | Avg:  2m 26s | Max:  2m 26s
      🟩 Test               Pass: 100%/1   | Total:  7m 35s | Avg:  7m 35s | Max:  7m 35s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 94)

# Runner
70 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16

@bernhardmgruber bernhardmgruber merged commit 012a5bf into NVIDIA:main Dec 12, 2024
110 checks passed
@bernhardmgruber bernhardmgruber deleted the ref_rle_tuning branch December 12, 2024 12:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

None yet

2 participants