Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize, Cleanup, and Expose CUB Thread-Level Reduction #2390

Closed
wants to merge 0 commits into from

Conversation

fbusato
Copy link
Contributor

@fbusato fbusato commented Sep 6, 2024

Address #2287

Optimize, Cleanup, and Expose CUB Thread-Level Reduction

  • ILP: Binary-tree reduction
  • ILP: Ternary-tree reduction
  • Generic SIMD reduction
  • SIMD binding for narrow precision floating point types
  • Implement integer promotion optimization
  • Vectorize narrow precision data types half and bfloat16
  • Expose ThreadReduce in cub:: namespace
  • Document the API
  • Create catch2 test cases for all new cases

@fbusato fbusato requested review from a team as code owners September 6, 2024 21:25
@fbusato fbusato requested review from elstehle and griwes September 6, 2024 21:25
@fbusato fbusato marked this pull request as draft September 6, 2024 21:25
@fbusato fbusato changed the title Optimize, Cleanup, and Expose CUB Thread-Level Reduction [DRAFT] [DRAFT] Optimize, Cleanup, and Expose CUB Thread-Level Reduction Sep 6, 2024
@miscco
Copy link
Collaborator

miscco commented Sep 9, 2024

Please stay with tthe current optimizations, and create individual PRs for follow up changes.

Especially with performance improvements it is important to be able to bisect small commits rather than one large "optimizes the world" one

@fbusato fbusato marked this pull request as ready for review September 17, 2024 22:36
@fbusato fbusato requested a review from a team as a code owner September 17, 2024 22:36
Copy link
Contributor

🟨 CI finished in 5h 09m: Pass: 81%/259 | Total: 5d 21h | Avg: 32m 39s | Max: 1h 16m | Hits: 62%/21490
  • 🟨 cub: Pass: 78%/136 | Total: 3d 14h | Avg: 38m 05s | Max: 1h 14m | Hits: 5%/3650

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  77%/128 | Total:  3d 07h | Avg: 37m 03s | Max:  1h 14m | Hits:   5%/3650  
      🟩 arm64              Pass: 100%/8   | Total:  7h 16m | Avg: 54m 36s | Max:  1h 01m
    🚨 sm: 60;70;80;90 🚨
      🔥 60;70;80;90        Pass:   0%/3   | Total: 17m 17s | Avg:  5m 45s | Max:  6m 06s
      🟩 90a                Pass: 100%/4   | Total:  1h 39m | Avg: 24m 57s | Max: 26m 24s
    🟨 cxx
      🟨 Clang9             Pass:  50%/6   | Total:  2h 44m | Avg: 27m 25s | Max: 51m 02s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 49m | Avg: 56m 39s | Max: 58m 50s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 39m | Avg: 54m 48s | Max: 57m 48s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 29m | Avg: 52m 17s | Max: 56m 12s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 30m | Avg: 52m 34s | Max: 57m 21s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 37m | Avg: 54m 20s | Max: 56m 39s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 02s | Max: 56m 57s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 34m | Avg: 53m 35s | Max: 55m 56s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 41s | Max: 53m 57s
      🟨 Clang18            Pass:  76%/26  | Total: 12h 27m | Avg: 28m 45s | Max:  1h 00m
      🟥 GCC6               Pass:   0%/2   | Total:  8m 25s | Avg:  4m 12s | Max:  4m 20s
      🟨 GCC7               Pass:  50%/6   | Total:  2h 53m | Avg: 28m 55s | Max: 56m 26s
      🟨 GCC8               Pass:  50%/6   | Total:  2h 49m | Avg: 28m 14s | Max: 57m 07s
      🟨 GCC9               Pass:  50%/6   | Total:  2h 54m | Avg: 29m 08s | Max: 57m 49s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 29m | Avg: 52m 27s | Max: 53m 59s
      🟨 GCC11              Pass:  57%/7   | Total:  3h 47m | Avg: 32m 27s | Max: 53m 30s
      🟩 GCC12              Pass: 100%/4   | Total:  3h 37m | Avg: 54m 20s | Max: 57m 39s
      🟨 GCC13              Pass:  82%/29  | Total: 15h 03m | Avg: 31m 09s | Max:  1h 02m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 44m | Avg: 54m 42s | Max: 55m 18s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 13m 08s | Avg: 13m 08s | Max: 13m 08s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 11m | Hits:   5%/1460  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 34m | Avg:  1h 11m | Max:  1h 14m | Hits:   5%/2190  
    🟨 cxx_family
      🟨 Clang              Pass:  85%/63  | Total:  1d 18h | Avg: 40m 45s | Max:  1h 00m
      🟨 GCC                Pass:  70%/64  | Total:  1d 10h | Avg: 32m 33s | Max:  1h 02m
      🟩 Intel              Pass: 100%/3   | Total:  2h 44m | Avg: 54m 42s | Max: 55m 18s
      🟨 MSVC               Pass:  83%/6   | Total:  6h 03m | Avg:  1h 00m | Max:  1h 14m | Hits:   5%/3650  
    🟨 jobs
      🟨 Build              Pass:  80%/103 | Total:  3d 03h | Avg: 43m 56s | Max:  1h 14m | Hits:   5%/3650  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 25m | Avg: 18m 13s | Max: 19m 54s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 07m | Avg: 15m 57s | Max: 19m 15s
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 26m | Avg: 18m 22s | Max: 22m 39s
      🟥 SmallGMem          Pass:   0%/1   | Total: 34m 01s | Avg: 34m 01s | Max: 34m 01s
      🟥 TestGPU            Pass:   0%/8   | Total:  3h 18m | Avg: 24m 51s | Max: 31m 34s
    🟨 gpu
      🟨 v100               Pass:  78%/136 | Total:  3d 14h | Avg: 38m 05s | Max:  1h 14m | Hits:   5%/3650  
    🟨 ctk
      🟥 11.1               Pass:   0%/15  | Total:  1h 10m | Avg:  4m 41s | Max: 13m 08s
      🟥 11.8               Pass:   0%/3   | Total: 17m 17s | Avg:  5m 45s | Max:  6m 06s
      🟨 12.6               Pass:  90%/118 | Total:  3d 12h | Avg: 43m 09s | Max:  1h 14m | Hits:   5%/3650  
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  4m 16s
      🟥 nvcc11.1           Pass:   0%/15  | Total:  1h 10m | Avg:  4m 41s | Max: 13m 08s
      🟥 nvcc11.8           Pass:   0%/3   | Total: 17m 17s | Avg:  5m 45s | Max:  6m 06s
      🟨 nvcc12.6           Pass:  92%/116 | Total:  3d 12h | Avg: 43m 49s | Max:  1h 14m | Hits:   5%/3650  
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  4m 16s
      🟨 nvcc               Pass:  79%/134 | Total:  3d 14h | Avg: 38m 35s | Max:  1h 14m | Hits:   5%/3650  
    🟨 std
      🟨 11                 Pass:  77%/35  | Total: 21h 06m | Avg: 36m 10s | Max:  1h 00m
      🟨 14                 Pass:  76%/38  | Total: 23h 59m | Avg: 37m 52s | Max:  1h 06m | Hits:   7%/1460  
      🟨 17                 Pass:  76%/38  | Total:  1d 00h | Avg: 37m 55s | Max:  1h 14m | Hits:   3%/1460  
      🟨 20                 Pass:  88%/25  | Total: 17h 12m | Avg: 41m 18s | Max:  1h 12m | Hits:   4%/730   
    
  • 🟨 thrust: Pass: 83%/122 | Total: 2d 06h | Avg: 26m 43s | Max: 1h 16m | Hits: 74%/17840

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  82%/114 | Total:  2d 02h | Avg: 26m 24s | Max:  1h 16m | Hits:  74%/17840 
      🟩 arm64              Pass: 100%/8   | Total:  4h 10m | Avg: 31m 18s | Max: 35m 27s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  80%/103 | Total:  2d 02h | Avg: 29m 10s | Max:  1h 16m | Hits:  59%/11150 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 09m | Avg: 11m 45s | Max: 25m 05s | Hits:  99%/6690  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 06m | Avg: 15m 45s | Max: 19m 49s
    🚨 sm: 60;70;80;90 🚨
      🔥 60;70;80;90        Pass:   0%/3   | Total:  7m 00s | Avg:  2m 20s | Max:  2m 26s
      🟩 90a                Pass: 100%/4   | Total:  1h 18m | Avg: 19m 35s | Max: 22m 37s
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total:  5m 18s | Avg:  2m 39s | Max:  2m 40s
      🟥 nvcc11.1           Pass:   0%/15  | Total: 42m 29s | Avg:  2m 49s | Max: 12m 02s
      🟥 nvcc11.8           Pass:   0%/3   | Total:  7m 00s | Avg:  2m 20s | Max:  2m 26s
      🟩 nvcc12.6           Pass: 100%/102 | Total:  2d 05h | Avg: 31m 26s | Max:  1h 16m | Hits:  74%/17840 
    🟨 cxx
      🟨 Clang9             Pass:  50%/6   | Total:  1h 57m | Avg: 19m 30s | Max: 40m 10s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 45m | Avg: 35m 03s | Max: 42m 56s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 49s | Max: 38m 10s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 05s | Max: 38m 55s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 13m | Avg: 33m 25s | Max: 37m 35s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 10m | Avg: 32m 43s | Max: 38m 18s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 43s | Max: 35m 21s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 55s | Max: 37m 38s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 13m | Avg: 33m 23s | Max: 36m 08s
      🟨 Clang18            Pass:  88%/18  | Total:  5h 56m | Avg: 19m 47s | Max: 39m 59s
      🟥 GCC6               Pass:   0%/2   | Total:  4m 24s | Avg:  2m 12s | Max:  2m 23s
      🟨 GCC7               Pass:  50%/6   | Total:  1h 46m | Avg: 17m 40s | Max: 38m 04s
      🟨 GCC8               Pass:  50%/6   | Total:  1h 44m | Avg: 17m 28s | Max: 34m 35s
      🟨 GCC9               Pass:  50%/6   | Total:  1h 44m | Avg: 17m 28s | Max: 36m 35s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 28m | Avg: 37m 06s | Max: 42m 16s
      🟨 GCC11              Pass:  57%/7   | Total:  2h 31m | Avg: 21m 36s | Max: 39m 50s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 18m | Avg: 34m 38s | Max: 38m 29s
      🟩 GCC13              Pass: 100%/20  | Total:  7h 30m | Avg: 22m 30s | Max: 39m 32s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 01m | Avg: 40m 27s | Max: 45m 11s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 12m 02s | Avg: 12m 02s | Max: 12m 02s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 07m | Hits:  60%/4460  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  4h 39m | Avg: 46m 34s | Max:  1h 16m | Hits:  79%/13380 
    🟨 cxx_family
      🟨 Clang              Pass:  90%/55  | Total:  1d 01h | Avg: 27m 28s | Max: 42m 56s
      🟨 GCC                Pass:  74%/55  | Total: 20h 08m | Avg: 21m 58s | Max: 42m 16s
      🟩 Intel              Pass: 100%/3   | Total:  2h 01m | Avg: 40m 27s | Max: 45m 11s
      🟨 MSVC               Pass:  88%/9   | Total:  7h 00m | Avg: 46m 44s | Max:  1h 16m | Hits:  74%/17840 
    🟨 gpu
      🟨 v100               Pass:  83%/122 | Total:  2d 06h | Avg: 26m 43s | Max:  1h 16m | Hits:  74%/17840 
    🟨 ctk
      🟥 11.1               Pass:   0%/15  | Total: 42m 29s | Avg:  2m 49s | Max: 12m 02s
      🟥 11.8               Pass:   0%/3   | Total:  7m 00s | Avg:  2m 20s | Max:  2m 26s
      🟨 12.6               Pass:  98%/104 | Total:  2d 05h | Avg: 30m 52s | Max:  1h 16m | Hits:  74%/17840 
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  5m 18s | Avg:  2m 39s | Max:  2m 40s
      🟨 nvcc               Pass:  85%/120 | Total:  2d 06h | Avg: 27m 07s | Max:  1h 16m | Hits:  74%/17840 
    🟨 std
      🟨 11                 Pass:  80%/31  | Total: 10h 55m | Avg: 21m 07s | Max: 33m 08s
      🟨 14                 Pass:  80%/35  | Total: 15h 42m | Avg: 26m 55s | Max:  1h 01m | Hits:  73%/6690  
      🟨 17                 Pass:  82%/34  | Total: 16h 19m | Avg: 28m 47s | Max:  1h 11m | Hits:  72%/6690  
      🟨 20                 Pass:  95%/22  | Total: 11h 24m | Avg: 31m 06s | Max:  1h 16m | Hits:  79%/4460  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 259)

# Runner
186 linux-amd64-cpu16
42 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 2h 16m: Pass: 100%/208 | Total: 6d 01h | Avg: 41m 50s | Max: 1h 24m | Hits: 43%/14070
  • 🟩 cub: Pass: 100%/104 | Total: 3d 16h | Avg: 51m 00s | Max: 1h 19m | Hits: 24%/2920

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  3d 09h | Avg: 50m 40s | Max:  1h 19m | Hits:  24%/2920  
      🟩 arm64              Pass: 100%/8   | Total:  7h 20m | Avg: 55m 01s | Max: 59m 23s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 11h 35m | Avg: 46m 23s | Max: 58m 18s | Hits:   3%/730   
      🟩 11.8               Pass: 100%/3   | Total:  3h 40m | Avg:  1h 13m | Max:  1h 16m
      🟩 12.6               Pass: 100%/86  | Total:  3d 01h | Avg: 51m 01s | Max:  1h 19m | Hits:  31%/2190  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 54m | Avg: 57m 24s | Max: 59m 37s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 11h 35m | Avg: 46m 23s | Max: 58m 18s | Hits:   3%/730   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 40m | Avg:  1h 13m | Max:  1h 16m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 23h | Avg: 50m 52s | Max:  1h 19m | Hits:  31%/2190  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 24s | Max: 59m 37s
      🟩 nvcc               Pass: 100%/102 | Total:  3d 14h | Avg: 50m 52s | Max:  1h 19m | Hits:  24%/2920  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 52m | Avg: 48m 47s | Max: 55m 33s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 38m | Avg: 52m 47s | Max: 57m 10s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 35m | Avg: 53m 58s | Max: 56m 48s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 37s | Max: 57m 27s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 33m | Avg: 53m 19s | Max: 54m 55s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 34m | Avg: 53m 32s | Max: 59m 39s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 33m | Avg: 53m 21s | Max: 56m 45s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 37m | Avg: 54m 21s | Max: 56m 34s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 12s | Max: 57m 21s
      🟩 Clang18            Pass: 100%/9   | Total:  7h 07m | Avg: 47m 29s | Max: 59m 37s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 31m | Avg: 45m 30s | Max: 46m 53s
      🟩 GCC7               Pass: 100%/6   | Total:  5h 06m | Avg: 51m 03s | Max: 57m 42s
      🟩 GCC8               Pass: 100%/6   | Total:  5h 06m | Avg: 51m 09s | Max: 57m 22s
      🟩 GCC9               Pass: 100%/6   | Total:  5h 03m | Avg: 50m 32s | Max: 58m 42s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 39m | Avg: 54m 46s | Max: 56m 39s
      🟩 GCC11              Pass: 100%/7   | Total:  7h 12m | Avg:  1h 01m | Max:  1h 16m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 38m | Avg: 54m 37s | Max:  1h 00m
      🟩 GCC13              Pass: 100%/16  | Total: 10h 00m | Avg: 37m 30s | Max:  1h 19m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  3h 02m | Avg:  1h 00m | Max:  1h 02m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 58m 18s | Avg: 58m 18s | Max: 58m 18s | Hits:   3%/730   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 18m | Hits:  31%/1460  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:  31%/730   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 15h | Avg: 51m 28s | Max: 59m 39s
      🟩 GCC                Pass: 100%/51  | Total:  1d 17h | Avg: 48m 34s | Max:  1h 19m
      🟩 Intel              Pass: 100%/3   | Total:  3h 02m | Avg:  1h 00m | Max:  1h 02m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 37m | Avg:  1h 09m | Max:  1h 18m | Hits:  24%/2920  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  3d 16h | Avg: 51m 00s | Max:  1h 19m | Hits:  24%/2920  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  3d 12h | Avg: 53m 00s | Max:  1h 18m | Hits:  24%/2920  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 00s | Avg: 18m 00s | Max: 18m 00s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
      🟩 HostLaunch         Pass: 100%/3   | Total: 51m 37s | Avg: 17m 12s | Max: 21m 09s
      🟩 TestGPU            Pass: 100%/3   | Total:  2h 10m | Avg: 43m 33s | Max:  1h 19m
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 40m | Avg:  1h 13m | Max:  1h 16m
      🟩 90a                Pass: 100%/4   | Total:  1h 36m | Avg: 24m 01s | Max: 25m 44s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  1d 00h | Avg: 51m 32s | Max:  1h 19m
      🟩 14                 Pass: 100%/27  | Total: 23h 51m | Avg: 53m 01s | Max:  1h 18m | Hits:  17%/1460  
      🟩 17                 Pass: 100%/26  | Total: 23h 09m | Avg: 53m 25s | Max:  1h 14m | Hits:  31%/730   
      🟩 20                 Pass: 100%/23  | Total: 17h 21m | Avg: 45m 15s | Max:  1h 05m | Hits:  31%/730   
    
  • 🟩 thrust: Pass: 100%/103 | Total: 2d 08h | Avg: 32m 51s | Max: 1h 24m | Hits: 49%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  2d 04h | Avg: 33m 03s | Max:  1h 24m | Hits:  49%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  4h 03m | Avg: 30m 24s | Max: 34m 09s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  8h 21m | Avg: 33m 26s | Max:  1h 21m | Hits:   4%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  2h 12m | Avg: 44m 18s | Max: 50m 39s
      🟩 12.6               Pass: 100%/85  | Total:  1d 21h | Avg: 32m 20s | Max:  1h 24m | Hits:  60%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 56m 30s | Avg: 28m 15s | Max: 29m 06s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 21m | Avg: 33m 26s | Max:  1h 21m | Hits:   4%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 12m | Avg: 44m 18s | Max: 50m 39s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 20h | Avg: 32m 26s | Max:  1h 24m | Hits:  60%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 56m 30s | Avg: 28m 15s | Max: 29m 06s
      🟩 nvcc               Pass: 100%/101 | Total:  2d 07h | Avg: 32m 56s | Max:  1h 24m | Hits:  49%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 00m | Avg: 30m 07s | Max: 35m 51s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 39m | Avg: 33m 02s | Max: 37m 12s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 03s | Max: 38m 53s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 37s | Max: 36m 11s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 51s | Max: 35m 54s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 00m | Avg: 30m 11s | Max: 32m 07s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 13m | Avg: 33m 23s | Max: 37m 31s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 17s | Max: 39m 22s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 33s | Max: 37m 08s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 51m | Avg: 25m 40s | Max: 35m 24s
      🟩 GCC6               Pass: 100%/2   | Total: 56m 58s | Avg: 28m 29s | Max: 32m 57s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 04m | Avg: 30m 40s | Max: 35m 15s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 10m | Avg: 31m 40s | Max: 37m 52s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 09m | Avg: 31m 36s | Max: 37m 08s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 17m | Avg: 34m 24s | Max: 36m 59s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 35m | Avg: 39m 21s | Max: 50m 39s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 31m | Avg: 37m 49s | Max: 41m 23s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 20m | Avg: 22m 53s | Max: 40m 11s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 00m | Avg: 40m 05s | Max: 44m 21s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 21m | Avg:  1h 21m | Max:  1h 21m | Hits:   4%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m | Hits:  47%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 35s | Max:  1h 24m | Hits:  73%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total: 23h 50m | Avg: 31m 06s | Max: 39m 22s
      🟩 GCC                Pass: 100%/49  | Total:  1d 01h | Avg: 30m 43s | Max: 50m 39s
      🟩 Intel              Pass: 100%/3   | Total:  2h 00m | Avg: 40m 05s | Max: 44m 21s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 27m | Avg:  1h 05m | Max:  1h 24m | Hits:  49%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  2d 08h | Avg: 32m 51s | Max:  1h 24m | Hits:  49%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 06h | Avg: 34m 21s | Max:  1h 24m | Hits:  36%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 56s | Avg: 11m 44s | Max: 24m 54s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 38m 21s | Avg: 12m 47s | Max: 14m 04s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 12m | Avg: 44m 18s | Max: 50m 39s
      🟩 90a                Pass: 100%/4   | Total:  1h 24m | Avg: 21m 06s | Max: 23m 11s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 12h 17m | Avg: 26m 20s | Max: 35m 48s
      🟩 14                 Pass: 100%/27  | Total: 16h 36m | Avg: 36m 54s | Max:  1h 21m | Hits:  25%/4460  
      🟩 17                 Pass: 100%/26  | Total: 15h 42m | Avg: 36m 15s | Max:  1h 09m | Hits:  47%/2230  
      🟩 20                 Pass: 100%/22  | Total: 11h 46m | Avg: 32m 07s | Max:  1h 24m | Hits:  73%/4460  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 208)

# Runner
171 linux-amd64-cpu16
16 linux-arm64-cpu16
12 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 2h 17m: Pass: 100%/208 | Total: 5d 22h | Avg: 41m 05s | Max: 1h 23m | Hits: 54%/14070
  • 🟩 cub: Pass: 100%/104 | Total: 3d 14h | Avg: 49m 57s | Max: 1h 23m | Hits: 3%/2920

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  3d 07h | Avg: 49m 32s | Max:  1h 23m | Hits:   3%/2920  
      🟩 arm64              Pass: 100%/8   | Total:  7h 18m | Avg: 54m 51s | Max:  1h 01m
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 11h 35m | Avg: 46m 23s | Max: 58m 55s | Hits:   3%/730   
      🟩 11.8               Pass: 100%/3   | Total:  3h 32m | Avg:  1h 10m | Max:  1h 14m
      🟩 12.6               Pass: 100%/86  | Total:  2d 23h | Avg: 49m 51s | Max:  1h 23m | Hits:   3%/2190  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 50m | Avg: 55m 20s | Max: 55m 34s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 11h 35m | Avg: 46m 23s | Max: 58m 55s | Hits:   3%/730   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 32m | Avg:  1h 10m | Max:  1h 14m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 21h | Avg: 49m 43s | Max:  1h 23m | Hits:   3%/2190  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 50m | Avg: 55m 20s | Max: 55m 34s
      🟩 nvcc               Pass: 100%/102 | Total:  3d 12h | Avg: 49m 51s | Max:  1h 23m | Hits:   3%/2920  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  5h 00m | Avg: 50m 06s | Max: 57m 29s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 33m | Avg: 51m 10s | Max: 51m 37s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 25m | Avg: 51m 17s | Max: 53m 37s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 23m | Avg: 50m 51s | Max: 52m 35s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 20m | Avg: 50m 14s | Max: 50m 28s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 29m | Avg: 52m 20s | Max: 58m 05s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 08s | Max: 56m 26s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 22m | Avg: 50m 33s | Max: 51m 31s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 13s | Max: 57m 02s
      🟩 Clang18            Pass: 100%/9   | Total:  6h 54m | Avg: 46m 05s | Max: 55m 34s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 28m | Avg: 44m 12s | Max: 45m 53s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 50m | Avg: 48m 22s | Max: 57m 28s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 59m | Avg: 49m 54s | Max: 57m 38s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 58m | Avg: 49m 48s | Max:  1h 00m
      🟩 GCC10              Pass: 100%/4   | Total:  3h 38m | Avg: 54m 36s | Max: 57m 38s
      🟩 GCC11              Pass: 100%/7   | Total:  7h 09m | Avg:  1h 01m | Max:  1h 14m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 26m | Avg: 51m 42s | Max: 52m 33s
      🟩 GCC13              Pass: 100%/16  | Total: 10h 09m | Avg: 38m 04s | Max:  1h 23m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 55m | Avg: 58m 38s | Max:  1h 01m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 58m 55s | Avg: 58m 55s | Max: 58m 55s | Hits:   3%/730   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 06m | Hits:   3%/1460  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 19m | Avg:  1h 19m | Max:  1h 19m | Hits:   3%/730   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 14h | Avg: 50m 09s | Max: 58m 05s
      🟩 GCC                Pass: 100%/51  | Total:  1d 16h | Avg: 47m 51s | Max:  1h 23m
      🟩 Intel              Pass: 100%/3   | Total:  2h 55m | Avg: 58m 38s | Max:  1h 01m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 31m | Avg:  1h 07m | Max:  1h 19m | Hits:   3%/2920  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  3d 14h | Avg: 49m 57s | Max:  1h 23m | Hits:   3%/2920  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  3d 10h | Avg: 51m 48s | Max:  1h 19m | Hits:   3%/2920  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 16m 40s | Avg: 16m 40s | Max: 16m 40s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 40s | Avg: 17m 40s | Max: 17m 40s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 59m | Avg: 39m 54s | Max:  1h 23m
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 08m | Avg: 22m 41s | Max: 26m 54s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 32m | Avg:  1h 10m | Max:  1h 14m
      🟩 90a                Pass: 100%/4   | Total:  1h 36m | Avg: 24m 09s | Max: 25m 56s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 23h 29m | Avg: 50m 19s | Max:  1h 23m
      🟩 14                 Pass: 100%/27  | Total: 23h 16m | Avg: 51m 44s | Max:  1h 10m | Hits:   3%/1460  
      🟩 17                 Pass: 100%/26  | Total: 22h 58m | Avg: 53m 00s | Max:  1h 07m | Hits:   3%/730   
      🟩 20                 Pass: 100%/23  | Total: 16h 51m | Avg: 43m 58s | Max:  1h 19m | Hits:   3%/730   
    
  • 🟩 thrust: Pass: 100%/103 | Total: 2d 07h | Avg: 32m 23s | Max: 1h 15m | Hits: 67%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  2d 03h | Avg: 32m 27s | Max:  1h 15m | Hits:  67%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  4h 12m | Avg: 31m 35s | Max: 38m 29s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 48m | Avg: 31m 15s | Max: 58m 14s | Hits:  59%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  2h 03m | Avg: 41m 08s | Max: 44m 01s
      🟩 12.6               Pass: 100%/85  | Total:  1d 21h | Avg: 32m 16s | Max:  1h 15m | Hits:  69%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 08s | Avg: 25m 04s | Max: 25m 34s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 48m | Avg: 31m 15s | Max: 58m 14s | Hits:  59%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 03m | Avg: 41m 08s | Max: 44m 01s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 20h | Avg: 32m 27s | Max:  1h 15m | Hits:  69%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 08s | Avg: 25m 04s | Max: 25m 34s
      🟩 nvcc               Pass: 100%/101 | Total:  2d 06h | Avg: 32m 32s | Max:  1h 15m | Hits:  67%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 05m | Avg: 30m 53s | Max: 34m 32s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 41m | Avg: 33m 49s | Max: 39m 55s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 46s | Max: 36m 38s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 09m | Avg: 32m 24s | Max: 37m 15s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 11s | Max: 35m 14s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 45s | Max: 37m 09s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 06m | Avg: 31m 39s | Max: 34m 38s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 07m | Avg: 31m 46s | Max: 34m 51s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 00s | Max: 38m 21s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 47m | Avg: 25m 18s | Max: 39m 13s
      🟩 GCC6               Pass: 100%/2   | Total: 54m 00s | Avg: 27m 00s | Max: 29m 19s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 13m | Avg: 32m 15s | Max: 40m 31s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 09m | Avg: 31m 38s | Max: 37m 11s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 13m | Avg: 32m 10s | Max: 40m 13s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 21m | Avg: 35m 20s | Max: 37m 10s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 29m | Avg: 38m 28s | Max: 44m 01s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 16m | Avg: 34m 03s | Max: 37m 21s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 24m | Avg: 23m 09s | Max: 41m 59s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 05m | Avg: 41m 40s | Max: 50m 10s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 58m 14s | Avg: 58m 14s | Max: 58m 14s | Hits:  59%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 07m | Hits:  59%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 38m | Avg: 49m 08s | Max:  1h 15m | Hits:  79%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total: 23h 40m | Avg: 30m 53s | Max: 39m 55s
      🟩 GCC                Pass: 100%/49  | Total:  1d 01h | Avg: 30m 38s | Max: 44m 01s
      🟩 Intel              Pass: 100%/3   | Total:  2h 05m | Avg: 41m 40s | Max: 50m 10s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 48m | Avg: 57m 42s | Max:  1h 15m | Hits:  67%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  2d 07h | Avg: 32m 23s | Max:  1h 15m | Hits:  67%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 06h | Avg: 33m 53s | Max:  1h 15m | Hits:  59%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 44m 48s | Avg: 11m 12s | Max: 22m 46s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 38m 05s | Avg: 12m 41s | Max: 12m 57s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 03m | Avg: 41m 08s | Max: 44m 01s
      🟩 90a                Pass: 100%/4   | Total:  1h 19m | Avg: 19m 55s | Max: 23m 02s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 12h 17m | Avg: 26m 21s | Max: 38m 01s
      🟩 14                 Pass: 100%/27  | Total: 15h 45m | Avg: 35m 01s | Max:  1h 07m | Hits:  59%/4460  
      🟩 17                 Pass: 100%/26  | Total: 15h 56m | Avg: 36m 47s | Max:  1h 04m | Hits:  59%/2230  
      🟩 20                 Pass: 100%/22  | Total: 11h 35m | Avg: 31m 38s | Max:  1h 15m | Hits:  79%/4460  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 208)

# Runner
171 linux-amd64-cpu16
16 linux-arm64-cpu16
12 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 1h 51m: Pass: 100%/208 | Total: 5d 22h | Avg: 41m 08s | Max: 1h 16m | Hits: 54%/14070
  • 🟩 cub: Pass: 100%/104 | Total: 3d 14h | Avg: 50m 03s | Max: 1h 16m | Hits: 3%/2920

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  3d 07h | Avg: 49m 42s | Max:  1h 16m | Hits:   3%/2920  
      🟩 arm64              Pass: 100%/8   | Total:  7h 13m | Avg: 54m 07s | Max: 56m 50s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 11h 50m | Avg: 47m 23s | Max: 57m 53s | Hits:   3%/730   
      🟩 11.8               Pass: 100%/3   | Total:  3h 30m | Avg:  1h 10m | Max:  1h 14m
      🟩 12.6               Pass: 100%/86  | Total:  2d 23h | Avg: 49m 48s | Max:  1h 16m | Hits:   3%/2190  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 52m | Avg: 56m 27s | Max: 59m 30s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 11h 50m | Avg: 47m 23s | Max: 57m 53s | Hits:   3%/730   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 30m | Avg:  1h 10m | Max:  1h 14m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 21h | Avg: 49m 39s | Max:  1h 16m | Hits:   3%/2190  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 52m | Avg: 56m 27s | Max: 59m 30s
      🟩 nvcc               Pass: 100%/102 | Total:  3d 12h | Avg: 49m 55s | Max:  1h 16m | Hits:   3%/2920  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  5h 01m | Avg: 50m 15s | Max: 57m 05s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 42m | Avg: 54m 01s | Max: 58m 57s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 30m | Avg: 52m 30s | Max: 57m 50s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 29m | Avg: 52m 18s | Max: 56m 07s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 42s | Max: 55m 49s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 37m | Avg: 54m 25s | Max: 57m 34s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 35m | Avg: 53m 53s | Max: 58m 39s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 05s | Max: 52m 30s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 21m | Avg: 50m 27s | Max: 51m 24s
      🟩 Clang18            Pass: 100%/9   | Total:  7h 01m | Avg: 46m 53s | Max: 59m 30s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 32m | Avg: 46m 17s | Max: 50m 10s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 54m | Avg: 49m 09s | Max: 52m 40s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 59m | Avg: 49m 59s | Max: 56m 11s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 56m | Avg: 49m 20s | Max: 54m 11s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 35m | Avg: 53m 52s | Max: 58m 39s
      🟩 GCC11              Pass: 100%/7   | Total:  7h 03m | Avg:  1h 00m | Max:  1h 14m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 27m | Avg: 51m 57s | Max: 52m 44s
      🟩 GCC13              Pass: 100%/16  | Total:  9h 32m | Avg: 35m 45s | Max: 56m 19s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 57m | Avg: 59m 14s | Max:  1h 03m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 57m 53s | Avg: 57m 53s | Max: 57m 53s | Hits:   3%/730   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 14m | Hits:   3%/1460  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 16m | Avg:  1h 16m | Max:  1h 16m | Hits:   3%/730   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 15h | Avg: 51m 06s | Max: 59m 30s
      🟩 GCC                Pass: 100%/51  | Total:  1d 16h | Avg: 47m 06s | Max:  1h 14m
      🟩 Intel              Pass: 100%/3   | Total:  2h 57m | Avg: 59m 14s | Max:  1h 03m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 34m | Avg:  1h 08m | Max:  1h 16m | Hits:   3%/2920  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  3d 14h | Avg: 50m 03s | Max:  1h 16m | Hits:   3%/2920  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  3d 11h | Avg: 52m 11s | Max:  1h 16m | Hits:   3%/2920  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 25m 44s | Avg: 25m 44s | Max: 25m 44s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 25s | Avg: 16m 25s | Max: 16m 25s
      🟩 HostLaunch         Pass: 100%/3   | Total: 57m 28s | Avg: 19m 09s | Max: 20m 10s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 35m | Avg: 31m 40s | Max: 37m 53s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 30m | Avg:  1h 10m | Max:  1h 14m
      🟩 90a                Pass: 100%/4   | Total:  1h 36m | Avg: 24m 01s | Max: 25m 37s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 23h 10m | Avg: 49m 40s | Max:  1h 07m
      🟩 14                 Pass: 100%/27  | Total: 23h 12m | Avg: 51m 34s | Max:  1h 08m | Hits:   3%/1460  
      🟩 17                 Pass: 100%/26  | Total: 23h 05m | Avg: 53m 17s | Max:  1h 14m | Hits:   3%/730   
      🟩 20                 Pass: 100%/23  | Total: 17h 16m | Avg: 45m 03s | Max:  1h 16m | Hits:   3%/730   
    
  • 🟩 thrust: Pass: 100%/103 | Total: 2d 07h | Avg: 32m 24s | Max: 1h 16m | Hits: 67%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  2d 03h | Avg: 32m 28s | Max:  1h 16m | Hits:  67%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  4h 11m | Avg: 31m 28s | Max: 36m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  8h 02m | Avg: 32m 11s | Max:  1h 05m | Hits:  59%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  2h 00m | Avg: 40m 18s | Max: 44m 01s
      🟩 12.6               Pass: 100%/85  | Total:  1d 21h | Avg: 32m 09s | Max:  1h 16m | Hits:  69%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 09s | Avg: 25m 04s | Max: 25m 48s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 02m | Avg: 32m 11s | Max:  1h 05m | Hits:  59%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 00m | Avg: 40m 18s | Max: 44m 01s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 20h | Avg: 32m 19s | Max:  1h 16m | Hits:  69%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 09s | Avg: 25m 04s | Max: 25m 48s
      🟩 nvcc               Pass: 100%/101 | Total:  2d 06h | Avg: 32m 32s | Max:  1h 16m | Hits:  67%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 05m | Avg: 30m 54s | Max: 36m 48s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 39m | Avg: 33m 08s | Max: 37m 38s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 11s | Max: 33m 19s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 05m | Avg: 31m 27s | Max: 33m 57s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 13s | Max: 37m 32s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 21s | Max: 40m 24s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 52s | Max: 36m 24s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 13m | Avg: 33m 18s | Max: 40m 59s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 03s | Max: 39m 01s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 44m | Avg: 24m 53s | Max: 34m 45s
      🟩 GCC6               Pass: 100%/2   | Total: 56m 54s | Avg: 28m 27s | Max: 29m 15s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 06m | Avg: 31m 08s | Max: 39m 07s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 09m | Avg: 31m 36s | Max: 35m 30s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 14m | Avg: 32m 27s | Max: 38m 06s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 15m | Avg: 33m 59s | Max: 37m 15s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 18m | Avg: 36m 56s | Max: 44m 01s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 16m | Avg: 34m 07s | Max: 37m 36s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 24m | Avg: 23m 11s | Max: 37m 44s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 04m | Avg: 41m 27s | Max: 47m 10s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:  59%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 16m | Hits:  59%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 43m | Avg: 51m 59s | Max:  1h 16m | Hits:  79%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total: 23h 42m | Avg: 30m 55s | Max: 40m 59s
      🟩 GCC                Pass: 100%/49  | Total:  1d 00h | Avg: 30m 16s | Max: 44m 01s
      🟩 Intel              Pass: 100%/3   | Total:  2h 04m | Avg: 41m 27s | Max: 47m 10s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 06m | Avg:  1h 01m | Max:  1h 16m | Hits:  67%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  2d 07h | Avg: 32m 24s | Max:  1h 16m | Hits:  67%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 06h | Avg: 33m 50s | Max:  1h 16m | Hits:  59%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 48m 30s | Avg: 12m 07s | Max: 27m 16s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 39m 21s | Avg: 13m 07s | Max: 14m 24s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 00m | Avg: 40m 18s | Max: 44m 01s
      🟩 90a                Pass: 100%/4   | Total:  1h 25m | Avg: 21m 21s | Max: 25m 04s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 12h 14m | Avg: 26m 13s | Max: 35m 19s
      🟩 14                 Pass: 100%/27  | Total: 15h 53m | Avg: 35m 19s | Max:  1h 05m | Hits:  59%/4460  
      🟩 17                 Pass: 100%/26  | Total: 15h 57m | Avg: 36m 49s | Max:  1h 16m | Hits:  59%/2230  
      🟩 20                 Pass: 100%/22  | Total: 11h 31m | Avg: 31m 26s | Max:  1h 16m | Hits:  79%/4460  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 208)

# Runner
171 linux-amd64-cpu16
16 linux-arm64-cpu16
12 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/thread/thread_operators.cuh Outdated Show resolved Hide resolved
cub/cub/thread/thread_operators.cuh Outdated Show resolved Hide resolved
cub/cub/thread/thread_operators.cuh Outdated Show resolved Hide resolved
@fbusato fbusato requested a review from a team as a code owner September 23, 2024 22:22
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
@fbusato fbusato requested a review from a team as a code owner September 24, 2024 23:56
@fbusato fbusato requested a review from alliepiper September 24, 2024 23:56
@fbusato fbusato changed the title [DRAFT] Optimize, Cleanup, and Expose CUB Thread-Level Reduction Optimize, Cleanup, and Expose CUB Thread-Level Reduction Sep 25, 2024
@fbusato fbusato self-assigned this Sep 25, 2024
Copy link
Contributor

🟨 CI finished in 1h 50m: Pass: 93%/364 | Total: 6d 04h | Avg: 24m 28s | Max: 1h 13m | Hits: 77%/22755
  • 🟨 cub: Pass: 79%/104 | Total: 3d 06h | Avg: 45m 24s | Max: 1h 07m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  78%/96  | Total:  2d 23h | Avg: 44m 37s | Max:  1h 07m
      🟩 arm64              Pass: 100%/8   | Total:  7h 17m | Avg: 54m 43s | Max: 57m 04s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 49s | Max: 54m 21s
      🔍 nvcc               Pass:  79%/102 | Total:  3d 04h | Avg: 45m 14s | Max:  1h 07m
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  78%/96  | Total:  3d 03h | Avg: 47m 28s | Max:  1h 07m
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 51s | Avg: 20m 51s | Max: 20m 51s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 07s | Avg: 19m 07s | Max: 19m 07s
      🟩 HostLaunch         Pass: 100%/3   | Total: 52m 59s | Avg: 17m 39s | Max: 19m 44s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 11m | Avg: 23m 41s | Max: 25m 30s
    🚨 sm: 60;70;80;90 🚨
      🔥 60;70;80;90        Pass:   0%/3   | Total:  2h 07m | Avg: 42m 33s | Max: 45m 57s
      🟩 90a                Pass: 100%/4   | Total:  1h 39m | Avg: 24m 55s | Max: 27m 33s
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 47m | Avg: 53m 49s | Max: 54m 21s
      🟥 nvcc11.1           Pass:   0%/15  | Total:  5h 18m | Avg: 21m 13s | Max: 30m 32s
      🟥 nvcc11.8           Pass:   0%/3   | Total:  2h 07m | Avg: 42m 33s | Max: 45m 57s
      🟨 nvcc12.6           Pass:  96%/84  | Total:  2d 21h | Avg: 49m 37s | Max:  1h 07m
    🟨 cxx
      🟨 Clang9             Pass:  50%/6   | Total:  3h 51m | Avg: 38m 34s | Max:  1h 02m
      🟩 Clang10            Pass: 100%/3   | Total:  2h 36m | Avg: 52m 08s | Max: 52m 18s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 31m | Avg: 52m 49s | Max: 57m 27s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 41m | Avg: 55m 16s | Max:  1h 01m
      🟩 Clang13            Pass: 100%/4   | Total:  3h 25m | Avg: 51m 15s | Max: 51m 27s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 06s | Max: 51m 59s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 27m | Avg: 51m 46s | Max: 53m 12s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 36m | Avg: 54m 03s | Max: 58m 25s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 39m | Avg: 54m 50s | Max: 58m 44s
      🟩 Clang18            Pass: 100%/9   | Total:  6h 55m | Avg: 46m 09s | Max: 54m 48s
      🟥 GCC6               Pass:   0%/2   | Total: 54m 56s | Avg: 27m 28s | Max: 28m 39s
      🟨 GCC7               Pass:  50%/6   | Total:  3h 43m | Avg: 37m 16s | Max: 55m 17s
      🟨 GCC8               Pass:  50%/6   | Total:  3h 38m | Avg: 36m 20s | Max: 56m 41s
      🟨 GCC9               Pass:  50%/6   | Total:  3h 39m | Avg: 36m 30s | Max: 53m 59s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 33m | Avg: 53m 16s | Max: 57m 31s
      🟨 GCC11              Pass:  57%/7   | Total:  5h 46m | Avg: 49m 32s | Max: 59m 14s
      🟩 GCC12              Pass: 100%/4   | Total:  3h 46m | Avg: 56m 37s | Max:  1h 01m
      🟩 GCC13              Pass: 100%/16  | Total:  9h 08m | Avg: 34m 18s | Max: 57m 04s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 46m | Avg: 55m 33s | Max: 56m 15s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 18m 12s | Avg: 18m 12s | Max: 18m 12s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 07m
      🟥 MSVC14.39          Pass:   0%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    🟨 cxx_family
      🟨 Clang              Pass:  93%/46  | Total:  1d 14h | Avg: 49m 44s | Max:  1h 02m
      🟨 GCC                Pass:  72%/51  | Total:  1d 10h | Avg: 40m 12s | Max:  1h 01m
      🟩 Intel              Pass: 100%/3   | Total:  2h 46m | Avg: 55m 33s | Max: 56m 15s
      🟥 MSVC               Pass:   0%/4   | Total:  3h 36m | Avg: 54m 09s | Max:  1h 07m
    🟨 gpu
      🟨 v100               Pass:  79%/104 | Total:  3d 06h | Avg: 45m 24s | Max:  1h 07m
    🟨 ctk
      🟥 11.1               Pass:   0%/15  | Total:  5h 18m | Avg: 21m 13s | Max: 30m 32s
      🟥 11.8               Pass:   0%/3   | Total:  2h 07m | Avg: 42m 33s | Max: 45m 57s
      🟨 12.6               Pass:  96%/86  | Total:  2d 23h | Avg: 49m 43s | Max:  1h 07m
    🟨 std
      🟨 11                 Pass:  78%/28  | Total: 21h 04m | Avg: 45m 08s | Max: 58m 25s
      🟨 14                 Pass:  70%/27  | Total: 20h 44m | Avg: 46m 06s | Max:  1h 04m
      🟨 17                 Pass:  76%/26  | Total: 19h 41m | Avg: 45m 25s | Max:  1h 07m
      🟨 20                 Pass:  95%/23  | Total: 17h 12m | Avg: 44m 52s | Max:  1h 06m
    
  • 🟨 thrust: Pass: 96%/103 | Total: 2d 05h | Avg: 31m 25s | Max: 1h 13m | Hits: 61%/11150

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/95  | Total:  2d 01h | Avg: 31m 26s | Max:  1h 13m | Hits:  61%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  4h 10m | Avg: 31m 18s | Max: 35m 36s
    🔍 ctk: 11.1 🔍
      🔍 11.1               Pass:  73%/15  | Total:  6h 00m | Avg: 24m 00s | Max:  1h 05m | Hits:  51%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  2h 08m | Avg: 42m 53s | Max: 51m 57s
      🟩 12.6               Pass: 100%/85  | Total:  1d 21h | Avg: 32m 20s | Max:  1h 13m | Hits:  63%/8920  
    🔍 cudacxx: nvcc11.1 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 51m 04s | Avg: 25m 32s | Max: 25m 57s
      🔍 nvcc11.1           Pass:  73%/15  | Total:  6h 00m | Avg: 24m 00s | Max:  1h 05m | Hits:  51%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 08m | Avg: 42m 53s | Max: 51m 57s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 20h | Avg: 32m 29s | Max:  1h 13m | Hits:  63%/8920  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 51m 04s | Avg: 25m 32s | Max: 25m 57s
      🔍 nvcc               Pass:  96%/101 | Total:  2d 05h | Avg: 31m 32s | Max:  1h 13m | Hits:  61%/11150 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  95%/96  | Total:  2d 04h | Avg: 32m 34s | Max:  1h 13m | Hits:  51%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 48m 47s | Avg: 12m 11s | Max: 25m 08s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 01m | Avg: 20m 30s | Max: 29m 49s
    🔍 std: 17 🔍
      🟩 11                 Pass: 100%/28  | Total: 12h 34m | Avg: 26m 56s | Max: 38m 11s
      🟩 14                 Pass: 100%/27  | Total: 16h 04m | Avg: 35m 43s | Max:  1h 05m | Hits:  55%/4460  
      🔍 17                 Pass:  84%/26  | Total: 13h 52m | Avg: 32m 01s | Max:  1h 09m | Hits:  49%/2230  
      🟩 20                 Pass: 100%/22  | Total: 11h 26m | Avg: 31m 10s | Max:  1h 13m | Hits:  73%/4460  
    🟨 cxx
      🟨 Clang9             Pass:  83%/6   | Total:  2h 30m | Avg: 25m 09s | Max: 34m 05s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 40m | Avg: 33m 33s | Max: 34m 46s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 10m | Avg: 32m 39s | Max: 39m 44s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 11s | Max: 33m 17s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 51s | Max: 32m 48s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 09m | Avg: 32m 27s | Max: 33m 02s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 34s | Max: 38m 22s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 06m | Avg: 31m 36s | Max: 34m 26s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 09m | Avg: 32m 28s | Max: 38m 06s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 49m | Avg: 25m 30s | Max: 33m 34s
      🟩 GCC6               Pass: 100%/2   | Total: 53m 52s | Avg: 26m 56s | Max: 29m 38s
      🟨 GCC7               Pass:  83%/6   | Total:  2h 39m | Avg: 26m 37s | Max: 36m 50s
      🟨 GCC8               Pass:  83%/6   | Total:  2h 42m | Avg: 27m 02s | Max: 37m 42s
      🟨 GCC9               Pass:  83%/6   | Total:  2h 41m | Avg: 26m 54s | Max: 41m 34s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 11m | Avg: 32m 57s | Max: 36m 05s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 32m | Avg: 38m 53s | Max: 51m 57s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 27m | Avg: 36m 49s | Max: 42m 28s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 45m | Avg: 24m 42s | Max: 36m 18s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 03m | Avg: 41m 17s | Max: 43m 58s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:  51%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 09m | Hits:  54%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 38m | Avg: 49m 14s | Max:  1h 13m | Hits:  73%/4460  
    🟨 cxx_family
      🟨 Clang              Pass:  97%/46  | Total: 23h 00m | Avg: 30m 00s | Max: 39m 44s
      🟨 GCC                Pass:  93%/49  | Total: 23h 54m | Avg: 29m 16s | Max: 51m 57s
      🟩 Intel              Pass: 100%/3   | Total:  2h 03m | Avg: 41m 17s | Max: 43m 58s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 58m | Avg: 59m 39s | Max:  1h 13m | Hits:  61%/11150 
    🟨 gpu
      🟨 v100               Pass:  96%/103 | Total:  2d 05h | Avg: 31m 25s | Max:  1h 13m | Hits:  61%/11150 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 08m | Avg: 42m 53s | Max: 51m 57s
      🟩 90a                Pass: 100%/4   | Total:  1h 26m | Avg: 21m 40s | Max: 25m 37s
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 12h 22m | Avg: 7m 08s | Max: 32m 42s | Hits: 93%/11383

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 11h 48m | Avg:  7m 23s | Max: 32m 42s | Hits:  93%/11383 
      🟩 arm64              Pass: 100%/8   | Total: 33m 36s | Avg:  4m 12s | Max:  4m 33s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 07m | Avg:  4m 31s | Max: 20m 28s | Hits:  99%/2648  
      🟩 11.8               Pass: 100%/3   | Total: 53m 53s | Avg: 17m 57s | Max: 26m 41s
      🟩 12.6               Pass: 100%/86  | Total: 10h 20m | Avg:  7m 13s | Max: 32m 42s | Hits:  91%/8735  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 37m 42s | Avg: 18m 51s | Max: 19m 58s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 07m | Avg:  4m 31s | Max: 20m 28s | Hits:  99%/2648  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 53m 53s | Avg: 17m 57s | Max: 26m 41s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 43m | Avg:  6m 56s | Max: 32m 42s | Hits:  91%/8735  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 37m 42s | Avg: 18m 51s | Max: 19m 58s
      🟩 nvcc               Pass: 100%/102 | Total: 11h 44m | Avg:  6m 54s | Max: 32m 42s | Hits:  93%/11383 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 26m 59s | Avg:  4m 29s | Max:  5m 51s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 31s | Avg:  5m 30s | Max:  5m 51s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 22s | Avg:  4m 50s | Max:  5m 26s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 06s | Avg:  4m 46s | Max:  5m 12s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 02s | Avg:  4m 45s | Max:  5m 03s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 17s | Avg:  4m 34s | Max:  4m 50s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 28s | Avg:  4m 52s | Max:  5m 00s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 02s | Avg:  4m 45s | Max:  5m 16s
      🟩 Clang17            Pass: 100%/4   | Total: 17m 43s | Avg:  4m 25s | Max:  4m 55s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 17m | Avg:  9m 37s | Max: 19m 58s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 17s | Avg:  3m 08s | Max:  3m 20s
      🟩 GCC7               Pass: 100%/6   | Total: 21m 34s | Avg:  3m 35s | Max:  4m 25s
      🟩 GCC8               Pass: 100%/6   | Total: 23m 26s | Avg:  3m 54s | Max:  4m 43s
      🟩 GCC9               Pass: 100%/6   | Total: 22m 37s | Avg:  3m 46s | Max:  4m 11s
      🟩 GCC10              Pass: 100%/4   | Total: 17m 45s | Avg:  4m 26s | Max:  4m 52s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 11m | Avg: 10m 09s | Max: 26m 41s
      🟩 GCC12              Pass: 100%/4   | Total: 17m 42s | Avg:  4m 25s | Max:  4m 58s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 33m | Avg: 12m 33s | Max: 32m 42s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 27s | Avg:  6m 09s | Max:  6m 25s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 28s | Avg: 20m 28s | Max: 20m 28s | Hits:  99%/2648  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 30m 25s | Avg: 15m 12s | Max: 15m 14s | Hits:  99%/5658  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 26m 41s | Avg: 26m 41s | Max: 26m 41s | Hits:  76%/3077  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  4h 12m | Avg:  5m 36s | Max: 19m 58s
      🟩 GCC                Pass: 100%/52  | Total:  6h 34m | Avg:  7m 34s | Max: 32m 42s
      🟩 Intel              Pass: 100%/3   | Total: 18m 27s | Avg:  6m 09s | Max:  6m 25s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 17m | Avg: 19m 23s | Max: 26m 41s | Hits:  93%/11383 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 12h 22m | Avg:  7m 08s | Max: 32m 42s | Hits:  93%/11383 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  9h 11m | Avg:  5m 44s | Max: 26m 41s | Hits:  93%/11383 
      🟩 NVRTC              Pass: 100%/4   | Total:  2h 02m | Avg: 30m 31s | Max: 32m 42s
      🟩 Test               Pass: 100%/3   | Total:  1h 06m | Avg: 22m 03s | Max: 27m 28s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 38s | Avg:  2m 38s | Max:  2m 38s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 53m 53s | Avg: 17m 57s | Max: 26m 41s
      🟩 90a                Pass: 100%/4   | Total: 16m 36s | Avg:  4m 09s | Max:  4m 31s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 54m | Avg:  6m 12s | Max: 26m 52s
      🟩 14                 Pass: 100%/28  | Total:  3h 15m | Avg:  6m 59s | Max: 30m 38s | Hits:  99%/5397  
      🟩 17                 Pass: 100%/27  | Total:  2h 56m | Avg:  6m 32s | Max: 32m 42s | Hits:  99%/2909  
      🟩 20                 Pass: 100%/20  | Total:  3h 13m | Avg:  9m 40s | Max: 31m 55s | Hits:  76%/3077  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 3h 08m | Avg: 3m 38s | Max: 11m 39s | Hits: 89%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 56m | Avg:  3m 40s | Max: 11m 39s | Hits:  89%/222   
      🟩 arm64              Pass: 100%/4   | Total: 12m 16s | Avg:  3m 04s | Max:  3m 24s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 08m | Avg:  3m 36s | Max: 10m 15s | Hits:  89%/111   
      🟩 12.6               Pass: 100%/33  | Total:  2h 00m | Avg:  3m 39s | Max: 11m 39s | Hits:  90%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 08m | Avg:  3m 36s | Max: 10m 15s | Hits:  89%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 00m | Avg:  3m 39s | Max: 11m 39s | Hits:  90%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  3h 08m | Avg:  3m 38s | Max: 11m 39s | Hits:  89%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  7m 05s | Avg:  3m 32s | Max:  3m 35s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  3m 34s
      🟩 Clang11            Pass: 100%/4   | Total: 12m 43s | Avg:  3m 10s | Max:  3m 38s
      🟩 Clang12            Pass: 100%/4   | Total: 12m 59s | Avg:  3m 14s | Max:  3m 36s
      🟩 Clang13            Pass: 100%/4   | Total: 13m 52s | Avg:  3m 28s | Max:  3m 42s
      🟩 Clang14            Pass: 100%/4   | Total: 12m 56s | Avg:  3m 14s | Max:  4m 00s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 42s | Avg:  3m 21s | Max:  3m 35s
      🟩 Clang16            Pass: 100%/4   | Total: 13m 16s | Avg:  3m 19s | Max:  3m 29s
      🟩 Clang17            Pass: 100%/2   | Total:  7m 12s | Avg:  3m 36s | Max:  3m 43s
      🟩 Clang18            Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  4m 32s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 55s | Avg:  3m 27s | Max:  3m 38s
      🟩 GCC10              Pass: 100%/4   | Total: 12m 46s | Avg:  3m 11s | Max:  3m 30s
      🟩 GCC11              Pass: 100%/4   | Total: 13m 43s | Avg:  3m 25s | Max:  3m 41s
      🟩 GCC12              Pass: 100%/7   | Total: 23m 41s | Avg:  3m 23s | Max:  4m 25s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 41s | Avg:  2m 53s | Max:  3m 02s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 15s | Avg: 10m 15s | Max: 10m 15s | Hits:  89%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 39s | Avg: 11m 39s | Max: 11m 39s | Hits:  90%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 41m | Avg:  3m 22s | Max:  4m 32s
      🟩 GCC                Pass: 100%/20  | Total:  1h 05m | Avg:  3m 17s | Max:  4m 25s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 54s | Avg: 10m 57s | Max: 11m 39s | Hits:  89%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  3h 08m | Avg:  3m 38s | Max: 11m 39s | Hits:  89%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 48m | Avg:  3m 34s | Max: 11m 39s | Hits:  89%/222   
      🟩 Test               Pass: 100%/5   | Total: 20m 59s | Avg:  4m 11s | Max:  4m 32s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 16s | Avg:  3m 16s | Max:  3m 16s
      🟩 90a                Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 33m | Avg:  3m 20s | Max:  4m 25s
      🟩 20                 Pass: 100%/24  | Total:  1h 35m | Avg:  3m 58s | Max: 11m 39s | Hits:  89%/222   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 16m 58s | Avg: 16m 58s | Max: 16m 58s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 16m 58s | Avg: 16m 58s | Max: 16m 58s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 16m 58s | Avg: 16m 58s | Max: 16m 58s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 16m 58s | Avg: 16m 58s | Max: 16m 58s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 16m 58s | Avg: 16m 58s | Max: 16m 58s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 16m 58s | Avg: 16m 58s | Max: 16m 58s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 16m 58s | Avg: 16m 58s | Max: 16m 58s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 16m 58s | Avg: 16m 58s | Max: 16m 58s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 16m 58s | Avg: 16m 58s | Max: 16m 58s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 2h 18m: Pass: 93%/364 | Total: 5d 17h | Avg: 22m 40s | Max: 1h 03m | Hits: 85%/22755
  • 🟨 cub: Pass: 79%/104 | Total: 3d 02h | Avg: 43m 03s | Max: 1h 03m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  78%/96  | Total:  2d 19h | Avg: 41m 58s | Max:  1h 03m
      🟩 arm64              Pass: 100%/8   | Total:  7h 28m | Avg: 56m 01s | Max:  1h 01m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 57s | Max: 54m 37s
      🔍 nvcc               Pass:  79%/102 | Total:  3d 00h | Avg: 42m 50s | Max:  1h 03m
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  78%/96  | Total:  2d 23h | Avg: 44m 58s | Max:  1h 03m
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s
      🟩 GraphCapture       Pass: 100%/1   | Total: 13m 42s | Avg: 13m 42s | Max: 13m 42s
      🟩 HostLaunch         Pass: 100%/3   | Total: 53m 22s | Avg: 17m 47s | Max: 21m 34s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 15m | Avg: 25m 05s | Max: 26m 31s
    🚨 sm: 60;70;80;90 🚨
      🔥 60;70;80;90        Pass:   0%/3   | Total:  1h 59m | Avg: 39m 55s | Max: 40m 33s
      🟩 90a                Pass: 100%/4   | Total:  1h 28m | Avg: 22m 02s | Max: 24m 36s
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 47m | Avg: 53m 57s | Max: 54m 37s
      🟥 nvcc11.1           Pass:   0%/15  | Total:  5h 11m | Avg: 20m 47s | Max: 28m 57s
      🟥 nvcc11.8           Pass:   0%/3   | Total:  1h 59m | Avg: 39m 55s | Max: 40m 33s
      🟨 nvcc12.6           Pass:  96%/84  | Total:  2d 17h | Avg: 46m 53s | Max:  1h 03m
    🟨 cxx
      🟨 Clang9             Pass:  50%/6   | Total:  3h 29m | Avg: 34m 51s | Max: 50m 00s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 35m | Avg: 51m 57s | Max: 56m 44s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 02s | Max: 57m 13s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 18m | Avg: 49m 35s | Max: 50m 37s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 16m | Avg: 49m 04s | Max: 49m 18s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 15m | Avg: 48m 58s | Max: 49m 27s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 16m | Avg: 49m 02s | Max: 49m 44s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 20m | Avg: 50m 01s | Max: 53m 21s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 16m | Avg: 49m 05s | Max: 49m 29s
      🟩 Clang18            Pass: 100%/9   | Total:  7h 11m | Avg: 47m 53s | Max:  1h 00m
      🟥 GCC6               Pass:   0%/2   | Total: 54m 31s | Avg: 27m 15s | Max: 28m 08s
      🟨 GCC7               Pass:  50%/6   | Total:  3h 26m | Avg: 34m 24s | Max: 49m 16s
      🟨 GCC8               Pass:  50%/6   | Total:  3h 28m | Avg: 34m 41s | Max: 49m 16s
      🟨 GCC9               Pass:  50%/6   | Total:  3h 30m | Avg: 35m 07s | Max: 51m 57s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 24m | Avg: 51m 12s | Max: 54m 57s
      🟨 GCC11              Pass:  57%/7   | Total:  5h 17m | Avg: 45m 21s | Max: 49m 56s
      🟩 GCC12              Pass: 100%/4   | Total:  3h 20m | Avg: 50m 10s | Max: 50m 32s
      🟩 GCC13              Pass: 100%/16  | Total:  8h 51m | Avg: 33m 13s | Max:  1h 01m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 38m | Avg: 52m 55s | Max: 53m 34s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 17m 34s | Avg: 17m 34s | Max: 17m 34s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m
      🟥 MSVC14.39          Pass:   0%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m
    🟨 cxx_family
      🟨 Clang              Pass:  93%/46  | Total:  1d 12h | Avg: 47m 27s | Max:  1h 00m
      🟨 GCC                Pass:  72%/51  | Total:  1d 08h | Avg: 37m 55s | Max:  1h 01m
      🟩 Intel              Pass: 100%/3   | Total:  2h 38m | Avg: 52m 55s | Max: 53m 34s
      🟥 MSVC               Pass:   0%/4   | Total:  3h 21m | Avg: 50m 25s | Max:  1h 03m
    🟨 gpu
      🟨 v100               Pass:  79%/104 | Total:  3d 02h | Avg: 43m 03s | Max:  1h 03m
    🟨 ctk
      🟥 11.1               Pass:   0%/15  | Total:  5h 11m | Avg: 20m 47s | Max: 28m 57s
      🟥 11.8               Pass:   0%/3   | Total:  1h 59m | Avg: 39m 55s | Max: 40m 33s
      🟨 12.6               Pass:  96%/86  | Total:  2d 19h | Avg: 47m 03s | Max:  1h 03m
    🟨 std
      🟨 11                 Pass:  78%/28  | Total: 20h 02m | Avg: 42m 56s | Max:  1h 00m
      🟨 14                 Pass:  70%/27  | Total: 19h 41m | Avg: 43m 45s | Max:  1h 00m
      🟨 17                 Pass:  76%/26  | Total: 18h 34m | Avg: 42m 52s | Max:  1h 00m
      🟨 20                 Pass:  95%/23  | Total: 16h 20m | Avg: 42m 37s | Max:  1h 03m
    
  • 🟨 thrust: Pass: 96%/103 | Total: 2d 01h | Avg: 28m 59s | Max: 1h 02m | Hits: 71%/11150

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/95  | Total:  1d 21h | Avg: 28m 53s | Max:  1h 02m | Hits:  71%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  4h 02m | Avg: 30m 15s | Max: 33m 31s
    🔍 ctk: 11.1 🔍
      🔍 11.1               Pass:  73%/15  | Total:  5h 44m | Avg: 22m 56s | Max: 55m 49s | Hits:  65%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  1h 56m | Avg: 38m 43s | Max: 42m 36s
      🟩 12.6               Pass: 100%/85  | Total:  1d 18h | Avg: 29m 43s | Max:  1h 02m | Hits:  73%/8920  
    🔍 cudacxx: nvcc11.1 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 00s | Avg: 25m 00s | Max: 25m 34s
      🔍 nvcc11.1           Pass:  73%/15  | Total:  5h 44m | Avg: 22m 56s | Max: 55m 49s | Hits:  65%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 56m | Avg: 38m 43s | Max: 42m 36s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 17h | Avg: 29m 49s | Max:  1h 02m | Hits:  73%/8920  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 00s | Avg: 25m 00s | Max: 25m 34s
      🔍 nvcc               Pass:  96%/101 | Total:  2d 00h | Avg: 29m 04s | Max:  1h 02m | Hits:  71%/11150 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  95%/96  | Total:  2d 00h | Avg: 30m 13s | Max:  1h 02m | Hits:  65%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 01s | Avg: 11m 30s | Max: 24m 23s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 39m 14s | Avg: 13m 04s | Max: 14m 59s
    🔍 std: 17 🔍
      🟩 11                 Pass: 100%/28  | Total: 11h 44m | Avg: 25m 09s | Max: 33m 40s
      🟩 14                 Pass: 100%/27  | Total: 14h 52m | Avg: 33m 03s | Max:  1h 02m | Hits:  65%/4460  
      🔍 17                 Pass:  84%/26  | Total: 12h 37m | Avg: 29m 09s | Max: 57m 14s | Hits:  65%/2230  
      🟩 20                 Pass: 100%/22  | Total: 10h 30m | Avg: 28m 40s | Max: 59m 47s | Hits:  82%/4460  
    🟨 cxx
      🟨 Clang9             Pass:  83%/6   | Total:  2h 26m | Avg: 24m 23s | Max: 32m 55s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 32m | Avg: 30m 43s | Max: 33m 38s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 58m | Avg: 29m 41s | Max: 31m 49s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 59m | Avg: 29m 51s | Max: 31m 49s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 58m | Avg: 29m 42s | Max: 31m 41s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 58m | Avg: 29m 37s | Max: 31m 36s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 59s | Max: 35m 44s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 17s | Max: 32m 54s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 00m | Avg: 30m 06s | Max: 32m 53s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 35m | Avg: 23m 57s | Max: 32m 15s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 01m | Avg: 30m 35s | Max: 33m 38s
      🟨 GCC7               Pass:  83%/6   | Total:  2h 28m | Avg: 24m 49s | Max: 32m 41s
      🟨 GCC8               Pass:  83%/6   | Total:  2h 27m | Avg: 24m 33s | Max: 32m 21s
      🟨 GCC9               Pass:  83%/6   | Total:  2h 29m | Avg: 24m 52s | Max: 33m 39s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 08m | Avg: 32m 14s | Max: 34m 30s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 07m | Avg: 35m 24s | Max: 42m 36s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 09m | Avg: 32m 18s | Max: 35m 09s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 07m | Avg: 21m 55s | Max: 37m 37s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 51m | Avg: 37m 16s | Max: 40m 59s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 55m 49s | Avg: 55m 49s | Max: 55m 49s | Hits:  65%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 59m | Avg: 59m 48s | Max:  1h 02m | Hits:  65%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 24m | Avg: 42m 05s | Max: 59m 47s | Hits:  82%/4460  
    🟨 cxx_family
      🟨 Clang              Pass:  97%/46  | Total: 21h 35m | Avg: 28m 09s | Max: 35m 44s
      🟨 GCC                Pass:  93%/49  | Total: 21h 59m | Avg: 26m 56s | Max: 42m 36s
      🟩 Intel              Pass: 100%/3   | Total:  1h 51m | Avg: 37m 16s | Max: 40m 59s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 19m | Avg: 51m 55s | Max:  1h 02m | Hits:  71%/11150 
    🟨 gpu
      🟨 v100               Pass:  96%/103 | Total:  2d 01h | Avg: 28m 59s | Max:  1h 02m | Hits:  71%/11150 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 56m | Avg: 38m 43s | Max: 42m 36s
      🟩 90a                Pass: 100%/4   | Total:  1h 12m | Avg: 18m 11s | Max: 20m 11s
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 10h 10m | Avg: 5m 52s | Max: 32m 22s | Hits: 99%/11383

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  9h 41m | Avg:  6m 03s | Max: 32m 22s | Hits:  99%/11383 
      🟩 arm64              Pass: 100%/8   | Total: 28m 56s | Avg:  3m 37s | Max:  4m 18s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 01m | Avg:  4m 04s | Max: 19m 51s | Hits:  99%/2648  
      🟩 11.8               Pass: 100%/3   | Total:  9m 44s | Avg:  3m 14s | Max:  3m 30s
      🟩 12.6               Pass: 100%/86  | Total:  8h 59m | Avg:  6m 16s | Max: 32m 22s | Hits:  99%/8735  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 37m 30s | Avg: 18m 45s | Max: 19m 24s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 01m | Avg:  4m 04s | Max: 19m 51s | Hits:  99%/2648  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  9m 44s | Avg:  3m 14s | Max:  3m 30s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  8h 22m | Avg:  5m 58s | Max: 32m 22s | Hits:  99%/8735  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 37m 30s | Avg: 18m 45s | Max: 19m 24s
      🟩 nvcc               Pass: 100%/102 | Total:  9h 33m | Avg:  5m 37s | Max: 32m 22s | Hits:  99%/11383 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 25m 05s | Avg:  4m 10s | Max:  6m 04s
      🟩 Clang10            Pass: 100%/3   | Total: 15m 19s | Avg:  5m 06s | Max:  5m 38s
      🟩 Clang11            Pass: 100%/4   | Total: 15m 53s | Avg:  3m 58s | Max:  4m 14s
      🟩 Clang12            Pass: 100%/4   | Total: 16m 01s | Avg:  4m 00s | Max:  4m 16s
      🟩 Clang13            Pass: 100%/4   | Total: 16m 35s | Avg:  4m 08s | Max:  4m 29s
      🟩 Clang14            Pass: 100%/4   | Total: 16m 35s | Avg:  4m 08s | Max:  4m 22s
      🟩 Clang15            Pass: 100%/4   | Total: 16m 29s | Avg:  4m 07s | Max:  4m 23s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 04s | Avg:  4m 16s | Max:  5m 01s
      🟩 Clang17            Pass: 100%/4   | Total: 16m 10s | Avg:  4m 02s | Max:  4m 26s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 17m | Avg:  9m 38s | Max: 20m 16s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 10s | Avg:  3m 05s | Max:  3m 08s
      🟩 GCC7               Pass: 100%/6   | Total: 19m 03s | Avg:  3m 10s | Max:  3m 43s
      🟩 GCC8               Pass: 100%/6   | Total: 19m 49s | Avg:  3m 18s | Max:  3m 51s
      🟩 GCC9               Pass: 100%/6   | Total: 18m 50s | Avg:  3m 08s | Max:  3m 47s
      🟩 GCC10              Pass: 100%/4   | Total: 14m 33s | Avg:  3m 38s | Max:  3m 54s
      🟩 GCC11              Pass: 100%/7   | Total: 24m 11s | Avg:  3m 27s | Max:  3m 48s
      🟩 GCC12              Pass: 100%/4   | Total: 14m 41s | Avg:  3m 40s | Max:  3m 56s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 03m | Avg: 10m 46s | Max: 32m 22s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 45s | Avg:  5m 15s | Max:  5m 36s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 51s | Avg: 19m 51s | Max: 19m 51s | Hits:  99%/2648  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 26m 56s | Avg: 13m 28s | Max: 13m 42s | Hits:  99%/5658  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 20s | Avg: 15m 20s | Max: 15m 20s | Hits:  99%/3077  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  3h 52m | Avg:  5m 09s | Max: 20m 16s
      🟩 GCC                Pass: 100%/52  | Total:  5h 00m | Avg:  5m 46s | Max: 32m 22s
      🟩 Intel              Pass: 100%/3   | Total: 15m 45s | Avg:  5m 15s | Max:  5m 36s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 02m | Avg: 15m 31s | Max: 19m 51s | Hits:  99%/11383 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 10h 10m | Avg:  5m 52s | Max: 32m 22s | Hits:  99%/11383 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  7h 22m | Avg:  4m 36s | Max: 19m 51s | Hits:  99%/11383 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 45m | Avg: 26m 17s | Max: 32m 22s
      🟩 Test               Pass: 100%/3   | Total:  1h 00m | Avg: 20m 12s | Max: 23m 18s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  9m 44s | Avg:  3m 14s | Max:  3m 30s
      🟩 90a                Pass: 100%/4   | Total: 14m 28s | Avg:  3m 37s | Max:  3m 53s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 05m | Avg:  4m 29s | Max: 19m 40s
      🟩 14                 Pass: 100%/28  | Total:  2h 35m | Avg:  5m 33s | Max: 28m 56s | Hits:  99%/5397  
      🟩 17                 Pass: 100%/27  | Total:  2h 41m | Avg:  5m 59s | Max: 32m 22s | Hits:  99%/2909  
      🟩 20                 Pass: 100%/20  | Total:  2h 45m | Avg:  8m 16s | Max: 24m 13s | Hits:  99%/3077  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 42m | Avg: 3m 07s | Max: 10m 32s | Hits: 89%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 31m | Avg:  3m 09s | Max: 10m 32s | Hits:  89%/222   
      🟩 arm64              Pass: 100%/4   | Total: 10m 56s | Avg:  2m 44s | Max:  3m 02s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 00m | Avg:  3m 12s | Max: 10m 08s | Hits:  89%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 41m | Avg:  3m 05s | Max: 10m 32s | Hits:  89%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 00m | Avg:  3m 12s | Max: 10m 08s | Hits:  89%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 41m | Avg:  3m 05s | Max: 10m 32s | Hits:  89%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 42m | Avg:  3m 07s | Max: 10m 32s | Hits:  89%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 23s | Avg:  2m 41s | Max:  2m 45s
      🟩 Clang10            Pass: 100%/2   | Total:  5m 29s | Avg:  2m 44s | Max:  2m 53s
      🟩 Clang11            Pass: 100%/4   | Total: 10m 23s | Avg:  2m 35s | Max:  2m 40s
      🟩 Clang12            Pass: 100%/4   | Total: 10m 41s | Avg:  2m 40s | Max:  2m 56s
      🟩 Clang13            Pass: 100%/4   | Total: 10m 43s | Avg:  2m 40s | Max:  2m 50s
      🟩 Clang14            Pass: 100%/4   | Total: 11m 39s | Avg:  2m 54s | Max:  3m 44s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 26s | Avg:  2m 43s | Max:  2m 45s
      🟩 Clang16            Pass: 100%/4   | Total: 10m 19s | Avg:  2m 34s | Max:  2m 45s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 20s | Avg:  2m 40s | Max:  2m 41s
      🟩 Clang18            Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  4m 42s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 23s | Avg:  2m 41s | Max:  2m 54s
      🟩 GCC10              Pass: 100%/4   | Total: 10m 21s | Avg:  2m 35s | Max:  2m 45s
      🟩 GCC11              Pass: 100%/4   | Total: 10m 32s | Avg:  2m 38s | Max:  2m 43s
      🟩 GCC12              Pass: 100%/7   | Total: 24m 44s | Avg:  3m 32s | Max:  5m 10s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 19s | Avg:  2m 46s | Max:  3m 02s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 08s | Avg: 10m 08s | Max: 10m 08s | Hits:  89%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 32s | Avg: 10m 32s | Max: 10m 32s | Hits:  89%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 22m | Avg:  2m 45s | Max:  4m 42s
      🟩 GCC                Pass: 100%/20  | Total: 59m 19s | Avg:  2m 57s | Max:  5m 10s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 40s | Avg: 10m 20s | Max: 10m 32s | Hits:  89%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 42m | Avg:  3m 07s | Max: 10m 32s | Hits:  89%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 19m | Avg:  2m 58s | Max: 10m 32s | Hits:  89%/222   
      🟩 Test               Pass: 100%/5   | Total: 22m 59s | Avg:  4m 35s | Max:  5m 10s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 16s | Avg:  2m 16s | Max:  2m 16s
      🟩 90a                Pass: 100%/1   | Total:  2m 19s | Avg:  2m 19s | Max:  2m 19s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 18m | Avg:  2m 48s | Max:  5m 10s
      🟩 20                 Pass: 100%/24  | Total:  1h 24m | Avg:  3m 30s | Max: 10m 32s | Hits:  89%/222   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 13m 59s | Avg: 13m 59s | Max: 13m 59s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 13m 59s | Avg: 13m 59s | Max: 13m 59s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 13m 59s | Avg: 13m 59s | Max: 13m 59s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 13m 59s | Avg: 13m 59s | Max: 13m 59s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 13m 59s | Avg: 13m 59s | Max: 13m 59s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 13m 59s | Avg: 13m 59s | Max: 13m 59s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 13m 59s | Avg: 13m 59s | Max: 13m 59s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 13m 59s | Avg: 13m 59s | Max: 13m 59s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 13m 59s | Avg: 13m 59s | Max: 13m 59s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 1h 29m: Pass: 98%/364 | Total: 6d 07h | Avg: 24m 54s | Max: 1h 12m | Hits: 85%/22755
  • 🟨 cub: Pass: 96%/104 | Total: 3d 12h | Avg: 48m 47s | Max: 1h 12m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/96  | Total:  3d 05h | Avg: 48m 24s | Max:  1h 12m
      🟩 arm64              Pass: 100%/8   | Total:  7h 07m | Avg: 53m 24s | Max: 55m 10s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🔍 nvcc               Pass:  96%/102 | Total:  3d 10h | Avg: 48m 34s | Max:  1h 12m
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/46  | Total:  1d 14h | Avg: 50m 16s | Max:  1h 01m
      🟩 GCC                Pass: 100%/51  | Total:  1d 15h | Avg: 45m 54s | Max:  1h 12m
      🟩 Intel              Pass: 100%/3   | Total:  2h 51m | Avg: 57m 19s | Max:  1h 00m
      🔥 MSVC               Pass:   0%/4   | Total:  4h 09m | Avg:  1h 02m | Max:  1h 03m
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  95%/96  | Total:  3d 10h | Avg: 51m 18s | Max:  1h 12m
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 15m 28s | Avg: 15m 28s | Max: 15m 28s
      🟩 GraphCapture       Pass: 100%/1   | Total: 13m 17s | Avg: 13m 17s | Max: 13m 17s
      🟩 HostLaunch         Pass: 100%/3   | Total: 52m 52s | Avg: 17m 37s | Max: 19m 09s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 08m | Avg: 22m 51s | Max: 25m 40s
    🟨 ctk
      🟨 11.1               Pass:  93%/15  | Total: 12h 05m | Avg: 48m 21s | Max:  1h 03m
      🟩 11.8               Pass: 100%/3   | Total:  3h 29m | Avg:  1h 09m | Max:  1h 12m
      🟨 12.6               Pass:  96%/86  | Total:  2d 21h | Avg: 48m 08s | Max:  1h 03m
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🟨 nvcc11.1           Pass:  93%/15  | Total: 12h 05m | Avg: 48m 21s | Max:  1h 03m
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 29m | Avg:  1h 09m | Max:  1h 12m
      🟨 nvcc12.6           Pass:  96%/84  | Total:  2d 19h | Avg: 47m 51s | Max:  1h 03m
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  5h 00m | Avg: 50m 09s | Max: 57m 05s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 32m | Avg: 50m 58s | Max: 53m 09s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 29m | Avg: 52m 22s | Max: 58m 42s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 17m | Avg: 49m 24s | Max: 49m 41s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 36s | Max: 54m 53s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 17m | Avg: 49m 20s | Max: 49m 28s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 17m | Avg: 49m 20s | Max: 49m 54s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 38m | Avg: 54m 40s | Max: 59m 33s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 29m | Avg: 52m 21s | Max: 56m 42s
      🟩 Clang18            Pass: 100%/9   | Total:  7h 02m | Avg: 46m 55s | Max:  1h 01m
      🟩 GCC6               Pass: 100%/2   | Total:  1h 32m | Avg: 46m 02s | Max: 49m 06s
      🟩 GCC7               Pass: 100%/6   | Total:  5h 01m | Avg: 50m 18s | Max: 56m 46s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 50m | Avg: 48m 23s | Max: 53m 30s
      🟩 GCC9               Pass: 100%/6   | Total:  5h 04m | Avg: 50m 43s | Max: 57m 13s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 19m | Avg: 49m 57s | Max: 50m 23s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 48m | Avg: 58m 17s | Max:  1h 12m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 39m | Avg: 54m 49s | Max: 57m 59s
      🟩 GCC13              Pass: 100%/16  | Total:  8h 45m | Avg: 32m 50s | Max: 56m 43s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 51m | Avg: 57m 19s | Max:  1h 00m
      🟥 MSVC14.16          Pass:   0%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m
      🟥 MSVC14.39          Pass:   0%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m
    🟨 std
      🟩 11                 Pass: 100%/28  | Total: 22h 33m | Avg: 48m 19s | Max:  1h 05m
      🟨 14                 Pass:  92%/27  | Total: 23h 27m | Avg: 52m 07s | Max:  1h 12m
      🟨 17                 Pass:  96%/26  | Total: 22h 17m | Avg: 51m 25s | Max:  1h 11m
      🟨 20                 Pass:  95%/23  | Total: 16h 17m | Avg: 42m 29s | Max:  1h 03m
    🟨 gpu
      🟨 v100               Pass:  96%/104 | Total:  3d 12h | Avg: 48m 47s | Max:  1h 12m
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 29m | Avg:  1h 09m | Max:  1h 12m
      🟩 90a                Pass: 100%/4   | Total:  1h 31m | Avg: 22m 51s | Max: 24m 34s
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 10h 51m | Avg: 6m 15s | Max: 27m 08s | Hits: 99%/11383

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 10h 20m | Avg:  6m 27s | Max: 27m 08s | Hits:  99%/11383 
      🟩 arm64              Pass: 100%/8   | Total: 30m 54s | Avg:  3m 51s | Max:  4m 13s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 00m | Avg:  4m 00s | Max: 19m 12s | Hits:  99%/2648  
      🟩 11.8               Pass: 100%/3   | Total: 32m 20s | Avg: 10m 46s | Max: 25m 52s
      🟩 12.6               Pass: 100%/86  | Total:  9h 18m | Avg:  6m 29s | Max: 27m 08s | Hits:  99%/8735  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 20m 34s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 00m | Avg:  4m 00s | Max: 19m 12s | Hits:  99%/2648  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 32m 20s | Avg: 10m 46s | Max: 25m 52s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  8h 37m | Avg:  6m 09s | Max: 27m 08s | Hits:  99%/8735  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 20m 34s
      🟩 nvcc               Pass: 100%/102 | Total: 10h 10m | Avg:  5m 59s | Max: 27m 08s | Hits:  99%/11383 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 25m 06s | Avg:  4m 11s | Max:  5m 29s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 59s | Avg:  5m 39s | Max:  6m 16s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 42s | Avg:  4m 25s | Max:  5m 02s
      🟩 Clang12            Pass: 100%/4   | Total: 17m 32s | Avg:  4m 23s | Max:  4m 42s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 09s | Avg:  4m 32s | Max:  4m 48s
      🟩 Clang14            Pass: 100%/4   | Total: 17m 22s | Avg:  4m 20s | Max:  5m 04s
      🟩 Clang15            Pass: 100%/4   | Total: 17m 31s | Avg:  4m 22s | Max:  4m 33s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 13s | Avg:  4m 33s | Max:  5m 10s
      🟩 Clang17            Pass: 100%/4   | Total: 16m 55s | Avg:  4m 13s | Max:  4m 35s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 27m | Avg: 10m 57s | Max: 27m 08s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 00s | Avg:  3m 00s | Max:  3m 15s
      🟩 GCC7               Pass: 100%/6   | Total: 20m 43s | Avg:  3m 27s | Max:  4m 24s
      🟩 GCC8               Pass: 100%/6   | Total: 19m 23s | Avg:  3m 13s | Max:  3m 55s
      🟩 GCC9               Pass: 100%/6   | Total: 20m 59s | Avg:  3m 29s | Max:  4m 24s
      🟩 GCC10              Pass: 100%/4   | Total: 15m 48s | Avg:  3m 57s | Max:  4m 10s
      🟩 GCC11              Pass: 100%/7   | Total: 49m 18s | Avg:  7m 02s | Max: 25m 52s
      🟩 GCC12              Pass: 100%/4   | Total: 15m 56s | Avg:  3m 59s | Max:  4m 53s
      🟩 GCC13              Pass: 100%/17  | Total:  2h 49m | Avg:  9m 56s | Max: 26m 55s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 17m 22s | Avg:  5m 47s | Max:  5m 57s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 12s | Avg: 19m 12s | Max: 19m 12s | Hits:  99%/2648  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 28m 43s | Avg: 14m 21s | Max: 15m 04s | Hits:  99%/5658  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 18s | Avg: 15m 18s | Max: 15m 18s | Hits:  99%/3077  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  4h 13m | Avg:  5m 37s | Max: 27m 08s
      🟩 GCC                Pass: 100%/52  | Total:  5h 17m | Avg:  6m 06s | Max: 26m 55s
      🟩 Intel              Pass: 100%/3   | Total: 17m 22s | Avg:  5m 47s | Max:  5m 57s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 03m | Avg: 15m 48s | Max: 19m 12s | Hits:  99%/11383 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 10h 51m | Avg:  6m 15s | Max: 27m 08s | Hits:  99%/11383 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 14m | Avg:  5m 08s | Max: 25m 52s | Hits:  99%/11383 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 23m | Avg: 20m 51s | Max: 26m 55s
      🟩 Test               Pass: 100%/3   | Total:  1h 11m | Avg: 23m 40s | Max: 27m 08s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 35s | Avg:  2m 35s | Max:  2m 35s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 32m 20s | Avg: 10m 46s | Max: 25m 52s
      🟩 90a                Pass: 100%/4   | Total: 15m 32s | Avg:  3m 53s | Max:  4m 24s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 08m | Avg:  4m 36s | Max: 17m 02s
      🟩 14                 Pass: 100%/28  | Total:  2h 56m | Avg:  6m 18s | Max: 25m 52s | Hits:  99%/5397  
      🟩 17                 Pass: 100%/27  | Total:  2h 37m | Avg:  5m 50s | Max: 21m 12s | Hits:  99%/2909  
      🟩 20                 Pass: 100%/20  | Total:  3h 05m | Avg:  9m 16s | Max: 27m 08s | Hits:  99%/3077  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 2d 04h | Avg: 30m 42s | Max: 1h 03m | Hits: 72%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  2d 00h | Avg: 30m 44s | Max:  1h 03m | Hits:  72%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  4h 01m | Avg: 30m 12s | Max: 33m 35s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 47m | Avg: 31m 09s | Max: 56m 23s | Hits:  65%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  1h 56m | Avg: 38m 50s | Max: 42m 43s
      🟩 12.6               Pass: 100%/85  | Total:  1d 18h | Avg: 30m 20s | Max:  1h 03m | Hits:  74%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 18s | Avg: 25m 09s | Max: 25m 30s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 47m | Avg: 31m 09s | Max: 56m 23s | Hits:  65%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 56m | Avg: 38m 50s | Max: 42m 43s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 18h | Avg: 30m 27s | Max:  1h 03m | Hits:  74%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 18s | Avg: 25m 09s | Max: 25m 30s
      🟩 nvcc               Pass: 100%/101 | Total:  2d 03h | Avg: 30m 48s | Max:  1h 03m | Hits:  72%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 56m | Avg: 29m 25s | Max: 34m 08s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 38m | Avg: 32m 56s | Max: 38m 01s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 59m | Avg: 29m 49s | Max: 31m 55s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 57s | Max: 34m 39s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 59m | Avg: 29m 46s | Max: 31m 48s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 58m | Avg: 29m 41s | Max: 31m 44s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 05m | Avg: 31m 16s | Max: 35m 04s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 00m | Avg: 30m 01s | Max: 32m 43s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 02m | Avg: 30m 41s | Max: 35m 18s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 37m | Avg: 24m 11s | Max: 32m 59s
      🟩 GCC6               Pass: 100%/2   | Total: 57m 18s | Avg: 28m 39s | Max: 29m 54s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 00m | Avg: 30m 08s | Max: 33m 04s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 03m | Avg: 30m 39s | Max: 35m 25s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 04m | Avg: 30m 48s | Max: 37m 16s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 09m | Avg: 32m 19s | Max: 35m 30s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 13m | Avg: 36m 11s | Max: 42m 43s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 26m | Avg: 36m 43s | Max: 41m 31s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 06m | Avg: 21m 52s | Max: 40m 19s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 00m | Avg: 40m 03s | Max: 42m 44s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 56m 23s | Avg: 56m 23s | Max: 56m 23s | Hits:  65%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 58m | Avg: 59m 03s | Max:  1h 03m | Hits:  65%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 23m | Avg: 41m 31s | Max:  1h 00m | Hits:  82%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total: 22h 21m | Avg: 29m 10s | Max: 38m 01s
      🟩 GCC                Pass: 100%/49  | Total:  1d 00h | Avg: 29m 26s | Max: 42m 43s
      🟩 Intel              Pass: 100%/3   | Total:  2h 00m | Avg: 40m 03s | Max: 42m 44s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 17m | Avg: 51m 30s | Max:  1h 03m | Hits:  72%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  2d 04h | Avg: 30m 42s | Max:  1h 03m | Hits:  72%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 03h | Avg: 32m 06s | Max:  1h 03m | Hits:  65%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 44m 49s | Avg: 11m 12s | Max: 23m 01s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 34m 51s | Avg: 11m 37s | Max: 12m 29s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 56m | Avg: 38m 50s | Max: 42m 43s
      🟩 90a                Pass: 100%/4   | Total:  1h 14m | Avg: 18m 41s | Max: 19m 21s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 12h 00m | Avg: 25m 44s | Max: 35m 34s
      🟩 14                 Pass: 100%/27  | Total: 15h 14m | Avg: 33m 53s | Max: 56m 23s | Hits:  65%/4460  
      🟩 17                 Pass: 100%/26  | Total: 14h 40m | Avg: 33m 51s | Max:  1h 03m | Hits:  65%/2230  
      🟩 20                 Pass: 100%/22  | Total: 10h 46m | Avg: 29m 23s | Max:  1h 00m | Hits:  82%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 42m | Avg: 3m 07s | Max: 10m 40s | Hits: 89%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 32m | Avg:  3m 10s | Max: 10m 40s | Hits:  89%/222   
      🟩 arm64              Pass: 100%/4   | Total: 10m 15s | Avg:  2m 33s | Max:  2m 41s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 00m | Avg:  3m 12s | Max: 10m 40s | Hits:  89%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 41m | Avg:  3m 04s | Max: 10m 12s | Hits:  89%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 00m | Avg:  3m 12s | Max: 10m 40s | Hits:  89%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 41m | Avg:  3m 04s | Max: 10m 12s | Hits:  89%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 42m | Avg:  3m 07s | Max: 10m 40s | Hits:  89%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 29s | Avg:  2m 44s | Max:  2m 45s
      🟩 Clang10            Pass: 100%/2   | Total:  5m 26s | Avg:  2m 43s | Max:  2m 45s
      🟩 Clang11            Pass: 100%/4   | Total: 10m 43s | Avg:  2m 40s | Max:  2m 49s
      🟩 Clang12            Pass: 100%/4   | Total: 11m 04s | Avg:  2m 46s | Max:  2m 53s
      🟩 Clang13            Pass: 100%/4   | Total: 10m 41s | Avg:  2m 40s | Max:  2m 44s
      🟩 Clang14            Pass: 100%/4   | Total: 12m 07s | Avg:  3m 01s | Max:  4m 09s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 48s | Avg:  2m 54s | Max:  2m 57s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 08s | Avg:  2m 47s | Max:  3m 21s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 50s | Avg:  2m 55s | Max:  3m 04s
      🟩 Clang18            Pass: 100%/2   | Total:  6m 50s | Avg:  3m 25s | Max:  4m 11s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 26s | Avg:  2m 43s | Max:  2m 52s
      🟩 GCC10              Pass: 100%/4   | Total: 10m 16s | Avg:  2m 34s | Max:  2m 40s
      🟩 GCC11              Pass: 100%/4   | Total: 10m 08s | Avg:  2m 32s | Max:  2m 45s
      🟩 GCC12              Pass: 100%/7   | Total: 22m 50s | Avg:  3m 15s | Max:  4m 08s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 45s | Avg:  2m 35s | Max:  2m 41s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 40s | Avg: 10m 40s | Max: 10m 40s | Hits:  89%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 12s | Avg: 10m 12s | Max: 10m 12s | Hits:  89%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 25m | Avg:  2m 50s | Max:  4m 11s
      🟩 GCC                Pass: 100%/20  | Total: 56m 25s | Avg:  2m 49s | Max:  4m 08s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 52s | Avg: 10m 26s | Max: 10m 40s | Hits:  89%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 42m | Avg:  3m 07s | Max: 10m 40s | Hits:  89%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 22m | Avg:  3m 01s | Max: 10m 40s | Hits:  89%/222   
      🟩 Test               Pass: 100%/5   | Total: 19m 53s | Avg:  3m 58s | Max:  4m 11s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 25s | Avg:  2m 25s | Max:  2m 25s
      🟩 90a                Pass: 100%/1   | Total:  2m 35s | Avg:  2m 35s | Max:  2m 35s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 18m | Avg:  2m 48s | Max:  4m 08s
      🟩 20                 Pass: 100%/24  | Total:  1h 23m | Avg:  3m 29s | Max: 10m 40s | Hits:  89%/222   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 50m 43s: Pass: 100%/364 | Total: 1d 12h | Avg: 5m 57s | Max: 37m 56s | Hits: 99%/25679
  • 🟩 cub: Pass: 100%/104 | Total: 11h 42m | Avg: 6m 45s | Max: 37m 56s | Hits: 98%/2924

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 11h 06m | Avg:  6m 56s | Max: 37m 56s | Hits:  98%/2924  
      🟩 arm64              Pass: 100%/8   | Total: 35m 58s | Avg:  4m 29s | Max:  4m 52s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 14m | Avg:  4m 59s | Max: 16m 20s | Hits:  98%/731   
      🟩 11.8               Pass: 100%/3   | Total: 15m 38s | Avg:  5m 12s | Max:  5m 25s
      🟩 12.6               Pass: 100%/86  | Total: 10h 12m | Avg:  7m 07s | Max: 37m 56s | Hits:  98%/2193  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 54s | Avg:  3m 57s | Max:  4m 01s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 14m | Avg:  4m 59s | Max: 16m 20s | Hits:  98%/731   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 38s | Avg:  5m 12s | Max:  5m 25s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 04m | Avg:  7m 11s | Max: 37m 56s | Hits:  98%/2193  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 54s | Avg:  3m 57s | Max:  4m 01s
      🟩 nvcc               Pass: 100%/102 | Total: 11h 34m | Avg:  6m 48s | Max: 37m 56s | Hits:  98%/2924  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 30m 06s | Avg:  5m 01s | Max:  5m 56s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 33s | Avg:  5m 31s | Max:  5m 48s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 42s | Avg:  4m 55s | Max:  5m 21s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 32s | Avg:  4m 53s | Max:  5m 11s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 48s | Avg:  4m 57s | Max:  5m 21s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 10s | Avg:  4m 47s | Max:  4m 55s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 18s | Avg:  5m 04s | Max:  5m 36s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 37s | Avg:  4m 54s | Max:  5m 13s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 36s | Avg:  4m 54s | Max:  5m 08s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 34m | Avg: 10m 27s | Max: 37m 56s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 14s | Avg:  4m 07s | Max:  4m 10s
      🟩 GCC7               Pass: 100%/6   | Total: 26m 51s | Avg:  4m 28s | Max:  4m 41s
      🟩 GCC8               Pass: 100%/6   | Total: 26m 09s | Avg:  4m 21s | Max:  4m 47s
      🟩 GCC9               Pass: 100%/6   | Total: 25m 38s | Avg:  4m 16s | Max:  4m 42s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 15s | Avg:  4m 48s | Max:  4m 59s
      🟩 GCC11              Pass: 100%/7   | Total: 34m 47s | Avg:  4m 58s | Max:  5m 25s
      🟩 GCC12              Pass: 100%/4   | Total: 19m 32s | Avg:  4m 53s | Max:  5m 00s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 09m | Avg: 11m 49s | Max: 33m 29s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 17m 29s | Avg:  5m 49s | Max:  6m 11s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 16m 20s | Avg: 16m 20s | Max: 16m 20s | Hits:  98%/731   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 26m 18s | Avg: 13m 09s | Max: 13m 09s | Hits:  98%/1462  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s | Hits:  98%/731   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 38m | Avg:  6m 03s | Max: 37m 56s
      🟩 GCC                Pass: 100%/51  | Total:  5h 49m | Avg:  6m 51s | Max: 33m 29s
      🟩 Intel              Pass: 100%/3   | Total: 17m 29s | Avg:  5m 49s | Max:  6m 11s
      🟩 MSVC               Pass: 100%/4   | Total: 57m 04s | Avg: 14m 16s | Max: 16m 20s | Hits:  98%/2924  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 11h 42m | Avg:  6m 45s | Max: 37m 56s | Hits:  98%/2924  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 12m | Avg:  5m 08s | Max: 16m 20s | Hits:  98%/2924  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 34s | Avg: 24m 34s | Max: 24m 34s
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 10s | Avg: 24m 10s | Max: 24m 10s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 01m | Avg: 20m 36s | Max: 25m 26s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 39m | Avg: 33m 05s | Max: 37m 56s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 38s | Avg:  5m 12s | Max:  5m 25s
      🟩 90a                Pass: 100%/4   | Total: 14m 36s | Avg:  3m 39s | Max:  3m 48s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 51m | Avg:  6m 07s | Max: 33m 29s
      🟩 14                 Pass: 100%/27  | Total:  2h 28m | Avg:  5m 29s | Max: 16m 20s | Hits:  98%/1462  
      🟩 17                 Pass: 100%/26  | Total:  2h 12m | Avg:  5m 05s | Max: 13m 09s | Hits:  98%/731   
      🟩 20                 Pass: 100%/23  | Total:  4h 10m | Avg: 10m 53s | Max: 37m 56s | Hits:  98%/731   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 10h 48m | Avg: 6m 13s | Max: 28m 12s | Hits: 99%/11383

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 10h 18m | Avg:  6m 26s | Max: 28m 12s | Hits:  99%/11383 
      🟩 arm64              Pass: 100%/8   | Total: 29m 44s | Avg:  3m 43s | Max:  4m 12s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 02m | Avg:  4m 11s | Max: 21m 14s | Hits:  99%/2648  
      🟩 11.8               Pass: 100%/3   | Total: 25m 28s | Avg:  8m 29s | Max: 18m 24s
      🟩 12.6               Pass: 100%/86  | Total:  9h 19m | Avg:  6m 30s | Max: 28m 12s | Hits:  99%/8735  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 36m 21s | Avg: 18m 10s | Max: 19m 01s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 02m | Avg:  4m 11s | Max: 21m 14s | Hits:  99%/2648  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 25m 28s | Avg:  8m 29s | Max: 18m 24s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  8h 43m | Avg:  6m 13s | Max: 28m 12s | Hits:  99%/8735  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 36m 21s | Avg: 18m 10s | Max: 19m 01s
      🟩 nvcc               Pass: 100%/102 | Total: 10h 11m | Avg:  5m 59s | Max: 28m 12s | Hits:  99%/11383 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 26m 47s | Avg:  4m 27s | Max:  6m 19s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 18s | Avg:  5m 26s | Max:  5m 42s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 52s | Avg:  4m 28s | Max:  4m 48s
      🟩 Clang12            Pass: 100%/4   | Total: 17m 04s | Avg:  4m 16s | Max:  4m 24s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 24s | Avg:  4m 21s | Max:  4m 46s
      🟩 Clang14            Pass: 100%/4   | Total: 16m 48s | Avg:  4m 12s | Max:  4m 28s
      🟩 Clang15            Pass: 100%/4   | Total: 17m 49s | Avg:  4m 27s | Max:  4m 37s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 22s | Avg:  4m 20s | Max:  4m 28s
      🟩 Clang17            Pass: 100%/4   | Total: 17m 12s | Avg:  4m 18s | Max:  4m 28s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 17m | Avg:  9m 41s | Max: 21m 18s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 25s | Avg:  2m 42s | Max:  2m 48s
      🟩 GCC7               Pass: 100%/6   | Total: 20m 35s | Avg:  3m 25s | Max:  4m 32s
      🟩 GCC8               Pass: 100%/6   | Total: 20m 22s | Avg:  3m 23s | Max:  4m 08s
      🟩 GCC9               Pass: 100%/6   | Total: 20m 39s | Avg:  3m 26s | Max:  4m 13s
      🟩 GCC10              Pass: 100%/4   | Total: 15m 59s | Avg:  3m 59s | Max:  4m 24s
      🟩 GCC11              Pass: 100%/7   | Total: 42m 02s | Avg:  6m 00s | Max: 18m 24s
      🟩 GCC12              Pass: 100%/4   | Total: 16m 48s | Avg:  4m 12s | Max:  4m 36s
      🟩 GCC13              Pass: 100%/17  | Total:  2h 59m | Avg: 10m 33s | Max: 28m 12s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 32s | Avg:  6m 10s | Max:  6m 16s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 21m 14s | Avg: 21m 14s | Max: 21m 14s | Hits:  99%/2648  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 30m 03s | Avg: 15m 01s | Max: 15m 59s | Hits:  99%/5658  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s | Hits:  99%/3077  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  4h 02m | Avg:  5m 22s | Max: 21m 18s
      🟩 GCC                Pass: 100%/52  | Total:  5h 21m | Avg:  6m 10s | Max: 28m 12s
      🟩 Intel              Pass: 100%/3   | Total: 18m 32s | Avg:  6m 10s | Max:  6m 16s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 06m | Avg: 16m 32s | Max: 21m 14s | Hits:  99%/11383 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 10h 48m | Avg:  6m 13s | Max: 28m 12s | Hits:  99%/11383 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 05m | Avg:  5m 03s | Max: 21m 14s | Hits:  99%/11383 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 40m | Avg: 25m 11s | Max: 28m 12s
      🟩 Test               Pass: 100%/3   | Total: 59m 11s | Avg: 19m 43s | Max: 22m 06s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 28s | Avg:  2m 28s | Max:  2m 28s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 25m 28s | Avg:  8m 29s | Max: 18m 24s
      🟩 90a                Pass: 100%/4   | Total: 16m 19s | Avg:  4m 04s | Max:  4m 25s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 13m | Avg:  4m 47s | Max: 21m 03s
      🟩 14                 Pass: 100%/28  | Total:  2h 41m | Avg:  5m 46s | Max: 23m 36s | Hits:  99%/5397  
      🟩 17                 Pass: 100%/27  | Total:  2h 58m | Avg:  6m 36s | Max: 28m 12s | Hits:  99%/2909  
      🟩 20                 Pass: 100%/20  | Total:  2h 51m | Avg:  8m 33s | Max: 27m 56s | Hits:  99%/3077  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 10h 44m | Avg: 6m 15s | Max: 26m 22s | Hits: 99%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total: 10h 07m | Avg:  6m 23s | Max: 26m 22s | Hits:  99%/11150 
      🟩 arm64              Pass: 100%/8   | Total: 37m 07s | Avg:  4m 38s | Max:  5m 13s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 20m | Avg:  5m 22s | Max: 19m 05s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 15m 29s | Avg:  5m 09s | Max:  5m 32s
      🟩 12.6               Pass: 100%/85  | Total:  9h 08m | Avg:  6m 27s | Max: 26m 22s | Hits:  99%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  5m 05s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 20m | Avg:  5m 22s | Max: 19m 05s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 29s | Avg:  5m 09s | Max:  5m 32s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  8h 59m | Avg:  6m 29s | Max: 26m 22s | Hits:  99%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  5m 05s
      🟩 nvcc               Pass: 100%/101 | Total: 10h 35m | Avg:  6m 17s | Max: 26m 22s | Hits:  99%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 32m 25s | Avg:  5m 24s | Max:  6m 26s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 28s | Avg:  6m 09s | Max:  6m 36s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 47s | Avg:  4m 56s | Max:  5m 24s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 24s | Avg:  5m 06s | Max:  5m 17s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 33s | Avg:  5m 08s | Max:  5m 33s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 01s | Avg:  5m 00s | Max:  5m 08s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 27s | Avg:  5m 06s | Max:  5m 51s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 33s | Avg:  5m 08s | Max:  5m 25s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 11s | Avg:  5m 02s | Max:  5m 43s
      🟩 Clang18            Pass: 100%/9   | Total: 54m 06s | Avg:  6m 00s | Max: 15m 17s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 38s | Avg:  4m 19s | Max:  4m 28s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 19s | Avg:  4m 33s | Max:  5m 02s
      🟩 GCC8               Pass: 100%/6   | Total: 27m 59s | Avg:  4m 39s | Max:  5m 22s
      🟩 GCC9               Pass: 100%/6   | Total: 27m 25s | Avg:  4m 34s | Max:  5m 18s
      🟩 GCC10              Pass: 100%/4   | Total: 20m 51s | Avg:  5m 12s | Max:  5m 27s
      🟩 GCC11              Pass: 100%/7   | Total: 36m 07s | Avg:  5m 09s | Max:  5m 33s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 19s | Avg:  5m 34s | Max:  6m 18s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 49m | Avg:  7m 51s | Max: 25m 28s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 47s | Avg:  6m 35s | Max:  6m 45s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 05s | Avg: 19m 05s | Max: 19m 05s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 34m 21s | Avg: 17m 10s | Max: 17m 33s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 44m 16s | Avg: 22m 08s | Max: 26m 22s | Hits:  99%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 06m | Avg:  5m 22s | Max: 15m 17s
      🟩 GCC                Pass: 100%/49  | Total:  4h 40m | Avg:  5m 43s | Max: 25m 28s
      🟩 Intel              Pass: 100%/3   | Total: 19m 47s | Avg:  6m 35s | Max:  6m 45s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 37m | Avg: 19m 32s | Max: 26m 22s | Hits:  99%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 10h 44m | Avg:  6m 15s | Max: 26m 22s | Hits:  99%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 54m | Avg:  5m 33s | Max: 19m 05s | Hits:  99%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 48m 49s | Avg: 12m 12s | Max: 26m 22s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 01m | Avg: 20m 39s | Max: 25m 28s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 29s | Avg:  5m 09s | Max:  5m 32s
      🟩 90a                Pass: 100%/4   | Total: 17m 29s | Avg:  4m 22s | Max:  4m 56s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 32m | Avg:  5m 26s | Max: 21m 13s
      🟩 14                 Pass: 100%/27  | Total:  2h 41m | Avg:  5m 58s | Max: 19m 05s | Hits:  99%/4460  
      🟩 17                 Pass: 100%/26  | Total:  2h 28m | Avg:  5m 41s | Max: 16m 48s | Hits:  99%/2230  
      🟩 20                 Pass: 100%/22  | Total:  3h 02m | Avg:  8m 18s | Max: 26m 22s | Hits:  99%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 38m | Avg: 3m 03s | Max: 10m 11s | Hits: 90%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 30m | Avg:  3m 08s | Max: 10m 11s | Hits:  90%/222   
      🟩 arm64              Pass: 100%/4   | Total:  8m 11s | Avg:  2m 02s | Max:  2m 20s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 55m 38s | Avg:  2m 55s | Max: 10m 11s | Hits:  90%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 43m | Avg:  3m 07s | Max:  9m 56s | Hits:  90%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 55m 38s | Avg:  2m 55s | Max: 10m 11s | Hits:  90%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 43m | Avg:  3m 07s | Max:  9m 56s | Hits:  90%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 38m | Avg:  3m 03s | Max: 10m 11s | Hits:  90%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 38s | Avg:  2m 19s | Max:  2m 26s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 32s | Avg:  2m 16s | Max:  2m 22s
      🟩 Clang11            Pass: 100%/4   | Total: 10m 21s | Avg:  2m 35s | Max:  2m 50s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 06s | Avg:  2m 16s | Max:  2m 36s
      🟩 Clang13            Pass: 100%/4   | Total:  9m 14s | Avg:  2m 18s | Max:  2m 29s
      🟩 Clang14            Pass: 100%/4   | Total: 12m 39s | Avg:  3m 09s | Max:  5m 05s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 34s | Avg:  2m 47s | Max:  3m 02s
      🟩 Clang16            Pass: 100%/4   | Total: 10m 10s | Avg:  2m 32s | Max:  3m 00s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 06s | Avg:  2m 33s | Max:  2m 50s
      🟩 Clang18            Pass: 100%/2   | Total: 12m 11s | Avg:  6m 05s | Max:  9m 56s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 17s | Avg:  2m 08s | Max:  2m 11s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 45s | Avg:  2m 11s | Max:  2m 23s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 59s | Avg:  2m 14s | Max:  2m 49s
      🟩 GCC12              Pass: 100%/7   | Total: 26m 34s | Avg:  3m 47s | Max:  6m 09s
      🟩 GCC13              Pass: 100%/3   | Total:  6m 30s | Avg:  2m 10s | Max:  2m 49s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 11s | Avg: 10m 11s | Max: 10m 11s | Hits:  90%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 56s | Avg:  9m 56s | Max:  9m 56s | Hits:  90%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 23m | Avg:  2m 47s | Max:  9m 56s
      🟩 GCC                Pass: 100%/20  | Total: 55m 05s | Avg:  2m 45s | Max:  6m 09s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 07s | Avg: 10m 03s | Max: 10m 11s | Hits:  90%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 38m | Avg:  3m 03s | Max: 10m 11s | Hits:  90%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 05m | Avg:  2m 40s | Max: 10m 11s | Hits:  90%/222   
      🟩 Test               Pass: 100%/5   | Total: 33m 23s | Avg:  6m 40s | Max:  9m 56s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
      🟩 90a                Pass: 100%/1   | Total:  2m 49s | Avg:  2m 49s | Max:  2m 49s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 12m | Avg:  2m 36s | Max:  6m 09s
      🟩 20                 Pass: 100%/24  | Total:  1h 25m | Avg:  3m 34s | Max: 10m 11s | Hits:  90%/222   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 16m 59s | Avg: 16m 59s | Max: 16m 59s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 16m 59s | Avg: 16m 59s | Max: 16m 59s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 16m 59s | Avg: 16m 59s | Max: 16m 59s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 16m 59s | Avg: 16m 59s | Max: 16m 59s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 16m 59s | Avg: 16m 59s | Max: 16m 59s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 16m 59s | Avg: 16m 59s | Max: 16m 59s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 16m 59s | Avg: 16m 59s | Max: 16m 59s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 16m 59s | Avg: 16m 59s | Max: 16m 59s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 16m 59s | Avg: 16m 59s | Max: 16m 59s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

@fbusato fbusato closed this Oct 3, 2024
@fbusato fbusato deleted the thread-reduce-optimization branch October 3, 2024 22:22
@fbusato fbusato restored the thread-reduce-optimization branch October 14, 2024 22:27
@fbusato fbusato reopened this Oct 14, 2024
Copy link
Contributor

🟩 CI finished in 1h 29m: Pass: 100%/364 | Total: 1d 12h | Avg: 6m 04s | Max: 54m 37s | Hits: 94%/25679
  • 🟩 cub: Pass: 100%/104 | Total: 12h 12m | Avg: 7m 02s | Max: 54m 37s | Hits: 99%/2924

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 11h 37m | Avg:  7m 15s | Max: 54m 37s | Hits:  99%/2924  
      🟩 arm64              Pass: 100%/8   | Total: 35m 27s | Avg:  4m 25s | Max:  4m 49s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 10m | Avg:  4m 42s | Max: 16m 00s | Hits:  99%/731   
      🟩 11.8               Pass: 100%/3   | Total: 14m 55s | Avg:  4m 58s | Max:  5m 23s
      🟩 12.6               Pass: 100%/86  | Total: 10h 47m | Avg:  7m 31s | Max: 54m 37s | Hits:  99%/2193  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 11s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 10m | Avg:  4m 42s | Max: 16m 00s | Hits:  99%/731   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 55s | Avg:  4m 58s | Max:  5m 23s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 39m | Avg:  7m 36s | Max: 54m 37s | Hits:  99%/2193  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 11s
      🟩 nvcc               Pass: 100%/102 | Total: 12h 04m | Avg:  7m 06s | Max: 54m 37s | Hits:  99%/2924  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 29m 15s | Avg:  4m 52s | Max:  6m 06s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 54s | Avg:  5m 38s | Max:  5m 44s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 14s | Avg:  4m 48s | Max:  5m 23s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 20s | Avg:  4m 50s | Max:  5m 13s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 39s | Avg:  4m 54s | Max:  5m 13s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 49s | Avg:  4m 57s | Max:  5m 02s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 09s | Avg:  4m 47s | Max:  5m 12s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 06s | Avg:  5m 01s | Max:  5m 21s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 33s | Avg:  5m 08s | Max:  5m 23s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 41m | Avg: 11m 16s | Max: 54m 37s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 00s | Avg:  4m 00s | Max:  4m 03s
      🟩 GCC7               Pass: 100%/6   | Total: 24m 45s | Avg:  4m 07s | Max:  5m 07s
      🟩 GCC8               Pass: 100%/6   | Total: 25m 11s | Avg:  4m 11s | Max:  4m 47s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 11m | Avg: 11m 52s | Max: 49m 38s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 06s | Avg:  4m 31s | Max:  4m 46s
      🟩 GCC11              Pass: 100%/7   | Total: 33m 23s | Avg:  4m 46s | Max:  5m 23s
      🟩 GCC12              Pass: 100%/4   | Total: 19m 45s | Avg:  4m 56s | Max:  5m 15s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 50m | Avg: 10m 40s | Max: 31m 52s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 16m 10s | Avg:  5m 23s | Max:  5m 32s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 16m 00s | Avg: 16m 00s | Max: 16m 00s | Hits:  99%/731   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 29m 30s | Avg: 14m 45s | Max: 16m 11s | Hits:  99%/1462  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 21s | Avg: 14m 21s | Max: 14m 21s | Hits:  99%/731   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 45m | Avg:  6m 12s | Max: 54m 37s
      🟩 GCC                Pass: 100%/51  | Total:  6h 11m | Avg:  7m 16s | Max: 49m 38s
      🟩 Intel              Pass: 100%/3   | Total: 16m 10s | Avg:  5m 23s | Max:  5m 32s
      🟩 MSVC               Pass: 100%/4   | Total: 59m 51s | Avg: 14m 57s | Max: 16m 11s | Hits:  99%/2924  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 12h 12m | Avg:  7m 02s | Max: 54m 37s | Hits:  99%/2924  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 53m | Avg:  5m 33s | Max: 49m 38s | Hits:  99%/2924  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 16m 54s | Avg: 16m 54s | Max: 16m 54s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 11s | Avg: 15m 11s | Max: 15m 11s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 03m | Avg: 21m 19s | Max: 31m 52s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 43m | Avg: 34m 21s | Max: 54m 37s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 55s | Avg:  4m 58s | Max:  5m 23s
      🟩 90a                Pass: 100%/4   | Total: 14m 39s | Avg:  3m 39s | Max:  3m 54s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 38m | Avg:  5m 40s | Max: 25m 54s
      🟩 14                 Pass: 100%/27  | Total:  3h 11m | Avg:  7m 05s | Max: 49m 38s | Hits:  99%/1462  
      🟩 17                 Pass: 100%/26  | Total:  2h 13m | Avg:  5m 08s | Max: 13m 19s | Hits:  99%/731   
      🟩 20                 Pass: 100%/23  | Total:  4h 08m | Avg: 10m 49s | Max: 54m 37s | Hits:  99%/731   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 11h 20m | Avg: 6m 32s | Max: 38m 42s | Hits: 88%/11383

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 10h 50m | Avg:  6m 46s | Max: 38m 42s | Hits:  88%/11383 
      🟩 arm64              Pass: 100%/8   | Total: 30m 04s | Avg:  3m 45s | Max:  4m 13s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 01m | Avg:  4m 06s | Max: 18m 50s | Hits:  99%/2648  
      🟩 11.8               Pass: 100%/3   | Total: 30m 04s | Avg: 10m 01s | Max: 22m 40s
      🟩 12.6               Pass: 100%/86  | Total:  9h 49m | Avg:  6m 51s | Max: 38m 42s | Hits:  85%/8735  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 36m 45s | Avg: 18m 22s | Max: 19m 06s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 01m | Avg:  4m 06s | Max: 18m 50s | Hits:  99%/2648  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 30m 04s | Avg: 10m 01s | Max: 22m 40s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 12m | Avg:  6m 34s | Max: 38m 42s | Hits:  85%/8735  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 36m 45s | Avg: 18m 22s | Max: 19m 06s
      🟩 nvcc               Pass: 100%/102 | Total: 10h 44m | Avg:  6m 18s | Max: 38m 42s | Hits:  88%/11383 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 25m 23s | Avg:  4m 13s | Max:  5m 23s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 02s | Avg:  5m 20s | Max:  5m 39s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 34s | Avg:  4m 23s | Max:  4m 40s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 15s | Avg:  4m 48s | Max:  5m 59s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 43s
      🟩 Clang14            Pass: 100%/4   | Total: 17m 41s | Avg:  4m 25s | Max:  4m 41s
      🟩 Clang15            Pass: 100%/4   | Total: 17m 58s | Avg:  4m 29s | Max:  4m 48s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 53s | Avg:  4m 28s | Max:  4m 59s
      🟩 Clang17            Pass: 100%/4   | Total: 17m 13s | Avg:  4m 18s | Max:  4m 26s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 16m | Avg:  9m 34s | Max: 19m 46s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 00s | Avg:  3m 00s | Max:  3m 27s
      🟩 GCC7               Pass: 100%/6   | Total: 20m 55s | Avg:  3m 29s | Max:  4m 12s
      🟩 GCC8               Pass: 100%/6   | Total: 19m 42s | Avg:  3m 17s | Max:  4m 05s
      🟩 GCC9               Pass: 100%/6   | Total: 22m 19s | Avg:  3m 43s | Max:  4m 23s
      🟩 GCC10              Pass: 100%/4   | Total: 16m 35s | Avg:  4m 08s | Max:  4m 29s
      🟩 GCC11              Pass: 100%/7   | Total: 46m 56s | Avg:  6m 42s | Max: 22m 40s
      🟩 GCC12              Pass: 100%/4   | Total: 16m 26s | Avg:  4m 06s | Max:  4m 15s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 03m | Avg: 10m 49s | Max: 32m 33s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 22s | Avg:  6m 07s | Max:  6m 13s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 50s | Avg: 18m 50s | Max: 18m 50s | Hits:  99%/2648  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 29m 19s | Avg: 14m 39s | Max: 15m 33s | Hits:  99%/5658  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 38m 42s | Avg: 38m 42s | Max: 38m 42s | Hits:  58%/3077  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  4h 02m | Avg:  5m 23s | Max: 19m 46s
      🟩 GCC                Pass: 100%/52  | Total:  5h 32m | Avg:  6m 23s | Max: 32m 33s
      🟩 Intel              Pass: 100%/3   | Total: 18m 22s | Avg:  6m 07s | Max:  6m 13s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 26m | Avg: 21m 42s | Max: 38m 42s | Hits:  88%/11383 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 11h 20m | Avg:  6m 32s | Max: 38m 42s | Hits:  88%/11383 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 35m | Avg:  5m 22s | Max: 38m 42s | Hits:  88%/11383 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 52m | Avg: 28m 10s | Max: 32m 33s
      🟩 Test               Pass: 100%/3   | Total: 50m 30s | Avg: 16m 50s | Max: 19m 46s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 15s | Avg:  2m 15s | Max:  2m 15s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 30m 04s | Avg: 10m 01s | Max: 22m 40s
      🟩 90a                Pass: 100%/4   | Total: 15m 24s | Avg:  3m 51s | Max:  4m 14s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 11m | Avg:  4m 40s | Max: 22m 21s
      🟩 14                 Pass: 100%/28  | Total:  3h 00m | Avg:  6m 26s | Max: 25m 14s | Hits:  99%/5397  
      🟩 17                 Pass: 100%/27  | Total:  2h 50m | Avg:  6m 18s | Max: 32m 33s | Hits:  99%/2909  
      🟩 20                 Pass: 100%/20  | Total:  3h 16m | Avg:  9m 50s | Max: 38m 42s | Hits:  58%/3077  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 10h 27m | Avg: 6m 05s | Max: 25m 04s | Hits: 99%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  9h 51m | Avg:  6m 13s | Max: 25m 04s | Hits:  99%/11150 
      🟩 arm64              Pass: 100%/8   | Total: 36m 02s | Avg:  4m 30s | Max:  5m 12s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 18m | Avg:  5m 12s | Max: 20m 06s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 15m 54s | Avg:  5m 18s | Max:  5m 43s
      🟩 12.6               Pass: 100%/85  | Total:  8h 53m | Avg:  6m 16s | Max: 25m 04s | Hits:  99%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  5m 02s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 18m | Avg:  5m 12s | Max: 20m 06s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 54s | Avg:  5m 18s | Max:  5m 43s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  8h 43m | Avg:  6m 18s | Max: 25m 04s | Hits:  99%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  5m 02s
      🟩 nvcc               Pass: 100%/101 | Total: 10h 17m | Avg:  6m 06s | Max: 25m 04s | Hits:  99%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 31m 36s | Avg:  5m 16s | Max:  6m 40s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 32s | Avg:  6m 10s | Max:  6m 30s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 14s | Avg:  4m 48s | Max:  5m 01s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 11s | Avg:  5m 02s | Max:  5m 12s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 09s | Avg:  5m 02s | Max:  5m 13s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 23s | Avg:  4m 50s | Max:  4m 56s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 33s | Avg:  4m 53s | Max:  5m 08s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 01s | Avg:  5m 00s | Max:  5m 42s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 36s | Avg:  5m 09s | Max:  5m 33s
      🟩 Clang18            Pass: 100%/9   | Total: 52m 19s | Avg:  5m 48s | Max: 12m 44s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 56s | Avg:  3m 58s | Max:  4m 10s
      🟩 GCC7               Pass: 100%/6   | Total: 26m 27s | Avg:  4m 24s | Max:  5m 21s
      🟩 GCC8               Pass: 100%/6   | Total: 27m 20s | Avg:  4m 33s | Max:  5m 28s
      🟩 GCC9               Pass: 100%/6   | Total: 27m 58s | Avg:  4m 39s | Max:  5m 22s
      🟩 GCC10              Pass: 100%/4   | Total: 20m 37s | Avg:  5m 09s | Max:  5m 45s
      🟩 GCC11              Pass: 100%/7   | Total: 36m 33s | Avg:  5m 13s | Max:  5m 43s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 14s | Avg:  5m 33s | Max:  5m 42s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 39m | Avg:  7m 07s | Max: 25m 04s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 32s | Avg:  6m 30s | Max:  6m 51s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 06s | Avg: 20m 06s | Max: 20m 06s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 36m 01s | Avg: 18m 00s | Max: 19m 02s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 41m 21s | Avg: 20m 40s | Max: 23m 06s | Hits:  99%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 01m | Avg:  5m 15s | Max: 12m 44s
      🟩 GCC                Pass: 100%/49  | Total:  4h 28m | Avg:  5m 29s | Max: 25m 04s
      🟩 Intel              Pass: 100%/3   | Total: 19m 32s | Avg:  6m 30s | Max:  6m 51s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 37m | Avg: 19m 29s | Max: 23m 06s | Hits:  99%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 10h 27m | Avg:  6m 05s | Max: 25m 04s | Hits:  99%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 51m | Avg:  5m 32s | Max: 20m 06s | Hits:  99%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 09s | Avg: 11m 32s | Max: 23m 06s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 49m 21s | Avg: 16m 27s | Max: 25m 04s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 54s | Avg:  5m 18s | Max:  5m 43s
      🟩 90a                Pass: 100%/4   | Total: 17m 15s | Avg:  4m 18s | Max:  4m 37s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 20m | Avg:  5m 00s | Max: 11m 33s
      🟩 14                 Pass: 100%/27  | Total:  2h 39m | Avg:  5m 53s | Max: 20m 06s | Hits:  99%/4460  
      🟩 17                 Pass: 100%/26  | Total:  2h 30m | Avg:  5m 48s | Max: 19m 02s | Hits:  99%/2230  
      🟩 20                 Pass: 100%/22  | Total:  2h 57m | Avg:  8m 02s | Max: 25m 04s | Hits:  99%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 34m | Avg: 2m 58s | Max: 10m 36s | Hits: 90%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 24m | Avg:  3m 00s | Max: 10m 36s | Hits:  90%/222   
      🟩 arm64              Pass: 100%/4   | Total: 10m 03s | Avg:  2m 30s | Max:  2m 43s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 55m 13s | Avg:  2m 54s | Max: 10m 12s | Hits:  90%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 39m | Avg:  3m 00s | Max: 10m 36s | Hits:  90%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 55m 13s | Avg:  2m 54s | Max: 10m 12s | Hits:  90%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 39m | Avg:  3m 00s | Max: 10m 36s | Hits:  90%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 34m | Avg:  2m 58s | Max: 10m 36s | Hits:  90%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 27s | Avg:  2m 43s | Max:  3m 08s
      🟩 Clang10            Pass: 100%/2   | Total:  5m 14s | Avg:  2m 37s | Max:  3m 06s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 52s | Avg:  2m 28s | Max:  2m 41s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 24s | Avg:  2m 21s | Max:  2m 43s
      🟩 Clang13            Pass: 100%/4   | Total:  9m 55s | Avg:  2m 28s | Max:  2m 56s
      🟩 Clang14            Pass: 100%/4   | Total: 13m 29s | Avg:  3m 22s | Max:  5m 52s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 11s | Avg:  2m 35s | Max:  2m 39s
      🟩 Clang16            Pass: 100%/4   | Total:  9m 56s | Avg:  2m 29s | Max:  2m 41s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 43s | Avg:  2m 51s | Max:  2m 56s
      🟩 Clang18            Pass: 100%/2   | Total:  6m 32s | Avg:  3m 16s | Max:  4m 01s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 00s | Avg:  2m 30s | Max:  2m 47s
      🟩 GCC10              Pass: 100%/4   | Total: 10m 01s | Avg:  2m 30s | Max:  2m 55s
      🟩 GCC11              Pass: 100%/4   | Total:  9m 01s | Avg:  2m 15s | Max:  2m 40s
      🟩 GCC12              Pass: 100%/7   | Total: 20m 44s | Avg:  2m 57s | Max:  4m 09s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 08s | Avg:  2m 42s | Max:  2m 52s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 12s | Avg: 10m 12s | Max: 10m 12s | Hits:  90%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 36s | Avg: 10m 36s | Max: 10m 36s | Hits:  90%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 20m | Avg:  2m 41s | Max:  5m 52s
      🟩 GCC                Pass: 100%/20  | Total: 52m 54s | Avg:  2m 38s | Max:  4m 09s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 48s | Avg: 10m 24s | Max: 10m 36s | Hits:  90%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 34m | Avg:  2m 58s | Max: 10m 36s | Hits:  90%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 12m | Avg:  2m 49s | Max: 10m 36s | Hits:  90%/222   
      🟩 Test               Pass: 100%/5   | Total: 21m 34s | Avg:  4m 18s | Max:  5m 52s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 19s | Avg:  2m 19s | Max:  2m 19s
      🟩 90a                Pass: 100%/1   | Total:  2m 52s | Avg:  2m 52s | Max:  2m 52s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 13m | Avg:  2m 37s | Max:  4m 09s
      🟩 20                 Pass: 100%/24  | Total:  1h 21m | Avg:  3m 22s | Max: 10m 36s | Hits:  90%/222   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 1h 50m: Pass: 100%/366 | Total: 6d 16h | Avg: 26m 17s | Max: 1h 16m | Hits: 11%/27881
  • 🟩 cub: Pass: 100%/104 | Total: 3d 13h | Avg: 49m 36s | Max: 1h 12m | Hits: 2%/2932

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  3d 06h | Avg: 49m 16s | Max:  1h 12m | Hits:   2%/2932  
      🟩 arm64              Pass: 100%/8   | Total:  7h 08m | Avg: 53m 34s | Max: 54m 49s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 11h 51m | Avg: 47m 27s | Max:  1h 02m | Hits:   2%/733   
      🟩 11.8               Pass: 100%/3   | Total:  3h 32m | Avg:  1h 10m | Max:  1h 12m
      🟩 12.6               Pass: 100%/86  | Total:  2d 22h | Avg: 49m 14s | Max:  1h 12m | Hits:   2%/2199  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 55m | Avg: 57m 32s | Max:  1h 01m
      🟩 nvcc11.1           Pass: 100%/15  | Total: 11h 51m | Avg: 47m 27s | Max:  1h 02m | Hits:   2%/733   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 32m | Avg:  1h 10m | Max:  1h 12m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 20h | Avg: 49m 03s | Max:  1h 12m | Hits:   2%/2199  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 55m | Avg: 57m 32s | Max:  1h 01m
      🟩 nvcc               Pass: 100%/102 | Total:  3d 12h | Avg: 49m 27s | Max:  1h 12m | Hits:   2%/2932  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 51m | Avg: 48m 39s | Max: 53m 09s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 40m | Avg: 53m 32s | Max: 59m 08s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 13s | Max: 56m 20s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 03s | Max: 54m 25s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 22m | Avg: 50m 32s | Max: 50m 58s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 22m | Avg: 50m 38s | Max: 53m 16s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 43m | Avg: 55m 57s | Max:  1h 09m
      🟩 Clang16            Pass: 100%/4   | Total:  3h 23m | Avg: 50m 45s | Max: 51m 04s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 13s | Max: 52m 24s
      🟩 Clang18            Pass: 100%/9   | Total:  7h 17m | Avg: 48m 37s | Max:  1h 01m
      🟩 GCC6               Pass: 100%/2   | Total:  1h 35m | Avg: 47m 36s | Max: 50m 15s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 58m | Avg: 49m 43s | Max: 55m 25s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 51m | Avg: 48m 36s | Max: 52m 52s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 54m | Avg: 49m 06s | Max: 53m 52s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 35m | Avg: 53m 50s | Max: 56m 04s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 58m | Avg: 59m 46s | Max:  1h 12m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 31m | Avg: 52m 56s | Max: 54m 30s
      🟩 GCC13              Pass: 100%/16  | Total:  9h 04m | Avg: 34m 00s | Max: 56m 10s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 52m | Avg: 57m 32s | Max:  1h 01m
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m | Hits:   2%/733   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:   2%/1466  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 12m | Avg:  1h 12m | Max:  1h 12m | Hits:   2%/733   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 15h | Avg: 50m 57s | Max:  1h 09m
      🟩 GCC                Pass: 100%/51  | Total:  1d 15h | Avg: 46m 27s | Max:  1h 12m
      🟩 Intel              Pass: 100%/3   | Total:  2h 52m | Avg: 57m 32s | Max:  1h 01m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 33m | Avg:  1h 08m | Max:  1h 12m | Hits:   2%/2932  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  3d 13h | Avg: 49m 36s | Max:  1h 12m | Hits:   2%/2932  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  3d 10h | Avg: 51m 51s | Max:  1h 12m | Hits:   2%/2932  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 44s | Avg: 20m 44s | Max: 20m 44s
      🟩 GraphCapture       Pass: 100%/1   | Total: 23m 24s | Avg: 23m 24s | Max: 23m 24s
      🟩 HostLaunch         Pass: 100%/3   | Total: 54m 02s | Avg: 18m 00s | Max: 19m 49s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 22m | Avg: 27m 28s | Max: 34m 36s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 32m | Avg:  1h 10m | Max:  1h 12m
      🟩 90a                Pass: 100%/4   | Total:  1h 30m | Avg: 22m 43s | Max: 23m 55s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 23h 00m | Avg: 49m 19s | Max:  1h 12m
      🟩 14                 Pass: 100%/27  | Total: 23h 08m | Avg: 51m 24s | Max:  1h 07m | Hits:   2%/1466  
      🟩 17                 Pass: 100%/26  | Total: 22h 36m | Avg: 52m 11s | Max:  1h 12m | Hits:   2%/733   
      🟩 20                 Pass: 100%/23  | Total: 17h 13m | Avg: 44m 56s | Max:  1h 12m | Hits:   2%/733   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 13h 28m | Avg: 7m 46s | Max: 52m 33s | Hits: 3%/11638

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 12h 55m | Avg:  8m 04s | Max: 52m 33s | Hits:   3%/11638 
      🟩 arm64              Pass: 100%/8   | Total: 33m 14s | Avg:  4m 09s | Max:  4m 53s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 31m | Avg:  6m 04s | Max: 43m 37s | Hits:   3%/2711  
      🟩 11.8               Pass: 100%/3   | Total: 11m 55s | Avg:  3m 58s | Max:  4m 11s
      🟩 12.6               Pass: 100%/86  | Total: 11h 45m | Avg:  8m 12s | Max: 52m 33s | Hits:   3%/8927  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 38m 09s | Avg: 19m 04s | Max: 19m 21s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 31m | Avg:  6m 04s | Max: 43m 37s | Hits:   3%/2711  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 11m 55s | Avg:  3m 58s | Max:  4m 11s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 11h 07m | Avg:  7m 56s | Max: 52m 33s | Hits:   3%/8927  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 38m 09s | Avg: 19m 04s | Max: 19m 21s
      🟩 nvcc               Pass: 100%/102 | Total: 12h 50m | Avg:  7m 33s | Max: 52m 33s | Hits:   3%/11638 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 27m 53s | Avg:  4m 38s | Max:  6m 10s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 38s | Avg:  5m 52s | Max:  6m 40s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 25s | Avg:  4m 51s | Max:  5m 03s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 37s | Avg:  4m 39s | Max:  4m 55s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 43s | Avg:  4m 25s | Max:  4m 34s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 04s | Avg:  4m 31s | Max:  4m 58s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 32s | Avg:  4m 38s | Max:  4m 59s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 31s | Avg:  4m 37s | Max:  5m 03s
      🟩 Clang17            Pass: 100%/4   | Total: 18m 29s | Avg:  4m 37s | Max:  4m 56s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 32m | Avg: 11m 30s | Max: 31m 51s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 05s | Avg:  3m 02s | Max:  3m 20s
      🟩 GCC7               Pass: 100%/6   | Total: 23m 02s | Avg:  3m 50s | Max:  4m 36s
      🟩 GCC8               Pass: 100%/6   | Total: 21m 51s | Avg:  3m 38s | Max:  4m 19s
      🟩 GCC9               Pass: 100%/6   | Total: 23m 18s | Avg:  3m 53s | Max:  4m 45s
      🟩 GCC10              Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 40s
      🟩 GCC11              Pass: 100%/7   | Total: 29m 19s | Avg:  4m 11s | Max:  4m 59s
      🟩 GCC12              Pass: 100%/4   | Total: 17m 24s | Avg:  4m 21s | Max:  4m 58s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 15m | Avg: 11m 31s | Max: 31m 56s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 44s | Avg:  6m 14s | Max:  6m 34s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 43m 37s | Avg: 43m 37s | Max: 43m 37s | Hits:   3%/2711  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 32m | Avg: 46m 04s | Max: 49m 01s | Hits:   3%/5786  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 52m 33s | Avg: 52m 33s | Max: 52m 33s | Hits:   3%/3141  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  4h 26m | Avg:  5m 55s | Max: 31m 51s
      🟩 GCC                Pass: 100%/52  | Total:  5h 34m | Avg:  6m 25s | Max: 31m 56s
      🟩 Intel              Pass: 100%/3   | Total: 18m 44s | Avg:  6m 14s | Max:  6m 34s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 08m | Avg: 47m 04s | Max: 52m 33s | Hits:   3%/11638 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 13h 28m | Avg:  7m 46s | Max: 52m 33s | Hits:   3%/11638 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 10h 22m | Avg:  6m 29s | Max: 52m 33s | Hits:   3%/11638 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 38m | Avg: 24m 43s | Max: 31m 20s
      🟩 Test               Pass: 100%/3   | Total:  1h 24m | Avg: 28m 07s | Max: 31m 56s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 44s | Avg:  2m 44s | Max:  2m 44s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 11m 55s | Avg:  3m 58s | Max:  4m 11s
      🟩 90a                Pass: 100%/4   | Total: 17m 40s | Avg:  4m 25s | Max:  5m 03s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 24m | Avg:  5m 08s | Max: 21m 01s
      🟩 14                 Pass: 100%/28  | Total:  3h 34m | Avg:  7m 38s | Max: 43m 37s | Hits:   3%/5524  
      🟩 17                 Pass: 100%/27  | Total:  3h 28m | Avg:  7m 43s | Max: 49m 01s | Hits:   3%/2973  
      🟩 20                 Pass: 100%/20  | Total:  3h 58m | Avg: 11m 56s | Max: 52m 33s | Hits:   3%/3141  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 2d 09h | Avg: 33m 24s | Max: 1h 16m | Hits: 19%/13095

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  2d 05h | Avg: 33m 29s | Max:  1h 16m | Hits:  19%/13095 
      🟩 arm64              Pass: 100%/8   | Total:  4h 19m | Avg: 32m 28s | Max: 38m 43s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  8h 29m | Avg: 33m 56s | Max:  1h 16m | Hits:   0%/2619  
      🟩 11.8               Pass: 100%/3   | Total:  2h 12m | Avg: 44m 01s | Max: 50m 14s
      🟩 12.6               Pass: 100%/85  | Total:  1d 22h | Avg: 32m 56s | Max:  1h 16m | Hits:  24%/10476 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 55m 13s | Avg: 27m 36s | Max: 27m 45s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 29m | Avg: 33m 56s | Max:  1h 16m | Hits:   0%/2619  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 12m | Avg: 44m 01s | Max: 50m 14s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 21h | Avg: 33m 04s | Max:  1h 16m | Hits:  24%/10476 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 55m 13s | Avg: 27m 36s | Max: 27m 45s
      🟩 nvcc               Pass: 100%/101 | Total:  2d 08h | Avg: 33m 31s | Max:  1h 16m | Hits:  19%/13095 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 15m | Avg: 32m 37s | Max: 37m 04s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 41m | Avg: 33m 56s | Max: 36m 58s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 14s | Max: 35m 45s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 10m | Avg: 32m 40s | Max: 35m 54s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 48s | Max: 35m 41s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 09m | Avg: 32m 17s | Max: 34m 32s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 01s | Max: 35m 52s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 15m | Avg: 33m 53s | Max: 36m 41s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 04s | Max: 36m 06s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 54m | Avg: 26m 05s | Max: 35m 33s
      🟩 GCC6               Pass: 100%/2   | Total: 57m 45s | Avg: 28m 52s | Max: 31m 23s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 07m | Avg: 31m 10s | Max: 35m 32s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 14m | Avg: 32m 27s | Max: 35m 38s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 26m | Avg: 34m 20s | Max: 39m 02s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 17m | Avg: 34m 18s | Max: 37m 27s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 35m | Avg: 39m 19s | Max: 50m 14s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 21m | Avg: 35m 25s | Max: 38m 13s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 26m | Avg: 23m 21s | Max: 40m 01s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 08m | Avg: 42m 44s | Max: 47m 44s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 16m | Avg:  1h 16m | Max:  1h 16m | Hits:   0%/2619  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 14m | Hits:   0%/5238  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 42m | Avg: 51m 26s | Max:  1h 16m | Hits:  49%/5238  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 00h | Avg: 31m 44s | Max: 37m 04s
      🟩 GCC                Pass: 100%/49  | Total:  1d 01h | Avg: 31m 09s | Max: 50m 14s
      🟩 Intel              Pass: 100%/3   | Total:  2h 08m | Avg: 42m 44s | Max: 47m 44s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 26m | Avg:  1h 05m | Max:  1h 16m | Hits:  19%/13095 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  2d 09h | Avg: 33m 24s | Max:  1h 16m | Hits:  19%/13095 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 07h | Avg: 34m 55s | Max:  1h 16m | Hits:   0%/10476 
      🟩 TestCPU            Pass: 100%/4   | Total: 49m 19s | Avg: 12m 19s | Max: 26m 07s | Hits:  99%/2619  
      🟩 TestGPU            Pass: 100%/3   | Total: 38m 53s | Avg: 12m 57s | Max: 13m 56s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 12m | Avg: 44m 01s | Max: 50m 14s
      🟩 90a                Pass: 100%/4   | Total:  1h 18m | Avg: 19m 36s | Max: 21m 16s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 12h 46m | Avg: 27m 22s | Max: 36m 19s
      🟩 14                 Pass: 100%/27  | Total: 16h 53m | Avg: 37m 33s | Max:  1h 16m | Hits:   0%/5238  
      🟩 17                 Pass: 100%/26  | Total: 15h 58m | Avg: 36m 52s | Max:  1h 14m | Hits:   0%/2619  
      🟩 20                 Pass: 100%/22  | Total: 11h 42m | Avg: 31m 55s | Max:  1h 16m | Hits:  49%/5238  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 3h 09m | Avg: 3m 38s | Max: 13m 16s | Hits: 12%/216

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 58m | Avg:  3m 42s | Max: 13m 16s | Hits:  12%/216   
      🟩 arm64              Pass: 100%/4   | Total: 11m 23s | Avg:  2m 50s | Max:  3m 04s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 12m | Avg:  3m 47s | Max: 13m 16s | Hits:  12%/108   
      🟩 12.6               Pass: 100%/33  | Total:  1h 57m | Avg:  3m 33s | Max: 13m 05s | Hits:  12%/108   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 12m | Avg:  3m 47s | Max: 13m 16s | Hits:  12%/108   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 57m | Avg:  3m 33s | Max: 13m 05s | Hits:  12%/108   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  3h 09m | Avg:  3m 38s | Max: 13m 16s | Hits:  12%/216   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  3m 42s
      🟩 Clang10            Pass: 100%/2   | Total:  5m 42s | Avg:  2m 51s | Max:  3m 08s
      🟩 Clang11            Pass: 100%/4   | Total: 12m 47s | Avg:  3m 11s | Max:  3m 24s
      🟩 Clang12            Pass: 100%/4   | Total: 13m 05s | Avg:  3m 16s | Max:  3m 28s
      🟩 Clang13            Pass: 100%/4   | Total: 13m 14s | Avg:  3m 18s | Max:  3m 28s
      🟩 Clang14            Pass: 100%/4   | Total: 13m 42s | Avg:  3m 25s | Max:  4m 00s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 14s | Avg:  3m 07s | Max:  3m 22s
      🟩 Clang16            Pass: 100%/4   | Total: 12m 00s | Avg:  3m 00s | Max:  3m 30s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 33s | Avg:  3m 16s | Max:  3m 25s
      🟩 Clang18            Pass: 100%/2   | Total:  7m 53s | Avg:  3m 56s | Max:  4m 49s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 25s | Avg:  3m 12s | Max:  3m 15s
      🟩 GCC10              Pass: 100%/4   | Total: 12m 55s | Avg:  3m 13s | Max:  3m 36s
      🟩 GCC11              Pass: 100%/4   | Total: 12m 55s | Avg:  3m 13s | Max:  3m 32s
      🟩 GCC12              Pass: 100%/7   | Total: 24m 15s | Avg:  3m 27s | Max:  3m 53s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 20s | Avg:  2m 46s | Max:  3m 04s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 13m 16s | Avg: 13m 16s | Max: 13m 16s | Hits:  12%/108   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 05s | Avg: 13m 05s | Max: 13m 05s | Hits:  12%/108   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 38m | Avg:  3m 16s | Max:  4m 49s
      🟩 GCC                Pass: 100%/20  | Total:  1h 04m | Avg:  3m 14s | Max:  3m 53s
      🟩 MSVC               Pass: 100%/2   | Total: 26m 21s | Avg: 13m 10s | Max: 13m 16s | Hits:  12%/216   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  3h 09m | Avg:  3m 38s | Max: 13m 16s | Hits:  12%/216   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 49m | Avg:  3m 36s | Max: 13m 16s | Hits:  12%/216   
      🟩 Test               Pass: 100%/5   | Total: 20m 09s | Avg:  4m 01s | Max:  4m 49s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 06s | Avg:  3m 06s | Max:  3m 06s
      🟩 90a                Pass: 100%/1   | Total:  2m 30s | Avg:  2m 30s | Max:  2m 30s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 30m | Avg:  3m 12s | Max:  3m 53s
      🟩 20                 Pass: 100%/24  | Total:  1h 39m | Avg:  4m 08s | Max: 13m 16s | Hits:  12%/216   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 54s | Avg: 5m 27s | Max: 8m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 54s | Avg:  5m 27s | Max:  8m 26s
    🟩 ctk
      🟩 12.5               Pass: 100%/2   | Total: 10m 54s | Avg:  5m 27s | Max:  8m 26s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 54s | Avg:  5m 27s | Max:  8m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 54s | Avg:  5m 27s | Max:  8m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 54s | Avg:  5m 27s | Max:  8m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 54s | Avg:  5m 27s | Max:  8m 26s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 54s | Avg:  5m 27s | Max:  8m 26s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 28s | Avg:  2m 28s | Max:  2m 28s
      🟩 Test               Pass: 100%/1   | Total:  8m 26s | Avg:  8m 26s | Max:  8m 26s
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CCCL C Parallel Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CCCL C Parallel Library

🏃‍ Runner counts (total jobs: 366)

# Runner
298 linux-amd64-cpu16
28 linux-arm64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
cub/cub/detail/type_traits.cuh Outdated Show resolved Hide resolved
docs/cub/developer_overview.rst Outdated Show resolved Hide resolved
cub/cub/thread/thread_reduce.cuh Outdated Show resolved Hide resolved
Comment on lines 319 to 320
(return (is_one_of<T, ::cuda::std::int32_t, ::cuda::std::uint32_t, ::cuda::std::int64_t, ::cuda::std::uint64_t>
&& is_one_of<ReductionOp, cub::Min, cub::Max, cub::Sum, cub::BitAnd, cub::BitOr, cub::BitXor>())
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should move that into a proper variable tempalte or struct

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

uhmm, not sure what you are referring to. It looks pretty readable to me

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does the following solution make sense?

template <typename T, typename ReductionOp>
struct enable_ternary_reduction_sm90
{
  static constexpr bool value =
    cub::detail::is_one_of<T, ::cuda::std::int32_t, ::cuda::std::uint32_t, ::cuda::std::int64_t, ::cuda::std::uint64_t>
    && cub::detail::is_one_of<ReductionOp, cub::Min, cub::Max, cub::Sum, cub::BitAnd, cub::BitOr, cub::BitXor>();
};

#  if defined(_CCCL_HAS_NVFP16)

template <typename ReductionOp>
struct enable_ternary_reduction_sm90<__half2, ReductionOp>
{
  static constexpr bool value =
    cub::detail::is_one_of<ReductionOp, cub::Min, cub::Max, SimdMin<__half>, SimdMax<__half>>();
};

#  endif // defined(_CCCL_HAS_NVFP16)


#  if defined(_CCCL_HAS_NVBF16)

template <typename ReductionOp>
struct enable_ternary_reduction_sm90<__nv_bfloat162, ReductionOp>
{
  static constexpr bool value =
    cub::detail::is_one_of<ReductionOp, cub::Min, cub::Max, SimdMin<__nv_bfloat16>, SimdMax<__nv_bfloat16>>();
};

#  endif // defined(_CCCL_HAS_NVBF16)

Comment on lines 220 to 227
return ((is_one_of<T, ::cuda::std::int16_t, ::cuda::std::uint16_t>() && is_one_of<ReductionOp, cub::Min, cub::Max>())
# if defined(_CCCL_HAS_NVFP16)
|| (::cuda::std::is_same<T, __half>::value && is_one_of<ReductionOp, cub::Min, cub::Max, cub::Sum, cub::Mul>())
# endif
# if defined(_CCCL_HAS_NVBF16)
|| (::cuda::std::is_same<T, __nv_bfloat16>::value &&
is_one_of<ReductionOp, cub::Min, cub::Max, cub::Sum, cub::Mul>())
# endif
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I find this technique really hard to read, why are we not using a struct that we specialize for half and friends?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sorry I don't exactly understand what you are proposing. Are you concerned by the #if defines? Do you suggest to break this function into multiple template structs with specializations for different types?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what do you think?

template <typename T, typename ReductionOp>
struct enable_generic_simd_reduction_traits
{
  static constexpr bool value = cub::detail::is_one_of<T, ::cuda::std::int16_t, ::cuda::std::uint16_t>()
                             && cub::detail::is_one_of<ReductionOp, cub::Min, cub::Max>();
};

#  if defined(_CCCL_HAS_NVFP16)

template <typename ReductionOp>
struct enable_generic_simd_reduction_traits<__half, ReductionOp>
{
  static constexpr bool value = cub::detail::is_one_of<ReductionOp, cub::Min, cub::Max, cub::Sum, cub::Mul>();
};
#  endif // defined(_CCCL_HAS_NVFP16)

#  if defined(_CCCL_HAS_NVBF16)

template <typename ReductionOp>
struct enable_generic_simd_reduction_traits<__nv_bfloat16, ReductionOp>
{
  static constexpr bool value = cub::detail::is_one_of<ReductionOp, cub::Min, cub::Max, cub::Sum, cub::Mul>();
};

#  endif // defined(_CCCL_HAS_NVBF16)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah

cub/cub/thread/thread_reduce.cuh Outdated Show resolved Hide resolved
@fbusato fbusato requested review from a team as code owners November 5, 2024 00:14
@fbusato fbusato closed this Nov 9, 2024
@fbusato fbusato force-pushed the thread-reduce-optimization branch from 39ba5f5 to 9616009 Compare November 9, 2024 01:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

2 participants