Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PTX] Add st, ld instructions #3974

Merged
merged 1 commit into from
Mar 3, 2025
Merged

Conversation

fbusato
Copy link
Contributor

@fbusato fbusato commented Mar 1, 2025

Description

Following #3939

This PR just applies the generated files from libcudaptx for those instructions and calls add_ptx_instruction.py for all of them. The availability table in the docs has been updated as well.

@fbusato fbusato added the 3.0 Targeted for 3.0 release label Mar 1, 2025
@fbusato fbusato self-assigned this Mar 1, 2025
@fbusato fbusato requested review from a team as code owners March 1, 2025 00:59
@fbusato fbusato requested a review from alliepiper March 1, 2025 00:59
Copy link
Contributor

github-actions bot commented Mar 1, 2025

🟨 CI finished in 1h 43m: Pass: 99%/158 | Total: 3d 11h | Avg: 31m 51s | Max: 1h 19m | Hits: 56%/249527
  • 🟨 libcudacxx: Pass: 97%/43 | Total: 14h 45m | Avg: 20m 35s | Max: 48m 00s | Hits: 47%/103876

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/41  | Total: 14h 01m | Avg: 20m 31s | Max: 48m 00s | Hits:  47%/98177 
      🟩 arm64              Pass: 100%/2   | Total: 43m 57s | Avg: 21m 58s | Max: 22m 21s | Hits:  33%/5699  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total:  1h 12m | Avg: 14m 29s | Max: 25m 12s | Hits:  59%/13784 
      🟩 12.5               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 33m 01s | Hits:  31%/5644  
      🔍 12.8               Pass:  97%/36  | Total: 12h 29m | Avg: 20m 49s | Max: 48m 00s | Hits:  45%/84448 
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 42m 35s | Avg: 21m 17s | Max: 22m 42s | Hits:  26%/5660  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 12m | Avg: 14m 29s | Max: 25m 12s | Hits:  59%/13784 
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 33m 01s | Hits:  31%/5644  
      🔍 nvcc12.8           Pass:  97%/34  | Total: 11h 46m | Avg: 20m 47s | Max: 48m 00s | Hits:  47%/78788 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 42m 35s | Avg: 21m 17s | Max: 22m 42s | Hits:  26%/5660  
      🔍 nvcc               Pass:  97%/41  | Total: 14h 03m | Avg: 20m 33s | Max: 48m 00s | Hits:  48%/98216 
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total: 55m 18s | Avg: 13m 49s | Max: 23m 55s | Hits:  65%/11290 
      🟩 Clang15            Pass: 100%/2   | Total: 28m 49s | Avg: 14m 24s | Max: 24m 04s | Hits:  65%/5656  
      🟩 Clang16            Pass: 100%/2   | Total: 46m 33s | Avg: 23m 16s | Max: 25m 13s | Hits:  33%/5656  
      🟩 Clang17            Pass: 100%/2   | Total: 48m 07s | Avg: 24m 03s | Max: 24m 21s | Hits:  33%/5656  
      🟩 Clang18            Pass: 100%/6   | Total:  2h 36m | Avg: 26m 07s | Max: 47m 07s | Hits:  31%/14165 
      🟩 GCC7               Pass: 100%/2   | Total: 41m 09s | Avg: 20m 34s | Max: 21m 44s | Hits:  34%/5594  
      🟩 GCC8               Pass: 100%/1   | Total: 20m 45s | Avg: 20m 45s | Max: 20m 45s | Hits:  34%/2807  
      🟩 GCC9               Pass: 100%/2   | Total: 23m 11s | Avg: 11m 35s | Max: 19m 04s | Hits:  65%/5606  
      🟩 GCC10              Pass: 100%/2   | Total: 28m 23s | Avg: 14m 11s | Max: 23m 40s | Hits:  65%/5662  
      🟩 GCC11              Pass: 100%/2   | Total: 42m 48s | Avg: 21m 24s | Max: 22m 44s | Hits:  33%/5658  
      🟩 GCC12              Pass: 100%/2   | Total: 44m 24s | Avg: 22m 12s | Max: 23m 19s | Hits:  33%/5658  
      🔍 GCC13              Pass:  90%/10  | Total:  3h 00m | Avg: 18m 01s | Max: 48m 00s | Hits:  58%/14406 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 50m 13s | Avg: 25m 06s | Max: 25m 12s | Hits:  65%/5128  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 55m 16s | Avg: 27m 38s | Max: 30m 12s | Hits:  33%/5290  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 33m 01s | Hits:  31%/5644  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/16  | Total:  5h 35m | Avg: 20m 58s | Max: 47m 07s | Hits:  45%/42423 
      🔍 GCC                Pass:  95%/21  | Total:  6h 20m | Avg: 18m 08s | Max: 48m 00s | Hits:  49%/45391 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 45m | Avg: 26m 22s | Max: 30m 12s | Hits:  49%/10418 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 33m 01s | Hits:  31%/5644  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 21m 15s | Avg: 10m 37s | Max: 13m 36s | Hits:  95%/2939  
      🔍 rtx2080            Pass:  97%/41  | Total: 14h 24m | Avg: 21m 04s | Max: 48m 00s | Hits:  45%/100937
    🔍 jobs: NVRTC 🔍
      🟩 Build              Pass: 100%/37  | Total: 12h 24m | Avg: 20m 06s | Max: 33m 01s | Hits:  47%/103856
      🔍 NVRTC              Pass:  50%/2   | Total: 30m 39s | Avg: 15m 19s | Max: 15m 23s | Hits:  90%/20    
      🟩 Test               Pass: 100%/3   | Total:  1h 48m | Avg: 36m 14s | Max: 48m 00s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 06s | Avg:  2m 06s | Max:  2m 06s
    🔍 sm: 75 🔍
      🔍 75                 Pass:  50%/2   | Total: 30m 39s | Avg: 15m 19s | Max: 15m 23s | Hits:  90%/20    
      🟩 90                 Pass: 100%/2   | Total: 21m 15s | Avg: 10m 37s | Max: 13m 36s | Hits:  95%/2939  
      🟩 90;90a;100         Pass: 100%/1   | Total: 30m 03s | Avg: 30m 03s | Max: 30m 03s | Hits:  32%/2939  
    🔍 std: 17 🔍
      🔍 17                 Pass:  95%/21  | Total:  6h 52m | Avg: 19m 39s | Max: 30m 45s | Hits:  46%/55380 
      🟩 20                 Pass: 100%/21  | Total:  7h 50m | Avg: 22m 25s | Max: 48m 00s | Hits:  48%/48496 
    
  • 🟩 cub: Pass: 100%/45 | Total: 1d 17h | Avg: 55m 05s | Max: 1h 19m | Hits: 44%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 15h | Avg: 54m 41s | Max:  1h 19m | Hits:  45%/51055 
      🟩 arm64              Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m | Hits:  34%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 16m | Avg:  1h 03m | Max:  1h 05m | Hits:  30%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  31%/2248  
      🟩 12.8               Pass: 100%/38  | Total:  1d 09h | Avg: 53m 14s | Max:  1h 19m | Hits:  47%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 09m | Hits:  35%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 16m | Avg:  1h 03m | Max:  1h 05m | Hits:  30%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  31%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 07h | Avg: 52m 35s | Max:  1h 19m | Hits:  47%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 09m | Hits:  35%/2100  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 15h | Avg: 54m 38s | Max:  1h 19m | Hits:  45%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 01m | Avg:  1h 00m | Max:  1h 03m | Hits:  34%/4868  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  34%/2430  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 02m | Hits:  34%/2430  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 57m | Avg: 58m 38s | Max: 58m 50s | Hits:  34%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 54m | Avg: 50m 38s | Max:  1h 09m | Hits:  54%/8175  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 05m | Hits:  34%/2434  
      🟩 GCC8               Pass: 100%/1   | Total: 58m 13s | Avg: 58m 13s | Max: 58m 13s | Hits:  34%/1217  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  34%/2434  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  34%/2434  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 01s | Max: 59m 34s | Hits:  34%/2430  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 08m | Hits:  34%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 55m | Avg: 37m 47s | Max:  1h 16m | Hits:  70%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m | Hits:  12%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 19m | Hits:  12%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  31%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 15h 54m | Avg: 56m 07s | Max:  1h 09m | Hits:  42%/20333 
      🟩 GCC                Pass: 100%/22  | Total: 18h 14m | Avg: 49m 44s | Max:  1h 16m | Hits:  52%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 50m | Avg:  1h 12m | Max:  1h 19m | Hits:  12%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  31%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 14m | Avg: 24m 50s | Max: 27m 31s | Hits:  77%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 11h | Avg:  1h 03m | Max:  1h 19m | Hits:  32%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 05m | Avg: 30m 44s | Max:  1h 00m | Hits:  83%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 14h | Avg:  1h 02m | Max:  1h 19m | Hits:  32%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 00s | Avg: 21m 00s | Max: 21m 00s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 32s | Avg: 16m 32s | Max: 16m 32s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 09m | Avg: 23m 14s | Max: 23m 45s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 07m | Avg: 22m 23s | Max: 23m 14s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 14m | Avg: 24m 50s | Max: 27m 31s | Hits:  77%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 16m | Avg:  1h 16m | Max:  1h 16m | Hits:  34%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 04m | Avg:  1h 03m | Max:  1h 19m | Hits:  31%/23535 
      🟩 20                 Pass: 100%/25  | Total: 20h 14m | Avg: 48m 34s | Max:  1h 16m | Hits:  54%/29950 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 21h 39m | Avg: 28m 53s | Max: 1h 02m | Hits: 77%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 35m 58s | Avg: 17m 59s | Max: 24m 47s | Hits:  88%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 20h 47m | Avg: 29m 00s | Max:  1h 02m | Hits:  77%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 52m 24s | Avg: 26m 12s | Max: 28m 11s | Hits:  77%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 46m | Avg: 33m 17s | Max: 53m 07s | Hits:  72%/8901  
      🟩 12.5               Pass: 100%/2   | Total:  1h 36m | Avg: 48m 26s | Max: 48m 53s | Hits:  65%/3562  
      🟩 12.8               Pass: 100%/38  | Total: 17h 16m | Avg: 27m 16s | Max:  1h 02m | Hits:  78%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 46m 20s | Avg: 23m 10s | Max: 24m 02s | Hits:  77%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 46m | Avg: 33m 17s | Max: 53m 07s | Hits:  72%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 36m | Avg: 48m 26s | Max: 48m 53s | Hits:  65%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 16h 30m | Avg: 27m 30s | Max:  1h 02m | Hits:  78%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 46m 20s | Avg: 23m 10s | Max: 24m 02s | Hits:  77%/3562  
      🟩 nvcc               Pass: 100%/43  | Total: 20h 53m | Avg: 29m 09s | Max:  1h 02m | Hits:  77%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 19s | Max: 27m 52s | Hits:  77%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 54m 59s | Avg: 27m 29s | Max: 28m 14s | Hits:  77%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 58m 09s | Avg: 29m 04s | Max: 29m 55s | Hits:  77%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 57m 11s | Avg: 28m 35s | Max: 29m 21s | Hits:  77%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 24m | Avg: 20m 38s | Max: 28m 06s | Hits:  83%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 55m 45s | Avg: 27m 52s | Max: 28m 27s | Hits:  77%/3564  
      🟩 GCC8               Pass: 100%/1   | Total: 28m 26s | Avg: 28m 26s | Max: 28m 26s | Hits:  77%/1782  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 00s | Max: 30m 08s | Hits:  77%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 57m 32s | Avg: 28m 46s | Max: 29m 33s | Hits:  77%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 56m 50s | Avg: 28m 25s | Max: 28m 51s | Hits:  77%/3564  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 32s | Max: 30m 56s | Hits:  77%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 24m | Avg: 20m 28s | Max: 32m 55s | Hits:  86%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 41m | Avg: 50m 58s | Max: 53m 07s | Hits:  54%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 32m | Avg: 50m 55s | Max:  1h 02m | Hits:  60%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 36m | Avg: 48m 26s | Max: 48m 53s | Hits:  65%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 04m | Avg: 24m 56s | Max: 29m 55s | Hits:  79%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  8h 44m | Avg: 24m 58s | Max: 32m 55s | Hits:  81%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 14m | Avg: 50m 56s | Max:  1h 02m | Hits:  58%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 36m | Avg: 48m 26s | Max: 48m 53s | Hits:  65%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 42s | Avg: 13m 51s | Max: 16m 26s | Hits:  88%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total: 17h 22m | Avg: 31m 34s | Max: 54m 59s | Hits:  74%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 50m | Avg: 23m 00s | Max:  1h 02m | Hits:  85%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 20h 04m | Avg: 31m 41s | Max:  1h 02m | Hits:  74%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 02s | Avg: 17m 00s | Max: 35m 27s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 24s | Avg: 11m 06s | Max: 11m 38s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 27m 42s | Avg: 13m 51s | Max: 16m 26s | Hits:  88%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 32m 55s | Avg: 32m 55s | Max: 32m 55s | Hits:  77%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 11h 00m | Avg: 33m 00s | Max: 54m 59s | Hits:  73%/35611 
      🟩 20                 Pass: 100%/23  | Total: 10h 03m | Avg: 26m 14s | Max:  1h 02m | Hits:  80%/40961 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 4h 58m | Avg: 13m 34s | Max: 18m 34s | Hits: 58%/11722

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  4h 03m | Avg: 13m 30s | Max: 18m 34s | Hits:  61%/9406  
      🟩 arm64              Pass: 100%/4   | Total: 55m 18s | Avg: 13m 49s | Max: 15m 22s | Hits:  50%/2316  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits:  55%/277   
      🟩 12.5               Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 50s | Hits:  68%/742   
      🟩 12.8               Pass: 100%/19  | Total:  4h 29m | Avg: 14m 11s | Max: 18m 34s | Hits:  58%/10703 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits:  55%/277   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 50s | Hits:  68%/742   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  4h 29m | Avg: 14m 11s | Max: 18m 34s | Hits:  58%/10703 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  4h 58m | Avg: 13m 34s | Max: 18m 34s | Hits:  58%/11722 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 15m 53s | Avg: 15m 53s | Max: 15m 53s | Hits:  50%/581   
      🟩 Clang15            Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s | Hits:  50%/579   
      🟩 Clang16            Pass: 100%/1   | Total: 16m 14s | Avg: 16m 14s | Max: 16m 14s | Hits:  50%/579   
      🟩 Clang17            Pass: 100%/1   | Total: 15m 15s | Avg: 15m 15s | Max: 15m 15s | Hits:  50%/579   
      🟩 Clang18            Pass: 100%/4   | Total: 54m 29s | Avg: 13m 37s | Max: 15m 37s | Hits:  62%/2316  
      🟩 GCC10              Pass: 100%/1   | Total: 17m 18s | Avg: 17m 18s | Max: 17m 18s | Hits:  50%/581   
      🟩 GCC11              Pass: 100%/1   | Total: 15m 38s | Avg: 15m 38s | Max: 15m 38s | Hits:  49%/579   
      🟩 GCC12              Pass: 100%/2   | Total: 31m 08s | Avg: 15m 34s | Max: 18m 34s | Hits:  74%/1158  
      🟩 GCC13              Pass: 100%/6   | Total:  1h 16m | Avg: 12m 45s | Max: 15m 22s | Hits:  58%/3474  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits:  55%/277   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 11m 38s | Avg: 11m 38s | Max: 11m 38s | Hits:  55%/277   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 50s | Hits:  68%/742   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total:  1h 57m | Avg: 14m 39s | Max: 16m 14s | Hits:  56%/4634  
      🟩 GCC                Pass: 100%/10  | Total:  2h 20m | Avg: 14m 03s | Max: 18m 34s | Hits:  59%/5792  
      🟩 MSVC               Pass: 100%/2   | Total: 23m 15s | Avg: 11m 37s | Max: 11m 38s | Hits:  55%/554   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 50s | Hits:  68%/742   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 26m 03s | Avg: 13m 01s | Max: 13m 46s | Hits:  74%/1158  
      🟩 rtx2080            Pass: 100%/20  | Total:  4h 32m | Avg: 13m 37s | Max: 18m 34s | Hits:  57%/10564 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  4h 19m | Avg: 13m 40s | Max: 18m 34s | Hits:  51%/9985  
      🟩 Test               Pass: 100%/3   | Total: 38m 43s | Avg: 12m 54s | Max: 13m 46s | Hits:  99%/1737  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 36m 27s | Avg: 12m 09s | Max: 13m 46s | Hits:  66%/1737  
      🟩 90a                Pass: 100%/1   | Total: 11m 19s | Avg: 11m 19s | Max: 11m 19s | Hits:  49%/579   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 45m 10s | Avg: 11m 17s | Max: 13m 27s | Hits:  53%/2108  
      🟩 20                 Pass: 100%/18  | Total:  4h 13m | Avg: 14m 04s | Max: 18m 34s | Hits:  60%/9614  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 17m 15s | Avg: 8m 37s | Max: 14m 46s | Hits: 97%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 29s | Avg:  2m 29s | Max:  2m 29s | Hits:  96%/154   
      🟩 Test               Pass: 100%/1   | Total: 14m 46s | Avg: 14m 46s | Max: 14m 46s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 158)

# Runner
111 linux-amd64-cpu16
15 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Copy link
Contributor

github-actions bot commented Mar 1, 2025

🟩 CI finished in 8h 02m: Pass: 100%/158 | Total: 3d 11h | Avg: 31m 51s | Max: 1h 19m | Hits: 56%/249547
  • 🟩 cub: Pass: 100%/45 | Total: 1d 17h | Avg: 55m 05s | Max: 1h 19m | Hits: 44%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 15h | Avg: 54m 41s | Max:  1h 19m | Hits:  45%/51055 
      🟩 arm64              Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m | Hits:  34%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 16m | Avg:  1h 03m | Max:  1h 05m | Hits:  30%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  31%/2248  
      🟩 12.8               Pass: 100%/38  | Total:  1d 09h | Avg: 53m 14s | Max:  1h 19m | Hits:  47%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 09m | Hits:  35%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 16m | Avg:  1h 03m | Max:  1h 05m | Hits:  30%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  31%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 07h | Avg: 52m 35s | Max:  1h 19m | Hits:  47%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 09m | Hits:  35%/2100  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 15h | Avg: 54m 38s | Max:  1h 19m | Hits:  45%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 01m | Avg:  1h 00m | Max:  1h 03m | Hits:  34%/4868  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  34%/2430  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 02m | Hits:  34%/2430  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 57m | Avg: 58m 38s | Max: 58m 50s | Hits:  34%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 54m | Avg: 50m 38s | Max:  1h 09m | Hits:  54%/8175  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 05m | Hits:  34%/2434  
      🟩 GCC8               Pass: 100%/1   | Total: 58m 13s | Avg: 58m 13s | Max: 58m 13s | Hits:  34%/1217  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  34%/2434  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  34%/2434  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 01s | Max: 59m 34s | Hits:  34%/2430  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 08m | Hits:  34%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 55m | Avg: 37m 47s | Max:  1h 16m | Hits:  70%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m | Hits:  12%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 19m | Hits:  12%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  31%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 15h 54m | Avg: 56m 07s | Max:  1h 09m | Hits:  42%/20333 
      🟩 GCC                Pass: 100%/22  | Total: 18h 14m | Avg: 49m 44s | Max:  1h 16m | Hits:  52%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 50m | Avg:  1h 12m | Max:  1h 19m | Hits:  12%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  31%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 14m | Avg: 24m 50s | Max: 27m 31s | Hits:  77%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 11h | Avg:  1h 03m | Max:  1h 19m | Hits:  32%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 05m | Avg: 30m 44s | Max:  1h 00m | Hits:  83%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 14h | Avg:  1h 02m | Max:  1h 19m | Hits:  32%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 00s | Avg: 21m 00s | Max: 21m 00s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 32s | Avg: 16m 32s | Max: 16m 32s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 09m | Avg: 23m 14s | Max: 23m 45s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 07m | Avg: 22m 23s | Max: 23m 14s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 14m | Avg: 24m 50s | Max: 27m 31s | Hits:  77%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 16m | Avg:  1h 16m | Max:  1h 16m | Hits:  34%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 04m | Avg:  1h 03m | Max:  1h 19m | Hits:  31%/23535 
      🟩 20                 Pass: 100%/25  | Total: 20h 14m | Avg: 48m 34s | Max:  1h 16m | Hits:  54%/29950 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 21h 39m | Avg: 28m 53s | Max: 1h 02m | Hits: 77%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 35m 58s | Avg: 17m 59s | Max: 24m 47s | Hits:  88%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 20h 47m | Avg: 29m 00s | Max:  1h 02m | Hits:  77%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 52m 24s | Avg: 26m 12s | Max: 28m 11s | Hits:  77%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 46m | Avg: 33m 17s | Max: 53m 07s | Hits:  72%/8901  
      🟩 12.5               Pass: 100%/2   | Total:  1h 36m | Avg: 48m 26s | Max: 48m 53s | Hits:  65%/3562  
      🟩 12.8               Pass: 100%/38  | Total: 17h 16m | Avg: 27m 16s | Max:  1h 02m | Hits:  78%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 46m 20s | Avg: 23m 10s | Max: 24m 02s | Hits:  77%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 46m | Avg: 33m 17s | Max: 53m 07s | Hits:  72%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 36m | Avg: 48m 26s | Max: 48m 53s | Hits:  65%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 16h 30m | Avg: 27m 30s | Max:  1h 02m | Hits:  78%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 46m 20s | Avg: 23m 10s | Max: 24m 02s | Hits:  77%/3562  
      🟩 nvcc               Pass: 100%/43  | Total: 20h 53m | Avg: 29m 09s | Max:  1h 02m | Hits:  77%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 19s | Max: 27m 52s | Hits:  77%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 54m 59s | Avg: 27m 29s | Max: 28m 14s | Hits:  77%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 58m 09s | Avg: 29m 04s | Max: 29m 55s | Hits:  77%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 57m 11s | Avg: 28m 35s | Max: 29m 21s | Hits:  77%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 24m | Avg: 20m 38s | Max: 28m 06s | Hits:  83%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 55m 45s | Avg: 27m 52s | Max: 28m 27s | Hits:  77%/3564  
      🟩 GCC8               Pass: 100%/1   | Total: 28m 26s | Avg: 28m 26s | Max: 28m 26s | Hits:  77%/1782  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 00s | Max: 30m 08s | Hits:  77%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 57m 32s | Avg: 28m 46s | Max: 29m 33s | Hits:  77%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 56m 50s | Avg: 28m 25s | Max: 28m 51s | Hits:  77%/3564  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 32s | Max: 30m 56s | Hits:  77%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 24m | Avg: 20m 28s | Max: 32m 55s | Hits:  86%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 41m | Avg: 50m 58s | Max: 53m 07s | Hits:  54%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 32m | Avg: 50m 55s | Max:  1h 02m | Hits:  60%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 36m | Avg: 48m 26s | Max: 48m 53s | Hits:  65%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 04m | Avg: 24m 56s | Max: 29m 55s | Hits:  79%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  8h 44m | Avg: 24m 58s | Max: 32m 55s | Hits:  81%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 14m | Avg: 50m 56s | Max:  1h 02m | Hits:  58%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 36m | Avg: 48m 26s | Max: 48m 53s | Hits:  65%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 42s | Avg: 13m 51s | Max: 16m 26s | Hits:  88%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total: 17h 22m | Avg: 31m 34s | Max: 54m 59s | Hits:  74%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 50m | Avg: 23m 00s | Max:  1h 02m | Hits:  85%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 20h 04m | Avg: 31m 41s | Max:  1h 02m | Hits:  74%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 02s | Avg: 17m 00s | Max: 35m 27s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 24s | Avg: 11m 06s | Max: 11m 38s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 27m 42s | Avg: 13m 51s | Max: 16m 26s | Hits:  88%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 32m 55s | Avg: 32m 55s | Max: 32m 55s | Hits:  77%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 11h 00m | Avg: 33m 00s | Max: 54m 59s | Hits:  73%/35611 
      🟩 20                 Pass: 100%/23  | Total: 10h 03m | Avg: 26m 14s | Max:  1h 02m | Hits:  80%/40961 
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 14h 46m | Avg: 20m 36s | Max: 48m 00s | Hits: 47%/103896

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 14h 02m | Avg: 20m 32s | Max: 48m 00s | Hits:  47%/98197 
      🟩 arm64              Pass: 100%/2   | Total: 43m 57s | Avg: 21m 58s | Max: 22m 21s | Hits:  33%/5699  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 12m | Avg: 14m 29s | Max: 25m 12s | Hits:  59%/13784 
      🟩 12.5               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 33m 01s | Hits:  31%/5644  
      🟩 12.8               Pass: 100%/36  | Total: 12h 30m | Avg: 20m 50s | Max: 48m 00s | Hits:  45%/84468 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 42m 35s | Avg: 21m 17s | Max: 22m 42s | Hits:  26%/5660  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 12m | Avg: 14m 29s | Max: 25m 12s | Hits:  59%/13784 
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 33m 01s | Hits:  31%/5644  
      🟩 nvcc12.8           Pass: 100%/34  | Total: 11h 47m | Avg: 20m 48s | Max: 48m 00s | Hits:  47%/78808 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 42m 35s | Avg: 21m 17s | Max: 22m 42s | Hits:  26%/5660  
      🟩 nvcc               Pass: 100%/41  | Total: 14h 03m | Avg: 20m 34s | Max: 48m 00s | Hits:  48%/98236 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 55m 18s | Avg: 13m 49s | Max: 23m 55s | Hits:  65%/11290 
      🟩 Clang15            Pass: 100%/2   | Total: 28m 49s | Avg: 14m 24s | Max: 24m 04s | Hits:  65%/5656  
      🟩 Clang16            Pass: 100%/2   | Total: 46m 33s | Avg: 23m 16s | Max: 25m 13s | Hits:  33%/5656  
      🟩 Clang17            Pass: 100%/2   | Total: 48m 07s | Avg: 24m 03s | Max: 24m 21s | Hits:  33%/5656  
      🟩 Clang18            Pass: 100%/6   | Total:  2h 36m | Avg: 26m 07s | Max: 47m 07s | Hits:  31%/14165 
      🟩 GCC7               Pass: 100%/2   | Total: 41m 09s | Avg: 20m 34s | Max: 21m 44s | Hits:  34%/5594  
      🟩 GCC8               Pass: 100%/1   | Total: 20m 45s | Avg: 20m 45s | Max: 20m 45s | Hits:  34%/2807  
      🟩 GCC9               Pass: 100%/2   | Total: 23m 11s | Avg: 11m 35s | Max: 19m 04s | Hits:  65%/5606  
      🟩 GCC10              Pass: 100%/2   | Total: 28m 23s | Avg: 14m 11s | Max: 23m 40s | Hits:  65%/5662  
      🟩 GCC11              Pass: 100%/2   | Total: 42m 48s | Avg: 21m 24s | Max: 22m 44s | Hits:  33%/5658  
      🟩 GCC12              Pass: 100%/2   | Total: 44m 24s | Avg: 22m 12s | Max: 23m 19s | Hits:  33%/5658  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 00m | Avg: 18m 05s | Max: 48m 00s | Hits:  58%/14426 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 50m 13s | Avg: 25m 06s | Max: 25m 12s | Hits:  65%/5128  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 55m 16s | Avg: 27m 38s | Max: 30m 12s | Hits:  33%/5290  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 33m 01s | Hits:  31%/5644  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/16  | Total:  5h 35m | Avg: 20m 58s | Max: 47m 07s | Hits:  45%/42423 
      🟩 GCC                Pass: 100%/21  | Total:  6h 21m | Avg: 18m 10s | Max: 48m 00s | Hits:  49%/45411 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 45m | Avg: 26m 22s | Max: 30m 12s | Hits:  49%/10418 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 33m 01s | Hits:  31%/5644  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 21m 15s | Avg: 10m 37s | Max: 13m 36s | Hits:  95%/2939  
      🟩 rtx2080            Pass: 100%/41  | Total: 14h 25m | Avg: 21m 05s | Max: 48m 00s | Hits:  45%/100957
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 12h 24m | Avg: 20m 06s | Max: 33m 01s | Hits:  47%/103856
      🟩 NVRTC              Pass: 100%/2   | Total: 31m 17s | Avg: 15m 38s | Max: 16m 01s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total:  1h 48m | Avg: 36m 14s | Max: 48m 00s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 06s | Avg:  2m 06s | Max:  2m 06s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 31m 17s | Avg: 15m 38s | Max: 16m 01s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 21m 15s | Avg: 10m 37s | Max: 13m 36s | Hits:  95%/2939  
      🟩 90;90a;100         Pass: 100%/1   | Total: 30m 03s | Avg: 30m 03s | Max: 30m 03s | Hits:  32%/2939  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  6h 53m | Avg: 19m 41s | Max: 30m 45s | Hits:  46%/55400 
      🟩 20                 Pass: 100%/21  | Total:  7h 50m | Avg: 22m 25s | Max: 48m 00s | Hits:  48%/48496 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 4h 58m | Avg: 13m 34s | Max: 18m 34s | Hits: 58%/11722

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  4h 03m | Avg: 13m 30s | Max: 18m 34s | Hits:  61%/9406  
      🟩 arm64              Pass: 100%/4   | Total: 55m 18s | Avg: 13m 49s | Max: 15m 22s | Hits:  50%/2316  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits:  55%/277   
      🟩 12.5               Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 50s | Hits:  68%/742   
      🟩 12.8               Pass: 100%/19  | Total:  4h 29m | Avg: 14m 11s | Max: 18m 34s | Hits:  58%/10703 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits:  55%/277   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 50s | Hits:  68%/742   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  4h 29m | Avg: 14m 11s | Max: 18m 34s | Hits:  58%/10703 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  4h 58m | Avg: 13m 34s | Max: 18m 34s | Hits:  58%/11722 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 15m 53s | Avg: 15m 53s | Max: 15m 53s | Hits:  50%/581   
      🟩 Clang15            Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s | Hits:  50%/579   
      🟩 Clang16            Pass: 100%/1   | Total: 16m 14s | Avg: 16m 14s | Max: 16m 14s | Hits:  50%/579   
      🟩 Clang17            Pass: 100%/1   | Total: 15m 15s | Avg: 15m 15s | Max: 15m 15s | Hits:  50%/579   
      🟩 Clang18            Pass: 100%/4   | Total: 54m 29s | Avg: 13m 37s | Max: 15m 37s | Hits:  62%/2316  
      🟩 GCC10              Pass: 100%/1   | Total: 17m 18s | Avg: 17m 18s | Max: 17m 18s | Hits:  50%/581   
      🟩 GCC11              Pass: 100%/1   | Total: 15m 38s | Avg: 15m 38s | Max: 15m 38s | Hits:  49%/579   
      🟩 GCC12              Pass: 100%/2   | Total: 31m 08s | Avg: 15m 34s | Max: 18m 34s | Hits:  74%/1158  
      🟩 GCC13              Pass: 100%/6   | Total:  1h 16m | Avg: 12m 45s | Max: 15m 22s | Hits:  58%/3474  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits:  55%/277   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 11m 38s | Avg: 11m 38s | Max: 11m 38s | Hits:  55%/277   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 50s | Hits:  68%/742   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total:  1h 57m | Avg: 14m 39s | Max: 16m 14s | Hits:  56%/4634  
      🟩 GCC                Pass: 100%/10  | Total:  2h 20m | Avg: 14m 03s | Max: 18m 34s | Hits:  59%/5792  
      🟩 MSVC               Pass: 100%/2   | Total: 23m 15s | Avg: 11m 37s | Max: 11m 38s | Hits:  55%/554   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 50s | Hits:  68%/742   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 26m 03s | Avg: 13m 01s | Max: 13m 46s | Hits:  74%/1158  
      🟩 rtx2080            Pass: 100%/20  | Total:  4h 32m | Avg: 13m 37s | Max: 18m 34s | Hits:  57%/10564 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  4h 19m | Avg: 13m 40s | Max: 18m 34s | Hits:  51%/9985  
      🟩 Test               Pass: 100%/3   | Total: 38m 43s | Avg: 12m 54s | Max: 13m 46s | Hits:  99%/1737  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 36m 27s | Avg: 12m 09s | Max: 13m 46s | Hits:  66%/1737  
      🟩 90a                Pass: 100%/1   | Total: 11m 19s | Avg: 11m 19s | Max: 11m 19s | Hits:  49%/579   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 45m 10s | Avg: 11m 17s | Max: 13m 27s | Hits:  53%/2108  
      🟩 20                 Pass: 100%/18  | Total:  4h 13m | Avg: 14m 04s | Max: 18m 34s | Hits:  60%/9614  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 17m 15s | Avg: 8m 37s | Max: 14m 46s | Hits: 97%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 46s | Hits:  97%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 29s | Avg:  2m 29s | Max:  2m 29s | Hits:  96%/154   
      🟩 Test               Pass: 100%/1   | Total: 14m 46s | Avg: 14m 46s | Max: 14m 46s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 52m 25s | Avg: 52m 25s | Max: 52m 25s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 158)

# Runner
111 linux-amd64-cpu16
15 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Copy link
Contributor

@ahendriksen ahendriksen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@bernhardmgruber bernhardmgruber merged commit c457736 into NVIDIA:main Mar 3, 2025
172 of 175 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.0 Targeted for 3.0 release
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants