Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PTX: Remove internal instructions #3583

Merged
merged 3 commits into from
Jan 30, 2025

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Jan 29, 2025

@bernhardmgruber
Copy link
Contributor Author

/ok to test

Copy link
Contributor

🟨 CI finished in 3h 04m: Pass: 99%/152 | Total: 1d 05h | Avg: 11m 48s | Max: 1h 04m | Hits: 515%/21523
  • 🟨 cub: Pass: 97%/44 | Total: 11h 55m | Avg: 16m 16s | Max: 1h 04m | Hits: 442%/3552

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/42  | Total: 11h 46m | Avg: 16m 48s | Max:  1h 04m | Hits: 442%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  4m 54s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  1h 08m | Avg: 13m 37s | Max: 47m 02s | Hits: 467%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
      🔍 12.6               Pass:  97%/37  | Total:  8h 46m | Avg: 14m 13s | Max:  1h 04m | Hits: 433%/2664  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 45s | Avg:  4m 22s | Max:  4m 29s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 08m | Avg: 13m 37s | Max: 47m 02s | Hits: 467%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
      🔍 nvcc12.6           Pass:  97%/35  | Total:  8h 37m | Avg: 14m 47s | Max:  1h 04m | Hits: 433%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 45s | Avg:  4m 22s | Max:  4m 29s
      🔍 nvcc               Pass:  97%/42  | Total: 11h 47m | Avg: 16m 50s | Max:  1h 04m | Hits: 442%/3552  
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total: 20m 41s | Avg:  5m 10s | Max:  5m 25s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  5m 54s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 15s | Avg:  5m 37s | Max:  5m 42s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 32s | Avg:  5m 46s | Max:  6m 14s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 16m | Avg: 10m 56s | Max: 26m 24s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 00s | Max: 56m 43s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 21s | Avg:  5m 21s | Max:  5m 21s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  5m 57s
      🟩 GCC10              Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  5m 31s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 12s | Avg:  5m 36s | Max:  5m 52s
      🟩 GCC12              Pass: 100%/4   | Total: 35m 01s | Avg:  8m 45s | Max: 19m 23s
      🔍 GCC13              Pass:  87%/8   | Total:  1h 41m | Avg: 12m 37s | Max: 28m 18s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 55s | Max:  1h 00m | Hits: 453%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 56s | Max:  1h 04m | Hits: 430%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/17  | Total:  2h 11m | Avg:  7m 45s | Max: 26m 24s
      🔍 GCC                Pass:  95%/21  | Total:  3h 56m | Avg: 11m 16s | Max: 56m 43s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 45m | Avg: 56m 25s | Max:  1h 04m | Hits: 442%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 23m 40s | Avg: 11m 50s | Max: 19m 23s
      🔍 v100               Pass:  97%/42  | Total: 11h 32m | Avg: 16m 28s | Max:  1h 04m | Hits: 442%/3552  
    🚨 jobs: GraphCapture 🚨
      🟩 Build              Pass: 100%/37  | Total:  9h 24m | Avg: 15m 15s | Max:  1h 04m | Hits: 442%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 08s | Avg: 21m 08s | Max: 21m 08s
      🔥 GraphCapture       Pass:   0%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 13m | Avg: 24m 20s | Max: 28m 18s
      🟩 TestGPU            Pass: 100%/2   | Total: 53m 43s | Avg: 26m 51s | Max: 27m 19s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total:  5h 59m | Avg: 17m 59s | Max:  1h 00m | Hits: 443%/2664  
      🔍 20                 Pass:  95%/24  | Total:  5h 56m | Avg: 14m 50s | Max:  1h 04m | Hits: 438%/888   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 40s | Avg: 11m 50s | Max: 19m 23s
      🟩 90a                Pass: 100%/1   | Total:  4m 19s | Avg:  4m 19s | Max:  4m 19s
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 6h 42m | Avg: 9m 21s | Max: 31m 01s | Hits: 688%/10065

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  6h 35m | Avg:  9m 38s | Max: 31m 01s | Hits: 688%/10065 
      🟩 arm64              Pass: 100%/2   | Total:  7m 02s | Avg:  3m 31s | Max:  3m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 36m 07s | Avg:  7m 13s | Max: 20m 45s | Hits: 689%/2471  
      🟩 12.5               Pass: 100%/2   | Total: 17m 29s | Avg:  8m 44s | Max:  8m 59s
      🟩 12.6               Pass: 100%/36  | Total:  5h 49m | Avg:  9m 41s | Max: 31m 01s | Hits: 688%/7594  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 10m | Avg: 17m 32s | Max: 22m 07s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 36m 07s | Avg:  7m 13s | Max: 20m 45s | Hits: 689%/2471  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 29s | Avg:  8m 44s | Max:  8m 59s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  4h 38m | Avg:  8m 42s | Max: 31m 01s | Hits: 688%/7594  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 10m | Avg: 17m 32s | Max: 22m 07s
      🟩 nvcc               Pass: 100%/39  | Total:  5h 32m | Avg:  8m 31s | Max: 31m 01s | Hits: 688%/10065 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 16m 06s | Avg:  4m 01s | Max:  4m 13s
      🟩 Clang15            Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  5m 23s
      🟩 Clang16            Pass: 100%/2   | Total:  8m 22s | Avg:  4m 11s | Max:  4m 17s
      🟩 Clang17            Pass: 100%/2   | Total:  9m 11s | Avg:  4m 35s | Max:  4m 44s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 40m | Avg: 12m 33s | Max: 22m 07s
      🟩 GCC7               Pass: 100%/2   | Total:  7m 19s | Avg:  3m 39s | Max:  3m 41s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  4m 02s
      🟩 GCC10              Pass: 100%/2   | Total:  7m 53s | Avg:  3m 56s | Max:  4m 13s
      🟩 GCC11              Pass: 100%/2   | Total:  7m 46s | Avg:  3m 53s | Max:  4m 01s
      🟩 GCC12              Pass: 100%/2   | Total:  8m 12s | Avg:  4m 06s | Max:  4m 07s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 37m | Avg: 12m 13s | Max: 31m 01s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 46m 40s | Avg: 23m 20s | Max: 25m 55s | Hits: 689%/4952  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 53m 52s | Avg: 26m 56s | Max: 28m 28s | Hits: 688%/5113  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 29s | Avg:  8m 44s | Max:  8m 59s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 24m | Avg:  8m 00s | Max: 22m 07s
      🟩 GCC                Pass: 100%/19  | Total:  2h 20m | Avg:  7m 23s | Max: 31m 01s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 40m | Avg: 25m 08s | Max: 28m 28s | Hits: 688%/10065 
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 29s | Avg:  8m 44s | Max:  8m 59s
    🟩 gpu
      🟩 v100               Pass: 100%/43  | Total:  6h 42m | Avg:  9m 21s | Max: 31m 01s | Hits: 688%/10065 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  5h 02m | Avg:  7m 57s | Max: 28m 28s | Hits: 688%/10065 
      🟩 NVRTC              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 42s | Max: 31m 01s
      🟩 Test               Pass: 100%/2   | Total: 37m 05s | Avg: 18m 32s | Max: 19m 10s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 56s | Avg:  1m 56s | Max:  1m 56s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 14m 22s | Avg: 14m 22s | Max: 14m 22s
      🟩 90a                Pass: 100%/2   | Total: 18m 10s | Avg:  9m 05s | Max: 14m 07s
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 10m | Avg:  9m 04s | Max: 30m 24s | Hits: 689%/7433  
      🟩 20                 Pass: 100%/21  | Total:  3h 30m | Avg: 10m 00s | Max: 31m 01s | Hits: 687%/2632  
    
  • 🟩 thrust: Pass: 100%/42 | Total: 8h 20m | Avg: 11m 55s | Max: 51m 11s | Hits: 323%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 21s | Avg: 10m 40s | Max: 15m 41s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total:  8h 11m | Avg: 12m 16s | Max: 51m 11s | Hits: 323%/7384  
      🟩 arm64              Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  5m 03s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 54m 31s | Avg: 10m 54s | Max: 33m 40s | Hits: 337%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 38m | Avg: 49m 26s | Max: 51m 11s
      🟩 12.6               Pass: 100%/35  | Total:  5h 47m | Avg:  9m 55s | Max: 44m 31s | Hits: 318%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 19s | Avg:  5m 09s | Max:  5m 23s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 54m 31s | Avg: 10m 54s | Max: 33m 40s | Hits: 337%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 38m | Avg: 49m 26s | Max: 51m 11s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  5h 37m | Avg: 10m 13s | Max: 44m 31s | Hits: 318%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 19s | Avg:  5m 09s | Max:  5m 23s
      🟩 nvcc               Pass: 100%/40  | Total:  8h 10m | Avg: 12m 15s | Max: 51m 11s | Hits: 323%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 18s | Avg:  5m 19s | Max:  5m 47s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  5m 58s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 04s | Avg:  5m 32s | Max:  5m 34s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 22s | Avg:  5m 41s | Max:  5m 48s
      🟩 Clang18            Pass: 100%/7   | Total: 56m 44s | Avg:  8m 06s | Max: 23m 17s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 53s | Avg:  5m 26s | Max:  5m 30s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 29s | Avg:  5m 44s | Max:  5m 58s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  5m 32s
      🟩 GCC11              Pass: 100%/2   | Total: 12m 25s | Avg:  6m 12s | Max:  6m 20s
      🟩 GCC12              Pass: 100%/2   | Total: 12m 22s | Avg:  6m 11s | Max:  6m 17s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 07m | Avg:  8m 28s | Max: 16m 25s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 18m | Avg: 39m 05s | Max: 44m 31s | Hits: 303%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 20m | Avg: 40m 07s | Max: 41m 56s | Hits: 343%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 38m | Avg: 49m 26s | Max: 51m 11s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 52m | Avg:  6m 35s | Max: 23m 17s
      🟩 GCC                Pass: 100%/19  | Total:  2h 11m | Avg:  6m 55s | Max: 16m 25s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 38m | Avg: 39m 36s | Max: 44m 31s | Hits: 323%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 38m | Avg: 49m 26s | Max: 51m 11s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total:  8h 20m | Avg: 11m 55s | Max: 51m 11s | Hits: 323%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  7h 09m | Avg: 11m 36s | Max: 51m 11s | Hits: 323%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 16m 13s | Avg:  8m 06s | Max:  8m 54s
      🟩 TestGPU            Pass: 100%/3   | Total: 55m 23s | Avg: 18m 27s | Max: 23m 17s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 17s | Avg:  4m 17s | Max:  4m 17s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  4h 21m | Avg: 13m 03s | Max: 51m 11s | Hits: 314%/5538  
      🟩 20                 Pass: 100%/20  | Total:  3h 38m | Avg: 10m 55s | Max: 47m 41s | Hits: 351%/1846  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 52m | Avg: 5m 36s | Max: 19m 49s | Hits: 388%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 41m | Avg:  6m 20s | Max: 19m 49s | Hits: 388%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 30s | Avg:  2m 37s | Max:  2m 45s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  8m 44s | Avg:  8m 44s | Max:  8m 44s | Hits: 388%/261   
      🟩 12.5               Pass: 100%/2   | Total: 10m 44s | Avg:  5m 22s | Max:  5m 29s
      🟩 12.6               Pass: 100%/17  | Total:  1h 32m | Avg:  5m 26s | Max: 19m 49s | Hits: 388%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  8m 44s | Avg:  8m 44s | Max:  8m 44s | Hits: 388%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 44s | Avg:  5m 22s | Max:  5m 29s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 32m | Avg:  5m 26s | Max: 19m 49s | Hits: 388%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 52m | Avg:  5m 36s | Max: 19m 49s | Hits: 388%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 08s | Avg:  3m 08s | Max:  3m 08s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 46s | Avg:  3m 46s | Max:  3m 46s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s
      🟩 Clang18            Pass: 100%/4   | Total: 28m 30s | Avg:  7m 07s | Max: 19m 49s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s
      🟩 GCC12              Pass: 100%/2   | Total: 22m 13s | Avg: 11m 06s | Max: 18m 53s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 37s | Avg:  2m 39s | Max:  2m 54s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  8m 44s | Avg:  8m 44s | Max:  8m 44s | Hits: 388%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 18s | Avg: 11m 18s | Max: 11m 18s | Hits: 388%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 44s | Avg:  5m 22s | Max:  5m 29s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 42m 03s | Avg:  5m 15s | Max: 19m 49s
      🟩 GCC                Pass: 100%/8   | Total: 39m 15s | Avg:  4m 54s | Max: 18m 53s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 11m 18s | Hits: 388%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 44s | Avg:  5m 22s | Max:  5m 29s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  1h 52m | Avg:  5m 36s | Max: 19m 49s | Hits: 388%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 13m | Avg:  4m 04s | Max: 11m 18s | Hits: 388%/522   
      🟩 Test               Pass: 100%/2   | Total: 38m 42s | Avg: 19m 21s | Max: 19m 49s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 36s | Avg:  2m 36s | Max:  2m 36s
      🟩 90a                Pass: 100%/1   | Total:  2m 54s | Avg:  2m 54s | Max:  2m 54s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 13m 02s | Avg:  3m 15s | Max:  5m 15s
      🟩 20                 Pass: 100%/16  | Total:  1h 39m | Avg:  6m 11s | Max: 19m 49s | Hits: 388%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 12m 45s | Avg: 6m 22s | Max: 10m 39s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max: 10m 39s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max: 10m 39s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max: 10m 39s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max: 10m 39s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max: 10m 39s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max: 10m 39s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max: 10m 39s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 06s | Avg:  2m 06s | Max:  2m 06s
      🟩 Test               Pass: 100%/1   | Total: 10m 39s | Avg: 10m 39s | Max: 10m 39s
    
  • 🟩 python: Pass: 100%/1 | Total: 50m 14s | Avg: 50m 14s | Max: 50m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 50m 14s | Avg: 50m 14s | Max: 50m 14s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 50m 14s | Avg: 50m 14s | Max: 50m 14s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 50m 14s | Avg: 50m 14s | Max: 50m 14s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 50m 14s | Avg: 50m 14s | Max: 50m 14s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 50m 14s | Avg: 50m 14s | Max: 50m 14s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 50m 14s | Avg: 50m 14s | Max: 50m 14s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 50m 14s | Avg: 50m 14s | Max: 50m 14s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 50m 14s | Avg: 50m 14s | Max: 50m 14s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 152)

# Runner
110 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
14 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1

This is not supposed to be exposed in CCCL.
Not ready for inclusion yet. This needs to handle the optional extra
output mask as well.
This has compiler bugs. We should use intrinsics instead.
@bernhardmgruber bernhardmgruber marked this pull request as ready for review January 29, 2025 22:09
@bernhardmgruber bernhardmgruber requested review from a team as code owners January 29, 2025 22:09
@NVIDIA NVIDIA deleted a comment from copy-pr-bot bot Jan 29, 2025
Copy link
Contributor

🟨 CI finished in 4h 45m: Pass: 98%/152 | Total: 3d 04h | Avg: 30m 16s | Max: 1h 19m | Hits: 411%/21523
  • 🟨 cub: Pass: 95%/44 | Total: 1d 15h | Avg: 54m 13s | Max: 1h 19m | Hits: 159%/3552

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/42  | Total:  1d 13h | Avg: 53m 35s | Max:  1h 19m | Hits: 159%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  5h 26m | Avg:  1h 05m | Max:  1h 17m | Hits: 160%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m
      🔍 12.6               Pass:  94%/37  | Total:  1d 08h | Avg: 51m 55s | Max:  1h 19m | Hits: 159%/2664  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 57m | Avg: 58m 58s | Max:  1h 00m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 26m | Avg:  1h 05m | Max:  1h 17m | Hits: 160%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m
      🔍 nvcc12.6           Pass:  94%/35  | Total:  1d 06h | Avg: 51m 31s | Max:  1h 19m | Hits: 159%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 58s | Max:  1h 00m
      🔍 nvcc               Pass:  95%/42  | Total:  1d 13h | Avg: 54m 00s | Max:  1h 19m | Hits: 159%/3552  
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 42m 22s | Avg: 21m 11s | Max: 23m 21s
      🔍 v100               Pass:  95%/42  | Total:  1d 15h | Avg: 55m 48s | Max:  1h 19m | Hits: 159%/3552  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 20h 32m | Avg:  1h 01m | Max:  1h 17m | Hits: 160%/2664  
      🔍 20                 Pass:  91%/24  | Total: 19h 13m | Avg: 48m 03s | Max:  1h 19m | Hits: 159%/888   
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 19m | Avg:  1h 04m | Max:  1h 17m
      🟩 Clang15            Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m
      🟩 Clang16            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 38s | Max: 57m 19s
      🟩 Clang17            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 03m
      🟨 Clang18            Pass:  85%/7   | Total:  5h 50m | Avg: 50m 08s | Max:  1h 07m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 51m | Avg: 55m 34s | Max: 57m 41s
      🟩 GCC8               Pass: 100%/1   | Total: 57m 08s | Avg: 57m 08s | Max: 57m 08s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 59m | Avg: 59m 49s | Max:  1h 01m
      🟩 GCC10              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 13m
      🟩 GCC11              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 12m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 37m | Avg: 39m 28s | Max: 58m 20s
      🟨 GCC13              Pass:  87%/8   | Total:  4h 49m | Avg: 36m 08s | Max:  1h 07m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 08m | Hits: 160%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 19m | Hits: 159%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m
    🟨 cxx_family
      🟨 Clang              Pass:  94%/17  | Total: 16h 05m | Avg: 56m 48s | Max:  1h 17m
      🟨 GCC                Pass:  95%/21  | Total: 16h 38m | Avg: 47m 33s | Max:  1h 13m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 43m | Avg:  1h 10m | Max:  1h 19m | Hits: 159%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m
    🟨 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 13h | Avg:  1h 00m | Max:  1h 19m | Hits: 159%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 43s | Avg: 24m 43s | Max: 24m 43s
      🟥 GraphCapture       Pass:   0%/1   | Total:  6m 30s | Avg:  6m 30s | Max:  6m 30s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 15m | Avg: 25m 18s | Max: 34m 33s
      🟨 TestGPU            Pass:  50%/2   | Total: 34m 31s | Avg: 17m 15s | Max: 27m 22s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 42m 22s | Avg: 21m 11s | Max: 23m 21s
      🟩 90a                Pass: 100%/1   | Total: 24m 19s | Avg: 24m 19s | Max: 24m 19s
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 10h 16m | Avg: 14m 20s | Max: 33m 57s | Hits: 679%/10065

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  9h 52m | Avg: 14m 27s | Max: 33m 57s | Hits: 679%/10065 
      🟩 arm64              Pass: 100%/2   | Total: 24m 09s | Avg: 12m 04s | Max: 20m 49s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 57m 14s | Avg: 11m 26s | Max: 20m 53s | Hits: 682%/2471  
      🟩 12.5               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 23s | Max: 33m 10s
      🟩 12.6               Pass: 100%/36  | Total:  8h 16m | Avg: 13m 47s | Max: 33m 57s | Hits: 678%/7594  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 07m | Avg: 16m 47s | Max: 22m 15s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 57m 14s | Avg: 11m 26s | Max: 20m 53s | Hits: 682%/2471  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 02m | Avg: 31m 23s | Max: 33m 10s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  7h 09m | Avg: 13m 25s | Max: 33m 57s | Hits: 678%/7594  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 07m | Avg: 16m 47s | Max: 22m 15s
      🟩 nvcc               Pass: 100%/39  | Total:  9h 09m | Avg: 14m 05s | Max: 33m 57s | Hits: 679%/10065 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 53m 31s | Avg: 13m 22s | Max: 20m 53s
      🟩 Clang15            Pass: 100%/2   | Total: 31m 44s | Avg: 15m 52s | Max: 24m 11s
      🟩 Clang16            Pass: 100%/2   | Total: 44m 40s | Avg: 22m 20s | Max: 22m 41s
      🟩 Clang17            Pass: 100%/2   | Total: 30m 53s | Avg: 15m 26s | Max: 23m 03s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 41m | Avg: 20m 11s | Max: 33m 57s
      🟩 GCC7               Pass: 100%/2   | Total:  8m 38s | Avg:  4m 19s | Max:  4m 52s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s
      🟩 GCC9               Pass: 100%/2   | Total:  8m 30s | Avg:  4m 15s | Max:  4m 42s
      🟩 GCC10              Pass: 100%/2   | Total: 13m 03s | Avg:  6m 31s | Max:  6m 37s
      🟩 GCC11              Pass: 100%/2   | Total: 10m 14s | Avg:  5m 07s | Max:  6m 09s
      🟩 GCC12              Pass: 100%/2   | Total:  8m 01s | Avg:  4m 00s | Max:  4m 06s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 18m | Avg:  9m 45s | Max: 22m 57s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 44m 13s | Avg: 22m 06s | Max: 23m 38s | Hits: 682%/4952  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 56m 53s | Avg: 28m 26s | Max: 30m 28s | Hits: 676%/5113  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 23s | Max: 33m 10s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  5h 22m | Avg: 17m 54s | Max: 33m 57s
      🟩 GCC                Pass: 100%/19  | Total:  2h 10m | Avg:  6m 51s | Max: 22m 57s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 41m | Avg: 25m 16s | Max: 30m 28s | Hits: 679%/10065 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 23s | Max: 33m 10s
    🟩 gpu
      🟩 v100               Pass: 100%/43  | Total: 10h 16m | Avg: 14m 20s | Max: 33m 57s | Hits: 679%/10065 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  8h 39m | Avg: 13m 40s | Max: 33m 10s | Hits: 679%/10065 
      🟩 NVRTC              Pass: 100%/2   | Total: 44m 37s | Avg: 22m 18s | Max: 22m 57s
      🟩 Test               Pass: 100%/2   | Total: 50m 38s | Avg: 25m 19s | Max: 33m 57s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 57s | Avg:  1m 57s | Max:  1m 57s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 20s | Avg: 13m 20s | Max: 13m 20s
      🟩 90a                Pass: 100%/2   | Total: 17m 17s | Avg:  8m 38s | Max: 13m 43s
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 26m | Avg: 12m 42s | Max: 29m 36s | Hits: 682%/7433  
      🟩 20                 Pass: 100%/21  | Total:  5h 47m | Avg: 16m 33s | Max: 33m 57s | Hits: 669%/2632  
    
  • 🟩 thrust: Pass: 100%/42 | Total: 23h 30m | Avg: 33m 35s | Max: 1h 07m | Hits: 177%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total:  1h 07m | Avg: 33m 39s | Max: 39m 01s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 22h 27m | Avg: 33m 41s | Max:  1h 07m | Hits: 177%/7384  
      🟩 arm64              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 36s | Max: 31m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 03m | Avg: 36m 43s | Max: 55m 14s | Hits: 177%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 58m 47s
      🟩 12.6               Pass: 100%/35  | Total: 18h 35m | Avg: 31m 51s | Max:  1h 07m | Hits: 177%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 52m 11s | Avg: 26m 05s | Max: 26m 22s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 03m | Avg: 36m 43s | Max: 55m 14s | Hits: 177%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 58m 47s
      🟩 nvcc12.6           Pass: 100%/33  | Total: 17h 42m | Avg: 32m 12s | Max:  1h 07m | Hits: 177%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 52m 11s | Avg: 26m 05s | Max: 26m 22s
      🟩 nvcc               Pass: 100%/40  | Total: 22h 38m | Avg: 33m 58s | Max:  1h 07m | Hits: 177%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 21s | Max: 31m 31s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 35s | Max: 35m 12s
      🟩 Clang16            Pass: 100%/2   | Total: 58m 48s | Avg: 29m 24s | Max: 29m 43s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 12s | Max: 32m 50s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 56m | Avg: 25m 10s | Max: 31m 33s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 04m | Avg: 32m 09s | Max: 32m 16s
      🟩 GCC8               Pass: 100%/1   | Total: 30m 12s | Avg: 30m 12s | Max: 30m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 36s | Max: 35m 37s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 44s | Max: 31m 56s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 27s | Max: 33m 19s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 18s | Max: 34m 32s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 33m | Avg: 26m 39s | Max: 39m 01s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 51m | Avg: 55m 36s | Max: 55m 59s | Hits: 177%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 07m | Hits: 177%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 58m 47s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 06m | Avg: 28m 35s | Max: 35m 12s
      🟩 GCC                Pass: 100%/19  | Total:  9h 33m | Avg: 30m 12s | Max: 39m 01s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 58m | Avg: 59m 41s | Max:  1h 07m | Hits: 177%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 58m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total: 23h 30m | Avg: 33m 35s | Max:  1h 07m | Hits: 177%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 21h 57m | Avg: 35m 36s | Max:  1h 07m | Hits: 177%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 14m 51s | Avg:  7m 25s | Max:  7m 32s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 18m | Avg: 26m 12s | Max: 39m 01s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 23m 35s | Avg: 23m 35s | Max: 23m 35s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 06m | Avg: 36m 19s | Max: 59m 47s | Hits: 177%/5538  
      🟩 20                 Pass: 100%/20  | Total: 10h 17m | Avg: 30m 51s | Max:  1h 07m | Hits: 177%/1846  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 09m | Avg: 6m 29s | Max: 19m 33s | Hits: 286%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 55m | Avg:  7m 13s | Max: 19m 33s | Hits: 286%/522   
      🟩 arm64              Pass: 100%/4   | Total: 14m 10s | Avg:  3m 32s | Max:  3m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s | Hits: 286%/261   
      🟩 12.5               Pass: 100%/2   | Total: 17m 02s | Avg:  8m 31s | Max:  8m 40s
      🟩 12.6               Pass: 100%/17  | Total:  1h 42m | Avg:  6m 02s | Max: 19m 33s | Hits: 286%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s | Hits: 286%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 02s | Avg:  8m 31s | Max:  8m 40s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 42m | Avg:  6m 02s | Max: 19m 33s | Hits: 286%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 09m | Avg:  6m 29s | Max: 19m 33s | Hits: 286%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 27s | Avg:  4m 27s | Max:  4m 27s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 09s | Avg:  4m 09s | Max:  4m 09s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 57s | Avg:  3m 57s | Max:  3m 57s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟩 Clang18            Pass: 100%/4   | Total: 30m 41s | Avg:  7m 40s | Max: 19m 33s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 GCC11              Pass: 100%/1   | Total:  4m 07s | Avg:  4m 07s | Max:  4m 07s
      🟩 GCC12              Pass: 100%/2   | Total: 21m 10s | Avg: 10m 35s | Max: 16m 54s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 45s | Avg:  3m 26s | Max:  3m 39s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s | Hits: 286%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 42s | Avg: 12m 42s | Max: 12m 42s | Hits: 286%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 02s | Avg:  8m 31s | Max:  8m 40s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 47m 13s | Avg:  5m 54s | Max: 19m 33s
      🟩 GCC                Pass: 100%/8   | Total: 42m 51s | Avg:  5m 21s | Max: 16m 54s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 37s | Avg: 11m 18s | Max: 12m 42s | Hits: 286%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 02s | Avg:  8m 31s | Max:  8m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  2h 09m | Avg:  6m 29s | Max: 19m 33s | Hits: 286%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 33m | Avg:  5m 10s | Max: 12m 42s | Hits: 286%/522   
      🟩 Test               Pass: 100%/2   | Total: 36m 27s | Avg: 18m 13s | Max: 19m 33s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 90a                Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 18m 31s | Avg:  4m 37s | Max:  8m 22s
      🟩 20                 Pass: 100%/16  | Total:  1h 51m | Avg:  6m 57s | Max: 19m 33s | Hits: 286%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 41s | Avg: 4m 50s | Max: 7m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
      🟩 Test               Pass: 100%/1   | Total:  7m 29s | Avg:  7m 29s | Max:  7m 29s
    
  • 🟩 python: Pass: 100%/1 | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 152)

# Runner
110 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
14 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1

Copy link
Contributor

🟩 CI finished in 9h 55m: Pass: 100%/152 | Total: 3d 05h | Avg: 30m 32s | Max: 1h 19m | Hits: 411%/21523
  • 🟩 cub: Pass: 100%/44 | Total: 1d 16h | Avg: 55m 10s | Max: 1h 19m | Hits: 159%/3552

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 14h | Avg: 54m 35s | Max:  1h 19m | Hits: 159%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 26m | Avg:  1h 05m | Max:  1h 17m | Hits: 160%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m
      🟩 12.6               Pass: 100%/37  | Total:  1d 08h | Avg: 53m 02s | Max:  1h 19m | Hits: 159%/2664  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 57m | Avg: 58m 58s | Max:  1h 00m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 26m | Avg:  1h 05m | Max:  1h 17m | Hits: 160%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1d 06h | Avg: 52m 42s | Max:  1h 19m | Hits: 159%/2664  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 58s | Max:  1h 00m
      🟩 nvcc               Pass: 100%/42  | Total:  1d 14h | Avg: 54m 59s | Max:  1h 19m | Hits: 159%/3552  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 19m | Avg:  1h 04m | Max:  1h 17m
      🟩 Clang15            Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m
      🟩 Clang16            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 38s | Max: 57m 19s
      🟩 Clang17            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 03m
      🟩 Clang18            Pass: 100%/7   | Total:  6h 08m | Avg: 52m 36s | Max:  1h 07m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 51m | Avg: 55m 34s | Max: 57m 41s
      🟩 GCC8               Pass: 100%/1   | Total: 57m 08s | Avg: 57m 08s | Max: 57m 08s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 59m | Avg: 59m 49s | Max:  1h 01m
      🟩 GCC10              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 13m
      🟩 GCC11              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 12m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 37m | Avg: 39m 28s | Max: 58m 20s
      🟩 GCC13              Pass: 100%/8   | Total:  5h 13m | Avg: 39m 08s | Max:  1h 07m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 08m | Hits: 160%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 19m | Hits: 159%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 23m | Avg: 57m 49s | Max:  1h 17m
      🟩 GCC                Pass: 100%/21  | Total: 17h 02m | Avg: 48m 41s | Max:  1h 13m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 43m | Avg:  1h 10m | Max:  1h 19m | Hits: 159%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 42m 22s | Avg: 21m 11s | Max: 23m 21s
      🟩 v100               Pass: 100%/42  | Total:  1d 15h | Avg: 56m 47s | Max:  1h 19m | Hits: 159%/3552  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 13h | Avg:  1h 00m | Max:  1h 19m | Hits: 159%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 43s | Avg: 24m 43s | Max: 24m 43s
      🟩 GraphCapture       Pass: 100%/1   | Total: 30m 33s | Avg: 30m 33s | Max: 30m 33s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 15m | Avg: 25m 18s | Max: 34m 33s
      🟩 TestGPU            Pass: 100%/2   | Total: 51m 53s | Avg: 25m 56s | Max: 27m 22s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 42m 22s | Avg: 21m 11s | Max: 23m 21s
      🟩 90a                Pass: 100%/1   | Total: 24m 19s | Avg: 24m 19s | Max: 24m 19s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 20h 32m | Avg:  1h 01m | Max:  1h 17m | Hits: 160%/2664  
      🟩 20                 Pass: 100%/24  | Total: 19h 54m | Avg: 49m 46s | Max:  1h 19m | Hits: 159%/888   
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 10h 16m | Avg: 14m 20s | Max: 33m 57s | Hits: 679%/10065

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  9h 52m | Avg: 14m 27s | Max: 33m 57s | Hits: 679%/10065 
      🟩 arm64              Pass: 100%/2   | Total: 24m 09s | Avg: 12m 04s | Max: 20m 49s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 57m 14s | Avg: 11m 26s | Max: 20m 53s | Hits: 682%/2471  
      🟩 12.5               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 23s | Max: 33m 10s
      🟩 12.6               Pass: 100%/36  | Total:  8h 16m | Avg: 13m 47s | Max: 33m 57s | Hits: 678%/7594  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 07m | Avg: 16m 47s | Max: 22m 15s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 57m 14s | Avg: 11m 26s | Max: 20m 53s | Hits: 682%/2471  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 02m | Avg: 31m 23s | Max: 33m 10s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  7h 09m | Avg: 13m 25s | Max: 33m 57s | Hits: 678%/7594  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 07m | Avg: 16m 47s | Max: 22m 15s
      🟩 nvcc               Pass: 100%/39  | Total:  9h 09m | Avg: 14m 05s | Max: 33m 57s | Hits: 679%/10065 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 53m 31s | Avg: 13m 22s | Max: 20m 53s
      🟩 Clang15            Pass: 100%/2   | Total: 31m 44s | Avg: 15m 52s | Max: 24m 11s
      🟩 Clang16            Pass: 100%/2   | Total: 44m 40s | Avg: 22m 20s | Max: 22m 41s
      🟩 Clang17            Pass: 100%/2   | Total: 30m 53s | Avg: 15m 26s | Max: 23m 03s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 41m | Avg: 20m 11s | Max: 33m 57s
      🟩 GCC7               Pass: 100%/2   | Total:  8m 38s | Avg:  4m 19s | Max:  4m 52s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s
      🟩 GCC9               Pass: 100%/2   | Total:  8m 30s | Avg:  4m 15s | Max:  4m 42s
      🟩 GCC10              Pass: 100%/2   | Total: 13m 03s | Avg:  6m 31s | Max:  6m 37s
      🟩 GCC11              Pass: 100%/2   | Total: 10m 14s | Avg:  5m 07s | Max:  6m 09s
      🟩 GCC12              Pass: 100%/2   | Total:  8m 01s | Avg:  4m 00s | Max:  4m 06s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 18m | Avg:  9m 45s | Max: 22m 57s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 44m 13s | Avg: 22m 06s | Max: 23m 38s | Hits: 682%/4952  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 56m 53s | Avg: 28m 26s | Max: 30m 28s | Hits: 676%/5113  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 23s | Max: 33m 10s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  5h 22m | Avg: 17m 54s | Max: 33m 57s
      🟩 GCC                Pass: 100%/19  | Total:  2h 10m | Avg:  6m 51s | Max: 22m 57s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 41m | Avg: 25m 16s | Max: 30m 28s | Hits: 679%/10065 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 23s | Max: 33m 10s
    🟩 gpu
      🟩 v100               Pass: 100%/43  | Total: 10h 16m | Avg: 14m 20s | Max: 33m 57s | Hits: 679%/10065 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  8h 39m | Avg: 13m 40s | Max: 33m 10s | Hits: 679%/10065 
      🟩 NVRTC              Pass: 100%/2   | Total: 44m 37s | Avg: 22m 18s | Max: 22m 57s
      🟩 Test               Pass: 100%/2   | Total: 50m 38s | Avg: 25m 19s | Max: 33m 57s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 57s | Avg:  1m 57s | Max:  1m 57s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 20s | Avg: 13m 20s | Max: 13m 20s
      🟩 90a                Pass: 100%/2   | Total: 17m 17s | Avg:  8m 38s | Max: 13m 43s
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 26m | Avg: 12m 42s | Max: 29m 36s | Hits: 682%/7433  
      🟩 20                 Pass: 100%/21  | Total:  5h 47m | Avg: 16m 33s | Max: 33m 57s | Hits: 669%/2632  
    
  • 🟩 thrust: Pass: 100%/42 | Total: 23h 30m | Avg: 33m 35s | Max: 1h 07m | Hits: 177%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total:  1h 07m | Avg: 33m 39s | Max: 39m 01s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 22h 27m | Avg: 33m 41s | Max:  1h 07m | Hits: 177%/7384  
      🟩 arm64              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 36s | Max: 31m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 03m | Avg: 36m 43s | Max: 55m 14s | Hits: 177%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 58m 47s
      🟩 12.6               Pass: 100%/35  | Total: 18h 35m | Avg: 31m 51s | Max:  1h 07m | Hits: 177%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 52m 11s | Avg: 26m 05s | Max: 26m 22s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 03m | Avg: 36m 43s | Max: 55m 14s | Hits: 177%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 58m 47s
      🟩 nvcc12.6           Pass: 100%/33  | Total: 17h 42m | Avg: 32m 12s | Max:  1h 07m | Hits: 177%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 52m 11s | Avg: 26m 05s | Max: 26m 22s
      🟩 nvcc               Pass: 100%/40  | Total: 22h 38m | Avg: 33m 58s | Max:  1h 07m | Hits: 177%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 21s | Max: 31m 31s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 35s | Max: 35m 12s
      🟩 Clang16            Pass: 100%/2   | Total: 58m 48s | Avg: 29m 24s | Max: 29m 43s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 12s | Max: 32m 50s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 56m | Avg: 25m 10s | Max: 31m 33s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 04m | Avg: 32m 09s | Max: 32m 16s
      🟩 GCC8               Pass: 100%/1   | Total: 30m 12s | Avg: 30m 12s | Max: 30m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 36s | Max: 35m 37s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 44s | Max: 31m 56s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 27s | Max: 33m 19s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 18s | Max: 34m 32s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 33m | Avg: 26m 39s | Max: 39m 01s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 51m | Avg: 55m 36s | Max: 55m 59s | Hits: 177%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 07m | Hits: 177%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 58m 47s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 06m | Avg: 28m 35s | Max: 35m 12s
      🟩 GCC                Pass: 100%/19  | Total:  9h 33m | Avg: 30m 12s | Max: 39m 01s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 58m | Avg: 59m 41s | Max:  1h 07m | Hits: 177%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 58m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total: 23h 30m | Avg: 33m 35s | Max:  1h 07m | Hits: 177%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 21h 57m | Avg: 35m 36s | Max:  1h 07m | Hits: 177%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 14m 51s | Avg:  7m 25s | Max:  7m 32s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 18m | Avg: 26m 12s | Max: 39m 01s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 23m 35s | Avg: 23m 35s | Max: 23m 35s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 06m | Avg: 36m 19s | Max: 59m 47s | Hits: 177%/5538  
      🟩 20                 Pass: 100%/20  | Total: 10h 17m | Avg: 30m 51s | Max:  1h 07m | Hits: 177%/1846  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 09m | Avg: 6m 29s | Max: 19m 33s | Hits: 286%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 55m | Avg:  7m 13s | Max: 19m 33s | Hits: 286%/522   
      🟩 arm64              Pass: 100%/4   | Total: 14m 10s | Avg:  3m 32s | Max:  3m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s | Hits: 286%/261   
      🟩 12.5               Pass: 100%/2   | Total: 17m 02s | Avg:  8m 31s | Max:  8m 40s
      🟩 12.6               Pass: 100%/17  | Total:  1h 42m | Avg:  6m 02s | Max: 19m 33s | Hits: 286%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s | Hits: 286%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 02s | Avg:  8m 31s | Max:  8m 40s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 42m | Avg:  6m 02s | Max: 19m 33s | Hits: 286%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 09m | Avg:  6m 29s | Max: 19m 33s | Hits: 286%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 27s | Avg:  4m 27s | Max:  4m 27s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 09s | Avg:  4m 09s | Max:  4m 09s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 57s | Avg:  3m 57s | Max:  3m 57s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟩 Clang18            Pass: 100%/4   | Total: 30m 41s | Avg:  7m 40s | Max: 19m 33s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 GCC11              Pass: 100%/1   | Total:  4m 07s | Avg:  4m 07s | Max:  4m 07s
      🟩 GCC12              Pass: 100%/2   | Total: 21m 10s | Avg: 10m 35s | Max: 16m 54s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 45s | Avg:  3m 26s | Max:  3m 39s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s | Hits: 286%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 42s | Avg: 12m 42s | Max: 12m 42s | Hits: 286%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 02s | Avg:  8m 31s | Max:  8m 40s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 47m 13s | Avg:  5m 54s | Max: 19m 33s
      🟩 GCC                Pass: 100%/8   | Total: 42m 51s | Avg:  5m 21s | Max: 16m 54s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 37s | Avg: 11m 18s | Max: 12m 42s | Hits: 286%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 02s | Avg:  8m 31s | Max:  8m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  2h 09m | Avg:  6m 29s | Max: 19m 33s | Hits: 286%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 33m | Avg:  5m 10s | Max: 12m 42s | Hits: 286%/522   
      🟩 Test               Pass: 100%/2   | Total: 36m 27s | Avg: 18m 13s | Max: 19m 33s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 90a                Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 18m 31s | Avg:  4m 37s | Max:  8m 22s
      🟩 20                 Pass: 100%/16  | Total:  1h 51m | Avg:  6m 57s | Max: 19m 33s | Hits: 286%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 41s | Avg: 4m 50s | Max: 7m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  7m 29s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
      🟩 Test               Pass: 100%/1   | Total:  7m 29s | Avg:  7m 29s | Max:  7m 29s
    
  • 🟩 python: Pass: 100%/1 | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 48m 51s | Avg: 48m 51s | Max: 48m 51s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 152)

# Runner
110 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
14 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1

@bernhardmgruber bernhardmgruber merged commit 3e888d8 into NVIDIA:main Jan 30, 2025
164 of 168 checks passed
@bernhardmgruber bernhardmgruber deleted the ptx_remove_internal branch January 30, 2025 08:10
Copy link
Contributor

Backport failed for branch/2.8.x, because it was unable to cherry-pick the commit(s).

Please cherry-pick the changes locally.

git fetch origin branch/2.8.x
git worktree add -d .worktree/backport-3583-to-branch/2.8.x origin/branch/2.8.x
cd .worktree/backport-3583-to-branch/2.8.x
git checkout -b backport-3583-to-branch/2.8.x
ancref=$(git merge-base d21e0c9804ad63d23950c8b0a2462e5b7ebc8701 092fdc691acbe3197e54791ffc21bb51d30598ac)
git cherry-pick -x $ancref..092fdc691acbe3197e54791ffc21bb51d30598ac

bernhardmgruber added a commit that referenced this pull request Jan 31, 2025
* barrier.cluster.aligned: Remove
This is not supposed to be exposed in CCCL.

* elect.sync: Remove
Not ready for inclusion yet. This needs to handle the optional extra
output mask as well.

* mapa: Remove
This has compiler bugs. We should use intrinsics instead.

Co-authored-by: Allard Hendriksen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants