Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add span to example and templated block size #2470

Merged
merged 3 commits into from
Sep 28, 2024

Conversation

Kh4ster
Copy link
Contributor

@Kh4ster Kh4ster commented Sep 27, 2024

This commits update the README example:

  1. Update codebolt link
  2. Use cuda::std::span instead of raw pointers
  3. Use templated block size instead of global constexpr

The 3. could be reverted. I personnaly think it's good to remind new users that kernels are templatable.

@Kh4ster Kh4ster requested a review from a team as a code owner September 27, 2024 09:31
@Kh4ster Kh4ster requested a review from miscco September 27, 2024 09:31
Copy link

copy-pr-bot bot commented Sep 27, 2024

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Copy link
Collaborator

@miscco miscco left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot, I found some other improvements

README.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
README.md Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
@miscco
Copy link
Collaborator

miscco commented Sep 27, 2024

/ok to test

Copy link
Contributor

🟩 CI finished in 1h 27m: Pass: 100%/368 | Total: 1d 13h | Avg: 6m 06s | Max: 1h 19m | Hits: 99%/25663
  • 🟩 cub: Pass: 100%/104 | Total: 12h 45m | Avg: 7m 21s | Max: 1h 19m | Hits: 99%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 12h 12m | Avg:  7m 37s | Max:  1h 19m | Hits:  99%/2908  
      🟩 arm64              Pass: 100%/8   | Total: 33m 08s | Avg:  4m 08s | Max:  4m 40s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 08m | Avg:  4m 34s | Max: 15m 24s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total: 14m 32s | Avg:  4m 50s | Max:  5m 06s
      🟩 12.6               Pass: 100%/86  | Total: 11h 22m | Avg:  7m 55s | Max:  1h 19m | Hits:  99%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 54s | Avg:  3m 57s | Max:  4m 02s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 08m | Avg:  4m 34s | Max: 15m 24s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 32s | Avg:  4m 50s | Max:  5m 06s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 11h 14m | Avg:  8m 01s | Max:  1h 19m | Hits:  99%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 54s | Avg:  3m 57s | Max:  4m 02s
      🟩 nvcc               Pass: 100%/102 | Total: 12h 37m | Avg:  7m 25s | Max:  1h 19m | Hits:  99%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 28m 25s | Avg:  4m 44s | Max:  5m 25s
      🟩 Clang10            Pass: 100%/3   | Total: 15m 55s | Avg:  5m 18s | Max:  5m 22s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 12s | Avg:  4m 33s | Max:  4m 45s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 56s | Avg:  4m 44s | Max:  5m 05s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 23s | Avg:  4m 35s | Max:  4m 59s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 45s | Avg:  4m 41s | Max:  5m 02s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 34s | Avg:  4m 38s | Max:  5m 04s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 40s | Avg:  4m 40s | Max:  5m 00s
      🟩 Clang17            Pass: 100%/4   | Total: 18m 38s | Avg:  4m 39s | Max:  4m 45s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 31m | Avg: 10m 13s | Max: 41m 01s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 55s | Avg:  3m 27s | Max:  3m 29s
      🟩 GCC7               Pass: 100%/6   | Total: 23m 58s | Avg:  3m 59s | Max:  4m 46s
      🟩 GCC8               Pass: 100%/6   | Total: 24m 48s | Avg:  4m 08s | Max:  4m 36s
      🟩 GCC9               Pass: 100%/6   | Total: 25m 17s | Avg:  4m 12s | Max:  4m 45s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 11s | Avg:  4m 47s | Max:  4m 57s
      🟩 GCC11              Pass: 100%/7   | Total: 33m 11s | Avg:  4m 44s | Max:  5m 06s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 53s | Avg:  4m 43s | Max:  4m 48s
      🟩 GCC13              Pass: 100%/16  | Total:  4h 36m | Avg: 17m 16s | Max:  1h 19m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 16m 38s | Avg:  5m 32s | Max:  5m 41s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 24s | Avg: 15m 24s | Max: 15m 24s | Hits:  99%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 24m 52s | Avg: 12m 26s | Max: 12m 29s | Hits:  99%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 23s | Avg: 13m 23s | Max: 13m 23s | Hits:  99%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 26m | Avg:  5m 47s | Max: 41m 01s
      🟩 GCC                Pass: 100%/51  | Total:  7h 08m | Avg:  8m 24s | Max:  1h 19m
      🟩 Intel              Pass: 100%/3   | Total: 16m 38s | Avg:  5m 32s | Max:  5m 41s
      🟩 MSVC               Pass: 100%/4   | Total: 53m 39s | Avg: 13m 24s | Max: 15m 24s | Hits:  99%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 12h 45m | Avg:  7m 21s | Max:  1h 19m | Hits:  99%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  7h 46m | Avg:  4m 51s | Max: 15m 24s | Hits:  99%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 25m 34s | Avg: 25m 34s | Max: 25m 34s
      🟩 GraphCapture       Pass: 100%/1   | Total: 32m 46s | Avg: 32m 46s | Max: 32m 46s
      🟩 HostLaunch         Pass: 100%/3   | Total:  2h 19m | Avg: 46m 20s | Max:  1h 19m
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 41m | Avg: 33m 46s | Max: 41m 01s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 32s | Avg:  4m 50s | Max:  5m 06s
      🟩 90a                Pass: 100%/4   | Total: 14m 09s | Avg:  3m 32s | Max:  3m 36s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  3h 39m | Avg:  7m 50s | Max:  1h 19m
      🟩 14                 Pass: 100%/27  | Total:  2h 19m | Avg:  5m 09s | Max: 15m 24s | Hits:  99%/1454  
      🟩 17                 Pass: 100%/26  | Total:  2h 07m | Avg:  4m 53s | Max: 12m 23s | Hits:  99%/727   
      🟩 20                 Pass: 100%/23  | Total:  4h 39m | Avg: 12m 08s | Max: 41m 01s | Hits:  99%/727   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 11h 19m | Avg: 6m 32s | Max: 47m 19s | Hits: 99%/11383

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 10h 49m | Avg:  6m 45s | Max: 47m 19s | Hits:  99%/11383 
      🟩 arm64              Pass: 100%/8   | Total: 30m 23s | Avg:  3m 47s | Max:  4m 32s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 02m | Avg:  4m 11s | Max: 20m 59s | Hits:  99%/2648  
      🟩 11.8               Pass: 100%/3   | Total: 10m 01s | Avg:  3m 20s | Max:  3m 38s
      🟩 12.6               Pass: 100%/86  | Total: 10h 06m | Avg:  7m 03s | Max: 47m 19s | Hits:  99%/8735  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 41m 56s | Avg: 20m 58s | Max: 22m 05s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 02m | Avg:  4m 11s | Max: 20m 59s | Hits:  99%/2648  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 10m 01s | Avg:  3m 20s | Max:  3m 38s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 24m | Avg:  6m 43s | Max: 47m 19s | Hits:  99%/8735  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 41m 56s | Avg: 20m 58s | Max: 22m 05s
      🟩 nvcc               Pass: 100%/102 | Total: 10h 37m | Avg:  6m 15s | Max: 47m 19s | Hits:  99%/11383 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 24m 56s | Avg:  4m 09s | Max:  5m 27s
      🟩 Clang10            Pass: 100%/3   | Total: 15m 25s | Avg:  5m 08s | Max:  5m 31s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 04s | Avg:  4m 31s | Max:  4m 55s
      🟩 Clang12            Pass: 100%/4   | Total: 17m 08s | Avg:  4m 17s | Max:  4m 49s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 32s | Avg:  4m 23s | Max:  4m 36s
      🟩 Clang14            Pass: 100%/4   | Total: 17m 29s | Avg:  4m 22s | Max:  4m 38s
      🟩 Clang15            Pass: 100%/4   | Total: 17m 38s | Avg:  4m 24s | Max:  5m 10s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 00s | Avg:  4m 30s | Max:  5m 03s
      🟩 Clang17            Pass: 100%/4   | Total: 17m 00s | Avg:  4m 15s | Max:  4m 58s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 30m | Avg: 11m 16s | Max: 28m 09s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 41s | Avg:  2m 50s | Max:  2m 51s
      🟩 GCC7               Pass: 100%/6   | Total: 19m 14s | Avg:  3m 12s | Max:  3m 50s
      🟩 GCC8               Pass: 100%/6   | Total: 19m 55s | Avg:  3m 19s | Max:  3m 57s
      🟩 GCC9               Pass: 100%/6   | Total: 21m 05s | Avg:  3m 30s | Max:  4m 09s
      🟩 GCC10              Pass: 100%/4   | Total: 16m 18s | Avg:  4m 04s | Max:  4m 26s
      🟩 GCC11              Pass: 100%/7   | Total: 26m 29s | Avg:  3m 47s | Max:  4m 25s
      🟩 GCC12              Pass: 100%/4   | Total: 16m 16s | Avg:  4m 04s | Max:  4m 16s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 35m | Avg: 12m 39s | Max: 47m 19s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 05s | Avg:  6m 01s | Max:  6m 30s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 59s | Avg: 20m 59s | Max: 20m 59s | Hits:  99%/2648  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 30m 14s | Avg: 15m 07s | Max: 15m 13s | Hits:  99%/5658  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 16m 49s | Avg: 16m 49s | Max: 16m 49s | Hits:  99%/3077  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  4h 13m | Avg:  5m 37s | Max: 28m 09s
      🟩 GCC                Pass: 100%/52  | Total:  5h 40m | Avg:  6m 32s | Max: 47m 19s
      🟩 Intel              Pass: 100%/3   | Total: 18m 05s | Avg:  6m 01s | Max:  6m 30s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 08m | Avg: 17m 00s | Max: 20m 59s | Hits:  99%/11383 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 11h 19m | Avg:  6m 32s | Max: 47m 19s | Hits:  99%/11383 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  7h 55m | Avg:  4m 57s | Max: 22m 05s | Hits:  99%/11383 
      🟩 NVRTC              Pass: 100%/4   | Total:  2h 17m | Avg: 34m 23s | Max: 47m 19s
      🟩 Test               Pass: 100%/3   | Total:  1h 03m | Avg: 21m 18s | Max: 28m 09s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 10m 01s | Avg:  3m 20s | Max:  3m 38s
      🟩 90a                Pass: 100%/4   | Total: 16m 35s | Avg:  4m 08s | Max:  4m 36s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 06m | Avg:  4m 30s | Max: 20m 46s
      🟩 14                 Pass: 100%/28  | Total:  2h 38m | Avg:  5m 39s | Max: 22m 12s | Hits:  99%/5397  
      🟩 17                 Pass: 100%/27  | Total:  3h 06m | Avg:  6m 53s | Max: 47m 19s | Hits:  99%/2909  
      🟩 20                 Pass: 100%/20  | Total:  3h 26m | Avg: 10m 20s | Max: 47m 17s | Hits:  99%/3077  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 10h 26m | Avg: 6m 04s | Max: 24m 03s | Hits: 99%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  9h 47m | Avg:  6m 11s | Max: 24m 03s | Hits:  99%/11150 
      🟩 arm64              Pass: 100%/8   | Total: 38m 37s | Avg:  4m 49s | Max:  5m 38s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 18m | Avg:  5m 14s | Max: 20m 44s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 14m 20s | Avg:  4m 46s | Max:  5m 15s
      🟩 12.6               Pass: 100%/85  | Total:  8h 53m | Avg:  6m 16s | Max: 24m 03s | Hits:  99%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  5m 06s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 18m | Avg:  5m 14s | Max: 20m 44s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 20s | Avg:  4m 46s | Max:  5m 15s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  8h 43m | Avg:  6m 18s | Max: 24m 03s | Hits:  99%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  5m 06s
      🟩 nvcc               Pass: 100%/101 | Total: 10h 16m | Avg:  6m 06s | Max: 24m 03s | Hits:  99%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 29m 42s | Avg:  4m 57s | Max:  5m 57s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 03s | Avg:  6m 01s | Max:  6m 29s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 01s | Avg:  4m 45s | Max:  5m 16s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 15s | Avg:  5m 03s | Max:  5m 21s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 03s | Avg:  4m 45s | Max:  5m 12s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 00s | Avg:  4m 45s | Max:  4m 55s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 36s | Avg:  4m 54s | Max:  5m 07s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 05s | Avg:  5m 01s | Max:  5m 16s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 21s | Avg:  5m 05s | Max:  5m 43s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 06m | Avg:  7m 21s | Max: 24m 03s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 10s | Avg:  4m 05s | Max:  4m 16s
      🟩 GCC7               Pass: 100%/6   | Total: 26m 51s | Avg:  4m 28s | Max:  5m 03s
      🟩 GCC8               Pass: 100%/6   | Total: 25m 45s | Avg:  4m 17s | Max:  4m 54s
      🟩 GCC9               Pass: 100%/6   | Total: 27m 36s | Avg:  4m 36s | Max:  5m 18s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 34s | Avg:  4m 53s | Max:  5m 06s
      🟩 GCC11              Pass: 100%/7   | Total: 36m 42s | Avg:  5m 14s | Max:  7m 52s
      🟩 GCC12              Pass: 100%/4   | Total: 20m 51s | Avg:  5m 12s | Max:  5m 23s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 33m | Avg:  6m 42s | Max: 16m 42s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 58s | Avg:  6m 19s | Max:  6m 55s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 44s | Avg: 20m 44s | Max: 20m 44s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 35m 10s | Avg: 17m 35s | Max: 18m 27s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 40m 38s | Avg: 20m 19s | Max: 22m 19s | Hits:  99%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 11m | Avg:  5m 27s | Max: 24m 03s
      🟩 GCC                Pass: 100%/49  | Total:  4h 19m | Avg:  5m 17s | Max: 16m 42s
      🟩 Intel              Pass: 100%/3   | Total: 18m 58s | Avg:  6m 19s | Max:  6m 55s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 36m | Avg: 19m 18s | Max: 22m 19s | Hits:  99%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 10h 26m | Avg:  6m 04s | Max: 24m 03s | Hits:  99%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 45m | Avg:  5m 28s | Max: 20m 44s | Hits:  99%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 44m 36s | Avg: 11m 09s | Max: 22m 19s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 55m 58s | Avg: 18m 39s | Max: 24m 03s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 20s | Avg:  4m 46s | Max:  5m 15s
      🟩 90a                Pass: 100%/4   | Total: 16m 25s | Avg:  4m 06s | Max:  4m 14s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 21m | Avg:  5m 02s | Max: 16m 42s
      🟩 14                 Pass: 100%/27  | Total:  2h 40m | Avg:  5m 56s | Max: 20m 44s | Hits:  99%/4460  
      🟩 17                 Pass: 100%/26  | Total:  2h 23m | Avg:  5m 31s | Max: 16m 43s | Hits:  99%/2230  
      🟩 20                 Pass: 100%/22  | Total:  3h 00m | Avg:  8m 13s | Max: 24m 03s | Hits:  99%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 23m | Avg: 2m 45s | Max: 10m 08s | Hits: 90%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 16m | Avg:  2m 50s | Max: 10m 08s | Hits:  90%/222   
      🟩 arm64              Pass: 100%/4   | Total:  7m 04s | Avg:  1m 46s | Max:  1m 53s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 53m 38s | Avg:  2m 49s | Max: 10m 08s | Hits:  90%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 29m | Avg:  2m 43s | Max:  9m 56s | Hits:  90%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 53m 38s | Avg:  2m 49s | Max: 10m 08s | Hits:  90%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 29m | Avg:  2m 43s | Max:  9m 56s | Hits:  90%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 23m | Avg:  2m 45s | Max: 10m 08s | Hits:  90%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 58s | Avg:  2m 29s | Max:  2m 46s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 40s | Avg:  2m 20s | Max:  2m 26s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 20s | Avg:  2m 20s | Max:  2m 32s
      🟩 Clang12            Pass: 100%/4   | Total:  8m 47s | Avg:  2m 11s | Max:  2m 14s
      🟩 Clang13            Pass: 100%/4   | Total:  9m 06s | Avg:  2m 16s | Max:  2m 21s
      🟩 Clang14            Pass: 100%/4   | Total: 11m 24s | Avg:  2m 51s | Max:  4m 11s
      🟩 Clang15            Pass: 100%/2   | Total:  4m 50s | Avg:  2m 25s | Max:  2m 34s
      🟩 Clang16            Pass: 100%/4   | Total:  8m 46s | Avg:  2m 11s | Max:  2m 47s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 17s | Avg:  2m 38s | Max:  2m 50s
      🟩 Clang18            Pass: 100%/2   | Total:  7m 04s | Avg:  3m 32s | Max:  4m 50s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 20s | Avg:  2m 10s | Max:  2m 16s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 31s | Avg:  2m 07s | Max:  2m 14s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 41s | Avg:  2m 10s | Max:  2m 17s
      🟩 GCC12              Pass: 100%/7   | Total: 22m 09s | Avg:  3m 09s | Max:  4m 50s
      🟩 GCC13              Pass: 100%/3   | Total:  5m 22s | Avg:  1m 47s | Max:  2m 00s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 08s | Avg: 10m 08s | Max: 10m 08s | Hits:  90%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 56s | Avg:  9m 56s | Max:  9m 56s | Hits:  90%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 14m | Avg:  2m 28s | Max:  4m 50s
      🟩 GCC                Pass: 100%/20  | Total: 49m 03s | Avg:  2m 27s | Max:  4m 50s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 04s | Avg: 10m 02s | Max: 10m 08s | Hits:  90%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 23m | Avg:  2m 45s | Max: 10m 08s | Hits:  90%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 01m | Avg:  2m 35s | Max: 10m 08s | Hits:  90%/222   
      🟩 Test               Pass: 100%/5   | Total: 21m 52s | Avg:  4m 22s | Max:  4m 50s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 08s | Avg:  2m 08s | Max:  2m 08s
      🟩 90a                Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 06m | Avg:  2m 23s | Max:  4m 50s
      🟩 20                 Pass: 100%/24  | Total:  1h 16m | Avg:  3m 11s | Max: 10m 08s | Hits:  90%/222   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 17m 54s | Avg: 4m 28s | Max: 4m 53s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 17m 54s | Avg:  4m 28s | Max:  4m 53s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  8m 23s | Avg:  4m 11s | Max:  4m 31s
      🟩 12.6               Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  4m 53s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  8m 23s | Avg:  4m 11s | Max:  4m 31s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  4m 53s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 54s | Avg:  4m 28s | Max:  4m 53s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 31s | Avg:  4m 31s | Max:  4m 31s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 53s | Avg:  4m 53s | Max:  4m 53s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 38s | Avg:  4m 38s | Max:  4m 38s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  9m 24s | Avg:  4m 42s | Max:  4m 53s
      🟩 GCC                Pass: 100%/2   | Total:  8m 30s | Avg:  4m 15s | Max:  4m 38s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 17m 54s | Avg:  4m 28s | Max:  4m 53s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 17m 54s | Avg:  4m 28s | Max:  4m 53s
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 17m 24s | Avg: 17m 24s | Max: 17m 24s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 17m 24s | Avg: 17m 24s | Max: 17m 24s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 17m 24s | Avg: 17m 24s | Max: 17m 24s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 17m 24s | Avg: 17m 24s | Max: 17m 24s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 17m 24s | Avg: 17m 24s | Max: 17m 24s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 17m 24s | Avg: 17m 24s | Max: 17m 24s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 17m 24s | Avg: 17m 24s | Max: 17m 24s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 17m 24s | Avg: 17m 24s | Max: 17m 24s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 17m 24s | Avg: 17m 24s | Max: 17m 24s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 368)

# Runner
297 linux-amd64-cpu16
28 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

@pciolkosz
Copy link
Contributor

/ok to test

Copy link
Contributor

🟩 CI finished in 1h 34m: Pass: 100%/368 | Total: 1d 13h | Avg: 6m 06s | Max: 1h 23m | Hits: 99%/25663
  • 🟩 cub: Pass: 100%/104 | Total: 12h 39m | Avg: 7m 18s | Max: 1h 23m | Hits: 99%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 12h 04m | Avg:  7m 32s | Max:  1h 23m | Hits:  99%/2908  
      🟩 arm64              Pass: 100%/8   | Total: 34m 32s | Avg:  4m 19s | Max:  4m 57s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 09m | Avg:  4m 36s | Max: 15m 46s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total: 13m 25s | Avg:  4m 28s | Max:  4m 36s
      🟩 12.6               Pass: 100%/86  | Total: 11h 16m | Avg:  7m 52s | Max:  1h 23m | Hits:  99%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 56s | Avg:  3m 58s | Max:  4m 03s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 09m | Avg:  4m 36s | Max: 15m 46s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 13m 25s | Avg:  4m 28s | Max:  4m 36s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 11h 08m | Avg:  7m 57s | Max:  1h 23m | Hits:  99%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 56s | Avg:  3m 58s | Max:  4m 03s
      🟩 nvcc               Pass: 100%/102 | Total: 12h 31m | Avg:  7m 21s | Max:  1h 23m | Hits:  99%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 27m 47s | Avg:  4m 37s | Max:  5m 38s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 02s | Avg:  5m 40s | Max:  5m 53s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 29s | Avg:  4m 52s | Max:  5m 51s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 08s | Avg:  4m 32s | Max:  4m 49s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 04s | Avg:  4m 46s | Max:  4m 57s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 06s | Avg:  4m 46s | Max:  5m 05s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 44s | Avg:  4m 41s | Max:  4m 58s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 01s | Avg:  4m 30s | Max:  4m 36s
      🟩 Clang17            Pass: 100%/4   | Total: 18m 55s | Avg:  4m 43s | Max:  5m 08s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 28m | Avg:  9m 48s | Max: 33m 05s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  3m 47s
      🟩 GCC7               Pass: 100%/6   | Total: 23m 35s | Avg:  3m 55s | Max:  4m 24s
      🟩 GCC8               Pass: 100%/6   | Total: 25m 36s | Avg:  4m 16s | Max:  4m 42s
      🟩 GCC9               Pass: 100%/6   | Total: 24m 48s | Avg:  4m 08s | Max:  4m 49s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 22s | Avg:  4m 35s | Max:  5m 04s
      🟩 GCC11              Pass: 100%/7   | Total: 31m 40s | Avg:  4m 31s | Max:  4m 41s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 50s | Avg:  4m 42s | Max:  4m 55s
      🟩 GCC13              Pass: 100%/16  | Total:  4h 33m | Avg: 17m 04s | Max:  1h 23m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 16m 35s | Avg:  5m 31s | Max:  5m 59s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 46s | Avg: 15m 46s | Max: 15m 46s | Hits:  99%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 24m 58s | Avg: 12m 29s | Max: 12m 47s | Hits:  99%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 46s | Avg: 13m 46s | Max: 13m 46s | Hits:  99%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 24m | Avg:  5m 45s | Max: 33m 05s
      🟩 GCC                Pass: 100%/51  | Total:  7h 03m | Avg:  8m 18s | Max:  1h 23m
      🟩 Intel              Pass: 100%/3   | Total: 16m 35s | Avg:  5m 31s | Max:  5m 59s
      🟩 MSVC               Pass: 100%/4   | Total: 54m 30s | Avg: 13m 37s | Max: 15m 46s | Hits:  99%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 12h 39m | Avg:  7m 18s | Max:  1h 23m | Hits:  99%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  7h 48m | Avg:  4m 52s | Max: 15m 46s | Hits:  99%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total:  1h 23m | Avg:  1h 23m | Max:  1h 23m
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 09s | Avg: 24m 09s | Max: 24m 09s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 14m | Avg: 24m 44s | Max: 27m 54s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 48m | Avg: 36m 08s | Max: 38m 27s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 13m 25s | Avg:  4m 28s | Max:  4m 36s
      🟩 90a                Pass: 100%/4   | Total: 14m 44s | Avg:  3m 41s | Max:  3m 48s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 50m | Avg:  6m 05s | Max: 36m 53s
      🟩 14                 Pass: 100%/27  | Total:  2h 19m | Avg:  5m 09s | Max: 15m 46s | Hits:  99%/1454  
      🟩 17                 Pass: 100%/26  | Total:  2h 08m | Avg:  4m 57s | Max: 12m 47s | Hits:  99%/727   
      🟩 20                 Pass: 100%/23  | Total:  5h 20m | Avg: 13m 56s | Max:  1h 23m | Hits:  99%/727   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 11h 08m | Avg: 6m 25s | Max: 29m 45s | Hits: 99%/11383

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 10h 38m | Avg:  6m 39s | Max: 29m 45s | Hits:  99%/11383 
      🟩 arm64              Pass: 100%/8   | Total: 30m 08s | Avg:  3m 46s | Max:  4m 10s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 05m | Avg:  4m 20s | Max: 19m 53s | Hits:  99%/2648  
      🟩 11.8               Pass: 100%/3   | Total: 52m 26s | Avg: 17m 28s | Max: 26m 15s
      🟩 12.6               Pass: 100%/86  | Total:  9h 11m | Avg:  6m 24s | Max: 29m 45s | Hits:  99%/8735  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 39m 49s | Avg: 19m 54s | Max: 22m 02s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 05m | Avg:  4m 20s | Max: 19m 53s | Hits:  99%/2648  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 52m 26s | Avg: 17m 28s | Max: 26m 15s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  8h 31m | Avg:  6m 05s | Max: 29m 45s | Hits:  99%/8735  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 39m 49s | Avg: 19m 54s | Max: 22m 02s
      🟩 nvcc               Pass: 100%/102 | Total: 10h 29m | Avg:  6m 10s | Max: 29m 45s | Hits:  99%/11383 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 26m 19s | Avg:  4m 23s | Max:  5m 35s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 24s | Avg:  5m 28s | Max:  5m 55s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 22s | Avg:  4m 35s | Max:  5m 03s
      🟩 Clang12            Pass: 100%/4   | Total: 16m 51s | Avg:  4m 12s | Max:  4m 51s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 21s | Avg:  4m 35s | Max:  4m 51s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 27s | Avg:  4m 36s | Max:  4m 54s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 34s | Avg:  4m 38s | Max:  4m 57s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 45s | Avg:  4m 26s | Max:  5m 05s
      🟩 Clang17            Pass: 100%/4   | Total: 17m 51s | Avg:  4m 27s | Max:  4m 40s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 16m | Avg:  9m 35s | Max: 22m 02s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 18s | Avg:  3m 09s | Max:  3m 43s
      🟩 GCC7               Pass: 100%/6   | Total: 20m 48s | Avg:  3m 28s | Max:  4m 06s
      🟩 GCC8               Pass: 100%/6   | Total: 20m 48s | Avg:  3m 28s | Max:  4m 18s
      🟩 GCC9               Pass: 100%/6   | Total: 21m 55s | Avg:  3m 39s | Max:  4m 39s
      🟩 GCC10              Pass: 100%/4   | Total: 17m 03s | Avg:  4m 15s | Max:  4m 46s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 09m | Avg:  9m 56s | Max: 26m 15s
      🟩 GCC12              Pass: 100%/4   | Total: 17m 16s | Avg:  4m 19s | Max:  4m 46s
      🟩 GCC13              Pass: 100%/17  | Total:  2h 50m | Avg: 10m 00s | Max: 29m 45s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 17m 15s | Avg:  5m 45s | Max:  6m 04s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 53s | Avg: 19m 53s | Max: 19m 53s | Hits:  99%/2648  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 27m 27s | Avg: 13m 43s | Max: 13m 55s | Hits:  99%/5658  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 51s | Avg: 14m 51s | Max: 14m 51s | Hits:  99%/3077  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  4h 05m | Avg:  5m 27s | Max: 22m 02s
      🟩 GCC                Pass: 100%/52  | Total:  5h 43m | Avg:  6m 36s | Max: 29m 45s
      🟩 Intel              Pass: 100%/3   | Total: 17m 15s | Avg:  5m 45s | Max:  6m 04s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 02m | Avg: 15m 32s | Max: 19m 53s | Hits:  99%/11383 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 11h 08m | Avg:  6m 25s | Max: 29m 45s | Hits:  99%/11383 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 40m | Avg:  5m 25s | Max: 26m 15s | Hits:  99%/11383 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 39m | Avg: 24m 47s | Max: 29m 45s
      🟩 Test               Pass: 100%/3   | Total: 47m 29s | Avg: 15m 49s | Max: 19m 12s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 16s | Avg:  2m 16s | Max:  2m 16s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 52m 26s | Avg: 17m 28s | Max: 26m 15s
      🟩 90a                Pass: 100%/4   | Total: 16m 26s | Avg:  4m 06s | Max:  4m 25s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 25m | Avg:  5m 11s | Max: 22m 40s
      🟩 14                 Pass: 100%/28  | Total:  2h 38m | Avg:  5m 40s | Max: 20m 37s | Hits:  99%/5397  
      🟩 17                 Pass: 100%/27  | Total:  3h 11m | Avg:  7m 06s | Max: 29m 45s | Hits:  99%/2909  
      🟩 20                 Pass: 100%/20  | Total:  2h 50m | Avg:  8m 32s | Max: 29m 27s | Hits:  99%/3077  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 10h 29m | Avg: 6m 06s | Max: 24m 26s | Hits: 99%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  9h 52m | Avg:  6m 14s | Max: 24m 26s | Hits:  99%/11150 
      🟩 arm64              Pass: 100%/8   | Total: 36m 19s | Avg:  4m 32s | Max:  4m 48s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 16m | Avg:  5m 07s | Max: 19m 02s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 15m 27s | Avg:  5m 09s | Max:  5m 23s
      🟩 12.6               Pass: 100%/85  | Total:  8h 56m | Avg:  6m 18s | Max: 24m 26s | Hits:  99%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  5m 13s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 16m | Avg:  5m 07s | Max: 19m 02s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 27s | Avg:  5m 09s | Max:  5m 23s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  8h 46m | Avg:  6m 20s | Max: 24m 26s | Hits:  99%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  5m 13s
      🟩 nvcc               Pass: 100%/101 | Total: 10h 19m | Avg:  6m 07s | Max: 24m 26s | Hits:  99%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 30m 33s | Avg:  5m 05s | Max:  5m 54s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 13s | Avg:  6m 24s | Max:  6m 54s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 10s | Avg:  5m 02s | Max:  5m 16s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 46s | Avg:  4m 56s | Max:  5m 14s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 34s | Avg:  4m 53s | Max:  5m 06s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 41s | Avg:  4m 55s | Max:  4m 59s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 28s | Avg:  5m 07s | Max:  5m 21s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 32s | Avg:  5m 08s | Max:  5m 37s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 01s | Avg:  5m 00s | Max:  5m 45s
      🟩 Clang18            Pass: 100%/9   | Total: 58m 10s | Avg:  6m 27s | Max: 19m 04s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 34s | Avg:  4m 17s | Max:  4m 28s
      🟩 GCC7               Pass: 100%/6   | Total: 26m 03s | Avg:  4m 20s | Max:  5m 07s
      🟩 GCC8               Pass: 100%/6   | Total: 25m 53s | Avg:  4m 18s | Max:  4m 37s
      🟩 GCC9               Pass: 100%/6   | Total: 26m 33s | Avg:  4m 25s | Max:  5m 01s
      🟩 GCC10              Pass: 100%/4   | Total: 20m 20s | Avg:  5m 05s | Max:  5m 29s
      🟩 GCC11              Pass: 100%/7   | Total: 36m 14s | Avg:  5m 10s | Max:  5m 39s
      🟩 GCC12              Pass: 100%/4   | Total: 21m 04s | Avg:  5m 16s | Max:  5m 42s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 43m | Avg:  7m 25s | Max: 16m 30s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 02s | Avg:  6m 00s | Max:  6m 09s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 02s | Avg: 19m 02s | Max: 19m 02s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 33m 24s | Avg: 16m 42s | Max: 16m 48s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 41m 54s | Avg: 20m 57s | Max: 24m 26s | Hits:  99%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 08m | Avg:  5m 23s | Max: 19m 04s
      🟩 GCC                Pass: 100%/49  | Total:  4h 28m | Avg:  5m 28s | Max: 16m 30s
      🟩 Intel              Pass: 100%/3   | Total: 18m 02s | Avg:  6m 00s | Max:  6m 09s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 34m | Avg: 18m 52s | Max: 24m 26s | Hits:  99%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 10h 29m | Avg:  6m 06s | Max: 24m 26s | Hits:  99%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 42m | Avg:  5m 26s | Max: 19m 02s | Hits:  99%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 55m 01s | Avg: 13m 45s | Max: 24m 26s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 52m 01s | Avg: 17m 20s | Max: 19m 04s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 27s | Avg:  5m 09s | Max:  5m 23s
      🟩 90a                Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  4m 27s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 33m | Avg:  5m 27s | Max: 16m 27s
      🟩 14                 Pass: 100%/27  | Total:  2h 37m | Avg:  5m 51s | Max: 19m 02s | Hits:  99%/4460  
      🟩 17                 Pass: 100%/26  | Total:  2h 23m | Avg:  5m 32s | Max: 16m 48s | Hits:  99%/2230  
      🟩 20                 Pass: 100%/22  | Total:  2h 54m | Avg:  7m 55s | Max: 24m 26s | Hits:  99%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 36m | Avg: 3m 00s | Max: 10m 43s | Hits: 90%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 28m | Avg:  3m 05s | Max: 10m 43s | Hits:  90%/222   
      🟩 arm64              Pass: 100%/4   | Total:  8m 08s | Avg:  2m 02s | Max:  2m 18s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 59m 12s | Avg:  3m 06s | Max: 10m 43s | Hits:  90%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 37m | Avg:  2m 57s | Max: 10m 36s | Hits:  90%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 59m 12s | Avg:  3m 06s | Max: 10m 43s | Hits:  90%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 37m | Avg:  2m 57s | Max: 10m 36s | Hits:  90%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 36m | Avg:  3m 00s | Max: 10m 43s | Hits:  90%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 22s | Avg:  2m 41s | Max:  2m 45s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 42s | Avg:  2m 21s | Max:  2m 24s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 33s | Avg:  2m 23s | Max:  2m 39s
      🟩 Clang12            Pass: 100%/4   | Total: 10m 13s | Avg:  2m 33s | Max:  2m 50s
      🟩 Clang13            Pass: 100%/4   | Total: 11m 21s | Avg:  2m 50s | Max:  3m 06s
      🟩 Clang14            Pass: 100%/4   | Total: 11m 09s | Avg:  2m 47s | Max:  3m 56s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 38s | Avg:  2m 49s | Max:  2m 50s
      🟩 Clang16            Pass: 100%/4   | Total:  9m 32s | Avg:  2m 23s | Max:  2m 55s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 08s | Avg:  2m 34s | Max:  2m 55s
      🟩 Clang18            Pass: 100%/2   | Total:  6m 26s | Avg:  3m 13s | Max:  4m 03s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 11s | Avg:  2m 35s | Max:  2m 58s
      🟩 GCC10              Pass: 100%/4   | Total:  9m 19s | Avg:  2m 19s | Max:  2m 35s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 16s | Avg:  2m 49s | Max:  2m 56s
      🟩 GCC12              Pass: 100%/7   | Total: 23m 56s | Avg:  3m 25s | Max:  5m 45s
      🟩 GCC13              Pass: 100%/3   | Total:  6m 36s | Avg:  2m 12s | Max:  2m 52s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 43s | Avg: 10m 43s | Max: 10m 43s | Hits:  90%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 36s | Avg: 10m 36s | Max: 10m 36s | Hits:  90%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 19m | Avg:  2m 38s | Max:  4m 03s
      🟩 GCC                Pass: 100%/20  | Total: 56m 18s | Avg:  2m 48s | Max:  5m 45s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 19s | Avg: 10m 39s | Max: 10m 43s | Hits:  90%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 36m | Avg:  3m 00s | Max: 10m 43s | Hits:  90%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 14m | Avg:  2m 51s | Max: 10m 43s | Hits:  90%/222   
      🟩 Test               Pass: 100%/5   | Total: 22m 24s | Avg:  4m 28s | Max:  5m 45s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 49s | Avg:  2m 49s | Max:  2m 49s
      🟩 90a                Pass: 100%/1   | Total:  2m 52s | Avg:  2m 52s | Max:  2m 52s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 14m | Avg:  2m 39s | Max:  4m 23s
      🟩 20                 Pass: 100%/24  | Total:  1h 22m | Avg:  3m 25s | Max: 10m 43s | Hits:  90%/222   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 16m 47s | Avg: 4m 11s | Max: 4m 31s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  4m 31s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  8m 01s | Avg:  4m 00s | Max:  4m 03s
      🟩 12.6               Pass: 100%/2   | Total:  8m 46s | Avg:  4m 23s | Max:  4m 31s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  8m 01s | Avg:  4m 00s | Max:  4m 03s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 46s | Avg:  4m 23s | Max:  4m 31s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  4m 31s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 03s | Avg:  4m 03s | Max:  4m 03s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 31s | Avg:  4m 31s | Max:  4m 31s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 58s | Avg:  3m 58s | Max:  3m 58s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 15s | Avg:  4m 15s | Max:  4m 15s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  8m 34s | Avg:  4m 17s | Max:  4m 31s
      🟩 GCC                Pass: 100%/2   | Total:  8m 13s | Avg:  4m 06s | Max:  4m 15s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  4m 31s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  4m 31s
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 16m 04s | Avg: 16m 04s | Max: 16m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 16m 04s | Avg: 16m 04s | Max: 16m 04s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 16m 04s | Avg: 16m 04s | Max: 16m 04s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 16m 04s | Avg: 16m 04s | Max: 16m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 16m 04s | Avg: 16m 04s | Max: 16m 04s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 16m 04s | Avg: 16m 04s | Max: 16m 04s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 16m 04s | Avg: 16m 04s | Max: 16m 04s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 16m 04s | Avg: 16m 04s | Max: 16m 04s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 16m 04s | Avg: 16m 04s | Max: 16m 04s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 368)

# Runner
297 linux-amd64-cpu16
28 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

@miscco
Copy link
Collaborator

miscco commented Sep 28, 2024

Thanks a lot for the improvement 🎉

@miscco miscco merged commit e3800d7 into NVIDIA:main Sep 28, 2024
382 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

3 participants