Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add benchmarking and tuning presets #2856

Merged
merged 1 commit into from
Nov 18, 2024

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Nov 18, 2024

This PR contains two drive-by fixes to CMake.

Fixes: #2839

@bernhardmgruber bernhardmgruber marked this pull request as ready for review November 18, 2024 13:56
@bernhardmgruber bernhardmgruber requested review from a team as code owners November 18, 2024 13:56
@bernhardmgruber bernhardmgruber added the infrastructure Shared CMake, github, etc infrastructure label Nov 18, 2024
Copy link
Contributor

@miscco miscco left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is soo much better.

Thanks a lot for cleaning this up

Copy link
Contributor

🟩 CI finished in 1h 29m: Pass: 100%/400 | Total: 1d 21h | Avg: 6m 48s | Max: 56m 49s | Hits: 88%/25890
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 14h 48m | Avg: 7m 31s | Max: 38m 31s | Hits: 69%/9500

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 14h 13m | Avg:  7m 45s | Max: 38m 31s | Hits:  69%/9500  
      🟩 arm64              Pass: 100%/8   | Total: 35m 48s | Avg:  4m 28s | Max: 11m 11s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 16m | Avg:  5m 04s | Max: 33m 36s | Hits:  34%/2181  
      🟩 11.8               Pass: 100%/3   | Total:  9m 37s | Avg:  3m 12s | Max:  3m 21s
      🟩 12.5               Pass: 100%/4   | Total:  1h 07m | Avg: 16m 49s | Max: 25m 11s
      🟩 12.6               Pass: 100%/96  | Total: 12h 15m | Avg:  7m 39s | Max: 38m 31s | Hits:  79%/7319  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 27m | Avg: 12m 15s | Max: 21m 09s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 16m | Avg:  5m 04s | Max: 33m 36s | Hits:  34%/2181  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  9m 37s | Avg:  3m 12s | Max:  3m 21s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 07m | Avg: 16m 49s | Max: 25m 11s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 48m | Avg:  7m 00s | Max: 38m 31s | Hits:  79%/7319  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 27m | Avg: 12m 15s | Max: 21m 09s
      🟩 nvcc               Pass: 100%/106 | Total: 12h 21m | Avg:  6m 59s | Max: 38m 31s | Hits:  69%/9500  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 26m 58s | Avg:  4m 29s | Max:  5m 30s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 15s | Avg:  5m 25s | Max:  6m 21s
      🟩 Clang11            Pass: 100%/4   | Total: 16m 49s | Avg:  4m 12s | Max:  4m 29s
      🟩 Clang12            Pass: 100%/4   | Total: 17m 08s | Avg:  4m 17s | Max:  4m 48s
      🟩 Clang13            Pass: 100%/4   | Total: 28m 49s | Avg:  7m 12s | Max: 15m 46s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 04s | Avg:  4m 46s | Max:  5m 34s
      🟩 Clang15            Pass: 100%/4   | Total: 32m 43s | Avg:  8m 10s | Max: 20m 21s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 08s | Avg:  4m 32s | Max:  5m 00s
      🟩 Clang17            Pass: 100%/4   | Total: 17m 12s | Avg:  4m 18s | Max:  4m 49s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 05m | Avg: 10m 19s | Max: 21m 09s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 00s | Avg:  3m 00s | Max:  3m 04s
      🟩 GCC7               Pass: 100%/6   | Total: 18m 35s | Avg:  3m 05s | Max:  3m 40s
      🟩 GCC8               Pass: 100%/6   | Total: 19m 05s | Avg:  3m 10s | Max:  3m 48s
      🟩 GCC9               Pass: 100%/6   | Total: 19m 36s | Avg:  3m 16s | Max:  4m 25s
      🟩 GCC10              Pass: 100%/4   | Total: 23m 27s | Avg:  5m 51s | Max: 12m 03s
      🟩 GCC11              Pass: 100%/7   | Total: 36m 18s | Avg:  5m 11s | Max: 13m 42s
      🟩 GCC12              Pass: 100%/4   | Total: 15m 08s | Avg:  3m 47s | Max:  4m 04s
      🟩 GCC13              Pass: 100%/17  | Total:  2h 39m | Avg:  9m 22s | Max: 25m 05s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 50m 16s | Avg: 16m 45s | Max: 31m 24s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 33m 36s | Avg: 33m 36s | Max: 33m 36s | Hits:  34%/2181  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 22m 48s | Avg: 11m 24s | Max: 12m 20s | Hits:  98%/4725  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 38m 31s | Avg: 38m 31s | Max: 38m 31s | Hits:  45%/2594  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 07m | Avg: 16m 49s | Max: 25m 11s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  6h 18m | Avg:  6m 53s | Max: 21m 09s
      🟩 GCC                Pass: 100%/52  | Total:  4h 57m | Avg:  5m 43s | Max: 25m 05s
      🟩 Intel              Pass: 100%/3   | Total: 50m 16s | Avg: 16m 45s | Max: 31m 24s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 34m | Avg: 23m 43s | Max: 38m 31s | Hits:  69%/9500  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 07m | Avg: 16m 49s | Max: 25m 11s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 14h 48m | Avg:  7m 31s | Max: 38m 31s | Hits:  69%/9500  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total: 12h 34m | Avg:  6m 51s | Max: 38m 31s | Hits:  69%/9500  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 25m | Avg: 21m 23s | Max: 25m 05s
      🟩 Test               Pass: 100%/3   | Total: 46m 44s | Avg: 15m 34s | Max: 19m 37s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 01s | Avg:  2m 01s | Max:  2m 01s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  9m 37s | Avg:  3m 12s | Max:  3m 21s
      🟩 90                 Pass: 100%/4   | Total: 40m 58s | Avg: 10m 14s | Max: 11m 45s
      🟩 90a                Pass: 100%/8   | Total: 57m 55s | Avg:  7m 14s | Max: 13m 04s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  3h 09m | Avg:  5m 56s | Max: 24m 28s
      🟩 14                 Pass: 100%/32  | Total:  3h 55m | Avg:  7m 20s | Max: 33m 36s | Hits:  67%/4465  
      🟩 17                 Pass: 100%/30  | Total:  3h 33m | Avg:  7m 07s | Max: 31m 24s | Hits:  97%/2441  
      🟩 20                 Pass: 100%/23  | Total:  4h 08m | Avg: 10m 47s | Max: 38m 31s | Hits:  45%/2594  
    
  • 🟩 cub: Pass: 100%/110 | Total: 12h 58m | Avg: 7m 04s | Max: 56m 49s | Hits: 99%/2964

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 12h 20m | Avg:  7m 15s | Max: 56m 49s | Hits:  99%/2964  
      🟩 arm64              Pass: 100%/8   | Total: 38m 13s | Avg:  4m 46s | Max:  5m 43s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 12m | Avg:  4m 50s | Max: 12m 42s | Hits:  99%/741   
      🟩 11.8               Pass: 100%/3   | Total: 16m 35s | Avg:  5m 31s | Max:  5m 46s
      🟩 12.5               Pass: 100%/4   | Total: 35m 36s | Avg:  8m 54s | Max:  9m 08s
      🟩 12.6               Pass: 100%/88  | Total: 10h 54m | Avg:  7m 26s | Max: 56m 49s | Hits:  99%/2223  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 09m | Avg: 17m 17s | Max: 56m 49s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 12m | Avg:  4m 50s | Max: 12m 42s | Hits:  99%/741   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 35s | Avg:  5m 31s | Max:  5m 46s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 35m 36s | Avg:  8m 54s | Max:  9m 08s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 45m | Avg:  6m 57s | Max: 33m 42s | Hits:  99%/2223  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 09m | Avg: 17m 17s | Max: 56m 49s
      🟩 nvcc               Pass: 100%/106 | Total: 11h 49m | Avg:  6m 41s | Max: 33m 42s | Hits:  99%/2964  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 30m 23s | Avg:  5m 03s | Max:  6m 11s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 18s | Avg:  6m 06s | Max:  6m 36s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 45s | Avg:  5m 11s | Max:  5m 35s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 14s | Avg:  5m 03s | Max:  5m 20s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 37s | Avg:  4m 54s | Max:  5m 02s
      🟩 Clang14            Pass: 100%/4   | Total: 22m 11s | Avg:  5m 32s | Max:  6m 15s
      🟩 Clang15            Pass: 100%/4   | Total: 22m 42s | Avg:  5m 40s | Max:  5m 58s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 36s | Avg:  5m 24s | Max:  5m 34s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  5m 37s
      🟩 Clang18            Pass: 100%/11  | Total:  2h 26m | Avg: 13m 21s | Max: 56m 49s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 33s | Avg:  4m 16s | Max:  4m 25s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 25s | Avg:  4m 34s | Max:  4m 55s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 14s | Avg:  4m 42s | Max:  5m 26s
      🟩 GCC9               Pass: 100%/6   | Total: 28m 08s | Avg:  4m 41s | Max:  5m 13s
      🟩 GCC10              Pass: 100%/4   | Total: 20m 43s | Avg:  5m 10s | Max:  5m 35s
      🟩 GCC11              Pass: 100%/7   | Total: 37m 10s | Avg:  5m 18s | Max:  5m 46s
      🟩 GCC12              Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  5m 45s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 44m | Avg: 10m 15s | Max: 22m 12s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 34s | Avg:  6m 11s | Max:  6m 18s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 12m 42s | Avg: 12m 42s | Max: 12m 42s | Hits:  99%/741   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 21m 05s | Avg: 10m 32s | Max: 10m 53s | Hits:  99%/1482  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 30s | Avg: 11m 30s | Max: 11m 30s | Hits:  99%/741   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 35m 36s | Avg:  8m 54s | Max:  9m 08s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  5h 43m | Avg:  7m 09s | Max: 56m 49s
      🟩 GCC                Pass: 100%/51  | Total:  5h 35m | Avg:  6m 34s | Max: 22m 12s
      🟩 Intel              Pass: 100%/3   | Total: 18m 34s | Avg:  6m 11s | Max:  6m 18s
      🟩 MSVC               Pass: 100%/4   | Total: 45m 17s | Avg: 11m 19s | Max: 12m 42s | Hits:  99%/2964  
      🟩 NVHPC              Pass: 100%/4   | Total: 35m 36s | Avg:  8m 54s | Max:  9m 08s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 12h 58m | Avg:  7m 04s | Max: 56m 49s | Hits:  99%/2964  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total: 10h 08m | Avg:  5m 58s | Max: 56m 49s | Hits:  99%/2964  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 19m 27s | Avg: 19m 27s | Max: 19m 27s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 50s | Avg: 16m 50s | Max: 16m 50s
      🟩 HostLaunch         Pass: 100%/3   | Total: 56m 40s | Avg: 18m 53s | Max: 20m 22s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 17m | Avg: 25m 47s | Max: 33m 42s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 35s | Avg:  5m 31s | Max:  5m 46s
      🟩 90a                Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  4m 19s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 02m | Avg:  6m 04s | Max: 21m 27s
      🟩 14                 Pass: 100%/29  | Total:  2h 43m | Avg:  5m 37s | Max: 12m 42s | Hits:  99%/1482  
      🟩 17                 Pass: 100%/27  | Total:  3h 21m | Avg:  7m 26s | Max: 56m 49s | Hits:  99%/741   
      🟩 20                 Pass: 100%/24  | Total:  3h 52m | Avg:  9m 41s | Max: 33m 42s | Hits:  99%/741   
    
  • 🟩 thrust: Pass: 100%/109 | Total: 12h 33m | Avg: 6m 54s | Max: 24m 36s | Hits: 99%/13180

    🟩 cpu
      🟩 amd64              Pass: 100%/101 | Total: 11h 50m | Avg:  7m 01s | Max: 24m 36s | Hits:  99%/13180 
      🟩 arm64              Pass: 100%/8   | Total: 42m 50s | Avg:  5m 21s | Max:  6m 15s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 23m | Avg:  5m 33s | Max: 18m 39s | Hits:  99%/2636  
      🟩 11.8               Pass: 100%/3   | Total: 17m 22s | Avg:  5m 47s | Max:  6m 16s
      🟩 12.5               Pass: 100%/4   | Total:  1h 08m | Avg: 17m 04s | Max: 17m 54s
      🟩 12.6               Pass: 100%/87  | Total:  9h 44m | Avg:  6m 42s | Max: 24m 36s | Hits:  99%/10544 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 20m 44s | Avg:  5m 11s | Max:  5m 26s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 23m | Avg:  5m 33s | Max: 18m 39s | Hits:  99%/2636  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 17m 22s | Avg:  5m 47s | Max:  6m 16s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 08m | Avg: 17m 04s | Max: 17m 54s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  9h 23m | Avg:  6m 47s | Max: 24m 36s | Hits:  99%/10544 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 20m 44s | Avg:  5m 11s | Max:  5m 26s
      🟩 nvcc               Pass: 100%/105 | Total: 12h 12m | Avg:  6m 58s | Max: 24m 36s | Hits:  99%/13180 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 35m 05s | Avg:  5m 50s | Max:  7m 11s
      🟩 Clang10            Pass: 100%/3   | Total: 22m 28s | Avg:  7m 29s | Max:  7m 59s
      🟩 Clang11            Pass: 100%/4   | Total: 22m 20s | Avg:  5m 35s | Max:  5m 59s
      🟩 Clang12            Pass: 100%/4   | Total: 22m 53s | Avg:  5m 43s | Max:  5m 59s
      🟩 Clang13            Pass: 100%/4   | Total: 23m 33s | Avg:  5m 53s | Max:  6m 08s
      🟩 Clang14            Pass: 100%/4   | Total: 22m 11s | Avg:  5m 32s | Max:  5m 50s
      🟩 Clang15            Pass: 100%/4   | Total: 23m 53s | Avg:  5m 58s | Max:  6m 22s
      🟩 Clang16            Pass: 100%/4   | Total: 23m 17s | Avg:  5m 49s | Max:  6m 13s
      🟩 Clang17            Pass: 100%/4   | Total: 23m 10s | Avg:  5m 47s | Max:  6m 28s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 06m | Avg:  6m 02s | Max: 11m 25s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 37s | Avg:  4m 18s | Max:  4m 19s
      🟩 GCC7               Pass: 100%/6   | Total: 29m 50s | Avg:  4m 58s | Max:  6m 07s
      🟩 GCC8               Pass: 100%/6   | Total: 29m 50s | Avg:  4m 58s | Max:  5m 33s
      🟩 GCC9               Pass: 100%/6   | Total: 32m 11s | Avg:  5m 21s | Max:  6m 32s
      🟩 GCC10              Pass: 100%/4   | Total: 24m 30s | Avg:  6m 07s | Max:  6m 48s
      🟩 GCC11              Pass: 100%/7   | Total: 41m 10s | Avg:  5m 52s | Max:  6m 16s
      🟩 GCC12              Pass: 100%/4   | Total: 24m 57s | Avg:  6m 14s | Max:  6m 34s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 36m | Avg:  6m 52s | Max: 13m 33s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 22m 00s | Avg:  7m 20s | Max:  7m 58s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 39s | Avg: 18m 39s | Max: 18m 39s | Hits:  99%/2636  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 29m 55s | Avg: 14m 57s | Max: 15m 01s | Hits:  99%/5272  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 41m 40s | Avg: 20m 50s | Max: 24m 36s | Hits:  99%/5272  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 08m | Avg: 17m 04s | Max: 17m 54s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 45m | Avg:  5m 56s | Max: 11m 25s
      🟩 GCC                Pass: 100%/49  | Total:  4h 47m | Avg:  5m 51s | Max: 13m 33s
      🟩 Intel              Pass: 100%/3   | Total: 22m 00s | Avg:  7m 20s | Max:  7m 58s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 30m | Avg: 18m 02s | Max: 24m 36s | Hits:  99%/13180 
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 08m | Avg: 17m 04s | Max: 17m 54s
    🟩 gpu
      🟩 v100               Pass: 100%/109 | Total: 12h 33m | Avg:  6m 54s | Max: 24m 36s | Hits:  99%/13180 
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total: 11h 07m | Avg:  6m 32s | Max: 18m 39s | Hits:  99%/10544 
      🟩 TestCPU            Pass: 100%/4   | Total: 48m 55s | Avg: 12m 13s | Max: 24m 36s | Hits:  99%/2636  
      🟩 TestGPU            Pass: 100%/3   | Total: 37m 08s | Avg: 12m 22s | Max: 13m 33s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 17m 22s | Avg:  5m 47s | Max:  6m 16s
      🟩 90a                Pass: 100%/4   | Total: 19m 35s | Avg:  4m 53s | Max:  5m 25s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 55m | Avg:  5m 51s | Max: 16m 26s
      🟩 14                 Pass: 100%/29  | Total:  3h 19m | Avg:  6m 51s | Max: 18m 39s | Hits:  99%/5272  
      🟩 17                 Pass: 100%/27  | Total:  3h 02m | Avg:  6m 45s | Max: 17m 54s | Hits:  99%/2636  
      🟩 20                 Pass: 100%/23  | Total:  3h 16m | Avg:  8m 31s | Max: 24m 36s | Hits:  99%/5272  
    
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 07m | Avg: 4m 35s | Max: 20m 46s | Hits: 90%/246

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  3h 57m | Avg:  4m 45s | Max: 20m 46s | Hits:  90%/246   
      🟩 arm64              Pass: 100%/4   | Total: 10m 16s | Avg:  2m 34s | Max:  2m 42s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 31m | Avg:  4m 47s | Max: 20m 25s | Hits:  90%/123   
      🟩 12.5               Pass: 100%/2   | Total:  9m 37s | Avg:  4m 48s | Max:  4m 57s
      🟩 12.6               Pass: 100%/33  | Total:  2h 27m | Avg:  4m 27s | Max: 20m 46s | Hits:  90%/123   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 31m | Avg:  4m 47s | Max: 20m 25s | Hits:  90%/123   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  9m 37s | Avg:  4m 48s | Max:  4m 57s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 27m | Avg:  4m 27s | Max: 20m 46s | Hits:  90%/123   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 07m | Avg:  4m 35s | Max: 20m 46s | Hits:  90%/246   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 14s | Avg:  3m 07s | Max:  3m 22s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 49s | Avg:  3m 24s | Max:  3m 46s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 22s | Avg:  2m 50s | Max:  3m 09s
      🟩 Clang12            Pass: 100%/4   | Total: 11m 26s | Avg:  2m 51s | Max:  3m 10s
      🟩 Clang13            Pass: 100%/4   | Total: 11m 29s | Avg:  2m 52s | Max:  2m 57s
      🟩 Clang14            Pass: 100%/4   | Total: 29m 41s | Avg:  7m 25s | Max: 20m 25s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 13s | Avg:  3m 06s | Max:  3m 11s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 31s | Avg:  2m 52s | Max:  3m 22s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 29s | Avg:  3m 14s | Max:  3m 27s
      🟩 Clang18            Pass: 100%/2   | Total: 23m 43s | Avg: 11m 51s | Max: 20m 46s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 36s | Avg:  2m 48s | Max:  2m 57s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 28s | Avg:  2m 52s | Max:  3m 00s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 43s | Avg:  2m 55s | Max:  3m 08s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 02m | Avg:  8m 54s | Max: 18m 51s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 37s | Avg:  2m 32s | Max:  2m 38s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  7m 14s | Avg:  7m 14s | Max:  7m 14s | Hits:  90%/123   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  7m 17s | Avg:  7m 17s | Max:  7m 17s | Hits:  90%/123   
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  9m 37s | Avg:  4m 48s | Max:  4m 57s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 04m | Avg:  4m 09s | Max: 20m 46s
      🟩 GCC                Pass: 100%/20  | Total:  1h 38m | Avg:  4m 56s | Max: 18m 51s
      🟩 MSVC               Pass: 100%/2   | Total: 14m 31s | Avg:  7m 15s | Max:  7m 17s | Hits:  90%/246   
      🟩 NVHPC              Pass: 100%/2   | Total:  9m 37s | Avg:  4m 48s | Max:  4m 57s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 07m | Avg:  4m 35s | Max: 20m 46s | Hits:  90%/246   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  2h 36m | Avg:  3m 11s | Max:  7m 17s | Hits:  90%/246   
      🟩 Test               Pass: 100%/5   | Total:  1h 31m | Avg: 18m 17s | Max: 20m 46s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 29s | Avg:  2m 29s | Max:  2m 29s
      🟩 90a                Pass: 100%/1   | Total:  2m 38s | Avg:  2m 38s | Max:  2m 38s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  1h 56m | Avg:  4m 00s | Max: 18m 51s
      🟩 20                 Pass: 100%/25  | Total:  2h 11m | Avg:  5m 15s | Max: 20m 46s | Hits:  90%/246   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 27m 57s | Avg: 4m 39s | Max: 5m 46s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 27m 57s | Avg:  4m 39s | Max:  5m 46s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  7m 53s | Avg:  3m 56s | Max:  4m 36s
      🟩 12.0               Pass: 100%/2   | Total:  9m 32s | Avg:  4m 46s | Max:  4m 56s
      🟩 12.6               Pass: 100%/2   | Total: 10m 32s | Avg:  5m 16s | Max:  5m 46s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  7m 53s | Avg:  3m 56s | Max:  4m 36s
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 32s | Avg:  4m 46s | Max:  4m 56s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 32s | Avg:  5m 16s | Max:  5m 46s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 27m 57s | Avg:  4m 39s | Max:  5m 46s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s
      🟩 Clang18            Pass: 100%/1   | Total:  5m 46s | Avg:  5m 46s | Max:  5m 46s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 17s | Avg:  3m 17s | Max:  3m 17s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 46s | Avg:  4m 46s | Max:  4m 46s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 14m 58s | Avg:  4m 59s | Max:  5m 46s
      🟩 GCC                Pass: 100%/3   | Total: 12m 59s | Avg:  4m 19s | Max:  4m 56s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 27m 57s | Avg:  4m 39s | Max:  5m 46s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 27m 57s | Avg:  4m 39s | Max:  5m 46s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 31s | Avg: 4m 45s | Max: 7m 33s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  7m 33s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  7m 33s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  7m 33s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  7m 33s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  7m 33s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  7m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  7m 33s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 58s | Avg:  1m 58s | Max:  1m 58s
      🟩 Test               Pass: 100%/1   | Total:  7m 33s | Avg:  7m 33s | Max:  7m 33s
    
  • 🟩 python: Pass: 100%/1 | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 400)

# Runner
326 linux-amd64-cpu16
31 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

@bernhardmgruber bernhardmgruber merged commit 7db6131 into NVIDIA:main Nov 18, 2024
416 checks passed
@bernhardmgruber bernhardmgruber deleted the bench_preset branch November 18, 2024 16:37
@gonidelis
Copy link
Member

gonidelis commented Nov 18, 2024

I am late to the party but I love this. Thanks!

trxcllnt pushed a commit to trxcllnt/cccl that referenced this pull request Nov 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
infrastructure Shared CMake, github, etc infrastructure
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

Create CMake presets for benchmarking and tuning
3 participants