Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support FP16 traits on CTK 12.0 #3535

Merged
merged 3 commits into from
Jan 30, 2025

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Jan 27, 2025

libcu++ uses the macros _CCCL_HAS_NV[FP|BF]16 to indicate the general availability of FP16 types (CTK headers are present), and _LIBCUDACXX_HAS_NV[FP|BF]16 that those types are actually supported in various places. This leads to problematic results in downstream code.

For example, in the CUB test, _CCCL_HAS_NVFP16 is always defined for the supported CTKs, but _LIBCUDACXX_HAS_NVFP16 only on CTK >= 12.2. Therefore, cuda::is_floating_point<__half> is false for CTK < 12.2 and true afterwards. However, any code relying on _CCCL_HAS_NVFP16 for __half support will likely run into surprising results.

This PR enables all FP16 traits already when _CCCL_HAS_NVFP16 is defined.

Copy link

copy-pr-bot bot commented Jan 27, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@bernhardmgruber
Copy link
Contributor Author

/ok to test

@@ -53,7 +53,7 @@ _CCCL_INLINE_VAR constexpr bool __is_extended_floating_point_v
# endif // !_CCCL_NO_INLINE_VARIABLES
#endif // !_CCCL_NO_VARIABLE_TEMPLATES

#if defined(_LIBCUDACXX_HAS_NVFP16)
#if defined(_CCCL_HAS_FP16)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will not work, because above we guard with _LIBCUDACXX_HAS_NVFP16

Please also change the include guard

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will not work, because above we guard with _LIBCUDACXX_HAS_NVFP16

Fixes

Please also change the include guard

Can you elaborate? The include guard _LIBCUDACXX___TYPE_TRAITS_IS_EXTENDED_FLOATING_POINT_H looks good to me.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I meant please also change theguard for the <cuda_fp16.h> include

@bernhardmgruber
Copy link
Contributor Author

/ok to test

@bernhardmgruber
Copy link
Contributor Author

/ok to test

Copy link
Contributor

🟨 CI finished in 5h 49m: Pass: 86%/153 | Total: 2d 00h | Avg: 18m 53s | Max: 1h 15m | Hits: 414%/8457
  • 🟨 libcudacxx: Pass: 81%/43 | Total: 10h 32m | Avg: 14m 42s | Max: 1h 15m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  80%/41  | Total: 10h 23m | Avg: 15m 12s | Max:  1h 15m
      🟩 arm64              Pass: 100%/2   | Total:  9m 15s | Avg:  4m 37s | Max:  5m 42s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 09m | Avg: 17m 16s | Max: 22m 36s
      🔍 nvcc               Pass:  79%/39  | Total:  9h 23m | Avg: 14m 26s | Max:  1h 15m
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  78%/38  | Total:  7h 55m | Avg: 12m 31s | Max: 22m 36s
      🟩 NVRTC              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 00s | Max: 36m 33s
      🟩 Test               Pass: 100%/2   | Total:  1h 34m | Avg: 47m 16s | Max:  1h 15m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 57m 04s | Avg: 11m 24s | Max: 20m 46s
      🟩 12.5               Pass: 100%/2   | Total: 19m 57s | Avg:  9m 58s | Max: 11m 09s
      🟨 12.6               Pass:  91%/36  | Total:  9h 15m | Avg: 15m 25s | Max:  1h 15m
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 09m | Avg: 17m 16s | Max: 22m 36s
      🟥 nvcc12.0           Pass:   0%/5   | Total: 57m 04s | Avg: 11m 24s | Max: 20m 46s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 19m 57s | Avg:  9m 58s | Max: 11m 09s
      🟨 nvcc12.6           Pass:  90%/32  | Total:  8h 06m | Avg: 15m 12s | Max:  1h 15m
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total:  1h 14m | Avg: 18m 34s | Max: 20m 46s
      🟩 Clang15            Pass: 100%/2   | Total: 25m 17s | Avg: 12m 38s | Max: 20m 32s
      🟩 Clang16            Pass: 100%/2   | Total: 34m 48s | Avg: 17m 24s | Max: 17m 39s
      🟩 Clang17            Pass: 100%/2   | Total: 30m 56s | Avg: 15m 28s | Max: 18m 05s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 56m | Avg: 14m 33s | Max: 22m 36s
      🟨 GCC7               Pass:  50%/2   | Total:  5m 59s | Avg:  2m 59s | Max:  3m 57s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 56s | Avg:  3m 56s | Max:  3m 56s
      🟨 GCC9               Pass:  50%/2   | Total:  5m 39s | Avg:  2m 49s | Max:  3m 41s
      🟩 GCC10              Pass: 100%/2   | Total: 25m 13s | Avg: 12m 36s | Max: 21m 39s
      🟩 GCC11              Pass: 100%/2   | Total: 31m 31s | Avg: 15m 45s | Max: 16m 40s
      🟩 GCC12              Pass: 100%/2   | Total: 19m 52s | Avg:  9m 56s | Max: 16m 02s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 04m | Avg: 23m 01s | Max:  1h 15m
      🟥 MSVC14.29          Pass:   0%/2   | Total: 26m 06s | Avg: 13m 03s | Max: 13m 37s
      🟥 MSVC14.39          Pass:   0%/2   | Total: 28m 18s | Avg: 14m 09s | Max: 14m 54s
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 19m 57s | Avg:  9m 58s | Max: 11m 09s
    🟨 cxx_family
      🟨 Clang              Pass:  88%/18  | Total:  4h 41m | Avg: 15m 39s | Max: 22m 36s
      🟨 GCC                Pass:  89%/19  | Total:  4h 36m | Avg: 14m 32s | Max:  1h 15m
      🟥 MSVC               Pass:   0%/4   | Total: 54m 24s | Avg: 13m 36s | Max: 14m 54s
      🟩 NVHPC              Pass: 100%/2   | Total: 19m 57s | Avg:  9m 58s | Max: 11m 09s
    🟨 gpu
      🟨 v100               Pass:  81%/43  | Total: 10h 32m | Avg: 14m 42s | Max:  1h 15m
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 38s | Avg: 13m 38s | Max: 13m 38s
      🟩 90a                Pass: 100%/2   | Total: 17m 27s | Avg:  8m 43s | Max: 13m 15s
    🟨 std
      🟨 17                 Pass:  71%/21  | Total:  4h 00m | Avg: 11m 27s | Max: 23m 28s
      🟨 20                 Pass:  90%/21  | Total:  6h 29m | Avg: 18m 34s | Max:  1h 15m
    
  • 🟨 thrust: Pass: 86%/43 | Total: 10h 01m | Avg: 13m 59s | Max: 45m 34s | Hits: 355%/5532

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  85%/41  | Total:  9h 39m | Avg: 14m 08s | Max: 45m 34s | Hits: 355%/5532  
      🟩 arm64              Pass: 100%/2   | Total: 21m 42s | Avg: 10m 51s | Max: 12m 01s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 21m 53s | Avg: 10m 56s | Max: 11m 57s
      🔍 nvcc               Pass:  85%/41  | Total:  9h 39m | Avg: 14m 08s | Max: 45m 34s | Hits: 355%/5532  
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  1h 04m | Avg: 12m 53s | Max: 45m 34s
      🟩 12.5               Pass: 100%/2   | Total: 44m 09s | Avg: 22m 04s | Max: 23m 16s
      🟨 12.6               Pass:  97%/36  | Total:  8h 13m | Avg: 13m 41s | Max: 36m 58s | Hits: 355%/5532  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 21m 53s | Avg: 10m 56s | Max: 11m 57s
      🟥 nvcc12.0           Pass:   0%/5   | Total:  1h 04m | Avg: 12m 53s | Max: 45m 34s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 44m 09s | Avg: 22m 04s | Max: 23m 16s
      🟨 nvcc12.6           Pass:  97%/34  | Total:  7h 51m | Avg: 13m 51s | Max: 36m 58s | Hits: 355%/5532  
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total: 30m 04s | Avg:  7m 31s | Max: 10m 45s
      🟩 Clang15            Pass: 100%/2   | Total: 20m 46s | Avg: 10m 23s | Max: 11m 13s
      🟩 Clang16            Pass: 100%/2   | Total: 20m 25s | Avg: 10m 12s | Max: 10m 25s
      🟩 Clang17            Pass: 100%/2   | Total: 20m 14s | Avg: 10m 07s | Max: 10m 31s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 18m | Avg: 11m 14s | Max: 18m 36s
      🟨 GCC7               Pass:  50%/2   | Total: 14m 27s | Avg:  7m 13s | Max:  9m 55s
      🟩 GCC8               Pass: 100%/1   | Total:  9m 41s | Avg:  9m 41s | Max:  9m 41s
      🟨 GCC9               Pass:  50%/2   | Total: 16m 16s | Avg:  8m 08s | Max: 11m 33s
      🟩 GCC10              Pass: 100%/2   | Total: 24m 26s | Avg: 12m 13s | Max: 13m 18s
      🟩 GCC11              Pass: 100%/2   | Total: 21m 04s | Avg: 10m 32s | Max: 10m 45s
      🟩 GCC12              Pass: 100%/2   | Total: 23m 17s | Avg: 11m 38s | Max: 11m 47s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 40m | Avg: 12m 35s | Max: 20m 24s
      🟨 MSVC14.29          Pass:  50%/2   | Total:  1h 17m | Avg: 38m 31s | Max: 45m 34s | Hits: 355%/1844  
      🟨 MSVC14.39          Pass:  66%/3   | Total:  1h 40m | Avg: 33m 27s | Max: 36m 58s | Hits: 355%/3688  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 44m 09s | Avg: 22m 04s | Max: 23m 16s
    🟨 cxx_family
      🟨 Clang              Pass:  88%/17  | Total:  2h 50m | Avg: 10m 00s | Max: 18m 36s
      🟨 GCC                Pass:  89%/19  | Total:  3h 29m | Avg: 11m 02s | Max: 20m 24s
      🟨 MSVC               Pass:  60%/5   | Total:  2h 57m | Avg: 35m 29s | Max: 45m 34s | Hits: 355%/5532  
      🟩 NVHPC              Pass: 100%/2   | Total: 44m 09s | Avg: 22m 04s | Max: 23m 16s
    🟨 jobs
      🟨 Build              Pass:  86%/37  | Total:  8h 15m | Avg: 13m 23s | Max: 45m 34s | Hits: 355%/5532  
      🟨 TestCPU            Pass:  66%/3   | Total: 47m 20s | Avg: 15m 46s | Max: 31m 25s
      🟩 TestGPU            Pass: 100%/3   | Total: 58m 32s | Avg: 19m 30s | Max: 20m 24s
    🟨 gpu
      🟨 v100               Pass:  86%/43  | Total: 10h 01m | Avg: 13m 59s | Max: 45m 34s | Hits: 355%/5532  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 30m 35s | Avg: 15m 17s | Max: 20m 24s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  6m 51s | Avg:  6m 51s | Max:  6m 51s
    🟨 std
      🟨 17                 Pass:  80%/20  | Total:  4h 47m | Avg: 14m 21s | Max: 45m 34s | Hits: 355%/3688  
      🟨 20                 Pass:  90%/21  | Total:  4h 43m | Avg: 13m 31s | Max: 36m 58s | Hits: 355%/1844  
    
  • 🟨 cub: Pass: 88%/44 | Total: 1d 00h | Avg: 33m 38s | Max: 59m 13s | Hits: 539%/2664

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  88%/42  | Total: 23h 14m | Avg: 33m 11s | Max: 59m 13s | Hits: 539%/2664  
      🟩 arm64              Pass: 100%/2   | Total:  1h 26m | Avg: 43m 10s | Max: 43m 13s
    🚨 ctk: 12.0 🚨
      🔥 12.0               Pass:   0%/5   | Total:  3h 08m | Avg: 37m 43s | Max: 59m 13s
      🟩 12.5               Pass: 100%/2   | Total:  1h 22m | Avg: 41m 14s | Max: 43m 02s
      🟩 12.6               Pass: 100%/37  | Total: 20h 09m | Avg: 32m 41s | Max: 51m 40s | Hits: 539%/2664  
    🚨 cudacxx: nvcc12.0 🚨
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 42m | Avg: 51m 22s | Max: 51m 40s
      🔥 nvcc12.0           Pass:   0%/5   | Total:  3h 08m | Avg: 37m 43s | Max: 59m 13s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 22m | Avg: 41m 14s | Max: 43m 02s
      🟩 nvcc12.6           Pass: 100%/35  | Total: 18h 26m | Avg: 31m 37s | Max: 43m 13s | Hits: 539%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 42m | Avg: 51m 22s | Max: 51m 40s
      🔍 nvcc               Pass:  88%/42  | Total: 22h 57m | Avg: 32m 48s | Max: 59m 13s | Hits: 539%/2664  
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 24m 07s | Avg: 12m 03s | Max: 19m 32s
      🔍 v100               Pass:  88%/42  | Total:  1d 00h | Avg: 34m 40s | Max: 59m 13s | Hits: 539%/2664  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  86%/37  | Total: 22h 09m | Avg: 35m 55s | Max: 59m 13s | Hits: 539%/2664  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 19m 23s | Avg: 19m 23s | Max: 19m 23s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 42s | Avg: 17m 42s | Max: 17m 42s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 42s | Max: 25m 56s
      🟩 TestGPU            Pass: 100%/2   | Total: 46m 09s | Avg: 23m 04s | Max: 23m 41s
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total:  2h 24m | Avg: 36m 04s | Max: 38m 57s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 10m | Avg: 35m 02s | Max: 35m 48s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 17m | Avg: 38m 31s | Max: 38m 37s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 11m | Avg: 35m 44s | Max: 36m 52s
      🟩 Clang18            Pass: 100%/7   | Total:  4h 22m | Avg: 37m 32s | Max: 51m 40s
      🟨 GCC7               Pass:  50%/2   | Total:  1h 05m | Avg: 32m 48s | Max: 34m 07s
      🟩 GCC8               Pass: 100%/1   | Total: 35m 16s | Avg: 35m 16s | Max: 35m 16s
      🟨 GCC9               Pass:  50%/2   | Total:  1h 13m | Avg: 36m 31s | Max: 42m 00s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 49s | Max: 37m 16s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 14m | Avg: 37m 09s | Max: 37m 37s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 35m | Avg: 23m 47s | Max: 35m 47s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 27m | Avg: 25m 52s | Max: 43m 08s
      🟨 MSVC14.29          Pass:  50%/2   | Total:  1h 29m | Avg: 44m 38s | Max: 59m 13s | Hits: 539%/888   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 28s | Max: 31m 15s | Hits: 539%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 22m | Avg: 41m 14s | Max: 43m 02s
    🟨 cxx_family
      🟨 Clang              Pass:  88%/17  | Total: 10h 25m | Avg: 36m 48s | Max: 51m 40s
      🟨 GCC                Pass:  90%/21  | Total: 10h 22m | Avg: 29m 37s | Max: 43m 08s
      🟨 MSVC               Pass:  75%/4   | Total:  2h 30m | Avg: 37m 33s | Max: 59m 13s | Hits: 539%/2664  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 22m | Avg: 41m 14s | Max: 43m 02s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 24m 07s | Avg: 12m 03s | Max: 19m 32s
      🟩 90a                Pass: 100%/1   | Total:  4m 08s | Avg:  4m 08s | Max:  4m 08s
    🟨 std
      🟨 17                 Pass:  80%/20  | Total: 12h 23m | Avg: 37m 10s | Max: 59m 13s | Hits: 539%/1776  
      🟨 20                 Pass:  95%/24  | Total: 12h 16m | Avg: 30m 42s | Max: 51m 04s | Hits: 539%/888   
    
  • 🟨 cudax: Pass: 95%/20 | Total: 2h 00m | Avg: 6m 01s | Max: 18m 57s | Hits: 383%/261

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  93%/16  | Total:  1h 49m | Avg:  6m 52s | Max: 18m 57s | Hits: 383%/261   
      🟩 arm64              Pass: 100%/4   | Total: 10m 34s | Avg:  2m 38s | Max:  2m 42s
    🚨 ctk: 12.0 🚨
      🔥 12.0               Pass:   0%/1   | Total: 13m 13s | Avg: 13m 13s | Max: 13m 13s
      🟩 12.5               Pass: 100%/2   | Total: 12m 35s | Avg:  6m 17s | Max:  6m 27s
      🟩 12.6               Pass: 100%/17  | Total:  1h 34m | Avg:  5m 34s | Max: 18m 57s | Hits: 383%/261   
    🚨 cudacxx: nvcc12.0 🚨
      🔥 nvcc12.0           Pass:   0%/1   | Total: 13m 13s | Avg: 13m 13s | Max: 13m 13s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 35s | Avg:  6m 17s | Max:  6m 27s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 34m | Avg:  5m 34s | Max: 18m 57s | Hits: 383%/261   
    🚨 cxx: MSVC14.36 🚨
      🟩 Clang14            Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s
      🟩 Clang18            Pass: 100%/4   | Total: 25m 40s | Avg:  6m 25s | Max: 16m 21s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s
      🟩 GCC12              Pass: 100%/2   | Total: 23m 16s | Avg: 11m 38s | Max: 18m 57s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 22s | Avg:  2m 35s | Max:  2m 38s
      🔥 MSVC14.36          Pass:   0%/1   | Total: 13m 13s | Avg: 13m 13s | Max: 13m 13s
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 09s | Avg: 12m 09s | Max: 12m 09s | Hits: 383%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 35s | Avg:  6m 17s | Max:  6m 27s
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/8   | Total: 41m 19s | Avg:  5m 09s | Max: 16m 21s
      🟩 GCC                Pass: 100%/8   | Total: 41m 11s | Avg:  5m 08s | Max: 18m 57s
      🔍 MSVC               Pass:  50%/2   | Total: 25m 22s | Avg: 12m 41s | Max: 13m 13s | Hits: 383%/261   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 35s | Avg:  6m 17s | Max:  6m 27s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  94%/18  | Total:  1h 25m | Avg:  4m 43s | Max: 13m 13s | Hits: 383%/261   
      🟩 Test               Pass: 100%/2   | Total: 35m 18s | Avg: 17m 39s | Max: 18m 57s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/4   | Total: 14m 21s | Avg:  3m 35s | Max:  6m 27s
      🔍 20                 Pass:  93%/16  | Total:  1h 46m | Avg:  6m 37s | Max: 18m 57s | Hits: 383%/261   
    🟨 cudacxx_family
      🟨 nvcc               Pass:  95%/20  | Total:  2h 00m | Avg:  6m 01s | Max: 18m 57s | Hits: 383%/261   
    🟨 gpu
      🟨 v100               Pass:  95%/20  | Total:  2h 00m | Avg:  6m 01s | Max: 18m 57s | Hits: 383%/261   
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 38s | Avg:  2m 38s | Max:  2m 38s
      🟩 90a                Pass: 100%/1   | Total:  2m 33s | Avg:  2m 33s | Max:  2m 33s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 8m 52s | Avg: 4m 26s | Max: 6m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  6m 51s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  6m 51s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  6m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  6m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  6m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  6m 51s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  6m 51s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 01s | Avg:  2m 01s | Max:  2m 01s
      🟩 Test               Pass: 100%/1   | Total:  6m 51s | Avg:  6m 51s | Max:  6m 51s
    
  • 🟩 python: Pass: 100%/1 | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 153)

# Runner
110 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber
Copy link
Contributor Author

/ok to test

Copy link
Contributor

🟨 CI finished in 1h 57m: Pass: 93%/153 | Total: 1d 16h | Avg: 15m 52s | Max: 1h 17m | Hits: 82%/11450
  • 🟨 libcudacxx: Pass: 81%/43 | Total: 8h 44m | Avg: 12m 12s | Max: 32m 46s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  80%/41  | Total:  8h 37m | Avg: 12m 37s | Max: 32m 46s
      🟩 arm64              Pass: 100%/2   | Total:  7m 07s | Avg:  3m 33s | Max:  3m 42s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 09m | Avg: 17m 22s | Max: 21m 13s
      🔍 nvcc               Pass:  79%/39  | Total:  7h 35m | Avg: 11m 40s | Max: 32m 46s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  78%/38  | Total:  6h 58m | Avg: 11m 00s | Max: 32m 46s
      🟩 NVRTC              Pass: 100%/2   | Total: 56m 32s | Avg: 28m 16s | Max: 29m 29s
      🟩 Test               Pass: 100%/2   | Total: 47m 42s | Avg: 23m 51s | Max: 25m 22s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  1h 31m | Avg: 18m 21s | Max: 20m 24s
      🟩 12.5               Pass: 100%/2   | Total:  1h 04m | Avg: 32m 10s | Max: 32m 46s
      🟨 12.6               Pass:  91%/36  | Total:  6h 08m | Avg: 10m 14s | Max: 29m 29s
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 09m | Avg: 17m 22s | Max: 21m 13s
      🟥 nvcc12.0           Pass:   0%/5   | Total:  1h 31m | Avg: 18m 21s | Max: 20m 24s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 04m | Avg: 32m 10s | Max: 32m 46s
      🟨 nvcc12.6           Pass:  90%/32  | Total:  4h 59m | Avg:  9m 20s | Max: 29m 29s
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total: 48m 36s | Avg: 12m 09s | Max: 20m 24s
      🟩 Clang15            Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  4m 39s
      🟩 Clang16            Pass: 100%/2   | Total: 21m 11s | Avg: 10m 35s | Max: 16m 42s
      🟩 Clang17            Pass: 100%/2   | Total: 26m 46s | Avg: 13m 23s | Max: 22m 26s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 47m | Avg: 13m 24s | Max: 25m 22s
      🟨 GCC7               Pass:  50%/2   | Total: 35m 10s | Avg: 17m 35s | Max: 20m 11s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 34s | Avg:  3m 34s | Max:  3m 34s
      🟨 GCC9               Pass:  50%/2   | Total: 23m 31s | Avg: 11m 45s | Max: 19m 46s
      🟩 GCC10              Pass: 100%/2   | Total:  7m 18s | Avg:  3m 39s | Max:  3m 47s
      🟩 GCC11              Pass: 100%/2   | Total: 17m 13s | Avg:  8m 36s | Max: 13m 24s
      🟩 GCC12              Pass: 100%/2   | Total:  8m 05s | Avg:  4m 02s | Max:  4m 04s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 35m | Avg: 11m 58s | Max: 29m 29s
      🟥 MSVC14.29          Pass:   0%/2   | Total: 25m 55s | Avg: 12m 57s | Max: 14m 16s
      🟥 MSVC14.39          Pass:   0%/2   | Total: 30m 43s | Avg: 15m 21s | Max: 15m 38s
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 04m | Avg: 32m 10s | Max: 32m 46s
    🟨 cxx_family
      🟨 Clang              Pass:  88%/18  | Total:  3h 33m | Avg: 11m 50s | Max: 25m 22s
      🟨 GCC                Pass:  89%/19  | Total:  3h 10m | Avg: 10m 02s | Max: 29m 29s
      🟥 MSVC               Pass:   0%/4   | Total: 56m 38s | Avg: 14m 09s | Max: 15m 38s
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 10s | Max: 32m 46s
    🟨 gpu
      🟨 v100               Pass:  81%/43  | Total:  8h 44m | Avg: 12m 12s | Max: 32m 46s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 41s | Avg: 13m 41s | Max: 13m 41s
      🟩 90a                Pass: 100%/2   | Total: 17m 49s | Avg:  8m 54s | Max: 14m 10s
    🟨 std
      🟨 17                 Pass:  71%/21  | Total:  4h 28m | Avg: 12m 45s | Max: 31m 34s
      🟨 20                 Pass:  90%/21  | Total:  4h 14m | Avg: 12m 07s | Max: 32m 46s
    
  • 🟨 cub: Pass: 97%/44 | Total: 15h 52m | Avg: 21m 38s | Max: 1h 17m | Hits: 42%/3552

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/42  | Total: 15h 42m | Avg: 22m 26s | Max:  1h 17m | Hits:  42%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  4m 55s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  4h 56m | Avg: 59m 19s | Max:  1h 03m | Hits:  38%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
      🔍 12.6               Pass:  97%/37  | Total:  8h 29m | Avg: 13m 46s | Max:  1h 17m | Hits:  43%/2664  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 31s | Avg:  4m 15s | Max:  4m 22s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 56m | Avg: 59m 19s | Max:  1h 03m | Hits:  38%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
      🔍 nvcc12.6           Pass:  97%/35  | Total:  8h 21m | Avg: 14m 19s | Max:  1h 17m | Hits:  43%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 31s | Avg:  4m 15s | Max:  4m 22s
      🔍 nvcc               Pass:  97%/42  | Total: 15h 43m | Avg: 22m 27s | Max:  1h 17m | Hits:  42%/3552  
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  2h 02m | Avg: 30m 35s | Max: 56m 00s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 10s | Avg:  5m 35s | Max:  5m 40s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 32s | Avg:  5m 46s | Max:  5m 57s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 46s | Avg:  5m 53s | Max:  5m 57s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 14m | Avg: 10m 42s | Max: 28m 29s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 08m | Avg: 34m 17s | Max:  1h 03m
      🟩 GCC8               Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 04m | Avg: 32m 13s | Max: 58m 43s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 38s | Avg:  5m 49s | Max:  6m 00s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 48s | Avg:  5m 54s | Max:  6m 03s
      🟩 GCC12              Pass: 100%/4   | Total: 36m 30s | Avg:  9m 07s | Max: 19m 29s
      🔍 GCC13              Pass:  87%/8   | Total:  1h 33m | Avg: 11m 39s | Max: 29m 26s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 13m | Hits:  41%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 17m | Hits:  43%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/17  | Total:  3h 51m | Avg: 13m 38s | Max: 56m 00s
      🔍 GCC                Pass:  95%/21  | Total:  4h 51m | Avg: 13m 53s | Max:  1h 03m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 42m | Avg:  1h 10m | Max:  1h 17m | Hits:  42%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 24m 02s | Avg: 12m 01s | Max: 19m 29s
      🔍 v100               Pass:  97%/42  | Total: 15h 28m | Avg: 22m 05s | Max:  1h 17m | Hits:  42%/3552  
    🚨 jobs: DeviceLaunch 🚨
      🟩 Build              Pass: 100%/37  | Total: 13h 30m | Avg: 21m 54s | Max:  1h 17m | Hits:  42%/3552  
      🔥 DeviceLaunch       Pass:   0%/1   | Total:  4m 31s | Avg:  4m 31s | Max:  4m 31s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 02m | Avg: 20m 52s | Max: 22m 04s
      🟩 TestGPU            Pass: 100%/2   | Total: 57m 55s | Avg: 28m 57s | Max: 29m 26s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total:  8h 53m | Avg: 26m 41s | Max:  1h 16m | Hits:  41%/2664  
      🔍 20                 Pass:  95%/24  | Total:  6h 58m | Avg: 17m 25s | Max:  1h 17m | Hits:  43%/888   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 24m 02s | Avg: 12m 01s | Max: 19m 29s
      🟩 90a                Pass: 100%/1   | Total:  4m 33s | Avg:  4m 33s | Max:  4m 33s
    
  • 🟨 thrust: Pass: 97%/43 | Total: 12h 56m | Avg: 18m 02s | Max: 1h 15m | Hits: 95%/7376

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/41  | Total: 12h 46m | Avg: 18m 41s | Max:  1h 15m | Hits:  95%/7376  
      🟩 arm64              Pass: 100%/2   | Total:  9m 39s | Avg:  4m 49s | Max:  4m 57s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  3h 13m | Avg: 38m 41s | Max:  1h 02m | Hits:  92%/1844  
      🟩 12.5               Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 15m
      🔍 12.6               Pass:  97%/36  | Total:  7h 13m | Avg: 12m 02s | Max:  1h 03m | Hits:  96%/5532  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 36s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 13m | Avg: 38m 41s | Max:  1h 02m | Hits:  92%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 15m
      🔍 nvcc12.6           Pass:  97%/34  | Total:  7h 02m | Avg: 12m 25s | Max:  1h 03m | Hits:  96%/5532  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 36s
      🔍 nvcc               Pass:  97%/41  | Total: 12h 45m | Avg: 18m 39s | Max:  1h 15m | Hits:  95%/7376  
    🔍 cxx: MSVC14.39 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  1h 15m | Avg: 18m 45s | Max: 33m 08s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 42s | Avg:  5m 51s | Max:  6m 02s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 33s | Avg:  5m 46s | Max:  5m 53s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  5m 53s
      🟩 Clang18            Pass: 100%/7   | Total: 53m 14s | Avg:  7m 36s | Max: 19m 12s
      🟩 GCC7               Pass: 100%/2   | Total: 38m 46s | Avg: 19m 23s | Max: 33m 19s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 06s | Avg:  5m 06s | Max:  5m 06s
      🟩 GCC9               Pass: 100%/2   | Total: 39m 31s | Avg: 19m 45s | Max: 33m 42s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 09s | Avg:  5m 34s | Max:  5m 36s
      🟩 GCC11              Pass: 100%/2   | Total: 12m 05s | Avg:  6m 02s | Max:  6m 04s
      🟩 GCC12              Pass: 100%/2   | Total: 11m 59s | Avg:  5m 59s | Max:  6m 14s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 02m | Avg:  7m 47s | Max: 13m 21s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits: 101%/3688  
      🔍 MSVC14.39          Pass:  66%/3   | Total:  2h 40m | Avg: 53m 27s | Max:  1h 03m | Hits:  88%/3688  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 15m
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/17  | Total:  2h 43m | Avg:  9m 35s | Max: 33m 08s
      🟩 GCC                Pass: 100%/19  | Total:  3h 00m | Avg:  9m 31s | Max: 33m 42s
      🔍 MSVC               Pass:  80%/5   | Total:  4h 42m | Avg: 56m 31s | Max:  1h 03m | Hits:  95%/7376  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 15m
    🔍 jobs: TestCPU 🔍
      🟩 Build              Pass: 100%/37  | Total: 11h 21m | Avg: 18m 25s | Max:  1h 15m | Hits:  95%/7376  
      🔍 TestCPU            Pass:  66%/3   | Total: 50m 02s | Avg: 16m 40s | Max: 34m 14s
      🟩 TestGPU            Pass: 100%/3   | Total: 44m 31s | Avg: 14m 50s | Max: 19m 12s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total:  7h 15m | Avg: 21m 46s | Max:  1h 14m | Hits:  97%/5532  
      🔍 20                 Pass:  95%/21  | Total:  5h 22m | Avg: 15m 21s | Max:  1h 15m | Hits:  88%/1844  
    🟨 gpu
      🟨 v100               Pass:  97%/43  | Total: 12h 56m | Avg: 18m 02s | Max:  1h 15m | Hits:  95%/7376  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 18m 09s | Avg:  9m 04s | Max: 11m 58s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 48s | Avg:  4m 48s | Max:  4m 48s
    
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 59m | Avg: 5m 59s | Max: 20m 46s | Hits: 175%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 49m | Avg:  6m 49s | Max: 20m 46s | Hits: 175%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 35s | Avg:  2m 38s | Max:  2m 42s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 05s | Avg: 10m 05s | Max: 10m 05s | Hits: 173%/261   
      🟩 12.5               Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 24s
      🟩 12.6               Pass: 100%/17  | Total:  1h 31m | Avg:  5m 24s | Max: 20m 46s | Hits: 177%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 05s | Avg: 10m 05s | Max: 10m 05s | Hits: 173%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 24s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 31m | Avg:  5m 24s | Max: 20m 46s | Hits: 177%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 59m | Avg:  5m 59s | Max: 20m 46s | Hits: 175%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 10s | Avg:  3m 10s | Max:  3m 10s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 10s | Avg:  3m 10s | Max:  3m 10s
      🟩 Clang18            Pass: 100%/4   | Total: 24m 50s | Avg:  6m 12s | Max: 16m 21s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 01s | Avg:  3m 01s | Max:  3m 01s
      🟩 GCC11              Pass: 100%/1   | Total:  2m 57s | Avg:  2m 57s | Max:  2m 57s
      🟩 GCC12              Pass: 100%/2   | Total: 24m 12s | Avg: 12m 06s | Max: 20m 46s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 55s | Avg:  2m 43s | Max:  2m 54s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 05s | Avg: 10m 05s | Max: 10m 05s | Hits: 173%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 53s | Avg: 12m 53s | Max: 12m 53s | Hits: 177%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 24s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 37m 57s | Avg:  4m 44s | Max: 16m 21s
      🟩 GCC                Pass: 100%/8   | Total: 41m 05s | Avg:  5m 08s | Max: 20m 46s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 58s | Avg: 11m 29s | Max: 12m 53s | Hits: 175%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 24s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  1h 59m | Avg:  5m 59s | Max: 20m 46s | Hits: 175%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 22m | Avg:  4m 35s | Max: 12m 53s | Hits: 175%/522   
      🟩 Test               Pass: 100%/2   | Total: 37m 07s | Avg: 18m 33s | Max: 20m 46s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 54s | Avg:  2m 54s | Max:  2m 54s
      🟩 90a                Pass: 100%/1   | Total:  2m 47s | Avg:  2m 47s | Max:  2m 47s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 16m 34s | Avg:  4m 08s | Max:  8m 24s
      🟩 20                 Pass: 100%/16  | Total:  1h 43m | Avg:  6m 27s | Max: 20m 46s | Hits: 175%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 49s | Avg: 4m 54s | Max: 7m 49s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  7m 49s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  7m 49s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  7m 49s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  7m 49s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  7m 49s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  7m 49s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  7m 49s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
      🟩 Test               Pass: 100%/1   | Total:  7m 49s | Avg:  7m 49s | Max:  7m 49s
    
  • 🟩 python: Pass: 100%/1 | Total: 45m 27s | Avg: 45m 27s | Max: 45m 27s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 45m 27s | Avg: 45m 27s | Max: 45m 27s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 45m 27s | Avg: 45m 27s | Max: 45m 27s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 45m 27s | Avg: 45m 27s | Max: 45m 27s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 45m 27s | Avg: 45m 27s | Max: 45m 27s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 45m 27s | Avg: 45m 27s | Max: 45m 27s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 45m 27s | Avg: 45m 27s | Max: 45m 27s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 45m 27s | Avg: 45m 27s | Max: 45m 27s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 45m 27s | Avg: 45m 27s | Max: 45m 27s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 153)

# Runner
110 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber
Copy link
Contributor Author

/ok to test

Copy link
Contributor

🟨 CI finished in 1d 01h: Pass: 99%/152 | Total: 1d 13h | Avg: 14m 39s | Max: 1h 15m | Hits: 436%/21515
  • 🟨 cub: Pass: 97%/44 | Total: 15h 30m | Avg: 21m 08s | Max: 1h 15m | Hits: 371%/3552

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/42  | Total: 15h 20m | Avg: 21m 55s | Max:  1h 15m | Hits: 371%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  9m 38s | Avg:  4m 49s | Max:  4m 56s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  4h 50m | Avg: 58m 05s | Max:  1h 02m | Hits: 370%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 08m
      🔍 12.6               Pass:  97%/37  | Total:  8h 28m | Avg: 13m 43s | Max:  1h 15m | Hits: 372%/2664  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 15s | Avg:  4m 37s | Max:  4m 52s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 50m | Avg: 58m 05s | Max:  1h 02m | Hits: 370%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 08m
      🔍 nvcc12.6           Pass:  97%/35  | Total:  8h 18m | Avg: 14m 15s | Max:  1h 15m | Hits: 372%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 15s | Avg:  4m 37s | Max:  4m 52s
      🔍 nvcc               Pass:  97%/42  | Total: 15h 21m | Avg: 21m 55s | Max:  1h 15m | Hits: 371%/3552  
    🔍 cxx: GCC12 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 49s | Max: 56m 21s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  5m 41s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 24s | Avg:  5m 42s | Max:  5m 49s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 20s | Avg:  5m 40s | Max:  5m 45s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 21m | Avg: 11m 38s | Max: 29m 39s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 11s | Max: 55m 00s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 33s | Avg:  5m 33s | Max:  5m 33s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 10s | Max:  1h 00m
      🟩 GCC10              Pass: 100%/2   | Total: 11m 28s | Avg:  5m 44s | Max:  6m 00s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 32s
      🔍 GCC12              Pass:  75%/4   | Total: 16m 15s | Avg:  4m 03s | Max:  5m 59s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 54m | Avg: 14m 15s | Max: 27m 07s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits: 371%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m | Hits: 371%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 08m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/17  | Total:  3h 58m | Avg: 14m 02s | Max: 56m 21s
      🔍 GCC                Pass:  95%/21  | Total:  4h 45m | Avg: 13m 34s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 34m | Avg:  1h 08m | Max:  1h 15m | Hits: 371%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 08m
    🔍 gpu: h100 🔍
      🔍 h100               Pass:  50%/2   | Total:  4m 38s | Avg:  2m 19s | Max:  4m 38s
      🟩 v100               Pass: 100%/42  | Total: 15h 25m | Avg: 22m 02s | Max:  1h 15m | Hits: 371%/3552  
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/37  | Total: 13h 01m | Avg: 21m 07s | Max:  1h 15m | Hits: 371%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 37s | Avg: 22m 37s | Max: 22m 37s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 37s | Avg: 16m 37s | Max: 16m 37s
      🔍 HostLaunch         Pass:  66%/3   | Total: 56m 46s | Avg: 18m 55s | Max: 29m 39s
      🟩 TestGPU            Pass: 100%/2   | Total: 52m 48s | Avg: 26m 24s | Max: 26m 26s
    🔍 sm: 90 🔍
      🔍 90                 Pass:  50%/2   | Total:  4m 38s | Avg:  2m 19s | Max:  4m 38s
      🟩 90a                Pass: 100%/1   | Total:  4m 40s | Avg:  4m 40s | Max:  4m 40s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total:  8h 26m | Avg: 25m 19s | Max:  1h 12m | Hits: 371%/2664  
      🔍 20                 Pass:  95%/24  | Total:  7h 03m | Avg: 17m 39s | Max:  1h 15m | Hits: 371%/888   
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 7h 23m | Avg: 10m 18s | Max: 37m 39s | Hits: 639%/10065

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  7h 16m | Avg: 10m 38s | Max: 37m 39s | Hits: 639%/10065 
      🟩 arm64              Pass: 100%/2   | Total:  7m 05s | Avg:  3m 32s | Max:  3m 45s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 37m 12s | Avg:  7m 26s | Max: 21m 52s | Hits: 622%/2471  
      🟩 12.5               Pass: 100%/2   | Total: 40m 40s | Avg: 20m 20s | Max: 32m 02s
      🟩 12.6               Pass: 100%/36  | Total:  6h 05m | Avg: 10m 09s | Max: 37m 39s | Hits: 644%/7594  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 07m | Avg: 16m 56s | Max: 20m 36s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 37m 12s | Avg:  7m 26s | Max: 21m 52s | Hits: 622%/2471  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 40m 40s | Avg: 20m 20s | Max: 32m 02s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  4h 57m | Avg:  9m 18s | Max: 37m 39s | Hits: 644%/7594  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 07m | Avg: 16m 56s | Max: 20m 36s
      🟩 nvcc               Pass: 100%/39  | Total:  6h 15m | Avg:  9m 38s | Max: 37m 39s | Hits: 639%/10065 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 16m 04s | Avg:  4m 01s | Max:  4m 13s
      🟩 Clang15            Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  4m 36s
      🟩 Clang16            Pass: 100%/2   | Total:  8m 26s | Avg:  4m 13s | Max:  4m 15s
      🟩 Clang17            Pass: 100%/2   | Total:  9m 08s | Avg:  4m 34s | Max:  4m 39s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 38m | Avg: 12m 16s | Max: 20m 36s
      🟩 GCC7               Pass: 100%/2   | Total:  7m 18s | Avg:  3m 39s | Max:  3m 42s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 56s | Avg:  3m 58s | Max:  4m 08s
      🟩 GCC10              Pass: 100%/2   | Total:  7m 48s | Avg:  3m 54s | Max:  3m 56s
      🟩 GCC11              Pass: 100%/2   | Total:  7m 29s | Avg:  3m 44s | Max:  3m 46s
      🟩 GCC12              Pass: 100%/2   | Total:  8m 13s | Avg:  4m 06s | Max:  4m 16s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 49m | Avg: 13m 41s | Max: 37m 39s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 50m 04s | Avg: 25m 02s | Max: 28m 12s | Hits: 622%/4952  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 59m 51s | Avg: 29m 55s | Max: 33m 41s | Hits: 654%/5113  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 40m 40s | Avg: 20m 20s | Max: 32m 02s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 20m | Avg:  7m 49s | Max: 20m 36s
      🟩 GCC                Pass: 100%/19  | Total:  2h 32m | Avg:  8m 00s | Max: 37m 39s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 49m | Avg: 27m 28s | Max: 33m 41s | Hits: 639%/10065 
      🟩 NVHPC              Pass: 100%/2   | Total: 40m 40s | Avg: 20m 20s | Max: 32m 02s
    🟩 gpu
      🟩 v100               Pass: 100%/43  | Total:  7h 23m | Avg: 10m 18s | Max: 37m 39s | Hits: 639%/10065 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  5h 31m | Avg:  8m 43s | Max: 33m 41s | Hits: 639%/10065 
      🟩 NVRTC              Pass: 100%/2   | Total:  1h 14m | Avg: 37m 09s | Max: 37m 39s
      🟩 Test               Pass: 100%/2   | Total: 35m 33s | Avg: 17m 46s | Max: 17m 52s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 55s | Avg:  1m 55s | Max:  1m 55s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 03s | Avg: 13m 03s | Max: 13m 03s
      🟩 90a                Pass: 100%/2   | Total: 18m 12s | Avg:  9m 06s | Max: 14m 05s
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 22m | Avg:  9m 37s | Max: 36m 39s | Hits: 645%/7433  
      🟩 20                 Pass: 100%/21  | Total:  3h 59m | Avg: 11m 23s | Max: 37m 39s | Hits: 622%/2632  
    
  • 🟩 thrust: Pass: 100%/42 | Total: 11h 17m | Avg: 16m 07s | Max: 55m 48s | Hits: 194%/7376

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 25m 32s | Avg: 12m 46s | Max: 19m 45s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 11h 07m | Avg: 16m 41s | Max: 55m 48s | Hits: 194%/7376  
      🟩 arm64              Pass: 100%/2   | Total:  9m 36s | Avg:  4m 48s | Max:  5m 00s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 07m | Avg: 37m 34s | Max: 54m 18s | Hits: 182%/1844  
      🟩 12.5               Pass: 100%/2   | Total:  1h 49m | Avg: 54m 58s | Max: 55m 48s
      🟩 12.6               Pass: 100%/35  | Total:  6h 19m | Avg: 10m 50s | Max: 53m 20s | Hits: 199%/5532  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 56s | Avg:  4m 58s | Max:  5m 01s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 07m | Avg: 37m 34s | Max: 54m 18s | Hits: 182%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 49m | Avg: 54m 58s | Max: 55m 48s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  6h 09m | Avg: 11m 12s | Max: 53m 20s | Hits: 199%/5532  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 56s | Avg:  4m 58s | Max:  5m 01s
      🟩 nvcc               Pass: 100%/40  | Total: 11h 07m | Avg: 16m 41s | Max: 55m 48s | Hits: 194%/7376  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 15m | Avg: 18m 54s | Max: 32m 36s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  5m 38s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 22s | Avg:  5m 41s | Max:  5m 59s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 52s
      🟩 Clang18            Pass: 100%/7   | Total: 46m 28s | Avg:  6m 38s | Max: 14m 11s
      🟩 GCC7               Pass: 100%/2   | Total: 41m 42s | Avg: 20m 51s | Max: 35m 40s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 39s | Avg:  5m 39s | Max:  5m 39s
      🟩 GCC9               Pass: 100%/2   | Total: 40m 17s | Avg: 20m 08s | Max: 33m 51s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  6m 10s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 51s | Avg:  5m 55s | Max:  5m 59s
      🟩 GCC12              Pass: 100%/2   | Total: 11m 37s | Avg:  5m 48s | Max:  5m 53s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 18m | Avg:  9m 45s | Max: 22m 32s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 13s | Max: 54m 18s | Hits: 190%/3688  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 44m | Avg: 52m 08s | Max: 53m 20s | Hits: 199%/3688  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 58s | Max: 55m 48s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 35m | Avg:  9m 09s | Max: 32m 36s
      🟩 GCC                Pass: 100%/19  | Total:  3h 21m | Avg: 10m 35s | Max: 35m 40s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 30m | Avg: 52m 40s | Max: 54m 18s | Hits: 194%/7376  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 49m | Avg: 54m 58s | Max: 55m 48s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total: 11h 17m | Avg: 16m 07s | Max: 55m 48s | Hits: 194%/7376  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 10h 06m | Avg: 16m 22s | Max: 55m 48s | Hits: 194%/7376  
      🟩 TestCPU            Pass: 100%/2   | Total: 15m 02s | Avg:  7m 31s | Max:  8m 01s
      🟩 TestGPU            Pass: 100%/3   | Total: 56m 28s | Avg: 18m 49s | Max: 22m 32s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 20s | Avg:  4m 20s | Max:  4m 20s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  6h 30m | Avg: 19m 31s | Max: 54m 18s | Hits: 193%/5532  
      🟩 20                 Pass: 100%/20  | Total:  4h 21m | Avg: 13m 04s | Max: 55m 48s | Hits: 199%/1844  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 04m | Avg: 6m 14s | Max: 22m 14s | Hits: 383%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 50m | Avg:  6m 54s | Max: 22m 14s | Hits: 383%/522   
      🟩 arm64              Pass: 100%/4   | Total: 14m 14s | Avg:  3m 33s | Max:  3m 38s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  9m 52s | Avg:  9m 52s | Max:  9m 52s | Hits: 383%/261   
      🟩 12.5               Pass: 100%/2   | Total: 13m 15s | Avg:  6m 37s | Max:  7m 02s
      🟩 12.6               Pass: 100%/17  | Total:  1h 41m | Avg:  5m 59s | Max: 22m 14s | Hits: 383%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  9m 52s | Avg:  9m 52s | Max:  9m 52s | Hits: 383%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 13m 15s | Avg:  6m 37s | Max:  7m 02s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 41m | Avg:  5m 59s | Max: 22m 14s | Hits: 383%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 04m | Avg:  6m 14s | Max: 22m 14s | Hits: 383%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 01s | Avg:  4m 01s | Max:  4m 01s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 05s | Avg:  4m 05s | Max:  4m 05s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 56s | Avg:  3m 56s | Max:  3m 56s
      🟩 Clang18            Pass: 100%/4   | Total: 25m 58s | Avg:  6m 29s | Max: 15m 01s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s
      🟩 GCC12              Pass: 100%/2   | Total: 26m 15s | Avg: 13m 07s | Max: 22m 14s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 45s | Avg:  3m 26s | Max:  3m 35s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 52s | Avg:  9m 52s | Max:  9m 52s | Hits: 383%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 01s | Avg: 12m 01s | Max: 12m 01s | Hits: 383%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 13m 15s | Avg:  6m 37s | Max:  7m 02s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 41m 54s | Avg:  5m 14s | Max: 15m 01s
      🟩 GCC                Pass: 100%/8   | Total: 47m 50s | Avg:  5m 58s | Max: 22m 14s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 53s | Avg: 10m 56s | Max: 12m 01s | Hits: 383%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 13m 15s | Avg:  6m 37s | Max:  7m 02s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  2h 04m | Avg:  6m 14s | Max: 22m 14s | Hits: 383%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 27m | Avg:  4m 52s | Max: 12m 01s | Hits: 383%/522   
      🟩 Test               Pass: 100%/2   | Total: 37m 15s | Avg: 18m 37s | Max: 22m 14s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s
      🟩 90a                Pass: 100%/1   | Total:  3m 16s | Avg:  3m 16s | Max:  3m 16s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 16m 46s | Avg:  4m 11s | Max:  6m 13s
      🟩 20                 Pass: 100%/16  | Total:  1h 48m | Avg:  6m 45s | Max: 22m 14s | Hits: 383%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 35s | Avg: 4m 47s | Max: 7m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  7m 29s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  7m 29s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  7m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  7m 29s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  7m 29s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  7m 29s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  7m 29s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 06s | Avg:  2m 06s | Max:  2m 06s
      🟩 Test               Pass: 100%/1   | Total:  7m 29s | Avg:  7m 29s | Max:  7m 29s
    
  • 🟩 python: Pass: 100%/1 | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 152)

# Runner
110 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
14 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber bernhardmgruber marked this pull request as ready for review January 29, 2025 19:36
@bernhardmgruber bernhardmgruber requested a review from a team as a code owner January 29, 2025 19:36
Copy link
Contributor

🟨 CI finished in 2h 57m: Pass: 99%/152 | Total: 1d 13h | Avg: 14m 44s | Max: 1h 16m | Hits: 398%/21523
  • 🟨 cub: Pass: 97%/44 | Total: 15h 46m | Avg: 21m 31s | Max: 1h 16m | Hits: 86%/3552

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/42  | Total: 15h 37m | Avg: 22m 18s | Max:  1h 16m | Hits:  86%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  5m 02s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  4h 58m | Avg: 59m 46s | Max:  1h 03m | Hits:  85%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
      🔍 12.6               Pass:  97%/37  | Total:  8h 22m | Avg: 13m 34s | Max:  1h 13m | Hits:  86%/2664  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  4m 32s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 58m | Avg: 59m 46s | Max:  1h 03m | Hits:  85%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
      🔍 nvcc12.6           Pass:  97%/35  | Total:  8h 13m | Avg: 14m 05s | Max:  1h 13m | Hits:  86%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  4m 32s
      🔍 nvcc               Pass:  97%/42  | Total: 15h 38m | Avg: 22m 20s | Max:  1h 16m | Hits:  86%/3552  
    🔍 cxx: Clang18 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 10s | Max: 59m 00s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 53s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 09s | Avg:  5m 34s | Max:  5m 44s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  5m 43s
      🔍 Clang18            Pass:  85%/7   | Total: 56m 54s | Avg:  8m 07s | Max: 27m 49s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 15s | Max:  1h 00m
      🟩 GCC8               Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 28s | Max: 57m 09s
      🟩 GCC10              Pass: 100%/2   | Total: 12m 44s | Avg:  6m 22s | Max:  6m 48s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  5m 58s
      🟩 GCC12              Pass: 100%/4   | Total: 35m 14s | Avg:  8m 48s | Max: 19m 04s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 45m | Avg: 13m 14s | Max: 24m 00s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 12m | Hits:  87%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 13m | Hits:  84%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  94%/17  | Total:  3h 39m | Avg: 12m 54s | Max: 59m 00s
      🟩 GCC                Pass: 100%/21  | Total:  5h 00m | Avg: 14m 18s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 41m | Avg:  1h 10m | Max:  1h 13m | Hits:  86%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 23m 29s | Avg: 11m 44s | Max: 19m 04s
      🔍 v100               Pass:  97%/42  | Total: 15h 23m | Avg: 21m 59s | Max:  1h 16m | Hits:  86%/3552  
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/37  | Total: 13h 31m | Avg: 21m 55s | Max:  1h 16m | Hits:  86%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 00s | Avg: 24m 00s | Max: 24m 00s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
      🔍 HostLaunch         Pass:  66%/3   | Total: 42m 25s | Avg: 14m 08s | Max: 19m 13s
      🟩 TestGPU            Pass: 100%/2   | Total: 49m 35s | Avg: 24m 47s | Max: 27m 49s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total:  8h 49m | Avg: 26m 28s | Max:  1h 13m | Hits:  88%/2664  
      🔍 20                 Pass:  95%/24  | Total:  6h 57m | Avg: 17m 24s | Max:  1h 16m | Hits:  78%/888   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 29s | Avg: 11m 44s | Max: 19m 04s
      🟩 90a                Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 7h 04m | Avg: 9m 52s | Max: 31m 01s | Hits: 669%/10065

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  6h 57m | Avg: 10m 10s | Max: 31m 01s | Hits: 669%/10065 
      🟩 arm64              Pass: 100%/2   | Total:  7m 03s | Avg:  3m 31s | Max:  3m 43s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 43m 32s | Avg:  8m 42s | Max: 28m 07s | Hits: 622%/2471  
      🟩 12.5               Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 31m 01s
      🟩 12.6               Pass: 100%/36  | Total:  5h 39m | Avg:  9m 26s | Max: 28m 49s | Hits: 684%/7594  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 07m | Avg: 16m 45s | Max: 20m 22s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 43m 32s | Avg:  8m 42s | Max: 28m 07s | Hits: 622%/2471  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 31m 01s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  4h 32m | Avg:  8m 31s | Max: 28m 49s | Hits: 684%/7594  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 07m | Avg: 16m 45s | Max: 20m 22s
      🟩 nvcc               Pass: 100%/39  | Total:  5h 57m | Avg:  9m 09s | Max: 31m 01s | Hits: 669%/10065 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 16m 55s | Avg:  4m 13s | Max:  4m 31s
      🟩 Clang15            Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 23s
      🟩 Clang16            Pass: 100%/2   | Total:  8m 37s | Avg:  4m 18s | Max:  4m 34s
      🟩 Clang17            Pass: 100%/2   | Total:  9m 01s | Avg:  4m 30s | Max:  4m 33s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 37m | Avg: 12m 11s | Max: 20m 22s
      🟩 GCC7               Pass: 100%/2   | Total:  7m 01s | Avg:  3m 30s | Max:  3m 32s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 38s | Avg:  3m 38s | Max:  3m 38s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 18s | Avg:  3m 39s | Max:  3m 42s
      🟩 GCC10              Pass: 100%/2   | Total:  8m 32s | Avg:  4m 16s | Max:  4m 30s
      🟩 GCC11              Pass: 100%/2   | Total:  7m 46s | Avg:  3m 53s | Max:  4m 03s
      🟩 GCC12              Pass: 100%/2   | Total:  8m 05s | Avg:  4m 02s | Max:  4m 03s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 31m | Avg: 11m 22s | Max: 27m 42s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 52m 32s | Avg: 26m 16s | Max: 28m 07s | Hits: 655%/4952  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 56m 47s | Avg: 28m 23s | Max: 28m 49s | Hits: 682%/5113  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 31m 01s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 20m | Avg:  7m 49s | Max: 20m 22s
      🟩 GCC                Pass: 100%/19  | Total:  2h 13m | Avg:  7m 01s | Max: 27m 42s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 49m | Avg: 27m 19s | Max: 28m 49s | Hits: 669%/10065 
      🟩 NVHPC              Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 31m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/43  | Total:  7h 04m | Avg:  9m 52s | Max: 31m 01s | Hits: 669%/10065 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  5h 30m | Avg:  8m 41s | Max: 31m 01s | Hits: 669%/10065 
      🟩 NVRTC              Pass: 100%/2   | Total: 51m 47s | Avg: 25m 53s | Max: 27m 42s
      🟩 Test               Pass: 100%/2   | Total: 40m 03s | Avg: 20m 01s | Max: 22m 09s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 55s | Avg:  1m 55s | Max:  1m 55s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 14m 09s | Avg: 14m 09s | Max: 14m 09s
      🟩 90a                Pass: 100%/2   | Total: 16m 14s | Avg:  8m 07s | Max: 12m 35s
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 18m | Avg:  9m 27s | Max: 28m 07s | Hits: 667%/7433  
      🟩 20                 Pass: 100%/21  | Total:  3h 43m | Avg: 10m 38s | Max: 31m 01s | Hits: 675%/2632  
    
  • 🟩 thrust: Pass: 100%/42 | Total: 11h 23m | Avg: 16m 15s | Max: 57m 46s | Hits: 187%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 28m 57s | Avg: 14m 28s | Max: 23m 27s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 11h 13m | Avg: 16m 50s | Max: 57m 46s | Hits: 187%/7384  
      🟩 arm64              Pass: 100%/2   | Total:  9m 37s | Avg:  4m 48s | Max:  5m 00s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 16m | Avg: 39m 12s | Max: 57m 46s | Hits: 174%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 48m | Avg: 54m 07s | Max: 56m 19s
      🟩 12.6               Pass: 100%/35  | Total:  6h 18m | Avg: 10m 49s | Max: 54m 39s | Hits: 191%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 31s | Avg:  5m 15s | Max:  5m 16s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 16m | Avg: 39m 12s | Max: 57m 46s | Hits: 174%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 48m | Avg: 54m 07s | Max: 56m 19s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  6h 08m | Avg: 11m 09s | Max: 54m 39s | Hits: 191%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 31s | Avg:  5m 15s | Max:  5m 16s
      🟩 nvcc               Pass: 100%/40  | Total: 11h 12m | Avg: 16m 48s | Max: 57m 46s | Hits: 187%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 22m | Avg: 20m 37s | Max: 39m 32s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 38s
      🟩 Clang16            Pass: 100%/2   | Total: 12m 06s | Avg:  6m 03s | Max:  6m 04s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 01s | Avg:  5m 30s | Max:  5m 40s
      🟩 Clang18            Pass: 100%/7   | Total: 45m 30s | Avg:  6m 30s | Max: 11m 07s
      🟩 GCC7               Pass: 100%/2   | Total: 37m 21s | Avg: 18m 40s | Max: 31m 26s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 GCC9               Pass: 100%/2   | Total: 41m 31s | Avg: 20m 45s | Max: 35m 28s
      🟩 GCC10              Pass: 100%/2   | Total: 12m 00s | Avg:  6m 00s | Max:  6m 29s
      🟩 GCC11              Pass: 100%/2   | Total: 12m 00s | Avg:  6m 00s | Max:  6m 06s
      🟩 GCC12              Pass: 100%/2   | Total: 12m 39s | Avg:  6m 19s | Max:  6m 22s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 15m | Avg:  9m 29s | Max: 23m 27s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 21s | Max: 57m 46s | Hits: 183%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 19s | Max: 54m 39s | Hits: 191%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 07s | Max: 56m 19s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 42m | Avg:  9m 32s | Max: 39m 32s
      🟩 GCC                Pass: 100%/19  | Total:  3h 17m | Avg: 10m 22s | Max: 35m 28s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 35m | Avg: 53m 50s | Max: 57m 46s | Hits: 187%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 48m | Avg: 54m 07s | Max: 56m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total: 11h 23m | Avg: 16m 15s | Max: 57m 46s | Hits: 187%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 10h 15m | Avg: 16m 38s | Max: 57m 46s | Hits: 187%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 15m 31s | Avg:  7m 45s | Max:  7m 47s
      🟩 TestGPU            Pass: 100%/3   | Total: 52m 07s | Avg: 17m 22s | Max: 23m 27s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 27s | Avg:  4m 27s | Max:  4m 27s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  6h 27m | Avg: 19m 23s | Max: 57m 46s | Hits: 185%/5538  
      🟩 20                 Pass: 100%/20  | Total:  4h 26m | Avg: 13m 18s | Max: 56m 19s | Hits: 191%/1846  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 06m | Avg: 6m 20s | Max: 25m 51s | Hits: 287%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 55m | Avg:  7m 14s | Max: 25m 51s | Hits: 287%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 54s | Avg:  2m 43s | Max:  2m 47s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 18s | Avg: 10m 18s | Max: 10m 18s | Hits: 286%/261   
      🟩 12.5               Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max:  8m 58s
      🟩 12.6               Pass: 100%/17  | Total:  1h 38m | Avg:  5m 48s | Max: 25m 51s | Hits: 288%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 18s | Avg: 10m 18s | Max: 10m 18s | Hits: 286%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max:  8m 58s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 38m | Avg:  5m 48s | Max: 25m 51s | Hits: 288%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 06m | Avg:  6m 20s | Max: 25m 51s | Hits: 287%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s
      🟩 Clang18            Pass: 100%/4   | Total: 26m 22s | Avg:  6m 35s | Max: 17m 27s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 06s | Avg:  3m 06s | Max:  3m 06s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s
      🟩 GCC12              Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 25m 51s
      🟩 GCC13              Pass: 100%/4   | Total: 11m 11s | Avg:  2m 47s | Max:  2m 56s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 18s | Avg: 10m 18s | Max: 10m 18s | Hits: 286%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 04s | Avg: 12m 04s | Max: 12m 04s | Hits: 288%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max:  8m 58s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 39m 57s | Avg:  4m 59s | Max: 17m 27s
      🟩 GCC                Pass: 100%/8   | Total: 46m 38s | Avg:  5m 49s | Max: 25m 51s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 22s | Avg: 11m 11s | Max: 12m 04s | Hits: 287%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max:  8m 58s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  2h 06m | Avg:  6m 20s | Max: 25m 51s | Hits: 287%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 23m | Avg:  4m 38s | Max: 12m 04s | Hits: 287%/522   
      🟩 Test               Pass: 100%/2   | Total: 43m 18s | Avg: 21m 39s | Max: 25m 51s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 56s | Avg:  2m 56s | Max:  2m 56s
      🟩 90a                Pass: 100%/1   | Total:  2m 53s | Avg:  2m 53s | Max:  2m 53s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 17m 10s | Avg:  4m 17s | Max:  8m 49s
      🟩 20                 Pass: 100%/16  | Total:  1h 49m | Avg:  6m 50s | Max: 25m 51s | Hits: 287%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 35s | Avg: 5m 17s | Max: 8m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 09s | Avg:  2m 09s | Max:  2m 09s
      🟩 Test               Pass: 100%/1   | Total:  8m 26s | Avg:  8m 26s | Max:  8m 26s
    
  • 🟩 python: Pass: 100%/1 | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 152)

# Runner
110 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
14 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1

Copy link
Contributor

🟩 CI finished in 6h 38m: Pass: 100%/152 | Total: 1d 13h | Avg: 14m 59s | Max: 1h 16m | Hits: 398%/21523
  • 🟩 cub: Pass: 100%/44 | Total: 16h 24m | Avg: 22m 22s | Max: 1h 16m | Hits: 86%/3552

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total: 16h 14m | Avg: 23m 11s | Max:  1h 16m | Hits:  86%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  5m 02s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 58m | Avg: 59m 46s | Max:  1h 03m | Hits:  85%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
      🟩 12.6               Pass: 100%/37  | Total:  8h 59m | Avg: 14m 34s | Max:  1h 13m | Hits:  86%/2664  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  4m 32s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 58m | Avg: 59m 46s | Max:  1h 03m | Hits:  85%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
      🟩 nvcc12.6           Pass: 100%/35  | Total:  8h 50m | Avg: 15m 09s | Max:  1h 13m | Hits:  86%/2664  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  4m 32s
      🟩 nvcc               Pass: 100%/42  | Total: 16h 15m | Avg: 23m 13s | Max:  1h 16m | Hits:  86%/3552  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 10s | Max: 59m 00s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 53s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 09s | Avg:  5m 34s | Max:  5m 44s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  5m 43s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 34m | Avg: 13m 26s | Max: 41m 18s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 15s | Max:  1h 00m
      🟩 GCC8               Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 28s | Max: 57m 09s
      🟩 GCC10              Pass: 100%/2   | Total: 12m 44s | Avg:  6m 22s | Max:  6m 48s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  5m 58s
      🟩 GCC12              Pass: 100%/4   | Total: 35m 14s | Avg:  8m 48s | Max: 19m 04s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 45m | Avg: 13m 14s | Max: 24m 00s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 12m | Hits:  87%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 13m | Hits:  84%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  4h 16m | Avg: 15m 06s | Max: 59m 00s
      🟩 GCC                Pass: 100%/21  | Total:  5h 00m | Avg: 14m 18s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 41m | Avg:  1h 10m | Max:  1h 13m | Hits:  86%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 23m 29s | Avg: 11m 44s | Max: 19m 04s
      🟩 v100               Pass: 100%/42  | Total: 16h 00m | Avg: 22m 52s | Max:  1h 16m | Hits:  86%/3552  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 13h 31m | Avg: 21m 55s | Max:  1h 16m | Hits:  86%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 00s | Avg: 24m 00s | Max: 24m 00s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 19m | Avg: 26m 31s | Max: 41m 18s
      🟩 TestGPU            Pass: 100%/2   | Total: 49m 35s | Avg: 24m 47s | Max: 27m 49s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 29s | Avg: 11m 44s | Max: 19m 04s
      🟩 90a                Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  8h 49m | Avg: 26m 28s | Max:  1h 13m | Hits:  88%/2664  
      🟩 20                 Pass: 100%/24  | Total:  7h 34m | Avg: 18m 57s | Max:  1h 16m | Hits:  78%/888   
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 7h 04m | Avg: 9m 52s | Max: 31m 01s | Hits: 669%/10065

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  6h 57m | Avg: 10m 10s | Max: 31m 01s | Hits: 669%/10065 
      🟩 arm64              Pass: 100%/2   | Total:  7m 03s | Avg:  3m 31s | Max:  3m 43s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 43m 32s | Avg:  8m 42s | Max: 28m 07s | Hits: 622%/2471  
      🟩 12.5               Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 31m 01s
      🟩 12.6               Pass: 100%/36  | Total:  5h 39m | Avg:  9m 26s | Max: 28m 49s | Hits: 684%/7594  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 07m | Avg: 16m 45s | Max: 20m 22s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 43m 32s | Avg:  8m 42s | Max: 28m 07s | Hits: 622%/2471  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 31m 01s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  4h 32m | Avg:  8m 31s | Max: 28m 49s | Hits: 684%/7594  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 07m | Avg: 16m 45s | Max: 20m 22s
      🟩 nvcc               Pass: 100%/39  | Total:  5h 57m | Avg:  9m 09s | Max: 31m 01s | Hits: 669%/10065 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 16m 55s | Avg:  4m 13s | Max:  4m 31s
      🟩 Clang15            Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 23s
      🟩 Clang16            Pass: 100%/2   | Total:  8m 37s | Avg:  4m 18s | Max:  4m 34s
      🟩 Clang17            Pass: 100%/2   | Total:  9m 01s | Avg:  4m 30s | Max:  4m 33s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 37m | Avg: 12m 11s | Max: 20m 22s
      🟩 GCC7               Pass: 100%/2   | Total:  7m 01s | Avg:  3m 30s | Max:  3m 32s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 38s | Avg:  3m 38s | Max:  3m 38s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 18s | Avg:  3m 39s | Max:  3m 42s
      🟩 GCC10              Pass: 100%/2   | Total:  8m 32s | Avg:  4m 16s | Max:  4m 30s
      🟩 GCC11              Pass: 100%/2   | Total:  7m 46s | Avg:  3m 53s | Max:  4m 03s
      🟩 GCC12              Pass: 100%/2   | Total:  8m 05s | Avg:  4m 02s | Max:  4m 03s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 31m | Avg: 11m 22s | Max: 27m 42s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 52m 32s | Avg: 26m 16s | Max: 28m 07s | Hits: 655%/4952  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 56m 47s | Avg: 28m 23s | Max: 28m 49s | Hits: 682%/5113  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 31m 01s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 20m | Avg:  7m 49s | Max: 20m 22s
      🟩 GCC                Pass: 100%/19  | Total:  2h 13m | Avg:  7m 01s | Max: 27m 42s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 49m | Avg: 27m 19s | Max: 28m 49s | Hits: 669%/10065 
      🟩 NVHPC              Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 31m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/43  | Total:  7h 04m | Avg:  9m 52s | Max: 31m 01s | Hits: 669%/10065 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  5h 30m | Avg:  8m 41s | Max: 31m 01s | Hits: 669%/10065 
      🟩 NVRTC              Pass: 100%/2   | Total: 51m 47s | Avg: 25m 53s | Max: 27m 42s
      🟩 Test               Pass: 100%/2   | Total: 40m 03s | Avg: 20m 01s | Max: 22m 09s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 55s | Avg:  1m 55s | Max:  1m 55s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 14m 09s | Avg: 14m 09s | Max: 14m 09s
      🟩 90a                Pass: 100%/2   | Total: 16m 14s | Avg:  8m 07s | Max: 12m 35s
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 18m | Avg:  9m 27s | Max: 28m 07s | Hits: 667%/7433  
      🟩 20                 Pass: 100%/21  | Total:  3h 43m | Avg: 10m 38s | Max: 31m 01s | Hits: 675%/2632  
    
  • 🟩 thrust: Pass: 100%/42 | Total: 11h 23m | Avg: 16m 15s | Max: 57m 46s | Hits: 187%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 28m 57s | Avg: 14m 28s | Max: 23m 27s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 11h 13m | Avg: 16m 50s | Max: 57m 46s | Hits: 187%/7384  
      🟩 arm64              Pass: 100%/2   | Total:  9m 37s | Avg:  4m 48s | Max:  5m 00s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 16m | Avg: 39m 12s | Max: 57m 46s | Hits: 174%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 48m | Avg: 54m 07s | Max: 56m 19s
      🟩 12.6               Pass: 100%/35  | Total:  6h 18m | Avg: 10m 49s | Max: 54m 39s | Hits: 191%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 31s | Avg:  5m 15s | Max:  5m 16s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 16m | Avg: 39m 12s | Max: 57m 46s | Hits: 174%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 48m | Avg: 54m 07s | Max: 56m 19s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  6h 08m | Avg: 11m 09s | Max: 54m 39s | Hits: 191%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 31s | Avg:  5m 15s | Max:  5m 16s
      🟩 nvcc               Pass: 100%/40  | Total: 11h 12m | Avg: 16m 48s | Max: 57m 46s | Hits: 187%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 22m | Avg: 20m 37s | Max: 39m 32s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 38s
      🟩 Clang16            Pass: 100%/2   | Total: 12m 06s | Avg:  6m 03s | Max:  6m 04s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 01s | Avg:  5m 30s | Max:  5m 40s
      🟩 Clang18            Pass: 100%/7   | Total: 45m 30s | Avg:  6m 30s | Max: 11m 07s
      🟩 GCC7               Pass: 100%/2   | Total: 37m 21s | Avg: 18m 40s | Max: 31m 26s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 GCC9               Pass: 100%/2   | Total: 41m 31s | Avg: 20m 45s | Max: 35m 28s
      🟩 GCC10              Pass: 100%/2   | Total: 12m 00s | Avg:  6m 00s | Max:  6m 29s
      🟩 GCC11              Pass: 100%/2   | Total: 12m 00s | Avg:  6m 00s | Max:  6m 06s
      🟩 GCC12              Pass: 100%/2   | Total: 12m 39s | Avg:  6m 19s | Max:  6m 22s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 15m | Avg:  9m 29s | Max: 23m 27s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 21s | Max: 57m 46s | Hits: 183%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 19s | Max: 54m 39s | Hits: 191%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 07s | Max: 56m 19s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 42m | Avg:  9m 32s | Max: 39m 32s
      🟩 GCC                Pass: 100%/19  | Total:  3h 17m | Avg: 10m 22s | Max: 35m 28s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 35m | Avg: 53m 50s | Max: 57m 46s | Hits: 187%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 48m | Avg: 54m 07s | Max: 56m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total: 11h 23m | Avg: 16m 15s | Max: 57m 46s | Hits: 187%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 10h 15m | Avg: 16m 38s | Max: 57m 46s | Hits: 187%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 15m 31s | Avg:  7m 45s | Max:  7m 47s
      🟩 TestGPU            Pass: 100%/3   | Total: 52m 07s | Avg: 17m 22s | Max: 23m 27s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 27s | Avg:  4m 27s | Max:  4m 27s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  6h 27m | Avg: 19m 23s | Max: 57m 46s | Hits: 185%/5538  
      🟩 20                 Pass: 100%/20  | Total:  4h 26m | Avg: 13m 18s | Max: 56m 19s | Hits: 191%/1846  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 06m | Avg: 6m 20s | Max: 25m 51s | Hits: 287%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 55m | Avg:  7m 14s | Max: 25m 51s | Hits: 287%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 54s | Avg:  2m 43s | Max:  2m 47s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 18s | Avg: 10m 18s | Max: 10m 18s | Hits: 286%/261   
      🟩 12.5               Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max:  8m 58s
      🟩 12.6               Pass: 100%/17  | Total:  1h 38m | Avg:  5m 48s | Max: 25m 51s | Hits: 288%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 18s | Avg: 10m 18s | Max: 10m 18s | Hits: 286%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max:  8m 58s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 38m | Avg:  5m 48s | Max: 25m 51s | Hits: 288%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 06m | Avg:  6m 20s | Max: 25m 51s | Hits: 287%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s
      🟩 Clang18            Pass: 100%/4   | Total: 26m 22s | Avg:  6m 35s | Max: 17m 27s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 06s | Avg:  3m 06s | Max:  3m 06s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s
      🟩 GCC12              Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 25m 51s
      🟩 GCC13              Pass: 100%/4   | Total: 11m 11s | Avg:  2m 47s | Max:  2m 56s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 18s | Avg: 10m 18s | Max: 10m 18s | Hits: 286%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 04s | Avg: 12m 04s | Max: 12m 04s | Hits: 288%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max:  8m 58s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 39m 57s | Avg:  4m 59s | Max: 17m 27s
      🟩 GCC                Pass: 100%/8   | Total: 46m 38s | Avg:  5m 49s | Max: 25m 51s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 22s | Avg: 11m 11s | Max: 12m 04s | Hits: 287%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max:  8m 58s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  2h 06m | Avg:  6m 20s | Max: 25m 51s | Hits: 287%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 23m | Avg:  4m 38s | Max: 12m 04s | Hits: 287%/522   
      🟩 Test               Pass: 100%/2   | Total: 43m 18s | Avg: 21m 39s | Max: 25m 51s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 56s | Avg:  2m 56s | Max:  2m 56s
      🟩 90a                Pass: 100%/1   | Total:  2m 53s | Avg:  2m 53s | Max:  2m 53s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 17m 10s | Avg:  4m 17s | Max:  8m 49s
      🟩 20                 Pass: 100%/16  | Total:  1h 49m | Avg:  6m 50s | Max: 25m 51s | Hits: 287%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 35s | Avg: 5m 17s | Max: 8m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 26s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 09s | Avg:  2m 09s | Max:  2m 09s
      🟩 Test               Pass: 100%/1   | Total:  8m 26s | Avg:  8m 26s | Max:  8m 26s
    
  • 🟩 python: Pass: 100%/1 | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 152)

# Runner
110 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
14 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1

@bernhardmgruber bernhardmgruber merged commit a654bc6 into NVIDIA:main Jan 30, 2025
163 of 167 checks passed
@bernhardmgruber bernhardmgruber deleted the fp16_ctk120 branch January 30, 2025 07:33
Copy link
Contributor

Backport failed for branch/2.8.x, because it was unable to cherry-pick the commit(s).

Please cherry-pick the changes locally.

git fetch origin branch/2.8.x
git worktree add -d .worktree/backport-3535-to-branch/2.8.x origin/branch/2.8.x
cd .worktree/backport-3535-to-branch/2.8.x
git checkout -b backport-3535-to-branch/2.8.x
ancref=$(git merge-base 09b12009d906bdb69f9da60de5196991d0610f9e 86457ca76fdf502d4932d372e6cd49ca751e575f)
git cherry-pick -x $ancref..86457ca76fdf502d4932d372e6cd49ca751e575f

bernhardmgruber added a commit that referenced this pull request Jan 31, 2025
* Support FP16 traits on CTK 12.0
* Only enable constexpr limits when supported
* Support float_eq on CTK < 12.2
bernhardmgruber added a commit that referenced this pull request Jan 31, 2025
* Support FP16 traits on CTK 12.0
* Only enable constexpr limits when supported
* Support float_eq on CTK < 12.2
miscco pushed a commit that referenced this pull request Jan 31, 2025
* Support FP16 traits on CTK 12.0
* Only enable constexpr limits when supported
* Support float_eq on CTK < 12.2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants