Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add b200 policies for reduce #3612

Merged
merged 2 commits into from
Jan 31, 2025
Merged

Conversation

bernhardmgruber
Copy link
Contributor

No description provided.

Copy link
Contributor

🟩 CI finished in 2h 16m: Pass: 100%/89 | Total: 2d 14h | Avg: 41m 54s | Max: 1h 13m | Hits: 274%/10936
  • 🟩 cub: Pass: 100%/44 | Total: 1d 14h | Avg: 52m 49s | Max: 1h 13m | Hits: 363%/3552

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 12h | Avg: 52m 22s | Max:  1h 13m | Hits: 363%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 03m
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 06m | Avg:  1h 01m | Max:  1h 07m | Hits: 363%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 10m
      🟩 12.6               Pass: 100%/37  | Total:  1d 07h | Avg: 50m 45s | Max:  1h 13m | Hits: 363%/2664  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 56m | Avg: 58m 11s | Max: 59m 00s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 06m | Avg:  1h 01m | Max:  1h 07m | Hits: 363%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 10m
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1d 05h | Avg: 50m 20s | Max:  1h 13m | Hits: 363%/2664  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 56m | Avg: 58m 11s | Max: 59m 00s
      🟩 nvcc               Pass: 100%/42  | Total:  1d 12h | Avg: 52m 34s | Max:  1h 13m | Hits: 363%/3552  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 44m | Avg: 56m 04s | Max:  1h 00m
      🟩 Clang15            Pass: 100%/2   | Total:  1h 49m | Avg: 54m 41s | Max: 57m 12s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 54s | Max: 59m 10s
      🟩 Clang17            Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 07m
      🟩 Clang18            Pass: 100%/7   | Total:  5h 36m | Avg: 48m 05s | Max:  1h 01m
      🟩 GCC7               Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 09m
      🟩 GCC8               Pass: 100%/1   | Total: 53m 19s | Avg: 53m 19s | Max: 53m 19s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 59m | Avg: 59m 58s | Max:  1h 01m
      🟩 GCC10              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 31s | Max: 59m 23s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 54m | Avg: 57m 20s | Max:  1h 01m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 55m | Avg: 43m 59s | Max:  1h 02m
      🟩 GCC13              Pass: 100%/8   | Total:  4h 42m | Avg: 35m 18s | Max:  1h 03m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m | Hits: 363%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 13m | Hits: 364%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 10m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 15h 15m | Avg: 53m 51s | Max:  1h 07m
      🟩 GCC                Pass: 100%/21  | Total: 16h 31m | Avg: 47m 11s | Max:  1h 09m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 38m | Avg:  1h 09m | Max:  1h 13m | Hits: 363%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 10m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 56m 15s | Avg: 28m 07s | Max: 29m 12s
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 05m | Avg: 30m 40s | Max: 58m 30s
      🟩 v100               Pass: 100%/34  | Total:  1d 09h | Avg: 59m 30s | Max:  1h 13m | Hits: 363%/3552  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 12h | Avg: 58m 26s | Max:  1h 13m | Hits: 363%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 46s | Avg: 20m 46s | Max: 20m 46s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 21m | Avg: 27m 00s | Max: 29m 12s
      🟩 TestGPU            Pass: 100%/2   | Total: 43m 29s | Avg: 21m 44s | Max: 22m 02s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 56m 15s | Avg: 28m 07s | Max: 29m 12s
      🟩 90a                Pass: 100%/1   | Total: 24m 51s | Avg: 24m 51s | Max: 24m 51s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 20h 13m | Avg:  1h 00m | Max:  1h 10m | Hits: 364%/2664  
      🟩 20                 Pass: 100%/24  | Total: 18h 31m | Avg: 46m 18s | Max:  1h 13m | Hits: 362%/888   
    
  • 🟩 thrust: Pass: 100%/42 | Total: 22h 46m | Avg: 32m 32s | Max: 1h 02m | Hits: 231%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 47m 21s | Avg: 23m 40s | Max: 33m 19s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 21h 44m | Avg: 32m 36s | Max:  1h 02m | Hits: 231%/7384  
      🟩 arm64              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 16s | Max: 33m 02s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 02m | Avg: 36m 31s | Max: 53m 08s | Hits: 228%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 47m | Avg: 53m 33s | Max: 55m 43s
      🟩 12.6               Pass: 100%/35  | Total: 17h 56m | Avg: 30m 46s | Max:  1h 02m | Hits: 232%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 02s | Avg: 25m 01s | Max: 25m 01s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 02m | Avg: 36m 31s | Max: 53m 08s | Hits: 228%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 47m | Avg: 53m 33s | Max: 55m 43s
      🟩 nvcc12.6           Pass: 100%/33  | Total: 17h 06m | Avg: 31m 07s | Max:  1h 02m | Hits: 232%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 02s | Avg: 25m 01s | Max: 25m 01s
      🟩 nvcc               Pass: 100%/40  | Total: 21h 56m | Avg: 32m 54s | Max:  1h 02m | Hits: 231%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 05s | Max: 38m 50s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 18s | Max: 36m 49s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 03m | Avg: 31m 55s | Max: 33m 06s
      🟩 Clang17            Pass: 100%/2   | Total: 59m 29s | Avg: 29m 44s | Max: 30m 16s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 37m | Avg: 22m 27s | Max: 30m 43s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 37s | Max: 33m 11s
      🟩 GCC8               Pass: 100%/1   | Total: 32m 41s | Avg: 32m 41s | Max: 32m 41s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 58s | Max: 35m 59s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 17s | Max: 32m 03s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 45s | Max: 32m 49s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 55s | Max: 32m 29s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 02m | Avg: 22m 45s | Max: 33m 19s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 55m | Avg: 57m 51s | Max:  1h 02m | Hits: 228%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m | Hits: 234%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 33s | Max: 55m 43s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 01m | Avg: 28m 19s | Max: 38m 50s
      🟩 GCC                Pass: 100%/19  | Total:  8h 57m | Avg: 28m 18s | Max: 35m 59s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 00m | Avg:  1h 00m | Max:  1h 02m | Hits: 231%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 47m | Avg: 53m 33s | Max: 55m 43s
    🟩 gpu
      🟩 rtx4090            Pass: 100%/8   | Total:  2h 25m | Avg: 18m 14s | Max: 33m 19s
      🟩 v100               Pass: 100%/34  | Total: 20h 20m | Avg: 35m 54s | Max:  1h 02m | Hits: 231%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 21h 55m | Avg: 35m 33s | Max:  1h 02m | Hits: 231%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 15m 24s | Avg:  7m 42s | Max:  7m 48s
      🟩 TestGPU            Pass: 100%/3   | Total: 35m 51s | Avg: 11m 57s | Max: 14m 02s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 19m 50s | Avg: 19m 50s | Max: 19m 50s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 15m | Avg: 36m 47s | Max:  1h 02m | Hits: 232%/5538  
      🟩 20                 Pass: 100%/20  | Total:  9h 43m | Avg: 29m 10s | Max:  1h 02m | Hits: 228%/1846  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 7m 36s | Avg: 3m 48s | Max: 5m 32s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 36s | Avg:  3m 48s | Max:  5m 32s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  7m 36s | Avg:  3m 48s | Max:  5m 32s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  7m 36s | Avg:  3m 48s | Max:  5m 32s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  7m 36s | Avg:  3m 48s | Max:  5m 32s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  7m 36s | Avg:  3m 48s | Max:  5m 32s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  7m 36s | Avg:  3m 48s | Max:  5m 32s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  7m 36s | Avg:  3m 48s | Max:  5m 32s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
      🟩 Test               Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
    
  • 🟩 python: Pass: 100%/1 | Total: 30m 21s | Avg: 30m 21s | Max: 30m 21s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 30m 21s | Avg: 30m 21s | Max: 30m 21s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 30m 21s | Avg: 30m 21s | Max: 30m 21s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 30m 21s | Avg: 30m 21s | Max: 30m 21s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 30m 21s | Avg: 30m 21s | Max: 30m 21s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 30m 21s | Avg: 30m 21s | Max: 30m 21s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 30m 21s | Avg: 30m 21s | Max: 30m 21s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 30m 21s | Avg: 30m 21s | Max: 30m 21s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 30m 21s | Avg: 30m 21s | Max: 30m 21s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 89)

# Runner
65 linux-amd64-cpu16
8 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

@miscco miscco merged commit de1c340 into NVIDIA:main Jan 31, 2025
104 of 108 checks passed
Copy link
Contributor

Git push to origin failed for branch/2.8.x with exitcode 128

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

4 participants