Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add b200 policies for cub.device.partition.flagged,if,three_way #3617

Draft
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Jan 30, 2025

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner January 30, 2025 19:09
@bernhardmgruber bernhardmgruber marked this pull request as draft January 30, 2025 19:09
Copy link

copy-pr-bot bot commented Jan 30, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

Copy link
Contributor

🟨 CI finished in 1h 38m: Pass: 98%/89 | Total: 2d 13h | Avg: 41m 29s | Max: 1h 17m | Hits: 291%/10936
  • 🟨 cub: Pass: 97%/44 | Total: 1d 13h | Avg: 51m 44s | Max: 1h 15m | Hits: 355%/3552

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/42  | Total:  1d 11h | Avg: 51m 22s | Max:  1h 15m | Hits: 355%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 23s | Max:  1h 00m
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  4h 52m | Avg: 58m 32s | Max:  1h 00m | Hits: 355%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m
      🔍 12.6               Pass:  97%/37  | Total:  1d 06h | Avg: 49m 54s | Max:  1h 15m | Hits: 355%/2664  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 05m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 52m | Avg: 58m 32s | Max:  1h 00m | Hits: 355%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m
      🔍 nvcc12.6           Pass:  97%/35  | Total:  1d 04h | Avg: 49m 11s | Max:  1h 15m | Hits: 355%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 05m
      🔍 nvcc               Pass:  97%/42  | Total:  1d 11h | Avg: 51m 14s | Max:  1h 15m | Hits: 355%/3552  
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  3h 49m | Avg: 57m 17s | Max: 58m 25s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 39s | Max: 57m 22s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 51s | Max: 59m 42s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 52m | Avg: 56m 23s | Max: 59m 49s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 45m | Avg: 49m 25s | Max:  1h 05m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 56m | Avg: 58m 18s | Max: 59m 36s
      🟩 GCC8               Pass: 100%/1   | Total: 51m 44s | Avg: 51m 44s | Max: 51m 44s
      🟩 GCC9               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
      🟩 GCC10              Pass: 100%/2   | Total:  1h 57m | Avg: 58m 45s | Max:  1h 02m
      🟩 GCC11              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 40s | Max:  1h 00m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 39m | Avg: 39m 57s | Max:  1h 02m
      🔍 GCC13              Pass:  87%/8   | Total:  4h 26m | Avg: 33m 15s | Max:  1h 01m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 11m | Hits: 355%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 15m | Hits: 355%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/17  | Total: 15h 14m | Avg: 53m 49s | Max:  1h 05m
      🔍 GCC                Pass:  95%/21  | Total: 15h 48m | Avg: 45m 10s | Max:  1h 02m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 35m | Avg:  1h 08m | Max:  1h 15m | Hits: 355%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 42m 51s | Avg: 21m 25s | Max: 23m 12s
      🔍 v100               Pass:  97%/42  | Total:  1d 13h | Avg: 53m 11s | Max:  1h 15m | Hits: 355%/3552  
    🚨 jobs: DeviceLaunch 🚨
      🟩 Build              Pass: 100%/37  | Total:  1d 11h | Avg: 57m 55s | Max:  1h 15m | Hits: 355%/3552  
      🔥 DeviceLaunch       Pass:   0%/1   | Total:  3m 45s | Avg:  3m 45s | Max:  3m 45s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 29s | Avg: 17m 29s | Max: 17m 29s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 07m | Avg: 22m 21s | Max: 24m 35s
      🟩 TestGPU            Pass: 100%/2   | Total: 45m 21s | Avg: 22m 40s | Max: 23m 21s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 20h 07m | Avg:  1h 00m | Max:  1h 15m | Hits: 356%/2664  
      🔍 20                 Pass:  95%/24  | Total: 17h 49m | Avg: 44m 34s | Max:  1h 10m | Hits: 354%/888   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 42m 51s | Avg: 21m 25s | Max: 23m 12s
      🟩 90a                Pass: 100%/1   | Total: 23m 27s | Avg: 23m 27s | Max: 23m 27s
    
  • 🟩 thrust: Pass: 100%/42 | Total: 22h 36m | Avg: 32m 17s | Max: 1h 17m | Hits: 261%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 39m 45s | Avg: 19m 52s | Max: 26m 06s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 21h 39m | Avg: 32m 29s | Max:  1h 17m | Hits: 261%/7384  
      🟩 arm64              Pass: 100%/2   | Total: 56m 30s | Avg: 28m 15s | Max: 29m 32s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 02m | Avg: 36m 35s | Max: 57m 27s | Hits: 260%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 44m | Avg: 52m 00s | Max: 54m 59s
      🟩 12.6               Pass: 100%/35  | Total: 17h 49m | Avg: 30m 32s | Max:  1h 17m | Hits: 261%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 52m 57s | Avg: 26m 28s | Max: 28m 36s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 02m | Avg: 36m 35s | Max: 57m 27s | Hits: 260%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 44m | Avg: 52m 00s | Max: 54m 59s
      🟩 nvcc12.6           Pass: 100%/33  | Total: 16h 56m | Avg: 30m 47s | Max:  1h 17m | Hits: 261%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 52m 57s | Avg: 26m 28s | Max: 28m 36s
      🟩 nvcc               Pass: 100%/40  | Total: 21h 43m | Avg: 32m 34s | Max:  1h 17m | Hits: 261%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 59m | Avg: 29m 58s | Max: 30m 27s
      🟩 Clang15            Pass: 100%/2   | Total: 57m 34s | Avg: 28m 47s | Max: 29m 31s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 26s | Max: 32m 58s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 26s | Max: 32m 13s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 43m | Avg: 23m 23s | Max: 32m 42s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 06s | Max: 31m 35s
      🟩 GCC8               Pass: 100%/1   | Total: 32m 10s | Avg: 32m 10s | Max: 32m 10s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 04m | Avg: 32m 08s | Max: 33m 50s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 07s | Max: 35m 12s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 46s | Max: 34m 01s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 41s | Max: 34m 44s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 00m | Avg: 22m 34s | Max: 35m 12s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 01s | Max: 57m 27s | Hits: 260%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 17m | Hits: 261%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 44m | Avg: 52m 00s | Max: 54m 59s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 46m | Avg: 27m 27s | Max: 32m 58s
      🟩 GCC                Pass: 100%/19  | Total:  8h 56m | Avg: 28m 13s | Max: 35m 12s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 08m | Avg:  1h 02m | Max:  1h 17m | Hits: 261%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 44m | Avg: 52m 00s | Max: 54m 59s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total: 22h 36m | Avg: 32m 17s | Max:  1h 17m | Hits: 261%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 21h 41m | Avg: 35m 10s | Max:  1h 17m | Hits: 261%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 16m 30s | Avg:  8m 15s | Max:  8m 16s
      🟩 TestGPU            Pass: 100%/3   | Total: 38m 21s | Avg: 12m 47s | Max: 13m 39s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 19m 09s | Avg: 19m 09s | Max: 19m 09s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 05m | Avg: 36m 16s | Max: 57m 36s | Hits: 261%/5538  
      🟩 20                 Pass: 100%/20  | Total:  9h 50m | Avg: 29m 32s | Max:  1h 17m | Hits: 260%/1846  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 12m 09s | Avg: 6m 04s | Max: 9m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  9m 51s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  9m 51s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  9m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  9m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  9m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  9m 51s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  9m 51s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 18s | Avg:  2m 18s | Max:  2m 18s
      🟩 Test               Pass: 100%/1   | Total:  9m 51s | Avg:  9m 51s | Max:  9m 51s
    
  • 🟩 python: Pass: 100%/1 | Total: 47m 10s | Avg: 47m 10s | Max: 47m 10s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 47m 10s | Avg: 47m 10s | Max: 47m 10s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 47m 10s | Avg: 47m 10s | Max: 47m 10s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 47m 10s | Avg: 47m 10s | Max: 47m 10s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 47m 10s | Avg: 47m 10s | Max: 47m 10s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 47m 10s | Avg: 47m 10s | Max: 47m 10s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 47m 10s | Avg: 47m 10s | Max: 47m 10s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 47m 10s | Avg: 47m 10s | Max: 47m 10s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 47m 10s | Avg: 47m 10s | Max: 47m 10s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 89)

# Runner
65 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
8 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1

…ass template parameter to Nominal4BItemsToItems call
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

2 participants