Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace CUB iterators by Thrust ones #3480

Merged
merged 2 commits into from
Feb 5, 2025

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Jan 22, 2025

Copy link

copy-pr-bot bot commented Jan 22, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@bernhardmgruber
Copy link
Contributor Author

/ok to test

Copy link
Contributor

github-actions bot commented Feb 5, 2025

🟩 CI finished in 1h 53m: Pass: 100%/90 | Total: 2d 17h | Avg: 43m 51s | Max: 1h 17m | Hits: 174%/13398
  • 🟩 cub: Pass: 100%/44 | Total: 1d 17h | Avg: 56m 38s | Max: 1h 17m | Hits: 88%/4168

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 15h | Avg: 56m 19s | Max:  1h 17m | Hits:  88%/4168  
      🟩 arm64              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 07m | Avg:  1h 01m | Max:  1h 04m | Hits:  90%/1042  
      🟩 12.5               Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 17m
      🟩 12.8               Pass: 100%/37  | Total:  1d 09h | Avg: 55m 01s | Max:  1h 16m | Hits:  87%/3126  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 07m | Avg:  1h 01m | Max:  1h 04m | Hits:  90%/1042  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 17m
      🟩 nvcc12.8           Pass: 100%/35  | Total:  1d 07h | Avg: 54m 38s | Max:  1h 16m | Hits:  87%/3126  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m
      🟩 nvcc               Pass: 100%/42  | Total:  1d 15h | Avg: 56m 23s | Max:  1h 17m | Hits:  88%/4168  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 04m | Avg:  1h 01m | Max:  1h 03m
      🟩 Clang15            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 07m
      🟩 Clang16            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 06m
      🟩 Clang17            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 06m
      🟩 Clang18            Pass: 100%/7   | Total:  6h 05m | Avg: 52m 09s | Max:  1h 06m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 23s | Max: 58m 24s
      🟩 GCC8               Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m
      🟩 GCC9               Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 05m
      🟩 GCC10              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m
      🟩 GCC11              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 39s | Max: 59m 49s
      🟩 GCC12              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m
      🟩 GCC13              Pass: 100%/10  | Total:  6h 39m | Avg: 39m 55s | Max:  1h 09m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 09m | Hits:  90%/2084  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 16m | Hits:  86%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 17m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 27m | Avg: 58m 03s | Max:  1h 07m
      🟩 GCC                Pass: 100%/21  | Total: 17h 51m | Avg: 51m 02s | Max:  1h 09m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 44m | Avg:  1h 11m | Max:  1h 16m | Hits:  88%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 17m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 51m 24s | Avg: 25m 42s | Max: 27m 22s
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 12h | Avg:  1h 03m | Max:  1h 17m | Hits:  88%/4168  
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 29m | Avg: 33m 41s | Max:  1h 09m
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 14h | Avg:  1h 03m | Max:  1h 17m | Hits:  88%/4168  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 50s | Avg: 22m 50s | Max: 22m 50s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 51s | Avg: 17m 51s | Max: 17m 51s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 13m | Avg: 24m 25s | Max: 26m 15s
      🟩 TestGPU            Pass: 100%/2   | Total: 43m 44s | Avg: 21m 52s | Max: 22m 28s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 51m 24s | Avg: 25m 42s | Max: 27m 22s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 00m | Avg:  1h 03m | Max:  1h 13m | Hits:  90%/3126  
      🟩 20                 Pass: 100%/24  | Total: 20h 31m | Avg: 51m 18s | Max:  1h 17m | Hits:  82%/1042  
    
  • 🟩 thrust: Pass: 100%/43 | Total: 23h 42m | Avg: 33m 04s | Max: 1h 03m | Hits: 212%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 38m 37s | Avg: 19m 18s | Max: 27m 33s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 22h 42m | Avg: 33m 13s | Max:  1h 03m | Hits: 212%/9230  
      🟩 arm64              Pass: 100%/2   | Total: 59m 55s | Avg: 29m 57s | Max: 31m 40s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 00m | Avg: 36m 11s | Max: 56m 31s | Hits: 174%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 07s | Max: 59m 44s
      🟩 12.8               Pass: 100%/36  | Total: 18h 47m | Avg: 31m 18s | Max:  1h 03m | Hits: 222%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 55m 48s | Avg: 27m 54s | Max: 29m 50s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 00m | Avg: 36m 11s | Max: 56m 31s | Hits: 174%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 54m | Avg: 57m 07s | Max: 59m 44s
      🟩 nvcc12.8           Pass: 100%/34  | Total: 17h 51m | Avg: 31m 30s | Max:  1h 03m | Hits: 222%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 55m 48s | Avg: 27m 54s | Max: 29m 50s
      🟩 nvcc               Pass: 100%/41  | Total: 22h 46m | Avg: 33m 19s | Max:  1h 03m | Hits: 212%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 14s | Max: 33m 01s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 01m | Avg: 30m 37s | Max: 31m 36s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 20s | Max: 35m 31s
      🟩 Clang17            Pass: 100%/2   | Total: 59m 15s | Avg: 29m 37s | Max: 29m 49s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 44m | Avg: 23m 33s | Max: 31m 11s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 14s | Max: 31m 42s
      🟩 GCC8               Pass: 100%/1   | Total: 33m 29s | Avg: 33m 29s | Max: 33m 29s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 29s | Max: 31m 40s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 55s | Max: 35m 17s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 52s | Max: 36m 55s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 54s | Max: 36m 25s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 13m | Avg: 24m 12s | Max: 37m 26s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 56m | Avg: 58m 02s | Max: 59m 33s | Hits: 174%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 39m | Avg: 53m 00s | Max:  1h 03m | Hits: 238%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 07s | Max: 59m 44s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 57m | Avg: 28m 03s | Max: 35m 31s
      🟩 GCC                Pass: 100%/19  | Total:  9h 15m | Avg: 29m 15s | Max: 37m 26s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 35m | Avg: 55m 00s | Max:  1h 03m | Hits: 212%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 54m | Avg: 57m 07s | Max: 59m 44s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 19h 37m | Avg: 35m 40s | Max: 59m 44s | Hits: 174%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 04m | Avg: 24m 29s | Max:  1h 03m | Hits: 270%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 22h 17m | Avg: 36m 08s | Max:  1h 03m | Hits: 174%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 39s | Avg: 17m 13s | Max: 35m 44s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 12s | Avg: 11m 04s | Max: 11m 34s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 34m 02s | Avg: 34m 02s | Max: 34m 02s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 14m | Avg: 36m 44s | Max: 59m 33s | Hits: 174%/5538  
      🟩 20                 Pass: 100%/21  | Total: 10h 48m | Avg: 30m 53s | Max:  1h 03m | Hits: 270%/3692  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 7m 20s | Avg: 3m 40s | Max: 4m 59s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  4m 59s
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  4m 59s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  4m 59s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  4m 59s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  4m 59s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  4m 59s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  4m 59s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s
      🟩 Test               Pass: 100%/1   | Total:  4m 59s | Avg:  4m 59s | Max:  4m 59s
    
  • 🟩 python: Pass: 100%/1 | Total: 26m 17s | Avg: 26m 17s | Max: 26m 17s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 26m 17s | Avg: 26m 17s | Max: 26m 17s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 26m 17s | Avg: 26m 17s | Max: 26m 17s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 26m 17s | Avg: 26m 17s | Max: 26m 17s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 26m 17s | Avg: 26m 17s | Max: 26m 17s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 26m 17s | Avg: 26m 17s | Max: 26m 17s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 26m 17s | Avg: 26m 17s | Max: 26m 17s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 26m 17s | Avg: 26m 17s | Max: 26m 17s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 26m 17s | Avg: 26m 17s | Max: 26m 17s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

@bernhardmgruber bernhardmgruber marked this pull request as ready for review February 5, 2025 09:47
@bernhardmgruber bernhardmgruber requested review from a team as code owners February 5, 2025 09:47
@bernhardmgruber bernhardmgruber enabled auto-merge (squash) February 5, 2025 12:12
Copy link
Contributor

github-actions bot commented Feb 5, 2025

🟩 CI finished in 3h 03m: Pass: 100%/90 | Total: 2d 23h | Avg: 47m 24s | Max: 1h 51m | Hits: 127%/13398
  • 🟩 cub: Pass: 100%/44 | Total: 1d 18h | Avg: 58m 35s | Max: 1h 51m | Hits: 86%/4168

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 16h | Avg: 58m 25s | Max:  1h 51m | Hits:  86%/4168  
      🟩 arm64              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 06m | Avg:  1h 01m | Max:  1h 04m | Hits:  88%/1042  
      🟩 12.5               Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 18m
      🟩 12.8               Pass: 100%/37  | Total:  1d 11h | Avg: 57m 26s | Max:  1h 51m | Hits:  85%/3126  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 06m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 06m | Avg:  1h 01m | Max:  1h 04m | Hits:  88%/1042  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 18m
      🟩 nvcc12.8           Pass: 100%/35  | Total:  1d 09h | Avg: 57m 03s | Max:  1h 51m | Hits:  85%/3126  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 06m
      🟩 nvcc               Pass: 100%/42  | Total:  1d 16h | Avg: 58m 19s | Max:  1h 51m | Hits:  86%/4168  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 56m | Avg: 59m 10s | Max:  1h 02m
      🟩 Clang15            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 06m
      🟩 Clang16            Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m
      🟩 Clang17            Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 07m
      🟩 Clang18            Pass: 100%/7   | Total:  5h 56m | Avg: 50m 58s | Max:  1h 06m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 58m | Avg: 59m 22s | Max:  1h 00m
      🟩 GCC8               Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m
      🟩 GCC9               Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m
      🟩 GCC10              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 05m
      🟩 GCC11              Pass: 100%/2   | Total:  1h 56m | Avg: 58m 28s | Max: 58m 46s
      🟩 GCC12              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 04m
      🟩 GCC13              Pass: 100%/10  | Total:  8h 15m | Avg: 49m 31s | Max:  1h 51m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 12m | Hits:  88%/2084  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m | Hits:  84%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 18m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 15m | Avg: 57m 21s | Max:  1h 07m
      🟩 GCC                Pass: 100%/21  | Total: 19h 30m | Avg: 55m 42s | Max:  1h 51m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 46m | Avg:  1h 11m | Max:  1h 15m | Hits:  86%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 18m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 54m 36s | Avg: 27m 18s | Max: 30m 03s
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 12h | Avg:  1h 03m | Max:  1h 18m | Hits:  86%/4168  
      🟩 rtxa6000           Pass: 100%/8   | Total:  5h 49m | Avg: 43m 44s | Max:  1h 51m
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 14h | Avg:  1h 03m | Max:  1h 18m | Hits:  86%/4168  
      🟩 DeviceLaunch       Pass: 100%/1   | Total:  1h 51m | Avg:  1h 51m | Max:  1h 51m
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 19s | Avg: 16m 19s | Max: 16m 19s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 11m | Avg: 23m 48s | Max: 24m 33s
      🟩 TestGPU            Pass: 100%/2   | Total: 39m 44s | Avg: 19m 52s | Max: 20m 40s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 54m 36s | Avg: 27m 18s | Max: 30m 03s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 04m | Avg:  1h 03m | Max:  1h 13m | Hits:  88%/3126  
      🟩 20                 Pass: 100%/24  | Total: 21h 53m | Avg: 54m 43s | Max:  1h 51m | Hits:  80%/1042  
    
  • 🟩 thrust: Pass: 100%/43 | Total: 1d 03h | Avg: 38m 22s | Max: 1h 17m | Hits: 146%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 42m 39s | Avg: 21m 19s | Max: 31m 36s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  1d 02h | Avg: 38m 35s | Max:  1h 17m | Hits: 146%/9230  
      🟩 arm64              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 03s | Max: 35m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 30m | Avg: 42m 07s | Max:  1h 00m | Hits:  81%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m
      🟩 12.8               Pass: 100%/36  | Total: 21h 30m | Avg: 35m 51s | Max:  1h 17m | Hits: 162%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 08m | Avg: 34m 10s | Max: 35m 44s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 30m | Avg: 42m 07s | Max:  1h 00m | Hits:  81%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m
      🟩 nvcc12.8           Pass: 100%/34  | Total: 20h 22m | Avg: 35m 57s | Max:  1h 17m | Hits: 162%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 08m | Avg: 34m 10s | Max: 35m 44s
      🟩 nvcc               Pass: 100%/41  | Total:  1d 02h | Avg: 38m 34s | Max:  1h 17m | Hits: 146%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 26m | Avg: 36m 33s | Max: 38m 12s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 12m | Avg: 36m 19s | Max: 37m 05s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 14m | Avg: 37m 10s | Max: 37m 26s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 11m | Avg: 35m 59s | Max: 36m 12s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 16m | Avg: 28m 07s | Max: 39m 21s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 16m | Avg: 38m 01s | Max: 38m 02s
      🟩 GCC8               Pass: 100%/1   | Total: 34m 24s | Avg: 34m 24s | Max: 34m 24s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 19m | Avg: 39m 41s | Max: 39m 46s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 16m | Avg: 38m 24s | Max: 39m 50s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 20m | Avg: 40m 29s | Max: 40m 58s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 20m | Avg: 40m 18s | Max: 41m 16s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 32m | Avg: 26m 36s | Max: 39m 40s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m | Hits: 102%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 55m | Avg: 58m 39s | Max:  1h 17m | Hits: 176%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  9h 22m | Avg: 33m 03s | Max: 39m 21s
      🟩 GCC                Pass: 100%/19  | Total: 10h 41m | Avg: 33m 44s | Max: 41m 16s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 58m | Avg: 59m 39s | Max:  1h 17m | Hits: 146%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 23h 01m | Avg: 41m 51s | Max:  1h 15m | Hits:  95%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 28m | Avg: 26m 52s | Max:  1h 17m | Hits: 223%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 02h | Avg: 42m 24s | Max:  1h 17m | Hits:  91%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 48m 00s | Avg: 16m 00s | Max: 32m 08s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 32m 50s | Avg: 10m 56s | Max: 11m 16s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 37m 49s | Avg: 37m 49s | Max: 37m 49s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 14h 27m | Avg: 43m 21s | Max:  1h 15m | Hits:  95%/5538  
      🟩 20                 Pass: 100%/21  | Total: 12h 20m | Avg: 35m 15s | Max:  1h 17m | Hits: 223%/3692  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 07s | Avg: 5m 33s | Max: 8m 50s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  8m 50s
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  8m 50s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  8m 50s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  8m 50s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  8m 50s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  8m 50s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  8m 50s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s
      🟩 Test               Pass: 100%/1   | Total:  8m 50s | Avg:  8m 50s | Max:  8m 50s
    
  • 🟩 python: Pass: 100%/1 | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

@bernhardmgruber bernhardmgruber merged commit f4545e0 into NVIDIA:main Feb 5, 2025
102 of 105 checks passed
@bernhardmgruber bernhardmgruber deleted the replace_cub_iterators branch February 5, 2025 13:59
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Feb 5, 2025
Also consider thrust::discard_iterator's value_type void

Fixes: NVIDIA#3261
miscco added a commit that referenced this pull request Feb 6, 2025
…place CUB iterators by Thrust ones (#3480) (#3697)

* Make thrust iterators work with NVRTC (#3676)

* Make thrust iterators work with NVRTC

As a drive-by, all iterator tags used in CUB and Thrust are replaced with ones from libcu++.

Co-authored-by: Michael Schellenberger Costa <[email protected]>

* More NVRTC fixes

* Replace CUB iterators by Thrust ones (#3480)

Also consider thrust::discard_iterator's value_type void

Fixes: #3261

---------

Co-authored-by: Michael Schellenberger Costa <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants