Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ForEachN from CUB to cccl/c. #2378

Merged
merged 18 commits into from
Oct 8, 2024
Merged

Conversation

wmaxey
Copy link
Member

@wmaxey wmaxey commented Sep 6, 2024

Description

Adds the For Each algorithm to cccl/c. Later PRs will address lack of iterator support.

Checklist

  • New or existing tests cover these changes.

@wmaxey wmaxey requested a review from gevtushenko September 6, 2024 00:52
Copy link
Contributor

github-actions bot commented Sep 6, 2024

🟩 CI finished in 14m 26s: Pass: 100%/1 | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
+/- CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 1)

# Runner
1 linux-amd64-gpu-v100-latest-1

Copy link
Collaborator

@gevtushenko gevtushenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This PR has most of the things we need already! Let's add iterators support and mark ready for review.

c/src/reduce.cu Outdated Show resolved Hide resolved
c/src/reduce.cu Outdated Show resolved Hide resolved
c/src/util/small_storage.h Outdated Show resolved Hide resolved
c/src/for.cu Outdated Show resolved Hide resolved
c/src/for.cu Outdated Show resolved Hide resolved
c/src/for.cu Outdated Show resolved Hide resolved
c/src/for.cu Outdated Show resolved Hide resolved
c/src/util/small_storage.cpp Outdated Show resolved Hide resolved
c/src/util/small_storage.cpp Outdated Show resolved Hide resolved
c/src/for.cu Outdated Show resolved Hide resolved
Copy link
Contributor

github-actions bot commented Oct 4, 2024

🟥 CI finished in 12m 33s: Pass: 0%/1 | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
  • 🟥 pycuda: Pass: 0%/1 | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s

    🟥 cpu
      🟥 amd64              Pass:   0%/1   | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
    🟥 ctk
      🟥 12.5               Pass:   0%/1   | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
    🟥 cudacxx
      🟥 nvcc12.5           Pass:   0%/1   | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/1   | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
    🟥 cxx
      🟥 GCC13              Pass:   0%/1   | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/1   | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
    🟥 gpu
      🟥 v100               Pass:   0%/1   | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
    🟥 jobs
      🟥 Test               Pass:   0%/1   | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
+/- CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 1)

# Runner
1 linux-amd64-gpu-v100-latest-1

@wmaxey wmaxey requested a review from gevtushenko October 4, 2024 06:11
Copy link
Contributor

github-actions bot commented Oct 4, 2024

🟩 CI finished in 15m 01s: Pass: 100%/1 | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
+/- CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 1)

# Runner
1 linux-amd64-gpu-v100-latest-1

@wmaxey wmaxey changed the title [Draft] Add ForEachN from CUB to cccl/c. Add ForEachN from CUB to cccl/c. Oct 4, 2024
@wmaxey wmaxey marked this pull request as ready for review October 4, 2024 23:11
@wmaxey wmaxey requested review from a team as code owners October 4, 2024 23:11
@wmaxey wmaxey requested review from alliepiper and elstehle October 4, 2024 23:11
Copy link
Contributor

github-actions bot commented Oct 4, 2024

🟩 CI finished in 18m 40s: Pass: 100%/1 | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s
  • 🟩 pycuda: Pass: 100%/1 | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 18m 40s | Avg: 18m 40s | Max: 18m 40s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
+/- CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 1)

# Runner
1 linux-amd64-gpu-v100-latest-1

@wmaxey wmaxey force-pushed the fea/cub_device_for_c branch from 7e267f0 to ed5dfca Compare October 7, 2024 21:32
@NVIDIA NVIDIA deleted a comment from copy-pr-bot bot Oct 7, 2024
Copy link
Contributor

github-actions bot commented Oct 7, 2024

🟩 CI finished in 14m 29s: Pass: 100%/1 | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
+/- CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 1)

# Runner
1 linux-amd64-gpu-v100-latest-1

c/test/test_for.cpp Outdated Show resolved Hide resolved
Copy link
Contributor

github-actions bot commented Oct 8, 2024

🟩 CI finished in 13m 11s: Pass: 100%/1 | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
  • 🟩 pycuda: Pass: 100%/1 | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
+/- CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 1)

# Runner
1 linux-amd64-gpu-v100-latest-1

c/src/for/for_op_helper.cpp Outdated Show resolved Hide resolved
c/src/for/for_op_helper.cpp Outdated Show resolved Hide resolved
c/src/for.cu Outdated Show resolved Hide resolved
c/src/for.cu Show resolved Hide resolved
Copy link
Contributor

github-actions bot commented Oct 8, 2024

🟩 CI finished in 14m 44s: Pass: 100%/1 | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 44s | Avg: 14m 44s | Max: 14m 44s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
+/- CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 1)

# Runner
1 linux-amd64-gpu-v100-latest-1

@wmaxey wmaxey merged commit ee5dd3e into NVIDIA:main Oct 8, 2024
17 checks passed
rwgk added a commit to rwgk/cccl that referenced this pull request Oct 8, 2024
@gevtushenko gevtushenko linked an issue Oct 10, 2024 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

Implement C API for device for
2 participants