Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA]: Introduce cache-modified input iterator into cuda.parallel #2536

Closed
1 task done
gevtushenko opened this issue Oct 10, 2024 · 0 comments · Fixed by #2788
Closed
1 task done

[FEA]: Introduce cache-modified input iterator into cuda.parallel #2536

gevtushenko opened this issue Oct 10, 2024 · 0 comments · Fixed by #2788
Assignees
Labels
2.8.0 target for 2.8.0 release feature request New feature or request.

Comments

@gevtushenko
Copy link
Collaborator

Is this a duplicate?

Area

Not sure

Is your feature request related to a problem? Please describe.

Usage of cuda.parallel in applications like llm.c (example) is currently blocked by lack of cache-modified iterators support.

Describe the solution you'd like

We need an functional alternative of cache-modified iterator in cuda.parallel.itertools. Design might follow the API that @fbusato came up with in #2487. For instance:

d_input = cp.array([8, 6, 7, 5, 3, 0, 9], dtype=dtype)
d_streaming_input = cudax.itertools.accessor(d_input, "eviction_policy::no_allocation")
cudax.reduce(d_streaming_input)

should lead to streaming loads of d_input (ld.global.cs instruction in PTX)

Describe alternatives you've considered

No response

Additional context

No response

@gevtushenko gevtushenko added the feature request New feature or request. label Oct 10, 2024
@github-project-automation github-project-automation bot moved this to Todo in CCCL Oct 10, 2024
@jollylili jollylili added the 2.8.0 target for 2.8.0 release label Nov 15, 2024
@jollylili jollylili moved this from Todo to In Progress in CCCL Nov 21, 2024
@github-project-automation github-project-automation bot moved this from In Progress to Done in CCCL Dec 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.8.0 target for 2.8.0 release feature request New feature or request.
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

3 participants