-
Notifications
You must be signed in to change notification settings - Fork 195
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[STF] Implement kernel chains in the graph backend without child grap…
…hs (#3707) * Reimplement chain of CUDA kernels in the CUDA graph backend to avoid child graphs * simplify code * Revert compilation environment changes that should not be committed * remove unused var * minor code improvements * Update cudax/include/cuda/experimental/__stf/graph/graph_task.cuh Cleaner code Co-authored-by: Bernhard Manfred Gruber <[email protected]> --------- Co-authored-by: Bernhard Manfred Gruber <[email protected]>
- Loading branch information
1 parent
d19c9a2
commit f745c97
Showing
2 changed files
with
35 additions
and
17 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters