Skip to content

Commit 9b2f7fc

Browse files
committed
run.py script proper directory
With our devcontainers and presets dev environment it's highly likely that `cccl/` is not just a step above.
1 parent 43416ee commit 9b2f7fc

File tree

1 file changed

+10
-3
lines changed

1 file changed

+10
-3
lines changed

docs/cub/benchmarking.rst

+10-3
Original file line numberDiff line numberDiff line change
@@ -174,6 +174,13 @@ Therefore, it's critical that you run it in a clean build directory without any
174174
Running cmake is enough. Alternatively, you can also clean your build directory with.
175175
Furthermore, the tuning scripts require some additional python dependencies, which you have to install.
176176

177+
<<<<<<< Updated upstream
178+
=======
179+
To select the appropriate CUDA GPU, first identify the GPU ID by running `nvidia-smi`, then set the
180+
desired GPU using `export CUDA_VISIBLE_DEVICES=x`, where `x` is the ID of the GPU you want to use (e.g., `1`).
181+
This ensures your application uses only the specified GPU.
182+
183+
>>>>>>> Stashed changes
177184
.. code-block:: bash
178185
179186
ninja clean
@@ -183,7 +190,7 @@ We can then run the full benchmark suite from the build directory with:
183190

184191
.. code-block:: bash
185192
186-
../benchmarks/scripts/run.py
193+
<root_dir_to_cccl>/cccl/benchmarks/scripts/run.py
187194
188195
You can expect the output to look like this:
189196

@@ -205,7 +212,7 @@ It's also possible to benchmark a subset of algorithms and workloads:
205212

206213
.. code-block:: bash
207214
208-
../benchmarks/scripts/run.py -R '.*scan.exclusive.sum.*' -a 'Elements{io}[pow2]=[24,28]' -a 'T{ct}=I32'
215+
<root_dir_to_cccl>/cccl/benchmarks/scripts/run.py -R '.*scan.exclusive.sum.*' -a 'Elements{io}[pow2]=[24,28]' -a 'T{ct}=I32'
209216
&&&& RUNNING bench
210217
ctk: 12.6.77
211218
cccl: v2.7.0-rc0-265-g32aa6aa5a
@@ -229,7 +236,7 @@ The resulting database contains all samples, which can be extracted into JSON fi
229236

230237
.. code-block:: bash
231238
232-
../benchmarks/scripts/analyze.py -o ./cccl_meta_bench.db
239+
<root_dir_to_cccl>/cccl/benchmarks/scripts/analyze.py -o ./cccl_meta_bench.db
233240
234241
This will create a JSON file for each benchmark variant next to the database.
235242
For example:

0 commit comments

Comments
 (0)