[BUG]: #include <thrust/device_vector.h> causes compile error #1783

Olli1080 · 2024-05-28T00:51:53Z

Is this a duplicate?

I confirmed there appear to be no duplicate issues for this bug and that I agree to the Code of Conduct

Type of Bug

Compile-time Error

Component

Thrust

Describe the bug

When including thrust/device_vector in anywhere visible to a cpp compiled file it fails due to including device only code.
This issue is present in thrust shipped with CUDA 12.5 but was non existent in CUDA 12.3.

How to Reproduce

Include #include <thrust/device_vector.h> in any file related to c++ compiler.
Compile
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.5\include\cub\util_ptx.cuh(90,29): error C2059: Syntaxerror: ":" (20 more lines omitted)

Expected behavior

Include #include <thrust/device_vector.h> in any file related to c++ compiler.
Compile
Compilation is succesfull

Reproduction link

No response

Operating System

Windows 11 Pro 10.0.22621

nvidia-smi output

NVCC version

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2024 NVIDIA Corporation
Built on Wed_Apr_17_19:36:51_Pacific_Daylight_Time_2024
Cuda compilation tools, release 12.5, V12.5.40
Build cuda_12.5.r12.5/compiler.34177558_0

jrhemstad · 2024-05-28T18:23:06Z

Thanks for raising this @Olli1080.

Unfortunately, this is expected. See #1374

TL;DR: It was never intended that thrust/device_vector.h (or any Thrust header for that matter) would work with a host-only compiler when the device system is configured to be CUDA.

Out of curiosity, in your use case, what did you expect to be able to do with the thrust/device_vector.h header in a host-only translation unit? Simply include the header but not use anything from it?

Olli1080 · 2024-05-28T21:14:22Z

Thank you for the clarification.
I've been trying very hard to find such guarantees.

My use case is to have thrust/device_vector as a private member of a class which can be consumed by C++ translation units e.g. outside of a library but to compile the definition of the class using CUDA.
That way i can cache certain data from e.g. point clouds in device_vectors but read out results to either other CUDA units or through methods as host_vectors.
This also makes it way easier to handle storage and to debug.

TL;DR: Exposing consumption of classes in host-only code, but also having the functionality inside the CUDA translation unit.

Olli1080 · 2024-05-29T00:03:44Z

For example here is a one such class i'm compiling with cuda but consuming in host-only code.
https://github.com/Olli1080/gpu-voxels/blob/ar_integration/packages/gpu_voxels/src/gpu_voxels/helpers/MetaPointCloud.h
https://github.com/Olli1080/gpu-voxels/blob/ar_integration/packages/gpu_voxels/src/gpu_voxels/helpers/MetaPointCloud.cu

bernhardmgruber · 2024-05-29T09:27:02Z

@Olli1080 you can apply the PIMPL idiom to workaround this. Something along these lines:

Header:

class MetaPointCloud {
  struct Impl;
  std::unique_ptr<Impl> pimpl;
};

Source:

struct MetaPointCloud::Impl {
  thrust::device_vector<...> ...;
};

Make sure all constructors and destructors of MetaPointCloud are also defined in the source file. In C++26 you can switch std::unique_ptr for std::indirect. The corresponding paper has a section on PIMPL.

Olli1080 · 2024-05-30T19:55:34Z

I had some digging around in the code (using /showincludes) and found out that the errors are caused by the changes in <cub/util_device.cuh>, namely adding <cub/util_ptx.cuh> and <cuda/discard_memory>. Looking at the main branch both of these includes are now gone. I'll test if including host_vector or device_vector works again asap.

For anyone interested the relevant include tree is:

CUDA\thrust/host_vector.h
CUDA\thrust/detail/vector_base.h
CUDA\thrust/detail/contiguous_storage.h
CUDA\thrust/detail/contiguous_storage.inl
CUDA\thrust/detail/allocator/copy_construct_range.h
CUDA\thrust/detail/allocator/copy_construct_range.inl
CUDA\thrust/detail/copy.h
CUDA\thrust/detail/copy.inl
CUDA\thrust/system/detail/adl/copy.h
CUDA\thrust/system/cuda/detail/copy.h
CUDA\thrust/system/cuda/detail/internal/copy_cross_system.h
CUDA\thrust/system/cuda/detail/util.h
CUDA\cub/util_device.cuh
CUDA\cub/util_ptx.cuh

Olli1080 · 2024-05-31T10:31:47Z

I've tested CUDA 12.5 with replaced cccl headers and the issue does not exist anymore.
Commit as of this comment is bf1a71a318a3dbc4ee522a42e2b82b6d2a5410e4

Olli1080 added the bug Something isn't working right. label May 28, 2024

github-project-automation bot added this to CCCL May 28, 2024

github-project-automation bot moved this to Todo in CCCL May 28, 2024

Olli1080 closed this as completed May 31, 2024

github-project-automation bot moved this from Todo to Done in CCCL May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: #include <thrust/device_vector.h> causes compile error #1783

[BUG]: #include <thrust/device_vector.h> causes compile error #1783

Olli1080 commented May 28, 2024

jrhemstad commented May 28, 2024

Olli1080 commented May 28, 2024

Olli1080 commented May 29, 2024

bernhardmgruber commented May 29, 2024

Olli1080 commented May 30, 2024

Olli1080 commented May 31, 2024

[BUG]: #include <thrust/device_vector.h> causes compile error #1783

[BUG]: #include <thrust/device_vector.h> causes compile error #1783

Comments

Olli1080 commented May 28, 2024

Is this a duplicate?

Type of Bug

Component

Describe the bug

How to Reproduce

Expected behavior

Reproduction link

Operating System

nvidia-smi output

NVCC version

jrhemstad commented May 28, 2024

Olli1080 commented May 28, 2024

Olli1080 commented May 29, 2024

bernhardmgruber commented May 29, 2024

Olli1080 commented May 30, 2024

Olli1080 commented May 31, 2024