-
Notifications
You must be signed in to change notification settings - Fork 91
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RTX4090 and RTX4060Ti no P2P #28
Comments
Seen a few people having issues with mixed cards. One guy has a6000 and 4 4090 and p2p didn't work unrill he disabled the A6000 |
Thank you for a reply. It's a pity that p2p doesn't work on mixed cards. |
"Manual says that both 4090 and 4060Ti should be supported" which manual says that? You may mess up with the p2p mod and the original open-gpu-kernel-modules. In some 3090s and all 4090s, NVIDIA added large BAR support."_ |
For example, here. In the end there is a table of compatible GPUs |
@Ivan04012025 I think he I meaning that list of gpu that are compatible are the ones compatible with the original nvidia open driver. For p2p he mentioned you need a gpu with large BAR support. Perhaps that 4060 gpu does not support that. |
Then is there a way to find out whether 4060 has large BAR or not? |
@Ivan04012025 says on spec page for 4060 Resizable BAR |
In system info or cpu x I do not see any info about BAR support. dmesg gives a lot of output and i don't know which string is about BAR support on 4060. I copied this output, maybe you can help with that? NVIDIA settings software shows: "Resizable BAR: Yes" on both 4090 and 4060 GPUs |
@Ivan04012025 3090s and all 4090s, NVIDIA added large BAR support. tiny@tiny14:~$ lspci -s 01:00.0 -v We do this by bypassing the HAL and calling a bunch of the GH100 methods directly. Methods like kbusEnableStaticBar1Mapping_GH100, which maps the entire VRAM into BAR1. This mostly just works, but we had to disable the use of that region in the MapAperture function for some reason. Shouldn't matter. [ 3491.654009] NVRM: kbusEnableStaticBar1Mapping_GH100: Static bar1 mapped offset 0x0 size 0x5e9200000 [ 3742.840689] NVRM: kbusCreateP2PMappingForBar1P2P_GH100: added PCIe BAR1 P2P mapping between GPU2 and GPU3 cudaMemcpyPeer / cudaMemcpy between GPU0 and GPU1: 24.21GB/s |
Also why is one of you gpu only 8x pcie. |
NVIDIA Open GPU Kernel Modules Version
550.90.07
Please confirm this issue does not happen with the proprietary driver (of the same version). This issue tracker is only for bugs specific to the open kernel driver.
Operating System and Version
Linux Mint 21.3
Kernel Release
6.8.0-50-generic NVIDIA#51~22.04.1-Ubuntu
Please confirm you are running a stable release kernel (e.g. not a -rc). We do not accept bug reports for unreleased kernels.
Hardware: GPU
RTX4090 and RTX4060Ti
Describe the bug
Is it possible to enable P2P between 4090 and 4060Ti cards?
My motherboard is Asus Pro WS X299 SAGE II. I turned on large BAR and disabled IOMMU in bios.
Next I installed open-gpu-kernel-modules-550.90.07-p2p using install.sh script and driver: NVIDIA-Linux-x86_64-550.90.07.run --no-kernel-modules
nvidia-smi works fine, but p2pBandwidthLatencyTest gives following output:
[P2P (Peer-to-Peer) GPU Bandwidth Latency Test]
Device: 0, NVIDIA GeForce RTX 4090, pciBusID: 1a, pciDeviceID: 0, pciDomainID:0
Device: 1, NVIDIA GeForce RTX 4060 Ti, pciBusID: 68, pciDeviceID: 0, pciDomainID:0
Device=0 CANNOT Access Peer Device=1
Device=1 CANNOT Access Peer Device=0
***NOTE: In case a device doesn't have P2P access to other one, it falls back to normal memcopy procedure.
So you can see lesser Bandwidth (GB/s) and unstable Latency (us) in those cases.
P2P Connectivity Matrix
D\D 0 1
0 1 0
1 0 1
Unidirectional P2P=Disabled Bandwidth Matrix (GB/s)
D\D 0 1
0 911.08 6.27
1 6.26 244.87
Unidirectional P2P=Enabled Bandwidth (P2P Writes) Matrix (GB/s)
D\D 0 1
0 913.21 6.27
1 6.25 245.25
Bidirectional P2P=Disabled Bandwidth Matrix (GB/s)
D\D 0 1
0 915.33 8.49
1 8.63 244.46
Bidirectional P2P=Enabled Bandwidth Matrix (GB/s)
D\D 0 1
0 918.31 8.43
1 8.63 244.56
P2P=Disabled Latency Matrix (us)
GPU 0 1
0 1.44 20.43
1 20.54 1.20
CPU 0 1
0 2.25 6.10
1 5.98 2.24
P2P=Enabled Latency (P2P Writes) Matrix (us)
GPU 0 1
0 1.45 20.61
1 11.36 1.20
CPU 0 1
0 2.22 5.93
1 6.19 2.23
NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.
This result indicate that P2P is not working. Manual says that both 4090 and 4060Ti should be supported. Is there anythig that can be done to enable P2P?
To Reproduce
I followed the installation instructions for the kernel version 550
Bug Incidence
Always
nvidia-bug-report.log.gz
nvidia-bug-report.log.gz
More Info
No response
The text was updated successfully, but these errors were encountered: