Fixes build failures whencuda-bindings reports major version 13 but CUDA headers are version 12, causing missing enum errors forCU_MEM_LOCATION_TYPE_NONE andCU_MEM_ALLOCATION_TYPE_MANAGED
The new_get_cuda_core_build_major_version() function prioritizes: env var override → CUDA headers → nvidia-smi → cuda-bindings fallback
Adds unit tests for the version detection logic

Test plan

Unit tests pass:pytest tests/test_build_hooks.py -v --noconftest
CI tests pass
Manual verification: build succeeds with mismatched cuda-bindings 13.x and CUDA 12 headers

Andy-Jost added bug

Something isn't working

High priority - Must do!

cuda.coreEverything related to the cuda.core module labels

Dec 17, 2025

Andy-Jost self-assigned this

Dec 17, 2025

Copy link

Contributor

copy-pr-botbot commentedDec 17, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilitieshere.

Contributors can view more details about this messagehere.

Andy-Jost requested review fromleofang,rparolin andrwgk

December 17, 2025 16:18

Copy link

ContributorAuthor

Andy-Jost commentedDec 17, 2025

/ok to test0957f91

Andy-Jost added enhancement

Any code-related improvements

P1Medium priority - Should do and removed bug

Something isn't working

P0High priority - Must do! labels

Dec 17, 2025

Andy-Jost added this to thecuda.core beta 11 milestone

Dec 17, 2025

Copy link

github-actionsbot commentedDec 17, 2025

Doc Preview CI
🚀 View preview at https://nvidia.github.io/cuda-python/pr-preview/pr-1395/
https://nvidia.github.io/cuda-python/pr-preview/pr-1395/cuda-core/
https://nvidia.github.io/cuda-python/pr-preview/pr-1395/cuda-bindings/
https://nvidia.github.io/cuda-python/pr-preview/pr-1395/cuda-pathfinder/
Preview will be ready when the GitHub Pages deployment is complete.

fix: derive CUDA_CORE_BUILD_MAJOR from headers instead of bindings ve…

ff5644a

…rsionFixes build failures when cuda-bindings reports major version 13 butCUDA headers are version 12, causing missing enum errors forCU_MEM_LOCATION_TYPE_NONE and CU_MEM_ALLOCATION_TYPE_MANAGED.The new _get_cuda_core_build_major_version() function prioritizes:1. Explicit CUDA_CORE_BUILD_MAJOR env var (CI override)2. CUDA_VERSION from cuda.h headers (matches compile target)3. nvidia-smi driver-reported version (fallback)4. cuda-bindings major version (last resort)Adds tests for the version detection logic in test_build_hooks.py.

Andy-Jost force-pushed thebuild-major-from-headers branch from0957f91 toff5644aCompare

December 17, 2025 16:37

Copy link

ContributorAuthor

Andy-Jost commentedDec 17, 2025

/ok to testff5644a

Copy link

Collaborator

kkraus14 commentedDec 17, 2025

Fixes build failures whencuda-bindings reports major version 13 but CUDA headers are version 12, causing missing enum errors forCU_MEM_LOCATION_TYPE_NONE andCU_MEM_ALLOCATION_TYPE_MANAGED

Is this not a broken environment?cuda-bindings would presumably end up calling into v12.x DSOs which have a different abi than v13.x? What situation are we looking to support here?

Copy link

ContributorAuthor

Andy-Jost commentedDec 17, 2025

Fixes build failures whencuda-bindings reports major version 13 but CUDA headers are version 12, causing missing enum errors forCU_MEM_LOCATION_TYPE_NONE andCU_MEM_ALLOCATION_TYPE_MANAGED
Is this not a broken environment?cuda-bindings would presumably end up calling into v12.x DSOs which have a different abi than v13.x? What situation are we looking to support here?

When creating an environment withconda create -n test cuda-version=12 and then runningpip install cuda-bindings, I end up withcuda-bindings13.x:

% conda list cuda# packages in environment at /home/scratch.ajost_sw/miniforge3/envs/test:## Name                    Version                   Build  Channelcuda-bindings             13.1.1                   pypi_0    pypicuda-version              12.9                 h4f385c5_3    conda-forge

(As an aside, if I specify both packages up front withconda create -n test cuda-version=12 cuda-bindings I getcuda-bindings12.x instead. I wouldn’t have expected a difference between installing it during or after environment creation, but that’s what happens.)

This setup shouldn’t inherently be a problem. Users generally expect that newer releases (likecuda-bindings13.x) work with older CUDA toolkits due to backward compatibility guarantees. In practice,cuda-bindings should detect and adapt to the underlying CUDA 12 APIs.

Anecdotally, this configuration has worked fine for me for months with no runtime instability, though it may not be explicitly supported. However, a recent change broke this workflow, requiring eithercuda-bindings12.x or settingCUDA_CORE_BUILD_MAJOR=12 manually when buildingcuda-core.

Becausecuda-core discoverscuda.h relative toCUDA_HOME orCUDA_PATH, it doesn’t make sense to tieCUDA_CORE_BUILD_MAJOR to thecuda-bindings version. It’s more consistent to derive it from the version indicated by the headers.

So the case we want to support is:

The user has an older CUDA toolkit (e.g.12.x).
The user installs the latestcuda-bindings and expects it to work due to backward compatibility.

The proposed fix ensurescuda-core builds correctly in this situation by decoupling its build version logic from the installedcuda-bindings.

Copy link

Collaborator

kkraus14 commentedDec 17, 2025

When creating an environment withconda create -n test cuda-version=12 and then runningpip install cuda-bindings, I end up withcuda-bindings13.x:
% conda list cuda# packages in environment at /home/scratch.ajost_sw/miniforge3/envs/test:## Name                    Version                   Build  Channelcuda-bindings             13.1.1                   pypi_0    pypicuda-version              12.9                 h4f385c5_3    conda-forge
(As an aside, if I specify both packages up front withconda create -n test cuda-version=12 cuda-bindings I getcuda-bindings12.x instead. I wouldn’t have expected a difference between installing it during or after environment creation, but that’s what happens.)

Unfortunately, the Python packaging ecosystem is a mess, but this is expected. Conda packages and pip packages are two entirely separate things that aren't necessarily equivalent or compatible with each other. In our case, conda packages can be used for packaging non-python code, i.e. for the CUDA Toolkit native libraries. Thecuda-version conda package has a constraint on the__cuda virtual conda package which detects the version of the toolkit that is compatible with the driver running on the system. Pip unfortunately doesn't have these capabilities (we are trying to change that withhttps://wheelnext.dev/) so there's no way to control the version ofcuda-bindings resolved from a pip install command based on the driver version.

This setup shouldn’t inherently be a problem. Users generally expect that newer releases (likecuda-bindings13.x) work with older CUDA toolkits due to backward compatibility guarantees. In practice,cuda-bindings should detect and adapt to the underlying CUDA 12 APIs.

How do we handle API breaking changes across major versions like 12.x and 13.x? The underlying CTK libraries only guarantee their API and ABI stability within a major version. If any API has a signature change from 12.x --> 13.x, which flavor of the API should we have for Python? Should we dynamically adjust our Python API at runtime based on the detected driver version available on the system? What if someone wants to specifically target the 12.x API and run on a 13.x+ driver? There's a lot of open questions here where the supported path for now is that thecuda-bindings package version follows the API and ABI of same major version of the CTK.

Anecdotally, this configuration has worked fine for me for months with no runtime instability, though it may not be explicitly supported. However, a recent change broke this workflow, requiring eithercuda-bindings12.x or settingCUDA_CORE_BUILD_MAJOR=12 manually when buildingcuda-core.
Becausecuda-core discoverscuda.h relative toCUDA_HOME orCUDA_PATH, it doesn’t make sense to tieCUDA_CORE_BUILD_MAJOR to thecuda-bindings version. It’s more consistent to derive it from the version indicated by the headers.

The problem with this is thatcuda-core uses thecuda-bindings Cython implementation within it. I.E. in your environment as described above, I imagine this would cause an issue:https://github.com/NVIDIA/cuda-python/blob/main/cuda_core/cuda/core/experimental/_device.pyx#L1097-L1100 since it's trying to use an externedcuDeviceGetUuid_v2 API fromcuda.h, which exists in CUDA 12.9, but doesn't exist as of CUDA 13.0 in eithercuda.h or incydriver.pxd.

Becausecuda-core discoverscuda.h relative toCUDA_HOME orCUDA_PATH, it doesn’t make sense to tieCUDA_CORE_BUILD_MAJOR to thecuda-bindings version. It’s more consistent to derive it from the version indicated by the headers.
So the case we want to support is:
The user has an older CUDA toolkit (e.g.12.x).
The user installs the latestcuda-bindings and expects it to work due to backward compatibility.
The proposed fix ensurescuda-core builds correctly in this situation by decoupling its build version logic from the installedcuda-bindings.

cuda-core only usescuda.h indirectly via thecuda-bindings Cython APIs, which extern APIs fromcuda.h and other CUDA headers. But again, as described above, we currently need to match thecuda-bindings andcuda.h (and other CUDA headers) major versions in order to match the APIs.

The backward compatibility guarantees that CUDA makes and we follow are the following:

For the driver library, API backward and forward compatibility within a major version
For the driver library, ABI backward compatibility forever and forward compatibility within a major version
- We currently don't support ABI backward compatibility across major versions incuda.bindings driver modules today, but hope to in the future
For toolkit libraries, API backward and forward compatibility within a major version
For the toolkit libraries, ABI backward and forward compatibility within a major version

Merge branch 'main' into build-major-from-headers

0a7d002

Copy link

ContributorAuthor

Andy-Jost commentedDec 17, 2025•
edited
Loading

@kkraus14 Thanks for the additional details. In my view, derivingCUDA_CORE_BUILD_MAJOR from the headers thatcuda-core actually compiles against is a strict improvement, since it allows previously failing environments to build without weakening the official guidance about matching major versions.

I'd like to suggest the following:

We commit this change because it turns a hard build failure into a successful build likely producing a working configuration in an environment that users can realistically end up in.
As a follow-on change, we add import-time checking to flag unsupported version combinations and issue an appropriate warning.

WDYT

Edit: For (2) please see#1412

Andy-Jost added2 commits

December 17, 2025 15:50

Merge branch 'main' into build-major-from-headers

feba5e8

Merge branch 'main' into build-major-from-headers

a669246

Labels

cuda.core

Everything related to the cuda.core module

enhancement

Any code-related improvements

Medium priority - Should do

Movatterモバイル変換

fix: derive CUDA_CORE_BUILD_MAJOR from headers instead of bindings version#1395

Are you sure you want to change the base?

fix: derive CUDA_CORE_BUILD_MAJOR from headers instead of bindings version#1395

Uh oh!

Conversation

Andy-Jost commentedDec 17, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

copy-pr-botbot commentedDec 17, 2025

Uh oh!

Andy-Jost commentedDec 17, 2025

Uh oh!

github-actionsbot commentedDec 17, 2025

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

Andy-Jost commentedDec 17, 2025

Uh oh!

kkraus14 commentedDec 17, 2025

Uh oh!

Andy-Jost commentedDec 17, 2025

Uh oh!

kkraus14 commentedDec 17, 2025

Uh oh!

Andy-Jost commentedDec 17, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Andy-Jost commentedDec 17, 2025•
edited
Loading

Andy-Jost commentedDec 17, 2025•
edited
Loading