Skip to content

[gcc15] Update CUDA to version 13.3.0#10616

Closed
fwyzard wants to merge 6 commits into
cms-sw:IB/CMSSW_20_1_X/g15from
fwyzard:IB/CMSSW_20_1_X/g15__CUDA_13_3_0
Closed

[gcc15] Update CUDA to version 13.3.0#10616
fwyzard wants to merge 6 commits into
cms-sw:IB/CMSSW_20_1_X/g15from
fwyzard:IB/CMSSW_20_1_X/g15__CUDA_13_3_0

Conversation

@fwyzard

@fwyzard fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

enable gpu

@cmsbuild

cmsbuild commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

A new Pull Request was created by @fwyzard for branch IB/CMSSW_20_1_X/g15.

@akritkbehera, @cmsbuild, @iarspider, @raoatifshad, @smuzaffar can you please review it and eventually sign? Thanks.
@ftenchini, @mandrenguyen, @sextonkennedy you are the release manager for this.
cms-bot commands are listed here

@cmsbuild

cmsbuild commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

cms-bot internal usage

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

please test with cms-sw/cms-bot#2785

@cmsbuild

Copy link
Copy Markdown
Contributor

Pull request #10616 was updated.

@fwyzard fwyzard force-pushed the IB/CMSSW_20_1_X/g15__CUDA_13_3_0 branch from 391662d to e4d7f4b Compare June 12, 2026 10:40
@cmsbuild

Copy link
Copy Markdown
Contributor

Pull request #10616 was updated.

@fwyzard fwyzard force-pushed the IB/CMSSW_20_1_X/g15__CUDA_13_3_0 branch from e4d7f4b to cf446a4 Compare June 12, 2026 17:05
@cmsbuild

Copy link
Copy Markdown
Contributor

Pull request #10616 was updated.

@fwyzard

fwyzard commented Jun 12, 2026

Copy link
Copy Markdown
Contributor Author

please test with cms-sw/cms-bot#2785 using full cmssw for CMSSW_20_1_X/el9_amd64_gcc15

@fwyzard fwyzard changed the title [gcc15] Update to CUDA 13 [gcc15] Update CUDA to version 13.3.0 Jun 12, 2026
@cmsbuild

Copy link
Copy Markdown
Contributor

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-36c6df/53913/summary.html
COMMIT: cf446a4
CMSSW: CMSSW_20_1_X_2026-06-11-2300/el9_amd64_gcc15
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10616/53913/install.sh to create a dev area with all the needed externals and cmssw changes.

Failed External Build

I found compilation error when building:

from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/el9_amd64_gcc15/external/onnxruntime/1.26.0-ca6d07c6c9ce2fd37ca83c876b2abe25/external_onnxruntime_1.26.0-ca6d07c6c9ce2fd37ca83c876b2abe25-1-build/onnxruntime-1.26.0/onnxruntime/core/providers/cuda/nn/conv_transpose.cc:8:
/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el9_amd64_gcc15/external/gcc/15.3.1-d0029c3359733fc60a20a68217f2ee30/include/c++/15.3.1/ciso646:49:6: warning: #warning " is not a standard header since C++20, use  to detect implementation-specific macros" [-Wcpp]
49 | #    warning " is not a standard header since C++20, use  to detect implementation-specific macros"
|      ^~~~~~~
ninja: build stopped: subcommand failed.
error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.7WOZKa (%build)

RPM build warnings:
Macro expanded in comment on line 488: %{pkginstroot}/${PYTHON3_LIB_SITE_PACKAGES}




@fwyzard fwyzard force-pushed the IB/CMSSW_20_1_X/g15__CUDA_13_3_0 branch from cf446a4 to c91153d Compare June 13, 2026 17:32
fwyzard added 3 commits June 14, 2026 13:55
Fix invalid C++ syntax in cub/device/device_transform.cuh in CUDA 13.
Backport NVIDIA/cccl#8771 from the main branch.
Drop the Volta (sm 7.0) architecture, that is not supported by CUDA 13.
Major updates:
  - support for DMA-BUF mmap backend with CUDA 13.3
  - support for AVX2, AVX-512, MOVDIR64B, NEON CPU instructions
  - support for Linux kernel 6.15+

The updates, fixes and changes in v2.5.2 can be found at
https://github.com/NVIDIA/gdrcopy/releases/tag/v2.5.2 .

The updates, fixes and changes in v2.6 can be found at
https://github.com/NVIDIA/gdrcopy/releases/tag/v2.6 .
@cmsbuild

Copy link
Copy Markdown
Contributor

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-36c6df/53934/summary.html
COMMIT: 4fc3669
CMSSW: CMSSW_20_1_X_2026-06-13-1100/el9_amd64_gcc15
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10616/53934/install.sh to create a dev area with all the needed externals and cmssw changes.

Failed External Build

I found compilation error when building:

cwd: /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/el9_amd64_gcc15/external/py3-torch-cuda/2.11.0-7bc16fc5004765e3f118346ea964570e/external_py3-torch-cuda_2.11.0-7bc16fc5004765e3f118346ea964570e-1-build/cmsdist-pip-src/py3-torch-cuda-2.11.0
Building wheel for torch (pyproject.toml): finished with status 'error'
ERROR: Failed building wheel for torch
Failed to build torch
ERROR: Failed to build one or more wheels
error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.toCtHi (%build)

RPM build warnings:
Macro expanded in comment on line 609: %{pkginstroot}/bin/*

Macro expanded in comment on line 614: %{pkginstroot}/${PYTHON3_LIB_SITE_PACKAGES}


@fwyzard fwyzard force-pushed the IB/CMSSW_20_1_X/g15__CUDA_13_3_0 branch from 4fc3669 to d9be15e Compare June 14, 2026 17:56
@cmsbuild

Copy link
Copy Markdown
Contributor

Pull request #10616 was updated.

fwyzard added 3 commits June 14, 2026 20:07
Update CMake to the latest 3.x bugfix version, v3.31.12.
Backport cmake!12162 CUDA: Add support for cuda_std_23 for nvcc 13.3+.
Backport microsoft/onnxruntime#28736
Patch Abseil to workaround NVIDIA bug #6302392
Do not include <ciso646> in c++20/c++23 mode
@fwyzard fwyzard force-pushed the IB/CMSSW_20_1_X/g15__CUDA_13_3_0 branch from d9be15e to 4f0cb4f Compare June 14, 2026 18:07
@cmsbuild

Copy link
Copy Markdown
Contributor

Pull request #10616 was updated.

@fwyzard

fwyzard commented Jun 14, 2026

Copy link
Copy Markdown
Contributor Author

test parameters:

@fwyzard

fwyzard commented Jun 14, 2026

Copy link
Copy Markdown
Contributor Author

please test

@cmsbuild

Copy link
Copy Markdown
Contributor

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-36c6df/53941/summary.html
COMMIT: 4f0cb4f
CMSSW: CMSSW_20_1_X_2026-06-13-1100/el9_amd64_gcc15
Additional Tests: GPU,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10616/53941/install.sh to create a dev area with all the needed externals and cmssw changes.

Failed External Build

I found compilation error when building:

cmake/modules/FindVecCore.cmake:12 (find_package_handle_standard_args)
CMakeLists.txt:404 (find_package)


-- Configuring incomplete, errors occurred!
error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.LwXJWu (%build)

RPM build errors:
Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.LwXJWu (%build)

Allocating resources {'name': 'build-build-external+py3-onnx+1.21.0-d96e9f670dda4dcfcbe73ab15a7a4d79', 'cpu': 654.5, 'rss': 7717519360, 'time': 3928.14375}


@fwyzard

fwyzard commented Jun 15, 2026

Copy link
Copy Markdown
Contributor Author

Let's directly merge #10617 in the master branch.

@fwyzard fwyzard closed this Jun 15, 2026
@fwyzard fwyzard deleted the IB/CMSSW_20_1_X/g15__CUDA_13_3_0 branch June 16, 2026 06:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants