Skip to content

[devel] Update to CUDA 13#10613

Closed
fwyzard wants to merge 2 commits into
cms-sw:IB/CMSSW_20_1_X/develfrom
fwyzard:IB/CMSSW_20_0_X/devel_CUDA_13_3_0
Closed

[devel] Update to CUDA 13#10613
fwyzard wants to merge 2 commits into
cms-sw:IB/CMSSW_20_1_X/develfrom
fwyzard:IB/CMSSW_20_0_X/devel_CUDA_13_3_0

Conversation

@fwyzard

@fwyzard fwyzard commented Jun 8, 2026

Copy link
Copy Markdown
Contributor
  • update CUDA to version 13.3.0
    • drop the Volta (sm 7.0) architecture, that is not supported by CUDA 13
  • update cuDNN to version 9.23.0.39
  • update ONNXRuntime to version 1.26.0

@fwyzard

fwyzard commented Jun 8, 2026

Copy link
Copy Markdown
Contributor Author

enable gpu

@cmsbuild

cmsbuild commented Jun 8, 2026

Copy link
Copy Markdown
Contributor

A new Pull Request was created by @fwyzard for branch IB/CMSSW_20_0_X/devel.

@akritkbehera, @cmsbuild, @iarspider, @raoatifshad, @smuzaffar can you please review it and eventually sign? Thanks.
@ftenchini, @mandrenguyen, @sextonkennedy you are the release manager for this.
cms-bot commands are listed here

@cmsbuild

cmsbuild commented Jun 8, 2026

Copy link
Copy Markdown
Contributor

cms-bot internal usage

@fwyzard

fwyzard commented Jun 8, 2026

Copy link
Copy Markdown
Contributor Author

please test

@makortel

makortel commented Jun 8, 2026

Copy link
Copy Markdown
Contributor

Just to clarify, is the plan to test this first in the DEVEL IB, and then deploy in the default IB later in 20_1_X?

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

The interest is to enable c++23.
I guess that requires also ROOT 6.40, but I'm not sure.

@cmsbuild

cmsbuild commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ecdaa7/53757/summary.html
COMMIT: e27a764
CMSSW: CMSSW_20_0_DEVEL_X_2026-06-08-2300/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10613/53757/install.sh to create a dev area with all the needed externals and cmssw changes.

Failed External Build

I found compilation error when building:

+ chmod -Rf a+rX,u+w,g-w,o-w .
+ '[' 13 '!=' 12 ']'
+ echo 'Incompatible CUDA version in cudnn.spec!'
Incompatible CUDA version in cudnn.spec!
+ exit 1
error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.kTun3j (%prep)

RPM build warnings:
Macro expanded in comment on line 487: %{pkginstroot}/lib64




@fwyzard fwyzard changed the title Update CUDA to version 13.3.0 [devel] Update CUDA to version 13.3.0 Jun 9, 2026
Update CUDA to version 13.3.0 and cuDNN to version 9.23.0.39.
Drop the Volta (sm 7.0) architecture, that is not supported by CUDA 13.
@fwyzard fwyzard force-pushed the IB/CMSSW_20_0_X/devel_CUDA_13_3_0 branch from e27a764 to 25bbe28 Compare June 9, 2026 05:00
@cmsbuild

cmsbuild commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

Pull request #10613 was updated.

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

please test

@fwyzard fwyzard changed the title [devel] Update CUDA to version 13.3.0 [devel] Update to CUDA 13 Jun 9, 2026
@cmsbuild

cmsbuild commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ecdaa7/53758/summary.html
COMMIT: 25bbe28
CMSSW: CMSSW_20_0_DEVEL_X_2026-06-08-2300/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/10613/53758/install.sh to create a dev area with all the needed externals and cmssw changes.

Failed External Build

I found compilation error when building:

Requested to quit.
Requested to quit.
* The action "build-build-external+onnxruntime+1.25.1-95422ee4e6e44dd99dbfd3c469f5a4e4" was not completed successfully because Failed to build onnxruntime. Log file in /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/el8_amd64_gcc13/external/onnxruntime/1.25.1-95422ee4e6e44dd99dbfd3c469f5a4e4/log. Final lines of the log file:
171 |   ABSL_INTERNAL_X(insert_or_assign, insert_or_assign_impl, const &, &&, false,
|                                                                                                                                                                                                                     ^
/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/el8_amd64_gcc13/external/onnxruntime/1.25.1-95422ee4e6e44dd99dbfd3c469f5a4e4/build/_deps/abseil_cpp-src/absl/container/internal/raw_hash_map.h:180:200: error: using template type parameter 'absl::lts_20250814::container_internal::IfRRef::AddPtr' after 'typename'
180 |   ABSL_INTERNAL_X(insert_or_assign, insert_or_assign_impl, &&, const &, false,
|                                                                                                                                                                                                        ^         
/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/el8_amd64_gcc13/external/onnxruntime/1.25.1-95422ee4e6e44dd99dbfd3c469f5a4e4/build/_deps/abseil_cpp-src/absl/container/internal/raw_hash_map.h:180:211: error: template argument 6 is invalid
180 |   ABSL_INTERNAL_X(insert_or_assign, insert_or_assign_impl, &&, const &, false,
|                                                                                                                                                                                                                   ^


@akritkbehera akritkbehera changed the base branch from IB/CMSSW_20_0_X/devel to IB/CMSSW_20_1_X/devel June 9, 2026 07:44
@cmsbuild

cmsbuild commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

Pull request #10613 was updated.

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

please test

@makortel

makortel commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

(mirroring the discussion in Mattermost) IIRC CUDA is the last major component holding us back from enabling C++23. I think we should test it (and enabling Alpaka's CUDA backend) in CPP23 IB.

I guess we can test CUDA 13.3 first in DEVEL and then in CPP23, or test directly in CPP23.

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

@makortel @smuzaffar do we have a cpp23 branch for 17.x or 21.x ?

I found only IB/CMSSW_16_1_X/cpp23

@smuzaffar

smuzaffar commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

@fwyzard , c++23 is abled for multiple IBs .e.g see https://raw.githubusercontent.com/cms-sw/cms-bot/refs/heads/master/config.map (look for cpp23 flag under e.g. BUILD_OPTS=estats,cpp23,without:cuda,microarchs:x86-64-v2; . el9_amd64_gcc15 normal IBs are also based on c++23. Note that all cpp23 IBs are built without cuda. We can use cmsdistr branch IB/CMSSW_20_1_X/g15 for testing . I need to open cms-bot PR to remove the build without:cuda flag though

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

OK, then let me open a PR for the g15 branch to update CUDA.

Then you can open a PR to upate the bot, and we can see what happens :-)

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

please test

@smuzaffar

Copy link
Copy Markdown
Contributor

bot PR is open cms-sw/cms-bot#2785 . We need cmsdist PR for gcc15 branch to test that

@smuzaffar

smuzaffar commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

https://cmssdt.cern.ch/jenkins/job/ib-run-pr-tests/53784/console is now testing this for el9_amd64_gcc15

(typo, I meant now running)

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

Done, see #10616.

@fwyzard

fwyzard commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

I'll close this PR and follow up there.

@fwyzard fwyzard closed this Jun 9, 2026
@fwyzard fwyzard deleted the IB/CMSSW_20_0_X/devel_CUDA_13_3_0 branch June 9, 2026 16:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants