[devel] Update to CUDA 13#10613
Conversation
|
enable gpu |
|
A new Pull Request was created by @fwyzard for branch IB/CMSSW_20_0_X/devel. @akritkbehera, @cmsbuild, @iarspider, @raoatifshad, @smuzaffar can you please review it and eventually sign? Thanks. |
|
cms-bot internal usage |
|
please test |
|
Just to clarify, is the plan to test this first in the DEVEL IB, and then deploy in the default IB later in 20_1_X? |
|
The interest is to enable c++23. |
|
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ecdaa7/53757/summary.html Failed External BuildI found compilation error when building: + chmod -Rf a+rX,u+w,g-w,o-w .
+ '[' 13 '!=' 12 ']'
+ echo 'Incompatible CUDA version in cudnn.spec!'
Incompatible CUDA version in cudnn.spec!
+ exit 1
error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.kTun3j (%prep)
RPM build warnings:
Macro expanded in comment on line 487: %{pkginstroot}/lib64
|
Update CUDA to version 13.3.0 and cuDNN to version 9.23.0.39. Drop the Volta (sm 7.0) architecture, that is not supported by CUDA 13.
e27a764 to
25bbe28
Compare
|
Pull request #10613 was updated. |
|
please test |
|
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ecdaa7/53758/summary.html Failed External BuildI found compilation error when building: Requested to quit. Requested to quit. * The action "build-build-external+onnxruntime+1.25.1-95422ee4e6e44dd99dbfd3c469f5a4e4" was not completed successfully because Failed to build onnxruntime. Log file in /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/el8_amd64_gcc13/external/onnxruntime/1.25.1-95422ee4e6e44dd99dbfd3c469f5a4e4/log. Final lines of the log file: 171 | ABSL_INTERNAL_X(insert_or_assign, insert_or_assign_impl, const &, &&, false, | ^ /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/el8_amd64_gcc13/external/onnxruntime/1.25.1-95422ee4e6e44dd99dbfd3c469f5a4e4/build/_deps/abseil_cpp-src/absl/container/internal/raw_hash_map.h:180:200: error: using template type parameter 'absl::lts_20250814::container_internal::IfRRef::AddPtr' after 'typename' 180 | ABSL_INTERNAL_X(insert_or_assign, insert_or_assign_impl, &&, const &, false, | ^ /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/el8_amd64_gcc13/external/onnxruntime/1.25.1-95422ee4e6e44dd99dbfd3c469f5a4e4/build/_deps/abseil_cpp-src/absl/container/internal/raw_hash_map.h:180:211: error: template argument 6 is invalid 180 | ABSL_INTERNAL_X(insert_or_assign, insert_or_assign_impl, &&, const &, false, | ^ |
|
Pull request #10613 was updated. |
|
please test |
|
(mirroring the discussion in Mattermost) IIRC CUDA is the last major component holding us back from enabling C++23. I think we should test it (and enabling Alpaka's CUDA backend) in CPP23 IB. I guess we can test CUDA 13.3 first in DEVEL and then in CPP23, or test directly in CPP23. |
|
@makortel @smuzaffar do we have a cpp23 branch for 17.x or 21.x ? I found only |
|
@fwyzard , |
|
OK, then let me open a PR for the Then you can open a PR to upate the bot, and we can see what happens :-) |
|
please test |
|
bot PR is open cms-sw/cms-bot#2785 . We need cmsdist PR for gcc15 branch to test that |
|
https://cmssdt.cern.ch/jenkins/job/ib-run-pr-tests/53784/console is now testing this for el9_amd64_gcc15 (typo, I meant now running) |
|
Done, see #10616. |
|
I'll close this PR and follow up there. |
Uh oh!
There was an error while loading. Please reload this page.