Skip to content

Releases: NVIDIA/k8s-device-plugin

v0.19.3

22 Jun 18:29
v0.19.3
570e776

Choose a tag to compare

Changelog

Version v0.19.3

  • bump nvcr.io/nvidia/distroless/go to v4.0.8-dev (#1852)
  • revert default enablement of features mofed, gdrcopy and mofed (#1837)
  • bump golang version to 1.26.4 (#1828)
  • bump golang.org/x/net to v0.56.0 (#1867)

v0.19.2

27 May 03:43
5939c7a

Choose a tag to compare

Changelog

Version v0.19.2

  • [release-0.19] Bump k8s-device-plugin version to v0.19.2 (#1814)
  • [release-0.19] Bump toolkit go module to 1.19.1 (#1776)
  • Always use dedicated service account in device-plugin helm chart (#1804)
  • [toolkit] add support for injecting /dev/dri* devices nodes for gfx MIGs (#1785)
  • bump go-nvlib, selinux, x/net, x/mod deps (#1774)

v0.19.1

23 Apr 14:23
v0.19.1
b24f243

Choose a tag to compare

What's Changed

  • [release-0.19] wsl: report a single "all" device to kubelet by @github-actions[bot] in #1699
  • [release-0.19] Fix CDI spec generation to respect driver root for Tegra CSV files by @github-actions[bot] in #1701
  • Bump actions/configure-pages from 5 to 6 by @dependabot[bot] in #1707
  • Bump actions/deploy-pages from 4 to 5 by @dependabot[bot] in #1703
  • Bump golang from 1.26.1 to 1.26.2 in /deployments/container by @dependabot[bot] in #1704
  • Bump actions/upload-pages-artifact from 4 to 5 by @dependabot[bot] in #1705
  • Bump nvidia/distroless/go from v4.0.3-dev to v4.0.4-dev in /deployments/container by @dependabot[bot] in #1702
  • Bump google.golang.org/grpc from 1.79.1 to 1.79.3 by @dependabot[bot] in #1711
  • Bump the k8sio group across 1 directory with 5 updates by @dependabot[bot] in #1710
  • Bump release v0.19.1 by @elezar in #1712

Full Changelog: v0.19.0...v0.19.1

v0.19.0

17 Mar 18:24
v0.19.0
1ae8e5f

Choose a tag to compare

Changelog

  • Add --sleep-interval=infinite support to GFD for running as a pod (#1603)
  • Fix image tag in static deployment (#1604)
  • Add ownerReference to NodeFeature CRs for garbage collection (#1597)
  • Change default value for gds, gdrcopy and mofed flags (#1550)
  • Fix healthchecking on old devices (#1562)
  • Enable NodeFeature API by default in GFD (#1504)
  • Build multiarch images on native GitHub runners (#1468)

v0.18.2

23 Jan 14:24
v0.18.2
fb1242a

Choose a tag to compare

What's Changed

  • Ensure that cdi.FeatureFlags are passed to CDI library
  • Fix race condition in config-manager when label is unset
  • Fix nested container use cases by ensuring that IPC sockets are not mounted readonly
  • Bump NVIDIA Container Toolkit to v1.18.2
  • Bump distroless base image to v3.2.2-dev

Full Changelog: v0.18.1...v0.18.2

v0.18.1

07 Jan 22:10
dfcf728

Choose a tag to compare

Changelog

  • Allow CDI feature flags to be set
  • Pass driver root to nvinfo.New in device plugin main
  • Bump NVIDIA Container Toolkit to v1.18.1
  • Bump distroless base image to v3.2.1-dev
  • Bump github.com/opencontainers/selinux from 1.12.0 to 1.13.1 (#1506)

Full Changelog: v0.18.0...v0.18.1

v0.18.0

21 Oct 13:53
v0.18.0
3c9ffca

Choose a tag to compare

Changelog

  • Rename getHealthCheckXids and clarify documentation
  • Add support for explicitly enabling XIDs in health checks
  • Deduplicate requested device IDs
  • Check for nil before reading boolean config values
  • Make gated modes (GDS, MOFED, GDRCOPY) optional in CDI
  • Add support for setting gdrcopyEnabled
  • Ignore errors getting device memory using NVML
  • Ensure that directory volumes have Directory type
  • Switch to plain golang image for builds
  • Remove unneeded intermediate container
  • Update CI definitions
  • Switch to distroless golang image
  • Update README.md with RuntimeClass
  • Pass a single context throughout the device-plugin method call stack (#1284)
  • Remove internal logger in favour of klog (#1277)
  • Remove FAIL_ON_INIT_ERROR from static examples
  • Detect blackwell architecture
  • Updated .release:staging to stage device-plugin images in nvstaging
  • Use MiB instead of MB for gpu-memory
  • Ignore XID error 109
  • Update README.md adjust set docker runtime default
  • Remove nvidia.com/gpu.imex-domain label
  • Fix containerd runc config error when creating a kind cluster
  • Use stable nividia-container-toolkit repo when creating a kind cluster
  • Switch to context package in go stdlib
  • Raise a warning instead of an error if GPU mode labeler fails
  • Add ada-lovelace architecture label for compute capability 8.9
  • Ensure FAIL_ON_INIT_ERROR boolean env is quoted
  • Honor fail-on-init-error when no resources are found
  • Enable hostPID in the mps-control-daemon pod (#1045)

Full Changelog: v0.17.1...v0.18.0

v0.17.4

09 Sep 18:53
fd56a74

Choose a tag to compare

What's Changed

  • Bump slackapi/slack-github-action from 2.1.0 to 2.1.1 by @dependabot[bot] in #1317
  • Bump github.com/NVIDIA/go-nvlib from 0.7.2 to 0.7.4 by @dependabot[bot] in #1346
  • Bump golang from 1.23.11 to 1.23.12 in /deployments/devel by @dependabot[bot] in #1355
  • Ensure that directory volumes have Directory type by @elezar in #1368
  • Bump nvidia/cuda from 12.9.1-base-ubi9 to 13.0.0-base-ubi9 in /deployments/container by @dependabot[bot] in #1369
  • Ignore errors getting device memory using NVML by @elezar in #1374
  • Bump project version to v0.17.4 by @cdesiniotis in #1402
  • [no-relnote] update ngc publishing logic for release pipelines by @cdesiniotis in #1406

Full Changelog: v0.17.3...v0.17.4

v0.17.3

24 Jul 09:53
v0.17.3
e0a461e

Choose a tag to compare

What's Changed

  • Bump github.com/NVIDIA/nvidia-container-toolkit from 1.17.6 to 1.17.8 by @dependabot[bot] in #1275
  • Bump nvidia/cuda from 12.9.0-base-ubi9 to 12.9.1-base-ubi9 in /deployments/container by @dependabot[bot] in #1300
  • Bump github.com/NVIDIA/go-nvml from 0.12.4-1 to 0.12.9-0 by @dependabot[bot] in #1287
  • Bump golang from 1.23.9 to 1.23.10 in /deployments/devel by @dependabot[bot] in #1283
  • Bump golang from 1.23.10 to 1.23.11 in /deployments/devel by @dependabot[bot] in #1318
  • Bump release v0.17.3 by @elezar in #1326
  • Backport: Bump golang.org/x/oauth2 from 0.23.0 to 0.27.0 by @cdesiniotis in #1328
  • Updated .release:staging to stage device-plugin images in nvstaging by @elezar in #1329

Full Changelog: v0.17.2...v0.17.3

v0.17.2

13 May 18:21
v0.17.2
390b1f6

Choose a tag to compare

What's Changed

  • Update nvidia.com/gpu.product label to include blackwell architectures
  • Update documentation to indicate that nvidia.com/gpu.memory label is in MiB instead of MB

Full Changelog: v0.17.1...v0.17.2