nvidia-container-toolkit

mirror of https://github.com/NVIDIA/nvidia-container-toolkit synced 2025-06-26 18:18:24 +00:00

Author	SHA1	Message	Date
Evan Lezar	7e1beb7aa6	[no-relnote] Update gitlab CI for new image names We now release all images with vX.Y.Z and vX.Y.Z-packaging tags. This change updates the gitlab CI to allow for this. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-18 11:51:52 +02:00
Evan Lezar	d560888f1f	Use single version tag for image Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-17 19:46:42 +02:00
Evan Lezar	81fb7bb9c1	Merge pull request #602 from elezar/use-ubi8-image Use single image for deb and rpm-based systems	2025-06-17 15:09:47 +02:00
Evan Lezar	208896d87d	Merge pull request #1130 from ArangoGutierrez/fix/1049 BUGFIX: modifier: respect GPU volume-mount device requests	2025-06-17 15:04:22 +02:00
Evan Lezar	82b62898bf	[no-relnote] Make devicesFromEnvvars private Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-17 11:55:00 +02:00
Carlos Eduardo Arango Gutierrez	d03a06029a	BUGFIX: modifier: respect GPU volume-mount device requests The gated modifiers used to add support for GDS, Mofed, and CUDA Forward Comatibility only check the NVIDIA_VISIBLE_DEVICES envvar to determine whether GPUs are requested and modifications should be made. This means that use cases where volume mounts are used to request devices are not supported. This change ensures that device extraction is consistent for all use cases. Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com> Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-17 11:55:00 +02:00
Evan Lezar	f4f7da65f1	Ensure consistent sorting of annotation devices Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-17 11:55:00 +02:00
Carlos Eduardo Arango Gutierrez	5fe7b06514	[no-relnote] new helper func visibleEnvVars Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com> Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-17 11:54:53 +02:00
Evan Lezar	5606caa5af	Extract deb and rpm packages to single image This change swithces to using a single image for the NVIDIA Container Toolkit contianer. Here the contents of the architecture-specific deb and rpm packages are extracted to a known root. These contents can then be installed using the updated installation mechanism which has been updated to detect the source root based on the packaging type. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-16 19:15:56 +02:00
Evan Lezar	8149be09ac	Merge pull request #1149 from NVIDIA/dependabot/go_modules/main/github.com/urfave/cli/v2-2.27.7 Bump github.com/urfave/cli/v2 from 2.27.6 to 2.27.7	2025-06-16 16:28:29 +02:00
Evan Lezar	d935648722	Merge pull request #1140 from NVIDIA/dependabot/go_modules/main/github.com/NVIDIA/go-nvlib-0.7.3 Bump github.com/NVIDIA/go-nvlib from 0.7.2 to 0.7.3	2025-06-16 16:26:57 +02:00
dependabot[bot]	1f43b71dd8	Bump github.com/urfave/cli/v2 from 2.27.6 to 2.27.7 Bumps [github.com/urfave/cli/v2](https://github.com/urfave/cli) from 2.27.6 to 2.27.7. - [Release notes](https://github.com/urfave/cli/releases) - [Changelog](https://github.com/urfave/cli/blob/main/docs/CHANGELOG.md) - [Commits](https://github.com/urfave/cli/compare/v2.27.6...v2.27.7) --- updated-dependencies: - dependency-name: github.com/urfave/cli/v2 dependency-version: 2.27.7 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-16 08:56:44 +00:00
Evan Lezar	b33d475ff3	Merge pull request #1145 from elezar/remove-docker-runc Some checks failed CI Pipeline / code-scanning (push) Has been cancelled Details CI Pipeline / variables (push) Has been cancelled Details CI Pipeline / golang (push) Has been cancelled Details CI Pipeline / image (push) Has been cancelled Details CI Pipeline / e2e-test (push) Has been cancelled Details Remove docker-run as default runtime candidate	2025-06-13 19:27:08 +02:00
Evan Lezar	6359cc9919	Remove docker-run as default runtime candidate This change removes docker-runc as the highest priority default candidate for the low-level runtimes supported by the nvidia-container-runtime. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-13 17:26:19 +02:00
Evan Lezar	4f9c860a37	Merge pull request #927 from elezar/disable-device-node-creation Disable device node creation in CDI mode	2025-06-13 16:44:18 +02:00
Evan Lezar	bab9fdf607	Merge pull request #1144 from NVIDIA/dependabot/submodules/main/third_party/libnvidia-container-710a0f1 Bump third_party/libnvidia-container from `6eda4d7` to `710a0f1`	2025-06-13 16:42:53 +02:00
Evan Lezar	cc7812470f	Merge pull request #1143 from elezar/add-device-ids-to-getspec Add device IDs to nvcdi.GetSpec API	2025-06-13 16:41:43 +02:00
Evan Lezar	bdcdcb7449	Merge pull request #1132 from elezar/make-cdi-device-extraction-consistent Make CDI device requests consistent with other methods	2025-06-13 16:05:44 +02:00
Evan Lezar	8be03cfc41	[no-relnote] Ignore annotation devices for non-CDI modes Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-13 15:26:48 +02:00
Evan Lezar	8650ca6533	[no-relnote] Move hookCreator initialisation for readability Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-13 15:00:51 +02:00
Evan Lezar	1bc2a9fee3	Return annotation devices from VisibleDevices This change includes annotation devices in CUDA.VisibleDevices with the highest priority. This allows for the CDI device request extraction to be consistent across all request mechanisms. Note that this does change behaviour in the following ways: 1. Annotations are considered when resolving the runtime mode. 2. Incorrectly formed device names in annotations are no longer treated as an error. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-13 14:56:08 +02:00
Evan Lezar	dc87dcf786	Make CDI device requests consistent with other methods Following the refactoring of device request extraction, we can now make CDI device requests consistent with other methods. This change moves to using image.VisibleDevices instead of separate calls to CDIDevicesFromMounts and VisibleDevicesFromEnvVar. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-13 14:34:02 +02:00
Evan Lezar	f17d424248	Construct container info once Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-13 14:05:51 +02:00
Evan Lezar	426186c992	Add logic to extract annotation device requests to image type This change updates the image.CUDA type to also extract CDI device requests. These are only relevant IF CDI prefixes are specifically set. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-13 14:05:51 +02:00
Evan Lezar	6849ebd621	Add IsPrivileged function to CUDA container type Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-13 14:05:51 +02:00
dependabot[bot]	4a6685d3a8	Bump third_party/libnvidia-container from `6eda4d7` to `710a0f1` Bumps [third_party/libnvidia-container](https://github.com/NVIDIA/libnvidia-container) from `6eda4d7` to `710a0f1`. - [Release notes](https://github.com/NVIDIA/libnvidia-container/releases) - [Commits](`6eda4d76c8...710a0f1304`) --- updated-dependencies: - dependency-name: third_party/libnvidia-container dependency-version: 710a0f1304dacb9de06716a4e24160698952aa81 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-13 08:14:38 +00:00
Evan Lezar	2ccf67c40f	[no-relnote] Remove test output file Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-12 16:34:08 +02:00
Evan Lezar	0134ba4250	Add device IDs to nvcdi.GetSpec API This change allows device IDs to the specified in the GetSpec API. This simplifies cases where CDI specs are being generated for specific devices by ID. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-12 16:23:11 +02:00
dependabot[bot]	eab9cdf1c2	Bump github.com/NVIDIA/go-nvlib from 0.7.2 to 0.7.3 Bumps [github.com/NVIDIA/go-nvlib](https://github.com/NVIDIA/go-nvlib) from 0.7.2 to 0.7.3. - [Release notes](https://github.com/NVIDIA/go-nvlib/releases) - [Commits](https://github.com/NVIDIA/go-nvlib/compare/v0.7.2...v0.7.3) --- updated-dependencies: - dependency-name: github.com/NVIDIA/go-nvlib dependency-version: 0.7.3 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-11 12:58:33 +00:00
Evan Lezar	dba15acdcc	Merge pull request #1135 from NVIDIA/dependabot/go_modules/tests/main/golang.org/x/crypto-0.39.0 Some checks failed CI Pipeline / code-scanning (push) Has been cancelled Details CI Pipeline / variables (push) Has been cancelled Details CI Pipeline / golang (push) Has been cancelled Details CI Pipeline / image (push) Has been cancelled Details CI Pipeline / e2e-test (push) Has been cancelled Details Bump golang.org/x/crypto from 0.38.0 to 0.39.0 in /tests	2025-06-11 14:57:38 +02:00
Evan Lezar	8339fb1ec3	Merge pull request #1139 from NVIDIA/dependabot/go_modules/main/github.com/NVIDIA/go-nvml-0.12.9-0 Bump github.com/NVIDIA/go-nvml from 0.12.4-1 to 0.12.9-0	2025-06-11 14:57:18 +02:00
Evan Lezar	b9d646c80d	Merge pull request #1136 from NVIDIA/dependabot/docker/deployments/devel/main/golang-1.24.4 Bump golang from 1.24.3 to 1.24.4 in /deployments/devel	2025-06-11 14:53:57 +02:00
Evan Lezar	7380cff645	Merge pull request #1134 from NVIDIA/dependabot/go_modules/main/golang.org/x/mod-0.25.0 Bump golang.org/x/mod from 0.24.0 to 0.25.0	2025-06-11 14:53:34 +02:00
dependabot[bot]	f91736d832	Bump github.com/NVIDIA/go-nvml from 0.12.4-1 to 0.12.9-0 Bumps [github.com/NVIDIA/go-nvml](https://github.com/NVIDIA/go-nvml) from 0.12.4-1 to 0.12.9-0. - [Release notes](https://github.com/NVIDIA/go-nvml/releases) - [Commits](https://github.com/NVIDIA/go-nvml/compare/v0.12.4-1...v0.12.9-0) --- updated-dependencies: - dependency-name: github.com/NVIDIA/go-nvml dependency-version: 0.12.9-0 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-11 09:03:11 +00:00
dependabot[bot]	5ccac4da5a	Bump golang from 1.24.3 to 1.24.4 in /deployments/devel Bumps golang from 1.24.3 to 1.24.4. --- updated-dependencies: - dependency-name: golang dependency-version: 1.24.4 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-06 09:01:57 +00:00
dependabot[bot]	1aee45be2d	Bump golang.org/x/crypto from 0.38.0 to 0.39.0 in /tests Bumps [golang.org/x/crypto](https://github.com/golang/crypto) from 0.38.0 to 0.39.0. - [Commits](https://github.com/golang/crypto/compare/v0.38.0...v0.39.0) --- updated-dependencies: - dependency-name: golang.org/x/crypto dependency-version: 0.39.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-06 08:43:50 +00:00
dependabot[bot]	0d4a7f1d5a	Bump golang.org/x/mod from 0.24.0 to 0.25.0 Bumps [golang.org/x/mod](https://github.com/golang/mod) from 0.24.0 to 0.25.0. - [Commits](https://github.com/golang/mod/compare/v0.24.0...v0.25.0) --- updated-dependencies: - dependency-name: golang.org/x/mod dependency-version: 0.25.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-06 08:43:41 +00:00
Evan Lezar	27f5ec83de	Merge pull request #1125 from elezar/vulkan-target-cpu Some checks failed CI Pipeline / code-scanning (push) Has been cancelled Details CI Pipeline / variables (push) Has been cancelled Details CI Pipeline / golang (push) Has been cancelled Details CI Pipeline / image (push) Has been cancelled Details CI Pipeline / e2e-test (push) Has been cancelled Details Add discovery of arch-specific vulkan ICD	2025-06-05 14:09:50 +02:00
Carlos Eduardo Arango Gutierrez	0a3146f74f	Merge pull request #1076 from ArangoGutierrez/refresh_cdi Add nvidia-cdi-refresh service	2025-06-05 13:08:57 +02:00
Carlos Eduardo Arango Gutierrez	28add0a532	Merge pull request #1123 from ArangoGutierrez/cdi_generate_env_cli Add EnvVars option for all nvidia-ctk cdi commands	2025-06-05 13:05:46 +02:00
Evan Lezar	b55255e31f	Merge pull request #1110 from ArangoGutierrez/i/1049 Refactor extracting requested devices from the container image	2025-06-05 13:00:32 +02:00
Carlos Eduardo Arango Gutierrez	dede03f322	Refactor extracting requested devices from the container image This change consolidates the logic for determining requested devices from the container image. The logic for this has been integrated into the image.CUDA type so that multiple implementations are not required. Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com> Co-authored-by: Evan Lezar <elezar@nvidia.com>	2025-06-05 12:38:45 +02:00
Carlos Eduardo Arango Gutierrez	ce3e2c1ed5	Add EnvVars option for all nvidia-ctk cdi commands Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com>	2025-06-05 11:22:35 +02:00
Carlos Eduardo Arango Gutierrez	a537d0323d	Add nvidia-cdi-refresh service Automatic regeneration of /var/run/cdi/nvidia.yaml New units: • nvidia-cdi-refresh.service – one-shot wrapper for nvidia-ctk cdi generate (adds sleep + required caps). • nvidia-cdi-refresh.path – fires on driver install/upgrade via modules.dep.bin changes. Packaging • RPM %post reloads systemd and enables the path unit on fresh installs. • DEB postinst does the same (configure, skip on upgrade). Result: CDI spec is always up to date Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com>	2025-06-05 10:54:15 +02:00
Evan Lezar	2de997e25b	Add discovery of arch-specific vulkan ICD On some RPM-based platforms, the path of the Vulkan ICD file include an architecture-specific infix to distinguish it from other architectures. (Most notably x86_64 vs i686). This change attempts to discover the arch-specific ICD file in addition to the standard nvidia_icd.json. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-04 23:06:16 +02:00
Evan Lezar	e046d6ae79	Add disabled-device-node-modification hook to CDI spec Some checks failed CI Pipeline / code-scanning (push) Has been cancelled Details CI Pipeline / variables (push) Has been cancelled Details CI Pipeline / golang (push) Has been cancelled Details CI Pipeline / image (push) Has been cancelled Details CI Pipeline / e2e-test (push) Has been cancelled Details This hook is not added to management specs. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-04 19:08:48 +02:00
Evan Lezar	0c8723a93a	Add a hook to disable device node creation in a container If required, this hook creates a modified params file (with ModifyDeviceFiles: 0) in a tmpfs and mounts this over /proc/driver/nvidia/params. This prevents device node creation when running tools such as nvidia-smi. In general the creation of these devices is cosmetic as a container does not have the required cgroup access for the devices. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-04 19:08:48 +02:00
Evan Lezar	fdcd250362	Merge pull request #1129 from elezar/fix-deduplicate-driver-store-wsl Minor cleanup of WSL2 CDI spec generation	2025-06-04 10:16:36 +02:00
Evan Lezar	b66d37bedb	[no-relnote] Minor code cleanup in WSL2 discoverer Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-03 23:43:17 +02:00
Evan Lezar	0c905d0de2	[no-relnote] Remove unneeded indirection Signed-off-by: Evan Lezar <elezar@nvidia.com>	2025-06-03 23:43:17 +02:00

1 2 3 4 5 ...

2425 Commits