nvidia-container-toolkit

mirror of https://github.com/NVIDIA/nvidia-container-toolkit synced 2024-12-01 16:52:54 +00:00

Author	SHA1	Message	Date
Evan Lezar	b18ac09f77	Refactor handling of DriverCapabilities This change consolidates the handling of NVIDIA_DRIVER_CAPABILITIES in the interal/image package. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-08-14 10:40:42 +02:00
Evan Lezar	4dcaa61167	Use internal/config structs in hook This change ensures that the Config structs from internal.Config are used for the NVIDIA Container Runtime Hook config too. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-08-14 10:40:41 +02:00
Evan Lezar	f6a4986c15	Add support for creating oci hook to nvidia-ctk This change extends the nvidia-ctk runtime configure command with a --config-mode=oci-hook that creates an OCI hook json file. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-08-11 16:34:58 +02:00
Evan Lezar	feb069a2e9	Log registry refresh errors in cdi list Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-08-08 16:00:36 +02:00
Evan Lezar	8553fce68a	Specify library search paths for CSV CDI spec generation Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-08-04 16:49:30 +02:00
Evan Lezar	918bd03488	Move tegra-specifics to new package Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-08-04 16:49:30 +02:00
Evan Lezar	e51621aa7f	Handle empty root in config If the config.toml has an empty root specified, this could be passed to the NVIDIA Container CLI through the --root flag which causes argument parsing to fail. This change only adds the --root flag if the config option is specified and is non-empty. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-19 14:02:23 +02:00
Evan Lezar	9b64d74f6a	Use functional options when constructing Symlink locator Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-18 15:31:15 +02:00
Evan Lezar	3c9d95c62f	Fix usage string in CLI Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-18 15:20:24 +02:00
Evan Lezar	1081cecea9	Return empty requirements if NVIDIA_DISABLE_REQUIRE is true Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-11 13:47:37 +02:00
Evan Lezar	f78d3a858f	Rework default config generation to not use toml Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-05 11:26:55 +02:00
Evan Lezar	65ae6f1dab	Fix generation of default config This change ensures that the nvidia-ctk config default command generates a config file that is compatible with the official documentation to, for example, disable cgroups in the NVIDIA Container CLI. This requires that whitespace around comments is stripped before outputing the contets. This also adds an option to load a config and modify it in-place instead. This can be triggered as a post-install step, for example. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-05 11:26:04 +02:00
Evan Lezar	ba24338122	Add quiet mode to nvidia-ctk cli Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-05 11:26:04 +02:00
Evan Lezar	baf94181aa	Add engine.Config to encapsulate writing This change adds an engine.Config type to encapsulate the writing of config files for container engines. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-03 15:26:47 +02:00
Evan Lezar	d52dbeaa7a	Split internal system package This changes splits the functionality in the internal system package into two packages: one for dealing with devices and one for dealing with kernel modules. This removes ambiguity around the meaning of driver / device roots in each case. In each case, a root can be specified where device nodes are created or kernel modules loaded. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-06-15 09:01:13 +02:00
Evan Lezar	c4d3b13ae2	Update go-nvlib with new constructor Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-06-14 17:55:33 +02:00
Evan Lezar	82347eb9bc	Resolve auto mode as cdi for fully-qualified names Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-06-13 16:05:37 +02:00
Evan Lezar	1d0a733487	Replace logger.Warn(f) with logger.Warning(f) This aligns better with klog used in other projects. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-06-12 10:48:04 +02:00
Evan Lezar	9464953924	Use logger.Interface when resolving auto mode Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-06-12 10:46:11 +02:00
Evan Lezar	c9b05d8fed	Use logger Interface in runtime configuration Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-06-12 10:46:11 +02:00
Evan Lezar	a02bc27c3e	Define a basic logger interface Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-06-12 10:46:10 +02:00
Evan Lezar	6b1e8171c8	Merge branch 'add-mod-probe' into 'main' Add option to load NVIDIA kernel modules See merge request nvidia/container-toolkit/container-toolkit!409	2023-05-31 18:14:45 +00:00
Evan Lezar	2e50b3da7c	Merge branch 'ldcache-resolve-circular' into 'main' Fix infinite recursion when resolving libraries in LDCache Closes #13 See merge request nvidia/container-toolkit/container-toolkit!406	2023-05-31 17:35:27 +00:00
Evan Lezar	b64ba6ac2d	Add option to create device nodes This change adds a --create-device-nodes option to the nvidia-ctk system create-dev-char-symlinks command to create device nodes. The currently only creates control device nodes. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-31 19:31:38 +02:00
Evan Lezar	7b801a0ce0	Add option to load NVIDIA kernel modules These changes add a --load-kernel-modules option to the nvidia-ctk system commands. If specified the NVIDIA kernel modules (nvidia, nvidia-uvm, and nvidia-modeset) are loaded before any operations on device nodes are performed. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-31 19:31:38 +02:00
Evan Lezar	528cbbb636	Merge branch 'fix-device-symlinks' into 'main' Fix creation of device symlinks in /dev/char See merge request nvidia/container-toolkit/container-toolkit!399	2023-05-31 17:31:04 +00:00
Evan Lezar	39263ea365	Add command to print ldcache Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-30 11:02:33 +02:00
Evan Lezar	9ea214d0b3	Correct typo in info command Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-30 10:58:30 +02:00
Evan Lezar	315f4adb8f	Check for required device majors Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-26 10:24:36 +02:00
Evan Lezar	ac11727ec5	Add nvidia-contianer-runtime-hook.path config option This change adds an nvidia-container-runtime-hook.path config option to allow the path used for the prestart hook to be overridden. This is useful in cases where multiple NVIDIA Container Toolkit installations are present. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-25 12:05:33 +02:00
Evan Lezar	927ec78b6e	Add symlinks package with Resolve function This change adds a symlinks.Resolve function for resolving symlinks and updates usages across the code to make use of it. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-23 20:42:17 +02:00
Evan Lezar	e30fd0f4ad	Add csv mode to nvidia-ctk cdi generate command This chagne allows the csv mode option to specified in the nvidia-ctk cdi generate command and adds a --csv.file option that can be repeated to specify the CSV files to be processed. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-22 13:56:45 +02:00
Evan Lezar	fe37196788	Generate all device using merged transform The nvcid api is extended to allow for merged device options to be specified. If any options are specified, then a merged device is generated. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-12 13:52:58 +02:00
Evan Lezar	ba44c50f4e	Add MergedDevice transform to generate all device Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-12 13:52:58 +02:00
Evan Lezar	9378d0cd0f	Move discover.FindNvidiaCTK to config.ResolveNVIDIACTKPath Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-10 15:12:44 +02:00
Evan Lezar	f9df36c473	Rename config struct to options Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-10 15:12:00 +02:00
Evan Lezar	3945abb2f2	Add nvidia-ctk cdi list command Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-09 19:59:00 +02:00
Evan Lezar	1bd5798a99	Use toml representation to get defaults Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-08 11:26:53 +02:00
Evan Lezar	3e7acec0b4	Add nvidia-ctk config generate-default command This change adds a CLI command to generate a default config. This config checks the host operating system to apply specific modifications that were previously captured in static config files. These include: * select /sbin/ldconfig or /sbin/ldconfig.real depending on which exists on the host * set the user to allow device access on SUSE-based systems Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-03 16:11:05 +02:00
Evan Lezar	4165961d31	Rename config struct options to avoid conflict This change renames the struct for storing CLI flag values options over config to avoid a conflict with the config package. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-03 15:59:02 +02:00
Carlos Eduardo Arango Gutierrez	6750df8e01	Merge branch 'fix-cdi-spec-permissions' into 'main' Generate CDI specifications with 644 permissions to allow non-root clients to consume them See merge request nvidia/container-toolkit/container-toolkit!381	2023-05-02 19:36:40 +00:00
Elliot Courant	140b1e33ef	chore(cmd): Fixing minor spelling error. Fixed a minor spelling error inside `nvidia-ctk system create-device-nodes`. Signed-off-by: Elliot Courant <me@elliotcourant.dev>	2023-05-02 12:53:45 -05:00
Evan Lezar	3056428eda	Generate spec file with 644 permissions Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-02 16:47:44 +02:00
Evan Lezar	d77f46aa09	Create ld.so.conf file with permissions 644 By default, temporary files are created with permissions 600 and this means that the files created when updating the ldcache are not readable in non-root containers. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-05-02 12:51:27 +02:00
Carlos Eduardo Arango Gutierrez	81d8b94cdc	Export pkg config/engine Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com>	2023-04-25 07:16:59 +02:00
Evan Lezar	70920d7a04	Add support for containerd to the runtime configure CLI Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-04-24 18:32:28 +02:00
Evan Lezar	f1e201d368	Refactor runtime configure cli Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-04-24 18:32:04 +02:00
Evan Lezar	29c6288128	Only update ldcache if it exists Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-04-13 17:18:09 +02:00
Evan Lezar	f6983969ad	Merge branch 'nvidia-ctk-cdi-transform' into 'main' Add 'target-driver-root' option to 'nvidia-ctk cdi generate' to transform root... See merge request nvidia/container-toolkit/container-toolkit!363	2023-03-28 20:05:12 +00:00
Evan Lezar	7f7fc35843	Move input and output to transform root subcommand Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-28 21:12:48 +02:00
Evan Lezar	f27c33b45f	Remove target-driver-root from generate Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-28 11:49:45 -07:00
Evan Lezar	6a83e2ebe5	Add nvidia-ctk cdi transform root command Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-28 11:45:58 -07:00
Evan Lezar	e774c51c97	Add nvidia-ctk system create-device-nodes command This change adds an nvidia-ctk system create-device-nodes command for creating NVIDIA device nodes. Currently this is limited to control devices (nvidia-uvm, nvidia-uvm-tools, nvidia-modeset, nvidiactl). A --dry-run mode is included for outputing commands that would be executed and the driver root can be specified. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-28 11:29:45 +02:00
Christopher Desiniotis	7f5c9abc1e	Add ability to configure CDI kind with 'nvidia-ctk cdi generate' Signed-off-by: Christopher Desiniotis <cdesiniotis@nvidia.com>	2023-03-27 23:12:00 -07:00
Christopher Desiniotis	92d82ceaee	Add 'target-driver-root' option to 'nvidia-ctk cdi generate' to transform root paths in generated spec Signed-off-by: Christopher Desiniotis <cdesiniotis@nvidia.com>	2023-03-27 22:22:36 -07:00
Evan Lezar	226c54613e	Also return an error from nvcdi.New This change allows nvcdi.New to return an error in addition to the constructed library instead of panicing. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-26 16:13:12 +02:00
Evan Lezar	685802b1ce	Only init nvml as required when generating CDI specs CDI generation modes such as management and wsl don't require NVML. This change removes the top-level instantiation of nvmllib and replaces it with an instanitation in the nvml CDI spec generation code. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-20 14:24:08 +02:00
Evan Lezar	3a11f6ee0a	Add nvidia-container-runtime-hook.skip-mode-detection option to config Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-09 20:15:40 +02:00
Evan Lezar	936fad1d04	Move check for privileged images to config/image/ package Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-09 11:15:53 +02:00
Evan Lezar	3bac4fad09	Migrate cri-o config update to engine.Interface Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-07 20:59:54 +02:00
Evan Lezar	9fff19da23	Migrate docker config to engine.Interface Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-07 20:59:54 +02:00
Evan Lezar	e5bb4d2718	Move runtime config code from config to config/engine Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-07 20:59:54 +02:00
Evan Lezar	cb5006c73f	Merge branch 'CNT-3897/generate-management-container-spec' into 'main' Generate CDI specs for management containers See merge request nvidia/container-toolkit/container-toolkit!314	2023-03-06 16:23:13 +00:00
Evan Lezar	20d3bb189b	Rename --discovery-mode to --mode Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-06 11:00:22 +02:00
Evan Lezar	f7e817cff6	Support management mode in nvidia-ctk cdi generate Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-06 10:53:43 +02:00
Evan Lezar	314059fcf0	Move path manipulation to spec.Save Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-01 13:49:04 +02:00
Evan Lezar	221781bd0b	Use full path for output spec Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-01 13:48:28 +02:00
Evan Lezar	8be6de177f	Move formatJSON and formatYAML to nvcdi/spec package Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-01 13:48:28 +02:00
Evan Lezar	890a519121	Use nvcdi.spec package to write and validate spec Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-01 13:48:28 +02:00
Evan Lezar	89321edae6	Add top-level GetSpec function to nvcdi API Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-03-01 13:48:28 +02:00
Evan Lezar	accba4ead5	Merge branch 'CNT-3965/clean-up-by-path-symlinks' into 'main' Improve handling of /dev/dri devices and nested device paths See merge request nvidia/container-toolkit/container-toolkit!307	2023-03-01 10:25:48 +00:00
Evan Lezar	b4dc1f338d	Generate nested device folder permission hooks per device This change generates device folder permission hooks per device instead of at a spec level. This ensures that the hook is not injected for a device that does not have any nested device nodes. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-22 17:16:23 +02:00
Evan Lezar	2542224d7b	Skip paths with errors in chmod hook Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-21 11:47:11 +02:00
Evan Lezar	2680c45811	Add mode constants to nvcdi Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-20 16:33:51 +02:00
Evan Lezar	4ccb0b9a53	Add and resolve auto discovery mode for cdi generation Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-20 14:49:58 +02:00
Evan Lezar	b21dc929ef	Add WSL2 discovery and spec generation These changes add a wsl discovery mode to the nvidia-ctk cdi generate command. If wsl mode is enabled, the driver store for the available devices is used as the source for discovered entities. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-20 10:30:13 +02:00
Evan Lezar	20d6e9af04	Add --discovery-mode to nvidia-ctk cdi generate command This change adds --discovery-mode flag to the nvidia-ctk cdi generate command and plumbs this through to the CDI API. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-20 10:30:13 +02:00
Evan Lezar	a844749791	Ensure that generate uses a consistent nvidia-ctk path Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-20 10:28:45 +02:00
Evan Lezar	5b110fba2d	Add nvcdi package with basic CDI generation API This change adds an nvcdi package that exposes a basic API for CDI spec generation. This is used from the nvidia-ctk cdi generate command and can be consumed by DRA implementations and the device plugin. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-14 19:52:31 +01:00
Evan Lezar	fdc759f7c2	Add nvidia-container-runtime.legacy executable Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-13 16:09:46 +01:00
Evan Lezar	43448bac11	Add nvidia-container-runtime.cdi executable This change adds an nvidia-container-runtime.cdi executable that overrides the runtime mode from the config to "cdi". Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-13 16:09:46 +01:00
Evan Lezar	406a5ec76f	Implement runtime package for creating runtime CLI Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-13 16:09:46 +01:00
Evan Lezar	f71c419cfb	Move modifying OCI runtime wrapper to oci package Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-13 16:09:46 +01:00
Evan Lezar	97008f2db6	Move IPC discoverer into DriverDiscoverer This simplifies the construction of the required common edits when constructing a CDI specification. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-08 09:06:07 +01:00
Evan Lezar	076eed7eb4	Update ipcMount to add noexec option Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-08 09:06:07 +01:00
Evan Lezar	3b8c40c3e6	Move IPC discoverer to internal/discover package Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-08 09:06:07 +01:00
Evan Lezar	daceac9117	Rename discover.Config.Root to discover.Config.DriverRoot Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-02 15:57:15 +01:00
Evan Lezar	cfa2647260	Rename root to driverRoot for CDI generation This makes the intent of the command line argument clearer since this relates specifically to the root where the NVIDIA driver is installed. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-02-02 15:42:04 +01:00
Evan Lezar	707e3479f8	Fix lint errors Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-30 13:39:57 +01:00
Evan Lezar	201232dae3	Add logging of minimum CDI version Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-30 13:39:08 +01:00
Evan Lezar	f768bb5783	Use device index as CDI device names by default This change uses the `index` mode for the --device-name-strategy when generating CDI specifications by default. This generates device names such as nvidia.com/gpu=0 or nvidia.com/gpu=1:0 by default. Note that this requires a CDI spec version of 0.5.0 and for consumers (e.g. podman) that are only compatible with older versions one of the other stragegies (`type-index` or `uuid`) should be used instead to generate a v0.3.0 or v0.4.0 specification. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-30 13:36:17 +01:00
Evan Lezar	f0de3ccd9c	Merge branch 'CNT-3718/allow-device-name-to-be-controlled' into 'main' Add --device-name-strategy flag for CDI spec generation See merge request nvidia/container-toolkit/container-toolkit!269	2023-01-30 12:28:38 +00:00
Evan Lezar	8188400c97	Move create-dev-char-symlinks subcommand from hook to system Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-27 12:12:54 +01:00
Evan Lezar	962d38e9dd	Add nvidia-ctk system subcommand Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-27 12:12:54 +01:00
Evan Lezar	1d7e419008	Add --create-all mode to creation of dev/char symlinks This change adds a --create-all mode to the create-dev-char-symlinks hook. This mode creates all POSSIBLE symlinks to device nodes for regular and cap devices. With the number of GPUs inferred from the PCI device information. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-25 13:43:43 +01:00
Evan Lezar	f9330a4c2c	Add --watch option to create-dev-char-symlinks This change adds a --watch option to the create-dev-char-symlinks hook. This installs an fsnotify watcher that creates symlinks for ADDED device nodes under /dev/char. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-25 13:43:43 +01:00
Evan Lezar	be0e4667a5	Add create-dev-char-symlinks hook This change adds an nvidia-ctk hook create-dev-char-symlinks subcommand that creates symlinks to device nodes (as required by systemd) under /dev/char. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-25 13:43:43 +01:00
Evan Lezar	408eeae70f	Allow locator to be marked as optional Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-25 10:38:11 +01:00
Evan Lezar	89bf81a9db	Add --device-name-strategy flag for CDI spec generation This change adds a --device-name-strategy flag for generating a CDI specificaion. This allows a CDI spec to be generated with the following names used for device: * type-index: gpu0 and mig0:1 * index: 0 and 0:1 * uuid: GPU and MIG UUIDs Note that the use of 'index' generates a v0.5.0 CDI specification since this relaxes the restriction on the device names. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-01-20 16:17:32 +01:00
Evan Lezar	7a1cfb48b9	Merge branch 'update-cdi' into 'main' Determine the minumum required spec version See merge request nvidia/container-toolkit/container-toolkit!265	2023-01-19 11:57:19 +00:00

1 2 3 4 5 ...

282 Commits