allegroai
da19ef26c4
Fix pinging running task (and change default to once a minute)
2022-12-13 15:26:26 +02:00
allegroai
f69e16ea9d
Fix clearml-agent build --docker
stuck on certain containers
2022-12-13 15:24:32 +02:00
allegroai
b2da639582
Add CLEARML_AGENT_FORCE_SYSTEM_SITE_PACKAGES
env var (default true) to allow overriding default "system_site_packages: true" behavior when running tasks in containers (docker mode and k8s-glue)
2022-12-10 20:00:46 +02:00
allegroai
ca2791c65e
Fix pip support allowing multiple pip version constraints (by default, one for <PY3.10 and one for >=PY3.10)
2022-12-07 22:09:25 +02:00
allegroai
5d517c91b5
Add agent.disable_task_docker_override
configuration option to disable docker override specified in executing tasks
2022-12-07 22:07:11 +02:00
allegroai
6be75abc86
Add default output URI selection to "clearml-agent init"
2022-12-07 22:06:10 +02:00
allegroai
57cde21c48
Send task.ping
for executing tasks every 120 seconds (set using the agent.task_ping_interval_sec
configuration option)
2022-12-05 11:22:25 +02:00
allegroai
396abf13b6
Fix get_task_session()
may cause an old copy of the APIClient
to be used containing a reference to the previous session
2022-12-05 11:20:32 +02:00
allegroai
6e7fb5f331
Fix sending task logs fails when agent is not running in the same tenant
2022-12-05 11:19:14 +02:00
allegroai
76c533a2e8
Fix access to config object
2022-11-11 13:34:17 +02:00
Niels ten Boom
9eee213683
Add option to crash agent on exception using agent.crash_on_exception
configuration setting ( #123 )
2022-11-06 17:15:39 +02:00
allegroai
d2f3614ab0
Add support for CLEARML_AGENT_DOCKER_ARGS_HIDE_ENV environment variable (see agent.hide_docker_command_env_vars
config option)
2022-10-23 13:04:00 +03:00
allegroai
70771b12a9
Remove unused code
2022-10-23 13:04:00 +03:00
allegroai
2ef5d38b32
Remove future (Python 2 is not supported for clearml-agent)
2022-10-23 13:03:59 +03:00
allegroai
0de10345f7
Moved pyhocon to internal packages
2022-10-23 13:03:59 +03:00
allegroai
a243fa211f
Improve venv cache disabled message
2022-10-23 13:03:59 +03:00
allegroai
f0fd62a28f
Fix docker extra args showing up in configuration printout
2022-10-23 13:03:59 +03:00
allegroai
e61accefb9
PEP8 + refactor
2022-10-07 15:26:31 +03:00
allegroai
5c1543d112
Add agent.disable_ssh_mount
configuration option (same as CLEARML_AGENT_DISABLE_SSH_MOUNT
env var)
2022-10-07 15:24:39 +03:00
allegroai
7ff6aee20c
Add warning if venv cache is disabled
2022-10-07 15:23:10 +03:00
allegroai
37ea381d98
Add support for docker args filters
2022-10-07 15:22:42 +03:00
allegroai
1e3646b57c
Fix docker command for monitoring child agents
2022-10-07 15:21:32 +03:00
allegroai
a69766bd8b
Add CLEARML_AGENT_CHILD_AGENTS_COUNT_CMD to allow overriding child agent count command in k8s
2022-09-15 20:16:01 +03:00
allegroai
857a750eb1
Fix GCP load balancer not fwd GET request body, allow to change default request Action to Put/Post/Get. see api.http.default_method or CLEARML_API_DEFAULT_REQ_METHOD
2022-09-15 20:15:42 +03:00
allegroai
8b4f1eefc2
Add more debug printouts in k8s glue
2022-09-02 23:49:28 +03:00
allegroai
918dd39b87
Add docker ssh_ro_folder (default: "/.ssh") changed docker ssh_folder (default: "~/.ssh")
2022-09-02 23:49:27 +03:00
allegroai
7776e906c4
Fix second .ssh temp mount fails if container changes the files inside
2022-09-02 23:49:27 +03:00
allegroai
1bf865ec08
Fix name not escaped as regex (all services "get_all" use regex for name)
2022-09-02 23:49:27 +03:00
allegroai
9006c2d28f
Add support for abort callback registration
2022-08-29 18:06:59 +03:00
allegroai
ec9d027678
Add support for MIG devices, use 0:1 for GPU 0 slice 1 (or use 0.1)
2022-08-01 18:58:42 +03:00
allegroai
48a145a8bd
Fix messages
2022-08-01 18:57:36 +03:00
allegroai
12a8872b27
Fix Python 3.10+ support
2022-08-01 18:56:37 +03:00
allegroai
c08e2ac0bb
Fix clearml.conf access in non-root containers
2022-05-05 12:23:11 +03:00
allegroai
5f77cad5ac
Fix error message
2022-04-27 15:36:39 +03:00
allegroai
0228ae0494
Set environment variables before expanding path
2022-04-27 15:14:16 +03:00
allegroai
2e5298b737
Add support for use-owner-token in k8s glue
2022-04-27 14:59:27 +03:00
allegroai
2466eed23f
Fix dynamic GPUs with "all" GPUs on he same worker
2022-04-20 08:56:22 +03:00
allegroai
f50ba005b5
Protect dynamic GPUs from failing to parse worker GPU index
2022-04-13 10:01:50 +03:00
allegroai
1011544533
Fix copy breaks agent and nulls the worker name
2022-04-13 10:01:12 +03:00
allegroai
9c7e2aacd0
Fix PYTHONPATH is overwritten when executing a task (append to it instead)
2022-04-09 14:17:49 +03:00
allegroai
b94ec85461
Fix update should run with -y
2022-03-30 20:48:11 +03:00
allegroai
c87da3a079
Fix apt-get update fail causes apt-get install to not be executed
2022-03-30 20:10:57 +03:00
allegroai
c3590a53a8
Fix CLEARML_AGENT_SKIP_PIP_VENV_INSTALL fails to find python executable
2022-03-30 20:10:08 +03:00
allegroai
c901bd331c
Fix git packages are installed even if commit is given and is preinstalled when using cached virtual environment
2022-03-28 18:11:46 +03:00
allegroai
df97f170a2
Fix clearml-agent init
...
Use app.clear.ml as default server
Add git token refrences
2022-03-24 22:08:06 +02:00
allegroai
2432f5bb68
Add CLEARML_AGENT_PROPAGATE_EXITCODE
, set to 1 to let clearml-agent execute retrun a nonzero exit code on failure (notice by default we keep the retrun code 0, the exception is the k8s glue with non-restarting Pods, where users would want to get visibility into failing Tasks, do not use unless you know what to expect from k8s)
2022-03-24 22:04:25 +02:00
allegroai
341086d86a
Fix vcs packages are reinstalled when same commit version is already installed
2022-03-24 22:03:25 +02:00
allegroai
1163c96438
Add agent.package_manager.force_original_requirements
allowing to only use the "org_pip" coming from dev execution (using this prevents editing the installed packages from the UI)
2022-03-24 22:00:33 +02:00
allegroai
4c120d7cd0
Add ability to override container LOCAL_PYTHON, add auto python support (max 3.15)
2022-03-24 21:58:07 +02:00
allegroai
531e514003
Add custom build script support
...
Add extra configurations when starting daemon
Propagate token to docker in case credentials are not available
2022-03-15 10:04:25 +02:00
allegroai
2cd9e706c8
Fix user-provided "
is unnecessarily replaced to \\"
2022-03-15 10:02:28 +02:00
pollfly
36073ad488
Fix links ( #100 )
2022-02-17 12:04:11 +02:00
allegroai
14c48d0a78
Fix FORCE_LOCAL_CLEARML_AGENT_WHEEL when running from a Windows host
2022-02-09 18:28:17 +02:00
allegroai
1f53c4fd1b
Fix agent fails to check out code from main branch when branch/commit is not explicitly specified
2022-02-07 20:04:08 +02:00
allegroai
bb64e4a850
Fix hide_docker_command_env_vars mode to include URL passwords and handle env vars containing docker commands
2022-02-02 16:30:34 +02:00
pollfly
d39e30995a
Fix links ( #93 )
2022-01-27 12:15:36 +02:00
allegroai
fa1307e62c
Add agent.poetry_version to specify poetry version (and force installation of poetry if missing)
2022-01-23 10:40:05 +02:00
allegroai
5afb604e3d
Fix default_python set to None
2022-01-07 15:12:27 +02:00
allegroai
b3e8be6296
Add agent.force_git_root_python_path configuration setting to force adding the git repository root folder to the PYTHONPATH (if set working directory is not added to the PYHTONPATH)
2022-01-07 15:11:59 +02:00
allegroai
938fcc4530
Add build --force-docker command line argument to the to allow ignoring task container data
2021-12-29 13:21:25 +02:00
allegroai
f41ed09dc1
Add support for custom docker image resolving
2021-12-21 14:29:43 +02:00
allegroai
cd4a39d8fc
Fix config example
2021-10-21 12:03:07 +03:00
allegroai
92e3f00435
Add support for truncating task log file after reporting to server
2021-10-21 12:02:31 +03:00
allegroai
bed94ee431
Add support for configuration env and files section
2021-10-19 10:46:43 +03:00
allegroai
175e99b12b
Fix if queue tag default does not exist and --queue not specified, try queue name "default"
2021-10-16 23:21:45 +03:00
allegroai
2a941e3abf
Fix --stop checking default queue tag (issue #80 )
2021-10-16 23:21:12 +03:00
pollfly
e17246d8ea
Fix docstring typos ( #79 )
...
* edit doctring typo
* fix typos
2021-09-14 18:42:18 +03:00
allegroai
db57441c5d
Fix sensitive environment variable values are not masked in "executing docker" printout (issue #67 )
2021-09-13 14:00:11 +03:00
pollfly
21c4857795
Fix doctring typo ( #75 )
2021-08-22 08:19:55 +03:00
allegroai
4149afa896
Add agent.docker_internal_mounts to control containers internal mounts (non-root containers)
2021-08-21 16:03:37 +03:00
allegroai
b196ab5793
Do not overwrite PYTHONIOENCODING if defined
2021-08-20 00:37:21 +03:00
allegroai
b39b54bbaf
Add poetry cache into docker mapping (issue #74 )
2021-08-13 11:02:21 +03:00
allegroai
2fff28845d
Fix support for unicode standalone scripts, changing default 'ascii' encoding to UTF-8.
2021-08-12 13:39:11 +03:00
allegroai
5e4c495d62
Add support for naming docker containers. Use agent.docker_container_name_format to configure the name format (disabled by default) (issue clearml/#412)
...
Add missing entries in docs/clearml.conf
2021-08-12 13:38:26 +03:00
allegroai
06010ef1b7
Disable default demo server (still available with CLEARML_NO_DEFAULT_SERVER=0)
2021-08-12 13:36:49 +03:00
allegroai
5ed47d2d2c
Add support for CLEARML_NO_DEFAULT_SERVER env var to prevent agent from using the demo server
...
Add support for FORCE_CLEARML_AGENT_REPO env var to allow installing agent from a repo url when executing a task
Implement skip venv installation on execute and allow custom binary
Fix services mode limit implementation in docker mode
2021-08-02 22:51:26 +03:00
allegroai
4c9410c5fe
Fix auto mount SSH_AUTH_SOCK into docker (issue #45 )
2021-07-11 09:44:49 +03:00
allegroai
382604e923
Fix services mode killing child processes when running in services mode + venv
2021-06-30 23:58:25 +03:00
allegroai
aaa6b32f9f
Fix support for "-r requirements.txt" inside "installed packages"
2021-06-24 19:26:35 +03:00
allegroai
176b4a4cde
Fix --services-mode when the execute agent fails when starting to run with error code 0
2021-06-16 18:32:29 +03:00
allegroai
8c56777125
Add CLEARML_AGENT_DISABLE_SSH_MOUNT allowing disabling the auto .ssh mount into the docker
2021-06-02 13:16:58 +03:00
allegroai
7e90ebd5db
Fix _dynamic_gpu_get_available worker timeout increase to 10 minutes
2021-06-02 13:16:17 +03:00
allegroai
742cbf5767
Add docker environment arguments log masking support (issue #67 )
2021-05-25 19:31:45 +03:00
allegroai
e93384b99b
Fix --stop with dynamic gpus
2021-05-20 10:58:46 +03:00
allegroai
1e795beec8
Fix support for spaces in docker arguments (issue #358 )
2021-05-19 15:20:03 +03:00
allegroai
4f7407084d
Fix standalone script with pre-exiting conda venv
2021-05-12 15:46:25 +03:00
allegroai
e60a6f9d14
Fix --stop support for dynamic gpus
2021-04-25 10:46:43 +03:00
allegroai
6f8d5710d6
Fix dynamic gpus priority queue
2021-04-20 18:11:59 +03:00
allegroai
a671692832
Fix --services-mode with instance limit
2021-04-20 18:11:36 +03:00
allegroai
5c8675e43a
Add support for dynamic gpus opportunistic scheduling (with min/max gpus per queue)
2021-04-20 18:11:16 +03:00
allegroai
60a58f6fad
Fix poetry support (issue #57 )
2021-04-14 11:22:07 +03:00
allegroai
945dd816ad
Fix no docker arguments
2021-04-07 18:47:13 +03:00
allegroai
45009e6cc2
Add support for updating back docker on new API v2.13
2021-04-07 18:46:58 +03:00
allegroai
3774fa6abd
Add support for new container base setup script feature
2021-04-07 18:46:14 +03:00
allegroai
e71e6865d2
Add agent.docker_install_opencv_libs (default: True) to enable auto opencv libs install for faster docker spin-up
2021-04-07 18:45:44 +03:00
allegroai
c331babf51
Add stopping message on Task process termination
...
Fix --stop on dynamic gpus venv mode
2021-04-07 18:44:33 +03:00
allegroai
9e9fcb0ba9
Add dynamic mode terminate dockers on sig_term
2021-04-07 18:43:44 +03:00
allegroai
f33e0b2f78
Verify docker command exists when running in docker mode
2021-04-07 18:42:27 +03:00
allegroai
0e4b99351f
Add --stop support for dynamic gpus
...
Fix --stop mark tasks as aborted (not failed as before)
2021-04-07 18:42:10 +03:00
allegroai
81edd2860f
Fix --dynamic-gpus should keep original queue priority order
2021-03-31 23:55:12 +03:00