Commit Graph

  • fd859bed57
    Merge branch 'allegroai:master' into master pollfly 2021-09-14 10:31:28 +0300
  • f6f043d1ca Version bump to v1.1.0 1.1.0 allegroai 2021-09-13 15:25:25 +0300
  • db57441c5d Fix sensitive environment variable values are not masked in "executing docker" printout (issue #67) allegroai 2021-09-13 14:00:11 +0300
  • 31d90be0a1 Fix package manager config documentation (issue #78) allegroai 2021-09-10 13:11:39 +0300
  • 5a080798cb Add support for overriding initial server connection behavior using the CLEARML_AGENT_INITIAL_CONNECT_RETRY_OVERRIDE env var (defaults to true, allows boolean value or an explicit number specifying the number of connect retries) allegroai 2021-08-27 19:14:53 +0300
  • 21c4857795
    Fix doctring typo (#75) pollfly 2021-08-22 08:19:55 +0300
  • c0e2a1c7e9 edit doctring typo Revital 2021-08-22 07:52:40 +0300
  • 4149afa896 Add agent.docker_internal_mounts to control containers internal mounts (non-root containers) allegroai 2021-08-21 16:03:37 +0300
  • b196ab5793 Do not overwrite PYTHONIOENCODING if defined allegroai 2021-08-20 00:37:21 +0300
  • b39b54bbaf Add poetry cache into docker mapping (issue #74) allegroai 2021-08-13 11:02:21 +0300
  • 26d76f52ac Fix venv cache cannot reinstall package from git with http credentials allegroai 2021-08-13 11:00:54 +0300
  • 2fff28845d Fix support for unicode standalone scripts, changing default 'ascii' encoding to UTF-8. allegroai 2021-08-12 13:39:11 +0300
  • 5e4c495d62 Add support for naming docker containers. Use agent.docker_container_name_format to configure the name format (disabled by default) (issue clearml/#412) Add missing entries in docs/clearml.conf allegroai 2021-08-12 13:38:26 +0300
  • 5c5802c089 Fix python package with git+git:// links or git+ssh:// conversion allegroai 2021-08-12 13:37:10 +0300
  • 06010ef1b7 Disable default demo server (still available with CLEARML_NO_DEFAULT_SERVER=0) allegroai 2021-08-12 13:36:49 +0300
  • bd411a1984 version bump allegroai 2021-08-05 19:23:23 +0300
  • 29d24e3eaa Update docker example for k8s glue allegroai 2021-08-05 19:22:56 +0300
  • 0fbbe774fa Fix support for "-r requirements.txt" in installed packages allegroai 2021-08-05 19:19:54 +0300
  • aede6f4bac Fix README allegroai 2021-08-03 11:30:27 +0300
  • 84706ba66d Add docker example for running the agent k8s glue as a pod in a k8s cluster allegroai 2021-08-03 11:23:33 +0300
  • 6b602889a5 Fix import loop allegroai 2021-08-03 01:28:08 +0300
  • cd046927f3 Add k8s glue update task status_message in hanging pods daemon Fix k8s glue not throwing error when failing to push to queue allegroai 2021-08-02 22:59:31 +0300
  • 5ed47d2d2c Add support for CLEARML_NO_DEFAULT_SERVER env var to prevent agent from using the demo server Add support for FORCE_CLEARML_AGENT_REPO env var to allow installing agent from a repo url when executing a task Implement skip venv installation on execute and allow custom binary Fix services mode limit implementation in docker mode allegroai 2021-08-02 22:51:26 +0300
  • fd068c0933 Add support for env vars containing bash-style string lists using shlex Add support for CLEARML_AGENT_SKIP_PIP_VENV_INSTALL env var to skip venv installation on execute and allow custom binary Add support for CLEARML_AGENT_VENV_CACHE_PATH env var to allow overriding venv cache folder configuration Add support for CLEARML_AGENT_EXTRA_DOCKER_ARGS env var to allow overriding extra docker args configuration allegroai 2021-08-02 22:38:36 +0300
  • 9456e493ac Enable rewriting SSH URLs Simon Gasse 2021-07-27 14:00:33 +0200
  • 3b08a73245
    Update README with artifacthub.io badge Jake Henning 2021-07-27 19:53:16 +0300
  • bb72899b66 Enable rewriting SSH URLs Simon Gasse 2021-07-27 14:00:33 +0200
  • 42606d9247 Fix multiple k8s glue instances with pod limits Version bump allegroai 2021-07-15 10:28:43 +0300
  • 499b3dfa66 Fix k8s glue, do not reset Task before re-enqueuing as it will remove runtime properties allegroai 2021-07-15 10:27:54 +0300
  • ca360b7d43 Improve max pod limit check allegroai 2021-07-15 10:26:49 +0300
  • 6470b16b70 Add k8s set task container if using default image/arguments allegroai 2021-07-15 10:26:09 +0300
  • 4c9410c5fe Fix auto mount SSH_AUTH_SOCK into docker (issue #45) 1.0.1rc4 allegroai 2021-07-11 09:43:57 +0300
  • 351f0657c3
    Update agent gif (#69) pollfly 2021-07-08 09:20:45 +0300
  • b1cf527522 switch gif in readme Revital 2021-07-06 14:09:45 +0300
  • 382604e923 Fix services mode killing child processes when running in services mode + venv allegroai 2021-06-30 23:58:25 +0300
  • f5f70d9650 update clearml-agent gif Revital 2021-06-30 16:43:34 +0300
  • b48f25a7f9
    Merge pull request #68 from pollfly/master Jake Henning 2021-06-29 11:04:52 +0300
  • b76e4fc02b Merge remote-tracking branch 'origin/master' Revital 2021-06-29 07:59:02 +0300
  • 27cf7dd67f add clearml_architecture picture Revital 2021-06-29 07:58:29 +0300
  • 05ec45352c
    Merge branch 'allegroai:master' into master pollfly 2021-06-29 07:37:10 +0300
  • 0e7546f248 Fix docker force pull in k8s glue _kubectl_apply() 1.0.1rc3 allegroai 2021-06-27 09:42:14 +0300
  • e3c8bd5666 Add support for agent.docker_force_pull configuration setting in k8s glue 1.0.1rc2 allegroai 2021-06-25 17:36:08 +0300
  • 3ae1741343 Fix k8s glue task container arguments not supported in kubectl_run command Fix k8s glue not passing required extra_docker_bash_script to string format allegroai 2021-06-25 17:35:01 +0300
  • 53c106c3af Fix k8s glue task container handling fails parsing docker image Fix k8s glue uses task container image arguments when no image is specified allegroai 2021-06-25 17:34:28 +0300
  • 44fc7dffe6 Fix key/secret usage printout allegroai 2021-06-24 19:37:59 +0300
  • aaa6b32f9f Fix support for "-r requirements.txt" inside "installed packages" allegroai 2021-06-24 19:26:35 +0300
  • 821a0c4a2b Fix parsing VCS links starting with "git+git@" (notice "git+git://" was already supported) allegroai 2021-06-24 19:25:41 +0300
  • 6373237960 switch allegro.ai link to clear.ml links Revital 2021-06-22 13:59:37 +0300
  • 1caf7b104f
    Merge branch 'allegroai:master' into master pollfly 2021-06-22 13:47:48 +0300
  • 176b4a4cde Fix --services-mode when the execute agent fails when starting to run with error code 0 allegroai 2021-06-16 18:32:29 +0300
  • 29bf993be7 Add printout when using key/secret from env vars allegroai 2021-06-02 21:15:48 +0300
  • eda597dea5 Version bump allegroai 2021-06-02 13:17:37 +0300
  • 8c56777125 Add CLEARML_AGENT_DISABLE_SSH_MOUNT allowing disabling the auto .ssh mount into the docker allegroai 2021-06-02 13:16:58 +0300
  • 7e90ebd5db Fix _dynamic_gpu_get_available worker timeout increase to 10 minutes allegroai 2021-06-02 13:16:17 +0300
  • 3a07bfe1d7 Version bump 1.0.1rc0 allegroai 2021-05-31 23:19:46 +0300
  • 0694b9e8af Fix PyYAML supported versions allegroai 2021-05-26 18:33:35 +0300
  • 742cbf5767 Add docker environment arguments log masking support (issue #67) allegroai 2021-05-25 19:31:45 +0300
  • e93384b99b Fix --stop with dynamic gpus allegroai 2021-05-20 10:58:46 +0300
  • 3c4e976093 Add agent.ignore_requested_python_version to config file allegroai 2021-05-19 15:20:44 +0300
  • 1e795beec8 Fix support for spaces in docker arguments (issue #358) allegroai 2021-05-19 15:20:03 +0300
  • 4f7407084d Fix standalone script with pre-exiting conda venv allegroai 2021-05-12 15:46:25 +0300
  • ae3d034531 Protect against None in execution.repository allegroai 2021-05-12 15:45:31 +0300
  • a2db1f5ab5 Remove queue name from pod name in k8s glue, add queue name and ID to pod labels (issue #64) allegroai 2021-05-05 11:58:37 +0300
  • cec6420c8f Version bump to v1.0.0 1.0.0 allegroai 2021-05-03 18:33:53 +0300
  • 4f18bb7ea0 Add k8s glue default restartPolicy=Never to template to prevent pods from restarting allegroai 2021-04-28 13:20:13 +0300
  • 3ec2a3a92e Add k8s pod limit to k8s glue example allegroai 2021-04-28 13:19:34 +0300
  • 823b67a3ce Deprecate venv_update (replaced by the more robust venvs_cache) allegroai 2021-04-28 13:17:37 +0300
  • 24dc59e31f add space to help message Revital 2021-04-22 14:14:38 +0300
  • 08ff5e6db7 Add number of pods limit to k8s glue allegroai 2021-04-25 10:47:49 +0300
  • e60a6f9d14 Fix --stop support for dynamic gpus allegroai 2021-04-25 10:46:43 +0300
  • 161656d9e4 add space to help message Revital 2021-04-22 14:14:38 +0300
  • 8569c02b33
    Merge pull request #58 from pollfly/master Allegro AI 2021-04-21 15:27:47 +0300
  • 35e714d8d9 fix --downtime help Revital 2021-04-21 09:13:47 +0300
  • 6f8d5710d6 Fix dynamic gpus priority queue allegroai 2021-04-20 18:11:59 +0300
  • a671692832 Fix --services-mode with instance limit allegroai 2021-04-20 18:11:36 +0300
  • 5c8675e43a Add support for dynamic gpus opportunistic scheduling (with min/max gpus per queue) allegroai 2021-04-20 18:11:16 +0300
  • 60a58f6fad Fix poetry support (issue #57) allegroai 2021-04-14 11:22:07 +0300
  • 948fc4c6ce Add python 3.9 to the support table allegroai 2021-04-12 23:01:40 +0300
  • 5be5f3209d Fix documentation links allegroai 2021-04-12 23:01:22 +0300
  • 537b67e0cd Fix agent can return non-zero error code and pods will end up restarting forever (issue #56) allegroai 2021-04-12 23:00:59 +0300
  • 82c5e55fe4 Fix usage of not_set in k8s template merge allegroai 2021-04-07 21:30:13 +0300
  • 5f0d51d485 Add documentation for agent.docker_install_opencv_libs allegroai 2021-04-07 18:48:30 +0300
  • 945dd816ad Fix no docker arguments allegroai 2021-04-07 18:47:13 +0300
  • 45009e6cc2 Add support for updating back docker on new API v2.13 allegroai 2021-04-07 18:46:58 +0300
  • 8eace6d57b Bump virtualenv dependency version allegroai 2021-04-07 18:46:35 +0300
  • 3774fa6abd Add support for new container base setup script feature allegroai 2021-04-07 18:46:14 +0300
  • e71e6865d2 Add agent.docker_install_opencv_libs (default: True) to enable auto opencv libs install for faster docker spin-up allegroai 2021-04-07 18:45:44 +0300
  • 0e8f1528b1 Remove redundant py2 code allegroai 2021-04-07 18:44:59 +0300
  • c331babf51 Add stopping message on Task process termination Fix --stop on dynamic gpus venv mode allegroai 2021-04-07 18:44:33 +0300
  • c59d268995 Fix venv cache crash on bad symbolic links allegroai 2021-04-07 18:44:11 +0300
  • 9e9fcb0ba9 Add dynamic mode terminate dockers on sig_term allegroai 2021-04-07 18:43:44 +0300
  • f33e0b2f78 Verify docker command exists when running in docker mode allegroai 2021-04-07 18:42:27 +0300
  • 0e4b99351f Add --stop support for dynamic gpus Fix --stop mark tasks as aborted (not failed as before) allegroai 2021-04-07 18:42:10 +0300
  • 81edd2860f Fix --dynamic-gpus should keep original queue priority order allegroai 2021-03-31 23:55:12 +0300
  • 14ac584577 Support k8s glue container env vars merging allegroai 2021-03-31 23:53:58 +0300
  • 9ce6baf074 Fix broken k8s glue docker args parsing Fix empty env prevents override when merging template allegroai 2021-03-26 12:26:15 +0300
  • 92a1e07b33 Fix local path replace back when using cache allegroai 2021-03-26 12:16:05 +0300
  • cb6bdece39 Fix cuda version from driver does not return minor version allegroai 2021-03-18 10:07:20 +0200
  • 2ea38364bb Change the default conda channel order, so it pulls the correct pytorch allegroai 2021-03-18 10:03:36 +0200
  • cf6fdc0d81 Add support for PyJWT v2 allegroai 2021-03-18 10:02:50 +0200