Commit Graph

  • a6ae6b2095 Add initial support for --break-system-packages version detection, but the reality is that we do not need it, because by the time we are running it is too late, so we do rm /usr/lib/python3.*/EXTERNALLY-MANAGED master clearml 2025-01-26 23:07:09 +0200
  • 369b440b96 Reduce required packages clearml 2025-01-26 23:05:57 +0200
  • 28e9280a4f Reduce required packages clearml 2025-01-26 23:03:16 +0200
  • 7e9e3ad08b Add printout when using custom configuration file clearml 2025-01-26 22:51:09 +0200
  • 44709673f4 Add CLEARML_AGENT_CONFIG_VERBOSE for verbose configuration file loading clearml 2025-01-26 22:50:49 +0200
  • 823c92ba0e Add SECURITY.md clearml 2025-01-26 22:49:14 +0200
  • d98002091d
    Merge f640eb6707 into 4158146420 mads-oestergaard 2025-01-20 00:56:37 +0000
  • 4158146420 Version bump to v1.9.3 v1.9.3 clearml 2025-01-19 16:17:56 +0200
  • b9ef1a55cd Fix dependency on windows clearml 2025-01-19 16:16:54 +0200
  • 9fa8d72640 Update github repo link clearml 2025-01-13 18:36:16 +0200
  • e535390815 Add win32file dependency on windows clearml 2025-01-13 18:34:48 +0200
  • 72482ca394
    Merge f7d4e577ac into 91dfa09466 Emir İşman 2025-01-08 15:35:31 -0500
  • 91dfa09466 Fix Python 3.13 support v1.9.3rc2 clearml 2025-01-05 12:14:24 +0200
  • f110bbf5b4 Remove Python 3.5 support clearml 2025-01-05 12:13:57 +0200
  • 070919973b Fix python 3.6 compatibility, no := operator clearml 2025-01-05 12:13:21 +0200
  • 47d35ef48f Fix managed python environment inside container (PEP 668) remove usr/lib/python3.*/EXTERNALLY-MANAGED v1.9.3rc1 clearml 2024-12-26 18:59:42 +0200
  • 54ed234fca Add agent.docker_args_filters to configuration docs clearml 2024-12-26 18:58:58 +0200
  • a26860e79f Fix default value handling in merge_dicts() clearml 2024-12-26 18:58:24 +0200
  • fc1abbab0b Refactor k8s glue clearml 2024-12-26 18:58:00 +0200
  • 4fa61dde1f Support ignoring kubectl errors clearml 2024-12-12 23:41:31 +0200
  • 26d748a4d8 Support creating queue with tags clearml 2024-12-12 23:40:57 +0200
  • 5419fd84ae Add support for Python 3.13 clearml 2024-12-12 23:39:11 +0200
  • d8366dedc6 Fix UV priority Fix UV cache is disabled, UV handles its own cache Fix UV freeze Fix make sure we do not use pip cache if poetry/uv is used (even if we reverted to pip we can't know if someone changed the repository and now in a new version, a lock file exists) clearml 2024-12-12 23:38:42 +0200
  • cc656e2969
    Add support for uv as package manager (#218) mads-oestergaard 2024-11-27 12:44:55 +0100
  • b65e5fed94 Scan more Python 3 versions clearml 2024-11-17 13:55:51 +0200
  • 16a6701d2a rename to sync_extra_args and set UV_CACHE_DIR Mads Oestergaard 2024-11-04 10:05:56 +0100
  • 3d06b410c4 Update configs to highlight sync command Mads Oestergaard 2024-11-04 10:05:39 +0100
  • fb7011fe13 update environ Mads Oestergaard 2024-10-31 11:21:35 +0100
  • dbea22d725 update worker and defs Mads Oestergaard 2024-10-31 11:20:35 +0100
  • 536911b3a8 update configs Mads Oestergaard 2024-10-31 11:20:27 +0100
  • 11bf855547 add uv as a package manager Mads Oestergaard 2024-10-31 11:20:09 +0100
  • 3273f76b46 Version bump to v1.9.2 v1.9.2 clearml 2024-10-28 18:33:04 +0200
  • 9af0f9fe41 Fix reload method is found in the config object clearml 2024-10-28 18:12:22 +0200
  • 205cd47cb9 Fix use req_token_expiration_sec when creating a task session and not the default value clearml 2024-10-28 18:11:42 +0200
  • 0ff428bb96 Fix report index not advancing in resource monitoring causes more than one GPU not to be reported clearml 2024-10-28 18:11:00 +0200
  • f7d4e577ac
    Update requests range to allow <2.33.0 Emir Isman 2024-10-21 15:55:12 +0200
  • bf8d9c96e9
    Handle OSError when checking for is_file (#215) Matteo Destro 2024-10-13 09:08:03 +0200
  • 0044514f72
    Handle OSError when checking for is_file Matteo Destro 2024-10-11 10:40:59 +0200
  • a88487ff25 Add support for pip legacy resolver for versions specified in the agent.package_manager.pip_legacy_resolver configuration option Add skip existing packages allegroai 2024-09-22 22:35:56 +0300
  • 785e22dc87
    Version bump to v1.9.1 v1.9.1 Jake Henning 2024-09-02 01:04:49 +0300
  • 6a2b778d53
    Add default pip version support for Python 3.12 Jake Henning 2024-09-02 01:03:52 +0300
  • b2c3702830 Version bump to v1.9.0 v1.9.0 allegroai 2024-08-28 23:18:26 +0300
  • 6302d43990 Add support for skipping container apt installs using CLEARML_AGENT_SKIP_CONTAINER_APT env var in k8s Add runtime callback support for setting runtime properties per task in k8s Fix remove task from pending queue and set to failed when kubectl apply fails allegroai 2024-08-27 23:01:27 +0300
  • 760bbca74e Fix failed Task in services mode logged "User aborted" instead of failed, add Task reason string allegroai 2024-08-27 22:56:37 +0300
  • e63fd31420 Fix string format allegroai 2024-08-27 22:55:49 +0300
  • 2ff9985db7 Add user ID to the vault loading print allegroai 2024-08-27 22:55:32 +0300
  • b8c762401b Fix use same state transition if supported by the server (instead of stopping the task before re-enqueue) allegroai 2024-08-27 22:54:45 +0300
  • 99e1e54f94 Add support for tasks containing only bash script or python module command allegroai 2024-08-27 22:53:14 +0300
  • a4d3b5bad6 Fix only set Task started status on node rank 0 allegroai 2024-08-27 22:52:31 +0300
  • b21665ed6e Fix do not cache venv cache if venv/python skip env var was set allegroai 2024-08-27 22:52:01 +0300
  • f877aa96e2
    Update Docker base image to Ubuntu 22.04 and Kubectl to 1.29.3 (#201) Surya Teja 2024-07-29 16:41:50 +0100
  • f99344d194
    Add queue priority info to CLI help (#211) pollfly 2024-07-29 18:40:38 +0300
  • d9f2a1999a Fix Only send pip freeze update on RANK 0, only update task status on exit on RANK 0 allegroai 2024-07-29 17:40:24 +0300
  • f866d43693 Add --order-fairness info revital 2024-07-28 09:22:44 +0300
  • 79d0abe707
    Add NO_DOCKER flag to clearml-agent-services entrypoint (#206) Valentin Schabschneider 2024-07-26 18:09:54 +0200
  • 6213ef4c02 Add /bin/bash -c "command" support. Task binary should be set to /bin/bash and entry_point should be set to -c command allegroai 2024-07-24 18:00:13 +0300
  • aef6aa9fc8 Fix a race condition where in rare conditions popping a Task from a queue that was aborted did not set it to started before the watchdog killed it. Does not happen in k8s/slurm allegroai 2024-07-24 17:59:46 +0300
  • 0bb267115b Add venvs_cache.path mount override for non-root containers (use: agent.docker_internal_mounts.venvs_cache) allegroai 2024-07-24 17:59:18 +0300
  • f89a92556f Fix check logger is not None allegroai 2024-07-24 17:55:02 +0300
  • 8ba4d75e80 Add CLEARML_TASK_ID and auth token to pod env vars in original entrypoint flow allegroai 2024-07-24 17:47:48 +0300
  • edc333ba5f Add K8S_GLUE_POD_USE_IMAGE_ENTRYPOINT to allow running images without overriding the entrypoint (useful for agents using prebuilt images in k8s) allegroai 2024-07-24 17:46:27 +0300
  • 2f0553b873 Fix CLEARML_MULTI_NODE_SINGLE_TASK should be read once not every reported line allegroai 2024-07-24 17:45:02 +0300
  • b2a4bf08ac Fix pass --docker only (i.e. no default container image) for --dynamic-gpus feature allegroai 2024-07-24 17:44:35 +0300
  • f18c6b809f Fix slurm multi-node rank detection allegroai 2024-07-24 17:44:05 +0300
  • cd5b4d2186 Add "-m module args" in script entry now supports standalone script, standalone script is converted to "untitled.py" by default or if specified in working_dir such as <dir>:<target_file> for example ".:standalone.py" allegroai 2024-07-24 17:43:21 +0300
  • 5f1bab6711 Add default docker match_rules for enterprise users, NOTICE: matching_rules are ignored if --docker container is passed in command line allegroai 2024-07-24 17:42:55 +0300
  • ab9b9db0c9 Add CLEARML_MULTI_NODE_SINGLE_TASK (values -1, 0, 1, 2) for easier multi-node singe Task workloads allegroai 2024-07-24 17:42:25 +0300
  • 93df021108 Add support for .ipynb script entry files (install nbconvert in runtime, copnvert to python and execute the python script), including CLEARML_AGENT_FORCE_TASK_INIT patching of ipynb files (post python conversion) allegroai 2024-07-24 17:41:59 +0300
  • 700ae85de0 Fix file mode should be optional in configuration files section allegroai 2024-07-24 17:41:06 +0300
  • f367c5a571 Fix git fetch did not update new tags #209 allegroai 2024-07-24 17:39:53 +0300
  • ebc5944b44 Fix setting tasks that someone just marked as aborted to started - only force Task to started after dequeuing it otherwise lease it as is allegroai 2024-07-24 17:39:26 +0300
  • 8f41002845 Add task.script.binary /bin/bash support Fix -m module $env to support parsing the $env before launching allegroai 2024-07-24 17:37:26 +0300
  • 7e8670d57f Find the correct python version when using a pre-installed python environment allegroai 2024-07-21 14:10:38 +0300
  • 77de343863 Use "venv" module if virtualenv is not supported allegroai 2024-07-19 13:18:07 +0300
  • f71f13629c Merge remote-tracking branch 'origin/master' revital 2024-07-03 13:33:54 +0300
  • 07beb13846 add queue priority comment revital 2024-07-03 13:33:11 +0300
  • 3e328d2461 Merge https://github.com/allegroai/clearml-agent revital 2024-07-03 13:20:19 +0300
  • 94a44c34c5 Add NO_DOCKER flag to clearml-agent-services entrypoint Valentin Schabschneider 2024-05-27 08:09:33 +0000
  • 6b31883e45 Fix queue resolution when no queue is passed v1.8.1 allegroai 2024-05-15 18:30:24 +0300
  • e48b4756fa Add Python 3.12 support allegroai 2024-05-15 18:25:29 +0300
  • 16dc08eebd Update Docker base image to Ubuntu 22.04 and Kubectl to 1.29.3 Surya Kasturi 2024-05-10 11:07:00 +0100
  • 101c7976b6
    Merge f9874252cb into 47147e3237 IlyaMescheryakov1402 2024-04-26 11:34:17 +0300
  • 47147e3237 Fix cached repositories were not passing user/token when pulling, agent.vcs_cache.clone_on_pull_fail now defaults to false allegroai 2024-04-19 23:50:17 +0300
  • 41fc4ec646 Fix disabling vcs cache should not add vcs mount point to container allegroai 2024-04-19 23:48:50 +0300
  • 441e5a73b2 Fix conda env should not be cached if installing into base conda or conda existing env exists allegroai 2024-04-19 23:48:10 +0300
  • 27ed6821c4 Add mirrorD config files to gitignore allegroai 2024-04-19 23:47:34 +0300
  • 10c6629982 Support skipping re-enqueue on suspected preempted k8s pods allegroai 2024-04-19 23:46:57 +0300
  • 6fb48a4c6e Revert version to v1.8.1 allegroai 2024-04-19 23:44:31 +0300
  • 105ade31f1 Version bump to v1.8.2 allegroai 2024-04-14 18:18:10 +0300
  • 502e266b6b Fix polling interval missing when not using daemon mode allegroai 2024-04-14 18:17:57 +0300
  • cd9a3b9f4e Version bump to v1.8.1 allegroai 2024-04-12 20:30:11 +0300
  • 4179ac5234 Fix git pulling on cached invalid git entry. On error, re-clone the entire repo again (disable using "agent.vcs_cache.clone_on_pull_fail: false") allegroai 2024-04-12 20:29:36 +0300
  • 98cc0d86ba
    Add option to set daemon polling interval (#197) Liron Ilouz 2024-04-03 14:33:52 +0300
  • 0c8748fd83 polling interval minimum value Liron 2024-04-03 12:46:29 +0300
  • 238290122d
    Merge branch 'allegroai:master' into master Liron Ilouz 2024-04-03 11:57:18 +0300
  • 293cbc0ac6 Version bump to v1.8.0 v1.8.0 allegroai 2024-04-02 16:38:22 +0300
  • 4387ed73b6 Fix None handling when no limits exist allegroai 2024-04-02 16:36:09 +0300
  • 05600504b6
    Merge branch 'allegroai:master' into master Liron Ilouz 2024-04-01 16:59:15 +0300
  • 43443ccf08 Pass task_id when resolving k8s template allegroai 2024-04-01 11:37:01 +0300
  • 3d43240c8f Improve conda package manager support Add agent.package_manager.use_conda_base_env (CLEARML_USE_CONDA_BASE_ENV) allowing to use base conda environment (instead of installing a new one) Fix conda support for python packages with markers and multiple specifications Added "nvidia" conda channel and support for cuda-toolkit >= 12 allegroai 2024-04-01 11:36:26 +0300