allegroai
dc5e0033c8
Remove support for kubectl run
...
Allow customizing pod name prefix and limit pod label
Return deleted pods from cleanup
Some refactoring
2022-12-05 11:40:19 +02:00
allegroai
3dd5973734
Filter by phase when detecting hanging pods
...
More debug print-outs
Use task session when possible
Push task into k8s scheduler queue only if running from the same tenant
Make sure we pass git_user/pass to the task pod
Fix cleanup command not issued when no pods exist in a multi-queue setup
2022-12-05 11:29:59 +02:00
allegroai
53d379205f
Support raise_error
in get_bash_output()
2022-12-05 11:26:40 +02:00
allegroai
57cde21c48
Send task.ping
for executing tasks every 120 seconds (set using the agent.task_ping_interval_sec
configuration option)
2022-12-05 11:22:25 +02:00
allegroai
396abf13b6
Fix get_task_session()
may cause an old copy of the APIClient
to be used containing a reference to the previous session
2022-12-05 11:20:32 +02:00
allegroai
6e7fb5f331
Fix sending task logs fails when agent is not running in the same tenant
2022-12-05 11:19:14 +02:00
allegroai
1d5c118b70
Fix setting CLEARML_API_DEFAULT_REQ_METHOD
raises an error
2022-12-05 11:18:12 +02:00
allegroai
18612aac4d
Improve configuration examples
2022-12-05 11:17:27 +02:00
allegroai
76c533a2e8
Fix access to config object
2022-11-11 13:34:17 +02:00
Niels ten Boom
9eee213683
Add option to crash agent on exception using agent.crash_on_exception
configuration setting ( #123 )
2022-11-06 17:15:39 +02:00
allegroai
e4861fc0fb
Add missing settings in clearml.conf
2022-11-06 12:36:01 +02:00
allegroai
53ef984065
Update README
2022-11-06 11:53:16 +02:00
allegroai
26e62da1a8
version bump to 1.5.0rc0
2022-10-23 13:04:00 +03:00
allegroai
d2f3614ab0
Add support for CLEARML_AGENT_DOCKER_ARGS_HIDE_ENV environment variable (see agent.hide_docker_command_env_vars
config option)
2022-10-23 13:04:00 +03:00
allegroai
c6d767bd64
Make venv caching the default behavior
2022-10-23 13:04:00 +03:00
allegroai
efb06891a8
Add support for PyTorch new extra_index_url repo support. We will find the correct index url based on the cuda version, and let pip do the rest.
2022-10-23 13:04:00 +03:00
allegroai
70771b12a9
Remove unused code
2022-10-23 13:04:00 +03:00
allegroai
3f7a4840cc
Add support for operator != in package version (mostly for pytorch resolving)
2022-10-23 13:04:00 +03:00
allegroai
e28048dc25
Change default pip version used to "pip<21" for better Python 3.10 support
2022-10-23 13:04:00 +03:00
allegroai
2ef5d38b32
Remove future (Python 2 is not supported for clearml-agent)
2022-10-23 13:03:59 +03:00
allegroai
d216d70cdf
Upgrade packages for better Python 3.10 support
2022-10-23 13:03:59 +03:00
allegroai
0de10345f7
Moved pyhocon to internal packages
2022-10-23 13:03:59 +03:00
allegroai
a243fa211f
Improve venv cache disabled message
2022-10-23 13:03:59 +03:00
allegroai
d794b047be
Fix system_site_packages is not turned on in k8s glue
2022-10-23 13:03:59 +03:00
allegroai
f0fd62a28f
Fix docker extra args showing up in configuration printout
2022-10-23 13:03:59 +03:00
allegroai
e8493d3807
Refactor override configuration to a method
2022-10-23 13:03:58 +03:00
Allegro AI
5353e9c44d
Update README.md
2022-10-19 02:47:10 +03:00
Allegro AI
75f5814f9f
Update README.md
2022-10-19 02:44:53 +03:00
Allegro AI
94b8b5520d
Update README.md
2022-10-19 02:18:56 +03:00
allegroai
42450dcbc4
Update clearml.conf
2022-10-07 15:33:19 +03:00
allegroai
ef47225d41
Version bump to v1.4.1
2022-10-07 15:27:49 +03:00
allegroai
e61accefb9
PEP8 + refactor
2022-10-07 15:26:31 +03:00
allegroai
5c1543d112
Add agent.disable_ssh_mount
configuration option (same as CLEARML_AGENT_DISABLE_SSH_MOUNT
env var)
2022-10-07 15:24:39 +03:00
allegroai
7ff6aee20c
Add warning if venv cache is disabled
2022-10-07 15:23:10 +03:00
allegroai
37ea381d98
Add support for docker args filters
2022-10-07 15:22:42 +03:00
allegroai
67fc884895
Fix --gpus all
not reporting GPU stats on worker machine
2022-10-07 15:22:13 +03:00
allegroai
1e3646b57c
Fix docker command for monitoring child agents
2022-10-07 15:21:32 +03:00
allegroai
ba2db4e727
Version bump to v1.4.0
2022-09-29 18:21:04 +03:00
allegroai
077148be00
version bump
2022-09-16 17:29:42 +03:00
allegroai
594ee5842e
Allow to pverride pytorch lookup page: "agent.package_manager.torch_page / torch_nightly_page / torch_url_template_prefix"
2022-09-15 20:16:41 +03:00
allegroai
a69766bd8b
Add CLEARML_AGENT_CHILD_AGENTS_COUNT_CMD to allow overriding child agent count command in k8s
2022-09-15 20:16:01 +03:00
allegroai
857a750eb1
Fix GCP load balancer not fwd GET request body, allow to change default request Action to Put/Post/Get. see api.http.default_method or CLEARML_API_DEFAULT_REQ_METHOD
2022-09-15 20:15:42 +03:00
allegroai
26aa50f1b5
Fix k8s glue extra_bash_init_cmd location in initial bash script
2022-09-02 23:50:03 +03:00
allegroai
8b4f1eefc2
Add more debug printouts in k8s glue
2022-09-02 23:49:28 +03:00
allegroai
97c2e21dcc
Fix resolving k8s pending queue may cause a queue with a uuid name to be created
2022-09-02 23:49:28 +03:00
allegroai
918dd39b87
Add docker ssh_ro_folder (default: "/.ssh") changed docker ssh_folder (default: "~/.ssh")
2022-09-02 23:49:27 +03:00
allegroai
7776e906c4
Fix second .ssh temp mount fails if container changes the files inside
2022-09-02 23:49:27 +03:00
allegroai
1bf865ec08
Fix name not escaped as regex (all services "get_all" use regex for name)
2022-09-02 23:49:27 +03:00
Luca Cerone
3f1ce847dc
Fixed documentation ( #117 )
...
* Fixed documentation
* Update README.md
2022-09-01 17:18:48 +03:00
allegroai
9006c2d28f
Add support for abort callback registration
2022-08-29 18:06:59 +03:00