allegroai
b2da639582
Add CLEARML_AGENT_FORCE_SYSTEM_SITE_PACKAGES
env var (default true) to allow overriding default "system_site_packages: true" behavior when running tasks in containers (docker mode and k8s-glue)
2022-12-10 20:00:46 +02:00
allegroai
dc5e0033c8
Remove support for kubectl run
...
Allow customizing pod name prefix and limit pod label
Return deleted pods from cleanup
Some refactoring
2022-12-05 11:40:19 +02:00
allegroai
3dd5973734
Filter by phase when detecting hanging pods
...
More debug print-outs
Use task session when possible
Push task into k8s scheduler queue only if running from the same tenant
Make sure we pass git_user/pass to the task pod
Fix cleanup command not issued when no pods exist in a multi-queue setup
2022-12-05 11:29:59 +02:00
allegroai
6e7fb5f331
Fix sending task logs fails when agent is not running in the same tenant
2022-12-05 11:19:14 +02:00
allegroai
d794b047be
Fix system_site_packages is not turned on in k8s glue
2022-10-23 13:03:59 +03:00
allegroai
857a750eb1
Fix GCP load balancer not fwd GET request body, allow to change default request Action to Put/Post/Get. see api.http.default_method or CLEARML_API_DEFAULT_REQ_METHOD
2022-09-15 20:15:42 +03:00
allegroai
26aa50f1b5
Fix k8s glue extra_bash_init_cmd location in initial bash script
2022-09-02 23:50:03 +03:00
allegroai
8b4f1eefc2
Add more debug printouts in k8s glue
2022-09-02 23:49:28 +03:00
allegroai
97c2e21dcc
Fix resolving k8s pending queue may cause a queue with a uuid name to be created
2022-09-02 23:49:28 +03:00
allegroai
7292263f86
Add CLEARML_K8S_GLUE_START_AGENT_SCRIPT_PATH to allow customizing the agent startup script location for k8s glue agent
2022-08-23 23:16:36 +03:00
allegroai
820ab4dc0c
Fix k8s glue debug mode, refactoring
2022-08-01 18:55:49 +03:00
allegroai
d96b8ff906
Fix template namespace should override default namespace
2022-07-22 22:44:32 +03:00
allegroai
e687418194
Refactor k8s glue template handling
2022-07-22 22:43:07 +03:00
allegroai
2e5298b737
Add support for use-owner-token in k8s glue
2022-04-27 14:59:27 +03:00
allegroai
4c120d7cd0
Add ability to override container LOCAL_PYTHON, add auto python support (max 3.15)
2022-03-24 21:58:07 +02:00
allegroai
cd046927f3
Add k8s glue update task status_message in hanging pods daemon
...
Fix k8s glue not throwing error when failing to push to queue
2021-08-02 22:59:31 +03:00
allegroai
42606d9247
Fix multiple k8s glue instances with pod limits
...
Version bump
2021-07-15 10:28:43 +03:00
allegroai
499b3dfa66
Fix k8s glue, do not reset Task before re-enqueuing as it will remove runtime properties
2021-07-15 10:27:54 +03:00
allegroai
ca360b7d43
Improve max pod limit check
2021-07-15 10:26:49 +03:00
allegroai
6470b16b70
Add k8s set task container if using default image/arguments
2021-07-15 10:26:09 +03:00
allegroai
0e7546f248
Fix docker force pull in k8s glue _kubectl_apply()
2021-06-27 09:42:14 +03:00
allegroai
e3c8bd5666
Add support for agent.docker_force_pull configuration setting in k8s glue
2021-06-25 17:36:08 +03:00
allegroai
3ae1741343
Fix k8s glue task container arguments not supported in kubectl_run command
...
Fix k8s glue not passing required extra_docker_bash_script to string format
2021-06-25 17:35:01 +03:00
allegroai
53c106c3af
Fix k8s glue task container handling fails parsing docker image
...
Fix k8s glue uses task container image arguments when no image is specified
2021-06-25 17:34:28 +03:00
allegroai
a2db1f5ab5
Remove queue name from pod name in k8s glue, add queue name and ID to pod labels (issue #64 )
2021-05-05 12:03:35 +03:00
allegroai
4f18bb7ea0
Add k8s glue default restartPolicy=Never to template to prevent pods from restarting
2021-04-28 13:20:13 +03:00
allegroai
08ff5e6db7
Add number of pods limit to k8s glue
2021-04-25 10:47:49 +03:00
allegroai
537b67e0cd
Fix agent can return non-zero error code and pods will end up restarting forever (issue #56 )
2021-04-12 23:00:59 +03:00
allegroai
e71e6865d2
Add agent.docker_install_opencv_libs (default: True) to enable auto opencv libs install for faster docker spin-up
2021-04-07 18:45:44 +03:00
allegroai
14ac584577
Support k8s glue container env vars merging
2021-03-31 23:53:58 +03:00
allegroai
9ce6baf074
Fix broken k8s glue docker args parsing
...
Fix empty env prevents override when merging template
2021-03-26 12:26:15 +03:00
allegroai
205f9dd816
Fix k8s glue does not pass docker environment variables
...
Remove deprecated flags
2021-03-03 15:07:06 +02:00
allegroai
8229843018
Add base-pod-number parameter to k8s glue and example
2021-01-26 20:00:18 +02:00
allegroai
0462af6a3d
Allow providing namespace in k8s glue and k8s glue example
2021-01-20 19:01:03 +02:00
allegroai
b7f87fb8d3
Detect and delete "stuck" k8s pods k8s glue
2021-01-10 12:37:13 +02:00
allegroai
b4d143812e
initial clearml-agent v0.17.0
2020-12-22 23:00:57 +02:00
allegroai
6e1f74402e
Rename trains-agent -> clearml-agent
2020-12-22 21:21:29 +02:00