allegroai
829b1d8f15
Use deep copy to clone configuration, always write configuration before launching a docker
2020-05-09 20:12:29 +03:00
allegroai
53f511f536
Improve docker host-mount support, use TRAINS_AGENT_DOCKER_HOST_MOUNT env var
2020-05-09 20:02:46 +03:00
allegroai
7c87797a40
Pass git credentials to dockerized task execution
2020-05-09 19:59:58 +03:00
allegroai
272fa07c29
Fix and enhance "build --docker"
...
- Fix standalone docker execution
- Add --install-globally option to install required packages in the docker's system python
- Add --entry-point option to allow automatic task cloning when running the docker
2020-05-09 19:57:25 +03:00
allegroai
5bb257c46c
Add daemon --create-queue to automatically create a queue and use it if queue name doesn't exist in server
2020-05-09 19:50:53 +03:00
allegroai
9cb71b9526
Add daemon service mode to allow multiple tasks to be launched simultaneously on the same machine (--service-mode)
2020-05-09 19:45:14 +03:00
allegroai
38e02ca5cd
Add worker command state enforcement conforming and verification callback
2020-05-09 19:42:51 +03:00
allegroai
06bfea80bc
Fix read file scope
2020-04-09 11:27:04 +03:00
allegroai
fc28467080
Improve error message when failing to locate a task
2020-04-09 11:23:13 +03:00
allegroai
8d47905982
Show host information when failing to obtain a task
2020-04-01 19:12:45 +03:00
allegroai
a6a0b01f71
Remove deprecated OS environment variables
2020-04-01 19:11:37 +03:00
allegroai
61232d05dd
Fix run as user support in Windows and add fall-back for created user folders
2020-03-22 19:16:11 +02:00
allegroai
b3418e4496
Add daemon detached mode (--detached, -d) that runs agent in the background and returns immediately
2020-03-22 19:00:29 +02:00
allegroai
98a983d9a2
Add TRAINS_AGENT_EXTRA_PYTHON_PATH to allow adding additional python path for task execution (helpful when using extra untracked modules)
2020-03-20 10:46:56 +02:00
allegroai
482007c4ce
Fix run as user feature (TRAINS_AGENT_EXEC_USER)
2020-03-20 10:42:32 +02:00
allegroai
98198b8006
Auto mount ~/.git-credentials into docker container if file exists
2020-03-20 10:39:59 +02:00
allegroai
58ab67ea31
Fix execution output handling
2020-03-20 10:35:25 +02:00
allegroai
15e9e6b778
Fix "execute --clone" support
2020-03-12 18:38:35 +02:00
allegroai
aa75b92e46
Prefer docker image from command line over the one in the experiment
2020-03-12 18:35:49 +02:00
allegroai
757210d5b3
Add support for "execute --docker" and for cloning an experiment before execution
2020-03-12 18:33:07 +02:00
allegroai
3393372b9c
Do not share apt cache among agents on the same machine
2020-03-09 12:38:51 +02:00
allegroai
f2d2d702de
Fix k8s support to allow a specific network for the docker (do not use the parent daemon network definition)
2020-03-09 12:38:32 +02:00
allegroai
e3d0680d39
Improve Unicode/UTF stdout handling
2020-03-09 12:34:48 +02:00
allegroai
0272c4c79c
Add "--force-current-version" daemon command-line flag
2020-03-09 12:31:43 +02:00
allegroai
47bcd3839a
Pass correct GPU limit when skipping gpus flag in docker mode
2020-03-05 14:07:44 +02:00
allegroai
0a3a8a1c52
Add support for mounting dockerized experiment folders to host when running on K8s in daemon mode
2020-03-05 13:13:03 +02:00
allegroai
231a907cff
Add support for running daemon inside a K8s pod in daemon mode
2020-03-05 13:03:36 +02:00
allegroai
8f95eecf2e
Add TRAINS_AGENT_EXEC_USER support for multiple daemon instances
2020-03-05 12:46:53 +02:00
allegroai
81008ee00e
Add support for launching a specific python version based on Task.script.binary
2020-03-01 17:15:18 +02:00
allegroai
f838c8fc70
Allow providing queue names to daemon
2020-02-26 16:58:25 +02:00
allegroai
d558c66d3c
Do not stop experiments if network is down
2020-02-10 10:47:13 +02:00
allegroai
a57a5b151c
Daemon support for conda and poetry
2020-01-26 15:05:20 +02:00
allegroai
284271c654
Support limiting pip version, limit to <20 by default
2020-01-22 12:02:12 +02:00
allegroai
06897f7606
Fix poetry support
2020-01-21 16:23:36 +02:00
allegroai
599219b02d
Add conda support
2020-01-21 16:21:18 +02:00
allegroai
b7e568e299
Fix requirements handling and poetry support
2020-01-16 11:10:38 +02:00
allegroai
224868c9a4
Fix relative requirements "-e" support by installing from the code's cwd
2020-01-14 13:05:12 +02:00
allegroai
7a11c7c165
Make sure logs are sent even in case an exception occurs inside the logging monitor
2020-01-13 18:14:12 +02:00
allegroai
a9f479cfcd
Add extra docker parameters bash script to use when running an experiment using a docker image
2020-01-13 12:17:59 +02:00
allegroai
cbfba6acb2
Do not try to check for virtualenv command, we use it as python package
2020-01-13 12:12:38 +02:00
allegroai
f2e2e1f94a
Add configuration option to force docker pull
2020-01-13 12:11:06 +02:00
allegroai
2b239829de
Add extra_index_url to the configuration wizard
2019-12-24 18:23:59 +02:00
allegroai
5e063c9195
Add docker build command and improve k8s integration
2019-12-15 00:04:15 +02:00
allegroai
3a301b0b6c
Improve docker support and add docker build command
2019-12-15 00:03:04 +02:00
allegroai
1f0bb4906b
Improve configuration wizard
2019-12-15 00:02:04 +02:00
allegroai
0ed7b2a0c8
Fix support for shared cache folder between multiple nodes in the cluster
2019-11-23 23:38:36 +02:00
allegroai
bd73be928a
Improve trains-agent config wizard
2019-11-23 23:37:41 +02:00
allegroai
8b05bb1605
Add initial Conda support
2019-11-15 23:24:47 +02:00
allegroai
a7873705ec
Add --gpus / --cpu-only (equivalent to NVIDIA_VISIBLE_DEVICE)
...
Add agent.python_binary specifying full path to python binary to use for virtual environement creation
Fix Windows support
2019-11-08 22:36:24 +02:00
allegroai
c352c2711c
Improve multiple GPU's docker PyTorch support
...
Fix potential zombie dockers if task is aborted
2019-11-01 20:24:53 +02:00