allegroai
a41ea52f87
Add multiple packages support
2020-12-10 09:52:00 +02:00
allegroai
259113c989
Add PackageCollectorRequirement to allow multiple entries of the same package
2020-12-06 12:16:56 +02:00
allegroai
448e23825c
Fix requirements dict with null entry in pip
should be considered None and we should install from requirements.txt
2020-12-06 12:14:22 +02:00
allegroai
74b646af9e
Add pass TRAINS_DOCKER_IMAGE into docker for interactive sessions
2020-11-29 23:16:40 +02:00
allegroai
64e198a57a
Fix nvidia docker support on some linux distros (SUSE)
2020-11-25 16:44:37 +02:00
allegroai
6df0f81ca0
Fix uid is None causes ValueError in str.startswith(). Fix str.split (should be on the filename itself, not the path).
2020-11-11 16:32:47 +02:00
allegroai
92efea6b76
Add agent.package_manager.force_repo_requirements_txt. If True, "Installed Packages" on Task are ignored, and only repo requirements.txt is used
2020-11-11 16:29:00 +02:00
allegroai
89a3020c5e
Fix ubuntu/debian support by making sure not to ask for input (fix tzdata install)
2020-10-15 23:32:17 +03:00
allegroai
fc3e47b67e
Add suppress_carriage_return to documentation
...
Add docker_preprocess_bash_script to allow preprocessing bash to be added
Fix multiple python versions installed in the same docker by finding the highest installed python inside the docker
Fix conda_env_as_base_docker not set to False in docker mode
2020-10-15 23:31:01 +03:00
allegroai
47092c47db
Fix apply git diff from submodule only
2020-10-15 23:26:52 +03:00
allegroai
3c514e3418
Make sure TRAINS_AGENT_K8S_HOST_MOUNT is used only once per mount
2020-10-15 23:24:51 +03:00
allegroai
eb942cfedd
Add agent.package_manager.conda_env_as_base_docker allowing "docker_cmd" to contain link to a full pre-packaged conda environment (conda-pack outputs a tar.gz). Use TRAINS_CONDA_ENV_PACKAGE to specify conda tar.gz file.
2020-10-15 23:23:46 +03:00
allegroai
9fe77f3c28
Fix conda environment support for trains 0.16.3 full env. Add agent.package_manager.conda_full_env_update to allow conda to update back the requirements (default is false, to preserve previous behavior)
2020-10-11 11:24:52 +03:00
allegroai
15f4aa613e
Suppress "\r" when reading a current chunk of a file. Add agent.suppress_carriage_return (default True) to support previous behavior.
2020-10-11 11:21:08 +03:00
allegroai
7170296162
Remove warning on '.' (same as an empty working directory)
2020-10-04 19:32:48 +03:00
allegroai
31a56c71bd
Add preliminary agent uptime/downtime support
2020-09-29 19:34:51 +03:00
allegroai
28f47419b0
Fix incorrect check for spaces in current execution folder (only check in cache folders)
2020-09-15 20:26:02 +03:00
allegroai
6a24da2849
Add post_packages post_optional_packages to control packages installed after all the rest (e.g. horovod)
...
Rename CythonReq to PriorityPackageRequirement and HorovodReq to PostRequirement
2020-09-15 20:20:55 +03:00
allegroai
aaf8d802e7
Update documentation
2020-09-05 16:37:17 +03:00
allegroai
6b333202e9
Sync generated conf file with latest Trains
2020-08-08 14:44:45 +03:00
allegroai
699d13bbb3
Fix task status change to queued should also never happen during Task runtime
2020-07-14 23:42:11 +03:00
allegroai
17d2bf2a3e
Change daemon --stop without any specific flag to terminate the agents by worker id lexicographic order
2020-07-11 01:43:54 +03:00
allegroai
94997f9c88
Add daemon --order-fairness for round-robin queue pulling
...
Add daemon --stop to terminate running agent (assume all the rest of the arguments are the same)
Clean up all log files on termination unless executed with --debug
2020-07-11 01:42:56 +03:00
allegroai
09b6b6a9de
Fix non-root docker image usage
...
Fix broken trains-agent build
Improve support for dockers with preinstalled conda env
Improve trains-agent-docker spinning
2020-07-06 21:09:11 +03:00
allegroai
f693fa165c
Fix .git-credentials and .gitconfig mapping into docker
...
Add agent.docker_init_bash_script allow finer control over docker startup script
2020-07-02 01:33:13 +03:00
allegroai
f1abee91dd
Add FORCE_LOCAL_TRAINS_AGENT_WHEEL to force the install of local trains agent wheel into the docker image
2020-06-21 23:23:26 +03:00
allegroai
257dd95401
Add warning on --gpus without detected CUDA version (see issue #24 )
2020-06-18 01:52:58 +03:00
allegroai
473a8de8bb
Fix trains-agent init (max two verification retries, then print error)
2020-06-11 15:39:38 +03:00
allegroai
54d9d77294
Allow services mode to re-register (docker can kill it and not exit gracefully)
2020-06-01 16:34:33 +03:00
allegroai
ce02385420
Fix services mode abort docker while installing, detect docker crash
2020-06-01 16:33:47 +03:00
allegroai
522dd85d7b
Fix docker build with no --entry-point to use bash as an entrypoint
2020-06-01 11:05:06 +03:00
allegroai
3651c85fcd
Fix print if no repo (standalone script)
2020-05-31 14:03:31 +03:00
allegroai
566427d550
Fix build failing due to missing session
2020-05-31 14:02:42 +03:00
allegroai
22c5f043aa
Fix detached mode to correctly use cache folder slots
2020-05-31 14:00:14 +03:00
allegroai
860ff8911c
Fix status message check containing "worker" (deprecated test)
2020-05-31 13:58:39 +03:00
allegroai
799b292146
Support running code from module (i.e. '-m' in execution entry point)
2020-05-31 13:54:13 +03:00
allegroai
fffe8e1c3f
Fix init wizard, correctly display the input servers
2020-05-31 13:53:34 +03:00
allegroai
829b1d8f15
Use deep copy to clone configuration, always write configuration before launching a docker
2020-05-09 20:12:29 +03:00
allegroai
53f511f536
Improve docker host-mount support, use TRAINS_AGENT_DOCKER_HOST_MOUNT env var
2020-05-09 20:02:46 +03:00
allegroai
7c87797a40
Pass git credentials to dockerized task execution
2020-05-09 19:59:58 +03:00
allegroai
272fa07c29
Fix and enhance "build --docker"
...
- Fix standalone docker execution
- Add --install-globally option to install required packages in the docker's system python
- Add --entry-point option to allow automatic task cloning when running the docker
2020-05-09 19:57:25 +03:00
allegroai
5bb257c46c
Add daemon --create-queue to automatically create a queue and use it if queue name doesn't exist in server
2020-05-09 19:50:53 +03:00
allegroai
9cb71b9526
Add daemon service mode to allow multiple tasks to be launched simultaneously on the same machine (--service-mode)
2020-05-09 19:45:14 +03:00
allegroai
38e02ca5cd
Add worker command state enforcement conforming and verification callback
2020-05-09 19:42:51 +03:00
allegroai
06bfea80bc
Fix read file scope
2020-04-09 11:27:04 +03:00
allegroai
fc28467080
Improve error message when failing to locate a task
2020-04-09 11:23:13 +03:00
allegroai
8d47905982
Show host information when failing to obtain a task
2020-04-01 19:12:45 +03:00
allegroai
a6a0b01f71
Remove deprecated OS environment variables
2020-04-01 19:11:37 +03:00
allegroai
61232d05dd
Fix run as user support in Windows and add fall-back for created user folders
2020-03-22 19:16:11 +02:00
allegroai
b3418e4496
Add daemon detached mode (--detached, -d) that runs agent in the background and returns immediately
2020-03-22 19:00:29 +02:00