allegroai
09b6b6a9de
Fix non-root docker image usage
...
Fix broken trains-agent build
Improve support for dockers with preinstalled conda env
Improve trains-agent-docker spinning
2020-07-06 21:09:11 +03:00
allegroai
98ff9a50e6
Changed agent.docker_init_bash_script default value in comment
2020-07-06 21:05:55 +03:00
allegroai
1f4d358316
Changed default docker image from nvidia/cuda to "nvidia/cuda:10.1-runtime-ubuntu18.04" to support cudnn frameworks (TF)
2020-07-02 01:35:57 +03:00
allegroai
f693fa165c
Fix .git-credentials and .gitconfig mapping into docker
...
Add agent.docker_init_bash_script allow finer control over docker startup script
2020-07-02 01:33:13 +03:00
allegroai
c43084825c
Version bump to v0.15.1
2020-06-21 23:23:44 +03:00
allegroai
f1abee91dd
Add FORCE_LOCAL_TRAINS_AGENT_WHEEL to force the install of local trains agent wheel into the docker image
2020-06-21 23:23:26 +03:00
allegroai
c6b04edc34
version bump
2020-06-18 01:55:30 +03:00
allegroai
50b847f4f7
Add trains-agent dockerfile
2020-06-18 01:55:24 +03:00
allegroai
1f53a06299
Add agent.force_git_ssh_protocol option to force all git links to ssh:// (issue #16 )
...
Add git user/pass credentials for pip git packages (git+http and git+ssh) (issue #22 )
2020-06-18 01:55:14 +03:00
allegroai
257dd95401
Add warning on --gpus without detected CUDA version (see issue #24 )
2020-06-18 01:52:58 +03:00
allegroai
1736d205bb
Documentation
2020-06-18 00:31:44 +03:00
allegroai
6fef58df6c
embed jsonmodels 2.4 into trains-agent
2020-06-18 00:30:40 +03:00
allegroai
473a8de8bb
Fix trains-agent init (max two verification retries, then print error)
2020-06-11 15:39:38 +03:00
Allegro AI
ff6272f48f
Merge pull request #23 from H4dr1en/patch-1
...
remove six and pathlib2 dependencies from setup.py
2020-06-05 19:20:09 +03:00
H4dr1en
1b5bcebd10
remove six and pathlib2 dependencies from setup.py
2020-06-05 18:01:35 +02:00
Allegro AI
c4344d3afd
Update README.md
2020-06-02 01:02:34 +03:00
Allegro AI
45a44b087a
Update README.md
2020-06-02 00:58:52 +03:00
allegroai
c58ffdb9f8
Version bump to v0.15.0
2020-06-01 19:56:59 +03:00
allegroai
54d9d77294
Allow services mode to re-register (docker can kill it and not exit gracefully)
2020-06-01 16:34:33 +03:00
allegroai
ce02385420
Fix services mode abort docker while installing, detect docker crash
2020-06-01 16:33:47 +03:00
allegroai
87ffd95eaa
Upgrade default pip version to <20.2
2020-06-01 16:33:00 +03:00
allegroai
522dd85d7b
Fix docker build with no --entry-point to use bash as an entrypoint
2020-06-01 11:05:06 +03:00
allegroai
3651c85fcd
Fix print if no repo (standalone script)
2020-05-31 14:03:31 +03:00
allegroai
566427d550
Fix build failing due to missing session
2020-05-31 14:02:42 +03:00
allegroai
cc99077c92
Do not monitor GPU when running with --cpu-only
2020-05-31 14:01:14 +03:00
allegroai
5f112447f7
CUDA_VISIBLE_DEVICES should not be set to "all"
2020-05-31 14:00:51 +03:00
allegroai
22c5f043aa
Fix detached mode to correctly use cache folder slots
2020-05-31 14:00:14 +03:00
allegroai
860ff8911c
Fix status message check containing "worker" (deprecated test)
2020-05-31 13:58:39 +03:00
allegroai
799b292146
Support running code from module (i.e. '-m' in execution entry point)
2020-05-31 13:54:13 +03:00
allegroai
fffe8e1c3f
Fix init wizard, correctly display the input servers
2020-05-31 13:53:34 +03:00
allegroai
8245293f7f
Fix request endpoint constant version numbers
2020-05-31 13:52:53 +03:00
allegroai
6563ce70c8
Update README
2020-05-09 20:12:53 +03:00
allegroai
829b1d8f15
Use deep copy to clone configuration, always write configuration before launching a docker
2020-05-09 20:12:29 +03:00
allegroai
f6be64a4b5
Print conda install output if running in debug mode, turn on debugging if --debug flag is used
2020-05-09 20:11:01 +03:00
allegroai
21f6a73f66
Include CUDA version in the pytorch package fail error
2020-05-09 20:09:18 +03:00
allegroai
77c4c79a2f
Support pip 20.1 local/http package reference in pip freeze
2020-05-09 20:08:17 +03:00
allegroai
2ad929fa00
Add torch_nightly flag support (if torch wheel is not found on stable try the nightly builds), improve support for torch in freeze (add actually used HTTP link as comment to the original package)
2020-05-09 20:08:05 +03:00
allegroai
53f511f536
Improve docker host-mount support, use TRAINS_AGENT_DOCKER_HOST_MOUNT env var
2020-05-09 20:02:46 +03:00
allegroai
7c87797a40
Pass git credentials to dockerized task execution
2020-05-09 19:59:58 +03:00
allegroai
272fa07c29
Fix and enhance "build --docker"
...
- Fix standalone docker execution
- Add --install-globally option to install required packages in the docker's system python
- Add --entry-point option to allow automatic task cloning when running the docker
2020-05-09 19:57:25 +03:00
allegroai
6ce9cf7c2a
Fix version control links in requirements when using conda
2020-05-09 19:52:51 +03:00
allegroai
abb30ac2b8
Move --gpus and --cpu-only to worker args (used by daemon, execute and build)
2020-05-09 19:51:45 +03:00
allegroai
5bb257c46c
Add daemon --create-queue to automatically create a queue and use it if queue name doesn't exist in server
2020-05-09 19:50:53 +03:00
allegroai
c65b28ed92
Update venv_update URL
2020-05-09 19:47:00 +03:00
allegroai
fce8eb6782
Add OS environment configuration for git user/pass using TRAINS_AGENT_GIT_USER/TRAINS_AGENT_GIT_PASS
2020-05-09 19:46:46 +03:00
allegroai
9cb71b9526
Add daemon service mode to allow multiple tasks to be launched simultaneously on the same machine (--service-mode)
2020-05-09 19:45:14 +03:00
allegroai
38e02ca5cd
Add worker command state enforcement conforming and verification callback
2020-05-09 19:42:51 +03:00
allegroai
06bfea80bc
Fix read file scope
2020-04-09 11:27:04 +03:00
allegroai
e660c7f2be
Fix comments in config files
2020-04-09 11:23:45 +03:00
allegroai
fc28467080
Improve error message when failing to locate a task
2020-04-09 11:23:13 +03:00