Commit Graph

  • 6f078afafd Add Requirement.clone() allegroai 2020-10-11 11:21:49 +0300
  • 15f4aa613e Suppress "\r" when reading a current chunk of a file. Add agent.suppress_carriage_return (default True) to support previous behavior. allegroai 2020-10-11 11:21:08 +0300
  • 7cd9fa6c41 Version bump to v0.16.1 0.16.1 allegroai 2020-10-05 18:27:07 +0300
  • 234d5fac2c When using force ssh protocol, only enforce on git_host if provided, otherwise apply everywhere allegroai 2020-10-05 18:26:21 +0300
  • 6cbfb96ff8 Rename git_domain to git_host allegroai 2020-10-05 11:25:03 +0300
  • 6e54e55c31 Add agent.force_git_ssh_port to control https to ssh link conversion for non standard ssh port allegroai 2020-10-04 19:42:44 +0300
  • 3ff85b7b85 Replace back package version on conda and pip allegroai 2020-10-04 19:41:26 +0300
  • 5640489f57 Replace torch version on pre-installed local file allegroai 2020-10-04 19:40:39 +0300
  • 8135a6facf Add agent.git_domain setting for limiting git credential usage for a specific domain (env var TRAINS_AGENT_GIT_DOMAIN is also supported) allegroai 2020-10-04 19:40:04 +0300
  • b6ae4f211d Fix "package @ " should processed by us (pip will not test pre-installed version of the package compared with the link) allegroai 2020-10-04 19:38:33 +0300
  • a56f032ec4 Fix torch support to not change back the same link allegroai 2020-10-04 19:37:12 +0300
  • 075736de20 Translate downloaded URL back to original link when new pip version is installed (otherwise we end up with file:///... links) allegroai 2020-10-04 19:36:14 +0300
  • d8543c892e When new pip version is installed, no need to install git packages twice (pip freeze will detect the correct git link version) allegroai 2020-10-04 19:35:26 +0300
  • ca0870b048 Allow parsing of "package @ scheme://link" lines in requirements allegroai 2020-10-04 19:34:32 +0300
  • c7a739fafa Add support for detecting new pip version (20+) supporting @ in requirements allegroai 2020-10-04 19:33:52 +0300
  • 7170296162 Remove warning on '.' (same as an empty working directory) allegroai 2020-10-04 19:32:48 +0300
  • 3bed0ef33c Add protection against bad file name parsing in git diff apply allegroai 2020-10-04 19:31:48 +0300
  • d419fa1e4f Update torch version after using system pre-installed version allegroai 2020-10-04 19:29:47 +0300
  • 31a56c71bd Add preliminary agent uptime/downtime support allegroai 2020-09-29 19:34:51 +0300
  • 28f47419b0 Fix incorrect check for spaces in current execution folder (only check in cache folders) allegroai 2020-09-15 20:26:02 +0300
  • 6a24da2849 Add post_packages post_optional_packages to control packages installed after all the rest (e.g. horovod) Rename CythonReq to PriorityPackageRequirement and HorovodReq to PostRequirement allegroai 2020-09-15 20:20:55 +0300
  • 782668fd21 Add sdk.metrics.plot_max_num_digits to reduce plot storage size allegroai 2020-09-05 16:36:33 +0300
  • aaf8d802e7 Update documentation allegroai 2020-09-05 16:35:37 +0300
  • ca89a1e322 Fix pre-installed packages are ignored when installing a git package wheel. Reinstalling a git+http link is enough to make sure all requirements are met/installed (trains issue #196) allegroai 2020-09-05 16:34:36 +0300
  • 121dec2a62 Version bump to v0.16.0 0.16.0 allegroai 2020-08-10 17:28:00 +0300
  • 4aacf9005e Fix GPU Windows monitoring support (Trains Issue #177) allegroai 2020-08-10 08:07:51 +0300
  • 6b333202e9 Sync generated conf file with latest Trains allegroai 2020-08-08 14:44:45 +0300
  • ce6831368f Fix GPU monitoring on Windows machines allegroai 2020-08-08 14:43:25 +0300
  • e4111c830b Fix GIT user/pass in requirements and support for '-e git+http' lines allegroai 2020-07-30 14:30:23 +0300
  • 52c1772b04 Add requirement_parser into trains-agent instead as a dependency. Fix requirement_parser to support 'package @ git+http' lines allegroai 2020-07-30 14:29:37 +0300
  • 699d13bbb3 Fix task status change to queued should also never happen during Task runtime allegroai 2020-07-14 23:42:11 +0300
  • 2c8d7d3d9a Fix --debug to set all specified loggers to DEBUG Add set_urllib_log_level, in debug set urllib log level to DEBUG allegroai 2020-07-11 01:45:46 +0300
  • b13cc1e8e7 Add error message when Trains API Server is not accessible on startup allegroai 2020-07-11 01:44:45 +0300
  • 17d2bf2a3e Change daemon --stop without any specific flag to terminate the agents by worker id lexicographic order allegroai 2020-07-11 01:43:54 +0300
  • 94997f9c88 Add daemon --order-fairness for round-robin queue pulling Add daemon --stop to terminate running agent (assume all the rest of the arguments are the same) Clean up all log files on termination unless executed with --debug allegroai 2020-07-11 01:42:56 +0300
  • c6d998c4df Add terminate process and rmtree utilities allegroai 2020-07-11 01:40:50 +0300
  • f8ea445339 Fix docker to use UTF-8 encoding, so prints won't break it allegroai 2020-07-11 01:40:14 +0300
  • 712efa208b version bump 0.15.2rc0 allegroai 2020-07-06 21:09:21 +0300
  • 09b6b6a9de Fix non-root docker image usage Fix broken trains-agent build Improve support for dockers with preinstalled conda env Improve trains-agent-docker spinning allegroai 2020-07-06 21:09:11 +0300
  • 98ff9a50e6 Changed agent.docker_init_bash_script default value in comment allegroai 2020-07-06 21:05:55 +0300
  • 1f4d358316 Changed default docker image from nvidia/cuda to "nvidia/cuda:10.1-runtime-ubuntu18.04" to support cudnn frameworks (TF) allegroai 2020-07-02 01:35:57 +0300
  • f693fa165c Fix .git-credentials and .gitconfig mapping into docker Add agent.docker_init_bash_script allow finer control over docker startup script allegroai 2020-07-02 01:33:13 +0300
  • c43084825c Version bump to v0.15.1 0.15.1 allegroai 2020-06-21 23:23:44 +0300
  • f1abee91dd Add FORCE_LOCAL_TRAINS_AGENT_WHEEL to force the install of local trains agent wheel into the docker image allegroai 2020-06-21 23:23:26 +0300
  • c6b04edc34 version bump allegroai 2020-06-18 01:55:30 +0300
  • 50b847f4f7 Add trains-agent dockerfile allegroai 2020-06-18 01:55:24 +0300
  • 1f53a06299 Add agent.force_git_ssh_protocol option to force all git links to ssh:// (issue #16) Add git user/pass credentials for pip git packages (git+http and git+ssh) (issue #22) allegroai 2020-06-18 01:55:14 +0300
  • 257dd95401 Add warning on --gpus without detected CUDA version (see issue #24) allegroai 2020-06-18 01:52:58 +0300
  • 1736d205bb Documentation allegroai 2020-06-18 00:31:44 +0300
  • 6fef58df6c embed jsonmodels 2.4 into trains-agent allegroai 2020-06-18 00:30:40 +0300
  • 473a8de8bb Fix trains-agent init (max two verification retries, then print error) allegroai 2020-06-11 15:39:38 +0300
  • ff6272f48f
    Merge pull request #23 from H4dr1en/patch-1 Allegro AI 2020-06-05 19:20:09 +0300
  • 1b5bcebd10
    remove six and pathlib2 dependencies from setup.py H4dr1en 2020-06-05 18:01:35 +0200
  • c4344d3afd
    Update README.md Allegro AI 2020-06-02 01:02:34 +0300
  • 45a44b087a
    Update README.md Allegro AI 2020-06-02 00:58:52 +0300
  • c58ffdb9f8 Version bump to v0.15.0 0.15.0 allegroai 2020-06-01 19:56:59 +0300
  • 54d9d77294 Allow services mode to re-register (docker can kill it and not exit gracefully) allegroai 2020-06-01 16:34:33 +0300
  • ce02385420 Fix services mode abort docker while installing, detect docker crash allegroai 2020-06-01 16:33:47 +0300
  • 87ffd95eaa Upgrade default pip version to <20.2 allegroai 2020-06-01 16:33:00 +0300
  • 522dd85d7b Fix docker build with no --entry-point to use bash as an entrypoint allegroai 2020-06-01 11:05:06 +0300
  • 3651c85fcd Fix print if no repo (standalone script) allegroai 2020-05-31 14:03:31 +0300
  • 566427d550 Fix build failing due to missing session allegroai 2020-05-31 14:02:42 +0300
  • cc99077c92 Do not monitor GPU when running with --cpu-only allegroai 2020-05-31 14:01:14 +0300
  • 5f112447f7 CUDA_VISIBLE_DEVICES should not be set to "all" allegroai 2020-05-31 14:00:51 +0300
  • 22c5f043aa Fix detached mode to correctly use cache folder slots allegroai 2020-05-31 14:00:14 +0300
  • 860ff8911c Fix status message check containing "worker" (deprecated test) allegroai 2020-05-31 13:58:39 +0300
  • 799b292146 Support running code from module (i.e. '-m' in execution entry point) allegroai 2020-05-31 13:54:13 +0300
  • fffe8e1c3f Fix init wizard, correctly display the input servers allegroai 2020-05-31 13:53:34 +0300
  • 8245293f7f Fix request endpoint constant version numbers allegroai 2020-05-31 13:52:53 +0300
  • 6563ce70c8 Update README allegroai 2020-05-09 20:12:53 +0300
  • 829b1d8f15 Use deep copy to clone configuration, always write configuration before launching a docker allegroai 2020-05-09 20:12:29 +0300
  • f6be64a4b5 Print conda install output if running in debug mode, turn on debugging if --debug flag is used allegroai 2020-05-09 20:11:01 +0300
  • 21f6a73f66 Include CUDA version in the pytorch package fail error allegroai 2020-05-09 20:09:18 +0300
  • 77c4c79a2f Support pip 20.1 local/http package reference in pip freeze allegroai 2020-05-09 20:08:17 +0300
  • 2ad929fa00 Add torch_nightly flag support (if torch wheel is not found on stable try the nightly builds), improve support for torch in freeze (add actually used HTTP link as comment to the original package) allegroai 2020-05-09 20:08:05 +0300
  • 53f511f536 Improve docker host-mount support, use TRAINS_AGENT_DOCKER_HOST_MOUNT env var allegroai 2020-05-09 20:02:46 +0300
  • 7c87797a40 Pass git credentials to dockerized task execution allegroai 2020-05-09 19:59:58 +0300
  • 272fa07c29 Fix and enhance "build --docker" - Fix standalone docker execution - Add --install-globally option to install required packages in the docker's system python - Add --entry-point option to allow automatic task cloning when running the docker allegroai 2020-05-09 19:57:25 +0300
  • 6ce9cf7c2a Fix version control links in requirements when using conda allegroai 2020-05-09 19:52:51 +0300
  • abb30ac2b8 Move --gpus and --cpu-only to worker args (used by daemon, execute and build) allegroai 2020-05-09 19:51:45 +0300
  • 5bb257c46c Add daemon --create-queue to automatically create a queue and use it if queue name doesn't exist in server allegroai 2020-05-09 19:50:53 +0300
  • c65b28ed92 Update venv_update URL allegroai 2020-05-09 19:47:00 +0300
  • fce8eb6782 Add OS environment configuration for git user/pass using TRAINS_AGENT_GIT_USER/TRAINS_AGENT_GIT_PASS allegroai 2020-05-09 19:46:46 +0300
  • 9cb71b9526 Add daemon service mode to allow multiple tasks to be launched simultaneously on the same machine (--service-mode) allegroai 2020-05-09 19:45:14 +0300
  • 38e02ca5cd Add worker command state enforcement conforming and verification callback allegroai 2020-05-09 19:42:51 +0300
  • 06bfea80bc Fix read file scope allegroai 2020-04-09 11:27:04 +0300
  • e660c7f2be Fix comments in config files allegroai 2020-04-09 11:23:45 +0300
  • fc28467080 Improve error message when failing to locate a task allegroai 2020-04-09 11:23:13 +0300
  • 8d47905982 Show host information when failing to obtain a task allegroai 2020-04-01 19:12:45 +0300
  • a6a0b01f71 Remove deprecated OS environment variables allegroai 2020-04-01 19:11:37 +0300
  • 2b561f6066 Version bump to v0.14.1 0.14.1 allegroai 2020-03-24 20:37:18 +0200
  • 61232d05dd Fix run as user support in Windows and add fall-back for created user folders allegroai 2020-03-22 19:16:11 +0200
  • b3418e4496 Add daemon detached mode (--detached, -d) that runs agent in the background and returns immediately allegroai 2020-03-22 19:00:29 +0200
  • 5ef627165c Fix PyTorch support to ignore minor versions when looking for package to install or to download allegroai 2020-03-20 10:48:48 +0200
  • 98a983d9a2 Add TRAINS_AGENT_EXTRA_PYTHON_PATH to allow adding additional python path for task execution (helpful when using extra untracked modules) allegroai 2020-03-20 10:46:56 +0200
  • 482007c4ce Fix run as user feature (TRAINS_AGENT_EXEC_USER) allegroai 2020-03-20 10:42:32 +0200
  • 98198b8006 Auto mount ~/.git-credentials into docker container if file exists allegroai 2020-03-20 10:39:59 +0200
  • 94bb11a81a Change message when using local torch allegroai 2020-03-20 10:37:42 +0200
  • 4158d08f6f Fix test allegroai 2020-03-20 10:36:20 +0200
  • 58ab67ea31 Fix execution output handling allegroai 2020-03-20 10:35:25 +0200