Commit Graph

1423 Commits

Author SHA1 Message Date
allegroai
babaf9f1ce Add OpenMPI/Slurm support 2020-03-20 10:23:00 +02:00
allegroai
0adbd79975 Fix StorageHelper upload on shutdown 2020-03-20 10:20:44 +02:00
allegroai
dc915d0241 Fix support for Task init/close multiple times 2020-03-20 10:20:06 +02:00
allegroai
667ddcab88 Fix import for services that do not exist in old versions 2020-03-20 10:16:48 +02:00
allegroai
e05e41c117 Add table reporting to manual reporting example 2020-03-20 10:13:22 +02:00
allegroai
3b1d2d3258 Version bump to v0.14.0 2020-03-12 19:42:48 +02:00
allegroai
afad6a42ea Add initial slurm support (multiple nodes sharing the same task id) 2020-03-12 18:12:16 +02:00
allegroai
5b29aa194c Make sure artifact temporary files names are valid file names 2020-03-12 18:10:03 +02:00
allegroai
84a34428b6 Add trains-init support for config file env override (as well as argument) 2020-03-12 18:09:03 +02:00
allegroai
b3dff9a4eb Support setting task initial iteration for continuing previous runs 2020-03-12 17:40:29 +02:00
allegroai
f3531c1af2 Allow Task.set_credentials() to override configuration file in dev mode 2020-03-12 17:22:09 +02:00
allegroai
5bc39271e3 Fix store uncommitted code configuration option 2020-03-12 17:17:39 +02:00
allegroai
461fbd9df0 Better warning messages for storage errors 2020-03-12 17:13:36 +02:00
allegroai
30cf6b4834 Fix HTTP link quoting in stored links 2020-03-12 17:04:31 +02:00
allegroai
98c9a95338 Add support for reporting tables 2020-03-10 13:30:42 +02:00
allegroai
1043c22d0a Update Jupyter notebook server execution example 2020-03-09 12:20:34 +02:00
allegroai
2bffc24347 Update Jupyter notebook server execution example 2020-03-08 18:56:41 +02:00
allegroai
9e0ea880ce Add missing import 2020-03-08 18:56:28 +02:00
allegroai
8acb236b33 Update examples 2020-03-05 19:56:51 +02:00
allegroai
ef73bc258f Add Jupyter notebook server execution example 2020-03-05 12:35:58 +02:00
allegroai
b7358d7fef Add portalocker for inter-process lock 2020-03-05 12:31:22 +02:00
allegroai
2e3820603a Allow argparser override values with command line even in remote execution (essential for sub-process support) 2020-03-05 12:28:36 +02:00
allegroai
1d9e70bd8b Fix signal hooking registeration (cont.) 2020-03-05 12:26:56 +02:00
allegroai
181a0be0af Remove temporary file lock at the end of the execution or in Task.close() 2020-03-05 12:25:17 +02:00
allegroai
b0c602c832 Fix signal hooking registeration 2020-03-05 12:24:14 +02:00
allegroai
bcf97afeb9 Forking processes should not pass along the original File based Lock 2020-03-05 12:22:14 +02:00
allegroai
888c53f67d Allow disabling repository detection when calling Task.init() 2020-03-05 12:19:40 +02:00
allegroai
4bca5ccf27 Always reload task section before editing parts of it 2020-03-05 12:11:55 +02:00
allegroai
a2ecb2c75d Only use file based locks for main task. Secondary tasks use traditional multiprocessing lock 2020-03-05 12:10:23 +02:00
allegroai
da804ca75f Add support for Popen subprocesses with task edit protection from multiple processes 2020-03-05 12:05:12 +02:00
allegroai
e3ae4f4e26 Optimize task refresh while pulling task status in local worker and last iteration for Resource Monitoring 2020-03-05 11:40:27 +02:00
Allegro AI
cc1508b2bd
Merge pull request #111 from tirkarthi/fix-collections
Import ABC from collections.abc instead of collections for Python 3.9 compatibility
2020-03-05 00:37:04 +02:00
Karthikeyan Singaravelan
a97850e5b6 Import ABC from collections.abc instead of collections for Python 3.9 compatibility. 2020-03-03 21:38:03 +05:30
allegroai
146da439e7 Integrate pigar into Trains 2020-03-01 17:12:28 +02:00
allegroai
8ee2bd1844 Retry sending console logs if session.send() fails (applicable only in local mode where we use the logging handler) 2020-02-26 17:07:07 +02:00
allegroai
cf850020fb Don't print empty line at end of process if there's no artifacts summary 2020-02-26 17:06:17 +02:00
allegroai
baf5fc9e54 version bump to v0.13.3 2020-02-23 11:20:21 +02:00
allegroai
8972c1f005 Add Task.[get/set]_parameters_as_dict() to allow interaction with non-main task parameters (no need to connect()) 2020-02-20 18:32:12 +02:00
allegroai
98e6c2004c Use standard os environment variables to obtain default credentials for AWS, Google and Azure 2020-02-20 18:29:53 +02:00
allegroai
b5168010e9 Make sure Task.connect() returns the same value it is passed 2020-02-18 11:26:52 +02:00
allegroai
14588e6dec Refactor utility function 2020-02-18 11:25:29 +02:00
allegroai
3ea570cadf Store python binary along with major.minor version in task script section 2020-02-18 11:24:04 +02:00
allegroai
9fd3b98b24 Fix session error to print the instance host and not the class host 2020-02-18 11:23:06 +02:00
allegroai
edc237dad4 Improve support for tensorboard.summarywriter.addscalars binding 2020-02-18 11:21:47 +02:00
allegroai
f5f13658c3 Add binding for tensorboard.summarywriter.addscalars as well as scalars grouping configuration option 2020-02-12 14:04:53 +02:00
allegroai
63ffc09ae0 Fix incorrect upgrade message 2020-02-12 14:03:24 +02:00
allegroai
0bc71fbcf4 Remove title/series naming restrictions (allow '$' and '.') 2020-02-10 10:30:57 +02:00
allegroai
3ee70beea2 Fix URL for uploaded files with '%' in their name to allow proper unquote during HTTP serving 2020-02-10 10:30:57 +02:00
allegroai
c6849985ea Add Task.set_base_docer() and Task.get_base_docker() 2020-02-10 10:30:57 +02:00
Allegro AI
73d011fbba
Update README.md 2020-02-06 15:52:05 +02:00