Version 0.12.0
This release mostly features greatly improved reliability of stalled job detection, but also includes some refactoring. Here's the complete list:
- Modified CPU load calculations used to determine if a job is stalled now included all of the children of a process. Before, if a parent process was sleeping and children were doing all the work, the job would get incorrectly detected as stalled and be resubmitted. This was particularly problematic for SKLL.
- CPU usage and memory histories are now reset when a job is resubmitted. This means error emails will contain more sensible graphs for resubmitted jobs.
- Now raise a
JobExceptionif we give up on a job instead of ending up in a bad state. - Renamed
SEND_ERROR_MAILSenvironment variable toSEND_ERROR_MAIL. - Removed deprecated
pg_mapfunction. It was replaced bygrid_mapin 0.9.2 - Removed
runnermodule from generated API documentation, because no one should really need to use it directly. - Renamed
Job.job_idtoJob.id - Added missing
localoption togrid_map. - Added a bunch more unit tests.