Conversation
Reduce columns to set that is less likely to have empty entries that break parsing.
FrankD412
left a comment
There was a problem hiding this comment.
This looks good overall. Aside from some knitpicks I think this makes sense.
| Checks if there is a slurm instance to schedule to. NOT IMPLEMENTED YET. | ||
| """ | ||
| slurm_info_func = 'sinfo' | ||
| try: |
There was a problem hiding this comment.
I'm almost wondering if the version checking could at some point be moved to the adapters. I think the Flux adapter currently has version check logic for verifying Maestro can use it.
There was a problem hiding this comment.
Yeah, capturing that info in the adapters would be quite a useful thing for them to put in to the logs. More on point on this use case of it: do we want to have adapters themselves be able to tell if they have something to schedule to? gets a little muddy with the nested adapters -> flux in slurm/lsf/etc, but could be a potentially interesting use for dependencies too?
|
One other thing to stew on that i was playing with during testing this: maybe having an even more verbose log level would be handy to capture things like the raw sacct/squeue/etc outputs so that the logs are closer to fully reproducing errors. 'trace' level? and maybe part of that is even looking through everything and pushing some debug info up into info for things users might be interested in? |
1.1.10 Release (#432) * Sync up read the docs config with dev environments using poetry (#399) * Print usage on command line when no args are provided (#404) * Add sacct fallback to slurm adapter to improve robustness of job tracking (#405) * Update Flurm Job State mappings for flux versions >= 0.26 (#407) * Bump certifi from 2021.10.8 to 2022.12.7 to address security issue (#409) * Bump cryptography from 37.0.1 to 38.0.3 to address security issue (#410) * Add missing shbang in unscheduled scripts from lsf adapter (#411) * Update poetry lockfile to address dependabot flagged security issues (#412) * Fix for Dockerfile smell DL3006 (#418) * Port Maestro documentation to mkdocs and expand coverage of features and tutorials (#403) * Update version info to be driven from pyproject.toml exclusively, and hook up to command line (#419) * Pin mermaid to < 10.x due to api change (#422) * Bump lock file certifi from 2022.12.7 to 2023.7.22 to address security issue (#426) * Refactor flux adapter to avoid using pickle to talk to flux brokers installed in external environments (#415) Also adds flux integration tests to exercise against real flux brokers * Add pager functionality to status command (#420) * Patch broken flux job cancellation (#428) * Insulate slurm adapters from user customization of squeue and sacct output formats (#431) Also adds live unit and integration tests for slurm adapter --------- Co-authored-by: Francesco Di Natale <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Bruno P. Kinoshita <[email protected]> Co-authored-by: Charles Doutriaux <[email protected]> Co-authored-by: Giovanni Rosa <[email protected]> Co-authored-by: Brian Gunnarson <[email protected]>
1.1.10 Release (#432) * Sync up read the docs config with dev environments using poetry (#399) * Print usage on command line when no args are provided (#404) * Add sacct fallback to slurm adapter to improve robustness of job tracking (#405) * Update Flurm Job State mappings for flux versions >= 0.26 (#407) * Bump certifi from 2021.10.8 to 2022.12.7 to address security issue (#409) * Bump cryptography from 37.0.1 to 38.0.3 to address security issue (#410) * Add missing shbang in unscheduled scripts from lsf adapter (#411) * Update poetry lockfile to address dependabot flagged security issues (#412) * Fix for Dockerfile smell DL3006 (#418) * Port Maestro documentation to mkdocs and expand coverage of features and tutorials (#403) * Update version info to be driven from pyproject.toml exclusively, and hook up to command line (#419) * Pin mermaid to < 10.x due to api change (#422) * Bump lock file certifi from 2022.12.7 to 2023.7.22 to address security issue (#426) * Refactor flux adapter to avoid using pickle to talk to flux brokers installed in external environments (#415) Also adds flux integration tests to exercise against real flux brokers * Add pager functionality to status command (#420) * Patch broken flux job cancellation (#428) * Insulate slurm adapters from user customization of squeue and sacct output formats (#431) Also adds live unit and integration tests for slurm adapter --------- Co-authored-by: Francesco Di Natale <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Bruno P. Kinoshita <[email protected]> Co-authored-by: Charles Doutriaux <[email protected]> Co-authored-by: Giovanni Rosa <[email protected]> Co-authored-by: Brian Gunnarson <[email protected]>
Slurm adapter bugfixes
Test additions