Skip to content

Add controls for attaching nodes/task info to jobspecs and launchers#460

Closed
jwhite242 wants to merge 4 commits intodevelopfrom
feature/flux_binding_mode
Closed

Add controls for attaching nodes/task info to jobspecs and launchers#460
jwhite242 wants to merge 4 commits intodevelopfrom
feature/flux_binding_mode

Conversation

@jwhite242
Copy link
Copy Markdown
Collaborator

Adds additional control on how resource specs are attached to both batch jobs and launchers separately. Enables use of more dynamic resource configuration such as the gpumode introduced with flux for modern AMD machines which can change the number of logical gpus after job scheduling time. Pre 1.1.12 behavior attaches tasks to the jobspec in flux, which can result in 'unsatisfiable' job errors due to there not being sufficient logical gpu's at jobspec validation time to fulfill the job request. This case requires not binding the tasks to the jobspec, but still binding the tasks to the $(LAUNCHER) generated flux run.

This initial version adds support for flux only; slurm/lsf/etc handling to follow in subsequent release (will be no-op everywhere but flux scheduled jobs).

@jwhite242 jwhite242 force-pushed the feature/flux_binding_mode branch from 5a7af94 to 6b7dfa5 Compare February 18, 2026 17:06
@jwhite242 jwhite242 marked this pull request as draft February 18, 2026 17:12
@jwhite242
Copy link
Copy Markdown
Collaborator Author

jwhite242 commented Mar 13, 2026

Closing this for now in favor of #467 to default to single slot per node configurations. Can revisit this idea later if more granular control of slots on the node/allocation level is needed.

@jwhite242 jwhite242 closed this Mar 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant