WIP help w/ gpt-oss#653
Merged
Maxusmusti merged 1 commit intofixed-speed-gpt-oss-support-freezing-fix-lossfrom Sep 11, 2025
Merged
WIP help w/ gpt-oss#653Maxusmusti merged 1 commit intofixed-speed-gpt-oss-support-freezing-fix-lossfrom
Maxusmusti merged 1 commit intofixed-speed-gpt-oss-support-freezing-fix-lossfrom
Conversation
df48282
into
fixed-speed-gpt-oss-support-freezing-fix-loss
1 check passed
RobotSail
added a commit
that referenced
this pull request
Sep 12, 2025
RobotSail
added a commit
that referenced
this pull request
Sep 12, 2025
RobotSail
added a commit
that referenced
this pull request
Sep 16, 2025
RobotSail
added a commit
that referenced
this pull request
Sep 16, 2025
RobotSail
added a commit
that referenced
this pull request
Sep 16, 2025
Maxusmusti
added a commit
that referenced
this pull request
Sep 17, 2025
* Adding dequantized load support for gpt_oss models Signed-off-by: Mustafa Eyceoz <[email protected]> * Update gpt oss saving with requantization Signed-off-by: Mustafa Eyceoz <[email protected]> * Adjust data processing for gpt format Signed-off-by: Mustafa Eyceoz <[email protected]> * fix for exact quantization algorithm to replicate OpenAI quantized weights * Speedup replicate implementation Signed-off-by: Mustafa Eyceoz <[email protected]> * router freezing for gpt oss Signed-off-by: Mustafa Eyceoz <[email protected]> * Add corrected loss, aux loss support, and batching updates Signed-off-by: Mustafa Eyceoz <[email protected]> * Cleanup unnecessary test files Signed-off-by: Mustafa Eyceoz <[email protected]> * Fix linting and review feedback Signed-off-by: Mustafa Eyceoz <[email protected]> * Add linting skip for mxfp4 import Signed-off-by: Mustafa Eyceoz <[email protected]> * Fix unit tests with mock configs Signed-off-by: Mustafa Eyceoz <[email protected]> * Switch to mini trainer sampler Signed-off-by: Mustafa Eyceoz <[email protected]> * remove dead code + add defaults (#653) * Refactors train loop, adds padded batch packer, other fixes (#654) * addition of padded batch packer + simplified train loop * update tests + linting * fix tests x2 --------- Signed-off-by: Mustafa Eyceoz <[email protected]> Co-authored-by: Nikhil Nayak <[email protected]> Co-authored-by: Oleg Silkin <[email protected]>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
making this PR early in the AM, will update with rest of changes later