Skip to content

WIP help w/ gpt-oss#653

Merged
Maxusmusti merged 1 commit intofixed-speed-gpt-oss-support-freezing-fix-lossfrom
os-i-show-gpt-oss-speed
Sep 11, 2025
Merged

WIP help w/ gpt-oss#653
Maxusmusti merged 1 commit intofixed-speed-gpt-oss-support-freezing-fix-lossfrom
os-i-show-gpt-oss-speed

Conversation

@RobotSail
Copy link
Copy Markdown
Member

making this PR early in the AM, will update with rest of changes later

Copy link
Copy Markdown
Collaborator

@Maxusmusti Maxusmusti left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@mergify mergify Bot added the one-approval label Sep 11, 2025
@Maxusmusti Maxusmusti merged commit df48282 into fixed-speed-gpt-oss-support-freezing-fix-loss Sep 11, 2025
1 check passed
@Maxusmusti Maxusmusti deleted the os-i-show-gpt-oss-speed branch September 11, 2025 14:43
RobotSail added a commit that referenced this pull request Sep 12, 2025
RobotSail added a commit that referenced this pull request Sep 12, 2025
RobotSail added a commit that referenced this pull request Sep 16, 2025
RobotSail added a commit that referenced this pull request Sep 16, 2025
RobotSail added a commit that referenced this pull request Sep 16, 2025
Maxusmusti added a commit that referenced this pull request Sep 17, 2025
* Adding dequantized load support for gpt_oss models

Signed-off-by: Mustafa Eyceoz <[email protected]>

* Update gpt oss saving with requantization

Signed-off-by: Mustafa Eyceoz <[email protected]>

* Adjust data processing for gpt format

Signed-off-by: Mustafa Eyceoz <[email protected]>

* fix for exact quantization algorithm to replicate OpenAI quantized weights

* Speedup replicate implementation

Signed-off-by: Mustafa Eyceoz <[email protected]>

* router freezing for gpt oss

Signed-off-by: Mustafa Eyceoz <[email protected]>

* Add corrected loss, aux loss support, and batching updates

Signed-off-by: Mustafa Eyceoz <[email protected]>

* Cleanup unnecessary test files

Signed-off-by: Mustafa Eyceoz <[email protected]>

* Fix linting and review feedback

Signed-off-by: Mustafa Eyceoz <[email protected]>

* Add linting skip for mxfp4 import

Signed-off-by: Mustafa Eyceoz <[email protected]>

* Fix unit tests with mock configs

Signed-off-by: Mustafa Eyceoz <[email protected]>

* Switch to mini trainer sampler

Signed-off-by: Mustafa Eyceoz <[email protected]>

* remove dead code + add defaults (#653)

* Refactors train loop, adds padded batch packer, other fixes (#654)

* addition of padded batch packer + simplified train loop

* update tests + linting

* fix tests x2

---------

Signed-off-by: Mustafa Eyceoz <[email protected]>
Co-authored-by: Nikhil Nayak <[email protected]>
Co-authored-by: Oleg Silkin <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants