Skip to content
View qsh-zh's full-sized avatar

Highlights

  • Pro

Block or report qsh-zh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Super basic implementation (gist-like) of RLMs with REPL environments.

Python 770 129 Updated Jan 7, 2026

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 2,702 488 Updated Mar 29, 2026

爬抖音,爬取别人的美好生活

Python 813 262 Updated Dec 8, 2022

nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)

Python 142 9 Updated May 8, 2025

A prototype implementation of the "dataset as a queue" pattern for processing web pages into interleaved image/text content.

Python 29 Updated Nov 16, 2025

Ongoing research training transformer models at scale

Python 16,145 3,867 Updated Apr 24, 2026

🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 452 134 Updated Apr 24, 2026

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 8,470 762 Updated Mar 24, 2026

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,143 67 Updated Mar 20, 2025

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 482 31 Updated Apr 15, 2026

Simple IO APIs with pluggable storage backends and rich format handlers.

Python 4 Updated Oct 31, 2025

Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars

Python 1,232 121 Updated Aug 21, 2025
Python 165 18 Updated Dec 27, 2024

Tiny AutoEncoder for Hunyuan Video (and other video models)

Python 360 12 Updated Mar 14, 2026

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 406 59 Updated Apr 20, 2026

RES: Refined Exponential Solver. https://arxiv.org/abs/2308.02157

Python 1 Updated Aug 24, 2025

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 516 16 Updated Sep 2, 2024

[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

Python 591 34 Updated Nov 11, 2025

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 659 46 Updated Mar 6, 2026

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 185 30 Updated Mar 17, 2026

Dion optimizer algorithm

Python 467 55 Updated Apr 18, 2026

Evaluation harness for diffusion world models

TypeScript 14 4 Updated Aug 13, 2025

A place to store reusable transformer components of my own creation or found on the interwebs

Python 77 12 Updated Apr 24, 2026

Kernels, of the mega variety :)

Python 711 56 Updated Apr 23, 2026

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 768 102 Updated Oct 29, 2025

open-source coding LLM for software engineering tasks

Python 1,200 153 Updated Sep 30, 2025

[ICML 2025 Spotlight] Direct Discriminative Optimization: Reinforcing Diffusion/Autoregressive with GAN Discrimination

Python 120 2 Updated Jan 27, 2026

The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise"

Python 1,074 50 Updated Oct 13, 2025
Next