A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…

TeX 453 15 Updated Apr 23, 2026

zli12321 / FFGO-Video-Customization

Video Content Customization Using First Frame

Python 181 11 Updated Mar 17, 2026

FireRedTeam / FireRed-Image-Edit

FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity co…

Python 1,172 67 Updated Apr 3, 2026

thu-ml / Causal-Forcing

Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"

Python 580 35 Updated Apr 18, 2026

PKU-YuanGroup / OpenS2V-Nexus

[NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Jupyter Notebook 208 7 Updated Apr 14, 2026

knightyxp / VideoCoF

[CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner

Python 177 8 Updated Feb 22, 2026

VectorSpaceLab / EditScore

[ICLR 2026] EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Python 237 7 Updated Mar 20, 2026

Purshow / Awesome-Unified-Multimodal

📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.

359 14 Updated Jan 8, 2026

Lixsp11 / sekai-codebase

[NeurIPS 2025] Sekai: A Video Dataset towards World Exploration

Python 288 6 Updated Dec 31, 2025

Yuanshi9815 / ViBT

Vision Bridge Transformer at Scale

Python 142 7 Updated Dec 1, 2025

vita-epfl / Stable-Video-Infinity

[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 2,361 198 Updated Jan 19, 2026

tianweiy / CausVid

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,311 79 Updated Aug 7, 2025

QwenLM / Qwen-Image-Layered

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Python 1,799 140 Updated Dec 31, 2025

Alibaba-Quark / LiveAvatar

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 2,033 236 Updated Apr 8, 2026

ali-vilab / Wan-Move

[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Python 613 29 Updated Jan 5, 2026

lizhiqi49 / MoCA

"MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation"

177 6 Updated Dec 9, 2025

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,610 357 Updated Apr 3, 2026

NVlabs / LongLive

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,176 110 Updated Feb 26, 2026

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 9,181 1,362 Updated Apr 22, 2026

kandinskylab / kandinsky-5

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 748 58 Updated Mar 31, 2026

Tencent-Hunyuan / HunyuanVideo-1.5

HunyuanVideo-1.5: A leading lightweight video generation model

Python 4,392 221 Updated Apr 10, 2026

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,268 155 Updated Dec 8, 2025