Skip to content
View txytju's full-sized avatar
  • Ytech Kwai
  • Beijing,China

Block or report txytju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.

Python 179 7 Updated Mar 18, 2026

PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.

Python 164 11 Updated Apr 5, 2024
Python 42 3 Updated Jan 2, 2025

[CVPR2025]

JavaScript 1 Updated Mar 4, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,143 67 Updated Mar 20, 2025

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 323 8 Updated Jul 9, 2024

The best OSS video generation models, created by Genmo

Python 3,643 477 Updated Nov 14, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,131 70 Updated Feb 7, 2025

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

438 23 Updated Mar 8, 2025

Official inference repo for FLUX.1 models

Python 25,462 1,877 Updated Jul 31, 2025

[CVPR2024] Official implementation of SplattingAvatar.

Python 554 54 Updated Oct 28, 2024

[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

Python 378 46 Updated Mar 15, 2025

[CVPR '24] DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars

Jupyter Notebook 174 17 Updated Jun 26, 2025

[Siggraph '23] NeRSemble: Neural Radiance Field Reconstruction of Human Heads

Python 250 13 Updated Apr 29, 2025

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,727 144 Updated Dec 17, 2024

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Python 760 38 Updated Nov 16, 2023

[CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians"

Python 584 54 Updated Mar 26, 2024

Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024

Python 111 7 Updated Dec 23, 2024

Kolors Team

Python 4,614 357 Updated Nov 13, 2024

[ICLR 2025 Oral] On Scaling Up 3D Gaussian Splatting Training

Python 666 42 Updated Sep 24, 2025

Decoupled Video Instance Segmentation Framework, improved version of dvis

Python 11 2 Updated May 22, 2024

Decoupled Video Instance Segmentation Framework

Python 8 1 Updated May 22, 2024

[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,860 383 Updated Apr 7, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 5,054 412 Updated Jan 9, 2026

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Python 355 22 Updated Jul 4, 2023

Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"

Python 231 12 Updated Jun 12, 2023
Python 136 13 Updated Jul 4, 2024

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Python 1,971 142 Updated Dec 1, 2025

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 310 16 Updated Apr 22, 2024
Next