Skip to content
View ZhiyingDu's full-sized avatar

Highlights

  • Pro

Block or report ZhiyingDu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"

Python 31 Updated Dec 12, 2025

[CoRL 25] Code for FLOWER VLA for finetuning FLOWER on CALVIN and all LIBERO environments

C++ 87 16 Updated Sep 22, 2025

[CVPR 2026] FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance

Python 51 3 Updated Mar 13, 2026

VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding

Python 25 Updated Jan 23, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,953 1,430 Updated Mar 3, 2026

A comprehensive list of excellent research papers, models, datasets, and other resources on Vision-Language-Action (VLA) models in robotics.

470 14 Updated Mar 23, 2026

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,998 137 Updated Apr 15, 2026

A GPU-accelerated library that enables random frame access and efficient video decoding for data loading.

CMake 60 4 Updated Apr 20, 2026

We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…

Python 1,232 109 Updated Jan 20, 2026

Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Python 261 13 Updated Feb 10, 2026

4-steps distilled version of Wan2.2-TI2V-5B

Python 153 9 Updated Mar 15, 2026
Python 262 20 Updated Aug 25, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 23,539 4,351 Updated Apr 25, 2026

[ICCV 2025] MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance

Python 180 11 Updated Feb 11, 2026
1 Updated Dec 9, 2025
HTML 1 Updated Dec 9, 2025

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex

Python 769 41 Updated Mar 19, 2026

[CSUR] A Survey on Video Diffusion Models

2,295 113 Updated Apr 15, 2026

FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients

Python 14 Updated Jan 22, 2025

[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Python 182 14 Updated Jun 20, 2025

[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI

2,024 140 Updated Apr 16, 2026

This repository implements a Retrieval-Augmented Generation (RAG) system using FAISS for vector-based retrieval and GPT for generative response. It is designed to process large datasets, index them…

Python 8 1 Updated Jan 1, 2025

CVPR and NeurIPS poster examples and templates

1,928 170 Updated May 9, 2023

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"

Python 254 14 Updated Aug 15, 2024

[ACM MM 2023] Little Strokes Fell Great Oaks: Boosting the Hierarchical Features for Multi-exposure Image Fusion

Python 6 Updated Jul 23, 2024

A Collection of Papers and Codes for CVPR2026/CVPR2025/ICCV2025/CVPR2024/ECCV2024 AIGC

659 20 Updated Apr 23, 2026