ZhiyingDu

Zhiying Du ZhiyingDu

13 followers · 2 following

Fudan University
Beijing, China
[email protected]
https://zhiyingdu.github.io/

Highlights

Stars

ZhiyingDu / HiMoE-VLA

Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"

Python 31 Updated Dec 12, 2025

intuitive-robots / flower_vla_calvin

[CoRL 25] Code for FLOWER VLA for finetuning FLOWER on CALVIN and all LIBERO environments

C++ 87 16 Updated Sep 22, 2025

quanhaol / FlashMotion

[CVPR 2026] FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance

Python 51 3 Updated Mar 13, 2026

JPShi12 / VideoLoom

VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding

Python 25 Updated Jan 23, 2026

hq-King / Awesome-Affordance-Learning

176 7 Updated Apr 5, 2026

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,953 1,430 Updated Mar 3, 2026

Jiaaqiliu / Awesome-VLA-Robotics

A comprehensive list of excellent research papers, models, datasets, and other resources on Vision-Language-Action (VLA) models in robotics.

470 14 Updated Mar 23, 2026

jonyzhang2023 / awesome-embodied-vla-va-vln

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,998 137 Updated Apr 15, 2026

yueen-ma / Awesome-VLA

548 23 Updated Feb 27, 2026

AgiBot-World / VideoDataset

A GPU-accelerated library that enables random frame access and efficient video decoding for data loading.

CMake 60 4 Updated Apr 20, 2026

Francis-Rings / StableAvatar

We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…

Python 1,232 109 Updated Jan 20, 2026

CodeGoat24 / Pref-GRPO

Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Python 261 13 Updated Feb 10, 2026

quanhaol / Wan2.2-TI2V-5B-Turbo

4-steps distilled version of Wan2.2-TI2V-5B

Python 153 9 Updated Mar 15, 2026

gen-robot / RL4VLA

Python 262 20 Updated Aug 25, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 23,539 4,351 Updated Apr 25, 2026

quanhaol / MagicMotion

[ICCV 2025] MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance

Python 180 11 Updated Feb 11, 2026

ZhiyingDu / ZhiyingDu

1 Updated Dec 9, 2025

ZhiyingDu / ECCV-2024-Workshop-on-Multimodal-Perception-and-Comprehension-of-Corner-Cases-in-Autonomous-Driving

Python 3 Updated Aug 21, 2024

ZhiyingDu / ZhiyingDu.github.io

HTML 1 Updated Dec 9, 2025

CodeGoat24 / UnifiedReward

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex

Python 769 41 Updated Mar 19, 2026

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,295 113 Updated Apr 15, 2026

Physical-Intelligence / openpi

Python 11,511 1,831 Updated Apr 16, 2026

nailwatts / FNIN

FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients

Python 14 Updated Jan 22, 2025

PKU-HMI-Lab / LIFT3D

[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Python 182 14 Updated Jun 20, 2025

HCPLab-SYSU / Embodied_AI_Paper_List

[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI

2,024 140 Updated Apr 16, 2026

dongdongunique / LLM_RAG

This repository implements a Retrieval-Augmented Generation (RAG) system using FAISS for vector-based retrieval and GPT for generative response. It is designed to process large datasets, index them…

Python 8 1 Updated Jan 1, 2025

zhoubolei / bolei_awesome_posters

CVPR and NeurIPS poster examples and templates

1,928 170 Updated May 9, 2023

wenyuqing / panacea

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"

Python 254 14 Updated Aug 15, 2024

ZhiyingDu / BHFMEF

[ACM MM 2023] Little Strokes Fell Great Oaks: Boosting the Hierarchical Features for Multi-exposure Image Fusion

Python 6 Updated Jul 23, 2024

Kobaayyy / Awesome-CVPR2026-CVPR2025-ICCV2025-CVPR2024-ECCV2024-AIGC

A Collection of Papers and Codes for CVPR2026/CVPR2025/ICCV2025/CVPR2024/ECCV2024 AIGC

659 20 Updated Apr 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly