-
Zhejiang University
- Hangzhou, China
- https://jianbiaomei.github.io
Stars
Hy3 preview (295B A21B), a leading reasoning and agent model in its size, with great cost efficiency
The agent that grows with you
Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"
JoyAI-Image is the unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing.
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
Supercharge your AI agents by versioning, tracking, and merging overlapping skills.
OpenClaw-RL: Train any agent simply by talking
Causal video-action world model for generalist robot control
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"
Think Before You Move: Latent Motion Reasoning for Text-to-Motion Generation
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
RynnVLA-002: A Unified Vision-Language-Action and World Model
The Agent’s First Day: Benchmarking Learning, Exploration, and Scheduling in the Workplace Scenarios
✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
[NeurIPS'24 Spotlight] GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
A unified inference and post-training framework for accelerated video generation.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.
[CVPR 2026 Highlight] NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
A construction kit for reinforcement learning environment management.