dongyh20

Follow

Yuhao Dong dongyh20

Follow

Tsinghua University, Senior Student

111 followers · 68 following

Tsinghua University
https://scholar.google.com/citations?hl=zh-CN&user=kMui170AAAAJ

Achievements

Achievements

dongyh20/README.md

Hi there 👋

🔭 I’m currently working on the topic of visual perception and my long-term goal is to build general foundation models.

⚡ Recently I'm focusing on vision-language model and unified visual models.

📫 If you are also interested in relevant issues, feel free to chat with me!

Pinned Loading

Oryx-mllm/Oryx Oryx-mllm/Oryx Public

[ICLR 2025] MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Python 330 19
Insight-V Insight-V Public

[CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Python 241 6
MME-Benchmarks/Video-MME-v2 MME-Benchmarks/Video-MME-v2 Public

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Python 355 1
Ola-Omni/Ola Ola-Omni/Ola Public

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 389 16
Octopus Octopus Public

[ECCV2024] 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.

Python 298 20
open-compass/VLMEvalKit open-compass/VLMEvalKit Public

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4.1k 684