-
Xi'an Jiaotong University
- Xi'an, China
-
01:52
(UTC -12:00) - https://scholar.google.com/citations?user=bAJIGTcAAAAJ&hl=en&authuser=1
Highlights
- Pro
Lists (24)
Sort Name ascending (A-Z)
3D
Agent
AI4Science
AIGC
AIGC for CV/MultiModal
AutoDrive
Awesome
Books
Change Detection
Datasets
EDIT
ELSE
Embodied AI
Foundation
High Quality Work
Learning
LLM
Low Level
Meta-Learning
MLLM
Non-Euclidean Representation
RL
Segmentation
Tools
Stars
Harness Engineering 学习指南 — 从概念理解到独立实践的深度学习档案
This repository is a curated collection of CVPR 2026 oral papers.
⭐️ A cross-platform CLI All-in-One assistant tool for Claude Code, Codex & Gemini CLI.
A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode, openclaw & Gemini CLI.
[CVPR2026]PixDLM: A Dual-Path Multimodal Language Model for UAV Reasoning Segmentation
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
This is the open-sourced link of the TPAMI 2026 paper "SkyFind: A Large-Scale Benchmark Unveiling Referring Expression Comprehension for UAV".
This repository is an official implementation of ADAPT: Action-aware Driving Caption Transformer, accepted by ICRA 2023.
[CVPR2026] Detect Anything via Next Point Prediction
Vision–Language–Action models for Autonomous Driving (VLA4AD) resources, serving as the companion repository to the survey paper “A Survey on Vision–Language–Action Models for Autonomous Driving”.
Referring Change Detection in Remote Sensing Imagery
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
[CVPR 2026] UniChange: Unifying Change Detection with Multimodal Large Language Model
[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"
🚀🚀🚀Official Repository of Intelligent Remote Sensing Agents: A Survey
🦞 Just talk to your agent — it learns and EVOLVES 🧬.
Pytorch Implementation of Neural Architecture Search with Reinforcement Learning (in dev)
Muggled SAM: Segmentation without the magic
[ISPRS2026] DVGBench: Implicit-to-Explicit Visual Grounding Benchmark in UAV Imagery with Large Vision-Language Models