Stars
[ICCV 2025] Official PyTorch implementation of TrafficLoc
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
Contrastive Olfaction-Language-Image Pre-training Model. The first-ever series of embeddings models for olfaction-vision-language applications in robotics and embodied AI - an extension of CLIP wit…
Code and data for the paper "Diffusion Graph Neural Networks for Robustness in Olfaction Sensors and Datasets in Robotics"
[ICCV2025] CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception
[CVPR 2025 Oral] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video
Optimal Transport Aggregation for Visual Place Recognition
FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry
ViPE: Video Pose Engine for Geometric 3D Perception
Monocular Visual-Inertial State Estimator on Mobile Phones
A Robust and Versatile Monocular Visual-Inertial State Estimator
OpenStereo: A Comprehensive Benchmark for Stereo Matching
KVN: Keypoints Voting Network with Differentiable RANSAC for Stereo Pose Estimation
Deep Stereo RGB-only Dense 6D Object Pose Estimation
StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose Estimation (ICCV 2021)
source code for paper "Learning Better Keypoints for Multi-Object 6DoF Pose Estimation".
[CVPR Workshop DLGC, 2024] RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images
Deep Object Pose Estimation (DOPE) – ROS inference (CoRL 2018)
[arXiv '24] Real-Time 3D Semantic Scene Perception for Egocentric Robots with Binocular Vision
Automated, hardware-independent Hand-Eye Calibration
[ICRA 2023 & IROS 2023] Code release for Keypoint-GraspNet (KGN) and Keypoint-GraspNet-V2 (KGNv2)
(ECCV 2024) Official implementation of the Economic 6-DoF Grasp Detection Framework (EconomicGrasp).
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
To facilitate the research of invisible gas detection, we introduce Gas-DB, an extensive open-source gas detection database including about 1.3K well-annotated RGB-thermal images with eight variant…
Deep Learning model implementation for Fire detection both classification and segmentation from the FLAME dataset.