Skip to content
View wooksu's full-sized avatar

Organizations

@nota-github

Block or report wooksu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,661 8,634 Updated Apr 27, 2026

Official implementation for "PIO-FVLM: Rethinking Training-Free Visual Token Reduction for VLM Acceleration from an Inference-Objective Perspective"

Python 111 8 Updated Apr 22, 2026

omo; the best agent harness - previously oh-my-opencode

TypeScript 54,381 4,419 Updated Apr 27, 2026

A simple yet powerful agent framework that delivers with open-source models

Python 4,524 465 Updated Mar 21, 2026

ERGO (Efficient Reasoning & Guided Observation) is a large vision-language model trained with reinforcement learning on efficiency objectives. [ICLR'26]

Python 18 1 Updated Feb 25, 2026

[ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Python 82 2 Updated Jan 26, 2026
Python 41 1 Updated Jul 14, 2025

Nano vLLM

Python 13,142 2,002 Updated Apr 26, 2026

[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

Python 99 5 Updated Sep 20, 2025

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python 185 11 Updated Jan 16, 2026
Python 1,199 74 Updated Nov 20, 2025

Open-source unified multimodal model

Python 5,869 520 Updated Oct 27, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,406 62 Updated Apr 19, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,887 367 Updated Apr 6, 2026

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,260 155 Updated Apr 13, 2026

Solve Visual Understanding with Reinforced VLMs

Python 5,946 377 Updated Mar 12, 2026

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,245 106 Updated Oct 29, 2025

Witness the aha moment of VLM with less than $3.

Python 4,055 285 Updated May 19, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,274 16,141 Updated Apr 27, 2026

A paper list of some recent works about Token Compress for Vit and VLM

890 41 Updated Apr 14, 2026

A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]

Python 315 20 Updated Jul 6, 2024

A 28× Compressed Wav2Lip for Efficient Talking Face Generation [ICCV'23 Demo] [MLSys'23 Workshop] [NVIDIA GTC'23]

Python 60 6 Updated Mar 8, 2024

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Python 91 13 Updated Sep 13, 2024

The official NetsPresso Python package.

Jupyter Notebook 48 1 Updated Nov 20, 2025

A library for training, compressing and deploying computer vision models (including ViT) with edge devices

Python 74 12 Updated Sep 29, 2025

Repository for 2023 AI City Challenge (Track1: Multi-Camera People Tracking)

Python 38 6 Updated Oct 7, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,973 33,026 Updated Apr 27, 2026

Polynomial Learning Rate Decay Scheduler for PyTorch

Python 65 13 Updated Dec 25, 2021

Learning Rate Warmup in PyTorch

Python 415 23 Updated Jun 19, 2025

An easy to use PyTorch to TensorRT converter

Python 4,865 700 Updated Aug 17, 2024
Next