Skip to content
View hitcoogle's full-sized avatar

Block or report hitcoogle

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The agent that grows with you

Python 116,606 17,193 Updated Apr 25, 2026
Python 6,540 875 Updated Apr 25, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 363,872 74,502 Updated Apr 25, 2026

DFlash: Block Diffusion for Flash Speculative Decoding

Python 2,282 159 Updated Apr 25, 2026

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,721 318 Updated Apr 24, 2026

[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Python 120 5 Updated Dec 3, 2025

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,136 169 Updated Apr 25, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,029 132 Updated Apr 24, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,333 323 Updated Jan 14, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,467 747 Updated Apr 25, 2026

STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.

Python 2,134 185 Updated Mar 14, 2026

A framework for efficient model inference with omni-modality models

Python 4,488 835 Updated Apr 25, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,604 8,630 Updated Apr 23, 2026

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 4,428 379 Updated Apr 10, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,744 1,443 Updated Feb 27, 2026

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 946 83 Updated Feb 28, 2026

Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".

Python 16 1 Updated Sep 15, 2024

Empowering everyone to build reliable and efficient software.

Rust 112,328 14,806 Updated Apr 25, 2026

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,811 485 Updated Feb 10, 2026

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 800 214 Updated Apr 2, 2026

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,302 392 Updated Apr 23, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 8,122 1,131 Updated Apr 25, 2026

Efficient Triton Kernels for LLM Training

Python 6,303 520 Updated Apr 24, 2026

Fast and memory-efficient exact attention

Python 31 1 Updated Dec 2, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,746 523 Updated Apr 25, 2026

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 387 49 Updated Apr 22, 2025

Muon is Scalable for LLM Training

1,462 85 Updated Aug 3, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

10,681 819 Updated Jan 21, 2026
Next