Skip to content
View woodx9's full-sized avatar
🙂
🙂

Block or report woodx9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official Go library for the OpenAI API

Go 3,172 309 Updated Apr 23, 2026

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 43,929 3,972 Updated Apr 23, 2026

Based on the RV32I ISA, aiming to implement the complete functions of the CPU without considering synthesis, timing, and latency.

Verilog 2 Updated Jun 20, 2025

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 797 212 Updated Apr 2, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,142 490 Updated Apr 22, 2026

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 1,059 86 Updated Sep 4, 2024

从零构建大模型:从预训练到RLHF的完整实践

Python 2,621 203 Updated Mar 19, 2026

Algorithm powering the For You feed on X

Rust 16,366 2,824 Updated Jan 20, 2026

a embedding infer server faster than vllm and sglang

Python 17 1 Updated Feb 10, 2026
Python 1,275 132 Updated Feb 28, 2026

Nano vLLM

Python 13,083 1,979 Updated Apr 13, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,050 593 Updated Mar 13, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,479 919 Updated Apr 22, 2026

Fast and memory-efficient exact attention

Python 23,487 2,638 Updated Apr 22, 2026
Python 61 9 Updated Jun 19, 2024

LeetGPU Solutions

Python 114 5 Updated Oct 9, 2025

leetTriton

Python 2 Updated Sep 9, 2025

Getting Started with Triton: A Tutorial for Python Beginners

HTML 52 5 Updated Mar 26, 2026

A powerful MCP toolkit for coding, providing semantic retrieval and editing capabilities - the IDE for your agent

Python 23,297 1,559 Updated Apr 23, 2026

Kode Agent — Design for post-human workflows. One unit agent for every human & computer task.

TypeScript 4,960 749 Updated Jan 23, 2026

Merge superpoint、lightglue、MixVPR into VINS-FUSION for loop closure with TensorRT

C++ 153 20 Updated Nov 12, 2024

Code search MCP for Claude Code. Make entire codebase the context for any coding agent.

TypeScript 7,914 641 Updated Apr 23, 2026

Extract and compare system prompts and tools from different Claude Code versions

TypeScript 350 31 Updated Oct 30, 2025

WPF+litegraph.js+Webview实现的混合图节点编辑器

JavaScript 26 4 Updated May 2, 2025

Build a Claude Code–like CLI coding agent from scratch.

Python 140 25 Updated Jan 22, 2026

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 32,759 2,612 Updated Mar 4, 2026

a great vscode extension

TypeScript 1 Updated Aug 6, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 60,641 6,237 Updated Apr 23, 2026

Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent.

TypeScript 18,461 2,422 Updated Apr 23, 2026

A modular, documentation-driven framework using Cursor custom modes (VAN, PLAN, CREATIVE, IMPLEMENT) to provide persistent memory and guide AI through a structured development workflow with visual …

3,031 444 Updated Jan 7, 2026
Next