TheTom

Tom Turney TheTom

Working on LLM inference systems, KV cache compression, and kernel-level optimizations (TurboQuant).

475 followers · 2 following

Achievements

x3 x2 x4

Achievements

x3 x2 x4

Organizations

Stars

0xClandestine / phew

Search-based optimizer for MLX/Metal on Apple Silicon.

Python 8 Updated Apr 28, 2026

1amageek / swift-lm

Hugging Face native LLM inference on Apple Silicon via direct Metal

Swift 5 Updated Apr 28, 2026

TheTom / vllm-swift

vLLM Metal plugin powered by mlx-swift — high-performance LLM inference on Apple Silicon

Python 219 11 Updated Apr 26, 2026

trevin-creator / autoresearch-mlx

Apple Silicon (MLX) port of Karpathy's autoresearch — autonomous AI research loops on Mac, no PyTorch required.

Python 1,515 307 Updated Mar 10, 2026

WeianMao / triattention

TriAttention — Efficient long reasoning with trigonometric KV cache compression. Enables OpenClaw local deployment on memory-constrained GPUs.

Python 658 53 Updated Apr 23, 2026

SharpAI / SwiftLM

⚡ Native MLX Swift LLM inference server for Apple Silicon. OpenAI-compatible API, SSD streaming for 100B+ MoE models, TurboQuant KV cache compression, MACOS + iOS iPhone app.

Swift 590 28 Updated Apr 28, 2026

mudler / LocalAI

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

Go 45,892 4,028 Updated Apr 28, 2026

TheTom / llama-cpp-turboquant

Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++ 1,013 192 Updated Apr 25, 2026

TheTom / turboquant_plus

Python 6,603 882 Updated Apr 25, 2026

spiritbuun / buun-llama-cpp

Forked from TheTom/llama-cpp-turboquant

LLAMA Turboquant implementation with CUDA support

C++ 498 48 Updated Apr 27, 2026

miolini / autoresearch-macos

Forked from karpathy/autoresearch

AI agents running research on single-GPU nanochat training automatically adopted for MacOS

Python 2,066 307 Updated Mar 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tom Turney TheTom

Sponsors

Achievements

Achievements

Organizations

Block or report TheTom

Stars

0xClandestine / phew

1amageek / swift-lm

TheTom / vllm-swift

trevin-creator / autoresearch-mlx

WeianMao / triattention

SharpAI / SwiftLM

mudler / LocalAI

TheTom / llama-cpp-turboquant

TheTom / turboquant_plus

spiritbuun / buun-llama-cpp

miolini / autoresearch-macos

TheTom / elm327_obd_for_mac

matt1398 / claude-devtools

TheTom / BookTies

TheTom / BookTiesOld

TheTom / LearnPerl

TheTom / main_web

sjsu-cs161 / stemville

TheTom / cs161