Skip to content
View njhill's full-sized avatar

Organizations

@netty @kserve @vllm-project @llm-d @Inferact

Block or report njhill

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Early-stage Rust drop-in alternative frontend for vLLM

Rust 5 Updated Apr 24, 2026

Tools for Python coroutines and advanced scheduling for `asyncio`

Python 19 1 Updated Dec 29, 2025

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 300 169 Updated Apr 26, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,079 437 Updated Apr 24, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,151 16,082 Updated Apr 26, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,926 33,014 Updated Apr 25, 2026

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,671 1,076 Updated Apr 24, 2026

High-performance netty and thrift-based microservice RPC library for Java

Java 4 4 Updated Sep 17, 2025

Alternative etcd3 java client

Java 163 43 Updated Sep 17, 2025

Distributed Model Serving Framework

Java 188 78 Updated Apr 14, 2026

Controller for ModelMesh

Go 244 135 Updated Apr 14, 2026

Abstracted helper classes providing consistent key-value store functionality, with zookeeper and etcd3 implementations

Java 5 2 Updated Sep 17, 2025

Fake XRandR configurations for multi-head setups with crappy video drivers, like fakexinerama but with xrandr

Python 274 38 Updated Apr 29, 2024

Java utilities for working with CompletionStages

Java 59 13 Updated Jan 17, 2019
Java 3,821 587 Updated Apr 24, 2026

Netty project - an event-driven asynchronous network application framework

Java 34,929 16,242 Updated Apr 25, 2026

The Java gRPC implementation. HTTP/2 based RPC

Java 12,004 3,985 Updated Apr 23, 2026