Skip to content
View ritazh's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@kubernetes @open-policy-agent @virtual-kubelet @kubernetes-sigs @coreweave

Block or report ritazh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best-benchmarked open-source AI memory system. And it's free.

Python 49,518 6,493 Updated Apr 25, 2026

llm-d benchmark scripts and tooling

Python 58 70 Updated Apr 25, 2026

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Go 275 34 Updated Apr 25, 2026

A cloud-agnostic Kubernetes node autoscaler that dynamically scales infrastructure across Azure and emerging neoclouds like Nebius—managed from a single control plane.

Go 8 1 Updated Apr 24, 2026

Rally your AI squad to GitHub issues and PRs via git worktrees

JavaScript 32 2 Updated Apr 23, 2026
Shell 2 1 Updated Feb 14, 2026

💫 Toolkit to help you get started with Spec-Driven Development

Python 90,763 7,830 Updated Apr 24, 2026

Inspektor Gadget is a set of tools and framework for data collection and system inspection on Kubernetes clusters and Linux hosts using eBPF

C 2,797 340 Updated Apr 24, 2026

✈️ Kubernetes-native platform for deploying and managing AI inference across multiple providers.

TypeScript 76 23 Updated Apr 24, 2026

Discover ingress-nginx usage and auto-generate Gateway API migration plans before ingress-nginx reaches end-of-life (March 2026).

Go 16 Updated Nov 26, 2025

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

Python 9,894 1,045 Updated Apr 25, 2026

The best ChatGPT that $100 can buy.

Python 52,472 6,998 Updated Apr 14, 2026

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,361 248 Updated Feb 8, 2026

A sample pack of GitHub Agentic Workflows!

Makefile 644 99 Updated Apr 24, 2026
2 Updated Sep 26, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,072 435 Updated Apr 24, 2026

Wassette: A security-oriented runtime that runs WebAssembly Components via MCP

Rust 880 60 Updated Apr 23, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,043 2,065 Updated Mar 27, 2026

Next Generation Agentic Proxy for AI Agents and MCP servers

Rust 2,490 417 Updated Apr 24, 2026

LLM inference in C/C++

C++ 106,404 17,329 Updated Apr 25, 2026

Home of the out-of-tree KAITO plugin for Headlamp Kubernetes UI

TypeScript 7 2 Updated Aug 8, 2025

The Security Toolkit for LLM Interactions

Python 2,866 379 Updated Dec 15, 2025

Set of tools to assess and improve LLM security.

Python 4,139 724 Updated Apr 24, 2026

A comprehensive social media management tool designed to help you create, format, and post content across multiple platforms including LinkedIn, Twitter/X, Bluesky, and Mastodon. Features advanced …

TypeScript 92 15 Updated Jan 15, 2026

Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

Go 434 74 Updated Apr 25, 2026

CRIU based GPU workload migration in Kubernetes

Go 22 6 Updated Apr 22, 2025

OPA Gatekeeper provider for GitHub Artifact Attestations

Go 22 8 Updated Apr 24, 2026
Shell 5 Updated Apr 24, 2026

This repositories contains examples and best practices for AI workloads on Azure

Shell 30 14 Updated Apr 22, 2026

Main reference implementation for NLWeb, implemented in Python.

Python 6,191 687 Updated Apr 13, 2026
Next