Skip to content
View sar's full-sized avatar

Block or report sar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
C++ 52 1 Updated Apr 13, 2026

The main repository for building Pascal-compatible versions of ML applications and libraries.

Shell 190 31 Updated Aug 23, 2025

A fast high-compression read-only file system for Linux, FreeBSD, macOS and Windows

C++ 2,548 84 Updated Apr 23, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,067 434 Updated Apr 24, 2026

Kimi Code CLI is your next CLI agent.

Python 8,184 928 Updated Apr 24, 2026

Jobs scraper library for LinkedIn, Indeed, Glassdoor, Google, ZipRecruiter & more

Python 3,208 657 Updated Feb 18, 2026

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 16,432 1,187 Updated Mar 24, 2026

An open-source AI agent that lives in your terminal.

TypeScript 23,815 2,275 Updated Apr 24, 2026

Flexible I/O Tester

C 6,197 1,399 Updated Apr 22, 2026

A fast JSON parser/generator for C++ with both SAX/DOM style API

C++ 15,037 3,642 Updated Feb 5, 2025

NVIDIA Inference Xfer Library (NIXL)

C++ 1,003 302 Updated Apr 24, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 8,113 1,124 Updated Apr 24, 2026

Distributed reliable key-value store for the most critical data of a distributed system

Go 51,628 10,325 Updated Apr 24, 2026

The etcd-cpp-apiv3 is a C++ library for etcd's v3 client APIs, i.e., ETCDCTL_API=3.

C++ 392 149 Updated Mar 28, 2025

IOR and mdtest

C 476 195 Updated Apr 3, 2026

Magnum IO community repo

C++ 114 19 Updated Mar 23, 2026

NVIDIA GPUDirect Storage Driver

C 344 57 Updated Mar 18, 2026

llama.cpp fork with additional SOTA quants and improved performance

C++ 2,175 275 Updated Apr 24, 2026

Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++

C++ 5,828 601 Updated Apr 23, 2026

Minimal CLI coding agent by Mistral

Python 3,979 447 Updated Apr 21, 2026

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 926 133 Updated Mar 15, 2026

CUDA Library Samples

Cuda 2,378 457 Updated Apr 20, 2026

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 9,112 2,322 Updated Mar 30, 2026

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 32,865 2,622 Updated Mar 4, 2026

LLM inference in C/C++

C++ 106,231 17,311 Updated Apr 24, 2026

A lightweight chat terminal-interface for llama.cpp server written in C++ with many features and windows/linux support.

C++ 25 5 Updated Mar 31, 2026

NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes

Go 2,659 492 Updated Apr 24, 2026

RTL, Cmodel, and testbench for NVDLA

Verilog 2,061 644 Updated Mar 2, 2022

Firmware Analysis Tool

Rust 13,877 1,785 Updated Apr 14, 2026

A natural language interface for computers

Python 63,296 5,510 Updated Apr 22, 2026
Next