axrshz

adarsh axrshz

technology brother

India

Stars

Xiaohao-Liu / Awesome-Multi-Token-Prediction

A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Speech-Language Models (SLMs), and more.

82 3 Updated Feb 7, 2026

RiddleHe / nanochat

Forked from karpathy/nanochat

The best ChatGPT that $100 can buy.

Python 33 Updated Apr 23, 2026

huggingface / trl

Train transformer language models with reinforcement learning.

Python 18,161 2,672 Updated Apr 24, 2026

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 946 119 Updated Apr 24, 2026

hugobowne / build-your-own-deep-research-agent

Python 86 16 Updated Mar 28, 2026

avelino / awesome-go

A curated list of awesome Go frameworks, libraries and software

Go 170,923 13,176 Updated Apr 23, 2026

DravenALG / awesome-vla-wam

A Curated List of Vision-Language-Action (VLA) and World Action Models (WAM) Research and Beyond

328 4 Updated Apr 23, 2026

PufferAI / PufferLib

Puffing up reinforcement learning

C 5,620 443 Updated Apr 23, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 26,395 5,543 Updated Apr 25, 2026

Proximal-Labs / frontier-swe

FrontierSWE is an ultra long-horizon coding agent benchmark that tests implementation, performance eng and ML research

C 84 3 Updated Apr 23, 2026

vivekvkashyap / simpleGRPO

Python 1 Updated Apr 16, 2026

stanford-iris-lab / meta-harness

Reference code for the Meta-Harness paper.

Python 643 48 Updated Apr 16, 2026

anakin87 / llm-rl-environments-lil-course

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

Python 182 15 Updated Apr 23, 2026

unslothai / notebooks

250+ Fine-tuning & RL Notebooks for text, vision, audio, embedding, TTS models.

Jupyter Notebook 5,272 860 Updated Apr 24, 2026

tensara / tensara

Competitive GPU kernel optimization platform.

TypeScript 189 19 Updated Apr 24, 2026

fla-org / flash-linear-attention

🚀 Efficient implementations for emerging model architectures

Python 4,975 510 Updated Apr 22, 2026

jmaczan / tiny-vllm

Build your own high performance LLM inference engine in C++ and CUDA - a smaller version of vLLM

C++ 115 7 Updated Apr 14, 2026

cottus-ai / cottus-runtime

C++ 3 Updated Jan 17, 2026

tspeterkim / flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 1,124 111 Updated Dec 30, 2024

keon / awesome-physical-ai

A curated list of academic papers and resources on Physical AI — focusing on Vision-Language-Action (VLA) models, world models, embodied ai, and robotic foundation models.

219 20 Updated Mar 30, 2026

HamzaElshafie / gpt-oss-20B

A PyTorch implementation of the GPT-OSS-20B architecture. All components are coded from scratch: RoPE with YaRN, RMSNorm, SwiGLU with clamping and residual connection, Mixture-of-Experts (MoE), Sel…

Python 231 16 Updated Dec 2, 2025

AmberLJC / LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

1,937 101 Updated Apr 17, 2026

mehdihadeli / awesome-software-architecture

📚 A curated list of awesome articles, videos, and other resources to learn and practice software architecture, patterns, and principles.

C# 10,980 954 Updated Feb 1, 2026

theanalyst / awesome-distributed-systems

A curated list to learn about distributed systems

11,785 1,538 Updated Jan 10, 2025

m0at / rvllm

rvLLM: High-performance LLM inference in Rust. Drop-in vLLM replacement.

Rust 685 63 Updated Apr 25, 2026

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 12,065 931 Updated Sep 1, 2024

binhnguyennus / awesome-scalability

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

70,577 6,974 Updated Jan 4, 2026

leofan90 / Awesome-World-Models

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,559 51 Updated Apr 24, 2026

goabiaryan / awesome-gpu-engineering

GPU Engineering for AI Systems

HTML 299 35 Updated Apr 21, 2026

aerlabsAI / ai-inference-resources

Curated collection of AI inference engineering resources — LLM serving, GPU kernels, quantization, distributed inference, and production deployment. Compiled from the AER Labs community.

101 9 Updated Feb 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly