Lists (3)
Sort Name ascending (A-Z)
- All languages
- Assembly
- C
- C#
- C++
- CSS
- Clojure
- CoffeeScript
- Common Lisp
- Coq
- Cuda
- Cython
- Dockerfile
- Emacs Lisp
- Fortran
- GAP
- Go
- HTML
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Lean
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Metal
- Nim
- PHP
- Pascal
- Perl
- Python
- R
- Racket
- Rocq Prover
- Ruby
- Rust
- SCSS
- Sass
- Scala
- Shell
- Svelte
- Swift
- SystemVerilog
- TeX
- TypeScript
- Verilog
- Vim Script
- Visual Basic
- Vue
- Zig
Starred repositories
On-device Speech AI for Apple Silicon
A hybrid programming language combining Lean4's formal verification with blazing-fast compilation, actor-based agent orchestration, AI-driven optimization, and vector-backed agent memory.
🦄 ai that works - every tuesday 10 AM PST
Machine Learning Engineering Open Book
🤗 smolagents: a barebones library for agents that think in code.
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
Verified tensor graph optimization in Lean 4: constructive soundness proofs + equality saturation + verified extraction via e-graph↔circuit bijection + multi-target code generation.
Verified GPU programming framework for Lean 4. Write type-safe WebGPU shaders with formal verification, hardware-accelerated matrix ops, and cross-platform support (Metal/Vulkan/D3D12). Build prova…
Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.
A lightweight multi-GPU inference engine for LLMs on mid/low-end GPUs.
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
hardware accelerator for deep convolutional neural networks
Open-source CUDA compiler targeting multiple GPU architectures. Compiles .cu to AMD and Tenstorrent GPU's
Intel® Nervana™ reference deep learning framework committed to best performance on all hardware
An ARC-AGI solution using Agentica from Symbolica
ALMA (Automated meta-Learning of Memory designs for Agentic systems) is a framework that meta-learns memory designs to replace human-engineered designs for agentic system.
A huge collection of VHDL/Verilog open-source IP cores scraped from the web
[KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations
Multi-agent communication extension for pi coding agent
A collection of GPU kernels and other experiments comparing Torch, Triton etc to Modular/Mojo