Skip to content
@Dao-AILab

Dao AI Lab

We are an AI research group led by Prof. Tri Dao

Popular repositories Loading

  1. flash-attention flash-attention Public

    Fast and memory-efficient exact attention

    Python 23.6k 2.7k

  2. quack quack Public

    A Quirky Assortment of CuTe Kernels

    Python 953 123

  3. causal-conv1d causal-conv1d Public

    Causal depthwise conv1d in CUDA, with a PyTorch interface

    Cuda 857 180

  4. sonic-moe sonic-moe Public

    Accelerating MoE with IO and Tile-aware Optimizations

    Python 664 80

  5. fast-hadamard-transform fast-hadamard-transform Public

    Fast Hadamard transform in CUDA, with a PyTorch interface

    C 310 60

  6. gram-newton-schulz gram-newton-schulz Public

    Fast Polar Decomposition for Muon

    Python 142 13

Repositories

Showing 10 of 11 repositories

Top languages

Loading…

Most used topics

Loading…