Skip to content
View thuwzt's full-sized avatar

Organizations

@thu-ml

Block or report thuwzt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.

Python 56 2 Updated Mar 12, 2026

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

Python 302 18 Updated Feb 24, 2026

Official repo for vidar and vidarc: video foundation model for robotics.

Python 40 1 Updated Dec 22, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,472 250 Updated Apr 15, 2026

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

293 5 Updated Dec 1, 2025

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 939 57 Updated Dec 20, 2025

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 986 91 Updated Feb 25, 2026

Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training

Python 39 4 Updated Jun 20, 2025

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

17,123 1,559 Updated Feb 13, 2023

Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)

Python 19 2 Updated Jul 1, 2025

[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.

Python 113 11 Updated Dec 20, 2024

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,812 395 Updated Mar 27, 2026

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,327 400 Updated Jan 17, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,126 3,402 Updated Apr 26, 2026

Ongoing research training transformer models at scale

Python 16,157 3,874 Updated Apr 26, 2026

Triton-based implementation of Sparse Mixture of Experts.

Python 273 28 Updated Oct 3, 2025

Development repository for the Triton language and compiler

MLIR 19,055 2,803 Updated Apr 25, 2026

[TMLR 2024] Efficient Large Language Models: A Survey

1,258 98 Updated Jun 23, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,295 710 Updated Apr 24, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,197 4,806 Updated Apr 24, 2026

Official code for "Efficient Backpropagation with Variance Controlled Adaptive Sampling" (ICLR 2024)

Python 8 2 Updated Mar 8, 2024

Fast and memory-efficient exact attention

Python 23,538 2,644 Updated Apr 25, 2026

Low-bit optimizers for PyTorch

Python 138 9 Updated Oct 9, 2023

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 43,733 5,321 Updated Apr 22, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 57,185 9,801 Updated Nov 12, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 99,442 27,594 Updated Apr 26, 2026

LaTeX Thesis Template for Tsinghua University

TeX 5,279 1,144 Updated Apr 4, 2026

The JavaScript library that provides a program-friendly interface to Tsinghua web portal

TypeScript 28 5 Updated Sep 24, 2023

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 36,977 7,845 Updated Apr 13, 2026