A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,360 236 Updated Apr 25, 2026

kssteven418 / I-BERT

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Python 268 42 Updated Jan 29, 2023

codertimo / BERT-pytorch

Google AI 2018 BERT pytorch implementation

Python 6,532 1,322 Updated Sep 15, 2023

bentrevett / pytorch-seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,689 1,357 Updated Jan 20, 2024

thuime / XPEsim

A simulator for RRAM-based neural processor engine.

C++ 36 20 Updated Mar 6, 2018

NVIDIA / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,953 1,522 Updated Apr 27, 2026

pytorch / extension-cpp

C++ extensions in PyTorch

Python 1,186 249 Updated Jan 13, 2026

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,655 1,832 Updated Apr 25, 2026

brycexu / BNN_Kernel

This is a BNN_Kernel on PyTorch for 1-bit networks in image data processing

C 23 2 Updated Sep 28, 2019

fpeder / espresso

Efficient forward propagation for BCNNs

Cuda 50 15 Updated Jun 12, 2017

bordaw / H2O-LightGBM-CUDA

IBM Research CUDA Implementation for the H2O version of the LightGBM package (v2.2.4)

C++ 2 4 Updated Jul 29, 2020

polo5 / ZeroShotKnowledgeTransfer

Accompanying code for the paper "Zero-shot Knowledge Transfer via Adversarial Belief Matching"

Jupyter Notebook 142 17 Updated Apr 29, 2020

snudatalab / KegNet

Knowledge Extraction with No Observable Data (NeurIPS 2019)

Python 46 11 Updated Jan 9, 2020

huawei-noah / Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

Jupyter Notebook 1,304 219 Updated Nov 5, 2024

anokland / dfa-torch

Training neural networks with back-prop, feedback-alignment and direct feedback-alignment

Lua 105 21 Updated Jan 15, 2018

itayhubara / BinaryNet.tf

BNN implementation in tensorflow

Python 165 54 Updated Jun 9, 2018

NVIDIA / multi-gpu-programming-models

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 883 149 Updated Sep 26, 2025

tensorflow / models

Models and examples built with TensorFlow

Python 77,667 45,113 Updated Apr 29, 2026

eladhoffer / convNet.pytorch

ConvNet training using pytorch

Python 348 87 Updated Feb 4, 2021

itayhubara / BinaryNet.pytorch

Binarized Neural Network (BNN) for pytorch

Python 532 128 Updated Nov 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yulhwa Kim YulhwaKim

Achievements

Achievements

Highlights

Block or report YulhwaKim

Stars

scalesim-project / SCALE-Sim

Seungmin0825 / AnalogToBi

NVIDIA / audio-flamingo

tiny-tpu-v2 / tiny-tpu

seongjunpark17 / GPU-power-log

sheldonucr / ee260_lab

dongwonjo / FastKV

jiwonsong-dev / SLEB

SqueezeBits / QUICK

horseee / Awesome-Efficient-LLM

Efficient-ML / Awesome-Model-Quantization