Skip to content
View YulhwaKim's full-sized avatar

Highlights

  • Pro

Block or report YulhwaKim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repository to host and maintain SCALE-Sim code

Python 454 149 Updated Feb 2, 2026

AnalogToBi Project

Python 3 Updated Mar 29, 2026

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

1,113 95 Updated Dec 15, 2025

A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1

Verilog 1,260 100 Updated Apr 3, 2026

GPU-power-log

Shell 2 Updated Nov 25, 2025

EE 260 Winter 2017: Advanced VLSI Design

Verilog 69 27 Updated Dec 13, 2016

[ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration"

Python 31 5 Updated Apr 14, 2026

[ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks

Python 41 6 Updated Feb 4, 2025

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

Python 122 5 Updated Mar 6, 2024

A curated list for Efficient Large Language Models

Python 1,998 163 Updated Jun 17, 2025

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,360 236 Updated Apr 25, 2026

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Python 268 42 Updated Jan 29, 2023

Google AI 2018 BERT pytorch implementation

Python 6,532 1,322 Updated Sep 15, 2023

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,689 1,357 Updated Jan 20, 2024

A simulator for RRAM-based neural processor engine.

C++ 36 20 Updated Mar 6, 2018

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,953 1,522 Updated Apr 27, 2026

C++ extensions in PyTorch

Python 1,186 249 Updated Jan 13, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,655 1,832 Updated Apr 25, 2026

This is a BNN_Kernel on PyTorch for 1-bit networks in image data processing

C 23 2 Updated Sep 28, 2019

Efficient forward propagation for BCNNs

Cuda 50 15 Updated Jun 12, 2017

IBM Research CUDA Implementation for the H2O version of the LightGBM package (v2.2.4)

C++ 2 4 Updated Jul 29, 2020

Accompanying code for the paper "Zero-shot Knowledge Transfer via Adversarial Belief Matching"

Jupyter Notebook 142 17 Updated Apr 29, 2020

Knowledge Extraction with No Observable Data (NeurIPS 2019)

Python 46 11 Updated Jan 9, 2020

Efficient computing methods developed by Huawei Noah's Ark Lab

Jupyter Notebook 1,304 219 Updated Nov 5, 2024

Training neural networks with back-prop, feedback-alignment and direct feedback-alignment

Lua 105 21 Updated Jan 15, 2018

BNN implementation in tensorflow

Python 165 54 Updated Jun 9, 2018

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 883 149 Updated Sep 26, 2025

Models and examples built with TensorFlow

Python 77,667 45,113 Updated Apr 29, 2026

ConvNet training using pytorch

Python 348 87 Updated Feb 4, 2021

Binarized Neural Network (BNN) for pytorch

Python 532 128 Updated Nov 6, 2023