Skip to content
View hvy's full-sized avatar
🏃‍♂️
Focusing
🏃‍♂️
Focusing

Highlights

  • Pro

Organizations

@pfnet @pfnet-research @chainer @cupy @optuna

Block or report hvy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 91,381 14,062 Updated Apr 16, 2026

AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

Python 247 64 Updated Apr 25, 2026

Inference server benchmarking tool

Rust 152 29 Updated Apr 24, 2026

Preferred Generation Benchmark

Python 94 16 Updated Mar 6, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,340 266 Updated Apr 8, 2026

Nano vLLM

Python 13,121 1,992 Updated Apr 13, 2026
TypeScript 49 Updated May 12, 2025

The code of several works on oimo.io/works

Haxe 1,464 60 Updated Jan 15, 2025

An Intel 8086 Emulator created in Rust.

Rust 429 67 Updated Feb 16, 2024

Collective communications library with various primitives for multi-machine training.

C++ 1,419 355 Updated Apr 21, 2026

Pipeline Parallelism for PyTorch

Python 785 87 Updated Aug 21, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,947 608 Updated May 3, 2024

The registry of the OptunaHub packages

Jupyter Notebook 52 57 Updated Apr 24, 2026

Python library to use and implement packages in OptunaHub

Python 55 15 Updated Apr 3, 2026

DiscoGrad - automatically differentiate across conditional branches in C++ programs

C++ 212 5 Updated Sep 12, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,881 70 Updated Jun 22, 2025

Development repository for the Triton language and compiler

MLIR 19,046 2,800 Updated Apr 25, 2026

A curated list for Efficient Large Language Models

Python 1,992 164 Updated Jun 17, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,465 894 Updated Dec 17, 2024

LLM inference in C/C++

C++ 106,374 17,325 Updated Apr 24, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,468 2,317 Updated Apr 25, 2026

Code release for NeuS

Python 1,768 223 Updated Feb 28, 2024

Google Research

Jupyter Notebook 37,787 8,396 Updated Apr 24, 2026

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 549 72 Updated Apr 23, 2026

Inference code for Llama models

Python 59,364 9,820 Updated Jan 26, 2025

Extended functionalities for Optuna in combination with third-party libraries.

Python 68 42 Updated Apr 23, 2026

A curated list of awesome neural radiance fields papers

TeX 6,772 599 Updated Jan 6, 2025

CPU assembly examples

Assembly 90 6 Updated May 19, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,430 774 Updated Apr 21, 2026
Next