Skip to content
View hvy's full-sized avatar
🏃‍♂️
Focusing
🏃‍♂️
Focusing

Highlights

  • Pro

Organizations

@pfnet @pfnet-research @chainer @cupy @optuna

Block or report hvy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A minimal PyTorch re-implementation of Qwen 3.5

Python 415 32 Updated Mar 5, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 91,744 14,135 Updated Apr 16, 2026

AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

Python 253 67 Updated Apr 30, 2026

Inference server benchmarking tool

Rust 154 29 Updated Apr 24, 2026

Preferred Generation Benchmark

Python 94 16 Updated Mar 6, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,350 277 Updated Apr 8, 2026

Nano vLLM

Python 13,192 2,019 Updated Apr 26, 2026
TypeScript 49 Updated May 12, 2025

The code of several works on oimo.io/works

Haxe 1,465 60 Updated Jan 15, 2025

An Intel 8086 Emulator created in Rust.

Rust 429 66 Updated Feb 16, 2024

Collective communications library with various primitives for multi-machine training.

C++ 1,421 355 Updated Apr 21, 2026

Pipeline Parallelism for PyTorch

Python 787 87 Updated Aug 21, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,954 611 Updated May 3, 2024

The registry of the OptunaHub packages

Jupyter Notebook 52 57 Updated Apr 24, 2026

Python library to use and implement packages in OptunaHub

Python 55 15 Updated Apr 3, 2026

DiscoGrad - automatically differentiate across conditional branches in C++ programs

C++ 212 5 Updated Sep 12, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,881 70 Updated Jun 22, 2025

Development repository for the Triton language and compiler

MLIR 19,080 2,807 Updated Apr 30, 2026

A curated list for Efficient Large Language Models

Python 1,997 163 Updated Jun 17, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,487 896 Updated Dec 17, 2024

LLM inference in C/C++

C++ 107,529 17,589 Updated Apr 30, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,516 2,333 Updated Apr 30, 2026

Code release for NeuS

Python 1,769 223 Updated Feb 28, 2024

Google Research

Jupyter Notebook 37,826 8,396 Updated Apr 30, 2026

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 549 72 Updated Apr 29, 2026

Inference code for Llama models

Python 59,378 9,818 Updated Jan 26, 2025

Extended functionalities for Optuna in combination with third-party libraries.

Python 68 43 Updated Apr 23, 2026

A curated list of awesome neural radiance fields papers

TeX 6,772 599 Updated Jan 6, 2025

CPU assembly examples

Assembly 90 6 Updated May 19, 2024
Next