Skip to content
View gigit0000's full-sized avatar
  • Kim Baksa's Lab, South Korea
  • 21:41 (UTC +09:00)

Block or report gigit0000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,795 310 Updated Apr 29, 2026

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,185 371 Updated Apr 20, 2026

Easy, Fast, and Scalable Multimodal AI

Python 124 9 Updated Apr 17, 2026

Nvidia Instruction Set Specification Generator

Python 321 20 Updated Jul 9, 2024

Cataloging released Triton kernels.

302 16 Updated Sep 9, 2025

Learning Deep Representations of Data Distributions

TeX 956 95 Updated Apr 17, 2026

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Python 165 10 Updated Oct 19, 2023

Python pdb for multiple processes

Python 82 9 Updated May 24, 2025

Large-scale LLM inference engine

C++ 1,714 194 Updated Apr 28, 2026

Memray is a memory profiler for Python

Python 15,000 438 Updated Apr 28, 2026
Python 7 Updated Jul 26, 2025

Triton Support in Compiler Explorer

TypeScript 5 Updated Aug 5, 2025

Run compilers interactively from your web browser and interact with the assembly

TypeScript 18,723 2,022 Updated Apr 25, 2026

This repo provides several classic attention variant implementation based on FlexAttention API.

Python 2 1 Updated May 18, 2025

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 381 77 Updated Apr 29, 2026

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 7,412 1,671 Updated Apr 27, 2026

Hacker News

HTML 15 6 Updated Apr 29, 2026

Distribute and run LLMs with a single file.

C++ 1 Updated Jul 23, 2024

Distribute and run LLMs with a single file.

C++ 24,316 1,335 Updated Apr 29, 2026

CUDA on non-NVIDIA GPUs

Rust 14,160 902 Updated Apr 28, 2026

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 10,584 391 Updated Apr 29, 2026

A .NET MAUI app for displaying the top posts on Hacker News that demonstrates text sentiment analysis gathered using artificial intelligence

C# 279 40 Updated Apr 24, 2026

A curated list of awesome C frameworks, libraries, resources and other shiny things. Inspired by all the other awesome-... projects out there.

11,257 929 Updated Dec 27, 2025

Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model recommendations - settings designed for low VRAM systems.

237 22 Updated Jul 27, 2025

Debug Module for Embedded Systems

C 1 Updated Mar 20, 2026

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 1 Updated Jul 6, 2025

📝 A curated list of awesome Raspberry Pi tools, projects, images and resources

Shell 16,304 1,114 Updated Apr 15, 2026

Inference Llama 2 in one file of pure C & one file with CUDA

C 33 1 Updated Oct 14, 2023

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,288 1,682 Updated Nov 19, 2025
Next