Skip to content
View kykim0's full-sized avatar

Organizations

@sisl @JuliaPOMDP @StanfordVL

Block or report kykim0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

OpenClaw-RL: Train any agent simply by talking

Python 5,163 551 Updated Apr 28, 2026

The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".

219 9 Updated Apr 25, 2026

Self-referential self-improving agents that can optimize for any computable task

Python 2,418 306 Updated Apr 26, 2026

Research on Coding Agents

11,760 19,746 Updated Apr 1, 2026

Curated academic CV templates and guidelines for PhD students, researchers, and faculty job applicants.

TeX 1,090 122 Updated Apr 1, 2026

AI agents running research on single-GPU nanochat training automatically

Python 77,606 11,319 Updated Mar 26, 2026

LLM Chess - evaluating Large Language Models' reasoning and instruction-following abilities by simulating chess games

Python 99 9 Updated Apr 28, 2026

A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.

Python 63 8 Updated Apr 20, 2026

CATArena is an engineering-level tournament evaluation platform for Large Language Model-driven code agents (LLM-driven code agents), based on an iterative competitive peer learning framework.

Python 64 10 Updated Dec 25, 2025

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 13,796 2,342 Updated Apr 24, 2026

Synthetic data curation for post-training and structured data extraction

Python 1,671 139 Updated Apr 18, 2026

Benchmark LLM reasoning capability by solving chess puzzles.

Python 91 5 Updated Apr 26, 2025

Training VLM agents with multi-turn reinforcement learning

Python 453 55 Updated Apr 17, 2026

Harsh Jhamtani*, Varun Gangal*, Eduard Hovy, Graham Neubig, Taylor Berg-Kirkpatrick. Learning to Generate Move-by-Move Commentary for Chess Games from Large-Scale Social Forum Data. ACL 2018

OpenEdge ABL 47 11 Updated Jul 21, 2022

Open source neural network chess engine with GPU acceleration and broad hardware support.

C++ 3,063 586 Updated Apr 20, 2026

A Text-Based Environment for Interactive Debugging

Python 298 39 Updated Apr 14, 2026

This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models".

Python 187 6 Updated Apr 18, 2026

Fully open reproduction of DeepSeek-R1

Python 26,012 2,421 Updated Apr 2, 2026

[ICLR 2026] Learning to Reason without External Rewards

Python 408 43 Updated Jan 26, 2026

A library for generative social simulation

Python 1,389 312 Updated Apr 27, 2026

AI paper trading project inspired by nof1 Alpha Arena, using cctx for quotation.

Python 583 146 Updated Nov 21, 2025

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

C++ 1,156 219 Updated Mar 27, 2026

Defeating the Training-Inference Mismatch via FP16

Python 192 17 Updated Nov 14, 2025

Natural Language Reinforcement Learning

Python 102 7 Updated Jul 30, 2025
Python 15 3 Updated Jul 10, 2025

Post-training with Tinker

Python 3,179 402 Updated Apr 29, 2026

A library for mechanistic interpretability of GPT-style language models

Python 3,369 558 Updated Apr 29, 2026

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs

Python 45 3 Updated Apr 17, 2026

[ICLR 2026] Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.

Python 126 4 Updated Feb 6, 2026
Next