Skip to content
View longxudou's full-sized avatar

Organizations

@HIT-SCIR @sail-sg @sea-sailor @terminal-agent

Block or report longxudou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI demo for playing ARPG/Soul-like game with RL frame

Python 391 71 Updated Sep 24, 2024

AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.

Python 941 120 Updated Nov 18, 2023

The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"

Python 32 Updated Mar 26, 2026

💻 Terminal-Agent with Human-in-the-Loop Learning

Python 39 2 Updated Jan 16, 2026

Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training

Python 60 13 Updated Jul 28, 2025

The official repository for "Rongsheng Wang's Arxiv Template"

TeX 57 5 Updated May 7, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,782 307 Updated Apr 24, 2026

Interleaving Reasoning: Next-Generation Reasoning Systems for AGI

269 11 Updated Oct 17, 2025

Defeating the Training-Inference Mismatch via FP16

Python 190 17 Updated Nov 14, 2025

Scaling Long-Horizon LLM Agent via Context-Folding

Python 147 11 Updated Jan 26, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,449 741 Updated Apr 23, 2026
C 15 Updated Oct 13, 2025

User Profile-Based Long-Term Memory for AI Chatbot Applications.

Python 2,695 210 Updated Jan 11, 2026

A tool for exploring each layer in a docker image

Go 53,818 1,993 Updated Dec 15, 2025

Docker image registry for SWE-bench, created by Epoch AI.

Python 16 1 Updated Aug 21, 2025

Fast, Flexible and Portable Structured Generation

C++ 1,640 142 Updated Apr 17, 2026

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,756 562 Updated Apr 23, 2026

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

347 16 Updated Jan 22, 2026

☑️ A simple and extensible shell script for managing your todo.txt file.

Shell 6,062 734 Updated Nov 24, 2025

A Tool to Visualize Claude Code's LLM Interactions

JavaScript 2,358 404 Updated Aug 26, 2025

The official github repo for "Diffusion Language Models are Super Data Learners".

Python 228 8 Updated Nov 6, 2025

A Gym for Agentic LLMs

Python 477 32 Updated Jan 21, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 102,273 13,299 Updated Apr 24, 2026

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

TypeScript 163 7 Updated Nov 6, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,246 58 Updated Aug 27, 2025

[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Python 269 17 Updated Jul 8, 2025

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 630 117 Updated Apr 20, 2026

Create knowledge graphs with LLMs

Jupyter Notebook 507 33 Updated Oct 11, 2025

An incremental parsing system for programming tools

Rust 24,964 2,589 Updated Apr 23, 2026

OpenAI Frontier Evals

Python 1,179 149 Updated Apr 21, 2026
Next