longxudou

Longxu Dou longxudou

Researcher @sail-sg. Working on Agent Training.

142 followers · 205 following

Achievements

Organizations

Stars

Turing-Project / Black-Myth-Wukong-AI

AI demo for playing ARPG/Soul-like game with RL frame

Python 391 71 Updated Sep 24, 2024

py499372727 / AgentSims

AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.

Python 941 120 Updated Nov 18, 2023

LARK-AI-Lab / CodeScaler

The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"

Python 32 Updated Mar 26, 2026

terminal-agent / reptile

💻 Terminal-Agent with Human-in-the-Loop Learning

Python 39 2 Updated Jan 16, 2026

Danau5tin / tbench-agentic-data-pipeline

Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training

Python 60 13 Updated Jul 28, 2025

WangRongsheng / Arxiv-Template

The official repository for "Rongsheng Wang's Arxiv Template"

TeX 57 5 Updated May 7, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,782 307 Updated Apr 24, 2026

Osilly / Awesome-Interleaving-Reasoning

Interleaving Reasoning: Next-Generation Reasoning Systems for AGI

269 11 Updated Oct 17, 2025

sail-sg / Precision-RL

Defeating the Training-Inference Mismatch via FP16

Python 190 17 Updated Nov 14, 2025

sunnweiwei / FoldAgent

Scaling Long-Horizon LLM Agent via Context-Folding

Python 147 11 Updated Jan 26, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,449 741 Updated Apr 23, 2026

sail-sg / tty-use

C 15 Updated Oct 13, 2025

memodb-io / memobase

User Profile-Based Long-Term Memory for AI Chatbot Applications.

Python 2,695 210 Updated Jan 11, 2026

wagoodman / dive

A tool for exploring each layer in a docker image

Go 53,818 1,993 Updated Dec 15, 2025

epoch-research / SWE-bench

Docker image registry for SWE-bench, created by Epoch AI.

Python 16 1 Updated Aug 21, 2025

mlc-ai / xgrammar

Fast, Flexible and Portable Structured Generation

C++ 1,640 142 Updated Apr 17, 2026

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,756 562 Updated Apr 23, 2026

Saibo-creator / Awesome-LLM-Constrained-Decoding

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

347 16 Updated Jan 22, 2026

todotxt / todo.txt-cli

☑️ A simple and extensible shell script for managing your todo.txt file.

Shell 6,062 734 Updated Nov 24, 2025

Yuyz0112 / claude-code-reverse

A Tool to Visualize Claude Code's LLM Interactions

JavaScript 2,358 404 Updated Aug 26, 2025

JinjieNi / dlms-are-super-data-learners

The official github repo for "Diffusion Language Models are Super Data Learners".

Python 228 8 Updated Nov 6, 2025

axon-rl / gem

A Gym for Agentic LLMs

Python 477 32 Updated Jan 21, 2026

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 102,273 13,299 Updated Apr 24, 2026

xlang-ai / OSWorld-G

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

TypeScript 163 7 Updated Nov 6, 2025

sail-sg / understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,246 58 Updated Aug 27, 2025

GAIR-NLP / ProX

[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Python 269 17 Updated Jul 8, 2025

SWE-bench / SWE-smith

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 630 117 Updated Apr 20, 2026

dylanhogg / llmgraph

Create knowledge graphs with LLMs

Jupyter Notebook 507 33 Updated Oct 11, 2025

tree-sitter / tree-sitter

An incremental parsing system for programming tools

Rust 24,964 2,589 Updated Apr 23, 2026

openai / frontier-evals

OpenAI Frontier Evals

Python 1,179 149 Updated Apr 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Longxu Dou longxudou

Achievements

Achievements

Organizations

Block or report longxudou

Stars

Turing-Project / Black-Myth-Wukong-AI

py499372727 / AgentSims

LARK-AI-Lab / CodeScaler

terminal-agent / reptile

Danau5tin / tbench-agentic-data-pipeline

WangRongsheng / Arxiv-Template

NovaSky-AI / SkyRL

Osilly / Awesome-Interleaving-Reasoning

sail-sg / Precision-RL

sunnweiwei / FoldAgent

THUDM / slime

sail-sg / tty-use

memodb-io / memobase

wagoodman / dive

epoch-research / SWE-bench

mlc-ai / xgrammar

vllm-project / aibrix

Saibo-creator / Awesome-LLM-Constrained-Decoding

todotxt / todo.txt-cli

Yuyz0112 / claude-code-reverse

JinjieNi / dlms-are-super-data-learners

axon-rl / gem

google-gemini / gemini-cli

xlang-ai / OSWorld-G

sail-sg / understand-r1-zero

GAIR-NLP / ProX

SWE-bench / SWE-smith

dylanhogg / llmgraph

tree-sitter / tree-sitter

openai / frontier-evals