-
Sea AI Lab
- Singapore
- https://longxudou.github.io/
- in/longxu-dou-6b167410a
- @LongxuDou
Stars
AI demo for playing ARPG/Soul-like game with RL frame
AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.
The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"
💻 Terminal-Agent with Human-in-the-Loop Learning
Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training
The official repository for "Rongsheng Wang's Arxiv Template"
SkyRL: A Modular Full-stack RL Library for LLMs
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
Defeating the Training-Inference Mismatch via FP16
Scaling Long-Horizon LLM Agent via Context-Folding
slime is an LLM post-training framework for RL Scaling.
User Profile-Based Long-Term Memory for AI Chatbot Applications.
A tool for exploring each layer in a docker image
Docker image registry for SWE-bench, created by Epoch AI.
Fast, Flexible and Portable Structured Generation
Cost-efficient and pluggable Infrastructure components for GenAI inference
A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.
☑️ A simple and extensible shell script for managing your todo.txt file.
A Tool to Visualize Claude Code's LLM Interactions
The official github repo for "Diffusion Language Models are Super Data Learners".
An open-source AI agent that brings the power of Gemini directly into your terminal.
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
Understanding R1-Zero-Like Training: A Critical Perspective
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
An incremental parsing system for programming tools