Skip to content
View GeminiLight's full-sized avatar

Highlights

  • Pro

Block or report GeminiLight

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

RL

Reinforcement Learning
45 repositories
Python 33 4 Updated Nov 21, 2022

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Python 147 27 Updated Jan 12, 2019

Curiosity-driven Exploration by Self-supervised Prediction

Python 147 32 Updated Mar 12, 2023

PyTorch implementation of deep reinforcement learning algorithms

Python 489 58 Updated Nov 19, 2021

Reinforcement Learning Algorithms Based on PyTorch

Python 452 93 Updated Oct 21, 2021

Series of deep reinforcement learning algorithms 🤖

Jupyter Notebook 29 12 Updated Jun 19, 2021
C++ 43 6 Updated Feb 8, 2026

(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…

Python 365 59 Updated Nov 9, 2022

StarCraft II Learning Environment

Python 8,276 1,162 Updated Jul 23, 2024

An offline deep reinforcement learning library

Python 1,660 265 Updated Sep 10, 2025

Multi-Agent Reinforcement Learning (MARL) papers

296 40 Updated Sep 19, 2022

A library of reinforcement learning components and agents

Python 3,975 535 Updated Apr 8, 2026

Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning

Python 17 5 Updated Mar 11, 2020

Learn to Steer through Deep Reinforcement Learning

Python 5 1 Updated Aug 22, 2019

Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019

Python 803 180 Updated May 29, 2022

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python 3,608 1,028 Updated Apr 24, 2024

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,317 970 Updated Feb 20, 2026

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python 2,957 654 Updated Feb 21, 2026

A pack of reinforcement learning algorithms.

Python 84 13 Updated Oct 26, 2021

This project is implementation code of AlphaStar

Python 207 28 Updated Jan 19, 2024

(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning

Python 121 22 Updated Feb 3, 2023

This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit

Python 21 3 Updated Sep 10, 2016

Multi-Objective Reinforcement Learning

Python 301 57 Updated Aug 10, 2021
Python 1,054 309 Updated Jan 29, 2023
Python 161 47 Updated May 3, 2019

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Python 603 92 Updated Oct 28, 2020

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,616 899 Updated Mar 24, 2023
Python 4 Updated Mar 3, 2021

Environments for OR and RL Research

Python 442 98 Updated Oct 12, 2023