💾 Archived View for theo.flounder.online › academic › rl.gmi captured on 2023-05-30 at 20:40:50. Gemini links have been rewritten to link to archived content

View Raw

More Information

⬅️ Previous capture (2023-05-24)

-=-=-=-=-=-=-

Reinforcement Learning

Theodore Tollet

2023-05-11

A collection of resources relevant to the study of reinforcement learning.

lunar lander

An RL agent completes a 2D model moon landing in the lunar lander environment.

source

gallery

[1] These are considered the primary introductions to Reinforcement Learning:

2019 | "Reinforcement Learning: An Introduction" by Sutton & Barto

2015 | David Silver's UCL Lectures

[2] Pieter Abbeel’s 6-part lecture series is intuitively explained, giving each topic’s motivation:

2022 | Foundations of Deep RL

[3] Bertsekas' book provides a comprehensive research focussed approach:

2023 | "A Course in Reinforcement Learning"

[4] Hugging Face provide a well structured & popular contemporary DeepRL course:

2018 | Welcome to the 🤗 Deep Reinforcement Learning Course

[5] Lillian Weng's blog post below is a superb reference for the core concepts:

2018 | A (Long) Peek into RL

Python Tools

Gymnasium (Gym)

Provides the standard interface for reinforcement learning environments, in addition to a collection of common environments.

Stable Baselines3 (SB3)

Reliable PyTorch-based implementations of widely used RL algorithms.

Other

[6] REINFORCEjs: Interactive DP, TD-learning & DQN in your browser!

An unmaintained (since 2022) but useful list of varying quality

12 uses of RL in industry

People

[1]

Richard Sutton

Andrew Barto

David Silver

[2]

Pieter Abbeel

[3]

Dmitri Bertsekas

[4]

Thomas Simonini

Omar Sansevier

Sayak Paul

[5]

Lilian Weng

[6]

Andrej Karpathy

---

Feel free to send comments & corrections to me at:

@ttollet@sigmoid.social

Last updated 2023-05-17