Reinforcement Learning

An overview of RL published just a few days ago. 144 pages of goodies covering everything from basic RL theory to modern deep RL algorithms and various related niches.

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics (including a very brief discussion of RL+LLMs).

Keynotes from the 2024 Reinforcement Learning Conference (www.youtube.com)

submitted 8 months ago by howrar to c/reinforcement_learning

0 comments fedilink

Recordings for the RLC keynote talks have been released.

Keynote speakers:

David Silver
Doina Precup (Not recorded)
Peter Stone
Finale Doshi-Velez
Sergey Levine
Emma Brunskill
Andrew Barto

OpenAI: Learning to Reason with LLMs (openai.com)

submitted 8 months ago* (last edited 8 months ago) by howrar to c/reinforcement_learning

0 comments fedilink

OpenAI just put out a blog post about a new model trained via RL (I'm assuming this isn't the usual RLHF) to perform chain of thought reasoning before giving the user its answer. As usual, there's very little detail about how this is accomplished so it's hard for me to get excited about it, but the rest of you might find this interesting.

Introducing SIMA, a Scalable Instructable Multiworld Agent (deepmind.google)

submitted 1 year ago by howrar to c/reinforcement_learning

0 comments fedilink