Home
L1 MDPs, Exact Solution Methods, Max-ent RL (Foundations of Deep RL Series)
Pieter Abbeel
24 ส.ค. 2021
การดู 51,074 ครั้ง
L2 Deep Q-Learning (Foundations of Deep RL Series)
L3 Policy Gradients and Advantage Estimation (Foundations of Deep RL Series)
Reinforcement Learning Series: Overview of Methods
lofi hip hop radio 📚 - beats to relax/study to
MIT 6.S191 (2023): Reinforcement Learning
L4 TRPO and PPO (Foundations of Deep RL Series)
Reinforcement Learning: Machine Learning Meets Control Theory
Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 1 - Introduction - Emma Brunskill
Multi-Agent Hide and Seek
An introduction to Policy Gradient methods - Deep Reinforcement Learning
Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)
Overview of Deep Reinforcement Learning Methods
Deep Learning: A Crash Course (2018) | SIGGRAPH Courses
Ilya Sutskever: OpenAI Meta-Learning and Self-Play | MIT Artificial General Intelligence (AGI)
Introduction to Multi-Agent Reinforcement Learning
A friendly introduction to deep reinforcement learning, Q-networks and policy gradients
Python + PyTorch + Pygame Reinforcement Learning – Train an AI to Play Snake
Reinforcement Learning, by the Book
CS 285: Lecture 1, Part 1
DeepMind x UCL RL Lecture Series - Introduction to Reinforcement Learning [1/13]