This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Last revision Both sides next revision | ||
curriculum [2023/04/27 10:31] scornet [Final project (April to September - 30 ECTS)] |
curriculum [2023/06/22 18:40] scornet [Scientific Courses, period 2 (January - March)] |
||
---|---|---|---|
Line 83: | Line 83: | ||
==== Scientific Courses, period 2 (January - March) ==== | ==== Scientific Courses, period 2 (January - March) ==== | ||
- | **MAP/INF641 - Deep Reinforcement Learning (24h, 2 ECTS), Jesse Read** (contact: jesse.read@polytechnique.edu) | + | **INF649 - Deep Reinforcement Learning (24h, 2 ECTS), Jesse Read** (contact: jesse.read@polytechnique.edu) |
Reinforcement learning (RL) is of increasing relevance today, including in games, complex energy systems, recommendation engines, finance, logistics, and for auto-tuning the parameters of other learning frameworks. This course assumes familiarity with the foundations of RL and its main paradigms (temporal-difference learning, Monte Carlo, and policy-gradient methods). We will explore them further, and study modern state-of-the-art variants (such as proximal policy optimization), with a focus on developing RL solutions with deep neural architectures suited to modern applications. We will also take a look at specialized topics such inverse reinforcement learning. | Reinforcement learning (RL) is of increasing relevance today, including in games, complex energy systems, recommendation engines, finance, logistics, and for auto-tuning the parameters of other learning frameworks. This course assumes familiarity with the foundations of RL and its main paradigms (temporal-difference learning, Monte Carlo, and policy-gradient methods). We will explore them further, and study modern state-of-the-art variants (such as proximal policy optimization), with a focus on developing RL solutions with deep neural architectures suited to modern applications. We will also take a look at specialized topics such inverse reinforcement learning. | ||