This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
curriculum [2018/09/18 14:56] cani [Transvere courses and projects (September 2018 to March 2019)] |
curriculum [2019/03/02 10:16] cani [Scientific Courses, period 2 (January - March 2019)] |
||
---|---|---|---|
Line 31: | Line 31: | ||
==== Scientific Courses, period 2 (January - March 2019) ==== | ==== Scientific Courses, period 2 (January - March 2019) ==== | ||
- | **MAP641 - Reinforcement Learning (48h, 5 ECTS), Odalric-Ambrym Maillard (Inria Lille), Bruno Scherrer (Inria), Olivier Pietquin (Google Brain) ** (contact: odalricambrym.maillard@inria.fr) | + | **MAP641 - Reinforcement Learning (48h, 5 ECTS), Odalric-Ambrym Maillard (Inria Lille), Bruno Scherrer (Inria Nancy), Olivier Pietquin (Google Brain) ** (contact: odalricambrym.maillard@inria.fr) |
Reinforcement learning aims at finding at each step of a process the best action to take in order to minimize some regret function. This course will introduce the general notions of reinforcement learning and will present several online algorithms that can be used in real-time to take actions. The specificity and the performance of the different algorithms will be discussed in detail. | Reinforcement learning aims at finding at each step of a process the best action to take in order to minimize some regret function. This course will introduce the general notions of reinforcement learning and will present several online algorithms that can be used in real-time to take actions. The specificity and the performance of the different algorithms will be discussed in detail. | ||