UE MAP670C | Catalogue 2023-2024

Descriptif

Syllabus: This 20-hour course provides an introduction to reinforcement learning. It is based on the new edition of the book "Reinforcement Learning: An Introduction" by R. Sutton and A. Barto. Barto (available online at http://incompleteideas.net/book/the-book-2nd.html).

Outline:

Introduction to reinforcement learning and Markov decision processes
The bandit case
Tabular methods: prediction by dynamic programming, Monte Carlo method and TD Learning
Planning and learning for tabular methods
approximate methods: prediction, planning and learning

Grading: Project based on a research article

Diplôme(s) concerné(s)

M2 Data Science
M2 Optimisation

Parcours de rattachement

M2OPTIM - Semestre 1 - Electifs hors maquette

Format des notes

Numérique sur 20

Littérale/grade réduit

Pour les étudiants du diplôme M2 Optimisation

Pour les étudiants du diplôme M2 Data Science

Le rattrapage est autorisé (Max entre les deux notes)

L'UE est acquise si Note finale >= 10

Crédits ECTS acquis : 3 ECTS

Cours scientifiques - MAP670C : Reinforcement Learning (M2DS)

Domaine > Mathématiques appliquées.