v2.11.0 (5518)

Cours scientifiques - MAP670C : Reinforcement Learning (M2DS)

Domaine > Mathématiques appliquées.


Syllabus: This 20-hour course provides an introduction to reinforcement learning. It is based on the new edition of the book "Reinforcement Learning: An Introduction" by R. Sutton and A. Barto. Barto (available online at http://incompleteideas.net/book/the-book-2nd.html).


  1. Introduction to reinforcement learning and Markov decision processes
  2. The bandit case
  3. Tabular methods: prediction by dynamic programming, Monte Carlo method and TD Learning
  4. Planning and learning for tabular methods
  5. approximate methods: prediction, planning and learning


Grading: Project based on a research article


Diplôme(s) concerné(s)

Format des notes

Numérique sur 20

Littérale/grade réduit

Pour les étudiants du diplôme M2 DS - Data Science

Le rattrapage est autorisé (Max entre les deux notes)
    L'UE est acquise si Note finale >= 10
    • Crédits ECTS acquis : 3 ECTS
    Veuillez patienter