A reinforcement learning process in extensive form games

Jean-François Laslier; Bernard Walliser

doi:10.1007/s001820400194

Article Dans Une Revue International Journal of Game Theory Année : 2005

A reinforcement learning process in extensive form games

(1) , (2)

1
2

Jean-François Laslier

Fonction : Auteur
PersonId : 10499
IdHAL : jean-francois-laslier
ORCID : 0000-0001-8334-1350
IdRef : 069975124

Laboratoire d'économétrie de l'École polytechnique

Bernard Walliser

Fonction : Auteur
PersonId : 856358

Centre d'enseignement et de recherche en analyse socio-économique

Résumé

The CPR ("cumulative proportional reinforcement") learning rule stipulates that an agent chooses a move with a probability proportional to the cumulative payoff she obtained in the past with that move. Previously considered for strategies in normal form games (Laslier, Topol and Walliser, Games and Econ. Behav., 2001), the CPR rule is here adapted for actions in perfect information extensive form games. The paper shows that the action-based CPR process converges with probability one to the (unique) subgame perfect equilibrium.

Mots clés

Learning Polya process Reinforcement Subgame Perfect Equilibrium

Domaines

Economies et finances

Caroline Bauer : Connectez-vous pour contacter le contributeur

https://pjse.hal.science/halshs-00754083

Soumis le : mardi 20 novembre 2012-08:42:35

Dernière modification le : mardi 2 janvier 2024-16:25:09

Dates et versions

halshs-00754083 , version 1 (20-11-2012)

Identifiants

HAL Id : halshs-00754083 , version 1
DOI : 10.1007/s001820400194

Citer

Jean-François Laslier, Bernard Walliser. A reinforcement learning process in extensive form games. International Journal of Game Theory, 2005, 33 (2), pp.219-227. ⟨10.1007/s001820400194⟩. ⟨halshs-00754083⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X ENPC PJSE CNRS X-LEEP X-DEP-ECO PARISTECH

182 Consultations

0 Téléchargements

A reinforcement learning process in extensive form games

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager