A reinforcement learning process in extensive form games - Paris-Jourdan Sciences Économiques Accéder directement au contenu
Article Dans Une Revue International Journal of Game Theory Année : 2005

A reinforcement learning process in extensive form games

Résumé

The CPR ("cumulative proportional reinforcement") learning rule stipulates that an agent chooses a move with a probability proportional to the cumulative payoff she obtained in the past with that move. Previously considered for strategies in normal form games (Laslier, Topol and Walliser, Games and Econ. Behav., 2001), the CPR rule is here adapted for actions in perfect information extensive form games. The paper shows that the action-based CPR process converges with probability one to the (unique) subgame perfect equilibrium.

Dates et versions

halshs-00754083 , version 1 (20-11-2012)

Identifiants

Citer

Jean-François Laslier, Bernard Walliser. A reinforcement learning process in extensive form games. International Journal of Game Theory, 2005, 33 (2), pp.219-227. ⟨10.1007/s001820400194⟩. ⟨halshs-00754083⟩
182 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More