Probability Matching and Reinforcement Learning

Rivas, Javier

dp11-20.pdf (160.61 kB)

Probability Matching and Reinforcement Learning

report

posted on 2011-04-20, 13:58 authored by Javier Rivas

Probability matching occurs when an action is chosen with a frequency equivalent to the probability of that action being the best choice. This sub-optimal behavior has been reported repeatedly by psychologist and experimental economist. We provide an evolutionary foundation for this phenomenon by showing that learning by reinforcement can lead to probability matching and, if learning occurs su ciently slowly, probability matching does not only occur in choice frequencies but also in choice probabilities. Our results are completed by proving that there exists no quasi-linear reinforcement learning speci cation such that behavior is optimal for all environments where counterfactuals are observed.

History

Publisher

Dept. of Economics, University of Leicester

Available date

2011-04-20

Publisher version

http://www.le.ac.uk/ec/research/discussion/Papers2011.html

Book series

Papers in Economics;11/20

Language

en

Administrator link

https://leicester.figshare.com/account/articles/10100063

Usage metrics

Keywords

Probability Matching Reinforcement Learning

Licence

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Probability Matching and Reinforcement Learning

History

Publisher

Available date

Publisher version

Book series

Language

Administrator link

Usage metrics

Categories

Keywords

Licence

Exports