Munich Personal RePEc Archive

The economic analysis of a Q-learning model of Cooperation with punishment.

Solferino, Nazaria and Solferino, Viviana and Taurino, Serena Fiona (2015): The economic analysis of a Q-learning model of Cooperation with punishment.

There is a more recent version of this item available.

Preview

PDF
MPRA_paper_66880.pdf
Download (270kB) | Preview

Abstract

A Q-learning model is devised in order to see whether individuals can "learn" how to cooperate, when a virtuous system of punishment and reinforcement is adopted. The paper shows that, if it is possible to free-ride and not being adequately punished, there will always be an incentive to deviate from cooperation. Conversely, even if the others did not cooperate, it is still possible to have someone who cooperates when individuals are pushed by strong intrinsec motivations. Cooperation can be a learning process. It is possible to trigger a learning process that leads individuals to be equally cooperative. This happens much more easily, the more responsible the individuals are. It also depends on proper punishment.

Item Type:	MPRA Paper
Original Title:	The economic analysis of a Q-learning model of Cooperation with punishment.
English Title:	The economic analysis of a Q-learning model of Cooperation with punishment.
Language:	English
Keywords:	Cooperation, punishment, q-learning models.
Subjects:	C - Mathematical and Quantitative Methods > C6 - Mathematical Methods ; Programming Models ; Mathematical and Simulation Modeling C - Mathematical and Quantitative Methods > C7 - Game Theory and Bargaining Theory C - Mathematical and Quantitative Methods > C7 - Game Theory and Bargaining Theory > C71 - Cooperative Games
Item ID:	66880
Depositing User:	Dr Nazaria Solferino
Date Deposited:	23 Sep 2015 17:55
Last Modified:	08 Oct 2019 16:42
References:	Andreoni, J., 1989, Giving with impure altruism:applucations to Charities and ricardian equivalence, Journal of political economy, 97,6: 1447-14. Andreoni, J, 1990, Impure altruism and donation to public goods. A theory of warm glow giving, economic journal, 100: 464-477. Antoci A., Sabatini F, Sodini M., 2014, On line and off line partecipation ans social poverty trap, Crenos wp. 2014. Becchetti, L., Federico, G. and Solferino, N., 2014, What to do in globalised economies if global governance is missing, International economic review, 58,2: 185-211. Becchetti, L., Palestini, A., Solferino. N. and Tessitore, M.E, 2014, The socially responsible choice in a duopolistic market, economic modelling, 43. Becchetti, L., Pelligra V. and Taurino S.F., 2015, Other-Regarding preferences and betrayal aversion, unpublished work. Boyd R. and Richardson P. J. 1988, The evolution of reciprocity n sizeable groups, Journal of Theoretical Biology, 132. Boyd R and Richardson P. J., 1992, Punishment allows the evolution of cooperation, Ethology and sociobiology, 13. Brandt H., Hauert C, Sigmund K., 2006, Punish and abstaining for public goods, Proceedings of the National academy of science, 103. Bruni, L., 2006, Reciprocità, Mondadori editore. Dawes, R. M., 1980, Social Dilemmas, Annual review of psychology, 31. Dercole, F., Decarli M, Della Rossa F, Papadoupolos A. V, 2013, Overpunish is not necessary to fix cooperation in public goods games, Journal of Theoretical Biology, 134. Egas, M. and Riedl, A., 2008, "The economics of altruistic punishment and the maintenance of cooperation," Proceedings of the National Academy of Sciences, 275: 871-878. Fehr, E. and Gachter, S.,2000, "Cooperation and punishment in public goods experiments," American Economic Review, 90: 980-994. Fehr, E. and Gachter, S., 2000, "Altruistic punishment in humans," Nature, 415: 137-140. Fowler, J.H., 2005, "Altruistic punishment and the origin of cooperation," Proc.Natl. Acad. Sci., 102: 7047-7049 Hardin, G.,1968, "The tragedy of the commons," Science, 162: 1243-1248. Hauert, C. and Schuster, P., 1998, "Extending the iterated prisoner's dilemma without synchrony," Journal of Theoretical biology, 192: 155-166. Hauert, C.,Traulsen, A., Brandt, H., Nowak, M.A. and Sigmund, K., 2007, "Via freedom to coercion: the emergence of costly punishment," Science, 316: 1905-1907. Hauert, C., Traulsen, A., Brandt, H., Nowak, M.A. and Sigmund, K., 2008, "Public goods with punishment and abstaining in finite and infinite populations," biol.Theory, 3: 114-122. Kagel, J. and Roth, A., 1997, The Handbook of Experimental Economics, Princeton, NJ: Princeton University Press. Kianercy A, Galstyan A, 2012, Dynamics of Bolznann Q-learning in a two-players two action games, Physical Review E, 85:041145: Nakamaru, M. and Dieckmann, U., 2009, "Runaway selection for cooperation and strict-and-severe punishment," Journal of Theoretical biology, 257: 1-8. Sasaki, T., Brannstrom, A., Dieckmanna, U. and Sigmund, K., 2012, "The take-it-or-leave-it option allows small penalties to overcome social dilemmas," Proceedings of the National Academy of Sciences, 109: 1165-1169. Schuster, P. and Sigmund, K., 1983, "Replicator dynamics," Journal of Theoretical biology, 100: 533-538. Sigmund, K., DeSilva,H., Traulsen,A. and Hauert,C., 2010, "Social learning promotes institutions for governing the commons," Nature, 466: 861-863. Waltman L. and Kaymak U., 2008, "Q-learning agents in a Cournot oligopoly model,"Journal of Economi Dynamics and Control 32(10):3275-3293 · Xie M.C. and Tachibana, A., 2007, "Cooperative Behavior Acquisition for Multi-agent Systems by Q-learning," Foundations of Computational Intelligence, 2007. FOCI 2007.
URI:	https://mpra.ub.uni-muenchen.de/id/eprint/66880

Available Versions of this Item

The economic analysis of a Q-learning model of Cooperation with punishment. (deposited 14 Sep 2015 19:20)
- The economic analysis of a Q-learning model of Cooperation with punishment. (deposited 23 Sep 2015 17:55) [Currently Displayed]
  - The economic analysis of a Q-learning model of Cooperation with punishment. (deposited 02 Jun 2016 09:18)

All papers reproduced by permission. Reproduction and distribution subject to the approval of the copyright owners.

View Item

Atom RSS 1.0 RSS 2.0

Contact us: mpra@ub.uni-muenchen.de

This repository has been built using EPrints software.

MPRA is a RePEc service hosted by .