Skip to main content
Log in

Learning with neighbours

Emergence of convention in a society of learning agents

  • Published:
Synthese Aims and scope Submit manuscript

Abstract

I present a game-theoretical multi-agent system to simulate the evolutionary process responsible for the pragmatic phenomenon division of pragmatic labour (DOPL), a linguistic convention emerging from evolutionary forces. Each agent is positioned on a toroid lattice and communicates via signaling games, where the choice of an interlocutor depends on the Manhattan distance between them. In this framework I compare two learning dynamics: reinforcement learning (RL) and belief learning (BL). An agent’s experiences from previous plays influence his communication behaviour, and RL agents act in a non-rational way whereas BL agents display a small degree of rationality by using best response dynamics. The complete system simulates an evolutionary process of communication strategies, which agents can learn in a structured spatial society. The significant questions are: what circumstances could lead to an evolutionary process that doesn’t result in the expected DOPL convention; and to what extent is interlocutor rationality necessary for the emergence of a society-wide convention à la DOPL?

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Abbreviations

DOPL:

Division of pragmatic labour

BL:

Belief learning

NE:

Nash equilibrium

RD:

Replicator dynamics

RL:

Reinforcement learning

References

  • Barrett J., Zollman K.J.S. (2009) The role of forgetting in the evolution of learning and language. Journal of Experimental and Theoretical Artificial Intelligence 21(4): 293–309

    Article  Google Scholar 

  • Benz, A., Jäger, G., & van Rooy, R. (2005). An introduction to game theory for linguists. In Game theory and pragmatics (pp. 1–82).

  • Blutner, R., & Zeevat, H. (to appear). Optimality-theoretic pragmatics. In Semantics: An international handbook of natural language meaning.

  • Brown G.W. (1951) Iterative solutions of games by fictitious play. In: Koopmans T.C (eds) Activity analysis of production and allocation. Wiley, New York

    Google Scholar 

  • Foster D., Young H. P. (1990) Stochastic evolutionary game dynamics. Theoretical Population Biology 38: 219–232

    Article  Google Scholar 

  • Franke, M., & Jäger, G. (to appear). Bidirectional optimization from reasoning and learning in games. Journal of Logic, Language and Information.

  • Horn L. (1984) Towards a new taxonomy of pragmatic inference: Q-based and R-based implicature form, and use in context: Linguistic applications. Georgetown University Press, Washington, pp 11–42

    Google Scholar 

  • Huttegger S. M., Zollman K. J. S. (2011) Signaling games: Dynamics of evolution and learning language, games, and evolution. Springer, Berlin

    Google Scholar 

  • Jäger G. (2008) Evolutionary stability conditions for signaling games with costly signals. Journal of Theoretical Biology 253(1): 131–141

    Article  Google Scholar 

  • Jäger, G., & Ebert, C. (2009). Pragmatic rationalizibility. In Proceedings of Sinn and Bedeutung 13, SinSpeC. Working papers of the SFB 732 (Vol. 5, pp. 1–15).

  • Levinson S. C. (2000) Presumptive meanings: The theory of generalized conversational implicature. MIT Press, Cambridge, MA

    Google Scholar 

  • Lewis D. (1969) Convention. Harvard University Press, Cambridge

    Google Scholar 

  • Parikh P. (1991) Communication and strategic inference. Linguistics and Philosophy 14: 473–513

    Article  Google Scholar 

  • Roth A. E., Erev I. (1995) Learning in extensive form games: Experimental data and simple dynamical models in the intermediate term. Games and Economic Behavior 8(1): 164–212

    Article  Google Scholar 

  • Schaden, G. (2008). Say hello to markedness. In Proceedings of DEAL II, LiP 28 (pp. 73–97).

  • Skyrms B. (2009) Evolution of signalling systems with multiple senders and receivers. Philosophical Transactions of the Royal Society of London B 364: 771–779

    Article  Google Scholar 

  • Skyrms B. (2010) Signals: Evolution learning and information. Oxford University Press, Oxford

    Google Scholar 

  • Taylor P. D., Jonker L. (1978) Evolutionarily stable strategies and game dynamics. Mathematical Biosciences 40: 145–156

    Article  Google Scholar 

  • Tesar B., Smolensky P. (1998) Learnability in optimality theory. Linguistic Inquiry 29: 229–268

    Article  Google Scholar 

  • van Rooij R. (2008) Evolutionary motivations for semantic universals. In: Eckardt R., Jäger G., Veenstra T (eds) Variation, selection, development—probing the evolutionary model of language change. Mouton de Gruyter, Berlin, pp 103–142

    Google Scholar 

  • van Rooy R. (2004a) Signaling games select Horn strategies. Linguistics and Philosophy 27(4): 493–527

    Article  Google Scholar 

  • van Rooy R. (2004b) Evolution of conventional meaning and conversational principles. Synthese 139(2): 331–366

    Article  Google Scholar 

  • Wagner E. (2009) Communication and structured correlation. Erkenntnis 71: 377–393

    Article  Google Scholar 

  • Zollman K. J. S. (2005) Talking to neighbors: The evolution of regional meaning. Philosophy of Science 72: 69–85

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Roland Mühlenbernd.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mühlenbernd, R. Learning with neighbours. Synthese 183 (Suppl 1), 87–109 (2011). https://doi.org/10.1007/s11229-011-9980-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11229-011-9980-y

Keywords

Navigation