<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="discussion">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Artif. Intell.</journal-id>
<journal-title>Frontiers in Artificial Intelligence</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Artif. Intell.</abbrev-journal-title>
<issn pub-type="epub">2624-8212</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/frai.2022.886349</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Artificial Intelligence</subject>
<subj-group>
<subject>Opinion</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Emerging Grounded Shared Vocabularies Between Human and Machine, Inspired by Human Language Evolution</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Kouwenhoven</surname> <given-names>Tom</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1514118/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Verhoef</surname> <given-names>Tessa</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/928398/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>de Kleijn</surname> <given-names>Roy</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/116899/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Raaijmakers</surname> <given-names>Stephan</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1314983/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Creative Intelligence Lab, Leiden Institute for Advanced Computer Science, Leiden University</institution>, <addr-line>Leiden</addr-line>, <country>Netherlands</country></aff>
<aff id="aff2"><sup>2</sup><institution>Cognitive Psychology Unit, Institute of Psychology, Leiden University</institution>, <addr-line>Leiden</addr-line>, <country>Netherlands</country></aff>
<aff id="aff3"><sup>3</sup><institution>TNO, Information and Communication Technology</institution>, <addr-line>Delft</addr-line>, <country>Netherlands</country></aff>
<aff id="aff4"><sup>4</sup><institution>Leiden University Centre for Linguistics, Leiden University</institution>, <addr-line>Leiden</addr-line>, <country>Netherlands</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Arkaitz Zubiaga, Queen Mary University of London, United Kingdom</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Limor Raviv, Max Planck Institute for Psycholinguistics, Netherlands</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Tom Kouwenhoven <email>t.kouwenhoven&#x00040;liacs.leidenuniv.nl</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Language and Computation, a section of the journal Frontiers in Artificial Intelligence</p></fn></author-notes>
<pub-date pub-type="epub">
<day>26</day>
<month>04</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>5</volume>
<elocation-id>886349</elocation-id>
<history>
<date date-type="received">
<day>28</day>
<month>02</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>05</day>
<month>04</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Kouwenhoven, Verhoef, de Kleijn and Raaijmakers.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Kouwenhoven, Verhoef, de Kleijn and Raaijmakers</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<kwd-group>
<kwd>conversational AI</kwd>
<kwd>language evolution</kwd>
<kwd>human-machine communication</kwd>
<kwd>grounded vocabularies</kwd>
<kwd>human-machine cooperation</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="45"/>
<page-count count="5"/>
<word-count count="3527"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>Building conversational AI systems has the goal to teach machines to understand human language and respond naturally. The most common way to train agents to produce and interpret natural language is currently by exposing them to large quantities of data. Although this has resulted in advances in many areas, these systems typically have little understanding of how language is related to the real world (Mordatch and Abbeel, <xref ref-type="bibr" rid="B28">2018</xref>), known as the <italic>grounding problem</italic>. Also, most conversational agents are trained in isolation, while humans are social animals, deeply embedded in culture and surrounded by others. Complex human behaviors, like language, evolved in socio&#x02013;cultural contexts and could not exist without a variety of minds using and transmitting these behaviors.</p>
<p>To overcome this problem, researchers in Computational Linguistics have started modeling emerging communication setups, in which novel signals are created by interacting agents (Lazaridou et al., <xref ref-type="bibr" rid="B23">2018</xref>; Mordatch and Abbeel, <xref ref-type="bibr" rid="B28">2018</xref>; Chaabouni et al., <xref ref-type="bibr" rid="B7">2019</xref>; ter Hoeve et al., <xref ref-type="bibr" rid="B43">2021</xref>). However, the findings in such models do not always match what is found in similar experiments with humans, and features found in human language often do not emerge (Lazaridou et al., <xref ref-type="bibr" rid="B26">2020</xref>).</p>
<p>The mechanisms that influence the emergence of communication and linguistic structure have been studied in the field of Language Evolution. Although the precise origins of human language are widely debated, computer simulations (Boer, <xref ref-type="bibr" rid="B4">2006</xref>; Steels, <xref ref-type="bibr" rid="B38">2012a</xref>; Kirby, <xref ref-type="bibr" rid="B18">2017</xref>) and experiments in which humans use novel communication signals (Galantucci and Garrod, <xref ref-type="bibr" rid="B13">2010</xref>; Scott-Phillips and Kirby, <xref ref-type="bibr" rid="B33">2010</xref>; Kirby et al., <xref ref-type="bibr" rid="B20">2014</xref>), have revealed some key mechanisms that drive the initial emergence of a novel language and the gradual appearance of more complex linguistic structure. We review relevant findings and propose to apply methods that confirm the importance of including micro&#x02013;societies of interacting minds to the emergence of novel human&#x02013;machine communication systems.</p>
<p>A major insight from these studies is that language <italic>adapts</italic> to human biases and how it is learned and used (Kirby et al., <xref ref-type="bibr" rid="B20">2014</xref>, <xref ref-type="bibr" rid="B21">2015</xref>). Similarly, current language models also exhibit biases, free order case-marking languages are for example more challenging to model than fixed-order languages (Bisazza et al., <xref ref-type="bibr" rid="B3">2021</xref>). As such, we suggest that language used in human&#x02013;machine communication should also evolve more naturally, resulting in a grounded communication system adapted to biases and constraints of human and machine learning. We moreover emphasize the importance of co&#x02013;development of shared vocabularies by conversational partners (human or AI&#x02013;based). Doing so might result in a dynamic communication system that is natural to humans and artificial conversational agents. We propose to follow a process of several steps, displayed in <xref ref-type="fig" rid="F1">Figure 1</xref>. Starting from random behaviors, a signal&#x02013;meaning mapping emerges from shared interactions (section 2) which become more structured through horizontal and vertical transmission (section 3) and eventually evolve into an adaptive communication system (sections 4, 5).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>The proposed road of evolving a natural human&#x02013;machine communication system. First, initially random behaviors obtain meanings and become more structured through recurrent horizontal and vertical transmission. Everyday usage facilitates the continuous evolution of communication systems which are adaptive to biases of humans and machines.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frai-05-886349-g0001.tif"/>
</fig>
</sec>
<sec id="s2">
<title>2. Emergence of communication</title>
<p>Successful communication happens when the coordinated actions of all participants adhere to the grounding criterion: that interlocutors agree that they have understood what was meant for the current purposes (Clark and Brennan, <xref ref-type="bibr" rid="B8">1991</xref>). This requires a vocabulary that is (partially) aligned between interlocutors of a conversation (Pickering and Garrod, <xref ref-type="bibr" rid="B29">2004</xref>). The emergence of which starts with agreeing on what kind of (initially random) behaviors should be interpreted as communicative and what they refer to (box 1 and 2 in <xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<p>Experiments with human participants have been conducted to study the emergence of novel communication forms (Galantucci, <xref ref-type="bibr" rid="B12">2005</xref>; Steels, <xref ref-type="bibr" rid="B37">2006</xref>; Scott-Phillips et al., <xref ref-type="bibr" rid="B34">2009</xref>; Galantucci and Garrod, <xref ref-type="bibr" rid="B13">2010</xref>). Here, participants need to invent and negotiate novel signals to solve a communicative or cooperative task. Albeit often bound to the starting conditions of the experiment, even when no conventional signaling device is given, actions may gradually become communicative (Scott-Phillips et al., <xref ref-type="bibr" rid="B34">2009</xref>). Typically, humans quickly establish conventions and settle on a shared set of signals. Sufficient common ground, interactions, and social coordination have been identified as crucial to facilitate the emergence of communication systems.</p>
<p>With computational agents, Quinn (<xref ref-type="bibr" rid="B30">2001</xref>) investigated the emergence of signals and cooperation without dedicated communication channels in a way that is comparable to the work of Scott-Phillips et al. (<xref ref-type="bibr" rid="B34">2009</xref>). Here, robots, only equipped with sensors to observe a shared environment, were tasked to move away from a starting point while maintaining proximity to each other. Initial random behaviors gradually evolved into an iconic signaling system that could establish the allocation of leader&#x02013;follower roles (Quinn, <xref ref-type="bibr" rid="B30">2001</xref>; Quinn et al., <xref ref-type="bibr" rid="B31">2003</xref>).</p>
<p>A large body of work in <italic>evolutionary language games</italic>, as reviewed in Steels (<xref ref-type="bibr" rid="B39">2012b</xref>), has shown that agents without a pre&#x02013;programmed language can develop a communication system from scratch. This happens in a self&#x02013;organizing fashion, as alignment between agents arises from repeated interactions between individuals without central control. In the context of those experiments, Steels already proposed that robots can participate in the ongoing evolution of language and learn from human language users if there are sufficient situated interactions (Steels, <xref ref-type="bibr" rid="B38">2012a</xref>). We think that this is key to developing natural communication between humans and machines.</p>
<p>Although building an initially shared vocabulary is well&#x02013;explored between humans and in agent&#x02013;based models, to the best of our knowledge, it is rarely applied in human&#x02013;machine settings. One exception is a large&#x02013;scale exhibition of Steel&#x00027;s Talking Heads experiment (Steels, <xref ref-type="bibr" rid="B36">1999</xref>), in which both agents and human visitors proposed new words that could become part of an evolving shared vocabulary. We propose to revisit this idea in the context of conversational AI and include this initial step of co&#x02013;developing a shared vocabulary, rather than bypassing it with (random) symbols or pre&#x02013;trained language models, and trust in the process of self&#x02013;organization to facilitate the emergence of conventional signal&#x02013;meaning mappings. Once established, this shared set of signals may be far removed from natural language that humans know today, but just like human languages have, will adapt to their users and usage and become more complex and systematic.</p>
</sec>
<sec id="s3">
<title>3. Emergence of structure in language systems</title>
<p>Human language is uniquely structured and exhibits systematicity at multiple levels (Kirby, <xref ref-type="bibr" rid="B18">2017</xref>). For example, words are combined into sentences such that their meaning is a function of the meanings of the parts and the way they are combined (compositional structure). The origins of this and other types of structure have been studied using computer models and artificial language learning experiments with humans (Kirby, <xref ref-type="bibr" rid="B18">2017</xref>).</p>
<p>Among others, two processes have been found to contribute to the emergence of structure in language (box 2 and 3 in <xref ref-type="fig" rid="F1">Figure 1</xref>). The first is <italic>cumulative cultural evolution</italic> where (cultural) information, such as ideas or linguistic signals, is transmitted vertically along generations. An influential experiment was conducted by Kirby et al. (<xref ref-type="bibr" rid="B19">2008</xref>). Here, the first participant was asked to learn an artificial language and describe images with the acquired words. Subsequent participants learned the output of the previous participant. Through this process, the words gradually changed and became more compositional and learnable. Such results consistently show that increases in learnability and structure arise because languages adapt to human inductive biases to be transmitted faithfully (Griffiths and Kalish, <xref ref-type="bibr" rid="B15">2007</xref>). Words and patterns that are not easily learned or interpreted, will not be reproduced by the next generation and since structured languages are more easily compressible (Kirby et al., <xref ref-type="bibr" rid="B21">2015</xref>; Tamariz and Kirby, <xref ref-type="bibr" rid="B42">2015</xref>), this eventually results in more learnable and structured languages.</p>
<p>A second process contributing to the emergence of structure in human language is <italic>horizontal transmission</italic>. Here, linguistic structure originates and evolves from social coordination through repeated interactions between individuals in micro-societies. While interactions between dyads can lead to shared vocabularies and initial regularities (Theisen-White et al., <xref ref-type="bibr" rid="B44">2011</xref>; Verhoef et al., <xref ref-type="bibr" rid="B45">2016</xref>), a community of users seems to be necessary for the emergence of system-wide compositional structure and efficient coding (Fay et al., <xref ref-type="bibr" rid="B11">2008</xref>; Raviv et al., <xref ref-type="bibr" rid="B32">2019</xref>). In these cases, pressures such as the number of interaction partners and expanding meaning spaces cause initially random languages to become more structured over time.</p>
<p>The effects of horizontal and vertical transmission have also been demonstrated with agent&#x02013;based computer simulations (Steels and Loetzsch, <xref ref-type="bibr" rid="B40">2012</xref>; Kirby, <xref ref-type="bibr" rid="B18">2017</xref>). Altogether, there is overwhelming evidence suggesting that transmission (vertical or horizontal) of signals within communities contributes to the emergence of structure in language. In fact, it has been argued that both types of transmission are necessary to get a language that is learnable and usable (Kirby et al., <xref ref-type="bibr" rid="B21">2015</xref>). These processes should therefore be projected onto the human&#x02013;machine language evolution scenario to evolve a vocabulary that shares features with human language and is equally adapted to be learned and used by machines.</p>
</sec>
<sec id="s4">
<title>4. Human&#x02013;machine language evolution and reinforcement learning</title>
<p>Recent work in Computational Linguistics started to train machines to understand human language through the emergence of communication systems (Lazaridou et al., <xref ref-type="bibr" rid="B24">2017</xref>; Clark et al., <xref ref-type="bibr" rid="B9">2019</xref>; Manning et al., <xref ref-type="bibr" rid="B27">2020</xref>). A range of work has shown that (multi&#x02013;agent) reinforcement learning (RL; Sutton and Barto, <xref ref-type="bibr" rid="B41">2018</xref>) can converge on communication protocols in various game scenarios (Lazaridou et al., <xref ref-type="bibr" rid="B25">2016</xref>; Havrylov and Titov, <xref ref-type="bibr" rid="B17">2017</xref>; Chaabouni et al., <xref ref-type="bibr" rid="B6">2020</xref>). While communicative systems emerge, these often suffer from interpretability issues for humans (e.g., Mordatch and Abbeel, <xref ref-type="bibr" rid="B28">2018</xref>), making its applicability to human&#x02013;machine communication less obvious. The emergent protocols also often do not bear core properties of natural language (Kottur et al., <xref ref-type="bibr" rid="B22">2017</xref>; Chaabouni et al., <xref ref-type="bibr" rid="B7">2019</xref>). As such Lazaridou et al. (<xref ref-type="bibr" rid="B26">2020</xref>) use a pre&#x02013;trained language model in combination with self&#x02013;play to teach RL agents to communicate in natural language. However, without human intervention, this approach suffers from language drift, ultimately causing misunderstandings. While too much is problematic, we argue that some language drift is welcome since it can result in language that is optimized for human&#x02013;machine communication.</p>
<p>A growing trend in RL advocates to include human feedback in the learning loop to improve learning (Arzate Cruz and Igarashi, <xref ref-type="bibr" rid="B1">2020</xref>; ter Hoeve et al., <xref ref-type="bibr" rid="B43">2021</xref>). Bignold et al. (<xref ref-type="bibr" rid="B2">2021</xref>) propose <italic>assisted reinforcement learning</italic> where information external from the environment is used to improve the performance of a learner agent and scale to more complex scenarios. Human feedback can, for example, directly be included in the behavior of an agent instead of learning it from the ground up and potentially prevents too much language drift. This draws parallels to human interactions which offer a means to ground signs through recurrent and reciprocal usage (Garrod et al., <xref ref-type="bibr" rid="B14">2007</xref>), provide feedback on the success of a conversational contribution, and alleviate miscommunications resulting from partially aligned vocabularies due to variations or dialects.</p>
<p>To establish mutual understanding, we propose to use assisted reinforcement learning and revisit signaling games (Scott-Phillips et al., <xref ref-type="bibr" rid="B34">2009</xref>), referential games (Steels and Loetzsch, <xref ref-type="bibr" rid="B40">2012</xref>; Chaabouni et al., <xref ref-type="bibr" rid="B6">2020</xref>), and navigation games (Mordatch and Abbeel, <xref ref-type="bibr" rid="B28">2018</xref>; Dubova and Moskvichev, <xref ref-type="bibr" rid="B10">2020</xref>) to evolve shared vocabularies between humans and machines. The next step is not only to use more complex problems (e.g., increasing the number of objects, interacting partners, or vocabulary size) that necessitate more complex syntax and vocabularies (Mordatch and Abbeel, <xref ref-type="bibr" rid="B28">2018</xref>) but also to continue interacting with conversational agents frequently (box 3 and 4 in <xref ref-type="fig" rid="F1">Figure 1</xref>). While communities of self&#x02013;playing RL agents interact and consolidate the learned behaviors, frequent human&#x02013;agent interactions prevent too much language drift. The evolved communication systems will not take the same form as human language initially, but through iterations, may come closer toward it and evolve into a form that makes human&#x02013;machine interactions more natural, with communication systems adapted to biases in both human and machine learning.</p>
<p>We are aware that this proposition poses challenges, for example, the tutoring role and interactions that humans must supply. However, we assume that humans are willing to do so since we perceive robots as social, communicative partners (Guzman and Lewis, <xref ref-type="bibr" rid="B16">2020</xref>) and linguistically align to computers in several ways (Branigan et al., <xref ref-type="bibr" rid="B5">2010</xref>), an effect that is even stronger when computers themselves also exhibit alignment behavior (Spillner and Wenig, <xref ref-type="bibr" rid="B35">2021</xref>).</p>
</sec>
<sec sec-type="conclusions" id="s5">
<title>5. Conclusion</title>
<p>This article proposed to combine insights from human language evolution, specifically concerning the influence of vertical and horizontal transmission, with assisted reinforcement learning. We have shown how signals and structure emerge in socio&#x02013;cultural contexts and that language adapts to how it is learned and used. We therefore suggest that language used in human&#x02013;machine communication should also evolve naturally, emphasizing the importance of co&#x02013;development of shared conventions during communication. A first step would be to revisit communicative games and evolve successful systems between humans and machines. Doing so allows communication systems to adapt to the biases of both parties while mutual understanding is maintained, ultimately benefiting the communicative capacity of conversational agents.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>This opinion piece is written by TK with help of TV. TK, TV, and RK: literature research was conducted. TK, TV, RK, and SR: reviewing and editing was done. TV and SR supervised the process. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>SR is employed by TNO. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s7">
<title>Publisher&#x00027;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arzate Cruz</surname> <given-names>C.</given-names></name> <name><surname>Igarashi</surname> <given-names>T.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;A survey on interactive reinforcement learning: design principles and open challenges,&#x0201D;</article-title> in <source>Proceedings of the 2020 ACM Designing Interactive Systems Conference</source>, <fpage>1195</fpage>&#x02013;<lpage>1209</lpage>.</citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bignold</surname> <given-names>A.</given-names></name> <name><surname>Cruz</surname> <given-names>F.</given-names></name> <name><surname>Taylor</surname> <given-names>M. E.</given-names></name> <name><surname>Brys</surname> <given-names>T.</given-names></name> <name><surname>Dazeley</surname> <given-names>R.</given-names></name> <name><surname>Vamplew</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>A conceptual framework for externally-influenced agents: an assisted reinforcement learning review</article-title>. <source>J. Ambient Intell. Humanized Comput</source>. <fpage>1</fpage>&#x02013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1007/s12652-021-03489-y</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bisazza</surname> <given-names>A.</given-names></name> <name><surname>&#x000DC;st&#x000FC;n</surname> <given-names>A.</given-names></name> <name><surname>Sportel</surname> <given-names>S.</given-names></name></person-group> (<year>2021</year>). <article-title>On the difficulty of translating free-order case-marking languages</article-title>. <source>Trans. Assoc. Comput. Linguist</source>. <volume>9</volume>, <fpage>1233</fpage>&#x02013;<lpage>1248</lpage>. <pub-id pub-id-type="doi">10.1162/tacl_a_00424</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Boer</surname> <given-names>B. d</given-names></name></person-group>. (<year>2006</year>). <article-title>&#x0201C;Computer modelling as a tool for understanding language evolution,&#x0201D;</article-title> in <source>Evolutionary Epistemology, Language and Culture</source> (<publisher-loc>Springer</publisher-loc>), <fpage>381</fpage>&#x02013;<lpage>406</lpage>.</citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Branigan</surname> <given-names>H. P.</given-names></name> <name><surname>Pickering</surname> <given-names>M. J.</given-names></name> <name><surname>Pearson</surname> <given-names>J.</given-names></name> <name><surname>McLean</surname> <given-names>J. F.</given-names></name></person-group> (<year>2010</year>). <article-title>Linguistic alignment between people and computers</article-title>. <source>J. Pragmat</source>. <volume>42</volume>, <fpage>2355</fpage>&#x02013;<lpage>2368</lpage>. <pub-id pub-id-type="doi">10.1016/j.pragma.2009.12.012</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chaabouni</surname> <given-names>R.</given-names></name> <name><surname>Kharitonov</surname> <given-names>E.</given-names></name> <name><surname>Bouchacourt</surname> <given-names>D.</given-names></name> <name><surname>Dupoux</surname> <given-names>E.</given-names></name> <name><surname>Baroni</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>Compositionality and generalization in emergent languages</article-title>. <source>arXiv preprint</source> arXiv:2004.09124. <pub-id pub-id-type="doi">10.18653/v1/2020.acl-main.407</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chaabouni</surname> <given-names>R.</given-names></name> <name><surname>Kharitonov</surname> <given-names>E.</given-names></name> <name><surname>Dupoux</surname> <given-names>E.</given-names></name> <name><surname>Baroni</surname> <given-names>M.</given-names></name></person-group> (<year>2019</year>). <article-title>Anti-efficient encoding in emergent communication</article-title>. <source>Adv. Neural Inf. Process. Syst</source>. <fpage>32</fpage>.</citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>H. H.</given-names></name> <name><surname>Brennan</surname> <given-names>S. E.</given-names></name></person-group> (<year>1991</year>). Grounding in communication.</citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>K.</given-names></name> <name><surname>Khandelwal</surname> <given-names>U.</given-names></name> <name><surname>Levy</surname> <given-names>O.</given-names></name> <name><surname>Manning</surname> <given-names>C. D.</given-names></name></person-group> (<year>2019</year>). <article-title>What does bert look at? an analysis of bert&#x00027;s attention</article-title>. <source>arXiv preprint</source> arXiv:1906.04341. <pub-id pub-id-type="doi">10.18653/v1/W19-4828</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Dubova</surname> <given-names>M.</given-names></name> <name><surname>Moskvichev</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Effects of supervision, population size, and self-play on multi-agent reinforcement learning to communicate,&#x0201D;</article-title> in <source>ALIFE 2020: The 2020 Conference on Artificial Life</source> (<publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>678</fpage>&#x02013;<lpage>686</lpage>.</citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fay</surname> <given-names>N.</given-names></name> <name><surname>Garrod</surname> <given-names>S.</given-names></name> <name><surname>Roberts</surname> <given-names>L.</given-names></name></person-group> (<year>2008</year>). <article-title>The fitness and functionality of culturally evolved communication systems</article-title>. <source>Philos. Trans. R. Soc. B Biol. Sci</source>. <volume>363</volume>, <fpage>3553</fpage>&#x02013;<lpage>3561</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2008.0130</pub-id><pub-id pub-id-type="pmid">18799421</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galantucci</surname> <given-names>B</given-names></name></person-group>. (<year>2005</year>). <article-title>An experimental study of the emergence of human communication systems</article-title>. <source>Cogn. Sci</source>. <volume>29</volume>, <fpage>737</fpage>&#x02013;<lpage>767</lpage>. <pub-id pub-id-type="doi">10.1207/s15516709cog0000_34</pub-id><pub-id pub-id-type="pmid">21702792</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galantucci</surname> <given-names>B.</given-names></name> <name><surname>Garrod</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>Experimental semiotics: a new approach for studying the emergence and the evolution of human communication</article-title>. <source>Interact. Stud</source>. <volume>11</volume>, <fpage>1</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1075/is.11.1.01gal</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garrod</surname> <given-names>S.</given-names></name> <name><surname>Fay</surname> <given-names>N.</given-names></name> <name><surname>Lee</surname> <given-names>J.</given-names></name> <name><surname>Oberlander</surname> <given-names>J.</given-names></name> <name><surname>MacLeod</surname> <given-names>T.</given-names></name></person-group> (<year>2007</year>). <article-title>Foundations of representation: where might graphical symbol systems come from?</article-title> <source>Cogn. Sci</source>. <volume>31</volume>, <fpage>961</fpage>&#x02013;<lpage>987</lpage>. <pub-id pub-id-type="doi">10.1080/03640210701703659</pub-id><pub-id pub-id-type="pmid">21635324</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Griffiths</surname> <given-names>T. L.</given-names></name> <name><surname>Kalish</surname> <given-names>M. L.</given-names></name></person-group> (<year>2007</year>). <article-title>Language evolution by iterated learning with bayesian agents</article-title>. <source>Cogn. Sci</source>. <volume>31</volume>, <fpage>441</fpage>&#x02013;<lpage>480</lpage>. <pub-id pub-id-type="doi">10.1080/15326900701326576</pub-id><pub-id pub-id-type="pmid">21635304</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guzman</surname> <given-names>A. L.</given-names></name> <name><surname>Lewis</surname> <given-names>S. C.</given-names></name></person-group> (<year>2020</year>). <article-title>Artificial intelligence and communication: a human-machine communication research agenda</article-title>. <source>New Media Soc</source>. <volume>22</volume>, <fpage>70</fpage>&#x02013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1177/1461444819858691</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Havrylov</surname> <given-names>S.</given-names></name> <name><surname>Titov</surname> <given-names>I.</given-names></name></person-group> (<year>2017</year>). <article-title>Emergence of language with multi-agent games: learning to communicate with sequences of symbols</article-title>. <source>Adv. Neural Inf. Process. Syst</source>. <fpage>30</fpage>.</citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kirby</surname> <given-names>S</given-names></name></person-group>. (<year>2017</year>). <article-title>Culture and biology in the origins of linguistic structure</article-title>. <source>Psychon. Bull. Rev</source>. <volume>24</volume>, <fpage>118</fpage>&#x02013;<lpage>137</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-016-1166-7</pub-id><pub-id pub-id-type="pmid">28120320</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kirby</surname> <given-names>S.</given-names></name> <name><surname>Cornish</surname> <given-names>H.</given-names></name> <name><surname>Smith</surname> <given-names>K.</given-names></name></person-group> (<year>2008</year>). <article-title>Cumulative cultural evolution in the laboratory: an experimental approach to the origins of structure in human language</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>105</volume>, <fpage>10681</fpage>&#x02013;<lpage>10686</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0707835105</pub-id><pub-id pub-id-type="pmid">28242685</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kirby</surname> <given-names>S.</given-names></name> <name><surname>Griffiths</surname> <given-names>T.</given-names></name> <name><surname>Smith</surname> <given-names>K.</given-names></name></person-group> (<year>2014</year>). <article-title>Iterated learning and the evolution of language</article-title>. <source>Curr. Opin. Neurobiol</source>. <volume>28</volume>, <fpage>108</fpage>&#x02013;<lpage>114</lpage>. <pub-id pub-id-type="doi">10.1016/j.conb.2014.07.014</pub-id><pub-id pub-id-type="pmid">25062470</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kirby</surname> <given-names>S.</given-names></name> <name><surname>Tamariz</surname> <given-names>M.</given-names></name> <name><surname>Cornish</surname> <given-names>H.</given-names></name> <name><surname>Smith</surname> <given-names>K.</given-names></name></person-group> (<year>2015</year>). <article-title>Compression and communication in the cultural evolution of linguistic structure</article-title>. <source>Cognition</source> <volume>141</volume>, <fpage>87</fpage>&#x02013;<lpage>102</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2015.03.016</pub-id><pub-id pub-id-type="pmid">25966840</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kottur</surname> <given-names>S.</given-names></name> <name><surname>Moura</surname> <given-names>J. M.</given-names></name> <name><surname>Lee</surname> <given-names>S.</given-names></name> <name><surname>Batra</surname> <given-names>D.</given-names></name></person-group> (<year>2017</year>). <article-title>Natural language does not emerge&#x00027;naturally&#x00027;in multi-agent dialog</article-title>. <source>arXiv preprint</source> arXiv:1706.08502. <pub-id pub-id-type="doi">10.18653/v1/D17-1321</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lazaridou</surname> <given-names>A.</given-names></name> <name><surname>Hermann</surname> <given-names>K. M.</given-names></name> <name><surname>Tuyls</surname> <given-names>K.</given-names></name> <name><surname>Clark</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>Emergence of linguistic communication from referential games with symbolic and pixel input</article-title>. <source>arXiv preprint</source> arXiv:1804.03984.</citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lazaridou</surname> <given-names>A.</given-names></name> <name><surname>Peysakhovich</surname> <given-names>A.</given-names></name> <name><surname>Baroni</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>Multi-agent cooperation and the emergence of (natural) language</article-title>. <source>arXiv preprint</source> arXiv:1612.07182.</citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lazaridou</surname> <given-names>A.</given-names></name> <name><surname>Pham</surname> <given-names>N. T.</given-names></name> <name><surname>Baroni</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Towards multi-agent communication-based language learning</article-title>. <source>arXiv preprint</source> arXiv:1605.07133.</citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lazaridou</surname> <given-names>A.</given-names></name> <name><surname>Potapenko</surname> <given-names>A.</given-names></name> <name><surname>Tieleman</surname> <given-names>O.</given-names></name></person-group> (<year>2020</year>). <article-title>Multi-agent communication meets natural language: synergies between functional and structural language learning</article-title>. <source>arXiv preprint</source> arXiv:2005.07064. <pub-id pub-id-type="doi">10.18653/v1/2020.acl-main.685</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manning</surname> <given-names>C. D.</given-names></name> <name><surname>Clark</surname> <given-names>K.</given-names></name> <name><surname>Hewitt</surname> <given-names>J.</given-names></name> <name><surname>Khandelwal</surname> <given-names>U.</given-names></name> <name><surname>Levy</surname> <given-names>O.</given-names></name></person-group> (<year>2020</year>). <article-title>Emergent linguistic structure in artificial neural networks trained by self-supervision</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>117</volume>, <fpage>30046</fpage>&#x02013;<lpage>30054</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1907367117</pub-id><pub-id pub-id-type="pmid">32493748</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mordatch</surname> <given-names>I.</given-names></name> <name><surname>Abbeel</surname> <given-names>P.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Emergence of grounded compositional language in multi-agent populations,&#x0201D;</article-title> in <source>Proceedings of the AAAI Conference on Artificial Intelligence, Vol</source>. <fpage>32</fpage>.</citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pickering</surname> <given-names>M. J.</given-names></name> <name><surname>Garrod</surname> <given-names>S.</given-names></name></person-group> (<year>2004</year>). <article-title>Toward a mechanistic psychology of dialogue</article-title>. <source>Behav. Brain Sci</source>. <volume>27</volume>, <fpage>169</fpage>&#x02013;<lpage>190</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X04000056</pub-id><pub-id pub-id-type="pmid">15595235</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Quinn</surname> <given-names>M</given-names></name></person-group>. (<year>2001</year>). <article-title>&#x0201C;Evolving communication without dedicated communication channels,&#x0201D;</article-title> in <source>European Conference on Artificial Life</source> (<publisher-loc>Springer</publisher-loc>), <fpage>357</fpage>&#x02013;<lpage>366</lpage>.</citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quinn</surname> <given-names>M.</given-names></name> <name><surname>Smith</surname> <given-names>L.</given-names></name> <name><surname>Mayley</surname> <given-names>G.</given-names></name> <name><surname>Husbands</surname> <given-names>P.</given-names></name></person-group> (<year>2003</year>). <article-title>Evolving controllers for a homogeneous system of physical robots: Structured cooperation with minimal sensors</article-title>. <source>Philos. Trans. R. Soc. Lond. A Math. Phys. Eng. Sci</source>. <volume>361</volume>, <fpage>2321</fpage>&#x02013;<lpage>2343</lpage>. <pub-id pub-id-type="doi">10.1098/rsta.2003.1258</pub-id><pub-id pub-id-type="pmid">14599322</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raviv</surname> <given-names>L.</given-names></name> <name><surname>Meyer</surname> <given-names>A.</given-names></name> <name><surname>Lev-Ari</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <article-title>Compositional structure can emerge without generational transmission</article-title>. <source>Cognition</source> <volume>182</volume>, <fpage>151</fpage>&#x02013;<lpage>164</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2018.09.010</pub-id><pub-id pub-id-type="pmid">30267952</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scott-Phillips</surname> <given-names>T. C.</given-names></name> <name><surname>Kirby</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>Language evolution in the laboratory</article-title>. <source>Trends Cogn. Sci</source>. <volume>14</volume>, <fpage>411</fpage>&#x02013;<lpage>417</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2010.06.006</pub-id><pub-id pub-id-type="pmid">20675183</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scott-Phillips</surname> <given-names>T. C.</given-names></name> <name><surname>Kirby</surname> <given-names>S.</given-names></name> <name><surname>Ritchie</surname> <given-names>G. R.</given-names></name></person-group> (<year>2009</year>). <article-title>Signalling signalhood and the emergence of communication</article-title>. <source>Cognition</source> <volume>113</volume>, <fpage>226</fpage>&#x02013;<lpage>233</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2009.08.009</pub-id><pub-id pub-id-type="pmid">19740461</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spillner</surname> <given-names>L.</given-names></name> <name><surname>Wenig</surname> <given-names>N.</given-names></name></person-group> (<year>2021</year>). <article-title>&#x0201C;Talk to me on my level-linguistic alignment for chatbots,&#x0201D;</article-title> in <source>Proceedings of the 23rd International Conference on Mobile Human-Computer Interaction</source>, <fpage>1</fpage>&#x02013;<lpage>12</lpage>.</citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steels</surname> <given-names>L</given-names></name></person-group>. (<year>1999</year>). <source>The Talking Heads Experiment. Vol. 1. Words and Meanings</source>. Special pre-print for the LABORATORIUM Antwerpen.</citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steels</surname> <given-names>L</given-names></name></person-group>. (<year>2006</year>). <article-title>Experiments on the emergence of human communication</article-title>. <source>Trends Cogn. Sci</source>. <volume>10</volume>, <fpage>347</fpage>&#x02013;<lpage>349</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2006.06.002</pub-id><pub-id pub-id-type="pmid">16828573</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Steels</surname> <given-names>L</given-names></name></person-group>. (<year>2012a</year>). <article-title>&#x0201C;Grounding language through evolutionary language games,&#x0201D;</article-title> in <source>Language Grounding in Robots</source> (<publisher-loc>Springer</publisher-loc>), <fpage>1</fpage>&#x02013;<lpage>22</lpage>.</citation>
</ref>
<ref id="B39">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Steels</surname> <given-names>L</given-names></name></person-group>. (<year>2012b</year>). <article-title>&#x0201C;Self-organization and selection in cultural language evolution,&#x0201D;</article-title> in <source>Experiments in Cultural Language Evolution</source> (<publisher-loc>John Benjamins</publisher-loc>), <fpage>1</fpage>&#x02013;<lpage>37</lpage>.</citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steels</surname> <given-names>L.</given-names></name> <name><surname>Loetzsch</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>The grounded naming game</article-title>. <source>Exp. Cult. Lang. Evolut</source>. <volume>3</volume>, <fpage>41</fpage>&#x02013;<lpage>59</lpage>. <pub-id pub-id-type="doi">10.1075/ais.3.04ste</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sutton</surname> <given-names>R. S.</given-names></name> <name><surname>Barto</surname> <given-names>A. G.</given-names></name></person-group> (<year>2018</year>). <source>Reinforcement Learning: An Introduction</source>. MIT Press.</citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tamariz</surname> <given-names>M.</given-names></name> <name><surname>Kirby</surname> <given-names>S.</given-names></name></person-group> (<year>2015</year>). <article-title>Culture: copying, compression, and conventionality</article-title>. <source>Cogn. Sci</source>. <volume>39</volume>, <fpage>171</fpage>&#x02013;<lpage>183</lpage>. <pub-id pub-id-type="doi">10.1111/cogs.12144</pub-id><pub-id pub-id-type="pmid">25039798</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>ter Hoeve</surname> <given-names>M.</given-names></name> <name><surname>Kharitonov</surname> <given-names>E.</given-names></name> <name><surname>Hupkes</surname> <given-names>D.</given-names></name> <name><surname>Dupoux</surname> <given-names>E.</given-names></name></person-group> (<year>2021</year>). <article-title>Towards interactive language modeling</article-title>. <source>arXiv preprint</source> arXiv:2112.11911.</citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Theisen-White</surname> <given-names>C.</given-names></name> <name><surname>Kirby</surname> <given-names>S.</given-names></name> <name><surname>Oberlander</surname> <given-names>J.</given-names></name></person-group> (<year>2011</year>). <article-title>&#x0201C;Integrating the horizontal and vertical cultural transmission of novel communication systems,&#x0201D;</article-title> in <source>Proceedings of the Annual Meeting of the Cognitive Science Society, Vol</source>. <volume>33</volume>, <fpage>956</fpage>&#x02013;<lpage>961</lpage>.</citation>
</ref>
<ref id="B45">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Verhoef</surname> <given-names>T.</given-names></name> <name><surname>Walker</surname> <given-names>E.</given-names></name> <name><surname>Marghetis</surname> <given-names>T.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Cognitive biases and social coordination in the emergence of temporal language,&#x0201D;</article-title> in <source>Proceedings of the 38th Annual Conference of the Cognitive Science Society</source> (<publisher-loc>Austin, TX</publisher-loc>: <publisher-name>Cognitive Science Society</publisher-name>), <fpage>2615</fpage>&#x02013;<lpage>2620</lpage>.</citation>
</ref>
</ref-list>
</back>
</article>