<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="editorial">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2013.00233</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Opinion Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Production, comprehension, and synthesis: a communicative perspective on language</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Ramscar</surname> <given-names>Michael</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Baayen</surname> <given-names>Harald</given-names></name>
</contrib>
</contrib-group>
<aff><institution>Department of Linguistics, University of T&#x000FC;bingen</institution> <country>T&#x000FC;bingen, Germany</country></aff>
<author-notes>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: <email>michael.ramscar&#x00040;uni-tuebingen.de</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Frontiers in Language Sciences, a specialty of Frontiers in Psychology.</p></fn>
<fn fn-type="edited-by"><p>Edited by: Charles Jr. Clifton, University of Massachusetts Amherst, USA</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Charles Jr. Clifton, University of Massachusetts Amherst, USA</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>02</day>
<month>05</month>
<year>2013</year>
</pub-date>
<pub-date pub-type="collection">
<year>2013</year>
</pub-date>
<volume>4</volume>
<elocation-id>233</elocation-id>
<history>
<date date-type="received">
<day>12</day>
<month>02</month>
<year>2013</year>
</date>
<date date-type="accepted">
<day>11</day>
<month>04</month>
<year>2013</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2013 Ramscar and Baayen.</copyright-statement>
<copyright-year>2013</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.</p>
</license>
</permissions>
<counts>
<fig-count count="0"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="56"/>
<page-count count="4"/>
<word-count count="3257"/>
</counts>
</article-meta>
</front>
<body>
<p>MacDonald (<xref ref-type="bibr" rid="B24a">2013</xref>) presents a strong case for speech production having a pivotal role in the cognition of language processing. Experimental research has been strongly biased toward the study of language comprehension, and it is an intellectual pleasure to be invited to rethink the consequences of the constraints imposed by speech production on both the form of utterances and how utterances are produced and understood.</p>
<p>Yet, it seems to us that production and comprehension are much more in balance. For instance, when Latin lost many of its inflectional exponents and morphed into what is now modern French, the pronouns of Latin, which were used for emphasis only, became obligatory. This, it would seem, serves the listener rather than making life easier for the speaker. In the convection cycle of language change over time, speakers time and again opt for articulatory simpler forms whenever they can. In French, this led to reduced forms (compared to Latin) of the subject and object pronouns. In modern colloquial French, these pronouns can even become prefixoids that are fusing into the verb, leading to structures such as <italic>Jean il l&#x00027;a vue Pierre.</italic> The result is an inflectional system with subject and object marking on the verb, remarkably similar to the forms of Amerindian languages (Vendryes, <xref ref-type="bibr" rid="B53">1921</xref>; Lambrecht, <xref ref-type="bibr" rid="B21">1981</xref>). Simplification by the speaker is followed by diversification for the listener, which is followed by simplification by the speaker. Crucially, in the negotiation of communication, utterances only have a chance of being replicated (in the evolutionary sense) if they are both producible and understandable (cf. Steels, <xref ref-type="bibr" rid="B48">1998</xref>; Steels and Wellens, <xref ref-type="bibr" rid="B49">2006</xref>).</p>
<p>However, rather than attempting to evaluate MacDonald&#x00027;s program by means of individual case studies, in this commentary we take a step back, and argue for a view in which the forces of production and comprehension are not only much more balanced, but in which they are essentially the same. To understand why we think the similarities are much more important than the differences, we turn to learning theory and information theory.</p>
<p>As MacDonald emphasizes, <italic>learning</italic> is a ubiquitous aspect of experience. Although, it is often conceptualized abstractly as a process that increases knowledge (like adding entries to an encyclopedia) and that improves performance (by increasing counters in the head, whether conceptualized as Bayesian priors or by serial search in a frequency ordered encyclopedia), it is important to note that the mechanistic picture of learning that has emerged from many lines of inquiry in the cognitive and brain sciences is <italic>discriminative</italic>. At both low- (e.g., O&#x00027;Brien and Raymond, <xref ref-type="bibr" rid="B28">2012</xref>) and high- (e.g., Ramscar et al., <xref ref-type="bibr" rid="B35">2013b</xref>) levels of abstraction, learning is a process that reapportions attentional/representational resources in order to maximize future predictive success (e.g., Rescorla and Wagner, <xref ref-type="bibr" rid="B42">1972</xref>; Pearce and Hall, <xref ref-type="bibr" rid="B29">1980</xref>; Sutton and Barto, <xref ref-type="bibr" rid="B50">1998</xref>; McLaren and Mackintosh, <xref ref-type="bibr" rid="B25">2000</xref>; Schultz and Dickinson, <xref ref-type="bibr" rid="B45">2000</xref>; Kruschke, <xref ref-type="bibr" rid="B19">2001</xref>; Danks, <xref ref-type="bibr" rid="B7">2003</xref>). Prediction error is used to <italic>discriminate</italic> against uninformative cues and to reinforce <italic>informative</italic> cues. These models of learning belong to a broad class of discriminative algorithms, along with the overwhelming majority of biologically based learning models (Schultz, <xref ref-type="bibr" rid="B44">2006</xref>).</p>
<p>An important, though little-mentioned feature of this kind of learning is that it yields an inherently lossy form of coding (Ramscar et al., <xref ref-type="bibr" rid="B40">2010</xref>). If languages are learned discriminatively, the representations of relationships between form and meaning that learners acquire from experience will be subject to constant change, and these changes will involve information loss. Learned relationships between forms and meanings will be subject to constant variation, both across different language users, and within language users over time (Ramscar et al., <xref ref-type="bibr" rid="B37">2013d</xref>). As MacDonald rightly observes, in these circumstances, <italic>all</italic> linguistic communication can be expected to involve ambiguity.</p>
<p>A crucial consequence of lossy coding is that linguistic forms do not simply serve as hash codes for mapping form onto meaning. The forms of language are simply not rich enough data structures to formally encode the full richness of the experiences they serve to communicate (Ramscar et al., <xref ref-type="bibr" rid="B40">2010</xref>). It is therefore not at all clear what it means to say, as MacDonald does, that &#x0201C;linguistic utterances clearly differ from other actions in that they have both a goal (e.g., to communicate) and a meaning.&#x0201D; Given what we understand about learning and encoding (see Gr&#x000FC;nwald and Vit&#x000E1;nyi, <xref ref-type="bibr" rid="B14">2003</xref> for an introduction to coding theory), it is clear that utterances neither <italic>encrypt</italic> their meanings, nor do they map onto them in a compositional, or even determinate, way. In spite of the pervasiveness of the structural metaphor (Lakoff and Johnson, <xref ref-type="bibr" rid="B20">1980</xref>) that language is like a conveyor belt transporting boxes with meanings from speaker to listener, and that it is desirable to optimally stack the boxes so that their load is uniformly distributed over the conveyor belt (Hale, <xref ref-type="bibr" rid="B15">2006</xref>; Levy, <xref ref-type="bibr" rid="B22">2008</xref>; Jaeger, <xref ref-type="bibr" rid="B17">2010</xref>; see Ferrer-i-Cancho and Moscoso del Prado Mart&#x000ED;n, <xref ref-type="bibr" rid="B9">2011</xref>; Pellegrino et al., <xref ref-type="bibr" rid="B30">2011</xref> for critiques) there is good reason to believe that meaning is not <italic>in</italic> the words nor <italic>in</italic> the sentences.</p>
<p>This is where Shannon (<xref ref-type="bibr" rid="B46">1948</xref>)&#x00027;s mathematical theory of communication provides insight:
<disp-quote>
<p>&#x0201C;The fundamental problem of communication is that of reproducing at one point either exactly or approximately a message selected at another point. Frequently the messages have meaning; that is they refer to or are correlated according to some system with certain physical or conceptual entities. <italic>These semantic aspects of communication are irrelevant to the engineering problem</italic>. The significant aspect is that the actual message is one selected from a set of possible messages. The system must be designed to operate for each possible selection, not just the one which will actually be chosen since this is unknown at the time of design.&#x0201D; (Our emphasis.)</p>
</disp-quote></p>
<p>In other words, whatever the experiences and goals we wish to communicate might be, a signal should not be assumed to be a compositional deconstruction of them. Instead, an encoding simply needs to enable senders and receivers to discriminate between experiences and goals <italic>on the basis of a shared code</italic>. For example, in a world with just two experiences (being hungry; being satiated) and no noise, a code with just two non-decompositional signals, 0 and 1, suffices.</p>
<p>The relationship between signals and meanings in this kind of system can be summarized as follows (MacKay, <xref ref-type="bibr" rid="B23">2003</xref>):
<list list-type="order">
<list-item><p>A communication <italic>system</italic> requires a sender and a receiver to be in possession of a source code defining the scope of the possible messages that can be transmitted.</p></list-item>
<list-item><p>Communication across the system is not concerned with the <italic>meaning</italic> of messages. In a Shannon system the receiver <italic>reconstructs</italic> the source message from the received signal by <italic>discriminating</italic> the source message from other possible messages that might have been selected and noise introduced by the communication channel.</p></list-item>
<list-item><p>The receiver does not interpret or expand on the source message. It simply reconstructs it at the destination with no loss of signal content. In linguistic terms, necessary condition for successful communication is that a listener be able to correctly identify the form of the message sent. To the extent that a speaker and listener&#x00027;s codes converge, this will serve to reduce, or even eliminate, a listener&#x00027;s uncertainty about the experiences and goals that led a speaker to select that message, aligning the listener&#x00027;s predictions with the speaker&#x00027;s intentions.</p></list-item>
</list>
</p>
<p>Although, this picture is <italic>very</italic> different to most historical approaches to language (Frege, <xref ref-type="bibr" rid="B10">1892</xref>; Russell, <xref ref-type="bibr" rid="B43">1905</xref>; Wittgenstein, <xref ref-type="bibr" rid="B54">1947</xref>; Miller, <xref ref-type="bibr" rid="B27">1951</xref>; Chomsky, <xref ref-type="bibr" rid="B5">1957</xref>, <xref ref-type="bibr" rid="B6">1997</xref>; Tomasello, <xref ref-type="bibr" rid="B52">2005</xref>), there are many reasons to believe that Shannon&#x00027;s theory provides a fruitful framework for the understanding of human communication.</p>
<p>First, as we noted above, learning is a process that leads to the acquisition of exactly the kind of predictive, discriminative codes that information theory specifies for artificial systems (Hentschel and Barlow, <xref ref-type="bibr" rid="B16">1991</xref>; Atick, <xref ref-type="bibr" rid="B1">1992</xref>). The critical difference between human and artificial communication systems is that human communicators <italic>learn</italic> as they go. Indeed, an alternative description of the goal of utterances is that speakers intend listeners to learn something from them. Virtually all utterances&#x02014;even, &#x0201C;Hello!&#x0201D;&#x02014;are intended to reduce a listener&#x00027;s uncertainty, whether about the world, or the thoughts, feelings etc., of a speaker; learning is largely defined in terms of this kind of uncertainty reduction (Rescorla, <xref ref-type="bibr" rid="B41">1988</xref>; Hentschel and Barlow, <xref ref-type="bibr" rid="B16">1991</xref>; Ramscar et al., <xref ref-type="bibr" rid="B35">2013b</xref>).</p>
<p>Second, since learning is a discriminative process, acquiring a language amounts to learning how forms discriminate between the rich experiences and goals that speakers and listeners share (see Baayen et al., <xref ref-type="bibr" rid="B4">2011</xref>, for a proof of concept). From this perspective, MacDonald&#x00027;s suggestion that prediction serves to &#x0201C;guide comprehension,&#x0201D;&#x02014;somehow helping rich semantic understandings to be mysteriously extracted from a few sparse signals (Ramscar, <xref ref-type="bibr" rid="B33">2010</xref>)&#x02014;is unnecessarily vague and complicated when compared to a more straightforward view of comprehension as the reduction of listeners&#x00027; uncertainty about speakers&#x00027; intentions as messages unfold (Ramscar et al., <xref ref-type="bibr" rid="B40">2010</xref>; see also Pickering and Garrod, <xref ref-type="bibr" rid="B32">2007</xref>; McMurray and Jongman, <xref ref-type="bibr" rid="B26">2011</xref>).</p>
<p>Third, not only does learning appear to extract a particular kind of predictive code (Schultz and Dickinson, <xref ref-type="bibr" rid="B45">2000</xref>), but the distributional structures of languages correspond closely to <italic>optimal predictive codes</italic> (Hentschel and Barlow, <xref ref-type="bibr" rid="B16">1991</xref>). In Shannon entropy terms, the least efficient possible code has a uniform distribution (i.e., one in which all alternatives are equiprobable at any given choice point) and the most efficient code is one in which items are distributed in the most non-uniform way possible (i.e., a power law distribution). The distributions of languages approximate the latter at every level so far examined (Zipf, <xref ref-type="bibr" rid="B55">1949</xref>; Genzel and Charniak, <xref ref-type="bibr" rid="B12">2002</xref>, <xref ref-type="bibr" rid="B13">2003</xref>; Aylett and Turk, <xref ref-type="bibr" rid="B2">2004</xref>, <xref ref-type="bibr" rid="B3">2006</xref>; Manin, <xref ref-type="bibr" rid="B24">2006</xref>; Futrell and Ramscar, <xref ref-type="bibr" rid="B11">2011</xref>; Ramscar and Futrell, <xref ref-type="bibr" rid="B38">2011</xref>; Piantadosi et al., <xref ref-type="bibr" rid="B31">2011</xref>).</p>
<p>Finally, it is clear that the nature of learning changes across childhood (Ramscar and Gitcho, <xref ref-type="bibr" rid="B39">2007</xref>; Thompson-Schill et al., <xref ref-type="bibr" rid="B51">2009</xref>; Ramscar et al., <xref ref-type="bibr" rid="B36">2013c</xref>). Very young children are deficient in many prefrontal functions that, as MacDonald emphasizes, are important to speech planning. This is a curious adaptation, but it offers at least one benefit: if &#x0201C;simple&#x0201D; discriminative learners are exposed to a highly structured environmental stimulus&#x02014;a language and its experiential correlates&#x02014;and are restricted to sampling it in the same, non-deliberative way, they will learn very similar <italic>systems</italic> of mappings (Ramscar et al., <xref ref-type="bibr" rid="B34">2013a</xref>; see also Shannon, <xref ref-type="bibr" rid="B47">1956</xref>).</p>
<p>In other words, learning, and its developmental trajectory across childhood, are particularly well-adapted for the acquisition of <italic>common</italic> predictive codes (in the Shannon sense), and linguistic distributions appear to have evolved&#x02014;socially&#x02014;to optimize these codes for communication (in the Shannon sense). It is within this information-theoretic rethinking of language that the question of the relative importance of comprehension and production in shaping language comes to stand in a different light.</p>
<p>We immediately acknowledge that linguistic distributions must be optimized for speech production (see also Zipf, <xref ref-type="bibr" rid="B55">1949</xref>). However, we contend that this optimization is totally constrained by what the listener can tolerate. For instance, in spoken Dutch, the word <italic>eigenlijk</italic> (actually) can reduce to <italic>egk</italic>. However, the speaker cannot opt for articulatory laziness in total disregard of the listener. Native speakers of Dutch do not understand <italic>egk</italic> when spoken in isolation (Ernestus et al., <xref ref-type="bibr" rid="B8">2002</xref>; Kemps et al., <xref ref-type="bibr" rid="B18">2004</xref>), and successful comprehension critically depends on its use in appropriate contexts. In other words, <italic>egk</italic> is a functional element of the speech signal by the grace of being part of a code that speakers and listeners share. Thanks to this shared code, what is easy for the speaker to produce is easy for the listener to understand. Likewise, what is more difficult for the speaker to encode, at whatever level of linguistic structure, is more difficult for the listener to decode. These considerations lead to the prediction that for each of the interesting examples discussed by MacDonald where we currently see optimization for production at work, there is a corresponding benefit for comprehension. If, as we suspect, Shannon&#x00027;s view of communication is correct, these benefits <italic>must</italic> be there, even if it is difficult to discern them at present, given our still very limited understanding of the experiences, and their neuro-cognitive instantiations, that we share when communicating with language.</p>
</body>
<back>
<ack>
<p>This research was made possible by an Alexander von Humboldt award to the second author.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Atick</surname> <given-names>J. J.</given-names></name></person-group> (<year>1992</year>). <article-title>Could information theory provide an ecological theory of sensory processing?</article-title> <source>Network</source> <volume>3</volume>, <fpage>213</fpage>&#x02013;<lpage>251</lpage>. <pub-id pub-id-type="doi">10.3109/0954898X.2011.638888</pub-id><pub-id pub-id-type="pmid">22149669</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aylett</surname> <given-names>M. P.</given-names></name> <name><surname>Turk</surname> <given-names>A.</given-names></name></person-group> (<year>2004</year>). <article-title>The smooth signal redundancy hypothesis: a functional explanation for relationships between redundancy, prosodic prominence, and duration in spontaneous speech</article-title>. <source>Lang. Speech</source> <volume>47</volume>, <fpage>31</fpage>&#x02013;<lpage>56</lpage>. <pub-id pub-id-type="pmid">15298329</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aylett</surname> <given-names>M. P.</given-names></name> <name><surname>Turk</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>Language redundancy predicts syllabic duration and the spectral characteristics of vocalic syllable nuclei</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>119</volume>, <fpage>3048</fpage>&#x02013;<lpage>3058</lpage>. <pub-id pub-id-type="pmid">16708960</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Milin</surname> <given-names>P.</given-names></name> <name><surname>Durdevic</surname> <given-names>D. F.</given-names></name> <name><surname>Hendrix</surname> <given-names>P.</given-names></name> <name><surname>Marelli</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>An amorphous model for morphological processing in visual comprehension based on naive discriminative learning</article-title>. <source>Psychol. Rev</source>. <volume>118</volume>, <fpage>438</fpage>&#x02013;<lpage>481</lpage>. <pub-id pub-id-type="doi">10.1037/a0023851</pub-id><pub-id pub-id-type="pmid">21744979</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chomsky</surname> <given-names>N.</given-names></name></person-group> (<year>1957</year>). <source>Syntactic Structures</source>. <publisher-loc>The Hague</publisher-loc>: <publisher-name>Mouton</publisher-name>.</citation>
</ref>
<ref id="B6">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chomsky</surname> <given-names>N.</given-names></name></person-group> (<year>1997</year>). <source>New Horizons in the Study of Language and Mind Cambridge</source>. <publisher-loc>England</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Danks</surname> <given-names>D.</given-names></name></person-group> (<year>2003</year>). <article-title>Equilibria of the Rescorla-Wagner model</article-title>. <source>J. Math. Psychol</source>. <volume>47</volume>, <fpage>109</fpage>&#x02013;<lpage>121</lpage>.</citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ernestus</surname> <given-names>M.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Schreuder</surname> <given-names>R.</given-names></name></person-group> (<year>2002</year>). <article-title>The recognition of reduced word forms</article-title>. <source>Brain Lang</source>. <volume>81</volume>, <fpage>162</fpage>&#x02013;<lpage>173</lpage>. <pub-id pub-id-type="pmid">12081389</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferrer-i-Cancho</surname> <given-names>R.</given-names></name> <name><surname>Moscoso del Prado Mart&#x000ED;n</surname> <given-names>F.</given-names></name></person-group> (<year>2011</year>). <article-title>Information content versus word length in random typing</article-title>. <source>J. Stat. Mech</source>. <volume>2011</volume>:<fpage>L12002</fpage>. <pub-id pub-id-type="doi">10.1088/1742-5468/2011/12</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Frege</surname> <given-names>G.</given-names></name></person-group> (<year>1892</year>). <article-title>&#x000DC;ber Sinn und Bedeutung</article-title>. <source>Zeitschrift f&#x000FC;r Philosophie und Philosophische Kritik</source> <volume>100</volume>, <fpage>25</fpage>&#x02013;<lpage>50</lpage>.</citation>
</ref>
<ref id="B11">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Futrell</surname> <given-names>R.</given-names></name> <name><surname>Ramscar</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>German grammatical gender manages nominal entropy</article-title>, in <source>Presentation at Information-Theoretic Approaches to Linguistics 2011</source> (<publisher-loc>Columbus</publisher-loc>: <publisher-name>LSA Linguistic Institute, Ohio State University</publisher-name>).</citation>
</ref>
<ref id="B12">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Genzel</surname> <given-names>D.</given-names></name> <name><surname>Charniak</surname> <given-names>E.</given-names></name></person-group> (<year>2002</year>). <article-title>Entropy rate constancy in text</article-title>, in <source>Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL&#x00027;02)</source> (<publisher-loc>Ann Arbor, MI</publisher-loc>: <publisher-name>Association for Computational Linguistics</publisher-name>).</citation>
</ref>
<ref id="B13">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Genzel</surname> <given-names>D.</given-names></name> <name><surname>Charniak</surname> <given-names>E.</given-names></name></person-group> (<year>2003</year>). <article-title>Variation of entropy and parse tree of sentences as a function of the sentence number</article-title>, in <source>Proceedings of the Conference on Empirical Methods in Natural Language Processing</source> (<publisher-loc>Sapporo</publisher-loc>), <fpage>65</fpage>&#x02013;<lpage>72</lpage>.</citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gr&#x000FC;nwald</surname> <given-names>P. D.</given-names></name> <name><surname>Vit&#x000E1;nyi</surname> <given-names>P. M.</given-names></name></person-group> (<year>2003</year>). <article-title>Kolmogorov complexity and information theory. With an interpretation in terms of questions and answers</article-title>. <source>J. Logic Lang. Inform</source>. <volume>12</volume>, <fpage>497</fpage>&#x02013;<lpage>529</lpage>.</citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hale</surname> <given-names>J.</given-names></name></person-group> (<year>2006</year>). <article-title>Uncertainty about the rest of the sentence</article-title>. <source>Cogn. Sci</source>. <volume>30</volume>, <fpage>643</fpage>&#x02013;<lpage>672</lpage>. <pub-id pub-id-type="doi">10.1207/s15516709cog0000_64</pub-id><pub-id pub-id-type="pmid">21702829</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hentschel</surname> <given-names>H. G.</given-names></name> <name><surname>Barlow</surname> <given-names>U. B.</given-names></name></person-group> (<year>1991</year>). <article-title>Minimum entropy coding with Hopfield networks</article-title>. <source>Nerwork</source> <volume>2</volume>, <fpage>135</fpage>&#x02013;<lpage>148</lpage>.</citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaeger</surname> <given-names>T. F.</given-names></name></person-group> (<year>2010</year>). <article-title>Redundancy and reduction: speakers manage syntactic information density</article-title>. <source>Cogn. Psychol</source>. <volume>61</volume>, <fpage>23</fpage>&#x02013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1016/j.cogpsych.2010.02.002</pub-id><pub-id pub-id-type="pmid">20434141</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kemps</surname> <given-names>R.</given-names></name> <name><surname>Ernestus</surname> <given-names>M.</given-names></name> <name><surname>Schreuder</surname> <given-names>R.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2004</year>). <article-title>Processing reduced word forms: the suffix restoration effect</article-title>. <source>Brain Lang</source>. <volume>90</volume>, <fpage>117</fpage>&#x02013;<lpage>127</lpage>. <pub-id pub-id-type="doi">10.1016/S0093-934X(03)00425-5</pub-id><pub-id pub-id-type="pmid">15172530</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kruschke</surname> <given-names>J. K.</given-names></name></person-group> (<year>2001</year>). <article-title>Toward a unified model of attention in associative learning</article-title>. <source>J. Math. Psychol</source>. <volume>45</volume>, <fpage>812</fpage>&#x02013;<lpage>863</lpage>.</citation>
</ref>
<ref id="B20">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lakoff</surname> <given-names>G. J.</given-names></name> <name><surname>Johnson</surname> <given-names>M.</given-names></name></person-group> (<year>1980</year>). <source>Metaphors We Live By</source>. <publisher-loc>Chicago, IL</publisher-loc>: <publisher-name>University of Chicago</publisher-name>.</citation>
</ref>
<ref id="B21">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lambrecht</surname> <given-names>K.</given-names></name></person-group> (<year>1981</year>). <source>Topic, Antitopic, and Verb Agreement in Non-Standard French</source>. <publisher-loc>Amsterdam</publisher-loc>: <publisher-name>John Benjamins Publishing Company</publisher-name>.</citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levy</surname> <given-names>R.</given-names></name></person-group> (<year>2008</year>). <article-title>Expectation-based syntactic comprehension</article-title>. <source>Cognition</source> <volume>106</volume>, <fpage>1126</fpage>&#x02013;<lpage>1177</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2007.05.006</pub-id><pub-id pub-id-type="pmid">17662975</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>MacKay</surname> <given-names>D.</given-names></name></person-group> (<year>2003</year>). <source>Information Theory, Inference, and Learning Algorithms</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manin</surname> <given-names>D.</given-names></name></person-group> (<year>2006</year>). <article-title>Experiments on predictability of word in context and information rate in natural language</article-title>. <source>J. Inform. Process</source>. <volume>6</volume>, <fpage>229</fpage>&#x02013;<lpage>236</lpage>.</citation>
</ref>
<ref id="B24a">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>MacDonald</surname> <given-names>M. C.</given-names></name></person-group> (<year>2013</year>). <article-title>How language production shapes language form and comprehension</article-title>. <source>Front. Psychol</source>. <volume>4</volume>:<issue>226</issue>. <pub-id pub-id-type="doi">10.3389/fpsyg.2013.00226</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McLaren</surname> <given-names>I. P. L.</given-names></name> <name><surname>Mackintosh</surname> <given-names>N. J.</given-names></name></person-group> (<year>2000</year>). <article-title>An elemental model of associative learning: I. Latent inhibition and perceptual learning</article-title>. <source>Anim. Learn. Behav</source>. <volume>28</volume>, <fpage>211</fpage>&#x02013;<lpage>246</lpage>. <pub-id pub-id-type="doi">10.3758/s13420-012-0079-1</pub-id><pub-id pub-id-type="pmid">22927004</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McMurray</surname> <given-names>B.</given-names></name> <name><surname>Jongman</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <article-title>What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations</article-title>. <source>Psychol. Rev</source>. <volume>118</volume>, <fpage>219</fpage>&#x02013;<lpage>246</lpage>. <pub-id pub-id-type="doi">10.1037/a0022325</pub-id><pub-id pub-id-type="pmid">21417542</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Miller</surname> <given-names>G. A.</given-names></name></person-group> (<year>1951</year>). <source>Language and Communication</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>McGraw-Hill</publisher-name>.</citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x00027;Brien</surname> <given-names>J. L.</given-names></name> <name><surname>Raymond</surname> <given-names>J. E.</given-names></name></person-group> (<year>2012</year>). <article-title>Learned predictiveness speeds visual processing</article-title>. <source>Psychol. Sci</source>. <volume>23</volume>, <fpage>359</fpage>&#x02013;<lpage>363</lpage>. <pub-id pub-id-type="doi">10.1177/0956797611429800</pub-id><pub-id pub-id-type="pmid">22399415</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pearce</surname> <given-names>J. M.</given-names></name> <name><surname>Hall</surname> <given-names>G.</given-names></name></person-group> (<year>1980</year>). <article-title>A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli</article-title>. <source>Psychol. Rev</source>. <volume>87</volume>, <fpage>532</fpage>&#x02013;<lpage>552</lpage>. <pub-id pub-id-type="pmid">7443916</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pellegrino</surname> <given-names>F.</given-names></name> <name><surname>Coup&#x000E9;</surname> <given-names>C.</given-names></name> <name><surname>Marsico</surname> <given-names>E.</given-names></name></person-group> (<year>2011</year>). <article-title>A cross-language perspective on speech information rate</article-title>. <source>Language</source> <volume>87</volume>, <fpage>539</fpage>&#x02013;<lpage>558</lpage>.</citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Piantadosi</surname> <given-names>S. T.</given-names></name> <name><surname>Tily</surname> <given-names>H.</given-names></name> <name><surname>Gibson</surname> <given-names>E.</given-names></name></person-group> (<year>2011</year>). <article-title>The communicative function of ambiguity in language</article-title>. <source>Cognition</source> <volume>122</volume>, <fpage>280</fpage>&#x02013;<lpage>291</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2011.10.004</pub-id><pub-id pub-id-type="pmid">22192697</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pickering</surname> <given-names>M. J.</given-names></name> <name><surname>Garrod</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>Do people use language production to make predictions during comprehension?</article-title> <source>Trends Cogn. Sci</source>. <volume>11</volume>, <fpage>105</fpage>&#x02013;<lpage>110</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2006.12.002</pub-id><pub-id pub-id-type="pmid">17254833</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Computing machinery and understanding</article-title>. <source>Cogn. Sci</source>. <volume>34</volume>, <fpage>966</fpage>&#x02013;<lpage>971</lpage>. <pub-id pub-id-type="doi">10.1111/j.1551-6709.2010.01120.x</pub-id><pub-id pub-id-type="pmid">21564241</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Dye</surname> <given-names>M.</given-names></name> <name><surname>McCauley</surname> <given-names>S.</given-names></name></person-group> (<year>2013a</year>). <article-title>Error and expectation in language learning: the curious absence of &#x0201C;mouses&#x0201D; in adult speech</article-title>. <source>Language</source>. (in press).</citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Dye</surname> <given-names>M.</given-names></name> <name><surname>Klein</surname> <given-names>J.</given-names></name></person-group> (<year>2013b</year>). <article-title>Children value informativity over logic in word learning</article-title>. <source>Psychol. Sci</source>. (in press). <pub-id pub-id-type="doi">10.1177/0956797612460691</pub-id><pub-id pub-id-type="pmid">23610135</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Dye</surname> <given-names>M.</given-names></name> <name><surname>Gustafson</surname> <given-names>J. W.</given-names></name> <name><surname>Klein</surname> <given-names>J.</given-names></name></person-group> (<year>2013c</year>). <article-title>Dual routes to cognitive flexibility: learning and response conflict resolution in the dimensional change card sort task</article-title>. <source>Child Dev</source>. (in press). <pub-id pub-id-type="doi">10.1111/cdev.12044</pub-id><pub-id pub-id-type="pmid">23311677</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Hendrix</surname> <given-names>P.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2013d</year>). <source>Nonlinear Dynamics of Lifelong Learning: the Myth of Cognitive Decline</source>. <publisher-loc>Manuscript, University of Tuebingen</publisher-loc>.</citation>
</ref>
<ref id="B38">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Futrell</surname> <given-names>R.</given-names></name></person-group> (<year>2011</year>). <source>The Predictive Function of Prenominal Adjectives Presentation at Information-Theoretic Approaches to Linguistics 2011</source>. <publisher-loc>LSA Linguistic Institute, Ohio State University</publisher-loc>.</citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Gitcho</surname> <given-names>N.</given-names></name></person-group> (<year>2007</year>). <article-title>Developmental change and the nature of learning in childhood</article-title>. <source>Trends Cogn. Sci</source>. <volume>11</volume>, <fpage>274</fpage>&#x02013;<lpage>279</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2007.05.007</pub-id><pub-id pub-id-type="pmid">17560161</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Yarlett</surname> <given-names>D.</given-names></name> <name><surname>Dye</surname> <given-names>M.</given-names></name> <name><surname>Denny</surname> <given-names>K.</given-names></name> <name><surname>Thorpe</surname> <given-names>K.</given-names></name></person-group> (<year>2010</year>). <article-title>The effects of feature-label-order and their implications for symbolic learning</article-title>. <source>Cogn. Sci</source>. <volume>34</volume>, <fpage>909</fpage>&#x02013;<lpage>957</lpage>. <pub-id pub-id-type="doi">10.1111/j.1551-6709.2009.01092.x</pub-id><pub-id pub-id-type="pmid">21564239</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rescorla</surname> <given-names>R. A.</given-names></name></person-group> (<year>1988</year>). <article-title>Pavlovian conditioning: it&#x00027;s not what you think it is</article-title>. <source>Am. Psychol</source>. <volume>43</volume>, <fpage>151</fpage>&#x02013;<lpage>160</lpage>. <pub-id pub-id-type="pmid">3364852</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Rescorla</surname> <given-names>R. A.</given-names></name> <name><surname>Wagner</surname> <given-names>A. R.</given-names></name></person-group> (<year>1972</year>). <article-title>A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement</article-title>, in <source>Classical Conditioning II: Current Research and Theory</source>, eds <person-group person-group-type="editor"><name><surname>Black</surname> <given-names>A. H.</given-names></name> <name><surname>Prokasy</surname> <given-names>W. F.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-loc>Appleton-Century-Crofts</publisher-loc>), <fpage>64</fpage>&#x02013;<lpage>99</lpage>.</citation>
</ref>
<ref id="B43">
<citation citation-type="other"><person-group person-group-type="author"><name><surname>Russell</surname> <given-names>B.</given-names></name></person-group> (<year>1905</year>). <article-title>On denoting</article-title>, in <source>Mind</source>, New Series, <volume>Vol. 14.</volume> Basil Blackwell, <fpage>479</fpage>&#x02013;<lpage>493</lpage>.</citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schultz</surname> <given-names>W.</given-names></name></person-group> (<year>2006</year>). <article-title>Behavioral theories and the neurophysiology of reward</article-title>. <source>Annu. Rev. Psychol</source>. <volume>57</volume>, <fpage>87</fpage>&#x02013;<lpage>115</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.psych.56.091103.070229</pub-id><pub-id pub-id-type="pmid">16318590</pub-id></citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schultz</surname> <given-names>W.</given-names></name> <name><surname>Dickinson</surname> <given-names>A.</given-names></name></person-group> (<year>2000</year>). <article-title>Neural coding of prediction errors</article-title>. <source>Annu. Rev. Neurosci</source>. <volume>23</volume>, <fpage>473</fpage>&#x02013;<lpage>500</lpage>.</citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shannon</surname> <given-names>C. E.</given-names></name></person-group> (<year>1948</year>). <article-title>A mathematical theory of communication</article-title>. <source>Bell Syst. Tech. J</source>. <volume>27</volume>, <fpage>379</fpage>&#x02013;<lpage>423</lpage>, 623&#x02013;665.</citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shannon</surname> <given-names>C. E.</given-names></name></person-group> (<year>1956</year>). <article-title>The bandwagon</article-title>. <source>IRE Trans. Inform. Theory</source> <volume>2</volume>, <fpage>3</fpage>.</citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steels</surname> <given-names>L.</given-names></name></person-group> (<year>1998</year>). <article-title>The origins of syntax in visually grounded robotic agents</article-title>. <source>Artif. Intell</source>. <volume>103</volume>, <fpage>133</fpage>&#x02013;<lpage>156</lpage>.</citation>
</ref>
<ref id="B49">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Steels</surname> <given-names>L.</given-names></name> <name><surname>Wellens</surname> <given-names>P.</given-names></name></person-group> (<year>2006</year>). <article-title>How grammar emerges to dampen combinatorial search in parsing</article-title>, in <source>Symbol Grounding and Beyond, Proceedings of the Third EELC</source>, eds <person-group person-group-type="editor"><name><surname>Vogt</surname> <given-names>P.</given-names></name> <name><surname>Sugita</surname> <given-names>Y.</given-names></name> <name><surname>Tuci</surname> <given-names>E.</given-names></name> <name><surname>Nehaniv</surname> <given-names>C.</given-names></name></person-group> (<publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer Verlag</publisher-name>), <fpage>76</fpage>&#x02013;<lpage>88</lpage>.</citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sutton</surname> <given-names>R.</given-names></name> <name><surname>Barto</surname> <given-names>A. G.</given-names></name></person-group> (<year>1998</year>). <source>Reinforcement Learning</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thompson-Schill</surname> <given-names>S.</given-names></name> <name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Chrysikou</surname> <given-names>E.</given-names></name></person-group> (<year>2009</year>). <article-title>Cognition without control: when a little frontal lobe goes a long way</article-title>. <source>Curr. Dir. Psychol. Sci</source>. <volume>18</volume>, <fpage>259</fpage>&#x02013;<lpage>263</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-8721.2009.01648.x</pub-id><pub-id pub-id-type="pmid">20401341</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Tomasello</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <source>Constructing a Language: A Usage-Based Theory of Language Acquisition</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Harvard University Press</publisher-name>.</citation>
</ref>
<ref id="B53">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Vendryes</surname> <given-names>J.</given-names></name></person-group> (<year>1921</year>). <source>Le Langage</source>, <volume>Vol. 3.</volume> <publisher-loc>Paris</publisher-loc>: <publisher-name>Albin Michel</publisher-name>.</citation>
</ref>
<ref id="B54">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wittgenstein</surname> <given-names>L.</given-names></name></person-group> (<year>1947</year>). <source>Tractatus Logico-Philosophicus</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Kegan Paul, Trench, Trubner and Company</publisher-name>.</citation>
</ref>
<ref id="B55">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zipf</surname> <given-names>G. K.</given-names></name></person-group> (<year>1949</year>). <source>Human Behavior and the Principle of Least-Effort</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Addison-Wesley</publisher-name>.</citation>
</ref>
</ref-list>
</back>
</article>