<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="brief-report">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Hum. Neurosci.</journal-id>
<journal-title>Frontiers in Human Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Hum. Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1662-5161</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnhum.2023.1108354</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neuroscience</subject>
<subj-group>
<subject>Perspective</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Multimodal processing in face-to-face interactions: A bridging link between psycholinguistics and sensory neuroscience</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Benetti</surname> <given-names>Stefania</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/115471/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Ferrari</surname> <given-names>Ambra</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2115581/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Pavani</surname> <given-names>Francesco</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/928/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Centre for Mind/Brain Sciences, University of Trento</institution>, <addr-line>Trento</addr-line>, <country>Italy</country></aff>
<aff id="aff2"><sup>2</sup><institution>Interuniversity Research Centre &#x201C;Cognition, Language, and Deafness&#x201D;, CIRCLeS</institution>, <addr-line>Catania</addr-line>, <country>Italy</country></aff>
<aff id="aff3"><sup>3</sup><institution>Max Planck Institute for Psycholinguistics, Donders Institute for Brain, Cognition, and Behaviour, Radboud University</institution>, <addr-line>Nijmegen</addr-line>, <country>Netherlands</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Elena Nava, University of Milano-Bicocca, Italy</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: James P. Trujillo, Radboud University, Netherlands; Huriye Atilgan, University of Oxford, United Kingdom; Jun-ichiro Watanabe, Hitachi, Japan</p></fn>
<corresp id="c001">&#x002A;Correspondence: Stefania Benetti, <email>stefania.benetti@unitn.it</email></corresp>
<fn fn-type="equal" id="fn002"><p><sup>&#x2020;</sup>These authors have contributed equally to this work and share first authorship</p></fn>
<fn fn-type="other" id="fn004"><p>This article was submitted to Sensory Neuroscience, a section of the journal Frontiers in Human Neuroscience</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>02</day>
<month>02</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>17</volume>
<elocation-id>1108354</elocation-id>
<history>
<date date-type="received">
<day>25</day>
<month>11</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>11</day>
<month>01</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Benetti, Ferrari and Pavani.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Benetti, Ferrari and Pavani</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>In face-to-face communication, humans are faced with multiple layers of discontinuous multimodal signals, such as head, face, hand gestures, speech and non-speech sounds, which need to be interpreted as coherent and unified communicative actions. This implies a fundamental computational challenge: optimally binding only signals belonging to the same communicative action while segregating signals that are not connected by the communicative content. How do we achieve such an extraordinary feat, reliably, and efficiently? To address this question, we need to further move the study of human communication beyond speech-centred perspectives and promote a multimodal approach combined with interdisciplinary cooperation. Accordingly, we seek to reconcile two explanatory frameworks recently proposed in psycholinguistics and sensory neuroscience into a neurocognitive model of multimodal face-to-face communication. First, we introduce a psycholinguistic framework that characterises face-to-face communication at three parallel processing levels: multiplex signals, multimodal gestalts and multilevel predictions. Second, we consider the recent proposal of a lateral neural visual pathway specifically dedicated to the dynamic aspects of social perception and reconceive it from a multimodal perspective (&#x201C;lateral processing pathway&#x201D;). Third, we reconcile the two frameworks into a neurocognitive model that proposes how multiplex signals, multimodal gestalts, and multilevel predictions may be implemented along the lateral processing pathway. Finally, we advocate a multimodal and multidisciplinary research approach, combining state-of-the-art imaging techniques, computational modelling and artificial intelligence for future empirical testing of our model.</p>
</abstract>
<kwd-group>
<kwd>multimodal communication</kwd>
<kwd>face-to-face interactions</kwd>
<kwd>social actions</kwd>
<kwd>lateral cortical processing pathway</kwd>
<kwd>psycholinguistics</kwd>
<kwd>sensory neuroscience</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="157"/>
<page-count count="9"/>
<word-count count="8078"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>In face-to-face communication, we encounter multiple layers of discontinuous multimodal signals: head, face, mouth movements, hand gestures, speech and non-speech sounds. This implies a fundamental computational challenge: optimally binding only signals belonging to the same communicative action while segregating unrelated signals (<xref ref-type="bibr" rid="B97">Noppeney, 2021</xref>). Within this challenge, the temporal misalignment of fast-changing signals across different sensory channels raises a central binding problem (<xref ref-type="bibr" rid="B22">Chen and Vroomen, 2013</xref>). Finally, each conversational partner is taxed by fast turn-taking dynamics (<xref ref-type="bibr" rid="B85">Levinson, 2016</xref>). Despite these critical constraints, we process multimodal communicative signals faster than speech alone (<xref ref-type="bibr" rid="B69">Holler et al., 2018</xref>; <xref ref-type="bibr" rid="B30">Drijvers and Holler, 2022</xref>). Crucially, we use non-verbal communicative signals to facilitate semantic understanding (<xref ref-type="bibr" rid="B101">&#x00D6;zy&#x00FC;rek, 2014</xref>) and pragmatic inference (<xref ref-type="bibr" rid="B68">Holler, 2022</xref>). How do we achieve such an extraordinary feat?</p>
<p>To address this question, we need to move beyond the prominent speech-centred research perspective on the neurocognitive mechanisms of human communication. Building on previous calls for the need to study language in its multimodal manifestation and ecological context (<xref ref-type="bibr" rid="B86">Levinson and Holler, 2014</xref>; <xref ref-type="bibr" rid="B147">Vigliocco et al., 2014</xref>; <xref ref-type="bibr" rid="B57">Hasson et al., 2018</xref>; <xref ref-type="bibr" rid="B106">Perniss, 2018</xref>), the view we put forward here seeks to reconcile two explanatory frameworks recently proposed in psycholinguistics and sensory neuroscience. Specifically, we first highlight that verbal and non-verbal communicative signals are integrated to represent socially relevant acts (<xref ref-type="bibr" rid="B86">Levinson and Holler, 2014</xref>) through domain-general mechanisms of multimodal integration and prediction (<xref ref-type="bibr" rid="B70">Holler and Levinson, 2019</xref>). Accordingly, we then reconceive the neuroscientific evidence of a third visual pathway, specialised for dynamic aspects of social perception (<xref ref-type="bibr" rid="B111">Pitcher and Ungerleider, 2021</xref>), from a multimodal perspective. Finally, we propose that the resulting brain network implements the sensory processing gateway necessary toward successful multimodal processing and interpretation of face-to-face communicative signals.</p>
</sec>
<sec id="S2">
<title>Multimodal processing in face-to-face interactions: A possible computational framework</title>
<p><xref ref-type="bibr" rid="B70">Holler and Levinson (2019)</xref> recently outlined the key computational principles that support fast and efficient multimodal processing in face-to-face communication, with the ultimate goal of interpreting communicative social actions (<xref ref-type="fig" rid="F1">Figure 1A</xref>). First, domain-general mechanisms of multimodal integration (<xref ref-type="bibr" rid="B136">Stein, 2012</xref>; <xref ref-type="bibr" rid="B97">Noppeney, 2021</xref>) are hypothesised to be co-opted for detecting communicative signals. For example, faster processing of multimodal relative to unimodal communicative inputs mirrors multimodal facilitation outside the domain of communication in humans (<xref ref-type="bibr" rid="B95">Murray et al., 2001</xref>; <xref ref-type="bibr" rid="B131">Senkowski, 2005</xref>; <xref ref-type="bibr" rid="B29">Diederich et al., 2009</xref>) and animals (<xref ref-type="bibr" rid="B48">Gingras et al., 2009</xref>). <xref ref-type="bibr" rid="B70">Holler and Levinson (2019)</xref> proposed that multimodal interactions resting on statistical regularities among sensory inputs allow chunking the stream of concurrent dynamic inputs into <italic>multiplex signals</italic> at a perceptual, pre-semantic level. Further, the statistical regularities between multiplex signals and communicative meanings generate <italic>multimodal gestalts</italic> that bear semantic and pragmatic value, thus signalling a specific social action. For example, eyebrow frowns often accompany a raising voice pitch to signal the intention to ask a question (<xref ref-type="bibr" rid="B99">Nota et al., 2021</xref>). Mechanisms of Gestalt perception (<xref ref-type="bibr" rid="B148">Wagemans et al., 2012</xref>), social affordance (<xref ref-type="bibr" rid="B45">Gallagher, 2020</xref>), and relevance (<xref ref-type="bibr" rid="B135">Sperber and Wilson, 1995</xref>) may jointly contribute to the recognition of multimodal communicative gestalts (<xref ref-type="bibr" rid="B142">Trujillo and Holler, 2023</xref>). Finally, the recognition of a specific social action may trigger top-down <italic>multilevel predictions</italic> about how the message will unfold in time. For example, frowning and pointing at an object typically anticipates a question about that object, triggering top-down hierarchical predictions at multiple sensory levels (e.g., vocal sounds, bodily movements) and linguistic levels (e.g., words, sentential units). Multiplex signals, multimodal gestalts, and multilevel predictions are thought to interact in a continuous, dialectic process, leading to incremental unification while the message unfolds (<xref ref-type="bibr" rid="B53">Hagoort, 2005</xref>, <xref ref-type="bibr" rid="B54">2019</xref>). Specifically, this supports a parallel processing framework whereby the beginning of the message simultaneously activates multiple potential interpretations (i.e., multimodal gestalts). As the message unfolds, concurrent bottom-up sensory processing and multilevel predictions iteratively refine each other toward a final gestalt solution (<xref ref-type="bibr" rid="B142">Trujillo and Holler, 2023</xref>). Such a parallel account accommodates evidence that processing of communicative social actions starts early (<xref ref-type="bibr" rid="B118">Redcay and Carlson, 2015</xref>), perhaps in parallel to semantic comprehension (<xref ref-type="bibr" rid="B141">Tomasello et al., 2022</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p><bold>(A)</bold> Face-to-face communication comprises multiple layers of discontinuous multimodal signals emitted by different articulators (eyes, face, etc.) over time (visual in cyan, auditory in yellow). To enable effective communication, interlocutors must bind only signals belonging to the same communicative action while segregating tangential, unrelated signals that do not share the communicative content. Temporal statistical regularities allow chunking coherent communicative inputs into multiplex signals (dashed contours) at a perceptual, pre-semantic level (A1). Further, statistical regularities between multiplex signals and communicative meanings generate multimodal gestalts (solid contours) that bear semantic and pragmatic value (A2) and thus signal a specific communicative intention in conversation, i.e., social action (A3). Finally, social action recognition may trigger top-down multilevel predictions across hierarchically organised linguistic and perceptual levels. Following a parallel processing framework, concurrent bottom-up sensory processing (black arrow) and multilevel predictions (red arrow) iteratively refine each other. <bold>(B)</bold> Schematic representation of the third visual cortical pathway (TVP) specialised for the dynamic aspects of social perception, as proposed by <xref ref-type="bibr" rid="B111">Pitcher and Ungerleider (2021)</xref>. The pathway originates in the primary visual cortex (V1) and dissociates from both the ventral and dorsal pathways by projecting into the posterior portion of the superior temporal sulcus (pSTS) via motion-selective areas (MT). <bold>(C)</bold> Schematic representation of the multimodal lateral processing pathway (LPP) implementing the sensory processing gateway toward successful face-to-face communication, as proposed in the present perspective. The LPP originates in early visual (V1) and auditory (A1) areas and dissociates from ventral and dorsal pathways by projecting to the mid-posterior portion of STS via animacy and motion-selective areas (here, MT and PT only are represented for clarity of visualisation). Black arrows indicate bottom-up processing along the LPP hierarchy from upstream regions (dashed contours) responsible for multiplex signals (panel A1) to portions of the pSTS (solid contours) that contribute toward the implementation of multimodal gestalts (panel A2). Red arrows indicate top-down multilevel predictions via pSTS to upstream visual and auditory areas (in cyan and yellow, respectively). Panel <bold>(A)</bold> is based on <xref ref-type="bibr" rid="B70">Holler and Levinson, 2019</xref>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnhum-17-1108354-g001.tif"/>
</fig>
<p>Supporting this framework, there is substantial psycholinguistic evidence for systematic associations between facial-bodily signals and social actions (<xref ref-type="bibr" rid="B70">Holler and Levinson, 2019</xref>; <xref ref-type="bibr" rid="B99">Nota et al., 2021</xref>). Moreover, the early emergence of these perceptual associations in infants (<xref ref-type="bibr" rid="B21">Cameron-Faulkner et al., 2015</xref>), as well as parallels in non-human primates (<xref ref-type="bibr" rid="B123">Rossano and Liebal, 2014</xref>), suggest they might be deeply rooted in the human onto- and phylogenesis.</p>
</sec>
<sec id="S3">
<title>Multimodal processing in face-to-face interactions: A possible neural framework</title>
<p>Accumulating evidence (<xref ref-type="bibr" rid="B109">Pitcher et al., 2014</xref>; <xref ref-type="bibr" rid="B149">Walbrin and Koldewyn, 2019</xref>; <xref ref-type="bibr" rid="B83">Landsiedel et al., 2022</xref>) suggests that dynamic visual aspects of social perception (e.g., face, hand and body movements across the visual field) cannot be easily accommodated within the classic dual-stream model for visual perception (<xref ref-type="bibr" rid="B144">Ungerleider and Mishkin, 1982</xref>). Accordingly, resting on both anatomical and functional evidence in humans and non-human primates, <xref ref-type="bibr" rid="B111">Pitcher and Ungerleider (2021)</xref> proposed the existence of a third visual processing pathway (<xref ref-type="fig" rid="F1">Figure 1B</xref>) that projects on the lateral cortical surface from the early visual cortex into the mid-posterior superior temporal sulcus (pSTS) <italic>via</italic> motion-selective occipito-temporal areas (V5/hMT). Consistent evidence shows that pSTS preferentially responds to multiple types of dynamic social bodily inputs including eye, mouth, hands, and body movements (<xref ref-type="bibr" rid="B3">Allison et al., 2000</xref>; <xref ref-type="bibr" rid="B61">Hein and Knight, 2008</xref>; <xref ref-type="bibr" rid="B27">Deen et al., 2020</xref>). Importantly, both anterior hMT (<xref ref-type="bibr" rid="B28">Desimone and Ungerleider, 1986</xref>; <xref ref-type="bibr" rid="B72">Huk et al., 2002</xref>) and pSTS (<xref ref-type="bibr" rid="B19">Bruce et al., 1981</xref>; <xref ref-type="bibr" rid="B110">Pitcher et al., 2020</xref>; <xref ref-type="bibr" rid="B34">Finzi et al., 2021</xref>) respond to dynamic signals across both visual hemifields in human and non-human primates, in opposition to the contralateral field bias that characterises the ventral pathway (<xref ref-type="bibr" rid="B34">Finzi et al., 2021</xref>). Together, these functional properties are thought to support social interaction, which is an inherently dynamic process requiring the integration of sensory information across the entire visual field (<xref ref-type="bibr" rid="B111">Pitcher and Ungerleider, 2021</xref>).</p>
<p>Relevantly, <xref ref-type="bibr" rid="B111">Pitcher and Ungerleider (2021)</xref> note that the <italic>&#x201C;proximity (to pSTS, a.n.) of brain areas computing multisensory information relevant to social interactions further dissociates the third pathway from the established role of the ventral and dorsal pathways.&#x201D;</italic> We further elaborate on this by reconceiving the third visual pathway as a fundamental part of a larger multimodal neural system that implements fast analysis of multisensory communicative signals during face-to-face interactions. This pathway projects from early visual and auditory regions along the lateral brain surface and into the pSTS (lateral processing pathway; LPP). From this perspective, regions in the mid-posterior and lateral superior temporal gyrus, which are sensitive to auditory motion, animacy, sounds of moving bodies and dynamic aspects of human vocalisation (i.e., prosodic intonation), become candidate nodes of the auditory bank of LPP.</p>
<p>Analogously to the third visual pathway, evidence supporting the existence of a third lateral auditory cortical pathway, independent of dorsal/ventral pathways (<xref ref-type="bibr" rid="B116">Rauschecker, 1998</xref>; <xref ref-type="bibr" rid="B117">Rauschecker and Tian, 2000</xref>) and projecting <italic>via</italic> motion-sensitive regions into the posterior STS, comes from both tracer studies in macaques and <italic>in vivo</italic> white matter tractography in humans (see <xref ref-type="table" rid="T1">Table 1</xref>, connectivity profiles). These mid-posterior lateral areas showing anatomical connectivity with the pSTS also show motion-sensitive and voice-sensitive responses, suggesting functional selectivity for dynamic biologically-relevant information along this lateral auditory pathway (see <xref ref-type="table" rid="T1">Table 1</xref>, functional properties). Relevantly, functional interactions and direct anatomical connections have also been observed between auditory and visual motion-sensitive regions (see <xref ref-type="table" rid="T1">Table 1</xref>), suggesting a structural scaffolding for early convergence of multimodal information (<xref ref-type="bibr" rid="B13">Benetti and Collignon, 2022</xref>) within temporo-occipital regions of the LPP that might share the same computational goal: fast and reliable analysis of multimodal information relevant to social interactions.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Functional properties and structural connectivity profile of mid-posterior and lateral auditory areas in the superior temporal gyrus as described in (a) non-human and (b) human primates.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Auditory area</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Functional/Connectivity profile</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">References</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" colspan="3" style="background-color: #dcdcdc;"><bold>(a) In non-human primates</bold></td>
</tr>
<tr>
<td valign="top" align="left">Mid-posterior parabelt</td>
<td valign="top" align="left">Auditory motion processing</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B112">Poirier et al., 2017</xref></td>
</tr>
<tr>
<td valign="top" align="left">Mid-lateral parabelt</td>
<td valign="top" align="left">Processing of conspecific vocalization</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B108">Petkov et al., 2008</xref>; <xref ref-type="bibr" rid="B107">Perrodin et al., 2011</xref></td>
</tr>
<tr>
<td valign="top" align="left">Mid-posterior parabelt</td>
<td valign="top" align="left">Connection to the mid-posterior STS</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B44">Galaburda and Pandya, 1983</xref>; <xref ref-type="bibr" rid="B52">Hackett et al., 1998</xref>; <xref ref-type="bibr" rid="B25">de la Mothe et al., 2006</xref>; <xref ref-type="bibr" rid="B51">Hackett et al., 2007</xref>; <xref ref-type="bibr" rid="B133">Smiley et al., 2007</xref></td>
</tr>
<tr>
<td valign="top" align="left">Motion-sensitive areas</td>
<td valign="top" align="left">Monosynaptic connection to visual MT</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B143">Ungerleider and Desimone, 1986</xref>; <xref ref-type="bibr" rid="B18">Boussaoud et al., 1990</xref>; <xref ref-type="bibr" rid="B102">Palmer and Rosa, 2006</xref></td>
</tr>
<tr>
<td valign="top" align="left" colspan="3" style="background-color: #dcdcdc;"><bold>(b) In human primates</bold></td>
</tr>
<tr>
<td valign="top" align="left">Bilateral hPT</td>
<td valign="top" align="left">Preferential processing of moving sounds</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B81">Krumbholz et al., 2005</xref>; <xref ref-type="bibr" rid="B7">Battal et al., 2019</xref></td>
</tr>
<tr>
<td valign="top" align="left">Right lateral hPT</td>
<td valign="top" align="left">Responses to ipsilateral auditory field</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B81">Krumbholz et al., 2005</xref></td>
</tr>
<tr>
<td valign="top" align="left">Bilateral anterior hPT</td>
<td valign="top" align="left">Encoding of living and human-action sounds categories</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B49">Giordano et al., 2013</xref></td>
</tr>
<tr>
<td valign="top" align="left">Right anterior hPT and area adjacent to TVA</td>
<td valign="top" align="left">Responses to socially meaningful prosody</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B12">Belyk and Brown, 2014</xref>; <xref ref-type="bibr" rid="B124">Sammler et al., 2015</xref>; <xref ref-type="bibr" rid="B62">Hellbernd and Sammler, 2018</xref></td>
</tr>
<tr>
<td valign="top" align="left">Bilateral lateral hPT</td>
<td valign="top" align="left">White matter connections to mid- and posterior upper bank of STS</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B11">Beer et al., 2013</xref></td>
</tr>
<tr>
<td valign="top" align="left">Bilateral mid-lateral STG</td>
<td valign="top" align="left">White matter connections to mid-upper bank of STS</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B11">Beer et al., 2013</xref></td>
</tr>
<tr>
<td valign="top" align="left">Bil. motion-selective portions of hPT</td>
<td valign="top" align="left">White matter connections to motion-selective hMT</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B50">Gurtubay-Antolin et al., 2021</xref></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>STS, superior temporal sulcus; MT, middle temporal visual area; hPT, human planum temporale; TVA, temporal voice area; STG, superior temporal gyrus; Bil., Bilateral.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="S4">
<title>Toward a neurocognitive model of face-to-face communication</title>
<p>In the following section, we attempt to reconcile the psycholinguistic (<xref ref-type="bibr" rid="B70">Holler and Levinson, 2019</xref>) and sensory neuroscience (<xref ref-type="bibr" rid="B111">Pitcher and Ungerleider, 2021</xref>) frameworks, reviewed so far, toward a coherent neurocognitive model of multimodal face-to-face communication. Accordingly, we propose how key computational principles underlying the perception of multimodal social actions (multiplex signals, multimodal gestalts, and multilevel predictions) might be implemented along the LPP (<xref ref-type="fig" rid="F1">Figure 1C</xref>).</p>
<sec id="S4.SS1">
<title>Detecting multimodal co-occurrences: Multiplex signals <italic>via</italic> upstream sensory regions</title>
<p>Traditionally, it was thought that multimodal integration takes place in higher-order polysensory areas such as parietal or prefrontal cortices, after unimodal processing in early sensory regions (<xref ref-type="bibr" rid="B144">Ungerleider and Mishkin, 1982</xref>; <xref ref-type="bibr" rid="B117">Rauschecker and Tian, 2000</xref>); however, accumulating evidence over the past two decades shows clear cross-modal interactions between early sensory areas (<xref ref-type="bibr" rid="B36">Foxe and Schroeder, 2005</xref>; <xref ref-type="bibr" rid="B46">Ghazanfar and Schroeder, 2006</xref>; <xref ref-type="bibr" rid="B75">Kayser and Logothetis, 2007</xref>; <xref ref-type="bibr" rid="B31">Driver and Noesselt, 2008</xref>). In fact, several studies with humans (<xref ref-type="bibr" rid="B35">Foxe et al., 2000</xref>, <xref ref-type="bibr" rid="B37">2002</xref>; <xref ref-type="bibr" rid="B129">Sch&#x00FC;rmann et al., 2006</xref>; <xref ref-type="bibr" rid="B92">Martuzzi et al., 2007</xref>; <xref ref-type="bibr" rid="B14">Besle et al., 2008</xref>; <xref ref-type="bibr" rid="B87">Lewis and Noppeney, 2010</xref>) and primates (<xref ref-type="bibr" rid="B128">Schroeder et al., 2001</xref>; <xref ref-type="bibr" rid="B43">Fu et al., 2003</xref>; <xref ref-type="bibr" rid="B77">Kayser et al., 2005</xref>, <xref ref-type="bibr" rid="B78">2008</xref>; <xref ref-type="bibr" rid="B82">Lakatos et al., 2007</xref>) have proved driving or modulatory effects of cross-modal stimuli at the bottom of the sensory processing hierarchy. Beyond identifying multimodal interactions, such evidence also revealed their ubiquity across the (sub)cortical hierarchy and called for the need to further characterise the computational principles, neural properties and behavioural relevance of these interactions. One possibility is that they differ at different processing stages (i.e., <italic>multistage integration</italic>) along the (sub)cortical hierarchy (<xref ref-type="bibr" rid="B20">Calvert and Thesen, 2004</xref>; <xref ref-type="bibr" rid="B98">Noppeney et al., 2018</xref>; <xref ref-type="bibr" rid="B97">Noppeney, 2021</xref>).</p>
<p>Since visual bodily signals typically precede speech during natural face-to-face interactions (<xref ref-type="bibr" rid="B99">Nota et al., 2021</xref>), they may modulate the sound-induced activity in the auditory cortex by resetting the phase of ongoing oscillations (<xref ref-type="bibr" rid="B15">Biau et al., 2015</xref>; <xref ref-type="bibr" rid="B93">M&#x00E9;gevand et al., 2020</xref>; <xref ref-type="bibr" rid="B113">Pouw et al., 2021</xref>). In support of a temporally-sensitive mechanism, neurophysiological (<xref ref-type="bibr" rid="B76">Kayser et al., 2010</xref>; <xref ref-type="bibr" rid="B5">Atilgan et al., 2018</xref>), and fMRI studies (<xref ref-type="bibr" rid="B87">Lewis and Noppeney, 2010</xref>; <xref ref-type="bibr" rid="B153">Werner and Noppeney, 2011</xref>) have shown that audiovisual interactions in early auditory cortex and hPT depended on audiovisual temporal coincidence or coherence over time. Sensitivity to temporal co-occurrences is crucial to multiplex signals, which rest on temporal statistical regularities across sensory channels at a perceptual, pre-semantic level (<xref ref-type="bibr" rid="B70">Holler and Levinson, 2019</xref>). Therefore, it seems plausible that upstream sensory regions (e.g., visual and auditory cortices) interact in a temporally-sensitive fashion at corresponding processing stages (i.e., <italic>via</italic> multistage integration) to implement multiplex signals [see also <xref ref-type="bibr" rid="B16">Bizley et al. (2016)</xref>]. Specifically, it may be that primary visual and auditory cortices concur to support the automatic, salience-driven detection of multimodal co-occurrences, while secondary visual and auditory cortices along the LPP (hMT/EBA and hPT/TVA) concur to represent dynamic aspects of audiovisual bodily signals, mirroring results outside the realm of face-to-face communication (<xref ref-type="bibr" rid="B87">Lewis and Noppeney, 2010</xref>).</p>
</sec>
<sec id="S4.SS2">
<title>Recognizing communicative meanings: Multimodal gestalts <italic>via</italic> pSTS</title>
<p>As reviewed above, upstream visual and auditory sensory regions are structurally and functionally interconnected with pSTS. Crucially, this region represents a site of multimodal integration of social and non-social sensory information, as shown in neuroimaging and neurophysiological studies with humans (<xref ref-type="bibr" rid="B8">Beauchamp, 2005</xref>; <xref ref-type="bibr" rid="B10">Beauchamp et al., 2008</xref>; <xref ref-type="bibr" rid="B152">Werner and Noppeney, 2010a</xref>,<xref ref-type="bibr" rid="B151">b</xref>; <xref ref-type="bibr" rid="B65">Hirsch et al., 2018</xref>; <xref ref-type="bibr" rid="B96">Noah et al., 2020</xref>) and non-human primates (<xref ref-type="bibr" rid="B47">Ghazanfar et al., 2008</xref>; <xref ref-type="bibr" rid="B42">Froesel et al., 2021</xref>). While these studies employed non-linguistic but meaningful world categories such as animals, manipulable objects, and human actions, pSTS is also involved in the processing of communicative and meaningful audiovisual stimuli such as lip-speech (<xref ref-type="bibr" rid="B90">MacSweeney et al., 2000</xref>; <xref ref-type="bibr" rid="B155">Wright, 2003</xref>; <xref ref-type="bibr" rid="B89">Macaluso et al., 2004</xref>; <xref ref-type="bibr" rid="B145">van Atteveldt et al., 2004</xref>; <xref ref-type="bibr" rid="B137">Stevenson and James, 2009</xref>; <xref ref-type="bibr" rid="B114">Price, 2012</xref>; <xref ref-type="bibr" rid="B146">Venezia et al., 2017</xref>) and gesture-speech (<xref ref-type="bibr" rid="B66">Holle et al., 2008</xref>, <xref ref-type="bibr" rid="B67">2010</xref>; <xref ref-type="bibr" rid="B71">Hubbard et al., 2009</xref>; <xref ref-type="bibr" rid="B154">Willems et al., 2016</xref>). Consistently, multimodal integration in pSTS may allow the creation of meaningful neural representations (<xref ref-type="bibr" rid="B9">Beauchamp et al., 2004</xref>; <xref ref-type="bibr" rid="B98">Noppeney et al., 2018</xref>), including those bearing semantic and pragmatic values for social communication (i.e., multimodal gestalts; <xref ref-type="bibr" rid="B70">Holler and Levinson, 2019</xref>). In particular, we propose that pSTS might concur toward such (multimodal) neural representations based on Bayesian Causal Inference principles (<xref ref-type="bibr" rid="B79">K&#x00F6;rding et al., 2007</xref>; <xref ref-type="bibr" rid="B132">Shams and Beierholm, 2010</xref>; <xref ref-type="bibr" rid="B97">Noppeney, 2021</xref>), mirroring effects found along the dorsal audiovisual pathways for spatial localisation (<xref ref-type="bibr" rid="B121">Rohe and Noppeney, 2015</xref>, <xref ref-type="bibr" rid="B122">2016</xref>; <xref ref-type="bibr" rid="B2">Aller and Noppeney, 2019</xref>; <xref ref-type="bibr" rid="B32">Ferrari and Noppeney, 2021</xref>).</p>
<p>Intriguingly, pSTS is positioned at the intersection of three brain systems respectively responsible for social perception, action observation, and theory of mind (<xref ref-type="bibr" rid="B157">Yang et al., 2015</xref>). As noticed by <xref ref-type="bibr" rid="B111">Pitcher and Ungerleider (2021)</xref>, perceptual analysis of goal-directed actions in the pSTS likely influences activity in parietal and frontal systems that are responsible for action and intention recognition. As such, after receiving converging inputs from upstream sensory regions of the LPP, pSTS may represent the sensory processing gateway that feeds to higher-order networks for social action recognition during face-to-face communication. As a result, multiplex signals may be processed at the semantic and pragmatic levels, enabling the recognition of multimodal gestalts (<xref ref-type="bibr" rid="B70">Holler and Levinson, 2019</xref>).</p>
</sec>
<sec id="S4.SS3">
<title>Predicting how the conversation unfolds: Multilevel predictions along the cortical hierarchy</title>
<p>Increasing evidence shows that humans, among other species, build on their past experiences to construct predictive models of themselves and their sensory environment (<xref ref-type="bibr" rid="B26">de Lange et al., 2018</xref>). Accordingly, the brain can be conceived as a &#x201C;prediction machine&#x201D; (<xref ref-type="bibr" rid="B23">Clark, 2013</xref>) that attempts to match bottom-up sensory inputs with top-down expectations. Following hierarchical predictive coding (<xref ref-type="bibr" rid="B115">Rao and Ballard, 1999</xref>; <xref ref-type="bibr" rid="B39">Friston, 2005</xref>, <xref ref-type="bibr" rid="B40">2010</xref>), any mismatch between expectation and actual input is signalled as a prediction error that propagates up the processing hierarchy to higher-level areas; vice versa, expected inputs are &#x201C;explained away,&#x201D; resulting in &#x201C;expectation suppression&#x201D; (<xref ref-type="bibr" rid="B138">Summerfield et al., 2008</xref>; <xref ref-type="bibr" rid="B1">Alink et al., 2010</xref>; <xref ref-type="bibr" rid="B120">Richter et al., 2018</xref>; <xref ref-type="bibr" rid="B150">Walsh et al., 2020</xref>). Importantly, expectation suppression reflects the neural tuning properties along a given processing hierarchy. For example, predictions about visual object and face identity are associated with expectation suppression respectively in object-selective regions (<xref ref-type="bibr" rid="B94">Meyer and Olson, 2011</xref>; <xref ref-type="bibr" rid="B74">Kaposvari et al., 2018</xref>; <xref ref-type="bibr" rid="B120">Richter et al., 2018</xref>; <xref ref-type="bibr" rid="B33">Ferrari et al., 2022</xref>; <xref ref-type="bibr" rid="B58">He et al., 2022</xref>) and face-selective regions (<xref ref-type="bibr" rid="B138">Summerfield et al., 2008</xref>; <xref ref-type="bibr" rid="B4">Amado et al., 2016</xref>; <xref ref-type="bibr" rid="B130">Schwiedrzik and Freiwald, 2017</xref>) along the ventral visual stream [for corresponding effects in the auditory domain, see e.g., <xref ref-type="bibr" rid="B73">Jaramillo and Zador (2011)</xref>, <xref ref-type="bibr" rid="B140">Todorovic et al. (2011)</xref>, <xref ref-type="bibr" rid="B6">Barascud et al. (2016)</xref>, <xref ref-type="bibr" rid="B60">Heilbron and Chait (2018)</xref>].</p>
<p>Similarly, multilevel predictions during face-to-face interactions (<xref ref-type="bibr" rid="B70">Holler and Levinson, 2019</xref>) may be implemented <italic>via</italic> mechanisms of hierarchical predictive processing in neural pathways that are responsible for coding the relevant sensory information (e.g., vocal sounds, bodily movements) and linguistic information (e.g., words, sentential units, social actions). Increasing evidence shows signatures of hierarchical predictive processing during language comprehension in left-lateralized fronto-temporal regions of the language network (<xref ref-type="bibr" rid="B17">Blank and Davis, 2016</xref>; <xref ref-type="bibr" rid="B134">Sohoglu and Davis, 2016</xref>; <xref ref-type="bibr" rid="B154">Willems et al., 2016</xref>; <xref ref-type="bibr" rid="B127">Schmitt et al., 2021</xref>; <xref ref-type="bibr" rid="B59">Heilbron et al., 2022</xref>). Accordingly, predictive processing mechanisms may implement multimodal sensory predictions relevant to face-to-face interactions along the cortical hierarchy of the LPP. Initial evidence shows that hMT and pSTS activity is reduced in response to expected than unexpected visual actions (<xref ref-type="bibr" rid="B80">Koster-Hale and Saxe, 2013</xref>), such as human movements violating biomechanical predictions (<xref ref-type="bibr" rid="B24">Costantini et al., 2005</xref>; <xref ref-type="bibr" rid="B125">Saygin et al., 2012</xref>). Further, pSTS activity is reduced in response to actions that fit rather than violate the spatiotemporal structure of the environment (<xref ref-type="bibr" rid="B80">Koster-Hale and Saxe, 2013</xref>), such as shifting head and gaze toward rather than away an abrupt warning signal (<xref ref-type="bibr" rid="B104">Pelphrey et al., 2003</xref>). Interestingly, there is evidence of a functional dissociation between hMT and pSTS, with only the latter being sensitive to violations of action intentions (<xref ref-type="bibr" rid="B105">Pelphrey et al., 2004</xref>). Such dissociation is suggestive of a hierarchy of computations from sensory processing of dynamic inputs in hMT (at the level of multiplex signals) to semantic and pragmatic analysis in pSTS (at the level of multimodal gestalts), which may then be reflected in the respective expectation suppression profiles. Yet, it remains an open question whether and how multimodal (e.g., audiovisual) predictions arising from face-to-face interactions generate neural signatures of hierarchical predictive processing along the entire LPP, down to upstream sensory regions [for complementary evidence, see <xref ref-type="bibr" rid="B84">Lee and Noppeney (2014)</xref>]. Further, it is unknown whether and how higher-order expectations from language, action recognition and theory of mind networks may feed-back to pSTS (<xref ref-type="bibr" rid="B157">Yang et al., 2015</xref>) and thus travel down the LPP.</p>
</sec>
</sec>
<sec id="S5" sec-type="discussion|conclusion">
<title>Discussion and conclusion</title>
<p>The current proposal leaves many aspects of the model un- or under-specified, including issues of hemispheric lateralization (<xref ref-type="bibr" rid="B111">Pitcher and Ungerleider, 2021</xref>) and the exact relationship between LPP and brain networks responsible for language (<xref ref-type="bibr" rid="B63">Hickok and Poeppel, 2000</xref>, <xref ref-type="bibr" rid="B64">2007</xref>; <xref ref-type="bibr" rid="B38">Friederici, 2012</xref>; <xref ref-type="bibr" rid="B54">Hagoort, 2019</xref>), action recognition (<xref ref-type="bibr" rid="B88">Lingnau and Downing, 2015</xref>; <xref ref-type="bibr" rid="B156">Wurm and Caramazza, 2022</xref>), and theory of mind (<xref ref-type="bibr" rid="B41">Frith and Frith, 2006</xref>; <xref ref-type="bibr" rid="B91">Mar, 2011</xref>; <xref ref-type="bibr" rid="B126">Schaafsma et al., 2015</xref>). Future research must provide direct empirical evidence to support our framework, as well as refine and enrich it at the algorithmic and neural levels. To start, neuroimaging and neurostimulation techniques may characterise the functional and representational properties of the LPP as proposed here, as well as its degree of lateralization and interconnection with other brain networks (<xref ref-type="bibr" rid="B139">Thiebaut de Schotten and Forkel, 2022</xref>). Further, it will be crucial to combine these techniques with methodological approaches that enable human motion-tracking and near-to-optimal preservation of naturalistic, ecological contexts of face-to-face social interactions, such as virtual reality (<xref ref-type="bibr" rid="B103">Peeters, 2019</xref>). Complementarily, hyperscanning (<xref ref-type="bibr" rid="B119">Redcay and Schilbach, 2019</xref>; <xref ref-type="bibr" rid="B55">Hamilton, 2021</xref>) and multibrain stimulation techniques (<xref ref-type="bibr" rid="B100">Novembre and Iannetti, 2021</xref>) will be necessary to probe the functional relevance of the LPP during multimodal face-to-face processing across interacting brains. In parallel, the use of computational models (e.g., Bayesian Causal Inference) and neuroscientific-inspired artificial intelligence (i.e., convolutional or deep neural networks) could formalise the empirical evidence and test its role (e.g., necessity, sufficiency) for human behaviour (<xref ref-type="bibr" rid="B56">Hassabis et al., 2017</xref>) during face-to-face interactions. Last, but not least, it will be crucial to further embrace an interdisciplinary perspective in which psycholinguistics and neuroscientific frameworks would be reciprocally validated.</p>
<p>We conclude that the time is mature to accept the challenge we, among others before, advocated in this perspective and move beyond the speech-centred perspective dominating research on the neurocognitive mechanisms of human communication and language. We offer an original perspective bridging two recent propositions in psycholinguistics (<xref ref-type="bibr" rid="B70">Holler and Levinson, 2019</xref>) and sensory neuroscience (<xref ref-type="bibr" rid="B111">Pitcher and Ungerleider, 2021</xref>) into a neurocognitive model of multimodal face-to-face communication. Testing this framework represents a novel and promising endeavour for future research.</p>
</sec>
<sec id="S6" sec-type="author-contributions">
<title>Author contributions</title>
<p>SB and AF contributed equally to the original conception of the perspective and wrote the first draft of the manuscript. FP contributed to further developing the preliminary conception. All authors contributed to manuscript revision, read, and approved the submitted version.</p>
</sec>
</body>
<back>
<sec id="S7" sec-type="funding-information">
<title>Funding</title>
<p>SB was supported by a &#x201C;Starting Grant DM 737/21&#x201D; from the University of Trento (R06). SB and FP were supported by a &#x201C;Progetto di Rilevante Interesse Nazionale (PRIN)&#x201D; from the Italian Ministry for Education, University and Research (MIUR-PRIN 2017 n.20177894ZH).</p>
</sec>
<ack><p>We would like to express our gratitude to Eugenio Parise for providing insightful comments on the preliminary version of this perspective.</p>
</ack>
<sec id="S8" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The reviewer JT declared a shared parent affiliation with the author AF, and the handling editor declared a past collaboration with the author SB, at the time of review.</p>
</sec>
<sec id="S9" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alink</surname> <given-names>A.</given-names></name> <name><surname>Schwiedrzik</surname> <given-names>C. M.</given-names></name> <name><surname>Kohler</surname> <given-names>A.</given-names></name> <name><surname>Singer</surname> <given-names>W.</given-names></name> <name><surname>Muckli</surname> <given-names>L.</given-names></name></person-group> (<year>2010</year>). <article-title>Stimulus predictability reduces responses in primary visual cortex.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>30</volume> <fpage>2960</fpage>&#x2013;<lpage>2966</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.3730-10.2010</pub-id> <pub-id pub-id-type="pmid">20181593</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aller</surname> <given-names>M.</given-names></name> <name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2019</year>). <article-title>To integrate or not to integrate: Temporal dynamics of hierarchical Bayesian causal inference.</article-title> <source><italic>PLoS Biol.</italic></source> <volume>17</volume>:<issue>e3000210</issue>. <pub-id pub-id-type="doi">10.1371/journal.pbio.3000210</pub-id> <pub-id pub-id-type="pmid">30939128</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Allison</surname> <given-names>T.</given-names></name> <name><surname>Puce</surname> <given-names>A.</given-names></name> <name><surname>McCarthy</surname> <given-names>G.</given-names></name></person-group> (<year>2000</year>). <article-title>Social perception from visual cues: Role of the STS region.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>4</volume> <fpage>267</fpage>&#x2013;<lpage>278</lpage>. <pub-id pub-id-type="doi">10.1016/S1364-6613(00)01501-1</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Amado</surname> <given-names>C.</given-names></name> <name><surname>Hermann</surname> <given-names>P.</given-names></name> <name><surname>Kov&#x00E1;cs</surname> <given-names>P.</given-names></name> <name><surname>Grotheer</surname> <given-names>M.</given-names></name> <name><surname>Vidny&#x00E1;nszky</surname> <given-names>Z.</given-names></name> <name><surname>Kov&#x00E1;cs</surname> <given-names>G.</given-names></name></person-group> (<year>2016</year>). <article-title>The contribution of surprise to the prediction based modulation of fMRI responses.</article-title> <source><italic>Neuropsychologia</italic></source> <volume>84</volume> <fpage>105</fpage>&#x2013;<lpage>112</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuropsychologia.2016.02.003</pub-id> <pub-id pub-id-type="pmid">26873275</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Atilgan</surname> <given-names>H.</given-names></name> <name><surname>Town</surname> <given-names>S. M.</given-names></name> <name><surname>Wood</surname> <given-names>K. C.</given-names></name> <name><surname>Jones</surname> <given-names>G. P.</given-names></name> <name><surname>Maddox</surname> <given-names>R. K.</given-names></name> <name><surname>Lee</surname> <given-names>A. K. C.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Integration of visual information in auditory cortex promotes auditory scene analysis through multisensory binding.</article-title> <source><italic>Neuron</italic></source> <volume>97</volume> <fpage>640</fpage>&#x2013;<lpage>655.e4</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2017.12.034</pub-id> <pub-id pub-id-type="pmid">29395914</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barascud</surname> <given-names>N.</given-names></name> <name><surname>Pearce</surname> <given-names>M. T.</given-names></name> <name><surname>Griffiths</surname> <given-names>T. D.</given-names></name> <name><surname>Friston</surname> <given-names>K. J.</given-names></name> <name><surname>Chait</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Brain responses in humans reveal ideal observer-like sensitivity to complex acoustic patterns.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>113</volume>:<fpage>E616</fpage>&#x2013;<lpage>E625</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1508523113</pub-id> <pub-id pub-id-type="pmid">26787854</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Battal</surname> <given-names>C.</given-names></name> <name><surname>Rezk</surname> <given-names>M.</given-names></name> <name><surname>Mattioni</surname> <given-names>S.</given-names></name> <name><surname>Vadlamudi</surname> <given-names>J.</given-names></name> <name><surname>Collignon</surname> <given-names>O.</given-names></name></person-group> (<year>2019</year>). <article-title>Representation of auditory motion directions and sound source locations in the human planum temporale.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>39</volume> <fpage>2208</fpage>&#x2013;<lpage>2220</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.2289-18.2018</pub-id> <pub-id pub-id-type="pmid">30651333</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beauchamp</surname> <given-names>M. S.</given-names></name></person-group> (<year>2005</year>). <article-title>See me, hear me, touch me: Multisensory integration in lateral occipital-temporal cortex.</article-title> <source><italic>Curr. Opin. Neurobiol.</italic></source> <volume>15</volume> <fpage>145</fpage>&#x2013;<lpage>153</lpage>. <pub-id pub-id-type="doi">10.1016/j.conb.2005.03.011</pub-id> <pub-id pub-id-type="pmid">15831395</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beauchamp</surname> <given-names>M. S.</given-names></name> <name><surname>Argall</surname> <given-names>B. D.</given-names></name> <name><surname>Bodurka</surname> <given-names>J.</given-names></name> <name><surname>Duyn</surname> <given-names>J. H.</given-names></name> <name><surname>Martin</surname> <given-names>A.</given-names></name></person-group> (<year>2004</year>). <article-title>Unraveling multisensory integration: Patchy organization within human STS multisensory cortex.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>7</volume> <fpage>1190</fpage>&#x2013;<lpage>1192</lpage>. <pub-id pub-id-type="doi">10.1038/nn1333</pub-id> <pub-id pub-id-type="pmid">15475952</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beauchamp</surname> <given-names>M. S.</given-names></name> <name><surname>Yasar</surname> <given-names>N. E.</given-names></name> <name><surname>Frye</surname> <given-names>R. E.</given-names></name> <name><surname>Ro</surname> <given-names>T.</given-names></name></person-group> (<year>2008</year>). <article-title>Touch, sound and vision in human superior temporal sulcus.</article-title> <source><italic>Neuroimage</italic></source> <volume>41</volume> <fpage>1011</fpage>&#x2013;<lpage>1020</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2008.03.015</pub-id> <pub-id pub-id-type="pmid">18440831</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beer</surname> <given-names>A. L.</given-names></name> <name><surname>Plank</surname> <given-names>T.</given-names></name> <name><surname>Meyer</surname> <given-names>G.</given-names></name> <name><surname>Greenlee</surname> <given-names>M. W.</given-names></name></person-group> (<year>2013</year>). <article-title>Combined diffusion-weighted and functional magnetic resonance imaging reveals a temporal-occipital network involved in auditory-visual object processing.</article-title> <source><italic>Front. Integr. Neurosci.</italic></source> <volume>7</volume>:<issue>5</issue>. <pub-id pub-id-type="doi">10.3389/fnint.2013.00005</pub-id> <pub-id pub-id-type="pmid">23407860</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Belyk</surname> <given-names>M.</given-names></name> <name><surname>Brown</surname> <given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>Perception of affective and linguistic prosody: An ALE meta-analysis of neuroimaging studies.</article-title> <source><italic>Soc. Cogn. Affect. Neurosci.</italic></source> <volume>9</volume> <fpage>1395</fpage>&#x2013;<lpage>1403</lpage>. <pub-id pub-id-type="doi">10.1093/scan/nst124</pub-id> <pub-id pub-id-type="pmid">23934416</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benetti</surname> <given-names>S.</given-names></name> <name><surname>Collignon</surname> <given-names>O.</given-names></name></person-group> (<year>2022</year>). <article-title>Cross-modal integration and plasticity in the superior temporal cortex.</article-title> <source><italic>Handb. Clin. Neurol.</italic></source> <volume>187</volume> <fpage>127</fpage>&#x2013;<lpage>143</lpage>. <pub-id pub-id-type="doi">10.1016/B978-0-12-823493-8.00026-2</pub-id> <pub-id pub-id-type="pmid">35964967</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Besle</surname> <given-names>J.</given-names></name> <name><surname>Fischer</surname> <given-names>C.</given-names></name> <name><surname>Bidet-Caulet</surname> <given-names>A.</given-names></name> <name><surname>Lecaignard</surname> <given-names>F.</given-names></name> <name><surname>Bertrand</surname> <given-names>O.</given-names></name> <name><surname>Giard</surname> <given-names>M.-H.</given-names></name></person-group> (<year>2008</year>). <article-title>Visual activation and audiovisual interactions in the auditory cortex during speech perception: Intracranial recordings in humans.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>28</volume> <fpage>14301</fpage>&#x2013;<lpage>14310</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.2875-08.2008</pub-id> <pub-id pub-id-type="pmid">19109511</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Biau</surname> <given-names>E.</given-names></name> <name><surname>Torralba</surname> <given-names>M.</given-names></name> <name><surname>Fuentemilla</surname> <given-names>L.</given-names></name> <name><surname>de Diego Balaguer</surname> <given-names>R.</given-names></name> <name><surname>Soto-Faraco</surname> <given-names>S.</given-names></name></person-group> (<year>2015</year>). <article-title>Speaker&#x2019;s hand gestures modulate speech perception through phase resetting of ongoing neural oscillations.</article-title> <source><italic>Cortex</italic></source> <volume>68</volume> <fpage>76</fpage>&#x2013;<lpage>85</lpage>. <pub-id pub-id-type="doi">10.1016/j.cortex.2014.11.018</pub-id> <pub-id pub-id-type="pmid">25595613</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bizley</surname> <given-names>J. K.</given-names></name> <name><surname>Maddox</surname> <given-names>R. K.</given-names></name> <name><surname>Lee</surname> <given-names>A. K. C.</given-names></name></person-group> (<year>2016</year>). <article-title>Defining auditory-visual objects: Behavioral tests and physiological mechanisms.</article-title> <source><italic>Trends Neurosci.</italic></source> <volume>39</volume> <fpage>74</fpage>&#x2013;<lpage>85</lpage>. <pub-id pub-id-type="doi">10.1016/j.tins.2015.12.007</pub-id> <pub-id pub-id-type="pmid">26775728</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blank</surname> <given-names>H.</given-names></name> <name><surname>Davis</surname> <given-names>M. H.</given-names></name></person-group> (<year>2016</year>). <article-title>Prediction Errors but Not Sharpened Signals Simulate Multivoxel fMRI Patterns during Speech Perception.</article-title> <source><italic>PLoS Biol.</italic></source> <volume>14</volume>:<issue>e1002577</issue>. <pub-id pub-id-type="doi">10.1371/journal.pbio.1002577</pub-id> <pub-id pub-id-type="pmid">27846209</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boussaoud</surname> <given-names>D.</given-names></name> <name><surname>Ungerleider</surname> <given-names>L. G.</given-names></name> <name><surname>Desimone</surname> <given-names>R.</given-names></name></person-group> (<year>1990</year>). <article-title>Pathways for motion analysis: Cortical connections of the medial superior temporal and fundus of the superior temporal visual areas in the macaque.</article-title> <source><italic>J. Comp. Neurol.</italic></source> <volume>296</volume> <fpage>462</fpage>&#x2013;<lpage>495</lpage>. <pub-id pub-id-type="doi">10.1002/CNE.902960311</pub-id> <pub-id pub-id-type="pmid">2358548</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bruce</surname> <given-names>C.</given-names></name> <name><surname>Desimone</surname> <given-names>R.</given-names></name> <name><surname>Gross</surname> <given-names>C. G.</given-names></name></person-group> (<year>1981</year>). <article-title>Visual properties of neurons in a polysensory area in superior temporal sulcus of the macaque.</article-title> <source><italic>J. Neurophysiol.</italic></source> <volume>46</volume> <fpage>369</fpage>&#x2013;<lpage>384</lpage>. <pub-id pub-id-type="doi">10.1152/jn.1981.46.2.369</pub-id> <pub-id pub-id-type="pmid">6267219</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Calvert</surname> <given-names>G. A.</given-names></name> <name><surname>Thesen</surname> <given-names>T.</given-names></name></person-group> (<year>2004</year>). <article-title>Multisensory integration: Methodological approaches and emerging principles in the human brain.</article-title> <source><italic>J. Physiol. Paris</italic></source> <volume>98</volume> <fpage>191</fpage>&#x2013;<lpage>205</lpage>. <pub-id pub-id-type="doi">10.1016/j.jphysparis.2004.03.018</pub-id> <pub-id pub-id-type="pmid">15477032</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cameron-Faulkner</surname> <given-names>T.</given-names></name> <name><surname>Theakston</surname> <given-names>A.</given-names></name> <name><surname>Lieven</surname> <given-names>E.</given-names></name> <name><surname>Tomasello</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>The relationship between infant holdout and gives, and pointing.</article-title> <source><italic>Infancy</italic></source> <volume>20</volume> <fpage>576</fpage>&#x2013;<lpage>586</lpage>. <pub-id pub-id-type="doi">10.1111/infa.12085</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>L.</given-names></name> <name><surname>Vroomen</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>Intersensory binding across space and time: A tutorial review.</article-title> <source><italic>Atten. Percept. Psychophys.</italic></source> <volume>75</volume> <fpage>790</fpage>&#x2013;<lpage>811</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-013-0475-4</pub-id> <pub-id pub-id-type="pmid">23709064</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Whatever next? predictive brains, situated agents, and the future of cognitive science.</article-title> <source><italic>Behav. Brain Sci.</italic></source> <volume>36</volume> <fpage>181</fpage>&#x2013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X12000477</pub-id> <pub-id pub-id-type="pmid">23663408</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Costantini</surname> <given-names>M.</given-names></name> <name><surname>Galati</surname> <given-names>G.</given-names></name> <name><surname>Ferretti</surname> <given-names>A.</given-names></name> <name><surname>Caulo</surname> <given-names>M.</given-names></name> <name><surname>Tartaro</surname> <given-names>A.</given-names></name> <name><surname>Romani</surname> <given-names>G. L.</given-names></name><etal/></person-group> (<year>2005</year>). <article-title>Neural systems underlying observation of humanly impossible movements: An FMRI study.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>15</volume> <fpage>1761</fpage>&#x2013;<lpage>1767</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhi053</pub-id> <pub-id pub-id-type="pmid">15728741</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Mothe</surname> <given-names>L. A.</given-names></name> <name><surname>Blumell</surname> <given-names>S.</given-names></name> <name><surname>Kajikawa</surname> <given-names>Y.</given-names></name> <name><surname>Hackett</surname> <given-names>T. A.</given-names></name></person-group> (<year>2006</year>). <article-title>Cortical connections of the auditory cortex in marmoset monkeys: Core and medial belt regions.</article-title> <source><italic>J. Comp. Neurol.</italic></source> <volume>496</volume> <fpage>27</fpage>&#x2013;<lpage>71</lpage>. <pub-id pub-id-type="doi">10.1002/cne.20923</pub-id> <pub-id pub-id-type="pmid">16528722</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Lange</surname> <given-names>F. P.</given-names></name> <name><surname>Heilbron</surname> <given-names>M.</given-names></name> <name><surname>Kok</surname> <given-names>P.</given-names></name></person-group> (<year>2018</year>). <article-title>How do expectations shape perception?</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>22</volume> <fpage>764</fpage>&#x2013;<lpage>779</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2018.06.002</pub-id> <pub-id pub-id-type="pmid">30122170</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deen</surname> <given-names>B.</given-names></name> <name><surname>Saxe</surname> <given-names>R.</given-names></name> <name><surname>Kanwisher</surname> <given-names>N.</given-names></name></person-group> (<year>2020</year>). <article-title>Processing communicative facial and vocal cues in the superior temporal sulcus.</article-title> <source><italic>Neuroimage</italic></source> <volume>221</volume>:<issue>117191</issue>. <pub-id pub-id-type="doi">10.1016/J.NEUROIMAGE.2020.117191</pub-id> <pub-id pub-id-type="pmid">32711066</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Desimone</surname> <given-names>R.</given-names></name> <name><surname>Ungerleider</surname> <given-names>L. G.</given-names></name></person-group> (<year>1986</year>). <article-title>Multiple visual areas in the caudal superior temporal sulcus of the macaque.</article-title> <source><italic>J. Comp. Neurol.</italic></source> <volume>248</volume> <fpage>164</fpage>&#x2013;<lpage>189</lpage>. <pub-id pub-id-type="doi">10.1002/cne.902480203</pub-id> <pub-id pub-id-type="pmid">3722457</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Diederich</surname> <given-names>N. J.</given-names></name> <name><surname>F&#x00E9;nelon</surname> <given-names>G.</given-names></name> <name><surname>Stebbins</surname> <given-names>G.</given-names></name> <name><surname>Goetz</surname> <given-names>C. G.</given-names></name></person-group> (<year>2009</year>). <article-title>Hallucinations in Parkinson disease.</article-title> <source><italic>Nat. Rev. Neurol.</italic></source> <volume>5</volume> <fpage>331</fpage>&#x2013;<lpage>342</lpage>.</citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Drijvers</surname> <given-names>L.</given-names></name> <name><surname>Holler</surname> <given-names>J.</given-names></name></person-group> (<year>2022</year>). <article-title>The multimodal facilitation effect in human communication.</article-title> <source><italic>Psychon. Bull. Rev.</italic></source> <comment>[Epub ahead of print]</comment>. <pub-id pub-id-type="doi">10.3758/S13423-022-02178-X</pub-id> <pub-id pub-id-type="pmid">36138282</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Driver</surname> <given-names>J.</given-names></name> <name><surname>Noesselt</surname> <given-names>T.</given-names></name></person-group> (<year>2008</year>). <article-title>Multisensory interplay reveals crossmodal influences on &#x201C;sensory-specific&#x201D; brain regions, neural responses, and judgments.</article-title> <source><italic>Neuron</italic></source> <volume>57</volume> <fpage>11</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2007.12.013</pub-id> <pub-id pub-id-type="pmid">18184561</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferrari</surname> <given-names>A.</given-names></name> <name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2021</year>). <article-title>Attention controls multisensory perception via two distinct mechanisms at different levels of the cortical hierarchy.</article-title> <source><italic>PLoS Biol.</italic></source> <volume>19</volume>:<issue>e3001465</issue>. <pub-id pub-id-type="doi">10.1371/journal.pbio.3001465</pub-id> <pub-id pub-id-type="pmid">34793436</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferrari</surname> <given-names>A.</given-names></name> <name><surname>Richter</surname> <given-names>D.</given-names></name> <name><surname>de Lange</surname> <given-names>F. P.</given-names></name></person-group> (<year>2022</year>). <article-title>Updating contextual sensory expectations for adaptive behaviour.</article-title> <source><italic>J. Neurosci</italic>.</source> <volume>42</volume> <fpage>8855</fpage>&#x2013;<lpage>8869</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.1107-22.2022</pub-id> <pub-id pub-id-type="pmid">36280262</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Finzi</surname> <given-names>D.</given-names></name> <name><surname>Gomez</surname> <given-names>J.</given-names></name> <name><surname>Nordt</surname> <given-names>M.</given-names></name> <name><surname>Rezai</surname> <given-names>A. A.</given-names></name> <name><surname>Poltoratski</surname> <given-names>S.</given-names></name> <name><surname>Grill-Spector</surname> <given-names>K.</given-names></name></person-group> (<year>2021</year>). <article-title>Differential spatial computations in ventral and lateral face-selective regions are scaffolded by structural connections.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>12</volume>:<issue>2278</issue>. <pub-id pub-id-type="doi">10.1038/s41467-021-22524-2</pub-id> <pub-id pub-id-type="pmid">33859195</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foxe</surname> <given-names>J. J.</given-names></name> <name><surname>Morocz</surname> <given-names>I. A.</given-names></name> <name><surname>Murray</surname> <given-names>M. M.</given-names></name> <name><surname>Higgins</surname> <given-names>B. A.</given-names></name> <name><surname>Javitt</surname> <given-names>D. C.</given-names></name> <name><surname>Schroeder</surname> <given-names>C. E.</given-names></name></person-group> (<year>2000</year>). <article-title>Multisensory auditory-somatosensory interactions in early cortical processing revealed by high-density electrical mapping.</article-title> <source><italic>Cogn. Brain Res.</italic></source> <volume>10</volume> <fpage>77</fpage>&#x2013;<lpage>83</lpage>. <pub-id pub-id-type="doi">10.1016/S0926-6410(00)00024-0</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foxe</surname> <given-names>J. J.</given-names></name> <name><surname>Schroeder</surname> <given-names>C. E.</given-names></name></person-group> (<year>2005</year>). <article-title>The case for feedforward multisensory convergence during early cortical processing.</article-title> <source><italic>Neuroreport</italic></source> <volume>16</volume> <fpage>419</fpage>&#x2013;<lpage>423</lpage>. <pub-id pub-id-type="doi">10.1097/00001756-200504040-00001</pub-id> <pub-id pub-id-type="pmid">15770144</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foxe</surname> <given-names>J. J.</given-names></name> <name><surname>Wylie</surname> <given-names>G. R.</given-names></name> <name><surname>Martinez</surname> <given-names>A.</given-names></name> <name><surname>Schroeder</surname> <given-names>C. E.</given-names></name> <name><surname>Javitt</surname> <given-names>D. C.</given-names></name> <name><surname>Guilfoyle</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2002</year>). <article-title>Auditory-somatosensory multisensory processing in auditory association cortex: An fMRI study.</article-title> <source><italic>J. Neurophysiol.</italic></source> <volume>88</volume> <fpage>540</fpage>&#x2013;<lpage>543</lpage>. <pub-id pub-id-type="doi">10.1152/jn.00694.2001</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friederici</surname> <given-names>A. D.</given-names></name></person-group> (<year>2012</year>). <article-title>The cortical language circuit: From auditory perception to sentence comprehension.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>16</volume> <fpage>262</fpage>&#x2013;<lpage>268</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2012.04.001</pub-id> <pub-id pub-id-type="pmid">22516238</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name></person-group> (<year>2005</year>). <article-title>A theory of cortical responses.</article-title> <source><italic>Philos. Trans. R. Soc. B</italic></source> <volume>360</volume> <fpage>815</fpage>&#x2013;<lpage>836</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2005.1622</pub-id> <pub-id pub-id-type="pmid">15937014</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name></person-group> (<year>2010</year>). <article-title>The free-energy principle: A unified brain theory?</article-title> <source><italic>Nat. Rev. Neurosci.</italic></source> <volume>11</volume> <fpage>127</fpage>&#x2013;<lpage>138</lpage>. <pub-id pub-id-type="doi">10.1038/nrn2787</pub-id> <pub-id pub-id-type="pmid">20068583</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Frith</surname> <given-names>C. D.</given-names></name> <name><surname>Frith</surname> <given-names>U.</given-names></name></person-group> (<year>2006</year>). <article-title>The neural basis of mentalizing.</article-title> <source><italic>Neuron</italic></source> <volume>50</volume> <fpage>531</fpage>&#x2013;<lpage>534</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2006.05.001</pub-id> <pub-id pub-id-type="pmid">16701204</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Froesel</surname> <given-names>M.</given-names></name> <name><surname>Gacoin</surname> <given-names>M.</given-names></name> <name><surname>Clavagnier</surname> <given-names>S.</given-names></name> <name><surname>Hauser</surname> <given-names>M.</given-names></name> <name><surname>Goudard</surname> <given-names>Q.</given-names></name> <name><surname>Hamed</surname> <given-names>S.</given-names></name></person-group> (<year>2021</year>). <article-title>Neural correlates of audio-visual integration of socially meaningful information in macaque monkeys.</article-title> <source><italic>bioRxiv</italic></source> [<comment>Preprint</comment>]. <pub-id pub-id-type="doi">10.1101/2021.05.02.442333</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fu</surname> <given-names>K. M. G.</given-names></name> <name><surname>Johnston</surname> <given-names>T. A.</given-names></name> <name><surname>Shah</surname> <given-names>A. S.</given-names></name> <name><surname>Arnold</surname> <given-names>L.</given-names></name> <name><surname>Smiley</surname> <given-names>J.</given-names></name> <name><surname>Hackett</surname> <given-names>T. A.</given-names></name><etal/></person-group> (<year>2003</year>). <article-title>Auditory cortical neurons respond to somatosensory stimulation.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>23</volume> <fpage>7510</fpage>&#x2013;<lpage>7515</lpage>.</citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galaburda</surname> <given-names>A. M.</given-names></name> <name><surname>Pandya</surname> <given-names>D. N.</given-names></name></person-group> (<year>1983</year>). <article-title>The intrinsic architectonic and connectional organization of the superior temporal region of the rhesus monkey.</article-title> <source><italic>J. Comp. Neurol.</italic></source> <volume>221</volume> <fpage>169</fpage>&#x2013;<lpage>184</lpage>. <pub-id pub-id-type="doi">10.1002/CNE.902210206</pub-id> <pub-id pub-id-type="pmid">6655080</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="editor"><name><surname>Gallagher</surname> <given-names>S.</given-names></name></person-group> <role>(ed.)</role> (<year>2020</year>). &#x201C;<article-title>Direct Social Perception</article-title>,&#x201D; in <source><italic>Action and Interaction</italic></source>, (<publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>121</fpage>&#x2013;<lpage>154</lpage>. <pub-id pub-id-type="doi">10.1093/oso/9780198846345.003.0007</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghazanfar</surname> <given-names>A.</given-names></name> <name><surname>Schroeder</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <article-title>Is neocortex essentially multisensory?</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>10</volume> <fpage>278</fpage>&#x2013;<lpage>285</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2006.04.008</pub-id> <pub-id pub-id-type="pmid">16713325</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghazanfar</surname> <given-names>A. A.</given-names></name> <name><surname>Chandrasekaran</surname> <given-names>C.</given-names></name> <name><surname>Logothetis</surname> <given-names>N. K.</given-names></name></person-group> (<year>2008</year>). <article-title>Interactions between the superior temporal sulcus and auditory cortex mediate dynamic face/voice integration in rhesus monkeys.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>28</volume> <fpage>4457</fpage>&#x2013;<lpage>4469</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.0541-08.2008</pub-id> <pub-id pub-id-type="pmid">18434524</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gingras</surname> <given-names>G.</given-names></name> <name><surname>Rowland</surname> <given-names>B. A.</given-names></name> <name><surname>Stein</surname> <given-names>B. E.</given-names></name></person-group> (<year>2009</year>). <article-title>The differing impact of multisensory and unisensory integration on behavior.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>29</volume> <fpage>4897</fpage>&#x2013;<lpage>4902</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.4120-08.2009</pub-id> <pub-id pub-id-type="pmid">19369558</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giordano</surname> <given-names>B. L.</given-names></name> <name><surname>McAdams</surname> <given-names>S.</given-names></name> <name><surname>Zatorre</surname> <given-names>R. J.</given-names></name> <name><surname>Kriegeskorte</surname> <given-names>N.</given-names></name> <name><surname>Belin</surname> <given-names>P.</given-names></name></person-group> (<year>2013</year>). <article-title>Abstract encoding of auditory objects in cortical activity patterns.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>23</volume> <fpage>2025</fpage>&#x2013;<lpage>2037</lpage>. <pub-id pub-id-type="doi">10.1093/CERCOR/BHS162</pub-id> <pub-id pub-id-type="pmid">22802575</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gurtubay-Antolin</surname> <given-names>A.</given-names></name> <name><surname>Battal</surname> <given-names>C.</given-names></name> <name><surname>Maffei</surname> <given-names>C.</given-names></name> <name><surname>Rezk</surname> <given-names>M.</given-names></name> <name><surname>Mattioni</surname> <given-names>S.</given-names></name> <name><surname>Jovicich</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Direct structural connections between auditory and visual motion-selective regions in humans.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>41</volume> <fpage>2393</fpage>&#x2013;<lpage>2405</lpage>. <pub-id pub-id-type="doi">10.1523/jneurosci.1552-20.2021</pub-id> <pub-id pub-id-type="pmid">33514674</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hackett</surname> <given-names>T. A.</given-names></name> <name><surname>Smiley</surname> <given-names>J. F.</given-names></name> <name><surname>Ulbert</surname> <given-names>I.</given-names></name> <name><surname>Karmos</surname> <given-names>G.</given-names></name> <name><surname>Lakatos</surname> <given-names>P.</given-names></name> <name><surname>De La Mothe</surname> <given-names>L. A.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Sources of somatosensory input to the caudal belt areas of auditory cortex.</article-title> <source><italic>Perception</italic></source> <volume>36</volume> <fpage>1419</fpage>&#x2013;<lpage>1430</lpage>. <pub-id pub-id-type="doi">10.1068/p5841</pub-id> <pub-id pub-id-type="pmid">18265825</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hackett</surname> <given-names>T. A.</given-names></name> <name><surname>Stepniewska</surname> <given-names>I.</given-names></name> <name><surname>Kaas</surname> <given-names>J. H.</given-names></name></person-group> (<year>1998</year>). <article-title>Subdivisions of auditory cortex and ipsilateral cortical connections of the parabelt auditory cortex in macaque monkeys</article-title>. <source><italic>J. Comp. Neurol</italic></source>. 394, 475-495. <pub-id pub-id-type="doi">10.1002/(SICI)1096-9861(19980518)394:4</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hagoort</surname> <given-names>P.</given-names></name></person-group> (<year>2005</year>). <article-title>On broca, brain, and binding: A new framework.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>9</volume> <fpage>416</fpage>&#x2013;<lpage>423</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2005.07.004</pub-id> <pub-id pub-id-type="pmid">16054419</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hagoort</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>The neurobiology of language beyond single-word processing.</article-title> <source><italic>Science</italic></source> <volume>366</volume> <fpage>55</fpage>&#x2013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.1126/science.aax0289</pub-id> <pub-id pub-id-type="pmid">31604301</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamilton</surname> <given-names>A. F. C.</given-names></name></person-group> (<year>2021</year>). <article-title>Hyperscanning: Beyond the Hype.</article-title> <source><italic>Neuron</italic></source> <volume>109</volume> <fpage>404</fpage>&#x2013;<lpage>407</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2020.11.008</pub-id> <pub-id pub-id-type="pmid">33259804</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hassabis</surname> <given-names>D.</given-names></name> <name><surname>Kumaran</surname> <given-names>D.</given-names></name> <name><surname>Summerfield</surname> <given-names>C.</given-names></name> <name><surname>Botvinick</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>Neuroscience-Inspired Artificial Intelligence.</article-title> <source><italic>Neuron</italic></source> <volume>95</volume> <fpage>245</fpage>&#x2013;<lpage>258</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2017.06.011</pub-id> <pub-id pub-id-type="pmid">28728020</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hasson</surname> <given-names>U.</given-names></name> <name><surname>Egidi</surname> <given-names>G.</given-names></name> <name><surname>Marelli</surname> <given-names>M.</given-names></name> <name><surname>Willems</surname> <given-names>R. M.</given-names></name></person-group> (<year>2018</year>). <article-title>Grounding the neurobiology of language in first principles: The necessity of non-language-centric explanations for language comprehension.</article-title> <source><italic>Cognition</italic></source> <volume>180</volume> <fpage>135</fpage>&#x2013;<lpage>157</lpage>. <pub-id pub-id-type="doi">10.1016/J.COGNITION.2018.06.018</pub-id> <pub-id pub-id-type="pmid">30053570</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>T.</given-names></name> <name><surname>Richter</surname> <given-names>D.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>de Lange</surname> <given-names>F. P.</given-names></name></person-group> (<year>2022</year>). <article-title>Spatial and temporal context jointly modulate the sensory response within the ventral visual stream.</article-title> <source><italic>J. Cogn. Neurosci.</italic></source> <volume>34</volume> <fpage>332</fpage>&#x2013;<lpage>347</lpage>. <pub-id pub-id-type="doi">10.1162/jocn_a_01792</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heilbron</surname> <given-names>M.</given-names></name> <name><surname>Armeni</surname> <given-names>K.</given-names></name> <name><surname>Schoffelen</surname> <given-names>J. M.</given-names></name> <name><surname>Hagoort</surname> <given-names>P.</given-names></name> <name><surname>de Lange</surname> <given-names>F. P.</given-names></name></person-group> (<year>2022</year>). <article-title>A hierarchy of linguistic predictions during natural language comprehension.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>119</volume>:<issue>e2201968119</issue>. <pub-id pub-id-type="doi">10.1073/pnas.2201968119</pub-id> <pub-id pub-id-type="pmid">35921434</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heilbron</surname> <given-names>M.</given-names></name> <name><surname>Chait</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>Great expectations: Is there evidence for predictive coding in auditory cortex?</article-title> <source><italic>Neuroscience</italic></source> <volume>389</volume> <fpage>54</fpage>&#x2013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroscience.2017.07.061</pub-id> <pub-id pub-id-type="pmid">28782642</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hein</surname> <given-names>G.</given-names></name> <name><surname>Knight</surname> <given-names>R. T.</given-names></name></person-group> (<year>2008</year>). <article-title>Superior temporal sulcus - it&#x2019;s my area: Or is it?</article-title> <source><italic>J. Cogn. Neurosci.</italic></source> <volume>20</volume> <fpage>2125</fpage>&#x2013;<lpage>2136</lpage>. <pub-id pub-id-type="doi">10.1162/jocn.2008.20148</pub-id> <pub-id pub-id-type="pmid">18457502</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hellbernd</surname> <given-names>N.</given-names></name> <name><surname>Sammler</surname> <given-names>D.</given-names></name></person-group> (<year>2018</year>). <article-title>Neural bases of social communicative intentions in speech.</article-title> <source><italic>Soc. Cogn. Affect. Neurosci.</italic></source> <volume>13</volume> <fpage>604</fpage>&#x2013;<lpage>615</lpage>. <pub-id pub-id-type="doi">10.1093/SCAN/NSY034</pub-id> <pub-id pub-id-type="pmid">29771359</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hickok</surname> <given-names>G.</given-names></name> <name><surname>Poeppel</surname> <given-names>D.</given-names></name></person-group> (<year>2000</year>). <article-title>Towards a functional neuroanatomy of speech perception.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>4</volume> <fpage>131</fpage>&#x2013;<lpage>138</lpage>. <pub-id pub-id-type="doi">10.1016/S1364-6613(00)01463-7</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hickok</surname> <given-names>G.</given-names></name> <name><surname>Poeppel</surname> <given-names>D.</given-names></name></person-group> (<year>2007</year>). <article-title>The cortical organization of speech processing.</article-title> <source><italic>Nat. Rev. Neurosci.</italic></source> <volume>8</volume> <fpage>393</fpage>&#x2013;<lpage>402</lpage>. <pub-id pub-id-type="doi">10.1038/nrn2113</pub-id> <pub-id pub-id-type="pmid">17431404</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hirsch</surname> <given-names>J.</given-names></name> <name><surname>Noah</surname> <given-names>J. A.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Dravida</surname> <given-names>S.</given-names></name> <name><surname>Ono</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>A cross-brain neural mechanism for human-to-human verbal communication.</article-title> <source><italic>Soc. Cogn. Affect. Neurosci.</italic></source> <volume>13</volume> <fpage>907</fpage>&#x2013;<lpage>920</lpage>. <pub-id pub-id-type="doi">10.1093/scan/nsy070</pub-id> <pub-id pub-id-type="pmid">30137601</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holle</surname> <given-names>H.</given-names></name> <name><surname>Gunter</surname> <given-names>T. C.</given-names></name> <name><surname>R&#x00FC;schemeyer</surname> <given-names>S. A.</given-names></name> <name><surname>Hennenlotter</surname> <given-names>A.</given-names></name> <name><surname>Iacoboni</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>Neural correlates of the processing of co-speech gestures.</article-title> <source><italic>Neuroimage</italic></source> <volume>39</volume> <fpage>2010</fpage>&#x2013;<lpage>2024</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2007.10.055</pub-id> <pub-id pub-id-type="pmid">18093845</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holle</surname> <given-names>H.</given-names></name> <name><surname>Obleser</surname> <given-names>J.</given-names></name> <name><surname>Rueschemeyer</surname> <given-names>S. A.</given-names></name> <name><surname>Gunter</surname> <given-names>T. C.</given-names></name></person-group> (<year>2010</year>). <article-title>Integration of iconic gestures and speech in left superior temporal areas boosts speech comprehension under adverse listening conditions.</article-title> <source><italic>Neuroimage</italic></source> <volume>49</volume> <fpage>875</fpage>&#x2013;<lpage>884</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2009.08.058</pub-id> <pub-id pub-id-type="pmid">19733670</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holler</surname> <given-names>J.</given-names></name></person-group> (<year>2022</year>). <article-title>Visual bodily signals as core devices for coordinating minds in interaction.</article-title> <source><italic>Philos. Trans. R. Soc. B</italic></source> <volume>377</volume>:<issue>20210094</issue>. <pub-id pub-id-type="doi">10.1098/rstb.2021.0094</pub-id> <pub-id pub-id-type="pmid">35876208</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holler</surname> <given-names>J.</given-names></name> <name><surname>Kendrick</surname> <given-names>K. H.</given-names></name> <name><surname>Levinson</surname> <given-names>S. C.</given-names></name></person-group> (<year>2018</year>). <article-title>Processing language in face-to-face conversation: Questions with gestures get faster responses.</article-title> <source><italic>Psychon. Bull. Rev.</italic></source> <volume>25</volume> <fpage>1900</fpage>&#x2013;<lpage>1908</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-017-1363-z</pub-id> <pub-id pub-id-type="pmid">28887798</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holler</surname> <given-names>J.</given-names></name> <name><surname>Levinson</surname> <given-names>S. C.</given-names></name></person-group> (<year>2019</year>). <article-title>Multimodal Language Processing in Human Communication.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>23</volume> <fpage>639</fpage>&#x2013;<lpage>652</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2019.05.006</pub-id> <pub-id pub-id-type="pmid">31235320</pub-id></citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hubbard</surname> <given-names>A. L.</given-names></name> <name><surname>Wilson</surname> <given-names>S. M.</given-names></name> <name><surname>Callan</surname> <given-names>D. E.</given-names></name> <name><surname>Dapretto</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>Giving speech a hand: Gesture modulates activity in auditory cortex during speech perception.</article-title> <source><italic>Hum. Brain Mapp.</italic></source> <volume>30</volume> <fpage>1028</fpage>&#x2013;<lpage>1037</lpage>. <pub-id pub-id-type="doi">10.1002/hbm.20565</pub-id> <pub-id pub-id-type="pmid">18412134</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huk</surname> <given-names>A. C.</given-names></name> <name><surname>Dougherty</surname> <given-names>R. F.</given-names></name> <name><surname>Heeger</surname> <given-names>D. J.</given-names></name></person-group> (<year>2002</year>). <article-title>Retinotopy and functional subdivision of human areas MT and MST.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>22</volume> <fpage>7195</fpage>&#x2013;<lpage>7205</lpage>. <pub-id pub-id-type="doi">10.1523/jneurosci.22-16-07195.2002</pub-id> <pub-id pub-id-type="pmid">12177214</pub-id></citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaramillo</surname> <given-names>S.</given-names></name> <name><surname>Zador</surname> <given-names>A. M.</given-names></name></person-group> (<year>2011</year>). <article-title>The auditory cortex mediates the perceptual effects of acoustic temporal expectation</article-title>. <source><italic>Nat. Neurosci</italic></source>. <volume>14</volume>, <fpage>246</fpage>&#x2013;<lpage>253</lpage>. <pub-id pub-id-type="doi">10.1038/nn.2688</pub-id> <pub-id pub-id-type="pmid">21170056</pub-id></citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaposvari</surname> <given-names>P.</given-names></name> <name><surname>Kumar</surname> <given-names>S.</given-names></name> <name><surname>Vogels</surname> <given-names>R.</given-names></name></person-group> (<year>2018</year>). <article-title>Statistical learning signals in macaque inferior temporal cortex.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>28</volume> <fpage>250</fpage>&#x2013;<lpage>266</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhw374</pub-id> <pub-id pub-id-type="pmid">27909007</pub-id></citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kayser</surname> <given-names>C.</given-names></name> <name><surname>Logothetis</surname> <given-names>N. K.</given-names></name></person-group> (<year>2007</year>). <article-title>Do early sensory cortices integrate cross-modal information?</article-title> <source><italic>Brain Struct. Funct.</italic></source> <volume>212</volume> <fpage>121</fpage>&#x2013;<lpage>132</lpage>. <pub-id pub-id-type="doi">10.1007/s00429-007-0154-0</pub-id> <pub-id pub-id-type="pmid">17717687</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kayser</surname> <given-names>C.</given-names></name> <name><surname>Logothetis</surname> <given-names>N. K.</given-names></name> <name><surname>Panzeri</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>Visual enhancement of the information representation in auditory cortex.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>20</volume> <fpage>19</fpage>&#x2013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2009.10.068</pub-id> <pub-id pub-id-type="pmid">20036538</pub-id></citation></ref>
<ref id="B77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kayser</surname> <given-names>C.</given-names></name> <name><surname>Petkov</surname> <given-names>C. I.</given-names></name> <name><surname>Augath</surname> <given-names>M.</given-names></name> <name><surname>Logothetis</surname> <given-names>N. K.</given-names></name></person-group> (<year>2005</year>). <article-title>Integration of touch and sound in auditory cortex.</article-title> <source><italic>Neuron</italic></source> <volume>48</volume> <fpage>373</fpage>&#x2013;<lpage>384</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2005.09.018</pub-id> <pub-id pub-id-type="pmid">16242415</pub-id></citation></ref>
<ref id="B78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kayser</surname> <given-names>C.</given-names></name> <name><surname>Petkov</surname> <given-names>C. I.</given-names></name> <name><surname>Logothetis</surname> <given-names>N. K.</given-names></name></person-group> (<year>2008</year>). <article-title>Visual modulation of neurons in auditory cortex.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>18</volume> <fpage>1560</fpage>&#x2013;<lpage>1574</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhm187</pub-id> <pub-id pub-id-type="pmid">18180245</pub-id></citation></ref>
<ref id="B79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>K&#x00F6;rding</surname> <given-names>K. P.</given-names></name> <name><surname>Beierholm</surname> <given-names>U.</given-names></name> <name><surname>Ma</surname> <given-names>W. J.</given-names></name> <name><surname>Quartz</surname> <given-names>S.</given-names></name> <name><surname>Tenenbaum</surname> <given-names>J. B.</given-names></name> <name><surname>Shams</surname> <given-names>L.</given-names></name></person-group> (<year>2007</year>). <article-title>Causal inference in multisensory perception.</article-title> <source><italic>PLoS One</italic></source> <volume>2</volume>:<issue>e943</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0000943</pub-id> <pub-id pub-id-type="pmid">17895984</pub-id></citation></ref>
<ref id="B80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koster-Hale</surname> <given-names>J.</given-names></name> <name><surname>Saxe</surname> <given-names>R.</given-names></name></person-group> (<year>2013</year>). <article-title>Theory of mind: A neural prediction problem.</article-title> <source><italic>Neuron</italic></source> <volume>79</volume> <fpage>836</fpage>&#x2013;<lpage>848</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2013.08.020</pub-id> <pub-id pub-id-type="pmid">24012000</pub-id></citation></ref>
<ref id="B81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krumbholz</surname> <given-names>K.</given-names></name> <name><surname>Sch&#x00F6;nwiesner</surname> <given-names>M.</given-names></name> <name><surname>Von Cramon</surname> <given-names>D. Y.</given-names></name> <name><surname>R&#x00FC;bsamen</surname> <given-names>R.</given-names></name> <name><surname>Shah</surname> <given-names>N. J.</given-names></name> <name><surname>Zilles</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2005</year>). <article-title>Representation of interaural temporal information from left and right auditory space in the human planum temporale and inferior parietal lobe.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>15</volume> <fpage>317</fpage>&#x2013;<lpage>324</lpage>. <pub-id pub-id-type="doi">10.1093/CERCOR/BHH133</pub-id> <pub-id pub-id-type="pmid">15297367</pub-id></citation></ref>
<ref id="B82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lakatos</surname> <given-names>P.</given-names></name> <name><surname>Chen</surname> <given-names>C. M.</given-names></name> <name><surname>O&#x2019;Connell</surname> <given-names>M. N.</given-names></name> <name><surname>Mills</surname> <given-names>A.</given-names></name> <name><surname>Schroeder</surname> <given-names>C. E.</given-names></name></person-group> (<year>2007</year>). <article-title>Neuronal oscillations and multisensory interaction in primary auditory cortex.</article-title> <source><italic>Neuron</italic></source> <volume>53</volume> <fpage>279</fpage>&#x2013;<lpage>292</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2006.12.011</pub-id> <pub-id pub-id-type="pmid">17224408</pub-id></citation></ref>
<ref id="B83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Landsiedel</surname> <given-names>J.</given-names></name> <name><surname>Daughters</surname> <given-names>K.</given-names></name> <name><surname>Downing</surname> <given-names>P. E.</given-names></name> <name><surname>Koldewyn</surname> <given-names>K.</given-names></name></person-group> (<year>2022</year>). <article-title>The role of motion in the neural representation of social interactions in the posterior temporal cortex.</article-title> <source><italic>Neuroimage</italic></source> <volume>262</volume> <issue>119533</issue>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2022.119533</pub-id> <pub-id pub-id-type="pmid">35931309</pub-id></citation></ref>
<ref id="B84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>H.</given-names></name> <name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2014</year>). <article-title>Temporal prediction errors in visual and auditory cortices.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>24</volume>:<fpage>R309</fpage>&#x2013;<lpage>R310</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2014.02.007</pub-id> <pub-id pub-id-type="pmid">24735850</pub-id></citation></ref>
<ref id="B85"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levinson</surname> <given-names>S. C.</given-names></name></person-group> (<year>2016</year>). <article-title>Turn-taking in human communication &#x2013; origins and implications for language processing.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>20</volume> <fpage>6</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1016/J.TICS.2015.10.010</pub-id> <pub-id pub-id-type="pmid">26651245</pub-id></citation></ref>
<ref id="B86"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levinson</surname> <given-names>S. C.</given-names></name> <name><surname>Holler</surname> <given-names>J.</given-names></name></person-group> (<year>2014</year>). <article-title>The origin of human multi-modal communication.</article-title> <source><italic>Philos. Trans. R. Soc. B</italic></source> <volume>369</volume>:<issue>20130302</issue>. <pub-id pub-id-type="doi">10.1098/rstb.2013.0302</pub-id> <pub-id pub-id-type="pmid">25092670</pub-id></citation></ref>
<ref id="B87"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lewis</surname> <given-names>R.</given-names></name> <name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2010</year>). <article-title>Audiovisual synchrony improves motion discrimination via enhanced connectivity between early visual and auditory areas.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>30</volume> <fpage>12329</fpage>&#x2013;<lpage>12339</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.5745-09.2010</pub-id> <pub-id pub-id-type="pmid">20844129</pub-id></citation></ref>
<ref id="B88"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lingnau</surname> <given-names>A.</given-names></name> <name><surname>Downing</surname> <given-names>P. E.</given-names></name></person-group> (<year>2015</year>). <article-title>The lateral occipitotemporal cortex in action.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>19</volume> <fpage>268</fpage>&#x2013;<lpage>277</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2015.03.006</pub-id> <pub-id pub-id-type="pmid">25843544</pub-id></citation></ref>
<ref id="B89"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Macaluso</surname> <given-names>E.</given-names></name> <name><surname>George</surname> <given-names>N.</given-names></name> <name><surname>Dolan</surname> <given-names>R.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Driver</surname> <given-names>J.</given-names></name></person-group> (<year>2004</year>). <article-title>Spatial and temporal factors during processing of audiovisual speech: A PET study.</article-title> <source><italic>Neuroimage</italic></source> <volume>21</volume> <fpage>725</fpage>&#x2013;<lpage>732</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2003.09.049</pub-id> <pub-id pub-id-type="pmid">14980575</pub-id></citation></ref>
<ref id="B90"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>MacSweeney</surname> <given-names>M.</given-names></name> <name><surname>Amaro</surname> <given-names>E.</given-names></name> <name><surname>Calvert</surname> <given-names>G. A.</given-names></name> <name><surname>Campbell</surname> <given-names>R.</given-names></name> <name><surname>David</surname> <given-names>A. S.</given-names></name> <name><surname>McGuire</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2000</year>). <article-title>Silent speechreading in the absence of scanner noise: An event-related fMRI study.</article-title> <source><italic>Neuroreport</italic></source> <volume>11</volume>:<issue>1729</issue>. <pub-id pub-id-type="doi">10.1097/00001756-200006050-00026</pub-id> <pub-id pub-id-type="pmid">10852233</pub-id></citation></ref>
<ref id="B91"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mar</surname> <given-names>R. A.</given-names></name></person-group> (<year>2011</year>). <article-title>The neural bases of social cognition and story comprehension.</article-title> <source><italic>Annu. Rev. Psychol.</italic></source> <volume>62</volume> <fpage>103</fpage>&#x2013;<lpage>134</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-psych-120709-145406</pub-id> <pub-id pub-id-type="pmid">21126178</pub-id></citation></ref>
<ref id="B92"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martuzzi</surname> <given-names>R.</given-names></name> <name><surname>Murray</surname> <given-names>M. M.</given-names></name> <name><surname>Michel</surname> <given-names>C. M.</given-names></name> <name><surname>Thiran</surname> <given-names>J. P.</given-names></name> <name><surname>Maeder</surname> <given-names>P. P.</given-names></name> <name><surname>Clarke</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Multisensory interactions within human primary cortices revealed by BOLD dynamics.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>17</volume> <fpage>1672</fpage>&#x2013;<lpage>1679</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhl077</pub-id> <pub-id pub-id-type="pmid">16968869</pub-id></citation></ref>
<ref id="B93"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>M&#x00E9;gevand</surname> <given-names>P.</given-names></name> <name><surname>Mercier</surname> <given-names>M. R.</given-names></name> <name><surname>Groppe</surname> <given-names>D. M.</given-names></name> <name><surname>Golumbic</surname> <given-names>E. Z.</given-names></name> <name><surname>Mesgarani</surname> <given-names>N.</given-names></name> <name><surname>Beauchamp</surname> <given-names>M. S.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Crossmodal phase reset and evoked responses provide complementary mechanisms for the influence of visual speech in auditory cortex.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>40</volume> <fpage>8530</fpage>&#x2013;<lpage>8542</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.0555-20.2020</pub-id> <pub-id pub-id-type="pmid">33023923</pub-id></citation></ref>
<ref id="B94"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meyer</surname> <given-names>T.</given-names></name> <name><surname>Olson</surname> <given-names>C. R.</given-names></name></person-group> (<year>2011</year>). <article-title>Statistical learning of visual transitions in monkey inferotemporal cortex.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>108</volume> <fpage>19401</fpage>&#x2013;<lpage>19406</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1112895108</pub-id> <pub-id pub-id-type="pmid">22084090</pub-id></citation></ref>
<ref id="B95"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murray</surname> <given-names>M. M.</given-names></name> <name><surname>Foxe</surname> <given-names>J. J.</given-names></name> <name><surname>Higgins</surname> <given-names>B. A.</given-names></name> <name><surname>Javitt</surname> <given-names>D. C.</given-names></name> <name><surname>Schroeder</surname> <given-names>C. E.</given-names></name></person-group> (<year>2001</year>). <article-title>Visuo-spatial neural response interactions in early cortical processing during a simple reaction time task: A high-density electrical mapping study.</article-title> <source><italic>Neuropsychologia</italic></source> <volume>39</volume> <fpage>828</fpage>&#x2013;<lpage>844</lpage>. <pub-id pub-id-type="doi">10.1016/S0028-3932(01)00004-5</pub-id></citation></ref>
<ref id="B96"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Noah</surname> <given-names>J. A.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Dravida</surname> <given-names>S.</given-names></name> <name><surname>Ono</surname> <given-names>Y.</given-names></name> <name><surname>Naples</surname> <given-names>A.</given-names></name> <name><surname>McPartland</surname> <given-names>J. C.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Real-time eye-to-eye contact is associated with cross-brain neural coupling in angular gyrus.</article-title> <source><italic>Front. Hum. Neurosci.</italic></source> <volume>14</volume>:<issue>19</issue>. <pub-id pub-id-type="doi">10.3389/fnhum.2020.00019</pub-id> <pub-id pub-id-type="pmid">32116606</pub-id></citation></ref>
<ref id="B97"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2021</year>). <article-title>Perceptual inference, learning, and attention in a multisensory world.</article-title> <source><italic>Annu. Rev. Neurosci.</italic></source> <volume>44</volume> <fpage>449</fpage>&#x2013;<lpage>473</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-neuro-100120-085519</pub-id> <pub-id pub-id-type="pmid">33882258</pub-id></citation></ref>
<ref id="B98"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Noppeney</surname> <given-names>U.</given-names></name> <name><surname>Jones</surname> <given-names>S. A.</given-names></name> <name><surname>Rohe</surname> <given-names>T.</given-names></name> <name><surname>Ferrari</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>). <article-title>See what you hear-How the brain forms representations across the senses.</article-title> <source><italic>Neuroforum</italic></source> <volume>24</volume> <fpage>237</fpage>&#x2013;<lpage>246</lpage>. <pub-id pub-id-type="doi">10.1515/nf-2017-A066</pub-id></citation></ref>
<ref id="B99"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nota</surname> <given-names>N.</given-names></name> <name><surname>Trujillo</surname> <given-names>J. P.</given-names></name> <name><surname>Holler</surname> <given-names>J.</given-names></name></person-group> (<year>2021</year>). <article-title>Facial signals and social actions in multimodal face-to-face interaction.</article-title> <source><italic>Brain Sci.</italic></source> <volume>11</volume>:<issue>1017</issue>. <pub-id pub-id-type="doi">10.3390/brainsci11081017</pub-id> <pub-id pub-id-type="pmid">34439636</pub-id></citation></ref>
<ref id="B100"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Novembre</surname> <given-names>G.</given-names></name> <name><surname>Iannetti</surname> <given-names>G. D.</given-names></name></person-group> (<year>2021</year>). <article-title>Hyperscanning alone cannot prove causality. Multibrain stimulation can.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>25</volume> <fpage>96</fpage>&#x2013;<lpage>99</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2020.11.003</pub-id> <pub-id pub-id-type="pmid">33293210</pub-id></citation></ref>
<ref id="B101"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>&#x00D6;zy&#x00FC;rek</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Hearing and seeing meaning in speech and gesture: Insights from brain and behaviour.</article-title> <source><italic>Philos. Trans. R. Soc. B</italic></source> <volume>369</volume>:<issue>20130296</issue>. <pub-id pub-id-type="doi">10.1098/rstb.2013.0296</pub-id> <pub-id pub-id-type="pmid">25092664</pub-id></citation></ref>
<ref id="B102"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Palmer</surname> <given-names>S. M.</given-names></name> <name><surname>Rosa</surname> <given-names>M. G. P.</given-names></name></person-group> (<year>2006</year>). <article-title>Quantitative analysis of the corticocortical projections to the middle temporal area in the marmoset monkey: Evolutionary and functional implications.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>16</volume> <fpage>1361</fpage>&#x2013;<lpage>1375</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhj078</pub-id> <pub-id pub-id-type="pmid">16292001</pub-id></citation></ref>
<ref id="B103"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peeters</surname> <given-names>D.</given-names></name></person-group> (<year>2019</year>). <article-title>Virtual reality: A game-changing method for the language sciences</article-title>. <source><italic>Psychon. Bull. Rev</italic></source>. <volume>26</volume>, <fpage>894</fpage>&#x2013;<lpage>900</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-019-01571-3</pub-id> <pub-id pub-id-type="pmid">30734158</pub-id></citation></ref>
<ref id="B104"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pelphrey</surname> <given-names>K. A.</given-names></name> <name><surname>Mitchell</surname> <given-names>T. V.</given-names></name> <name><surname>McKeown</surname> <given-names>M. J.</given-names></name> <name><surname>Goldstein</surname> <given-names>J.</given-names></name> <name><surname>Allison</surname> <given-names>T.</given-names></name> <name><surname>McCarthy</surname> <given-names>G.</given-names></name></person-group> (<year>2003</year>). <article-title>Brain activity evoked by the perception of human walking: Controlling for meaningful coherent motion.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>23</volume> <fpage>6819</fpage>&#x2013;<lpage>6825</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.23-17-06819.2003</pub-id> <pub-id pub-id-type="pmid">12890776</pub-id></citation></ref>
<ref id="B105"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pelphrey</surname> <given-names>K. A.</given-names></name> <name><surname>Morris</surname> <given-names>J. P.</given-names></name> <name><surname>McCarthy</surname> <given-names>G.</given-names></name></person-group> (<year>2004</year>). <article-title>Grasping the intentions of others: The perceived intentionality of an action influences activity in the superior temporal sulcus during social perception</article-title>. <source><italic>J. Cogn. Neurosci</italic></source>. <volume>16</volume>, <fpage>1706</fpage>&#x2013;<lpage>1716</lpage>. <pub-id pub-id-type="doi">10.1162/0898929042947900</pub-id> <pub-id pub-id-type="pmid">15701223</pub-id></citation></ref>
<ref id="B106"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Perniss</surname> <given-names>P.</given-names></name></person-group> (<year>2018</year>). <article-title>Why we should study multimodal language.</article-title> <source><italic>Front. Psychol.</italic></source> <volume>9</volume>:<issue>1109</issue>. <pub-id pub-id-type="doi">10.3389/FPSYG.2018.01109/BIBTEX</pub-id></citation></ref>
<ref id="B107"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Perrodin</surname> <given-names>C.</given-names></name> <name><surname>Kayser</surname> <given-names>C.</given-names></name> <name><surname>Logothetis</surname> <given-names>N. K.</given-names></name> <name><surname>Petkov</surname> <given-names>C. I.</given-names></name></person-group> (<year>2011</year>). <article-title>Voice cells in the primate temporal lobe.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>21</volume> <fpage>1408</fpage>&#x2013;<lpage>1415</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2011.07.028</pub-id> <pub-id pub-id-type="pmid">21835625</pub-id></citation></ref>
<ref id="B108"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Petkov</surname> <given-names>C. I.</given-names></name> <name><surname>Kayser</surname> <given-names>C.</given-names></name> <name><surname>Steudel</surname> <given-names>T.</given-names></name> <name><surname>Whittingstall</surname> <given-names>K.</given-names></name> <name><surname>Augath</surname> <given-names>M.</given-names></name> <name><surname>Logothetis</surname> <given-names>N. K.</given-names></name></person-group> (<year>2008</year>). <article-title>A voice region in the monkey brain.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>11</volume> <fpage>367</fpage>&#x2013;<lpage>374</lpage>.</citation></ref>
<ref id="B109"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pitcher</surname> <given-names>D.</given-names></name> <name><surname>Duchaine</surname> <given-names>B.</given-names></name> <name><surname>Walsh</surname> <given-names>V.</given-names></name></person-group> (<year>2014</year>). <article-title>Combined TMS and fMRI reveal dissociable cortical pathways for dynamic and static face perception.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>24</volume> <fpage>2066</fpage>&#x2013;<lpage>2070</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2014.07.060</pub-id> <pub-id pub-id-type="pmid">25131678</pub-id></citation></ref>
<ref id="B110"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pitcher</surname> <given-names>D.</given-names></name> <name><surname>Pilkington</surname> <given-names>A.</given-names></name> <name><surname>Rauth</surname> <given-names>L.</given-names></name> <name><surname>Baker</surname> <given-names>C.</given-names></name> <name><surname>Kravitz</surname> <given-names>D. J.</given-names></name> <name><surname>Ungerleider</surname> <given-names>L. G.</given-names></name></person-group> (<year>2020</year>). <article-title>The human posterior superior temporal sulcus samples visual space differently from other face-selective regions.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>30</volume> <fpage>778</fpage>&#x2013;<lpage>785</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhz125</pub-id> <pub-id pub-id-type="pmid">31264693</pub-id></citation></ref>
<ref id="B111"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pitcher</surname> <given-names>D.</given-names></name> <name><surname>Ungerleider</surname> <given-names>L. G.</given-names></name></person-group> (<year>2021</year>). <article-title>Evidence for a third visual pathway specialized for social perception.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>25</volume> <fpage>100</fpage>&#x2013;<lpage>110</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2020.11.006</pub-id> <pub-id pub-id-type="pmid">33334693</pub-id></citation></ref>
<ref id="B112"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Poirier</surname> <given-names>C.</given-names></name> <name><surname>Baumann</surname> <given-names>S.</given-names></name> <name><surname>Dheerendra</surname> <given-names>P.</given-names></name> <name><surname>Joly</surname> <given-names>O.</given-names></name> <name><surname>Hunter</surname> <given-names>D.</given-names></name> <name><surname>Balezeau</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Auditory motion-specific mechanisms in the primate brain.</article-title> <source><italic>PLoS Biol.</italic></source> <volume>15</volume>:<issue>e2001379</issue>. <pub-id pub-id-type="doi">10.1371/JOURNAL.PBIO.2001379</pub-id> <pub-id pub-id-type="pmid">28472038</pub-id></citation></ref>
<ref id="B113"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pouw</surname> <given-names>W.</given-names></name> <name><surname>Proksch</surname> <given-names>S.</given-names></name> <name><surname>Drijvers</surname> <given-names>L.</given-names></name> <name><surname>Gamba</surname> <given-names>M.</given-names></name> <name><surname>Holler</surname> <given-names>J.</given-names></name> <name><surname>Kello</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Multilevel rhythms in multimodal communication.</article-title> <source><italic>Philos. Trans. R. Soc. B</italic></source> <volume>376</volume>: 20200334. <pub-id pub-id-type="doi">10.31219/OSF.IO/PSMHN</pub-id></citation></ref>
<ref id="B114"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Price</surname> <given-names>C. J.</given-names></name></person-group> (<year>2012</year>). <article-title>A review and synthesis of the first 20years of PET and fMRI studies of heard speech, spoken language and reading.</article-title> <source><italic>Neuroimage</italic></source> <volume>62</volume> <fpage>816</fpage>&#x2013;<lpage>847</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2012.04.062</pub-id> <pub-id pub-id-type="pmid">22584224</pub-id></citation></ref>
<ref id="B115"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rao</surname> <given-names>R. P. N.</given-names></name> <name><surname>Ballard</surname> <given-names>D. H.</given-names></name></person-group> (<year>1999</year>). <article-title>Predictive coding in the visual cortex: A functional interpretation of some extra-classical receptive-field effects.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>2</volume> <fpage>79</fpage>&#x2013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1038/4580</pub-id> <pub-id pub-id-type="pmid">10195184</pub-id></citation></ref>
<ref id="B116"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rauschecker</surname> <given-names>J. P.</given-names></name></person-group> (<year>1998</year>). <article-title>Cortical processing of complex sounds.</article-title> <source><italic>Curr. Opin. Neurobiol.</italic></source> <volume>8</volume> <fpage>516</fpage>&#x2013;<lpage>521</lpage>. <pub-id pub-id-type="doi">10.1016/S0959-4388(98)80040-8</pub-id></citation></ref>
<ref id="B117"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rauschecker</surname> <given-names>J. P.</given-names></name> <name><surname>Tian</surname> <given-names>B.</given-names></name></person-group> (<year>2000</year>). <article-title>Mechanisms and streams for processing of &#x201C;what&#x201D; and &#x201C;where&#x201D; in auditory cortex.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>97</volume> <fpage>11800</fpage>&#x2013;<lpage>11806</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.97.22.11800</pub-id> <pub-id pub-id-type="pmid">11050212</pub-id></citation></ref>
<ref id="B118"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Redcay</surname> <given-names>E.</given-names></name> <name><surname>Carlson</surname> <given-names>T. A.</given-names></name></person-group> (<year>2015</year>). <article-title>Rapid neural discrimination of communicative gestures.</article-title> <source><italic>Soc. Cogn. Affect. Neurosci.</italic></source> <volume>10</volume> <fpage>545</fpage>&#x2013;<lpage>551</lpage>. <pub-id pub-id-type="doi">10.1093/scan/nsu089</pub-id> <pub-id pub-id-type="pmid">24958087</pub-id></citation></ref>
<ref id="B119"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Redcay</surname> <given-names>E.</given-names></name> <name><surname>Schilbach</surname> <given-names>L.</given-names></name></person-group> (<year>2019</year>). <article-title>Using second-person neuroscience to elucidate the mechanisms of social interaction.</article-title> <source><italic>Nat. Rev. Neurosci.</italic></source> <volume>20</volume> <fpage>495</fpage>&#x2013;<lpage>505</lpage>. <pub-id pub-id-type="doi">10.1038/s41583-019-0179-4</pub-id> <pub-id pub-id-type="pmid">31138910</pub-id></citation></ref>
<ref id="B120"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richter</surname> <given-names>D.</given-names></name> <name><surname>Ekman</surname> <given-names>M.</given-names></name> <name><surname>de Lange</surname> <given-names>F. P.</given-names></name></person-group> (<year>2018</year>). <article-title>Suppressed sensory response to predictable object stimuli throughout the ventral visual stream.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>38</volume> <fpage>7452</fpage>&#x2013;<lpage>7461</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.3421-17.2018</pub-id> <pub-id pub-id-type="pmid">30030402</pub-id></citation></ref>
<ref id="B121"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rohe</surname> <given-names>T.</given-names></name> <name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2015</year>). <article-title>Cortical hierarchies perform bayesian causal inference in multisensory perception.</article-title> <source><italic>PLoS Biol.</italic></source> <volume>13</volume>:<issue>e1002073</issue>. <pub-id pub-id-type="doi">10.1371/journal.pbio.1002073</pub-id> <pub-id pub-id-type="pmid">25710328</pub-id></citation></ref>
<ref id="B122"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rohe</surname> <given-names>T.</given-names></name> <name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2016</year>). <article-title>Distinct computational principles govern multisensory integration in primary sensory and association cortices.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>26</volume> <fpage>509</fpage>&#x2013;<lpage>514</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2015.12.056</pub-id> <pub-id pub-id-type="pmid">26853368</pub-id></citation></ref>
<ref id="B123"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rossano</surname> <given-names>F.</given-names></name> <name><surname>Liebal</surname> <given-names>K.</given-names></name></person-group> (<year>2014</year>). &#x201C;<article-title>Requests and offers,&#x201D; in orangutans and human infants</article-title>,&#x201D; in <source><italic>Requesting in social interaction</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Drew</surname> <given-names>P.</given-names></name></person-group> (<publisher-loc>Amsterdam</publisher-loc>: <publisher-name>Benjamins</publisher-name>), <fpage>335</fpage>&#x2013;<lpage>364</lpage>. <pub-id pub-id-type="doi">10.1075/slsi.26.13ros</pub-id> <pub-id pub-id-type="pmid">33486653</pub-id></citation></ref>
<ref id="B124"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sammler</surname> <given-names>D.</given-names></name> <name><surname>Grosbras</surname> <given-names>M. H.</given-names></name> <name><surname>Anwander</surname> <given-names>A.</given-names></name> <name><surname>Bestelmeyer</surname> <given-names>P. E. G.</given-names></name> <name><surname>Belin</surname> <given-names>P.</given-names></name></person-group> (<year>2015</year>). <article-title>Dorsal and ventral pathways for prosody.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>25</volume> <fpage>3079</fpage>&#x2013;<lpage>3085</lpage>. <pub-id pub-id-type="doi">10.1016/J.CUB.2015.10.009</pub-id> <pub-id pub-id-type="pmid">26549262</pub-id></citation></ref>
<ref id="B125"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saygin</surname> <given-names>A. P.</given-names></name> <name><surname>Chaminade</surname> <given-names>T.</given-names></name> <name><surname>Ishiguro</surname> <given-names>H.</given-names></name> <name><surname>Driver</surname> <given-names>J.</given-names></name> <name><surname>Frith</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>The thing that should not be: Predictive coding and the uncanny valley in perceiving human and humanoid robot actions.</article-title> <source><italic>Soc. Cogn. Affect. Neurosci.</italic></source> <volume>7</volume> <fpage>413</fpage>&#x2013;<lpage>422</lpage>. <pub-id pub-id-type="doi">10.1093/scan/nsr025</pub-id> <pub-id pub-id-type="pmid">21515639</pub-id></citation></ref>
<ref id="B126"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schaafsma</surname> <given-names>S. M.</given-names></name> <name><surname>Pfaff</surname> <given-names>D. W.</given-names></name> <name><surname>Spunt</surname> <given-names>R. P.</given-names></name> <name><surname>Adolphs</surname> <given-names>R.</given-names></name></person-group> (<year>2015</year>). <article-title>Deconstructing and reconstructing theory of mind.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>19</volume> <fpage>65</fpage>&#x2013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2014.11.007</pub-id> <pub-id pub-id-type="pmid">25496670</pub-id></citation></ref>
<ref id="B127"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmitt</surname> <given-names>L. M.</given-names></name> <name><surname>Erb</surname> <given-names>J.</given-names></name> <name><surname>Tune</surname> <given-names>S.</given-names></name> <name><surname>Rysop</surname> <given-names>A. U.</given-names></name> <name><surname>Hartwigsen</surname> <given-names>G.</given-names></name> <name><surname>Obleser</surname> <given-names>J.</given-names></name></person-group> (<year>2021</year>). <article-title>Predicting speech from a cortical hierarchy of event-based time scales.</article-title> <source><italic>Sci. Adv.</italic></source> <volume>7</volume>:<issue>eabi6070</issue>. <pub-id pub-id-type="doi">10.1126/sciadv.abi6070</pub-id> <pub-id pub-id-type="pmid">34860554</pub-id></citation></ref>
<ref id="B128"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schroeder</surname> <given-names>C. E.</given-names></name> <name><surname>Lindsley</surname> <given-names>R. W.</given-names></name> <name><surname>Specht</surname> <given-names>C.</given-names></name> <name><surname>Marcovici</surname> <given-names>A.</given-names></name> <name><surname>Smiley</surname> <given-names>J. F.</given-names></name> <name><surname>Javitt</surname> <given-names>D. C.</given-names></name></person-group> (<year>2001</year>). <article-title>Somatosensory input to auditory association cortex in the macaque monkey.</article-title> <source><italic>J. Neurophysiol.</italic></source> <volume>85</volume> <fpage>1322</fpage>&#x2013;<lpage>1327</lpage>.</citation></ref>
<ref id="B129"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sch&#x00FC;rmann</surname> <given-names>M.</given-names></name> <name><surname>Caetano</surname> <given-names>G.</given-names></name> <name><surname>Hlushchuk</surname> <given-names>Y.</given-names></name> <name><surname>Jousm&#x00E4;ki</surname> <given-names>V.</given-names></name> <name><surname>Hari</surname> <given-names>R.</given-names></name></person-group> (<year>2006</year>). <article-title>Touch activates human auditory cortex.</article-title> <source><italic>Neuroimage</italic></source> <volume>30</volume> <fpage>1325</fpage>&#x2013;<lpage>1331</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2005.11.020</pub-id> <pub-id pub-id-type="pmid">16488157</pub-id></citation></ref>
<ref id="B130"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schwiedrzik</surname> <given-names>C. M.</given-names></name> <name><surname>Freiwald</surname> <given-names>W. A.</given-names></name></person-group> (<year>2017</year>). <article-title>High-level prediction signals in a low-level area of the macaque face-processing hierarchy.</article-title> <source><italic>Neuron</italic></source> <volume>96</volume> <fpage>89</fpage>&#x2013;<lpage>97.e4</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2017.09.007</pub-id> <pub-id pub-id-type="pmid">28957679</pub-id></citation></ref>
<ref id="B131"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Senkowski</surname> <given-names>D.</given-names></name></person-group> (<year>2005</year>). <article-title>Oscillatory beta activity predicts response speed during a multisensory audiovisual reaction time task: A high-density electrical mapping study.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>16</volume> <fpage>1556</fpage>&#x2013;<lpage>1565</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhj091</pub-id> <pub-id pub-id-type="pmid">16357336</pub-id></citation></ref>
<ref id="B132"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shams</surname> <given-names>L.</given-names></name> <name><surname>Beierholm</surname> <given-names>U. R.</given-names></name></person-group> (<year>2010</year>). <article-title>Causal inference in perception.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>14</volume> <fpage>425</fpage>&#x2013;<lpage>432</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2010.07.001</pub-id> <pub-id pub-id-type="pmid">20705502</pub-id></citation></ref>
<ref id="B133"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smiley</surname> <given-names>J. F.</given-names></name> <name><surname>Hackett</surname> <given-names>T. A.</given-names></name> <name><surname>Ulbert</surname> <given-names>I.</given-names></name> <name><surname>Karmas</surname> <given-names>G.</given-names></name> <name><surname>Lakatos</surname> <given-names>P.</given-names></name> <name><surname>Javitt</surname> <given-names>D. C.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Multisensory convergence in auditory cortex, I. Cortical connections of the caudal superior temporal plane in macaque monkeys</article-title>. <source><italic>J. Comp. Neurol</italic></source>. <volume>502</volume>, <fpage>894</fpage>&#x2013;<lpage>923</lpage>. <pub-id pub-id-type="doi">10.1002/CNE.21325</pub-id> <pub-id pub-id-type="pmid">17447261</pub-id></citation></ref>
<ref id="B134"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sohoglu</surname> <given-names>E.</given-names></name> <name><surname>Davis</surname> <given-names>M. H.</given-names></name></person-group> (<year>2016</year>). <article-title>Perceptual learning of degraded speech by minimizing prediction error.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>113</volume>:<fpage>E1747</fpage>&#x2013;<lpage>E1756</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1523266113</pub-id> <pub-id pub-id-type="pmid">26957596</pub-id></citation></ref>
<ref id="B135"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sperber</surname> <given-names>D.</given-names></name> <name><surname>Wilson</surname> <given-names>D.</given-names></name></person-group> (<year>1995</year>). <source><italic>Relevance: Communication and cognition</italic></source>, <edition>2nd Edn</edition>. <publisher-loc>Hoboken, NJ</publisher-loc>: <publisher-name>Blackwell Publishing</publisher-name>.</citation></ref>
<ref id="B136"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stein</surname> <given-names>B. E.</given-names></name></person-group> (<year>2012</year>). <source><italic>The new handbook of multisensory processing</italic></source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>The MIT Press</publisher-name>. <pub-id pub-id-type="doi">10.7551/mitpress/8466.003.0001</pub-id></citation></ref>
<ref id="B137"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stevenson</surname> <given-names>R. A.</given-names></name> <name><surname>James</surname> <given-names>T. W.</given-names></name></person-group> (<year>2009</year>). <article-title>Audiovisual integration in human superior temporal sulcus: Inverse effectiveness and the neural processing of speech and object recognition.</article-title> <source><italic>Neuroimage</italic></source> <volume>44</volume> <fpage>1210</fpage>&#x2013;<lpage>1223</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2008.09.034</pub-id> <pub-id pub-id-type="pmid">18973818</pub-id></citation></ref>
<ref id="B138"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Summerfield</surname> <given-names>C.</given-names></name> <name><surname>Trittschuh</surname> <given-names>E. H.</given-names></name> <name><surname>Monti</surname> <given-names>J. M.</given-names></name> <name><surname>Mesulam</surname> <given-names>M. M.</given-names></name> <name><surname>Egner</surname> <given-names>T.</given-names></name></person-group> (<year>2008</year>). <article-title>Neural repetition suppression reflects fulfilled perceptual expectations.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>11</volume> <fpage>1004</fpage>&#x2013;<lpage>1006</lpage>. <pub-id pub-id-type="doi">10.1038/nn.2163</pub-id> <pub-id pub-id-type="pmid">19160497</pub-id></citation></ref>
<ref id="B139"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thiebaut de Schotten</surname> <given-names>M.</given-names></name> <name><surname>Forkel</surname> <given-names>S. J.</given-names></name></person-group> (<year>2022</year>). <article-title>The emergent properties of the connected brain.</article-title> <source><italic>Science</italic></source> <volume>378</volume> <fpage>505</fpage>&#x2013;<lpage>510</lpage>. <pub-id pub-id-type="doi">10.1126/science.abq2591</pub-id> <pub-id pub-id-type="pmid">36378968</pub-id></citation></ref>
<ref id="B140"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Todorovic</surname> <given-names>A.</given-names></name> <name><surname>van Ede</surname> <given-names>F.</given-names></name> <name><surname>Maris</surname> <given-names>E.</given-names></name> <name><surname>de Lange</surname> <given-names>F. P.</given-names></name></person-group> (<year>2011</year>). <article-title>Prior expectation mediates neural adaptation to repeated sounds in the auditory cortex: An MEG study.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>31</volume> <fpage>9118</fpage>&#x2013;<lpage>9123</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.1425-11.2011</pub-id> <pub-id pub-id-type="pmid">21697363</pub-id></citation></ref>
<ref id="B141"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tomasello</surname> <given-names>R.</given-names></name> <name><surname>Grisoni</surname> <given-names>L.</given-names></name> <name><surname>Boux</surname> <given-names>I.</given-names></name> <name><surname>Sammler</surname> <given-names>D.</given-names></name> <name><surname>Pulverm&#x00FC;ller</surname> <given-names>F.</given-names></name></person-group> (<year>2022</year>). <article-title>Instantaneous neural processing of communicative functions conveyed by speech prosody.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>32</volume> <fpage>4885</fpage>&#x2013;<lpage>4901</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhab522</pub-id> <pub-id pub-id-type="pmid">35136980</pub-id></citation></ref>
<ref id="B142"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Trujillo</surname> <given-names>J. P.</given-names></name> <name><surname>Holler</surname> <given-names>J.</given-names></name></person-group> (<year>2023</year>). <article-title>Interactionally embedded gestalt principles of multimodal human communication.</article-title> <source><italic>Perspect. Psychol. Sci.</italic></source> <pub-id pub-id-type="doi">10.1177/17456916221141422</pub-id> <pub-id pub-id-type="pmid">36634318</pub-id></citation></ref>
<ref id="B143"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ungerleider</surname> <given-names>L. G.</given-names></name> <name><surname>Desimone</surname> <given-names>R.</given-names></name></person-group> (<year>1986</year>). <article-title>Cortical connections of visual area MT in the macaque.</article-title> <source><italic>J. Comp. Neurol.</italic></source> <volume>248</volume> <fpage>190</fpage>&#x2013;<lpage>222</lpage>. <pub-id pub-id-type="doi">10.1002/CNE.902480204</pub-id> <pub-id pub-id-type="pmid">3722458</pub-id></citation></ref>
<ref id="B144"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ungerleider</surname> <given-names>L. G.</given-names></name> <name><surname>Mishkin</surname> <given-names>M.</given-names></name></person-group> (<year>1982</year>). &#x201C;<article-title>Two cortical visual systems</article-title>,&#x201D; in <source><italic>Analysis of visual behavior</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Ingle</surname> <given-names>D.</given-names></name> <name><surname>Goodale</surname> <given-names>M.</given-names></name> <name><surname>Mansfield</surname> <given-names>R.</given-names></name></person-group> (<publisher-loc>Cambridge</publisher-loc>: <publisher-name>MIT Press</publisher-name>).</citation></ref>
<ref id="B145"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Atteveldt</surname> <given-names>N.</given-names></name> <name><surname>Formisano</surname> <given-names>E.</given-names></name> <name><surname>Goebel</surname> <given-names>R.</given-names></name> <name><surname>Blomert</surname> <given-names>L.</given-names></name></person-group> (<year>2004</year>). <article-title>Integration of letters and speech sounds in the human brain.</article-title> <source><italic>Neuron</italic></source> <volume>43</volume> <fpage>271</fpage>&#x2013;<lpage>282</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2004.06.025</pub-id> <pub-id pub-id-type="pmid">15260962</pub-id></citation></ref>
<ref id="B146"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Venezia</surname> <given-names>J. H.</given-names></name> <name><surname>Vaden</surname> <given-names>K. I.</given-names></name> <name><surname>Rong</surname> <given-names>F.</given-names></name> <name><surname>Maddox</surname> <given-names>D.</given-names></name> <name><surname>Saberi</surname> <given-names>K.</given-names></name> <name><surname>Hickok</surname> <given-names>G.</given-names></name></person-group> (<year>2017</year>). <article-title>Auditory, visual and audiovisual speech processing streams in superior temporal sulcus.</article-title> <source><italic>Front. Hum. Neurosci.</italic></source> <volume>11</volume>:<issue>174</issue>. <pub-id pub-id-type="doi">10.3389/fnhum.2017.00174</pub-id> <pub-id pub-id-type="pmid">28439236</pub-id></citation></ref>
<ref id="B147"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vigliocco</surname> <given-names>G.</given-names></name> <name><surname>Perniss</surname> <given-names>P.</given-names></name> <name><surname>Vinson</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Language as a multimodal phenomenon: Implications for language learning, processing and evolution.</article-title> <source><italic>Philos. Trans. R. Soc. B</italic></source> <volume>369</volume>:<issue>20130292</issue>. <pub-id pub-id-type="doi">10.1098/RSTB.2013.0292</pub-id> <pub-id pub-id-type="pmid">25092660</pub-id></citation></ref>
<ref id="B148"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wagemans</surname> <given-names>J.</given-names></name> <name><surname>Elder</surname> <given-names>J. H.</given-names></name> <name><surname>Kubovy</surname> <given-names>M.</given-names></name> <name><surname>Palmer</surname> <given-names>S. E.</given-names></name> <name><surname>Peterson</surname> <given-names>M. A.</given-names></name> <name><surname>Singh</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>A century of Gestalt psychology in visual perception: I. Perceptual grouping and figure-ground organization.</article-title> <source><italic>Psychol. Bull.</italic></source> <volume>138</volume> <fpage>1172</fpage>&#x2013;<lpage>1217</lpage>. <pub-id pub-id-type="doi">10.1037/a0029333</pub-id> <pub-id pub-id-type="pmid">22845751</pub-id></citation></ref>
<ref id="B149"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walbrin</surname> <given-names>J.</given-names></name> <name><surname>Koldewyn</surname> <given-names>K.</given-names></name></person-group> (<year>2019</year>). <article-title>Dyadic interaction processing in the posterior temporal cortex.</article-title> <source><italic>Neuroimage</italic></source> <volume>198</volume> <fpage>296</fpage>&#x2013;<lpage>302</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2019.05.027</pub-id> <pub-id pub-id-type="pmid">31100434</pub-id></citation></ref>
<ref id="B150"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walsh</surname> <given-names>K. S.</given-names></name> <name><surname>McGovern</surname> <given-names>D. P.</given-names></name> <name><surname>Clark</surname> <given-names>A.</given-names></name> <name><surname>O&#x2019;Connell</surname> <given-names>R. G.</given-names></name></person-group> (<year>2020</year>). <article-title>Evaluating the neurophysiological evidence for predictive processing as a model of perception.</article-title> <source><italic>Ann. N. Y. Acad. Sci.</italic></source> <volume>1464</volume> <fpage>242</fpage>&#x2013;<lpage>268</lpage>. <pub-id pub-id-type="doi">10.1111/nyas.14321</pub-id> <pub-id pub-id-type="pmid">32147856</pub-id></citation></ref>
<ref id="B151"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werner</surname> <given-names>S.</given-names></name> <name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2010b</year>). <article-title>Superadditive responses in superior temporal sulcus predict audiovisual benefits in object categorization.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>20</volume> <fpage>1829</fpage>&#x2013;<lpage>1842</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhp248</pub-id> <pub-id pub-id-type="pmid">19923200</pub-id></citation></ref>
<ref id="B152"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werner</surname> <given-names>S.</given-names></name> <name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2010a</year>). <article-title>Distinct functional contributions of primary sensory and association areas to audiovisual integration in object categorization.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>30</volume> <fpage>2662</fpage>&#x2013;<lpage>2675</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.5091-09.2010</pub-id> <pub-id pub-id-type="pmid">20164350</pub-id></citation></ref>
<ref id="B153"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werner</surname> <given-names>S.</given-names></name> <name><surname>Noppeney</surname> <given-names>U.</given-names></name></person-group> (<year>2011</year>). <article-title>The contributions of transient and sustained response codes to audiovisual integration.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>21</volume> <fpage>920</fpage>&#x2013;<lpage>931</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhq161</pub-id> <pub-id pub-id-type="pmid">20810622</pub-id></citation></ref>
<ref id="B154"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Willems</surname> <given-names>R. M.</given-names></name> <name><surname>Frank</surname> <given-names>S. L.</given-names></name> <name><surname>Nijhof</surname> <given-names>A. D.</given-names></name> <name><surname>Hagoort</surname> <given-names>P.</given-names></name> <name><surname>van den Bosch</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). <article-title>Prediction during natural language comprehension.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>26</volume> <fpage>2506</fpage>&#x2013;<lpage>2516</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhv075</pub-id> <pub-id pub-id-type="pmid">25903464</pub-id></citation></ref>
<ref id="B155"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>T. M.</given-names></name></person-group> (<year>2003</year>). <article-title>Polysensory interactions along lateral temporal regions evoked by audiovisual speech.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>13</volume> <fpage>1034</fpage>&#x2013;<lpage>1043</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/13.10.1034</pub-id> <pub-id pub-id-type="pmid">12967920</pub-id></citation></ref>
<ref id="B156"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wurm</surname> <given-names>M. F.</given-names></name> <name><surname>Caramazza</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). <article-title>Two &#x2018;what&#x2019; pathways for action and object recognition.</article-title> <source><italic>Trends Cogn. Sci.</italic></source> <volume>26</volume> <fpage>103</fpage>&#x2013;<lpage>116</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2021.10.003</pub-id> <pub-id pub-id-type="pmid">34702661</pub-id></citation></ref>
<ref id="B157"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>D. Y. J.</given-names></name> <name><surname>Rosenblau</surname> <given-names>G.</given-names></name> <name><surname>Keifer</surname> <given-names>C.</given-names></name> <name><surname>Pelphrey</surname> <given-names>K. A.</given-names></name></person-group> (<year>2015</year>). <article-title>An integrative neural model of social perception, action observation, and theory of mind.</article-title> <source><italic>Neurosci. Biobehav. Rev.</italic></source> <volume>51</volume> <fpage>263</fpage>&#x2013;<lpage>275</lpage>. <pub-id pub-id-type="doi">10.1016/j.neubiorev.2015.01.020</pub-id> <pub-id pub-id-type="pmid">25660957</pub-id></citation></ref>
</ref-list>
</back>
</article>