<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3-mathml3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title-group>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2025.1610179</article-id><article-version article-version-type="Version of Record" vocab="NISO-RP-8-2008"/>
<article-categories>
<subj-group subj-group-type="heading"><subject>Original Research</subject></subj-group>
</article-categories>
<title-group>
<article-title>Do frequency and frequency-related measures signal turn completion? An exploratory corpus study</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>R&#x00FC;hlemann</surname>
<given-names>Christoph</given-names>
</name><xref ref-type="aff" rid="aff1"/>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/1559907"/>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="validation" vocab-term-identifier="https://credit.niso.org/contributor-roles/validation/">Validation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="conceptualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/conceptualization/">Conceptualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="methodology" vocab-term-identifier="https://credit.niso.org/contributor-roles/methodology/">Methodology</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Data curation" vocab-term-identifier="https://credit.niso.org/contributor-roles/data-curation/">Data curation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="supervision" vocab-term-identifier="https://credit.niso.org/contributor-roles/supervision/">Supervision</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="visualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/visualization/">Visualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="investigation" vocab-term-identifier="https://credit.niso.org/contributor-roles/investigation/">Investigation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="resources" vocab-term-identifier="https://credit.niso.org/contributor-roles/resources/">Resources</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Funding acquisition" vocab-term-identifier="https://credit.niso.org/contributor-roles/funding-acquisition/">Funding acquisition</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Project administration" vocab-term-identifier="https://credit.niso.org/contributor-roles/project-administration/">Project administration</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &#x0026; editing</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="software" vocab-term-identifier="https://credit.niso.org/contributor-roles/software/">Software</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Formal analysis" vocab-term-identifier="https://credit.niso.org/contributor-roles/formal-analysis/">Formal analysis</role>
</contrib>
</contrib-group>
<aff id="aff1"><institution>University of Freiburg</institution>, <city>Freiburg</city>, <country country="de">Germany</country></aff>
<author-notes>
<corresp id="c001"><label>&#x002A;</label>Correspondence: Christoph R&#x00FC;hlemann, <email xlink:href="mailto:chrisruehlemann@googlemail.com">chrisruehlemann@googlemail.com</email></corresp>
</author-notes>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2025-12-04">
<day>04</day>
<month>12</month>
<year>2025</year>
</pub-date>
<pub-date publication-format="electronic" date-type="collection">
<year>2025</year>
</pub-date>
<volume>16</volume>
<elocation-id>1610179</elocation-id>
<history>
<date date-type="received">
<day>11</day>
<month>04</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>10</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2025 R&#x00FC;hlemann.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>R&#x00FC;hlemann</copyright-holder>
<license><ali:license_ref start_date="2025-12-04">https://creativecommons.org/licenses/by/4.0/</ali:license_ref>
<license-p>This is an open-access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License (CC BY)</ext-link>. The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</license-p>
</license>
</permissions>
<abstract>
<p>Speakers in conversation have access to word frequency information stored in the mental lexicon. This article examines whether word frequencies play a role as a turn-completion cue in conversation. Based on the Freiburg Multimodal Interaction Corpus (FreMIC), frequencies and frequency-related measures are compared in turn-constructional units (TCUs) from two types of action/turns that are systematically complementary with regard to turn transition: question TCUs, which exert pressure for the next speaker to take over, and storytelling TCUs, which largely resist transition. Based on these systematic tendencies, the focus is on question TCUs that result in speaker change and story TCUs that result in speaker continuation, thereby tying <italic>turn-transition</italic> inevitably to <italic>social action.</italic> We address two research questions: RQ #1 - <italic>Do word frequencies in the TCUs follow an S-shaped pattern?</italic> and RQ #2 - <italic>Which frequency-related measures predict that a TCU will be followed by a turn transition or continuation?</italic> To address RQ #1, a mixed effects model showed the same S-shape found in prior research in large corpora. To address RQ #2, a mixed-effects model was computed, with turn transition (TT) as a binary outcome variable. The model suggested that turn finality in question TCUs co-occurs with a more pronounced drop in word frequency toward the TCU end than in story TCUs. A follow-up analysis revealed a more asymmetrical (right-leaning) distribution of nouns in turn-final question TCUs. Information extracted from word frequencies may hence serve listeners in conversation as cues to anticipate turn completion in questions as opposed to turn continuation in stories.</p>
</abstract>
<kwd-group>
<kwd>word frequencies</kwd>
<kwd>turn-constructional unit</kwd>
<kwd>questions</kwd>
<kwd>storytelling</kwd>
<kwd>turn-transition</kwd>
</kwd-group><funding-group><funding-statement>The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by a grant from the Deutsche Forschungsgemeinschaft (DFG): <ext-link xlink:href="https://gepris.dfg.de/gepris/projekt/497779797" ext-link-type="uri">https://gepris.dfg.de/gepris/projekt/497779797</ext-link>; grant number 497779797.</funding-statement></funding-group>
<counts>
<fig-count count="5"/>
<table-count count="8"/>
<equation-count count="0"/>
<ref-count count="84"/>
<page-count count="17"/>
<word-count count="13571"/>
</counts>
<custom-meta-group>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Psychology of Language</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="sec1">
<label>1</label>
<title>Introduction</title>
<p>Speakers in conversation across the world manage to produce a response to a prior turn with a small gap of around 200&#x202F;ms (<xref ref-type="bibr" rid="ref75">Stivers et al., 2009</xref>, p. 10588; <xref ref-type="bibr" rid="ref27">Heldner and Edlund, 2010</xref>, p. 564). How is this precision-timing achieved? It is commonly assumed that listeners dual-task, predicting the unfolding action (speech act) and its time course while pre-planning their own response (<xref ref-type="bibr" rid="ref45">Levinson and Torreira, 2015</xref>). The pre-planned response is launched as soon as the speaker gives the ultimate &#x201C;go-signal&#x201D; (<xref ref-type="bibr" rid="ref4">Barthel et al., 2017</xref>: <xref ref-type="bibr" rid="ref30">Holler and Levinson, 2019</xref>; <xref ref-type="bibr" rid="ref45">Levinson and Torreira, 2015</xref>; <xref ref-type="bibr" rid="ref48">Magyari et al., 2014</xref>; <xref ref-type="bibr" rid="ref18">Gisladottir et al., 2018</xref>; <xref ref-type="bibr" rid="ref7">B&#x00F6;gels and Torreira, 2015</xref>). The model is schematically depicted in <xref ref-type="fig" rid="fig1">Figure 1</xref>.<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref></p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption>
<p>Schematic representation of the current consensus model on the synergy of early prediction and planning by the listener and late occurrence of <italic>go</italic>-signals that facilitate precision timing in turn transition.</p>
</caption>
<graphic xlink:href="fpsyg-16-1610179-g001.tif" mimetype="image" mime-subtype="tiff">
<alt-text content-type="machine-generated">Diagram illustrating turn-taking in conversation. Speaker A's turn is followed by a transition space, marked by "A gives go-signal(s)," then Speaker B's turn. Below, B's actions: starting predictive comprehension, production planning, and launching production are shown sequentially.</alt-text>
</graphic>
</fig>
<p>Previous research on resources that listeners exploit in order to determine when a turn has, or is about to, come to a close has suggested a large number of such resources in all modalities. These resources do not only comprise &#x201C;one&#x2013;off&#x201C;cues issued by the speaker upon turn completion, for example, a trail-off conjunctional or turn-final lengthening, but also include indexes derived from the turn as a whole that allow <italic>long-distance projection,</italic> such as lexico-syntactic predictability or rallentando (<xref ref-type="bibr" rid="ref45">Levinson and Torreira, 2015</xref>, p. 13; <xref ref-type="bibr" rid="ref63">R&#x00FC;hlemann and Gries, 2020</xref>; cf. also <xref ref-type="bibr" rid="ref66">Sacks et al., 1974</xref>; <xref ref-type="bibr" rid="ref10">Clayman, 2013</xref>; <xref ref-type="bibr" rid="ref48">Magyari et al., 2014</xref>).<xref ref-type="fn" rid="fn0002"><sup>2</sup></xref></p>
<p>In this study, we examine word frequency and related measures as another verbal resource to project and predict turn-completion. Frequency effects can be observed at almost any level of inquiry into language processing (<xref ref-type="bibr" rid="ref15">Ellis, 2002</xref>). Our concern with frequency in this article is motivated by prior research suggesting an S-shaped distribution of word frequencies in conversational turns-at-talk (<xref ref-type="bibr" rid="ref83">Yu et al., 2016</xref>; <xref ref-type="bibr" rid="ref38">Klafka and Yurovsky, 2021</xref>; <xref ref-type="bibr" rid="ref59">R&#x00FC;hlemann, 2020a</xref>, <xref ref-type="bibr" rid="ref60">2020b</xref>; <xref ref-type="bibr" rid="ref62">R&#x00FC;hlemann and Barthel, 2024</xref>): frequencies start very high in turn-first position, then drop and level out until the last position in the turn, until they drop again steeply.</p>
<p>The S-shape pattern emerges very clearly and with little variation in the large conversational subcorpus of the British National Corpus (cf. <xref ref-type="bibr" rid="ref29">Hoffmann et al., 2008</xref>) and is strong enough to also appear in the much smaller Freiburg Multimodal Interaction Corpus (FreMIC) (<xref ref-type="bibr" rid="ref62">R&#x00FC;hlemann and Barthel, 2024</xref>).</p>
<p>Conversationalists are sensitive to word frequencies (<xref ref-type="bibr" rid="ref24">Hasher and Chromiak, 1977</xref>; <xref ref-type="bibr" rid="ref25">Hasher and Zacks, 1984</xref>). This transpires, for example, from the word frequency effect, that is, the fact that rare words are more slowly processed than common words (<xref ref-type="bibr" rid="ref52">Oldfield and Wingfield, 1965</xref>; <xref ref-type="bibr" rid="ref36">Jescheniak and Levelt, 1994</xref>; <xref ref-type="bibr" rid="ref32">Indefrey and Levelt, 2004</xref>; <xref ref-type="bibr" rid="ref44">Levelt et al., 1999</xref>; <xref ref-type="bibr" rid="ref37">Johns et al., 2012</xref>). Given this sensitivity, the S-shaped distribution of word frequencies in turn would suggest the possibility that the drop in frequency in turn-last position represents a go-signal, that is, a one-off cue occurring upon turn completion, similar to an adress term (<xref ref-type="bibr" rid="ref66">Sacks et al., 1974</xref>), or the return of the speaker&#x2019;s gaze (e.g., <xref ref-type="bibr" rid="ref1">Auer, 2018</xref>, <xref ref-type="bibr" rid="ref2">2021a</xref>, <xref ref-type="bibr" rid="ref3">2021b</xref>) However, given that frequencies decrease not just on the last word but overall within turns we wish to allow for the possibility that frequency serves as a resource for <italic>advance-projecting turn completion</italic> very much like syntax: just as syntax provides a structural envelope allowing the listener to predict the structural contour of the turn-in-progress, so frequency may provide a statistical envelope for the listener to predict the time course of the turn.</p>
<p>We thus hypothesize that the dynamic changes in frequency, including but not restricted to the drop in frequency on the turn-final word and the changes in frequency-related measures, do not go unnoticed by the listener and can be used by the listener as resources to (advance-)project (imminent) turn completion. As we have no access to recipients&#x2019; internal processes, to test the hypothesis, we investigate word frequencies and related measures in turns and their potential correlation with the actual occurrence or non-occurrence of turn transition observed in the sequence.</p>
<p>Specifically, we address two research questions: RQ #1 - <italic>Do word frequencies in the TCUs follow an S-shaped pattern?</italic> and RQ #2 - <italic>Which frequency-related measures predict that a TCU will be followed by turn transition or continuation?</italic></p>
<p>Crucially, RQ #2 is examined by comparing questions and stories. These kinds of turns/actions differ fundamentally: questions are short, they consist mostly of a single turn-constructional unit, and they exert maximal pressure on the listener to respond (<xref ref-type="bibr" rid="ref76">Stivers and Rossano, 2010</xref>, p. 29). Stories, by contrast, are extended turns, consisting of multiple TCUs, during most of which turn transition is avoided&#x2014;typically until the climax, where assessments by the recipient are normatively relevant (<xref ref-type="bibr" rid="ref72">Stivers, 2008</xref>). In (information-seeking) questions, the pressure is maximal: the provision of the sought information is normatively relevant; non-provision of the information may get negatively sanctioned (<xref ref-type="bibr" rid="ref74">Stivers, 2013</xref>, p. 204). In <xref ref-type="bibr" rid="ref73">Stivers (2010)</xref>, for example, 93% of all questions were indeed followed by a turn transition. Storytellings provide a very stark contrast: they can be &#x201C;very long stretches of talk being properly understood as being organized under the scope of a single sequence&#x201D; (<xref ref-type="bibr" rid="ref67">Schegloff, 2007</xref>, p. 215). They require the suspension of ordinary turn-taking (e.g., <xref ref-type="bibr" rid="ref34">Jefferson, 1978</xref>) and entail a structural asymmetry, with the storyteller building up a succession of turn-constructional units (TCUs), and the listener filling the places between the units with recipient feedback in the form of vocal continuers (e.g.&#x2018;mm&#x2019;, &#x2018;uhu&#x2019;, and &#x2018;yeah&#x2019;) (<xref ref-type="bibr" rid="ref19">Goodwin, 1984</xref>) and/or visual continuers, such as nods (<xref ref-type="bibr" rid="ref72">Stivers, 2008</xref>) and blinks (<xref ref-type="bibr" rid="ref31">H&#x00F6;mke et al., 2017</xref>). Obviously, question turns can also be built out of multiple TCUs, and storytellings also come to a point where more action than issuing a continuer is expected from the interlocutor (namely at the story&#x2019;s climax; cf. <xref ref-type="bibr" rid="ref72">Stivers, 2008</xref>) and where, then, turn transition does occur. In addressing RQ #2, we therefore focus entirely on question TCUs that result in turn transfer and on story TCUs that do not lead to turn transition.</p>
<p>This methodological decision has important implications. The decision effectively means that turn transition is perfectly correlated with the type of action. Therefore, the present analysis does not claim to separate frequency-related features of transition <italic>per se</italic> from those associated with the social action of asking a question versus telling a story. Instead, what the study aims to identify are candidate frequency-related features that <italic>co-occur</italic> with transition-likely actions (questions) versus transition-resistant actions (stories). So, while we are using predictive modeling, prediction is used as a means to <italic>discriminate</italic> frequency-related features associated with turn-final question TCUs and, respectively, turn-medial story TCUs.</p>
</sec>
<sec id="sec2">
<label>2</label>
<title>Data</title>
<sec id="sec3">
<label>2.1</label>
<title>The Freiburg multimodal interaction corpus</title>
<p>The data underlying the analyses in this article are part of the Freiburg Multimodal Interaction Corpus (FreMIC). Although small, FreMIC holds information of a breadth and level of detail not commonly seen in linguistic corpora (for a full description, see <xref ref-type="bibr" rid="ref64">R&#x00FC;hlemann and Ptak, 2023</xref>).</p>
<p>FreMIC comprises ~30&#x202F;h of video-recordings in 38 files transcribed and annotated in detail and featuring large streams of automatically generated multimodal data (e.g., eye gaze and pupil size). FreMIC&#x2019;s total word count is 375,637. All conversations were annotated and transcribed in ELAN (<xref ref-type="bibr" rid="ref82">Wittenburg et al., 2006</xref>). Two types of transcriptions were used: orthographic and conversation-analytic (e.g., <xref ref-type="bibr" rid="ref35">Jefferson, 2004</xref>); the latter renders verbal content and interactionally relevant details of sequencing (e.g., overlap and latching), temporal aspects (pauses and acceleration/deceleration), phonological aspects (e.g., intensity, pitch, stretching, truncation and voice quality), and laughter. The underlying unit of analysis for transcription was the interpausal unit (IPU); that is, whenever a speaker stopped speaking for longer than 180&#x202F;ms a new annotation was begun, a threshold that reflects the human 120 to 200&#x202F;ms threshold for the detection of acoustic silence (<xref ref-type="bibr" rid="ref26">Heldner, 2011</xref>; <xref ref-type="bibr" rid="ref81">Walker and Trimboli, 1982</xref>; cf. also <xref ref-type="bibr" rid="ref45">Levinson and Torreira, 2015</xref> and <xref ref-type="bibr" rid="ref55">Roberts et al., 2015</xref>, who also work with IPUs).</p>
</sec>
<sec id="sec4">
<label>2.2</label>
<title>Participants</title>
<p>Forty-one individual participants were recruited to contribute to one or more of the 38 recorded conversations (total run time 30&#x202F;h). Recordings lasted between 30 and 98&#x202F;min (mean&#x202F;=&#x202F;46.75&#x202F;min, SD&#x202F;=&#x202F;13.80).</p>
<p>The participants were explicitly told they were free to talk about anything that came to their minds. They were mainly students at Albert-Ludwigs-University Freiburg, as well as their friends and relatives [17 men, 21 women, 3 diverse/NA; mean age&#x202F;=&#x202F;26&#x202F;years (SD&#x202F;=&#x202F;5.7&#x202F;years)]. Most participants&#x2019; first language was English (<italic>n</italic>&#x202F;=&#x202F;38, out of 41). All participants had normal or corrected-to-normal vision and hearing. Participants gave their informed consent about the use of the recorded data, stating their individual choices as to which of their data can be used and for what specific purposes. They received a compensation of &#x20AC;15 per hour for their participation.</p>
</sec>
<sec id="sec5">
<label>2.3</label>
<title>The c7 tag set</title>
<p>All orthographic transcripts in FreMIC were part-of-speech tagged using the CLAWS web tagger (<xref ref-type="bibr" rid="ref17">Garside and Smith, 1997</xref>) and its <italic>c7</italic> tag set.<xref ref-type="fn" rid="fn0003"><sup>3</sup></xref> The <italic>c7</italic> tag set is a fine-grained tag set providing a total of 138 PoS categories (cf. <xref rid="SM1" ref-type="supplementary-material">Supplementary Material 1</xref>). The major advantage of such a fine-grained set is that it helps distinguish distinct morpho-syntactic functions of one and the same word form. For example, the word form <italic>that</italic> in English can take on a number of functions in context, for example, as a demonstrative as in <italic>when was that?</italic>, where in <italic>c7</italic> it is tagged <italic>that_DD1</italic>, a relativizer as in <italic>the day that follows Christmas</italic> (<italic>that_CST</italic>), a complex subordinating conjunction as in <italic>now that you talk it&#x2019;s fine</italic> (<italic>that_CS22</italic>), and an adverb as in <italic>it&#x2019;s not that far</italic> (<italic>that_RG</italic>).<xref ref-type="fn" rid="fn0004"><sup>4</sup></xref> The accuracy rate for the <italic>c7</italic> tagset is 96&#x2013;97% (Rayson, personal email communication; cf. also <xref ref-type="bibr" rid="ref42">Leech et al., 1994</xref>; <xref ref-type="bibr" rid="ref17">Garside and Smith, 1997</xref>).</p>
</sec>
<sec id="sec6">
<label>2.4</label>
<title>The data subsets</title>
<sec id="sec7">
<label>2.4.1</label>
<title>Data selection</title>
<p>Question turns can be used to do a wide range of things, such as initiating repair, confirming, and assessing (e.g., <xref ref-type="bibr" rid="ref73">Stivers, 2010</xref>). This study focuses on information-seeking questions.<xref ref-type="fn" rid="fn0005"><sup>5</sup></xref> Four syntactic types were targeted: <italic>wh</italic>-questions, polar questions, declarative questions, and multi-clausal <italic>or</italic>-questions, such as <italic>is it!mult!iple singers for the band or is she like the main one &#x00B0;then&#x00B0;&#x202F;=</italic>.</p>
<p>The stories for this analysis were selected from the data used for a prior analysis (<xref ref-type="bibr" rid="ref9001">R&#x00FC;hlemann and Trujillo, 2024</xref>) based on the condition that they be &#x2018;big-package&#x2019; stories involving canonical story structure with (optional) story abstract, background, complicating events, and climax (<xref ref-type="bibr" rid="ref41">Labov and Waletzky, 1967</xref>; <xref ref-type="bibr" rid="ref40">Labov, 1972</xref>; cf. also <xref ref-type="bibr" rid="ref19">Goodwin, 1984</xref>).</p>
<p>Both the questions and the storytellings selected were elaborately pre-processed in a joint effort by multiple researchers (<xref ref-type="bibr" rid="ref61">R&#x00FC;hlemann et al., n.d.</xref>). The pre-processing is detailed in the following.</p>
</sec>
</sec>
<sec id="sec8">
<label>2.5</label>
<title>Data pre-processing</title>
<p>Turns can be single-unit turns or multi-unit turns (cf. <xref ref-type="bibr" rid="ref56">Robinson et al., 2022</xref>). Storytellings are virtually always such extended turns stretching over multiple turn-constructional units (TCUs), and question turns can harbor a complex structure too. The questions and storytellings that form our data were therefore manually segmented into TCUs and whatever other units were found.</p>
<p>TCUs were operationalized as &#x201C;coherent and self-contained utterance[s], recognizable in context as &#x2018;possibly complete&#x2019;&#x201D; (<xref ref-type="bibr" rid="ref10">Clayman, 2013</xref>, p. 151) so that another speaker could legitimately step in. &#x201C;Completeness&#x201D; was investigated in terms of syntax, prosody, and/or pragmatics (<xref ref-type="bibr" rid="ref10">Clayman, 2013</xref>). While syntax served as the main guide for identifying TCU boundaries, prosody could override it in certain cases&#x2014;specifically when (i) an extension, though grammatically complete, was bound to the prior unit through intonation, and (ii) the break between the core TCU and its extension was made audible by a shift in pitch or contour. The TCU segmentation in questions and stories is detailed in the following.</p>
<sec id="sec9">
<label>2.5.1</label>
<title>TCU segmentation</title>
<sec id="sec10">
<label>2.5.1.1</label>
<title>TCU segmentation in questions</title>
<p>Question <italic>turns</italic> can be single-TCU turns or multi-unit turns exhibiting a more complex structure due not only to the occurrence of more than one question-TCU but also to the speaker&#x2019;s use of other, non-TCU or non-question material. As is generally the case (cf. <xref ref-type="bibr" rid="ref56">Robinson et al., 2022</xref>), most questions in the data were single-TCU turns; question turns with two or more question-TCUs were less frequent. Consider extract (1), where the distinct turn components are separated by <bold>|</bold>:</p><preformat>(1) [F04, Sequ 35]
01 	B:	[but] =&#x00B0;but&#x00B0; w- if you say it's a Dachgeschoss top floor is it like 	02			(0.493) slanted? <bold>| pol</bold>
03		and can you actually [walk?] <bold>| pol</bold></preformat>
<p>In multi-unit question turns, speakers often also use TCUs that do not perform the action of asking a question but that do other things (labeled <italic>non-Q</italic>), as in extract (2):</p>
<preformat>(2) [F01, Sequ 1]

01 	A:	&#x003E;like I do n't understand&#x003C; <bold>| nonQ</bold>
02		sorry <bold>| nonQ</bold>
03		like how old's your mom&#x00BF; <bold>| wh</bold></preformat>
<p>The first TCU <italic>&#x003E;&#x202F;like I do n&#x2019;t understand&#x003C;</italic> as well as the following TCU <italic>sorry</italic> are clearly not questions; only the third TCU <italic>like how old&#x2019;s your mom&#x00BF;</italic> serves to request information.</p>
<p>Question-TCUs are sometimes extended by a turn increment; to the extent that these were syntactically and/or prosodically separated from the preceding question TCU, they were treated as a separate, extension TCU (labeled <italic>ext</italic>), as shown in extract (3):</p>
<preformat>(3) [F07, Sequ 109]

01 	C:	&#x003C;what would you call it&#x003E; <bold>| wh</bold>
02		this <bold>| frg</bold>
03		you know when you don't clean your sink [like ever] <bold>| ext</bold></preformat>
<p>Here, the first segment represents the question TCU; it is followed by the fragment <italic>this</italic>, and finally extended with <italic>you know when you do not clean your sink [like ever].</italic></p>
<p>Not all verbal material a speaker uses in a turn may be part of a TCU; these components are referred to as fragments (labeled <italic>frg</italic>). They include syntactically incomplete utterances, turn-initial particles, as well as turn-final particles. Such particles are treated as fragments only if they are separated from the TCU by an intonation boundary (indicated in the transcripts by &#x201C;,&#x201D; &#x201C;?&#x201D; or &#x201C;&#x00BF;&#x201D;). Contrarily, if they are intonationally integrated into the TCU, they are treated as part of the TCU. For example, in extract (4), the (repeated) particle <italic>so</italic> heading the question-TCU <italic>[so] so do you just stay on the cruise ship</italic>, is intonationally integrated into the TCU and therefore considered a part of it. By contrast, the trail-off conjunctional <italic>o:r&#x202F;=&#x202F;following</italic> the question-TCU is intonationally separated and therefore a fragment:</p>
<preformat>(4) [F08, Sequ 207]

01 	C:	[so] so do you just stay on the cruise ship, <bold>| pol</bold>
02		o::r= <bold>| frg</bold></preformat>
</sec>
<sec id="sec11">
<label>2.5.1.2</label>
<title>TCU segmentation in storytellings</title>
<p>Storytellings are often considered multi-unit turns as they are of extended length and consist of several, often numerous TCUs. Storytellings are thus large &#x201C;projects,&#x201D; whose completion is potentially projected by a story preface adumbrating the story&#x2019;s high point and/or the storyteller&#x2019;s stance toward it (<xref ref-type="bibr" rid="ref72">Stivers, 2008</xref>). Once the co-participants grant permission to carry out the telling project, they also implicitly agree to a suspension of ordinary turn-taking for the duration of the story, giving the storyteller the right to an extended turn, involving a series of narrative TCUs.</p>
<p>However, not all TCUs a storyteller uses in telling their story are <italic>per se</italic> a narrative TCU. Story recipients may insert comments or ask questions in mid-story position, which the storyteller responds to; alternatively, storytellers themselves may interrupt the telling, for example, to recruit story recipients in a word search. These actions/TCUs by the storyteller are not narrative TCUs with suspended turn-taking. Rather, in that the storyteller responds to or seeks to initiate a recipient&#x2019;s action, these TCUs are interactive ones in which normal turn-taking is briefly resumed. Moreover, even in uninterrupted, smoothly delivered storytelling, turn transition is not avoided everywhere. On the contrary, based on a conceptualization of &#x201C;storytelling as an activity that both takes a stance toward what is being reported and makes the taking of a stance by the recipient relevant&#x201D; (<xref ref-type="bibr" rid="ref72">Stivers, 2008</xref>, p. 32), the story climax can be considered the transition-relevance point in storytelling interaction. For it is here, at or around the story&#x2019;s high point, that story recipients are expected to actively take a stance on the story events&#x2014;a stance that, preferably, &#x201C;mirrors&#x201D; the storyteller&#x2019;s. That is, those narrative TCUs that depict the story&#x2019;s high point are then designed, not to avoid, but to initiate turn transfer.</p>
<p>To illustrate, in (5), (where narrative TCUs are labeled <italic>narr</italic> and interactive TCUs are labeled <italic>int</italic>), speaker A is telling a story about his father&#x2019;s career as a diplomat, which the storyteller bills as a <italic>sad story</italic> (not shown in the transcript). The father&#x2019;s career hit a bump when the US reached its <italic>maximum budget deficit</italic> (line 04). At this point in the telling, the storyteller changes into interactive mode by asking <italic>&#x00B0;what&#x2019;s&#x00B0;</italic>(.) (line 05) <italic>&#x00B0;what&#x2019;s that called again?&#x00B0;</italic> (line 06), to which none of the two recipients respond immediately, so he continues with the story <italic>so there&#x2019;s a government shutdown&#x003C;</italic> (line 08) before, finally, recipient A does proffer <italic>the [fiscal cliff]</italic> (line 09) as a candidate term. Speaker A immediately confirms this as the searched-for term by repeating it emphatically (line 10) and reaffirming it (line 11), and then resumes the telling (lines 13 and 15). In line 15, the telling reaches (the beginning of) the story climax: as a result of the fiscal cliff, the father&#x2019;s position <italic>as a diplomat is cut</italic>&#x2014;which is the &#x201C;sad&#x201D; event that the story set out to relate. As per preference structure (<xref ref-type="bibr" rid="ref72">Stivers, 2008</xref>), recipient A answers empathically <italic>wow</italic>:</p>
<preformat>(5) [F16, story &#x201C;Sad story&#x201D;]

01	C:	=u:m and then they had a &#x2191;budget&#x2191; cut (.) <bold>| narr</bold>
02		um oh I mean u:h: <bold>| frg</bold>
03		the US reaches its um budget deficit, <bold>| narr</bold>
04		&#x003E;its maximum budget deficit&#x003C; <bold>| narr</bold>
05	<bold>&#x2014;&#x003E;</bold>	&#x00B0;what's&#x00B0; (.) <bold>| frg</bold>
06	<bold>&#x2014;&#x003E;</bold>	&#x00B0;what's that called again?&#x00B0; <bold>| int</bold>
07	<bold>&#x2014;&#x003E;</bold>	&#x00B0;governmental&#x00B0; tt &#x003E;I don't know <bold>| int</bold>
08		so there's a government shutdown&#x003C; <bold>| narr</bold>
09	A:	the [ fiscal   cliff  ]
10	C:	    [a:nd the !fiscal!] <bold>| frg</bold>
11	<bold>&#x2014;&#x003E;</bold>	yeah &#x00B0;ye[ah]&#x00B0; (.) <bold>| int</bold>
12		and u:h <bold>| frg</bold>
13		and so as a result all new positions are cut <bold>| narr</bold>
14	A:	[mm    ]
15	C:	[uh and] his position as a as a diplomat is cut <bold>| narr</bold>
16	A:	wow</preformat>
<p>Storytellers frequently, especially around story climaxes, use direct speech (or constructed dialog or enactments, which clusters around climaxes; cf. <xref ref-type="bibr" rid="ref40">Labov, 1972</xref>; <xref ref-type="bibr" rid="ref46">Li, 1986</xref>; <xref ref-type="bibr" rid="ref49">Mathis and Yule, 1994</xref>; <xref ref-type="bibr" rid="ref50">Mayes, 1990</xref>; <xref ref-type="bibr" rid="ref51">Norrick, 2000</xref>; <xref ref-type="bibr" rid="ref11">Clift and Holt, 2007</xref>; <xref ref-type="bibr" rid="ref58">R&#x00FC;hlemann, 2013</xref>), as exemplified in extract (6); the content of direct speech (or, as in lines 01 and 06, silent gesture) is indicated by ~; TCUs containing direct speech are labeled <italic>dr</italic>:</p>
<preformat>(6) [F27, story &#x201C;Black Forest&#x201D;]

01	A:	I was like ~yo&#x00BF; ((imitates typing on keyboard))~ <bold>| dr</bold>
02		~YO GUYS I think I'm gonna go to this place called &#x003C;!Frei!:bu:rg 		03		a:nd&#x003E; there's !some!thing here called the Black !Fo!rest~<bold>| dr</bold>
04		and it's &#x003C;almost like&#x003E; everything STOPped <bold>| narr</bold>
05		like ~&#x2191;weow&#x2191;~ <bold>| dr</bold>
06		and everyone just stopped like ~((freezes/2.5))~ <bold>| dr</bold>
07	B:	[((laughs))]
08	A:	[it  got like] !NO! REACtion <bold>| narr</bold></preformat>
<p>Another critical part of the data pre-processing was the annotation of <italic>Turn Transition</italic> (TT), the response variable in model #2, addressing RQ #2.</p>
</sec>
</sec>
<sec id="sec12">
<label>2.5.2</label>
<title>Turn-transition coding</title>
<sec id="sec13">
<label>2.5.2.1</label>
<title>Turn-transition coding in questions</title>
<p>The critical variable in this study, indeed the outcome variable of the model addressing RQ #2, is <italic>Turn Transition</italic> (TT), a binary variable recording whether a TCU led to a speaker change and turn transition or not. In single-TCU questions, the coding as such was obvious (except for the few cases where the first response was by the non-selected third participant; cf. <xref ref-type="bibr" rid="ref43">Lerner, 2019</xref>). In complex question turns, TCU segmentation allowed us to identify the TCU that the speaker&#x2019;s response was a response to:</p>
<preformat>(7) [F01, Sequ 5]

01 	C:	[what] type of:: tours is it <bold>|  wh</bold>
02		is it [(like    a long)] ti:me&#x00BF; <bold>| pol</bold>
03		[ or  ] <bold>| frg</bold>
04 	A:	      [it's cruise ship]
05		[tours]</preformat>
<p>In extract (7), speaker A&#x2019;s response &#x201C;<italic>it&#x2019;s cruise ship tours&#x201D;</italic> specifically responds to speaker C&#x2019;s first question-TCU &#x201C;<italic>what type of: tours is it&#x201D;</italic> for two reasons: first, the response overlaps with key lexical elements of the second question-TCU <italic>is it (like a long) ti:me&#x00BF;,</italic> and it is therefore unlikely that speaker C can even hear this question-TCU, let alone process it. Second, the response &#x201C;<italic>it&#x2019;s cruise ship tours&#x201D;</italic> is both syntactically and semantically fitted to the <italic>wh</italic>-question &#x201C;<italic>what type of tours is it&#x201D;</italic> but not to the polar question <italic>is it like a long time&#x00BF;</italic>, which would require a <italic>yes/no</italic>-type answer. In QA sequences such as these, the variable Turn Transition (TT) was coded &#x201C;yes&#x201D; only for the responded-to question-TCU; the TCU(s) to which the response was not fitted were coded &#x201C;no.&#x201D; In cases where the response was fitted syntactically and semantically to more than one TCU, the question&#x2019;s <italic>last</italic> and fully audible question TCU was coded as the one leading to the turn transition.</p>
<p>In extract (8), for instance, the question turn is made up of a sequence of three question-TCUs (two declarative question-TCUs and one <italic>or</italic> question-TCU), all three syntactically aligned (i.e., answerable by <italic>yes/no</italic>), but only the last (<italic>or</italic>-)TCU is coded as the one leading to turn transfer:</p>
<preformat>(8) [F08, Sequ 167]

01 	A:	so&#x2009;it&#x00A0;'s&#x2009;like&#x2009;not&#x2009;really&#x2009;like&#x2009;Fra:nce <bold>|  decl</bold>
02		it&#x00A0;'s&#x2009;like&#x2009;a&#x2009;mix&#x2009;&#x003C;&#x00B0;of&#x2009;the&#x2009;two&#x00B0;&#x003E; <bold>|  decl</bold>
03		or&#x2009;is&#x2009;it&#x2009;like&#x00A0;!real!ly&#x2009;French <bold>|  or</bold>
04		[like&#x2009;a&#x2009;r-] <bold>|  frg</bold>
05 	B:	[no   it&#x2019;s ] it's I I guess it's a bit like (.) Alsace=</preformat>
<p>Two types of sequences were excluded from the analysis. Sequences such as (9), where a gap of more than 1&#x202F;s ensued between the (final) question-TCU and the answer, were omitted from further analysis, as a gap of this length is far beyond the &#x201C;regular&#x201D; gap of around 200&#x202F;ms, potentially indicating comprehension problems, a dispreferred answer, uncertainty as to who is selected as the next speaker, and so on. In extract (9), it appears that the gap of 1.19&#x202F;s is a harbinger of a disaligned answer (an answer, in this case, whose truth value is compromised due to it being <italic>individual</italic> and <italic>subjective</italic> only):</p>
<preformat>(9) [F12, Sequ 226]

01 	B:	but how is it for you&#x00BF; <bold>|  wh</bold>
02		do you feel like &#x003C;you: remember more than&#x003E; fifty percent of what you 03		learned in your bachelor 's degree? <bold>|  pol</bold>
04		or <bold>|  frg</bold>
05		like what what would you say&#x00BF; <bold>|  wh</bold>
06	<bold>&#x2014;&#x003E;</bold>	(1.190)
07 	A: 	&#x00B0;so&#x00B0; !ob!viously this is very like
08 	B:	like it 's
09 	A:	[individual  (.) ye:ah  exactly  so  it's  very]
10 	B:	[very sub!ject!ive (depending on &#x00B0;how it goes&#x00B0;)]</preformat>
<p>Sequences as in extract (10), where the answer is referenced to a TCU-extension, were removed from the data set given their lack of syntactic and semantic independence from the preceding question-TCU (indeed, they cannot &#x2018;survive&#x2019; without them):</p>
<preformat>(10) [F04, Sequ 50]

01	A:	u:h the guy (.) <bold>| frg</bold>
02		you remember Urick? <bold>| pol</bold>
03		and there was like a room directly 	across of me&#x00BF; <bold>| ext</bold>
04		that a guy moved out and his girlfriend?= <bold>| ext</bold>
05	B:	=&#x00B0;&#x00B0;yeah&#x00B0;&#x00B0;=</preformat>
<sec id="sec9000">
<label>2.5.2.2</label>
<title>Turn transition coding in stories</title>
<p>TCUs labeled <italic>int</italic> were coded as facilitating speaker change (Turn Transition&#x202F;=&#x202F;&#x201C;yes&#x201D;) regardless of their position in the story. By contrast, TCUs labeled <italic>narr</italic> and <italic>dr</italic> were both coded as avoiding speaker change (<italic>Turn Transition</italic>&#x202F;=&#x202F;&#x201C;no&#x201D;) only in pre-climax position; narrative TCUs at or around the story climax eliciting engaged recipient response, such as the one in line 15 in extract (9), were coded as inviting turn transition (<italic>Turn Transition</italic>&#x202F;=&#x202F;&#x201C;yes&#x201D;).</p>
<p>To ensure replicability, interrater-reliability (IRR) analyses were carried out both for TCU-segmentation and <italic>Turn Transition</italic> (TT) coding.</p>
</sec>
</sec>
</sec>
<sec id="sec14">
<label>2.5.3</label>
<title>Interrater-reliability analyses</title>
<sec id="sec15">
<label>2.5.3.1</label>
<title>Interrater-reliability for TCU segmentation</title>
<p>From the 457 QA sequences, 92 sequences (20%) were randomly sampled, and the IPU transcriptions available in FreMIC were TCU-segmented by a second rater. The 13 stories were each divided into three same-size intervals (c. 33%), and one interval was randomly sampled from each story. The IPU transcriptions available in FreMIC for those intervals were TCU-segmented by a second rater.</p>
<p>The agreement percentage for question-TCUs in which both raters segmented exactly the same words was 83.58%, and the percentage for storytelling-TCUs with the exact same segments and hence the same words was 71.68%. This lower agreement rate likely reflects the fact that the IPUs underlying the segmentation in stories tend to be markedly longer than the IPUs underlying questions, thus allowing more divergent codings. This greater length of IPUs also transpires from the greater length of storytelling TCUs: as shown in <xref ref-type="table" rid="tab1">Table 1</xref> (cf. Section 2.5.6), the mean number of words in story TCUs is 7.20 (median&#x202F;=&#x202F;6, SD&#x202F;=&#x202F;4.58) as opposed to 6.04 in questions (median&#x202F;=&#x202F;5, SD&#x202F;=&#x202F;3.45) and the mean duration is 1,851&#x202F;ms (median&#x202F;=&#x202F;1,440&#x202F;ms, SD&#x202F;=&#x202F;1,506) as opposed to 1,450&#x202F;ms in questions (median&#x202F;=&#x202F;1,218, SD&#x202F;=&#x202F;1,005).</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption>
<p>Descriptive statistics: Number of words (<italic>N_w</italic>) and durations of TCUs in original data (1,074 TCUs).</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" rowspan="2">Type</th>
<th align="center" valign="top" colspan="4">N_w</th>
<th align="center" valign="top" colspan="3">Duration (ms)</th>
</tr>
<tr>
<th align="center" valign="top">Range</th>
<th align="center" valign="top">Mean</th>
<th align="center" valign="top">Median</th>
<th align="center" valign="top">SD</th>
<th align="center" valign="top">Median</th>
<th align="center" valign="top">Mean</th>
<th align="center" valign="top">SD</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">all</td>
<td align="center" valign="top">1&#x2013;39</td>
<td align="center" valign="top">6.53</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">4.01</td>
<td align="center" valign="top">1,300</td>
<td align="center" valign="top">1,621</td>
<td align="center" valign="top">1258.34</td>
</tr>
<tr>
<td align="left" valign="top">question</td>
<td align="center" valign="top">1&#x2013;33</td>
<td align="center" valign="top">6.04</td>
<td align="center" valign="top">5</td>
<td align="center" valign="top">3.45</td>
<td align="center" valign="top">1218.5</td>
<td align="center" valign="top">1450.43</td>
<td align="center" valign="top">1005.54</td>
</tr>
<tr>
<td align="left" valign="top">story</td>
<td align="center" valign="top">1&#x2013;39</td>
<td align="center" valign="top">7.20</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">4.58</td>
<td align="center" valign="top">1,440</td>
<td align="center" valign="top">1851.35</td>
<td align="center" valign="top">1506.74</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="sec16">
<label>2.5.3.2</label>
<title>Interrater-reliability for turn transition (TT)</title>
<p>In the questions subset, the IRR analysis for <italic>Turn Transition</italic> (TT) was carried out only on QA sequences with more than one question-TCU (coded <italic>wh</italic>, <italic>pol</italic>, <italic>decl</italic>, or <italic>or</italic>), as there is no choice as to which TCU is answered if there is just one. This subset consisted of 72 sequences; 24 of them (c. 33%) were rated by a second rater. In the storytellings subset, the narrative TCUs (coded <italic>narr</italic> or <italic>dr</italic>) as well as the interactive TCUs (coded <italic>int</italic>) were selected; a proportion of 33% of them were randomly sampled and coded for <italic>Turn Transition</italic> by a second rater.</p>
<p>The agreement percentage for <italic>Turn Transition</italic> coding in questions and storytellings taken together was 91.2%, yielding a Cohen&#x2019;s Kappa of 0.706 (<italic>p</italic>&#x202F;&#x003C;&#x202F;0.001), which indicates substantial interrater agreement (cf. <xref ref-type="bibr" rid="ref9002">Landis and Koch, 1977</xref>).</p>
</sec>
</sec>
<sec id="sec17">
<label>2.5.4</label>
<title>Statistical overview of the data</title>
<p>The analysis started out with a total of 1,074 TCUs. The descriptive statistics for this original data are shown in <xref ref-type="table" rid="tab1">Table 1</xref>.</p>
<p>The mean number of words in the TCUs was 6.5, their mean duration 1,621&#x202F;ms; for comparison, TCU mean length in <xref ref-type="bibr" rid="ref31">H&#x00F6;mke et al. (2017)</xref> was 1,754&#x202F;ms.</p>
<p>To address RQ #1 &#x2014; <italic>Do word frequencies in the TCUs follow an S-shaped pattern?</italic>&#x2014;TCUs with fewer than three words were excluded as no development of frequencies can be read-off of them; the number of thus-excluded TCUs was 100 (or 9.31% of the total 1,074 TCUs), leaving model #1 with 974 TCUs produced by 29 distinct participants. (For RQ #1, the distinction between question- and story-TCU was not relevant.)</p>
<p>Addressing RQ #2&#x2014;<italic>Which frequency-related measures predict that a TCU will be followed by turn transition or continuation?</italic>&#x2014;the data set was further reduced. Given the focus of RQ #2 on the (potential) effect of frequency-related measures on <italic>Turn Transition</italic>, question-TCUs that did not result in turn transition were excluded, thus keeping only question-TCUs coded &#x201C;yes&#x201D; on <italic>Turn Transition</italic> (TT), as were story-TCUs that did result in turn transition, thus keeping only story-TCUs coded &#x201C;no&#x201D; on <italic>Turn Transition</italic>. As noted, this decision intimately ties the results of the predictive modeling undertaken to address RQ #2 to the social-action type: whatever significant effects we may observe cannot be taken as features of turn transition in itself, independent of the type of social action in which it occurred, but will discriminate frequency-related features of turn transition in (i) transition-ready question TCUs and (ii) transition-resistant story TCUs.</p>
<p>After all reductions were made, model #2 was based on 876 TCUs. Of these, 457 were question-TCUs asked by 29 distinct participants and 419 story-TCUs occurring in 18 stories told by 13 participants (who were a subgroup of the 29 questioners). The participants&#x2019; demographic details are given in <xref ref-type="table" rid="tab2">Table 2</xref>.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption>
<p>Participants&#x2019; gender, age, and L1 (first language).</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Gender</th>
<th align="left" valign="top">Age</th>
<th align="left" valign="top">L1</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Male: 13</td>
<td align="left" valign="top">Range: 20&#x2013;49</td>
<td align="left" valign="top">English only: 20</td>
</tr>
<tr>
<td align="left" valign="top">Female: 13</td>
<td align="left" valign="top">Mean: 26.5</td>
<td align="left" valign="top">English + other: 6</td>
</tr>
<tr>
<td align="left" valign="top">cis-Fe/Male: 2</td>
<td align="left" valign="top">Median: 26</td>
<td align="left" valign="top">not English: 2</td>
</tr>
<tr>
<td align="left" valign="top"><italic>NA</italic>: 1</td>
<td align="left" valign="top">SD: 6.42</td>
<td align="left" valign="top"><italic>NA</italic>: 1</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="sec18">
<label>2.5.5</label>
<title>Computation of word frequencies</title>
<p>As noted, FreMIC&#x2019;s total word token count is 375,637. A frequency list was computed for the whole corpus, based on <italic>c7</italic> word-tag combinations, giving the absolute word token frequencies for any <italic>c7</italic> word-tag combination. Frequencies were normalized per 1,000 words and log-transformed (to the base of 2). The top 10 most frequent <italic>c7</italic> word-tag combinations in FreMIC are shown in <xref ref-type="table" rid="tab3">Table 3</xref>: as is to be expected from a conversational corpus, personal pronouns as well as interjections such as <italic>yeah_UH</italic> are ranked highly, whereas noun-related items such as <italic>the_AT</italic> and a<italic>_AT1</italic> are less highly-ranked than in general or written corpora (e.g., <xref ref-type="bibr" rid="ref5">Biber et al., 1999</xref>; <xref ref-type="bibr" rid="ref77">Stubbs, 2001</xref>; <xref ref-type="bibr" rid="ref57">R&#x00FC;hlemann, 2007</xref>):</p>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption>
<p>Top 10 most highly-ranked c7 word-tag combinations in FreMIC.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">w_c7</th>
<th align="center" valign="top">freq</th>
<th align="center" valign="top">f_norm</th>
<th align="center" valign="top">rank</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">I_PPIS1</td>
<td align="center" valign="top">16,448</td>
<td align="center" valign="middle">43.7869539</td>
<td align="center" valign="top">1</td>
</tr>
<tr>
<td align="left" valign="top">it_PPH1</td>
<td align="center" valign="top">10,440</td>
<td align="center" valign="middle">27.8460322</td>
<td align="center" valign="top">2</td>
</tr>
<tr>
<td align="left" valign="top">yeah_UH</td>
<td align="center" valign="top">10,270</td>
<td align="center" valign="middle">27.4413862</td>
<td align="center" valign="top">3</td>
</tr>
<tr>
<td align="left" valign="top">and_CC</td>
<td align="center" valign="top">10,094</td>
<td align="center" valign="middle">26.9009709</td>
<td align="center" valign="top">4</td>
</tr>
<tr>
<td align="left" valign="top">the_AT</td>
<td align="center" valign="top">9,660</td>
<td align="center" valign="middle">25.8228023</td>
<td align="center" valign="top">5</td>
</tr>
<tr>
<td align="left" valign="top">you_PPY</td>
<td align="center" valign="top">8,583</td>
<td align="center" valign="middle">22.9290512</td>
<td align="center" valign="top">6</td>
</tr>
<tr>
<td align="left" valign="top">&#x2018;s_VBZ</td>
<td align="center" valign="top">8,315</td>
<td align="center" valign="middle">22.2368936</td>
<td align="center" valign="top">7</td>
</tr>
<tr>
<td align="left" valign="top">like_II</td>
<td align="center" valign="top">6,945</td>
<td align="center" valign="middle">18.5418369</td>
<td align="center" valign="top">8</td>
</tr>
<tr>
<td align="left" valign="top">a_AT1</td>
<td align="center" valign="top">6,602</td>
<td align="center" valign="middle">17.6100863</td>
<td align="center" valign="top">9</td>
</tr>
<tr>
<td align="left" valign="top">was_VBDZ</td>
<td align="center" valign="top">4,781</td>
<td align="center" valign="middle">12.7782939</td>
<td align="center" valign="top">10</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Assigning the corpus frequencies to the words in the TCUs presented a challenge because, as noted, in FreMIC, the underlying unit of observation is the IPU, and the <italic>c7</italic> word-tag &#x2018;transcriptions&#x2019; available in FreMIC are for IPUs as well. Large numbers, however, of the TCUs obtained from manual segmentation in ELAN did not map onto these IPUs either because a TCU was just one part of an IPU or a TCU spanned two or more IPUs. Mapping <italic>c7</italic> word-tags and their frequencies to the words in the TCUs, therefore, required additional work.</p>
<p>To illustrate, the utterance <italic>so wait (wha-) [when was this]</italic> in excerpt (11.a) represented one uninterrupted IPU in FreMIC. It is associated with the string of <italic>c7</italic> word-tags shown in (11.b). During the TCU-segmentation process, the IPU was broken up into three segments, as shown in (11.c).</p>
<preformat>(11.a)
so wait (wha-) [when was this]

(11.b)
so_RR wait_VV0 wha-_UNC when_RRQ was_VBDZ this_DD1

(11.c) [F36, Sequ 574]

so wait <bold>| nonQ</bold>
(wha-)  <bold>| frg</bold>
[when was this] <bold>| wh</bold></preformat>
<p>To map the c7 word-tags to each TCU segment, the c7 word-tag strings in (11.b) had to be separated into the exact same segments using a multi-step coding procedure in R so that the c7 word-tag segments could be matched to their corresponding TCU segments, as shown in (11.d):<xref ref-type="fn" rid="fn0006"><sup>6</sup></xref></p>
<preformat>(11.d):

so wait <bold>| nonQ</bold> so_RR wait_VV0
(wha-)  <bold>| frg</bold>  wha-_UNC
[when was this] <bold>| wh</bold>  when_RRQ was_VBDZ this_DD1</preformat>
<p>The next pre-processing step was to assign to each <italic>c7</italic> word-tag in the TCU segments their total corpus frequencies.</p>
</sec>
<sec id="sec19">
<label>2.5.6</label>
<title>Computation of frequency-related measures</title>
<p>While there is some agreement that conversationalists constantly monitor relative word frequencies during conversation (<xref ref-type="bibr" rid="ref70">Shapiro, 1969</xref>; <xref ref-type="bibr" rid="ref24">Hasher and Chromiak, 1977</xref>; <xref ref-type="bibr" rid="ref25">Hasher and Zacks, 1984</xref>), the question of <italic>how</italic> they do it is largely an open question.</p>
<p>It is, for example, unclear whether conversationalists monitor frequencies relative to the turn-so-far (i.e., the Saussurian <italic>parole</italic>) or the language as such (i.e., the Saussurian <italic>langue</italic>). If word frequencies are monitored relative to <italic>langue</italic>, the relative word frequencies are &#x2018;simply&#x2019; retrieved from the mental lexicon in which they are stored (e.g., <xref ref-type="bibr" rid="ref33">Jaeger, 2010</xref>; <xref ref-type="bibr" rid="ref69">Seyfarth, 2014</xref>), to the extent that the corpus can be seen as a microcosm reflecting the macrocosm of <italic>la langue</italic>,<xref ref-type="fn" rid="fn0007"><sup>7</sup></xref> this would suggest that speakers make use directly of corpus frequency values <italic>independently of one another</italic>. Consider, for example, the question-turn <italic>What&#x2019;s a mountain for you?.</italic> As shown in <xref ref-type="table" rid="tab4">Table 4</xref>, the lowest normalized frequency is for the noun <italic>mountain</italic>, a rather rare noun (and, in English, rarity is highly correlated with nouns; cf. <xref ref-type="bibr" rid="ref62">R&#x00FC;hlemann and Barthel, 2024</xref>), whereas the highest frequencies are for the shortened form of the verb <italic>is</italic> and the pronoun <italic>you.</italic><xref ref-type="fn" rid="fn0008"><sup>8</sup></xref></p>
<table-wrap position="float" id="tab4">
<label>Table 4</label>
<caption>
<p>Log-transformed normalized rank and frequency values for <italic>What&#x2019;s a mountain for you?</italic> [F01, Sequ 9].</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Word token</th>
<th align="left" valign="top"><italic>c7</italic> word-tag</th>
<th align="center" valign="top">f_norm</th>
<th align="center" valign="top">f_norm_log</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">what</td>
<td align="left" valign="top">what_DDQ</td>
<td align="center" valign="top">5.5586</td>
<td align="center" valign="top">1.7153</td>
</tr>
<tr>
<td align="left" valign="top">&#x2018;s</td>
<td align="left" valign="top">&#x2018;s_VBZ</td>
<td align="center" valign="top">22.2369</td>
<td align="center" valign="top">3.1018</td>
</tr>
<tr>
<td align="left" valign="top">a</td>
<td align="left" valign="top">a_AT1</td>
<td align="center" valign="top">17.6101</td>
<td align="center" valign="top">2.8685</td>
</tr>
<tr>
<td align="left" valign="top">mountain</td>
<td align="left" valign="top">mountain_NN1</td>
<td align="center" valign="top">0.0426</td>
<td align="center" valign="top">&#x2212;3.156</td>
</tr>
<tr>
<td align="left" valign="top">for</td>
<td align="left" valign="top">for_IF</td>
<td align="center" valign="top">5.3243</td>
<td align="center" valign="top">1.6723</td>
</tr>
<tr>
<td align="left" valign="top">you</td>
<td align="left" valign="top">you_PPY</td>
<td align="center" valign="top">22.9291</td>
<td align="center" valign="top">3.1324</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>If, by contrast, frequencies are monitored with reference to <italic>parole</italic>, that is, to their immediate context of use, the frequencies are still retrieved from the mental lexicon but are additionally put in relation to one another.</p>
<p>An established method to capture speakers&#x2019; monitoring of relative frequencies in turns/TCUs is <italic>surprisal</italic> (e.g., <xref ref-type="bibr" rid="ref53">Piantadosi et al., 2011</xref>; <xref ref-type="bibr" rid="ref69">Seyfarth, 2014</xref>). Surprisal may be part of the resources listeners deploy to predict the TCU&#x2019;s lexico-syntactic path so as to be able to anticipate the TCU end and speed up their response (cf. <xref ref-type="bibr" rid="ref48">Magyari et al., 2014</xref>: 2537; cf. also <xref ref-type="bibr" rid="ref13">De Ruiter et al., 2006</xref>). To measure <italic>surprisal</italic>, the Conditional Probability of each word is calculated given the word or words preceding it; that probability then is converted to <italic>surprisal</italic> by taking the negative log of each probability.</p>
<p>We calculated <italic>surprisal</italic> based on bigrams, establishing how unexpected word B is given word A, C given B, D given C, and so forth. This method and the related unigram and trigram-based methods have some currency in linguistic research (e.g., <xref ref-type="bibr" rid="ref38">Klafka and Yurovsky, 2021</xref>; <xref ref-type="bibr" rid="ref63">R&#x00FC;hlemann and Gries, 2020</xref>; <xref ref-type="bibr" rid="ref80">Trujillo and Holler, 2025</xref>); it implies that upon listening to a current speaker, conversationalists experience an increment to a turn-so-far (i.e., the next word) as more or less surprising based on a comparison of that increment&#x2019;s frequency with the frequency of its combination with the immediately prior word(s).<xref ref-type="fn" rid="fn0009"><sup>9</sup></xref></p>
<p>To illustrate, as shown in <xref ref-type="table" rid="tab5">Table 5</xref>, in the question <italic>What&#x2019;s a mountain for you?</italic>, it is to be expected that <italic>surprisal</italic> is highest on the word <italic>mountain</italic>, given that the indefinite article preceding it is highly common, whereas the noun is rare.</p>
<table-wrap position="float" id="tab5">
<label>Table 5</label>
<caption>
<p>Bigrams, Surprisal, Cumulative ngram, (log-transformed) Cumulative Ngram Frequency (CNF) for <italic>What&#x2019;s a mountain for you?</italic> [F01, Sequ 9].</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Bigram</th>
<th align="center" valign="top">Surprisal</th>
<th align="left" valign="top">Cumulative ngram</th>
<th align="center" valign="top">Cumulative Ngram Frequency (CNF; log-transformed)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">what_DDQ</td>
<td align="center" valign="top">7.4911</td>
<td align="left" valign="top">what_DDQ</td>
<td align="center" valign="top">7.643962</td>
</tr>
<tr>
<td align="left" valign="top">what_DDQ &#x2018;s_VBZ</td>
<td align="center" valign="top">3.2205</td>
<td align="left" valign="top">what_DDQ &#x2018;s_VBZ</td>
<td align="center" valign="top">5.411646</td>
</tr>
<tr>
<td align="left" valign="top">&#x2019;s_VBZ a_AT1</td>
<td align="center" valign="top">3.6639</td>
<td align="left" valign="top">what_DDQ &#x2018;s_VBZ a_AT1</td>
<td align="center" valign="top">2.484907</td>
</tr>
<tr>
<td align="left" valign="top">a_AT1 mountain_NN1</td>
<td align="center" valign="top">10.106</td>
<td align="left" valign="top">what_DDQ &#x2018;s_VBZ a_AT1 mountain_NN1</td>
<td align="center" valign="top">0.000000</td>
</tr>
<tr>
<td align="left" valign="top">mountain_NN1 for_IF</td>
<td align="center" valign="top">4.0000</td>
<td align="left" valign="top">what_DDQ &#x2018;s_VBZ a_AT1 mountain_NN1 for_IF</td>
<td align="center" valign="top">0.000000</td>
</tr>
<tr>
<td align="left" valign="top">for_IF you_PPY</td>
<td align="center" valign="top">4.5734</td>
<td align="left" valign="top">what_DDQ &#x2018;s_VBZ a_AT1 mountain_NN1 for_IF you_PPY</td>
<td align="center" valign="top">0.000000</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Another frequency-based measure used here is the <italic>number of once-attested ngrams</italic> per TCU (<italic>N_0_CNF</italic>). This novel measure is based on the following rationale.</p>
<p>As noted, listeners seek to predict the TCU&#x2019;s lexico-syntactic path in order to anticipate how and when the TCU is going to end (cf. <xref ref-type="bibr" rid="ref48">Magyari et al., 2014</xref>: 2537; cf. also <xref ref-type="bibr" rid="ref13">De Ruiter et al., 2006</xref>). While, clearly, successful anticipation and hence response speed may depend on a number of factors, such as syntactic affordances (<xref ref-type="bibr" rid="ref9003">Barthel and Sauppe, 2019</xref>) and early or late placement of key information (<xref ref-type="bibr" rid="ref6">B&#x00F6;gels et al., 2015</xref>), a likely additional factor is the extent to which an unfolding utterance aligns with pre-established phraseological usage that members of a language community have accumulated and stored through their experience as language users (<xref ref-type="bibr" rid="ref14">DeLong et al., 2005</xref>: <xref ref-type="bibr" rid="ref28">Hoey, 2005</xref>). Based on this resource, they will more easily predict the trajectory of common word combinations than that of unusual or even novel combinations they have never experienced before (e.g., <xref ref-type="bibr" rid="ref12">Corps et al., 2018</xref>; <xref ref-type="bibr" rid="ref48">Magyari et al., 2014</xref>, p. 2537).</p>
<p>The variable recording the number of only once-attested ngrams per TCU, <italic>N_0_CNF,</italic> aims to capture the moment when the TCU-so-far has left behind the &#x2018;trodden paths&#x2019; of everyday usage and presents the listener with a sequence of words that is, beyond this one occurrence, not yet attested&#x2014; at least not in the corpus. We refer to this moment as the 0-point (as the logarithm of 1 is 0). To the extent that a corpus can be seen as a microcosm reflecting the macrocosm of a language (cf. Section 5), that 0-point would demarcate the entry point into uncharted phraseological territory: a stringing together of words that has no precedent in a language user&#x2019;s experience. Listeners, lacking that experience, have no blueprint to rely on, and predicting the TCU&#x2019;s lexico-syntactic path from that point onwards likely becomes a challenging task.</p>
<p>To illustrate, consider <xref ref-type="table" rid="tab5">Table 5</xref>, which, for the example question <italic>What&#x2019;s a mountain for you?</italic> gives the number of only once-attested ngrams, <italic>N_0_CNF,</italic> and Cumulative Ngram Frequencies (CNF) representing the total log-transformed frequencies of each ngram (1-gram, 2-gram, 3-gram, 4-gram, etc.) in the TCU. The log-transformed CNF values for <italic>What&#x2019;s a mountain for you?</italic> already on <italic>mountain</italic> hit the floor, that is, the minimum value 0, indicating that the ngram token <italic>what_DDQ &#x2018;s_VBZ a_AT1 mountain_NN1</italic> occurs just once in the corpus. Inevitably, the subsequent 4-gram <italic>what_DDQ &#x2018;s_VBZ a_AT1 mountain_NN1 for_IF</italic> and the 5-gram <italic>what_DDQ &#x2018;s_VBZ a_AT1 mountain_NN1 for_IF you_PPY</italic> also occur just once in the corpus. Thus, the total number of only once-attested ngrams for which there is no prior attestation in the listener&#x2019;s language experience, in this example, is 3.</p>
<p>As shown in <xref ref-type="fig" rid="fig2">Figure 2</xref>, in the 856 TCUs on which model #2 is based, the first ngram in each TCU that is attested only once (and, hence, has CNF_log&#x202F;=&#x202F;0) occurs early on: the average word position of once-attested ngrams is 3.62. Note, however, that this average reflects the 733 TCUs (out of 856) in which the 0-point <italic>is</italic> reached; in 123 TCUs, all ngrams are attested more than once and the 0-point is never reached.</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption>
<p>Quintic slope of word frequencies in TCUs (three-word minimum length) in the question and storytelling subsets; <italic>position_rel</italic>: relative positions of words in the TCU (0&#x2013;1); <italic>F_norm_log</italic>: log-transformed normalized frequencies.</p>
</caption>
<graphic xlink:href="fpsyg-16-1610179-g002.tif" mimetype="image" mime-subtype="tiff">
<alt-text content-type="machine-generated">Line graph titled "Slope of word frequencies in TCUs" showing a downward trend. The x-axis represents "position_rel" from 0.00 to 1.00, and the y-axis is "F_norm_log" ranging from negative one to two. A red line with shading denotes this decreasing trend.</alt-text>
</graphic>
</fig>
<p>The measure for the number of only once-attested ngrams, <italic>N_0_CNF,</italic> is exploratory in character, and we feel justified to use it in the analyses, considering that, essentially, how conversationalists use word frequencies in conversation and what role frequencies play, if any, in turn transition is still largely <italic>terra incognita</italic>.</p>
</sec>
<sec id="sec20">
<label>2.5.7</label>
<title>Statistical analysis</title>
<p>RQ #1&#x2014;<italic>Do word frequencies in TCUs follow an S-shaped pattern?</italic>&#x2014;was addressed using a mixed-effects model. To handle the variance in lengths of the TCUs (as measured in terms of number of words), a relative positional measure <italic>position_rel was</italic> computed for each TCU, assigning as many equi-distanced values between 0 and 1 as there were words in the TCU (e.g., the relative positions of the five words in a 5-word TCU are 0, 0.25, 0.5, 0.75, and 1). The fixed effects in the model were the log-transformed normalized frequencies <italic>(F_norm_log)</italic> (as the dependent variable) and <italic>position_rel</italic> (the independent variable); file/participant was modeled as a nested random factor. To account for (the expected) non-linear effects of relative position within the TCU (<italic>position_rel</italic>), we modeled this predictor using orthogonal polynomial terms. Models including polynomial terms of increasing order (from 1st to 6th) were fit successively. Model comparisons were conducted using AIC, BIC, and likelihood ratio tests to determine the appropriate degree of polynomial to retain. We restricted the analysis to TCUs with at least three words. This ensured that the trajectory of word frequencies could, in principle, display the hypothesized three-step pattern. Model comparisons (AIC/BIC) further indicated improved fit when two-word TCUs were excluded.</p>
<p>To address RQ #2&#x2014;<italic>Which frequency-related measures predict that a TCU will be followed by turn transition or continuation?</italic>&#x2014;a generalized mixed-effects logistic regression model was fitted to the data, with <italic>Turn Transition</italic> (TT) as the binary outcome variable. The predictor variables were:</p>
<list list-type="simple">
<list-item>
<p>- <italic>S_DiffSecndFirstHalf</italic>: The difference of the mean <italic>surprisal</italic> in the second half of the TCU minus the mean of <italic>surprisal</italic> in the first half. This conceptualization of <italic>surprisal</italic> is based on <xref ref-type="bibr" rid="ref79">Trujillo and Holler&#x2019;s (2024)</xref> finding that, in English conversation, <italic>surprisal</italic> in a turn&#x2019;s second half is greater than in the first half.</p>
</list-item>
<list-item>
<p>- <italic>F_DropLastThird</italic>: The difference of the largest word frequency in the first two-thirds of a TCU minus the smallest word frequency in the last third of the TCU. This conceptualization of word frequency builds directly on the assumption that the drop at turn/TCU endings might be used as a turn completion cue.</p>
</list-item>
<list-item>
<p>- <italic>N_0_CNF</italic>: The number of once-attested ngrams in the TCU. As noted, the assumption here is that the listener&#x2019;s task of predicting the trajectory and, finally, the end point of the TCU is becoming challenging once the speaker&#x2019;s talk arrives at, and extends beyond, the first 0-point (the first only once-attested ngram). How that challenge impacts the anticipation of turn completion is yet an open question.</p>
</list-item>
</list>
<p>The random variable was <italic>FileSpeakerID</italic>, a combination of participant and recording ID.</p>
<p>In the remainder of this article, we will describe, in Section 3, the results of our enquiries into our two research questions, and then, in Section 4, discuss these results, before we conclude the study in Section 5.</p>
</sec>
</sec>
</sec>
<sec sec-type="results" id="sec21">
<label>3</label>
<title>Results</title>
<sec id="sec22">
<label>3.1</label>
<title>RQ#1 - do word frequencies in TCUs follow an S-shaped pattern?</title>
<p>Our mixed-effects model predicts log-transformed normalized word frequency (<italic>F_norm_log</italic>) based on a fifth-degree polynomial of relative position in the turn (<italic>position_rel</italic>), with random intercepts for individuals (<italic>Person_anon</italic>) nested within files (<italic>File</italic>). Model comparison using AIC/BIC and likelihood ratio tests indicated that including up to the fifth-order polynomial significantly improved model fit over lower-order models, while including the sixth-order polynomial did not. The model confirms that word frequency follows a complex non-linear pattern across turn positions, which seems to align with the S-shaped effect reported in prior research.</p>
<p>The model summary is given in <xref ref-type="table" rid="tab6">Table 6</xref>.</p>
<table-wrap position="float" id="tab6">
<label>Table 6</label>
<caption>
<p>Model summary for Model RQ#1; Formula: <italic>F_norm_log ~ poly(position_rel, 5)&#x202F;+&#x202F;(1 | File/Person_anon)</italic>.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" colspan="4">Random effects</th>
</tr>
<tr>
<th align="left" valign="top">Groups</th>
<th align="center" valign="top">Name</th>
<th align="center" valign="top">Variance</th>
<th align="center" valign="top">Std. Dev.</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Person_anon: File</td>
<td align="center" valign="top">(Intercept)</td>
<td align="center" valign="top">0.01351</td>
<td align="center" valign="top">0.1162</td>
</tr>
<tr>
<td align="left" valign="top">File</td>
<td align="center" valign="top">(Intercept)</td>
<td align="center" valign="top">0.01153</td>
<td align="center" valign="top">0.1074</td>
</tr>
<tr>
<td align="left" valign="top">Residual</td>
<td/>
<td align="center" valign="top">5.11856</td>
<td align="center" valign="top">2.2624</td>
</tr>
<tr>
<td align="left" valign="top" colspan="4">Number of obs: 6824, groups: Person_anon: File, 44; File, 16</td>
</tr>
</tbody>
</table>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" colspan="6">Fixed effects</th>
</tr>
<tr>
<th/>
<th align="center" valign="top">
<italic>&#x03B2;</italic>
</th>
<th align="center" valign="top">Std. Error</th>
<th align="center" valign="top">df t value</th>
<th align="center" valign="top">Pr(&#x003E;|t|)</th>
<th align="center" valign="top"><italic>p</italic>-value</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">(Intercept)</td>
<td align="center" valign="top">0.53032</td>
<td align="center" valign="top">0.04499</td>
<td align="center" valign="top">13.479701</td>
<td align="center" valign="top">1.789</td>
<td align="center" valign="top">1.76e-08 &#x002A;&#x002A;&#x002A;</td>
</tr>
<tr>
<td align="left" valign="top">position_rel<sup>1</sup></td>
<td align="center" valign="top">&#x2212;74.52082</td>
<td align="center" valign="top">2.27249</td>
<td align="center" valign="top">6789.94250</td>
<td align="center" valign="top">&#x2212;32.793</td>
<td align="center" valign="top">&#x003C; 2e-16 &#x002A;&#x002A;&#x002A;</td>
</tr>
<tr>
<td align="left" valign="top">position_rel<sup>2</sup></td>
<td align="center" valign="top">&#x2212;7.12678</td>
<td align="center" valign="top">2.27354</td>
<td align="center" valign="top">6806.90308</td>
<td align="center" valign="top">&#x2212;3.135</td>
<td align="center" valign="top">0.00173 &#x002A;&#x002A;</td>
</tr>
<tr>
<td align="left" valign="top">position_rel<sup>3</sup></td>
<td align="center" valign="top">&#x2212;14.39053</td>
<td align="center" valign="top">2.26841</td>
<td align="center" valign="top">6789.61424</td>
<td align="center" valign="top">&#x2212;6.344</td>
<td align="center" valign="top">2.38e-10 &#x002A;&#x002A;&#x002A;</td>
</tr>
<tr>
<td align="left" valign="top">position_rel<sup>4</sup></td>
<td align="center" valign="top">&#x2212;10.83934</td>
<td align="center" valign="top">2.26781</td>
<td align="center" valign="top">6816.12194</td>
<td align="center" valign="top">&#x2212;4.780</td>
<td align="center" valign="top">1.79e-06 &#x002A;&#x002A;&#x002A;</td>
</tr>
<tr>
<td align="left" valign="top">position_rel<sup>5</sup></td>
<td align="center" valign="top">&#x2212;6.59510</td>
<td align="center" valign="top">2.26469</td>
<td align="center" valign="top">6789.67182</td>
<td align="center" valign="top">&#x2212;2.912</td>
<td align="center" valign="top">0.00360 &#x002A;&#x002A;</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The Random Effects suggest that there is some variability in word frequency across different individuals within files (Variance&#x202F;=&#x202F;0.01351, SD&#x202F;=&#x202F;0.1162) and that differences in files contribute to variability in word frequency (Variance&#x202F;=&#x202F;0.01153, SD&#x202F;=&#x202F;0.1074); the largest source of variation is residual (unexplained) variation, suggesting that factors other than position in the TCU may also influence word frequency (5.15657, SD&#x202F;=&#x202F;2.2708).</p>
<p>Regarding the Fixed Effects, all polynomial terms up to the fifth order were statistically significant, providing strong evidence that the relationship between relative word position and normalized word frequency is highly non-linear. Although the large negative coefficient for the first-degree term reflects a strong overall downward trend from the beginning to the end of the TCU, the additional higher-order terms (quadratic through quintic) reveal systematic departures from this monotonic decline. Since the model employs orthogonal polynomials, the individual coefficients are not directly interpretable in terms of slope or curvature. Instead, their joint significance demonstrates that the trajectory of word frequency across positions contains multiple inflection points. To judge by the curve depicted in <xref ref-type="fig" rid="fig2">Figure 2</xref>, these inflection points are largely consistent with an S-shaped distribution reported in previous research, which could indicate an initial drop, a plateau, and then a sharp final drop.</p>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption>
<p>Cumulative Ngram Frequency (<italic>CNF_log</italic>) in the data used for model #2 (addressing RQ #2); dotted line: mean word position of once-attested ngram in TCU (mean&#x202F;=&#x202F;3.62).</p>
</caption>
<graphic xlink:href="fpsyg-16-1610179-g003.tif" mimetype="image" mime-subtype="tiff">
<alt-text content-type="machine-generated">Graph showing cumulative ngram frequency (log-transformed) versus the number of words in TCU, clipped at 20. Dense blue lines hover around zero after three words, with a vertical dashed line at N_w at 3.6, indicating the mean.</alt-text>
</graphic>
</fig>
</sec>
<sec id="sec23">
<label>3.2</label>
<title>RQ #2 - which frequency-related measures predict that a TCU will be followed by a turn transition or continuation?</title>
<p>The logistic fixed-effects model, model #2, to address RQ #2 builds on the back of the results of the model to address RQ #1. While model #1 confirms the S-shape pattern for TCUs, including specifically the steep drop at TCU ends, model #2 takes as its starting point that steep drop in frequency and operationalizes it as <italic>F_DropLastThird</italic> as one predictor beside the difference of the mean <italic>surprisal</italic> in the second half of the TCU minus the mean of <italic>surprisal</italic> in the first half, <italic>S_DiffSecndFirstHalf,</italic> and the number of only once-attested ngrams, <italic>N_0_CNF</italic>.</p>
<p>The model included <italic>FileSpeakerID</italic> as a random intercept to account for variability across speakers and files. However, the estimated variance for this effect was notably large (70.09), suggesting it might not be essential for explaining variation in turn transitions. To assess whether <italic>FileSpeakerID</italic> significantly improved model fit, we compared the revised model with a reduced model excluding this random effect using a likelihood ratio test (LRT). The model comparison revealed that removing <italic>FileSpeakerID</italic> resulted in a significantly poorer fit (&#x03C7;<sup>2</sup>&#x202F;=&#x202F;601.26, df&#x202F;=&#x202F;1, <italic>p</italic>&#x202F;&#x003C;&#x202F;0.001), justifying its inclusion in the model.</p>
<p>The summary of the model is given in <xref ref-type="table" rid="tab7">Table 7</xref>; the reference level for turn transition (<italic>TT</italic>) is TT&#x202F;=&#x202F;&#x201C;yes&#x201D;:</p>
<table-wrap position="float" id="tab7">
<label>Table 7</label>
<caption>
<p>Model summary RQ#2; <italic>TT ~ S_DiffSecndFirstHalf + N_0_CNF + F_DropLastThird + (1 | FileSpeakerID).</italic></p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" colspan="4">Random effects</th>
</tr>
<tr>
<th align="left" valign="top">Groups</th>
<th align="center" valign="top">Name</th>
<th align="center" valign="top">Variance</th>
<th align="center" valign="top">Std. Dev.</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">FileSpeakerID</td>
<td align="center" valign="top">(Intercept)</td>
<td align="center" valign="top">70.09</td>
<td align="center" valign="top">8.372</td>
</tr>
<tr>
<td align="left" valign="top" colspan="4">Number of obs: 856, groups: FileSpeakerID, 44</td>
</tr>
</tbody>
</table>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" colspan="5">Fixed effects</th>
</tr>
<tr>
<th/>
<th align="center" valign="top">
<italic>&#x03B2;</italic>
</th>
<th align="center" valign="top">Std. Error</th>
<th align="center" valign="top"><italic>z</italic> value</th>
<th align="center" valign="top">Pr(&#x003E;|z|)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">(Intercept)</td>
<td align="center" valign="top">&#x2212;9.483750</td>
<td align="center" valign="top">1.604535</td>
<td align="center" valign="top">&#x2212;5.911</td>
<td align="center" valign="top">3.41e-09 &#x002A;&#x002A;&#x002A;</td>
</tr>
<tr>
<td align="left" valign="top">S_DiffSecndFirstHalf</td>
<td align="center" valign="top">0.008203</td>
<td align="center" valign="top">0.046911</td>
<td align="center" valign="top">0.175</td>
<td align="center" valign="top">0.861</td>
</tr>
<tr>
<td align="left" valign="top">N_0_CNF</td>
<td align="center" valign="top">0.024144</td>
<td align="center" valign="top">0.035064</td>
<td align="center" valign="top">0.689</td>
<td align="center" valign="top">0.491</td>
</tr>
<tr>
<td align="left" valign="top">F_DropLastThird</td>
<td align="center" valign="top">0.063434</td>
<td align="center" valign="top">0.012198</td>
<td align="center" valign="top">5.200</td>
<td align="center" valign="top">1.99e-07 &#x002A;&#x002A;&#x002A;</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Among the three predictors, the difference of the mean <italic>surprisal</italic> in the second half of the TCU minus the mean of <italic>surprisal</italic> in the first half, <italic>S_DiffSecndFirstHalf,</italic> (<italic>&#x03B2;</italic>&#x202F;=&#x202F;0.008203, <italic>p</italic>&#x202F;&#x003E;&#x202F;0.5), and the number of only once-attested ngrams in the TCU, <italic>N_0_CNF,</italic> (&#x03B2;&#x202F;=&#x202F;0.024144, p&#x202F;&#x003E;&#x202F;0.5) do not have a significant effect. The only significant predictor of <italic>Turn Transition</italic> (TT) is <italic>F_DropLastThird</italic> (&#x03B2;&#x202F;=&#x202F;0.063434, p&#x202F;&#x003C;&#x202F;0.001). Its effect is positive, that is, increases in the frequency drop in the last third of the TCU are associated with increases in the log-odds that turn transition (in questions as opposed to stories) will occur.</p>
</sec>
</sec>
<sec sec-type="discussion" id="sec24">
<label>4</label>
<title>Discussion</title>
<p>In this article, we explored the possibility that frequency and frequency-related measures serve as resources for the listener to (advance-)project (imminent) turn completion. We approached this possibility from two angles relating to two research questions.</p>
<p>Our first research question&#x2014;<italic>Do word frequencies in TCUs follow an S-shaped distribution?</italic>&#x2014;was answered in the positive: on analyzing the log-transformed normalized word frequencies in TCUs, we found an S-shaped distribution, exhibiting a drop in initial position(s), a more level stretch in mid-TCU position(s), and a sharp drop in final position(s). For illustration, consider <xref ref-type="fig" rid="fig4">Figure 4</xref>, showing the trajectories of word frequencies of two questions:</p>
<fig position="float" id="fig4">
<label>Figure 4</label>
<caption>
<p>Examples of question TCUs with S-shaped word frequencies; <italic>f_norm</italic>: word frequencies in FreMIC normalized by 1,000.</p>
</caption>
<graphic xlink:href="fpsyg-16-1610179-g004.tif" mimetype="image" mime-subtype="tiff">
<alt-text content-type="machine-generated">Two line graphs showing word frequency labeled as "f_norm" against word positions "w1" to "w6." The left graph shows words "what_DDQ," "about_II," "your_APPGE," and "projects_NN2" decreasing in frequency. The right graph shows "do_VD0," "they_PPHS2," "have_VH0," "their_APPGE," "own_DA," and "arabic_NN1," also decreasing in frequency.</alt-text>
</graphic>
</fig>
<p>This finding is noteworthy with regard to previous findings of a similar S-shape of frequencies in two ways. First, the S-shape in the literature was found in much larger datasets: in <xref ref-type="bibr" rid="ref62">R&#x00FC;hlemann and Barthel (2024)</xref>, for example, the underlying data comprised almost 300,000 utterances from the conversational component of the British National Corpus (BNC); in the present study, the pattern emerged from only 974&#x202F;units. This indicates the robust strength of the pattern. Second, the underlying units of observation in the literature were quite different. In <xref ref-type="bibr" rid="ref83">Yu et al. (2016)</xref>, for example, it was the (written) sentence (in data from the written component of the BNC); in <xref ref-type="bibr" rid="ref38">Klafka and Yurovsky (2021)</xref> and in <xref ref-type="bibr" rid="ref62">R&#x00FC;hlemann and Barthel (2024)</xref>, it was utterances (bounded by speaker change and/or pauses) but not turns in any strict conversation-analytic sense; in the present study, the pattern was found in the smallest interactionally significant unit, the TCU. Given that the frequency of a word is negatively correlated with its information content (<xref ref-type="bibr" rid="ref83">Yu et al., 2016</xref>; <xref ref-type="bibr" rid="ref62">R&#x00FC;hlemann and Barthel, 2024</xref>), the S-shape distribution of frequencies in TCUs suggests that information content is climactically ordered not only in sentences or utterances but even in TCUs. For illustration, in the two TCUs in <xref ref-type="fig" rid="fig4">Figure 4</xref>, the informational peak is clearly on the last words, <italic>projects</italic> and <italic>arabic</italic>. Further, assuming that conversation represents the &#x201C;core matrix for human social life&#x201D; (<xref ref-type="bibr" rid="ref75">Stivers et al., 2009</xref>) and the central context of language use from which others are departures (<xref ref-type="bibr" rid="ref20">Goodwin and Heriage, 1990</xref>, p. 298), the finding points to the possibility that the informational asymmetry in sentences in writing may have formed in the mold of the TCU.</p>
<p>To address the second research question&#x2014;<italic>Which frequency-related measures predict that a TCU will be followed by turn transition or continuation?&#x2014;</italic>a logistic mixed-effects model was fitted with <italic>Turn Transition</italic> (<italic>TT</italic>) as the binary outcome variable. The model with the three factors suggested that neither <italic>S_DiffSecndFirstHalf</italic>, which captures <italic>surprisal,</italic> nor <italic>N_0_CNF</italic>, which captures the number of once-attested ngrams per TCU, discriminate significantly between turn-yielding in questions (TT&#x202F;=&#x202F;&#x201C;yes&#x201D;) and turn-holding in storytelling (TT&#x202F;=&#x202F;&#x201C;no&#x201D;). The only predictor that was found to have that discriminatory power was <italic>F_DropLastThird:</italic> the larger the drop in frequency in the last third of the TCU, the larger the log-odds that turn transition in questions will occur.</p>
<p>How to make sense of these findings? To reiterate, the findings were based on a juxtaposition of question-TCUs in QA sequences that did result in speaker change (TT&#x202F;=&#x202F;&#x201C;yes&#x201D;) and narrative TCUs in storytellings that did not lead to speaker change (TT&#x202F;=&#x202F;&#x201C;no&#x201D;). So all the findings, be they negative or positive, strictly relate to that action-transition nexus.</p>
<p>The <italic>suprisal</italic> variable <italic>S_DiffSecndFirstHalf</italic> and the phraseological variable <italic>N_0_CNF</italic> have in common that they represent resources listeners may deploy to predict the lexico-syntactic trajectory and anticipate the end point of the speaker&#x2019;s talk (<xref ref-type="bibr" rid="ref48">Magyari et al., 2014</xref>; <xref ref-type="bibr" rid="ref13">De Ruiter et al., 2006</xref>). In the present study, these two variables fail to predict the turn transition in questions as opposed to stories. This failure does not invalidate these variables for future studies of turn transition. In different research scenarios, the variables may well be capable of discriminating turn-yielding TCUs from turn-holding ones.<xref ref-type="fn" rid="fn0010"><sup>10</sup></xref> Particularly, the novel variable for only once-attested ngrams, <italic>N_0_CNF</italic>, is promising enough to be tested in future studies for its impact on listeners and their ability to predict a TCU&#x2019;s lexico-syntactic course.</p>
<p>The main finding of the model is that the drop in frequency is sharper in turn-transitioning questions than in turn-holding story TCUs. This is intriguing and, at first sight, counterintuitive as storytelling epitomizes &#x201C;displaced talk,&#x201D; which may require extending the &#x201C;discoursal horizon&#x201D; beyond the here-and-now; that extension may necessitate a more diverse vocabulary (indicating time and place, giving characters&#x2019; names, describing story objects and characters&#x2019; actions) than asking an information-seeking question related to the immediate situational or sequential context. A greater diversity of the vocabulary inevitably entails less-frequent words. Tentatively, however, story TCUs and question TCUs might differ in how rarer words are distributed within the TCU: while in story TCUs, the rarer (and more informative) words might be distributed more uniformly, their distribution in question TCUs might be more asymmetrical, with greater weight toward the TCU end. This hypothesis is explored in a keyness analysis in the following section.</p>
<sec id="sec25">
<label>4.1</label>
<title>Follow-up analysis: key c7 tags in TCU intervals</title>
<p>Keyness analysis (<xref ref-type="bibr" rid="ref68">Scott and Tribble, 2006</xref>) is a statistical method that identifies items of unusual frequency in a target corpus in comparison with a reference corpus. While in most analyses of keyness, the aim is to work out <italic>words</italic> that are key, we are going to apply the keynesss method to the c7 PoS tags. The aim is to test the hypothesis that the distribution of rarer word classes in question TCUs is more asymmetrical, with greater weight toward the TCU end, than in story TCUs.</p>
<p>To this end, word-tag combinations (e.g., <italic>how_RGQ</italic>) were stripped of the word part so that only the c7 tag remained (<italic>RGQ</italic>). Further, two subcorpora were compiled: one for the first two-thirds of TCUs, one for the last third of TCUs, in which model #2 above found a more pronounced drop in frequency for question TCUs than for story TCUs. Finally, using the R packages <italic>quanteda</italic> and <italic>quanteda.textplots</italic>, questions were defined as the target corpus and story TCUs as the reference corpus and <italic>key</italic> c7 tags in questions, as compared to stories, were computed using G<sup>2</sup> (likelihood ratio), a measure of how strongly the observed frequency of a tag deviates from what would be expected by chance between the target and reference corpus. Also, log ratios were computed as an effect size measure (cf. <xref ref-type="bibr" rid="ref8">Brezina, 2018</xref>). The top-most key c7 tags are shown in <xref ref-type="fig" rid="fig5">Figure 5</xref>.<xref ref-type="fn" rid="fn0011"><sup>11</sup></xref></p>
<fig position="float" id="fig5">
<label>Figure 5</label>
<caption>
<p>Top-most key c7 tags (with <italic>p</italic>&#x202F;&#x003C;&#x202F;0.05 and absolute log ratio &#x003E;&#x202F;=&#x202F;1) in different intervals in question TCUs (target corpus) compared to story TCUs (reference corpus): <italic>left panel:</italic> top 10 most key c7 tags in first two-thirds of question TCUs (blue bars) compared to first two-thirds of story TCUs (grey); <italic>right panel:</italic> all key c7 tags in last third of question TCUs (blue) compared to last third of story TCUs (grey).</p>
</caption>
<graphic xlink:href="fpsyg-16-1610179-g005.tif" mimetype="image" mime-subtype="tiff">
<alt-text content-type="machine-generated">Bar charts comparing key c7 tags in question TCUs to story TCUs. The left chart shows significant positive (target) tags like "ppy" and negative (reference) like "ppis1." Right chart highlights positive tags "np1," "rt," and negative tags "ppio1." Both charts use G2 (likelihood ratio) for measurement.</alt-text>
</graphic>
</fig>
<p>As shown in <xref ref-type="fig" rid="fig5">Figure 5</xref>, the most key c7 tags in the early intervals are PPY in questions and, respectively, PPIS1 in stories, with the former designating the second-person personal pronoun, <italic>you</italic> (the sixth most common word in FreMIC, cf. <xref ref-type="table" rid="tab3">Table 3</xref> above), and the latter, the first-person pronoun, <italic>I</italic> (by far the most common word in FreMIC; cf. <xref ref-type="table" rid="tab3">Table 3</xref> above). These are very strong but obvious differences, as most questions are addressed to the interlocutor(s) (e.g., <italic>are you guys brothers?</italic>) and many stories are first-person stories in which the storyteller is the main protagonist. The second most key c7 tags in the early intervals are DDQ in questions, i.e., <italic>wh</italic>-determiners, and VBDZ in stories, i.e., the past tense form <italic>was</italic>. These are also to be expected, as a large chunk of the questions are <italic>wh</italic>-questions, and most stories relate events that happened in the past (see also the key tag VVD for stories). What is notably <italic>missing</italic> from the early intervals, both in questions and stories (at least among the top 10 most key tags; s. <xref rid="SM1" ref-type="supplementary-material">Supplementary Materials 2 and 3</xref> for the full lists of key tags), are tags for any type of nouns. This absence is noteworthy not only because nouns are by far the most type-rich category (cf., for example, the small inventory of pronouns) and by far the most hapax-rich category (hapax legomena are words that occur just once in a corpus and have hence the lowest possible frequency; cf. <xref ref-type="bibr" rid="ref62">R&#x00FC;hlemann and Barthel, 2024</xref>). The absence is also noteworthy because nouns &#x201C;carry most of the lexical content, in the sense of being able to make reference outside language&#x201D; (<xref ref-type="bibr" rid="ref77">Stubbs, 2001</xref>, p. 40; <xref ref-type="bibr" rid="ref5">Biber et al., 1999</xref>, p. 232), and their use is &#x201C;felicitous only in contexts of information novelty, disambiguation needs, or topic and perspective shifts&#x201D; (<xref ref-type="bibr" rid="ref9004">Seifart et al., 2018</xref>, p. 5721). So, nouns do not play a key role in the early intervals, either in question TCUs and story TCUs. Where nouns <italic>do</italic> come in is in the last interval&#x2014;but only in question TCUs, not in story TCUs (see the full key tag lists in <xref rid="SM1" ref-type="supplementary-material">Supplementary Materials 2 and 3</xref>). In the last third in question TCUs, by far the most key tag is NP1 (for singular proper noun), and the fifth most key tag is NN1 (for singular common noun). In the late interval in story TCUs, by contrast, it is the c7 tag UH, that is, interjections (often at the beginning of direct speech), VVN, that is, the past participle of lexical verbs, and VV0, that is, the base form of lexical verbs, that are key. Here, now lies the explanation to the result of model #2, which indicated that the frequency drop is more pronounced in question TCUs than in story TCUs: the drop in frequency is sharper as nouns, the most informative and potentially rarest type of word, are more asymmetrically distributed toward the TCU end in question TCUs than in story TCUs.</p>
<p><xref ref-type="table" rid="tab8">Table 8</xref> shows for each social action type, four TCU examples that are &#x201C;prototypical&#x201D; in the sense that they include words with key c7 tags for the first two-thirds and, respectively, the last third.</p>
<table-wrap position="float" id="tab8">
<label>Table 8</label>
<caption>
<p>Example TCUs with key c7 tags; emboldened items represent the w_c7 tag that had the highest frequency in the early intervals (F_max) and, respectively, the w_c7 tag that had the lowest frequency in the late interval (F_min); F_Drop <italic>(F_DropLastThird)</italic> is calculated from the difference of F_max and F_min.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Type</th>
<th align="left" valign="top">Early intervals (first two thirds)</th>
<th align="left" valign="top">Late interval (last third)</th>
<th align="center" valign="middle">F_max</th>
<th align="center" valign="middle">F_min</th>
<th align="center" valign="middle">F_Drop</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">question</td>
<td align="left" valign="middle">do_VD0 <bold>you_PPY</bold> guys_NN2 need_VV0 to_TO go_VVI back_RP</td>
<td align="left" valign="middle"><bold>ikea_NP1</bold> anytime_NNT1 soon_RR</td>
<td align="center" valign="middle">22.92</td>
<td align="center" valign="middle">0.03</td>
<td align="center" valign="middle">22.89</td>
</tr>
<tr>
<td align="left" valign="middle">question</td>
<td align="left" valign="middle">did_VDD you_PPY get_VVI <bold>the_AT</bold></td>
<td align="left" valign="middle"><bold>poem_NN1</bold> email_NN1</td>
<td align="center" valign="middle">25.82</td>
<td align="center" valign="middle">0.01</td>
<td align="center" valign="middle">25.82</td>
</tr>
<tr>
<td align="left" valign="middle">question</td>
<td align="left" valign="middle"><bold>you_PPY</bold> ever_RR played_VVD like_II</td>
<td align="left" valign="middle">a_AT1 <bold>banjo_NN1</bold></td>
<td align="center" valign="middle">22.92</td>
<td align="center" valign="middle">0.02</td>
<td align="center" valign="middle">22.90</td>
</tr>
<tr>
<td align="left" valign="middle">question</td>
<td align="left" valign="middle">did_VDD <bold>you_PPY</bold> get_VVI anything_PN1 out_II21 of_II22</td>
<td align="left" valign="middle">that_DD1 <bold>relationship_NN1</bold></td>
<td align="center" valign="middle">22.92</td>
<td align="center" valign="middle">0.10</td>
<td align="center" valign="middle">22.82</td>
</tr>
<tr>
<td align="left" valign="middle">story</td>
<td align="left" valign="middle"><bold>and_CC</bold> he_PPHS1 immediately_RR the_AT second_NNT1 we_PPIS2 got_VVD on_II</td>
<td align="left" valign="middle">just_RR <bold>zoned_VVN</bold> in_II us_PPIO2</td>
<td align="center" valign="middle">26.90</td>
<td align="center" valign="middle">0.01</td>
<td align="center" valign="middle">26.89</td>
</tr>
<tr>
<td align="left" valign="middle">story</td>
<td align="left" valign="middle"><bold>i_PPIS1</bold> do_VD0 n&#x2019;t_XX think_VVI</td>
<td align="left" valign="middle">they_PPHS2 <bold>care_VV0</bold></td>
<td align="center" valign="middle">43.78</td>
<td align="center" valign="middle">0.02</td>
<td align="center" valign="middle">43.76</td>
</tr>
<tr>
<td align="left" valign="middle">story</td>
<td align="left" valign="middle">she_PPHS1 said:VVD oh_UH <bold>i_PPIS1</bold> was_VBDZ</td>
<td align="left" valign="middle"><bold>invited_VVN</bold> too_RR</td>
<td align="center" valign="middle">43.78</td>
<td align="center" valign="middle">0.04</td>
<td align="center" valign="middle">43.74</td>
</tr>
<tr>
<td align="left" valign="middle">story</td>
<td align="left" valign="middle">uh_UH <bold>and_CC</bold> his_APPGE position_NN1 as_II a_AT1</td>
<td align="left" valign="middle">diplomat_NN1 is_VBZ <bold>cut_VVN</bold></td>
<td align="center" valign="middle">26.90</td>
<td align="center" valign="middle">0.01</td>
<td align="center" valign="middle">26.89</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Is the TCU-final frequency drop a <italic>turn</italic>-completion cue, regardless of social action type? This question cannot definitively be answered by this study, which compared turn-final question TCUs with turn-medial story TCUs. A <italic>general</italic> turn-completion signaling function for frequency is, however, unlikely. For it would presuppose that speakers manipulate frequencies depending on whether they wish to yield or keep the turn. A manipulation of frequencies could only be achieved if the speaker were skilled enough to use one way of phrasing for one purpose and another way of phrasing for the other purpose. That certainly overestimates a speaker&#x2019;s conscious control over what they say and their stylistic versatility, and it underestimates the constraints imposed by constituent order, which is strict in English, leaving little room for <italic>in situ</italic> variation. It appears more plausible that the frequency drop observed in this study, both in response to RQ #1 and RQ #2, functions as a <italic>TCU</italic> completion cue. Whether that TCU is (intended by the speaker) as the turn-final one is likely signaled by other, far less rules-governed <italic>prosodic</italic> cues such as turn-final lengthening (<xref ref-type="bibr" rid="ref9005">Duncan, 1972</xref>; <xref ref-type="bibr" rid="ref9006">Local and Walker, 2012</xref>; <xref ref-type="bibr" rid="ref7">B&#x00F6;gels and Torreira, 2015</xref>), creaky voice (<xref ref-type="bibr" rid="ref9007">Ogden, 2001</xref>; <xref ref-type="bibr" rid="ref9008">Redi and Shattuck-Hufnagel, 2001</xref>), audible outbreath (<xref ref-type="bibr" rid="ref9006">Local and Walker, 2012</xref>; <xref ref-type="bibr" rid="ref9009">Torreira et al., 2015</xref>), and pitch drop (<xref ref-type="bibr" rid="ref9010">Beattie et al., 1982</xref>; <xref ref-type="bibr" rid="ref9005">Duncan, 1972</xref>; <xref ref-type="bibr" rid="ref7">B&#x00F6;gels and Torreira, 2015</xref>). On this view, turn-completion is most likely signaled by the speaker and processed by the listener in <italic>multimodal clusters</italic>, in which the TCU-final drop in word frequency is one of the several components.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="sec26">
<label>5</label>
<title>Conclusion</title>
<p>FreMIC is a small corpus. Its smallness suggests that the findings should be treated with caution. For example, normalized frequencies may not yet be completely stable, and the speed with which, in the present data, cumulative ngrams become attested only once&#x2014;on average, on the fourth word&#x2014;may be exaggerated in FreMIC compared to larger corpora, where multi-word combinations that occur just once in FreMIC have a higher chance of occurring more frequently. In larger corpora, TCUs will likely reach that juncture at a later point.</p>
<p>The present findings hold for English conversation. To what extent they can be generalized to more languages is an open question. The generalizability may already prove difficult with closely related SVO languages such as, for example, German, which may be among the &#x201C;front-loaded information languages&#x201D; (<xref ref-type="bibr" rid="ref79">Trujillo and Holler, 2024</xref>), in which the first half of utterances is information-heavier than the second half (unlike in English, which is &#x201C;back-loaded,&#x201D; meaning the informational peak occurs in the second half of utterances) In the relatively few languages of the world where the basic constituent order does not start with the subject constituent (c. 17% of all languages; cf. <xref ref-type="bibr" rid="ref23">Hammarstr&#x00F6;m, 2016</xref>), such as Jarawa (spoken on the Andaman Islands, India; OSV), the distribution of frequencies and related measures across words in turns will likely diverge substantially from that in English conversation (where the subject is typically a high-frequency pronominal form; cf. <xref ref-type="bibr" rid="ref62">R&#x00FC;hlemann and Barthel, 2024</xref>), and it is doubtful whether in these languages any similar TCU-final frequency drop can be observed. This, however, is not to suggest that frequency patterns in these languages can never play any role in signaling that the current speaker is about to stop speaking and ready to hand over to another participant. The patterns, if any, might simply be of a different kind (for example, in an OVS language, a TCU-final <italic>rise</italic> in frequency might be construed by listeners as a cue that the speaker is done).<xref ref-type="fn" rid="fn0012"><sup>12</sup></xref></p>
<p>Finally, frequency and frequency-related measures cannot in themselves fully explain turn completion or continuation. Frequency measures will no doubt enter into important interactions with other turn-completion cues (<xref ref-type="bibr" rid="ref7">B&#x00F6;gels and Torreira, 2015</xref>, p. 55) and/or form multimodal packages. Future studies should therefore exhaustively incorporate the diverse set of turn-completion cues not only on the lexical/verbal level but also on the gestural/visual and prosodic/vocal levels. Only thus will it be possible to gain a <italic>comprehensive</italic> view of how speakers give the green light to their interlocutors that they are done and that someone else can now speak.</p>
<p>These limitations notwithstanding, this study does suggest that, in English conversation, word frequencies form an S-shaped pattern in TCUs (RQ #1) and they do discriminate turn-final question TCUs and turn-medial storytelling TCUs (RQ #2). Information extracted from word frequencies may hence serve listeners in conversation as cues to anticipate turn completion in questions as opposed to turn continuation in stories. Whether that information also discriminates other types of social action remains to be investigated in future research.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="sec27">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: The data and the R code are openly available in Open Science Framework at <ext-link xlink:href="https://osf.io/ygnze/" ext-link-type="uri">https://osf.io/ygnze/</ext-link>.</p>
</sec>
<sec sec-type="ethics-statement" id="sec28">
<title>Ethics statement</title>
<p>Ethical approval was not required for the studies in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.</p>
</sec>
<sec sec-type="author-contributions" id="sec29">
<title>Author contributions</title>
<p>CR: Validation, Conceptualization, Methodology, Writing &#x2013; original draft, Data curation, Supervision, Visualization, Investigation, Resources, Funding acquisition, Project administration, Writing &#x2013; review &#x0026; editing, Software, Formal analysis.</p>
</sec>

<sec sec-type="COI-statement" id="sec31">
<title>Conflict of interest</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="ai-statement" id="sec32">
<title>Generative AI statement</title>
<p>The author declares that Gen AI was used in the creation of this manuscript. During the preparation of this work the author used ChatGPT3 in order to explore ways to examine multicollinearity. After using this tool/service, the author reviewed and edited the content as needed and takes full responsibility for the content of the publication.</p>
<p>Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.</p>
</sec>
<sec sec-type="disclaimer" id="sec33">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec sec-type="supplementary-material" id="sec34">
<title>Supplementary material</title>
<p>The Supplementary material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1610179/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1610179/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.pdf" id="SM1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Auer</surname><given-names>P.</given-names></name></person-group> (<year>2018</year>). &#x201C;<article-title>Gaze, addressee selection and turn-taking in three-party interaction</article-title>&#x201D; in <source>Eye-tracking in interaction. Studies on the role of eye gaze in dialogue</source>. eds. <person-group person-group-type="editor"><name><surname>Br&#x00F4;ne</surname><given-names>G.</given-names></name> <name><surname>Oben</surname><given-names>B.</given-names></name></person-group> (<publisher-loc>Amsterdam</publisher-loc>: <publisher-name>John Benjamins</publisher-name>), <fpage>197</fpage>&#x2013;<lpage>231</lpage>.</mixed-citation></ref>
<ref id="ref2"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Auer</surname><given-names>P.</given-names></name></person-group> (<year>2021a</year>). <article-title>Turn-allocation and gaze: a multimodal revision of the &#x201C;current-speaker- selects next&#x201D; rule of the turn-taking system of conversation analysis</article-title>. <source>Discourse Stud.</source> <volume>23</volume>, <fpage>117</fpage>&#x2013;<lpage>140</lpage>. doi: <pub-id pub-id-type="doi">10.1177/146144562096692</pub-id></mixed-citation></ref>
<ref id="ref3"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Auer</surname><given-names>P.</given-names></name></person-group> (<year>2021b</year>). <article-title>Gaze selects the next speaker in answers to questions pronominally addressed to more than one co-participant</article-title>. <source>Interact. Linguist.</source> <volume>2021</volume>:<fpage>21002</fpage>. doi: <pub-id pub-id-type="doi">10.1075/il.21002.aue</pub-id></mixed-citation></ref>
<ref id="ref4"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Barthel</surname><given-names>M.</given-names></name> <name><surname>Meyer</surname><given-names>A. S.</given-names></name> <name><surname>Levinson</surname><given-names>S. C.</given-names></name></person-group> (<year>2017</year>). <article-title>Next speakers plan their turn early and speak after turn-final &#x201C;go-signals.&#x201D;</article-title>. <source>Front. Psychol.</source> <volume>8</volume>:<fpage>393</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2017.00393</pub-id>, PMID: <pub-id pub-id-type="pmid">28443035</pub-id></mixed-citation></ref>
<ref id="ref9003"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Barthel</surname><given-names>M.</given-names></name> <name><surname>Sauppe</surname><given-names>S.</given-names></name></person-group> (<year>2019</year>). <article-title>Speech planning at turn transitions in dialog is associated with increased processing load</article-title>. <source>Cogn. Sci.</source> <volume>43</volume>:<fpage>e12768</fpage>. doi: <pub-id pub-id-type="doi">10.1111/cogs.12768</pub-id></mixed-citation></ref>
<ref id="ref9010"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Beattie</surname><given-names>G.</given-names></name> <name><surname>Cutler</surname><given-names>A.</given-names></name> <name><surname>Pearson</surname><given-names>M.</given-names></name></person-group> (<year>1982</year>). <article-title>Why is Mrs. Thatcher interrupted so often?</article-title> <source>Nature</source> <volume>300</volume>, <fpage>744</fpage>&#x2013;<lpage>747</lpage>.</mixed-citation></ref>
<ref id="ref5"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Biber</surname><given-names>D.</given-names></name> <name><surname>Johansson</surname><given-names>S.</given-names></name> <name><surname>Leech</surname><given-names>G.</given-names></name> <name><surname>Conrad</surname><given-names>S.</given-names></name> <name><surname>Finegan</surname><given-names>E.</given-names></name></person-group> (<year>1999</year>). <source>Long- man grammar of spoken and written English</source>. <publisher-loc>Harlow</publisher-loc>: <publisher-name>Pearson Education Limited</publisher-name>.</mixed-citation></ref>
<ref id="ref6"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>B&#x00F6;gels</surname><given-names>S.</given-names></name> <name><surname>Magyari</surname><given-names>L.</given-names></name> <name><surname>Levinson</surname><given-names>S. C.</given-names></name></person-group> (<year>2015</year>). <article-title>Neural signatures of response planning occur midway through an incoming question in conversation</article-title>. <source>Sci. Rep.</source> <volume>5</volume>, <fpage>1</fpage>&#x2013;<lpage>11</lpage>. doi: <pub-id pub-id-type="doi">10.1038/srep12881</pub-id></mixed-citation></ref>
<ref id="ref7"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>B&#x00F6;gels</surname><given-names>S.</given-names></name> <name><surname>Torreira</surname><given-names>F.</given-names></name></person-group> (<year>2015</year>). <article-title>Listeners use intonational phrase boundaries to project turn ends in spoken interaction</article-title>. <source>J. Phon.</source> <volume>52</volume>, <fpage>46</fpage>&#x2013;<lpage>57</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.wocn.2015.04.004</pub-id></mixed-citation></ref>
<ref id="ref8"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Brezina</surname><given-names>V.</given-names></name></person-group> (<year>2018</year>). <source>Statistics in Corpus linguistics: A practical guide</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>CUP</publisher-name>.</mixed-citation></ref>
<ref id="ref10"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Clayman</surname><given-names>S. E.</given-names></name></person-group> (<year>2013</year>). &#x201C;<article-title>Turn-constructional units and the transition-relevance place</article-title>&#x201D; in <source>The handbook of conversation analysis</source>. eds. <person-group person-group-type="editor"><name><surname>Sidnell</surname><given-names>J.</given-names></name> <name><surname>Stivers</surname><given-names>T.</given-names></name></person-group> (<publisher-loc>Hoboken, NJ</publisher-loc>: <publisher-name>Malden/MA and Oxford, Wiley Blackwell</publisher-name>), <fpage>150</fpage>&#x2013;<lpage>166</lpage>.</mixed-citation></ref>
<ref id="ref11"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Clift</surname><given-names>R.</given-names></name> <name><surname>Holt</surname><given-names>E.</given-names></name></person-group> (<year>2007</year>). <source>Reporting talk. Reported speech in interaction</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>, <fpage>1</fpage>&#x2013;<lpage>15</lpage>.</mixed-citation></ref>
<ref id="ref12"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Corps</surname><given-names>R. E.</given-names></name> <name><surname>Crossley</surname><given-names>A.</given-names></name> <name><surname>Gambi</surname><given-names>C.</given-names></name> <name><surname>Pickering</surname><given-names>M. J.</given-names></name></person-group> (<year>2018</year>). <article-title>Early preparation during turn-taking: listeners use content predictions to determine what to say but not when to say it</article-title>. <source>Cognition</source> <volume>175</volume>, <fpage>77</fpage>&#x2013;<lpage>95</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.cognition.2018.01.015</pub-id>, PMID: <pub-id pub-id-type="pmid">29477750</pub-id></mixed-citation></ref>
<ref id="ref13"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>De Ruiter</surname><given-names>J. P.</given-names></name> <name><surname>Mitterer</surname><given-names>H.</given-names></name> <name><surname>Enfield</surname><given-names>N. J.</given-names></name></person-group> (<year>2006</year>). <article-title>Projecting the end of a speaker's turn: a cognitive cornerstone of conversation</article-title>. <source>Language</source> <volume>82</volume>, <fpage>515</fpage>&#x2013;<lpage>535</lpage>. doi: <pub-id pub-id-type="doi">10.1353/lan.2006.0130</pub-id></mixed-citation></ref>
<ref id="ref14"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>DeLong</surname><given-names>K. A.</given-names></name> <name><surname>Urbach</surname><given-names>T. P.</given-names></name> <name><surname>Kutas</surname><given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>Probabilistic word pre-activation during language comprehension inferred from electrical brain activity</article-title>. <source>Nat. Neurosci.</source> <volume>8</volume>, <fpage>1117</fpage>&#x2013;<lpage>1121</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nn1504</pub-id>, PMID: <pub-id pub-id-type="pmid">16007080</pub-id></mixed-citation></ref>
<ref id="ref9005"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Duncan</surname><given-names>S.</given-names></name></person-group> (<year>1972</year>). <article-title>Some signals and rules for taking speaking turns in conversations</article-title>. <source>Journal of Personality and Social Psychology</source> <volume>23</volume>, <fpage>283</fpage>&#x2013;<lpage>292</lpage>.</mixed-citation></ref>
<ref id="ref15"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ellis</surname><given-names>N. C.</given-names></name></person-group> (<year>2002</year>). <article-title>Frequency effects in language processing: a review with implications for theoriesof implicit and explicit language acquisition</article-title>. <source>Stud. Second. Lang. Acquis.</source> <volume>24</volume>:<fpage>188</fpage>. doi: <pub-id pub-id-type="doi">10.1017/S0272263102002024</pub-id></mixed-citation></ref>
<ref id="ref17"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Garside</surname><given-names>R.</given-names></name> <name><surname>Smith</surname><given-names>N.</given-names></name></person-group> (<year>1997</year>). &#x201C;<article-title>A hybrid grammatical tagger: CLAWS4</article-title>&#x201D; in <source>Corpus annotation: Linguistic information from computer text corpora</source>. eds. <person-group person-group-type="editor"><name><surname>Garside</surname><given-names>R.</given-names></name> <name><surname>Leech</surname><given-names>G.</given-names></name> <name><surname>McEnery</surname><given-names>A.</given-names></name></person-group> (<publisher-loc>London</publisher-loc>: <publisher-name>Longman</publisher-name>), <fpage>102</fpage>&#x2013;<lpage>121</lpage>.</mixed-citation></ref>
<ref id="ref18"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gisladottir</surname><given-names>R. S.</given-names></name> <name><surname>B&#x00F6;gels</surname><given-names>S.</given-names></name> <name><surname>Levinson</surname><given-names>S. C.</given-names></name></person-group> (<year>2018</year>). <article-title>Oscillatory brain responses reflect anticipation during comprehension of speech acts in spoken dialog</article-title>. <source>Front. Hum. Neurosci.</source> <volume>12</volume>:<fpage>34</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fnhum.2018.00034</pub-id></mixed-citation></ref>
<ref id="ref19"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Goodwin</surname><given-names>C.</given-names></name></person-group> (<year>1984</year>). &#x201C;<article-title>Notes on story structure and the organization of participation</article-title>&#x201D; in <source>Structures of social action: Studies in conversation analysis</source>. eds. <person-group person-group-type="editor"><name><surname>Atkinson</surname><given-names>J. M.</given-names></name> <name><surname>Heritage</surname><given-names>J.</given-names></name></person-group> (<publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>), <fpage>225</fpage>&#x2013;<lpage>246</lpage>.</mixed-citation></ref>
<ref id="ref20"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Goodwin</surname><given-names>C.</given-names></name> <name><surname>Heriage</surname><given-names>J.</given-names></name></person-group> (<year>1990</year>). <article-title>Conversation analysis</article-title>. <source>Annu. Rev. Anthropol.</source> <volume>19</volume>, <fpage>283</fpage>&#x2013;<lpage>307</lpage>. doi: <pub-id pub-id-type="doi">10.1146/annurev.an.19.100190.001435</pub-id></mixed-citation></ref>
<ref id="ref23"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hammarstr&#x00F6;m</surname><given-names>H.</given-names></name></person-group> (<year>2016</year>). <article-title>Linguistic diversity and language evolution</article-title>. <source>J. Lang. Evol.</source> <volume>1</volume>, <fpage>19</fpage>&#x2013;<lpage>29</lpage>. doi: <pub-id pub-id-type="doi">10.1093/jole/lzw002</pub-id></mixed-citation></ref>
<ref id="ref24"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hasher</surname><given-names>L.</given-names></name> <name><surname>Chromiak</surname><given-names>W.</given-names></name></person-group> (<year>1977</year>). <article-title>The processing of frequency information: an automatic mechanism?</article-title> <source>J. Verbal Learn. Verbal Behav.</source> <volume>16</volume>, <fpage>173</fpage>&#x2013;<lpage>184</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0022-5371(77)80045-5</pub-id></mixed-citation></ref>
<ref id="ref25"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hasher</surname><given-names>L.</given-names></name> <name><surname>Zacks</surname><given-names>R. T.</given-names></name></person-group> (<year>1984</year>). <article-title>Automatic processing of fundamental information: the case of frequency of occurrence</article-title>. <source>Am. Psychol.</source> <volume>39</volume>, <fpage>1372</fpage>&#x2013;<lpage>1388</lpage>. doi: <pub-id pub-id-type="doi">10.1037/0003-066X.39.12.1372</pub-id>, PMID: <pub-id pub-id-type="pmid">6395744</pub-id></mixed-citation></ref>
<ref id="ref26"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Heldner</surname><given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Detection thresholds for gaps, overlaps and no-gap-no-overlaps</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>130</volume>, <fpage>508</fpage>&#x2013;<lpage>513</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.3598457</pub-id></mixed-citation></ref>
<ref id="ref27"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Heldner</surname><given-names>M.</given-names></name> <name><surname>Edlund</surname><given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>Pauses, gaps and overlaps in conversations</article-title>. <source>J. Phon.</source> <volume>38</volume>, <fpage>555</fpage>&#x2013;<lpage>568</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.wocn.2010.08.002</pub-id></mixed-citation></ref>
<ref id="ref28"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Hoey</surname><given-names>M.</given-names></name></person-group> (<year>2005</year>). <source>Lexical priming: A new theory of words and language</source>. <publisher-loc>Abingdon</publisher-loc>: <publisher-name>Routledge</publisher-name>.</mixed-citation></ref>
<ref id="ref29"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Hoffmann</surname><given-names>S.</given-names></name> <name><surname>Evert</surname><given-names>S.</given-names></name> <name><surname>Smith</surname><given-names>N.</given-names></name> <name><surname>Lee</surname><given-names>D.</given-names></name> <name><surname>Prytz</surname><given-names>Y. B.</given-names></name></person-group> (<year>2008</year>). <source>Corpus linguistics with BNCweb &#x2013; A practical guide</source>. <publisher-loc>Frankfurt am Main</publisher-loc>: <publisher-name>Peter Lang</publisher-name>.</mixed-citation></ref>
<ref id="ref30"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Holler</surname><given-names>J.</given-names></name> <name><surname>Levinson</surname><given-names>S. C.</given-names></name></person-group> (<year>2019</year>). <article-title>Multimodal language processing in human communication</article-title>. <source>Trends Cogn. Sci.</source> <volume>23</volume>, <fpage>639</fpage>&#x2013;<lpage>652</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.tics.2019.05.006</pub-id>, PMID: <pub-id pub-id-type="pmid">31235320</pub-id></mixed-citation></ref>
<ref id="ref31"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>H&#x00F6;mke</surname><given-names>P.</given-names></name> <name><surname>Holler</surname><given-names>J.</given-names></name> <name><surname>Levinson</surname><given-names>S. C.</given-names></name></person-group> (<year>2017</year>). <article-title>Eye blinking as addressee feedback in face-to-face conversation</article-title>. <source>Res. Lang. Soc. Interact.</source> <volume>2017</volume>:<fpage>2143</fpage>. doi: <pub-id pub-id-type="doi">10.1080/08351813.2017.1262143</pub-id></mixed-citation></ref>
<ref id="ref32"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Indefrey</surname><given-names>P.</given-names></name> <name><surname>Levelt</surname><given-names>W. J. M.</given-names></name></person-group> (<year>2004</year>). <article-title>The spatial and temporal signatures of word production components</article-title>. <source>Cognition</source> <volume>92</volume>, <fpage>101</fpage>&#x2013;<lpage>144</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.cognition.2002.06.001</pub-id></mixed-citation></ref>
<ref id="ref33"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jaeger</surname><given-names>T. F.</given-names></name></person-group> (<year>2010</year>). <article-title>Redundancy and reduction: speakers manage syntactic information density</article-title>. <source>Cogn. Psychol.</source> <volume>61</volume>:<fpage>23e62</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.cogpsych.2010.02.002</pub-id></mixed-citation></ref>
<ref id="ref34"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Jefferson</surname><given-names>G.</given-names></name></person-group> (<year>1978</year>). &#x201C;<article-title>Sequential aspects of storytelling in conversation</article-title>&#x201D; in <source>Studies in the organization of conversational interaction</source>. ed. <person-group person-group-type="editor"><name><surname>Schenkein</surname><given-names>J.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Academic Press</publisher-name>), <fpage>219</fpage>&#x2013;<lpage>248</lpage>.</mixed-citation></ref>
<ref id="ref35"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Jefferson</surname><given-names>G.</given-names></name></person-group> (<year>2004</year>). &#x201C;<article-title>Glossary of transcript symbols with an introduction</article-title>&#x201D; in <source>Conversation analysis: Studies from the first generation</source>. ed. <person-group person-group-type="editor"><name><surname>Lerner</surname><given-names>G. H.</given-names></name></person-group> (<publisher-loc>Amsterdam, Netherlands</publisher-loc>: <publisher-name>John Benjamins</publisher-name>), <fpage>13</fpage>&#x2013;<lpage>31</lpage>.</mixed-citation></ref>
<ref id="ref36"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jescheniak</surname><given-names>J. D.</given-names></name> <name><surname>Levelt</surname><given-names>W. J. M.</given-names></name></person-group> (<year>1994</year>). <article-title>Word frequency effects in speech production: retrieval of syntactic information and of phonological form</article-title>. <source>J. Exp. Psychol. Learn. Mem. Cogn.</source> <volume>20</volume>, <fpage>824</fpage>&#x2013;<lpage>843</lpage>.</mixed-citation></ref>
<ref id="ref37"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Johns</surname><given-names>B. T.</given-names></name> <name><surname>Gruenenfelder</surname><given-names>T. M.</given-names></name> <name><surname>Pisoni</surname><given-names>D. B.</given-names></name> <name><surname>Jones</surname><given-names>M. N.</given-names></name></person-group> (<year>2012</year>). <article-title>Effects of word frequency, contextual diversity, and semantic distinctiveness on spoken word recognition</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>132</volume>, <fpage>EL74</fpage>&#x2013;<lpage>EL80</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.4731641</pub-id></mixed-citation></ref>
<ref id="ref38"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Klafka</surname><given-names>J.</given-names></name> <name><surname>Yurovsky</surname><given-names>D.</given-names></name></person-group> (<year>2021</year>). <article-title>Characterizing the typical information curves of diverse languages</article-title>. <source>Entropy</source> <volume>23</volume>:<fpage>1300</fpage>. doi: <pub-id pub-id-type="doi">10.3390/e23101300</pub-id>, PMID: <pub-id pub-id-type="pmid">34682024</pub-id></mixed-citation></ref>
<ref id="ref40"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Labov</surname><given-names>W.</given-names></name></person-group> (<year>1972</year>). <source>Language in the Inner City</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Basil Blackwell</publisher-name>.</mixed-citation></ref>
<ref id="ref41"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Labov</surname><given-names>W.</given-names></name> <name><surname>Waletzky</surname><given-names>J.</given-names></name></person-group> (<year>1967</year>). &#x201C;<article-title>Narrative analysis: Oral versions of personal experience</article-title>&#x201D; in <source>Essays on the verbal and visual arts</source>. ed. <person-group person-group-type="editor"><name><surname>June</surname><given-names>H.</given-names></name></person-group> (<publisher-loc>Seattle</publisher-loc>: <publisher-name>University of Washington Press</publisher-name>), <fpage>12</fpage>&#x2013;<lpage>44</lpage>.</mixed-citation></ref>
<ref id="ref9002"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Landis</surname><given-names>J. R.</given-names></name> <name><surname>Koch</surname><given-names>G. G.</given-names></name></person-group> (<year>1977</year>). <article-title>The Measurement of Observer Agreement for Categorical Data</article-title>. <source>Biometrics</source> <volume>33</volume>, <fpage>159</fpage>&#x2013;<lpage>174</lpage>.</mixed-citation></ref>
<ref id="ref42"><mixed-citation publication-type="other"><person-group person-group-type="author"><name><surname>Leech</surname><given-names>G.</given-names></name> <name><surname>Garside</surname><given-names>R.</given-names></name> <name><surname>Bryant</surname><given-names>M.</given-names></name></person-group> (<year>1994</year>). <italic>CLAWS4: the tagging of the British national corpus</italic>. COLING &#x2018;94: proceedings of the 15th conference on computational linguistics &#x2013; Volume 1, pp. 622&#x2013;628.</mixed-citation></ref>
<ref id="ref43"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lerner</surname><given-names>G. H.</given-names></name></person-group> (<year>2019</year>). <article-title>When someone other than the addressed recipient speaks next: three kinds of intervening action after the selection of next speaker</article-title>. <source>Res. Lang. Soc. Interact.</source> <volume>52</volume>, <fpage>388</fpage>&#x2013;<lpage>405</lpage>. doi: <pub-id pub-id-type="doi">10.1080/08351813.2019.1657280</pub-id></mixed-citation></ref>
<ref id="ref44"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Levelt</surname><given-names>W. J.</given-names></name> <name><surname>Roelofs</surname><given-names>A.</given-names></name> <name><surname>Meyer</surname><given-names>A. S.</given-names></name></person-group> (<year>1999</year>). <article-title>A theory of lexical access in speech production</article-title>. <source>Behav. Brain Sci.</source> <volume>22</volume>, <fpage>1</fpage>&#x2013;<lpage>75</lpage>. doi: <pub-id pub-id-type="doi">10.1017/s0140525x99001776</pub-id>, PMID: <pub-id pub-id-type="pmid">11301520</pub-id></mixed-citation></ref>
<ref id="ref45"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Levinson</surname><given-names>S. C.</given-names></name> <name><surname>Torreira</surname><given-names>F.</given-names></name></person-group> (<year>2015</year>). <article-title>Timing in turn-taking and its implications for processing models of language</article-title>. <source>Front. Psychol.</source> <volume>6</volume>:<fpage>731</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2015.00731</pub-id>, PMID: <pub-id pub-id-type="pmid">26124727</pub-id></mixed-citation></ref>
<ref id="ref46"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Li</surname><given-names>C. L.</given-names></name></person-group> (<year>1986</year>). &#x201C;<article-title>Direct and indirect speech: a functional study</article-title>&#x201D; in <source>Direct and indirect speech</source>. ed. <person-group person-group-type="editor"><name><surname>Coulmas</surname><given-names>F.</given-names></name></person-group> (<publisher-loc>Berlin</publisher-loc>: <publisher-name>Mouton de Gruyter</publisher-name>), <fpage>29</fpage>&#x2013;<lpage>45</lpage>.</mixed-citation></ref>
<ref id="ref9006"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Local</surname><given-names>J.</given-names></name> <name><surname>Walker</surname><given-names>G.</given-names></name></person-group> (<year>2012</year>). <article-title>How phonetic features project more talk</article-title>. <source>Journal of the International Phonetic Association</source> <volume>42</volume>, <fpage>255</fpage>&#x2013;<lpage>280</lpage>.</mixed-citation></ref>
<ref id="ref48"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Magyari</surname><given-names>L.</given-names></name> <name><surname>Bastiaansen</surname><given-names>M. C. M.</given-names></name> <name><surname>de Ruiter</surname><given-names>J. P.</given-names></name> <name><surname>Levinson</surname><given-names>S. C.</given-names></name></person-group> (<year>2014</year>). <article-title>Early anticipation lies behind the speed of response in conversation</article-title>. <source>J. Cogn. Neurosci.</source> <volume>26</volume>, <fpage>2530</fpage>&#x2013;<lpage>2539</lpage>. doi: <pub-id pub-id-type="doi">10.1162/jocn_a_00673</pub-id>, PMID: <pub-id pub-id-type="pmid">24893743</pub-id></mixed-citation></ref>
<ref id="ref49"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mathis</surname><given-names>T.</given-names></name> <name><surname>Yule</surname><given-names>G.</given-names></name></person-group> (<year>1994</year>). <article-title>Zero quotatives</article-title>. <source>Discourse Process.</source> <volume>18</volume>, <fpage>63</fpage>&#x2013;<lpage>76</lpage>. doi: <pub-id pub-id-type="doi">10.1080/01638539409544884</pub-id></mixed-citation></ref>
<ref id="ref50"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mayes</surname><given-names>P.</given-names></name></person-group> (<year>1990</year>). <article-title>Quotation in spoken English</article-title>. <source>Stud. Lang.</source> <volume>14</volume>, <fpage>325</fpage>&#x2013;<lpage>363</lpage>. doi: <pub-id pub-id-type="doi">10.1075/sl.14.2.04may</pub-id></mixed-citation></ref>
<ref id="ref51"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Norrick</surname><given-names>N. R.</given-names></name></person-group> (<year>2000</year>). <source>Conversational narrative storytelling in everyday talk</source>. <publisher-loc>Amsterdam</publisher-loc>: <publisher-name>John Benjamins</publisher-name>.</mixed-citation></ref>
<ref id="ref9007"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ogden</surname><given-names>R.</given-names></name></person-group> (<year>2001</year>). <article-title>Turn transition, creak and glottal stop in Finnish talk-in-interaction</article-title>. <source>Journal of the International Phonetic Association</source> <volume>31</volume>, <fpage>139</fpage>&#x2013;<lpage>152</lpage>.</mixed-citation></ref>
<ref id="ref52"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Oldfield</surname><given-names>R. C.</given-names></name> <name><surname>Wingfield</surname><given-names>A.</given-names></name></person-group> (<year>1965</year>). <article-title>Response latencies in naming objects</article-title>. <source>Q. J. Exp. Psychol.</source> <volume>17</volume>, <fpage>273</fpage>&#x2013;<lpage>228</lpage>. doi: <pub-id pub-id-type="doi">10.1080/17470216508416445</pub-id>, PMID: <pub-id pub-id-type="pmid">5852918</pub-id></mixed-citation></ref>
<ref id="ref53"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Piantadosi</surname><given-names>S. T.</given-names></name> <name><surname>Tily</surname><given-names>H.</given-names></name> <name><surname>Gibson</surname><given-names>E.</given-names></name></person-group> (<year>2011</year>). <article-title>Word lengths are optimized for efficient communication</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>108</volume>, <fpage>3526</fpage>&#x2013;<lpage>3529</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1012551108</pub-id>, PMID: <pub-id pub-id-type="pmid">21278332</pub-id></mixed-citation></ref>
<ref id="ref9008"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Redi</surname><given-names>L.</given-names></name> <name><surname>Shattuck-Hufnagel</surname><given-names>S.</given-names></name></person-group> (<year>2001</year>). <article-title>Variation in the realization of glottalization in normal speakers</article-title>. <source>Journal of Phonetics</source> <volume>29</volume>, <fpage>407</fpage>&#x2013;<lpage>429</lpage>.</mixed-citation></ref>
<ref id="ref55"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Roberts</surname><given-names>S. G.</given-names></name> <name><surname>Torreira</surname><given-names>F.</given-names></name> <name><surname>Levinson</surname><given-names>S. C.</given-names></name></person-group> (<year>2015</year>). <article-title>The effects of processing and sequence organization on the timing of turn taking: a corpus study</article-title>. <source>Front. Psychol.</source> <volume>6</volume>, <fpage>1</fpage>&#x2013;<lpage>16</lpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2015.00509</pub-id></mixed-citation></ref>
<ref id="ref56"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Robinson</surname><given-names>J. D.</given-names></name> <name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name> <name><surname>Rodriguez</surname><given-names>D. T.</given-names></name></person-group> (<year>2022</year>). <article-title>The bias toward single-unit turns in conversation</article-title>. <source>Res. Lang. Soc. Interact.</source> <volume>2022</volume>:<fpage>7436</fpage>. doi: <pub-id pub-id-type="doi">10.1080/08351813.2022.2067436</pub-id></mixed-citation></ref>
<ref id="ref57"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name></person-group> (<year>2007</year>). <source>Conversation in context: A corpus-driven approach</source>. <publisher-loc>London</publisher-loc>: <publisher-name>Continuum</publisher-name>.</mixed-citation></ref>
<ref id="ref58"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name></person-group> (<year>2013</year>). <source>Narrative in English conversation: A corpus analysis of storytelling</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation></ref>
<ref id="ref59"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name></person-group> (<year>2020a</year>). <article-title>Turn structure and inserts</article-title>. <source>Int. J. Corpus Linguist.</source> <volume>25</volume>, <fpage>185</fpage>&#x2013;<lpage>213</lpage>. doi: <pub-id pub-id-type="doi">10.1075/ijcl.19098.ruh</pub-id></mixed-citation></ref>
<ref id="ref60"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name></person-group> (<year>2020b</year>). <source>Visual linguistics with R. An introduction to quantitative interactional linguistics</source>. <publisher-loc>Amsterdam</publisher-loc>: <publisher-name>Benjamins</publisher-name>.</mixed-citation></ref>
<ref id="ref61"><mixed-citation publication-type="other"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name> <name><surname>Auer</surname><given-names>P.</given-names></name> <name><surname>Gries</surname><given-names>S. T.</given-names></name> <name><surname>Holler</surname><given-names>J.</given-names></name> <name><surname>Schulte</surname><given-names>M</given-names></name></person-group>. (<year>n.d.</year>). <italic>Which multimodal clusters discriminate between turn-final question units and turn-medial story units?</italic></mixed-citation></ref>
<ref id="ref62"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name> <name><surname>Barthel</surname><given-names>M.</given-names></name></person-group> (<year>2024</year>). <article-title>Word frequency and cognitive effort in turns-at-talk: turn structure affects processing load in natural conversation</article-title>. <source>Front. Psychol.</source> <volume>15</volume>:<fpage>29</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2024.1208029</pub-id>, PMID: <pub-id pub-id-type="pmid">38899128</pub-id></mixed-citation></ref>
<ref id="ref63"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name> <name><surname>Gries</surname><given-names>S. T.</given-names></name></person-group> (<year>2020</year>). <article-title>Speakers advance-project turn completion by slowing down: a multifactorial corpus analysis</article-title>. <source>J. Phon.</source> <volume>2020</volume>:<fpage>976</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.wocn.2020.100976</pub-id></mixed-citation></ref>
<ref id="ref64"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name> <name><surname>Ptak</surname><given-names>A.</given-names></name></person-group> (<year>2023</year>). <article-title>Reaching below the tip of the iceberg: a guide to the Freiburg multimodal interaction Corpus (FreMIC)</article-title>. <source>Open Linguist.</source> <volume>2023</volume>:<fpage>245</fpage>. doi: <pub-id pub-id-type="doi">10.1515/opli-2022-0245</pub-id></mixed-citation></ref>
<ref id="ref65"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name> <name><surname>Schweinberger</surname><given-names>M.</given-names></name></person-group> (<year>2021</year>). <article-title>Which word gets the nuclear stress in a turn-at-talk?</article-title> <source>J. Pragmat.</source> <volume>178</volume>, <fpage>426</fpage>&#x2013;<lpage>439</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.pragma.2021.04.005</pub-id></mixed-citation></ref>
<ref id="ref9001"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>R&#x00FC;hlemann</surname><given-names>C.</given-names></name> <name><surname>Trujillo</surname><given-names>J.</given-names></name></person-group> (<year>2024</year>). <article-title>The effects of gesture expressivity on emotional resonance in storytelling interaction</article-title>. <source>Frontiers in Psychology (Sec. Psychology of Language)</source>. Available online at: <ext-link xlink:href="https://www.frontiersin.org/journals/psychology/articles/10.3389/fpsyg.2024.1477263/full" ext-link-type="uri">https://www.frontiersin.org/journals/psychology/articles/10.3389/fpsyg.2024.1477263/full</ext-link></mixed-citation></ref>
<ref id="ref66"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sacks</surname><given-names>H.</given-names></name> <name><surname>Schegloff</surname><given-names>E. A.</given-names></name> <name><surname>Jefferson</surname><given-names>G.</given-names></name></person-group> (<year>1974</year>). <article-title>A simplest systematics for the organisation of turn-taking for conversation</article-title>. <source>Language</source> <volume>50</volume>, <fpage>696</fpage>&#x2013;<lpage>735</lpage>. doi: <pub-id pub-id-type="doi">10.1353/lan.1974.0010</pub-id></mixed-citation></ref>
<ref id="ref67"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Schegloff</surname><given-names>E. A.</given-names></name></person-group> (<year>2007</year>). <source>Sequence organisation in interaction: A primer in conversation-analysis</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation></ref>
<ref id="ref68"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Scott</surname><given-names>M.</given-names></name> <name><surname>Tribble</surname><given-names>C.</given-names></name></person-group> (<year>2006</year>). <source>Textual patterns: Key words and corpus analysis in language education</source>. <publisher-loc>Amsterdam, Philadelphia</publisher-loc>: <publisher-name>Benjamins</publisher-name>.</mixed-citation></ref>
<ref id="ref9004"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Seifart</surname><given-names>F.</given-names></name> <name><surname>Strunk</surname><given-names>J.</given-names></name> <name><surname>Danielsen</surname><given-names>S.</given-names></name> <name><surname>Bickel</surname><given-names>B.</given-names></name></person-group> (<year>2018</year>). <article-title>Nouns slow down speech across structurally and culturally diverse languages</article-title>. <source>PNAS</source> <volume>115</volume>, <fpage>5720</fpage>&#x2013;<lpage>5725</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1800708115</pub-id></mixed-citation></ref>
<ref id="ref69"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Seyfarth</surname><given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>Word informativity influences acoustic duration: effects of contextual predictability on lexical representation</article-title>. <source>Cognition</source> <volume>133</volume>, <fpage>140</fpage>&#x2013;<lpage>155</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.cognition.2014.06.013</pub-id>, PMID: <pub-id pub-id-type="pmid">25019178</pub-id></mixed-citation></ref>
<ref id="ref70"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shapiro</surname><given-names>B. J.</given-names></name></person-group> (<year>1969</year>). <article-title>The subjective estimate of relative word frequency</article-title>. <source>J. Verbal Learn. Verbal Behav.</source> <volume>8</volume>, <fpage>248</fpage>&#x2013;<lpage>251</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0022-5371(69)80070-8</pub-id></mixed-citation></ref>
<ref id="ref72"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Stivers</surname><given-names>T.</given-names></name></person-group> (<year>2008</year>). <article-title>Stance, alignment, and affiliation during storytelling: when nodding is a token of affiliation</article-title>. <source>Res. Lang. Soc. Interact.</source> <volume>41</volume>, <fpage>31</fpage>&#x2013;<lpage>57</lpage>. doi: <pub-id pub-id-type="doi">10.1080/08351810701691123</pub-id></mixed-citation></ref>
<ref id="ref73"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Stivers</surname><given-names>T.</given-names></name></person-group> (<year>2010</year>). <article-title>An overview of question response system in American English conversation</article-title>. <source>J. Pragmat.</source> <volume>42</volume>, <fpage>2772</fpage>&#x2013;<lpage>2781</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.pragma.2010.04.011</pub-id></mixed-citation></ref>
<ref id="ref74"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Stivers</surname><given-names>T.</given-names></name></person-group> (<year>2013</year>). &#x201C;<article-title>Sequence organization</article-title>&#x201D; in <source>The handbook of conversation analysis</source>. eds. <person-group person-group-type="editor"><name><surname>Sidnell</surname><given-names>J.</given-names></name> <name><surname>Stivers</surname><given-names>T.</given-names></name></person-group> (<publisher-loc>Malden/MA</publisher-loc>: <publisher-name>Blackwell</publisher-name>), <fpage>191</fpage>&#x2013;<lpage>209</lpage>.</mixed-citation></ref>
<ref id="ref75"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Stivers</surname><given-names>T.</given-names></name> <name><surname>Enfield</surname><given-names>N. J.</given-names></name> <name><surname>Brown</surname><given-names>P.</given-names></name> <name><surname>Englert</surname><given-names>C.</given-names></name> <name><surname>Hayashi</surname><given-names>M.</given-names></name> <name><surname>Heinemann</surname><given-names>T.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Universals and cultural variation in turn-taking in conversation</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>106</volume>, <fpage>10587</fpage>&#x2013;<lpage>10592</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.0903616106</pub-id>, PMID: <pub-id pub-id-type="pmid">19553212</pub-id></mixed-citation></ref>
<ref id="ref76"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Stivers</surname><given-names>T.</given-names></name> <name><surname>Rossano</surname><given-names>F.</given-names></name></person-group> (<year>2010</year>). <article-title>Mobilizing response</article-title>. <source>Res. Lang. Soc. Interact.</source> <volume>43</volume>, <fpage>3</fpage>&#x2013;<lpage>31</lpage>. doi: <pub-id pub-id-type="doi">10.1080/08351810903471258</pub-id></mixed-citation></ref>
<ref id="ref77"><mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Stubbs</surname><given-names>M.</given-names></name></person-group> (<year>2001</year>). <source>Words and phrases. Corpus studies of lexical semantics</source>, vol. <volume>20</volume>. <publisher-loc>Malden/MA</publisher-loc>: <publisher-name>Blackwell studies in English</publisher-name>, <fpage>71</fpage>&#x2013;<lpage>87</lpage>.</mixed-citation></ref>
<ref id="ref9009"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Torreira</surname><given-names>F.</given-names></name> <name><surname>B&#x00F6;gels</surname><given-names>S.</given-names></name> <name><surname>Levinson</surname><given-names>S. C.</given-names></name></person-group> (<year>2015</year>). <article-title>Breathing for answering: the time course of response planning in conversation</article-title>. <source>Frontiers in Psychology</source>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2015.00284</pub-id></mixed-citation></ref>
<ref id="ref79"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Trujillo</surname><given-names>J. P.</given-names></name> <name><surname>Holler</surname><given-names>J. J.</given-names></name></person-group> (<year>2024</year>). <article-title>Information distribution patterns in naturalistic dialogue differ across languages</article-title>. <source>Psychon. Bull. Rev.</source> <volume>2024</volume>, <fpage>1723</fpage>&#x2013;<lpage>1734</lpage>. doi: <pub-id pub-id-type="doi">10.3758/s13423-024-02452-0</pub-id></mixed-citation></ref>
<ref id="ref80"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Trujillo</surname><given-names>J. P.</given-names></name> <name><surname>Judith Holler</surname><given-names>J.</given-names></name></person-group> (<year>2025</year>). <article-title>Multimodal information density is highest in question beginnings, and early entropy is associated with fewer but longer visual signals</article-title>. <source>Discourse Process.</source> <volume>2025</volume>:<fpage>13314</fpage>. doi: <pub-id pub-id-type="doi">10.1080/0163853X.2024.2413314</pub-id></mixed-citation></ref>
<ref id="ref81"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Walker</surname><given-names>M. B.</given-names></name> <name><surname>Trimboli</surname><given-names>C.</given-names></name></person-group> (<year>1982</year>). <article-title>Smooth transitions in conversational interactions</article-title>. <source>J. Soc. Psychol.</source> <volume>117</volume>, <fpage>305</fpage>&#x2013;<lpage>306</lpage>.</mixed-citation></ref>
<ref id="ref82"><mixed-citation publication-type="other"><person-group person-group-type="author"><name><surname>Wittenburg</surname><given-names>P.</given-names></name> <name><surname>Brugman</surname><given-names>H.</given-names></name> <name><surname>Russel</surname><given-names>A.</given-names></name> <name><surname>Klassmann</surname><given-names>A.</given-names></name> <name><surname>Sloetjes</surname><given-names>H.</given-names></name></person-group> (<year>2006</year>). <italic>Elan: a professional framework for multimodality research</italic>. In Proceedings of LREC, 2006 (Genoa).</mixed-citation></ref>
<ref id="ref83"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname><given-names>S.</given-names></name> <name><surname>Cong</surname><given-names>J.</given-names></name> <name><surname>Liang</surname><given-names>J.</given-names></name> <name><surname>Liu</surname><given-names>H.</given-names></name></person-group> (<year>2016</year>). <article-title>The distribution of information content in English sentences</article-title>. <source>arXiv</source> <volume>2016</volume>:<fpage>7681</fpage>. doi: <pub-id pub-id-type="doi">10.48550/arXiv.1609.07681</pub-id></mixed-citation></ref>
</ref-list>
<fn-group>
<fn id="fn0013" fn-type="custom" custom-type="edited-by"><p>Edited by: <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1384755/overview">Anne Pycha</ext-link>, University of Wisconsin&#x2013;Milwaukee, United States</p></fn>
<fn id="fn0014" fn-type="custom" custom-type="reviewed-by"><p>Reviewed by: <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1857374/overview">Sara B&#x00F6;gels</ext-link>, Tilburg University, Netherlands</p><p>Matthew Brook O&#x2019;Donnell, University of Pennsylvania, United States</p></fn>
</fn-group>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>The schematic representation only depicts turn-final go-signals; it does not depict advance-projecting turn completion cues, whose onset may be much earlier in the turn.</p></fn>
<fn id="fn0002"><p><sup>2</sup>Long-distance projection appears to play a smaller role in estimating turn endings than one-off final cues. <xref ref-type="bibr" rid="ref12">Corps et al. (2018)</xref> found that while content predictability enables listeners to prepare a response early, it does not guide them in deciding when to begin articulating it. Likewise, <xref ref-type="bibr" rid="ref7">B&#x00F6;gels and Torreira (2015)</xref> demonstrated that prosodic features in the final word&#x2014;but not in earlier ones&#x2014;shaped turn-end judgments, indicating that final cues carry more weight than long-range anticipation.</p></fn>
<fn id="fn0003"><p><sup>3</sup><ext-link xlink:href="http://ucrel-api.lancaster.ac.uk/claws/free.html" ext-link-type="uri">http://ucrel-api.lancaster.ac.uk/claws/free.html</ext-link></p></fn>
<fn id="fn0004"><p><sup>4</sup>Another advantage is that the underlying grammatical words in contracted forms are recognized and tagged separately; e.g., <italic>gonna</italic> is tagged <italic>gon_VVGK na_TO.</italic></p></fn>
<fn id="fn0005"><p><sup>5</sup>Identifying such QA sequences is anything but trivial. Questions may remain unanswered, involve a <italic>wh</italic>-pronoun but do not seek information but affirmation of the stance displayed in the question (as in rhetorical questions); or they may get responded to but not in a type-fitted manner by the selected nex-speaker but by a third, non-selected party (<xref ref-type="bibr" rid="ref43">Lerner, 2019</xref>) who inserts some (intrusive) talk that does not provide the sought information. Another complicating factor is the occurrence of questions in turbulent turn-taking, for example due to multiple overlap, which make identification of question and particularly answer difficult.</p></fn>
<fn id="fn0006"><p><sup>6</sup>The steps involved were: (i) map the first segment (for example, <italic>so wait</italic>, in extract (11.c)) to the (full) IPU of which it is a part thereby also mapping it to the <italic>c7</italic> word-tag string associated with the full IPU; this mapping utilizes the fact that both units start at the same time in the recording and therefore have the same starting time in ELAN; (ii) convert CA transcription in TCU segments into orthographic transcription by removing all CA-related characters, comments, pauses etc. making use of regular expression; (iii) collapse all orthographic TCU segments into a single string; (iv) devise a function to map <italic>c7</italic> word-tags in the IPU to the matching orthographic TCU segments; (v) apply the mapping function.</p></fn>
<fn id="fn0007"><p><sup>7</sup>FreMIC is a small corpus, with less than 400,000 word tokens. This smallness may be seen as compromising its ability to reflect the macrocosm of <italic>la langue</italic>. However, the normalized frequencies obtained for the question <italic>what&#x2019;s a mountain for you?</italic> from FreMIC shown in <xref ref-type="table" rid="tab4">Table 4</xref>. roughly follow the same trajectory as the normalized frequencies for the same question obtained from the much larger conversational subcorpus of the British National Corpus, which comprises 4.2 million word tokens, where the frequencies are, in the order of the words in the question: 9.09, 25.4,18.3,0.025, 5.43, and 31.9.</p></fn>
<fn id="fn0008"><p><sup>8</sup>Note that the frequencies in this example do not neatly follow the S-shaped distribution that will be demonstrated in Section 3. The example is hence representative of (the many other) cases in the question and story samples that do not behave prototypically. Examples of TCUs in which the frequencies are more closely aligned with the S-shape will be given in Section 4.</p></fn>
<fn id="fn0009"><p><sup>9</sup><italic>Surprisal</italic> on the TCU-first word, for which there is no prior word(s), is obtained from the negative log of the word&#x2019;s frequency in FreMIC divided by the total number of words in FreMIC (cf. <xref ref-type="bibr" rid="ref63">R&#x00FC;hlemann and Gries, 2020</xref>). An alternative method, which takes into account the fact that turn/TCU-first words are taken from a rather specialized portion of the vocabulary, may be more precise (cf. <xref ref-type="bibr" rid="ref65">R&#x00FC;hlemann and Schweinberger, 2021</xref>). This method, however, could not be adapted to the present data (due to the unavailability of units identiied as <italic>turns</italic> in FreMIC).</p></fn>
<fn id="fn0010"><p><sup>10</sup>The number of once-attested ngrams (N_0_CNF) was found an important variable in a Random Forest analysis of <italic>multimodal packages</italic> discriminating between transition-ready question TCUs and transition-averse story TCUs; this analysis incorporated 14 predictors from the verbal, visual, and vocal modalities (R&#x00FC;hlemann, Auer, Gries, Holler, &#x0026; Schulte, <italic>In preparation</italic>).</p></fn>
<fn id="fn0011"><p><sup>11</sup>The plot also includes the results for NN1 although the log ratio is &#x003C;1 (<italic>p</italic>&#x202F;&#x003C;&#x202F;0.05); see <xref rid="SM1" ref-type="supplementary-material">Supplementary Materials 2 and 3</xref></p></fn>
<fn id="fn0012"><p><sup>12</sup>I owe this idea to an anonymous reviewer.</p></fn>
</fn-group>
</back>
</article>