<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2017.00395</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Auditory-Motor Rhythms and Speech Processing in French and German Listeners</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Falk</surname> <given-names>Simone</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/135250/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Volpi-Moncorger</surname> <given-names>Chlo&#x000E9;</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Dalla Bella</surname> <given-names>Simone</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<xref ref-type="aff" rid="aff6"><sup>6</sup></xref>
<xref ref-type="aff" rid="aff7"><sup>7</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/24326/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Institut f&#x000FC;r Deutsche Philologie, Ludwig-Maximilians-University</institution> <country>Munich, Germany</country></aff>
<aff id="aff2"><sup>2</sup><institution>Laboratoire Parole et Langage, UMR 7309, Centre National de la Recherche Scientifique, Aix-Marseille University</institution> <country>Aix-en-Provence, France</country></aff>
<aff id="aff3"><sup>3</sup><institution>Laboratoire Phon&#x000E9;tique et Phonologie, UMR 7018, CNRS, Universit&#x000E9; Sorbonne Nouvelle Paris-3</institution> <country>Paris, France</country></aff>
<aff id="aff4"><sup>4</sup><institution>EuroMov, University of Montpellier</institution> <country>Montpellier, France</country></aff>
<aff id="aff5"><sup>5</sup><institution>Institut Universitaire de France</institution> <country>Paris, France</country></aff>
<aff id="aff6"><sup>6</sup><institution>International Laboratory for Brain, Music, and Sound Research</institution> <country>Montreal, QC, Canada</country></aff>
<aff id="aff7"><sup>7</sup><institution>Department of Cognitive Psychology, Wy&#x0017C;sza Szko&#x00142;a Finans&#x000F3;w i Zarz&#x00105;dzania w Warszawie (WSFiZ)</institution> <country>Warsaw, Poland</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Daniela Sammler, Max Planck Institute for Human Cognitive and Brain Sciences, Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Maren Schmidt-Kassow, Goethe University, Germany; Cyrille Magne, Middle Tennessee State University, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Simone Falk <email>simone.falk&#x00040;univ-paris3.fr</email></p></fn>
<fn fn-type="corresp" id="fn002"><p>Simone Dalla Bella <email>simone.dalla-bella&#x00040;umontpellier.fr</email></p></fn>
<fn fn-type="other" id="fn003"><p>This article was submitted to Auditory Cognitive Neuroscience, a section of the journal Frontiers in Psychology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>11</day>
<month>04</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>395</elocation-id>
<history>
<date date-type="received">
<day>01</day>
<month>10</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>02</day>
<month>03</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2017 Falk, Volpi-Moncorger and Dalla Bella.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Falk, Volpi-Moncorger and Dalla Bella</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Moving to a speech rhythm can enhance verbal processing in the listener by increasing temporal expectancies (Falk and Dalla Bella, <xref ref-type="bibr" rid="B18">2016</xref>). Here we tested whether this hypothesis holds for prosodically diverse languages such as German (a lexical stress-language) and French (a non-stress language). Moreover, we examined the relation between motor performance and the benefits for verbal processing as a function of language. Sixty-four participants, 32 German and 32 French native speakers detected subtle word changes in accented positions in metrically structured sentences to which they previously tapped with their index finger. Before each sentence, they were cued by a metronome to tap either congruently (i.e., to accented syllables) or incongruently (i.e., to non-accented parts) to the following speech stimulus. Both French and German speakers detected words better when cued to tap congruently compared to incongruent tapping. Detection performance was predicted by participants&#x00027; motor performance in the non-verbal cueing phase. Moreover, tapping rate while participants tapped to speech predicted detection differently for the two language groups, in particular in the incongruent tapping condition. We discuss our findings in light of the rhythmic differences of both languages and with respect to recent theories of expectancy-driven and multisensory speech processing.</p></abstract>
<kwd-group>
<kwd>speech perception</kwd>
<kwd>multisensory rhythm</kwd>
<kwd>verbal processing</kwd>
<kwd>temporal expectancies</kwd>
<kwd>temporal prediction</kwd>
</kwd-group>
<contract-num rid="cn001">327586</contract-num>
<contract-num rid="cn004">2011/01/M/HS6/04158</contract-num>
<contract-sponsor id="cn001">Seventh Framework Programme<named-content content-type="fundref-id">10.13039/501100004963</named-content></contract-sponsor>
<contract-sponsor id="cn002">Ludwig-Maximilians-Universit&#x000E4;t M&#x000FC;nchen<named-content content-type="fundref-id">10.13039/501100005722</named-content></contract-sponsor>
<contract-sponsor id="cn003">Institut Universitaire de France<named-content content-type="fundref-id">10.13039/501100004795</named-content></contract-sponsor>
<contract-sponsor id="cn004">DAAD and Polish NCN</contract-sponsor>
<counts>
<fig-count count="7"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="94"/>
<page-count count="14"/>
<word-count count="10221"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>In everyday communication and interaction, we often experience our sound environment through movement. We sway with music, move our eyebrows, head, or body while participating in a conversation, we gesture while speaking (e.g., McNeill, <xref ref-type="bibr" rid="B47">1992</xref>; Janata et al., <xref ref-type="bibr" rid="B28">2012</xref>). In particular, rhythmic sounds such as music featuring a regular beat or speech with a metrical structure (e.g., as in rhymes or poetry) are often accompanied by coordinated rhythmic movement (e.g., Ong, <xref ref-type="bibr" rid="B55">2002</xref>; Maes et al., <xref ref-type="bibr" rid="B44">2014</xref>). Parents stimulate their infants via concurrent rhythmic movements in verbal games and nursery rhymes (e.g., Stern, <xref ref-type="bibr" rid="B81">1974</xref>; Opie and Opie, <xref ref-type="bibr" rid="B57">1997</xref>). Older children utter words and move simultaneously during rhyme games and song by clapping their hands or stamping their feet (e.g., Opie and Opie, <xref ref-type="bibr" rid="B56">1988</xref>). Past research has shown that rhythm facilitates multisensory coordination and temporal perception (Manning and Schutz, <xref ref-type="bibr" rid="B45">2013</xref>, <xref ref-type="bibr" rid="B46">2016</xref>). In return, aligning motor to verbal rhythms facilitates verbal processing (Falk and Dalla Bella, <xref ref-type="bibr" rid="B18">2016</xref>). The mechanisms underlying the beneficial outcomes of aligned auditory-motor rhythms, particularly in the verbal domain, are still unclear. Here we aim at contributing to this issue by examining auditory-motor alignment and its effects on verbal processing in two rhythmically diverse languages, namely French and German.</p>
<p>There is evidence that cognitive benefits of auditory-motor rhythms can be driven by temporal expectancies. In general, when we expect something to happen at a certain time, we attend more to that particular moment than to another time (e.g., Large and Jones, <xref ref-type="bibr" rid="B37">1999</xref>). Auditory rhythms such as metrical speech or music with a salient beat structure feature accent patterns of syllables and notes that recur (quasi-) periodically in time (e.g., Lehiste, <xref ref-type="bibr" rid="B39">1977</xref>; London, <xref ref-type="bibr" rid="B42">2004</xref>) and are perceived as highly regular. An &#x0201C;accent&#x0201D; refers to a prominent syllable or note marked by the expansion of pitch, intensity, duration or other heightened acoustic properties (e.g., articulatory clarity in speech). Thanks to the recurrence of these accent patterns, a predictable temporal structure (rhythm) emerges across several timescales. That these auditory rhythms enhance attending is supported by both behavioral and psychophysiological evidence (Jones et al., <xref ref-type="bibr" rid="B30">2002</xref>; Ellis and Jones, <xref ref-type="bibr" rid="B17">2010</xref>). EEG studies show that periodic sequences of tones and syllables evoke larger amplitudes and shorter latencies of attention-related brain potentials (i.e., the P3b) than sequences with aperiodic rhythmic organization (Schwartze et al., <xref ref-type="bibr" rid="B79">2011</xref>; Otterbein et al., <xref ref-type="bibr" rid="B58">2012</xref>). In addition, higher attending to temporally predictable sounds is underpinned by neural oscillatory activity that phase-locks to rhythmic periodicities in the auditory signal at hierarchically nested frequencies (e.g., Lakatos et al., <xref ref-type="bibr" rid="B36">2008</xref>; Fujioka et al., <xref ref-type="bibr" rid="B23">2010</xref>; Peelle et al., <xref ref-type="bibr" rid="B60">2013</xref>). Independently of the exact neural basis of rhythmic attending, which is still debated, several studies point to behavioral advantages of temporally predictable sound sequences in both speech and music. In metrical speech, information is better processed and remembered (Quen&#x000E9; and Port, <xref ref-type="bibr" rid="B66">2005</xref>; Dilley and McAuley, <xref ref-type="bibr" rid="B14">2008</xref>; Roncaglia-Denissen et al., <xref ref-type="bibr" rid="B74">2013</xref>), in particular when it occurs on accented compared to unaccented syllables (Pitt and Samuel, <xref ref-type="bibr" rid="B63">1990</xref>; Zheng and Pierrehumbert, <xref ref-type="bibr" rid="B94">2010</xref>; Falk and Dalla Bella, <xref ref-type="bibr" rid="B18">2016</xref>). In sum, these findings lend support to the conclusion that temporal expectancies driven by an auditory rhythm in speech may help enhancing the perceptual salience of prominent syllables, and, thereby, facilitate verbal processing at these points in time.</p>
<p>Movement coordinated with the rhythm of sound sequences can also modify the way listeners attend to and encode auditory stimuli. This has been shown with musical material (e.g., Phillips-Silver and Trainor, <xref ref-type="bibr" rid="B62">2005</xref>). For example, synchronous motor activity with periodic tone sequences enhances the amplitude of attention-related brain potentials (Schmidt-Kassow et al., <xref ref-type="bibr" rid="B76">2013</xref>; Conradi et al., <xref ref-type="bibr" rid="B9">2016</xref>). In another EEG study, Chemin et al. (<xref ref-type="bibr" rid="B8">2014</xref>) showed that participants moving their hand to either a binary or ternary beat pattern of rhythmically ambiguous music encoded the rhythm of the sound through the rhythmic movement pattern. When the same participants listened to the music later, their brain responses were enhanced at exactly those points in time when their movements had previously occurred. This finding is compatible with the &#x0201C;active sensing&#x0201D; framework which posits that the motor system shapes processing of sensory information by linking auditory information to temporal predictions generated by action planning and execution (see Morillon et al., <xref ref-type="bibr" rid="B52">2015</xref>, for an overview). The results of Chemin et al. (<xref ref-type="bibr" rid="B8">2014</xref>) also fit with the common coding theory (Prinz, <xref ref-type="bibr" rid="B65">1990</xref>; Hommel et al., <xref ref-type="bibr" rid="B26">2001</xref>), which states that predictions associated with motor action planning and execution are jointly coded with auditory predictions in the cognitive system (see also Maes et al., <xref ref-type="bibr" rid="B44">2014</xref>, for a review).</p>
<p>While these theoretical approaches have been used to explain the role of rhythm and auditory-motor coupling in music, evidence for multisensory rhythmic effects in speech is still scarce. In a recent study, we investigated the effects of shared predictions generated by a temporally aligned auditory-motor rhythm on speech perception (Falk and Dalla Bella, <xref ref-type="bibr" rid="B18">2016</xref>). Participants were cued to align or misalign a motor rhythm (i.e., finger tapping) with the accented syllables of a metrically structured German sentence. The sentence was then repeated without tapping and the participants detected a verb change occurring either on accented or unaccented syllables. Results showed that participants were most successful in detecting the changes when their finger taps were aligned with the accented syllables compared to misalignment, and when the changes also occurred on accented syllables. Thus, finger tapping aligned to accented syllables is a critical factor leading to improved verbal performance. These findings suggest that the benefits of aligning motor and auditory rhythms may be driven by overlapping temporal expectancies. The result would be a maximal enhancement of attentional resources at the rhythmic reference points (i.e., accented syllables) in the speech signal.</p>
<p>One of the open questions is whether the observed auditory-motor effects are confined to German prosody, or generalize to other languages. German is described as a &#x0201C;stress&#x0201D;-language (Jessen, <xref ref-type="bibr" rid="B29">1999</xref>; Wiese, <xref ref-type="bibr" rid="B93">2000</xref>). At the lexical level, each word is learned with a specific pattern of stressed and unstressed syllables. Thus, stress accents may distinguish meaning and thereby fulfill a <italic>contrastive</italic> function in German. In the context of spoken discourse, the German stress-accents also have a <italic>cumulative (i.e., head-marking)</italic> function within their prosodic domain (i.e., feet, prosodic words, and phrases, intonation phrases, etc.), resulting in complex metrical relations between subsequent accents that mark different hierarchical levels, similar to English (Liberman and Prince, <xref ref-type="bibr" rid="B40">1977</xref>; Hayes, <xref ref-type="bibr" rid="B24">1995</xref>). In the German stress system, de-stressing as well as heightened stress are flexibly used to signal communicative deviations from syntactic and semantic default interpretations (e.g., to signal focus; F&#x000E9;ry and Ishihara, <xref ref-type="bibr" rid="B20">2008</xref>; F&#x000E9;ry and K&#x000FC;gler, <xref ref-type="bibr" rid="B21">2008</xref>). Analysis of the profile of stress-accents (i.e., their position and strength) as a key to comprehend the meaning of words and nested discourse units is common in German listeners. Hence, the German prosodic system may be particularly well-suited to drive rhythmic expectancies and to lead to the ensuing benefits on verbal processing. Here we tested whether predictions driven by auditory-motor rhythms extend to French, a language that differs substantially from the prosody of German. French is most often described as lacking lexical stress (Rossi, <xref ref-type="bibr" rid="B75">1980</xref>). Indeed, accentuation in French has predominantly a <italic>demarcative (i.e., boundary-marking)</italic> function within the prosodic domain of the accentual phrase (AP, Vaissi&#x000E8;re, <xref ref-type="bibr" rid="B84">1991</xref>; Jun and Fougeron, <xref ref-type="bibr" rid="B31">2000</xref>, <xref ref-type="bibr" rid="B32">2002</xref>). Primary accents in French are assigned to the last syllable (except Schwa-syllables) of a phrasal segment, thereby marking the right edge of the AP. Secondary accents, though not obligatory, can be found toward the left edge of an AP, depending on the number of syllables and on the lexical composition of the phrase (e.g., Ast&#x000E9;sano, <xref ref-type="bibr" rid="B2">2001</xref>; Welby, <xref ref-type="bibr" rid="B89">2003</xref>, <xref ref-type="bibr" rid="B90">2006</xref>). Hence, accent placement in French is tightly linked to grouping and phrasing, and, ultimately, to syntactic segmentation (e.g., Millotte et al., <xref ref-type="bibr" rid="B51">2008</xref>; Michelas and D&#x00027;Imperio, <xref ref-type="bibr" rid="B49">2015</xref>). Importantly, compared to German, stress conflicts between words and phrases such as stress clash and shift (e.g., Mengel, <xref ref-type="bibr" rid="B48">2000</xref>; Bohn et al., <xref ref-type="bibr" rid="B5">2013</xref>) are more rare (Post, <xref ref-type="bibr" rid="B64">2000</xref>). Peperkamp and Dupoux (<xref ref-type="bibr" rid="B61">2002</xref>) have suggested that because French lacks lexical relevance of accents and has fixed accent placement, French listeners pay less attention to accentual variations as compared to listeners of languages with lexical/variable stress. This intriguing hypothesis is supported by the observation of difficulties in encoding and memorizing stress contrasts (i.e., &#x0201C;stress-deafness&#x0201D;) in native French listeners when learning a foreign language with lexical/variable stress (e.g., Dupoux et al., <xref ref-type="bibr" rid="B16">2008</xref>; Schmidt-Kassow et al., <xref ref-type="bibr" rid="B77">2011</xref>; Domahs et al., <xref ref-type="bibr" rid="B15">2012</xref>; but see also Michelas et al., <xref ref-type="bibr" rid="B50">2016</xref>). Thus, we anticipate that French listeners will attend less to accented syllables while tapping to them than German listeners did. As a result, the benefit of aligning movement and speech rhythms may be less visible in French than in German.</p>
<p>We tested this hypothesis by asking French and German listeners to detect a verbal change in their native language in the context of an aligned or misaligned auditory-motor rhythm. The paradigm is the same as in Falk and Dalla Bella (<xref ref-type="bibr" rid="B18">2016</xref>). A second goal was to closely examine whether individual differences in motor performance could predict the success in this task for both languages.</p></sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Participants</title>
<p>Thirty-two French-speaking students from the Aix-Marseille University (4 males, <italic>M</italic> &#x0003D; 22.5 years, <italic>SD</italic> &#x0003D; 4.0 years, 3 left-handed) and 32 German-speaking students from the Ludwig-Maximilians-University in Munich (3 males, <italic>M</italic> &#x0003D; 24.6 years, <italic>SD</italic> &#x0003D; 5.5 years, 4 left-handed) took part in the Experiment. The German-speaking group was a subgroup of a larger study (see Falk and Dalla Bella, <xref ref-type="bibr" rid="B18">2016</xref>). None of the participants was a professional musician, although the German speakers overall had more musical experience than the French speakers [French group: 10 participants with music lessons, range &#x0003D; 4&#x02013;20 years; German group: 27 participants with music lessons, range 2&#x02013;18 years, Mann&#x02013;Whitney <italic>U</italic><sub>(63)</sub> &#x0003D; 713.5, <italic>p</italic> &#x0003D; 0.001]. Sixteen additional French-speaking students (7 males, <italic>M</italic> &#x0003D; 21.8 years, <italic>SD</italic> &#x0003D; 2.8 years, 4 left-handed and 1 ambidexter, 9 participants with music lessons, range: 1&#x02013;20 years) and 16 additional German-speaking students (9 males, <italic>M</italic> &#x0003D; 25.0 years, <italic>SD</italic> &#x0003D; 3.3 years, all right-handed, 11 participants with music lessons; range: 1.5&#x02013;12 years) participated in a perceptual control experiment (&#x0201C;Baseline&#x0201D;). All the participants gave informed written consent. The study was approved by the Ethics Committee of Aix-Marseille University.</p>
<sec>
<title>Materials</title>
<p>Twenty-four German and 24 French metrical speech stimuli with regular distributions of accented and unaccented syllables were created (examples in Supplemental Materials). German stimuli (Figure <xref ref-type="fig" rid="F1">1A</xref>) consisted each of 16 syllables with an alternating binary strong-weak (i.e., stressed&#x02013;unstressed syllables) metrical pattern. Syntactically, each stimulus was formed by two short simple sentences (8 syllables each, only main clauses) one of which contained the target word (i.e., a monosyllabic verb) that served to test change detection. The occurrence of the verb in the first or in the second sentence was equally balanced across the stimuli. The verb was always placed on a metrically strong position. French stimuli (Figure <xref ref-type="fig" rid="F1">1B</xref>) consisted of 20 syllables each, divided into four accentual phrases (AP) of five syllables. Each accentual phrase followed a LHLH pattern (Welby, <xref ref-type="bibr" rid="B91">2007</xref>) with an initial and a final accent on the second and on the fifth syllable, respectively. Syntactically, as done with German material, each stimulus included two short simple sentences formed by two accentual phrases each. Across the stimuli, the first and the second sentence contained equally often the monosyllabic target verb. In French, the verb always occurred on an initial accent position.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Examples of (A)</bold> German and <bold>(B)</bold> French stimuli. The bold lines in the spectrogram display the F0 contour. The position of the change verb is indicated by the arrow. In the German example, big triangles mark stressed syllables, small triangles unstressed syllables. In the French example, big triangles mark the main final accents, medium-sized triangles the initial accents, and small triangles, the unaccented syllables.</p></caption>
<graphic xlink:href="fpsyg-08-00395-g0001.tif"/>
</fig>
<p>The stimuli were recorded by two native French- or German-speaking female speakers reading at a regular pace (100 beats/min). The speakers were cued by a metronome prior to reading each stimulus to produce an accented syllable at regular temporal intervals, every 600 ms. The recordings were then examined to ensure that accented syllables occurred every 600 ms, on average (&#x000B1;20 ms). To this aim, the perceptual centers (i.e., p-centers) of accented syllables were estimated using an automated procedure proposed by Cummins and Port (<xref ref-type="bibr" rid="B10">1998</xref>). A p-center is defined as the time at which the occurrence of a syllable is perceived, which is most often around the vowel onset (Morton et al., <xref ref-type="bibr" rid="B54">1976</xref>). It can be roughly estimated as the point corresponding to half of the amplitude rise before reaching maximum amplitude for a nuclear vowel (Cummins and Port, <xref ref-type="bibr" rid="B10">1998</xref>). The intervals between the estimated p-centers of accented syllables were calculated, and, if deviating from the 600 ms interval, they were manually adjusted by slightly shortening or lengthening silences or segmental material in the interval, using Praat software (Boersma, <xref ref-type="bibr" rid="B4">2001</xref>). A trial was created in which a stimulus was repeated with a 2-s pause between presentations. In the second presentation (&#x0201C;detection phase&#x0201D;), the target verb was replaced by another verb that had the same morpho-syntactic structure and very similar meaning (e.g., <italic>jault</italic> &#x0201C;yowls&#x0201D;&#x02014;<italic>heult</italic> &#x0201C;howls&#x0201D;; Figure 2, Sturt et al., <xref ref-type="bibr" rid="B82">2004</xref>).</p>
<p>Semantic closeness between the change verbs and the original verbs (on a scale from 1 &#x0003D; very distant to 10 &#x0003D; very close in meaning) was confirmed in a pilot experiment with two groups of 10 native French and German speakers each (see Table <xref ref-type="table" rid="T1">1</xref>). Both verbs were comparable in number of phonemes and frequency (Table <xref ref-type="table" rid="T1">1</xref>). For each language, additional filler trials were created to ensure that participants did not pay selectively attention to the verbs. Fillers had the same metrical and syntactic structure as described above. In 24 fillers, a noun was changed, and in 12 fillers, there was no change. Overall, there were 60 trials, including experimental stimuli and fillers, per language.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Characteristics of verb changes for French and German material</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Language (Database)</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>German material (Dlexdb)</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>French material (Lexique-3)</bold></th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>Original verb Mean (<italic>SD</italic>)</bold></th>
<th valign="top" align="center"><bold>Changed verb Mean (<italic>SD</italic>)</bold></th>
<th valign="top" align="center"><bold>Original verb Mean (<italic>SD</italic>)</bold></th>
<th valign="top" align="center"><bold>Changed verb Mean (<italic>SD</italic>)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Number of phonemes</td>
<td valign="top" align="center">4.1 (0.58)</td>
<td valign="top" align="center">4.3 (0.38)</td>
<td valign="top" align="center">4.5 (2.0)</td>
<td valign="top" align="center">4.3 (1.1)</td>
</tr>
<tr>
<td valign="top" align="left">Frequency</td>
<td valign="top" align="center">11 per million (28)</td>
<td valign="top" align="center">16 per million (49)</td>
<td valign="top" align="center">58 per million (85)</td>
<td valign="top" align="center">41 per million (56)</td>
</tr>
<tr>
<td valign="top" align="left">Semantic distance rating (1&#x02013;10)</td>
<td valign="top" align="center" colspan="2">7.62 (1.25) &#x0201C;semantically close meaning&#x0201D;</td>
<td valign="top" align="center" colspan="2">7.71 (1.02) &#x0201C;semantically close meaning&#x0201D;</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec></sec>
<sec>
<title>Procedure</title>
<p>As described in Falk and Dalla Bella (<xref ref-type="bibr" rid="B18">2016</xref>), participants performed a rhythmic finger tapping task (i.e., synchronization-continuation task; Wing, <xref ref-type="bibr" rid="B92">2002</xref>) combined with a verbal change detection task (Sturt et al., <xref ref-type="bibr" rid="B82">2004</xref>; for an illustration, see Figure <xref ref-type="fig" rid="F2">2</xref>). Prior to each verbal stimulus, participants were asked to tap with the index finger of their dominant hand to 12 isochronous metronome tones (<italic>synchronization phase</italic>; tone duration &#x0003D; 30 ms, Inter-Onset-Interval, IOI &#x0003D; 600 ms). The time of their taps was recorded on the left panel of a Roland SPD-6 MIDI percussion pad. When the metronome stopped, a speech stimulus started. Participants were instructed to continue their tapping at the rate previously indicated by the metronome (<italic>continuation phase</italic>), while carefully listening to the verbal stimulus. The continuation of the taps either resulted in congruent or incongruent alignment of the motor rhythm with the verbal rhythm, depending on the onset of the verbal stimulus. When the alignment condition was <italic>congruent</italic>, the first metrically strong syllable (i.e., its p-center) started 600 ms after the last metronome tone, corresponding to one IOI of the metronome. Thus, participants who continued tapping at the pace of the metronome, aligned their taps to the accented syllables of the speech stimulus. Their taps also occurred on the syllable that contained the change verb. In contrast, when the alignment condition was <italic>incongruent</italic>, the speech stimulus started with a delay of 300 ms, as compared to the congruent condition (Figure <xref ref-type="fig" rid="F2">2</xref>). As a result, participants&#x00027; taps continuing at the pace of the metronome fell between accented syllables and did not coincide with the target verb. During the pause preceding the detection phase, participants stopped tapping and prepared for detecting a verbal change. When perceiving a change, they tapped as fast as possible on the right panel of the percussion pad, thereby stopping the stimulus. At the end of the trial, participants recalled the original and the changed word. Verbal answers were recorded with a head-mounted microphone and written down by the Experimenter. In addition, participants summarized the content of the stimuli every three trials on average to ensure that meaning was processed.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Procedure used in the Experiment</bold>. A trial (German) is displayed. Accented positions of the metrical speech pattern are marked by black triangles and unaccented positions by small dots. The position of the verb change is marked by a big &#x0201C;V.&#x0201D; The alignment cue (big black dots) and alignment of finger taps (gray triangles) with the speech are displayed.</p></caption>
<graphic xlink:href="fpsyg-08-00395-g0002.tif"/>
</fig>
<p>Participants were only tested on the stimulus set of their native language. A block of 30 trials (12 stimuli, 12 fillers, and 6 no-change stimuli) per alignment condition was preceded by three practice trials. The stimulus set for each language was organized in 8 randomization lists which were presented equally often under both alignment conditions in counterbalanced order across participants.</p>
<p>A perceptual Baseline condition, without cues and tapping for the French and German stimuli was additionally tested with two other groups of participants in order to evaluate motor effects on detection sensitivity (see Falk and Dalla Bella, <xref ref-type="bibr" rid="B18">2016</xref>). The verbal change detection procedure followed the same protocol and randomization procedure as described above. However, no metronome cue was presented before the stimuli and participants only listened to the stimuli before giving their verbal response.</p>
<p>The tapping Experiment lasted approximately 45 min, and the Baseline assessment 35 min. Both were run on an IBM-compatible computer using MaxMSP 5.1.9. Auditory stimuli were presented via Beyer-Dynamic DT-770 Pro 250 headphones (German group) and Beyer-Dynamic MMX 300 headphones (French group). Testing was done in Munich, Germany, and Aix-en-Provence, France.</p></sec>
<sec>
<title>Tapping analyses<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref></title>
<sec>
<title>General tapping measures</title>
<p>Tapping rate was obtained by calculating the mean inter-tap-interval (mean ITI). The expected mean ITI was 600 ms, as cued by the metronome, in both the synchronization and continuation phases. Motor variability was obtained by computing the Coefficient of Variation of the ITI (CV ITI), namely the standard deviation of the ITI divided by the mean ITI. Higher CV indicates higher motor variability during the task.</p></sec>
<sec>
<title>Synchronization phase</title>
<p>We examined whether participants&#x00027; taps were well-aligned with the metronome tones. To this goal, synchronization accuracy (i.e., the synchronization error and its direction) and synchronization consistency were calculated (e.g., Aschersleben, <xref ref-type="bibr" rid="B1">2002</xref>; Repp and Penel, <xref ref-type="bibr" rid="B70">2004</xref>; Repp, <xref ref-type="bibr" rid="B68">2005</xref>; Sowi&#x00144;ski and Dalla Bella, <xref ref-type="bibr" rid="B80">2013</xref>; Dalla Bella et al., <xref ref-type="bibr" rid="B13">2016</xref>). The synchronization error and its direction were obtained by computing the absolute and the signed (positive or negative) mean delays between the taps and the metronome, respectively. Negative direction of the synchronization error indicates that the taps precede the metronome tone, on average. Synchronization consistency is measured by the standard error (<italic>SE</italic>) of the asynchrony between taps and metronome tones.</p></sec>
<sec>
<title>Continuation phase</title>
<p>Performance in the continuation phase was measured to assess how well the participants followed the instruction to continue tapping at the pace indicated by the metronome. Similar to the measures in the synchronization phase, measures of continuation accuracy and consistency were obtained, in spite of the absence of an explicit pacing stimulus. For accuracy, we measured whether the taps occurred on average before or after the <italic>expected</italic> tap times as indicated by the preceding metronome. Thus, mean signed asynchrony (positive or negative) was calculated between the actual tap times and the <italic>expected</italic> tap times from the synchronization phase. Consistency in the continuation phase was determined as the <italic>SE</italic> of the asynchrony between the actual taps and the <italic>expected</italic> tap times.</p></sec></sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Detection performance</title>
<p>Verbal responses in the detection phase were analyzed by calculating sensitivity (<italic>d</italic>&#x02032;) and response bias (<italic>C</italic>, MacMillan and Creelman, <xref ref-type="bibr" rid="B43">2005</xref>) as done in our previous study (Falk and Dalla Bella, <xref ref-type="bibr" rid="B18">2016</xref>). A Hit occurred when both the changed and the original verb were provided in their semantically and phonetically accurate form. A False alarm occurred when a change was reported in a no-change trial. Data of one French-speaking participant in the main Experiment were discarded because she performed at chance in the detection task (<italic>d</italic>&#x02032; &#x0003D; 0).</p>
<p>We first examined if detection of test words (<italic>d</italic>&#x02032;) was influenced by motor alignment in the tapping task. Data were entered in a 2 &#x000D7; 2 mixed-design Analysis of Variance (ANOVA) taking Congruency (congruent vs. incongruent) as the within-subject factor, and Language (French vs. German) as the between-subject factor. As shown in Figure <xref ref-type="fig" rid="F3">3</xref>, regardless of language, participants were overall more efficient in detecting verbal changes when their taps were congruently aligned to speech accents, as compared to the incongruent alignment [<italic>F</italic><sub>(1, 61)</sub> &#x0003D; 4.024, <italic>p</italic> &#x0003D; 0.049]. Overall, greater sensitivity to change detection was found in the German than in the French-speaking group<xref ref-type="fn" rid="fn0002"><sup>2</sup></xref> [<italic>F</italic><sub>(1, 61)</sub> &#x0003D; 5.15, <italic>p</italic> &#x0003D; 0.027]. The Congruency &#x000D7; Language interaction did not reach significance. Response bias (<italic>C</italic>) was entered into a similar ANOVA. No differences in response bias were found, as indicated by lack of significant effects of Congruency, Language, and their interaction.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>Detection sensitivity for change words in congruent and incongruent tapping conditions for French and German speakers</bold>. The perceptual baseline (averaged across language groups) is displayed as a dotted line. Error bars represent 95%-Confidence intervals. Stars indicate significant differences between alignment conditions. Arrows indicate significant differences between the Baseline and the motor alignment conditions.</p></caption>
<graphic xlink:href="fpsyg-08-00395-g0003.tif"/>
</fig>
<p>Second, we compared the tapping data in the congruent and incongruent conditions taken separately to the perceptual Baseline data (French: <italic>d</italic>&#x02032; &#x0003D; 1.51, 95%-CI &#x0003D; &#x000B1;0.469; German: 1.65, 95%-CI &#x0003D; &#x000B1;0.469). Two 2 &#x000D7; 2 ANOVAs were run using Language (French vs. German) and Condition (Motor alignment vs. Baseline) as between-subject factors. Results revealed that the congruent motor alignment enhanced detection sensitivity relative to the Baseline [<italic>F</italic><sub>(1, 91)</sub> &#x0003D; 8.16, <italic>p</italic> &#x0003D; 0.005] in both languages (no main effect of Language, no interaction). No difference in sensitivity was found between the incongruent alignment and the Baseline (<italic>p</italic> &#x0003E; 0.27).</p></sec>
<sec>
<title>Tapping performance<xref ref-type="fn" rid="fn0003"><sup>3</sup></xref></title>
<sec>
<title>General motor performance</title>
<p>To assess motor performance, tapping rate (mean ITI) and motor variability (CV of ITI) in the synchronization and continuation phases were analyzed for all trials. Data were entered in two independent 2 &#x000D7; 2 &#x000D7; 2 mixed-design ANOVAs. Congruency (congruent vs. incongruent) and Phase (synchronization vs. continuation) were the within-subject factors, while Language (French vs. German) was the between-subject factor.</p>
<p>For tapping rate, the triple interaction Congruency &#x000D7; Language &#x000D7; Phase was highly significant [<italic>F</italic><sub>(1, 61)</sub> &#x0003D; 16.15, <italic>p</italic> &#x0003C; 0.001]. This interaction was decomposed by holding Language constant, and by running two additional 2 &#x000D7; 2 ANOVAs. As can be seen in Figure <xref ref-type="fig" rid="F4">4</xref> (right panel), German participants tapped in the vicinity of the expected interval (mean ITI &#x0003D; 599 ms, 95 %-CI &#x0003D; &#x000B1;3 ms) without showing any differences between the synchronization and continuation phase, nor between congruent and incongruent alignments. In contrast, in the French group, analyses revealed a significant Congruency &#x000D7; Phase interaction [<italic>F</italic><sub>(1, 30)</sub> &#x0003D; 32.96, <italic>p</italic> &#x0003C; 0.001; main effect of Congruency: <italic>F</italic><sub>(1, 30)</sub> &#x0003D; 42.37, <italic>p</italic> &#x0003C; 0.001; main effect of Phase: <italic>F</italic><sub>(1, 30)</sub> &#x0003D; 4.58, <italic>p</italic> &#x0003D; 0.041]. French-speaking participants (Figure <xref ref-type="fig" rid="F4">4</xref>, left panel) slightly accelerated when tapping with speech as compared to the metronome in the congruent condition [<italic>F</italic><sub>(1, 30)</sub> &#x0003D; 4.93, <italic>p</italic> &#x0003D; 0.034], but slowed down on average by &#x0007E;15 ms per ITI when tapping to speech in the incongruent condition [<italic>F</italic><sub>(1, 30)</sub> &#x0003D; 11.20, <italic>p</italic> &#x0003D; 0.001].</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>Tapping rate (mean ITI) in the synchronization phase (with metronome) and the continuation phase (with speech) in the two alignment conditions, for French and German speakers</bold>. The horizontal line indicates the expected ITI at 600 ms. Error bars represent 95%-Confidence intervals. Stars indicate significant differences.</p></caption>
<graphic xlink:href="fpsyg-08-00395-g0004.tif"/>
</fig>
<p>The two language groups did not differ in terms of motor variability (CV ITI<sub>French</sub> &#x0003D; 0.054, CI &#x0003D; &#x000B1;0.005; CV ITI<sub>German</sub> &#x0003D; 0.053, CI &#x0003D; &#x000B1;0.005). Overall, participants&#x00027; motor variability only differed in the continuation phase depending on the congruency condition [Congruency &#x000D7; Phase interaction: <italic>F</italic><sub>(1, 61)</sub> &#x0003D; 8.91, <italic>p</italic> &#x0003D; 0.004; main effects for Congruency: <italic>F</italic><sub>(1, 61)</sub> &#x0003D; 6.02, <italic>p</italic> &#x0003D; 0.017; and Phase: <italic>F</italic><sub>(1, 61)</sub> &#x0003D; 70.00, <italic>p</italic> &#x0003C; 0.001]. Participants&#x00027; motor variability was higher when tapping incongruently to speech in the continuation phase than when tapping congruently [CV<sub>con</sub> &#x0003D; 0.048, CI &#x0003D; &#x000B1;0.004 vs. CV<sub>inc</sub> &#x0003D; 0.064, CI &#x0003D; &#x000B1;0.005, <italic>F</italic><sub>(1, 62)</sub> &#x0003D; 11.35, <italic>p</italic> &#x0003D; 0.001].</p></sec>
<sec>
<title>Tapping in the synchronization phase as a predictor of detection performance</title>
<p>An additional goal of the present study was to examine whether synchronization with the metronome tones was a predictor of participants&#x00027; success in later detection performance. We tested first if the two language groups differed in overall synchronization skills across all trials. Synchronization accuracy (synchronization error and its direction) and consistency (<italic>SE</italic> of asynchrony) were entered into three separate 2 &#x000D7; 2 mixed-design ANOVAs, with Congruency as the within-subject factor and Language as the between-subject factor. Overall, the results showed good synchronization with no differences between language groups or congruency conditions. All the participants tapped with a synchronization error of 57.44 ms on average (95%-CI &#x0003D; &#x000B1;6.72 ms). Their taps rather preceded the metronome tones (synchronization direction &#x0003D; &#x02212;53.15 ms; 95%-CI &#x0003D; &#x000B1;7.52 ms), as typically reported in tapping studies (negative mean asynchrony, e.g., Aschersleben, <xref ref-type="bibr" rid="B1">2002</xref>). Synchronization consistency was within the normal range (mean <italic>SE</italic> &#x0003D; 9.05; 95%-CI &#x0003D; &#x000B1;0.5) for a comparable population of young adults (e.g., Dalla Bella et al., <xref ref-type="bibr" rid="B13">2016</xref>).</p>
<p>After having discarded the possibility of group differences in synchronization skills, we examined whether participants&#x00027; synchronization performance across language groups could predict sensitivity in change detection (<italic>d</italic>&#x02032;) in the congruent and incongruent alignment conditions, respectively. To this aim, we fitted linear regression models in which <italic>d</italic>&#x02032; was the dependent variable and the predictors were Language and the variables reflecting the tapping performance during the synchronization phase: synchronization accuracy (i.e., signed asynchrony), synchronization consistency, tapping rate and motor variability. The best-fitting model is reported below (Table <xref ref-type="table" rid="T2">2</xref>)<xref ref-type="fn" rid="fn0004"><sup>4</sup></xref>. In the congruent condition, <italic>d</italic>&#x02032; was predicted by the direction of synchronization error: the more the participants tapped in advance of the metronome tones in the synchronization phase (i.e., the more negative their signed asynchrony), the higher the <italic>d</italic>&#x02032; for both language groups in the continuation phase. This model was not a good fit to the data from the incongruent condition (<italic>p</italic> &#x0003E; 0.60).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Predictors of sensitivity in change detection (<italic>d</italic>&#x02032;) in the congruent alignment condition</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th/>
<th valign="top" align="center"><bold>B/Beta</bold></th>
<th valign="top" align="center"><bold>Std. Error</bold></th>
<th valign="top" align="center"><italic><bold>t</bold></italic></th>
<th valign="top" align="center"><italic><bold>p</bold></italic></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Constant</td>
<td valign="top" align="center">1.455</td>
<td valign="top" align="center">0.217</td>
<td valign="top" align="center">6.690</td>
<td valign="top" align="center">&#x0003C;0.001<xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">Language</td>
<td valign="top" align="center">0.578/0.362</td>
<td valign="top" align="center">0.192</td>
<td valign="top" align="center">3.004</td>
<td valign="top" align="center">0.004<xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">Signed asynchrony</td>
<td valign="top" align="center">&#x02212;0.006/&#x02212;0.259</td>
<td valign="top" align="center">0.003</td>
<td valign="top" align="center">&#x02212;2.151</td>
<td valign="top" align="center">0.036<xref ref-type="table-fn" rid="TN1"><sup>&#x0002A;</sup></xref></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Model: R<sup>2</sup> &#x0003D; 0.16, F<sub>(2, 60)</sub> &#x0003D; 5.81, p &#x0003D; 0.005, AIC &#x0003D; 147.41, BIC &#x0003D; 155.98</italic>.</p>
<fn id="TN1">
<label>&#x0002A;</label>
<p><italic>p &#x0003C; 0.05</italic>;</p></fn>
<fn id="TN2">
<label>&#x0002A;&#x0002A;</label>
<p><italic>p &#x0003C; 0.01</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Tapping in the continuation phase as a predictor of detection performance</title>
<p>Here, we examined whether tapping performance in the continuation phase predicted participants&#x00027; detection performance. As before, we tested potential differences in the tapping task, in terms of accuracy and consistency during continuation in the two language groups with two 2 (Congruency) &#x000D7; 2 (Language) repeated-measures ANOVAs. Figure <xref ref-type="fig" rid="F5">5</xref> shows the results for accuracy (signed asynchrony) for the two language groups. As can be seen, all the participants tapped in advance relative to the expected tap time in the congruent condition compared to the incongruent condition [main effect of Congruency, <italic>F</italic><sub>(1, 61)</sub> &#x0003D; 96.85, <italic>p</italic> &#x0003C; 0.001]. However, continuation accuracy differed between French and German speakers as a function of the alignment condition [Congruency &#x000D7; Language interaction, <italic>F</italic><sub>(1, 61)</sub> &#x0003D; 4.67, <italic>p</italic> &#x0003D; 0.035]. With incongruent alignment, the French-speaking participants tapped later (i.e., after the expected tap time) than the German-speaking participants who were still slightly in advance of the expected tap time [<italic>F</italic><sub>(1, 61)</sub> &#x0003D; 4.52, <italic>p</italic> &#x0003D; 0.038]. All participants, irrespective of language group, were less consistent when their taps were incongruently aligned with the speech accents than when they were congruently aligned [<italic>F</italic><sub>(1, 61)</sub> &#x0003D; 96.85, <italic>p</italic> &#x0003C; 0.001]. No differences in consistency were found between language groups nor interactions with the alignment condition (French group, mean consistency &#x0003D; 23.69, CI &#x0003D; &#x000B1;3.20; German group, mean consistency &#x0003D; 21.81, CI &#x0003D; &#x000B1;3.15).</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p><bold>Continuation accuracy (signed asynchrony) for French and German speakers in relation to the expected moment of the tap (dotted vertical line at 0 ms) during the continuation phase (i.e., with speech) depending on the alignment condition</bold>. Triangles represent the mean asynchrony, error bars display 95%-Confidence intervals.</p></caption>
<graphic xlink:href="fpsyg-08-00395-g0005.tif"/>
</fig>
<p>Participants&#x00027; tapping performance in the continuation phase for test trials was used to predict detection success (<italic>d</italic>&#x02032;). We fitted linear regression models in which the dependent variable was sensitivity (<italic>d</italic>&#x02032;) and the predictors were language, continuation accuracy, consistency as well as tapping rate and motor variability. Again, the best-fitting model is reported<sup>4</sup>. Tapping in the continuation phase predicted detection performance only in the incongruent condition. The best model (Table <xref ref-type="table" rid="T3">3</xref>) showed that the tapping rate was a significant predictor for <italic>d</italic>&#x02032;, and that the relation between <italic>d</italic>&#x02032; and tapping rate differed with respect to language, as can be seen in Figure <xref ref-type="fig" rid="F6">6</xref>.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>Predictors of sensitivity in change detection (<italic>d</italic>&#x02032;) in the incongruent alignment condition</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th/>
<th valign="top" align="center"><bold>B/Beta</bold></th>
<th valign="top" align="center"><bold>Std. Error</bold></th>
<th valign="top" align="center"><italic><bold>T</bold></italic></th>
<th valign="top" align="center"><italic><bold>p</bold></italic></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Constant</td>
<td valign="top" align="center">13.336</td>
<td valign="top" align="center">3.951</td>
<td valign="top" align="center">3.375</td>
<td valign="top" align="center">0.001<xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">Language</td>
<td valign="top" align="center">&#x02212;17.238/&#x02212;9.586</td>
<td valign="top" align="center">5.530</td>
<td valign="top" align="center">&#x02212;3.117</td>
<td valign="top" align="center">0.003<xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">Tapping rate</td>
<td valign="top" align="center">&#x02212;0.019/&#x02212;0.529</td>
<td valign="top" align="center">0.006</td>
<td valign="top" align="center">&#x02212;2.933</td>
<td valign="top" align="center">0.005<xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">Language<sup>&#x0002A;</sup>Tapping rate</td>
<td valign="top" align="center">0.029/0.016</td>
<td valign="top" align="center">0.009</td>
<td valign="top" align="center">3.145</td>
<td valign="top" align="center">0.003<xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;&#x0002A;</sup></xref></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Model: R<sup>2</sup> &#x0003D; 0.17, F<sub>(3, 58)</sub> &#x0003D; 4.00, p &#x0003D; 0.01, AIC &#x0003D; 161.06, BIC &#x0003D; 171.70</italic>.</p>
<fn id="TN3">
<label>&#x0002A;&#x0002A;</label>
<p><italic>p &#x0003C; 0.01</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p><bold>Individual detection performance (<italic>d</italic>&#x02032;) in relation to tapping rate (mean ITI) in the continuation phase/incongruent alignment by language group</bold>. The regression lines are displayed in bold. The dotted lines mark the expected ITI.</p></caption>
<graphic xlink:href="fpsyg-08-00395-g0006.tif"/>
</fig>
<p>In light of these results, we further examined whether there was a relation between tapping rate and continuation accuracy in test trials for the incongruent alignment condition. Note that there are overall differences in continuation accuracy and also tapping rate between language groups, as observed before (see Figures <xref ref-type="fig" rid="F4">4</xref>, <xref ref-type="fig" rid="F5">5</xref>). Simple linear regressions were performed separately for each language group with tapping rate as the predictor and continuation accuracy as the dependent variable. Tapping rate accounted for a high proportion of variance in continuation accuracy for the incongruent alignment for both French [<italic>R</italic><sup>2</sup> &#x0003D; 0.58, <italic>F</italic><sub>(1, 29)</sub> &#x0003D; 39.2, <italic>p</italic> &#x0003C; 0.001] and German [<italic>R</italic><sup>2</sup> &#x0003D; 0.56, <italic>F</italic><sub>(1, 30)</sub> &#x0003D; 38.8, <italic>p</italic> &#x0003C; 0.001] participants, as shown in Figure <xref ref-type="fig" rid="F7">7</xref>. Thus, participants tapping at a slower rate than expected were also those who lagged more behind the expected moment of the tap in the incongruent continuation phase.</p>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p><bold>Continuation accuracy (signed asynchrony) in relation to tapping rate (mean ITI) in the continuation phase (incongruent alignment) for the two language groups</bold>. The regression lines are displayed in bold. The dotted lines mark the expected moment of tap on the y-axis and the expected tapping rate (continued from the synchronization phase) on the x-axis.</p></caption>
<graphic xlink:href="fpsyg-08-00395-g0007.tif"/>
</fig></sec></sec></sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>The aim of the present study was to test whether the beneficial effects of aligning a motor rhythm to a speech rhythm on verbal processing are found in rhythmically diverse languages (i.e., French, a non-stress language vs. German, a lexical stress language). In addition, we investigated the links between individual motor performance and benefits in speech processing. French and German native speakers were cued to tap their finger with metrical speech stimuli they heard in their native language. Tapping was either aligned with the speech accents (congruently) or non-aligned (incongruently; see Falk and Dalla Bella, <xref ref-type="bibr" rid="B18">2016</xref>). After tapping, participants detected subtle verbal changes in the speech stimuli.</p>
<p>Our results showed that native speakers of both a lexical stress- and a non-stress language were more successful in detecting subtle verbal changes in speech, when their finger taps were congruently aligned with the accented syllables of the sentences compared to incongruent alignment and to a perceptual baseline. Thus, the benefits of aligning motor and speech rhythms on verbal processing, already shown in German (Falk and Dalla Bella, <xref ref-type="bibr" rid="B18">2016</xref>), extend to a non-stress language (French). Generalization of the effect to rhythmically different languages supports the idea that motor alignment with prominent syllables in speech enhances attending to expected moments in time (e.g., Chemin et al., <xref ref-type="bibr" rid="B8">2014</xref>), and thereby verbal processing. Although French and German accentuation substantially differs in prosodic embedding, function, and acoustic correlates, the present findings support the idea that both types of rhythmic organization similarly serve as anchors for linking perception and action in time through shared predictions in a metrical context. This is in line with previous results pointing to a close link between speech prominences and manual movements (Kelso et al., <xref ref-type="bibr" rid="B33">1985</xref>; Rochet-Capellan et al., <xref ref-type="bibr" rid="B72">2008</xref>; Parrell et al., <xref ref-type="bibr" rid="B59">2014</xref>). Yet, note that the present study did not allow us to directly compare sensitivity in French vs. German listeners, as the material in the two languages was very different in terms of semantics, syntax, and phonetics/phonology.</p>
<p>We further investigated which aspects of the motor performance can predict individual success in the verbal task. We identified two motor predictors for verbal detection, synchronization performance with the cue and tapping rate during continuation, which differed depending on the alignments between motor and auditory rhythms. First, when rhythms were aligned (congruent condition), participants who tapped earlier to the metronome tones preceding the verbal stimulus had more success in detecting a verbal change than the other participants. Tapping in advance of a discrete, periodically recurring and, thereby, <italic>expected</italic> sound event is a common finding in tapping research (negative mean asynchrony; Fraisse and Voillaume, <xref ref-type="bibr" rid="B22">1971</xref>; Aschersleben, <xref ref-type="bibr" rid="B1">2002</xref>; Repp and Su, <xref ref-type="bibr" rid="B71">2013</xref>). It is indicative of anticipatory behavior during synchronization (Repp, <xref ref-type="bibr" rid="B68">2005</xref>; van der Steen and Keller, <xref ref-type="bibr" rid="B85">2013</xref>). The possibility that expectancies emerging during the synchronization phase can influence subsequent speech processing is supported by brain research on auditory cueing with speech (e.g., Cason and Sch&#x000F6;n, <xref ref-type="bibr" rid="B7">2012</xref>). In a recent perceptual EEG-study with the same French material (Falk et al., <xref ref-type="bibr" rid="B19">in press</xref>), sentences were cued by an auditory cue matching or non-matching the rhythm of speech (no motor condition). Akin to the present study, the authors found that participants&#x00027; success in behavioral response (a word memory task) was correlated with neural activity during the cueing but not during the speech phase. In particular, participants whose neural oscillatory activity showed better phase-locking (i.e., &#x0201C;neural entrainment,&#x0201D; Lakatos et al., <xref ref-type="bibr" rid="B36">2008</xref>) to the rhythm of the cue, also had higher success in the following speech memory task. Neural entrainment has been described as a process that drives neural excitability toward expected times and thereby engages higher attentional resources at these points in time (e.g., Calderone et al., <xref ref-type="bibr" rid="B6">2014</xref>). This neural effect on attending is even more pronounced when motor synchronization is added (e.g., Morillon et al., <xref ref-type="bibr" rid="B53">2014</xref>). Therefore, in our experiment, participants generating stronger expectancies during the synchronization phase may also be those that are more likely to maintain these expectancies during the continuation phase and having higher attentional resources to allocate. Thus, they may be more prone to get cognitive and neural benefits from rhythmically induced attending to speech when temporal expectancies are met by the speech structure (i.e., in the congruent condition). By contrast, in the incongruent condition, expectancies from the synchronization phase cannot be used to enhance attending because of the subsequent misalignment of taps and speech accent structure. The misalignment may disrupt expectancies directed to the relevant verbs and, hence, no relation between the synchronization phase and detection performance is found. Finally, it is interesting that we failed to observe a relation between continuation accuracy and verbal detection in the congruent condition. This may be linked to additional processes intervening during speech processing (e.g., comprehension, syntactic, or semantic processes) or to task-related factors (e.g., participants were not explicitly instructed to synchronize with stressed syllables while listening to speech). These possibilities deserve further investigation in future studies.</p>
<p>Second, when rhythms were not aligned (incongruent condition), tapping rate in the continuation phase (i.e., while participants listened to speech) predicted detection of a verb change, but differently depending on the language. Slowing down tapping in the continuation phase worsened detection in French speakers, while this was not the case in German speakers. This group difference may be linked to the fact that French speakers generally showed a considerable drift in tapping rate (a deceleration of 15 ms per ITI, on average) in the incongruent alignment condition which naturally also affected continuation accuracy. German speakers did not show any drift. Moreover, slower tapping rate in the continuation phase was associated with more positive asynchrony (i.e., lag) relative to the expected moments of the tap. In sum, French speakers, unlike German speakers, tended to lag behind the expected moment of the taps when motor and speech rhythms were incongruently aligned and this drift was linked to worse detection performance. In the following, we will discuss different explanations that can account for these findings.</p>
<p>A relatively trivial explanation is that pre-existing differences in musical experience between language groups may have led to worse tapping performance. However, musical experience should have produced group differences in all aspects of tapping performance, which was not the case (e.g., in the synchronization phase). Moreover, including musical experience as an additional predictor in the regression models did not change the pattern of results. Hence, differences in musical experience are unlikely to account for the observed group differences.</p>
<p>A more interesting explanation is that a dual task (i.e., tapping &#x0002B; listening to speech) during the continuation phase affected French and German speakers differently in our experiment. Generally, as already shown in Falk and Dalla Bella (<xref ref-type="bibr" rid="B18">2016</xref>), dual task conditions when auditory and motor rhythms are aligned lead to <italic>enhanced</italic> detection performance compared to a single task perceptual baseline. This finding indicates efficient integration of motor with speech accent information when both are temporally aligned. Similar beneficial multisensory effects of manual gesturing were reported in speech perception (Holle et al., <xref ref-type="bibr" rid="B25">2012</xref>; Biau and Soto-Faraco, <xref ref-type="bibr" rid="B3">2013</xref>; Ito et al., <xref ref-type="bibr" rid="B27">2014</xref>). In contrast, incongruent alignment, by temporally dissociating prominent verbal and motor information, creates less ideal conditions for efficient auditory-motor integration. Although, a stable rhythmic relation may still be established between speech and motor rhythms when finger taps are cued to occur at the anti-phase between accents (see Vos and Helsper, <xref ref-type="bibr" rid="B87">1992</xref>; Volman and Geuze, <xref ref-type="bibr" rid="B86">2000</xref>; Repp, <xref ref-type="bibr" rid="B68">2005</xref>), more difficulties in maintaining a stable relation may be encountered by participants. In our study, both higher motor variability in the continuation phase and lower continuation consistency were indicators of increased difficulty (or lower stability) in this condition. On the other hand, it appears that destabilizing the speech-motor relation in the incongruent alignment condition showed interesting differences between language groups, possibly because listeners had to readjust their coordination with the auditory stimulus. An important difference was the remarkable tapping deceleration found in French, but not in German speakers. One possibility is that, in the German material, the alternating strong-weak pattern made it easier for listeners to track strong syllables as reference points when tapping on the weak syllable between them. In the French material, the ternary pattern may have made it more difficult to find the anchor for tapping between two metrically strong syllables. However, there is another explanation that takes accentual properties of the two languages into account. Previous studies reported that a concurrent speech or musical rhythm (distractor sequence) makes listeners deviate from a synchronized tapping pattern to a metronome (target sequence; Dalla Bella et al., <xref ref-type="bibr" rid="B12">2013</xref>). In this kind of task, a lagging distractor rhythm typically leads to positive asynchronies in concurrent synchronization performance (Repp, <xref ref-type="bibr" rid="B67">2003</xref>; Repp and Penel, <xref ref-type="bibr" rid="B70">2004</xref>). Similarly, positive asynchronies were observed in our experiment during incongruent continuation tapping, particularly for the French speakers that also showed significant deceleration. This suggests that properties inherent to the language-specific metrical pattern may have attracted the French speakers more to the upcoming speech accent(s) than the German speakers. Final accents (right-edge of the AP) could have played a major role in this process. Indeed, these final accents feature higher and steeper rises and vocalic lengthening and appear to be more prominent than initial accents (left-edge of the AP, e.g., Rolland and L&#x00153;venbruck, <xref ref-type="bibr" rid="B73">2002</xref>). In addition, in corpora of colloquial French, initial accents are more optional and seem to depend on stylistic variation in contrast to the obligatory and highly predictable phrase-final accents (e.g., Ast&#x000E9;sano, <xref ref-type="bibr" rid="B2">2001</xref>). This imbalance in accentual acoustics and predictability may have led French speakers to perceive more prominently the periodicity of primary (i.e., final) accents occurring at the right edges of the AP in our study. These accents occurred at a periodicity of 1,200 ms, and thereby, on a higher metrical level compared to the 600-ms periodicity of both initial and final accents. Note that, in a previous study, French speakers were also more likely to spontaneously tap to a higher periodicity in metrical speech compared to English speakers, although it was not possible to identify the specific linguistic landmarks attracting the tapping in this study (Lidji et al., <xref ref-type="bibr" rid="B41">2011</xref>). In sum, tapping deceleration for French speakers observed in the incongruent condition may have been driven by attending to the higher metrical level of final accents (i.e., to a higher periodicity). Moreover, as the verbal changes exclusively occurred on initial accents, participants with higher attraction to final accents may have displayed worse detection performance. Future studies, placing verbal changes in final and initial AP positions, will help to examine this possibility. Alternatively, including languages with different accentual properties in a future study (such as Spanish, which possesses lexical stress, but different acoustic and positional properties than French and German, e.g., Toro et al., <xref ref-type="bibr" rid="B83">2009</xref>) may further clarify the potential of motor alignment tasks to inform us about the perception and adaption to accentual salience.</p>
<p>Finally, it is noteworthy that the congruent alignment condition did not produce a similar drift in French speakers. Previous synchronization studies using finger tapping have shown that <italic>1:n</italic> metrical subdivisions are generally not deleterious to synchronization capacity, rather the contrary, provided that different periodicities are coupled in a hierarchical system (Large and Jones, <xref ref-type="bibr" rid="B37">1999</xref>; Large and Palmer, <xref ref-type="bibr" rid="B38">2002</xref>; Repp, <xref ref-type="bibr" rid="B69">2008</xref>). Thereby, the congruent condition may have provided a metrical template with clear nesting of temporal reference frames during auditory-motor integration for French speakers even when they were attracted by the higher metrical level.</p>
<p>To conclude, differences in auditory-motor coordination between French and German speakers open interesting perspectives for further investigating differences in accent perception and metrical reference frames for multisensory processes. Overall, our results support recent models of expectancy-driven speech perception underscoring the role of the motor system. Our findings are compatible with Kotz and Schwartze&#x00027;s (<xref ref-type="bibr" rid="B34">2010</xref>, <xref ref-type="bibr" rid="B35">2016</xref>) subcortical-cortico framework for speech perception. The model encompasses a few critical regions typically involved in motor control, such as the (pre)supplementary motor area and basal ganglia and cerebellar circuitries. According to the model, the motor timing elements together with the other elements of the neural network converge in their function to optimize predictive timing of verbal behavior. Thereby, the network serves to precisely coordinate speech perception and production in time, and also explains the role of auditory-motor coupling and learning during language acquisition. Thus, motor benefits in speech perception such as found in our study may be the expression of an enhanced stimulation of these underlying neural mechanisms of predictive timing.</p>
<p>Finally, our findings are likely to extend to those ecological settings, whereby metrical speech and auditory-motor interactions are used to foster verbal memory and learning. Children&#x00027;s lore, in particular, exploits metrical speech with synchronized movements (e.g., hand clapping, stamping) in a wide variety of games that enhance children&#x00027;s social and verbal skills (Opie and Opie, <xref ref-type="bibr" rid="B56">1988</xref>). Oratory and joint speech in groups are other domains in which auditory-motor coupling with speech is naturally found which may help to convey verbal messages to a group and improve social cohesion and inter-subjectivity (e.g., Cummins, <xref ref-type="bibr" rid="B11">2014</xref>). Ultimately, our results may encourage novel multisensory rhythm-based interventions that are currently under investigation for fostering language acquisition and learning in developmental populations with speech and language disorders, such as dyslexic children (Sch&#x000F6;n and Tillmann, <xref ref-type="bibr" rid="B78">2015</xref>) or autistic children (Wan et al., <xref ref-type="bibr" rid="B88">2011</xref>). To conclude, the present results help to advance our understanding of the language-specific bases and learning of synchronized auditory-motor rhythms and predictive timing in multisensory speech processing.</p></sec>
<sec id="s5">
<title>Author contributions</title>
<p>SF and SDB designed the study. SF and CV developed the material and collected the data. All the authors contributed to data analysis. SF and SDB outlined the article and all the authors contributed to the final manuscript.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
</sec>
</body>
<back>
<ack><p>This work was supported by the People Programme (Marie Curie Actions) of the European Union&#x00027;s Seventh Framework Programme (FP7/2007&#x02013;2013) under REA grant agreement No. 327586 to SF, the LMU Munich&#x00027;s Institutional Strategy LMUexcellent within the framework of the German Excellence Initiative to SF, and the Institut Universitaire de France (IUF) to SDB. Both authors were financially supported by a bilateral travel grant from the DAAD and Polish NCN (no. 2011/01/M/HS6/04158). We thank the IPS (LMU Munich), Franziska Holzheimer, Elena Maslow, Karine Valette, and Virginie Epting for help with stimulus recording and testing, and Amandine Michelas, Pauline Welby, Christine Meunier, and Mireille Bastien-Toniazzo for helpful discussions.</p>
</ack>
<sec sec-type="supplementary-material" id="s6">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fpsyg.2017.00395/full#supplementary-material">http://journal.frontiersin.org/article/10.3389/fpsyg.2017.00395/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Audio1.WAV" mimetype="audio/x-wav" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Audio2.WAV" mimetype="audio/x-wav" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aschersleben</surname> <given-names>G.</given-names></name></person-group> (<year>2002</year>). <article-title>Temporal control of movements in sensorimotor synchronization</article-title>. <source>Brain Cogn.</source> <volume>48</volume>, <fpage>66</fpage>&#x02013;<lpage>79</lpage>. <pub-id pub-id-type="doi">10.1006/brcg.2001.1304</pub-id><pub-id pub-id-type="pmid">11812033</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ast&#x000E9;sano</surname> <given-names>C.</given-names></name></person-group> (<year>2001</year>). <source>Rythme et Accentuation en Fran&#x000E7;ais. Invariance et Variabilit&#x000E9; Stylistique</source>. <publisher-loc>Paris</publisher-loc>: <publisher-name>Editions L&#x00027;Harmattan</publisher-name>.</citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Biau</surname> <given-names>E.</given-names></name> <name><surname>Soto-Faraco</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <article-title>Beat gestures modulate auditory integration in speech perception</article-title>. <source>Brain Lang.</source> <volume>124</volume>, <fpage>143</fpage>&#x02013;<lpage>152</lpage>. <pub-id pub-id-type="doi">10.1016/j.bandl.2012.10.008</pub-id><pub-id pub-id-type="pmid">23333667</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boersma</surname> <given-names>P.</given-names></name></person-group> (<year>2001</year>). <article-title>Praat, a system for doing phonetics by computer</article-title>. <source>Glot Int.</source> <volume>5</volume>, <fpage>341</fpage>&#x02013;<lpage>345</lpage>.</citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bohn</surname> <given-names>K.</given-names></name> <name><surname>Knaus</surname> <given-names>J.</given-names></name> <name><surname>Wiese</surname> <given-names>R.</given-names></name> <name><surname>Domahs</surname> <given-names>U.</given-names></name></person-group> (<year>2013</year>). <article-title>The influence of rhythmic (ir)regularities on speech processing: evidence from an ERP study on German phrases</article-title>. <source>Neuropsychologia</source> <volume>51</volume>, <fpage>760</fpage>&#x02013;<lpage>771</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuropsychologia.2013.01.006</pub-id><pub-id pub-id-type="pmid">23333869</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Calderone</surname> <given-names>D. J.</given-names></name> <name><surname>Lakatos</surname> <given-names>P.</given-names></name> <name><surname>Butler</surname> <given-names>P. D.</given-names></name> <name><surname>Castellanos</surname> <given-names>F. X.</given-names></name></person-group> (<year>2014</year>). <article-title>Entrainment of neural oscillations as a modifiable substrate of attention</article-title>. <source>Trends Cogn. Sci.</source> <volume>18</volume>, <fpage>300</fpage>&#x02013;<lpage>309</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2014.02.005</pub-id><pub-id pub-id-type="pmid">24630166</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cason</surname> <given-names>N.</given-names></name> <name><surname>Sch&#x000F6;n</surname> <given-names>D.</given-names></name></person-group> (<year>2012</year>). <article-title>Rhythmic priming enhances the phonological processing of speech</article-title>. <source>Neuropsychologia</source> <volume>50</volume>, <fpage>2652</fpage>&#x02013;<lpage>2658</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuropsychologia.2012.07.018</pub-id><pub-id pub-id-type="pmid">22828660</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chemin</surname> <given-names>B.</given-names></name> <name><surname>Mouraux</surname> <given-names>A.</given-names></name> <name><surname>Nozaradan</surname> <given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>Body movement selectively shapes the neural representation of musical rhythms</article-title>. <source>Psychol. Sci.</source> <volume>25</volume>, <fpage>2147</fpage>&#x02013;<lpage>2159</lpage>. <pub-id pub-id-type="doi">10.1177/0956797614551161</pub-id><pub-id pub-id-type="pmid">25344346</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Conradi</surname> <given-names>N.</given-names></name> <name><surname>Abel</surname> <given-names>C.</given-names></name> <name><surname>Frisch</surname> <given-names>S.</given-names></name> <name><surname>Kell</surname> <given-names>C. A.</given-names></name> <name><surname>Kaiser</surname> <given-names>J.</given-names></name> <name><surname>Schmidt-Kassow</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Actively but not passively synchronized motor activity amplifies predictive timing</article-title>. <source>Neuroimage</source> <volume>139</volume>, <fpage>211</fpage>&#x02013;<lpage>217</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2016.06.033</pub-id><pub-id pub-id-type="pmid">27329809</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cummins</surname> <given-names>F.</given-names></name> <name><surname>Port</surname> <given-names>R. F.</given-names></name></person-group> (<year>1998</year>). <article-title>Rhythmic constraints on stress timing in English</article-title>. <source>J. Phon.</source> <volume>26</volume>, <fpage>145</fpage>&#x02013;<lpage>171</lpage>. <pub-id pub-id-type="doi">10.1006/jpho.1998.0070</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cummins</surname> <given-names>F.</given-names></name></person-group> (<year>2014</year>). <article-title>Voice, (inter-)subjectivity, and real-time recurrent interaction</article-title>. <source>Front. Psychol.</source> <volume>5</volume>:<fpage>760</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2014.00760</pub-id><pub-id pub-id-type="pmid">25101028</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dalla Bella</surname> <given-names>S.</given-names></name> <name><surname>Bialu&#x00144;ska</surname> <given-names>A.</given-names></name> <name><surname>Sowi&#x00144;ski</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>Why movement is captured by music, but less by speech: role of temporal regularity</article-title>. <source>PLoS ONE</source> <volume>8</volume>:<fpage>e71945</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0071945</pub-id><pub-id pub-id-type="pmid">23936534</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dalla Bella</surname> <given-names>S.</given-names></name> <name><surname>Farrugia</surname> <given-names>N.</given-names></name> <name><surname>Benoit</surname> <given-names>C.-E.</given-names></name> <name><surname>B&#x000E9;gel</surname> <given-names>V.</given-names></name> <name><surname>Verga</surname> <given-names>L.</given-names></name> <name><surname>Harding</surname> <given-names>E.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>BAASTA: Battery for the Assessment of Auditory Sensorimotor and Timing Abilities</article-title>. <source>Behav. Res. Methods.</source> [Epub ahead of print]. <pub-id pub-id-type="doi">10.3758/s13428-016-0773-6</pub-id><pub-id pub-id-type="pmid">27443353</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dilley</surname> <given-names>M. C.</given-names></name> <name><surname>McAuley</surname> <given-names>J. D.</given-names></name></person-group> (<year>2008</year>). <article-title>Distal prosodic context affects word segmentation and lexical processing</article-title>. <source>J. Mem. Lang.</source> <volume>59</volume>, <fpage>294</fpage>&#x02013;<lpage>311</lpage>. <pub-id pub-id-type="doi">10.1016/j.jml.2008.06.006</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Domahs</surname> <given-names>U.</given-names></name> <name><surname>Knaus</surname> <given-names>J.</given-names></name> <name><surname>Orzechowska</surname> <given-names>P.</given-names></name> <name><surname>Wiese</surname> <given-names>R.</given-names></name></person-group> (<year>2012</year>). <article-title>Stress &#x0201C;deafness&#x0201D; in a language with fixed word stress: an ERP study on Polish</article-title>. <source>Front. Psychol.</source> <volume>3</volume>:<fpage>439</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2012.00439</pub-id><pub-id pub-id-type="pmid">23125839</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dupoux</surname> <given-names>E.</given-names></name> <name><surname>Sebasti&#x000E1;n-Gall&#x000E9;s</surname> <given-names>N.</given-names></name> <name><surname>Navarrete</surname> <given-names>E.</given-names></name> <name><surname>Peperkamp</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>Persistent stress &#x0201C;deafness&#x0201D;: the case of French learners of Spanish</article-title>. <source>Cognition</source> <volume>106</volume>, <fpage>682</fpage>&#x02013;<lpage>706</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2007.04.001</pub-id><pub-id pub-id-type="pmid">17592731</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ellis</surname> <given-names>R. J.</given-names></name> <name><surname>Jones</surname> <given-names>M. R.</given-names></name></person-group> (<year>2010</year>). <article-title>Rhythmic context modulates foreperiod effects</article-title>. <source>Atten. Percept. Psychophys.</source> <volume>72</volume>, <fpage>2274</fpage>&#x02013;<lpage>2288</lpage>. <pub-id pub-id-type="doi">10.3758/BF03196701</pub-id><pub-id pub-id-type="pmid">21097869</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Falk</surname> <given-names>S.</given-names></name> <name><surname>Dalla Bella</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>It is better when expected: aligning verbal and motor rhythms enhances verbal processing</article-title>. <source>Lang. Cogn. Neurosci.</source> <volume>31</volume>, <fpage>699</fpage>&#x02013;<lpage>708</lpage>. <pub-id pub-id-type="doi">10.1080/23273798.2016.1144892</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Falk</surname> <given-names>S.</given-names></name> <name><surname>Lanzilotti</surname> <given-names>C.</given-names></name> <name><surname>Sch&#x000F6;n</surname> <given-names>D.</given-names></name></person-group> (<year>in press</year>). <article-title>Tuning neural phase entrainment to speech</article-title>. <source>J. Cogn. Neurosci</source>.</citation>
</ref>
<ref id="B20">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>F&#x000E9;ry</surname> <given-names>C.</given-names></name> <name><surname>Ishihara</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>How focus and givenness shape prosody</article-title>, in <source>Information Structure from Different Perspectives</source>, eds <person-group person-group-type="editor"><name><surname>Zimmerman</surname> <given-names>M.</given-names></name> <name><surname>F&#x000E9;ry</surname> <given-names>C.</given-names></name></person-group>(<publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>36</fpage>&#x02013;<lpage>63</lpage>.</citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>F&#x000E9;ry</surname> <given-names>C.</given-names></name> <name><surname>K&#x000FC;gler</surname> <given-names>F.</given-names></name></person-group> (<year>2008</year>). <article-title>Pitch accent scaling on given, new and focused constituents in German</article-title>. <source>J. Phon.</source> <volume>36</volume>, <fpage>680</fpage>&#x02013;<lpage>703</lpage>. <pub-id pub-id-type="doi">10.1016/j.wocn.2008.05.001</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fraisse</surname> <given-names>P.</given-names></name> <name><surname>Voillaume</surname> <given-names>C.</given-names></name></person-group> (<year>1971</year>). <article-title>Les r&#x000E9;p&#x000E8;res du sujet dans la synchronisation et dans la pseudo-synchronisation</article-title>. <source>L&#x00027;Ann&#x000E9;e Psychol.</source> <volume>71</volume>, <fpage>359</fpage>&#x02013;<lpage>369</lpage>. <pub-id pub-id-type="doi">10.3406/psy.1971.27747</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fujioka</surname> <given-names>T.</given-names></name> <name><surname>Zendel</surname> <given-names>B. R.</given-names></name> <name><surname>Ross</surname> <given-names>B.</given-names></name></person-group> (<year>2010</year>). <article-title>Endogenous neuromagnetic activity for mental hierarchy of timing</article-title>. <source>J. Neurosci.</source> <volume>30</volume>, <fpage>3458</fpage>&#x02013;<lpage>3466</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.3086-09.2010</pub-id><pub-id pub-id-type="pmid">20203205</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hayes</surname> <given-names>B.</given-names></name></person-group> (<year>1995</year>). <source>Metrical Stress Theory: Principles and Case Studies</source>. <publisher-loc>Chicago, IL</publisher-loc>: <publisher-name>University of Chicago Press</publisher-name>.</citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holle</surname> <given-names>H.</given-names></name> <name><surname>Obermeier</surname> <given-names>C.</given-names></name> <name><surname>Schmidt-Kassow</surname> <given-names>M.</given-names></name> <name><surname>Friederici</surname> <given-names>A. D.</given-names></name> <name><surname>Ward</surname> <given-names>J.</given-names></name> <name><surname>Gunter</surname> <given-names>T. C.</given-names></name></person-group> (<year>2012</year>). <article-title>Gesture facilitates the syntactic analysis of speech</article-title>. <source>Front. Psychol.</source> <volume>3</volume>:<fpage>74</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2012.00074</pub-id><pub-id pub-id-type="pmid">22457657</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hommel</surname> <given-names>B.</given-names></name> <name><surname>M&#x000FC;sseler</surname> <given-names>J.</given-names></name> <name><surname>Aschersleben</surname> <given-names>G.</given-names></name> <name><surname>Prinz</surname> <given-names>W.</given-names></name></person-group> (<year>2001</year>). <article-title>The Theory of Event Coding (TEC): a framework for perception and action planning</article-title>. <source>Behav. Brain Sci.</source> <volume>24</volume>, <fpage>849</fpage>&#x02013;<lpage>878</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X01000103</pub-id><pub-id pub-id-type="pmid">12239891</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ito</surname> <given-names>T.</given-names></name> <name><surname>Gracco</surname> <given-names>V. L.</given-names></name> <name><surname>Ostry</surname> <given-names>D. J.</given-names></name></person-group> (<year>2014</year>). <article-title>Temporal factors affecting somatosensory-auditory interactions in speech processing</article-title>. <source>Front. Psychol.</source> <volume>5</volume>:<fpage>1198</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2014.01198</pub-id><pub-id pub-id-type="pmid">25452733</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Janata</surname> <given-names>P.</given-names></name> <name><surname>Tomic</surname> <given-names>S. T.</given-names></name> <name><surname>Haberman</surname> <given-names>J. M.</given-names></name></person-group> (<year>2012</year>). <article-title>Sensorimotor coupling in music and the psychology of the groove</article-title>. <source>J. Exp. Psychol.</source> <volume>141</volume>, <fpage>54</fpage>&#x02013;<lpage>75</lpage>. <pub-id pub-id-type="doi">10.1037/a0024208</pub-id><pub-id pub-id-type="pmid">21767048</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jessen</surname> <given-names>M.</given-names></name></person-group> (<year>1999</year>). <article-title>German</article-title>, in <source>Word Prosodic Systems in the Languages of Europe</source>, ed <person-group person-group-type="editor"><name><surname>van der Hulst</surname> <given-names>H.</given-names></name></person-group>(<publisher-loc>Berlin; New York, NY</publisher-loc>: <publisher-name>Mouton de Gruyter</publisher-name>), <fpage>515</fpage>&#x02013;<lpage>544</lpage>.</citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jones</surname> <given-names>M.</given-names></name> <name><surname>Moynihan</surname> <given-names>H.</given-names></name> <name><surname>MacKenzie</surname> <given-names>N.</given-names></name> <name><surname>Puente</surname> <given-names>J.</given-names></name></person-group> (<year>2002</year>). <article-title>Temporal aspects of stimulus-driven attending in dynamic arrays</article-title>. <source>Psychol. Sci.</source> <volume>13</volume>, <fpage>313</fpage>&#x02013;<lpage>319</lpage>. <pub-id pub-id-type="doi">10.1111/1467-9280.00458</pub-id><pub-id pub-id-type="pmid">12137133</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jun</surname> <given-names>S.-A.</given-names></name> <name><surname>Fougeron</surname> <given-names>C.</given-names></name></person-group> (<year>2000</year>). <article-title>A phonological model of French intonation</article-title>, in <source>Intonation: Analysis, Modelling and Technology</source>, ed <person-group person-group-type="editor"><name><surname>Botinis</surname> <given-names>A.</given-names></name></person-group>(<publisher-loc>Boston, MA</publisher-loc>: <publisher-name>Kluwer</publisher-name>), <fpage>209</fpage>&#x02013;<lpage>242</lpage>.</citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jun</surname> <given-names>S.-A.</given-names></name> <name><surname>Fougeron</surname> <given-names>C.</given-names></name></person-group> (<year>2002</year>). <article-title>Realizations of accentual phrase in French intonation</article-title>. <source>Probus</source> <volume>14</volume>, <fpage>147</fpage>&#x02013;<lpage>172</lpage>. <pub-id pub-id-type="doi">10.1515/prbs.2002.002</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kelso</surname> <given-names>J. A.</given-names></name> <name><surname>Vatikiotis-Bateson</surname> <given-names>E.</given-names></name> <name><surname>Saltzman</surname> <given-names>E. L.</given-names></name> <name><surname>Kay</surname> <given-names>B.</given-names></name></person-group> (<year>1985</year>). <article-title>A qualitative dynamic analysis of reiterant speech production: phase portraits, kinematics, and dynamic modeling</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>77</volume>, <fpage>266</fpage>&#x02013;<lpage>280</lpage>. <pub-id pub-id-type="doi">10.1121/1.392268</pub-id><pub-id pub-id-type="pmid">3973219</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kotz</surname> <given-names>S. A.</given-names></name> <name><surname>Schwartze</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Cortical speech processing unplugged: a timely subcortico-cortical framework</article-title>. <source>Trends Cogn. Sci.</source> <volume>14</volume>, <fpage>392</fpage>&#x02013;<lpage>399</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2010.06.005</pub-id><pub-id pub-id-type="pmid">20655802</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kotz</surname> <given-names>S. A.</given-names></name> <name><surname>Schwartze</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Motor timing and sequencing in speech production. A general-purpose framework</article-title>, in <source>Neurobiology of Language</source>, eds <person-group person-group-type="editor"><name><surname>Hickock</surname> <given-names>G.</given-names></name> <name><surname>Small</surname> <given-names>S.</given-names></name></person-group>(<publisher-loc>Amsterdam</publisher-loc>: <publisher-name>Elsevier</publisher-name>), <fpage>717</fpage>&#x02013;<lpage>724</lpage>.</citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lakatos</surname> <given-names>P.</given-names></name> <name><surname>Karmos</surname> <given-names>G.</given-names></name> <name><surname>Mehta</surname> <given-names>A. D.</given-names></name> <name><surname>Ulbert</surname> <given-names>I.</given-names></name> <name><surname>Schroeder</surname> <given-names>C. E.</given-names></name></person-group> (<year>2008</year>). <article-title>Entrainment of neuronal oscillations as a mechanism of attentional selection</article-title>. <source>Science</source> <volume>320</volume>, <fpage>110</fpage>&#x02013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.1126/science.1154735</pub-id><pub-id pub-id-type="pmid">18388295</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Large</surname> <given-names>E. W.</given-names></name> <name><surname>Jones</surname> <given-names>M. R.</given-names></name></person-group> (<year>1999</year>). <article-title>The dynamics of attending: how people track time-varying events</article-title>. <source>Psychol. Rev.</source> <volume>106</volume>, <fpage>119</fpage>&#x02013;<lpage>159</lpage>. <pub-id pub-id-type="doi">10.1037/0033-295X.106.1.119</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Large</surname> <given-names>E. W.</given-names></name> <name><surname>Palmer</surname> <given-names>C.</given-names></name></person-group> (<year>2002</year>). <article-title>Perceiving temporal regularity in music</article-title>. <source>Cogn. Sci.</source> <volume>26</volume>, <fpage>1</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1207/s15516709cog2601_1</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lehiste</surname> <given-names>I.</given-names></name></person-group> (<year>1977</year>). <article-title>Isochrony reconsidered</article-title>. <source>J. Phon.</source> <volume>5</volume>, <fpage>253</fpage>&#x02013;<lpage>263</lpage>.</citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liberman</surname> <given-names>A.</given-names></name> <name><surname>Prince</surname> <given-names>A.</given-names></name></person-group> (<year>1977</year>). <article-title>On stress and linguistic rhythm</article-title>. <source>Linguist. Inq.</source> <volume>8</volume>, <fpage>249</fpage>&#x02013;<lpage>336</lpage>.</citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lidji</surname> <given-names>P.</given-names></name> <name><surname>Palmer</surname> <given-names>C.</given-names></name> <name><surname>Peretz</surname> <given-names>I.</given-names></name> <name><surname>Morningstar</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Listeners feel the beat: entrainment to English and French speech rhythms</article-title>. <source>Psychon. Bull. Rev.</source> <volume>18</volume>, <fpage>1035</fpage>&#x02013;<lpage>1041</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-011-0163-0</pub-id><pub-id pub-id-type="pmid">21912999</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>London</surname> <given-names>J.</given-names></name></person-group> (<year>2004</year>). <source>Hearing in Time.</source> <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B43">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>MacMillan</surname> <given-names>N. A.</given-names></name> <name><surname>Creelman</surname> <given-names>C. D.</given-names></name></person-group> (<year>2005</year>). <source>Detection Theory. A User&#x00027;s Guide</source>. <publisher-loc>Mahwah, NJ</publisher-loc>: <publisher-name>Laurence Erlbaum</publisher-name>.</citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maes</surname> <given-names>P.-J.</given-names></name> <name><surname>Leman</surname> <given-names>M.</given-names></name> <name><surname>Palmer</surname> <given-names>C.</given-names></name> <name><surname>Wanderley</surname> <given-names>M. M.</given-names></name></person-group> (<year>2014</year>). <article-title>Action-based effects on music perception</article-title>. <source>Front. Psychol.</source> <volume>4</volume>:<fpage>1008</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2013.01008</pub-id><pub-id pub-id-type="pmid">24454299</pub-id></citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manning</surname> <given-names>F. C.</given-names></name> <name><surname>Schutz</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Moving to the beat&#x0201D; improves timing perception</article-title>. <source>Psychon. Bull. Rev.</source> <volume>20</volume>, <fpage>1133</fpage>&#x02013;<lpage>1139</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-013-0439-7</pub-id><pub-id pub-id-type="pmid">23670284</pub-id></citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manning</surname> <given-names>F. C.</given-names></name> <name><surname>Schutz</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Trained to keep a beat: movement-related enhancements to timing perception in percussionists and non-percussionists</article-title>. <source>Psychol. Res.</source> <volume>80</volume>, <fpage>532</fpage>&#x02013;<lpage>542</lpage>. <pub-id pub-id-type="doi">10.1007/s00426-015-0678-5</pub-id><pub-id pub-id-type="pmid">26067889</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>McNeill</surname> <given-names>D.</given-names></name></person-group> (<year>1992</year>). <source>Hand and Mind: What Gestures Reveal About Thought</source>. <publisher-loc>Chicago, IL</publisher-loc>: <publisher-name>University of Chicago Press</publisher-name>.</citation>
</ref>
<ref id="B48">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Mengel</surname> <given-names>A.</given-names></name></person-group> (<year>2000</year>). <source>Deutscher Wortakzent</source>. <publisher-loc>Norderstedt</publisher-loc>: <publisher-name>Books on Demand</publisher-name>.</citation>
</ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Michelas</surname> <given-names>A.</given-names></name> <name><surname>D&#x00027;Imperio</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>Prosodic boundary strength guides syntactic parsing of French utterances</article-title>. <source>Lab. Phonol.</source> <volume>6</volume>, <fpage>119</fpage>&#x02013;<lpage>146</lpage>. <pub-id pub-id-type="doi">10.1515/lp-2015-0003</pub-id></citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Michelas</surname> <given-names>A.</given-names></name> <name><surname>Frauenfelder</surname> <given-names>U. H.</given-names></name> <name><surname>Sch&#x000F6;n</surname> <given-names>D.</given-names></name> <name><surname>Dufour</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>How deaf are French speakers to stress?</article-title> <source>J. Acoust. Soc. Am.</source> <volume>139</volume>, <fpage>1333</fpage>&#x02013;<lpage>1342</lpage>. <pub-id pub-id-type="doi">10.1121/1.4944574</pub-id><pub-id pub-id-type="pmid">27036270</pub-id></citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Millotte</surname> <given-names>S.</given-names></name> <name><surname>Rene</surname> <given-names>A.</given-names></name> <name><surname>Wales</surname> <given-names>R.</given-names></name> <name><surname>Christophe</surname> <given-names>A.</given-names></name></person-group> (<year>2008</year>). <article-title>Phonological phrase boundaries constrain the online syntactic analysis of spoken sentences</article-title>. <source>J. Exp. Psychol. Learn. Mem. Cogn.</source> <volume>34</volume>, <fpage>874</fpage>&#x02013;<lpage>885</lpage>. <pub-id pub-id-type="doi">10.1037/0278-7393.34.4.874</pub-id><pub-id pub-id-type="pmid">18605875</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morillon</surname> <given-names>B.</given-names></name> <name><surname>Hackett</surname> <given-names>T. A.</given-names></name> <name><surname>Kajikawa</surname> <given-names>Y.</given-names></name> <name><surname>Schroeder</surname> <given-names>C. E.</given-names></name></person-group> (<year>2015</year>). <article-title>Predictive motor control of sensory dynamics in auditory active sensing</article-title>. <source>Curr. Opin. Neurobiol.</source> <volume>31</volume>, <fpage>230</fpage>&#x02013;<lpage>238</lpage>. <pub-id pub-id-type="doi">10.1016/j.conb.2014.12.005</pub-id><pub-id pub-id-type="pmid">25594376</pub-id></citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morillon</surname> <given-names>B.</given-names></name> <name><surname>Schroeder</surname> <given-names>C. E.</given-names></name> <name><surname>Wyart</surname> <given-names>V.</given-names></name></person-group> (<year>2014</year>). <article-title>Motor contributions to the temporal precision of auditory attention</article-title>. <source>Nat. Commun.</source> <volume>5</volume>:<fpage>5255</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms6255</pub-id><pub-id pub-id-type="pmid">25314898</pub-id></citation>
</ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morton</surname> <given-names>J.</given-names></name> <name><surname>Martin</surname> <given-names>S. M.</given-names></name> <name><surname>Frankish</surname> <given-names>C.</given-names></name></person-group> (<year>1976</year>). <article-title>Perceptual centers (P-centers)</article-title>. <source>Psychol. Rev.</source> <volume>83</volume>, <fpage>405</fpage>&#x02013;<lpage>408</lpage>. <pub-id pub-id-type="doi">10.1037/0033-295X.83.5.405</pub-id></citation>
</ref>
<ref id="B55">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ong</surname> <given-names>W. J.</given-names></name></person-group> (<year>2002</year>). <source>Orality and Literacy: The Technologizing of the Word</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Routledge</publisher-name>.</citation>
</ref>
<ref id="B56">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Opie</surname> <given-names>I.</given-names></name> <name><surname>Opie</surname> <given-names>P.</given-names></name></person-group> (<year>1988</year>). <source>The Singing Game</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B57">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Opie</surname> <given-names>I.</given-names></name> <name><surname>Opie</surname> <given-names>P.</given-names></name></person-group> (<year>1997</year>). <source>The Oxford Dictionary of Rhymes</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Otterbein</surname> <given-names>S.</given-names></name> <name><surname>Abel</surname> <given-names>C.</given-names></name> <name><surname>Heinemann</surname> <given-names>L. V.</given-names></name> <name><surname>Kaiser</surname> <given-names>J.</given-names></name> <name><surname>Schmidt-Kassow</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>P3b reflects periodicity in linguistic sequences</article-title>. <source>PLoS ONE</source> <volume>7</volume>:<fpage>e51419</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0051419</pub-id><pub-id pub-id-type="pmid">23251527</pub-id></citation>
</ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parrell</surname> <given-names>B.</given-names></name> <name><surname>Goldstein</surname> <given-names>L.</given-names></name> <name><surname>Lee</surname> <given-names>S.</given-names></name> <name><surname>Byrd</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Spatiotemporal coupling between speech and manual motor actions</article-title>. <source>J. Phon.</source> <volume>42</volume>, <fpage>1</fpage>&#x02013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1016/j.wocn.2013.11.002</pub-id><pub-id pub-id-type="pmid">24465063</pub-id></citation>
</ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peelle</surname> <given-names>J. E.</given-names></name> <name><surname>Gross</surname> <given-names>J.</given-names></name> <name><surname>Davis</surname> <given-names>M. H.</given-names></name></person-group> (<year>2013</year>). <article-title>Phase-locked responses to speech in human auditory cortex are enhanced during comprehension</article-title>. <source>Cereb. Cortex</source> <volume>23</volume>, <fpage>1378</fpage>&#x02013;<lpage>1387</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhs118</pub-id><pub-id pub-id-type="pmid">22610394</pub-id></citation>
</ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peperkamp</surname> <given-names>S.</given-names></name> <name><surname>Dupoux</surname> <given-names>E.</given-names></name></person-group> (<year>2002</year>). <article-title>A typological study of stress &#x00027;deafness&#x00027;</article-title>. <source>Lab. Phonol.</source> <volume>7</volume>, <fpage>203</fpage>&#x02013;<lpage>240</lpage>. <pub-id pub-id-type="doi">10.1515/9783110197105.203</pub-id></citation>
</ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Phillips-Silver</surname> <given-names>J.</given-names></name> <name><surname>Trainor</surname> <given-names>L. J.</given-names></name></person-group> (<year>2005</year>). <article-title>Feeling the beat: movement influences infant rhythm perception</article-title>. <source>Science</source> <volume>308</volume>:<fpage>1430</fpage>. <pub-id pub-id-type="doi">10.1126/science.1110922</pub-id><pub-id pub-id-type="pmid">15933193</pub-id></citation>
</ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pitt</surname> <given-names>M. A.</given-names></name> <name><surname>Samuel</surname> <given-names>A. G.</given-names></name></person-group> (<year>1990</year>). <article-title>The use of rhythm in attending to speech</article-title>. <source>J. Exp. Psychol. Hum. Percept. Perform.</source> <volume>16</volume>, <fpage>564</fpage>&#x02013;<lpage>573</lpage>. <pub-id pub-id-type="doi">10.1037/0096-1523.16.3.564</pub-id><pub-id pub-id-type="pmid">2144571</pub-id></citation>
</ref>
<ref id="B64">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Post</surname> <given-names>B. M. B.</given-names></name></person-group> (<year>2000</year>). <source>Tonal and Phrasal Structures in French Intonation</source>. <publisher-loc>The Hague</publisher-loc>: <publisher-name>Thesus</publisher-name>.</citation>
</ref>
<ref id="B65">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Prinz</surname> <given-names>W.</given-names></name></person-group> (<year>1990</year>). <article-title>A common coding approach to perception and action</article-title>, in <source>Relationships Between Perception and Action: Current Approaches</source>, eds <person-group person-group-type="editor"><name><surname>Neumann</surname> <given-names>O.</given-names></name> <name><surname>Prinz</surname> <given-names>W.</given-names></name></person-group>(<publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>167</fpage>&#x02013;<lpage>201</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-75348-0_7</pub-id></citation>
</ref>
<ref id="B66">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quen&#x000E9;</surname> <given-names>H.</given-names></name> <name><surname>Port</surname> <given-names>R. F.</given-names></name></person-group> (<year>2005</year>). <article-title>Effects of timing regularity and metrical expectancy on spoken-word perception</article-title>. <source>Phonetica</source> <volume>62</volume>, <fpage>1</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1159/000087222</pub-id><pub-id pub-id-type="pmid">16116301</pub-id></citation>
</ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Repp</surname> <given-names>B. H.</given-names></name></person-group> (<year>2003</year>). <article-title>Phase attraction in sensorimotor synchronization with auditory sequences: effects of single and periodic distractors on synchronization accuracy</article-title>. <source>J. Exp. Psychol. Hum. Percept. Perform.</source> <volume>29</volume>, <fpage>290</fpage>&#x02013;<lpage>309</lpage>. <pub-id pub-id-type="doi">10.1037/0096-1523.29.2.290</pub-id><pub-id pub-id-type="pmid">12760616</pub-id></citation>
</ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Repp</surname> <given-names>B. H.</given-names></name></person-group> (<year>2005</year>). <article-title>Sensorimotor synchronization: a review of the tapping literature</article-title>. <source>Psychon. Bull. Rev.</source> <volume>12</volume>, <fpage>969</fpage>&#x02013;<lpage>992</lpage>. <pub-id pub-id-type="doi">10.3758/BF03206433</pub-id><pub-id pub-id-type="pmid">16615317</pub-id></citation>
</ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Repp</surname> <given-names>B. H.</given-names></name></person-group> (<year>2008</year>). <article-title>Multiple temporal references in sensorimotor synchronization with metrical auditory sequences</article-title>. <source>Psychol. Res.</source> <volume>72</volume>, <fpage>79</fpage>&#x02013;<lpage>98</lpage>. <pub-id pub-id-type="doi">10.1007/s00426-006-0067-1</pub-id><pub-id pub-id-type="pmid">16786353</pub-id></citation>
</ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Repp</surname> <given-names>B. H.</given-names></name> <name><surname>Penel</surname> <given-names>A.</given-names></name></person-group> (<year>2004</year>). <article-title>Rhythmic movement is attracted more strongly to auditory than to visual rhythms</article-title>. <source>Psychol. Res.</source> <volume>68</volume>, <fpage>252</fpage>&#x02013;<lpage>270</lpage>. <pub-id pub-id-type="doi">10.1007/s00426-003-0143-8</pub-id><pub-id pub-id-type="pmid">12955504</pub-id></citation>
</ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Repp</surname> <given-names>B. H.</given-names></name> <name><surname>Su</surname> <given-names>Y.-H.</given-names></name></person-group> (<year>2013</year>). <article-title>Sensorimotor synchronization: a review of recent research (2006&#x02013;2012)</article-title>. <source>Psychon. Bull. Rev.</source> <volume>20</volume>, <fpage>403</fpage>&#x02013;<lpage>452</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-012-0371-2</pub-id><pub-id pub-id-type="pmid">23397235</pub-id></citation>
</ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rochet-Capellan</surname> <given-names>A.</given-names></name> <name><surname>Laboissi&#x000E8;re</surname> <given-names>R.</given-names></name> <name><surname>Galv&#x000E1;n</surname> <given-names>A.</given-names></name> <name><surname>Schwartz</surname> <given-names>J. L.</given-names></name></person-group> (<year>2008</year>). <article-title>The speech focus position effect on jaw-finger coordination in a pointing task</article-title>. <source>J. Speech Lang. Hear. Res.</source> <volume>51</volume>, <fpage>1507</fpage>&#x02013;<lpage>1521</lpage>. <pub-id pub-id-type="doi">10.1044/1092-4388(2008/07-0173)</pub-id><pub-id pub-id-type="pmid">18695015</pub-id></citation>
</ref>
<ref id="B73">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Rolland</surname> <given-names>G.</given-names></name> <name><surname>L&#x00153;venbruck</surname> <given-names>H.</given-names></name></person-group> (<year>2002</year>). <article-title>Characteristics of the accentual phrase in French: an acoustic, articulatory and perceptual study</article-title>, in <source>Proceedings of the Speech Prosody 2002</source>, eds <person-group person-group-type="editor"><name><surname>Bel</surname> <given-names>B.</given-names></name> <name><surname>Marlien</surname> <given-names>I.</given-names></name></person-group>(<publisher-loc>Aix-en-Provence</publisher-loc>), <fpage>611</fpage>&#x02013;<lpage>614</lpage>.</citation>
</ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roncaglia-Denissen</surname> <given-names>M. P.</given-names></name> <name><surname>Schmidt-Kassow</surname> <given-names>M.</given-names></name> <name><surname>Kotz</surname> <given-names>S. A.</given-names></name></person-group> (<year>2013</year>). <article-title>Speech rhythm facilitates syntactic ambiguity resolution: ERP evidence</article-title>. <source>PLoS ONE</source> <volume>8</volume>:<fpage>e56000</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0056000</pub-id><pub-id pub-id-type="pmid">23409109</pub-id></citation>
</ref>
<ref id="B75">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Rossi</surname> <given-names>M.</given-names></name></person-group> (<year>1980</year>). <article-title>Le fran&#x000E7;ais, langue sans accent?</article-title> in <source>L&#x00027; accent en Fran&#x000E7;ais Contemporain</source>, eds <person-group person-group-type="editor"><name><surname>F&#x000F3;nagy</surname> <given-names>I.</given-names></name> <name><surname>L&#x000E9;on</surname> <given-names>P. P.</given-names></name></person-group>(<publisher-loc>Montr&#x000E9;al, QC</publisher-loc>: <publisher-name>Didier</publisher-name>), <fpage>13</fpage>&#x02013;<lpage>51</lpage>.</citation>
</ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt-Kassow</surname> <given-names>M.</given-names></name> <name><surname>Heinemann</surname> <given-names>L. V.</given-names></name> <name><surname>Abel</surname> <given-names>C.</given-names></name> <name><surname>Kaiser</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>Auditory-motor synchronization facilitates attention allocation</article-title>. <source>Neuroimage</source> <volume>15</volume>, <fpage>101</fpage>&#x02013;<lpage>106</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2013.05.111</pub-id></citation>
</ref>
<ref id="B77">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt-Kassow</surname> <given-names>M.</given-names></name> <name><surname>Rothermich</surname> <given-names>K.</given-names></name> <name><surname>Schwartze</surname> <given-names>M.</given-names></name> <name><surname>Kotz</surname> <given-names>S. A.</given-names></name></person-group> (<year>2011</year>). <article-title>Did you get the beat? Late proficient French-German learners extract strong-weak patterns in tonal but not in linguistic sequences</article-title>. <source>Neuroimage</source> <volume>54</volume>, <fpage>568</fpage>&#x02013;<lpage>576</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2010.07.062</pub-id><pub-id pub-id-type="pmid">20692349</pub-id></citation>
</ref>
<ref id="B78">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sch&#x000F6;n</surname> <given-names>D.</given-names></name> <name><surname>Tillmann</surname> <given-names>B.</given-names></name></person-group> (<year>2015</year>). <article-title>Short- and long-term rhythmic interventions: perspectives for language rehabilitation</article-title>. <source>Ann. N.Y. Acad. Sci.</source> <volume>1337</volume>, <fpage>32</fpage>&#x02013;<lpage>39</lpage>. <pub-id pub-id-type="doi">10.1111/nyas.12635</pub-id><pub-id pub-id-type="pmid">25773614</pub-id></citation>
</ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schwartze</surname> <given-names>M.</given-names></name> <name><surname>Rothermich</surname> <given-names>K.</given-names></name> <name><surname>Schmidt-Kassow</surname> <given-names>M.</given-names></name> <name><surname>Kotz</surname> <given-names>S. A.</given-names></name></person-group> (<year>2011</year>). <article-title>Temporal regularity effects on pre-attentive and attentive processing of deviance</article-title>. <source>Biol. Psychol.</source> <volume>87</volume>, <fpage>146</fpage>&#x02013;<lpage>151</lpage>. <pub-id pub-id-type="doi">10.1016/j.biopsycho.2011.02.021</pub-id><pub-id pub-id-type="pmid">21382437</pub-id></citation>
</ref>
<ref id="B80">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sowi&#x00144;ski</surname> <given-names>J.</given-names></name> <name><surname>Dalla Bella</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <article-title>Poor synchronization to the beat may result from deficient auditory-motor mapping</article-title>. <source>Neuropsychologia</source> <volume>51</volume>, <fpage>1952</fpage>&#x02013;<lpage>1963</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuropsychologia.2013.06.027</pub-id><pub-id pub-id-type="pmid">23838002</pub-id></citation>
</ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stern</surname> <given-names>D. N.</given-names></name></person-group> (<year>1974</year>). <article-title>The goal and structure of mother-infant play</article-title>. <source>J. Am. Acad. Child. Psychiatry.</source> <volume>13</volume>, <fpage>402</fpage>&#x02013;<lpage>421</lpage> <pub-id pub-id-type="doi">10.1016/S0002-7138(09)61348-0</pub-id><pub-id pub-id-type="pmid">4427039</pub-id></citation>
</ref>
<ref id="B82">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sturt</surname> <given-names>P.</given-names></name> <name><surname>Sanford</surname> <given-names>A. J.</given-names></name> <name><surname>Stewart</surname> <given-names>A.</given-names></name> <name><surname>Dawydiak</surname> <given-names>E.</given-names></name></person-group> (<year>2004</year>). <article-title>Linguistic focus and good-enough representations: an application of the change-detection paradigm</article-title>. <source>Psychon. Bull. Rev.</source> <volume>11</volume>, <fpage>882</fpage>&#x02013;<lpage>888</lpage>. <pub-id pub-id-type="doi">10.3758/BF03196716</pub-id><pub-id pub-id-type="pmid">15732698</pub-id></citation>
</ref>
<ref id="B83">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Toro</surname> <given-names>J. M.</given-names></name> <name><surname>Sebastian-Gall&#x000E8;s</surname> <given-names>N.</given-names></name> <name><surname>Mattys</surname> <given-names>S. L.</given-names></name></person-group> (<year>2009</year>). <article-title>The role of perceptual salience during the segmentation of connected speech</article-title>. <source>Eur. J. Cogn. Psychol.</source> <volume>21</volume>, <fpage>786</fpage>&#x02013;<lpage>800</lpage>. <pub-id pub-id-type="doi">10.1080/09541440802405584</pub-id></citation>
</ref>
<ref id="B84">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Vaissi&#x000E8;re</surname> <given-names>J.</given-names></name></person-group> (<year>1991</year>). <article-title>Rhythm, accentuation and final lengthening in French</article-title>, in <source>Music, Language, Speech and Brain: Proceedings of an International Symposium at the Wenner-Gren Center, Stockholm, 5-8 September 1990</source>, eds <person-group person-group-type="editor"><name><surname>Sundberg</surname> <given-names>J.</given-names></name> <name><surname>Nord</surname> <given-names>L.</given-names></name> <name><surname>Carlson</surname> <given-names>R.</given-names></name></person-group>(<publisher-loc>London</publisher-loc>: <publisher-name>MacMillan Education</publisher-name>), <fpage>108</fpage>&#x02013;<lpage>120</lpage>.</citation>
</ref>
<ref id="B85">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>van der Steen</surname> <given-names>M.</given-names></name> <name><surname>Keller</surname> <given-names>P. E.</given-names></name></person-group> (<year>2013</year>). <article-title>The ADaptation and Anticipation Model (ADAM) of sensorimotor synchronization</article-title>. <source>Front. Hum. Neurosci.</source> <volume>7</volume>:<fpage>253</fpage>. <pub-id pub-id-type="doi">10.3389/fnhum.2013.00253</pub-id><pub-id pub-id-type="pmid">23772211</pub-id></citation>
</ref>
<ref id="B86">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Volman</surname> <given-names>M. J.</given-names></name> <name><surname>Geuze</surname> <given-names>R. H.</given-names></name></person-group> (<year>2000</year>). <article-title>Temporal stability of rhythmic tapping &#x0201C;on&#x0201D; and &#x0201C;off the beat&#x0201D;: a developmental study</article-title>. <source>Psychol. Res.</source> <volume>63</volume>, <fpage>62</fpage>&#x02013;<lpage>69</lpage>. <pub-id pub-id-type="doi">10.1007/PL00008168</pub-id><pub-id pub-id-type="pmid">10743387</pub-id></citation>
</ref>
<ref id="B87">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Vos</surname> <given-names>P. G.</given-names></name> <name><surname>Helsper</surname> <given-names>E. L.</given-names></name></person-group> (<year>1992</year>). <article-title>Tracking simple rhythms: in-phase versus anti-phase performance</article-title>, in <source>Time, Action, and Cognition: Towards Bridging the Gap</source>, eds <person-group person-group-type="editor"><name><surname>Macar</surname> <given-names>F.</given-names></name> <name><surname>Pouthas</surname> <given-names>V.</given-names></name> <name><surname>Friedman</surname> <given-names>W. J.</given-names></name></person-group>(<publisher-loc>Dordrecht</publisher-loc>: <publisher-name>Kluwer</publisher-name>), <fpage>287</fpage>&#x02013;<lpage>299</lpage>.</citation>
</ref>
<ref id="B88">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wan</surname> <given-names>C. Y.</given-names></name> <name><surname>Bazen</surname> <given-names>L.</given-names></name> <name><surname>Baars</surname> <given-names>R.</given-names></name> <name><surname>Libenson</surname> <given-names>A.</given-names></name> <name><surname>Zipse</surname> <given-names>L.</given-names></name> <name><surname>Zuk</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Auditory-motor mapping training as an intervention to facilitate speech output in non-verbal children with autism: a proof of concept study</article-title>. <source>PLoS ONE</source> <volume>6</volume>:<fpage>e25505</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0025505</pub-id><pub-id pub-id-type="pmid">21980480</pub-id></citation>
</ref>
<ref id="B89">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Welby</surname> <given-names>P.</given-names></name></person-group> (<year>2003</year>). <article-title>Effects of pitch accent position, type, and status on focus projection</article-title>. <source>Lang. Speech</source> <volume>46</volume>, <fpage>53</fpage>&#x02013;<lpage>81</lpage>. <pub-id pub-id-type="doi">10.1177/00238309030460010401</pub-id><pub-id pub-id-type="pmid">14529111</pub-id></citation>
</ref>
<ref id="B90">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Welby</surname> <given-names>P.</given-names></name></person-group> (<year>2006</year>). <article-title>French intonational structure: evidence from tonal alignment</article-title>. <source>J. Phon.</source> <volume>34</volume>, <fpage>343</fpage>&#x02013;<lpage>371</lpage>. <pub-id pub-id-type="doi">10.1016/j.wocn.2005.09.001</pub-id></citation>
</ref>
<ref id="B91">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Welby</surname> <given-names>P.</given-names></name></person-group> (<year>2007</year>). <article-title>The role of early fundamental frequency rises and elbows in French word segmentation</article-title>. <source>Speech Commun.</source> <volume>49</volume>, <fpage>28</fpage>&#x02013;<lpage>48</lpage>. <pub-id pub-id-type="doi">10.1016/j.specom.2006.10.005</pub-id></citation>
</ref>
<ref id="B92">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wing</surname> <given-names>A. M.</given-names></name></person-group> (<year>2002</year>). <article-title>Voluntary timing and brain function: an information processing account</article-title>. <source>Brain Cogn.</source> <volume>48</volume>, <fpage>7</fpage>&#x02013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1006/brcg.2001.1301</pub-id><pub-id pub-id-type="pmid">11812030</pub-id></citation>
</ref>
<ref id="B93">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wiese</surname> <given-names>R.</given-names></name></person-group> (<year>2000</year>). <source>The Phonology of German</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B94">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>X.</given-names></name> <name><surname>Pierrehumbert</surname> <given-names>J. B.</given-names></name></person-group> (<year>2010</year>). <article-title>The effects of prosodic prominence and serial position on duration perception</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>128</volume>, <fpage>851</fpage>&#x02013;<lpage>859</lpage>. <pub-id pub-id-type="doi">10.1121/1.3455796</pub-id><pub-id pub-id-type="pmid">20707454</pub-id></citation>
</ref>
</ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>A MIDI delay of 81 ms was subtracted from all tapping data of the German-speaking group, and of 90 ms for the French-speaking group.</p></fn>
<fn id="fn0002"><p><sup>2</sup>Note that this main effect is not readily interpretable because of the different verbal materials used in the two languages.</p></fn>
<fn id="fn0003"><p><sup>3</sup>Language groups differed in musical experience. Because musical training can affect tapping performance, we re-ran all models reported below showing Language effects by including musical experience as a covariate. Musical experience did not change any of the results.</p></fn>
<fn id="fn0004"><p><sup>4</sup>No differences in the results were found when the regression model was run with and without one extreme value of <italic>d</italic>&#x02032; in the incongruent condition in the French group. The results without the extreme value are reported.</p></fn>
</fn-group>
</back>
</article>