<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2021.680889</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Durational Differences of Word-Final /s/ Emerge From the Lexicon: Modelling Morpho-Phonetic Effects in Pseudowords With Linear Discriminative Learning</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Schmitz</surname> <given-names>Dominic</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1219281/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Plag</surname> <given-names>Ingo</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/802788/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Baer-Henney</surname> <given-names>Dinah</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/168570/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Stein</surname> <given-names>Simon David</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1231783/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>English Language and Linguistics, Heinrich Heine University</institution>, <addr-line>D&#x00FC;sseldorf</addr-line>, <country>Germany</country></aff>
<aff id="aff2"><sup>2</sup><institution>Linguistics and Information Science, Heinrich Heine University</institution>, <addr-line>D&#x00FC;sseldorf</addr-line>, <country>Germany</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Vito Pirrelli, National Research Council (CNR), Italy</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: LouAnn Gerken, University of Arizona, United States; Yu-Ying Chuang, University of T&#x00FC;bingen, Germany</p></fn>
<corresp id="c001">&#x002A;Correspondence: Dominic Schmitz, <email>dominic.schmitz@uni-duesseldorf.de</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Language Sciences, a section of the journal Frontiers in Psychology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>09</day>
<month>08</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>680889</elocation-id>
<history>
<date date-type="received">
<day>15</day>
<month>03</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>24</day>
<month>06</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2021 Schmitz, Plag, Baer-Henney and Stein.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Schmitz, Plag, Baer-Henney and Stein</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Recent research has shown that seemingly identical suffixes such as word-final /s/ in English show systematic differences in their phonetic realisations. Most recently, durational differences between different types of /s/ have been found to also hold for pseudowords: the duration of /s/ is longest in non-morphemic contexts, shorter with suffixes, and shortest in clitics. At the theoretical level such systematic differences are unexpected and unaccounted for in current theories of speech production. Following a recent approach, we implemented a linear discriminative learning network trained on real word data in order to predict the duration of word-final non-morphemic and plural /s/ in pseudowords using production data by a previous production study. It is demonstrated that the duration of word-final /s/ in pseudowords can be predicted by LDL networks trained on real word data. That is, duration of word-final /s/ in pseudowords can be predicted based on their relations to the lexicon.</p>
</abstract>
<kwd-group>
<kwd>morphology</kwd>
<kwd>speech production</kwd>
<kwd>linear discriminative learning</kwd>
<kwd>computational modelling</kwd>
<kwd>pseudoword paradigm</kwd>
<kwd>subphonemic differences</kwd>
</kwd-group>
<contract-num rid="cn001">BA 6523/1-1</contract-num>
<contract-num rid="cn001">PL151/9-1</contract-num>
<contract-num rid="cn001">PL 151/7-2</contract-num>
<contract-sponsor id="cn001">Deutsche Forschungsgemeinschaft<named-content content-type="fundref-id">10.13039/501100001659</named-content></contract-sponsor>
<counts>
<fig-count count="5"/>
<table-count count="9"/>
<equation-count count="10"/>
<ref-count count="78"/>
<page-count count="20"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1">
<title>Introduction</title>
<p>Many studies on the acoustic properties of phonologically homophonous elements have shown unexpected effects of their morphological structure on their phonetic realisation. Such effects were shown for seemingly homophonous lexemes (<xref ref-type="bibr" rid="B25">Gahl, 2008</xref>; <xref ref-type="bibr" rid="B23">Drager, 2011</xref>), for free and bound variants of stems (<xref ref-type="bibr" rid="B31">Kemps et al., 2005a</xref>, <xref ref-type="bibr" rid="B32">b</xref>), and for prefixes (<xref ref-type="bibr" rid="B12">Ben Hedia and Plag, 2017</xref>; <xref ref-type="bibr" rid="B11">Ben Hedia, 2019</xref>).</p>
<p>For the level of individual segments, a number of studies have shown that the acoustic realisation of word-final /s/ and /z/ (henceforth S) in English depends on its morphological status and category. Corpus studies (<xref ref-type="bibr" rid="B77">Zimmermann, 2016</xref>; <xref ref-type="bibr" rid="B47">Plag et al., 2017</xref>) found that non-morphemic word-final S shows longest acoustic durations, followed by suffixes, which in turn are followed by clitics. Experimental studies (<xref ref-type="bibr" rid="B73">Walsh and Parker, 1983</xref>; <xref ref-type="bibr" rid="B28">Hsieh et al., 1999</xref>; <xref ref-type="bibr" rid="B60">Seyfarth et al., 2017</xref>; <xref ref-type="bibr" rid="B48">Plag et al., 2020</xref>) confirm durational differences between different types of S. However, their results are mostly not as clear as those by previous corpus studies. That is, only recently a study by <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref> on word-final S in pseudowords confirmed the pattern of durational differences found previously only in corpus studies.</p>
<p>Most importantly, none of the aforementioned studies on the matter of word-final S was able to explain found differences on a theoretical level. Traditional models of speech production come with the assumption of having no morphological information in phonetic processing (<xref ref-type="bibr" rid="B39">Levelt et al., 1999</xref>; <xref ref-type="bibr" rid="B54">Roelofs and Ferreira, 2019</xref>; <xref ref-type="bibr" rid="B68">Turk and Shattuck-Hufnagel, 2020</xref>), thus rendering an explanation on the basis of differing morphological categories improbable. Other accounts, e.g., standard feed-forward theories of morphology-phonology interaction (e.g., <xref ref-type="bibr" rid="B19">Chomsky and Halle, 1968</xref>; <xref ref-type="bibr" rid="B33">Kiparsky, 1982</xref>) or prosodic phonology (e.g., <xref ref-type="bibr" rid="B15">Booij, 1983</xref>; <xref ref-type="bibr" rid="B58">Selkirk, 1996</xref>; <xref ref-type="bibr" rid="B26">Goad, 1998</xref>, <xref ref-type="bibr" rid="B27">2002</xref>), do not offer a satisfying explanation for such durational differences, either.</p>
<p>Only recently, <xref ref-type="bibr" rid="B64">Tomaschek et al. (2019)</xref> analysed durational differences between types of S by means of an implementation of na&#x00EF;ve discriminative learning (<xref ref-type="bibr" rid="B49">Ramscar and Yarlett, 2007</xref>; <xref ref-type="bibr" rid="B50">Ramscar et al., 2010</xref>; <xref ref-type="bibr" rid="B6">Baayen et al., 2011</xref>). Their results indicate that the duration of a word-final S in English can be sufficiently approximated by considering the support for its morphological function from the word&#x2019;s sublexical and collocational properties.</p>
<p>This paper continues this line of evidence by making use of the computational model of linear discriminative learning (<xref ref-type="bibr" rid="B5">Baayen et al., 2019b</xref>; <xref ref-type="bibr" rid="B20">Chuang et al., 2020</xref>), the more advanced successor of na&#x00EF;ve discriminative learning. We analyse the durational differences between non-morphemic and plural word-final /s/ found not in real words, but in pseudowords. By using nonce words, we want to rule out potentially confounding effects of the lexical and contextual properties of the individual utterances (e.g., <xref ref-type="bibr" rid="B17">Caselli et al., 2016</xref>). Making use of measures derived from this implementation of linear discriminative learning, the present study demonstrates that the effects found by <xref ref-type="bibr" rid="B64">Tomaschek et al. (2019)</xref> can be confirmed. Differences in phonetic duration emerge from differences in the strengths of associations between form and meaning.</p>
<p>We proceed as follows. The next section will give an overview on studies on the duration of word-final S, and possibilities and obstacles of theoretical accounts. Section &#x201C;Introduction to LDL&#x201D; introduces linear discriminative learning on a theoretical level, while Section &#x201C;Combining Real Words and Pseudowords in an LDL Implementation&#x201D; presents the implementation of linear discriminative learning used in the present study. The analysis and results of our study are given in Sections &#x201C;Analysis&#x201D; and &#x201C;Results.&#x201D; A discussion of the obtained results and a conclusion follow in Section &#x201C;Discussion.&#x201D;</p>
</sec>
<sec id="S2">
<title>Word-Final /s/ and Its Duration</title>
<p>A number of morphological categories can take the phonological form of /s/ in English, i.e., plural, genitive, genitive plural, third person singular, and the clitics of is, has, and us. In itself, there is nothing in the phonological form of these morphological categories that indicates systematic differences in realisation on the phonetic level between different S morphemes or a non-morphemic S. Yet, a number of studies report on durational differences between different types of S.</p>
<p>Corpus studies on word-final S in English find differences in duration between non-morphemic, suffix, and clitic variants. <xref ref-type="bibr" rid="B77">Zimmermann (2016)</xref> on New Zealand English, and <xref ref-type="bibr" rid="B47">Plag et al. (2017)</xref> and <xref ref-type="bibr" rid="B64">Tomaschek et al. (2019)</xref> on North American English find that non-morphemic S (as in grace, cheese, bus) shows longer durations than plural S and the clitic S of has and is, while plural S in turn shows longer durations than clitic S.</p>
<p>Turning to experimental studies, results are not as consistent. <xref ref-type="bibr" rid="B73">Walsh and Parker (1983)</xref> conducted a production experiment with three homophonous word pairs with all words ending in either a non-morphemic or morphemic word-final S. Tested in three different contexts, they find durational differences in two of them. They conclude that morphemic S in English is systematically lengthened by speakers (<xref ref-type="bibr" rid="B73">Walsh and Parker, 1983</xref>: 204). However, their conclusion relies on only a small number of 110 observations, a mixture of common and proper nouns as items, and lacks appropriate inferential statistical methods as well as an integration of covariates.</p>
<p><xref ref-type="bibr" rid="B28">Hsieh et al. (1999)</xref> find that plural S is longer than third person singular S in child-directed speech. However, as their data was originally elicited for another study (<xref ref-type="bibr" rid="B62">Swanson and Leonard, 1994</xref>), half of all plural items occurred sentence-finally, while almost all third person singular items occurred sentence-medial. Thus, the durational differences found by <xref ref-type="bibr" rid="B28">Hsieh et al. (1999)</xref> may be attributed to effects of phrase-final lengthening (e.g., <xref ref-type="bibr" rid="B34">Klatt, 1976</xref>; <xref ref-type="bibr" rid="B74">Wightman et al., 1992</xref>) rather than to phonetic differences between different types of S.</p>
<p>In another study, <xref ref-type="bibr" rid="B60">Seyfarth et al. (2017)</xref> conducted a production experiment on word-final /s/ and /z/ in non-morphemic, plural, and third person singular contexts. Their results indicate that non-morphemic S is shorter than morphemic S. However, they do not find a difference between voiced and voiceless instances, even though previous studies confirm differences dependent on voicing (e.g., <xref ref-type="bibr" rid="B47">Plag et al., 2017</xref>). With only six items ending in /s/, but twenty items ending in /z/, it is questionable how meaningful their results on different types of S are.</p>
<p>Comparing affixes, <xref ref-type="bibr" rid="B48">Plag et al. (2020)</xref> find that plural and genitive plural S differ in duration. That is, in their study the genitive plural suffix shows a longer duration than the plural suffix.</p>
<p>Most recently, <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref> conducted a production experiment on pseudowords carrying either a non-morphemic, plural, or is- or has-clitic S. Their results are in line with those of aforementioned corpus studies. That is, non-morphemic S shows longest S durations, followed by plural S, which in turn is followed by clitic S, while there is no significant durational difference between the two clitics. An overview of the durational differences found in corpus and experimental studies is given in <xref ref-type="table" rid="T1">Table 1</xref>.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Overview of durational differences of word-final /s/ found in previous studies.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Study</td>
<td valign="top" align="left">Findings</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B77">Zimmermann, 2016</xref>; <xref ref-type="bibr" rid="B47">Plag et al., 2017</xref>; <xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>; <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref></td>
<td valign="top" align="left">Non-morphemic &#x003E; plural &#x003E; clitics</td>
</tr>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B73">Walsh and Parker, 1983</xref></td>
<td valign="top" align="left">Plural &#x003E; non-morphemic</td>
</tr>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B28">Hsieh et al., 1999</xref></td>
<td valign="top" align="left">Plural &#x003E; third person singular</td>
</tr>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B60">Seyfarth et al., 2017</xref></td>
<td valign="top" align="left">Plural &#x003E; non-morphemic</td>
</tr>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B48">Plag et al., 2020</xref></td>
<td valign="top" align="left">Genitive plural &#x003E; plural</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>There is a noteworthy discrepancy between experimental results and the results based on conversational speech data. Results of corpus studies are in line with each other, but they might be flawed due to imbalanced data sets. Experimental studies, on the other hand, often rely on small data sets, and lack phonetic covariates, appropriate statistical methods, or a proper distinction of voiced and voiceless segments. Additionally, previous experimental studies rely on different experimental methods, making their results subject to their pertinent limitations. Another crucial difference between corpus and experimental studies is the use of homophones. While corpus studies take into consideration all words, most experimental studies restrict their data to homophone pairs. This limitation to homophones and the competition between their representations might be a problem of its own as it is unclear how members of such homophone pairs may influence each other in speech production. Lastly, differences in results might also arise due to potentially confounding effects of the lexical properties and contextual effects of the items under investigation.</p>
<p>But even if the direction of durational differences between different types of S is not entirely clear yet, it appears that there are indeed durational differences of some sort. How is one to explain such differences? In standard feed-forward theories of morphology-phonology interaction (e.g., <xref ref-type="bibr" rid="B19">Chomsky and Halle, 1968</xref>; <xref ref-type="bibr" rid="B33">Kiparsky, 1982</xref>) all types of S, morphemic and non-morphemic, are treated in a similar way. For morphologically complex words, e.g., words ending in a morphological word-final S, a process named &#x201C;bracket erasure&#x201D; is said to remove any morphological information. Thus, leaving speech production with no information on the morphology of a complex word (e.g., the plural form cats), rendering its morphological information equal to that of a morphologically simple word ending in a non-morphemic word-final S (e.g., the singular form bus). In such a system, there is nothing that could account for realisational differences between phonologically identical forms of suffixes, clitics, and non-morphemic segments.</p>
<p>A similar distinction of lexical and post-lexical processing is also found in established theories of psycholinguistics. According to models of speech production (e.g., <xref ref-type="bibr" rid="B39">Levelt et al., 1999</xref>; <xref ref-type="bibr" rid="B54">Roelofs and Ferreira, 2019</xref>), morphemic types of word-final S do not differ in their realisation from non-morphemic instances of word-final S. For a plural form, e.g., cats, the lemma of the lexical concept CAT and a plural specification are retrieved. Then, during morphological encoding, the plural specification is mapped onto the base lemma, i.e., cat, and the plural suffix, &#x003C; -s &#x003E;. During phonological encoding, phonemes are selected for the corresponding morphemes, i.e., /k/, /&#x00E6;/, /t/, and /s/. Finally, the phonemes are syllabified, resulting in a phonological word representation. Such phonological forms are then forwarded and used in speech production. Thus, no information on the morphological origin of particular segments is contained in the phonetic realisation, rendering an explanation on durational differences between types of S on morphological grounds improbable.</p>
<p>In prosodic phonology (e.g., <xref ref-type="bibr" rid="B15">Booij, 1983</xref>), differences in phonetic realisation may arise from the position of sounds in different configurations of prosodic constituency. For instance, different types of word-final S can be analysed as being integrated at different levels of the hierarchical prosodic configuration. In the case of word-final S, different levels co-determine differing degrees of integration of an S to the word it belongs to. Non-morphemic S, uncontroversially, is an integral part of the prosodic word itself (<xref ref-type="bibr" rid="B58">Selkirk, 1996</xref>), see (A) of <xref ref-type="fig" rid="F1">Figure 1</xref>. For plural S, <xref ref-type="bibr" rid="B26">Goad (1998)</xref> analyses it as an &#x201C;internal clitic&#x201D;, see (B), while <xref ref-type="bibr" rid="B27">Goad (2002)</xref> analyses it as an &#x201C;affixal clitic&#x201D;, see (C).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Prosodic configuration for <bold>(A)</bold> non-morphemic and <bold>(B,C)</bold> plural S.</p></caption>
<graphic xlink:href="fpsyg-12-680889-g001.tif"/>
</fig>
<p>Thus, the prosodic approach posits a structural prosodic difference between types of S. However, it is not so clear what particular phonetic effects these differences would predict. Most plausibly, a higher degree of integration would correlate with shorter durations, predicting shortest S durations in monomorphemic words. Yet, findings on S duration show the opposite (e.g., <xref ref-type="bibr" rid="B77">Zimmermann, 2016</xref>; <xref ref-type="bibr" rid="B47">Plag et al., 2017</xref>; <xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>; <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref>), i.e., the duration of non-morphemic S is longest.</p>
<p>An alternative explanation for durational differences between different types of S can be found within the computational modelling framework of na&#x00EF;ve discriminate learning (NDL; e.g., <xref ref-type="bibr" rid="B49">Ramscar and Yarlett, 2007</xref>; <xref ref-type="bibr" rid="B50">Ramscar et al., 2010</xref>; <xref ref-type="bibr" rid="B6">Baayen et al., 2011</xref>). NDL is based on simple but powerful principles of discriminative learning theory (<xref ref-type="bibr" rid="B72">Wagner and Rescorla, 1972</xref>; <xref ref-type="bibr" rid="B52">Rescorla, 1988</xref>), i.e., learning results from exposure to informative relations among events in the individual&#x2019;s environment. Such events are used to form associations between them, while the associations and their resulting representations are constantly updated on the basis of new experiences. Associations are built between features (&#x201C;cues&#x201D;, e.g., biphones) and content lexemes or morphological functions (&#x201C;outcomes&#x201D;, e.g., different types of S), which co-occur in events in which the individual is predicting outcomes from cues (<xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>: 11). Using the Rescorla-Wagner equations (<xref ref-type="bibr" rid="B53">Rescorla and Wagner, 1972</xref>; <xref ref-type="bibr" rid="B72">Wagner and Rescorla, 1972</xref>; <xref ref-type="bibr" rid="B52">Rescorla, 1988</xref>), relations between cues and outcomes are modelled. That is, the weight of an association, i.e., its strength, increases every time a cue and an outcome co-occur, while it decreases if a cue occurs without the outcome. The result of this process is a continuous recalibration of association strengths, which is a crucial part of discriminative learning.</p>
<p>NDL has been used successfully to model various morphological phenomena, e.g., reaction times in studies on morphological processing (e.g., <xref ref-type="bibr" rid="B6">Baayen et al., 2011</xref>; <xref ref-type="bibr" rid="B13">Blevins et al., 2016</xref>; see <xref ref-type="bibr" rid="B46">Plag, 2018</xref>, chapter 7 for an introduction to NDL in morphological research). For word-final S, <xref ref-type="bibr" rid="B64">Tomaschek et al. (2019)</xref> reproduce the differences in duration found by <xref ref-type="bibr" rid="B47">Plag et al. (2017)</xref> by means of NDL measures. Their study shows that the duration of different types of S can be approximated by considering the support for these morphological functions from a word&#x2019;s sublexical and collocational properties. In the NDL network, all words and their diphones within a five word window centred on the target word that contained the S served as cues, and were associated with the morphological functions, which served as outcomes. Two main measurements from this network emerged as predictive for S duration. First, the so-called &#x201C;activation&#x201D; as a measure of an outcome&#x2019;s baseline activation, i.e., of how well an outcome is entrenched in the lexicon. Second, the so-called &#x201C;activation diversity&#x201D; as a measure to quantify the extent to which the cues in a given context also support other targets. Taken together, the following pattern for S duration emerges: When the uncertainty about a targeted outcome increases, i.e., the level of &#x201C;activation&#x201D; decreases and the level of &#x201C;activation diversity&#x201D; increases, the duration of S decreases. In other words: The stronger the support for a morphological function is, both from long-term entrenchment and short-term from the context, the longer its duration.</p>
<p>While NDL implementations apparently offer some form of explanation for different durations of different types of S, they also come with shortcomings and limitations. In NDL, a word&#x2019;s meaning is defined in terms of the presence or absence of an outcome, i.e., NDL &#x201C;adopted a stark form of naive realism&#x201D; (<xref ref-type="bibr" rid="B5">Baayen et al., 2019b</xref>: 4) just for computational reasons. That is, NDL takes into account that words tend to have similar forms, but ignores that words are also similar in meaning. Thus, <xref ref-type="bibr" rid="B5">Baayen et al. (2019b)</xref> introduced semantic vectors of reals replacing the binarily coded row vectors of the semantic matrix (see Section &#x201C;The S Matrix: Semantic Vectors&#x201D;), naming their new implementation linear discriminative learning (LDL) instead of na&#x00EF;ve discriminative learning. Outcomes are no longer assumed to be independent, i.e., semantic similarities are now reflected, and networks are mathematically equivalent to linear mappings of matrices, i.e., vector spaces. It is the implementation of such linear discriminative learning that the present paper makes use of for analysing the duration of word-final types of S. Our paper explores whether measures derived from an LDL implementation are predictive of different types of S and their durations. In order to better understand the relation between traditional psycholinguistic variables (such as lexical frequencies, neighbourhood densities, bigram probabilities, morphological category etc.) and LDL measurements we also compare models that use measures derived from an LDL implementation with models that use traditional measures to predict S durations. Finally, we test whether measures derived from an LDL implementation render the specification of morphological structure proper (affix vs. no affix) as predictor variable for S duration unnecessary.</p>
</sec>
<sec id="S3">
<title>Introduction to LDL</title>
<sec id="S3.SS1">
<title>Overview</title>
<p>Linear discriminative learning as a computational model implements a discriminative view of learning. In contrast to deep learning models that have multiple hidden layers based on non-linear functions, LDL networks are very simple two-layer networks and are linguistically transparent and interpretable. In LDL, the mental lexicon consists of five high-dimensional numeric matrices, each of which represents a different subsystem: the visual matrix, retina; the auditory matrix, cochlea; the semantic matrix; the speech matrix, speaking; and the spelling matrix, typing. For the current implementation, the semantic and the speech matrix are most important.</p>
<p>With regard to the mappings between vectors, linear mappings are implemented. These mappings are estimated using the linear algebra of multivariate regression. Thus, each mapping is defined by a matrix <italic>A</italic> that transforms the row vectors in a matrix <italic>X</italic> into the row vectors of a matrix <italic>Y</italic>, i.e., <italic>Y</italic> = <italic>XA</italic>. Then, <italic>A</italic> = <italic>X</italic>&#x2032;<italic>Y</italic>, where <italic>X</italic>&#x2032; is the generalised inverse of <italic>X</italic>. We will return to the mapping of matrices in Section &#x201C;Comprehension and Production,&#x201D; and refer the interested reader to <xref ref-type="bibr" rid="B5">Baayen et al. (2019b)</xref> for an introduction to the mathematical details, as well as to <xref ref-type="bibr" rid="B41">Milin et al. (2017)</xref> for a detailed discussion on the restrictions and possibilities of linear mappings.</p>
<p>Another important feature of LDL is its notion of lexomes, i.e., basic semantic units corresponding to words or morphological functions. As outlined in <xref ref-type="bibr" rid="B20">Chuang et al. (2020)</xref>, lexomes fall into two groups: content lexomes, and inflectional and derivational lexomes. Content lexomes can be morphologically simple or complex forms, i.e., <italic>cat</italic> and <italic>cats</italic>. Inflectional lexomes represent inflectional functions, e.g., number, tense, and aspect. Derivational lexomes represent derivational functions, e.g., morphological categories such as -<sc>NESS</sc>, -<sc>LESS</sc>, or <sc>UN</sc>-. Each lexome is paired with a vector of the aforementioned five subsystems. That is, for the semantic matrix, each lexome is paired with a semantic vector, making each lexome a pointer to a semantic vector on the one hand (<xref ref-type="bibr" rid="B41">Milin et al., 2017</xref>), and a location in a high-dimensional space on the other hand. For monomorphemic words, the semantic vector is identical to the semantic vector of the corresponding lexome. That is, the semantic vector of the word <italic>cat</italic>, <inline-formula><mml:math id="INEQ4"><mml:mover accent="true"><mml:mrow><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math></inline-formula>, is identical to the vector of the lexome <sc>CAT</sc>. For complex words, the semantic vector is the sum of its corresponding lexome vectors. That is, the semantic vector of the word <italic>cats</italic>, <inline-formula><mml:math id="INEQ5"><mml:mover accent="true"><mml:mrow><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math></inline-formula>, is the sum of the semantic vectors of the lexomes <sc>CAT</sc> and <sc>PLURAL</sc>, <inline-formula><mml:math id="INEQ6"><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>u</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula>. The implementation of LDL and the matrices necessary for the present paper are introduced in the subsequent sections. Please refer to <ext-link ext-link-type="uri" xlink:href="https://osf.io/zy7ar/?view_only=ef43a5caf6444270a56074027d7d6482">https://osf.io/zy7ar/?view_only=ef43a5caf6444270a56074027d7d6482</ext-link> for the full documentation of the data set, the implementation in R (<xref ref-type="bibr" rid="B51">R Core Team, 2020</xref>), and the R script.</p>
<sec id="S3.SS1.SSS1">
<title>The S Matrix: Semantic Vectors</title>
<p>The semantic matrix <italic>S</italic> contains semantic vectors of word forms on basis of their corresponding lexomes. That is, the semantic vector <inline-formula><mml:math id="INEQ7"><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math></inline-formula> in <italic>S</italic> for a simplex word is identical to its corresponding lexome, while the semantic vector <inline-formula><mml:math id="INEQ8"><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math></inline-formula> in <italic>S</italic> for a cosmplex word is the sum of its corresponding lexomes, e.g., <inline-formula><mml:math id="INEQ9"><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>u</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula> for <italic>apples</italic> (<xref ref-type="bibr" rid="B5">Baayen et al., 2019b</xref>). Semantic vectors of lexomes can be derived in different ways (e.g., <xref ref-type="bibr" rid="B37">Landauer and Dumais, 1997</xref>; <xref ref-type="bibr" rid="B30">Jones and Mewhort, 2007</xref>; <xref ref-type="bibr" rid="B61">Shaoul and Westbury, 2010</xref>; <xref ref-type="bibr" rid="B40">Mikolov et al., 2013</xref>).</p>
</sec>
<sec id="S3.SS1.SSS2">
<title>The C Matrix: Form Vectors</title>
<p>The present study uses triphones to represent form, as previous studies (<xref ref-type="bibr" rid="B41">Milin et al., 2017</xref>; <xref ref-type="bibr" rid="B5">Baayen et al., 2019b</xref>; <xref ref-type="bibr" rid="B20">Chuang et al., 2020</xref>) have shown that triphones capture the variability of neighbouring phonological information well for English. Triphones are sequences of three phones within a word form. They overlap and can be understood as proxies for phonetic transitions. The cue matrix <italic>C</italic> encodes the forms of words in a binary fashion, giving information on which triphones are part of which word. This is illustrated in (1). In each word&#x2019;s individual form vector <inline-formula><mml:math id="INEQ10"><mml:mover accent="true"><mml:mi>c</mml:mi><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math></inline-formula>, the presence of a triphone is marked with 1, while the absence is marked with 0. The cue vectors of all words of a set of words constitute its <italic>C</italic> matrix. That is, each row in such a <italic>C</italic> matrix represents a word form, while the columns of the respective <italic>C</italic> matrix represent all triphones of its underlying word set.</p>
</sec>
<sec id="S3.SS1.SSS3">
<title>Comprehension and Production</title>
<p>In LDL, comprehension refers to a model that has form vectors as input and semantic vectors as output. We illustrate the <italic>C</italic> matrix of a set of words with a toy lexicon containing the words <italic>cat</italic>, <italic>bus</italic>, and <italic>eel</italic> in (1). Here, the DISC keyboard phonetic alphabet (the &#x201C;Distinct Single Character&#x201D; representation introduced by <xref ref-type="bibr" rid="B16">Burnage, 1988</xref>) is used for triphone representation. Word boundaries are marked by the # symbol.</p>
<disp-formula id="S3.Ex1"><mml:math id="M1"><mml:mtable columnalign="center center center center center center center center" rowspacing="4pt" columnspacing="1em" /><mml:mi>C</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mtext>&#x00A0;</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>b</mml:mi><mml:mi>u</mml:mi><mml:mi>s</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>e</mml:mi><mml:mi>e</mml:mi><mml:mi>l</mml:mi></mml:mtd></mml:mtr></mml:mtable><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mspace width="-6pt" /><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mi>k</mml:mi><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mi>k</mml:mi><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>t</mml:mi><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>t</mml:mi><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mi>b</mml:mi><mml:mi>V</mml:mi><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mi>b</mml:mi><mml:mi>V</mml:mi><mml:mi>s</mml:mi><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mi>V</mml:mi><mml:mi>s</mml:mi><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mi>i</mml:mi><mml:mi>l</mml:mi><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mi>i</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">&#x0023;</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mtable columnalign="center center center center center center center center" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext><mml:mn>1</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>1</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>1</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>1</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>1</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>1</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>1</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd><mml:mtd><mml:mn>1</mml:mn><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext></mml:mtd></mml:mtr></mml:mtable><mml:mo>)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>For the same toy lexicon, suppose that the semantic vectors for these three words are the row vectors of the following <italic>S</italic> matrix:</p>
<disp-formula id="S3.Ex2"><mml:math id="M2"><mml:mtable columnalign="center center center" rowspacing="4pt" columnspacing="1em" /><mml:mi>S</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mtable rowspacing="4pt" columnspacing="1em" /><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mtext>&#x00A0;</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>b</mml:mi><mml:mi>u</mml:mi><mml:mi>s</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>e</mml:mi><mml:mi>e</mml:mi><mml:mi>l</mml:mi></mml:mtd></mml:mtr></mml:mtable><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mspace width="-5pt" /><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>b</mml:mi><mml:mi>u</mml:mi><mml:mi>s</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>e</mml:mi><mml:mi>e</mml:mi><mml:mi>l</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mtable columnalign="center center center" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mn>1.0</mml:mn></mml:mtd><mml:mtd><mml:mn>0.2</mml:mn></mml:mtd><mml:mtd><mml:mn>0.5</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0.4</mml:mn></mml:mtd><mml:mtd><mml:mn>1.0</mml:mn></mml:mtd><mml:mtd><mml:mn>0.1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0.2</mml:mn></mml:mtd><mml:mtd><mml:mn>0.3</mml:mn></mml:mtd><mml:mtd><mml:mn>1.0</mml:mn></mml:mtd></mml:mtr></mml:mtable><mml:mo>)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>To map forms onto meanings we need transformation matrix <italic>F</italic>, such that</p>
<disp-formula id="S3.Ex3"><mml:math id="M3"><mml:mrow><mml:mrow><mml:mrow><mml:mi>C</mml:mi><mml:mpadded width="+3.3pt"><mml:mi>F</mml:mi></mml:mpadded></mml:mrow><mml:mo rspace="5.8pt">=</mml:mo><mml:mi>S</mml:mi></mml:mrow><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
<p>The transformation matrix <italic>F</italic> is straightforward to obtain. Let <italic>C</italic>&#x2032; denote the Moore-Penrose generalised inverse<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> of <italic>C</italic>, available in R as the <italic>ginv</italic> function of the MASS package (<xref ref-type="bibr" rid="B70">Venables and Ripley, 2002</xref>). Then,</p>
<disp-formula id="S3.Ex4"><mml:math id="M4"><mml:mrow><mml:mrow><mml:mpadded width="+3.3pt"><mml:mi>F</mml:mi></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:mrow><mml:msup><mml:mi>C</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup><mml:mi>S</mml:mi></mml:mrow></mml:mrow><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
<p>For the toy lexicon example,</p>
<disp-formula id="S3.Ex5"><mml:math id="M5"><mml:mtable columnalign="center center center" rowspacing="4pt" columnspacing="1em" /><mml:mi>F</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mtext>&#x00A0;</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mi>k</mml:mi><mml:mo fence="false" stretchy="false">{</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>k</mml:mi><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>t</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>t</mml:mi><mml:mi mathvariant="normal">&#x0023;</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mi>b</mml:mi><mml:mi>V</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>b</mml:mi><mml:mi>V</mml:mi><mml:mi>s</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>V</mml:mi><mml:mi>s</mml:mi><mml:mi mathvariant="normal">&#x0023;</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mi>i</mml:mi><mml:mi>l</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>i</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">&#x0023;</mml:mi></mml:mtd></mml:mtr></mml:mtable><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mspace width="-5pt" /><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mspace width="1em" /><mml:mi>b</mml:mi><mml:mi>u</mml:mi><mml:mi>s</mml:mi><mml:mspace width="1em" /><mml:mi>e</mml:mi><mml:mi>e</mml:mi><mml:mi>l</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mtable columnalign="center center center" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mn>0.33</mml:mn></mml:mtd><mml:mtd><mml:mn>0.06</mml:mn></mml:mtd><mml:mtd><mml:mn>0.16</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0.33</mml:mn></mml:mtd><mml:mtd><mml:mn>0.06</mml:mn></mml:mtd><mml:mtd><mml:mn>0.16</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0.33</mml:mn></mml:mtd><mml:mtd><mml:mn>0.06</mml:mn></mml:mtd><mml:mtd><mml:mn>0.16</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0.13</mml:mn></mml:mtd><mml:mtd><mml:mn>0.33</mml:mn></mml:mtd><mml:mtd><mml:mn>0.03</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0.13</mml:mn></mml:mtd><mml:mtd><mml:mn>0.33</mml:mn></mml:mtd><mml:mtd><mml:mn>0.03</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0.13</mml:mn></mml:mtd><mml:mtd><mml:mn>0.33</mml:mn></mml:mtd><mml:mtd><mml:mn>0.03</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0.10</mml:mn></mml:mtd><mml:mtd><mml:mn>0.15</mml:mn></mml:mtd><mml:mtd><mml:mn>0.50</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0.10</mml:mn></mml:mtd><mml:mtd><mml:mn>0.15</mml:mn></mml:mtd><mml:mtd><mml:mn>0.50</mml:mn></mml:mtd></mml:mtr></mml:mtable><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>with <italic>CF</italic> being exactly equal to <italic>S</italic> in this simple example. That is, taking form vectors as input for the prediction of semantic vectors as output, i.e., solving <inline-formula><mml:math id="INEQ12"><mml:mrow><mml:mpadded width="+3.3pt"><mml:mover accent="true"><mml:mi>S</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:mrow><mml:mi>C</mml:mi><mml:mi>F</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, this toy example correctly predicts 100% of all (three) words&#x2019; semantics, i.e., <inline-formula><mml:math id="INEQ13"><mml:mrow><mml:mpadded width="+3.3pt"><mml:msub><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:msub><mml:mi>s</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. In more complex cases, semantic vectors are only approximately identical, thus, for a word <italic>i</italic> and its predicted semantic vector <inline-formula><mml:math id="INEQ14"><mml:msub><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:math></inline-formula>, comprehension is successful if <inline-formula><mml:math id="INEQ15"><mml:msub><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:math></inline-formula> shows the highest correlation with the targeted semantic vector <italic>s<sub>i</sub></italic> (<xref ref-type="bibr" rid="B5">Baayen et al., 2019b</xref>). Following this method, one can report the percentage of comprehension accuracy.</p>
<p>Production as modelled in in LDL takes semantic vectors as input and delivers form vectors as output. Using the same toy lexicon as before, we adapt its <italic>C</italic> matrix, i.e., we borrow the notation by <xref ref-type="bibr" rid="B5">Baayen et al. (2019b)</xref> and henceforth call it <italic>T</italic> as is contains the Targeted triphones. For production, the transformation matrix <italic>G</italic> is of interest. Similar to <italic>F</italic> for comprehension, it is straightforward to obtain. Let <italic>S</italic>&#x2032; denote the Moore-Penrose generalised inverse of <italic>S</italic>. Then,</p>
<disp-formula id="S3.Ex6"><mml:math id="M6"><mml:mrow><mml:mrow><mml:mpadded width="+3.3pt"><mml:mi>G</mml:mi></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:mrow><mml:msup><mml:mi>S</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup><mml:mi>T</mml:mi></mml:mrow></mml:mrow><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
<p>Given <italic>G</italic>, one can then predict the triphone matrix <inline-formula><mml:math id="INEQ17"><mml:mover accent="true"><mml:mi>T</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> from the semantic matrix <italic>S</italic> by solving</p>
<disp-formula id="S3.Ex7"><mml:math id="M7"><mml:mrow><mml:mrow><mml:mpadded width="+3.3pt"><mml:mover accent="true"><mml:mi>T</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:mrow><mml:mi>S</mml:mi><mml:mi>G</mml:mi></mml:mrow></mml:mrow><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
<p>For our toy lexicon example, the <italic>G</italic> transformation matrix is</p>
<disp-formula id="S3.Ex8"><mml:math id="M8"><mml:mtable columnalign="right center left" rowspacing="3pt" columnspacing="0 thickmathspace" displaystyle="true"><mml:mtr><mml:mtd><mml:mtable columnalign="center center center center center center center center" rowspacing="4pt" columnspacing="1em" /><mml:mtable rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mtext>&#x00A0;</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>&#x00A0;</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>b</mml:mi><mml:mi>u</mml:mi><mml:mi>s</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>e</mml:mi><mml:mi>e</mml:mi><mml:mi>l</mml:mi></mml:mtd></mml:mtr></mml:mtable><mml:mtable columnalign="left" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mi>G</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mi>k</mml:mi><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mspace width="1em" /><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext><mml:mi>k</mml:mi><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>t</mml:mi><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext><mml:mspace width="1em" /><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>t</mml:mi><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mspace width="1em" /><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mi>b</mml:mi><mml:mi>V</mml:mi><mml:mtext>&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;&#x00A0;</mml:mtext><mml:mi>b</mml:mi><mml:mi>V</mml:mi><mml:mi>s</mml:mi><mml:mspace width="1em" /><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mi>V</mml:mi><mml:mi>s</mml:mi><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mspace width="1em" /><mml:mtext>&#x00A0;</mml:mtext><mml:mi mathvariant="normal">&#x0023;</mml:mi><mml:mi>i</mml:mi><mml:mi>l</mml:mi><mml:mspace width="1em" /><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mi>i</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">&#x0023;</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mtable columnalign="right right right right right right right right" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mn>1.14</mml:mn></mml:mtd><mml:mtd><mml:mn>1.14</mml:mn></mml:mtd><mml:mtd><mml:mn>1.14</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.06</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.06</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.06</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.56</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.56</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.44</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.44</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.44</mml:mn></mml:mtd><mml:mtd><mml:mn>1.05</mml:mn></mml:mtd><mml:mtd><mml:mn>1.05</mml:mn></mml:mtd><mml:mtd><mml:mn>1.05</mml:mn></mml:mtd><mml:mtd><mml:mn>0.12</mml:mn></mml:mtd><mml:mtd><mml:mn>0.12</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.09</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.09</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.09</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.30</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.30</mml:mn></mml:mtd><mml:mtd><mml:mo>&#x2212;</mml:mo><mml:mn>0.30</mml:mn></mml:mtd><mml:mtd><mml:mn>1.08</mml:mn></mml:mtd><mml:mtd><mml:mn>1.08</mml:mn></mml:mtd></mml:mtr></mml:mtable><mml:mo>)</mml:mo></mml:mrow><mml:mspace width="negativethinmathspace" /><mml:mspace width="negativethinmathspace" /><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>As this is a toy example, <italic>SG</italic> is identical to <italic>T</italic>. For more complex cases, <inline-formula><mml:math id="INEQ18"><mml:mover accent="true"><mml:mi>T</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> will not be virtually identical to <italic>T</italic> &#x201C;but will be an approximation of it that is optimal in the least squares sense&#x201D; (<xref ref-type="bibr" rid="B5">Baayen et al., 2019b</xref>: 21). Triphones with strongest support are expected to be the triphones making up a word&#x2019;s form. As triphones are not ordered, it is also checked whether the sequence of phones can be constructed correctly. Both, checking triphone support and sequence, are conveniently done by the functions of the WpmWithLdl package (<xref ref-type="bibr" rid="B4">Baayen et al., 2019a</xref>). Following this method, one can report the percentage of production accuracy.</p>
<p><xref ref-type="fig" rid="F2">Figure 2</xref> summarizes the mapping between form and meaning by the <italic>F</italic> and <italic>G</italic> transformation matrix for comprehension and production modelling.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Illustration of mapping between <italic>C</italic> and <italic>S</italic> matrix via <italic>F</italic> (i.e., comprehension), and <italic>S</italic> and <italic>C</italic> matrix via <italic>G</italic> (i.e., production). In production, <italic>C</italic> is referred to as <italic>T</italic>.</p></caption>
<graphic xlink:href="fpsyg-12-680889-g002.tif"/>
</fig>
</sec>
</sec>
</sec>
<sec id="S4">
<title>Combining Real Words and Pseudowords in an LDL Implementation</title>
<sec id="S4.SS1">
<title>The Semantics of Pseudowords</title>
<p>The present paper follows the implementational basics outlined in Section &#x201C;Introduction to LDL.&#x201D; However, as we are interested in /s/ durations in pseudowords (and not in real words), there are a number of complications. The most important complication arises from the widely shared belief that pseudowords do not have meaning. So how can we map form and meaning with forms that have no meaning? In a recent study (<xref ref-type="bibr" rid="B20">Chuang et al., 2020</xref>) it was shown that the assumption that pseudowords are bare of meaning is most probably wrong. Due to their formal similarity with existing words, pseudowords resonate with the lexicon. As a result, they may in fact carry meaning. The authors demonstrate that quantitative measures gauging the semantic neighbourhoods of pseudowords predict reaction times of lexical decision and acoustic durations. The present study is inspired by these results and implements a similar architecture. To model resonance of pseudowords with the lexicon, both real words and pseudowords must be included in the networks. The following sections will detail the combined LDL implementation of real words and pseudowords.</p>
</sec>
<sec id="S4.SS2">
<title>Data Set: Real Words and Pseudowords</title>
<p>The pseudowords and their phonetic realisations that this paper is based on are taken from the study of word-final /s/ production by <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref>. In their study, participants were given pictures of &#x201C;alien creatures&#x201D; and their respective names (which were the target pseudowords), a short explanation of a situation, and a question relevant to the situation which was to be answered aloud. For each participant, pairings of pictures and pseudowords were randomised. That is, each pseudoword was represented by different pictures across participants. By button-press, a question was given to elicit an answer with the pertinent type of S while the context slowly faded out. The fading out of the question forced participants not to rely on the reading-aloud of the given context. In total, 24 pairs of pseudowords were used in that study. Each pseudoword form can act as singular or plural noun, e.g., <italic>glaits</italic> is either realised as singular, i.e., <italic>a glaits</italic>, or as plural, i.e., <italic>two glaits</italic>. Additionally, some pseudowords show a number of different realisations by the participants in the experiment, e.g., <italic>prups</italic> is sometimes produced as /p&#x0279;&#x028C;ps/, and sometimes it is produced as /p&#x0279;ups/. Thus, not 48 (i.e., 2 &#x00D7; 24) but 78 different phonological forms are included in the pseudoword set. <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref> gives an overview of all pseudowords and their phonological forms.</p>
<p>The second set of words contains real words and their phonetic realisations. Following <xref ref-type="bibr" rid="B20">Chuang et al. (2020)</xref> we extracted these words from the MALD corpus (<xref ref-type="bibr" rid="B66">Tucker et al., 2019a</xref>). While the MALD corpus contains 26,793 real words, only a subset of 8,285 words is used for a number of reasons. First, some 7,577 words in the corpus contain multiple affixes. As it was unclear how to handle such words, these were excluded. Second, only words for which we have semantic vectors could be used, leading to the exclusion of further 6,828 words. Third, only words with transcriptions available in the CELEX corpus (<xref ref-type="bibr" rid="B7">Baayen et al., 1995</xref>) were retained, i.e., there was no transcription available for 818 words. Fourth, 3,285 words showed ambiguities regarding their morphology, e.g., <italic>walks</italic> as a third person singular verb versus the plural of a noun. As huge numbers of words lead to extensive computation times, we decided to exclude such cases as well. The final set of real words contains 6,165 simple and 2,120 complex word forms.</p>
</sec>
<sec id="S4.SS3">
<title>Cue Matrices</title>
<p>As introduced in Section &#x201C;The C Matrix: Form Vectors,&#x201D; cue matrices are coded in binary form, giving information on which triphones are part of which word. For the current implementation, two such cue matrices are created using the WpmWithLdl package&#x2019;s (<xref ref-type="bibr" rid="B4">Baayen et al., 2019a</xref>) <italic>make_cue_matrix</italic> function. First <italic>C</italic><sub><italic>rw</italic></sub>, the real word cue matrix, is created for the set of real words. Then, a second cue matrix, <italic>C</italic><sub><italic>pw</italic></sub>, is created for the set of pseudowords. However, <italic>C</italic><sub><italic>pw</italic></sub> is a lot smaller than <italic>C</italic><sub><italic>rw</italic></sub> as there are only 78 phonological forms for pseudowords, but more than 8,000 for real words. Thus, the <italic>C</italic><sub><italic>rw</italic></sub> is of dimension 8285&#x00D7;7610, while <italic>C</italic><sub><italic>pw</italic></sub> is of dimension 78&#x00D7;78. We will come back to this issue of differing dimensions in the next section.</p>
</sec>
<sec id="S4.SS4">
<title>Semantic Matrices</title>
<p>To introduce semantics, i.e., semantic vectors, for the present set of real words, a pre-built semantic matrix <italic>A</italic> from <xref ref-type="bibr" rid="B5">Baayen et al. (2019b)</xref> was used. These authors derived semantic vectors based on the TASA corpus (<xref ref-type="bibr" rid="B29">Ivens and Koslin, 1991</xref>). For this, words were parsed into their lexomes, i.e., inflected words were represented by their stem and sense-disambiguated labels for their respective inflectional functions. Ambiguous forms, e.g., <italic>walks</italic>, were disambiguated using part of speech tagging (<xref ref-type="bibr" rid="B56">Schmid, 1999</xref>). Derived words were assigned a lexome for their stem and a lexome for derivational function. Then, following <xref ref-type="bibr" rid="B8">Baayen et al., 2016</xref> and <xref ref-type="bibr" rid="B41">Milin et al. (2017)</xref>, Na&#x00EF;ve Discriminative Learning (<xref ref-type="bibr" rid="B6">Baayen et al., 2011</xref>; <xref ref-type="bibr" rid="B59">Sering et al., 2019</xref>) was used to build semantic vectors. The Rescorla-Wagner update rule (<xref ref-type="bibr" rid="B53">Rescorla and Wagner, 1972</xref>; <xref ref-type="bibr" rid="B72">Wagner and Rescorla, 1972</xref>; <xref ref-type="bibr" rid="B52">Rescorla, 1988</xref>) was applied incrementally to the sentences of the TASA corpus. That is, for each sentence the algorithm was given the task to predict the lexomes in that sentence from all lexomes of that sentence. This resulted in a 23562&#x00D7;23562 weight matrix <italic>A</italic>. This matrix lists all lexomes as rows and columns. Thus, for a given lexome at row <italic>i</italic>, the association strengths of this lexome with all other lexomes as given as columns is contained. In this state of the <italic>A</italic> matrix, lexomes predict themselves. Thus, the diagonal of the <italic>A</italic> matrix is set to zero (see <xref ref-type="bibr" rid="B5">Baayen et al., 2019b</xref>, for a discussion on this procedure). Lastly, columns which mostly contain zeros, i.e., no information, and show small variances (&#x03C3; &#x003C; 3.4 &#x00D7; 10<sup>&#x2212;8</sup>) are removed. The resulting <italic>A</italic> matrix is of dimension 23,562 &#x00D7; 5030. Following the method outlined in Section &#x201C;The S Matrix: Semantic Vectors,&#x201D; a semantic matrix for real words <italic>S</italic><sub><italic>rw</italic></sub> can be constructed based on <italic>A</italic>. That is, the semantic vector <inline-formula><mml:math id="INEQ24"><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math></inline-formula> in <italic>S</italic><sub><italic>rw</italic></sub> for a simplex word is identical to its corresponding lexome, while the semantic vector <inline-formula><mml:math id="INEQ25"><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math></inline-formula> in <italic>S</italic><sub><italic>rw</italic></sub> for a complex word is the sum of its corresponding lexomes. That is, the semantic vector of <italic>apple</italic> is <inline-formula><mml:math id="INEQ26"><mml:mover accent="true"><mml:mrow><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:math></inline-formula>, while the semantic vector of <italic>apples</italic> is the sum of the vectors of the lexomes <sc>APPLE</sc> and <sc>PLURAL</sc>, i.e., <inline-formula><mml:math id="INEQ27"><mml:mrow><mml:mpadded width="+3.3pt"><mml:mover accent="true"><mml:mrow><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>u</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi></mml:mrow><mml:mo>&#x2192;</mml:mo></mml:mover></mml:mrow></mml:mrow></mml:math></inline-formula>. As a set of real words is used, <italic>S</italic><sub><italic>rw</italic></sub> contains only semantic vectors for this set of real words (instead of, e.g., all word forms of the TASA corpus). The final real word semantic matrix <italic>S</italic><sub><italic>rw</italic></sub> is of dimension 8285 &#x00D7; 5487.</p>
<p>While this procedure is rather straightforward, the creation of a pseudoword semantic matrix <italic>S</italic><sub><italic>pw</italic></sub> is not. Due to the nature of pseudowords, their lexomes are not contained within any corpus or our <italic>A</italic> matrix, for that matter. Instead, one can estimate a pseudoword&#x2019;s semantic content by utilising the semantic and phonological information on real words, i.e., their <italic>C</italic> and <italic>S</italic> matrix (<xref ref-type="bibr" rid="B20">Chuang et al., 2020</xref>). That is, the same transformation matrix <italic>F</italic> that is used for mapping real word cues onto predicted real word meanings (see Section &#x201C;Comprehension and Production&#x201D;) can be used to map pseudoword cues onto their estimated semantics. That is, one must first solve</p>
<disp-formula id="S4.Ex9"><mml:math id="M9"><mml:mrow><mml:mpadded width="+3.3pt"><mml:mi>F</mml:mi></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:mrow><mml:mmultiscripts><mml:mi>C</mml:mi><mml:none/><mml:mo>&#x2032;</mml:mo><mml:mrow><mml:mi>r</mml:mi><mml:mi>w</mml:mi></mml:mrow><mml:none/></mml:mmultiscripts><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>w</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:math></disp-formula>
<p>to obtain <italic>F</italic>. Then, one can make use of the pseudoword cue matrix <italic>C</italic><sub><italic>pw</italic></sub>, and estimate pseudoword semantics, as</p>
<disp-formula id="S4.Ex10"><mml:math id="M10"><mml:mrow><mml:mrow><mml:mpadded width="+3.3pt"><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>w</mml:mi></mml:mrow></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>w</mml:mi></mml:mrow></mml:msub><mml:mi>F</mml:mi></mml:mrow></mml:mrow><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
<p>with <italic>S</italic><sub><italic>pw</italic></sub> denoting the originally estimated semantic matrix for pseudowords. In this semantic matrix, pseudowords of identical segmental makeup show identical semantics as semantics are calculated only based on triphone occurrence, i.e., the semantics of <italic>pleeps</italic><sub><italic>singular</italic></sub> is identical to the semantics of <italic>pleeps</italic><sub><italic>plural</italic></sub>. To differentiate between singular and plural pseudowords, the semantic vector of the <sc>PLURAL</sc> lexome is added to all plural pseudowords in the S matrix. Similarly, the semantic vectors of <sc>ALIEN</sc> and <sc>CREATURE</sc> are added to all pseudoword semantic vectors as participants in the original production experiment were told that pseudowords describe alien creatures. As explained in Section &#x201C;Model B: LDL Measures and Affix Specification,&#x201D; the pairing of the pictures with pseudowords representing the alien creature was randomised during the experiment by <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref>. A pertinent pseudoword thus only contains the semantics of &#x201C;alien creature&#x201D; as a constant part of its own semantics, while other factors such as appearance, e.g., colour, shape, or number of eyes, differ across participants. We can assume that in the course of the experiment, participants gradually came to realize that the looks of these alien creatures, i.e., colour, shape, etc., are not relevant to their label names. Thus participants were just aware of the fact that these are all alien creatures, without paying much attention to their individual features. Please see the aforementioned complementary material for a detailed implementation.</p>
</sec>
<sec id="S4.SS5">
<title>Comprehension and Production</title>
<p>Pseudoword comprehension and production are not computed and evaluated in isolation but in combination with real words, simulating a real person&#x2019;s lexicon in a pseudoword comprehension and production situation, respectively. For this, we created a cue matrix <italic>C</italic><sub><italic>comb</italic></sub> based on a combined set of words, containing all aforementioned real words and pseudowords. In total, 8440 word forms are part of this set of words. A combined semantic matrix <italic>S</italic><sub><italic>comb</italic></sub> is created by attaching <italic>S</italic><sub><italic>pw</italic></sub> to <italic>S</italic><sub><italic>rw</italic></sub>, and reordering its rows to reflect the same order of words as found in <italic>C</italic><sub><italic>comb</italic></sub>.</p>
<p>Then, using the functions of the WpmWithLdl package (<xref ref-type="bibr" rid="B4">Baayen et al., 2019a</xref>) in R, a comprehension model is trained and checked for accuracy. That is, taking form vectors as input for the prediction of semantic vectors of output, <inline-formula><mml:math id="INEQ29"><mml:mrow><mml:mpadded width="+3.3pt"><mml:msub><mml:mover accent="true"><mml:mi>S</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mi>F</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> is solved. Comprehension is successfully modelled for a word <italic>i</italic> if its predicted semantic vector <inline-formula><mml:math id="INEQ30"><mml:msub><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:math></inline-formula> is most highly correlated with its targeted semantic vector <italic>s<sub>i</sub></italic>. This is true for 74.41% of cases (i.e., 6,165 word forms) in our comprehension model. In total, 25.59% of cases (i.e., 2,120 word forms) are incorrectly predicted, with 1,912 simple and 208 complex word forms. None of the incorrectly predicted word forms is a pseudoword.</p>
<p>Similarly, a production model is trained and checked for accuracy using functions of the aforementioned R package. Thus, semantic vectors are provided as input to predict form vectors as output, i.e., to solve <inline-formula><mml:math id="INEQ31"><mml:mrow><mml:mpadded width="+3.3pt"><mml:msub><mml:mover accent="true"><mml:mi>T</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">=</mml:mo><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mi>G</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>. Production is successfully modelled for a word <italic>i</italic> if its predicted triphones are those triphones present in its targeted cue vector in the correct sequence (possible sequences of triphones will be referred to below as &#x201C;paths&#x201D;). This is true for 97.3% of cases (i.e., 8,061 word forms) in our production model. In total, 2.7% of cases (i.e., 224 word forms) are incorrectly predicted, with 98 simple and 126 complex word forms. None of the incorrectly predicted word forms is a pseudoword.</p>
</sec>
<sec id="S4.SS6">
<title>Measures</title>
<p>In order to explore the potential of different measures emerging from the network to predict phonetic duration, we extracted a whole range of measures, based on the measures introduced by the WpmWithLdl package (<xref ref-type="bibr" rid="B4">Baayen et al., 2019a</xref>) and by <xref ref-type="bibr" rid="B20">Chuang et al. (2020)</xref>. Please see the <xref ref-type="supplementary-material" rid="TS1">Supplementary Material</xref> for exploratory analyses of individual measures.</p>
<p>In the following, we first describe the semantic measures before we turn to the phonetic measures.</p>
<p><sc>L</sc>1<sc>NORM</sc> and <sc>L</sc>2<sc>NORM</sc>: The <sc>L</sc>1<sc>NORM</sc> is the sum of the absolute values of vector elements of a given word&#x2019;s predicted semantic vector <inline-formula><mml:math id="INEQ32"><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula>, i.e., its city-block distance. The <sc>L</sc>2<sc>NORM</sc> is the square root of the sum of the squared values of a given word&#x2019;s predicted vector <inline-formula><mml:math id="INEQ33"><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula>, i.e., its Euclidian distance. For both variables, higher values imply more strong links to many other lexomes. Thus, both measures may be interpreted as semantic activation diversity.</p>
<p><sc>DENSITY</sc>: For <sc>DENSITY</sc>, the correlation values of a word&#x2019;s predicted semantic vector <inline-formula><mml:math id="INEQ34"><mml:mover accent="true"><mml:mi>s</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> and its eight nearest neighbours&#x2019; semantic vectors <italic>s</italic><sub><italic>n</italic>1</sub>&#x22EF;<italic>s</italic><sub><italic>n</italic>8</sub> are taken into consideration. The mean of these eight correlation values describes <sc>DENSITY</sc>, with higher values indicating a denser semantic neighbourhood.</p>
<p>ALC: The Average Lexical Correlation, ALC, is the mean value of all correlation values of a pseudoword&#x2019;s estimated semantic vector as contained in <italic>S</italic><sub><italic>pw</italic></sub> with each of the real word semantic vectors as contained in <italic>S</italic><sub><italic>rw</italic></sub>. Higher ALC values indicate that a pseudoword&#x2019;s semantics are part of a denser semantic neighbourhood. Thus, ALC may be interpreted as a measure of semantic activation diversity for pseudowords.</p>
<p>EDNN: This variable describes the Euclidian Distance between a pseudoword&#x2019;s estimated semantic vector <italic>s</italic> and its Nearest semantic real word or pseudoword Neighbour. Thus, higher values indicate a larger distance to the nearest semantic neighbour. EDNN may be regarded as a measure of semantic neighbourhood density.</p>
<p>NNC: The Nearest Neighbour Correlation is computed by taking a pseudoword&#x2019;s estimated semantic vector as given in <italic>S</italic><sub><italic>pw</italic></sub> and checking it for the highest correlation value against all real word semantic vectors as given in <italic>S</italic><sub><italic>rw</italic></sub>. This highest correlation value is taken as NNC value. Thus, higher values indicate that a pseudoword is semantically close to a real word. Additionally, one can tell which real word a pseudoword&#x2019;s semantics are closest to. This measure may be interpreted as a measure of similarity between nonce and real words, indicating the co-activation of a real word when confronted with a pseudoword.</p>
<p><sc>SUPPORT</sc>: This measure describes the amount of support the word-final triphone (i.e., fs#, ks#, ps#, ts#) obtains for each pseudoword. The value of <sc>SUPPORT</sc> is extracted from <inline-formula><mml:math id="INEQ36"><mml:mover accent="true"><mml:mi>T</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula>. Higher values of this variable indicate a higher semantic support for the word-final triphone which includes the segment of interest, i.e., word-final S.</p>
<p><sc>PATH_COUNTS</sc>: <sc>PATH_COUNTS</sc> describes the number of paths, i.e., possible sequences of triphones, detected for the production of a word by the production model. <sc>PATH_COUNTS</sc> may be interpreted as a measure of phonological activation diversity, as higher values indicate the existence of multiple candidates (and thus paths) in production.</p>
<p><sc>PATH_SUM</sc>: <sc>PATH_SUM</sc> describes the summed support of paths for a predicted form. <sc>PATH_SUM</sc> may be interpreted as a measure of phonological certainty, with higher values indicating a higher certainty in the candidate form.</p>
<p><sc>PATH_ENTROPIES</sc>: <sc>PATH_ENTROPIES</sc> contains the Shannon entropy values which are calculated over the path supports of the predicted form in <inline-formula><mml:math id="INEQ37"><mml:mover accent="true"><mml:mi>T</mml:mi><mml:mo stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula>. Thus, <sc>PATH_ENTROPIES</sc> may be interpreted as a measure of phonological uncertainty, with higher values indicating a higher level of disorder, i.e., uncertainty.</p>
<p>ALDC: The Average Levenshtein Distance of all Candidate productions, ALDC, is the mean of all Levenshtein distances of a word and its candidate forms. That is, for a word with only one candidate form, the Levenshtein distance between that word and its candidate form is its ALDC. For words with multiple candidates, the mean of the individual Levenshtein distances between candidates and targeted form constitutes the ALDC. Thus, higher values indicate that a word&#x2019;s candidate forms are very different from the intended pronunciation. ALDC may be interpreted as a measure of phonological neighbourhood density as it takes into account real word neighbourhoods for pseudowords, i.e., large values indicate sparse real word neighbourhoods.</p>
</sec>
</sec>
<sec id="S5">
<title>Analysis</title>
<p>The data set by <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref> contains non-morphemic, plural, or clitic word-final S as final segment of a pseudoword. As our LDL implementation does not include information on clitics, we only consider durational data on non-morphemic and plural S for the present study. A subset of 666 data points remains, with 303 observations with non-morphemic S and 363 observations with plural S. Due to some variable pronunciations requiring triphones not included in our LDL implementation, 13 data points had to be excluded, resulting in a final data set with non-morphemic and plural S durations of 653 data points, i.e., 300 entries on non-morphemic S and 353 entries on plural S.</p>
<sec id="S5.SS1">
<title>Covariates</title>
<p>Besides the aforementioned variables extracted and computed from the LDL implementation itself (see Section &#x201C;Measures&#x201D;), the following covariates, adopted from previous analyses of word-final S (e.g., <xref ref-type="bibr" rid="B47">Plag et al., 2017</xref>; <xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>; <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref>), are included in the analysis. The main reason for this is to allow us to compare the performance of these predictors with the performance of LDL predictors. LDL measures often correlate with traditional measures (such as lexical frequencies, transitional probabilities, or neighborhood densities), but the traditional measures have no clear correlating mechanisms in learning or processing.</p>
<p>There are, however, also covariates that do not tap into lexical properties, but that control for other influences, such as speech rate, the speaker, gender, the order of stimuli in an experiment, etc. These will be referred to as &#x201C;non-lexical covariates&#x201D; and they will also be included in our regression models.</p>
<p><sc>AFFIX</sc>: This binary variable indicates whether a word contains an affix, i.e., whether the pertinent pseudoword is a singular or plural form. It takes the value NM for pseudowords without affix, and PL for pseudowords with affix.</p>
<p><sc>SPEAKING</sc>R<sc>ATE</sc>: Analysing durational data, speech rate is a self-evident variable to consider. As speech rate is no inherent part of any LDL measure, we calculated speaking rate as the number of syllables in an utterance divided by the duration of the utterance (e.g., <xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>; <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref>). This was done automatically using a script in Praat (<xref ref-type="bibr" rid="B22">de Jong and Wempe, 2008</xref>; <xref ref-type="bibr" rid="B14">Boersma and Weenink, 2019</xref>).</p>
<p><sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>: Base duration was taken as a more local measure of speech rate (e.g., <xref ref-type="bibr" rid="B47">Plag et al., 2017</xref>, <xref ref-type="bibr" rid="B48">2020</xref>; <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref>). Here, the term &#x201C;base&#x201D; refers to the string of segments preceding the word-final S, for both non-morphemic and morphemic pseudowords. Base duration was then log-transformed to achieve a closer to normal distribution.</p>
<p><sc>PAUSE</sc>B<sc>IN</sc>: To account for final-lengthening effects, stretches of silence between the offset of the word-final S and the onset of the following word were measured. Silence of 50 ms and above was considered as pause (<xref ref-type="bibr" rid="B38">Lee and Oh, 1999</xref>; <xref ref-type="bibr" rid="B35">Krivokapi&#x0107;, 2007</xref>). In order to make sure that closures of following plosives were not mistaken for pauses, their average closure duration (see <xref ref-type="bibr" rid="B76">Yao, 2007</xref>) was subtracted of the pertinent measured silence. Following the results by <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref>, pause information was included as binary variable with the values PAUSE / NO PAUSE.</p>
<p>DISC: As some pseudowords were produced with multiple pronunciations, their transcription was incorporated as a categorical variable. This variable is called DISC after the DISC keyboard phonetic alphabet (<xref ref-type="bibr" rid="B16">Burnage, 1988</xref>).</p>
<p><sc>BIPHONE</sc>P<sc>ROB</sc>S<sc>UM</sc>B<sc>IN</sc>: The summed biphone probability for each pseudoword and its phonological variants is included as the binary variable <sc>BIPHONE</sc>P<sc>ROB</sc>S<sc>UM</sc>B<sc>IN</sc>. It was calculated using the Phonotactic Probability Calculator (<xref ref-type="bibr" rid="B71">Vitevitch and Luce, 2004</xref>). The rationale for this variable is that more probable biphones should lead to shorter durations (e.g., <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref>).</p>
<p><sc>LIST</sc> &#x0026; <sc>SLIDE</sc>N<sc>UMBER</sc>: To account for priming effects, the list number (1&#x2013;12) and the point of occurrence during the original experiment by <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref> are included.</p>
<p><sc>PRE</sc>C: To account for potential effects of the consonant preceding the word-final S (<xref ref-type="bibr" rid="B69">Umeda, 1977</xref>), it is included as <sc>PRE</sc>C variable (similar to e.g., <xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>).</p>
<p><sc>BIPHONE</sc>P<sc>ROB</sc>: The probability of the final biphones /fs/, /ks/, /ps/ and /ts/ in monomorphemic words is included as covariate to account for potential effects of phonotactics (see <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref>, for a detailed explanation).</p>
<p><sc>FOL</sc>T<sc>YPE</sc>: As the segment following the word-final S is no part of the individual pseudoword, it is also not considered in LDL measures. Thus, the covariate <sc>FOL</sc>T<sc>YPE</sc> is introduced (similar to e.g., <xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>), coding the following segment by its segmental class (i.e., approximant APP for <italic>listen</italic>, fricative F for <italic>find</italic>, nasal N for <italic>know</italic>, plosive P for <italic>cook</italic>, and vowel V for <italic>eat</italic>), to account for potential effects of the following word (<xref ref-type="bibr" rid="B34">Klatt, 1976</xref>; <xref ref-type="bibr" rid="B69">Umeda, 1977</xref>).</p>
<p><sc>SPEAKER, GENDER, AGE, LOCATION</sc> and <sc>MONO</sc>M<sc>ULTILINGUAL</sc>: S<sc>PEAKER</sc> ID was included to account for general inter-speaker differences in production. G<sc>ENDER</sc>, <sc>AGE</sc>, and <sc>LOCATION</sc>, i.e., the place in which the pertinent participant spent the bigger part of their life, were included as well. Additionally, participants who were early bilinguals were categorised as multilingual, while all other participants were categorised as monolingual in <sc>MONO</sc>M<sc>ULTILINGUAL</sc>.</p>
<p><sc>REAL</sc>: Some of the pseudowords in Schmitz et al.&#x2019;s data set have an orthographically different, but phonologically identical real word counterpart. We introduced the variable <sc>REAL</sc> to control for this potential confound. This variable is TRUE for pseudowords with such a real word counterpart, and FALSE for those without. We considered the following real words as counterparts as given in <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref>: <italic>glits</italic> corresponds to <italic>glitz</italic>, <italic>glaiks</italic> corresponds to <italic>Gleicks</italic>, <italic>glifs</italic> corresponds to <italic>glyphs</italic>, and <italic>pleets</italic> corresponds to <italic>pleats</italic>.</p>
<p>All of the following analyses make use of the following non-lexical covariates: <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>, <sc>SPEAKING</sc>R<sc>ATE</sc>, <sc>SLIDE</sc>N<sc>UMBER</sc>, and <sc>PAUSE</sc>B<sc>IN</sc> as variables concerning speech rate and continuity, <sc>PRE</sc>C and <sc>FOL</sc>T<sc>YPE</sc> accounting for coarticulatory effects, <sc>LIST</sc> taking into consideration potential priming effects, <sc>MONO</sc>M<sc>ULTILINGUAL</sc>, <sc>GENDER</sc>, <sc>LOCATION</sc>, <sc>AGE</sc>, and <sc>SPEAKER</sc> to account for speaker-individual differences, and <sc>REAL</sc> to include potential effects of real word counterparts.</p>
</sec>
<sec id="S5.SS2">
<title>Modelling Strategy</title>
<p>We devised three kinds of model: First, a baseline model with the traditional predictor variables (plus the non-lexical covariates). Second, a model with LDL predictors that also includes <sc>AFFIX</sc> as a covariate (plus the non-lexical covariates). Third, a model that contains only the LDL predictors (plus the non-lexical covariates).</p>
<p>The three kinds of model will allow us to answer our research questions. Recall that our ultimate goal is to understand how systematic durational differences emerge between words of different, but homophonous morphological categories. Traditional lexical variables are predictive but cannot explain how morphology can make its way into durational differences. But these models can show that such differences exist by looking at the effect of the variable <sc>AFFIX</sc>. This is our baseline model. As an alternative we implement a model that uses LDL measures. If these measures are predictive, they offer an explanation of the morphologically-induced phonetic differences: they emerge as a by-product of the association of form and meaning in the mental lexicon, and this association is the outcome of discriminative learning. By having a model that also includes <sc>AFFIX</sc> as an additional predictor, we can see whether the LDL measures completely capture the morphological effect, or whether there is a residue of morphological information that is predictive of duration but is still not captured by the LDL measures.</p>
</sec>
<sec id="S5.SS3">
<title>Model A: Traditional Measures</title>
<p>This model is meant to resemble those in previous studies on word-final S duration (e.g., <xref ref-type="bibr" rid="B47">Plag et al., 2017</xref>; <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref>). Thus, we make use of similar variables: A<sc>FFIX</sc>, <sc>BIPHONE</sc>P<sc>ROB</sc>S<sc>UM</sc>B<sc>IN</sc>, and <sc>BIPHONE</sc>P<sc>ROB</sc>, as well as those control variables included in all analyses of this paper. None of these covariates showed high correlation coefficients. Hence, no cautionary measures regarding collinearity were taken before an initial full model was constructed. The model selection process proceeded as explained in section &#x201C;Model B: LDL Measures and Affix Specification.&#x201D; That is, non-significant variables were excluded in a controlled step-wise fashion.</p>
<p>Then, variance inflation factors were checked. The covariates <sc>BIPHONE</sc>P<sc>ROB</sc> and <sc>PRE</sc>C showed high VIF values (i.e., 46.53 and 46.88, respectively), indicating potential overfitting of the model (e.g., <xref ref-type="bibr" rid="B78">Zuur et al., 2010</xref>; <xref ref-type="bibr" rid="B24">Fox and Weisberg, 2019</xref>). Consequently, <sc>PRE</sc>C was removed from the model as it showed the highest VIF value, following the procedure described by <xref ref-type="bibr" rid="B78">Zuur et al. (2010)</xref>. Re-fitting the model without <sc>PRE</sc>C and re-checking the new variance inflation factor values revealed only non-problematic values.</p>
<p>Finally, the resulting model&#x2019;s residuals were trimmed (e.g., <xref ref-type="bibr" rid="B2">Baayen and Milin, 2010</xref>). Data points with residuals larger than 2.5 standard deviations were removed, ensuring a satisfactory distribution of residuals. This procedure led to a loss of 4 data points, i.e., 0.61% of all data points. An overview of all variables used in the initial model is given in <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 2</xref>.</p>
</sec>
<sec id="S5.SS4">
<title>Model B: LDL Measures and Affix Specification</title>
<p>This model makes use of all LDL measures as well as of the A<sc>FFIX</sc> variable. Additionally, the non-lexical covariates are included. One issue to address when considering a model with such a multitude of variables is collinearity (e.g., <xref ref-type="bibr" rid="B1">Baayen, 2008</xref>; <xref ref-type="bibr" rid="B63">Tomaschek et al., 2018</xref>). To avoid collinearity related problems later on, all variables were tested for correlation using the languageR package (<xref ref-type="bibr" rid="B3">Baayen and Shafaei-Bajestan, 2019</xref>). This correlation check resulted in eight correlation coefficients indicating a high degree of correlation, for which we assume the threshold to be |<italic>rho</italic>| &#x2265; 0.5. The pairs of correlated covariates as well as their correlation coefficients are given in <xref ref-type="table" rid="T2">Table 2</xref>.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Correlated variables and their correlation coefficients.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left" colspan="2">variables</td>
<td valign="top" align="center">rho</td>
<td valign="top" align="center" colspan="2">variables</td>
<td valign="top" align="center">rho</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><sc>L</sc>1<sc>NORM</sc></td>
<td valign="top" align="left"><sc>L</sc>2<sc>NORM</sc></td>
<td valign="top" align="center">0.98</td>
<td valign="top" align="left"><sc>AFFIX</sc></td>
<td valign="top" align="left">NNC</td>
<td valign="top" align="center">&#x2013;0.89</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PATH_COUNTS</sc></td>
<td valign="top" align="left"><sc>PATH_ENTROPIES</sc></td>
<td valign="top" align="center">0.95</td>
<td valign="top" align="left"><sc>PATH_COUNTS</sc></td>
<td valign="top" align="left"><sc>SUPPORT</sc></td>
<td valign="top" align="center">&#x2013;0.65</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PATH_COUNTS</sc></td>
<td valign="top" align="left">ALDC</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="left"><sc>PATH_SUM</sc></td>
<td valign="top" align="left"><sc>SUPPORT</sc></td>
<td valign="top" align="center">0.73</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PATH_ENTROPIES</sc></td>
<td valign="top" align="left">ALDC</td>
<td valign="top" align="center">0.90</td>
<td valign="top" align="left"><sc>PATH_ENTROPIES</sc></td>
<td valign="top" align="left"><sc>SUPPORT</sc></td>
<td valign="top" align="center">&#x2013;0.63</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Due to the high number of correlated variables, we opted for a principal component analysis (PCA; e.g., <xref ref-type="bibr" rid="B70">Venables and Ripley, 2002</xref>; <xref ref-type="bibr" rid="B1">Baayen, 2008</xref>; <xref ref-type="bibr" rid="B63">Tomaschek et al., 2018</xref>) to address collinearity issues. In a PCA, the dimensionality of the data is reduced by transforming the included variables into principal components. These transformations result in linear combinations of the predictors that are orthogonal to each other. Thus, the resulting principal components are not correlated.</p>
<p>The PCA was carried out using the <italic>PCAmix</italic> function of the PCAmixdata package (<xref ref-type="bibr" rid="B18">Chavent et al., 2017</xref>) in R, allowing the simultaneous integration of continuous and discrete variables. All variables given in <xref ref-type="table" rid="T2">Table 2</xref> were included in the computation of the principal component analysis, which yields nine principal components. The next step of the PCA is to determine how many of these principal components are meaningful and thus should be retained for further use. For this decision, we followed several rules of thumb (e.g., <xref ref-type="bibr" rid="B44">O&#x2019;Rourke et al., 2005</xref>; <xref ref-type="bibr" rid="B1">Baayen, 2008</xref>). First, any component that displays an Eigenvalue greater than 1 accounts for a greater amount of variance than had been contributed by one variable. Such a component is therefore potentially meaningful. Second, one should retain enough components so that the cumulative percent of variance explained is equal to some minimal value. Following other implementations of principal component analyses, we aim at a value of 80% (e.g., <xref ref-type="bibr" rid="B44">O&#x2019;Rourke et al., 2005</xref>). Third, only interpretable components are to be retained. That is, each component is made up of loadings, i.e., parts of the variables included in the PCA&#x2019;s computation represented by correlation coefficient values. If none of these variables is strongly represented in a component, the interpretability of that component is extremely low, rendering the component of small interest for further analyses. Following these three criteria, we find that the first three of the principal components show an Eigenvalue of one or higher. Also, the first three components account for 84% of variance. Considering the third criterion, all three components are strongly correlated with input variables. We therefore retain components 1 to 3 for further analysis, all of which show an Eigenvalue greater than 1, account for more than eighty percent of variance, and contain strong representations of variables in their loadings.<sup><xref ref-type="fn" rid="footnote2">2</xref></sup> But what do these principal components mean? The highest loadings of the principal components, i.e., the correlation of the original variables to the pertinent component, are given in <xref ref-type="table" rid="T3">Table 3</xref>.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Loadings of original predictor variables in the three retained principal components of the first principal component analysis.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center">Component1</td>
<td valign="top" align="center">Component2</td>
<td valign="top" align="center">Component3</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><sc>L</sc>1<sc>NORM</sc></td>
<td/>
<td valign="top" align="center">0.397</td>
<td valign="top" align="center">0.348</td>
</tr>
<tr>
<td valign="top" align="left"><sc>L</sc>2<sc>NORM</sc></td>
<td/>
<td valign="top" align="center">0.405</td>
<td valign="top" align="center">0.363</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PATH_COUNTS</sc></td>
<td valign="top" align="center">0.813</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left"><sc>PATH_ENTROPIES</sc></td>
<td valign="top" align="center">0.828</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left"><sc>PATH_SUM</sc></td>
<td valign="top" align="center">&#x2013;0.430</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">ALDC</td>
<td valign="top" align="center">0.710</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">NNC</td>
<td/>
<td valign="top" align="center">0.698</td>
<td/>
</tr>
<tr>
<td valign="top" align="left"><sc>SUPPORT</sc></td>
<td valign="top" align="center">&#x2013;0.650</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left"><sc>AFFIX</sc></td>
<td/>
<td valign="top" align="center">0.421</td>
<td valign="top" align="center">0.517</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>C<sc>OMPONENT</sc>1 is most strongly positively correlated with <sc>PATH_COUNTS</sc>, <sc>PATH_ENTROPIES</sc>, and ALDC, while it is most strongly negatively correlated with <sc>PATH_SUM</sc> and <sc>SUPPORT</sc>. For <sc>PATH_COUNTS</sc>, higher values indicate the existence of multiple candidates (and thus paths) in production. It thus functions as an indicator of phonological uncertainty. Values of <sc>PATH_ENTROPIES</sc> relate to the level of uncertainty concerning the path supports of the predicted candidate form, with higher values indicating a higher level of uncertainty. For ALDC, higher values mean that a word&#x2019;s candidate forms are very different from the intended pronunciation, indicating uncertainty in production. <sc>PATH_SUM</sc> describes the summed support of paths for a predicted form, with higher values indicating a higher certainty in the candidate form. Higher values for <sc>SUPPORT</sc> suggest more certainty in the choice of the word-final triphone. C<sc>OMPONENT</sc>1 can thus be described as a dimension that represents phonological or articulatory certainty.</p>
<p>C<sc>OMPONENT</sc>2 is most strongly correlated with <sc>L</sc>1<sc>NORM</sc>, <sc>L</sc>2<sc>NORM</sc>, NNC, and A<sc>FFIX</sc>. <sc>L</sc>1<sc>NORM</sc> and <sc>L</sc>2<sc>NORM</sc> both imply more strong links to many other lexomes with higher values indicating a higher semantic activation diversity. Higher values of NNC suggest a close real word neighbour, which leads to higher levels of co-activation of that real word when confronted with the pseudoword, also leading to higher semantic activation diversity. As for A<sc>FFIX</sc>, C<sc>OMPONENT</sc>2 is positively correlated with the presence of non-morphemic S data points.</p>
<p>C<sc>OMPONENT</sc>3 is similar to C<sc>OMPONENT</sc>2 as it is also strongly correlated with <sc>L</sc>1<sc>NORM</sc>, <sc>L</sc>2<sc>NORM</sc>, and A<sc>FFIX</sc>. Again, for <sc>L</sc>1<sc>NORM</sc> and <sc>L</sc>2<sc>NORM</sc> higher values indicate higher semantic activation diversity. A<sc>FFIX</sc> is positively correlated for plural S data points. We will come back to the interpretation of this correlation in Section &#x201C;Model B: LDL Measures and <sc>AFFIX</sc> Specification.&#x201D;</p>
<p>In a next step, models were fitted using linear mixed-effects regression in R (<xref ref-type="bibr" rid="B51">R Core Team, 2020</xref>) using RStudio (<xref ref-type="bibr" rid="B55">RStudio Team, 2021</xref>) and as implemented by lme4 (<xref ref-type="bibr" rid="B10">Bates et al., 2015</xref>), lmerTest (<xref ref-type="bibr" rid="B36">Kuznetsova et al., 2017</xref>), and LMERConvenienceFunctions (<xref ref-type="bibr" rid="B65">Tremblay and Ransijn, 2020</xref>) to analyse the data on non-morphemic and plural S duration. The dependent variable, duration of S, was log-transformed following standard procedures to reduce the potentially harmful effect of skewed distributions in linear regression models (e.g., <xref ref-type="bibr" rid="B75">Winter, 2019</xref>). The name of this variable is <sc>S</sc>D<sc>UR</sc>L<sc>OG</sc>.</p>
<p>Following the standard backward step-wise selection process for model selection (e.g., <xref ref-type="bibr" rid="B1">Baayen, 2008</xref>), a first model containing all remaining variables is created. That is, C<sc>OMPONENT</sc>1, C<sc>OMPONENT</sc>2, C<sc>OMPONENT</sc>3, <sc>DENSITY</sc>, ALC, EDNN, <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>, <sc>SPEAKING</sc>R<sc>ATE</sc>, <sc>PAUSE</sc>B<sc>IN</sc>, <sc>FOL</sc>T<sc>YPE</sc>, <sc>PRE</sc>C, and <sc>REAL</sc> were included as fixed effects. The remaining variables, <sc>GENDER, LOCATION</sc>, <sc>MONO</sc>M<sc>ULTILINGUAL, AGE, LIST</sc>, and S<sc>PEAKER</sc>, are included as random intercepts.</p>
<p>This full model was then continuously reduced through step-wise exclusion of non-significant variables. That is, a variable was considered as significant if it passed all of three tests. First, its F-value in the pertinent model had to yield a value below &#x2212;2 or above 2. Second, the AIC value, i.e., the Akaike information criterion value, of the model including the variable had to be lower than the AIC value of a comparable model without the pertinent variable. Third, the results of log-likelihood tests comparing the model with to a model without the pertinent variable had to yield a p-value below the 0.05 threshold, thus indicating a significant improvement of model fit. This process was verified using the <italic>step</italic> function of R, which resulted in an identical model.</p>
<p>Then, variance inflation factors (VIFs) were computed. Predictors showing variance inflation factor values equal or greater than 3 are to be excluded due to the high risk of introducing multicollinearity and thus overfitting of the model (e.g., <xref ref-type="bibr" rid="B78">Zuur et al., 2010</xref>). For the present model, all variance inflation factor values are below 3.</p>
<p>Finally, the resulting model needed trimming of its residuals (e.g., <xref ref-type="bibr" rid="B2">Baayen and Milin, 2010</xref>). Data points with residuals larger than 2.5 standard deviations were removed to ensure a more satisfactory residual distribution. This procedure resulted in a loss of six data points (0.92%). An overview of all variables used in the initial model and their distribution is given in <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 2</xref>.</p>
</sec>
<sec id="S5.SS5">
<title>Model C: LDL Measures Only</title>
<p>This model uses all LDL measures but does not incorporate the A<sc>FFIX</sc> covariate. As in the previous model, there was a high number of highly correlated variables (see <xref ref-type="table" rid="T2">Table 2</xref> with the exception of the correlation of A<sc>FFIX</sc> and NNC, as A<sc>FFIX</sc> is not included in this analysis). We therefore again computed a principal component analysis, following the procedure outlined in Section &#x201C;Model B: LDL Measures and Affix Specification.&#x201D; Following the first two criteria, we find that two principal components are to be retained. However, considering the third criterion, we find that the two components are not readily interpretable as they show relatively high positive or negative correlations with all or almost all variables, without indicating a clearly discernible dimension underlying the patterns of correlations. We therefore turned to another procedure to reduce collinearity issues.</p>
<p>For each set of variables with a correlation of |<italic>rho</italic>| &#x003E; 0.5, models containing only the pertinent variable and a random intercept for subject are fitted and compared. Using log-likelihood tests for model comparison, the variable contained in a significantly better fit model is retained while those variables highly correlated with it are no longer used. In case of a non-significant difference, the variable of the model with the lower AIC value is retained. This procedure leads to the exclusion of <sc>L</sc>2<sc>NORM</sc>, <sc>PATH_COUNTS</sc>, <sc>PATH_ENTROPIES</sc>, and <sc>PATH_SUM</sc>.</p>
<p>Linear mixed-effects regression models were fitted according to the procedure given in Section &#x201C;Model B: LDL Measures and Affix Specification.&#x201D; That is, an initial full model was fitted with the following variables: <sc>L</sc>1<sc>NORM</sc>, ALDC, <sc>SUPPORT</sc>, <sc>DENSITY</sc>, ALC, EDNN, NNC, <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>, <sc>SPEAKING</sc>R<sc>ATE</sc>, <sc>PAUSE</sc>B<sc>IN</sc>, <sc>FOL</sc>T<sc>YPE</sc>, <sc>PRE</sc>C and <sc>REAL</sc>. As for random effects, random intercepts for <sc>GENDER, LOCATION</sc>, <sc>MONO</sc>M<sc>ULTILINGUAL, AGE, LIST</sc>, and <sc>SPEAKER</sc> were included.</p>
<p>This full model was then continuously reduced through step-wise exclusion of non-significant variables, following the aforementioned criteria. Then, variance inflation factors were computed, resulting only in non-problematic values (e.g., <xref ref-type="bibr" rid="B78">Zuur et al., 2010</xref>). Finally, the resulting model needed trimming of its residuals (e.g., <xref ref-type="bibr" rid="B2">Baayen and Milin, 2010</xref>). That is, data points with residuals larger than 2.5 standard deviations were removed, ensuring a more satisfactory residual distribution. This procedure led to a loss of 8 data points, i.e., 1.2% of all data points. An overview of all variables used in the initial model and their distribution is given in <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 2</xref>.</p>
</sec>
</sec>
<sec id="S6">
<title>Results</title>
<sec id="S6.SS1">
<title>Model A: Traditional Measures</title>
<p>The final model of traditional measures includes main effects of the following variables: type of S (A<sc>FFIX</sc>), speaking rate (<sc>SPEAKING</sc>R<sc>ATE</sc>), log-transformed base duration (<sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>), pause (<sc>PAUSE</sc>B<sc>IN</sc>), the summed biphone probability (<sc>BIPHONE</sc>P<sc>ROB</sc>S<sc>UM</sc>B<sc>IN</sc>), and following segmental type (<sc>FOL</sc>T<sc>YPE</sc>). As for random effects, random intercepts for <sc>SPEAKER</sc> and random slopes for <sc>AFFIX</sc> are included. The p-values of the analysis of variance of the final model are given in <xref ref-type="table" rid="T4">Table 4</xref>.</p>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p><italic>p</italic>-values of fixed effects in the final &#x201C;traditional&#x201D; model, fitted to the log-transformed durations of S.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="left">Sum Sq</td>
<td valign="top" align="left">Mean Sq</td>
<td valign="top" align="center">NumDF</td>
<td valign="top" align="center">DenDF</td>
<td valign="top" align="center">F.value</td>
<td valign="top" align="left">Pr (&#x003E; F)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">A<sc>FFIX</sc></td>
<td valign="top" align="left">0.711</td>
<td valign="top" align="left">0.711</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">37.90</td>
<td valign="top" align="right">13.845</td>
<td valign="top" align="left">0.001</td>
</tr>
<tr>
<td valign="top" align="left"><sc>SPEAKING</sc>R<sc>ATE</sc></td>
<td valign="top" align="left">0.163</td>
<td valign="top" align="left">0.163</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">604.07</td>
<td valign="top" align="right">3.165</td>
<td valign="top" align="left">0.076</td>
</tr>
<tr>
<td valign="top" align="left"><sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc></td>
<td valign="top" align="left">6.278</td>
<td valign="top" align="left">6.278</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">572.80</td>
<td valign="top" align="right">122.247</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PAUSE</sc>B<sc>IN</sc></td>
<td valign="top" align="left">5.430</td>
<td valign="top" align="left">5.430</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">635.92</td>
<td valign="top" align="right">105.722</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>BIPHONE</sc>P<sc>ROB</sc>S<sc>UM</sc>B<sc>IN</sc></td>
<td valign="top" align="left">0.646</td>
<td valign="top" align="left">0.646</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">596.28</td>
<td valign="top" align="right">12.580</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOL</sc>T<sc>YPE</sc></td>
<td valign="top" align="left">2.199</td>
<td valign="top" align="left">0.550</td>
<td valign="top" align="center">4</td>
<td valign="top" align="right">605.15</td>
<td valign="top" align="right">10.703</td>
<td valign="top" align="left">0.000</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The marginal R-squared value of the model is 0.43, i.e., fixed effects explain 43% of variation in the data. Taking random effects into account as well, the conditional R-squared value is 0.62. That is, the model explains 62% of data variation in total (see <xref ref-type="bibr" rid="B43">Nakagawa et al., 2017</xref>, for details on marginal and conditional R-squared computation). Both R-squared values were computed using the MuMIn package (<xref ref-type="bibr" rid="B9">Barton, 2020</xref>). The R-squared values are similar to the values found by <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref> on their complete data set.</p>
<p>The estimates of the final model and their p-values are given in <xref ref-type="table" rid="T5">Table 5</xref>. The reference levels for the categorical predictors are: for A<sc>FFIX</sc> it is NM, for <sc>PAUSE</sc>B<sc>IN</sc> it is no-pause, for <sc>BIPHONE</sc>P<sc>ROB</sc>S<sc>UM</sc>B<sc>IN</sc> it is high, and for <sc>FOL</sc>T<sc>YPE</sc> it is APP.</p>
<table-wrap position="float" id="T5">
<label>TABLE 5</label>
<caption><p>Fixed-effect coefficients and <italic>p</italic>-values as computed by the final &#x201C;traditional&#x201D; model (mixed-effects model fitted to the log-transformed duration of S).</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center">Estimate</td>
<td valign="top" align="left">Std. Error</td>
<td valign="top" align="center"><italic>df</italic></td>
<td valign="top" align="center"><italic>t</italic>-value</td>
<td valign="top" align="left">Pre (&#x003E; | t|)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">(Intercept)</td>
<td valign="top" align="right">&#x2212;1.202</td>
<td valign="top" align="left">0.083</td>
<td valign="top" align="right">407.927</td>
<td valign="top" align="right">&#x2212;14.520</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left">A<sc>FFIX</sc>PL</td>
<td valign="top" align="right">&#x2212;0.087</td>
<td valign="top" align="left">0.023</td>
<td valign="top" align="right">37.896</td>
<td valign="top" align="right">&#x2212;3.721</td>
<td valign="top" align="left">0.001</td>
</tr>
<tr>
<td valign="top" align="left"><sc>SPEAKING</sc>R<sc>ATE</sc></td>
<td valign="top" align="right">&#x2212;0.022</td>
<td valign="top" align="left">0.012</td>
<td valign="top" align="right">604.072</td>
<td valign="top" align="right">&#x2212;1.779</td>
<td valign="top" align="left">0.076</td>
</tr>
<tr>
<td valign="top" align="left"><sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc></td>
<td valign="top" align="right">0.635</td>
<td valign="top" align="left">0.057</td>
<td valign="top" align="right">572.805</td>
<td valign="top" align="right">11.057</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PAUSE</sc>B<sc>INPAUSE</sc></td>
<td valign="top" align="right">0.234</td>
<td valign="top" align="left">0.023</td>
<td valign="top" align="right">635.917</td>
<td valign="top" align="right">10.282</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>BIPHONE</sc>P<sc>ROB</sc>S<sc>UM</sc>B<sc>IN</sc>low</td>
<td valign="top" align="right">&#x2212;0.076</td>
<td valign="top" align="left">0.021</td>
<td valign="top" align="right">596.279</td>
<td valign="top" align="right">&#x2212;3.547</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOLTYPE</sc>F</td>
<td valign="top" align="right">&#x2212;0.001</td>
<td valign="top" align="left">0.073</td>
<td valign="top" align="right">610.436</td>
<td valign="top" align="right">&#x2212;0.007</td>
<td valign="top" align="left">0.994</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOLTYPE</sc>N</td>
<td valign="top" align="right">&#x2212;0.004</td>
<td valign="top" align="left">0.028</td>
<td valign="top" align="right">600.528</td>
<td valign="top" align="right">&#x2212;0.134</td>
<td valign="top" align="left">0.893</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOLTYPE</sc>P</td>
<td valign="top" align="right">&#x2212;0.027</td>
<td valign="top" align="left">0.025</td>
<td valign="top" align="right">599.182</td>
<td valign="top" align="right">&#x2212;1.107</td>
<td valign="top" align="left">0.269</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOLTYPE</sc>V</td>
<td valign="top" align="right">&#x2212;0.145</td>
<td valign="top" align="left">0.025</td>
<td valign="top" align="right">610.241</td>
<td valign="top" align="right">&#x2212;5.852</td>
<td valign="top" align="left">0.000</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The predictor strength of individual covariates was checked by taking the final model as template. For each predictor variable, a model was fitted lacking the particular variable. This resulted in seven models, each lacking a different predictor. Then, R-squared values were computed for these models and finally compared. The variable leading to the highest decrease in R-squared value as compared to the final model is thus the variable showing the highest predictor strength. The results of this comparison are reflected in the hierarchy given in (1). The decrease in R-squared is greatest when removing <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>, followed by <sc>PAUSE</sc>B<sc>IN</sc>, and so forth. The resulting order is identical to the one found by <xref ref-type="bibr" rid="B57">Schmitz et al. (2020)</xref> for the complete data set.</p>
<list list-type="simple">
<list-item><p>(1) baseDurLog &#x003E; &#x003E; pauseBin &#x003E; &#x003E; Affix &#x003E; &#x003E; folType &#x003E; &#x003E; speakingRate &#x003E; &#x003E; biphoneProbSumBin</p>
</list-item>
</list>
</sec>
<sec id="S6.SS2">
<title>Model B: LDL Measures and <sc>AFFIX</sc> Specification</title>
<p>In the final model including LDL measures as well as the A<sc>FFIX</sc> covariate as parts of the individual components resulting from the principal component analysis, and fitted according to the procedure described in Section &#x201C;Model B: LDL Measures and Affix Specification,&#x201D; we find main effects of the first principal component (C<sc>OMPONENT</sc>1), the third principal component (C<sc>OMPONENT</sc>3), <sc>DENSITY</sc>, ALC, base duration (<sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>), following pause (<sc>PAUSE</sc>B<sc>IN</sc>), following segmental type (<sc>FOL</sc>T<sc>YPE</sc>), and preceding consonant (<sc>PRE</sc>C). Regarding random effects, only a <sc>SPEAKER</sc>-specific random intercept turns out to significantly improve model fit. The p-values of the analysis of variance of the final model are given in <xref ref-type="table" rid="T6">Table 6</xref>.</p>
<table-wrap position="float" id="T6">
<label>TABLE 6</label>
<caption><p><italic>p</italic>-values of fixed effects in the final &#x201C;LDL measures and Affix&#x201D; model, fitted to the log-transformed durations of S.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="left">Sum Sq</td>
<td valign="top" align="left">Mean Sq</td>
<td valign="top" align="center">NumDF</td>
<td valign="top" align="right">DenDF</td>
<td valign="top" align="right">F.value</td>
<td valign="top" align="left">Pr (&#x003E; F)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">C<sc>OMPONENT</sc>1</td>
<td valign="top" align="left">0.376</td>
<td valign="top" align="left">0.376</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">618.06</td>
<td valign="top" align="right">6.970</td>
<td valign="top" align="left">0.008</td>
</tr>
<tr>
<td valign="top" align="left">C<sc>OMPONENT</sc>3</td>
<td valign="top" align="left">1.340</td>
<td valign="top" align="left">1.340</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">627.71</td>
<td valign="top" align="right">24.819</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc></td>
<td valign="top" align="left">6.751</td>
<td valign="top" align="left">6.751</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">620.55</td>
<td valign="top" align="right">125.080</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PAUSE</sc>B<sc>IN</sc></td>
<td valign="top" align="left">5.805</td>
<td valign="top" align="left">5.805</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">642.19</td>
<td valign="top" align="right">107.568</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOL</sc>T<sc>YPE</sc></td>
<td valign="top" align="left">2.093</td>
<td valign="top" align="left">0.523</td>
<td valign="top" align="center">4</td>
<td valign="top" align="right">617.98</td>
<td valign="top" align="right">9.695</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PRE</sc>C</td>
<td valign="top" align="left">0.702</td>
<td valign="top" align="left">0.234</td>
<td valign="top" align="center">3</td>
<td valign="top" align="right">615.33</td>
<td valign="top" align="right">4.334</td>
<td valign="top" align="left">0.005</td>
</tr>
<tr>
<td valign="top" align="left"><sc>DENSITY</sc></td>
<td valign="top" align="left">0.219</td>
<td valign="top" align="left">0.219</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">621.79</td>
<td valign="top" align="right">4.067</td>
<td valign="top" align="left">0.044</td>
</tr>
<tr>
<td valign="top" align="left">ALC</td>
<td valign="top" align="left">0.293</td>
<td valign="top" align="left">0.293</td>
<td valign="top" align="center">1</td>
<td valign="top" align="right">623.25</td>
<td valign="top" align="right">5.425</td>
<td valign="top" align="left">0.020</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The marginal R-squared value of the final model is 0.42, thus fixed effects explain 42% of the variation in our data. The conditional R-squared value of the final model is 0.60, that is fixed and random effects taken together explain 60% of variation.</p>
<p>The estimates of the final model and their p-values are given in <xref ref-type="table" rid="T7">Table 7</xref>. The reference levels for the categorical predictors are: for <sc>PAUSE</sc>B<sc>IN</sc> it is no-pause, for <sc>FOL</sc>T<sc>YPE</sc> it is APP, and for <sc>PRE</sc>C it is f.</p>
<table-wrap position="float" id="T7">
<label>TABLE 7</label>
<caption><p>Fixed-effect coefficients and <italic>p</italic>-values as computed by the final &#x201C;LDL measures and Affix&#x201D; model (mixed-effects model fitted to the log-transformed duration of S).</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center">Estimate</td>
<td valign="top" align="left">Std. Error</td>
<td valign="top" align="center"><italic>df</italic></td>
<td valign="top" align="left"><italic>t</italic>-value</td>
<td valign="top" align="left">Pre (&#x003E; | t|)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">(Intercept)</td>
<td valign="top" align="right">&#x2212;1.106</td>
<td valign="top" align="left">0.124</td>
<td valign="top" align="left">635.215</td>
<td valign="top" align="right">&#x2212;8.952</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left">C<sc>OMPONENT</sc>1</td>
<td valign="top" align="right">0.014</td>
<td valign="top" align="left">0.005</td>
<td valign="top" align="left">618.057</td>
<td valign="top" align="right">2.640</td>
<td valign="top" align="left">0.008</td>
</tr>
<tr>
<td valign="top" align="left">C<sc>OMPONENT</sc>3</td>
<td valign="top" align="right">&#x2212;0.041</td>
<td valign="top" align="left">0.008</td>
<td valign="top" align="left">627.708</td>
<td valign="top" align="right">&#x2212;4.982</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc></td>
<td valign="top" align="right">0.652</td>
<td valign="top" align="left">0.058</td>
<td valign="top" align="left">620.548</td>
<td valign="top" align="right">11.184</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PAUSE</sc>B<sc>IN</sc>pause</td>
<td valign="top" align="right">0.237</td>
<td valign="top" align="left">0.023</td>
<td valign="top" align="left">642.193</td>
<td valign="top" align="right">10.371</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOL</sc>T<sc>YPE</sc>F</td>
<td valign="top" align="right">&#x2212;0.014</td>
<td valign="top" align="left">0.075</td>
<td valign="top" align="left">621.463</td>
<td valign="top" align="right">&#x2212;0.180</td>
<td valign="top" align="left">0.857</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOL</sc>T<sc>YPE</sc>N</td>
<td valign="top" align="right">&#x2212;0.006</td>
<td valign="top" align="left">0.029</td>
<td valign="top" align="left">614.760</td>
<td valign="top" align="right">&#x2212;0.198</td>
<td valign="top" align="left">0.843</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOL</sc>T<sc>YPE</sc>P</td>
<td valign="top" align="right">&#x2212;0.028</td>
<td valign="top" align="left">0.025</td>
<td valign="top" align="left">615.172</td>
<td valign="top" align="right">&#x2212;1.126</td>
<td valign="top" align="left">0.261</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOL</sc>T<sc>YPE</sc>V</td>
<td valign="top" align="right">&#x2212;0.141</td>
<td valign="top" align="left">0.025</td>
<td valign="top" align="left">620.352</td>
<td valign="top" align="right">&#x2212;5.612</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PRE</sc>Ck</td>
<td valign="top" align="right">&#x2212;0.023</td>
<td valign="top" align="left">0.027</td>
<td valign="top" align="left">614.436</td>
<td valign="top" align="right">&#x2212;0.835</td>
<td valign="top" align="left">0.404</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PRE</sc>Cp</td>
<td valign="top" align="right">&#x2212;0.040</td>
<td valign="top" align="left">0.027</td>
<td valign="top" align="left">614.491</td>
<td valign="top" align="right">&#x2212;1.475</td>
<td valign="top" align="left">0.141</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PRE</sc>Ct</td>
<td valign="top" align="right">&#x2212;0.095</td>
<td valign="top" align="left">0.028</td>
<td valign="top" align="left">615.916</td>
<td valign="top" align="right">&#x2212;3.414</td>
<td valign="top" align="left">0.001</td>
</tr>
<tr>
<td valign="top" align="left"><sc>DENSITY</sc></td>
<td valign="top" align="right">&#x2212;0.241</td>
<td valign="top" align="left">0.119</td>
<td valign="top" align="left">621.790</td>
<td valign="top" align="right">&#x2212;2.017</td>
<td valign="top" align="left">0.044</td>
</tr>
<tr>
<td valign="top" align="left">ALC</td>
<td valign="top" align="right">&#x2212;5.302</td>
<td valign="top" align="left">2.277</td>
<td valign="top" align="left">623.246</td>
<td valign="top" align="right">&#x2212;2.329</td>
<td valign="top" align="left">0.020</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Similar to Section &#x201C;Model B: LDL Measures and <sc>AFFIX</sc> Specification,&#x201D; the predictor strength of individual covariates was checked by taking the final model as template. For each predictor variable, a model was fitted lacking the pertinent variable. This resulted in seven models, each missing a different covariate. Then, marginal R-squared values were computed and compared. The model showing the lowest of these values in turn missed the covariate with the highest predictor strength. The result of this procedure is reflected in the hierarchy in (2). The decrease in R-squared is greatest when removing <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>, followed by <sc>PAUSE</sc>B<sc>IN</sc>, and so forth. In sum, variables containing measures obtained by our LDL analysis appear to be meaningful predictors of S duration.</p>
<list list-type="simple">
<list-item><p>(2) <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc> &#x003E; &#x003E; <sc>PAUSE</sc>B<sc>IN</sc> &#x003E; &#x003E; C<sc>OMPONENT</sc>3 &#x003E; &#x003E; <sc>FOL</sc>T<sc>YPE</sc> &#x003E; &#x003E; ALC &#x003E; &#x003E; <sc>DENSITY</sc> &#x003E; &#x003E; C<sc>OMPONENT</sc>1 &#x003E; &#x003E; <sc>PRE</sc>C</p>
</list-item>
</list>
<p><xref ref-type="fig" rid="F3">Figure 3</xref> shows the effect on S duration of the numerical variables included in the model. The estimated values of the dependent variable <sc>S</sc>D<sc>UR</sc>L<sc>OG</sc>, i.e., S duration, and <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>, i.e., base duration, are back-transformed into seconds. For C<sc>OMPONENT</sc>1, higher values lead to longer S durations, while for C<sc>OMPONENT</sc>3 (panel A), higher values lead to shorter S durations (panel B). Higher values of <sc>DENSITY</sc> (panel C) and ALC (panel D) come with shorter S durations. Longer bases come with longer S durations (panel E).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Partial effects of the numerical variables included in the final &#x201C;LDL measures and AFFIX&#x201D; model, fitted to the log-transformed values of duration of S. <bold>(A)</bold> C<sc>OMPONENT</sc>1 <bold>(B)</bold> C<sc>OMPONENT</sc>3 <bold>(C)</bold> <sc>DENSITY</sc> <bold>(D)</bold> ALC <bold>(E)</bold> back-transformed <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>.</p></caption>
<graphic xlink:href="fpsyg-12-680889-g003.tif"/>
</fig>
<p>The partial effects of the categorical variables included in the final model are illustrated in <xref ref-type="fig" rid="F4">Figure 4</xref>. Pauses lead to longer S durations (panel A), which is most likely a case of phrase-final lengthening (e.g., <xref ref-type="bibr" rid="B21">Cooper and Danly, 1981</xref>). There is also an effect of the following segment type, with S being shorter when followed by a vowel (panel B). This difference is significant for all consonant types being compared against vowels with the exception of fricatives. However, as there is only a small number of fricative cases in our data, this non-significant difference is potentially not meaningful. Lastly, there is an effect of preceding consonant on S duration (panel D). S duration is significantly longer if preceded by a voiceless labiodental fricative /f/ or a voiceless velar stop /k/ as compared to cases where S is preceded by a voiceless alveolar stop /t/. All other comparisons are non-significant.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Partial effects of the categorical variables included in the final &#x201C;LDL measures and AFFIX&#x201D; model, fitted to the log-transformed values of duration of S. <bold>(A)</bold> <sc>PAUSE</sc>B<sc>IN</sc> <bold>(B)</bold> <sc>FOL</sc>T<sc>YPE</sc> <bold>(C)</bold> <sc>PRE</sc>C.</p></caption>
<graphic xlink:href="fpsyg-12-680889-g004.tif"/>
</fig>
<p>Let us turn to the variables of interest, i.e., those derived from our LDL network. C<sc>OMPONENT</sc>1 acts as a general measure of phonological certainty. High values of C<sc>OMPONENT</sc>1 come with high values of <sc>PATH_COUNTS</sc>, <sc>PATH_ENTROPIES</sc>, and ALDC, indicating a high level of phonological uncertainty. At the other end of the C<sc>OMPONENT</sc>1 dimension, high values of <sc>PATH_SUM</sc> and <sc>SUPPORT</sc> indicate a high level of phonological certainty. Higher uncertainty appears to lead to longer S durations, while higher certainty appears to lead to shorter S durations.</p>
<p>Recall from Section &#x201C;Model B: LDL Measures and Affix Specification&#x201D; that C<sc>OMPONENT</sc>3 relates to semantic activation diversity and to the presence of the plural suffix. Higher values of C<sc>OMPONENT</sc>3 indicate a higher level of semantic activation diversity. Higher levels of activation diversity then lead to shorter S durations (see panel B of <xref ref-type="fig" rid="F3">Figure 3</xref>). High values of C<sc>OMPONENT</sc>3 are positively correlated with the presence of plural S. It appears that the presence of plural makes words semantically more similar to each other as they share this meaning component. Hence it is to be expected that plural words live in a space of greater semantic activation diversity. C<sc>OMPONENT</sc>3 is not only a measure of semantic activation diversity, but also indicates that plural pseudowords show a tendency of having a higher degree of semantic activation diversity as compared to monomorphemic pseudowords in general. D<sc>ENSITY</sc> and ALC also tap into the semantics of pseudowords. That is, similar to C<sc>OMPONENT</sc>3, higher values indicate higher levels of semantic activation diversity. These higher levels then lead to shorter S durations.</p>
</sec>
<sec id="S6.SS3">
<title>Model C: LDL Measures Only</title>
<p>The final model of LDL measures only is fitted with main effects of the following variables: <sc>L</sc>1<sc>NORM</sc>, ALC, NNC, log-transformed base duration (<sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>), pause (<sc>PAUSE</sc>B<sc>IN</sc>), following segmental type (<sc>FOL</sc>T<sc>YPE</sc>), and preceding consonant (<sc>PRE</sc>C). The <sc>SPEAKER</sc> variable is included as random intercept. The p-values of the analysis of variance of the final model are given in <xref ref-type="table" rid="T8">Table 8</xref>.</p>
<table-wrap position="float" id="T8">
<label>TABLE 8</label>
<caption><p><italic>p</italic>-values of fixed effects in the final &#x201C;LDL measures only&#x201D; model, fitted to the log-transformed durations of S.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center">Sum Sq</td>
<td valign="top" align="center">Mean Sq</td>
<td valign="top" align="center">NumDF</td>
<td valign="top" align="left">DenDF</td>
<td valign="top" align="right">F.value</td>
<td valign="top" align="left">Pr (&#x003E; F)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><sc>L</sc>1<sc>NORM</sc></td>
<td valign="top" align="center">0.685</td>
<td valign="top" align="center">0.685</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left">611.07</td>
<td valign="top" align="right">13.473</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc></td>
<td valign="top" align="center">6.047</td>
<td valign="top" align="center">6.047</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left">627.51</td>
<td valign="top" align="right">118.901</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PAUSE</sc>B<sc>IN</sc></td>
<td valign="top" align="center">5.440</td>
<td valign="top" align="center">5.440</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left">632.72</td>
<td valign="top" align="right">106.956</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOL</sc>T<sc>YPE</sc></td>
<td valign="top" align="center">2.056</td>
<td valign="top" align="center">0.514</td>
<td valign="top" align="center">4</td>
<td valign="top" align="left">610.10</td>
<td valign="top" align="right">10.105</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PRE</sc>C</td>
<td valign="top" align="center">0.761</td>
<td valign="top" align="center">0.254</td>
<td valign="top" align="center">3</td>
<td valign="top" align="left">607.96</td>
<td valign="top" align="right">4.985</td>
<td valign="top" align="left">0.002</td>
</tr>
<tr>
<td valign="top" align="left">ALC</td>
<td valign="top" align="center">0.534</td>
<td valign="top" align="center">0.534</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left">615.51</td>
<td valign="top" align="right">10.504</td>
<td valign="top" align="left">0.001</td>
</tr>
<tr>
<td valign="top" align="left">NNC</td>
<td valign="top" align="center">0.778</td>
<td valign="top" align="center">0.778</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left">619.67</td>
<td valign="top" align="right">15.296</td>
<td valign="top" align="left">0.000</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>With a marginal R-squared value of 0.41, the fixed effects of this model explain 41% of variation within the data. The conditional R-squared value of the model is 0.61, that is the complete model accounts for 61% of variation.</p>
<p>The coefficients of the final model and their p-values are given in <xref ref-type="table" rid="T9">Table 9</xref>. The reference levels for the categorical covariates are: for <sc>PAUSE</sc>B<sc>IN</sc> it is no-pause; for <sc>FOL</sc>T<sc>YPE</sc> it is APP, and for <sc>PRE</sc>C it is f.</p>
<table-wrap position="float" id="T9">
<label>TABLE 9</label>
<caption><p>Fixed-effect coefficients and <italic>p</italic>-values as computed by the final &#x201C;LDL measures&#x201D; model (mixed-effects model fitted to the log-transformed duration of S).</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center">Estimate</td>
<td valign="top" align="left">Std. Error</td>
<td valign="top" align="center"><italic>df</italic></td>
<td valign="top" align="center"><italic>t</italic>-value</td>
<td valign="top" align="left">Pre (&#x003E; | t|)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">(Intercept)</td>
<td valign="top" align="right">&#x2212;2.334</td>
<td valign="top" align="left">0.320</td>
<td valign="top" align="left">625.440</td>
<td valign="top" align="right">&#x2212;7.301</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>L</sc>1<sc>NORM</sc></td>
<td valign="top" align="right">&#x2212;0.044</td>
<td valign="top" align="left">0.012</td>
<td valign="top" align="left">611.066</td>
<td valign="top" align="right">&#x2212;3.671</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc></td>
<td valign="top" align="right">0.624</td>
<td valign="top" align="left">0.057</td>
<td valign="top" align="left">627.514</td>
<td valign="top" align="right">10.904</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PAUSE</sc>B<sc>IN</sc>pause</td>
<td valign="top" align="right">0.233</td>
<td valign="top" align="left">0.022</td>
<td valign="top" align="left">632.719</td>
<td valign="top" align="right">10.342</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOL</sc>T<sc>YPE</sc>F</td>
<td valign="top" align="right">&#x2212;0.019</td>
<td valign="top" align="left">0.073</td>
<td valign="top" align="left">613.088</td>
<td valign="top" align="right">&#x2212;0.267</td>
<td valign="top" align="left">0.790</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOLTYPEN</sc></td>
<td valign="top" align="right">&#x2212;0.005</td>
<td valign="top" align="left">0.028</td>
<td valign="top" align="left">607.324</td>
<td valign="top" align="right">&#x2212;0.195</td>
<td valign="top" align="left">0.845</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOLTYPEP</sc></td>
<td valign="top" align="right">&#x2212;0.023</td>
<td valign="top" align="left">0.024</td>
<td valign="top" align="left">607.817</td>
<td valign="top" align="right">&#x2212;0.950</td>
<td valign="top" align="left">0.343</td>
</tr>
<tr>
<td valign="top" align="left"><sc>FOLTYPEV</sc></td>
<td valign="top" align="right">&#x2212;0.140</td>
<td valign="top" align="left">0.025</td>
<td valign="top" align="left">611.952</td>
<td valign="top" align="right">&#x2212;5.693</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PRE</sc>Ck</td>
<td valign="top" align="right">&#x2212;0.029</td>
<td valign="top" align="left">0.027</td>
<td valign="top" align="left">607.726</td>
<td valign="top" align="right">&#x2212;1.058</td>
<td valign="top" align="left">0.291</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PRE</sc>Cp</td>
<td valign="top" align="right">&#x2212;0.053</td>
<td valign="top" align="left">0.027</td>
<td valign="top" align="left">607.478</td>
<td valign="top" align="right">&#x2212;1.950</td>
<td valign="top" align="left">0.052</td>
</tr>
<tr>
<td valign="top" align="left"><sc>PRE</sc>Ct</td>
<td valign="top" align="right">&#x2212;0.101</td>
<td valign="top" align="left">0.028</td>
<td valign="top" align="left">608.068</td>
<td valign="top" align="right">&#x2212;3.632</td>
<td valign="top" align="left">0.000</td>
</tr>
<tr>
<td valign="top" align="left">ALC</td>
<td valign="top" align="right">&#x2212;6.663</td>
<td valign="top" align="left">2.056</td>
<td valign="top" align="left">615.511</td>
<td valign="top" align="right">&#x2212;3.241</td>
<td valign="top" align="left">0.001</td>
</tr>
<tr>
<td valign="top" align="left">NNC</td>
<td valign="top" align="right">1.221</td>
<td valign="top" align="left">0.312</td>
<td valign="top" align="left">619.671</td>
<td valign="top" align="right">3.911</td>
<td valign="top" align="left">0.000</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>As for both other final models, the predictor strength of the individual predictors was checked. Models with one of the predictor variables were constructed based on the complete final model. Then, marginal R-squared values were computed for each of these six models. A comparison of R-squared values then revealed the hierarchy of predictor strength given in (3). That is, the decrease in R-squared is greatest when removing <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc>, followed by <sc>PAUSE</sc>B<sc>IN</sc>, and so forth.</p>
<list list-type="simple">
<list-item><p>(3) <sc>BASE</sc>D<sc>UR</sc>L<sc>OG</sc> &#x003E; &#x003E; <sc>PAUSE</sc>B<sc>IN</sc> &#x003E;&#x003E; <sc>FOL</sc>T<sc>YPE</sc> &#x003E;&#x003E; NNC &#x003E;&#x003E; <sc>L</sc>1<sc>NORM</sc> &#x003E;&#x003E; ALC &#x003E;&#x003E; <sc>PRE</sc>C</p>
</list-item>
</list>
<p>Base duration and speaking rate show identical effects as compared to the model fitted in Section &#x201C;Model B: LDL Measures and <sc>AFFIX</sc> Specification,&#x201D; i.e., longer base durations come with longer S durations, while higher speaking rates lead to shorter S durations. As for categorical variables, pauses again come with longer S durations, and S is shorter if followed by a vowel. There is also an effect of the preceding consonant, with S duration being significantly longer if preceded by a voiceless labiodental fricative /f/ or a voiceless velar stop /k/ as compared to cases where S is preceded by a voiceless alveolar stop /t/. These results are generally in line with those by the analysis in the previous section.</p>
<p>Taking a closer look at the variables of interest, we find that higher values of <sc>L</sc>1<sc>NORM</sc>, and ALC, i.e., higher semantic activation diversity, lead to shorter S durations. As in model B, higher levels of semantic activation diversity come with shorter S durations. For NNC, we find that S duration is longer if a pseudoword is semantically similar to a real word. The effects of <sc>L</sc>1<sc>NORM</sc>, ALC, and NNC are illustrated in <xref ref-type="fig" rid="F5">Figure 5</xref>.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Partial effects of LDL derived variables contained in the final &#x201C;LDL measures only&#x201D; model, fitted to the log-transformed values of duration of S. <bold>(A)</bold> <sc>L</sc>1<sc>NORM</sc> <bold>(B)</bold> ALC <bold>(C)</bold> NNC.</p></caption>
<graphic xlink:href="fpsyg-12-680889-g005.tif"/>
</fig>
</sec>
</sec>
<sec id="S7">
<title>Discussion</title>
<sec id="S7.SS1">
<title>The Present Results</title>
<p>Previous studies (<xref ref-type="bibr" rid="B77">Zimmermann, 2016</xref>; <xref ref-type="bibr" rid="B60">Seyfarth et al., 2017</xref>; <xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>; <xref ref-type="bibr" rid="B48">Plag et al., 2020</xref>, <xref ref-type="bibr" rid="B47">2017</xref>; <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref>) reported that there are significant differences in the acoustic duration between different types of word-final S in English. Such durational differences challenge established feed-forward theories of morphology-phonology interaction (e.g., <xref ref-type="bibr" rid="B19">Chomsky and Halle, 1968</xref>; <xref ref-type="bibr" rid="B33">Kiparsky, 1982</xref>) as well as theories of psycholinguistics (e.g., <xref ref-type="bibr" rid="B39">Levelt et al., 1999</xref>; <xref ref-type="bibr" rid="B54">Roelofs and Ferreira, 2019</xref>; <xref ref-type="bibr" rid="B68">Turk and Shattuck-Hufnagel, 2020</xref>). The present study investigated whether measures derived on the basis of a discriminative learning theory are predictive of S durations in nonce words. In particular, we implemented LDL networks that model the production of a word based on its relation to the rest of the lexicon.</p>
<p>We explored the predictive possibilities of LDL measures by fitting three different models: a) a model based on the traditional predictors as used in previous studies (<xref ref-type="bibr" rid="B47">Plag et al., 2017</xref>; <xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>; <xref ref-type="bibr" rid="B57">Schmitz et al., 2020</xref>); b) a model with LDL measures and a variable <sc>AFFIX</sc> specifying the presence or absence of an affix; and c) a model with LDL measures but without a variable specifying the presence or absence of an affix. Both models with LDL measures show that such measures are predictive of S durations. This result is the most important of our study. While traditional variables such as lexical frequencies, bigram frequencies, transitional probabilities or neighbourhood densities measure important lexical properties, it is unclear why they would manifest themselves in a particular morphological effect in speech production. In LDL such effects can emerge through the mapping of form and meaning in a clearly defined process of discriminative learning.</p>
<p>All regression models showed a similar hierarchy of predictor strength for the variables included in the models. For the traditional model A, <sc>AFFIX</sc> is the third strongest predictor of S duration and for model B this spot is taken by C<sc>OMPONENT</sc>3, while there is no comparable variable included in model C. Comparing the variance explained by the fixed effects of the different models, we find that the traditional model accounts for most variation, i.e., 43%, while the LDL model including the <sc>AFFIX</sc> variable accounts for 42%, and the LDL model without the <sc>AFFIX</sc> variable accounts for 41% of variation. Thus, in terms of marginal R-squared values, all three models are close to each other. To check whether these differences in marginal R-squared values are of significance, the three models were refitted to the untrimmed data set and then compared with an analysis of variance. The results suggest that there is no significant difference between the traditional model and the LDL model including the <sc>AFFIX</sc> variable. However, the LDL model without the <sc>AFFIX</sc> variable shows a significantly worse fit (<italic>p</italic> &#x003C; 0.01). This seems to indicate that the LDL measures do not capture the full amount of the variance that is captured by the variable <sc>AFFIX</sc>. This means that there is still something about the morphological function that translates into duration and that is not properly modelled by the associative measurements of the learning network. The same problem holds, incidentally, for the traditional model (model A), in which the usual lexical measures (such as lexical frequencies, neighbourhood densities, etc.) and phonetic covariates (such as pauses, speech rate, etc.) are also not able to cover all durational variance. The morphological residue in both types of analysis remains a conundrum that calls for more sophisticated approaches in future research.</p>
</sec>
<sec id="S7.SS2">
<title>Comparison of Results to Other Studies</title>
<p>The LDL measures included in our final models are either concerned with semantic activation diversity (C<sc>OMPONENT</sc>3, ALC, and <sc>DENSITY</sc> in model B; <sc>L</sc>1<sc>NORM</sc>, and ALC in model C), semantic similarity (NNC in model C) or with phonological certainty (C<sc>OMPONENT</sc>1 in model B).</p>
<p>Higher degrees of semantic activation diversity come with shorter S durations. This effect is similar to the one which was reported by <xref ref-type="bibr" rid="B67">Tucker et al. (2019b)</xref> in a study on stem vowels, and <xref ref-type="bibr" rid="B64">Tomaschek et al. (2019)</xref> in their NDL study on S duration. A higher degree of activation diversity makes it &#x201C;more difficult to discriminate the targeted outcome from its competitors&#x201D; (<xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>:27). As for production, a prolongation of the acoustic signal is dysfunctional if the prolongation maintains or increases the discrimination problem instead of contributing to resolving it (<xref ref-type="bibr" rid="B64">Tomaschek et al., 2019</xref>).</p>
<p>In the model without A<sc>FFIX</sc> as predictor variable, NNC (i.e., a pseudoword&#x2019;s semantic similarity to its closest semantic real word neighbour) emerges as significant (see model C). Why so? As reported in <xref ref-type="table" rid="T2">Table 2</xref>, the A<sc>FFIX</sc> variable and NNC are strongly negatively correlated (rho = &#x2212;0.89). Post-hoc analysis shows that plural S has significantly lower NNC values as compared to non-morphemic S (Wilcoxon test, <italic>p</italic> &#x003C; 0.001). It therefore appears that NNC takes over the role of differentiating between plural and non-morphemic S in model C.</p>
<p>As for phonological certainty, we find that higher phonological certainty leads to shorter S durations, while higher phonological uncertainty leads to longer S durations. Shorter durations in contexts of high phonological certainty may be related to effects of frequency, i.e., highly frequent forms are produced with higher certainty and are thus shorter.</p>
</sec>
<sec id="S7.SS3">
<title>Directions for Future Research and Conclusion</title>
<p>The results of the present study may bring up further questions. First, are the predictive measures found for word-final S duration in pseudowords also predictive for word-final S duration in real words? <xref ref-type="bibr" rid="B64">Tomaschek et al.&#x2019;s (2019)</xref> NDL implementation suggests that it is, but LDL networks still need to be implemented. It would be especially interesting to model those data sets that have yielded seemingly contradictory effects. Second, taking into account that the specification of A<sc>FFIX</sc> in the modelling process leads to a significantly better model fit, one may ask what the underlying reasons for this significant effect are. This then automatically leads to another question: Is it possible to catch the effect of the A<sc>FFIX</sc> specification in terms of (new) LDL measures?</p>
<p>To summarize, this paper was the first to investigate durational differences between different types of word-final S (non-morphemic vs. plural S) in pseudowords by means of an LDL implementation, measures, and resulting statistical analyses. The findings yielded important evidence on the question of how such durational difference come to be, i.e., they can be predicted based on their pseudoword&#x2019;s relations to the lexicon. We demonstrated that durational differences emerge from the pseudoword&#x2019;s resonance with the lexicon by way of differing degrees of semantic activation diversity and phonological uncertainty. These manifestations of the relations to other words in the lexicon in turn are the result of discriminative learning.</p>
</sec>
</sec>
<sec id="S8">
<title>Data Availability Statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: <ext-link ext-link-type="uri" xlink:href="https://osf.io/zy7ar/?view_only=ef43a5caf6444270a56074027d7d6482">https://osf.io/zy7ar/?view_only=ef43a5caf6444270a56074027d7d6482</ext-link>.</p>
</sec>
<sec id="S9">
<title>Author Contributions</title>
<p>DS, IP, and DB-H contributed to conception and design of the study, manuscript revisions. DS retrieved the data and performed the computational implementation supported by SS. DS carried out the modelling and statistical analysis, and wrote the first draft of the manuscript. All authors read and approved the submitted version.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="S10">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This research was funded by the Deutsche Forschungsgemeinschaft (Research Unit FOR2373 &#x2018;Spoken Morphology&#x2019;, grant PL 151/7-2 &#x2018;Central project&#x2019;, grants BA 6523/1-1 and PL 151/9-1 &#x2018;Final S in English: The role of acoustic detail in morphological learning&#x2019;, and grant PL 151/8-2 &#x2018;Morpho-Phonetic variation in English&#x2019;), which we gratefully acknowledge.</p>
</fn>
</fn-group>
<ack>
<p>The authors are grateful to the members of the DFG Research Unit FOR2373, the audience of the Words in the World Conference 2020, Janina Esser, Yu-Ying Chuang, and one reviewer for valuable input. The usual disclaimers apply.</p>
</ack>
<sec id="S12" sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fpsyg.2021.680889/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fpsyg.2021.680889/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.DOCX" id="TS1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2008</year>). <source><italic>Analyzing Linguistic Data: A Practical Introduction to Statistics Using R.</italic></source> <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>, <pub-id pub-id-type="doi">10.1017/CBO9780511801686</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Milin</surname> <given-names>P.</given-names></name></person-group> (<year>2010</year>). <article-title>Analyzing reaction times.</article-title> <source><italic>Int. J. Psychol. Res.</italic></source> <volume>3</volume> <fpage>12</fpage>&#x2013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.21500/20112084.807</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Shafaei-Bajestan</surname> <given-names>E.</given-names></name></person-group> (<year>2019</year>). <source><italic>languageR: Analyzing Linguistic Data: A Practical Introduction to Statistics (1.5.0)</italic></source>. <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/package=languageR">https://cran.r-project.org/package=languageR</ext-link> <comment>(accessed June 09, 2021)</comment>.</citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Chuang</surname> <given-names>Y.-Y.</given-names></name> <name><surname>Heitmeier</surname> <given-names>M.</given-names></name></person-group> (<year>2019a</year>). <source><italic>WpmWithLdl: Implementation of Word and Paradigm Morphology with Linear Discriminative Learning (1.3.17.1).</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.sfs.uni-tuebingen.de/~hbaayen/software.html">http://www.sfs.uni-tuebingen.de/~hbaayen/software.html</ext-link> <comment>(accessed June 09, 2021)</comment>.</citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Chuang</surname> <given-names>Y.-Y.</given-names></name> <name><surname>Shafaei-Bajestan</surname> <given-names>E.</given-names></name> <name><surname>Blevins</surname> <given-names>J. P.</given-names></name></person-group> (<year>2019b</year>). <article-title>The discriminative lexicon: a unified computational model for the lexicon and lexical processing in comprehension and production grounded not in (De)composition but in linear discriminative learning.</article-title> <source><italic>Complexity</italic></source> <volume>2019</volume> <fpage>1</fpage>&#x2013;<lpage>39</lpage>. <pub-id pub-id-type="doi">10.1155/2019/4895891</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Milin</surname> <given-names>P.</given-names></name> <name><surname>&#x00D0;ur&#x0111;evi&#x0107;</surname> <given-names>D. F.</given-names></name> <name><surname>Hendrix</surname> <given-names>P.</given-names></name> <name><surname>Marelli</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>An amorphous model for morphological processing in visual comprehension based on naive discriminative learning.</article-title> <source><italic>Psychol. Rev.</italic></source> <volume>118</volume> <fpage>438</fpage>&#x2013;<lpage>481</lpage>. <pub-id pub-id-type="doi">10.1037/a0023851</pub-id> <pub-id pub-id-type="pmid">21744979</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Piepenbrock</surname> <given-names>R.</given-names></name> <name><surname>Gulikers</surname> <given-names>L.</given-names></name></person-group> (<year>1995</year>). <source><italic>The CELEX Lexical Database (CD-ROM).</italic></source> <publisher-loc>Philadelphia, PA</publisher-loc>: <publisher-name>University of Philadelphia</publisher-name>.</citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Shaoul</surname> <given-names>C.</given-names></name> <name><surname>Willits</surname> <given-names>J.</given-names></name> <name><surname>Ramscar</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Comprehension without segmentation: a proof of concept with naive discriminative learning.</article-title> <source><italic>Lang. Cogn. Neurosci.</italic></source> <volume>31</volume> <fpage>106</fpage>&#x2013;<lpage>128</lpage>. <pub-id pub-id-type="doi">10.1080/23273798.2015.1065336</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barton</surname> <given-names>K.</given-names></name></person-group> (<year>2020</year>). <source><italic>MuMIn: Multi-Model Inference (1.43.17).</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/package=MuMIn">https://cran.r-project.org/package=MuMIn</ext-link> <comment>(accessed June 09, 2021)</comment>.</citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bates</surname> <given-names>D.</given-names></name> <name><surname>M&#x00E4;chler</surname> <given-names>M.</given-names></name> <name><surname>Bolker</surname> <given-names>B. M.</given-names></name> <name><surname>Walker</surname> <given-names>S. C.</given-names></name></person-group> (<year>2015</year>). <article-title>Fitting linear mixed-effects models using lme4.</article-title> <source><italic>J. Statist. Softw.</italic></source> <volume>67</volume>:<issue>51</issue>. <pub-id pub-id-type="doi">10.18637/jss.v067.i01</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ben Hedia</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <source><italic>Gemination and Degemination in English Affixation: Investigating the Interplay Between Morphology, Phonology and Phonetics.</italic></source> <publisher-loc>Berlin</publisher-loc>: <publisher-name>Language Science Press</publisher-name>, <pub-id pub-id-type="doi">10.5281/zenodo.3232849</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ben Hedia</surname> <given-names>S.</given-names></name> <name><surname>Plag</surname> <given-names>I.</given-names></name></person-group> (<year>2017</year>). <article-title>Gemination and degemination in English prefixation: phonetic evidence for morphological organization.</article-title> <source><italic>J. Phonetics</italic></source> <volume>62</volume> <fpage>34</fpage>&#x2013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.1016/j.wocn.2017.02.002</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blevins</surname> <given-names>J. P.</given-names></name> <name><surname>Ackerman</surname> <given-names>F.</given-names></name> <name><surname>Malouf</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). &#x201C;<article-title>Morphology as an adaptive discriminative system</article-title>,&#x201D; in <source><italic>Morphological Metatheory</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Siddiqi</surname> <given-names>D.</given-names></name> <name><surname>Harley</surname> <given-names>H.</given-names></name></person-group> (<publisher-name>John Benjamins</publisher-name>), <fpage>271</fpage>&#x2013;<lpage>302</lpage>. <pub-id pub-id-type="doi">10.1075/la.229</pub-id> <pub-id pub-id-type="pmid">33486653</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boersma</surname> <given-names>P.</given-names></name> <name><surname>Weenink</surname> <given-names>D.</given-names></name></person-group> (<year>2019</year>). <source><italic>Praat: doing phonetics by computer (6.1.27).</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.praat.org/">http://www.praat.org/</ext-link> <comment>(accessed October 13, 2020)</comment>.</citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Booij</surname> <given-names>G. E.</given-names></name></person-group> (<year>1983</year>). <article-title>Principles and parameters in prosodic phonology.</article-title> <source><italic>Linguistics</italic></source> <volume>21</volume> <fpage>249</fpage>&#x2013;<lpage>280</lpage>. <pub-id pub-id-type="doi">10.1515/ling.1983.21.1.249</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burnage</surname> <given-names>G.</given-names></name></person-group> (<year>1988</year>). <source><italic>CELEX, A Guide for Users. Centre for Lexical Information.</italic></source> <publisher-loc>Nijmegen, Netherlands</publisher-loc>: <publisher-name>Centre for Lexical Information</publisher-name>.</citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caselli</surname> <given-names>N. K.</given-names></name> <name><surname>Caselli</surname> <given-names>M. K.</given-names></name> <name><surname>Cohen-Goldberg</surname> <given-names>A. M.</given-names></name></person-group> (<year>2016</year>). <article-title>Inflected words in production: evidence for a morphologically rich lexicon.</article-title> <source><italic>Quar. J. Exp. Psychol</italic>.</source> <volume>69</volume>, <fpage>434</fpage>&#x2013;<lpage>454</lpage>. <pub-id pub-id-type="doi">10.1080/17470218.2015.1054847</pub-id> <pub-id pub-id-type="pmid">26018493</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chavent</surname> <given-names>M.</given-names></name> <name><surname>Kuentz</surname> <given-names>V.</given-names></name> <name><surname>Labenne</surname> <given-names>A.</given-names></name> <name><surname>Liquet</surname> <given-names>B.</given-names></name> <name><surname>Saracco</surname> <given-names>J.</given-names></name></person-group> (<year>2017</year>). <source><italic>PCAmixdata: Multivariate Analysis of Mixed Data (3.1).</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/package=PCAmixdata">https://cran.r-project.org/package=PCAmixdata</ext-link> <comment>(accessed June 09, 2021)</comment>.</citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chomsky</surname> <given-names>N.</given-names></name> <name><surname>Halle</surname> <given-names>M.</given-names></name></person-group> (<year>1968</year>). <source><italic>The Sound Pattern of English.</italic></source> <publisher-loc>Manhattan, NY</publisher-loc>: <publisher-name>Harper and Row</publisher-name>.</citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chuang</surname> <given-names>Y.-Y.</given-names></name> <name><surname>Vollmer</surname> <given-names>M. L.</given-names></name> <name><surname>Shafaei-Bajestan</surname> <given-names>E.</given-names></name> <name><surname>Gahl</surname> <given-names>S.</given-names></name> <name><surname>Hendrix</surname> <given-names>P.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2020</year>). <article-title>The processing of pseudoword form and meaning in production and comprehension: a computational modeling approach using linear discriminative learning.</article-title> <source><italic>Behav. Res. Methods</italic></source> <volume>53</volume> <fpage>945</fpage>&#x2013;<lpage>976</lpage>. <pub-id pub-id-type="doi">10.3758/s13428-020-01356-w</pub-id> <pub-id pub-id-type="pmid">32377973</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cooper</surname> <given-names>W. E.</given-names></name> <name><surname>Danly</surname> <given-names>M.</given-names></name></person-group> (<year>1981</year>). <article-title>Segmental and temporal aspects of utterance-final lengthening.</article-title> <source><italic>Phonetica</italic></source> <volume>38</volume> <fpage>106</fpage>&#x2013;<lpage>115</lpage>.</citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Jong</surname> <given-names>N.</given-names></name> <name><surname>Wempe</surname> <given-names>T.</given-names></name></person-group> (<year>2008</year>). <source><italic>Praat Script Syllable Nuclei [Praat Script].</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://sites.google.com/site/speechrate/Home/praat-script-syllable-nuclei-v2">https://sites.google.com/site/speechrate/Home/praat-script-syllable-nuclei-v2</ext-link> <comment>(accessed August 19, 2020)</comment>.</citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Drager</surname> <given-names>K. K.</given-names></name></person-group> (<year>2011</year>). <article-title>Sociophonetic variation and the lemma.</article-title> <source><italic>J. Phonetics</italic></source> <volume>39</volume> <fpage>694</fpage>&#x2013;<lpage>707</lpage>. <pub-id pub-id-type="doi">10.1016/j.wocn.2011.08.005</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fox</surname> <given-names>J.</given-names></name> <name><surname>Weisberg</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <source><italic>An R Companion to Applied Regression.</italic></source> <publisher-loc>Thousand Oaks, CA</publisher-loc>: <publisher-name>Sage Publishing</publisher-name>.</citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gahl</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>Time and thyme are not homophones: the effect of lemma frequency on word durations in spontaneous speech.</article-title> <source><italic>Language</italic></source> <volume>84</volume> <fpage>474</fpage>&#x2013;<lpage>496</lpage>. <pub-id pub-id-type="doi">10.1353/lan.0.0035</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goad</surname> <given-names>H.</given-names></name></person-group> (<year>1998</year>). <article-title>Plurals in SLI: prosodic deficit or morphological deficit?</article-title> <source><italic>Lang. Acquisition</italic></source> <volume>7</volume> <fpage>247</fpage>&#x2013;<lpage>284</lpage>. <pub-id pub-id-type="doi">10.1207/s15327817la0702-4_6</pub-id> <pub-id pub-id-type="pmid">26627889</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goad</surname> <given-names>H.</given-names></name></person-group> (<year>2002</year>). <article-title>Markedness in right-edge syllabification: parallels across populations.</article-title> <source><italic>Canad. J. Linguistics</italic></source> <volume>47</volume> <fpage>151</fpage>&#x2013;<lpage>186</lpage>.</citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hsieh</surname> <given-names>L.</given-names></name> <name><surname>Leonard</surname> <given-names>L. B.</given-names></name> <name><surname>Swanson</surname> <given-names>L. L.</given-names></name></person-group> (<year>1999</year>). <article-title>Some differences between english plural noun inflections and third singular verb inflections in the input: the contributions of frequency, sentence position, and duration.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>26</volume> <fpage>531</fpage>&#x2013;<lpage>543</lpage>. <pub-id pub-id-type="doi">10.1017/S030500099900392X</pub-id> <pub-id pub-id-type="pmid">10603695</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ivens</surname> <given-names>S. H.</given-names></name> <name><surname>Koslin</surname> <given-names>B. L.</given-names></name></person-group> (<year>1991</year>). <source><italic>Demands for Reading Literacy Require New Accountability Methods.</italic></source> <publisher-loc>Valley</publisher-loc>: <publisher-name>Touchstone Applied Science Associates</publisher-name>.</citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jones</surname> <given-names>M. N.</given-names></name> <name><surname>Mewhort</surname> <given-names>D. J. K.</given-names></name></person-group> (<year>2007</year>). <article-title>Representing word meaning and order information in a composite holographic lexicon.</article-title> <source><italic>Psychol. Rev.</italic></source> <volume>114</volume> <fpage>1</fpage>&#x2013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1037/0033-295X.114.1.1</pub-id> <pub-id pub-id-type="pmid">17227180</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kemps</surname> <given-names>R. J. J. K.</given-names></name> <name><surname>Ernestus</surname> <given-names>M.</given-names></name> <name><surname>Schreuder</surname> <given-names>R.</given-names></name> <name><surname>Harald Baayen</surname> <given-names>R.</given-names></name></person-group> (<year>2005a</year>). <article-title>Prosodic cues for morphological complexity: the case of Dutch plural nouns.</article-title> <source><italic>Memory Cogn.</italic></source> <volume>33</volume> <fpage>430</fpage>&#x2013;<lpage>446</lpage>. <pub-id pub-id-type="doi">10.3758/BF03193061</pub-id> <pub-id pub-id-type="pmid">16156179</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kemps</surname> <given-names>R. J. J. K.</given-names></name> <name><surname>Wurm</surname> <given-names>L. H.</given-names></name> <name><surname>Ernestus</surname> <given-names>M.</given-names></name> <name><surname>Schreuder</surname> <given-names>R.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2005b</year>). <article-title>Prosodic cues for morphological complexity in Dutch and English.</article-title> <source><italic>Lang. Cogn. Proc.</italic></source> <volume>20</volume> <fpage>43</fpage>&#x2013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1080/01690960444000223</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kiparsky</surname> <given-names>P.</given-names></name></person-group> (<year>1982</year>). &#x201C;<article-title>Lexical morphology and phonology</article-title>,&#x201D; in <source><italic>Linguistics in the Morning Calm: Selected Papers From SICOL1</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Yang</surname> <given-names>I.</given-names></name></person-group> (<publisher-name>Hanshin</publisher-name>), <fpage>3</fpage>&#x2013;<lpage>91</lpage>.</citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klatt</surname> <given-names>D. H.</given-names></name></person-group> (<year>1976</year>). <article-title>Linguistic uses of segmental duration in English: Acoustic and perceptual evidence.</article-title> <source><italic>J. Acous. Soc. Am.</italic></source> <volume>59</volume> <fpage>1208</fpage>&#x2013;<lpage>1221</lpage>. <pub-id pub-id-type="doi">10.1121/1.380986</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krivokapi&#x0107;</surname> <given-names>J.</given-names></name></person-group> (<year>2007</year>). <article-title>Prosodic planning: effects of phrasal length and complexity on pause duration.</article-title> <source><italic>J. Phonetics</italic></source> <volume>35</volume> <fpage>162</fpage>&#x2013;<lpage>179</lpage>. <pub-id pub-id-type="doi">10.1016/j.wocn.2006.04.001</pub-id> <pub-id pub-id-type="pmid">18379639</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuznetsova</surname> <given-names>A.</given-names></name> <name><surname>Brockhoff</surname> <given-names>P. B.</given-names></name> <name><surname>Christensen</surname> <given-names>R. H. B.</given-names></name></person-group> (<year>2017</year>). <article-title>Lmertest package: tests in linear mixed effects models.</article-title> <source><italic>J. Statist. Softw.</italic></source> <volume>82</volume>:<issue>35902</issue>. <pub-id pub-id-type="doi">10.18637/jss.v082.i13</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Landauer</surname> <given-names>T. K.</given-names></name> <name><surname>Dumais</surname> <given-names>S. T.</given-names></name></person-group> (<year>1997</year>). <article-title>A Solution to plato&#x2019;s problem: the latent semantic analysis theory of acquisition, induction, and representation of knowledge.</article-title> <source><italic>Psychol. Rev.</italic></source> <volume>104</volume> <fpage>211</fpage>&#x2013;<lpage>240</lpage>. <pub-id pub-id-type="doi">10.1037/0033-295X.104.2.211</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>S.</given-names></name> <name><surname>Oh</surname> <given-names>Y. H.</given-names></name></person-group> (<year>1999</year>). <article-title>Tree-based modeling of prosodic phrasing and segmental duration for Korean TTS systems.</article-title> <source><italic>Speech Commun.</italic></source> <volume>28</volume> <fpage>283</fpage>&#x2013;<lpage>300</lpage>. <pub-id pub-id-type="doi">10.1016/S0167-6393(99)00014-X</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levelt</surname> <given-names>W. J. M.</given-names></name> <name><surname>Roelofs</surname> <given-names>A.</given-names></name> <name><surname>Meyer</surname> <given-names>A. S.</given-names></name></person-group> (<year>1999</year>). <article-title>A theory of lexical access in speech production.</article-title> <source><italic>Behav. Brain Sc.</italic></source> <volume>22</volume> <fpage>1</fpage>&#x2013;<lpage>75</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X99001776</pub-id> <pub-id pub-id-type="pmid">11301520</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mikolov</surname> <given-names>T.</given-names></name> <name><surname>Sutskever</surname> <given-names>I.</given-names></name> <name><surname>Chen</surname> <given-names>K.</given-names></name> <name><surname>Corrado</surname> <given-names>G.</given-names></name> <name><surname>Dean</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>Distributed representations of words and phrases and their compositionality. advances in neural information processing systems.</article-title> <source><italic>Arxiv</italic> [preprint]</source>. <comment>arxiv. 1310.4546</comment>,</citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Milin</surname> <given-names>P.</given-names></name> <name><surname>Feldman</surname> <given-names>L. B.</given-names></name> <name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Hendrix</surname> <given-names>P.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2017</year>). <article-title>Discrimination in lexical decision.</article-title> <source><italic>PLoS One</italic></source> <volume>12</volume>:<issue>e0171935</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0171935</pub-id> <pub-id pub-id-type="pmid">28235015</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moore</surname> <given-names>H. E.</given-names></name></person-group> (<year>1920</year>). <article-title>On the reciprocal of the general algebraic matrix.</article-title> <source><italic>Bull. Am. Mathemat. Soc.</italic></source> <volume>26</volume> <fpage>394</fpage>&#x2013;<lpage>395</lpage>.</citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nakagawa</surname> <given-names>S.</given-names></name> <name><surname>Johnson</surname> <given-names>P. C. D.</given-names></name> <name><surname>Schielzeth</surname> <given-names>H.</given-names></name></person-group> (<year>2017</year>). <article-title>The coefficient of determination R<sup>2</sup> and intra-class correlation coefficient from generalized linear mixed-effects models revisited and expanded.</article-title> <source><italic>J. Royal Soc. Int</italic>.</source> <volume>14</volume>. <pub-id pub-id-type="doi">10.1098/rsif.2017.0213</pub-id> <pub-id pub-id-type="pmid">28904005</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x2019;Rourke</surname> <given-names>N.</given-names></name> <name><surname>Hatcher</surname> <given-names>L.</given-names></name> <name><surname>Stepanski</surname> <given-names>E. J.</given-names></name></person-group> (<year>2005</year>). <source><italic>A Step-by-Step Approach to Using SAS for Univariate &#x0026; Multivariate Statistics.</italic></source> <publisher-loc>Cary</publisher-loc>: <publisher-name>SAS Publishing</publisher-name>.</citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Penrose</surname> <given-names>R.</given-names></name></person-group> (<year>1955</year>). <article-title>A generalized inverse for matrices.</article-title> <source><italic>Mathemat. Proc. Cambridge Philos. Soc.</italic></source> <volume>51</volume> <fpage>406</fpage>&#x2013;<lpage>413</lpage>. <pub-id pub-id-type="doi">10.1017/S0305004100030401</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Plag</surname> <given-names>I.</given-names></name></person-group> (<year>2018</year>). <source><italic>Word-Formation in English (Second Edition).</italic></source> <publisher-loc>Cambridge</publisher-loc>: <publisher-name>University Press</publisher-name>, <pub-id pub-id-type="doi">10.1017/CBO9780511841323</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Plag</surname> <given-names>I.</given-names></name> <name><surname>Homann</surname> <given-names>J.</given-names></name> <name><surname>Kunter</surname> <given-names>G.</given-names></name></person-group> (<year>2017</year>). <article-title>Homophony and morphology: the acoustics of word-final S in English.</article-title> <source><italic>J. Linguistics</italic></source> <volume>53</volume> <fpage>181</fpage>&#x2013;<lpage>216</lpage>. <pub-id pub-id-type="doi">10.1017/S0022226715000183</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Plag</surname> <given-names>I.</given-names></name> <name><surname>Lohmann</surname> <given-names>A.</given-names></name> <name><surname>Ben Hedia</surname> <given-names>S.</given-names></name> <name><surname>Zimmermann</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). &#x201C;<article-title>An &#x003C;s&#x003E; is an &#x003C;s&#x2019;&#x003E;, or is it? Plural and genitive-plural are not homophonous</article-title>,&#x201D; in <source><italic>Complex Words</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>K&#x00F6;rtv&#x00E9;lyessy</surname> <given-names>L.</given-names></name> <name><surname>&#x0160;tekauer</surname> <given-names>P.</given-names></name></person-group> (<publisher-name>Cambridge University Press</publisher-name>).</citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Yarlett</surname> <given-names>D.</given-names></name></person-group> (<year>2007</year>). <article-title>Linguistic self-correction in the absence of feedback: a new approach to the logical problem of language acquisition.</article-title> <source><italic>Cogn. Sci.</italic></source> <volume>31</volume> <fpage>927</fpage>&#x2013;<lpage>960</lpage>. <pub-id pub-id-type="doi">10.1080/03640210701703576</pub-id> <pub-id pub-id-type="pmid">21635323</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramscar</surname> <given-names>M.</given-names></name> <name><surname>Yarlett</surname> <given-names>D.</given-names></name> <name><surname>Dye</surname> <given-names>M.</given-names></name> <name><surname>Denny</surname> <given-names>K.</given-names></name> <name><surname>Thorpe</surname> <given-names>K.</given-names></name></person-group> (<year>2010</year>). <article-title>The effects of feature-label-order and their implications for symbolic learning.</article-title> <source><italic>Cogn. Sci.</italic></source> <volume>34</volume> <fpage>909</fpage>&#x2013;<lpage>957</lpage>. <pub-id pub-id-type="doi">10.1111/j.1551-6709.2009.01092.x</pub-id> <pub-id pub-id-type="pmid">21564239</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><collab>R Core Team</collab> (<year>2020</year>). <source><italic>R: A Language and Environment for Statistical Computing.</italic></source> <publisher-name>R Foundation for Statistical Computing</publisher-name>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.r-project.org/">https://www.r-project.org/</ext-link> <comment>(accessed February 15, 2021)</comment>.</citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rescorla</surname> <given-names>R. A.</given-names></name></person-group> (<year>1988</year>). <article-title>Pavlovian conditioning: it&#x2019;s not what you think it is.</article-title> <source><italic>Am. Psychol.</italic></source> <volume>43</volume> <fpage>151</fpage>&#x2013;<lpage>160</lpage>. <pub-id pub-id-type="doi">10.1037/0003-066X.43.3.151</pub-id> <pub-id pub-id-type="pmid">3364852</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rescorla</surname> <given-names>R. A.</given-names></name> <name><surname>Wagner</surname> <given-names>A. R.</given-names></name></person-group> (<year>1972</year>). &#x201C;<article-title>A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement</article-title>,&#x201D; in <source><italic>Classical Conditioning II: Current Research and Theory</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Black</surname> <given-names>A. H.</given-names></name> <name><surname>Prokasy</surname> <given-names>W. F.</given-names></name></person-group> (<publisher-loc>Newyork, NY</publisher-loc>: <publisher-name>Appleton Century Crofts</publisher-name>), <fpage>64</fpage>&#x2013;<lpage>99</lpage>.</citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roelofs</surname> <given-names>A.</given-names></name> <name><surname>Ferreira</surname> <given-names>V. S.</given-names></name></person-group> (<year>2019</year>). &#x201C;<article-title>The architecture of speaking</article-title>,&#x201D; in <source><italic>Human Language: From Genes and Brains to Behavior</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Hagoort</surname> <given-names>P.</given-names></name></person-group> (<publisher-name>MIT Press</publisher-name>), <fpage>35</fpage>&#x2013;<lpage>50</lpage>.</citation></ref>
<ref id="B55"><citation citation-type="journal"><collab>RStudio Team</collab> (<year>2021</year>). <source><italic>R, RStudio: Integrated Development for R (1.4.1103).</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.rstudio.com/">http://www.rstudio.com/</ext-link> <comment>(accessed January 20, 2021)</comment>.</citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmid</surname> <given-names>H.</given-names></name></person-group> (<year>1999</year>). &#x201C;<article-title>Improvements in part-of-speech tagging with an application to german</article-title>,&#x201D; in <source><italic>Natural Language Processing Using Very Large Corpora. Text, Speech and Language Technology</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Armstrong</surname> <given-names>S.</given-names></name> <name><surname>Church</surname> <given-names>K.</given-names></name> <name><surname>Isabelle</surname> <given-names>P.</given-names></name> <name><surname>Manzi</surname> <given-names>S.</given-names></name> <name><surname>Tzoukermann</surname> <given-names>E.</given-names></name> <name><surname>Yarowsky</surname> <given-names>D.</given-names></name></person-group> (<publisher-name>Springer</publisher-name>), <fpage>11</fpage>. <pub-id pub-id-type="doi">10.1007/978-94-017-2390-9_2</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmitz</surname> <given-names>D.</given-names></name> <name><surname>Baer-Henney</surname> <given-names>D.</given-names></name> <name><surname>Plag</surname> <given-names>I.</given-names></name></person-group> (<year>2020</year>). <source><italic>The Duration of Word-Final /s/ Differs Across Morphological Categories in English: Evidence From Pseudowords [Manuscript submitted for publication].</italic></source> <publisher-loc>Germany</publisher-loc>: <publisher-name>English Language and Linguistics, Heinrich Heine University D&#x00FC;sseldorf</publisher-name>.</citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Selkirk</surname> <given-names>E.</given-names></name></person-group> (<year>1996</year>). &#x201C;<article-title>The prosodic structure of function words</article-title>,&#x201D; in <source><italic>Signal to Syntax: Bootstrapping from speech to grammar in early acquisition</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Demuth</surname> <given-names>K.</given-names></name> <name><surname>Morgan</surname> <given-names>J.</given-names></name></person-group> (<publisher-name>Routledge</publisher-name>), <fpage>187</fpage>&#x2013;<lpage>213</lpage>.</citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sering</surname> <given-names>T.</given-names></name> <name><surname>Milin</surname> <given-names>P.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2019</year>). <article-title>Language comprehension as a multiple label classification problem. Statistica Neerlandica</article-title>. <source><italic>Statistica Neerlandica</italic></source> <fpage>1</fpage>&#x2013;<lpage>15</lpage>.</citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seyfarth</surname> <given-names>S.</given-names></name> <name><surname>Garellek</surname> <given-names>M.</given-names></name> <name><surname>Gillingham</surname> <given-names>G.</given-names></name> <name><surname>Ackerman</surname> <given-names>F.</given-names></name> <name><surname>Malouf</surname> <given-names>R.</given-names></name></person-group> (<year>2017</year>). <article-title>Acoustic differences in morphologically-distinct homophones.</article-title> <source><italic>Lang. Cogn. Neurosci.</italic></source> <volume>33</volume> <fpage>32</fpage>&#x2013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.1080/23273798.2017.1359634</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shaoul</surname> <given-names>C.</given-names></name> <name><surname>Westbury</surname> <given-names>C.</given-names></name></person-group> (<year>2010</year>). <article-title>Exploring lexical co-occurrence space using HiDEx.</article-title> <source><italic>Behav. Res. Methods</italic></source> <volume>42</volume> <fpage>393</fpage>&#x2013;<lpage>413</lpage>. <pub-id pub-id-type="doi">10.3758/BRM.42.2.393</pub-id> <pub-id pub-id-type="pmid">20479171</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Swanson</surname> <given-names>L. A.</given-names></name> <name><surname>Leonard</surname> <given-names>L. B.</given-names></name></person-group> (<year>1994</year>). <article-title>Duration of function-word vowels in mothers&#x2019; speech to young children.</article-title> <source><italic>J. Speech Hearing Res.</italic></source> <volume>37</volume> <fpage>1394</fpage>&#x2013;<lpage>1405</lpage>. <pub-id pub-id-type="doi">10.1044/jshr.3706.1394</pub-id> <pub-id pub-id-type="pmid">7877296</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tomaschek</surname> <given-names>F.</given-names></name> <name><surname>Hendrix</surname> <given-names>P.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2018</year>). <article-title>Strategies for addressing collinearity in multivariate linguistic data.</article-title> <source><italic>J. Phonetics</italic></source> <volume>71</volume> <fpage>249</fpage>&#x2013;<lpage>267</lpage>. <pub-id pub-id-type="doi">10.1016/j.wocn.2018.09.004</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tomaschek</surname> <given-names>F.</given-names></name> <name><surname>Plag</surname> <given-names>I.</given-names></name> <name><surname>Ernestus</surname> <given-names>M.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2019</year>). <article-title>Phonetic effects of morphology and context: modeling the duration of word-final S in English with na&#x00EF;ve discriminative learning.</article-title> <source><italic>J. Linguistics</italic></source> <volume>2019</volume> <fpage>1</fpage>&#x2013;<lpage>39</lpage>. <pub-id pub-id-type="doi">10.1017/S0022226719000203</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tremblay</surname> <given-names>A.</given-names></name> <name><surname>Ransijn</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). <source><italic>LMERConvenienceFunctions: Model Selection and Post-Hoc Analysis for (G)LMER Models (3.0).</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/package=LMERConvenienceFunctions">https://cran.r-project.org/package=LMERConvenienceFunctions</ext-link> <comment>(accessed June 09, 2021)</comment>.</citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tucker</surname> <given-names>B. V.</given-names></name> <name><surname>Brenner</surname> <given-names>D.</given-names></name> <name><surname>Danielson</surname> <given-names>D. K.</given-names></name> <name><surname>Kelley</surname> <given-names>M. C.</given-names></name> <name><surname>Nenadi&#x0107;</surname> <given-names>F.</given-names></name> <name><surname>Sims</surname> <given-names>M.</given-names></name></person-group> (<year>2019a</year>). <article-title>The massive auditory lexical decision (mald) database.</article-title> <source><italic>Behav. Res. Methods</italic></source> <volume>51</volume> <fpage>1187</fpage>&#x2013;<lpage>1204</lpage>. <pub-id pub-id-type="doi">10.3758/s13428-018-1056-1</pub-id> <pub-id pub-id-type="pmid">29916041</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tucker</surname> <given-names>B.</given-names></name> <name><surname>Sims</surname> <given-names>M.</given-names></name> <name><surname>Baayen</surname> <given-names>R. H.</given-names></name></person-group> (<year>2019b</year>). <article-title>Opposing forces on acoustic duration.</article-title> <source><italic>PsyArXiv</italic> [preprint]</source> <fpage>1</fpage>&#x2013;<lpage>38</lpage>. <pub-id pub-id-type="doi">10.31234/osf.io/jc97w</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Turk</surname> <given-names>A.</given-names></name> <name><surname>Shattuck-Hufnagel</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <source><italic>Speech Timing.</italic></source> <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>, <pub-id pub-id-type="doi">10.1093/oso/9780198795421.001.0001</pub-id> <pub-id pub-id-type="pmid">33782627</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Umeda</surname> <given-names>N.</given-names></name></person-group> (<year>1977</year>). <article-title>Consonant duration in American English.</article-title> <source><italic>J. Acoust. Soc. Am.</italic></source> <volume>61</volume> <fpage>846</fpage>&#x2013;<lpage>858</lpage>. <pub-id pub-id-type="doi">10.1121/1.381374</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Venables</surname> <given-names>W. N.</given-names></name> <name><surname>Ripley</surname> <given-names>B. D.</given-names></name></person-group> (<year>2002</year>). <source><italic>Modern Applied Statistics With S.</italic></source> <publisher-loc>New York</publisher-loc>: <publisher-name>Springer</publisher-name>, <pub-id pub-id-type="doi">10.1007/978-0-387-21706-2</pub-id></citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vitevitch</surname> <given-names>M. S.</given-names></name> <name><surname>Luce</surname> <given-names>P. A.</given-names></name></person-group> (<year>2004</year>). <article-title>A Web-based interface to calculate phonotactic probability for words and nonwords in English.</article-title> <source><italic>Behav. Res. Methods Instru. Comput.</italic></source> <volume>36</volume> <fpage>481</fpage>&#x2013;<lpage>487</lpage>. <pub-id pub-id-type="doi">10.3758/BF03195594</pub-id> <pub-id pub-id-type="pmid">15641436</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wagner</surname> <given-names>A. R.</given-names></name> <name><surname>Rescorla</surname> <given-names>R. A.</given-names></name></person-group> (<year>1972</year>). &#x201C;<article-title>Inhibition in Pavlovian conditioning: application of a theory</article-title>,&#x201D; in <source><italic>Inhibition and Learning</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Boakes</surname> <given-names>R. A.</given-names></name> <name><surname>Halliday</surname> <given-names>M. S.</given-names></name></person-group> (<publisher-loc>London</publisher-loc>: <publisher-name>Academic Press</publisher-name>), <fpage>301</fpage>&#x2013;<lpage>334</lpage>.</citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walsh</surname> <given-names>T.</given-names></name> <name><surname>Parker</surname> <given-names>F.</given-names></name></person-group> (<year>1983</year>). <article-title>The duration of morphemic and non-morphemic /s/ in English.</article-title> <source><italic>J. Phonetics</italic></source> <volume>11</volume> <fpage>201</fpage>&#x2013;<lpage>206</lpage>. <pub-id pub-id-type="doi">10.1016/s0095-4470(19)30816-2</pub-id></citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wightman</surname> <given-names>C. W.</given-names></name> <name><surname>Shattuck-Hufnagel</surname> <given-names>S.</given-names></name> <name><surname>Ostendorf</surname> <given-names>M.</given-names></name> <name><surname>Price</surname> <given-names>P. J.</given-names></name></person-group> (<year>1992</year>). <article-title>Segmental durations in the vicinity of prosodic phrase boundaries.</article-title> <source><italic>J. Acoust. Soc. Am.</italic></source> <volume>91</volume> <fpage>1707</fpage>&#x2013;<lpage>1717</lpage>. <pub-id pub-id-type="doi">10.1121/1.402450</pub-id></citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Winter</surname> <given-names>B.</given-names></name></person-group> (<year>2019</year>). <source><italic>Statistics for Linguists: An Introduction Using R.</italic></source> <publisher-loc>Milton Park</publisher-loc>: <publisher-name>Routledge</publisher-name>, <pub-id pub-id-type="doi">10.4324/9781315165547</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yao</surname> <given-names>Y.</given-names></name></person-group> (<year>2007</year>). <article-title>Closure duration and VOT of word-initial voiceless plosives in English in spontaneous connected speech.</article-title> <source><italic>UC Berkeley Phonol. Lab Ann. Rep.</italic></source> <volume>8</volume> <fpage>183</fpage>&#x2013;<lpage>225</lpage>.</citation></ref>
<ref id="B77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zimmermann</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). &#x201C;<article-title>Morphological status and acoustic realization</article-title>,&#x201D; in <source><italic>Proceedings of the Sixteenth Australasian International Conference on Speech Science and Technology (SST-2016)</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Carignan</surname> <given-names>C.</given-names></name> <name><surname>Tyler</surname> <given-names>M. D.</given-names></name></person-group> (<publisher-name>ASSTA</publisher-name>), <fpage>201</fpage>&#x2013;<lpage>204</lpage>.</citation></ref>
<ref id="B78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zuur</surname> <given-names>A. F.</given-names></name> <name><surname>Ieno</surname> <given-names>E. N.</given-names></name> <name><surname>Elphick</surname> <given-names>C. S.</given-names></name></person-group> (<year>2010</year>). <article-title>A protocol for data exploration to avoid common statistical problems.</article-title> <source><italic>Methods Ecol. Evolu.</italic></source> <volume>1</volume> <fpage>3</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1111/j.2041-210X.2009.00001.x</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p>The inverse of a matrix needs not exist, rendering such a matrix a singular one. Most matrices used in LDL implementations are singular matrices. Thus, an approximation of the inverse must be used instead of an inverse itself. One such approximation is the Moore-Penrose generalized inverse (<xref ref-type="bibr" rid="B42">Moore, 1920</xref>; <xref ref-type="bibr" rid="B45">Penrose, 1955</xref>).</p></fn>
<fn id="footnote2">
<label>2</label>
<p>In addition, a cluster analysis was performed. This analysis revealed clusters which align well with the retained components of the principal component analysis. The cluster analysis is also documented in the materials that can be found at <ext-link ext-link-type="uri" xlink:href="https://osf.io/zy7ar/?view_only=ef43a5caf6444270a56074027d7d6482">https://osf.io/zy7ar/?view_only=ef43a5caf6444270a56074027d7d6482</ext-link>.</p></fn>
</fn-group>
</back>
</article>
