<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2023.1234525</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Corpus of Mandarin Child Language: a preliminary study on the acquisition of semantic content categories in Mandarin-speaking preschoolers</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Tang</surname> <given-names>Tempo Po-Yi</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2330241/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Lau</surname> <given-names>Dustin Kai-Yan</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/149307/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Leung</surname> <given-names>Man-Tak</given-names></name>
</contrib>
</contrib-group>
<aff><institution>Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University</institution>, <addr-line>Kowloon</addr-line>, <country>Hong Kong SAR, China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Xiaowei Zhao, Emmanuel College, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Meiling Hao, Beijing Language and Culture University, China; Yibin Zhang, East China Normal University, China</p></fn>
<corresp id="c001">&#x002A;Correspondence: Tempo Po-Yi Tang, <email>tempo.tang@gmail.com</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>10</day>
<month>11</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1234525</elocation-id>
<history>
<date date-type="received">
<day>04</day>
<month>06</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>23</day>
<month>10</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Tang, Lau and Leung.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Tang, Lau and Leung</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>In studying language acquisition in children, sizable research studies have been focusing on the investigation of form and lexical semantics. This study aims to establish a child language database annotated both syntactically with part of speech and semantically with semantic content category to supplement the study of child language acquisition in the semantic domain beyond lexical level. The Corpus of Mandarin Child Language (CMCL) that documented the production of different semantic content categories by Mandarin-speaking children was established. Naturalistic language samples of 82 native Mandarin-speaking children aged 25&#x2013;60 months, divided into three age groups, were obtained. The corresponding semantic content categories coded in each utterance were tagged according to previous studies, in addition to the annotations of part of speech. MLU and lexical diversity were examined and the usage and acquisition of different semantic content categories were also analyzed. The results regarding syntactic complexity and lexical diversity replicated the typical language acquisition pattern from previous studies, which supported the validity of the data obtained in the CMCL. To investigate the trajectory of acquisition of various semantic content categories by age, a 90% acquisition criterion was used. Our findings regarding the acquisition order of semantic content category were basically in line with previous studies in general, with some minor differences. This acquisition order observed is largely explained by the cognitive and syntactic complexity associated with the semantic content category, with additional influence from language specific properties and cultural specific factors of Mandarin. In addition, with the tags in both part-of-speech and semantic content category, the CMCL potentially provides a platform for examining the form-content interface in early child language acquisition, which also implies significantly on the theoretical and clinical ground.</p>
</abstract>
<kwd-group>
<kwd>semantic content category</kwd>
<kwd>language corpus</kwd>
<kwd>Mandarin-speaking children</kwd>
<kwd>cognitive and syntactic complexity</kwd>
<kwd>acquisition</kwd>
</kwd-group>
<counts>
<fig-count count="0"/>
<table-count count="9"/>
<equation-count count="0"/>
<ref-count count="97"/>
<page-count count="16"/>
<word-count count="13411"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Psychology of Language</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>Language is a unique and conventional tool for human to communicate effectively. Children acquire language rapidly in their early ages. They gain the ability to communicate using words, phrases and gradually more complex sentences to express various meanings efficiently. The underlying processes by which children achieve this remarkable language acquisition has long been a hot topic for linguists, psycholinguists and neurolinguists. To understand the role of linguistic knowledge in language acquisition, <xref ref-type="bibr" rid="B7">Bloom and Lahey (1978)</xref> proposed that content, form and use are three basic dimensions of language. In general, language content concerns the meaning of language, typically referred as the topics and ideas that are encoded in the linguistic messages. Language form concerns the structure of language and represents the rules that govern how particular language features are arranged. Language use, on the other hand, refers to the goals for communication, as well as how people express the goals in various contexts.</p>
<p>Existing research in child language acquisition had focused primarily on the use of language forms. In particular, large number of studies have been conducted to investigate the sentence length in young children. The pioneer study by <xref ref-type="bibr" rid="B9">Brown (1973)</xref> found that children&#x2019;s mean length of utterance in morphemes (MLU) increased with age, and the children progressed through stages of grammatical development with increase in utterance length. Besides, the emergence of different parts of speech in the early vocabulary of young children has also been examined extensively (e.g., <xref ref-type="bibr" rid="B49">Lieven et al., 1992</xref>; <xref ref-type="bibr" rid="B4">Bleses et al., 2008</xref>). Likewise, sizable number of studies have also been conducted to investigate the acquisition of various morphological and syntactic constructions. For instance, early studies on child language (e.g., <xref ref-type="bibr" rid="B9">Brown, 1973</xref>; <xref ref-type="bibr" rid="B15">de Villiers and de Villiers, 1973</xref>) have established that different English inflections are acquired in a consistent order among young children. Similarly, the acquisition of various syntactic structures and categories was also investigated by <xref ref-type="bibr" rid="B70">Scarborough (1985)</xref>. They found that children&#x2019;s syntactic complexity and diversity increased with age, indicating that the older children produced more advanced and a larger variety of syntactic structures than the younger ones. In another study by <xref ref-type="bibr" rid="B16">Demuth and McCullough (2009)</xref>, it was reported that preschool children acquired more complex sentence structures with auxiliary verbs gradually when they grew up. These studies contribute greatly to our understanding of child language acquisition from the perspective of language form.</p>
<p>In addition to the above, the importance of semantic contents in the study of child language has also been emphasized. <xref ref-type="bibr" rid="B8">Bowerman (1973)</xref> claimed that semantic but not syntactic knowledge should attribute the early sentence expressions in children. Besides, <xref ref-type="bibr" rid="B9">Brown (1973)</xref> also proposed that young children start to combine words semantically rather than grammatically when they have acquired around 50 words in an early stage. One example is that young children may produce telegraphic speech like &#x201C;daddy chair&#x201D; to express the content of possession, in which the grammatical possessive marker &#x201C;s&#x201D; is missing. <xref ref-type="bibr" rid="B5">Bloom (1970)</xref> further argued that children use meaning relations between words to discover syntactic rules. It was suggested that they first acquire the syntax of simple sentences to express a core of simple meanings, and subsequently the syntax of complex sentences in order to express the relational meanings between different propositions conveyed by the simple sentences. More recently, a growing body of literature on the integral role of semantic knowledge in child syntax acquisition has also been reported (<xref ref-type="bibr" rid="B63">P&#x00E9;rez-Leroux et al., 2008</xref>; <xref ref-type="bibr" rid="B58">Mok and Kipka, 2009</xref>). Given the importance mentioned above, it will be invaluable to examine the role of semantic content in the acquisition of early expressions in children.</p>
<p>In view of the above, some studies attempted to investigate lexical semantics in the acquisition of early vocabulary in children. For instance, <xref ref-type="bibr" rid="B35">Jim&#x00E9;nez et al. (2021)</xref> investigated the lexical profiles of typically developing children by adopting the MacArthur-Bates Communicative Development Inventories (CDI; <xref ref-type="bibr" rid="B24">Fenson et al., 1993</xref>) to assess the production of twenty-two semantic categories. Apart from the investigation of semantic categories, the impact of various semantic measures and semantic features in language acquisition was also examined. <xref ref-type="bibr" rid="B78">Stockman and Vaughn-Cooke (1992)</xref> suggested that the acquisition of locative words was influenced by the meaning and relational roles of the words, while <xref ref-type="bibr" rid="B31">Horvath et al. (2022)</xref> also reported the impact of manner/result features of verbs on their acquisition in children. The results of these studies provided an alternative window for us to understand the acquisition process in young children in addition to the study of the acquisition of surface form. Nevertheless, existing studies investigating the semantic domains in language acquisition primarily focus on lexical semantics in terms of semantic categories and different semantic features. Only a limited number of studies attempted to go beyond the lexical level and examine the acquisition of various semantic contents directly (e.g., <xref ref-type="bibr" rid="B28">Green, 1975</xref>; <xref ref-type="bibr" rid="B25">Freedman and Carpenter, 1976</xref>; <xref ref-type="bibr" rid="B77">Stockman and Vaughn-Cooke, 1986</xref>; <xref ref-type="bibr" rid="B6">Bloom, 1991</xref>).</p>
<p>Mandarin, the official language spoken in China, is the language with the most native speakers in the world. It is of particular importance to understand how young children acquire Mandarin. A vast amount of acquisition studies on Mandarin-speaking young children have also been conducted. Regarding studies on language form, <xref ref-type="bibr" rid="B9">Brown&#x2019;s (1973)</xref> findings on the increase of sentence length across ages were replicated in Chinese-speaking children (e.g., <xref ref-type="bibr" rid="B12">Cheung, 1998</xref>; <xref ref-type="bibr" rid="B40">Klee et al., 2004</xref>; <xref ref-type="bibr" rid="B36">Jin and Jin, 2008</xref>; <xref ref-type="bibr" rid="B90">Wu, 2020</xref>). Besides, <xref ref-type="bibr" rid="B33">Huang et al. (2022)</xref> investigated the early grammatical markings in toddlers who speak Mandarin. It was found that the acquisition trajectories of these markers were different from those in English and other languages, supporting the linguistic specificity in the acquisition of morphology. In addition, it was also indicated that more perceptually salient and obligatory markers were acquired earlier. On the other side, studies have also been conducted to investigate the acquisition of various syntactic structures including SVO utterances and <italic>ba</italic> constructions (<xref ref-type="bibr" rid="B32">Hsu, 2014</xref>) as well as relative clauses (<xref ref-type="bibr" rid="B11">Chen and Shirai, 2015</xref>). As for lexical semantic domains, <xref ref-type="bibr" rid="B29">Hao et al. (2008)</xref> and <xref ref-type="bibr" rid="B83">Tardif et al. (2009)</xref> reported the acquisition of different semantic categories in the early vocabulary inventory of young Mandarin-speaking children using parent-report questionnaire. On the other hand, <xref ref-type="bibr" rid="B52">Ma et al. (2009)</xref> also demonstrated the effect of the semantic measure of imageability on the acquisition of verb in both Chinese and English. Likewise, semantic features of temporal remoteness and specificity on the acquisition of different temporal-marker categories were also reported (<xref ref-type="bibr" rid="B81">Tang et al., 2021</xref>).</p>
<p>To this end, the acquisition of semantic content beyond lexical level in Mandarin remains relatively unknown. Given the importance of semantic content in young children&#x2019;s language acquisition (e.g., <xref ref-type="bibr" rid="B9">Brown, 1973</xref>; <xref ref-type="bibr" rid="B6">Bloom, 1991</xref>), the current study aimed to investigate the acquisition of various semantic content categories (<xref ref-type="bibr" rid="B42">Lahey, 1988</xref>) in Mandarin-speaking children, in order to fill the research gap by contributing additional information from the semantic perspective. A database annotated both syntactically and semantically was established to investigate the above issues. In addition, the potential use of the database in examining the interaction between language form and content in the acquisition process was also explored. A brief introduction to various semantic content categories, followed by a review of previous studies on the acquisition of semantic content categories are given below.</p>
<sec id="S1.SS1">
<title>Semantic content category in young children</title>
<p>Semantic content is one component of language that concerns with the meaning of information, including the concepts and themes that language expresses. Young children typically express contents that reflect their immediate environment and experiences (<xref ref-type="bibr" rid="B7">Bloom and Lahey, 1978</xref>). A wave of crosslinguistic studies in the 1970s has proposed that young children, regardless of their native language, exhibit common topics and ideas in their early expressions (e.g., <xref ref-type="bibr" rid="B8">Bowerman, 1973</xref>; <xref ref-type="bibr" rid="B9">Brown, 1973</xref>). Similarly, <xref ref-type="bibr" rid="B73">Slobin (1985)</xref> crosslinguistic acquisition studies also echoed the restricted set of meanings expressed by young children. Other studies also reported cross-culturally common aspects of semantic relations (<xref ref-type="bibr" rid="B65">Raybeck and Herrmann, 1990</xref>) and antonymic meanings (<xref ref-type="bibr" rid="B66">Raybeck and Herrmann, 1996</xref>). As such, <xref ref-type="bibr" rid="B42">Lahey (1988)</xref> identified 21 semantic content categories in the production of young children according to the general kinds of meaning they share. The semantic content categories are listed in <xref ref-type="table" rid="T1">Table 1</xref> with examples.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>List of semantic content categories with examples, modified from <xref ref-type="bibr" rid="B42">Lahey (1988)</xref>.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">No.</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Semantic content category</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Abbreviation</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Example with Chinese translation</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1.</td>
<td valign="top" align="left"><italic>Existence</italic></td>
<td valign="top" align="left">exist</td>
<td valign="top" align="left">&#x6C7D;&#x8F66; &#x201C;car&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">2.</td>
<td valign="top" align="left"><italic>Recurrence</italic></td>
<td valign="top" align="left">recur</td>
<td valign="top" align="left">&#x518D; &#x201C;again&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">3.</td>
<td valign="top" align="left"><italic>Nonexistence</italic></td>
<td valign="top" align="left">nonE</td>
<td valign="top" align="left">&#x6CA1;&#x6709; &#x201C;none&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">4.</td>
<td valign="top" align="left"><italic>Rejection</italic></td>
<td valign="top" align="left">rej</td>
<td valign="top" align="left">&#x522B; &#x201C;do not&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">5.</td>
<td valign="top" align="left"><italic>Denial</italic></td>
<td valign="top" align="left">den</td>
<td valign="top" align="left">&#x4E0D;&#x662F; &#x201C;is not&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">6.</td>
<td valign="top" align="left"><italic>Attribution</italic></td>
<td valign="top" align="left">attri</td>
<td valign="top" align="left">&#x5927;&#x7684; &#x201C;big&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">7.</td>
<td valign="top" align="left"><italic>Possession</italic></td>
<td valign="top" align="left">poss</td>
<td valign="top" align="left">&#x6211;&#x7684; &#x201C;My&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">8.</td>
<td valign="top" align="left"><italic>Locative action</italic></td>
<td valign="top" align="left">locA</td>
<td valign="top" align="left">&#x53BB; &#x201C;go&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">9.</td>
<td valign="top" align="left"><italic>Action 12.</italic></td>
<td valign="top" align="left">act</td>
<td valign="top" align="left">&#x7761;&#x89C9; &#x201C;sleep&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">10.</td>
<td valign="top" align="left"><italic>Locative state</italic></td>
<td valign="top" align="left">locS</td>
<td valign="top" align="left">&#x4E0A;&#x9762; &#x201C;above&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">11.</td>
<td valign="top" align="left"><italic>State</italic></td>
<td valign="top" align="left">state</td>
<td valign="top" align="left">&#x997F; &#x201C;hungry,&#x201D; &#x9ED1;&#x6697; &#x201C;dark&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">12.</td>
<td valign="top" align="left"><italic>Quantity</italic></td>
<td valign="top" align="left">quan</td>
<td valign="top" align="left">&#x8BB8;&#x591A; &#x201C;many&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">13.</td>
<td valign="top" align="left"><italic>Notice</italic></td>
<td valign="top" align="left">noti</td>
<td valign="top" align="left">&#x770B;&#x5230; (&#x4ED6;&#x5728;&#x8DD1;)&#x201D;see (him running)&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">14.</td>
<td valign="top" align="left"><italic>Dative</italic></td>
<td valign="top" align="left">dat</td>
<td valign="top" align="left">&#x8DD1;&#x4F60; (&#x4E00;&#x652F;&#x7B14;) &#x201C;give you (a pen)&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">15.</td>
<td valign="top" align="left"><italic>Additive</italic></td>
<td valign="top" align="left">add</td>
<td valign="top" align="left">&#x2026;&#x548C;&#x2026; &#x201C;and&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">16.</td>
<td valign="top" align="left"><italic>Temporal</italic></td>
<td valign="top" align="left">temp</td>
<td valign="top" align="left">&#x521A;&#x521A; &#x201C;just&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">17.</td>
<td valign="top" align="left"><italic>Causal</italic></td>
<td valign="top" align="left">caus</td>
<td valign="top" align="left">&#x56E0;&#x4E3A;&#x2026; &#x201C;because&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">18.</td>
<td valign="top" align="left"><italic>Adversative</italic></td>
<td valign="top" align="left">adver</td>
<td valign="top" align="left">&#x4F46;&#x662F;&#x2026; &#x201C;but&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">19.</td>
<td valign="top" align="left"><italic>Epistemic</italic></td>
<td valign="top" align="left">epis</td>
<td valign="top" align="left">&#x6000;&#x7591; (&#x662F;&#x4ED6;&#x505A;&#x7684;) &#x201C;suspect (it is done by him)&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">20.</td>
<td valign="top" align="left"><italic>Specification</italic></td>
<td valign="top" align="left">spec</td>
<td valign="top" align="left">&#x7A7F;&#x7EA2;&#x8863;&#x670D;&#x7684;&#x7537;&#x5B69; &#x201C;the boy who wears in red&#x201D;</td>
</tr>
<tr>
<td valign="top" align="left">21</td>
<td valign="top" align="left"><italic>Communication</italic></td>
<td valign="top" align="left">com</td>
<td valign="top" align="left">&#x544A;&#x8BC9; (&#x6211;&#x4E00;&#x4E2A;&#x79D8;&#x5BC6;)&#x201C;tell (me a secret)&#x201D;</td>
</tr>
</tbody>
</table></table-wrap>
<p>Adopting the framework of semantic content category, <xref ref-type="bibr" rid="B9">Brown (1973)</xref> analyzed the first fifty words produced by children speaking English, Finnish and Spanish. Despite differences in the specific words used across languages, the semantic content categories were remarkably similar across different languages. Besides, the studies of <xref ref-type="bibr" rid="B79">Stockman and Vaughn-Cooke (1982</xref>, <xref ref-type="bibr" rid="B77">1986)</xref> also provided evidence on the equivalent set of semantic content categories among young African American children, for which forms are mapped onto the same underlying concepts, regardless of culture and whether standard forms are used.</p>
<p>To the best of our knowledge, no previous studies have been done on the acquisition of semantic content categories by Mandarin-speaking young children. Despite a few previous studies have suggested cultural influences in particular meanings like emotion (e.g., <xref ref-type="bibr" rid="B30">Herrmann and Raybeck, 1981</xref>; <xref ref-type="bibr" rid="B68">Russell, 1991</xref>), the major studies appeared to indicate that the above-mentioned semantic content categories were found in early child language in general (e.g., <xref ref-type="bibr" rid="B8">Bowerman, 1973</xref>; <xref ref-type="bibr" rid="B9">Brown, 1973</xref>). Therefore, the current study adopted this specific set of semantic content categories and investigated their acquisition in Mandarin accordingly.</p>
</sec>
<sec id="S1.SS2">
<title>Acquisition of semantic content categories</title>
<p>There are a few studies examining the acquisition of semantic content categories by English-speaking young children. The study conducted by <xref ref-type="bibr" rid="B77">Stockman and Vaughn-Cooke (1986)</xref> investigated the order of acquisition of various semantic content categories with twenty-two standard and non-standard English speakers aged 18&#x2013;54 months. Results indicated that <italic>existence</italic> and <italic>action</italic> were the earliest acquired semantic content categories and were the only ones productively produced by every child. After that, the semantic content categories of <italic>locative action</italic>, <italic>state</italic>, <italic>locative state</italic>, <italic>negation</italic>, <italic>possession</italic>, <italic>attribution</italic>, <italic>notice</italic>, and <italic>recurrence</italic> also emerged, whereas semantic content categories of <italic>additive</italic>, <italic>causal</italic>, and <italic>epistemic</italic> were acquired last. This study did not only present preliminary acquisition trends of various semantic content categories in young children, but also indicated that both standard and non-standard speakers shared the same semantic base in early language although the forms used were different.</p>
<p><xref ref-type="bibr" rid="B6">Bloom (1991)</xref> also carried out a series of longitudinal studies to investigate the acquisition of semantic content categories of children aged 24&#x2013;38 months by using naturalistic language samples. The expressions of various semantic content categories related to early sentences and complex sentences were examined, respectively. First, looking at the earliest appeared semantic content categories of simple sentences, contents <italic>of existence</italic>, <italic>non-existence</italic>, and <italic>recurrence</italic> were found to emerge before <italic>action</italic>, <italic>locative state</italic>, and <italic>locative action</italic>. Specifically, among the semantic content categories related to verb, it was found that <italic>action</italic> emerged first, followed by <italic>state</italic> and then <italic>locative action</italic>. Finally, <italic>dative</italic> and <italic>specification</italic> came along after the above basic verb relations. The emergence of <italic>possessive</italic> and <italic>attribution</italic>, on the other hand, varied among the participants. Apart from the above, a specific order of acquisition for semantic content categories of negation was also suggested, with <italic>non-existence</italic> emerged before <italic>rejection</italic>, while <italic>denial</italic> emerged last.</p>
<p>Next, eight semantic content categories associated with complex sentences were observed in the samples, including <italic>additive</italic>, <italic>temporal</italic>, <italic>causal</italic>, <italic>adversative</italic>, <italic>specification</italic>, <italic>epistemic</italic>, <italic>notice</italic>, and <italic>communication.</italic> Among those involving the use of conjunctions, the acquisition followed the sequence: <italic>additive &#x003E; temporal &#x003E; causal &#x003E; adversative</italic>. On the other side, <italic>epistemic</italic>, <italic>notice</italic>, and <italic>communication</italic> were observed to emerge after the above, while the content category of <italic>specification</italic> was infrequent in the language samples of all participants.</p>
</sec>
<sec id="S1.SS3">
<title>Factors affecting the acquisition of semantic content categories</title>
<p>The above acquisition order of semantic content category allows us to understand how content affects language acquisition in young children. This relative order of acquisition is affected by two major factors in general: syntactic complexity and cognitive complexity (<xref ref-type="bibr" rid="B6">Bloom, 1991</xref>). From the syntactic view, semantic content categories related to complex sentences in general appeared later than those associated with simple sentences (e.g., <italic>existence</italic> and <italic>possessive</italic> emerged before <italic>causal</italic> and <italic>specification)</italic>. Specifically, among the semantic content categories expressed using verbs, utterances expressing <italic>locative action</italic>, which usually involves the description of places, are syntactically more complex than descriptions of the motion or location alone in utterances expressing <italic>action</italic> or <italic>locative state</italic>. Likewise, the expressions of <italic>dative</italic> content, which describe two different kinds of relations between recipients and affected objects, are also syntactically more complex than utterances expressing only <italic>action</italic> and <italic>locative action</italic>, and thus emerged later. Regarding semantic content categories associated with complex sentences, those involving the use of syntactic structures with conjunctions (i.e., <italic>additive</italic>, <italic>temporal</italic>, <italic>causal</italic>, <italic>adversative</italic>) generally appeared earlier than those related to more complex complementation (i.e., <italic>epistemic</italic>, <italic>notice</italic>, <italic>communication).</italic> Semantic content category related to relativization (i.e., <italic>specification</italic>) is regarded as more complex syntactically and thus emerged last.</p>
<p>Cognitive complexity was suggested to be another factor accounting for the acquisition order of various semantic content categories according to <xref ref-type="bibr" rid="B6">Bloom (1991)</xref>. Children learn that objects exist by acting in ways that make them disappear and recur in the sensorimotor period (<xref ref-type="bibr" rid="B64">Piaget, 2000</xref>). This relatively simple object permanence explains the earliest emergence of <italic>existence</italic>, <italic>non-existence</italic>, and <italic>recurrence</italic>. Then, children learn that objects can be acted upon and located in space, as reflected in the emergence of verb relations including <italic>action</italic>, <italic>state</italic>, <italic>locative state</italic>, and <italic>locative action</italic>, which came immediately after the previous functional relations. In particular, <italic>locative action</italic> representing the dynamic concept of the movement of objects between two places, usually involving source, path and goal, is more complex cognitively and appears later than <italic>action</italic> and <italic>locative state</italic>, which are static in nature. Likewise, the expression of <italic>attribution</italic> requires discrimination among similar objects, and thus higher-level cognitive processes of categorization and seriation, whereas <italic>action</italic> only involves comparatively simpler sensorimotor schemas in its production. Therefore, <italic>action</italic> preceded <italic>attribution</italic> in the acquisition. Finally, the involvement of a cognitively more complex symbolic referent in <italic>denial</italic> may also partially account for its later acquisition than <italic>non-existence</italic> and <italic>rejection</italic>, which negate more concrete objects or actions, respectively.</p>
<p>The principle of cognitive complexity also applies to the acquisition among semantic content categories associated with complex sentences. <xref ref-type="bibr" rid="B6">Bloom (1991)</xref> suggested that there is a progressive increase in the cognitive complexity among the acquisition of <italic>additive</italic>, <italic>temporal</italic>, <italic>causal</italic>, and <italic>adversative</italic>. Firstly, <italic>additive</italic> simply represents the indefinite joining of two events and is regarded as relatively simple. <italic>Temporal</italic> conveys the relations of two or more events with designated temporal sequences, while <italic>causal</italic> relations express the antecedent and consequence of events on top of the additive and temporal concepts. <italic>Adversative</italic> appears to be the most complex one, which involves additive, temporal and sometimes, causal concepts, as well as expressing the new meaning of opposition which involves the cognitive process of comparing and contrasting. As a result, the cumulative cognitive complexity among these semantic content categories tends to explain their corresponding status in the acquisition process.</p>
<p>To achieve the goal of investigating the acquisition of semantic content categories in Mandarin-speaking children, the acquisition trajectory reported in English will be used as a reference. In light of the above factors of syntactic and cognitive complexity, it is predicted that the acquisition of semantic content categories among Mandarin-speaking children basically follows the trajectory reported in English, with some minor differences due to the syntactic properties of Mandarin and cultural specific factors.</p>
</sec>
<sec id="S1.SS4">
<title>Methodological concerns</title>
<p>To study child language acquisition, language sample analysis (LSA) has been used extensively. LSA based on the spontaneous speech of children was found to have the advantage of being an ecologically valid and authentic method of assessment (<xref ref-type="bibr" rid="B61">Owens, 2010</xref>) and the language samples are suggested to be usually reflecting a more naturalistic and representative picture of the child&#x2019;s language over the standardized assessments (<xref ref-type="bibr" rid="B21">Evans and Craig, 1992</xref>). Extensive research studies have employed LSA to examine the acquisition of different linguistic parameters, such as early lexical development (e.g., <xref ref-type="bibr" rid="B50">Liu, 2007</xref>; <xref ref-type="bibr" rid="B80">Stoll et al., 2012</xref>), early grammatical development (e.g., <xref ref-type="bibr" rid="B44">Lee and Wong, 1998</xref>), syntactic complexity (e.g., <xref ref-type="bibr" rid="B19">Diessel, 2004</xref>; <xref ref-type="bibr" rid="B51">Lu, 2009</xref>; <xref ref-type="bibr" rid="B18">Deng et al., 2018</xref>), morphology (e.g., <xref ref-type="bibr" rid="B55">Maslen et al., 2004</xref>; <xref ref-type="bibr" rid="B34">Jia and Fuse, 2007</xref>), and discourse relationship (e.g., <xref ref-type="bibr" rid="B96">Zhou and Xue, 2015</xref>).</p>
<p>Language corpora of transcribed and usually annotated language samples, on the other hand, are also essential for the study of child language acquisition. According to <xref ref-type="bibr" rid="B3">Biber et al. (1998)</xref>, using corpus to study language is empirical as a large collection of samples from real-life situations is utilized. Availability of computer analytical techniques also speeds up and enhances various quantitative and qualitative analyses of samples in different language aspects. Quantitatively, the distribution and properties of different structures can be calculated for types, tokens and percentages whereas qualitatively, both target and non-target structures can also be investigated (<xref ref-type="bibr" rid="B17">Deng and Yip, 2018</xref>). In addition, the corpora can be used for multiple times and be available to others if made publicly. Therefore, in the current study, both LSA and language corpora were adopted to document the production of Mandarin-speaking children in both syntactic and semantic domains.</p>
<p>Among currently available cross-sectional Mandarin corpora, many of them included good sample size and naturalistic procedures for collecting children&#x2019;s language samples (e.g., <xref ref-type="bibr" rid="B47">Li and Zhou, 2008</xref>; <xref ref-type="bibr" rid="B46">Li and Zhou, 2015</xref>), which provide important resources for studying child language acquisition. Nevertheless, these corpora were only annotated syntactically and none of them possessed annotations on semantic content category. Most importantly, it seems that expressions of certain semantic content categories were not specifically elicited in the language samples of the existing databases, which did not allow the investigation from the semantic perspective.</p>
<p>As discussed above, the semantic content of language also contributes potential values in language acquisition studies. While <italic>form</italic> and <italic>content</italic> are equally important domains of language, semantically annotated data provides an alternative window for the investigation of the acquisition process in child language. <xref ref-type="bibr" rid="B69">Sagae et al. (2010)</xref> also suggested that adding semantic information in the corpora should be a future direction in language acquisition studies. Therefore, it would be of great importance to establish a child language database with corresponding annotations of semantic content categories on the samples collected. To avoid the issue that children&#x2019;s language productions were dictated dominantly by the topics initiated by the interviewers, a standardized protocol was used to minimize the influence of different interviewers, as well as to create equal opportunities across children to elicit all the intended semantic content categories specifically. Besides, the typical measures of sentence complexity and lexical diversity in language acquisition were also calculated, to ensure the representativeness of the language samples collected.</p>
</sec>
<sec id="S1.SS5">
<title>The current study</title>
<p>In short, while the majority of previous studies on child language acquisition had been focusing on the study of form or lexical semantics, relatively few studies examine the semantic domain beyond lexical level in early language acquisition. In particular, there is currently no study investigating the acquisition of semantic content categories among Mandarin-speaking children. As a result, the current study aimed to establish the Corpus of Mandarin Child Language (CMCL), a corpus annotated with both part-of-speech and semantic content category, and adopted language sample analysis by extracting data from the corpus, for the study of the acquisition of semantic content categories in Mandarin-speaking children. The following research questions were addressed in the study:</p>
<list list-type="simple">
<list-item>
<label>1.</label>
<p>What is the acquisition trajectory of various semantic content categories among Mandarin-speaking children?</p>
</list-item>
<list-item>
<label>2.</label>
<p>What are the similarity and difference in the acquisition trajectories of semantic content category between Mandarin-speaking and English-speaking children?</p>
</list-item>
</list>
<p>To address the concern on the applicability of the language data in CMCL, the measures on sentence complexity and lexical diversity of the children&#x2019;s productions were also obtained. It was expected that the age differences in these traditional measures from previous studies would be replicated in the current study. On the other side, the acquisition trajectory of semantic content category among Mandarin-speaking children was also predicted to follow the sequences suggested in <xref ref-type="bibr" rid="B6">Bloom (1991)</xref> in general. Syntactically, semantic content categories related to simple sentences would appear earlier than those related to complex sentences. Similarly, semantic content categories associated with lower cognitive complexity would also precede those cognitively more complex ones in the acquisition. Nevertheless, subtle differences might be expected, due to the linguistic and cultural issues which are specific to Mandarin. This current study not only sheds new light on the understanding of language acquisition in the semantic domain, but also allows the investigation of the form-content interface and contributes to validating different theories in the early language of children. Moreover, clinical implications for the assessment and intervention of children with language disorders are also provided.</p>
</sec>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and methods</title>
<sec id="S2.SS1">
<title>Participants</title>
<p>The Corpus of Mandarin Child Language (CMCL) is established with 82 native Mandarin-speaking children aged 25&#x2013;60 months (48 boys and 34 girls) recruited from early education centers and kindergartens in Shenzhen and Guangzhou, China. According to their caregivers, none had any sensory or intellectual disabilities or language problems. All participants were divided into three age groups by 1-year intervals (i.e., 25&#x2013;36, 37&#x2013;48, and 49&#x2013;60 months). Information on the subjects is shown in <xref ref-type="table" rid="T2">Table 2</xref>.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Subject information of CMCL (<italic>N</italic> = 82).</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Age group</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><italic>n</italic></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">Age range (months)</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">Mean age (months)</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><italic>n</italic> male</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><italic>n</italic> female</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">25&#x2013;36</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">41</td>
<td valign="top" align="center">37&#x2013;48</td>
<td valign="top" align="center">42</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">17</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">49&#x2013;60</td>
<td valign="top" align="center">55</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">11</td>
</tr>
</tbody>
</table></table-wrap>
</sec>
<sec id="S2.SS2">
<title>Language sample collection and transcription</title>
<p>Participants&#x2019; language samples were elicited individually by speech therapists, speech therapy students and research assistants who had received prior training. First, a warm-up period with a doll set or train set was used for rapport building before conducting the three tasks of taking actual language samples (i.e., free play with toys, storytelling with pictures, and conversations). Semi-spontaneous speech of each child was then collected through one-to-one interactions with the examiner on three tasks following the standardized procedures in CMCL. Identical sets of toys, including a cooking set, food, utensils, puppets and a mystery bag, were provided during each 20-min freeplay session. Each child was encouraged to play and communicate with the examiner. Importantly, the play was specially designed to provide a scenario for eliciting different semantic content categories. Open-ended questions, parallel play and parallel talk were employed to facilitate the children&#x2019;s own language production. A color Cookie Theft picture (<xref ref-type="bibr" rid="B27">Goodglass and Kaplan, 1972</xref>) and a set of four card stories were then provided to elicit each child&#x2019;s narrative speech, which lasted about 5 min. Finally, the examiner initiated a 5-min talk about daily life according to the child&#x2019;s interest and experience. A Peppa Pig storybook was also presented to provide topics for the chat (e.g., picnicking and favorite cartoon characters).</p>
<p>All sessions were audio- and video-recorded and all utterances produced by both the participants and interviewers were transcribed orthographically. Intonation contours, pauses of more than 2 s, and speaker turns were used to determine the utterance boundaries (<xref ref-type="bibr" rid="B38">Klee and Fitzgerald, 1985</xref>). All the utterances produced by the participants were then analyzed by the trained speech therapy students and research assistants, but short responses, self-repetitions, unintelligible and incomplete utterances which did not reflect the children&#x2019;s own language ability, were not analyzed (<xref ref-type="bibr" rid="B14">Crystal et al., 1989</xref>). The physical context was provided by the descriptions of the events and the actions of participants, while the linguistic context was shown with the examiner&#x2019;s utterances.</p>
<p>To ensure transcription accuracy, the transcribers, all holding a degree in linguistics, were extensively trained prior to the beginning of the study. Language samples obtained from a pilot study were transcribed together and any discrepancies were resolved through discussion. A manual with a detailed set of transcription guidelines was eventually agreed. The audio- and video-taped samples also allowed multiple viewing when necessary. Finally, a number of transcripts independently transcribed by each of the transcribers were compared using the RELY function in CLAN (Codes for Human Analysis of Transcripts) to check for transcription consistency. The percentage of the overall match of words between the transcribers was 95.4%.</p>
</sec>
<sec id="S2.SS3">
<title>Database</title>
<p>All the above data were imported to the software Filemaker Pro which provides an interface to display the content of CMCL. The orthographic transcription of child utterances formed the first main layer in the database. Every single utterance produced by the child and interviewer was represented as one entry. Next, words within each utterance were identified according to the principles of boundedness, expandability, and versatility proposed by <xref ref-type="bibr" rid="B97">Zhu (1982)</xref>. For each word, tags on phonological form using <italic>pinyin</italic>, English translation and part of speech were added with reference to the MOR databank provided by CHILDES (<xref ref-type="bibr" rid="B53">MacWhinney, 2000</xref>). Words that could not be found in the &#x201C;zho&#x201D; dictionary in the MOR databank were identified and added manually. The newly added words mainly included names of specific people, places, cartoon programmes and cartoon characters. Specific food items (e.g., &#x751C;&#x751C;&#x5708; &#x201C;donut&#x201D;) and kitchen utensils (e.g., &#x84B8;&#x9505; &#x201C;steamer&#x201D;), as well as reduplication of words [e.g., &#x9E21;&#x9E21; &#x201C;chicken&#x201D; (noun); &#x795E;&#x79D8;&#x79D8; &#x201C;mysterious&#x201D; (adjective)] were also added.</p>
<p>Finally, the utterances were annotated with 21 semantic content categories based on the descriptions in <xref ref-type="bibr" rid="B42">Lahey (1988)</xref>, to form the third semantic tier. Assignment of semantic content categories was done by trained speech therapist students. Notably, a particular semantic content category can be encoded with different syntactic forms [e.g., content of <italic>temporal</italic> can be expressed with aspect marker, temporal adverb and temporal noun by Chinese-speaking young children (<xref ref-type="bibr" rid="B95">Zhou, 2004</xref>; <xref ref-type="bibr" rid="B86">Tse et al., 2012</xref>)], whereas a particular form can also be used to encode different semantic content categories (e.g., the phrasal structure &#x516C;&#x56ED;&#x7684;&#x6811;&#x6728; represents location, &#x7EA2;&#x8272;&#x7684;&#x676F;&#x5B50; represents attribution, while &#x7238;&#x7238;&#x7684;&#x5934;&#x53D1; represents possession). In this study, the semantic content categories associated with different utterances were annotated based on the participants&#x2019; intended meaning determined according to the corresponding physical and language contexts shown in the video. After all the semantic coding had been done, 10 percent of the speech samples were randomly selected and independently coded by a second rater to develop inter-rater reliability. A relatively high agreement of 93.0% of semantic content category coding was achieved between raters.</p>
</sec>
<sec id="S2.SS4">
<title>Data analysis</title>
<p>Data were extracted from the CMCL for further analysis. Each child&#x2019;s major utterances except the deviant ones were used to calculate the mean length of utterance in word (MLU), following <xref ref-type="bibr" rid="B97">Zhu (1982)</xref> and <xref ref-type="bibr" rid="B12">Cheung (1998)</xref> procedures. Traditional measures on lexical diversity including number of different words produced (NDW), total number of words produced (TNW), type-token ratio (TTR) and number of different open/closed class words were examined. In addition, vocD (<xref ref-type="bibr" rid="B54">Malvern and Richards, 2002</xref>) was also calculated automatically by running the MOR command in CLAN. Finally, the usage and acquisition of different semantic content categories were also analyzed.</p>
</sec>
</sec>
<sec id="S3" sec-type="results">
<title>Results</title>
<p>There are altogether 13, 630 utterances produced by 82 children from the age of 25&#x2013;60 months (mean age = 43 months, 48 boys and 34 girls). Among all the child utterances, only 10,643 were analyzed in CMCL, while the remaining 2,987 utterances (22% of all child utterances) including short response, repetition, incomplete or unintelligible utterances were not analyzed. Results on utterance length and lexical diversity, as well as the usage and acquisition of different semantic content categories were illustrated as follow.</p>
<sec id="S3.SS1">
<title>Utterance length and lexical diversity</title>
<p><xref ref-type="table" rid="T3">Table 3</xref> presents the descriptive statistic information of children&#x2019;s MLU, NDW, TNW, TTR, and vocD across different age groups. To investigate the effect of age on sentence length, a one-way analysis of variance (ANOVA) was conducted. Results showed a significant effect of age on MLU [<italic>F</italic><sub>(2,79)</sub> = 11.02, <italic>p</italic> &#x003C; 0.001]. <italic>Post-hoc</italic> analyses using the Bonferroni&#x2019;s test at a significance level of 0.05 indicated that the 2 year olds possessed lower MLU than the 3 year olds, but the difference between the 3 year olds and the 4 year olds was not significant.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Different language measures of child utterances across age group.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" colspan="3" style="color:#ffffff;background-color: #7f8080;">Age group 1</td>
<td valign="top" align="center" colspan="3" style="color:#ffffff;background-color: #7f8080;">Age group 2</td>
<td valign="top" align="center" colspan="3" style="color:#ffffff;background-color: #7f8080;">Age group 3</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Mean</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>S.D.</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Range</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Mean</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>S.D.</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Range</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Mean</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>S.D.</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Range</bold></td>
</tr>
<tr>
<td valign="top" align="left">MLU</td>
<td valign="top" align="center">2.73</td>
<td valign="top" align="center">0.55</td>
<td valign="top" align="center">1.74&#x2013;4.3</td>
<td valign="top" align="center">3.61</td>
<td valign="top" align="center">0.82</td>
<td valign="top" align="center">2.26&#x2013;5.86</td>
<td valign="top" align="center">3.78</td>
<td valign="top" align="center">0.77</td>
<td valign="top" align="center">2.14&#x2013;5.63</td>
</tr>
<tr>
<td valign="top" align="left">NDW</td>
<td valign="top" align="center">113.61</td>
<td valign="top" align="center">27.46</td>
<td valign="top" align="center">64&#x2013;179</td>
<td valign="top" align="center">168.26</td>
<td valign="top" align="center">40.45</td>
<td valign="top" align="center">102&#x2013;265</td>
<td valign="top" align="center">163.05</td>
<td valign="top" align="center">45.26</td>
<td valign="top" align="center">108&#x2013;258</td>
</tr>
<tr>
<td valign="top" align="left">TNW</td>
<td valign="top" align="center">507.78</td>
<td valign="top" align="center">186.32</td>
<td valign="top" align="center">192&#x2013;883</td>
<td valign="top" align="center">679.14</td>
<td valign="top" align="center">272.83</td>
<td valign="top" align="center">216&#x2013;1446</td>
<td valign="top" align="center">537.09</td>
<td valign="top" align="center">249.7</td>
<td valign="top" align="center">277&#x2013;1106</td>
</tr>
<tr>
<td valign="top" align="left">TTR</td>
<td valign="top" align="center">0.24</td>
<td valign="top" align="center">0.06</td>
<td valign="top" align="center">0.14&#x2013;0.38</td>
<td valign="top" align="center">0.27</td>
<td valign="top" align="center">0.07</td>
<td valign="top" align="center">0.15&#x2013;0.50</td>
<td valign="top" align="center">0.33</td>
<td valign="top" align="center">0.07</td>
<td valign="top" align="center">0.20&#x2013;0.46</td>
</tr>
<tr>
<td valign="top" align="left">TNU</td>
<td valign="top" align="center">136.39</td>
<td valign="top" align="center">46.02</td>
<td valign="top" align="center">36&#x2013;193</td>
<td valign="top" align="center">142.5</td>
<td valign="top" align="center">42.22</td>
<td valign="top" align="center">62&#x2013;241</td>
<td valign="top" align="center">116.59</td>
<td valign="top" align="center">40.69</td>
<td valign="top" align="center">62&#x2013;206</td>
</tr>
<tr>
<td valign="top" align="left">vocD</td>
<td valign="top" align="center">33.5</td>
<td valign="top" align="center">8.53</td>
<td valign="top" align="center">15.9&#x2013;47.0</td>
<td valign="top" align="center">41.1</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">16.0&#x2013;64.7</td>
<td valign="top" align="center">48.3</td>
<td valign="top" align="center">6.93</td>
<td valign="top" align="center">34.6&#x2013;60.2</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>MLU, mean length of utterances in word; NDW, number of different words; TNW, total number of words; TTR, type token ratio; TNU, total number of utterances.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>Regarding lexical diversity, a one-way ANOVA was conducted to analyze the effect of age on each of the measures of NDW, TNW, TTR and vocD. Results showed a significant age effect on NDW [<italic>F</italic><sub>(2,79)</sub> = 15.58, <italic>p</italic> &#x003C; 0.001], TTR [<italic>F</italic><sub>(2,79)</sub> = 9.41, <italic>p</italic> &#x003C; 0.001], and vocD [<italic>F</italic><sub>(2,79)</sub> = 18.35, <italic>p</italic> &#x003C; 0.001]. <italic>Post-hoc</italic> Bonferroni&#x2019;s test at a significance level of 0.05 revealed that the observed NDW was lower in the 2 year olds than the 3 year olds, while the 3 year olds also possessed lower TTR than the 4 year olds. On the other hand, lower vocD was also observed in both the 2 year olds and the 3 year olds when compared with their older counterparts, respectively. Pearson&#x2019;s correlation was also conducted to investigate the relations between sentence length and various measures of lexical diversity. <xref ref-type="table" rid="T4">Table 4</xref> presents the correlations among children&#x2019;s MLU, NDW, TNW, TTR, and vocD. The results indicated that MLU is significantly correlated with NDW (<italic>r</italic> = 0.662, <italic>p</italic> &#x003C; 0.01) and TTR (<italic>r</italic> = 0.662, <italic>p</italic> &#x003C; 0.01), but not with vocD (<italic>p</italic> &#x003E; 0.05).</p>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p>Correlation between children&#x2019;s age, mean length of utterances, number of different words, total number of words, and type-token ratio.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Variables</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">1</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">2</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">3</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">4</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Mean length of utterance in word</td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Number of different words</td>
<td valign="top" align="center">0.662<xref ref-type="table-fn" rid="t4fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Total number of words</td>
<td valign="top" align="center">0.509<xref ref-type="table-fn" rid="t4fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.807<xref ref-type="table-fn" rid="t4fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Type-token ratio</td>
<td valign="top" align="center">0.662<xref ref-type="table-fn" rid="t4fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">1.0<xref ref-type="table-fn" rid="t4fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.807<xref ref-type="table-fn" rid="t4fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">vocD</td>
<td valign="top" align="center">0.144</td>
<td valign="top" align="center">0.145</td>
<td valign="top" align="center">&#x2212;0.081</td>
<td valign="top" align="center">0.334<xref ref-type="table-fn" rid="t4fns2">&#x002A;&#x002A;</xref></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t4fns2"><p>&#x002A;&#x002A;<italic>p</italic> &#x003C; 0.01.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>The number of different open class words (nouns, verbs and adjectives) and closed class words produced by children in each age group was shown in <xref ref-type="table" rid="T5">Table 5</xref>. Results of correlations (Pearson&#x2019;s r), as shown in <xref ref-type="table" rid="T6">Table 6</xref>, indicated that children&#x2019;s age is significantly correlated with number of different nouns produced (<italic>r</italic> = 0.415, <italic>p</italic> &#x003C; 0.01), number of different verbs produced (<italic>r</italic> = 0.282, <italic>p</italic> &#x003C; 0.01), and number of different adjectives produced (<italic>r</italic> = 0.451, <italic>p</italic> &#x003C; 0.01), but not with number of different closed class words produced (<italic>p</italic> &#x003E; 0.05).</p>
<table-wrap position="float" id="T5">
<label>TABLE 5</label>
<caption><p>Average number of different lexical items across age group.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" colspan="2" style="color:#ffffff;background-color: #7f8080;">Age group 1</td>
<td valign="top" align="center" colspan="2" style="color:#ffffff;background-color: #7f8080;">Age group 2</td>
<td valign="top" align="center" colspan="2" style="color:#ffffff;background-color: #7f8080;">Age group 3</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;"><bold></bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Mean</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>S.D.</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Mean</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>S.D.</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Mean</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>S.D.</bold></td>
</tr>
<tr>
<td valign="top" align="left">Noun<xref ref-type="table-fn" rid="t5fns1"><sup>&#x2227;</sup></xref></td>
<td valign="top" align="center">28.28</td>
<td valign="top" align="center">10.86</td>
<td valign="top" align="center">47.5</td>
<td valign="top" align="center">12.67</td>
<td valign="top" align="center">47.77</td>
<td valign="top" align="center">15.57</td>
</tr>
<tr>
<td valign="top" align="left">Verb<xref ref-type="table-fn" rid="t5fns1"><sup>&#x2227;</sup></xref></td>
<td valign="top" align="center">21.78</td>
<td valign="top" align="center">7.12</td>
<td valign="top" align="center">34.05</td>
<td valign="top" align="center">9.86</td>
<td valign="top" align="center">32.36</td>
<td valign="top" align="center">9.51</td>
</tr>
<tr>
<td valign="top" align="left">Adjective<xref ref-type="table-fn" rid="t5fns1"><sup>&#x2227;</sup></xref></td>
<td valign="top" align="center">8.06</td>
<td valign="top" align="center">3.32</td>
<td valign="top" align="center">17.19</td>
<td valign="top" align="center">6.33</td>
<td valign="top" align="center">17.95</td>
<td valign="top" align="center">5.71</td>
</tr>
<tr>
<td valign="top" align="left">Closed class words<xref ref-type="table-fn" rid="t5fns1"><sup>&#x2227;</sup></xref></td>
<td valign="top" align="center">55.5</td>
<td valign="top" align="center">10.42</td>
<td valign="top" align="center">69.52</td>
<td valign="top" align="center">16.02</td>
<td valign="top" align="center">64.95</td>
<td valign="top" align="center">18.68</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t5fns1"><p><sup>&#x2227;</sup>Number of different items in that particular lexical class.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T6">
<label>TABLE 6</label>
<caption><p>Correlation between children&#x2019;s age and number of different lexical items.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Variables</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">1</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">2</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">3</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">4</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">5</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Age</td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Noun<xref ref-type="table-fn" rid="t6fns3"><sup>&#x2227;</sup></xref></td>
<td valign="top" align="center">0.415<xref ref-type="table-fn" rid="t6fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Verb<xref ref-type="table-fn" rid="t6fns3"><sup>&#x2227;</sup></xref></td>
<td valign="top" align="center">0.282<xref ref-type="table-fn" rid="t6fns1">&#x002A;</xref></td>
<td valign="top" align="center">0.775<xref ref-type="table-fn" rid="t6fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Adjective<xref ref-type="table-fn" rid="t6fns3"><sup>&#x2227;</sup></xref></td>
<td valign="top" align="center">0.451<xref ref-type="table-fn" rid="t6fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.748<xref ref-type="table-fn" rid="t6fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.729<xref ref-type="table-fn" rid="t6fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Closed class words<xref ref-type="table-fn" rid="t6fns3"><sup>&#x2227;</sup></xref></td>
<td valign="top" align="center">0.103</td>
<td valign="top" align="center">0.792<xref ref-type="table-fn" rid="t6fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.805<xref ref-type="table-fn" rid="t6fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.727<xref ref-type="table-fn" rid="t6fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t6fns1"><p>&#x002A;<italic>p</italic> &#x003C; 0.05.</p></fn>
<fn id="t6fns2"><p>&#x002A;&#x002A;<italic>p</italic> &#x003C; 0.01.</p></fn>
<fn id="t6fns3"><p><sup>&#x2227;</sup>Number of utterances expressing that particular lexical item.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="S3.SS2">
<title>Semantic content categories</title>
<p><xref ref-type="table" rid="T7">Table 7</xref> presents the average number of different semantic content categories across age groups. Results of correlations (Pearson&#x2019;s r) indicated that the number of different semantic content categories is significantly correlated with age (<italic>r</italic> = 0.269, <italic>p</italic> &#x003C; 0.05) and MLU (<italic>r</italic> = 0.332, <italic>p</italic> &#x003C; 0.001). The average token count and the number of different semantic content categories per utterance produced by children in different age groups can also be found in <xref ref-type="supplementary-material" rid="TS1">Supplementary Tables 1</xref>, <xref ref-type="supplementary-material" rid="TS2">2</xref>, respectively. It was noted that semantic content category of <italic>specification</italic> was absent in all age groups.</p>
<table-wrap position="float" id="T7">
<label>TABLE 7</label>
<caption><p>Average number of different semantic content categories across age groups.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" colspan="3" style="color:#ffffff;background-color: #7f8080;">Age group 1</td>
<td valign="top" align="center" colspan="3" style="color:#ffffff;background-color: #7f8080;">Age group 2</td>
<td valign="top" align="center" colspan="3" style="color:#ffffff;background-color: #7f8080;">Age group 3</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Mean</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>S.D.</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Range</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Mean</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>S.D.</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Range</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Mean</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>S.D.</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Range</bold></td>
</tr>
<tr>
<td valign="top" align="left">Number of different semantic content categories</td>
<td valign="top" align="center">14.5</td>
<td valign="top" align="center">2.2</td>
<td valign="top" align="center">9&#x2013;19</td>
<td valign="top" align="center">16.5</td>
<td valign="top" align="center">1.90</td>
<td valign="top" align="center">13&#x2013;20</td>
<td valign="top" align="center">16.4</td>
<td valign="top" align="center">1.40</td>
<td valign="top" align="center">14&#x2013;19</td>
</tr>
</tbody>
</table></table-wrap>
<p>To investigate the acquisition trajectory of different concepts within certain linguistic domains among children, one common approach is the use of a 90% criterion (e.g., <xref ref-type="bibr" rid="B9">Brown, 1973</xref>; <xref ref-type="bibr" rid="B76">So and Dodd, 1995</xref>), in which a concept is considered acquired by the majority of children in a particular age group if 90% of the children in that age group demonstrate the correct usage of that concept in the elicitation experiment. For example, researchers have examined the order of morpheme acquisition by looking at the age at which 90% of the children were reported to produce the morphemes (e.g., <xref ref-type="bibr" rid="B9">Brown, 1973</xref>; <xref ref-type="bibr" rid="B15">de Villiers and de Villiers, 1973</xref>). Likewise, a 90% criterion has also been used in research documenting the acquisition trajectory of different phonemes among young children (e.g., <xref ref-type="bibr" rid="B76">So and Dodd, 1995</xref>; <xref ref-type="bibr" rid="B93">Zarifian et al., 2015</xref>; <xref ref-type="bibr" rid="B13">Crowe and McLeod, 2020</xref>). To investigate the age of acquisition for different semantic content categories, the 90% criterion was therefore adopted. A semantic content category was regarded as acquired by the particular age group if 90% of the participants in the group produced the semantic content category at least once in the sample collected. <xref ref-type="table" rid="T8">Table 8</xref> summarizes the semantic content categories acquired by participants from 2 to 5 years old. Among the 2-year-old participants, nine semantic content categories were acquired (<italic>existence</italic>, <italic>non-existence</italic>, <italic>reject</italic>, <italic>attribution</italic>, <italic>action</italic>, <italic>locative state</italic>, <italic>state</italic>, <italic>quantity</italic>, <italic>temporal</italic>). Notably, the semantic content category of <italic>reject</italic> reached 90% occurrence in this age group but declined and did not meet the acquisition criterion in the two older age groups. Three more categories (<italic>denial</italic>, <italic>possession</italic>, <italic>locative action</italic>) were added to the inventory of the 3 year olds while another three categories (<italic>dative</italic>, <italic>additive</italic>, <italic>causal</italic>) were further acquired by the 4 year olds. Six semantic content categories, namely <italic>recurrence</italic>, <italic>notice</italic>, <italic>adversative</italic>, <italic>epistemic</italic>, <italic>specification</italic>, and <italic>communication</italic>, were not fully acquired even by the 5-year-old group.</p>
<table-wrap position="float" id="T8">
<label>TABLE 8</label>
<caption><p>Occurrence of different semantic content categories by age group.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">2 year olds</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">3 year olds</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">4 year olds</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Existence</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i001.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Non-existence</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i002.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Attribution</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i003.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Action</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i004.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Locative state</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i005.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">State</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i006.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Quantity</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i007.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Temporal</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i008.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Reject</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i009.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Denial</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i010.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Locative action</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i011.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Possession</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i012.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Additive</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i013.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Causal</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i014.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Dative</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i015.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Recurrence</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i016.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Notice</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i017.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Adversative</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i018.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Epistemic</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i019.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Communication</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i020.jpg"/></td>
</tr>
<tr>
<td valign="top" align="left">Specification</td>
<td valign="top" align="center" colspan="3"><inline-graphic xlink:href="fpsyg-14-1234525-i021.jpg"/></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>A dotted line indicates that the semantic content category had &#x003C;20% occurrence, a dashed line indicates 20%&#x2013;89% occurrence, and a solid line indicates &#x2265;90% occurrence.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<title>Discussion</title>
<p>The current study reported a database, the CMCL, that documents the language samples obtained from Mandarin-speaking children aged between 2 and 5 years old. The orthographically transcribed language samples were tagged with their corresponding part of speech and semantic content category. Before discussing the major focus of the acquisition pattern of semantic content categories, to ensure the representativeness of the content of the corpus, typical measures of utterance length and lexical diversity were obtained. Results indicated that the effect of age on sentence length and lexical diversity from previous studies were replicated. Details were elaborated in the following section.</p>
<sec id="S4.SS1">
<title>Utterance length and lexical diversity</title>
<p>In the current study, it was observed that shorter utterance length was associated with the younger age group in general. The increase in utterance length moving from the 2 year olds to the 3 year olds, as reflected in the MLU measure, is consistent with reports of previous studies on both English (<xref ref-type="bibr" rid="B56">Miller, 1981</xref>; <xref ref-type="bibr" rid="B62">Paul, 2000</xref>; <xref ref-type="bibr" rid="B67">Rice et al., 2010</xref>; <xref ref-type="bibr" rid="B59">Moyle et al., 2011</xref>) and Chinese children (<xref ref-type="bibr" rid="B12">Cheung, 1998</xref>; <xref ref-type="bibr" rid="B40">Klee et al., 2004</xref>; <xref ref-type="bibr" rid="B36">Jin and Jin, 2008</xref>; <xref ref-type="bibr" rid="B90">Wu, 2020</xref>). On the other hand, observations of the 4 year olds appeared to indicate a plateau in sentence length in this age group and replicated previous findings that MLU may be less sensitive in capturing the progress in language abilities of children beyond the earliest stage of language acquisition (e.g., <xref ref-type="bibr" rid="B38">Klee and Fitzgerald, 1985</xref>; <xref ref-type="bibr" rid="B75">Smith and Jackins, 2014</xref>).</p>
<p>Regarding the lexical diversity among children, NDW, especially for the open class words and TTR, appeared to reflect some age effects in the current study. However, it is noteworthy that the results also demonstrated positive correlations between NDW and TTR with MLU, but not between vocD and MLU. This echoed previous findings that suggested the possible confounds of NDW and TTR in representing the vocabulary skills of children, as both of them may be affected by sentence length (<xref ref-type="bibr" rid="B39">Klee, 1992</xref>). To this end, the current findings tended to support the notion that vocD is more suitable for measuring lexical diversity in the context of Mandarin (<xref ref-type="bibr" rid="B94">Zhang and Zhou, 2020</xref>). Besides, vocD also appeared to be a more sensitive measure in reflecting the progress of lexical diversity across age groups, as significant differences were found between both the 2-year-old and 3-year-old groups, as well as between the 3-year-old and 4-year-old groups. In sum, similar to the measure of sentence length, the increase in lexical diversity across age groups, as indicated in the measure of vocD, is also consistent with previous results with English-speaking (<xref ref-type="bibr" rid="B57">Miller, 1991</xref>; <xref ref-type="bibr" rid="B60">Owen and Leonard, 2002</xref>) and Chinese-speaking children (<xref ref-type="bibr" rid="B40">Klee et al., 2004</xref>; <xref ref-type="bibr" rid="B36">Jin and Jin, 2008</xref>; <xref ref-type="bibr" rid="B89">Wu et al., 2019</xref>; <xref ref-type="bibr" rid="B94">Zhang and Zhou, 2020</xref>). Generally, the results obtained from the CMCL replicated the typical language acquisition pattern from previous studies, which confirm that the procedures we applied in data collecting language samples provide a valid data source for studying language acquisition in young Mandarin-speaking children.</p>
<p>Open class words primarily convey the concrete content of the sentences whereas closed class words are usually more related to the grammatical aspects of sentences, and include relatively few members (<xref ref-type="bibr" rid="B87">Weber-Fox and Neville, 2001</xref>). It was observed that young children produced proportionally more open class words than closed class words in a previous study using parent questionnaires (<xref ref-type="bibr" rid="B41">Klintfors et al., 2009</xref>). In the current study, however, the productions of the open class words and closed class words were more comparable among the 2 year olds. The standardized elicitation probes used in the current study may be one possible reason that accounts for the different observations compared with previous literature. With the standardized elicitation probes that focused mainly on here and now contexts in the current study, the 2 year olds, who are more capable of producing contextualized expressions, may show higher tendency to produce only open class words targeted in the elicitation probes, and consequently produced only a limited number of them. On the other hand, closed class words include aspect markers, numerals, classifiers, determiners, pronouns, sentence final particles and prepositions. Some of them, like aspect markers and sentence final particles, are particularly prominent in Mandarin and are used extensively by young children (<xref ref-type="bibr" rid="B20">Erbaugh, 1992</xref>; <xref ref-type="bibr" rid="B23">Fang and Hengeveld, 2022</xref>). This may also lead to the production of apparently more closed class words among the 2 year old, resulting in a relatively even ratio between the number of open and closed class words observed.</p>
<p>The results from the current study indicated consistent findings on the traditional measures like MLU and vocD in the language acquisition of Mandarin-speaking children, thus confirming the representativeness of the corpus. In the following, we discussed the findings of our major focus regarding the acquisition of semantic content category in Mandarin-speaking children.</p>
</sec>
<sec id="S4.SS2">
<title>Acquisition of semantic content categories and the underlying factors</title>
<p>Given the importance of the semantic domain in language acquisition mentioned before, the usage and acquisition of various semantic content categories were investigated to supplement the study of child language acquisition from the semantic perspective.</p>
<p>Results showed that the number of different semantic content categories increased with both age and sentence length. The increase in the unique semantic content categories thus offers a glimpse into the language acquisition of typically developing children from the semantic perspective along with the syntactic analysis, providing a more comprehensive picture of language acquisition. Moreover, by using the 90% criterion, the acquisition pattern of different semantic content categories in Mandarin-speaking young children were also investigated. Among the 21 semantic content categories, nine of them, namely <italic>existence</italic>, <italic>non-existence</italic>, <italic>reject</italic>, <italic>attribution</italic>, <italic>action</italic>, <italic>locative state</italic>, <italic>state</italic>, <italic>quantity</italic>, and <italic>temporal</italic>, were acquired early by the 2-year-old group. The early acquisition of these semantic content categories, except <italic>temporal</italic>, is generally coherent with previous studies (<xref ref-type="bibr" rid="B77">Stockman and Vaughn-Cooke, 1986</xref>; <xref ref-type="bibr" rid="B6">Bloom, 1991</xref>). The later acquisition of <italic>temporal</italic> content reported in <xref ref-type="bibr" rid="B6">Bloom (1991)</xref> may probably be due to their investigation of the temporal content related to the use of conjunctions only in that particular study, while aspect markers, which occur in Mandarin-speaking children as young as 18 months (<xref ref-type="bibr" rid="B95">Zhou, 2004</xref>), were included in our study of the temporal content category. Besides, according to <xref ref-type="bibr" rid="B29">Hao et al. (2008)</xref>, Mandarin-speaking children aged 17&#x2013;30 months experience a substantial vocabulary growth, and are capable in expressing vocabulary of objects, people, places and actions, as well as using some quantifiers, descriptive words, and words about time. By relating these vocabularies to their corresponding semantic content category (e.g., content of <italic>existence</italic> can be expressed with objects and people; content of <italic>locative state</italic> can be expressed with place; content of <italic>attribution</italic> and <italic>state</italic> can be expressed with descriptive words; content of <italic>action</italic> can be expressed by action words; content of <italic>quantity</italic> can be expressed by quantifier; content of <italic>temporal</italic> can be expressed by time words and aspect markers), it is found that the acquisition trajectory on semantic content categories in our study generally replicates previous research findings.</p>
<p>Adopting <xref ref-type="bibr" rid="B6">Bloom&#x2019;s (1991)</xref> framework, the early acquisition of the abovementioned semantic content categories can be explained by the relatively simple cognitive complexity of the concept being coded. <xref ref-type="bibr" rid="B37">Johnston (1985)</xref> suggested that conceptual development is the prerequisite for semantic growth, while <xref ref-type="bibr" rid="B72">Slobin (1973)</xref> also proposed that complexity of the concepts affects the acquisition of linguistic terms, in which more abstract and complex ideas are acquired later. As a result, the cognitive complexity of the underlying concepts inevitably affects children&#x2019;s semantic representations, and thus the acquisition of various semantic content categories. Specifically, acquisition of the semantic content categories of <italic>existence</italic>, <italic>non-existence</italic>, and <italic>action</italic> require the sensorimotor knowledge related to object permanence and should therefore be acquired in the sensorimotor period before age of two (<xref ref-type="bibr" rid="B64">Piaget, 2000</xref>). In addition, concepts of objects and events are perceptually easy as concrete referents are usually involved. <xref ref-type="bibr" rid="B42">Lahey (1988)</xref> also claimed that young children have possessed the basic knowledge on objects and events at an early age. Among the earliest acquired semantic content categories, <italic>existence</italic> and <italic>non-existence</italic> mainly represent objects, people and events; <italic>locative state</italic> is related to different locations; <italic>action</italic> conveys the meaning of motion which does not involve changes of location. Besides, <italic>reject</italic>, <italic>attribution</italic>, <italic>state</italic>, <italic>quantity</italic>, and <italic>temporal</italic> mostly serve to provide additional information of the object, action and event. The mapping between this concrete sensorimotor and cognitively simple knowledge of single object and event with words can therefore be established at an early age, and are reflected in their naturalistic speech expression.</p>
<p>Alternatively, the acquisition of these semantic content categories may also be related to linguistic factors. As children&#x2019;s early vocabularies contain proportionally more open class words than closed class ones (<xref ref-type="bibr" rid="B41">Klintfors et al., 2009</xref>), it is not surprised to observe that some of the above semantic content categories which are mostly coded using open class words (e.g., <italic>existence</italic>, <italic>action</italic>, <italic>attribution)</italic> were acquired early. Furthermore, most of these semantic content categories can be coded with simple syntactic units, such as single words (e.g., <italic>existence</italic>, <italic>non-existence</italic>, <italic>attribution</italic>, <italic>action</italic>, <italic>locative state</italic>, <italic>state</italic>), simple noun phrase [e.g., &#x201C;&#x4E00;&#x4E2A;&#x201D; (numeral + classifier) to code <italic>quantity</italic>] and simple verb-phrase [e.g., &#x201C;&#x5403;&#x4E86;&#x201D; (verb + aspect marker) to code <italic>temporal</italic>]. In line with <xref ref-type="bibr" rid="B6">Bloom&#x2019;s (1991)</xref> proposal on the effect of syntactic complexity, semantic content categories related to simple sentences were therefore acquired first.</p>
<p>Moving on to the 3-year-old group, three more semantic content categories, namely <italic>denial</italic>, <italic>locative action</italic>, and <italic>possession</italic> were subsequently acquired. The emergence of these semantic content categories also followed the acquisition sequence reported in <xref ref-type="bibr" rid="B77">Stockman and Vaughn-Cooke (1986)</xref> and <xref ref-type="bibr" rid="B6">Bloom (1991)</xref> in general. The above acquisition is again considered to be related to the cognitive and syntactic complexity. Firstly, children progressively acquired the simple relations between objects and events on top of the knowledge in single objects and events (<xref ref-type="bibr" rid="B42">Lahey, 1988</xref>). To this end, the content of <italic>denial</italic> which negates attribute, identity and state of events (<xref ref-type="bibr" rid="B10">Chang, 1992</xref>), and the content of <italic>locative action</italic>, which expresses the change in locations including the source, goal and/or path, are used to represent the relations between objects and events. Similarly, the content of <italic>possession</italic> also expresses the relations between the owner or possessor and the entity. Acquisition of these semantic content categories representing the simple relational knowledge between objects and events tended to involve higher demand in cognitive processing, and therefore these semantic content categories emerged later than those acquired by the 2-year-old group.</p>
<p>The syntactic complexity associated with these semantic content categories may also play a role in the acquisition. To express the content of <italic>possession</italic> in Mandarin, it is proposed that the genitive marker &#x201C;<italic>de</italic>&#x201D; is the most common linguistic device being used (<xref ref-type="bibr" rid="B71">Shi and Zhou, 2018</xref>). <xref ref-type="bibr" rid="B48">Li (2004)</xref> also reported that the comprehensive use of possessive expressions with &#x201C;<italic>de</italic>&#x201D; was acquired by age of three, which appear to be in line with our observations on the acquisition of <italic>possession</italic>. Meanwhile, this grammatical marking &#x201C;<italic>de</italic>&#x201D; is often regarded as a bigger challenge than the content words in the acquisition by young children (<xref ref-type="bibr" rid="B33">Huang et al., 2022</xref>). On the other side, the content of <italic>locative action</italic> may also utilize co-verbs or prepositional phrases to encode both manner and path of the motion in Mandarin (<xref ref-type="bibr" rid="B74">Slobin, 2004</xref>). The unique thematic roles associated with different prepositions in Chinese may further impose difficulties in the acquisition of the corresponding constructions (<xref ref-type="bibr" rid="B43">Lau et al., 2023</xref>). These specific language properties of Mandarin thus constitute a higher syntactic complexity in encoding both <italic>possession</italic> and <italic>locative action</italic>, and account for their acquisition beyond 2 year old in Mandarin-speaking children.</p>
<p>Next, the 4-year-old group further acquired the semantic content categories of <italic>additive</italic>, <italic>causal</italic>, and <italic>dative</italic>. This again replicates the acquisition order reported in both <xref ref-type="bibr" rid="B77">Stockman and Vaughn-Cooke (1986)</xref> and <xref ref-type="bibr" rid="B6">Bloom (1991)</xref>. The acquisition trend in this age group can also be explained according to the cognitive and syntactic complexity associated with the semantic content categories. First, both additive and causal represent specific relations of two events or states and should be cognitively more complex that those representing only one event. Besides, the content of causal additionally indicates a dependency between the events and gives the reason or result of the events, thus requiring more cognitive resources to process (<xref ref-type="bibr" rid="B6">Bloom, 1991</xref>). The above contributes an initial explanation for the acquisitions of additive and causal in this group of children.</p>
<p>On the other hand, acquisition of <italic>dative</italic> seems to be more related to the syntactic complexity. The coding of <italic>dative</italic> in Mandarin usually involves the use of different syntactic structures and prepositions with different themes [e.g., &#x201C;&#x9001;&#x82B1;&#x7ED9;&#x59B9;&#x59B9;&#x201D; (verb + noun + preposition + noun)] (Details about different syntactic structures and use of prepositions are elaborated in later section). As mentioned before, Chinese prepositions are usually associated with specific thematic roles, which impose additional challenges on the syntax constructions (<xref ref-type="bibr" rid="B43">Lau et al., 2023</xref>). As a result, a more complex syntactic structure entailing the relations between recipients and the affected objects is usually involved with <italic>dative</italic> expressions. Similarly, the coding of <italic>additive</italic> and <italic>causal</italic> contents also requires the conjoining of phrases or clauses with conjunctions [e.g., &#x201C;&#x9762;&#x5305;&#x548C;&#x9E21;&#x86CB;&#x201D; (noun + conjunction + noun) to code <italic>additiv</italic>e; &#x201C;&#x751F;&#x75C5;&#x5C31;&#x770B;&#x533B;&#x751F;&#x201D; (verb + conjunction + verb) to code <italic>causal</italic>]. This seemingly higher syntactic complexity provides further explanations to their acquisition in this later stage of early childhood.</p>
<p>Finally, the semantic content categories of <italic>recurrence</italic>, <italic>notice</italic>, <italic>adversative</italic>, <italic>epistemic</italic>, <italic>specification</italic>, and <italic>communication</italic> were found not to be acquired by the 4-year-old group, using the 90% criterion. Once again, the late acquisition of the above semantic content categories, except <italic>recurrence</italic>, aligns the order suggested by <xref ref-type="bibr" rid="B77">Stockman and Vaughn-Cooke (1986)</xref> and <xref ref-type="bibr" rid="B6">Bloom (1991)</xref>. From the cognitive perspective, as noted in previous section, an increment in the cumulative cognitive complexity among <italic>additive</italic>, <italic>temporal</italic>, <italic>causal</italic>, and <italic>adversative</italic> is suggested. The content of <italic>adversative</italic> does not only represent definite events with highest cumulative cognitive complexity, but additionally denotes complex contrastive relationship beyond the here-and-now context (<xref ref-type="bibr" rid="B6">Bloom, 1991</xref>). The highest cognitive complexity of <italic>adversative</italic> thus explained its later acquisition. On the other hand, it is also proposed that the semantic content categories about how people think (<italic>epistemic</italic>) or talk (<italic>communication</italic>) about events, which involve implicit and non-transparent reasoning processes, may be complex for young children to interpret because it is considered relatively difficult for them to take the perspective of others (<xref ref-type="bibr" rid="B22">Evers-Vermeul and Sanders, 2011</xref>). In addition, some specific lexicons are also required in representing the definite relations of events in certain semantic content categories (e.g., perceptual verb to code content of <italic>notice</italic>; mental state verb to code content of <italic>epistemic</italic>; verb for conversation to describe what is to be express in the content of <italic>communication</italic>). The challenges in acquiring these specific and abstract words therefore pose extra difficulties in acquiring the corresponding semantic content categories.</p>
<p>Syntactically, in Mandarin, expressions of <italic>adversative</italic> usually require the use of conjunctions between clauses, and generally appear after those associated with simple sentences, but earlier than those related with more complex complementation (i.e., <italic>epistemic</italic>, <italic>notice</italic>, <italic>communication).</italic> Semantic content categories related with complementation, subsequently, also appeared earlier than those related to relativization (i.e., <italic>specification</italic>). The higher syntactic complexity of these semantic content categories may therefore explain why they are not acquired by the 4-year-old children.</p>
<p>It is noted that the semantic content category of <italic>specification</italic> was absent at all ages in the current study, which is coherent with the infrequent occurrence in young children reported in <xref ref-type="bibr" rid="B6">Bloom (1991)</xref>. According to <xref ref-type="bibr" rid="B42">Lahey (1988)</xref>, <italic>specification</italic> is usually expressed to indicate a particular person, object or event in terms of their functions, places or activities. More specific relational knowledge is thus required for its production. In addition, expression of <italic>specification</italic> usually involves complex sentences with relativization (e.g., &#x7A7F;&#x7EA2;&#x88D9;&#x5B50;&#x7684;&#x5973;&#x5B69;&#x5728;&#x54ED;) (<xref ref-type="bibr" rid="B6">Bloom, 1991</xref>). According to <xref ref-type="bibr" rid="B2">Arndt and Schuele (2013)</xref>, typically developing children usually start to form sentences containing relative clauses between the ages of 4 and 5 years of age and they continue to master the productions through the school-age year. It is therefore believed that this semantic content category only emerged at an older age, probably after age of five due to higher demand in cognitive ability and syntactic knowledge.</p>
<p>Unexpectedly, the semantic content category of <italic>recurrence</italic> was not acquired by the age of five in the current study. Likewise, it is also interesting to observe that the occurrence of the early acquired semantic content category of <italic>reject</italic> declined across ages and its occurrence did not reach 90% after the age of three. Both contents represent knowledge of single object and event and possess relatively low cognitive complexity. Syntactically, both can be encoded with simple phrases [e.g., &#x518D;&#x5403; (adverb + verb) to code <italic>recurrence</italic>; &#x4E0D;&#x73A9; (negative marker + verb) to code <italic>reject</italic>]. With relatively simple cognitive complexity and syntactic structure, these contents are expected to be acquired early. One possible reason is that the provided communicative context may not be obligatory enough to elicit the content categories of <italic>recurrence</italic> and <italic>reject</italic>. In addition, the high occurrence of <italic>reject</italic> in the 2-year-old group may also be related to the stage of &#x201C;trouble two&#x201D; in children&#x2019;s development. It has been proposed that frequent non-compliance is common in toddlerhood and peaks during the second year (<xref ref-type="bibr" rid="B85">Tremblay, 2004</xref>; <xref ref-type="bibr" rid="B1">Alink et al., 2006</xref>). Most children then learn to regulate their behavior as they grow into preschool years and their non-compliance declines (<xref ref-type="bibr" rid="B84">Tremblay, 2003</xref>). Our findings appear to be consistent with this pattern of child development. On the other hand, it is also possible that some cultural specific factors may affect the production of these semantic content categories. Accordingly, social harmony and fitting in with others are more encouraged in the Chinese context than in the Western ones (<xref ref-type="bibr" rid="B88">Wu, 1996</xref>). According to <xref ref-type="bibr" rid="B92">Xu and Farver (2009)</xref>, Chinese children tend to exhibit the regulated shyness, as a self-controlled form of social restraint to maintain harmonious group functioning and exemplifies self-regulation. Since <italic>reject</italic> and <italic>recurrence</italic> are mostly expressed when children intended to refuse or make additional request on objects or events, these may not adhere the social harmony in the Chinese culture. Similarly, it is also suggested that Chinese children are instructed to be more obedient due to the influence of Confucian teachings in traditional Chinese culture (<xref ref-type="bibr" rid="B91">Xiao, 1999</xref>). Older children may therefore tend to produce fewer negative expressions to reject, especially when interacting with our examiners that are all adults during the data collection. Considering all of the above, young Mandarin-speaking children therefore appeared to express fewer <italic>recurrence</italic> and <italic>reject</italic> contents in the current study.</p>
<p>To conclude, the acquisition trajectory of semantic content category in Mandarin-speaking-children aligns the sequence suggested by <xref ref-type="bibr" rid="B6">Bloom (1991)</xref> for English-speaking children. Acquisitions in both languages are predominantly predicted by the associated cognitive and syntactic complexity, while specific linguistic properties in Mandarin and Chinese culture also seem to have a role in modulating the acquisition of particular semantic content categories.</p>
</sec>
<sec id="S4.SS3">
<title>Potential usage of the database in investigating the form-content interface in language acquisition of young children</title>
<p>According to <xref ref-type="bibr" rid="B6">Bloom (1991)</xref>, young children acquire syntax and semantics together to lay the foundations of early expressions, and these two domains interact in a bidirectional manner to regulate the acquisition in early child language (<xref ref-type="bibr" rid="B58">Mok and Kipka, 2009</xref>). As mentioned, a certain form can be used to express various semantic content categories while an individual semantic content category can also be expressed with different forms. On some occasions, children who fail to encode the concept with specific grammatical forms, may still be able to possess the semantic content category with the expressions of the non-standard or ungrammatical forms (e.g., &#x7238;&#x7238;&#x978B;&#x978B; &#x201C;daddy shoe&#x201D; with absence of the genitive marker &#x201C;<italic>de</italic>&#x201D; to express <italic>possession</italic>). In view of the above, the importance to consider both language form and semantic content, as well as their interaction in child language study should not be underestimated.</p>
<p>The current study adopted language sample analysis, which allows the elicitation of various semantic content categories even among the youngest children by using standardized protocol. The established CMCL does not only provide important information on the acquisition of various semantic content categories in Mandarin-speaking children, but allows further investigation on the interaction between language form and content. By analyzing language samples annotated both syntactically (with parts of speech) and semantically (with semantic content categories), the relations between form and content produced by Mandarin-speaking preschoolers were further investigated. In the following, the syntactic category of verb and the semantic content category of <italic>dative</italic> were used as examples to illustrate the usage of the database to explore the interaction between form and content in the language acquisition of Mandarin-speaking children.</p>
</sec>
<sec id="S4.SS4">
<title>Expression of verb (form) with different semantic content categories</title>
<p>In the past, vast amount of research studies have been conducted to investigate the acquisition of early vocabulary by young children but conflicting results in the acquisition of nouns and verbs in early child language have been documented in the literature. <xref ref-type="bibr" rid="B26">Gentner (1982)</xref> proposed that nouns are universally acquired before verbs, while the others found that verbs can also appear in children&#x2019;s earliest vocabularies (<xref ref-type="bibr" rid="B52">Ma et al., 2009</xref>). Besides, <xref ref-type="bibr" rid="B82">Tardif (1996)</xref> also argued that young Mandarin-speaking children actually produced more verbs than nouns in their naturalistic speech. In view of this, the different semantic content categories encoded using verbs in the CMCL were explored in the current study. By searching utterances annotated with the part of speech &#x201C;verb&#x201D; in the CMCL, it is found that verb was used to express semantic content category of <italic>action</italic>, <italic>state</italic>, <italic>locative action</italic>, <italic>dative</italic>, <italic>notice</italic>, <italic>communication</italic>, and <italic>epistemic</italic>. Consider the proposed order of acquisition, <italic>action</italic> and <italic>state</italic> were the earliest semantic content categories to be acquired among all. <italic>Locative action</italic> and <italic>dative</italic> followed these two and were acquired by the 3-year-old and 4-year old children, respectively. Finally, the content of <italic>notice</italic>, <italic>communication</italic>, and <italic>epistemic</italic> were not acquired till the age of five. As such, it will not be adequate to look at the surface form of the vocabulary alone, but to examine different semantic content categories being expressed by the form, in order to have a more comprehensive picture on early language acquisition. By classifying verb according to the different semantic content categories being represented, it is possible to explain why some verbs appear earlier in age whereas the others emerge later. It appeared that the acquisition of different lexical forms in young children is affected by the semantic content being encoded. With the syntactically and semantically annotated database, it is possible to investigate the form-content interface more thoroughly.</p>
</sec>
<sec id="S4.SS5">
<title>Expression of dative (semantic content category) with different forms</title>
<p>Next, the expressions of the semantic content category of <italic>dative</italic> with different forms were investigated. Utterances expressing the content of <italic>dative</italic> designate the recipient of an object or action with or without a preposition (<xref ref-type="bibr" rid="B42">Lahey, 1988</xref>). These utterances were extracted from the CMCL for further investigation of different lexical items and sentence structures produced by participants across ages. The lists of lexical items produced were shown in <xref ref-type="table" rid="T9">Table 9</xref>.</p>
<table-wrap position="float" id="T9">
<label>TABLE 9</label>
<caption><p>Lexical items expressing dative content produced by each age group.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Age group</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Lexical items expressing dative content</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">&#x7ED9;<sub>1</sub><xref ref-type="table-fn" rid="t9fni"><sup>i</sup></xref>, &#x7ED9;<sub>2</sub><xref ref-type="table-fn" rid="t9fnii"><sup>ii</sup></xref>, &#x7ED9;<sub>3</sub><xref ref-type="table-fn" rid="t9fniii"><sup>iii</sup></xref>, &#x5E2E;</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">&#x7ED9;<sub>1</sub><xref ref-type="table-fn" rid="t9fni"><sup>i</sup></xref>, &#x7ED9;<sub>2</sub><xref ref-type="table-fn" rid="t9fnii"><sup>ii</sup></xref>, &#x7ED9;<sub>3</sub><xref ref-type="table-fn" rid="t9fniii"><sup>iii</sup></xref>, &#x5E2E;, &#x5582;, &#x966A;, &#x8BA9;, &#x8BF7;</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">&#x7ED9;<sub>1</sub><xref ref-type="table-fn" rid="t9fni"><sup>i</sup></xref>, &#x7ED9;<sub>2</sub><xref ref-type="table-fn" rid="t9fnii"><sup>ii</sup></xref>, &#x7ED9;<sub>3</sub><xref ref-type="table-fn" rid="t9fniii"><sup>iii</sup></xref>, &#x5E2E;, &#x5582;, &#x966A;, &#x8BA9;, &#x8BF7;, &#x9001;, &#x5BF9;, &#x7ED9;</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t9fni"><p>i. &#x7ED9;<sub>1</sub> serves as a verb (e.g., &#x7ED9;&#x6211;&#x4E00;&#x672C;&#x4E66;).</p></fn>
<fn id="t9fnii"><p>ii. &#x7ED9;<sub>2</sub> serves as a preverbal preposition (e.g., &#x7ED9;&#x4ED6;&#x9001;&#x4E00;&#x672C;&#x4E66;).</p></fn>
<fn id="t9fniii"><p>iii. &#x7ED9;<sub>3</sub> serves as a post verbal preposition (e.g., &#x9001;&#x7ED9;&#x4ED6;&#x4E00;&#x672C;&#x4E66; or &#x9001;&#x4E00;&#x672C;&#x4E66;&#x7ED9;&#x4ED6;) (<xref ref-type="bibr" rid="B45">Li and Thompson, 1981</xref>).</p></fn>
</table-wrap-foot>
</table-wrap>
<p>The repertoire of lexical items expressing <italic>dative</italic> content was observed to be expanding across age groups. Older children were more capable in using a larger variety of lexical items, including verbs and prepositions, to express the content of <italic>dative</italic>. Considering the syntactic structures, it was found that the 2 year olds were able to produce sentences with subject-predicate (e.g., &#x6211;&#x7ED9;&#x5B83;&#x8FD9;&#x4E2A;), prepositional phrase (e.g., &#x7ED9;&#x5979;&#x7A7F;&#x4E0A;&#x8863;&#x670D;) and serial verb construction (e.g., &#x5976;&#x5976;&#x5E2E;&#x5B83;&#x5237;) in expressing the content of <italic>dative</italic>. On the other hand, the older children were capable in using pivotal construction (e.g., &#x6211;&#x4EEC;&#x8981;&#x8BF7;&#x4F60;&#x5403;&#x996D;) in addition to the above sentence types. In addition to earlier findings that the content of <italic>dative</italic> was only fully acquired by children in the oldest age group, these results indicate that older children were able to express the content with a larger variety of lexical items and syntactic structures.</p>
<p>From the above examples, by searching utterances annotated with verbs, it is possible to examine different semantic content categories being expressed. On the other hand, by searching utterances annotated with <italic>dative</italic> content, different lexical items and syntactic structures produced by participants across age groups can also be analyzed. Thus, the CMCL provides a useful and convenient tool for us to study the language acquisition of young children, from both syntactic and semantic perspectives. A preliminary platform for investigating this interaction between syntactic form and semantic content in the language acquisition process of Mandarin-speaking children is provided. It is further proposed that children acquire language in a two-dimensional way with various form-content interactions. When acquiring a particular form, children progressively broaden the semantic content categories being expressed by that form. Similarly, when acquiring a particular semantic content category, children also gradually expand the use of different lexical items and syntactic structures to represent that semantic content category.</p>
<p>In this study, the CMCL that documented the production of both content (semantic content category) and form (part of speech) was established. By analyzing data extracted from the database, our findings do not only provide insights in the acquisition trajectory of semantic content category in Mandarin-speaking children, but also shed new light on the understanding of language acquisition in the semantic domains. Apart from the influence of cognitive and syntactic complexity, specific language properties in Mandarin and Chinese culture also seem to play a role in the acquisition. More in-depth investigation on the acquisition of particular semantic content category in Mandarin is therefore inspired and supported by our preliminary findings. The CMCL will also be interesting for the broader research community and allow further research including the investigation of the form-content interface in the early language of children. It may further contribute to validate different theories in child language acquisition. Clinically, our findings inspire a potentially more comprehensive approach in profiling the language ability of children with typical development, as well as those with language disorders. Assessment and intervention for children with language disorders can be planned with a balanced consideration between form and content.</p>
</sec>
</sec>
<sec id="S5">
<title>Limitations and future directions</title>
<p>Our results present an initial acquisition order on various semantic content categories in Mandarin-speaking children, and support that same set of factors, i.e., syntactic and cognitive complexity, predominantly affect the acquisition of semantic content categories in Mandarin. Nevertheless, some semantic content categories such as <italic>specification</italic> and <italic>epistemic</italic> may be too complex and abstract to acquire at an early age (<xref ref-type="bibr" rid="B6">Bloom, 1991</xref>; <xref ref-type="bibr" rid="B22">Evers-Vermeul and Sanders, 2011</xref>). Future research should therefore be extended to include older children, so as to gain a clear overall picture on the usage and acquisition of various semantic content categories. On the other hand, LSA provides a rich communicative context and allows naturalistic language production in children (<xref ref-type="bibr" rid="B21">Evans and Craig, 1992</xref>; <xref ref-type="bibr" rid="B61">Owens, 2010</xref>), but some cultural-sensitive semantic content category such as <italic>recurrence</italic> and <italic>reject</italic> may not be elicited within the context provided. It may be necessary in future studies to include some structural language-eliciting procedures, to capture some particular semantic content categories. Finally, other limitations of the current study included its cross-sectional nature and the uneven number of subjects in each age group. More participants with a more even distribution of subjects across ages, as well as longitudinal data, would be preferable in future study.</p>
</sec>
<sec id="S6" sec-type="conclusion">
<title>Conclusion</title>
<p>The current study reported a preliminary result on language acquisition in Mandarin-speaking children using a syntactically and semantically annotated database&#x2014;CMCL. The replication of findings on sentence complexity and lexical diversity in typical language acquisition confirms the quality of the language sample data in the corpus for studying child language acquisition. Coherent findings in the acquisition of semantic content categories also appear to suggest that the acquisition trajectory of semantic content categories in Mandarin-speaking children mostly resembles that of English-speaking peers. Notably, the acquisition trend is largely explained by the cognitive and syntactic complexity, with additional influence from the language specific properties and cultural specific factors in Mandarin. In addition, with the tags in both part-of-speech and semantic content category, the CMCL does not only contributes additional perspective in studying child language from the semantic domain, but provides a useful platform for examining the interaction between language form and semantic content in early child language acquisition, which also implies significantly on the theoretical and clinical ground.</p>
</sec>
<sec id="S7" sec-type="data-availability">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec id="S8" sec-type="ethics-statement">
<title>Ethics statement</title>
<p>The studies involving humans were approved by The Human Subjects Ethics Sub-Committee of The Hong Kong Polytechnic University (Reference Number: HSEARS20191004001). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants&#x2019; legal guardians/next of kin.</p>
</sec>
<sec id="S9" sec-type="author-contributions">
<title>Author contributions</title>
<p>TT and DL contributed to the conception and design of the study, discussed the results, and contributed to the final manuscript. DL and M-TL supervised the whole project. TT conducted the experiment, organized the database and performed the statistical analysis, and wrote the first draft of the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version.</p>
</sec>
</body>
<back>
<sec id="S10" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="S11" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="S12" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fpsyg.2023.1234525/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fpsyg.2023.1234525/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.docx" id="TS1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.docx" id="TS2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alink</surname> <given-names>L. R.</given-names></name> <name><surname>Mesman</surname> <given-names>J.</given-names></name> <name><surname>Van Zeijl</surname> <given-names>J.</given-names></name> <name><surname>Stolk</surname> <given-names>M. N.</given-names></name> <name><surname>Juffer</surname> <given-names>F.</given-names></name> <name><surname>Koot</surname> <given-names>H. M.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>The early childhood aggression curve: Development of physical aggression in 10-to 50-month-old children.</article-title> <source><italic>Child Dev.</italic></source> <volume>77</volume> <fpage>954</fpage>&#x2013;<lpage>966</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-8624.2006.00912.x</pub-id> <pub-id pub-id-type="pmid">16942499</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arndt</surname> <given-names>K. B.</given-names></name> <name><surname>Schuele</surname> <given-names>C. M.</given-names></name></person-group> (<year>2013</year>). <article-title>Multiclausal utterances aren&#x2019;t just for big kids: A framework for analysis of complex syntax production in spoken language of preschool-and early school-age children.</article-title> <source><italic>Top. Lang. Disord.</italic></source> <volume>33</volume> <fpage>125</fpage>&#x2013;<lpage>139</lpage>.</citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Biber</surname> <given-names>D.</given-names></name> <name><surname>Conrad</surname> <given-names>S.</given-names></name> <name><surname>Reppen</surname> <given-names>R.</given-names></name></person-group> (<year>1998</year>). <source><italic>Corpus linguistics: Investigating language structure and use.</italic></source> <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bleses</surname> <given-names>D.</given-names></name> <name><surname>Vach</surname> <given-names>W.</given-names></name> <name><surname>Slott</surname> <given-names>M.</given-names></name> <name><surname>Wehberg</surname> <given-names>S.</given-names></name> <name><surname>Thomsen</surname> <given-names>P.</given-names></name> <name><surname>Madsen</surname> <given-names>T. O.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>Early vocabulary development in Danish and other languages: A CDI-based comparison.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>35</volume> <fpage>619</fpage>&#x2013;<lpage>650</lpage>. <pub-id pub-id-type="doi">10.1017/S0305000908008714</pub-id> <pub-id pub-id-type="pmid">18588717</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bloom</surname> <given-names>L.</given-names></name></person-group> (<year>1970</year>). <source><italic>Language development; Form and function in emerging grammars.</italic></source> <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bloom</surname> <given-names>L.</given-names></name></person-group> (<year>1991</year>). <source><italic>Language development from two to three.</italic></source> <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bloom</surname> <given-names>L.</given-names></name> <name><surname>Lahey</surname> <given-names>M.</given-names></name></person-group> (<year>1978</year>). <source><italic>Language development and language disorders.</italic></source> <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Wiley</publisher-name>.</citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowerman</surname> <given-names>M.</given-names></name></person-group> (<year>1973</year>). &#x201C;<article-title>Structural relationships in children&#x2019;s utterances: Syntactic or semantic?</article-title>,&#x201D; in <source><italic>Cognitive development and acquisition of language</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Moore</surname> <given-names>T. E.</given-names></name></person-group> (<publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Academic Press</publisher-name>), <fpage>197</fpage>&#x2013;<lpage>213</lpage>.</citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>R.</given-names></name></person-group> (<year>1973</year>). <source><italic>A first language: The early stages.</italic></source> <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Harvard University Press</publisher-name>.</citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chang</surname> <given-names>H. W.</given-names></name></person-group> (<year>1992</year>). &#x201C;<article-title>The acquisition of Chinese syntax</article-title>,&#x201D; in <source><italic>Advances in psychology</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Chen</surname> <given-names>H. C.</given-names></name> <name><surname>Tzeng</surname> <given-names>O. J. L.</given-names></name></person-group>, <volume>Vol. 90</volume> (<publisher-loc>North-Holland</publisher-loc>: <publisher-name>Oxford</publisher-name>), <fpage>277</fpage>&#x2013;<lpage>311</lpage>.</citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Shirai</surname> <given-names>Y.</given-names></name></person-group> (<year>2015</year>). <article-title>The acquisition of relative clauses in spontaneous child speech in Mandarin Chinese.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>42</volume> <fpage>394</fpage>&#x2013;<lpage>422</lpage>.</citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheung</surname> <given-names>H.</given-names></name></person-group> (<year>1998</year>). &#x201C;<article-title>Utterance length and the development of Mandarin Chinese</article-title>,&#x201D; in <source><italic>Paper presented at The first Asia pacific conference on speech language and hearing</italic></source>, <publisher-loc>Hong Kong</publisher-loc>.</citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crowe</surname> <given-names>K.</given-names></name> <name><surname>McLeod</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Children&#x2019;s English consonant acquisition in the United States: A review.</article-title> <source><italic>Am. J. Speech Lang. Pathol.</italic></source> <volume>29</volume> <fpage>2155</fpage>&#x2013;<lpage>2169</lpage>.</citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crystal</surname> <given-names>D.</given-names></name> <name><surname>Fletcher</surname> <given-names>P.</given-names></name> <name><surname>Garman</surname> <given-names>M.</given-names></name></person-group> (<year>1989</year>). <source><italic>Grammatical analysis of language disability</italic></source>, <edition>2nd Edn</edition>. <publisher-loc>London</publisher-loc>: <publisher-name>Cole and Whurr</publisher-name>.</citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Villiers</surname> <given-names>J. G.</given-names></name> <name><surname>de Villiers</surname> <given-names>P. A.</given-names></name></person-group> (<year>1973</year>). <article-title>A cross-sectional study of the acquisition of grammatical morphemes in child speech.</article-title> <source><italic>J. Psycholinguist. Res.</italic></source> <volume>2</volume> <fpage>267</fpage>&#x2013;<lpage>278</lpage>.</citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Demuth</surname> <given-names>K.</given-names></name> <name><surname>McCullough</surname> <given-names>E.</given-names></name></person-group> (<year>2009</year>). <article-title>The acquisition of auxiliary syntax: A longitudinal elicitation study.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>36</volume> <fpage>83</fpage>&#x2013;<lpage>108</lpage>.</citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deng</surname> <given-names>X.</given-names></name> <name><surname>Yip</surname> <given-names>V.</given-names></name></person-group> (<year>2018</year>). <article-title>An event-semantic account of spatial modification in the acquisition of Mandarin.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>45</volume> <fpage>829</fpage>&#x2013;<lpage>857</lpage>. <pub-id pub-id-type="doi">10.1017/S0305000917000496</pub-id> <pub-id pub-id-type="pmid">29444722</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deng</surname> <given-names>X.</given-names></name> <name><surname>Mai</surname> <given-names>Z.</given-names></name> <name><surname>Yip</surname> <given-names>V.</given-names></name></person-group> (<year>2018</year>). <article-title>An aspectual account of ba and bei constructions in child Mandarin.</article-title> <source><italic>First Lang.</italic></source> <volume>38</volume> <fpage>243</fpage>&#x2013;<lpage>262</lpage>.</citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Diessel</surname> <given-names>H.</given-names></name></person-group> (<year>2004</year>). <source><italic>The acquisition of complex sentences.</italic></source> <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Erbaugh</surname> <given-names>M.</given-names></name></person-group> (<year>1992</year>). &#x201C;<article-title>The acquisition of Mandarin</article-title>,&#x201D; in <source><italic>The crosslinguistic study of language acquisition</italic></source> (<volume>Vol. 3</volume>). <role>ed.</role> <person-group person-group-type="editor"><name><surname>Slobin</surname> <given-names>D. I.</given-names></name></person-group> (<publisher-loc>Hillsdale, NJ</publisher-loc>: <publisher-name>Lawrence Erlbaum Associates</publisher-name>), <fpage>373</fpage>&#x2013;<lpage>455</lpage>.</citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Evans</surname> <given-names>J. L.</given-names></name> <name><surname>Craig</surname> <given-names>H. K.</given-names></name></person-group> (<year>1992</year>). <article-title>Language sample collection and analysis: Interview compared to freeplay assessment contexts.</article-title> <source><italic>J. Speech Lang. Hear. Res.</italic></source> <volume>35</volume> <fpage>343</fpage>&#x2013;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1044/jshr.3502.343</pub-id> <pub-id pub-id-type="pmid">1573874</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Evers-Vermeul</surname> <given-names>J.</given-names></name> <name><surname>Sanders</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>Discovering domains - on the acquisition of causal connectives.</article-title> <source><italic>J. Pragmat.</italic></source> <volume>43</volume> <fpage>1645</fpage>&#x2013;<lpage>1662</lpage>.</citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fang</surname> <given-names>H.</given-names></name> <name><surname>Hengeveld</surname> <given-names>K.</given-names></name></person-group> (<year>2022</year>). <article-title>Sentence-final particles in Mandarin.</article-title> <source><italic>Stud. Linguist.</italic></source> <volume>76</volume> <fpage>873</fpage>&#x2013;<lpage>913</lpage>.</citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fenson</surname> <given-names>L.</given-names></name> <name><surname>Dale</surname> <given-names>P.</given-names></name> <name><surname>Reznick</surname> <given-names>J. S.</given-names></name> <name><surname>Thal</surname> <given-names>D.</given-names></name> <name><surname>Bates</surname> <given-names>E.</given-names></name> <name><surname>Hartung</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>1993</year>). <source><italic>The MacArthur communicative development inventories: User&#x2019;s guide and technical manual.</italic></source> <publisher-loc>San Diego, CA</publisher-loc>: <publisher-name>Singular Publishing Group</publisher-name>.</citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Freedman</surname> <given-names>P. P.</given-names></name> <name><surname>Carpenter</surname> <given-names>R. L.</given-names></name></person-group> (<year>1976</year>). <article-title>Semantic relations used by normal and language-impaired children at stage I.</article-title> <source><italic>J. Speech Hear. Res.</italic></source> <volume>19</volume> <fpage>784</fpage>&#x2013;<lpage>795</lpage>. <pub-id pub-id-type="doi">10.1044/jshr.1904.784</pub-id> <pub-id pub-id-type="pmid">1003956</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gentner</surname> <given-names>D.</given-names></name></person-group> (<year>1982</year>). <source><italic>Why nouns are learned before verbs: Linguistic relativity versus natural partitioning. Center for the Study of Reading Technical Report; No. 257.</italic></source> <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Bolt Beranek and Newman, Inc</publisher-name>.</citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goodglass</surname> <given-names>H.</given-names></name> <name><surname>Kaplan</surname> <given-names>E.</given-names></name></person-group> (<year>1972</year>). <source><italic>Boston Diagnostic Aphasia Examination (BDAE)</italic>.</source> <publisher-loc>Philadelphia, PA</publisher-loc>: <publisher-name>Lea and Febiger</publisher-name>.</citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Green</surname> <given-names>M.</given-names></name></person-group> (<year>1975</year>). <article-title>Semantic acquisition in language impaired and normal speaking four year old children.</article-title> <source><italic>S. Afric. J. Commun. Disord.</italic></source> <volume>22</volume> <fpage>3</fpage>&#x2013;<lpage>21</lpage>.</citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hao</surname> <given-names>M.</given-names></name> <name><surname>Shu</surname> <given-names>H.</given-names></name> <name><surname>Xing</surname> <given-names>A.</given-names></name> <name><surname>Li</surname> <given-names>P.</given-names></name></person-group> (<year>2008</year>). <article-title>Early vocabulary inventory for Mandarin Chinese.</article-title> <source><italic>Behav. Res. Methods</italic></source> <volume>40</volume> <fpage>728</fpage>&#x2013;<lpage>733</lpage>.</citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Herrmann</surname> <given-names>D. J.</given-names></name> <name><surname>Raybeck</surname> <given-names>D.</given-names></name></person-group> (<year>1981</year>). <article-title>Similarities and differences in meaning in six cultures.</article-title> <source><italic>Journal of Cross Cult. Psychol.</italic></source> <volume>12</volume> <fpage>194</fpage>&#x2013;<lpage>206</lpage>.</citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Horvath</surname> <given-names>S.</given-names></name> <name><surname>Kueser</surname> <given-names>J. B.</given-names></name> <name><surname>Kelly</surname> <given-names>J.</given-names></name> <name><surname>Borovsky</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). <article-title>Difference or delay? Syntax, semantics, and verb vocabulary development in typically developing and late-talking toddlers.</article-title> <source><italic>Lang. Learn. Dev.</italic></source> <volume>18</volume> <fpage>352</fpage>&#x2013;<lpage>376</lpage>.</citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hsu</surname> <given-names>D. B.</given-names></name></person-group> (<year>2014</year>). <article-title>Mandarin-speaking three-year-olds&#x2019; demonstration of productive knowledge of syntax: Evidence from syntactic productivity and structural priming with the SVO-ba alternation.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>41</volume> <fpage>1115</fpage>&#x2013;<lpage>1146</lpage>. <pub-id pub-id-type="doi">10.1017/S0305000913000408</pub-id> <pub-id pub-id-type="pmid">24160845</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>R. L.</given-names></name> <name><surname>Fletcher</surname> <given-names>P.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Liang</surname> <given-names>W.</given-names></name> <name><surname>Marchman</surname> <given-names>V.</given-names></name> <name><surname>Tardif</surname> <given-names>T.</given-names></name></person-group> (<year>2022</year>). <article-title>Early grammatical marking development in Mandarin-speaking toddlers.</article-title> <source><italic>Dev. Psychol.</italic></source> <volume>58</volume> <issue>631</issue>. <pub-id pub-id-type="doi">10.1037/dev0001316</pub-id> <pub-id pub-id-type="pmid">35343712</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jia</surname> <given-names>G.</given-names></name> <name><surname>Fuse</surname> <given-names>A.</given-names></name></person-group> (<year>2007</year>). <article-title>Acquisition of English grammatical morphology by native Mandarin-Speaking children and adolescents: Age-related differences.</article-title> <source><italic>J. Speech Lang. Hear. Res.</italic></source> <volume>50</volume> <fpage>1280</fpage>&#x2013;<lpage>1299</lpage>. <pub-id pub-id-type="doi">10.1044/1092-4388(2007/090)</pub-id> <pub-id pub-id-type="pmid">17905912</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jim&#x00E9;nez</surname> <given-names>E.</given-names></name> <name><surname>Haebig</surname> <given-names>E.</given-names></name> <name><surname>Hills</surname> <given-names>T. T.</given-names></name></person-group> (<year>2021</year>). <article-title>Identifying areas of overlap and distinction in early lexical profiles of children with autism spectrum disorder, late talkers, and typical talkers.</article-title> <source><italic>J. Autism Dev. Disorde.</italic></source> <volume>51</volume> <fpage>3109</fpage>&#x2013;<lpage>3125</lpage>. <pub-id pub-id-type="doi">10.1007/s10803-020-04772-1</pub-id> <pub-id pub-id-type="pmid">33156473</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>Z. J.</given-names></name> <name><surname>Jin</surname> <given-names>X. M.</given-names></name></person-group> (<year>2008</year>). <article-title>Research on mean length of utterance and lexical diversity in preschool Mandarin-speaking children.</article-title> <source><italic>Chin. J. Evid. Based Pediatr.</italic></source> <volume>3</volume> <fpage>261</fpage>&#x2013;<lpage>266</lpage>. [In Chinese: &#x91D1;&#x5FD7;&#x5A1F;, and &#x91D1;&#x661F;&#x660E;. (<year>2008</year>). &#x5B66;&#x9F84;&#x524D;&#x513F;&#x7AE5;&#x666E;&#x901A;&#x8BDD;&#x5E73;&#x5747;&#x53E5;&#x5B50;&#x957F;&#x5EA6;&#x548C;&#x8BCD;&#x6C47;&#x5E7F;&#x5EA6;&#x7814;&#x7A76;. <italic>&#x4E2D;&#x56FD;&#x5FAA;&#x8BC1;&#x513F;&#x79D1;&#x6742;&#x5FD7;</italic>, <volume>3</volume>(<issue>4</issue>), <fpage>261</fpage>&#x2013;<lpage>266</lpage>].</citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johnston</surname> <given-names>J. R.</given-names></name></person-group> (<year>1985</year>). &#x201C;<article-title>Cognitive prerequisites: The evidence from children learning English</article-title>,&#x201D; in <source><italic>The crosslinguistic study of language acquisition, Vol. 1. The data; Vol. 2. Theoretical issues</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Slobin</surname> <given-names>D. I.</given-names></name></person-group> (<publisher-loc>Mahwah, NJ</publisher-loc>: <publisher-name>Lawrence Erlbaum Associates, Inc</publisher-name>), <fpage>961</fpage>&#x2013;<lpage>1004</lpage>.</citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klee</surname> <given-names>T</given-names></name> <name><surname>Fitzgerald</surname> <given-names>M. D.</given-names></name></person-group> (<year>1985</year>). <article-title>The relation between grammatical development and mean length of utterance in morphemes.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>12</volume> <fpage>251</fpage>&#x2013;<lpage>269</lpage>.</citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klee</surname> <given-names>T.</given-names></name></person-group> (<year>1992</year>). <article-title>Developmental and diagnostic characteristics of quantitative measures of children&#x2019;s language production.</article-title> <source><italic>Top. Lang. Disord.</italic></source> <volume>12</volume> <fpage>28</fpage>&#x2013;<lpage>41</lpage>.</citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klee</surname> <given-names>T.</given-names></name> <name><surname>Stokes</surname> <given-names>S. F.</given-names></name> <name><surname>Wong</surname> <given-names>A. M. Y.</given-names></name> <name><surname>Fletcher</surname> <given-names>P.</given-names></name> <name><surname>Gavin</surname> <given-names>W. J.</given-names></name></person-group> (<year>2004</year>). <article-title>Utterance length and lexical diversity in Cantonese-speaking children with and without specific language impairment.</article-title> <source><italic>J. Speech Lang. Hear. Res.</italic></source> <volume>47</volume> <fpage>1396</fpage>&#x2013;<lpage>1410</lpage>. <pub-id pub-id-type="doi">10.1044/1092-4388(2004/104)</pub-id> <pub-id pub-id-type="pmid">15842018</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klintfors</surname> <given-names>E.</given-names></name> <name><surname>Lacerda</surname> <given-names>F.</given-names></name> <name><surname>Sundberg</surname> <given-names>U.</given-names></name></person-group> (<year>2009</year>). <article-title>Major parts-of-speech in child language&#x2013;division in open and close class words</article-title>. <source><italic>FONETIK</italic></source>, 126.</citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lahey</surname> <given-names>M.</given-names></name></person-group> (<year>1988</year>). <source><italic>Language disorders and language development.</italic></source> <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Macmillan</publisher-name>.</citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lau</surname> <given-names>D. K. Y.</given-names></name> <name><surname>Tang</surname> <given-names>T. P. Y.</given-names></name> <name><surname>Wong</surname> <given-names>C. C. Y.</given-names></name> <name><surname>Yau</surname> <given-names>T. S. K.</given-names></name> <name><surname>Lam</surname> <given-names>I. H. C.</given-names></name></person-group> (<year>2023</year>). <article-title>Systematic language input improved productions of elaborated verb phrases of Cantonese-speaking children with language difficulties.</article-title> <source><italic>Clin. Linguist. Phonet.</italic></source> <volume>37</volume> <fpage>473</fpage>&#x2013;<lpage>490</lpage>. <pub-id pub-id-type="doi">10.1080/02699206.2022.2157330</pub-id> <pub-id pub-id-type="pmid">36592049</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>H. T.</given-names></name> <name><surname>Wong</surname> <given-names>C.</given-names></name></person-group> (<year>1998</year>). <article-title>Cancorp: The Hong Kong Cantonese child language corpus.</article-title> <source><italic>Cahiers Linguistique Asie Orientale</italic></source> <volume>27</volume> <fpage>211</fpage>&#x2013;<lpage>228</lpage>.</citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>C.</given-names></name> <name><surname>Thompson</surname> <given-names>S.</given-names></name></person-group> (<year>1981</year>). <source><italic>Mandarin Chinese: A functional reference grammar.</italic></source> <publisher-loc>Berkeley, CA</publisher-loc>: <publisher-name>University of California Press</publisher-name>.</citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Zhou</surname> <given-names>J.</given-names></name></person-group> (<year>2015</year>). <source><italic>Study on dinner table talk of preschool children family in Shanghai.</italic> [Unpublished Master dissertation]</source>. <publisher-loc>Shanghai</publisher-loc>: <publisher-name>East China Normal University</publisher-name>.</citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Zhou</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <source><italic>Meta communication development of children aged from three to six during their collaborative pretend play</italic></source>. <comment>[Unpublished Master dissertation]</comment>. <publisher-loc>Hong Kong</publisher-loc>: <publisher-name>The University of Hong Kong</publisher-name>.</citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Y.</given-names></name></person-group> (<year>2004</year>). <source><italic>The development of child language.</italic></source> <publisher-loc>Wuhan</publisher-loc>: <publisher-name>Huazhong Normal University Press</publisher-name>.</citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lieven</surname> <given-names>E. V. M.</given-names></name> <name><surname>Pine</surname> <given-names>J. M.</given-names></name> <name><surname>Barnes</surname> <given-names>H. D.</given-names></name></person-group> (<year>1992</year>). <article-title>Individual differences in early vocabulary development: Redefining the referential-expressive distinction.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>19</volume> <fpage>287</fpage>&#x2013;<lpage>310</lpage>. <pub-id pub-id-type="doi">10.1017/s0305000900011429</pub-id> <pub-id pub-id-type="pmid">1527204</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <source><italic>Early vocabulary development in English, Mandarin, and Cantonese: A cross-linguistic study based on CHILDES. Paper 705. Master&#x2019;s Theses.</italic></source> <publisher-loc>Richmond, VA</publisher-loc>: <publisher-name>University of Richmond</publisher-name>.</citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>X.</given-names></name></person-group> (<year>2009</year>). <article-title>Automatic measurement of syntactic complexity in child language acquisition.</article-title> <source><italic>Int. J. Corpus Linguist.</italic></source> <volume>14</volume> <fpage>3</fpage>&#x2013;<lpage>28</lpage>.</citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ma</surname> <given-names>W.</given-names></name> <name><surname>Golinkoff</surname> <given-names>R.</given-names></name> <name><surname>Hirsh-Pasek</surname> <given-names>K.</given-names></name> <name><surname>Mcdonough</surname> <given-names>C.</given-names></name> <name><surname>Tardif</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). <article-title>Image ability predicts the age of acquisition of verbs in Chinese children.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>36</volume> <fpage>405</fpage>&#x2013;<lpage>423</lpage>. <pub-id pub-id-type="doi">10.1017/S0305000908009008</pub-id> <pub-id pub-id-type="pmid">18937878</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>MacWhinney</surname> <given-names>B.</given-names></name></person-group> (<year>2000</year>). <source><italic>The CHILDES project: Tools for analyzing talk</italic></source>, <edition>3rd Edn</edition>. <publisher-loc>Mahwah, NJ</publisher-loc>: <publisher-name>Lawrence Erlbaum Associates</publisher-name>.</citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Malvern</surname> <given-names>D.</given-names></name> <name><surname>Richards</surname> <given-names>B.</given-names></name></person-group> (<year>2002</year>). <article-title>Investigating accommodation in language proficiency interviews using a new measure of lexical diversity.</article-title> <source><italic>Lang. Test.</italic></source> <volume>19</volume> <fpage>85</fpage>&#x2013;<lpage>104</lpage>.</citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maslen</surname> <given-names>R. J. C.</given-names></name> <name><surname>Theakston</surname> <given-names>A. L.</given-names></name> <name><surname>Lieven</surname> <given-names>E. V. M.</given-names></name> <name><surname>Tomasello</surname> <given-names>M.</given-names></name></person-group> (<year>2004</year>). <article-title>A dense corpus study of past tense and plural overregularization in English.</article-title> <source><italic>J. Speech Lang. Hear. Res.</italic></source> <volume>47</volume> <fpage>1319</fpage>&#x2013;<lpage>1333</lpage>. <pub-id pub-id-type="doi">10.1044/1092-4388(2004/099)</pub-id> <pub-id pub-id-type="pmid">15842013</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miller</surname> <given-names>J.F.</given-names></name></person-group> (<year>1981</year>). <source><italic>Assessing language production in children.</italic></source> <publisher-loc>Baltimore, MD</publisher-loc>: <publisher-name>University Park Press</publisher-name>.</citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miller</surname> <given-names>J.</given-names></name></person-group> (<year>1991</year>). &#x201C;<article-title>Quantifying productive language disorders</article-title>,&#x201D; in <source><italic>Research on Child Language Disorders: A Decade, of Progress</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Miller</surname> <given-names>J.</given-names></name></person-group> (<publisher-loc>Austin</publisher-loc>: <publisher-name>Pro-Ed</publisher-name>).</citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mok</surname> <given-names>Z.</given-names></name> <name><surname>Kipka</surname> <given-names>P. F.</given-names></name></person-group> (<year>2009</year>). <article-title>Lexical-semantic immaturities manifesting as grammatical disorders: Evidence from a child language sample.</article-title> <source><italic>Clin. Linguist. Phonet.</italic></source> <volume>23</volume> <fpage>808</fpage>&#x2013;<lpage>824</lpage>. <pub-id pub-id-type="doi">10.3109/02699200903242970</pub-id> <pub-id pub-id-type="pmid">19891522</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moyle</surname> <given-names>M.</given-names></name> <name><surname>Karasinski</surname> <given-names>C.</given-names></name> <name><surname>Weismer</surname> <given-names>S.</given-names></name> <name><surname>Gorman</surname> <given-names>B.</given-names></name></person-group> (<year>2011</year>). <article-title>Grammatical morphology in school-age children with and without language impairment: A discriminant function analysis.</article-title> <source><italic>Lang. Speech Hear. Serv. Schl.</italic></source> <volume>42</volume> <fpage>550</fpage>&#x2013;<lpage>560</lpage>. <pub-id pub-id-type="doi">10.1044/0161-1461(2011/10-0029)</pub-id> <pub-id pub-id-type="pmid">21969530</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Owen</surname> <given-names>A. J.</given-names></name> <name><surname>Leonard</surname> <given-names>L. B.</given-names></name></person-group> (<year>2002</year>). <article-title>Lexical diversity in the spontaneous speech of children with specific language impairment: Application of D.</article-title> <source><italic>J. Speech Lang. Hear. Res.</italic></source> <volume>45</volume> <fpage>927</fpage>&#x2013;<lpage>937</lpage>. <pub-id pub-id-type="doi">10.1044/1092-4388(2002/075)</pub-id> <pub-id pub-id-type="pmid">12381050</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Owens</surname> <given-names>R. E.</given-names></name></person-group> (<year>2010</year>). <source><italic>Language disorders: A functional approach to assessment and intervention</italic></source>, <edition>5th Edn</edition>. <publisher-loc>Boston</publisher-loc>: <publisher-name>Pearson/Allyn and Bacon</publisher-name>.</citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paul</surname> <given-names>R.</given-names></name></person-group> (<year>2000</year>). <source><italic>Language disorders from infancy through adolescents</italic></source>, <edition>2nd Edn</edition>. <publisher-loc>Saint Louis, MO</publisher-loc>: <publisher-name>Mosby-Year Book</publisher-name>.</citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>P&#x00E9;rez-Leroux</surname> <given-names>A. T.</given-names></name> <name><surname>Pirvulescu</surname> <given-names>M.</given-names></name> <name><surname>Roberge</surname> <given-names>Y.</given-names></name></person-group> (<year>2008</year>). <article-title>Null objects in child language: Syntax and the lexicon.</article-title> <source><italic>Lingua</italic></source> <volume>118</volume> <fpage>370</fpage>&#x2013;<lpage>398</lpage>. <pub-id pub-id-type="doi">10.1016/0010-0277(93)90021-m</pub-id> <pub-id pub-id-type="pmid">8432089</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Piaget</surname> <given-names>J.</given-names></name></person-group> (<year>2000</year>). <source><italic>The construction of reality in the child.</italic></source> <publisher-loc>London</publisher-loc>: <publisher-name>Routledge</publisher-name>.</citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raybeck</surname> <given-names>D.</given-names></name> <name><surname>Herrmann</surname> <given-names>D.</given-names></name></person-group> (<year>1990</year>). <article-title>A cross-cultural examination of semantic relations.</article-title> <source><italic>J. Cross Cult. Psychol.</italic></source> <volume>21</volume> <fpage>452</fpage>&#x2013;<lpage>473</lpage>.</citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raybeck</surname> <given-names>D.</given-names></name> <name><surname>Herrmann</surname> <given-names>D.</given-names></name></person-group> (<year>1996</year>). <article-title>Antonymy and semantic relations: The case for a linguistic universal.</article-title> <source><italic>Cross Cult. Res.</italic></source> <volume>30</volume> <fpage>154</fpage>&#x2013;<lpage>183</lpage>.</citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rice</surname> <given-names>M. L.</given-names></name> <name><surname>Smolik</surname> <given-names>F.</given-names></name> <name><surname>Perpich</surname> <given-names>D.</given-names></name> <name><surname>Thompson</surname> <given-names>T.</given-names></name> <name><surname>Rytting</surname> <given-names>N.</given-names></name> <name><surname>Blossom</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Mean length of utterance levels in 6-month intervals for children 3 to 9 years with and without language impairments.</article-title> <source><italic>J. Speech Lang. Hear. Res.</italic></source> <volume>53</volume> <fpage>333</fpage>&#x2013;<lpage>349</lpage>. <pub-id pub-id-type="doi">10.1044/1092-4388(2009/08-0183)</pub-id> <pub-id pub-id-type="pmid">20360460</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Russell</surname> <given-names>J. A.</given-names></name></person-group> (<year>1991</year>). <article-title>Culture and the categorization of emotions.</article-title> <source><italic>Psychol. Bull.</italic></source> <volume>110</volume>:<issue>426</issue>.</citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sagae</surname> <given-names>K.</given-names></name> <name><surname>Davis</surname> <given-names>E.</given-names></name> <name><surname>Lavie</surname> <given-names>A.</given-names></name> <name><surname>MacWhinney</surname> <given-names>B.</given-names></name> <name><surname>Wintner</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>Morphosyntactic annotation of CHILDES transcripts.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>37</volume> <fpage>705</fpage>&#x2013;<lpage>729</lpage>. <pub-id pub-id-type="doi">10.1017/S0305000909990407</pub-id> <pub-id pub-id-type="pmid">20334720</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scarborough</surname> <given-names>H. S.</given-names></name></person-group> (<year>1985</year>). &#x201C;<article-title>Measuring syntactic development: Index of productive syntax</article-title>,&#x201D; in <source><italic>Paper presented to the Society for Research in Child Development</italic></source>, <publisher-loc>Toronto, ON</publisher-loc>.</citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>J.</given-names></name> <name><surname>Zhou</surname> <given-names>P.</given-names></name></person-group> (<year>2018</year>). <article-title>How possessive relations are mapped onto child language: A view from Mandarin Chinese.</article-title> <source><italic>J. Psycholinguist. Res.</italic></source> <volume>47</volume> <fpage>1321</fpage>&#x2013;<lpage>1341</lpage>. <pub-id pub-id-type="doi">10.1007/s10936-017-9521-z</pub-id> <pub-id pub-id-type="pmid">28929452</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slobin</surname> <given-names>D. I.</given-names></name></person-group> (<year>1973</year>). &#x201C;<article-title>Cognitive prerequisites for the development of grammar</article-title>,&#x201D; in <source><italic>Studies of child language development</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Ferguson</surname> <given-names>C. A.</given-names></name> <name><surname>Slobin</surname> <given-names>D. I.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Holt, Rinehart, and Winston</publisher-name>), <fpage>175</fpage>&#x2013;<lpage>208</lpage>. <pub-id pub-id-type="doi">10.3389/fnevo.2012.00003</pub-id> <pub-id pub-id-type="pmid">22347185</pub-id></citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slobin</surname> <given-names>D. I.</given-names></name></person-group> (<year>1985</year>). &#x201C;<article-title>Crosslinguistic evidence for the language-making capacity</article-title>,&#x201D; in <source><italic>The crosslinguistic study of language acquisition</italic></source>, <volume>Vol. 2</volume> <role>ed.</role> <person-group person-group-type="editor"><name><surname>Slobin</surname> <given-names>D. I.</given-names></name></person-group> (<publisher-loc>Hillsdale, NJ</publisher-loc>: <publisher-name>Lawrence Erlbaum Associates</publisher-name>), <fpage>1157</fpage>&#x2013;<lpage>1249</lpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2015.00852</pub-id> <pub-id pub-id-type="pmid">26157406</pub-id></citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slobin</surname> <given-names>D. I.</given-names></name></person-group> (<year>2004</year>). &#x201C;<article-title>The many ways to search for a frog: Linguistic typology and the expression of motion events</article-title>,&#x201D; in <source><italic>Typological and contextual perspectives.</italic></source> <publisher-name>Psychology Press</publisher-name>, <fpage>219</fpage>&#x2013;<lpage>257</lpage>.</citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>A. B.</given-names></name> <name><surname>Jackins</surname> <given-names>M.</given-names></name></person-group> (<year>2014</year>). <article-title>Relationship between longest utterances and later MLU in late talkers.</article-title> <source><italic>Clin. Linguist. Phonet.</italic></source> <volume>28</volume> <fpage>143</fpage>&#x2013;<lpage>152</lpage>. <pub-id pub-id-type="doi">10.3109/02699206.2013.839746</pub-id> <pub-id pub-id-type="pmid">24160753</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>So</surname> <given-names>L. K.</given-names></name> <name><surname>Dodd</surname> <given-names>B. J.</given-names></name></person-group> (<year>1995</year>). <article-title>The acquisition of phonology by Cantonese-speaking children.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>22</volume> <fpage>473</fpage>&#x2013;<lpage>495</lpage>.</citation></ref>
<ref id="B77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stockman</surname> <given-names>I. J.</given-names></name> <name><surname>Vaughn-Cooke</surname> <given-names>F. B.</given-names></name></person-group> (<year>1986</year>). <article-title>Implications of semantic category research for the language assessment of nonstandard speakers.</article-title> <source><italic>Top. Lang. Disord.</italic></source> <volume>6</volume> <fpage>15</fpage>&#x2013;<lpage>26</lpage>.</citation></ref>
<ref id="B78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stockman</surname> <given-names>I. J.</given-names></name> <name><surname>Vaughn-Cooke</surname> <given-names>F. B.</given-names></name></person-group> (<year>1992</year>). <article-title>Lexical elaboration in children&#x2019;s locative action expressions.</article-title> <source><italic>Child Dev.</italic></source> <volume>63</volume> <fpage>1104</fpage>&#x2013;<lpage>1125</lpage>.</citation></ref>
<ref id="B79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stockman</surname> <given-names>I.</given-names></name> <name><surname>Vaughn-Cooke</surname> <given-names>F.</given-names></name></person-group> (<year>1982</year>). <article-title>A re-examination of research on the language of black children: The need for a new framework.</article-title> <source><italic>J. Educ.</italic></source> <volume>164</volume> <fpage>157</fpage>&#x2013;<lpage>172</lpage>.</citation></ref>
<ref id="B80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stoll</surname> <given-names>S.</given-names></name> <name><surname>Bickel</surname> <given-names>B.</given-names></name> <name><surname>Lieven</surname> <given-names>E.</given-names></name> <name><surname>Paudyal</surname> <given-names>N. P.</given-names></name> <name><surname>Banjade</surname> <given-names>G.</given-names></name> <name><surname>Bhatta</surname> <given-names>T. N.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Nouns and verbs in Chintang: Children&#x2019;s usage and surrounding adult speech.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>39</volume> <fpage>284</fpage>&#x2013;<lpage>321</lpage>. <pub-id pub-id-type="doi">10.1017/S0305000911000080</pub-id> <pub-id pub-id-type="pmid">21854689</pub-id></citation></ref>
<ref id="B81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tang</surname> <given-names>P.Y.</given-names></name> <name><surname>Hsu</surname> <given-names>Y.Y.</given-names></name> <name><surname>Lau</surname> <given-names>K.Y.</given-names></name> <name><surname>Leung</surname> <given-names>M.T.</given-names></name></person-group> (<year>2021</year>). &#x201C;<article-title>Interaction between form and content in the acquisition of temporal expressions among Mandarin-speakin children</article-title>,&#x201D; in <source><italic>Poster presented at the IASCL 2021</italic></source>, <publisher-loc>Philadelphia</publisher-loc>.</citation></ref>
<ref id="B82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tardif</surname> <given-names>T.</given-names></name></person-group> (<year>1996</year>). <article-title>Nouns are not always learned before verbs: Evidence from Mandarin speakers&#x2019; early vocabularies.</article-title> <source><italic>Dev. Psychol.</italic></source> <volume>32</volume>:<issue>492</issue>.</citation></ref>
<ref id="B83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tardif</surname> <given-names>T.</given-names></name> <name><surname>Fletcher</surname> <given-names>P.</given-names></name> <name><surname>Liang</surname> <given-names>W.</given-names></name> <name><surname>Kaciroti</surname> <given-names>N.</given-names></name></person-group> (<year>2009</year>). <article-title>Early vocabulary development in Mandarin (Putonghua) and Cantonese.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>36</volume> <fpage>1115</fpage>&#x2013;<lpage>1144</lpage>.</citation></ref>
<ref id="B84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tremblay</surname> <given-names>R. E.</given-names></name></person-group> (<year>2003</year>). &#x201C;<article-title>Why socialization fails. The case of chronic physical aggression</article-title>,&#x201D; in <source><italic>Causes of conduct disorder and juvenile delinquency</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Lahey</surname> <given-names>B. B.</given-names></name> <name><surname>Moffitt</surname> <given-names>T. E.</given-names></name> <name><surname>Caspi</surname> <given-names>A.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Guilford Press)</publisher-name>, <fpage>182</fpage>&#x2013;<lpage>224</lpage>.</citation></ref>
<ref id="B85"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tremblay</surname> <given-names>R. E.</given-names></name></person-group> (<year>2004</year>). <article-title>Decade of behavior distinguished lecture: Development of physical aggression during infancy.</article-title> <source><italic>Infant Ment. Health J.</italic></source> <volume>25</volume> <fpage>399</fpage>&#x2013;<lpage>407</lpage>.</citation></ref>
<ref id="B86"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tse</surname> <given-names>S. K.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Leung</surname> <given-names>S. O.</given-names></name></person-group> (<year>2012</year>). <article-title>Tense and temporality: How young children express time in Cantonese.</article-title> <source><italic>Chin. Lang. Discourse</italic></source> <volume>3</volume> <fpage>35</fpage>&#x2013;<lpage>56</lpage>.</citation></ref>
<ref id="B87"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weber-Fox</surname> <given-names>C.</given-names></name> <name><surname>Neville</surname> <given-names>H. J.</given-names></name></person-group> (<year>2001</year>). <article-title>Sensitive periods differentiate processing of open- and closed-class words: An ERP study of bilinguals.</article-title> <source><italic>J. Speech Lang. Hear. Res.</italic></source> <volume>44</volume> <fpage>1338</fpage>&#x2013;<lpage>1353</lpage>. <pub-id pub-id-type="doi">10.1044/1092-4388(2001/104)</pub-id> <pub-id pub-id-type="pmid">11776369</pub-id></citation></ref>
<ref id="B88"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>D.Y.H.</given-names></name></person-group> (<year>1996</year>). &#x201C;<article-title>Chinese childhood socialization</article-title>,&#x201D; in <source><italic>The handbook of Chinese psychology</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Bond</surname> <given-names>M.H.</given-names></name></person-group> (<publisher-loc>London</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>143</fpage>&#x2013;<lpage>154</lpage>.</citation></ref>
<ref id="B89"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>S. Y.</given-names></name> <name><surname>Huang</surname> <given-names>R. J.</given-names></name> <name><surname>Tsai</surname> <given-names>I. F.</given-names></name></person-group> (<year>2019</year>). <article-title>The applicability of D, MTLD, and MATTR in Mandarin&#x2013;speaking children.</article-title> <source><italic>J. Commun. Disord.</italic></source> <volume>77</volume> <fpage>71</fpage>&#x2013;<lpage>79</lpage>. <pub-id pub-id-type="doi">10.1016/j.jcomdis.2018.10.002</pub-id> <pub-id pub-id-type="pmid">30686328</pub-id></citation></ref>
<ref id="B90"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>S.-Y.</given-names></name></person-group> (<year>2020</year>). <article-title>Mean length of utterance among Mandarin-speaking children with and without DLD.</article-title> <source><italic>Child Lang. Teach. Ther.</italic></source> <volume>36</volume> <fpage>165</fpage>&#x2013;<lpage>179</lpage>. <pub-id pub-id-type="doi">10.1016/j.jcomdis.2022.106199</pub-id> <pub-id pub-id-type="pmid">35227976</pub-id></citation></ref>
<ref id="B91"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xiao</surname> <given-names>H.</given-names></name></person-group> (<year>1999</year>). <article-title>Independence and obedience: An analysis of child socialization values in the United States and China.</article-title> <source><italic>J. Comp. Fam. Stud.</italic></source> <volume>30</volume> <fpage>641</fpage>&#x2013;<lpage>657</lpage>.</citation></ref>
<ref id="B92"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>Y.</given-names></name> <name><surname>Farver</surname> <given-names>J. A. M.</given-names></name></person-group> (<year>2009</year>). <article-title>&#x201C;What makes you shy?&#x201D;: Understanding situational elicitors of shyness in Chinese children.</article-title> <source><italic>Int. J. Behav. Dev.</italic></source> <volume>33</volume> <fpage>97</fpage>&#x2013;<lpage>104</lpage>.</citation></ref>
<ref id="B93"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zarifian</surname> <given-names>T.</given-names></name> <name><surname>Modarresi</surname> <given-names>Y.</given-names></name> <name><surname>Tehrani</surname> <given-names>L. G.</given-names></name> <name><surname>Kazemi</surname> <given-names>M. D.</given-names></name></person-group> (<year>2015</year>). <article-title>Phonetic and phonological acquisition in Persian speaking children.</article-title> <source><italic>Monolingual Biling. Speech</italic></source> <volume>2015</volume>:<issue>430</issue>.</citation></ref>
<ref id="B94"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Zhou</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). <article-title>Building a norm-referenced dataset for vocabulary assessment based on Chinese vocD and word classes.</article-title> <source><italic>J. Chin. Writ. Syst.</italic></source> <volume>4</volume> <fpage>5</fpage>&#x2212;<lpage>17</lpage>.</citation></ref>
<ref id="B95"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>G. G.</given-names></name></person-group> (<year>2004</year>). <article-title>An investigation of the acquisition of Chinese time system.</article-title> <source><italic>Appl. Linguist.</italic></source> <volume>4</volume> <fpage>33</fpage>&#x2013;<lpage>40</lpage> [In Chinese: &#x5468;&#x56FD;&#x5149; (<year>2004</year>). &#x6C49;&#x8BED;&#x65F6;&#x95F4;&#x7CFB;&#x7EDF;&#x4E60;&#x5F97;&#x72B6;&#x51B5;&#x7684;&#x8003;&#x5BDF;. <italic>&#x8BED;&#x8A00;&#x6587;&#x5B57;&#x5E94;&#x7528;</italic> <volume>4</volume>: <fpage>33</fpage>&#x2013;<lpage>40</lpage>].</citation></ref>
<ref id="B96"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>Y.</given-names></name> <name><surname>Xue</surname> <given-names>N.</given-names></name></person-group> (<year>2015</year>). <article-title>The Chinese discourse Treebank: A Chinese corpus annotated with discourse relations.</article-title> <source><italic>Lang. Resour. Eval.</italic></source> <volume>49</volume> <fpage>397</fpage>&#x2212;<lpage>431</lpage>.</citation></ref>
<ref id="B97"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>D. X.</given-names></name></person-group> (<year>1982</year>). <source><italic>YuFa Jiangyi.</italic></source> <publisher-loc>Beijing</publisher-loc>: <publisher-name>Commercial Press</publisher-name> [In Chinese: &#x6731;&#x5FB7;&#x7199; (<year>1982</year>). <italic>&#x8BED;&#x6CD5;&#x8BB2;&#x4E49;</italic>. &#x5317;&#x4EAC;&#xFF1A;&#x5546;&#x52A1;&#x5370;&#x4E66;&#x9986;].].</citation></ref>
</ref-list>
</back>
</article>