<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2022.769415</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The Assessment of Chinese Children&#x2019;s English Vocabulary&#x2014;A Culturally Appropriate Receptive Vocabulary Test for Young Chinese Learners of English</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>de Ruiter</surname> <given-names>Laura E.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1519037/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Wen</surname> <given-names>Peizhi</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1511994/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Chen</surname> <given-names>Si</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1442109/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Division of Human Communication, Development &#x0026; Hearing, School of Health Sciences, The University of Manchester</institution>, <addr-line>Manchester</addr-line>, <country>United Kingdom</country></aff>
<aff id="aff2"><sup>2</sup><institution>Harvard Graduate School of Education</institution>, <addr-line>Cambridge, MA</addr-line>, <country>United States</country></aff>
<aff id="aff3"><sup>3</sup><institution>PACE Research Institute</institution>, <addr-line>Shenzhen</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Marianne Gullberg, Lund University, Sweden</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Ghada M. Awada, Lebanese American University, Lebanon; Dorota Campfield, University of Warsaw, Poland</p></fn>
<corresp id="c001">&#x002A;Correspondence: Si Chen, <email>sic773@mail.harvard.edu</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Language Sciences, a section of the journal Frontiers in Psychology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>22</day>
<month>03</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>769415</elocation-id>
<history>
<date date-type="received">
<day>02</day>
<month>09</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>01</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2022 de Ruiter, Wen and Chen.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>de Ruiter, Wen and Chen</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Millions of Chinese children learn English at increasingly younger ages. Yet when it comes to measuring proficiency, educators, and researchers rely on assessments that have been developed for L1 learners and/or for different cultural contexts, or on non-validated, individually designed tests. We developed the Assessment of Chinese Children&#x2019;s English Vocabulary test (ACCE-V) to address the need for a validated, culturally appropriate receptive vocabulary test, designed specifically for young Chinese learners. The items are drawn from current teaching materials used in China, and the depictions of people and objects are culturally appropriate. We evaluated the instrument&#x2019;s reliability and validity in two field tests with a combined sample size of 1,092 children (181 children for the first field test and 911 children for the second field test, age range from 3.1 to 7.7, mean age: 5.2. Item Response Theory (IRT) analyses show that the ACCE-V is sufficiently sensitive to capture different proficiency levels and that it has good psychometric properties. ACCE-V scores were correlated with Peabody Picture Vocabulary Test scores, indicating concurrent validity. We found that children&#x2019;s age and English learning experience can significantly predict the scores of ACCE-V, but the effect of English learning experience is greater. The ACCE-V thus offers an alternative to existing vocabulary tests. We argue that culturally appropriate assessments like the ACCE-V are fairer to learners and help promote an English learning and teaching environment that is less dominated by Western cultures and native speaker norms.</p>
</abstract>
<kwd-group>
<kwd>English as a foreign language</kwd>
<kwd>receptive vocabulary</kwd>
<kwd>assessment</kwd>
<kwd>young language learners</kwd>
<kwd>China</kwd>
</kwd-group>
<counts>
<fig-count count="7"/>
<table-count count="8"/>
<equation-count count="0"/>
<ref-count count="40"/>
<page-count count="15"/>
<word-count count="10834"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<sec id="S1.SS1">
<title>Young English Learners in China</title>
<p>Over the past 20 years, English has gained importance as an academic subject in the Chinese education system. In 2001, the Ministry of Education (MOE) of China launched the &#x201C;Guideline for Promoting English Teaching in Elementary Schools,&#x201D; which made English a compulsory subject starting from the third (rural areas) or the first grade (urban areas) of elementary school (<xref ref-type="bibr" rid="B24">Ministry of Education of the People&#x2019;s Republic of China, 2001</xref>). In the Chinese College Entrance Examination (Gao Kao), English is weighted as much as Chinese and mathematics.</p>
<p>Because of the importance of English in children&#x2019;s academic achievement, many parents try to boost their children&#x2019;s English skills by sending them to after-school and cram schools (<xref ref-type="bibr" rid="B16">Feng, 2012</xref>). They also believe that the best way to achieve good English proficiency is to start learning English at an early age, that is, before they enter elementary school and receive English instructions as part of the regular curriculum. As a consequence, more children at increasingly younger ages are learning English as a foreign language (EFL) (<xref ref-type="bibr" rid="B19">Hu and McKay, 2012</xref>). In 2016, more than 210 million young EFL learners under the age of six were taking English courses in more than 50,000 private English institutes in mainland China (<xref ref-type="bibr" rid="B37">Sun et al., 2016</xref>), and the numbers have almost certainly only increased since then. Recently, China has seen a significant growth in online English tutoring. A report showed that more than 26.9% users of all the K-12 (kindergarten through 12th grade) online English teaching courses in China are aged 3&#x2013;6 years (<xref ref-type="bibr" rid="B3">Aurora Mobile, 2020</xref>).</p>
<p>In China, public kindergartens are prohibited from teaching English (<xref ref-type="bibr" rid="B25">Ministry of Education of the People&#x2019;s Republic of China, 2018</xref>). Private kindergartens, in contrast, are allowed to teach English (<xref ref-type="bibr" rid="B8">Chen et al., 2020</xref>). Although China has recently issued a series of policies that prohibit online classes for children, including English (<xref ref-type="bibr" rid="B26">Ministry of Education of the People&#x2019;s Republic of China, 2021</xref>), many parents are still enthusiastic about having their young children learn English. Some kindergartens even hire foreign teachers to carry out immersive English teaching. Although private schools charge much higher fees than public schools, they are still popular with parents, especially with those with higher socio-economic status (SES). Since the government introduced restrictions on early English teaching in 2021 (<xref ref-type="bibr" rid="B26">Ministry of Education of the People&#x2019;s Republic of China, 2021</xref>), English is being learned by Chinese preschoolers exclusively in private kindergartens.</p>
</sec>
<sec id="S1.SS2">
<title>Challenges for Early English Teaching in China</title>
<p>Early English teaching in China faces several challenges with respect to quality. First, there are no educational standards for English teaching in kindergartens. Individual kindergartens or even individual teachers determine the content of teaching. High-quality kindergartens may decide to have courses and class plans reviewed and checked by specialized teaching and research departments of their local Ministry of Education. However, this kind of quality control is entirely voluntary, and in many kindergartens, there is no quality control whatsoever. This problem is compounded by the fact that foreign teachers in Chinese kindergartens are generally highly mobile, and that kindergartens have a high turnover rate. These constant changes in teaching personnel make it difficult to deliver high-quality education in a consistent manner.</p>
<p>Second, children in Chinese kindergartens are automatically grouped by age rather than proficiency level. However, unlike with Mandarin, age is not necessarily a good indicator of proficiency level in English, because children come to class with different backgrounds, due to the fact that many parents are pursuing different strategies to have their children learn English, ranging from apps to private tutors. This age-based grouping can make it difficult to cater to the needs of children with different levels of English proficiency.</p>
<p>In this environment, it is very difficult for teachers to provide differentiated content to the children that is adequate for their proficiency level. Very experienced teachers may be able to gauge children&#x2019;s English proficiency through regular interaction and observation. However, given that most teachers are inexperienced teachers and are often teaching the children only for a relatively short period, it is not easy for them to assess the actual proficiency level of individual children. Under these circumstances, an appropriate English proficiency test can provide this information. However, currently, there is a lack of assessment tools suitable for young Chinese children.</p>
</sec>
<sec id="S1.SS3">
<title>Currently Available Tests</title>
<sec id="S1.SS3.SSS1">
<title>L2-Specific Tests</title>
<p>There are currently three sets of tests aimed at young learners of English that are also available in China: the Cambridge Young Learners English Test (CYLE), developed by Cambridge ESOL Examinations, the Pearson Test of English Young Learners (PTE Young Learners), developed by Pearson, and the TOEFL Primary test, developed by ETS.</p>
<p>The CYLE test series tests how well 7&#x2013;12-year&#x2013;olds are performing in the skills of listening, speaking, reading, and writing. It has so far been administered in 68 countries and has more than 360,000 test takers annually (<xref ref-type="bibr" rid="B10">Chik and Besser, 2011</xref>). The PTE Young Learners, called Starters is aimed at learners between 6 and 13 years, also assessing the four language skills. TOEFL Primary is for learners 8 years and up and assesses reading and writing and speaking. All three tests are administered at designated test centers, with TOEFL Primary being the only one that also offers institutional testing at schools.</p>
<p>Although these three tests are aimed at children above kindergarten age (3&#x2013;6 years in China), they are nonetheless often used in Chinese kindergartens. This is highly problematic for several reasons. A major feature of these tests is that their format requires that children can already read and write, which is not the case for most kindergarten children in China who are just beginning to learn English. When used with preliterate children, these tests will therefore produce inaccurate results&#x2014;a child may have a certain level of oral proficiency, but this will not be captured by a test that requires literacy. Furthermore, these tests are designed for children above elementary school and lack the play-based interface that is necessary to engage young children. And finally, the tests are designed according to the cultural norms of Western countries (specifically the United States, the United Kingdom, Canada, and Australia), which makes them less accessible for Chinese children who lack the cultural background knowledge, as we will discuss in more detail in the next section.</p>
<p>Another issue with these tests is that they are designed to cover all areas of language ability, with a focus on phonological awareness and grammar. However, the consensus in early language teaching is that for very young children, teaching should be focused on developing lexical competence and no explicit grammar instruction, as children do not develop the necessary metalinguistic skills until much later (<xref ref-type="bibr" rid="B12">Curtain and Dahlberg, 2010</xref>; <xref ref-type="bibr" rid="B34">Shin and Crandall, 2014</xref>). In line with this, any instrument for assessing young learners&#x2019; proficiency should focus on vocabulary.</p>
</sec>
<sec id="S1.SS3.SSS2">
<title>Vocabulary Tests</title>
<p>When it comes to measuring the vocabulary of young Chinese EFL learners, researchers and teachers typically rely on assessments such as the PPVT (<xref ref-type="bibr" rid="B15">Dunn and Dunn, 2007</xref>) or the British Picture Vocabulary Scale, BPVS (<xref ref-type="bibr" rid="B14">Dunn et al., 2009</xref>). These tests have been developed over many years, and typically have very good psychometric properties. However, they are aimed at children learning English as their first language (L1). Using these tests for EFL learners is problematic for two different but related reasons.</p>
<p>The first reason is that the learning environment for children who learn English in an educational setting differs from that of children learning it in their home environment. Children who initially acquire words in their home environment learn these from conversations with and among adults as well as through activities such as pretend to play and book reading (<xref ref-type="bibr" rid="B29">Ninio, 1983</xref>; <xref ref-type="bibr" rid="B35">Snow et al., 1991</xref>). Their early vocabulary is characterized by words that reflect what is relevant for the child, such as family members (e.g., <italic>mommy, daddy</italic>), toys (e.g., <italic>ball, teddy</italic>), body parts (e.g., <italic>toe, hand</italic>), or food items (e.g., <italic>sandwich, cookie</italic>). In line with this, the structure of vocabulary tests like the PPVT is based on the idea that in the child&#x2019;s experience, some words are more frequent than others, and that children will acquire more frequent words before less frequent words (<xref ref-type="bibr" rid="B15">Dunn and Dunn, 2007</xref>).</p>
<p>However, the words used in English lessons in Chinese schools are different from what a child may encounter when it learns English from parents and others at home. English lessons often introduce school-related vocabulary (e.g., <italic>classroom, pencil, ruler</italic>), places (e.g., <italic>library, hospital, post office</italic>), shapes and colors (e.g., <italic>rectangle, red</italic>), and animals (e.g., <italic>elephant, snake</italic>). Thus, testing Chinese young EFL learners with words that children growing up in an English-speaking environment typically learn at home will not provide an accurate picture of their vocabulary knowledge.</p>
<p>The second, related reason why these tests are problematic is that the items are based on the dominant culture of the country for which the test was developed (e.g., United States, United Kingdom). This applies both to the types of items used as well as the way items are depicted. An example for the first issue items (&#x2018;items&#x2019; being used here to mean both targets and distractors) like <italic>muffin</italic> and <italic>pretzel</italic> (both used in the PPVT), items that are unlikely to be known to young Chinese children. Examples of culturally specific depictions are a traditional English teacup with a handle on a saucer (used in the BPVS), which are not common in China, or a castle in European medieval style (used in the PPVT), which is very different from the way castles look in China.</p>
<p>Recent research found the PPVT to be less reliable for L2 learners with limited English experience and proficiency. <xref ref-type="bibr" rid="B40">Wood et al. (2015)</xref> tested both Spanish-speaking kindergarteners&#x2019; and monolingual kindergarteners&#x2019; vocabulary using the PPVT-4. They observed that the relationship between the difficulty level of the items in the PPVT (which is indicated by the order of items in the test) was positively related to children&#x2019;s error scores in both groups (i.e., children made more errors with more difficult items). However, this relation was much stronger in English monolingual children than in Spanish L2 learners of English. In other words, the findings suggest that the difficulty assumptions that the test is based on do not hold to the same extent for L2 learners as they do for L1 learners. <xref ref-type="bibr" rid="B17">Goriot et al. (2018)</xref> administered the PPVT-4 to pupils in the Netherlands in six different age groups (4&#x2013;15 years old). They found that the test had low reliability scores (as measured by Cronbach&#x2019;s alpha) for learners with limited proficiency; these were predominantly children in the youngest age of 4&#x2013;5 years. They also found an effect of the children&#x2019;s L1. English words that had cognates in Dutch (e.g., <italic>penguin</italic> and <italic>pinguin</italic>) tended to be easier for participants than words that did not, most likely because participants were able to guess the meaning. Like Wood and Pena&#x2019;s study, this study showed that the test&#x2019;s reliability is lower with L2 speakers. In addition, the participants in <xref ref-type="bibr" rid="B17">Goriot et al. (2018)</xref> had the advantage of speaking a typologically close L1 and of being culturally closer to America. Potential issues with tests like the PPVT are arguably more pronounced in young Chinese learners of English, whose L1 does not have any linguistic similarities with English and who have less experience with Western culture.</p>
<p>The learning environment of young Chinese English learners is different also from that of dual language learners in an English-speaking country (e.g., Hispanic English language learners in the United States, who speak Spanish at home, but who learn and speak English at school, or EAL learners in the United Kingdom who speak Pashtu at home but speak English at school). Children whose home language is not English have less exposure to English than children whose home language is English (e.g., <xref ref-type="bibr" rid="B11">Cote and Bornstein, 2014</xref>), and they score consistently lower on English vocabulary tests than their monolingual counterparts (REF). But unlike foreign language learners in China, these children are immersed in English continuously at school (not only during English lessons), and are likely to interact with others (peers, teachers) in English on a regular basis. Researchers who work with this population have pointed out that test validity is threatened when available norms are based on monolingual children, when the child&#x2019;s cultural experiences do not match test expectations, or when the items are not presented in a way that allows the child to demonstrate competence (<xref ref-type="bibr" rid="B30">Pe&#x00F1;a and Halle, 2011</xref>). This problem is exacerbated for children who do not even have minimal experience with the cultural norms reflected in test items, such as young Chinese learners of English living in China.</p>
<p>Against this background, we developed a new vocabulary test, designed specifically for young Chinese learners of English. But before we move on to describing this new instrument, we briefly want to discuss what it means for a young learner to &#x201C;know a word&#x201D; (in comprehension).</p>
</sec>
</sec>
<sec id="S1.SS4">
<title>&#x201C;Knowing a Word&#x201D;</title>
<p>To know a word usually means that someone knows its basic meaning (denotation). For L1 speakers or more advanced learners, one would assume that they also have an understanding of evaluative meanings (connotation), an understanding of its grammatical form, an awareness that the word can have multiple meanings (e.g., <italic>to run across the field</italic> vs. <italic>to run a business</italic> vs. <italic>he had a good run</italic>), or knowledge of which register a word belongs to (e.g., formal, casual).</p>
<p>In the case of young learners, we assume knowledge only of its denotation for the word&#x2019;s most frequent use, for example, understanding that run means &#x201C;move using your feet/limbs at a speed faster than a walk.&#x201D; Traditionally, vocabulary tests use word families as their unit of recognition (<xref ref-type="bibr" rid="B33">Schmitt, 2010</xref>). A word family comprises the base word and its inflections and most common derivations. For example, the words <italic>run</italic>, <italic>runs</italic>, <italic>ran</italic>, <italic>running</italic>, and <italic>runner</italic> would all be assumed to be of the same word family. In other words, if a learner knows a word such as <italic>run</italic>, it is assumed that they will also know the meaning of <italic>runner</italic>, or at least be in a position to guess its meaning. However, some studies have challenged this assumption, finding that learners may in fact not know the other family members (<xref ref-type="bibr" rid="B28">Nation, 2006</xref>). <xref ref-type="bibr" rid="B38">Ward and Chuenjundaeng (2009)</xref>, for example, conducted a study with Thai EFL learners, focusing on their suffix knowledge. They conclude that their findings &#x201C;contradict the assumption that knowledge of headwords implies knowledge of word families, at least with lower-level students from non-Latinate L1 [first language] backgrounds&#x201D; (p. 465). For this reason, we refrain from making assumptions regarding the size of a learner&#x2019;s lexicon.</p>
</sec>
<sec id="S1.SS5">
<title>The Current Study: Purpose and Use of the Assessment of Chinese Children&#x2019;s English Vocabulary</title>
<p>As we discussed, existing vocabulary tests are not well suited for young Chinese learners of English. Both educators and researchers would benefit from a receptive vocabulary test that is specifically designed for this growing population. For educators, a suitable assessment tool will help understand the level of children&#x2019;s English development accurately, and this information can help educators set English learning goals and design curriculum content suitable for children&#x2019;s developmental level. For researchers, assessment tools are also needed to estimate children&#x2019;s English ability, for example in the context of evaluating the effects of educational experiments and intervention projects. We (a group of early childhood education, psychology, and psycholinguistic researchers) therefore developed the Assessment of Chinese Children&#x2019;s English&#x2014;Vocabulary (ACCE-V). The test was commissioned by the PACE Research Institute, which focuses on research on early childhood education in China.</p>
<p>The ACCE-V is a multiple-choice, receptive vocabulary test for young Chinese (Cantonese- and Mandarin-speaking) learners of English between 4 and 7 years and assesses vocabulary knowledge that is relevant in the context of Chinese primary English education. Because it does not require reading, can be used with preliterate children. Since the purpose of the test in educational settings is to provide teachers with information about the children&#x2019;s proficiency to allow them to tailor their teaching accordingly, scores are meant only for educators and will not be communicated to parents. Educators will receive standardized scores that interpret children&#x2019;s scores based on the group means and standard deviations. By avoiding communicating the scores to parents, we believe that the ACCE-V will not add to the existing &#x2018;testing culture&#x2019; in China.</p>
<p>The current study describes the design and validation of the ACCE-V. We ask two research questions:</p>
<p>(1) Does the ACCE-V have acceptable psychometric characteristics?</p>
<p>(2) What is the relationship between the ACCE-V, children&#x2019;s demographic features (age and gender), and children&#x2019;s English learning experience?</p>
<p>Regarding question (2), we hypothesize that there is little correlation between children&#x2019;s age and their vocabulary scores, while the correlation between children&#x2019;s English learning experience and vocabulary scores is greater.</p>
</sec>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S2.SS1">
<title>Participants</title>
<sec id="S2.SS1.SSS1">
<title>First Field Test Participants</title>
<p>One-hundred-and-eighty-one children between 3 and 7 years of age (<italic>M</italic> = 5;04<sup><xref ref-type="fn" rid="footnote1">1</xref></sup>) were recruited from two preschools in two major cities in China (see <xref ref-type="table" rid="T1">Table 1</xref>): one in eastern China (<italic>n</italic> = 72) and one in southern China (<italic>n</italic> = 109). Socio-economic background is often correlated with educational outcomes. We therefore wanted to include some information on the participants&#x2019; SES. As it was not possible to collect information about parents&#x2019; income or their educational background, we used the tuition costs of the preschools and publicly available economic information about the catchment areas of the preschools as indicators (here: housing prices). The first preschool served predominantly middle SES families: Its tuition was 28% higher than the average tuition in the city, while the average housing price was &#x00A5; 45,280/m<sup>2</sup>, which is slightly lower than the average housing price of both the cities (which is &#x00A5; 50,000/m<sup>2</sup> in both cities). The second preschool served predominantly high SES families, as evidenced by the fact that its tuition is ten times the average tuition in the southern China city and the housing price in its catchment area is more than two times the average (&#x00A5; 108,920/m<sup>2</sup>). Based on the total number of children enrolled in each preschool, 40% of all children were randomly selected from each grade. Of the 181 children in our sample, 75 were female and 106 were male.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Descriptive statistics.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Field test</td>
<td valign="top" align="center"><italic>N</italic> (%)</td>
<td valign="top" align="center">Mean</td>
<td valign="top" align="center">Standard deviation</td>
<td valign="top" align="center">Standard error of the mean</td>
<td valign="top" align="center">Skewness</td>
<td valign="top" align="center">Kurtosis</td>
<td valign="top" align="center">95% CI</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">First field test</td>
<td valign="top" align="center">181</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Gender (female)</td>
<td valign="top" align="center">75 (41.43%)</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Children&#x2019;s age</td>
<td valign="top" align="center">181 (100.00%)</td>
<td valign="top" align="center">5.30</td>
<td valign="top" align="center">0.69</td>
<td valign="top" align="center">0.05</td>
<td valign="top" align="center">&#x2013;0.24</td>
<td valign="top" align="center">2.42</td>
<td valign="top" align="center">5.20-5.40</td>
</tr>
<tr>
<td valign="top" align="left" colspan="8"><bold>SES (housing price, thousand per square meter in RMB)</bold></td>
</tr>
<tr>
<td valign="top" align="left">Middle SES (Southern China)</td>
<td valign="top" align="center">109 (60.22%)</td>
<td valign="top" align="center">45.28</td>
<td valign="top" align="center">8.10</td>
<td valign="top" align="center">1.81</td>
<td valign="top" align="center">0.15</td>
<td valign="top" align="center">2.26</td>
<td valign="top" align="center">41.49&#x2013;49.07</td>
</tr>
<tr>
<td valign="top" align="left">High SES (Eastern China)</td>
<td valign="top" align="center">72 (39.78%)</td>
<td valign="top" align="center">108.92</td>
<td valign="top" align="center">22.10</td>
<td valign="top" align="center">6.38</td>
<td valign="top" align="center">0.06</td>
<td valign="top" align="center">1.54</td>
<td valign="top" align="center">94.87&#x2013;122.96</td>
</tr>
<tr>
<td valign="top" align="left">Second field test</td>
<td valign="top" align="center">911</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Gender (female)</td>
<td valign="top" align="center">405 (44.46%)</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Children&#x2019;s age</td>
<td valign="top" align="center">911 (100.00%)</td>
<td valign="top" align="center">5.12</td>
<td valign="top" align="center">0.96</td>
<td valign="top" align="center">0.03</td>
<td valign="top" align="center">&#x2013;0.10</td>
<td valign="top" align="center">2.06</td>
<td valign="top" align="center">5.06&#x2013;5.19</td>
</tr>
<tr>
<td valign="top" align="left" colspan="8"><bold>SES (housing price, thousand per square meter in RMB)</bold></td>
</tr>
<tr>
<td valign="top" align="left">Low SES (Southern China)</td>
<td valign="top" align="center">199 (21.84%)</td>
<td valign="top" align="center">26.50</td>
<td valign="top" align="center">11.49</td>
<td valign="top" align="center">3.19</td>
<td valign="top" align="center">&#x2013;0.02</td>
<td valign="top" align="center">1.51</td>
<td valign="top" align="center">19.56&#x2013;33.45</td>
</tr>
<tr>
<td valign="top" align="left">Middle SES (Southern China)</td>
<td valign="top" align="center">112 (12.29%)</td>
<td valign="top" align="center">54.58</td>
<td valign="top" align="center">3.85</td>
<td valign="top" align="center">1.28</td>
<td valign="top" align="center">&#x2013;0.35</td>
<td valign="top" align="center">1.85</td>
<td valign="top" align="center">51.62&#x2013;57.54</td>
</tr>
<tr>
<td valign="top" align="left">Middle SES (Eastern China)</td>
<td valign="top" align="center">130 (14.27%)</td>
<td valign="top" align="center">77.93</td>
<td valign="top" align="center">6.16</td>
<td valign="top" align="center">2.05</td>
<td valign="top" align="center">0.30</td>
<td valign="top" align="center">2.01</td>
<td valign="top" align="center">73.19&#x2013;82.66</td>
</tr>
<tr>
<td valign="top" align="left">Middle-to-high SES (Southern China)</td>
<td valign="top" align="center">383 (42.04%)</td>
<td valign="top" align="center">93.41</td>
<td valign="top" align="center">25.80</td>
<td valign="top" align="center">6.66</td>
<td valign="top" align="center">&#x2013;2.04</td>
<td valign="top" align="center">8.18</td>
<td valign="top" align="center">79.12&#x2013;107.70</td>
</tr>
<tr>
<td valign="top" align="left">High SES (Eastern China)</td>
<td valign="top" align="center">87 (9.54%)</td>
<td valign="top" align="center">108.92</td>
<td valign="top" align="center">22.10</td>
<td valign="top" align="center">6.38</td>
<td valign="top" align="center">0.06</td>
<td valign="top" align="center">1.54</td>
<td valign="top" align="center">94.87&#x2013;122.96</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="S2.SS1.SSS2">
<title>Second Field Test Participants</title>
<p>Nine-hundred-and-eleven children participated in the second field test. The children were randomly selected from each grade in each preschool. Of these, 405 were female and 506 were male (mean age = 5;01).</p>
<p>We employed the same selection criteria as in the first field test and recruited six kindergartens and one elementary school for the second field test. Four kindergartens were located in a metropolis in southern China (<italic>N</italic> = 694), one from a low SES neighborhood with tuition 36% lower than the average school tuition in the city, and with an average housing price of &#x00A5; 26,500/m<sup>2</sup>; two from middle SES neighborhoods with tuition 28% higher than the average tuition and with an average housing price of &#x00A5; 54,580/m<sup>2</sup>, and one from a middle-to-high SES neighborhood with tuition 28% higher than the average tuition and with an average housing price of &#x00A5; 93,410/m<sup>2</sup>. The elementary school (with tuition 28% higher than the average tuition in the city) was from a middle-to-high SES neighborhood. Two kindergartens were in a metropolis in eastern China (<italic>N</italic> = 217; one from a middle SES neighborhood with tuition six times the average tuition and with an average housing price of &#x00A5; 77,930/m<sup>2</sup> and one from a high SES neighborhood with tuition ten times the average tuition and with an average housing price of &#x00A5; 108,920/m<sup>2</sup>).</p>
<p>Some children did not attend preschool on the second day of testing, or they did not want to complete one or several of the tests, often because they were shy. Of the children in the first cohort (<italic>N</italic> = 558), 22 children did not complete either of the two forms of the ACCE-V, and 21 of these did also not complete the PPVT-4. One child completed only Form A, but none of the other tests. Of the children in the second cohort (<italic>N</italic> = 353), 12 did not complete either of the two forms of the ACCE-V, and of these twelve, four did not complete these tests on the re-test date, either. One child did not complete Form B on the first day but did complete Form A and both forms on the re-test date.</p>
</sec>
</sec>
<sec id="S2.SS2">
<title>Test Construction and Item Development</title>
<sec id="S2.SS2.SSS1">
<title>Target Item Selection</title>
<p>The main purpose of the ACCE-V is to assess the level of receptive vocabulary knowledge relevant in the context of Chinese primary English education. Our search for target items therefore began by surveying the most widely used English textbooks and working books developed for first and second graders in China (see <xref ref-type="table" rid="T2">Table 2</xref>). From these books, we extracted the English words used in exercises, texts, and instructions (excluding pronouns and conjunctions). Altogether, 595 words (nouns, pronouns, adjectives, adverbs, verbs, and prepositions) were extracted. The procedure of the ACCE-V development was illustrated in <xref ref-type="fig" rid="F1">Figure 1</xref>.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Titles and grades of the English text- and workbooks used in the item development of the ACCE-V.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Book title English</td>
<td valign="top" align="left">Book title Chinese</td>
<td valign="top" align="left">Grade level</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">English Textbook&#x2014;Starting Line (First Grade First Semester)</td>
<td valign="top" align="left">&#x4E49;&#x52A1;&#x6559;&#x80B2;&#x8BFE;&#x7A0B;&#x6807;&#x51C6;&#x5B9E;&#x9A8C;&#x6559;&#x79D1;&#x4E66;&#xFF1A;&#x82F1;&#x8BED;&#x65B0;&#x8D77;&#x70B9;&#xFF08;&#x4E00;&#x5E74;&#x7EA7;&#x4E0A;&#x518C;&#xFF09;</td>
<td valign="top" align="left">Grade 1</td>
</tr>
<tr>
<td valign="top" align="left">English Textbook&#x2014;Starting Line (First Grade Second Semester)</td>
<td valign="top" align="left">&#x4E49;&#x52A1;&#x6559;&#x80B2;&#x8BFE;&#x7A0B;&#x6807;&#x51C6;&#x5B9E;&#x9A8C;&#x6559;&#x79D1;&#x4E66;&#xFF1A;&#x82F1;&#x8BED;&#x65B0;&#x8D77;&#x70B9;&#xFF08;&#x4E00;&#x5E74;&#x7EA7;&#x4E0B;&#x518C;&#xFF09;</td>
<td valign="top" align="left">Grade 1</td>
</tr>
<tr>
<td valign="top" align="left">English Textbook&#x2014;Starting Line (Second Grade First Semester)</td>
<td valign="top" align="left">&#x4E49;&#x52A1;&#x6559;&#x80B2;&#x8BFE;&#x7A0B;&#x6807;&#x51C6;&#x5B9E;&#x9A8C;&#x6559;&#x79D1;&#x4E66;&#xFF1A;&#x82F1;&#x8BED;&#x65B0;&#x8D77;&#x70B9;&#xFF08;&#x4E8C;&#x5E74;&#x7EA7;&#x4E0A;&#x518C;&#xFF09;</td>
<td valign="top" align="left">Grade 2</td>
</tr>
<tr>
<td valign="top" align="left">English Textbook&#x2014;Starting Line (Second Grade Second Semester)</td>
<td valign="top" align="left">&#x4E49;&#x52A1;&#x6559;&#x80B2;&#x8BFE;&#x7A0B;&#x6807;&#x51C6;&#x5B9E;&#x9A8C;&#x6559;&#x79D1;&#x4E66;&#xFF1A;&#x82F1;&#x8BED;&#x65B0;&#x8D77;&#x70B9;&#xFF08;&#x4E8C;&#x5E74;&#x7EA7;&#x4E0B;&#x518C;&#xFF09;</td>
<td valign="top" align="left">Grade 2</td>
</tr>
<tr>
<td valign="top" align="left">English Reading Comprehension Series (First Grade)</td>
<td valign="top" align="left">&#x5168;&#x65B0;&#x82F1;&#x8BED;&#x9605;&#x8BFB;&#xFF08;&#x4E00;&#x5E74;&#x7EA7;&#x9605;&#x8BFB;&#x7406;&#x89E3;&#xFF09;&#x534E;&#x4E1C;&#x5E08;&#x8303;&#x5927;&#x5B66;&#x51FA;&#x7248;&#x793E;</td>
<td valign="top" align="left">Grade 1</td>
</tr>
<tr>
<td valign="top" align="left">English Reading Comprehension Series (Second Grade)</td>
<td valign="top" align="left">&#x5168;&#x65B0;&#x82F1;&#x8BED;&#x9605;&#x8BFB;&#xFF08;&#x4E8C;&#x5E74;&#x7EA7;&#x9605;&#x8BFB;&#x7406;&#x89E3;&#xFF09;&#x534E;&#x4E1C;&#x5E08;&#x8303;&#x5927;&#x5B66;&#x51FA;&#x7248;&#x793E;</td>
<td valign="top" align="left">Grade 2</td>
</tr>
<tr>
<td valign="top" align="left">English Listening Series (First Grade)</td>
<td valign="top" align="left">&#x5168;&#x65B0;&#x82F1;&#x8BED;&#x542C;&#x529B;&#xFF08;&#x4E00;&#x5E74;&#x7EA7;&#x63D0;&#x9AD8;&#x7248;&#xFF09;&#x534E;&#x4E1C;&#x5E08;&#x8303;&#x5927;&#x5B66;&#x51FA;&#x7248;&#x793E;</td>
<td valign="top" align="left">Grade 1</td>
</tr>
<tr>
<td valign="top" align="left">English Listening Series (Second Grade)</td>
<td valign="top" align="left">&#x5168;&#x65B0;&#x82F1;&#x8BED;&#x542C;&#x529B;&#xFF08;&#x4E8C;&#x5E74;&#x7EA7;&#x63D0;&#x9AD8;&#x7248;&#xFF09;&#x534E;&#x4E1C;&#x5E08;&#x8303;&#x5927;&#x5B66;&#x51FA;&#x7248;&#x793E;</td>
<td valign="top" align="left">Grade 2</td>
</tr>
<tr>
<td valign="top" align="left">English Oral Communication Workbook (First Grade First Semester)</td>
<td valign="top" align="left">&#x53E3;&#x8BED;&#x4EA4;&#x9645;&#x82F1;&#x8BED;&#x6D3B;&#x52A8;&#x624B;&#x518C;&#xFF08;&#x4E00;&#x5E74;&#x7EA7;&#x4E0A;&#x518C;&#xFF09;</td>
<td valign="top" align="left">Grade 1</td>
</tr>
<tr>
<td valign="top" align="left">English Oral Communication Workbook (First Grade Second Semester)</td>
<td valign="top" align="left">&#x53E3;&#x8BED;&#x4EA4;&#x9645;&#x82F1;&#x8BED;&#x6D3B;&#x52A8;&#x624B;&#x518C;&#xFF08;&#x4E00;&#x5E74;&#x7EA7;&#x4E0B;&#x518C;&#xFF09;</td>
<td valign="top" align="left">Grade 1</td>
</tr>
<tr>
<td valign="top" align="left">English Oral Communication Workbook (Second Grade First Semester)</td>
<td valign="top" align="left">&#x53E3;&#x8BED;&#x4EA4;&#x9645;&#x82F1;&#x8BED;&#x6D3B;&#x52A8;&#x624B;&#x518C;&#xFF08;&#x4E8C;&#x5E74;&#x7EA7;&#x4E0A;&#x518C;&#xFF09;</td>
<td valign="top" align="left">Grade 2</td>
</tr>
<tr>
<td valign="top" align="left">English Oral Communication Workbook (Second Grade Second Semester)</td>
<td valign="top" align="left">&#x53E3;&#x8BED;&#x4EA4;&#x9645;&#x82F1;&#x8BED;&#x6D3B;&#x52A8;&#x624B;&#x518C;&#xFF08;&#x4E8C;&#x5E74;&#x7EA7;&#x4E0B;&#x518C;&#xFF09;</td>
<td valign="top" align="left">Grade 2</td>
</tr>
<tr>
<td valign="top" align="left">One Lesson One Practice (English): First Grade First Semester</td>
<td valign="top" align="left">&#x534E;&#x4E1C;&#x5E08;&#x5927;&#x7248;&#xFF1A;&#x4E00;&#x8BFE;&#x4E00;&#x7EC3;&#x4E00;&#x5E74;&#x7EA7;&#x82F1;&#x8BED;&#xFF08;&#x7B2C;&#x4E00;&#x5B66;&#x671F;&#xFF09;</td>
<td valign="top" align="left">Grade 1</td>
</tr>
<tr>
<td valign="top" align="left">One Lesson One Practice (English): First Grade Second Semester</td>
<td valign="top" align="left">&#x534E;&#x4E1C;&#x5E08;&#x5927;&#x7248;&#xFF1A;&#x4E00;&#x8BFE;&#x4E00;&#x7EC3;&#x4E00;&#x5E74;&#x7EA7;&#x82F1;&#x8BED;&#xFF08;&#x7B2C;&#x4E8C;&#x5B66;&#x671F;&#xFF09;</td>
<td valign="top" align="left">Grade 1</td>
</tr>
<tr>
<td valign="top" align="left">One Lesson One Practice (English): Second Grade First Semester</td>
<td valign="top" align="left">&#x534E;&#x4E1C;&#x5E08;&#x5927;&#x7248;&#xFF1A;&#x4E00;&#x8BFE;&#x4E00;&#x7EC3;&#x4E8C;&#x5E74;&#x7EA7;&#x82F1;&#x8BED;&#xFF08;&#x7B2C;&#x4E00;&#x5B66;&#x671F;&#xFF09;</td>
<td valign="top" align="left">Grade 2</td>
</tr>
<tr>
<td valign="top" align="left">One Lesson One Practice (English): Second Grade Second Semester</td>
<td valign="top" align="left">&#x534E;&#x4E1C;&#x5E08;&#x5927;&#x7248;&#xFF1A;&#x4E00;&#x8BFE;&#x4E00;&#x7EC3;&#x4E8C;&#x5E74;&#x7EA7;&#x82F1;&#x8BED;&#xFF08;&#x7B2C;&#x4E8C;&#x5B66;&#x671F;&#xFF09;</td>
<td valign="top" align="left">Grade 2</td>
</tr>
<tr>
<td valign="top" align="left">New Concept: First Things First!</td>
<td valign="top" align="left">&#x65B0;&#x6982;&#x5FF5;&#x82F1;&#x8BED;&#xFF1A;&#x82F1;&#x8BED;&#x521D;&#x9636;</td>
<td valign="top" align="left">Grade 1</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Procedure of the ACCE-V development.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-769415-g001.tif"/>
</fig>
<p>Each word was assigned to one of 18 content categories (actions, animals, body parts, attributes<sup><xref ref-type="fn" rid="footnote2">2</xref></sup>, people, buildings and spaces, vehicles, household objects, clothing and accessories, shapes and colors, nature and landscapes, food, plants and fruit, books and money, toys and recreation, times, numbers, prepositions, pronouns, abstract concepts). We assessed the usability of the words based on frequency measures, concreteness, and imageability.</p>
<p>A word&#x2019;s frequency is measured by counting how often it occurs in a corpus (e.g., in a collection of books or newspapers). We included frequency measures as a criterion because it seems fair to assume that more frequent words will more likely be found in different textbooks (and consequently be used in more English language classes and known to more children), whereas less frequent words reflect more idiosyncratic choices by a textbook publisher.</p>
<p>In the absence of a corpus of words used in Chinese English-language classes, we used two widely used databases: the CELEX database (<xref ref-type="bibr" rid="B4">Baayen et al., 1996</xref>), which is based on the COBUILD corpus with around 17.9 million tokens (from both written and spoken sources), and the SubtlexUS database (<xref ref-type="bibr" rid="B5">Brysbaert and New, 2009</xref>), which contains 50 million tokens and is based on American movies and TV series subtitles. We used the Cob frequency (from CELEX) and the SUBTLwf (from SubtlexUS) to identify and exclude low-frequency words (e.g., <italic>narrator</italic>, <italic>magnificent</italic>), and to decide between semantically similar words (e.g., <italic>lollipop</italic> and <italic>candy</italic>), assuming that more frequent words were more likely to be taught (in this case <italic>candy</italic> has a higher frequency than <italic>lollipop</italic>).</p>
<p>Concreteness and imageability were used to restrict the pool of possible target words to those that would be more likely to be known to children and that could be depicted clearly using drawings. Imageability is defined as the ease with which a word gives rise to a sensory mental image (<xref ref-type="bibr" rid="B100">Paivio et al., 1968</xref>), while concreteness refers to the ability to see, hear, and touch something. Empirically, words like <italic>difference</italic> or <italic>against</italic> tend to get lower concreteness ratings than words like <italic>banana or running</italic>.<sup><xref ref-type="fn" rid="footnote3">3</xref></sup> Not surprisingly, words with lower concreteness ratings are also typically more difficult to illustrate. We gauged a words&#x2019; imageability and concreteness using our own and the illustrator&#x2019;s (see below) introspection and our experience with creating visual stimuli for young children.</p>
</sec>
<sec id="S2.SS2.SSS2">
<title>Test Construction</title>
<sec id="S2.SS2.SSS2.Px1">
<title>Distractors</title>
<p>We then selected a subset of 48 words as potential target words. For each word, three distractor words were selected: a phonological distractor, a semantic distractor, and an unrelated distractor. The phonological distractors share the initial phoneme or onset with the target (e.g., <italic>skirt</italic> and <italic>square</italic>). The semantic distractors were from the same content category, and semantically related for instance through being a subordinate of the same superordinates the target word (e.g., <italic>eye</italic> and <italic>nose</italic> being subordinates of <italic>face</italic>), or being the opposite of the target word (e.g., <italic>losing</italic> and <italic>finding</italic>). Unrelated distractors were neither semantically nor phonologically related. The distractors were always of the same part of speech as the target word. Where possible, distractors were selected that had a similar frequency as the target word. Distractors were also selected to be concrete and to have high imageability.</p>
</sec>
<sec id="S2.SS2.SSS2.Px2">
<title>Illustration</title>
<p>A professional illustrator created color pictures for all target and distractor words. The illustrator was a Chinese&#x2013;English bilingual and born and raised in China who knows the living environment of typical Chinese children. This is important, as a main goal of ACCE-V is to be culturally appropriate both in terms of the items used and in terms of the illustrations. In other words, the illustrations should be in a style that is familiar to Chinese children. An example is provided in <xref ref-type="fig" rid="F2">Figure 2</xref>.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Example of an item from ACCE-V. The target word is body. The body is depicted using a traditional schematization of the body as often used in the context of Traditional Chinese Medicine.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-769415-g002.tif"/>
</fig>
<p>We tested whether the illustrations would evoke the intended concepts in Chinese native speakers using a picture naming paradigm: Each picture was presented to 14 native speakers of Mandarin Chinese (four adults, ten children between 5 and 7 years; eight females, six males) living in China, and they were asked to name what was shown on each picture in Mandarin. The overall agreement was 74.6%. Modifications were made for those pictures that showed low naming agreement (less than 50%) or high agreement but not on the intended (Mandarin translation of the English) word (e.g., most participants named the picture intended to show <italic>face</italic> as <italic>head</italic>, which led to removing the hair). Modified pictures were tested again to ensure that they had more than 80% agreement.</p>
</sec>
<sec id="S2.SS2.SSS2.Px3">
<title>Test Forms</title>
<p>In a next step, we selected 30 target words and their distractors. In addition, we also created a second form of the test, in which some of the semantic distractors of Form A served as targets, and the targets of Form B served as semantic distractors. The rationale for having two different but equivalent forms is that it allows re-testing without practice effects.</p>
</sec>
</sec>
</sec>
<sec id="S2.SS3">
<title>Measures</title>
<sec id="S2.SS3.SSS1">
<title>Criterion Validity</title>
<p>To compare ACCE and other vocabulary tests, we chose the Chinese version of PPVT (<xref ref-type="bibr" rid="B22">Lu and Liu, 1998</xref>, henceforth: PPVT-C) and the English version of PPVT (<xref ref-type="bibr" rid="B15">Dunn and Dunn, 2007</xref>, the PPVT-4).</p>
<p>The PPVT-C contained 115 items with a possible score ranging from 0 to 115. It was translated and validated in Taiwan. Like the ACCE, the PPVT-C is a forced-choice picture selection format, in which the child is presented with a word and then asked to select the target picture matching that word from an array of four pictures. If a child answers five out of seven consecutive items wrong, the examiner would stop the test and record the score. The internal consistency of the PPVT-C was 0.83, and that of the PPVT-4 was 0.89.</p>
<p>In addition to the revised ACCE-V vocabulary test (form A and form B) and PPVT-C (<xref ref-type="bibr" rid="B22">Lu and Liu, 1998</xref>), we also included the English PPVT-4 (<xref ref-type="bibr" rid="B15">Dunn and Dunn, 2007</xref>) and the Chinese Expressive Vocabulary Test (EVT) to test criterion validity (concurrent validity). The EVT in Chinese was adapted by the Child Language Research Center (CLRC) at East China Normal University (see <xref ref-type="bibr" rid="B7">Chen et al., 2018</xref> for details). Possible scores range from 0 to 124. The internal consistencies (Cronbach&#x2019;s alpha) of the PPVT-C, the English PPVT-4, and the Chinese EVT in our sample were 0.82, 0.96, and 0.73, respectively.</p>
</sec>
<sec id="S2.SS3.SSS2">
<title>Demographic Information</title>
<list list-type="simple">
<list-item><p><bold>Children&#x2019;s age</bold> was calculated by subtracting children&#x2019;s birth date from the test date, counted in months.</p>
</list-item>
<list-item><p><bold>Children&#x2019;s gender</bold> was a binomial variable with 1 for girl and 0 for boy.</p>
</list-item>
<list-item><p><bold>Foreign teacher</bold> was a binomial variable, representing whether there was a foreign teacher in children&#x2019;s classroom (1 = yes, 0 = no).</p>
</list-item>
<list-item><p><bold>Housing price</bold> was the average price in the catchment area of the preschool, in Yuan per square meter.</p>
</list-item>
<list-item><p><bold>Tuition</bold> was the tuition of children&#x2019;s preschool, counted in Yuan per month.</p>
</list-item>
</list>
</sec>
</sec>
<sec id="S2.SS4">
<title>Analytic Approach</title>
<sec id="S2.SS4.SSS1">
<title>RQ1: Assessment of Chinese Children&#x2019;s English Test Reliability and Validity</title>
<p>To assess the psychometric properties of the ACCE-V, we used both Item Response Theory (IRT) and classical test theory to determine the test&#x2019;s reliability and validity. After the first field test, we modified or removed items that did not have satisfactory properties, and then tested the modified version again in a second field test. We used a two-parameter (2PL) IRT model to fit the data.</p>
</sec>
<sec id="S2.SS4.SSS2">
<title>RQ2: Assessment of Chinese Children&#x2019;s English and Children&#x2019;s Demographic</title>
<p>Once we had established that the ACCE had good psychometric properties, we used multiple regression models to compare the children&#x2019;s age, gender, and English vocabulary scores. In the regression models, the outcome (English vocabulary score) was modeled as a linear combination of the predictor variable (age), controlling for gender, foreign teacher, housing price, and tuition.</p>
<p>Analyses were conducted using STATA 15.0 (<xref ref-type="bibr" rid="B36">StataCorp, 2017</xref>) and R 3.6.2 (<xref ref-type="bibr" rid="B31">R Core Team, 2019</xref>).</p>
</sec>
</sec>
<sec id="S2.SS5">
<title>Procedure</title>
<p>The first data collection took place between April and June 2019. Ten Chinese-English bilingual research assistants working in the preschool (with at least a bachelor&#x2019;s degree) were selected and trained by the authors. The training covered the administration of the test, including training the pronunciation of each target word. The purpose of this study, testing procedure, potential risk, and privacy were sent to the preschool administrators. The preschool administrators informed each parent through the parent committee who collected parents&#x2019; assent to the testing. Of the children whose parents agreed, children who also agreed were tested. All children were told that it is fine to stop the test at any time.</p>
<p>Children were tested individually by research assistants in their kindergartens. Of the 181 children, 143 completed both forms to gauge alternate form reliability. The two forms were administered in random order to counter-balance the test order. In addition, these 143 children completed the PPVT-C, and 34 completed the PPVT-4. The PPVT-C was administered before the ACCE-V test, and the PPVT-4 at the end. The administration of one form of the ACCE-V test took about 10 min. The administration of the PPVT-C and the PPVT-4 varied depending on the children&#x2019;s vocabulary level between 5 and 15 min. There was a short break between each test.</p>
<p>The same group of assistant researchers conducted the second field test. All children completed all tests in 2 days. From November 2019 to January 2020, 558 were tested on the PPVT-C and Chinese EVT on the first day and on both forms of the ACCE-V (order counter-balanced) and English PPVT-4 on the second day. In November and December 2020, the other 353 children were tested on both forms of the ACCE-V on 1 day, and then again on both forms 1 week after (order counter-balanced) to gauge the instrument&#x2019;s test&#x2013;retest reliability.</p>
</sec>
</sec>
<sec id="S3" sec-type="results">
<title>Results</title>
<sec id="S3.SS1">
<title>RQ1: Assessment of Chinese Children&#x2019;s English Test Reliability and Validity</title>
<p><xref ref-type="table" rid="T3">Table 3</xref> shows the correlations and descriptive statistics for the ACCE-V Basic Form A and Form B, and for the PPVT-4 and PPVT-C. As can be seen in <xref ref-type="table" rid="T3">Table 3</xref>, both forms had medium to high correlations with the PPVT, and medium correlations with the PPVT-C. In addition, scores in both forms were highly correlated with each other.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Correlations and descriptive statistics for the four tests (ACCE-V Form A, ACCE-V Form B, PPVT-4, and PPVT-C).</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">4</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">(1) ACCE-V Form A</td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">(2) ACCE-V Form B</td>
<td valign="top" align="center">0.909<xref ref-type="table-fn" rid="t3fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">(3) English PPVT</td>
<td valign="top" align="center">0.622<xref ref-type="table-fn" rid="t3fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.491<xref ref-type="table-fn" rid="t3fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">(4) PPVT-C</td>
<td valign="top" align="center">0.331<xref ref-type="table-fn" rid="t3fns1">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.43<xref ref-type="table-fn" rid="t3fns1">&#x002A;</xref></td>
<td valign="top" align="center">0.462<xref ref-type="table-fn" rid="t3fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Max. achievable score</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">228</td>
<td valign="top" align="center">115</td>
</tr>
<tr>
<td valign="top" align="left">Mean</td>
<td valign="top" align="center">15.56</td>
<td valign="top" align="center">13.92</td>
<td valign="top" align="center">42.35</td>
<td valign="top" align="center">50.93</td>
</tr>
<tr>
<td valign="top" align="left">Median</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">42</td>
<td valign="top" align="center">47.5</td>
</tr>
<tr>
<td valign="top" align="left">Standard deviation</td>
<td valign="top" align="center">7.05</td>
<td valign="top" align="center">7.14</td>
<td valign="top" align="center">10.3</td>
<td valign="top" align="center">19.14</td>
</tr>
<tr>
<td valign="top" align="left">Standard error of the mean</td>
<td valign="top" align="center">0.54</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">1.27</td>
<td valign="top" align="center">1.41</td>
</tr>
<tr>
<td valign="top" align="left">Skewness</td>
<td valign="top" align="center">0.15</td>
<td valign="top" align="center">0.43</td>
<td valign="top" align="center">&#x2013;0.12</td>
<td valign="top" align="center">0.63</td>
</tr>
<tr>
<td valign="top" align="left">Kurtosis</td>
<td valign="top" align="center">1.85</td>
<td valign="top" align="center">2.15</td>
<td valign="top" align="center">2.42</td>
<td valign="top" align="center">3.30</td>
</tr>
<tr>
<td valign="top" align="left">95% CI</td>
<td valign="top" align="center">14.59&#x2013;16.72</td>
<td valign="top" align="center">12.77&#x2013;15.05</td>
<td valign="top" align="center">39.53&#x2013;44.59</td>
<td valign="top" align="center">48.15&#x2013;53.71</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t3fns1"><p><italic>N = 181. &#x002A;p &#x003C; 0.05; &#x002A;&#x002A;p &#x003C; 0.01; &#x002A;&#x002A;&#x002A;p &#x003C; 0.001.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
<sec id="S3.SS1.SSS1">
<title>Test Form Modifications After the First Field Test</title>
<p>The uni-dimensionality assumption of the data was evaluated by using a principal components analysis (PCA). For Form A, the PCA showed that the first component (26.89%) accounted for substantially more variation than the second component (7.00%) and subsequent composites, indicating that Form A was measuring a unidimensional ability. Results were similar for Form B, where the first component accounted for 28.43% of the total variance whereas the second component accounted for only 7.72%. The uni-dimensionality assumption was thus met. We used a 2PL model to fit the data, allowing us to estimate difficulty and discrimination parameters for each item. Items with lower discrimination parameter estimates and items that were too easy were removed. Examples were illustrated in <xref ref-type="fig" rid="F3">Figure 3</xref>.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Examples of removed items due to low discrimination indices. The target for the <bold>left panel</bold> was to turn (option 4), the target for the <bold>right panel</bold> was to talk (option 3).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-769415-g003.tif"/>
</fig>
<p>To shorten the measurement while preserving the reliability and information, the original 30-item test was shortened to a 20-item test by using the Test Information Function (TIF) and Conditional Standard Error of Measurement (CSEM). Both measures allowed us to compare the total information provided by a different shortened version of a test (see <xref ref-type="fig" rid="F4">Figure 4</xref>). For each form in the field test, we selected 20 items out of 30 were that minimized the CSEM and maximized the TIF. In addition, difficulty parameter estimates were used to arrange items from easy to difficult in each form.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Test Information Functions (TIF) for Form A and Form B of the first version of the ACCE-V for 30 items, 28 items, 25 items, and 20 items, respectively.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-769415-g004.tif"/>
</fig>
</sec>
<sec id="S3.SS1.SSS2">
<title>Internal Consistency</title>
<p>Internal consistency measures were calculated for each 6-month age band. <xref ref-type="table" rid="T4">Table 4</xref> shows the internal consistency measures (split-half reliability, Cronbach&#x2019;s &#x03B1;, and standard error of measurement) for each of the age bands. All measures indicate high to very high internal consistency of the ACCE-V, indicating that it is suitable for all age ranges.</p>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p>Split-half reliability, Cronbach&#x2019;s <bold>&#x03B1;</bold>, standard error of measurement, and alternate form reliability for each 6-month age band.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Age</td>
<td valign="top" align="center"><italic>N</italic></td>
<td valign="top" align="center" colspan="2">Split-half reliability<hr/></td>
<td valign="top" align="center" colspan="2">Cronbach&#x2019;s alpha<hr/></td>
<td valign="top" align="center" colspan="2">Standard error of measurement<hr/></td>
<td valign="top" align="center" colspan="2">Form A<hr/></td>
<td valign="top" align="center" colspan="2">Form B<hr/></td>
<td valign="top" align="center">Mean difference</td>
<td valign="top" align="center"><italic>r</italic></td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">Form A</td>
<td valign="top" align="center">Form B</td>
<td valign="top" align="center">Form A</td>
<td valign="top" align="center">Form B</td>
<td valign="top" align="center">Form A</td>
<td valign="top" align="center">Form B</td>
<td valign="top" align="center">Mean</td>
<td valign="top" align="center"><italic>SD</italic></td>
<td valign="top" align="center">Mean</td>
<td valign="top" align="center"><italic>SD</italic></td>
<td/>
<td/>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">3.0&#x2013;3.5</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">0.73</td>
<td valign="top" align="center">0.79</td>
<td valign="top" align="center">0.82</td>
<td valign="top" align="center">0.81</td>
<td valign="top" align="center">1.4</td>
<td valign="top" align="center">1.45</td>
<td valign="top" align="center">7.78</td>
<td valign="top" align="center">4.3</td>
<td valign="top" align="center">8.08</td>
<td valign="top" align="center">4.21</td>
<td valign="top" align="center">&#x2013;0.3</td>
<td valign="top" align="center">0.65</td>
</tr>
<tr>
<td valign="top" align="left">3.6&#x2013;3.11</td>
<td valign="top" align="center">109</td>
<td valign="top" align="center">0.78</td>
<td valign="top" align="center">0.8</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">1.42</td>
<td valign="top" align="center">1.35</td>
<td valign="top" align="center">7.83</td>
<td valign="top" align="center">4.6</td>
<td valign="top" align="center">8.61</td>
<td valign="top" align="center">4.66</td>
<td valign="top" align="center">&#x2013;0.78</td>
<td valign="top" align="center">0.82</td>
</tr>
<tr>
<td valign="top" align="left">4.0&#x2013;4.5</td>
<td valign="top" align="center">107</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">0.83</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">1.26</td>
<td valign="top" align="center">1.37</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">4.96</td>
<td valign="top" align="center">9.82</td>
<td valign="top" align="center">4.96</td>
<td valign="top" align="center">&#x2013;0.82</td>
<td valign="top" align="center">0.86</td>
</tr>
<tr>
<td valign="top" align="left">4.6&#x2013;4.11</td>
<td valign="top" align="center">124</td>
<td valign="top" align="center">0.86</td>
<td valign="top" align="center">0.78</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">1.29</td>
<td valign="top" align="center">1.69</td>
<td valign="top" align="center">9.28</td>
<td valign="top" align="center">5.27</td>
<td valign="top" align="center">10.26</td>
<td valign="top" align="center">4.97</td>
<td valign="top" align="center">&#x2013;0.98</td>
<td valign="top" align="center">0.87</td>
</tr>
<tr>
<td valign="top" align="left">5.0&#x2013;5.5</td>
<td valign="top" align="center">152</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">0.9</td>
<td valign="top" align="center">0.88</td>
<td valign="top" align="center">1.28</td>
<td valign="top" align="center">1.23</td>
<td valign="top" align="center">10.33</td>
<td valign="top" align="center">5.51</td>
<td valign="top" align="center">11.1</td>
<td valign="top" align="center">5.13</td>
<td valign="top" align="center">&#x2013;0.77</td>
<td valign="top" align="center">0.88</td>
</tr>
<tr>
<td valign="top" align="left">5.6&#x2013;5.11</td>
<td valign="top" align="center">163</td>
<td valign="top" align="center">0.83</td>
<td valign="top" align="center">0.84</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">1.32</td>
<td valign="top" align="center">1.27</td>
<td valign="top" align="center">9.14</td>
<td valign="top" align="center">5.31</td>
<td valign="top" align="center">9.55</td>
<td valign="top" align="center">5.16</td>
<td valign="top" align="center">&#x2013;0.41</td>
<td valign="top" align="center">0.9</td>
</tr>
<tr>
<td valign="top" align="left">6.0&#x2013;6.5</td>
<td valign="top" align="center">119</td>
<td valign="top" align="center">0.84</td>
<td valign="top" align="center">0.83</td>
<td valign="top" align="center">0.88</td>
<td valign="top" align="center">0.88</td>
<td valign="top" align="center">1.29</td>
<td valign="top" align="center">1.27</td>
<td valign="top" align="center">8.49</td>
<td valign="top" align="center">5.2</td>
<td valign="top" align="center">9.21</td>
<td valign="top" align="center">5.16</td>
<td valign="top" align="center">&#x2013;0.72</td>
<td valign="top" align="center">0.89</td>
</tr>
<tr>
<td valign="top" align="left">6.6&#x2013;6.11</td>
<td valign="top" align="center">37</td>
<td valign="top" align="center">0.82</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="center">0.91</td>
<td valign="top" align="center">0.9</td>
<td valign="top" align="center">1.45</td>
<td valign="top" align="center">1.4</td>
<td valign="top" align="center">9.85</td>
<td valign="top" align="center">5.97</td>
<td valign="top" align="center">10.31</td>
<td valign="top" align="center">5.89</td>
<td valign="top" align="center">&#x2013;0.46</td>
<td valign="top" align="center">0.93</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><italic>N = 911.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="S3.SS1.SSS3">
<title>Test&#x2013;Retest Reliability</title>
<p>Both forms of the ACCE-V were completed twice by 340 children, with a test interval of 1 week. The correlation for Form A was <italic>r</italic> = 0.86 and <italic>r</italic> = 0.85 for Form B, indicating good test&#x2013;retest reliability.</p>
</sec>
<sec id="S3.SS1.SSS4">
<title>Alternate Form Reliability</title>
<p>We determined alternate form reliability by correlating scores from Form A and Form B. Overall, the alternate form reliability was 0.85. <xref ref-type="table" rid="T4">Table 4</xref> shows the correlations for each individual age band. There were moderate to strong positive correlations between both forms, indicating good alternate form reliability. At <italic>r</italic> = 0.65 the correlation is lowest in the youngest age band (3.0&#x2013;3.5 years), and the only band with a correlation below 0.8. We believe that the youngest age group may have had more difficulty keeping their concentration throughout the multiple assessments, despite the pauses in between. In addition, this age band had the smallest sample size (<italic>N</italic> = 36).</p>
</sec>
<sec id="S3.SS1.SSS5">
<title>Concurrent Validity</title>
<p>The ACCE-V scores were correlated with the PPVT-4, the PPVT-C, and the Chinese EVT. <xref ref-type="table" rid="T5">Table 5</xref> shows the correlation of the test scores with each other.</p>
<table-wrap position="float" id="T5">
<label>TABLE 5</label>
<caption><p>Correlations and descriptive statistics for the five tests (ACCE-V Form A, ACCE-V Form B, PPVT-4, and PPVT-C).</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">5</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">(1) ACCE-V Form A</td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">(2) ACCE-V Form B</td>
<td valign="top" align="center">0.879<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">(3) English PPVT-4</td>
<td valign="top" align="center">0.808<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.802<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">(4) Chinese PPVT</td>
<td valign="top" align="center">0.396<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.380<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.341<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">(5) Chinese EVT</td>
<td valign="top" align="center">0.346<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.333<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.297<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.615<xref ref-type="table-fn" rid="t5fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Max. achievable score</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">228</td>
<td valign="top" align="center">115</td>
<td valign="top" align="center">124</td>
</tr>
<tr>
<td valign="top" align="left">Mean</td>
<td valign="top" align="center">9.09</td>
<td valign="top" align="center">9.78</td>
<td valign="top" align="center">25.16</td>
<td valign="top" align="center">40.44</td>
<td valign="top" align="center">56.84</td>
</tr>
<tr>
<td valign="top" align="left">Median</td>
<td valign="top" align="center">8.00</td>
<td valign="top" align="center">9.00</td>
<td valign="top" align="center">22.00</td>
<td valign="top" align="center">36.00</td>
<td valign="top" align="center">58.00</td>
</tr>
<tr>
<td valign="top" align="left">Standard deviation</td>
<td valign="top" align="center">5.25</td>
<td valign="top" align="center">5.12</td>
<td valign="top" align="center">14.95</td>
<td valign="top" align="center">20.25</td>
<td valign="top" align="center">14.92</td>
</tr>
<tr>
<td valign="top" align="left">Standard error of the mean</td>
<td valign="top" align="center">0.18</td>
<td valign="top" align="center">0.17</td>
<td valign="top" align="center">0.65</td>
<td valign="top" align="center">0.86</td>
<td valign="top" align="center">0.72</td>
</tr>
<tr>
<td valign="top" align="left">Skewness</td>
<td valign="top" align="center">0.33</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">0.47</td>
<td valign="top" align="center">0.61</td>
<td valign="top" align="center">&#x2013;0.38</td>
</tr>
<tr>
<td valign="top" align="left">Kurtosis</td>
<td valign="top" align="center">2.00</td>
<td valign="top" align="center">2.11</td>
<td valign="top" align="center">2.22</td>
<td valign="top" align="center">2.79</td>
<td valign="top" align="center">3.33</td>
</tr>
<tr>
<td valign="top" align="left">95% CI</td>
<td valign="top" align="center">8.75&#x2013;9.44</td>
<td valign="top" align="center">9.44&#x2013;10.12</td>
<td valign="top" align="center">23.89&#x2013;26.43</td>
<td valign="top" align="center">38.75&#x2013;42.12</td>
<td valign="top" align="center">55.43&#x2013;58.26</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t5fns1"><p><italic>N = 911. &#x002A;&#x002A;&#x002A;p &#x003C; 0.001.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
<p>ACCE-V scores were most strongly correlated with PPVT-4 scores. Since the PPVT and the ACCE-V both assess children&#x2019;s English receptive vocabulary, this is to be expected and desirable. Correlations with the PPVT-C are moderate, showing that children&#x2019;s Mandarin skills and their L2 English skills are related. The strength of the correlation is in line with what is generally found for receptive L1 and L2 vocabulary in young learners (<xref ref-type="bibr" rid="B2">Atwill et al., 2007</xref>; <xref ref-type="bibr" rid="B21">Karlsen et al., 2017</xref>; <xref ref-type="bibr" rid="B18">Gr&#x00F8;ver et al., 2018</xref>). The correlations with the Chinese Expressive Vocabulary Test are lowest at around 0.3. This is not surprising, given that both tests are measuring proficiency in different languages (English vs. Mandarin) and in different modalities (receptive vs. productive).</p>
</sec>
<sec id="S3.SS1.SSS6">
<title>Item Analysis</title>
<p>We again used a 2PL model to measure the items&#x2019; difficulty and discrimination. All items had very good discrimination indices and provided a good range of difficulty indices (see <xref ref-type="table" rid="T6">Table 6</xref>; <xref ref-type="fig" rid="F5">Figure 5</xref>). The most difficult items tended to be verbs (e.g., <italic>putting</italic>) and prepositions (e.g., <italic>on</italic>).</p>
<table-wrap position="float" id="T6">
<label>TABLE 6</label>
<caption><p>Range, mean, and standard deviation of the discrimination and difficulty indices of the ACCE-V Form A and Form B.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center" colspan="3">Discrimination<hr/></td>
<td valign="top" align="center" colspan="3">Difficulty<hr/></td>
</tr>
<tr>
<td/>
<td valign="top" align="center">Range</td>
<td valign="top" align="center">Mean</td>
<td valign="top" align="center"><italic>SD</italic></td>
<td valign="top" align="center">Range</td>
<td valign="top" align="center">Mean</td>
<td valign="top" align="center"><italic>SD</italic></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Form A</td>
<td valign="top" align="center">0.99&#x2013;3.23</td>
<td valign="top" align="center">2.04</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">&#x2013;1.66 to 2.32</td>
<td valign="top" align="center">&#x2013;0.02</td>
<td valign="top" align="center">0.93</td>
</tr>
<tr>
<td valign="top" align="left">Form B</td>
<td valign="top" align="center">0.94&#x2013;2.87</td>
<td valign="top" align="center">1.91</td>
<td valign="top" align="center">0.55</td>
<td valign="top" align="center">&#x2013;1.54 to 1.28</td>
<td valign="top" align="center">&#x2013;0.18</td>
<td valign="top" align="center">0.78</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Item Characteristic Curves for Form A and Form B of the ACCE-V. Plot created with Shiny Item Analysis.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-769415-g005.tif"/>
</fig>
<p>We also compared the ACCE-V&#x2019;s difficulty parameters with those of the PPVT-4. As can be seen in <xref ref-type="fig" rid="F6">Figure 6</xref>, some items from the PPVT that are presented early in the test and thus assumed to be relatively easy were in fact quite difficult for the children in our sample. Examples are item 3 (spoon) and item 8 (cup). At the same time, there were also items that are assumed to be relatively difficult that turned out to be less difficult than some of the earlier presented items. Examples are item 38 (<italic>penguin</italic>) and item 58 (<italic>panda</italic>). Note that these items were assumed to be more difficult by the developers of the PPVT, as indicated by their position in the tests (easy items occur early, harder items later). Overall, the items in the ACCE-V increase monotonously in difficulty, whereas the difficulty of the items in the PPVT does not.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>Difficulty parameters of items in the ACCE-V Form A, ACCE-V Form B, and the PPVT-4 in the order in which the items are presented.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-769415-g006.tif"/>
</fig>
<p>Note that the current version of the ACCE-V does not have a stopping rule, since the test is brief. We re-analyzed the data using a (hypothetical) stopping rule of three incorrect responses in a row and four incorrect responses in a row, respectively. Using the stopping rule did not affect the instrument&#x2019;s reliability.</p>
<p>Finally, we used the Mantel&#x2013;Haenszel method (<xref ref-type="bibr" rid="B23">Mantel and Haenszel, 1959</xref>) for Differential Item Functioning (DIF) to test for each item whether it was more difficult for either boys or girls. Three items in Form A were potential DIF items. Two showed slight advantages for girls and one an advantage for boys: <italic>grandmother</italic> (65% correct among girls, 56.6% correct among boys), <italic>butterfly</italic> (68.1% correct responses among girls, 56.8% correct responses among boys), and <italic>triangle</italic> (41.1% correct responses among girls, 49.2% correct responses among boys).</p>
</sec>
</sec>
<sec id="S3.SS2">
<title>RQ2: Assessment of Chinese Children&#x2019;s English, Children&#x2019;s Demographic and English Learning Experiences</title>
<p>The overall mean score for Form A was 9.09 points (range: 0&#x2013;20 points, <italic>SD</italic> = 5.25), and 9.78 points for Form B (range: 0&#x2013;20 points, <italic>SD</italic> = 5.12). <xref ref-type="fig" rid="F7">Figure 7</xref> shows the distribution of scores for both forms.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p>Density plots showing the distribution of scores for the ACCE-V Form B. <italic>N</italic> = 1724 (877 for Form A, 875 for Form B).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-769415-g007.tif"/>
</fig>
<p>The ACCE is not designed to provide any age or grade norms, as quality and quantity of English language instruction vary widely between different preschools, and some children may have started English instruction at a later age than others. However, to provide an overview of children&#x2019;s performance across different ages in the sample, <xref ref-type="table" rid="T7">Table 7</xref> shows the measures of central tendency for each 6-months age band.</p>
<table-wrap position="float" id="T7">
<label>TABLE 7</label>
<caption><p>Number of children, mean score, median score, and standard deviation for 6-month age bands.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center" colspan="4">ACCE-V Form A<hr/></td>
<td valign="top" align="center" colspan="4">ACCE-V Form B<hr/></td>
</tr>
<tr>
<td/>
<td valign="top" align="center"><italic>N</italic></td>
<td valign="top" align="center">Mean</td>
<td valign="top" align="center">Median</td>
<td valign="top" align="center"><italic>SD</italic></td>
<td valign="top" align="center"><italic>N</italic></td>
<td valign="top" align="center">Mean</td>
<td valign="top" align="center">Median</td>
<td valign="top" align="center"><italic>SD</italic></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">3.0&#x2013;3.5</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">7.78</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">4.30</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">8.08</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">4.21</td>
</tr>
<tr>
<td valign="top" align="left">3.6&#x2013;3.11</td>
<td valign="top" align="center">109</td>
<td valign="top" align="center">7.83</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">4.60</td>
<td valign="top" align="center">109</td>
<td valign="top" align="center">8.61</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">4.66</td>
</tr>
<tr>
<td valign="top" align="left">4.0&#x2013;4.5</td>
<td valign="top" align="center">107</td>
<td valign="top" align="center">9.00</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">4.96</td>
<td valign="top" align="center">107</td>
<td valign="top" align="center">9.82</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">4.96</td>
</tr>
<tr>
<td valign="top" align="left">4.6&#x2013;4.11</td>
<td valign="top" align="center">124</td>
<td valign="top" align="center">9.28</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">5.27</td>
<td valign="top" align="center">124</td>
<td valign="top" align="center">10.25</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">4.97</td>
</tr>
<tr>
<td valign="top" align="left">5.0&#x2013;5.5</td>
<td valign="top" align="center">152</td>
<td valign="top" align="center">10.33</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">5.51</td>
<td valign="top" align="center">152</td>
<td valign="top" align="center">11.10</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">5.13</td>
</tr>
<tr>
<td valign="top" align="left">5.6&#x2013;511</td>
<td valign="top" align="center">165</td>
<td valign="top" align="center">9.14</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">5.31</td>
<td valign="top" align="center">163</td>
<td valign="top" align="center">9.55</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">5.16</td>
</tr>
<tr>
<td valign="top" align="left">6.0&#x2013;6.5</td>
<td valign="top" align="center">119</td>
<td valign="top" align="center">8.49</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">5.20</td>
<td valign="top" align="center">119</td>
<td valign="top" align="center">9.21</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">5.16</td>
</tr>
<tr>
<td valign="top" align="left">6.6&#x2013;6.11</td>
<td valign="top" align="center">65</td>
<td valign="top" align="center">6.76</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">4.70</td>
<td valign="top" align="center">65</td>
<td valign="top" align="center">7.38</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">4.63</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In addition to reporting by age group, we used regression models to observe how children&#x2019;s demographic characteristics (gender, age) and English learning experience (whether there are foreign teachers in kindergartens, kindergarten tuitions) predict children&#x2019;s English vocabulary skills.</p>
<p>In Form A, girls scored an average of 9.36 points (range: 0&#x2013;20, <italic>SD</italic> = 5.28), and boys scored an average of 8.88 points (range: 0&#x2013;20 points, <italic>SD</italic> = 5.23). In Form B, girls scored an average of 10.01 points (range: 0&#x2013;20, <italic>SD</italic> = 5.13), and boys an average of 9.6 points (range: 0&#x2013;20, <italic>SD</italic> = 5.10). We used a Bayesian <italic>t</italic>-test from the BayesFactor package (<xref ref-type="bibr" rid="B27">Morey et al., 2015</xref>) to test if either gender performed better than the other. A traditional <italic>t</italic>-test can determine if there are significant differences between the two groups. However, a non-significant result does not prove that there is no difference, as a non-significant result can also be due to insufficient data (<xref ref-type="bibr" rid="B13">Dienes, 2014</xref>). In contrast, a Bayesian <italic>t</italic>-test allows quantifying the evidence for or against there being a difference between two groups. The Bayes factor (i.e., the ratio of the likelihood of there being a difference between girls and boys to the likelihood of there not being a difference) was 0.19 for Form A and 0.15 for Form B, which is both considered &#x201C;substantial evidence&#x201D; (<xref ref-type="bibr" rid="B39">Wetzels et al., 2011</xref>) for the absence of a gender difference.</p>
<p>To test the assumptions about the data, we estimated the variance inflation factor (VIF) to check for multicollinearity, a normal predicted probability plot to check for normality, and a residuals plot to evaluate the homoscedasticity of errors. Both tuition and foreign teacher showed signs of multicollinearity. However, we decided to retain these two variables in our models for two reasons. First, tuition and foreign teacher were theoretically important to our model and were potentially able to explain variance on different levels (classroom level and school level). Second, multicollinearity inflate the variance and Type II error, but as will be shown below, Type II errors did not occur (as there were no null results). The assumptions of normality and homoscedasticity were both met.</p>
<p><xref ref-type="table" rid="T8">Table 8</xref> showed the results of multiple regression models. When controlling for foreign teacher, gender, housing price, and tuition, a month increase in children&#x2019;s age-predicted score increase of 0.29 and 0.28 standard deviations change in Form A score and Form B score, respectively (both <italic>p</italic> &#x003C; 0.001). When controlling for foreign teacher, children&#x2019;s age, housing price, and tuition, girls performed 0.06 and 0.05 standard deviations better than boys in Form A and Form B, respectively (<italic>p</italic> &#x003C; 0.05). It is worth noting that when controlling for age, gender, housing price, and tuition, foreign teacher predicted an increased score of 1.12 and 1.08 standard deviations in Form A score and Form B score (<italic>p</italic> &#x003C; 0.001). This means that the effect of having a foreign teacher is much larger than the effect of age (0.29 and 0.28 standard deviations).</p>
<table-wrap position="float" id="T8">
<label>TABLE 8</label>
<caption><p>Results of multiple linear regression models predicting the ACCE-V scores.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td/>
<td valign="top" align="center" colspan="3">ACCE-V Form A<hr/></td>
<td valign="top" align="center" colspan="3">ACCE-V Form B<hr/></td>
</tr>
<tr>
<td/>
<td valign="top" align="center"><italic>b</italic></td>
<td valign="top" align="center">Standard error</td>
<td valign="top" align="center">&#x03B2;</td>
<td valign="top" align="center"><italic>b</italic></td>
<td valign="top" align="center">Standard error</td>
<td valign="top" align="center">&#x03B2;</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Intercept</td>
<td valign="top" align="center">&#x2013;4.81</td>
<td valign="top" align="center">0.90</td>
<td/>
<td valign="top" align="center">&#x2013;3.40</td>
<td valign="top" align="center">0.88</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Children&#x2019;s age (in month)</td>
<td valign="top" align="center">0.13<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">0.29<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.12<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">0.28<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
</tr>
<tr>
<td valign="top" align="left">Foreign teacher</td>
<td valign="top" align="center">13.68<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">1.25</td>
<td valign="top" align="center">1.12<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">12.79<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">1.23</td>
<td valign="top" align="center">1.08<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
</tr>
<tr>
<td valign="top" align="left">Gender (girl)</td>
<td valign="top" align="center">0.64<xref ref-type="table-fn" rid="t8fns1">&#x002A;</xref></td>
<td valign="top" align="center">0.25</td>
<td valign="top" align="center">0.06<xref ref-type="table-fn" rid="t8fns1">&#x002A;</xref></td>
<td valign="top" align="center">0.53<xref ref-type="table-fn" rid="t8fns1">&#x002A;</xref></td>
<td valign="top" align="center">0.25</td>
<td valign="top" align="center">0.05<xref ref-type="table-fn" rid="t8fns1">&#x002A;</xref></td>
</tr>
<tr>
<td valign="top" align="left">Housing price</td>
<td valign="top" align="center">0.06<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">0.33<xref ref-type="table-fn" rid="t8fns1">&#x002A;</xref></td>
<td valign="top" align="center">0.06<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">0.32<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
</tr>
<tr>
<td valign="top" align="left">Tuition</td>
<td valign="top" align="center">&#x2013;0.0005<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">&#x2013;0.58<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">&#x2013;0.0005<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">&#x2013;0.52<xref ref-type="table-fn" rid="t8fns1">&#x002A;&#x002A;&#x002A;</xref></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t8fns1"><p><italic>&#x002A;p &#x003C; 0.05; &#x002A;&#x002A;&#x002A;p &#x003C; 0.001.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
<p>As we hypothesized earlier, although significant, a child&#x2019;s age was not the most important predictor of English vocabulary knowledge. More important than age was the English learning environment for children. Children who study in classes with foreign teachers had significantly higher scores than children without foreign teachers (Form A: &#x03B2; = 13.68, <italic>p</italic> &#x003C; 0.001; Form B: &#x03B2; = 12.79, <italic>p</italic> &#x003C; 0.001).</p>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<title>Discussion</title>
<p>Despite the growing demand for English instruction for young children in China, educators and researchers do not have the right tools that allow them to assess children&#x2019;s vocabulary knowledge. Existing tests like the PPVT and the BPVS are not suitable, because they were developed for first language learners. This is reflected both in the selection of items, which is based on the age of acquisition in an English-speaking environment, and in the depiction of items, which is in accordance with American and British cultural norms.</p>
<p>Our goal with the ACCE-V was to develop an English vocabulary test specifically for young Chinese learners of English. Drawn from textbooks used in Chinese elementary schools, the items are selected to capture school-relevant vocabulary rather than vocabulary that a child growing up in an English-speaking, Western environment would acquire through daily interactions. By using Chinese cultural visual conventions in the drawings, the ACCE-V improves children&#x2019;s chances of recognizing the intended meaning of a drawing. The multiple-choice format is a familiar format for Chinese children, and the test is easy to administer. With only 5&#x2013;10 min, the ACCE-V is a short test and can be used in combination with other tests.</p>
<p>The authors and the PACE Research Institute will be responsible for holding and distributing the test. We expect two groups interested in this test, researchers and schools. Eligible researchers from universities or research institutions may contact the authors to request a test, and PACE Research Institution is responsible for training investigators who will administer the tests. Schools or school districts that wish to use ACCE-V can contact PACE, and PACE will arrange for trained personnel to administer the tests.</p>
<p>Two field studies with a combined sample size of more than 1,000 children showed that the ACCE-V has very good psychometric properties with respect to alternate form reliability, test&#x2013;retest reliability, and internal consistency. IRT analyses indicate the range of difficulty of the items is appropriate for the target population, and that the items are good at discriminating between children with lower ability and those with higher ability.</p>
<p>The test scores showed high correlations with the PPVT-4. Both the ACCE-V and the PPVT are intended to measure children&#x2019;s English receptive vocabulary, so a high correlation is expected and desirable as an indicator of concurrent (criterion) validity. However, as the analysis of the difficulty indices of items in the PPVT showed, some items that are supposed to be relatively easy for L1 speakers were difficult for our L2 learners, so the assumed progression in difficulty in the PPVT does not necessarily hold for L2 learners.</p>
<p>In our view, observations like these make the case for dedicated L2 vocabulary assessments like the ACCE-V. It is important to note that this is not a critique of the PPVT and similar tests as they are intended&#x2014;as assessments for L1 learners. Researchers and educators tend to use instruments meant for L1 learners, because they are widely used and because they have been psychometrically validated. However, the validation pertains to the target population (L1 learners), and the tests should not unquestioningly be assumed to be suitable for other populations. While the ACCE-V&#x2019;s concurrent validity has been demonstrated, we are planning to collect data on children&#x2019;s English grades in primary school to evaluate the ACCE-V&#x2019;s predictive validity.</p>
<p>The development of the ACCE-V receptive vocabulary test is part of a larger effort to develop culturally appropriate assessments for young EFL learners. This is not just a challenge for Chinese young English learners.</p>
<p>We see our effort related to the Global English Language Teaching (GELT) framework (<xref ref-type="bibr" rid="B32">Rose and Galloway, 2019</xref>). The framework problematizes the conception of English as primarily the language of &#x201C;Inner Circle&#x201D; countries (<xref ref-type="bibr" rid="B20">Kachru, 1992</xref>), and not the globalized language it is, with many different contexts, uses, and users. Within the conventional approach to EFL teaching and testing, the ultimate goal of learning English is to achieve native-like proficiency, with native-likeness usually being defined by the (idealized) standards of American and British English. The target audience in this approach are also native speakers. Along with this goal often also comes the superimposition of certain norms of the native English-speaking cultures in teaching materials and language tests. In contrast, approaches like GELT assume that the goals for learning English can be manifold&#x2014;in the case of young EFL learners, it is usually being able to follow and participate in English classes in elementary and middle school, with the longer-term goal of communicating with both native- and non-native speakers, and consuming and producing content from their own culture and from other cultures.</p>
<p>Moving toward a GELT approach in teaching and assessment in the early years is not a call to abandon any inclusion of culture from countries like Britain, Australia, or the United States. Rather, the appeal is to focus more on what is relevant to young L2 learners in their immediate environment and to become aware of and reduce the biases inherent to the traditional approaches. Tests and assessments are obviously only one aspect in the larger system of foreign language learning. However, there is general consensus that testing has an influence on teaching and learning, an effect termed &#x2018;washback&#x2019; (<xref ref-type="bibr" rid="B1">Alderson and Wall, 1993</xref>). Tests and examinations have a long tradition in China, and the Chinese educational system is geared toward tests and assessments, including English (<xref ref-type="bibr" rid="B9">Cheng and Curtis, 2009</xref>). With the growing demand for early English instruction and the current lack of evaluation criteria, new assessments in that domain are likely to influence teacher and parent education decisions. Introducing culturally appropriate language assessments may therefore be a useful way to initiate changes in early English language teaching. Thus, while the primary goal of the ACCE-V is to provide educators and researchers with a valid, culturally appropriate instrument to measure young Chinese children&#x2019;s English vocabulary, we hope that developments like this one can also have a positive influence on English teaching and testing culture in China in general.</p>
</sec>
<sec id="S5">
<title>Limitations</title>
<p>We recruited more than 1,000 children for this study, and all came from the two most economically developed cities in China. Therefore, the conclusions of this study should be very cautious when extending to children learning English in other areas, especially the rural areas in China. In addition, our research data is cross-sectional, and we did not collect longitudinal data on children; thus, we cannot discuss whether ACCE-V can capture the development of children&#x2019;s long-term English proficiency. In the following research, we will expand our sample, recruit children from other cities, and track the English development of these children longitudinally.</p>
<p>We note that a few children (less than 3% of the total) achieved maximum scores, indicating the possibility of a ceiling effect. We plan to expand the current version of the ACCE-V to include more difficult items. Depending on the resulting length, future versions of the ACCE-V may include a stopping rule.</p>
</sec>
<sec id="S6" sec-type="conclusion">
<title>Conclusion</title>
<p>The goal of the ACCE-V is to provide educators and researchers with a valid, culturally appropriate instrument to measure young Chinese children&#x2019;s English vocabulary. In this study, we documented the design process of the ACCE-V and demonstrated its reliability and validity. We showed that the ACCE-V has good psychometrically properties. The authors and the PACE Research Institute plan to open the access of ACCE-V to qualified educators and researchers (e.g., certificated practitioners of English education institutions, researchers with sufficient educational psychology training) and provide them with ACCE-V related training. Before using the ACCE-V, the tester must pass the exam of the ACCE-V design team. As an alternative vocabulary test for young English learners in China, we will use ACCE-V to answer research questions related to Chinese children&#x2019;s English development, such as the relationship with Chinese proficiency, family socioeconomic status, and family literacy environment.</p>
</sec>
<sec id="S7" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The datasets presented in this article are not readily available because the data that support the findings of this study are available from Pace Research Institute but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are, however, available from the authors upon reasonable request and with permission of Pace Research Institute. Requests to access the datasets should be directed to PW, <email>lukewpz@gmail.com</email>.</p>
</sec>
<sec id="S8">
<title>Ethics Statement</title>
<p>The studies involving human participants were reviewed and approved by Pace Research Institute. Written informed consent to participate in this study was provided by the participants&#x2019; legal guardian/next of kin.</p>
</sec>
<sec id="S9">
<title>Author Contributions</title>
<p>LR: conceptualization, methodology, formal analysis, writing&#x2014;original draft, and visualization. PW: resources, data curation, methodology, formal analysis, project administration, and visualization. SC: conceptualization, methodology, formal analysis, supervision, writing&#x2014;review and editing, project administration, and funding acquisition. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The PACE Research Institute is applying for a patent in China for the ACCE-V and intends to use the test commercially. The authors have received consulting fees and travel grants from the PACE Research Institute and would receive royalties from the ACCE-V should the patent be granted, and the test be used commercially.</p>
</sec>
<sec id="pudiscl1" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="S10" sec-type="funding-information">
<title>Funding</title>
<p>This study was funded by PACE Research Institute (Shenzhen, China).</p>
</sec>
<ack>
<p>We thank all research assistants from PACE Research Institute for collecting data. We also thank all schools and administrators who participated in this study.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alderson</surname> <given-names>J. C.</given-names></name> <name><surname>Wall</surname> <given-names>D.</given-names></name></person-group> (<year>1993</year>). <article-title>Does washback exist?</article-title> <source><italic>Appl. Linguist.</italic></source> <volume>14</volume> <fpage>115</fpage>&#x2013;<lpage>129</lpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0167544</pub-id> <pub-id pub-id-type="pmid">27936103</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Atwill</surname> <given-names>K.</given-names></name> <name><surname>Blanchard</surname> <given-names>J.</given-names></name> <name><surname>Gorin</surname> <given-names>J. S.</given-names></name> <name><surname>Burstein</surname> <given-names>K.</given-names></name></person-group> (<year>2007</year>). <article-title>Receptive vocabulary and cross-language transfer of phonemic awareness in kindergarten children.</article-title> <source><italic>J. Educ. Res.</italic></source> <volume>100</volume> <fpage>336</fpage>&#x2013;<lpage>346</lpage>. <pub-id pub-id-type="doi">10.3200/JOER.100.6.336-346</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><collab>Aurora Mobile</collab> (<year>2020</year>). <source><italic>2020 China Online K-12 English Education Industry Research Report.</italic></source> <publisher-loc>Shenzhen</publisher-loc>: <publisher-name>Aurora Mobile</publisher-name>.</citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baayen</surname> <given-names>R. H.</given-names></name> <name><surname>Piepenbrock</surname> <given-names>R.</given-names></name> <name><surname>Gulikers</surname> <given-names>L.</given-names></name></person-group> (<year>1996</year>). <source><italic>The CELEX Lexical Database (CD-Rom).</italic></source> <publisher-loc>Philadelphia, PA</publisher-loc>: <publisher-name>University of Pennsylvania</publisher-name>.</citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brysbaert</surname> <given-names>M.</given-names></name> <name><surname>New</surname> <given-names>B.</given-names></name></person-group> (<year>2009</year>). <article-title>Moving beyond Ku&#x010D;era and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English.</article-title> <source><italic>Behav. Res. Methods</italic></source> <volume>41</volume> <fpage>977</fpage>&#x2013;<lpage>990</lpage>. <pub-id pub-id-type="doi">10.3758/BRM.41.4.977</pub-id> <pub-id pub-id-type="pmid">19897807</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brysbaert</surname> <given-names>M.</given-names></name> <name><surname>Warriner</surname> <given-names>A. B.</given-names></name> <name><surname>Kuperman</surname> <given-names>V.</given-names></name></person-group> (<year>2014</year>). <article-title>Concreteness ratings for 40 thousand generally known English word lemmas.</article-title> <source><italic>Behav. Res. Methods</italic></source> <volume>46</volume> <fpage>904</fpage>&#x2013;<lpage>911</lpage>. <pub-id pub-id-type="doi">10.3758/s13428-013-0403-5</pub-id> <pub-id pub-id-type="pmid">24142837</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>S.</given-names></name> <name><surname>Lawrence</surname> <given-names>J. F.</given-names></name> <name><surname>Zhou</surname> <given-names>J.</given-names></name> <name><surname>Min</surname> <given-names>L.</given-names></name> <name><surname>Snow</surname> <given-names>C. E.</given-names></name></person-group> (<year>2018</year>). <article-title>The efficacy of a school-based book-reading intervention on vocabulary development of young Uyghur children: a randomized controlled trial.</article-title> <source><italic>Early Child. Res. Q.</italic></source> <volume>44</volume> <fpage>206</fpage>&#x2013;<lpage>219</lpage>. <pub-id pub-id-type="doi">10.1016/j.ecresq.2017.12.008</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>S.</given-names></name> <name><surname>Zhao</surname> <given-names>J.</given-names></name> <name><surname>de Ruiter</surname> <given-names>L.</given-names></name> <name><surname>Zhou</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). <article-title>A burden or a boost: the impact of early childhood English learning experience on lower elementary English and Chinese achievement.</article-title> <source><italic>Int. J. Biling. Educ. Biling.</italic></source> <fpage>1</fpage>&#x2013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1080/13670050.2020.1749230</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="editor"><name><surname>Cheng</surname> <given-names>L.</given-names></name> <name><surname>Curtis</surname> <given-names>A.</given-names></name></person-group> <role>(eds)</role>. (<year>2009</year>). &#x201C;<article-title>The realities of English language assessment and the Chinese learner in China and beyond</article-title>,&#x201D; in <source><italic>English Language Assessment and the Chinese Learner</italic></source>, (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Routledge</publisher-name>), <fpage>3</fpage>&#x2013;<lpage>12</lpage>.</citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chik</surname> <given-names>A.</given-names></name> <name><surname>Besser</surname> <given-names>S.</given-names></name></person-group> (<year>2011</year>). <article-title>International language test taking among young learners: a Hong Kong case study.</article-title> <source><italic>Lang. Assess. Q.</italic></source> <volume>8</volume> <fpage>73</fpage>&#x2013;<lpage>91</lpage>. <pub-id pub-id-type="doi">10.1080/15434303.2010.537417</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cote</surname> <given-names>L. R.</given-names></name> <name><surname>Bornstein</surname> <given-names>M. H.</given-names></name></person-group> (<year>2014</year>). <article-title>Productive vocabulary among three groups of bilingual American children: comparison and prediction.</article-title> <source><italic>First Lang.</italic></source> <volume>34</volume> <fpage>467</fpage>&#x2013;<lpage>485</lpage>. <pub-id pub-id-type="doi">10.1177/0142723714560178</pub-id> <pub-id pub-id-type="pmid">25620820</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Curtain</surname> <given-names>H.</given-names></name> <name><surname>Dahlberg</surname> <given-names>C. A.</given-names></name></person-group> (<year>2010</year>). <source><italic>Languages and Children: Making the Match, New Languages for Young Learners, Grades K-8</italic></source>, <edition>4th Edn</edition>. <publisher-loc>London</publisher-loc>: <publisher-name>Pearson</publisher-name>.</citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dienes</surname> <given-names>Z.</given-names></name></person-group> (<year>2014</year>). <article-title>Using Bayes to get the most out of non-significant results.</article-title> <source><italic>Front. Psychol.</italic></source> <volume>5</volume>:<issue>781</issue>. <pub-id pub-id-type="doi">10.3389/fpsyg.2014.00781</pub-id> <pub-id pub-id-type="pmid">25120503</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dunn</surname> <given-names>D. M.</given-names></name> <name><surname>Dunn</surname> <given-names>L. M.</given-names></name> <name><surname>Styles</surname> <given-names>B.</given-names></name> <name><surname>Sewell</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <source><italic>The British Picture Vocabulary Scale III</italic></source>, <edition>3rd Edn</edition>. <publisher-loc>London</publisher-loc>: <publisher-name>GL Assessment</publisher-name>.</citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dunn</surname> <given-names>L. M.</given-names></name> <name><surname>Dunn</surname> <given-names>D. M.</given-names></name></person-group> (<year>2007</year>). <source><italic>PPVT-4: Peabody Picture Vocabulary Test.</italic></source> <publisher-loc>San Antonio, TX</publisher-loc>: <publisher-name>Pearson Assessments</publisher-name>.</citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Feng</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Spread of English across greater China.</article-title> <source><italic>J. Multiling. Multicult. Dev.</italic></source> <volume>33</volume> <fpage>363</fpage>&#x2013;<lpage>377</lpage>.</citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goriot</surname> <given-names>C.</given-names></name> <name><surname>van Hout</surname> <given-names>R.</given-names></name> <name><surname>Broersma</surname> <given-names>M.</given-names></name> <name><surname>Lobo</surname> <given-names>V.</given-names></name> <name><surname>McQueen</surname> <given-names>J. M.</given-names></name> <name><surname>Unsworth</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>Using the peabody picture vocabulary test in L2 children and adolescents: effects of L1.</article-title> <source><italic>Int. J. Biling. Educ. Biling.</italic></source> <volume>24</volume> <fpage>546</fpage>&#x2013;<lpage>568</lpage>. <pub-id pub-id-type="doi">10.1080/13670050.2018.1494131</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gr&#x00F8;ver</surname> <given-names>V.</given-names></name> <name><surname>Lawrence</surname> <given-names>J.</given-names></name> <name><surname>Rydland</surname> <given-names>V.</given-names></name></person-group> (<year>2018</year>). <article-title>Bilingual preschool children&#x2019;s second-language vocabulary development: the role of first-language vocabulary skills and second-language talk input.</article-title> <source><italic>Int. J. Biling.</italic></source> <volume>22</volume> <fpage>234</fpage>&#x2013;<lpage>250</lpage>. <pub-id pub-id-type="doi">10.1177/1367006916666389</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>G.</given-names></name> <name><surname>McKay</surname> <given-names>S. L.</given-names></name></person-group> (<year>2012</year>). <article-title>English language education in East Asia: some recent developments.</article-title> <source><italic>J. Multiling. Multicult. Dev.</italic></source> <volume>33</volume> <fpage>345</fpage>&#x2013;<lpage>362</lpage>. <pub-id pub-id-type="doi">10.1080/01434632.2012.661434</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="editor"><name><surname>Kachru</surname> <given-names>B. B.</given-names></name></person-group> <role>(ed.)</role>. (<year>1992</year>). &#x201C;<article-title>Teaching world Englishes</article-title>,&#x201D; in <source><italic>The other Tongue: English Across Cultures</italic></source>, <volume>Vol. 2</volume> (<publisher-loc>Urbana</publisher-loc>: <publisher-name>University of Illinois Press</publisher-name>), <fpage>355</fpage>&#x2013;<lpage>366</lpage>.</citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karlsen</surname> <given-names>J.</given-names></name> <name><surname>Lyster</surname> <given-names>S.-A. H.</given-names></name> <name><surname>Lerv&#x00E5;g</surname> <given-names>A.</given-names></name></person-group> (<year>2017</year>). <article-title>Vocabulary development in Norwegian L1 and L2 learners in the kindergarten&#x2013;school transition.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>44</volume> <fpage>402</fpage>&#x2013;<lpage>426</lpage>. <pub-id pub-id-type="doi">10.1017/S0305000916000106</pub-id> <pub-id pub-id-type="pmid">26951479</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>L.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name></person-group> (<year>1998</year>). <source><italic>The Peabody Picture Vocabulary Test-Revised in Chinese.</italic></source> <publisher-loc>Taipei</publisher-loc>: <publisher-name>Psychological Publishing</publisher-name>.</citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mantel</surname> <given-names>N.</given-names></name> <name><surname>Haenszel</surname> <given-names>W.</given-names></name></person-group> (<year>1959</year>). <article-title>Statistical aspects of the analysis of data from retrospective studies of disease.</article-title> <source><italic>J. Natl. Cancer Inst.</italic></source> <volume>22</volume> <fpage>719</fpage>&#x2013;<lpage>748</lpage>. <pub-id pub-id-type="pmid">13655060</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><collab>Ministry of Education of the People&#x2019;s Republic of China</collab> (<year>2001</year>). <source><italic>Guideline for Promoting English Teaching in Primary Schools.</italic></source> Available Online at: <ext-link ext-link-type="uri" xlink:href="http://www.moe.gov.cn/srcsite/A26/s7054/200101/t20010120_166075.html">http://www.moe.gov.cn/srcsite/A26/s7054/200101/t20010120_166075.html</ext-link> <comment>[accessed January 20, 2001]</comment>.</citation></ref>
<ref id="B25"><citation citation-type="journal"><collab>Ministry of Education of the People&#x2019;s Republic of China</collab> (<year>2018</year>). <source><italic>Notice of the General Office of the Ministry of Education on Launching the Special Governance Work for &#x201C;Primary Schooling.&#x201D;</italic></source> Available Online at: <ext-link ext-link-type="uri" xlink:href="http://www.moe.gov.cn/srcsite/A06/s3327/201807/t20180713_342997.html">http://www.moe.gov.cn/srcsite/A06/s3327/201807/t20180713_342997.html</ext-link> <comment>[accessed July 5, 2018]</comment>.</citation></ref>
<ref id="B26"><citation citation-type="journal"><collab>Ministry of Education of the People&#x2019;s Republic of China</collab> (<year>2021</year>). <source>Opinions Issued to Further Alleviate Homework Burden and Off-Campus Tutoring for Students Undergoing Compulsory Education</source>. Available Online at: <ext-link ext-link-type="uri" xlink:href="http://www.moe.gov.cn/jyb_xxgk/moe_1777/moe_1778/202107/t20210724_546576.html">http://www.moe.gov.cn/jyb_xxgk/moe_1777/moe_1778/202107/t20210724_546576.html</ext-link></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morey</surname> <given-names>R. D.</given-names></name> <name><surname>Rouder</surname> <given-names>J. N.</given-names></name> <name><surname>Jamil</surname> <given-names>T.</given-names></name></person-group> (<year>2015</year>). <source><italic>BayesFactor: Computation of Bayes Factors for Common Designs.</italic></source> Available Online at: <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/package=BayesFactor">https://cran.r-project.org/package=BayesFactor</ext-link></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nation</surname> <given-names>I.</given-names></name></person-group> (<year>2006</year>). <article-title>How large a vocabulary is needed for reading and listening?</article-title> <source><italic>Can. Mod. Lang. Rev.</italic></source> <volume>63</volume> <fpage>59</fpage>&#x2013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.3138/cmlr.63.1.59</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ninio</surname> <given-names>A.</given-names></name></person-group> (<year>1983</year>). <article-title>Joint book reading as a multiple vocabulary acquisition device.</article-title> <source><italic>Dev. Psychol.</italic></source> <volume>19</volume> <fpage>445</fpage>&#x2013;<lpage>451</lpage>. <pub-id pub-id-type="doi">10.1037/0012-1649.19.3.445</pub-id></citation></ref>
<ref id="B100"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paivio</surname> <given-names>A.</given-names></name> <name><surname>Yuille</surname> <given-names>J. C.</given-names></name> <name><surname>Madigan</surname> <given-names>S. A.</given-names></name></person-group> (<year>1968</year>). <article-title>Concreteness, imagery, and meaningfulness values for 925 nouns</article-title>. <source><italic>J. Exp. Psychol.</italic></source> <volume>76</volume>, <fpage>1</fpage>&#x2013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.1037/h0025327</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pe&#x00F1;a</surname> <given-names>E. D.</given-names></name> <name><surname>Halle</surname> <given-names>T. G.</given-names></name></person-group> (<year>2011</year>). <article-title>Assessing preschool dual language learners: traveling a multiforked road: assessing preschool dual language learners.</article-title> <source><italic>Child Dev. Perspect.</italic></source> <volume>5</volume> <fpage>28</fpage>&#x2013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1111/j.1750-8606.2010.00143.x</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><collab>R Core Team</collab> (<year>2019</year>). <source><italic>R: A Language and Environment for Statistical Computing.</italic></source> <publisher-loc>Vienna</publisher-loc>: <publisher-name>R Foundation for Statistical Computing</publisher-name>.</citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rose</surname> <given-names>H.</given-names></name> <name><surname>Galloway</surname> <given-names>N.</given-names></name></person-group> (<year>2019</year>). <source><italic>Global Englishes for Language Teaching</italic></source>, <edition>1st Edn</edition>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>. <pub-id pub-id-type="doi">10.1017/9781316678343</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmitt</surname> <given-names>N.</given-names></name></person-group> (<year>2010</year>). <source><italic>Researching Vocabulary: A Vocabulary Research Manual.</italic></source> <publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>.</citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shin</surname> <given-names>J. K.</given-names></name> <name><surname>Crandall</surname> <given-names>J. A.</given-names></name></person-group> (<year>2014</year>). <source><italic>Teaching Young Learners English: From theory to Practice.</italic></source> <publisher-loc>Boston, MA</publisher-loc>: <publisher-name>Cengage</publisher-name>.</citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Snow</surname> <given-names>C. E.</given-names></name> <name><surname>Barnes</surname> <given-names>W. S.</given-names></name> <name><surname>Chandler</surname> <given-names>J.</given-names></name> <name><surname>Goodman</surname> <given-names>I. F.</given-names></name> <name><surname>Hemphill</surname> <given-names>L.</given-names></name></person-group> (<year>1991</year>). <source><italic>Unfulfilled Expectations: Home and School Influences on Literacy.</italic></source> <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Harvard University Press</publisher-name>.</citation></ref>
<ref id="B36"><citation citation-type="journal"><collab>StataCorp</collab> (<year>2017</year>). <source><italic>Stata 15.0 [Computer Software].</italic></source> <publisher-loc>College Station, TX</publisher-loc>: <publisher-name>StataCorp</publisher-name>.</citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>H.</given-names></name> <name><surname>Steinkrauss</surname> <given-names>R.</given-names></name> <name><surname>Tendeiro</surname> <given-names>J.</given-names></name> <name><surname>De Bot</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>Individual differences in very young children&#x2019;s English acquisition in China: internal and external factors.</article-title> <source><italic>Bilingualism</italic></source> <volume>19</volume> <fpage>550</fpage>&#x2013;<lpage>566</lpage>. <pub-id pub-id-type="doi">10.1017/S1366728915000243</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ward</surname> <given-names>J.</given-names></name> <name><surname>Chuenjundaeng</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>Suffix knowledge: acquisition and applications.</article-title> <source><italic>System</italic></source> <volume>37</volume> <fpage>461</fpage>&#x2013;<lpage>469</lpage>. <pub-id pub-id-type="doi">10.1016/j.system.2009.01.004</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wetzels</surname> <given-names>R.</given-names></name> <name><surname>Matzke</surname> <given-names>D.</given-names></name> <name><surname>Lee</surname> <given-names>M. D.</given-names></name> <name><surname>Rouder</surname> <given-names>J. N.</given-names></name> <name><surname>Iverson</surname> <given-names>G. J.</given-names></name> <name><surname>Wagenmakers</surname> <given-names>E.-J.</given-names></name></person-group> (<year>2011</year>). <article-title>Statistical evidence in experimental psychology: an empirical comparison using 855 t tests.</article-title> <source><italic>Perspect. Psychol. Sci.</italic></source> <volume>6</volume> <fpage>291</fpage>&#x2013;<lpage>298</lpage>. <pub-id pub-id-type="doi">10.1177/1745691611406923</pub-id> <pub-id pub-id-type="pmid">26168519</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wood</surname> <given-names>C.</given-names></name> <name><surname>Stockholm</surname> <given-names>M.</given-names></name> <name><surname>Cearley</surname> <given-names>J.</given-names></name> <name><surname>Sheffield-Anderson</surname> <given-names>L.</given-names></name></person-group> (<year>2015</year>). <article-title>Lexical considerations for standardized vocabulary testing with young Spanish-English speakers.</article-title> <source><italic>Contemp. Issues Commun. Sci. Disord.</italic></source> <volume>42</volume> <fpage>202</fpage>&#x2013;<lpage>214</lpage>. <pub-id pub-id-type="doi">10.1044/cicsd_42_f_202</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p>Age is reported in the format years;months.</p></fn>
<fn id="footnote2">
<label>2</label>
<p>The category &#x2018;attributes&#x2019; comprises both adjectives and adverbs.</p></fn>
<fn id="footnote3">
<label>3</label>
<p>The mean concreteness scores for these words have been reported as 2.15 for <italic>difference</italic>, 2 for <italic>again</italic>, 5 for <italic>banana</italic>, and 3.75 for <italic>running</italic> (<xref ref-type="bibr" rid="B6">Brysbaert et al., 2014</xref>).</p></fn>
</fn-group>
</back>
</article>
