<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2023.1096399</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The difficulty of oral speech act production tasks in second language pragmatics testing</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Huang</surname>
<given-names>Weiying</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="c001" ref-type="corresp"><sup>&#x002A;</sup></xref>
<xref rid="fn0001" ref-type="author-notes"><sup>&#x2020;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/1777142/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Lu</surname>
<given-names>Xiaofei</given-names>
</name>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<xref rid="fn0001" ref-type="author-notes"><sup>&#x2020;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/1247418/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Foreign Languages, East China University of Technology</institution>, <addr-line>Nanchang</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Applied Linguistics, The Pennsylvania State University</institution>, <addr-line>University Park, PA</addr-line>, <country>United States</country></aff>
<author-notes>
<fn id="fn0002" fn-type="edited-by"><p>Edited by: Gary Morgan, Fundaci&#x00F3; per a la Universitat Oberta de Catalunya, Spain</p></fn>
<fn id="fn0003" fn-type="edited-by"><p>Reviewed by: Eliseo Diez-Itza, University of Oviedo, Spain; Musa Nushi, Shahid Beheshti University, Iran; Balachandran Vadivel, Cihan University-Duhok, Iraq</p></fn>
<corresp id="c001">&#x002A;Correspondence: Weiying Huang, &#x02709; <email>huangariel@163.com</email></corresp>
<fn id="fn0001" fn-type="equal"><p><sup>&#x2020;</sup>These authors have contributed equally to this work and share first authorship</p></fn>
<fn id="fn0004" fn-type="other"><p>This article was submitted to Language Sciences, a section of the journal Frontiers in Psychology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>03</day>
<month>02</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1096399</elocation-id>
<history>
<date date-type="received">
<day>12</day>
<month>11</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>17</day>
<month>01</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Huang and Lu.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Huang and Lu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>This study examined the relative difficulty of oral speech act production tasks involving eight different types of speech acts for Chinese English as a foreign language (EFL) learners and the effects of three contextual variables, namely, power, social distance, and imposition, on such difficulty. Eight Oral Discourse Completion Task items, each representing a unique combination of the three contextual variables, were designed for each speech act. Eighty Chinese EFL learners responded to these items and their responses were rated for appropriateness by two native-speaking college English instructors. A Many-facet Rasch Measurement analysis suggested that the eight speech acts can be ordered by ascending difficulty as follows: Thank, Request, Suggestion, Disagreement, Invitation, Refusal, Offer, and Apology. Significant effects on performance scores were found for the interaction between each of the three contextual variables and speech act, and the specific effects observed varied by speech act. The implications of our findings for L2 pragmatics testing are discussed.</p>
</abstract>
<kwd-group>
<kwd>pragmatic ability</kwd>
<kwd>speech acts</kwd>
<kwd>situational variables</kwd>
<kwd>task difficulty estimates</kwd>
<kwd>L2 pragmatics testing</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="8"/>
<equation-count count="0"/>
<ref-count count="63"/>
<page-count count="10"/>
<word-count count="8909"/>
</counts>
</article-meta>
</front>
<body>
<sec id="sec1" sec-type="intro">
<title>Introduction</title>
<p>Pragmatic ability, that is, the ability to understand the intended meanings communicated by the speaker and to use language appropriately in various communicative contexts (<xref ref-type="bibr" rid="ref51">Ross and Kasper, 2013</xref>; <xref ref-type="bibr" rid="ref44">Ren, 2022</xref>), is a crucial component in models of communicative language ability (<xref ref-type="bibr" rid="ref42">Purpura, 2004</xref>; <xref ref-type="bibr" rid="ref6">Bachman and Palmer, 2010</xref>). Albeit recent developments in second language pragmatics testing have shown a growing interest in interactive, discursively oriented assessment of interactional competence (for instance, <xref ref-type="bibr" rid="ref25">Grabowski, 2009</xref>, <xref ref-type="bibr" rid="ref26">2013</xref>; <xref ref-type="bibr" rid="ref61">Youn, 2015</xref>, <xref ref-type="bibr" rid="ref62">2019</xref>; <xref ref-type="bibr" rid="ref31">Ikeda, 2017</xref>; <xref ref-type="bibr" rid="ref22">Galaczi and Taylor, 2018</xref>), an important part of second language (L2) pragmatics testing involves assessing L2 learners&#x2019; ability to realize different speech acts under different circumstances (<xref ref-type="bibr" rid="ref51">Ross and Kasper, 2013</xref>). Research in this area has attended to the effects of different task features and contextual variables on the difficulty of pragmatic tasks (e.g., <xref ref-type="bibr" rid="ref28">Hudson, 2001</xref>; <xref ref-type="bibr" rid="ref55">Taguchi, 2007</xref>; <xref ref-type="bibr" rid="ref62">Youn, 2019</xref>). At the same time, while language users&#x2019; ability to perform various speech acts has been recognized as the universality of pragmatics (<xref ref-type="bibr" rid="ref52">Searle, 1969</xref>), linguistic means to engage in those speech acts and the socio-pragmatic norms associated with them exhibit considerable variation across languages and cultures (<xref ref-type="bibr" rid="ref56">Taguchi, 2012</xref>). This variation poses challenges for learning L2 speech acts and points to the need to take first language (L1) cultural background into account in assessing task difficulty. As identified in <xref ref-type="bibr" rid="ref49">Roever&#x2019;s (2007)</xref> study, one fourth of his test items in a pragmatics test showed differential functioning for test takers of Asian and European background. Indeed, a few studies have designed or evaluated L2 pragmatics tests with learners&#x2019; L1 background in mind (e.g., <xref ref-type="bibr" rid="ref21">Fulcher and Reiter, 2003</xref>; <xref ref-type="bibr" rid="ref37">Liu, 2006</xref>, <xref ref-type="bibr" rid="ref38">2007</xref>). However, systematical explorations of the difficulty of L2 oral production tasks involving a diverse range of speech acts and representing diverse combinations of contextual factors for learners from a specific L1 cultural background remain scant.</p>
<sec id="sec2">
<title>Task difficulty in oral proficiency assessment</title>
<p>Commonly used frameworks of task difficulty within second language acquisition (SLA) have focused on analyzing the degree of cognitive load and complexity of tasks (e.g., <xref ref-type="bibr" rid="ref54">Skehan, 1998</xref>; <xref ref-type="bibr" rid="ref45">Robinson, 2001</xref>). <xref ref-type="bibr" rid="ref54">Skehan&#x2019;s (1998)</xref> Limited Attentional Capacity Model and <xref ref-type="bibr" rid="ref45">Robinson&#x2019;s (2001)</xref> Cognition Hypothesis both hypothesize that manipulating the cognitive complexity and communicative requirements of a task will produce differential cognitive and communicative demands and affect the accuracy and complexity of the language that learners use to perform the task. <xref ref-type="bibr" rid="ref54">Skehan (1998)</xref> proposed three dimensions of task difficulty: code complexity (i.e., the variety and difficulty of the linguistic forms required for performing the task), cognitive complexity (i.e., the cognitive processing demands of the task content, such as the type of information to be processed), and communicative stress (i.e., stress caused by task-related factors such as time pressure). His model predicts a competition between accuracy and complexity as a result of limited attentional resources. <xref ref-type="bibr" rid="ref45">Robinson&#x2019;s (2001)</xref> triadic framework distinguishes task complexity features affected by cognitive factors (e.g., number of elements to deal with) from task condition features affected by interactional factors (e.g., power difference of the interlocutors) and task difficulty features affected by learner factors (e.g., learner motivation). His Cognition Hypothesis claims that increased task complexity may simultaneously promote linguistic complexity and accuracy as learners will activate and allocate more attentional resources to handle the higher cognitive load.</p>
<p>A few language assessment studies have applied these cognitive models of task complexity to examine the effect of varying task conditions on task difficulty in speaking tests. Based on <xref ref-type="bibr" rid="ref54">Skehan&#x2019;s (1998)</xref> cognitive complexity framework, <xref ref-type="bibr" rid="ref32">Iwashita et al. (2001)</xref> manipulated the performance conditions of a series of picture-based narrative task in terms of perspective (first vs. third person perspective), immediacy (here and now vs. there and then), adequacy (a complete set of pictures vs. an incomplete set), and planning time (no planning time vs. 3&#x2009;min planning time). They found no significant effect of the varying performance conditions on either the test-takers&#x2019; discourse in terms of fluency, complexity, or accuracy or the quality ratings of their performance. <xref ref-type="bibr" rid="ref19">Elder et al. (2002)</xref> further reported that the varying performance conditions did not affect task difficulty as perceived by the test-takers. They concluded that their results did not support Skehan&#x2019;s framework in the case of oral proficiency assessment. The lack of score sensitivity to varying task conditions in speaking tests has also been reported in other studies (<xref ref-type="bibr" rid="ref20">Fulcher, 1996</xref>; <xref ref-type="bibr" rid="ref21">Fulcher and Reiter, 2003</xref>). Accordingly, <xref ref-type="bibr" rid="ref21">Fulcher and Reiter (2003)</xref> suggested that L2 pragmatics test designers &#x201C;may look to pragmatic categories and cultural factors to develop task types&#x201D; (p. 339).</p>
</sec>
<sec id="sec3">
<title>Speech acts, contextual variables, and task difficulty in L2 pragmatics testing</title>
<p>A common way to attend to pragmatic categories in L2 pragmatics testing has been to look at different speech acts. Indeed, the speech act paradigm has played an important role in pragmatics testing since the 1980s, with the influence of studies in the Cross-Cultural Speech Act Realization Patterns (CCSARP) project initiated to investigate cross-cultural variations in speech act realization (<xref ref-type="bibr" rid="ref17">Cohen and Olshtain, 1981</xref>; <xref ref-type="bibr" rid="ref10">Blum-Kulka et al., 1989</xref>). Given that the linguistic realization patterns of speech acts have been found to differ from culture to culture (<xref ref-type="bibr" rid="ref23">Gass and Neu, 1996</xref>; <xref ref-type="bibr" rid="ref56">Taguchi, 2012</xref>), L2 learners&#x2019; pragmatic ability to realize different speech acts in the target language has been recognized as an essential component of their L2 communicative language ability (<xref ref-type="bibr" rid="ref4">Bachman, 1990</xref>; <xref ref-type="bibr" rid="ref5">Bachman and Palmer, 1996</xref>, <xref ref-type="bibr" rid="ref6">2010</xref>) and a prominent target construct of L2 pragmatics testing (<xref ref-type="bibr" rid="ref50">Roever, 2011</xref>).</p>
<p>Pragmatics tests of speech act realization have drawn heavily from Speech Act theory (<xref ref-type="bibr" rid="ref52">Searle, 1969</xref>) and Politeness theory (<xref ref-type="bibr" rid="ref14">Brown and Levinson, 1987</xref>). Speech Act theory views as the minimum unit of human communication the performance of different acts through language (e.g., apology and refusal) and distinguishes direct speech acts, where the speaker directly states the intended meaning, usually with certain conventionalized linguistic forms, from indirect ones, where the speaker says more than or something other than the intended meaning (<xref ref-type="bibr" rid="ref53">Searle, 1975</xref>). In Politeness theory, the directness of speech acts is seen to vary systematically with three contextual properties defined <italic>a priori</italic>, i.e., power, social distance, and rank of imposition (<xref ref-type="bibr" rid="ref14">Brown and Levinson, 1987</xref>). L2 pragmatics tests commonly examine L2 learners&#x2019; realization of different speech acts in situations with different contextual properties, although the most commonly investigated types of speech acts have centered around apology, refusal, and request (<xref ref-type="bibr" rid="ref29">Hudson et al., 1992</xref>, <xref ref-type="bibr" rid="ref30">1995</xref>; <xref ref-type="bibr" rid="ref58">Yamashita, 1996</xref>; <xref ref-type="bibr" rid="ref59">Yoshitake, 1997</xref>; <xref ref-type="bibr" rid="ref2">Ahn, 2005</xref>; <xref ref-type="bibr" rid="ref47">Roever, 2005</xref>, <xref ref-type="bibr" rid="ref48">2006</xref>; <xref ref-type="bibr" rid="ref37">Liu, 2006</xref>, <xref ref-type="bibr" rid="ref38">2007</xref>).</p>
<p>Among the task types used to test speech act production in pragmatics testing, Discourse Completion Tasks (DCTs) are used more widely than other types such as role plays and sociopragmatic judgment tasks (<xref ref-type="bibr" rid="ref40">Mart&#x00ED;nez-Flor and Us&#x00F3;-Juan, 2010</xref>). Although DCTs are artificial in nature (<xref ref-type="bibr" rid="ref11">Brown, 2001</xref>; <xref ref-type="bibr" rid="ref24">Golato, 2003</xref>), they allow for the evaluation of learners&#x2019; pragmatic knowledge and are the most prevalent data collection method in L2 pragmatics. <xref ref-type="bibr" rid="ref29">Hudson et al. (1992</xref>, <xref ref-type="bibr" rid="ref30">1995)</xref> designed a prototypical pragmatics test battery for apology, refusal, and request, which included six types of DCTs, namely, Written Discourse Completion Tasks (WDCT), Multiple-Choice Discourse Completion Tasks (MDCT), Oral Discourse Completion Tasks (ODCTs), Discourse Role-Play Tasks (DRPT), Discourse Self-Assessment Tasks (DSAT), and Role-Play Self-assessments (RPSA). All tasks other than self-assessments were designed around high/low settings of power, social distance, and imposition (<xref ref-type="bibr" rid="ref14">Brown and Levinson, 1987</xref>), rendering eight combinations of these contextual variables. Each task required test-takers to produce an oral or written response to a specific scenario representing a particular combination of contextual variables.</p>
<p>A limited number of studies have examined how pragmatic production tasks involving different speech acts compared with each other in terms of difficulty or how different contextual variables affect the difficulty of such tasks, sometimes with attention to the effects of assessment methods and/or L1 cultural background. <xref ref-type="bibr" rid="ref28">Hudson (2001)</xref> examined the effects of three assessment methods (i.e., WDCTs, language lab DCTs, and role-play scenarios) and three contextual variables (i.e., power, social distance, and imposition) on the scores assigned to pragmatic productions tasks involving three speech acts (i.e., apologies, refusals, and requests) among Japanese English as a second language (ESL) learners. He found that lab DCTs were slightly more difficult than the other two methods and that apologies were rated slightly higher than refusals and requests. He reported minimal effects of the contextual variables on the scores, with only imposition showing a slight effect, and attributed the lack of effects to the homogeneity of the participants&#x2019; proficiency level. <xref ref-type="bibr" rid="ref21">Fulcher and Reiter (2003)</xref> examined how social power and imposition as well as their interaction with learners&#x2019; L1 background affect test-takers&#x2019; pragmatic performance. Six role-play tasks representing six combinations of the two contextual variables were used to elicit L2 English learners&#x2019; realization of request. Significant effects were found for both contextual variables, the two-way interaction between social power and L1 background, and the three-way interaction between social power, imposition and L1 background. <xref ref-type="bibr" rid="ref46">Roever (2004)</xref> reviewed item difficulty in pragmatics tests including learners&#x2019; interpretation of routines, implicature and production of speech acts and identified degree of imposition as a source of speech act difficulty. The effect of degree of imposition on the difficulty of speech act performance was also evident in <xref ref-type="bibr" rid="ref55">Taguchi&#x2019;s (2007)</xref> study, in which she examined the effects of task difficulty on Japanese EFL learners&#x2019; oral production of requests and refusals. She operationalized task difficulty as two situation types, one with an equal power relationship, small social distance, and a small degree of imposition (PDR-low), and the other with greater power for the listener, large social distance, and a large degree of imposition (PDR-high). She reported that L2 learners produced speech acts significantly more easily and quickly in the PDR-low situation than in the PDR-high situation. In a study designed to evaluate the reliability of three test methods (WDCT, MDCT, and DST) for assessing the pragmatic knowledge of Chinese EFL learners, <xref ref-type="bibr" rid="ref37">Liu (2006)</xref> reported that the three methods were reasonably reliable, and that the apology subtest proved consistently more difficult than the request subtest across three test methods. However, compliment responses and refusals were found relatively easy while requests were more difficult for L2 Chinese learners in <xref ref-type="bibr" rid="ref35">Li et al. (2019)</xref>. <xref ref-type="bibr" rid="ref33">Krish and May (2020)</xref> identified interference of L1 cultural knowledge and linguistic rules in L2 Chinese learners&#x2019; pragmatic performance of five speech acts: compliments, requests, refusals, apologies, and complaints.</p>
<p>Taken together, these studies have provided evidence that pragmatic tasks involving different speech acts may have varying degrees of difficulty for L2 learners and that their relative difficulty may be affected by the learners&#x2019; L1 background and proficiency level, the assessment method used, and the contextual variables of power, social distance, and imposition. Meanwhile, it can also be seen that the range of speech acts and the range of combinations of different contextual variables that have been investigated in previous studies were both small, and the interaction between the contextual variables and speech acts has been underexamined. How learners&#x2019; native culture may influence their performance in pragmatics tests has barely been touched upon.</p>
</sec>
<sec id="sec4">
<title>Objectives</title>
<p>The current study contributes to the limited body of research in this area by examining the difficulty of oral production tasks involving different types of speech acts for Chinese English as foreign language (EFL) learners. In response to the call for broadening the range of pragmatic tasks and attending to the effects of relevant contextual variables in assessing task difficulty in pragmatics testing (<xref ref-type="bibr" rid="ref55">Taguchi, 2007</xref>; <xref ref-type="bibr" rid="ref62">Youn, 2019</xref>), we include eight speech acts and three contextual variables in designing the oral production tasks. It is our hope that our analysis will provide useful insight into the relative difficulty of oral production tasks involving different speech acts for Chinese EFL learners and the effects of the interaction between the contextual variables and speech act on task difficulty in L2 pragmatics tests. Informed by findings of previous studies, we explored these issues with a single assessment method and a group of learners from a single L1 background (i.e., Chinese EFL learners) representing diverse proficiency levels.</p>
</sec>
<sec id="sec5">
<title>Research questions</title>
<p>The present study explores the difficulty of oral speech act production tasks for Chinese EFL learners in L2 pragmatics testing by addressing the following research questions:</p>
<list list-type="order">
<list-item>
<p>What is the order of the difficulty estimates for oral speech act production tasks involving the speech acts of Apology, Disagreement, Thank, Request, Suggestion, Invitation, Offer and Refusal?</p>
</list-item>
<list-item>
<p>How do social distance, relative power, and imposition interact with speech act to affect the difficulty of oral speech act production tasks?</p>
</list-item>
</list>
</sec>
</sec>
<sec id="sec6" sec-type="methods">
<title>Methodology</title>
<sec id="sec7">
<title>Participants</title>
<p>Eighty Chinese EFL learners (24 male, 56 female) with an average age of 20.6 from three universities in south China responded to an open call to participate in the current study. The participants represented a range of disciplinary backgrounds, years in college, and language proficiency levels, with 35 first-and second-year non-English major undergraduate students from various arts and science disciplines, 40 first-and third-year English major undergraduate students, and five applied linguistics postgraduate students who majored in English in college. No participant had been abroad for over 1 month.</p>
</sec>
<sec id="sec8">
<title>Instruments</title>
<p>Given that our participants were all undergraduate and postgraduate students, we decided to test their pragmatic performance on speech acts commonly used in university settings. To this end, we identified 20 speech acts commonly discussed in the Interlanguage Pragmatics (ILP) literature and invited 28 L1 English American college students to rate the frequency of using each of them in their university life on a five-point scale. Based on their ratings, we included the following eight highest ranked speech acts in the current study: Apology, Disagreement, Thank, Request, Suggestion, Invitation, Offer, and Refusal.</p>
<p>We elicited the participants&#x2019; performance in producing target speech acts orally using Oral Discourse Completion Tasks (ODCTs). DCTs have been criticized for limited generalizability (<xref ref-type="bibr" rid="ref50">Roever, 2011</xref>), but ODCTs can measure online performance under time pressure (<xref ref-type="bibr" rid="ref46">Roever, 2004</xref>), which improves their authenticity and generalizability. To test the participants&#x2019; pragmatic ability to cope with different contexts, we incorporated different combinations of three contextual variables, i.e., relative power, social distance, and imposition in the ODCTs, with the values of these variables specified for each speech act production task. Relative power (P) refers to the power of the speaker with respect to the hearer (<xref ref-type="bibr" rid="ref14">Brown and Levinson, 1987</xref>), and P+, P&#x2212;, and P=&#x2009;denote the speaker has more, less, or equal power relative to the hearer, respectively, with more power defined as a higher rank, title, or social position or greater control of the assets in the situation. We excluded scenarios with the P+ feature in the current study as we limited the discourse context to the university setting, in which such scenarios were uncommon for our participants. Common scenarios with the P=&#x2009;feature included talking to classmates and roommates, and common scenarios with the P&#x2212; feature included talking to faculty and staff members. Social Distance (D) refers to the degree of familiarity and solidarity between the speaker and the hearer (<xref ref-type="bibr" rid="ref14">Brown and Levinson, 1987</xref>). D+ indicates that the speaker and hearer are unfamiliar with each other, and D-indicates that they are familiar with each other. Imposition (R) refers to the expenditure of goods and/or services by the hearer or the obligation of the speaker to perform an act (<xref ref-type="bibr" rid="ref14">Brown and Levinson, 1987</xref>). Given that the nature of this variable varies with different speech acts, we determined the value of this variable for each item in two steps. The speech events in the ODCTs were first ranked for imposition by two native speaker consultants through collaborative discussion. The rankings were then used to code the task items pertaining to the same speech act as either R+ (high imposition) or R&#x2212; (low imposition), depending on whether each item was ranked in the top or bottom half among the items for that speech act.</p>
<p>We initially developed eight ODCT items for each target speech act, each with a scenario reflecting a unique combination of the three contextual variables, as summarized in <xref rid="tab1" ref-type="table">Table 1</xref>. Each item was checked by two native speaker consultants for authenticity. The consultants recommended the removal of four items for Disagreement on the basis that they represented unrealistic scenarios. One consultant indicated that &#x201C;it&#x2019;s better to remain quiet if you do not agree in these cases.&#x201D; Therefore, only four items were retained for Disagreement (Item 1, 2, 3, 5). All other items were accepted by the consultants as authentic. The final test battery thus consisted of 60 ODCT items (see <xref ref-type="supplementary-material" rid="SM1">Appendix</xref>).</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption>
<p>Combinations of the three contextual variables represented by the eight ODCT items for each speech act.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th/>
<th align="center" valign="top">Item 1</th>
<th align="center" valign="top">Item 2</th>
<th align="center" valign="top">Item 3</th>
<th align="center" valign="top">Item 4</th>
<th align="center" valign="top">Item 5</th>
<th align="center" valign="top">Item 6</th>
<th align="center" valign="top">Item 7</th>
<th align="center" valign="top">Item 8</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">D</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">+</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">+</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">+</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">+</td>
</tr>
<tr>
<td align="left" valign="top">P</td>
<td align="center" valign="top">=</td>
<td align="center" valign="top">=</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">=</td>
<td align="center" valign="top">=</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">&#x2212;</td>
</tr>
<tr>
<td align="left" valign="top">R</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">&#x2212;</td>
<td align="center" valign="top">+</td>
<td align="center" valign="top">+</td>
<td align="center" valign="top">+</td>
<td align="center" valign="top">+</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>D, social distance; P, relative power; R, imposition.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="sec9">
<title>Procedure</title>
<p>The pragmatics test was first piloted with five Chinese EFL learners enrolled in the same university who did not participate in the actual study. They all found the scenario descriptions clear, but two participants identified several words in the descriptions that caused some comprehension difficulties. We thus added Chinese glosses to those words to minimize potential comprehension problems. Based on the maximum time they took to complete each item, we set the time limit to 20&#x2009;s for the first 50 items and 50&#x2009;s for the last 10 items due to the extended length of these items.</p>
<p>The final test was administered to the 80 participants in a large language lab in 12 groups of six to seven, with ample space between any two participants to minimize interference from each other. At the beginning of each session, one researcher provided instructions in English, illustrated the scenario descriptions and the types of oral response expected with an example, and confirmed that all participants understood the instructions and requirements. The researcher then presented the scenario descriptions and their corresponding time limits using PowerPoint slides on a screen in the front of the lab one by one. There was a signal for the participants to stop speaking at the end of the time limit for each item, and the next slide was shown. The entire session lasted about 1 h for each group. Each participant&#x2019;s responses were recorded by the computer and then saved in a separate audio file for rating and further analysis.</p>
</sec>
<sec id="sec10">
<title>Data analysis</title>
<p>Each participant&#x2019;s oral response to each item was firstly transcribed and their written responses were independently rated for pragmatic appropriateness by two native speakers of American English, both of whom were experienced English instructors at the university. A holistic five-point scale was adopted from the five-level rating scale constructed to evaluate Chinese EFL&#x2019;s written speech act performance by <xref ref-type="bibr" rid="ref15">Chen and Liu (2016)</xref>. Inter-rater reliability, assessed using Spearman&#x2019;s rank correlation, reached 0.823 (<italic>p</italic>&#x2009;&#x003C;&#x2009;0.001). The final score of each response was the mean of the two scores, and the overall test score of each participant was the sum of the scores for all responses by that participant.</p>
<p>We subjected the scores of the 80 participants&#x2019; responses to the 60 ODCT items to a Many-facet Rasch Measurement (MFRM) analysis within Item Response Theory (<xref ref-type="bibr" rid="ref41">McNamara and Knoch, 2012</xref>) using the FACETS 3.71.3 (<xref ref-type="bibr" rid="ref36">Linacre, 2013</xref>) for the analyses, with participants, speech acts, and item types as facets to assess the difficulty of items for each speech act as well as items of each of the eight types representing a specific combination of the three contextual variables. We further performed a series of two-way ANOVAs, each with speech act and one of the three contextual variables as independent variables and participants&#x2019; response scores as the dependent variable, to examine the effects of the interaction between each contextual variable and speech act on the difficulty of oral speech act production tasks. Cohen&#x2019;s <italic>D</italic>, or standardized mean difference, was adopted as an effect size measure. Following <xref ref-type="bibr" rid="ref16">Cohen (1969)</xref>, we characterized effect sizes as small, medium, and large if the &#x03B7;<sub>p</sub><sup>2</sup> values were larger than 0.0099, 0.0588, and 0.1379, respectively.</p>
</sec>
</sec>
<sec id="sec11" sec-type="results">
<title>Results</title>
<sec id="sec12">
<title>Research question 1: Order of the difficulty estimates for tasks involving different speech tasks</title>
<p>The MFRM analysis placed the estimates of the three facets (i.e., participants, speech acts, and item types) on a single measurement scale, as shown in <xref rid="fig1" ref-type="fig">Figure 1</xref>. The range of the measurements was within two logits, likely due to the narrow range of the ILP competence of our participants. The average person measure was 0.16, with a standard deviation of 0.22. Only four misfitting persons were identified with Z scores larger than two.</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption>
<p>Results of the Many-facet Rasch Measurement analysis of participant performance.</p>
</caption>
<graphic xlink:href="fpsyg-14-1096399-g001.tif"/>
</fig>
<p>For the speech act measures, the mean measure was set at zero and the standard deviation was calculated to be 0.30. Thank and Request were found to be the easiest, followed by Suggestion, Disagreement, and Invitation. Refusal, Offer, and Apology were found to be the most difficult among the eight speech acts.</p>
<p>Facets also generates an overall estimate of the extent to which items are at reliably different levels of difficulty. The reliability of separation index denotes the reliability with which the items included in the analysis are separated (i.e., how different the item difficulty measures are), and the fixed chi-square test for the items tests the hypothesis that all items are of the same level of difficulty, after accounting for measurement error. The reliability of separation was reported as 0.90 [&#x03C7;<sup>2</sup>(7)&#x2009;=&#x2009;74.0, <italic>p</italic>&#x2009;=&#x2009;0.000], indicating significant differences among the test items in terms of difficulty.</p>
<p>For the item type measures, the mean measure was set at zero and the standard deviation was calculated to be 0.11, indicating a low range of difficulty. Item 3 (D&#x2212;, P&#x2212;, and R&#x2212;) was the easiest item type, followed by items 1 (D&#x2212;, P=, R&#x2212;) and 4 (D+, P&#x2212;, R&#x2212;). Item 5 (D&#x2212;, P=, R+) was the most difficult item type, followed by item 8 (D+, P&#x2212;, R+). These results suggest that items with lower imposition (R&#x2212;) tended to be easier than those with higher imposition (R+).</p>
<p>To sum up, the MFRM analysis results suggested that the eight speech acts can be ordered by ascending difficulty as follows: Thank, Request, Suggestion, Disagreement, Invitation, Refusal, Offer, and Apology. The results also suggested a potential effect of imposition on learners&#x2019; oral speech act production performance.</p>
</sec>
<sec id="sec13">
<title>Research question 2: Effects of the interaction between each of the three contextual variables and speech act on the difficulty of oral speech act production tasks</title>
<p>Three separate two-way ANOVAs were conducted to investigate the effects of the interaction between each contextual factor and speech act on the difficulty of oral speech act production tasks. The four items for Disagreement were excluded from these analyses because not all values for all three variables were represented among these items as a result of the removal of four Disagreement items. The Levene test indicated that the assumption of equal variance across groups was violated (<italic>p</italic>&#x2009;&#x003C;&#x2009;0.05). However, the ANOVA <italic>F</italic> test has been shown to be robust if the sample is large, the group sizes are equal, and the largest group standard deviation is not larger than twice the smallest group standard deviation (e.g., <xref ref-type="bibr" rid="ref1">Agresti et al., 2017</xref>). Given that our dataset met these criteria, we proceeded with the two-way ANOVAs followed by pairwise comparisons using the Tamhane&#x2019;s T2 <italic>post hoc</italic> test, which does not assume equal variances across groups.</p>
</sec>
<sec id="sec14">
<title>Social distance</title>
<p>As shown in <xref rid="tab2" ref-type="table">Table 2</xref>, the main effect of speech act was statistically significant with a large effect size [<italic>F</italic>(6,153)&#x2009;=&#x2009;68.243, <italic>p</italic>&#x2009;=&#x2009;0.000, &#x03B7;<sub>p</sub><sup>2</sup>&#x2009;=&#x2009;0.270], but the main effect of social distance was insignificant [<italic>F</italic>(1,158)&#x2009;=&#x2009;0.316, <italic>p</italic>&#x2009;=&#x2009;0.574, &#x03B7;<sub>p</sub><sup>2</sup>&#x2009;=&#x2009;000]. The interaction effect between the two factors was significant with a medium effect size [<italic>F</italic>(1,158)&#x2009;=&#x2009;12.127, <italic>p</italic>&#x2009;=&#x2009;0.000, &#x03B7;<sub>p</sub><sup>2</sup>&#x2009;=&#x2009;0.062]. Pairwise comparisons revealed that, compared to items with the D+ feature, those with the D-feature were significantly easier for Offer and Request but significantly harder for Suggestion and Thank. These results are also visualized in <xref rid="fig2" ref-type="fig">Figure 2</xref>.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption>
<p>Comparison of mean task performance by speech act and social distance.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" rowspan="2">Speech act</th>
<th align="center" valign="top" rowspan="2"><italic>N</italic></th>
<th align="center" valign="top" colspan="2">Mean/SD</th>
<th align="center" valign="top">Pairwise comparisons</th>
<th align="center" valign="top" colspan="5">Analysis of variance</th>
</tr>
<tr>
<th align="center" valign="top">D&#x2212;</th>
<th align="center" valign="top">D+</th>
<th align="center" valign="top"><italic>p</italic></th>
<th/>
<th align="center" valign="top">df</th>
<th align="center" valign="top"><italic>F</italic></th>
<th align="center" valign="top"><italic>p</italic></th>
<th align="center" valign="top">&#x03B7;<sub>p</sub><sup>2</sup></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Apology</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">2.788/0.63</td>
<td align="center" valign="top">2.728/0.73</td>
<td align="center" valign="top">0.581</td>
<td align="left" valign="top">Speech act</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">68.243</td>
<td align="center" valign="top">0.000</td>
<td align="center" valign="top">0.270</td>
</tr>
<tr>
<td align="left" valign="top">Invitation</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.024/0.51</td>
<td align="center" valign="top">3.155/0.53</td>
<td align="center" valign="top">0.114</td>
<td align="left" valign="top">Social distance</td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">0.316</td>
<td align="center" valign="top">0.574</td>
<td align="center" valign="top">0.000</td>
</tr>
<tr>
<td align="left" valign="top">Offer</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.203/0.59</td>
<td align="center" valign="top">2.775/0.67</td>
<td align="center" valign="top">0.000</td>
<td align="left" valign="top">Interaction</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">12.127</td>
<td align="center" valign="top">0.000</td>
<td align="center" valign="top">0.062</td>
</tr>
<tr>
<td align="left" valign="top">Refusal</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.123/0.48</td>
<td align="center" valign="top">2.963/0.65</td>
<td align="center" valign="top">0.080</td>
<td colspan="5" rowspan="4"/>
</tr>
<tr>
<td align="left" valign="top">Request</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.798/0.57</td>
<td align="center" valign="top">3.444/0.50</td>
<td align="center" valign="top">0.000</td>
</tr>
<tr>
<td align="left" valign="top">Suggestion</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.188/0.49</td>
<td align="center" valign="top">3.580/0.54</td>
<td align="center" valign="top">0.000</td>
</tr>
<tr>
<td align="left" valign="top">Thank</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.662/0.57</td>
<td align="center" valign="top">4.003/0.61</td>
<td align="center" valign="top">0.000</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption>
<p>Profile plots for the interaction between speech act and social distance. Speech act codes: 1&#x2009;=&#x2009;Apology; 2&#x2009;=&#x2009;Invitation; 3&#x2009;=&#x2009;Offer; 4&#x2009;=&#x2009;Refusal; 5&#x2009;=&#x2009;Request; 6&#x2009;=&#x2009;Suggestion; 7&#x2009;=&#x2009;Thank. Social distance codes: 1&#x2009;=&#x2009;D&#x2212;; 2&#x2009;=&#x2009;D+.</p>
</caption>
<graphic xlink:href="fpsyg-14-1096399-g002.tif"/>
</fig>
</sec>
<sec id="sec15">
<title>Power</title>
<p>As shown in <xref rid="tab3" ref-type="table">Table 3</xref>, the main effect of speech act was statistically significant with a large effect size [<italic>F</italic>(6,153)&#x2009;=&#x2009;65.843, <italic>p</italic>&#x2009;=&#x2009;0.000, &#x03B7;<sub>p</sub><sup>2</sup>&#x2009;=&#x2009;0.263], but the main effect of power was insignificant [<italic>F</italic>(1,158)&#x2009;=&#x2009;1.986, <italic>p</italic>&#x2009;=&#x2009;0.159, &#x03B7;<sub>p</sub><sup>2</sup>&#x2009;=&#x2009;0.002]. The interaction effect between the two factors was significant with a medium effect size [<italic>F</italic>(1,158)&#x2009;=&#x2009;23.575, <italic>p</italic>&#x2009;=&#x2009;0.000, &#x03B7;<sub>p</sub><sup>2</sup>&#x2009;=&#x2009;0.113]. Pairwise comparisons revealed that, compared with items with the P=&#x2009;feature, those with the P-feature were significantly easier for Offer and Suggestion but significantly harder for Refusal. These results are also visualized in <xref rid="fig3" ref-type="fig">Figure 3</xref>.</p>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption>
<p>Comparison of mean task performance by speech act and power.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" rowspan="2">Speech act</th>
<th align="center" valign="top" rowspan="2"><italic>N</italic></th>
<th align="center" valign="top" colspan="2">Mean/SD</th>
<th align="center" valign="top">Pairwise comparisons</th>
<th align="center" valign="top" colspan="5">Analysis of variance</th>
</tr>
<tr>
<th align="center" valign="top">P&#x2212;</th>
<th align="center" valign="top">P=</th>
<th align="center" valign="top"><italic>p</italic></th>
<th/>
<th align="center" valign="top">df</th>
<th align="center" valign="top"><italic>F</italic></th>
<th align="center" valign="top"><italic>p</italic></th>
<th align="center" valign="top">&#x03B7;<sub>p</sub><sup>2</sup></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Apology</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">2.753/0.66</td>
<td align="center" valign="top">2.752/0.72</td>
<td align="center" valign="top">0.991</td>
<td align="left" valign="top">Speech act</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">65.843</td>
<td align="center" valign="top">0.000</td>
<td align="center" valign="top">0.263</td>
</tr>
<tr>
<td align="left" valign="top">Invitation</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.125/0.59</td>
<td align="center" valign="top">3.056/0.53</td>
<td align="center" valign="top">0.434</td>
<td align="left" valign="top">Power</td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">1.986</td>
<td align="center" valign="top">0.574</td>
<td align="center" valign="top">0.002</td>
</tr>
<tr>
<td align="left" valign="top">Offer</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.100/0.64</td>
<td align="center" valign="top">2.873/0.71</td>
<td align="center" valign="top">0.036</td>
<td align="left" valign="top">Interaction</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">23.575</td>
<td align="center" valign="top">0.000</td>
<td align="center" valign="top">0.113</td>
</tr>
<tr>
<td align="left" valign="top">Refusal</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">2.623/0.54</td>
<td align="center" valign="top">3.458/0.55</td>
<td align="center" valign="top">0.000</td>
<td colspan="5" rowspan="4"/>
</tr>
<tr>
<td align="left" valign="top">Request</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.630/0.52</td>
<td align="center" valign="top">3.614/0.58</td>
<td align="center" valign="top">0.852</td>
</tr>
<tr>
<td align="left" valign="top">Suggestion</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.735/0.54</td>
<td align="center" valign="top">3.027/0.52</td>
<td align="center" valign="top">0.000</td>
</tr>
<tr>
<td align="left" valign="top">Thank</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.917/0.61</td>
<td align="center" valign="top">3.751/0.58</td>
<td align="center" valign="top">0.081</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption>
<p>Profile plots for the interaction between speech act and power. Speech act codes: 1&#x2009;=&#x2009;Apology; 2&#x2009;=&#x2009;Invitation; 3&#x2009;=&#x2009;Offer; 4&#x2009;=&#x2009;Refusal; 5&#x2009;=&#x2009;Request; 6&#x2009;=&#x2009;Suggestion; 7&#x2009;=&#x2009;Thank. Power codes: 1&#x2009;=&#x2009;P&#x2212;; 2&#x2009;=&#x2009;<italic>p</italic>&#x2009;=&#x2009;.</p>
</caption>
<graphic xlink:href="fpsyg-14-1096399-g003.tif"/>
</fig>
</sec>
<sec id="sec16">
<title>Rank of imposition</title>
<p>As shown in <xref rid="tab4" ref-type="table">Table 4</xref>, the main effects of speech act [<italic>F</italic>(6,153)&#x2009;=&#x2009;63.918, <italic>p</italic>&#x2009;=&#x2009;0.000, &#x03B7;<sub>p</sub><sup>2</sup>&#x2009;=&#x2009;0.257] and Imposition [<italic>F</italic>(6,153)&#x2009;=&#x2009;39.300, <italic>p</italic>&#x2009;=&#x2009;0.000, &#x03B7;<sub>p</sub><sup>2</sup>&#x2009;=&#x2009;0.034] were both statistically significant, with large and small effect sizes, respectively. The interaction effect between the factors was also statistically significant with a medium effect size [<italic>F</italic>(6,153)&#x2009;=&#x2009;23.635, <italic>p</italic>&#x2009;=&#x2009;0.000, &#x03B7;<sub>p</sub><sup>2</sup>&#x2009;=&#x2009;0.114]. Pairwise comparisons revealed that, compared with items with the R+ feature, those with the R-feature were significantly easier for Offer, Request, and Suggestion but significantly harder for Refusal. These results are also visualized in <xref rid="fig4" ref-type="fig">Figure 4</xref>.</p>
<table-wrap position="float" id="tab4">
<label>Table 4</label>
<caption>
<p>Comparison of mean task performance by speech act and rank of imposition.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" rowspan="2">Speech act</th>
<th align="center" valign="top" rowspan="2"><italic>N</italic></th>
<th align="center" valign="top" colspan="2">Mean/SD</th>
<th align="center" valign="top">Pairwise comparisons</th>
<th align="center" valign="top" colspan="5">Analysis of variance</th>
</tr>
<tr>
<th align="center" valign="top">R&#x2212;</th>
<th align="center" valign="top">R+</th>
<th align="center" valign="top"><italic>p</italic></th>
<th/>
<th align="center" valign="top">df</th>
<th align="center" valign="top"><italic>F</italic></th>
<th align="center" valign="top"><italic>p</italic></th>
<th align="center" valign="top">&#x03B7;<sub>p</sub><sup>2</sup></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Apology</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">2.827/0.84</td>
<td align="center" valign="top">2.679/0.57</td>
<td align="center" valign="top">0.193</td>
<td align="left" valign="top">Speech act</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">63.918</td>
<td align="center" valign="top">0.000</td>
<td align="center" valign="top">0.257</td>
</tr>
<tr>
<td align="left" valign="top">Invitation</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.079/0.69</td>
<td align="center" valign="top">3.106/0.53</td>
<td align="center" valign="top">0.763</td>
<td align="left" valign="top">Imposition</td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">39.300</td>
<td align="center" valign="top">0.000</td>
<td align="center" valign="top">0.034</td>
</tr>
<tr>
<td align="left" valign="top">Offer</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.121/0.64</td>
<td align="center" valign="top">2.857/0.59</td>
<td align="center" valign="top">0.008</td>
<td align="left" valign="top">Interaction</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">23.635</td>
<td align="center" valign="top">0.000</td>
<td align="center" valign="top">0.114</td>
</tr>
<tr>
<td align="left" valign="top">Refusal</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">2.829/0.64</td>
<td align="center" valign="top">3.256/0.51</td>
<td align="center" valign="top">0.000</td>
<td colspan="5" rowspan="4"/>
</tr>
<tr>
<td align="left" valign="top">Request</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.822/0.50</td>
<td align="center" valign="top">3.422/0.57</td>
<td align="center" valign="top">0.000</td>
</tr>
<tr>
<td align="left" valign="top">Suggestion</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.928/0.53</td>
<td align="center" valign="top">2.835/0.53</td>
<td align="center" valign="top">0.000</td>
</tr>
<tr>
<td align="left" valign="top">Thank</td>
<td align="center" valign="top">80</td>
<td align="center" valign="top">3.894/0.69</td>
<td align="center" valign="top">3.762/0.58</td>
<td align="center" valign="top">0.195</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig position="float" id="fig4">
<label>Figure 4</label>
<caption>
<p>Profile plots for the interaction between speech act and rank of imposition. Speech act codes: 1&#x2009;=&#x2009;Apology; 2&#x2009;=&#x2009;Invitation; 3&#x2009;=&#x2009;Offer; 4&#x2009;=&#x2009;Refusal; 5&#x2009;=&#x2009;Request; 6&#x2009;=&#x2009;Suggestion; 7&#x2009;=&#x2009;Thank. Imposition codes: 1&#x2009;=&#x2009;R&#x2212;; 2&#x2009;=&#x2009;R+.</p>
</caption>
<graphic xlink:href="fpsyg-14-1096399-g004.tif"/>
</fig>
</sec>
</sec>
<sec id="sec17" sec-type="discussions">
<title>Discussion</title>
<p>ODCTs are a special type of oral assessment that elicit one-sided responses in hypothesized conversations. Following the suggestion by <xref ref-type="bibr" rid="ref21">Fulcher and Reiter (2003)</xref>, we included both pragmatic categories (i.e., the eight speech acts) and cultural factors (i.e., the combinations of the three social variables in different scenarios) in developing ODCT tasks in the current study. The analysis of the appropriateness ratings of our participants&#x2019; responses to the ODCT items revealed several substantive findings. First, the MFRM analysis showed that the eight speech acts investigated can be ranked in ascending order of difficulty for Chinese EFL learners as follows: Thank, Request, Suggestion, Disagreement, Invitation, Refusal, Offer, and Apology. Second, the two-way ANOVAs revealed significant main effects of speech act and rank of imposition (R), but not of power (P) and social distance (D). These analyses also revealed significant interaction effects between speech act and each of the three contextual variables, confirming the importance of including both pragmatic categories and cultural factors in ODCT task design (<xref ref-type="bibr" rid="ref21">Fulcher and Reiter, 2003</xref>). We discuss our findings on the relative difficulty of the tasks for different speech acts and the interaction effects between speech act and the three contextual variables below.</p>
<sec id="sec18">
<title>Difficulty of ODCTs for different speech acts</title>
<p>Previous findings on the relative difficulty of pragmatic tasks on different speech acts are limited and inconsistent. In testing learners&#x2019; pragmatic knowledge of three speech acts: apology, request, and refusal, <xref ref-type="bibr" rid="ref28">Hudson (2001)</xref> found that apologies were slightly easier than requests and refusals for Japanese ESL learners, which was echoed by Roever&#x2019;s pragmatics test of ESL/EFL learners with diverse language background (<xref ref-type="bibr" rid="ref46">Roever, 2004</xref>). Hudson accounted for this difference with the explanation that apologies tended to be more formulaic than the other two speech acts and attributed the absence of other difficulty differences to the homogeneity of the participants&#x2019; proficiency level. Using data from <xref ref-type="bibr" rid="ref2">Ahn (2005)</xref> on L1 English learners of Korean as a foreign language (KFL) at diverse proficiency levels, <xref ref-type="bibr" rid="ref12">Brown (2008)</xref> and <xref ref-type="bibr" rid="ref13">Brown and Ahn (2011)</xref> reported that the average ratings of apologies, requests, and refusals were comparable. <xref ref-type="bibr" rid="ref37">Liu (2006)</xref>, however, found apologies to be consistently more difficult across three test formats (MDCT, DSAT, and WDCT) than requests for Chinese EFL learners at diverse proficiency levels. The different findings pertaining to the difficulty of apologies relative to other speech acts on learners with different L1 backgrounds and the agreement between Liu&#x2019;s finding and our finding that apologies were harder than requests for Chinese EFL learners suggest a potential effect of the learners&#x2019; L1 cultural background on speech act production task difficulty. This conclusion aligns with the prediction that the culture-specific nature of pragmatic ability may give rise to unique challenges for learning L2 speech acts (<xref ref-type="bibr" rid="ref56">Taguchi, 2012</xref>). <xref ref-type="bibr" rid="ref63">Youn and Brown&#x2019;s (2013)</xref> finding that pragmatics test item difficulty remained consistent across two different studies by <xref ref-type="bibr" rid="ref2">Ahn (2005)</xref> and <xref ref-type="bibr" rid="ref60">Youn (2008)</xref> on two different groups of L1 English KFL learners also offers support for this conclusion, as it suggests more consistency of task difficulty among learners of the same L1 background.</p>
<p>Apology was found to be the most difficult speech act for Chinese EFL learners in the present study. A closer examination of the production data revealed that our participants had no difficulty in using the formulaic head act strategy (i.e., <italic>I&#x2019;m sorry</italic>), but many struggled with producing appropriate supporting moves. As illustrated in Example 1, many students followed <italic>I&#x2019;m sorry</italic> with an explanation that the cause was accidental, often with the structure &#x201C;didn&#x2019;t &#x2026; on purpose&#x201D;, likely translated from the Chinese expression <italic>b&#x00FA;sh&#x00EC; g&#x00F9;y&#x00EC; de</italic> (&#x4E0D;&#x662F;&#x6545;&#x610F;&#x7684;, &#x201C;didn&#x2019;t do it on purpose&#x201D;), which is commonly used in apologies in Chinese. This strategy, however, was not considered conventional by the L1 English raters.</p>
<table-wrap position="anchor" id="tab5">
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="left" valign="top">(1)</td>
<td align="left" valign="top">a. I&#x2019;m sorry. I did it by accident.</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">b. I&#x2019;m so sorry. I did not do it on purpose. I promise it will not happen again.</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">c. I&#x2019;m so sorry. I did not knock over the cup on purpose.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In addition, some participants provided grounders that were considered by the L1 English raters to be too casual to the extent that they jeopardize the sincerity of the apology, as illustrated by Example 2:</p>
<table-wrap position="anchor" id="tab6">
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="left" valign="top">(2)</td>
<td align="left" valign="top">a. Sorry, Miss May, I had something important to do just now. So I&#x2019;m coming late.</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">b. Sorry, Miss May, I had something on the way. I&#x2019;m very sorry.</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">c. Sorry, I have something urgent. Please forgive me.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>These grounders also appeared to display an L1 transfer effect, as the expressions <italic>y&#x01D2;udi&#x01CE;n sh&#x00EC;</italic> (&#x6709;&#x70B9;&#x4E8B;, &#x201C;have something&#x201D;) and <italic>y&#x01D2;udi&#x01CE;n j&#x00ED;sh&#x00EC;</italic> (&#x6709;&#x70B9;&#x6025;&#x4E8B;, &#x201C;have something urgent&#x201D;) are commonly used excuses in apologies in Chinese. These examples support <xref ref-type="bibr" rid="ref9">Blum-Kulka&#x2019;s (1982)</xref> claim that L2 learners&#x2019; speech act production is often influenced by pragmatic transfer from their L1 and that negative transfer may result in pragmatic failures and cross-cultural communication breakdowns.</p>
<p>Offer was found to be the second most difficult speech act for Chinese EFL learners. Previous research on L2 learners&#x2019; realization of offers is scant. As offers have a directive nature in that they involve the speaker attempting to persuade the hearer to accept the offer in question, the use of head act strategies for offers resembles that for requests. However, a major difference between offers and requests is that offers presumably benefit the hearer while requests impose on the hearer. As such, the use of direct strategies may be considered more acceptable for offers than for requests, which is also the case in Chinese. Additionally, it has been noted that in some cultures, Chinese included, an offer is not considered sincere until it has been reiterated (<xref ref-type="bibr" rid="ref7">Barron, 2003</xref>). As noted by the L1 English raters, the participants&#x2019; offers received low ratings primarily because they sometimes sounded overly direct and eager to help to the extent that the hearer might feel being imposed on. In Example 3, one participant offered to help a sick classmate with the use of <italic>must</italic>, which the raters felt was overly strong.</p>
<table-wrap position="anchor" id="tab7">
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="left" valign="top">(3)</td>
<td align="left" valign="top">You are sick. I must take you to the hospital.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Refusals were found to be the third most difficult among the eight speech acts. As a typical face-threatening speech act (<xref ref-type="bibr" rid="ref14">Brown and Levinson, 1987</xref>), refusals have been recognized as a major cross-cultural obstacle (<xref ref-type="bibr" rid="ref3">Babai Shishavan and Sharifian, 2016</xref>). <xref ref-type="bibr" rid="ref18">Ekiert et al. (2018)</xref> reported that advanced L1 Japanese and Spanish ESL learners achieved comparable pragmatic appropriacy for refusals, complaints, and advice, but lower proficiency ESL learners with those L1 backgrounds achieved lower pragmatic appropriacy for refusals than for complaints and advice. Our results showed that refusals were harder than suggestions for Chinese EFL learners. Refusal was again found more difficult than most speech acts in the present study. Previous research found that grounder and regret strategies are the most frequently used for refusals by Greek foreign language learners (<xref ref-type="bibr" rid="ref8">Bella, 2014</xref>) as well as by Chinese learners of English in both at-home and study aboard contexts (<xref ref-type="bibr" rid="ref43">Ren, 2015</xref>). A close analysis of the participants&#x2019; production data indicated that they relied heavily on expressions of gratitude but rarely used empathetic or positive statements, as illustrated in the participant&#x2019;s response to the item on refusing a chance to take part in a speech contest in Example 4. One L1 English rater commented that a positive statement before the refusal (e.g., <italic>I know the speech contest is a great opportunity for me to practice my English</italic>, <italic>but&#x2026;</italic>) would improve its pragmatic appropriacy.</p>
<table-wrap position="anchor" id="tab8">
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="left" valign="top">(4)</td>
<td align="left" valign="top">I&#x2019;m sorry. I do not think I can take part in it. Thank you for your trust.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Request, Suggestion, Disagreement, and Invitation were found to be relatively easier, and Thank was found to be the easiest speech act. The participants demonstrated good familiarity with the pragmatic formulas associated with these speech acts, and they used the most formulaic expressions for Thank among all speech acts. The higher frequency of use of these speech acts in the university setting in general and in the language classroom in particular may have also contributed to the lower difficulty of these speech acts.</p>
</sec>
<sec id="sec19">
<title>The interaction effects between speech act and The three contextual variables</title>
<p>The difficulty of the ODCT items was found to be affected by the interaction between speech act and each of the three contextual variables. This finding is consistent with <xref ref-type="bibr" rid="ref55">Taguchi&#x2019;s (2007)</xref> finding that social factors may make certain types of situations for pragmatic tasks more demanding than others. The finding also supports <xref ref-type="bibr" rid="ref21">Fulcher and Reiter&#x2019;s (2003)</xref> claim that different contextual variables may have distinct effects on particular speech acts.</p>
<p>Social distance exhibited different effects on different speech acts. Compared with items with the D+ feature, items with the D-feature were significantly easier for Offer and Request but significantly harder than Suggestion and Thank. These results indicate that the participants produced more appropriate offers and requests to familiar hearers but more appropriate suggestions and thanks to unfamiliar hearers. A close analysis of the learner production data suggested that the participants tended to use similar types of formulaic strategies for items with D+ and D&#x2212; features. For example, they frequently used &#x201C;Would you like to &#x2026;&#x201D; for Suggestion and &#x201C;Thank you very much&#x201D; for Thank, which were considered more appropriate for unfamiliar hearers (D+) but sometimes overly polite for very familiar peers (D&#x2212;). <xref ref-type="bibr" rid="ref34">Li (2010)</xref>, for example, indicated that native Australian students tended to use ability statements such as &#x201C;You can&#x201D; to realize suggestions in D-scenarios.</p>
<p>With respect to power, items with the P-feature were significantly easier for Offer and Suggestion, while items with the P=&#x2009;feature were significantly easier for Refusal. These results indicate that the participants produced more appropriate offers and suggestions to hears with more power but more appropriate refusals to hears with equal power. These results may not be surprising, as they align with the common understanding that it is easier to make an offer to than to refuse someone with more power in the university setting (e.g., a teacher) in the Chinese culture. Overall, our participants demonstrated some struggle with consistently deploying politeness strategies appropriate for these speech acts to hearers with different power status, sometimes showing negative pragmatic transfer from Chinese. For example, they tended to extend offers to teachers using polite, indirect forms and to their peers using highly direct forms (e.g., <italic>Come to dinner with me</italic>). While such direct strategies for making offers to peers are commonly used to show sincerity and hospitality or to preserve the speaker&#x2019;s positive face in the Chinese culture, they may sound intruding in western cultures where the hearer prefers to be left alone (<xref ref-type="bibr" rid="ref27">Gu, 1990</xref>; <xref ref-type="bibr" rid="ref39">Mao, 1994</xref>).</p>
<p>Imposition was the only contextual variable that showed a significant main effect, with items with the R+ feature showing a higher level of difficulty than those with the R&#x2212; feature overall. <xref ref-type="bibr" rid="ref28">Hudson (2001)</xref> and <xref ref-type="bibr" rid="ref37">Liu (2006</xref>, <xref ref-type="bibr" rid="ref38">2007)</xref> also reported that R+ items received lower scores than R&#x2212; items across multiple test methods, although they did not examine the interaction between speech act and imposition. Our analysis showed that, compared to R&#x2212; items, R+ items were significantly harder for Offer, Request, and Suggestion, significantly easier for Refusal, and comparably difficult for other speech acts. While these findings are not necessarily surprising (e.g., as the degree of imposition increases, requests become harder while refusals become easier), they nonetheless provide evidence for the need and usefulness to look at the interaction effect between speech act and individual situational variables.</p>
</sec>
<sec id="sec20">
<title>Limitations</title>
<p>The current study has several limitations that can be addressed in future research. First, while we included participants with diverse levels of English proficiency in the study to have a heterogenous sample, we did not systematically examine the effect of proficiency on the difficulty of speech act production tasks, a topic that can be useful to investigate in future research. Second, our analysis focused on the appropriateness ratings of the participants&#x2019; responses only, and it may be useful for future research to consider learners&#x2019; perceptions of task difficulty and to qualitatively explore the reasons why learners see certain speech acts and contextual variable combinations are more difficulty than others. Third, we employed two raters in the current study only, and greater reliability in the judgments of language learners&#x2019; pragmatic performance could be achieved by using a larger pool of raters. Fourth, a certain degree of interference existed in the data collection phase as oral samples of a group of participants were elicited simultaneously in a language lab, which can be avoided by applying headphones or collecting data separately. Finally, given that the difficulty of oral speech act production tasks may vary by L1 cultural background, the order of relative difficulty established in the current study for the eight speech acts may not be directly applicable to English learners of other L1 backgrounds. Future research can investigate how the order of relative difficulty may vary by L1 background by including participants from diverse L1 backgrounds.</p>
</sec>
</sec>
<sec id="sec21" sec-type="conclusions">
<title>Conclusion</title>
<p>This study examined the relative difficulty of oral speech act production tasks involving eight types of speech acts for Chinese EFL learners and the effects of three situational variables, namely, power, social distance, and imposition, on such difficulty. A Many-facet Rasch Measurement analysis suggested that the eight speech acts can be ordered by ascending difficulty as follows: Thank, Request, Suggestion, Disagreement, Invitation, Refusal, Offer, and Apology. Significant effects on performance scores were found for the interaction between each of the three contextual variables and speech act, and the specific effects observed varied by speech act. Learner responses also reflected influences of their L1 cultural background. Our findings on the relative difficulty of oral production tasks involving different speech acts and the effects of relevant situational variables on such difficulty have useful implications for L2 pragmatics test design.</p>
<p>Our findings have useful implications for L2 pragmatics testing. Given that different speech act types are not equally difficult to EFL learners, it is important to not generalize results from testing the realization of a particular speech act or a small set of speech acts to the learners&#x2019; pragmatic ability in performing other speech acts. Furthermore, given the effects of the situational variables on the task difficulty for different speech acts, it is critical to test learners&#x2019; speech act production with different combinations of contextual variables. Finally, the evaluation of task difficulty in L2 pragmatics assessment need to take learners&#x2019; L1 background into account.</p>
<p>Our findings also have useful implications for L2 pragmatics pedagogy in the Chinese EFL context. From a task-based language teaching perspective, as advocated by <xref ref-type="bibr" rid="ref57">Taguchi and Kim (2018)</xref>, the relative difficulty of tasks provides highly useful information for task selection and task sequencing in teaching L2 pragmatics. The rank of difficulty estimates of the pragmatic tasks for different speech acts observed in the present study can be used to inform the order in which the speech acts are introduced and the allocation of classroom time to different speech acts in L2 pragmatics pedagogy. Our findings regarding the effects of the three contextual factors on the task difficulty for different speech acts can be used to inform the design of different situation types in teaching speech acts. Our findings further showed the need to help Chinese EFL learners become more sensitive to different situation types and to avoid negative L1 transfer in their choices of speech act strategies. To this end, it will be especially helpful to deploy learning activities designed to help learners become more aware of the pragmatic appropriacy of different speech act strategies in different situation types as well as differences between the pragmatic appropriacy of different speech act realizations in the learners&#x2019; L1 and the target language.</p>
</sec>
<sec id="sec22" sec-type="data-availability">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="supplementary-material" rid="SM1">Supplementary material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="sec23">
<title>Author contributions</title>
<p>All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec100" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="sec25" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/fpsyg.2023.1096399/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/fpsyg.2023.1096399/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.DOCX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Agresti</surname> <given-names>A.</given-names></name> <name><surname>Franklin</surname> <given-names>C. A.</given-names></name> <name><surname>Klingenberg</surname> <given-names>B.</given-names></name></person-group> (<year>2017</year>). <source><italic>Statistics</italic>: <italic>The art and science of learning from data</italic></source>. <publisher-loc>Boston</publisher-loc>: <publisher-name>Pearson</publisher-name>.</citation></ref>
<ref id="ref2"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Ahn</surname> <given-names>R. C.</given-names></name></person-group> (<year>2005</year>). Five measures of interlanguage pragmatics in KFL (Korean as a foreign language) learners. PhD thesis. University of Hawai&#x2019;i at Manoa, USA.</citation></ref>
<ref id="ref3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Babai Shishavan</surname> <given-names>H.</given-names></name> <name><surname>Sharifian</surname> <given-names>F.</given-names></name></person-group> (<year>2016</year>). <article-title>The refusal speech act in a cross-cultural perspective: a study of Iranian English-language learners and Anglo-Australian speakers</article-title>. <source>Lang. Commun.</source> <volume>47</volume>, <fpage>75</fpage>&#x2013;<lpage>88</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.langcom.2016.01.001</pub-id></citation></ref>
<ref id="ref4"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bachman</surname> <given-names>L. F.</given-names></name></person-group> (<year>1990</year>). <source>Fundamental considerations in language testing</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="ref5"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bachman</surname> <given-names>L. F.</given-names></name> <name><surname>Palmer</surname> <given-names>A. S.</given-names></name></person-group> (<year>1996</year>). <source>Language testing in practice: Designing and developing useful language tests</source> (Vol. <volume>1</volume>). <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="ref6"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bachman</surname> <given-names>L. F.</given-names></name> <name><surname>Palmer</surname> <given-names>A. S.</given-names></name></person-group> (<year>2010</year>). <source>Language assessment in practice</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="ref7"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Barron</surname> <given-names>A.</given-names></name></person-group> (<year>2003</year>). <source>Acquisition in interlanguage pragmatics: Learning how to do things with words in a study abroad context</source>. <publisher-loc>Amsterdam</publisher-loc>: <publisher-name>John Benjamins</publisher-name>.</citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bella</surname> <given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>Developing the ability to refuse: a cross-sectional study of Greek FL refusals</article-title>. <source>J. Pragmat.</source> <volume>61</volume>, <fpage>35</fpage>&#x2013;<lpage>62</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.pragma.2013.11.015</pub-id></citation></ref>
<ref id="ref9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blum-Kulka</surname> <given-names>S.</given-names></name></person-group> (<year>1982</year>). <article-title>Learning to say what you mean in a second language: a study of the speech act performance of learners of Hebrew as a second language</article-title>. <source>Appl. Linguis.</source> <volume>3</volume>, <fpage>29</fpage>&#x2013;<lpage>59</lpage>. doi: <pub-id pub-id-type="doi">10.1093/applin/3.1.29</pub-id></citation></ref>
<ref id="ref10"><citation citation-type="book"><person-group person-group-type="editor"><name><surname>Blum-Kulka</surname> <given-names>S.</given-names></name> <name><surname>House</surname> <given-names>J.</given-names></name> <name><surname>Kasper</surname> <given-names>G.</given-names></name></person-group> (Eds.). (<year>1989</year>). <source>Cross-cultural pragmatics: Requests and apologies</source>. <publisher-loc>Norwood, NJ</publisher-loc>: <publisher-name>Ablex Publishing Corporation</publisher-name>.</citation></ref>
<ref id="ref11"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>J. D.</given-names></name></person-group> (<year>2001</year>). &#x201C;<article-title>Six types of pragmatics tests in two different contexts</article-title>,&#x201D; in <source>Pragmatics in language teaching</source>. eds. <person-group person-group-type="editor"><name><surname>Kasper</surname> <given-names>G.</given-names></name> <name><surname>Rose</surname> <given-names>K.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>), <fpage>301</fpage>&#x2013;<lpage>325</lpage>.</citation></ref>
<ref id="ref12"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>J. D.</given-names></name></person-group> (<year>2008</year>). &#x201C;<article-title>Raters, functions, item types and dependability of L2 pragmatics tests</article-title>,&#x201D; in <source>Investigating pragmatics in foreign language learning, teaching and testing</source>. eds. <person-group person-group-type="editor"><name><surname>Soler</surname> <given-names>E. A.</given-names></name> <name><surname>Martinez-Flor</surname> <given-names>A.</given-names></name></person-group> (<publisher-loc>Clevedon</publisher-loc>: <publisher-name>Multilingual Matters</publisher-name>), <fpage>224</fpage>&#x2013;<lpage>248</lpage>.</citation></ref>
<ref id="ref13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>J. D.</given-names></name> <name><surname>Ahn</surname> <given-names>R. C.</given-names></name></person-group> (<year>2011</year>). <article-title>Variables that affect the dependability of L2 pragmatics tests</article-title>. <source>J. Pragmat.</source> <volume>43</volume>, <fpage>198</fpage>&#x2013;<lpage>217</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.pragma.2010.07.026</pub-id></citation></ref>
<ref id="ref14"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>P.</given-names></name> <name><surname>Levinson</surname> <given-names>S. C.</given-names></name></person-group> (<year>1987</year>). <source>Politeness: Some universals in language usage</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>Constructing a scale to assess L2 written speech act performance: WDCT and E-mail tasks</article-title>. <source>Lang. Assess. Q.</source> <volume>13</volume>, <fpage>231</fpage>&#x2013;<lpage>250</lpage>. doi: <pub-id pub-id-type="doi">10.1080/15434303.2016.1213844</pub-id></citation></ref>
<ref id="ref16"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>J.</given-names></name></person-group> (<year>1969</year>). <source>Statistical power analysis for the behavioral sciences</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Academic Press</publisher-name>.</citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>A. D.</given-names></name> <name><surname>Olshtain</surname> <given-names>E.</given-names></name></person-group> (<year>1981</year>). <article-title>Developing a measure of sociocultural competence: the case of apology</article-title>. <source>Lang. Learn.</source> <volume>31</volume>, <fpage>113</fpage>&#x2013;<lpage>134</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1467-1770.1981.tb01375.x</pub-id></citation></ref>
<ref id="ref18"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Ekiert</surname> <given-names>M.</given-names></name> <name><surname>Lampropoulou</surname> <given-names>S.</given-names></name> <name><surname>R&#x00E9;v&#x00E9;sz</surname> <given-names>A.</given-names></name> <name><surname>Torgersen</surname> <given-names>E.</given-names></name></person-group> (<year>2018</year>). &#x201C;<article-title>The effects of task type and L2 proficiency on discourse appropriacy in oral task performance</article-title>,&#x201D; in <source>Task-based approaches to teaching and assessing pragmatics</source>. eds. <person-group person-group-type="editor"><name><surname>Taguchi</surname> <given-names>N.</given-names></name> <name><surname>Kim</surname> <given-names>Y.</given-names></name></person-group> (<publisher-loc>John Benjamins</publisher-loc>: <publisher-name>Amsterdam/New York</publisher-name>), <fpage>247</fpage>&#x2013;<lpage>264</lpage>.</citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Elder</surname> <given-names>C.</given-names></name> <name><surname>Iwashita</surname> <given-names>N.</given-names></name> <name><surname>McNamara</surname> <given-names>T.</given-names></name></person-group> (<year>2002</year>). <article-title>Estimating the difficulty of oral proficiency tasks: what does the test-taker have to offer?</article-title> <source>Lang. Test.</source> <volume>19</volume>, <fpage>347</fpage>&#x2013;<lpage>368</lpage>. doi: <pub-id pub-id-type="doi">10.1191/0265532202lt235oa</pub-id></citation></ref>
<ref id="ref20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fulcher</surname> <given-names>G.</given-names></name></person-group> (<year>1996</year>). <article-title>Testing tasks: issues in task design and the group oral</article-title>. <source>Lang. Test.</source> <volume>13</volume>, <fpage>23</fpage>&#x2013;<lpage>51</lpage>. doi: <pub-id pub-id-type="doi">10.1177/026553229601300103</pub-id></citation></ref>
<ref id="ref21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fulcher</surname> <given-names>G.</given-names></name> <name><surname>Reiter</surname> <given-names>R. M.</given-names></name></person-group> (<year>2003</year>). <article-title>Task difficulty in speaking tests</article-title>. <source>Lang. Test.</source> <volume>20</volume>, <fpage>321</fpage>&#x2013;<lpage>344</lpage>. doi: <pub-id pub-id-type="doi">10.1191/0265532203lt259oa</pub-id></citation></ref>
<ref id="ref22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galaczi</surname> <given-names>E.</given-names></name> <name><surname>Taylor</surname> <given-names>L.</given-names></name></person-group> (<year>2018</year>). <article-title>Interactional competence: Conceptualisations, Operationalisations, and outstanding questions</article-title>. <source>Lang. Assess. Q.</source> <volume>15</volume>, <fpage>219</fpage>&#x2013;<lpage>236</lpage>. doi: <pub-id pub-id-type="doi">10.1080/15434303.2018.1453816</pub-id></citation></ref>
<ref id="ref23"><citation citation-type="book"><person-group person-group-type="editor"><name><surname>Gass</surname> <given-names>S. M.</given-names></name> <name><surname>Neu</surname> <given-names>J.</given-names></name></person-group> (Eds.) (<year>1996</year>). <source>Speech acts across cultures: Challenges to communication in a second language</source>. <publisher-loc>Berlin</publisher-loc>: <publisher-name>Mouton de Gruyter</publisher-name>.</citation></ref>
<ref id="ref24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Golato</surname> <given-names>A.</given-names></name></person-group> (<year>2003</year>). <article-title>Studying compliment responses: a comparison of DCTs and recordings of naturally occurring talk</article-title>. <source>Appl. Linguis.</source> <volume>24</volume>, <fpage>90</fpage>&#x2013;<lpage>121</lpage>. doi: <pub-id pub-id-type="doi">10.1093/applin/24.1.90</pub-id></citation></ref>
<ref id="ref25"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Grabowski</surname> <given-names>K. C.</given-names></name></person-group> (<year>2009</year>). <source>Investigating the construct validity of a test designed to measure grammatical and pragmatic knowledge in the context of speaking (unpublished dissertation)</source>. <publisher-name>Columbia University</publisher-name>, <publisher-loc>New York</publisher-loc>.</citation></ref>
<ref id="ref26"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Grabowski</surname> <given-names>K.</given-names></name></person-group> (<year>2013</year>). &#x201C;<article-title>Investigating the construct validity of a role-play test designed to measure grammatical and pragmatic knowledge at multiple proficiency levels</article-title>,&#x201D; in <source>Assessing second language pragmatics</source>. eds. <person-group person-group-type="editor"><name><surname>Ross</surname> <given-names>S.</given-names></name> <name><surname>Kasper</surname> <given-names>G.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Palgrave Macmillan</publisher-name>), <fpage>149</fpage>&#x2013;<lpage>171</lpage>.</citation></ref>
<ref id="ref27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gu</surname> <given-names>Y.</given-names></name></person-group> (<year>1990</year>). <article-title>Politeness phenomena in modern Chinese</article-title>. <source>J. Pragmat.</source> <volume>14</volume>, <fpage>237</fpage>&#x2013;<lpage>257</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0378-2166(90)90082-O</pub-id></citation></ref>
<ref id="ref28"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hudson</surname> <given-names>T.</given-names></name></person-group> (<year>2001</year>). &#x201C;<article-title>Indicators for pragmatics instruction: some quantitative tools</article-title>,&#x201D; in <source>Pragmatics in language teaching</source>. eds. <person-group person-group-type="editor"><name><surname>Kasper</surname> <given-names>G.</given-names></name> <name><surname>Rose</surname> <given-names>K.</given-names></name></person-group> (<publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge uniersity press</publisher-name>)</citation></ref>
<ref id="ref29"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hudson</surname> <given-names>T.</given-names></name> <name><surname>Detmer</surname> <given-names>E.</given-names></name> <name><surname>Brown</surname> <given-names>J. D.</given-names></name></person-group> (<year>1992</year>). <source>A framework for testing cross-cultural pragmatics</source> (Vol. <volume>2</volume>). <publisher-loc>Honolulu</publisher-loc>: <publisher-name>University of Hawaii, Second Language Teaching and Curriculum Center</publisher-name>.</citation></ref>
<ref id="ref30"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hudson</surname> <given-names>T.</given-names></name> <name><surname>Detmer</surname> <given-names>E.</given-names></name> <name><surname>Brown</surname> <given-names>J. D.</given-names></name></person-group> (<year>1995</year>). <source>Developing prototypic measures of cross-cultural pragmatics</source> (Vol. <volume>7</volume>). <publisher-loc>Honolulu</publisher-loc>: <publisher-name>University of Hawaii, Second Language Teaching and Curriculum Center</publisher-name>.</citation></ref>
<ref id="ref31"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Ikeda</surname> <given-names>N.</given-names></name></person-group> (<year>2017</year>). Measuring L2 oral pragmatic abilities for use in social contexts: Development and validation of an assessment instrument for L2 pragmatics performance in university settings. Unpublished PhD thesis. University of Melbourne, Australia.</citation></ref>
<ref id="ref32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Iwashita</surname> <given-names>N.</given-names></name> <name><surname>McNamara</surname> <given-names>T.</given-names></name> <name><surname>Catherine</surname> <given-names>E.</given-names></name></person-group> (<year>2001</year>). <article-title>Can we predict task difficulty in an oral proficiency test? Exploring the potential of an information-processing approach to task design</article-title>. <source>Lang. Learn.</source> <volume>51</volume>, <fpage>401</fpage>&#x2013;<lpage>436</lpage>. doi: <pub-id pub-id-type="doi">10.1111/0023-8333.00160</pub-id></citation></ref>
<ref id="ref33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krish</surname> <given-names>P.</given-names></name> <name><surname>May</surname> <given-names>O. C.</given-names></name></person-group> (<year>2020</year>). <article-title>A case study of L1 interference in speech acts among Chinese L2 students. <italic>3L</italic></article-title>. <source>Lang. Linguist. Literature</source> <volume>26</volume>, <fpage>106</fpage>&#x2013;<lpage>118</lpage>. doi: <pub-id pub-id-type="doi">10.17576/3L-2020-2601-08</pub-id></citation></ref>
<ref id="ref34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>E. S.</given-names></name></person-group> (<year>2010</year>). <article-title>Making suggestions: a contrastive study of young Hong Kong and Australian students</article-title>. <source>J. Pragmat.</source> <volume>42</volume>, <fpage>598</fpage>&#x2013;<lpage>616</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.pragma.2009.07.014</pub-id></citation></ref>
<ref id="ref35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>S.</given-names></name> <name><surname>Taguchi</surname> <given-names>N.</given-names></name> <name><surname>Xiao</surname> <given-names>F.</given-names></name></person-group> (<year>2019</year>). <article-title>Variations in rating scale functioning in assessing speech act production in L2 Chinese</article-title>. <source>Lang. Assess. Q.</source> <volume>16</volume>, <fpage>271</fpage>&#x2013;<lpage>293</lpage>. doi: <pub-id pub-id-type="doi">10.1080/15434303.2019.1648473</pub-id></citation></ref>
<ref id="ref36"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Linacre</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <source>A user&#x2019;s guide to FACETS Rasch-model computer programs (Version 3.71.0)</source>. <publisher-loc>Chicago, IL</publisher-loc>. <publisher-name>Winsteps.Com</publisher-name>.</citation></ref>
<ref id="ref37"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>J.</given-names></name></person-group> (<year>2006</year>). <source>Measuring interlanguage pragmatic knowledge of EFL learners</source>. <publisher-loc>Frankfurt am Main</publisher-loc>: <publisher-name>Peter Lang</publisher-name>.</citation></ref>
<ref id="ref38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>J.</given-names></name></person-group> (<year>2007</year>). <article-title>Developing a pragmatics test for Chinese EFL learners</article-title>. <source>Lang. Test.</source> <volume>24</volume>, <fpage>391</fpage>&#x2013;<lpage>415</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0265532207077206</pub-id></citation></ref>
<ref id="ref39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mao</surname> <given-names>L. R.</given-names></name></person-group> (<year>1994</year>). <article-title>Beyond politeness theory: &#x2018;face&#x2019; revisited and renewed</article-title>. <source>J. Pragmat.</source> <volume>21</volume>, <fpage>451</fpage>&#x2013;<lpage>486</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0378-2166(94)90025-6</pub-id></citation></ref>
<ref id="ref40"><citation citation-type="book"><person-group person-group-type="editor"><name><surname>Mart&#x00ED;nez-Flor</surname> <given-names>A.</given-names></name> <name><surname>Us&#x00F3;-Juan</surname> <given-names>E.</given-names></name></person-group> (Eds.). (<year>2010</year>). <source>Speech act performance: Theoretical, empirical, and methodological issues</source>. <publisher-loc>Amsterdam</publisher-loc>: <publisher-name>John Benjamins</publisher-name>.</citation></ref>
<ref id="ref41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McNamara</surname> <given-names>T.</given-names></name> <name><surname>Knoch</surname> <given-names>U.</given-names></name></person-group> (<year>2012</year>). <article-title>The Rasch wars: the emergence of Rasch measurement in language testing</article-title>. <source>Lang. Test.</source> <volume>29</volume>, <fpage>555</fpage>&#x2013;<lpage>4576</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0265532211430367</pub-id></citation></ref>
<ref id="ref42"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Purpura</surname> <given-names>J. E.</given-names></name></person-group> (<year>2004</year>). <source>Assessing grammar</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="ref43"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Ren</surname> <given-names>W.</given-names></name></person-group> (<year>2015</year>). <source>L2 pragmatic development in study abroad contexts</source>. <publisher-loc>Berlin</publisher-loc>: <publisher-name>Peter Lang</publisher-name>.</citation></ref>
<ref id="ref44"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Ren</surname> <given-names>W.</given-names></name></person-group> (<year>2022</year>). <source>Second language pragmatics</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="ref45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robinson</surname> <given-names>P.</given-names></name></person-group> (<year>2001</year>). <article-title>Task complexity, task difficulty and task production: exploring interactions in a componential framework</article-title>. <source>Appl. Linguis.</source> <volume>22</volume>, <fpage>27</fpage>&#x2013;<lpage>57</lpage>. doi: <pub-id pub-id-type="doi">10.1093/applin/22.1.27</pub-id></citation></ref>
<ref id="ref46"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Roever</surname> <given-names>C.</given-names></name></person-group> (<year>2004</year>). &#x201C;<article-title>Difficulty and practicality in tests of interlanguage pragmatics</article-title>,&#x201D; in <source>Studying speaking to inform language learning</source>. eds. <person-group person-group-type="editor"><name><surname>Boxer</surname> <given-names>D.</given-names></name> <name><surname>Cohen</surname> <given-names>A.</given-names></name></person-group> (<publisher-loc>Clevedon</publisher-loc>: <publisher-name>Multilingual Matters Ltd</publisher-name>), <fpage>283</fpage>&#x2013;<lpage>301</lpage>.</citation></ref>
<ref id="ref47"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Roever</surname> <given-names>C.</given-names></name></person-group> (<year>2005</year>). <source>Testing ESL pragmatics</source>. <publisher-loc>Frankfurt-am-Main</publisher-loc>: <publisher-name>Peter Lang</publisher-name>.</citation></ref>
<ref id="ref48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roever</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <article-title>Validation of a web-based test of ESL pragmalinguistics</article-title>. <source>Lang. Test.</source> <volume>23</volume>, <fpage>229</fpage>&#x2013;<lpage>256</lpage>. doi: <pub-id pub-id-type="doi">10.1191/0265532206lt329oa</pub-id></citation></ref>
<ref id="ref49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roever</surname> <given-names>C.</given-names></name></person-group> (<year>2007</year>). <article-title>DIF in the assessment of second language pragmatics</article-title>. <source>Lang. Assess. Q.</source> <volume>4</volume>, <fpage>165</fpage>&#x2013;<lpage>189</lpage>. doi: <pub-id pub-id-type="doi">10.1080/15434300701375733</pub-id></citation></ref>
<ref id="ref50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roever</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>Testing of second language pragmatics: past and future</article-title>. <source>Lang. Test.</source> <volume>28</volume>, <fpage>463</fpage>&#x2013;<lpage>481</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0265532210394633</pub-id></citation></ref>
<ref id="ref51"><citation citation-type="book"><person-group person-group-type="editor"><name><surname>Ross</surname> <given-names>S. J.</given-names></name> <name><surname>Kasper</surname> <given-names>G.</given-names></name></person-group> (Eds.) (<year>2013</year>). <source>Assessing second language pragmatics</source>. <publisher-loc>Hampshire, UK</publisher-loc>: <publisher-name>Palgrave Macmillan</publisher-name>.</citation></ref>
<ref id="ref52"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Searle</surname> <given-names>J.</given-names></name></person-group> (<year>1969</year>). <source>Speech acts: An essay in the philosophy of language</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="ref53"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Searle</surname> <given-names>J.</given-names></name></person-group> (<year>1975</year>). &#x201C;<article-title>Indirect speech acts</article-title>,&#x201D; in <source>Syntax and semantics 3: Speech acts</source>. eds. <person-group person-group-type="editor"><name><surname>Cole</surname> <given-names>P.</given-names></name> <name><surname>Morgan</surname> <given-names>J.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Academic Press</publisher-name>), <fpage>59</fpage>&#x2013;<lpage>82</lpage>.</citation></ref>
<ref id="ref54"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Skehan</surname> <given-names>P.</given-names></name></person-group> (<year>1998</year>). <source>A cognitive approach to language learning</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="ref55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Taguchi</surname> <given-names>N.</given-names></name></person-group> (<year>2007</year>). <article-title>Task difficulty in oral speech act production</article-title>. <source>Appl. Linguis.</source> <volume>28</volume>, <fpage>113</fpage>&#x2013;<lpage>135</lpage>. doi: <pub-id pub-id-type="doi">10.1093/applin/aml051</pub-id></citation></ref>
<ref id="ref56"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Taguchi</surname> <given-names>N.</given-names></name></person-group> (<year>2012</year>). <source>Context, individual differences and pragmatic competence</source>. <publisher-loc>Bristol</publisher-loc>: <publisher-name>Multilingual Matters</publisher-name>.</citation></ref>
<ref id="ref57"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Taguchi</surname> <given-names>N.</given-names></name> <name><surname>Kim</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). &#x201C;<article-title>Task-based approaches to teaching and assessing pragmatics: an overview</article-title>,&#x201D; in <source>Task-based approaches to teaching and assessing pragmatics</source>. eds. <person-group person-group-type="editor"><name><surname>Taguchi</surname> <given-names>N.</given-names></name> <name><surname>Kim</surname> <given-names>Y.</given-names></name></person-group> (<publisher-loc>Amsterdam/New York</publisher-loc>: <publisher-name>John Benjamins</publisher-name>), <fpage>1</fpage>&#x2013;<lpage>26</lpage>.</citation></ref>
<ref id="ref58"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Yamashita</surname> <given-names>S.</given-names></name></person-group> (<year>1996</year>). <source><italic>Six measures of JSL pragmatics</italic> (technical report #14)</source>. <publisher-loc>Honolulu</publisher-loc>: <publisher-name>University of Hawaii, Second Language Teaching and Curriculum Center</publisher-name>.</citation></ref>
<ref id="ref59"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Yoshitake</surname> <given-names>S. S.</given-names></name></person-group> (<year>1997</year>). <source><italic>Measuring interlanguage pragmatic competence of Japanese students of English as a foreign language: A multi-test framework evaluation</italic> (unpublished doctoral dissertation)</source>. <publisher-name>Columbia Pacific University</publisher-name>, <publisher-loc>Novata, CA</publisher-loc>.</citation></ref>
<ref id="ref60"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Youn</surname> <given-names>S. J.</given-names></name></person-group> (<year>2008</year>). Rater variation in paper vs. web-based KFL pragmatic assessment using FACETS analysis. Unpublished munuscript, University of Hawai&#x2019;i, Honolulu, HI.</citation></ref>
<ref id="ref61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Youn</surname> <given-names>S. J.</given-names></name></person-group> (<year>2015</year>). <article-title>Validity argument for assessing L2 pragmatics in interaction using mixed methods</article-title>. <source>Lang. Test.</source> <volume>32</volume>, <fpage>199</fpage>&#x2013;<lpage>225</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0265532214557113</pub-id></citation></ref>
<ref id="ref62"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Youn</surname> <given-names>S. J.</given-names></name></person-group> (<year>2019</year>). &#x201C;<article-title>Assessment in L2 pragmatics</article-title>&#x201D; in <source>The Routledge handbook of second language acquisition and pragmatics</source>. ed. <person-group person-group-type="editor"><name><surname>Taguchi</surname> <given-names>N.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Routledge</publisher-name>), <fpage>308</fpage>&#x2013;<lpage>321</lpage>.</citation></ref>
<ref id="ref63"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Youn</surname> <given-names>S. J.</given-names></name> <name><surname>Brown</surname> <given-names>J. D.</given-names></name></person-group> (<year>2013</year>). &#x201C;<article-title>Item difficulty and heritage language learner status in pragmatic tests for Korean as a foreign language</article-title>,&#x201D; in <source>Assessing second language pragmatics</source>. eds. <person-group person-group-type="editor"><name><surname>Ross</surname> <given-names>S. J.</given-names></name> <name><surname>Kasper</surname> <given-names>G.</given-names></name></person-group> (<publisher-loc>Hampshire, UK</publisher-loc>: <publisher-name>Palgrave Macmillan</publisher-name>), <fpage>98</fpage>&#x2013;<lpage>123</lpage>.</citation></ref>
</ref-list>
</back>
</article>