<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="review-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Commun.</journal-id>
<journal-title>Frontiers in Communication</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Commun.</abbrev-journal-title>
<issn pub-type="epub">2297-900X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">626985</article-id>
<article-id pub-id-type="doi">10.3389/fcomm.2020.626985</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Communication</subject>
<subj-group>
<subject>Mini Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Ease and Difficulty in L2 Pronunciation Teaching: A Mini-Review</article-title>
<alt-title alt-title-type="left-running-head">O&#x2019;Brien</alt-title>
<alt-title alt-title-type="right-running-head">L2 Pronunciation Teaching</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>O&#x2019;Brien</surname>
<given-names>Mary Grantham</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="http://loop.frontiersin.org/people/78353/overview"/>
</contrib>
</contrib-group>
<aff>School of Languages, Linguistics, Literatures and Cultures, University of Calgary, <addr-line>Calgary</addr-line>, <addr-line>AB</addr-line>, <country>Canada</country>
</aff>
<author-notes>
<corresp id="c001">&#x2a;Correspondence: Mary Grantham O&#x2019;Brien, <email>mgobrien@ucalgary.ca</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Language Sciences, a section of the journal Frontiers in Communication</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/112714">Antonio Ben&#xed;tez-Burraco</ext-link>, Sevilla University, Spain</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1000949">Murray J Munro</ext-link>, Simon Fraser University, Canada</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1028018">John M. Levis</ext-link>, Iowa State University, United States</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>16</day>
<month>02</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2020</year>
</pub-date>
<volume>5</volume>
<elocation-id>626985</elocation-id>
<history>
<date date-type="received">
<day>07</day>
<month>11</month>
<year>2020</year>
</date>
<date date-type="accepted">
<day>24</day>
<month>12</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 O&#x2019;Brien.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>O&#x2019;Brien</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Both L2 learners and their teachers are concerned about pronunciation. While an unspoken classroom goal is often native-accented speech (i.e., a spoken variety of the mother tongue that it not geographically confined to a place within a particular country), pronunciation researchers tend to agree that comprehensible speech (i.e., speech that can be easily understood by an interlocutor) is a more realistic goal. A host of studies have demonstrated that certain types of training can result in more comprehensible L2 speech. This contribution considers research on training the perception and production of both segmental (i.e., speech sounds) and suprasegmental features (i.e., stress, rhythm, tone, intonation). Before we can determine whether a given pronunciation feature is easy or difficult to teach and&#x2014;more importantly&#x2014;to learn, we must focus on: 1) setting classroom priorities that place comprehensibility of L2 speech at the forefront; and 2) relying upon insights gained through research into L2 pronunciation training. The goal of the mini-review is to help contextualize the papers presented in this collection.</p>
</abstract>
<kwd-group>
<kwd>second language</kwd>
<kwd>pronunciation</kwd>
<kwd>training</kwd>
<kwd>priorities</kwd>
<kwd>effectiveness</kwd>
<kwd>comprehensibility</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Researchers and teachers alike agree that most adult second language (L2) learners will not sound like native speakers and that speaking with a nonnative accent is normal (<xref ref-type="bibr" rid="B19">Derwing and Munro, 2009</xref>). Nonetheless, both teachers and students express a desire for learners to achieve native-accented speech (<xref ref-type="bibr" rid="B81">Timmis, 2002</xref>; <xref ref-type="bibr" rid="B75">Sifakis and Sougari, 2005</xref>; <xref ref-type="bibr" rid="B74">Scales et al., 2006</xref>). Thus, the nativeness principle (i.e., a belief that nativelike pronunciation is both achievable and enviable (<xref ref-type="bibr" rid="B51">Levis, 2005</xref>; <xref ref-type="bibr" rid="B50">Levis, 2020</xref>)), serves as an implied objective in many language classrooms. In spite of this, recent studies demonstrate that teachers engage only intermittently in classroom pronunciation training, primarily because they lack training (<xref ref-type="bibr" rid="B20">Derwing and Munro, 2015</xref>) or confidence (<xref ref-type="bibr" rid="B4">Baker, 2011</xref>) or because they have relatively little knowledge about how to teach and assess pronunciation (<xref ref-type="bibr" rid="B6">Baker and Murphy, 2011</xref>; <xref ref-type="bibr" rid="B5">Baker, 2014</xref>; <xref ref-type="bibr" rid="B16">Couper, 2017</xref>). When they do teach pronunciation in their classrooms, teachers tend to focus on segmental production (<xref ref-type="bibr" rid="B27">Foote et al., 2016</xref>; <xref ref-type="bibr" rid="B52">Levis, 2016</xref>; <xref ref-type="bibr" rid="B16">Couper, 2017</xref>), most probably because materials&#x2014;especially textbooks&#x2014;tend to focus on segments (<xref ref-type="bibr" rid="B17">Derwing et al., 2012a</xref>; <xref ref-type="bibr" rid="B27">Foote et al., 2016</xref>).</p>
<p>It is not surprising that teachers might be reluctant to teach pronunciation if their ultimate objective is native-accented speech. However, a host of recent studies have demonstrated that being understood is a more realistic goal (<xref ref-type="bibr" rid="B20">Derwing and Munro, 2015</xref>). The intelligibility principle, with its acknowledgment that most foreign-accented speech is comprehensible<xref ref-type="fn" rid="FN1">
<sup>1</sup>
</xref>, thus guides recent L2 pronunciation research (<xref ref-type="bibr" rid="B51">Levis, 2005</xref>; <xref ref-type="bibr" rid="B50">Levis, 2020</xref>). Researchers generally agree that both segments and suprasegmental features play an important role in being understood (<xref ref-type="bibr" rid="B20">Derwing and Munro, 2015</xref>) and that explicit pronunciation training can have a positive impact on the comprehensibility of L2 speech (<xref ref-type="bibr" rid="B22">Derwing et al., 1998</xref>; <xref ref-type="bibr" rid="B38">Isaacs, 2009</xref>; <xref ref-type="bibr" rid="B46">Lee et al., 2014</xref>; <xref ref-type="bibr" rid="B79">Thomson and Derwing, 2015</xref>).</p>
<p>Given the unspoken classroom goal of native-accented speech coupled with the sporadic attention paid to pronunciation on the one hand, and the research focus on comprehensible speech and a recommendation for regular pronunciation instruction on the other hand, there is clearly a disconnect between pedagogical practice and research findings. This contribution&#x2019;s focus on teaching pronunciation therefore considers the notions of ease and difficulty from two perspectives: 1) setting classroom priorities that place comprehensibility of L2 speech at the forefront; and 2) relying upon insights into research-informed L2 pronunciation training.</p>
</sec>
<sec id="s2">
<title>Defining Ease and Difficulty in L2 Pronunciation Teaching<xref ref-type="fn" rid="FN2">
<sup>2</sup>
</xref>
</title>
<p>Determining whether a given pronunciation feature&#x2014;segmental or suprasegmental&#x2014;is more or less difficult to learn depends on the extent to which improvement is shown after training. Given the variation in how pronunciation features are trained, how speech samples are elicited (e.g., reading individual words, sentences or paragraphs; repetition of a model speaker; semi-spontaneous or spontaneous utterances), and how improvement is measured (e.g., acoustic analyses, listener intelligibility tasks, listener ratings of comprehensibility and/or foreign accentedness), the field of L2 pronunciation research does not have an agreed-upon standard for determining whether a given type of training is successful. Nonetheless, the results of two recent meta-analyses have shown that pronunciation instruction almost always leads to improvement (<xref ref-type="bibr" rid="B46">Lee et al., 2014</xref>; <xref ref-type="bibr" rid="B79">Thomson and Derwing, 2015</xref>).</p>
<p>As a starting point in distinguishing between easy and difficult pronunciation features, it is important to consider the factors that may play a role in L2 pronunciation. First among these is language pairings: the combination of a learner&#x2019;s first language (L1) and their L2. Studies investigating similar groups of L1 learners of the same L2 often report conflicting results. For example, although the Japanese speakers in <xref ref-type="bibr" rid="B35">Haslam (2011)</xref> did not show improvement in English /l/ and /&#x279;/ production even after training, other studies have shown improvement on these same segments among Japanese learners (e.g., <xref ref-type="bibr" rid="B33">Hardison, 2003</xref>; <xref ref-type="bibr" rid="B36">Hazan et al., 2005</xref>). The Mandarin native speakers who were trained in English vowel perception in <xref ref-type="bibr" rid="B83">Wang (2002)</xref> did not improve in their production of English vowels, but those in <xref ref-type="bibr" rid="B78">Thomson (2011)</xref> did. Given these inconsistent findings, it is clear that other factors must be at play in the ultimate success of pronunciation training. As such, L2 pronunciation researchers look beyond language pairings in their assessments of success of a given type of training. Additional factors may include participant&#x2019;s age of learning (<xref ref-type="bibr" rid="B2">Aoyama et al., 2008</xref>; <xref ref-type="bibr" rid="B7">Baker, 2010</xref>), quality of target language interactions (<xref ref-type="bibr" rid="B20">Derwing and Munro, 2015</xref>), motivational factors (<xref ref-type="bibr" rid="B62">Nagle, 2018</xref>), and learners&#x2019; involvement in instructional decisions (<xref ref-type="bibr" rid="B41">Jenkins, 2004</xref>).</p>
</sec>
<sec id="s3">
<title>Setting Priorities</title>
<p>When it comes to determining which pronunciation features are easy and which are hard to learn, some research has shown that certain features are so easy to learn that they do not need to be trained. For example, the Mandarin- and Slavic-speaking learners of English in <xref ref-type="bibr" rid="B18">Derwing et al. (2012b)</xref> demonstrated an ability to accurately perceive sentence stress, intonation and the -teen/-ty distinction in the absence of instruction. While we should not deduce from such findings that accurate perception will result in accurate production, it makes little sense to train such features&#x2014;in this case the perception thereof&#x2014;in the classroom or to investigate their development. Moreover, individual variation is also quite common, and certain exceptional learners may not require training. For example, two Dutch-speaking learners of Slovak in <xref ref-type="bibr" rid="B32">Hanul&#xed;kov&#xe1; et al. (2012)</xref> demonstrated nativelike perception and pronunciation of Slovak consonant clusters after only 15&#xa0;min of exposure to the language. It is thus important to know which pronunciation features learners have mastered so that teachers do not waste time focusing on features that do not need to be trained.</p>
<p>In order to determine which pronunciation features learners have difficulty with and thus which should be the focus of classroom training, instructors are encouraged to develop a pronunciation needs assessment as described by <xref ref-type="bibr" rid="B20">Derwing and Munro (2015)</xref>. Instructors should consider collecting both read and extemporaneous speech samples and assessing the samples both globally and analytically to determine learners&#x2019; difficulties. The authors note that a perceptual task that requires learners to demonstrate their ability to perceive relevant segmental and suprasegmental distinctions can further guide the development of a pronunciation curriculum.</p>
<p>With the results of an assessment in hand, teachers are able to set priorities for their classrooms. Those pronunciation features that both cause difficulty and affect learners&#x2019; comprehensibility&#x2014;or those with the highest functional load (<xref ref-type="bibr" rid="B11">Catford, 1987</xref>)&#x2014;should be the focus of training. At the segmental level, functional load can be determined, among other things, on the basis of the number of minimal pairs that are distinguished by two segments. For example, contrasting /l/ and /n/ distinguishes more English words than does producing a contrast between /d/ and /&#xf0;/ (<xref ref-type="bibr" rid="B61">Munro and Derwing, 2006</xref>). Although researchers have not established a functional load hierarchy for prosodic features of English, lexical (<xref ref-type="bibr" rid="B85">Zielinski, 2008</xref>; <xref ref-type="bibr" rid="B39">Isaacs and Trofimovich, 2012</xref>) and sentential stress assignment<xref ref-type="fn" rid="FN3">
<sup>3</sup>
</xref> (<xref ref-type="bibr" rid="B31">Hahn, 2004</xref>) both play an important role in being understood. While we have a good idea of which pronunciation features of English play a central role in understanding speech, that work is lacking for other target languages. Thus, when setting both segmental and suprasegmental pronunciation priorities in classes with target languages other than English, teachers are encouraged in their evaluation of their students&#x0027; pronunciation needs assessments to consider the extent to which producing given distinctions plays a role in their ability to understand their students&#x2019; speech.</p>
</sec>
<sec id="s4">
<title>Evaluating the Effectiveness of Training</title>
<p>Language learners&#x2014;especially those in the early stages of language learning&#x2014;tend to show improvement in their pronunciation over time. Thus, in order to determine whether a given type of training is effective, it is important when conducting research to include both a comparison group that receives a different type of training and a control group that receives no training. In addition, a delayed posttest allows researchers to determine whether the effects of training are long lasting (<xref ref-type="bibr" rid="B79">Thomson and Derwing, 2015</xref>).</p>
<p>Pronunciation improvement can be determined in two main ways: listener ratings and acoustic analyses. While listener ratings of understanding are considered the gold standard in pronunciation research (<xref ref-type="bibr" rid="B19">Derwing and Munro, 2009</xref>), some training studies also make use of acoustic analyses. Much of the research investigating the effectiveness of pronunciation training uses measures of understanding including comprehensibility ratings (e.g., <xref ref-type="bibr" rid="B25">Foote and McDonough, 2017</xref>; <xref ref-type="bibr" rid="B57">Martin, 2018</xref>) or intelligibility tasks (e.g., <xref ref-type="bibr" rid="B21">Derwing et al., 2014</xref>), often together with ratings of fluency and/or foreign accentedness. Acoustic analyses, completed by hand (e.g., <xref ref-type="bibr" rid="B15">Counselman, 2015</xref>) or automatically (e.g., <xref ref-type="bibr" rid="B76">Suemitsu et al., 2015</xref>; <xref ref-type="bibr" rid="B77">Tejedor-Garc&#xed;a et al., 2020</xref>) are also common and can be used to determine the extent to which certain pronunciation features change over time. Researchers note, however, that significant acoustic differences may not align with listener judgments (<xref ref-type="bibr" rid="B20">Derwing and Munro, 2015</xref>).</p>
<p>While few classroom teachers are able to carry out systematic analyses of their students&#x2019; pronunciation development, they are encouraged to rely upon pronunciation training methods whose effectiveness has been demonstrated via research. Some of this work is outlined below.</p>
</sec>
<sec id="s5">
<title>Research-Informed Pronunciation Training</title>
<p>After setting priorities, the next step is to choose how to most effectively train pronunciation. While a teacher&#x2019;s status as a native or nonnative speaker of the target language does not play a role in learners&#x2019; ultimate pronunciation (<xref ref-type="bibr" rid="B53">Levis et al., 2016</xref>), the results of research have generally demonstrated that explicit, form-focused instruction along with corrective feedback provides the greatest benefits to learners (<xref ref-type="bibr" rid="B72">Saito and Lyster, 2012</xref>; <xref ref-type="bibr" rid="B71">Saito, 2013</xref>). <xref ref-type="bibr" rid="B21">Derwing et al. (2014)</xref> describe an emergent training program designed to meet English language learners&#x2019; (L1 &#x3d; Vietnamese or Khmer) workplace needs. The classroom instruction, which targeted both perception and production, focused on those aspects of the participants&#x2019; speech that affected their intelligibility (i.e., consonant clusters, rhythm and intonation). Participants&#x2019; comprehensibility improved after only 17&#xa0;h of classroom-based training.</p>
<p>A relatively large number of recent studies have investigated the effectiveness of ways to train pronunciation outside of the classroom. Researchers point to a number of benefits of computer-assisted pronunciation training (CAPT). These include unlimited practice time and flexibility as well as opportunities for varied input and immediate feedback (<xref ref-type="bibr" rid="B24">Engwall et al, 2004</xref>; <xref ref-type="bibr" rid="B49">Levis, 2007</xref>). <xref ref-type="bibr" rid="B28">Gao and Hanna (2016)</xref> indicate a further benefit: a computer&#x2019;s capacity for providing &#x201c;infinite, patient modeling&#x201d; (p. 214). An element of fun is also often added to CAPT. For example, <xref ref-type="bibr" rid="B8">Barcomb and Cardoso (2020)</xref> demonstrate the effectiveness of gamified pronunciation training (i.e., training that includes elements of a game but that is not actually a game). The Japanese junior high school learners of English in that study were rewarded with points and badges as they completed a series of metalinguistic tasks and perception and pronunciation activities focusing on English /l/ and /&#x279;/. Learners in the study demonstrated both increased metalinguistic awareness and improved pronunciation accuracy over time. While a range of CAPT activity types exist, this contribution will focus on three that have been shown to play a positive role in improving learners&#x2019; production: 1) listen and repeat; 2) perceptual training; and 3) visualization.</p>
<p>Although the effectiveness of traditional listen and repeat pronunciation tasks may be limited (<xref ref-type="bibr" rid="B68">O&#x2019;Brien, 2019</xref>), a popular and effective way of training pronunciation by listening to a recording and then recording oneself is shadowing. The English learners in <xref ref-type="bibr" rid="B25">Foote and McDonough (2017)</xref> completed eight weeks of shadowing tasks in which they immediately repeated and recorded themselves while echoing dialogues from a sitcom as closely as possible. The task encouraged learners to focus on suprasegmental aspects of speech. Listeners rated pre-test, mid-training and post-test extemporaneous recordings for comprehensibility, accentedness and fluency. The authors found that learners had positive attitudes toward the activities and that learners&#x2019; comprehensibility and fluency improved over time. A number of additional researchers have demonstrated the effectiveness of shadowing for the development of both segments (<xref ref-type="bibr" rid="B84">Zaj&#x105;c and Rojczyk, 2014</xref>) and suprasegmental features (<xref ref-type="bibr" rid="B56">Lima, 2015</xref>).</p>
<p>Studies have investigated the efficacy of perceptual training for improving production (e.g., <xref ref-type="bibr" rid="B15">Counselman, 2015</xref>; <xref ref-type="bibr" rid="B44">Lee and Lyster, 2016</xref>; <xref ref-type="bibr" rid="B73">Sakai and Moorman, 2018</xref>). A popular and effective means of improving primarily segmental production through perceptual training is high variability phonetic training (HVPT), which trains listeners&#x2019; perception with a relatively large quantity of speech samples that are produced by multiple speakers in a range of phonetic contexts (<xref ref-type="bibr" rid="B80">Thomson, 2018</xref>). The results of HVPT studies speak in its favor for the improvement of English vowels by native speakers of Greek (<xref ref-type="bibr" rid="B47">Lengeris, 2018</xref>), Mandarin (<xref ref-type="bibr" rid="B78">Thomson, 2011</xref>) and French (<xref ref-type="bibr" rid="B40">Iverson et al., 2011</xref>), as well as for the improvement of English consonants including English /l/ and /&#x279;/ by Japanese speakers (<xref ref-type="bibr" rid="B10">Bradlow et al., 1997</xref>) and a number of English consonants by Korean learners (e.g., <xref ref-type="bibr" rid="B37">Huensch and Tremblay, 2015</xref>; <xref ref-type="bibr" rid="B45">Lee and Hwang, 2016</xref>). An additional type of perceptual training that has shown positive results is the use of speech synthesis systems (<xref ref-type="bibr" rid="B59">Mixdorff and Munro, 2013</xref>). For example, <xref ref-type="bibr" rid="B55">Liakin et al., (2017)</xref> found that L2 learners of French who made use of a simple text-to-speech (TTS) app on their mobile devices improved similarly to those learners who engaged in conversational practice with, and received feedback on their pronunciation from, their teachers in their in their production of French liaison. A highly innovative synthesis system that has demonstrated great promise generates a synthetic, native-accented version of a speaker&#x2019;s own voice (<xref ref-type="bibr" rid="B23">Ding et al., 2019</xref>). Participants in the study who made use of this so-called &#x201c;golden speaker&#x201d; version of their own voices showed improved comprehensibility and fluency.</p>
<p>Visualization techniques&#x2014;including the use of acoustic displays (i.e., waveforms, spectrograms, and pitch tracks), ultrasound images that provide feedback on articulatory processes, and talking heads that provide learners with access to facial movements&#x2014;allow learners to receive real-time visual feedback on productions. Tools used for visualization can include those designed for acoustic analyses such as Praat (<xref ref-type="bibr" rid="B9">Boersma and Weenink, 2020</xref>) and Audacity (<xref ref-type="bibr" rid="B3">Audacity Team, 2020</xref>) along with software that has been designed specifically to focus on L2 learners&#x2019; pronunciation (e.g., <xref ref-type="bibr" rid="B30">Godfroid et al., 2017</xref>). At the segmental level, researchers have demonstrated that teaching learners how to interpret formant frequencies may enable them to improve their vowel productions, as demonstrated the native speakers of Japanese learning American English /&#xe6;/ in <xref ref-type="bibr" rid="B76">Suemitsu et al. (2015)</xref>.<xref ref-type="fn" rid="FN4">
<sup>4</sup>
</xref> The English-Spanish L2 learners in <xref ref-type="bibr" rid="B66">Olson (2019)</xref>, <xref ref-type="bibr" rid="B64">Offerman and Olson (2016)</xref>, and <xref ref-type="bibr" rid="B67">Olson and Offerman (2020)</xref> who learned to interpret waveforms and spectrograms showing Spanish voice onset time also showed improvement after instruction. A number of researchers advocate for the use of waveforms and spectrograms for the teaching of suprasegmentals, especially duration and intonation (e.g., <xref ref-type="bibr" rid="B48">Levis, 1999</xref>; <xref ref-type="bibr" rid="B34">Hardison, 2004</xref>; <xref ref-type="bibr" rid="B13">Chun, 2013</xref>). For example, <xref ref-type="bibr" rid="B54">Levis and Pickering (2004)</xref> demonstrated the effectiveness of teaching contextualized discourse intonation to L2 learners of English by tracking intonation contours. The L2 Japanese learners in <xref ref-type="bibr" rid="B65">Okuno and Hardison (2016)</xref> received either audiovisual training consisting of audio files and waveform displays, audio-only training, or no training on vowel duration in Japanese. While participants in both experimental groups showed improvement and the ability to generalize what they learned to novel stimuli and new voices, participants in the audiovisual group improved their productions more than participants in the audio-only group. Similarly, <xref ref-type="bibr" rid="B60">Motohashi-Saigo and Hardison (2009)</xref> demonstrated the effectiveness of visualizations in learning vowel length and singleton/geminate distinctions. <xref ref-type="bibr" rid="B14">Chun et al. (2015)</xref> showed that L2 learners of Mandarin who compared the pitch contours of their own tone production with those of native speakers improved in their production of tones.</p>
<p>The type of feedback learners receive plays an important role in the extent of their improvement. <xref ref-type="bibr" rid="B44">Lee and Lyster (2016)</xref> investigated the effect of different types of corrective feedback on a series of perceptual tasks on the production accuracy of Korean-English L2 learners&#x2019; vowels. Corrective feedback that took the form of either 1) rejection (i.e., indicating that the chosen answer was wrong) together with the target form; or 2) rejection together with the nontarget form was more effective than feedback that included either 3) a rejection along with both the target and nontarget forms; or 4) rejection only. The authors take this as evidence that providing learners with feedback indicating that their responses are incorrect is not sufficient for learning to occur.</p>
<p>It is important to consider that computer software designed to assess pronunciation &#x201c;is not based on any particular theory or model of pronunciation which differentiates variation from (true) error&#x201d; (<xref ref-type="bibr" rid="B70">Pennington, 1999</xref>; p. 431). As such, most CAPT promotes accuracy over intelligibility (<xref ref-type="bibr" rid="B49">Levis, 2007</xref>). Finally, although automatic speech recognition (ASR), which relies on a combination of acoustic analyses and artificial intelligence, has been touted as a promising way to evaluate and provide feedback on pronunciation (<xref ref-type="bibr" rid="B69">O&#x2019;Brien et al., 2018</xref>), a number of researchers point to the relatively few studies that align ASR error detection and human judgments of speech (e.g., <xref ref-type="bibr" rid="B13">Chun, 2013</xref>; <xref ref-type="bibr" rid="B12">Chen and Li, 2016</xref>; <xref ref-type="bibr" rid="B42">Johnson and Kang, 2017</xref>; <xref ref-type="bibr" rid="B58">McCrocklin and Edalatishams, 2020</xref>).<xref ref-type="fn" rid="FN5">
<sup>5</sup>
</xref>
</p>
</sec>
<sec id="s6">
<title>Additional Factors</title>
<p>In addition to the type of pronunciation training and feedback learners receive, a number of other factors play a role in the success of training. Central among these is learner awareness. Although research has generally shown that learners have difficulty assessing their own pronunciation (e.g., <xref ref-type="bibr" rid="B82">Trofimovich et al., 2016</xref>), learners&#x2019; awareness of pronunciation features may be positively related to listeners&#x0027; comprehensibility ratings of their speech (<xref ref-type="bibr" rid="B43">Kennedy and Trofimovich, 2010</xref>). Explicit tasks that encourage awareness may be especially beneficial. For example, <xref ref-type="bibr" rid="B1">A&#xf1;orga and Benander (2015)</xref> demonstrated the effectiveness of tasks that encourage learners to compare their own productions with models. Along similar lines, in addition to carrying out a range of production tasks, the German L2 learners in <xref ref-type="bibr" rid="B57">Martin (2018)</xref> completed tasks that required them to distinguish between foreign-accented and native speech. Their comprehensibility improved over time.</p>
<p>Additional factors that may play a role in the effectiveness of pronunciation training can include learners&#x2019; proficiency levels, the length of training, and number of trained phonemes (<xref ref-type="bibr" rid="B73">Sakai and Moorman, 2018</xref>). Research has demonstrated that learners at lower levels of proficiency tend to make faster progress than more advanced learners (<xref ref-type="bibr" rid="B73">Sakai and Moorman, 2018</xref>), that there is an optimal length of pronunciation training (<xref ref-type="bibr" rid="B46">Lee et al., 2014</xref>; <xref ref-type="bibr" rid="B67">Olson and Offerman, 2020</xref>), and that the number of targeted phonemes should be constrained, possibly to as few as three (<xref ref-type="bibr" rid="B73">Sakai and Moorman, 2018</xref>).<xref ref-type="fn" rid="FN6">
<sup>6</sup>
</xref>
</p>
</sec>
<sec sec-type="conclusion" id="s7">
<title>Conclusion</title>
<p>Accessing tools to train pronunciation has never been easier. Any language learner has easy access to a multitude of apps that promise to reduce accents quickly and easily. The focus of many of these tools, however, is often highly salient sounds that often do not play a role in comprehensibility and that may never improve after hours of training (<xref ref-type="bibr" rid="B26">Foote and Smith, 2013</xref>). This mini-review was written to provide readers of this collection with a background into the field of pronunciation training. Distinguishing between the notions of ease and difficulty in pronunciation teaching is overall much less important than distinguishing between effective and ineffective types of training. This is especially true if we consider the ultimate goal of pronunciation training to be comprehensible L2 speech.</p>
</sec>
</body>
<back>
<sec id="s8">
<title>Author Contributions</title>
<p>The author confirms being the sole contributor of this work and has approved it for publication.</p>
</sec>
<sec sec-type="COI-statement" id="s9">
<title>Conflict of Interest</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>A&#xf1;orga</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Benander</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Creating a pronunciation profile of first-year Spanish students</article-title>. <source>Foreign Lang. Ann.</source> <volume>48</volume> (<issue>3</issue>), <fpage>434</fpage>&#x2013;<lpage>446</lpage>. <pub-id pub-id-type="doi">10.1111/flan.12151</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aoyama</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Guion</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Flege</surname>
<given-names>J. E.</given-names>
</name>
<name>
<surname>Yamada</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Akahane-Yamada</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>The first years in an L2-speaking environment: a comparison of Japanese children and adults learning American English</article-title>. <source>Int. Rev. Appl. Linguist.</source> <volume>46</volume> (<issue>1</issue>), <fpage>61</fpage>&#x2013;<lpage>90</lpage>. <pub-id pub-id-type="doi">10.1515/IRAL.2008.003</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="web">
<collab>Audacity Team</collab> (<year>2020</year>). <article-title>Audacity</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.audacityteam.org/">https://www.audacityteam.org/</ext-link>
</comment>. </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baker</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Discourse prosody and teachers&#x2019; stated beliefs and practices</article-title>. <source>TESOL J.</source> <volume>2</volume> (<issue>3</issue>), <fpage>263</fpage>&#x2013;<lpage>292</lpage>. <pub-id pub-id-type="doi">10.5054/tj.2011.259955</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baker</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Exploring teachers&#x2019; knowledge of second language pronunciation techniques: teacher cognitions, observed classroom practices, and student perceptions</article-title>. <source>Tesol Q.</source> <volume>48</volume> (<issue>1</issue>), <fpage>136</fpage>&#x2013;<lpage>163</lpage>. <pub-id pub-id-type="doi">10.1002/tesq.99</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baker</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Murphy</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Knowledge base of pronunciation teaching: staking out the territory</article-title>. <source>TESL Can. J.</source> <volume>28</volume> (<issue>2</issue>), <fpage>29</fpage>&#x2013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.18806/tesl.v28i2.1071</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baker</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Effects of age and experience on the production of English word&#x2013;final stops by Korean speakers</article-title>. <source>Biling. Lang. Cognit.</source> <volume>13</volume> (<issue>3</issue>), <fpage>263</fpage>&#x2013;<lpage>278</lpage>. <pub-id pub-id-type="doi">10.1017/S136672890999006X</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barcomb</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Cardoso</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Rock or lock? Gamifying an online course management system for pronunciation instruction: focus on English/r/and/l/</article-title>. <source>CALICO J.</source> <volume>37</volume> (<issue>2</issue>), <fpage>127</fpage>&#x2013;<lpage>147</lpage>. <pub-id pub-id-type="doi">10.1558/cj.36996</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Boersma</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Weenink</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Praat: Doing phonetics by computer [Computer program] version 6.1.27</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="http://www.praat.org/">http://www.praat.org/</ext-link>
</comment> (<comment>Accessed</comment> <month>October</month> <day>25</day>, <year>2020</year>). </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bradlow</surname>
<given-names>A. R.</given-names>
</name>
<name>
<surname>Pisoni</surname>
<given-names>D. B.</given-names>
</name>
<name>
<surname>Akahane-Yamada</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Tohkura</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Training Japanese listeners to identify English/r/and/l/: IV. Some effects of perceptual learning on speech production</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>101</volume> (<issue>4</issue>), <fpage>2299</fpage>&#x2013;<lpage>2310</lpage>. <pub-id pub-id-type="doi">10.1121/1.418276</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Catford</surname>
<given-names>J. C.</given-names>
</name>
</person-group> (<year>1987</year>). &#x201c;<article-title>Phonetics and the teaching of pronunciation</article-title>,&#x201d; in <source>Current perspectives on pronunciation: practices anchored in theory</source>. Editor <person-group person-group-type="editor">
<name>
<surname>Morley</surname>
<given-names>J.</given-names>
</name>
</person-group> (<publisher-loc>Alexandria, VA</publisher-loc>: <publisher-name>Teachers of English to Speakers of Other Languages</publisher-name>), <fpage>87</fpage>&#x2013;<lpage>100</lpage>. </citation>
</ref>
<ref id="B12">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>N. F.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2016</year>). &#x201c;<article-title>Computer-assisted pronunciation training: from pronunciation scoring towards spoken language learning</article-title>,&#x201d; in <conf-name>Asia-Pacific signal and information processing association annual summit and conference (APSIPA)</conf-name>, <conf-loc>Jeju, South Korea</conf-loc>, <conf-date>December 13&#x2013;16, 2016</conf-date> (<publisher-loc>Seoul, South Korea</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1</fpage>&#x2013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1109/APSIPA.2016.7820782</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Chun</surname>
<given-names>D. M.</given-names>
</name>
</person-group> (<year>2013</year>). &#x201c;<article-title>Computer-assisted pronunciation teaching</article-title>,&#x201d; in <source>Encyclopedia of applied linguistics</source>. Editor <person-group person-group-type="editor">
<name>
<surname>Chapelle</surname>
<given-names>C. A.</given-names>
</name>
</person-group> (<publisher-loc>Oxford, United Kingdom</publisher-loc>: <publisher-name>Wiley-Blackwell</publisher-name>), <fpage>823</fpage>&#x2013;<lpage>834</lpage>. <pub-id pub-id-type="doi">10.1002/9781405198431.wbeal0172</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chun</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Meyr</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Acquisition of L2 Mandarin Chinese tones with learner-created tone visualizations</article-title>. <source>J. Sec. Lang. Pron.</source> <volume>1</volume> (<issue>1</issue>), <fpage>86</fpage>&#x2013;<lpage>114</lpage>. <pub-id pub-id-type="doi">10.1075/jslp.1.1.04chu</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Counselman</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Directing attention to pronunciation in the second language classroom</article-title>. <source>Hispania</source> <volume>98</volume> (<issue>1</issue>), <fpage>31</fpage>&#x2013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.1353/hpn.2015.0006</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Couper</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Teacher cognition of pronunciation teaching: teachers&#x2019; concerns and issues</article-title>. <source>Tesol Q.</source> <volume>51</volume> (<issue>4</issue>), <fpage>820</fpage>&#x2013;<lpage>843</lpage>. <pub-id pub-id-type="doi">10.1002/tesq.354</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Derwing</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Diepenbroek</surname>
<given-names>L. G.</given-names>
</name>
<name>
<surname>Foote</surname>
<given-names>J. A.</given-names>
</name>
</person-group> (<year>2012a</year>). <article-title>How well do general skills ESL textbooks address pronunciation?</article-title> <source>TESL Can. J.</source> <volume>30</volume> (<issue>1</issue>), <fpage>22</fpage>&#x2013;<lpage>44</lpage>. <pub-id pub-id-type="doi">10.18806/tesl.v30i1.1124</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Derwing</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Thomson</surname>
<given-names>R. I.</given-names>
</name>
<name>
<surname>Foote</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Munro</surname>
<given-names>M. J.</given-names>
</name>
</person-group> (<year>2012b</year>). <article-title>A longitudinal study of listening perception in adult learners of English: implications for teachers</article-title>. <source>Can. Mod. Lang. Rev.</source> <volume>68</volume> (<issue>3</issue>), <fpage>247</fpage>&#x2013;<lpage>266</lpage>. <pub-id pub-id-type="doi">10.3138/cmlr.1215</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Derwing</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Munro</surname>
<given-names>M. J.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Putting accent in its place: Rethinking obstacles to communication</article-title>. <source>Lang. Teach.</source> <volume>42</volume> (<issue>4</issue>), <fpage>476</fpage>&#x2013;<lpage>490</lpage>. <pub-id pub-id-type="doi">10.1017/S026144480800551X</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Derwing</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Munro</surname>
<given-names>M. J.</given-names>
</name>
</person-group> (<year>2015</year>). <source>Pronunciation fundamentals: evidence-based perspectives for L2 teaching and research</source>. <publisher-loc>Amsterdam, Netherlands</publisher-loc>: <publisher-name>John Benjamins</publisher-name>. </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Derwing</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Munro</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Foote</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Waugh</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Fleming</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Opening the window on comprehensible pronunciation after 19 years: a workplace training study</article-title>. <source>Lang. Learn.</source> <volume>64</volume> (<issue>3</issue>), <fpage>526</fpage>&#x2013;<lpage>548</lpage>. <pub-id pub-id-type="doi">10.1111/lang.12053</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Derwing</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Munro</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Wiebe</surname>
<given-names>G. E.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Evidence in favor of a broad framework for pronunciation instruction</article-title>. <source>Lang. Learn.</source> <volume>48</volume> (<issue>3</issue>), <fpage>393</fpage>&#x2013;<lpage>410</lpage>. <pub-id pub-id-type="doi">10.1111/0023-8333.00047</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ding</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Liberatore</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Sonsaat</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Luci&#x10d;</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Silpachai</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>G.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Golden speaker builder: an interactive tool for pronunciation training</article-title>. <source>Speech Commun.</source> <volume>115</volume>, <fpage>51</fpage>&#x2013;<lpage>66</lpage>. <pub-id pub-id-type="doi">10.1016/j.specom.2019.10.005</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Engwall</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Wik</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Beskow</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Granstr&#xf6;m</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Design strategies for a virtual language tutor</article-title>. <conf-name>8th international conference on spoken</conf-name>, <conf-loc>Jiju Island, South Korea</conf-loc>, <conf-date>March 31, 2004</conf-date> (<publisher-loc>Seoul, South Korea</publisher-loc>: <publisher-name>ISCA</publisher-name>), <fpage>1</fpage>&#x2013;<lpage>4</lpage>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="http://www.speech.kth.se/ctt/publications/papers04/icslp2004_tutor.pdf">http://www.speech.kth.se/ctt/publications/papers04/icslp2004_tutor.pdf</ext-link>
</comment>. </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Foote</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>McDonough</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Using shadowing with mobile technology to improve L2 pronunciation</article-title>. <source>J. Sec. Lang. Pron.</source> <volume>3</volume> (<issue>1</issue>), <fpage>34</fpage>&#x2013;<lpage>56</lpage>. <pub-id pub-id-type="doi">10.1075/jslp.3.1.02foo</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Foote</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Is there an app for that?</article-title> <comment>PhD thesis</comment>. <publisher-loc>Montreal, QC, Canada</publisher-loc>: <publisher-name>Concordia University</publisher-name>. </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Foote</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Trofimovich</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Collins</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Soler Urz&#xfa;a</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Pronunciation teaching practices in communicative second language classes</article-title>. <source>Lang. Learn. J.</source> <volume>44</volume> (<issue>2</issue>), <fpage>181</fpage>&#x2013;<lpage>196</lpage>. <pub-id pub-id-type="doi">10.1080/09571736.2013.784345</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hanna</surname>
<given-names>B. E.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Exploring optimal pronunciation teaching: Integrating instructional software into intermediate-level EFL classes in China</article-title>. <source>CALICO J.</source> <volume>33</volume> (<issue>2</issue>), <fpage>201</fpage>&#x2013;<lpage>230</lpage>. <pub-id pub-id-type="doi">10.1558/cj.v33i2.26054</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Garcia</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Nickolai</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Traditional versus ASR-based pronunciation instruction: an empirical study</article-title>. <source>CALICO J.</source> <volume>37</volume> (<issue>3</issue>), <fpage>213</fpage>&#x2013;<lpage>232</lpage>. <pub-id pub-id-type="doi">10.1558/cj.40379</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Godfroid</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>C.-H.</given-names>
</name>
<name>
<surname>Ryu</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Hearing and seeing tone through color: an efficacy study of web&#x2013;based, multimodal Chinese tone perception training</article-title>. <source>Lang. Learn.</source> <volume>67</volume> (<issue>4</issue>), <fpage>819</fpage>&#x2013;<lpage>857</lpage>. <pub-id pub-id-type="doi">10.1111/lang.12246</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hahn</surname>
<given-names>L. D.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Primary stress and intelligibility: research to motivate the teaching of suprasegmentals</article-title>. <source>Tesol Q.</source> <volume>38</volume> (<issue>2</issue>), <fpage>201</fpage>&#x2013;<lpage>232</lpage>. <pub-id pub-id-type="doi">10.2307/3588378</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hanul&#xed;kov&#xe1;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Dediu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Fang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Basnakov&#xe1;</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Huettig</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Individual differences in the acquisition of a complex L2 phonology: a training study</article-title>. <source>Lang. Learn.</source> <volume>62</volume> (<issue>2</issue>), <fpage>79</fpage>&#x2013;<lpage>109</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-9922.2012.00707.x</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hardison</surname>
<given-names>D. M.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Acquisition of second-language speech: effects of visual cues, context, and talker variability</article-title>. <source>Appl. Psycholinguist.</source> <volume>24</volume> (<issue>4</issue>), <fpage>495</fpage>&#x2013;<lpage>522</lpage>. <pub-id pub-id-type="doi">10.1017/S0142716403000250</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hardison</surname>
<given-names>D. M.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Generalization of computer-assisted prosody training: quantitative and qualitative findings</article-title>. <source>Lang. Learn. Technol.</source> <volume>8</volume> (<issue>1</issue>), <fpage>34</fpage>&#x2013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.125/25228</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Haslam</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>The effect of perceptual training including required lexical access and meaningful linguistic context on second language phonology</article-title>. <comment>PhD dissertation</comment>. <publisher-loc>Salt Lake City, Utah</publisher-loc>: <publisher-name>University of Utah</publisher-name>. </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hazan</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Sennema</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Iba</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Faulkner</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Effect of audiovisual perceptual training on the perception and production of consonants by Japanese learners of English</article-title>. <source>Speech Commun.</source> <volume>47</volume> (<issue>3</issue>), <fpage>360</fpage>&#x2013;<lpage>378</lpage>. <pub-id pub-id-type="doi">10.1016/j.specom.2005.04.007</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huensch</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Tremblay</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Effects of perceptual phonetic training on the perception and production of second language syllable structure</article-title>. <source>J. Phonetics</source> <volume>52</volume>, <fpage>105</fpage>&#x2013;<lpage>120</lpage>. <pub-id pub-id-type="doi">10.1016/j.wocn.2015.06.007</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Isaacs</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Integrating form and meaning in L2 pronunciation instruction</article-title>. <source>TESL Can. J.</source> <volume>27</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.18806/tesl.v27i1.1034</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Isaacs</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Trofimovich</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Deconstructing comprehensibility: Identifying the linguistic influences on listeners&#x2019; L2 comprehensibility ratings</article-title>. <source>Stud. Sec. Lang. Acquis.</source> <volume>34</volume> (<issue>3</issue>), <fpage>475</fpage>&#x2013;<lpage>505</lpage>. <pub-id pub-id-type="doi">10.1017/S0272263112000150</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Iverson</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Pinet</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Evans</surname>
<given-names>B. G.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Auditory training for experienced and inexperienced second language learners: native French speakers learning English vowels</article-title>. <source>Appl. Psycholinguist.</source> <volume>33</volume> (<issue>1</issue>), <fpage>145</fpage>&#x2013;<lpage>160</lpage>. <pub-id pub-id-type="doi">10.1017/S0142716411000300</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jenkins</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Research in teaching pronunciation and intonation</article-title>. <source>Annu. Rev. Appl. Ling.</source> <volume>24</volume>, <fpage>109</fpage>&#x2013;<lpage>125</lpage>. <pub-id pub-id-type="doi">10.1017/S0267190504000054</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Johnson</surname>
<given-names>D. O.</given-names>
</name>
<name>
<surname>Kang</surname>
<given-names>O.</given-names>
</name>
</person-group> (<year>2017</year>). &#x201c;<article-title>Measures of intelligibility in different varieties of English: human vs. machine</article-title>,&#x201d; in <conf-name>Proceedings of the 8th pronunciation in second language learning and teaching conference</conf-name> Editors <person-group person-group-type="editor">
<name>
<surname>O&#x2019;Brien</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Levis</surname>
<given-names>J.</given-names>
</name>
</person-group>, <conf-loc>Santa Barbara, CA</conf-loc>, <conf-date>September, 2017</conf-date>. (<publisher-loc>Ames, IA</publisher-loc>: <publisher-name>Iowa State University</publisher-name>), <fpage>58</fpage>&#x2013;<lpage>72</lpage>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://apling.engl.iastate.edu/alt-content/uploads/2017/05/PSLLT_2016_Proceedings_finalB.pdf">https://apling.engl.iastate.edu/alt-content/uploads/2017/05/PSLLT_2016_Proceedings_finalB.pdf</ext-link>
</comment>. </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kennedy</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Trofimovich</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Language awareness and second language pronunciation: a classroom study</article-title>. <source>Lang. Aware.</source> <volume>19</volume> (<issue>3</issue>), <fpage>171</fpage>&#x2013;<lpage>185</lpage>. <pub-id pub-id-type="doi">10.1080/09658416.2010.48643</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>A. H.</given-names>
</name>
<name>
<surname>Lyster</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Can corrective feedback on second language speech perception errors affect production accuracy?</article-title> <source>Appl. Psycholinguist.</source> <volume>38</volume> (<issue>2</issue>), <fpage>371</fpage>&#x2013;<lpage>393</lpage>. <pub-id pub-id-type="doi">10.1017/S0142716416000254</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>H. Y.</given-names>
</name>
<name>
<surname>Hwang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Gradient of learnability in teaching English pronunciation to Korean learners</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>139</volume>, <fpage>1859</fpage>&#x2013;<lpage>1872</lpage>. <pub-id pub-id-type="doi">10.1121/1.4945716</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Plonsky</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>The effectiveness of second language pronunciation instruction: a meta-analysis</article-title>. <source>Appl. Ling.</source> <volume>36</volume> (<issue>3</issue>), <fpage>1</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1093/applin/amu040</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lengeris</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Computer-based auditory training improves second-language vowel production in spontaneous speech</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>144</volume> (<issue>3</issue>), <fpage>EL165</fpage>&#x2013;<lpage>EL171</lpage>. <pub-id pub-id-type="doi">10.1121/1.5052201</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levis</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Intonation in theory and practice, revisited</article-title>. <source>Tesol Q.</source> <volume>33</volume> (<issue>1</issue>), <fpage>37</fpage>&#x2013;<lpage>63</lpage>. <pub-id pub-id-type="doi">10.2307/3588190</pub-id> </citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levis</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Computer technology in teaching and researching</article-title>. <source>Annu. Rev. Appl. Ling.</source> <volume>27</volume>, <fpage>184</fpage>&#x2013;<lpage>202</lpage>. <pub-id pub-id-type="doi">10.1017/S0267190508070098</pub-id> </citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levis</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Revisiting the intelligibility and nativeness principles</article-title>. <source>J. Sec. Lang. Pronunciation</source> <volume>6</volume> (<issue>3</issue>), <fpage>310</fpage>&#x2013;<lpage>328</lpage>. <pub-id pub-id-type="doi">10.1075/jslp.20050.lev</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levis</surname>
<given-names>J. M.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Changing contexts and shifting paradigms in pronunciation teaching</article-title>. <source>Tesol Q.</source> <volume>39</volume> (<issue>3</issue>), <fpage>369</fpage>&#x2013;<lpage>377</lpage>. <pub-id pub-id-type="doi">10.2307/3588485</pub-id> </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levis</surname>
<given-names>J. M.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Research into practice: how research appears in pronunciation teaching materials</article-title>. <source>Lang. Teach.</source> <volume>49</volume> (<issue>3</issue>), <fpage>423</fpage>&#x2013;<lpage>437</lpage>. <pub-id pub-id-type="doi">10.1017/S0261444816000045</pub-id> </citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levis</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Sonsaat</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Link</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Barriuso</surname>
<given-names>T. A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Native and nonnative teachers of L2 pronunciation: effects on learner performance</article-title>. <source>Tesol Q.</source> <volume>50</volume> (<issue>4</issue>), <fpage>894</fpage>&#x2013;<lpage>951</lpage>. <pub-id pub-id-type="doi">10.1002/tesq.272</pub-id> </citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levis</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Pickering</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Teaching intonation in discourse using speech visualization technology</article-title>. <source>System</source> <volume>32</volume>, <fpage>505</fpage>&#x2013;<lpage>524</lpage>. <pub-id pub-id-type="doi">10.1016/j.system.2004.09.009</pub-id> </citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liakin</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Cardoso</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Liakina</surname>
<given-names>N.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>The pedagogical use of mobile speech synthesis (TTS): focus on French liaison</article-title>. <source>Comput. Assist. Lang. Learn.</source> <volume>30</volume> (<issue>3&#x2013;4</issue>), <fpage>348</fpage>&#x2013;<lpage>365</lpage>. <pub-id pub-id-type="doi">10.1080/09588221.2017.1312463</pub-id> </citation>
</ref>
<ref id="B56">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Lima</surname>
<given-names>E. F.</given-names>
</name>
</person-group> (<year>2015</year>). &#x201c;<article-title>Feel the rhythm! Fun and effective pronunciation practice using Audacity and sitcom scenes (teaching tip)</article-title>,&#x201d; in <conf-name>Proceedings of the 6th pronunciation in second language learning and teaching conference</conf-name> Editors <person-group person-group-type="editor">
<name>
<surname>Levis</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Mohammed</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Qian</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Z.</given-names>
</name>
</person-group>, <conf-loc>Santa Barbara, CA</conf-loc>, <conf-date>September 5&#x2013;6, 2014</conf-date> (<publisher-loc>Ames, IA</publisher-loc>: <publisher-name>Iowa State University</publisher-name>), <fpage>277</fpage>&#x2013;<lpage>284</lpage>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://apling.engl.iastate.edu/alt-content/uploads/2015/05/PSLLT_6th_Proceedings_2014.pdf">https://apling.engl.iastate.edu/alt-content/uploads/2015/05/PSLLT_6th_Proceedings_2014.pdf</ext-link>
</comment>. </citation>
</ref>
<ref id="B57">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Martin</surname>
<given-names>I. A.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Bridging the gap between L2 pronunciation research and teaching: using iCPRs to improve German learners&#x2019; pronunciation in distance and face-to-face classrooms</article-title>. <comment>PhD dissertation</comment>. <publisher-loc>State College, PA</publisher-loc>: <publisher-name>The Pennsylvania State University</publisher-name>. </citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McCrocklin</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Edalatishams</surname>
<given-names>I.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Revisiting popular speech recognition software for ESL speech</article-title>. <source>Tesol Q.</source> <volume>54</volume> (<issue>4</issue>), <fpage>1086</fpage>&#x2013;<lpage>1097</lpage>. <pub-id pub-id-type="doi">10.1002/tesq.3006</pub-id> </citation>
</ref>
<ref id="B59">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Mixdorff</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Munro</surname>
<given-names>M. J.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Quantifying and evaluating the impact of prosodic differences of foreign&#x2013;accented English</article-title>. <conf-name>Proceedings of the workshop on speech and language technology in education (SLaTE)</conf-name>, <conf-loc>Gernoble, France</conf-loc>, <conf-date>August 30&#x2013;September 1, 2013</conf-date> (<publisher-loc>Valencia, Spain</publisher-loc>: <publisher-name>ISCA</publisher-name>), <fpage>147</fpage>&#x2013;<lpage>152</lpage>. </citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Motohashi-Saigo</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Hardison</surname>
<given-names>D. M.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Acquisition of L2 Japanese geminates: training with waveform displays</article-title>. <source>Lang. Learn. Technol.</source> <volume>13</volume> (<issue>2</issue>), <fpage>29</fpage>&#x2013;<lpage>47</lpage>. <pub-id pub-id-type="doi">10.125/44179</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Munro</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Derwing</surname>
<given-names>T. M.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>The functional load principle in ESL pronunciation instruction: an exploratory study</article-title>. <source>System</source> <volume>34</volume> (<issue>4</issue>), <fpage>520</fpage>&#x2013;<lpage>531</lpage>. <pub-id pub-id-type="doi">10.1016/j.system.2006.09.004</pub-id> </citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nagle</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Motivation, comprehensibility, and accentedness in L2 Spanish: investigating motivation as a time&#x2010;varying predictor of pronunciation development</article-title>. <source>Mod. Lang. J.</source> <volume>102</volume> (<issue>1</issue>), <fpage>199</fpage>&#x2013;<lpage>217</lpage>. <pub-id pub-id-type="doi">10.1111/modl.12461</pub-id> </citation>
</ref>
<ref id="B63">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Nishi</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Kewley-Port</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Second language vowel production training: effects of set size, training order and native language</article-title>. <comment>Available at <ext-link ext-link-type="uri" xlink:href="http://www.icphs2007.de/conference/Papers/1018/1018.pdf">http://www.icphs2007.de/conference/Papers/1018/1018.pdf</ext-link>
</comment>. </citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Offerman</surname>
<given-names>H. M.</given-names>
</name>
<name>
<surname>Olson</surname>
<given-names>D. J.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Visual feedback and second language segmental production: the generalizability of pronunciation gains</article-title>. <source>System</source> <volume>59</volume>, <fpage>45</fpage>&#x2013;<lpage>60</lpage>. <pub-id pub-id-type="doi">10.1016/j.system.2016.03.003</pub-id> </citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Okuno</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Hardison</surname>
<given-names>D. M.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Perception&#x2013;production link in L2 Japanese vowel duration: training with technology</article-title>. <source>Lang. Learn. Technol.</source> <volume>20</volume> (<issue>2</issue>), <fpage>61</fpage>&#x2013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.125/44461</pub-id>
</citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Olson</surname>
<given-names>D. J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Feature acquisition in second language phonetic development: evidence from phonetic training</article-title>. <source>Lang. Learn.</source> <volume>69</volume> (<issue>2</issue>), <fpage>366</fpage>&#x2013;<lpage>404</lpage>. <pub-id pub-id-type="doi">10.1111/lang.12336</pub-id> </citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Olson</surname>
<given-names>D. J.</given-names>
</name>
<name>
<surname>Offerman</surname>
<given-names>H. M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Maximizing the effect of visual feedback for pronunciation instruction: a comparative analysis of three approaches</article-title>. <source>J. Sec. Lang. Pronunciation</source>. <pub-id pub-id-type="doi">10.1075/jslp.20005.ols</pub-id> </citation>
</ref>
<ref id="B68">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>O&#x2019;Brien</surname>
<given-names>M. G.</given-names>
</name>
</person-group> (<year>2019</year>). &#x201c;<article-title>Targeting pronunciation (and perception) with technology</article-title>,&#x201d; in <source>Engaging language learners through CALL</source>. Editors <person-group person-group-type="editor">
<name>
<surname>Arnold</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Ducate</surname>
<given-names>L.</given-names>
</name>
</person-group> (<publisher-loc>Sheffield, United Kingdom</publisher-loc>: <publisher-name>Equinox</publisher-name>), <fpage>309</fpage>&#x2013;<lpage>352</lpage>. </citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>O&#x2019;Brien</surname>
<given-names>M. G.</given-names>
</name>
<name>
<surname>Derwing</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Cucchiarini</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Hardison</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Mixdorff</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Thomson</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Directions for the future of technology in pronunciation research and teaching</article-title>. <source>J. Sec. Lang. Pronunciation</source> <volume>4</volume> (<issue>2</issue>), <fpage>182</fpage>&#x2013;<lpage>207</lpage>. <pub-id pub-id-type="doi">10.1075/jslp.17001.obr</pub-id> </citation>
</ref>
<ref id="B70">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pennington</surname>
<given-names>M. C.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Computer-aided pronunciation pedagogy: promise, limitations, directions</article-title>. <source>Comput. Assist. Lang. Learn.</source> <volume>12</volume> (<issue>5</issue>), <fpage>427</fpage>&#x2013;<lpage>440</lpage>. <pub-id pub-id-type="doi">10.1076/call.12.5.427.5693</pub-id> </citation>
</ref>
<ref id="B71">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saito</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Reexamining effects of form-focused instruction on L2 pronunciation development</article-title>. <source>Stud. Sec. Lang. Acquis.</source> <volume>35</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1017/S0272263112000666</pub-id> </citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saito</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Lyster</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Effects of form&#x2010;focused instruction and corrective feedback on L2 pronunciation development of/&#x279;/by Japanese learners of English</article-title>. <source>Lang. Learn.</source> <volume>62</volume> (<issue>2</issue>), <fpage>595</fpage>&#x2013;<lpage>633</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-9922.2011.00639.x</pub-id> </citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sakai</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Moorman</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Can perception training improve the production of second-language phonemes? A meta-analytic review of 25 years of perception training research</article-title>. <source>Appl. Psycholinguist.</source> <volume>39</volume> (<issue>1</issue>), <fpage>187</fpage>&#x2013;<lpage>224</lpage>. <pub-id pub-id-type="doi">10.1017/S0142716417000418</pub-id> </citation>
</ref>
<ref id="B74">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Scales</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wennerstrom</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Richard</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>S. H.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Language learners&#x2019; perceptions of accent</article-title>. <source>Tesol Q.</source> <volume>40</volume>, <fpage>715</fpage>&#x2013;<lpage>738</lpage>. <pub-id pub-id-type="doi">10.2307/40264305</pub-id> </citation>
</ref>
<ref id="B75">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sifakis</surname>
<given-names>N. C.</given-names>
</name>
<name>
<surname>Sougari</surname>
<given-names>A.-M.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Pronunciation issues and EIL pedagogy in the periphery: a survey of Greek state school teachers&#x2019; beliefs</article-title>. <source>Tesol Q.</source>, <volume>39</volume>, <fpage>467</fpage>&#x2013;<lpage>488</lpage>. <pub-id pub-id-type="doi">10.2307/3588490</pub-id> </citation>
</ref>
<ref id="B76">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Suemitsu</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Dang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ito</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tiede</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>A real-time articulatory visual feedback approach with target presentation for second language pronunciation learning</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>138</volume> (<issue>4</issue>), <fpage>EL382</fpage>&#x2013;<lpage>EL387</lpage>. <pub-id pub-id-type="doi">10.1121/1.4931827</pub-id> </citation>
</ref>
<ref id="B77">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tejedor-Garc&#xed;a</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Escudero-Mancebo</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Carde&#xf1;oso-Payo</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Gonz&#xe1;lez-Ferreras</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Using challenges to enhance a learning game for pronunciation training of English as a second language</article-title>. <source>IEEE Access</source> <volume>8</volume>, <fpage>74250</fpage>&#x2013;<lpage>74266</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2020.2988406</pub-id> </citation>
</ref>
<ref id="B78">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thomson</surname>
<given-names>R. I.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Computer-assisted pronunciation training: targeting second language vowel perception improves pronunciation</article-title>. <source>CALICO J.</source> <volume>28</volume> (<issue>3</issue>), <fpage>744</fpage>&#x2013;<lpage>765</lpage>. <pub-id pub-id-type="doi">10.11139/cj.28.3.744-765</pub-id> </citation>
</ref>
<ref id="B79">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thomson</surname>
<given-names>R. I.</given-names>
</name>
<name>
<surname>Derwing</surname>
<given-names>T. M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>The effectiveness of L2 pronunciation instruction: a narrative review</article-title>. <source>Appl. Ling.</source> <volume>36</volume> (<issue>3</issue>), <fpage>326</fpage>&#x2013;<lpage>344</lpage>. <pub-id pub-id-type="doi">10.1093/applin/amu076</pub-id> </citation>
</ref>
<ref id="B80">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thomson</surname>
<given-names>R. I.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>High variability [pronunciation] training. (HVPT). A proven technique about which every language teacher and learner ought to know</article-title>. <source>Journal of Second Language Pronunciation</source> <volume>4</volume> (<issue>2</issue>), <fpage>208</fpage>&#x2013;<lpage>231</lpage>. <pub-id pub-id-type="doi">10.1075/jslp.17038.tho</pub-id> </citation>
</ref>
<ref id="B81">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Timmis</surname>
<given-names>I.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Native-speaker norms and international English: a classroom view</article-title>. <source>ELT J.</source> <volume>56</volume> (<issue>3</issue>), <fpage>240</fpage>&#x2013;<lpage>249</lpage>. <pub-id pub-id-type="doi">10.1093/elt/56.3.240</pub-id> </citation>
</ref>
<ref id="B82">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Trofimovich</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Iaacs</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Kennedy</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Saito</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Crowther</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Flawed self-assessment: investigating self-and other-perception of second language speech</article-title>. <source>Bilingualism</source> <volume>19</volume> (<issue>1</issue>), <fpage>122</fpage>&#x2013;<lpage>140</lpage>. <pub-id pub-id-type="doi">10.17/S1366728914000832</pub-id> </citation>
</ref>
<ref id="B83">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2002</year>). <source>Training Mandarin and Cantonese speakers to identify English vowel contrasts: long-term retention and effects on production</source>
<italic>.</italic> <publisher-loc>Burnaby, Canada</publisher-loc>: <publisher-name>Simon Fraser University</publisher-name>. </citation>
</ref>
<ref id="B84">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zaj&#x105;c</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Rojczyk</surname>
</name>
</person-group> (<year>2014</year>). <article-title>Imitation of English vowel duration upon exposure to native and non-native speech</article-title>. <source>Pozna&#x144; Stud. Contemp. Linguis.</source> <volume>50</volume> (<issue>4</issue>), <fpage>495</fpage>&#x2013;<lpage>514</lpage>. <pub-id pub-id-type="doi">10.1515/psicl-2014&#x2013;0025</pub-id> </citation>
</ref>
<ref id="B85">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zielinski</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>The listener: No longer the silent partner in reduced intelligibility</article-title>. <source>System</source> <volume>36</volume> (<issue>1</issue>), <fpage>69</fpage>&#x2013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.1016/j.system.2007.11.004</pub-id> </citation>
</ref>
</ref-list>
<fn-group>
<fn id="FN1">
<label>1</label>
<p>Intelligibility and comprehensibility are terms that are used in research to describe methods for testing listeners&#x2019; understanding of speech. <xref ref-type="bibr" rid="B51">Levis&#x2019;s (2005</xref>, <xref ref-type="bibr" rid="B50">2020)</xref> intelligibility principle incorporates both intelligibility and comprehensibility.</p>
</fn>
<fn id="FN2">
<label>2</label>
<p>An anonymous reviewer brought up the important point that it is possible to teach something well and for learners not to learn it. As such, the issue that we are most concerned with is that of learnability.</p>
</fn>
<fn id="FN3">
<label>3</label>
<p>Readers are reminded that L2 learners of English may not require training in the perception of sentential stress assignment as demonstrated by <xref ref-type="bibr" rid="B18">Derwing et al. (2012b)</xref>.</p>
</fn>
<fn id="FN4">
<label>4</label>
<p>Making use of spectrograms to interpret formant frequencies requires specialized knowledge, and this may be difficult for some teachers and learners (<xref ref-type="bibr" rid="B69">O&#x2019;Brien et al., 2018</xref>).</p>
</fn>
<fn id="FN5">
<label>5</label>
<p>
<xref ref-type="bibr" rid="B29">Garcia et al. (2020)</xref> demonstrated that the effectiveness of ASR training for the development of some L2 segments.</p>
</fn>
<fn id="FN6">
<label>6</label>
<p>Note, however, that <xref ref-type="bibr" rid="B63">Nishi and Kewley-Port (2007)</xref> report detrimental effects for training only a subset of vowels or consonants and advocate instead for training the entire set of vowels.</p>
</fn>
</fn-group>
</back>
</article>