<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2023.1232262</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Phonological discrimination and contrast detection in pupillometry</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Chiossi</surname>
<given-names>Julia S. C.</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<xref rid="c001" ref-type="corresp"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/2062164/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Patou</surname>
<given-names>Fran&#x00E7;ois</given-names>
</name>
<xref rid="aff3" ref-type="aff"><sup>3</sup></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ng</surname>
<given-names>Elaine Hoi Ning</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="aff4" ref-type="aff"><sup>4</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/835953/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Faulkner</surname>
<given-names>Kathleen F.</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/2541635/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Lyxell</surname>
<given-names>Bj&#x00F6;rn</given-names>
</name>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/84158/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Oticon A/S</institution>, <addr-line>Sm&#x00F8;rum</addr-line>, <country>Denmark</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Special Needs Education, University of Oslo</institution>, <addr-line>Oslo</addr-line>, <country>Norway</country></aff>
<aff id="aff3"><sup>3</sup><institution>Oticon Medical</institution>, <addr-line>Sm&#x00F8;rum</addr-line>, <country>Denmark</country></aff>
<aff id="aff4"><sup>4</sup><institution>Department of Behavioural Sciences and Learning, Linnaeus Centre HEAD, Swedish Institute for Disability Research, Link&#x00F6;ping University</institution>, <addr-line>Link&#x00F6;ping</addr-line>, <country>Sweden</country></aff>
<author-notes>
<fn fn-type="edited-by" id="fn0001">
<p>Edited by: Bruno L. Giordano, UMR7289 Institut de Neurosciences de la Timone (INT), France</p>
</fn>
<fn fn-type="edited-by" id="fn0002">
<p>Reviewed by: Isabella Poggi, Roma Tre University, Italy; Riki Taitelbaum-Swead, Ariel University, Israel</p>
</fn>
<corresp id="c001">&#x002A;Correspondence: Julia S. C. Chiossi, <email>jschioss@uio.no</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>01</day>
<month>11</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1232262</elocation-id>
<history>
<date date-type="received">
<day>31</day>
<month>05</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>12</day>
<month>10</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Chiossi, Patou, Ng, Faulkner and Lyxell.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Chiossi, Patou, Ng, Faulkner and Lyxell</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<sec id="sec1">
<title>Introduction</title>
<p>The perception of phonemes is guided by both low-level acoustic cues and high-level linguistic context. However, differentiating between these two types of processing can be challenging. In this study, we explore the utility of pupillometry as a tool to investigate both low- and high-level processing of phonological stimuli, with a particular focus on its ability to capture novelty detection and cognitive processing during speech perception.</p>
</sec>
<sec id="sec2">
<title>Methods</title>
<p>Pupillometric traces were recorded from a sample of 22 Danish-speaking adults, with self-reported normal hearing, while performing two phonological-contrast perception tasks: a nonword discrimination task, which included minimal-pair combinations specific to the Danish language, and a nonword detection task involving the detection of phonologically modified words within sentences. The study explored the perception of contrasts in both unprocessed speech and degraded speech input, processed with a vocoder.</p>
</sec>
<sec id="sec3">
<title>Results</title>
<p>No difference in peak pupil dilation was observed when the contrast occurred between two isolated nonwords in the nonword discrimination task. For unprocessed speech, higher peak pupil dilations were measured when phonologically modified words were detected within a sentence compared to sentences without the nonwords. For vocoded speech, higher peak pupil dilation was observed for sentence stimuli, but not for the isolated nonwords, although performance decreased similarly for both tasks.</p>
</sec>
<sec id="sec4">
<title>Conclusion</title>
<p>Our findings demonstrate the complexity of pupil dynamics in the presence of acoustic and phonological manipulation. Pupil responses seemed to reflect higher-level cognitive and lexical processing related to phonological perception rather than low-level perception of acoustic cues. However, the incorporation of multiple talkers in the stimuli, coupled with the relatively low task complexity, may have affected the pupil dilation.</p>
</sec>
</abstract>
<kwd-group>
<kwd>pupillometry</kwd>
<kwd>speech perception</kwd>
<kwd>phoneme perception</kwd>
<kwd>acoustic cues</kwd>
<kwd>novelty detection</kwd>
<kwd>linguistic context</kwd>
</kwd-group>
<counts>
<fig-count count="5"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="78"/>
<page-count count="12"/>
<word-count count="10383"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Auditory Cognitive Neuroscience</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="sec5">
<label>1.</label>
<title>Introduction</title>
<p>The perception of contrast between phonemes is a fundamental aspect of speech perception and the basis for language acquisition (<xref ref-type="bibr" rid="ref35">Kuhl et al., 2008</xref>; <xref ref-type="bibr" rid="ref10">Casserly and Pisoni, 2010</xref>). By gradually extracting patterns from speech, infants learn to divide acoustic input into phonetic categories and sequences of phoneme combinations into words (<xref ref-type="bibr" rid="ref35">Kuhl et al., 2008</xref>; <xref ref-type="bibr" rid="ref53">Romberg and Saffran, 2010</xref>). As vocabulary grows, consistent perception of acoustic/phonetic patterns in the speech input will activate lexical processing for word recognition or pathways for learning a novel word (<xref ref-type="bibr" rid="ref50">Pittman et al., 2017</xref>). However, the perception of phonological contrasts is not uniquely driven by its acoustic properties. Phonological perception evolves to accommodate predictions from the linguistic context and the inherent phonetic variability in speech (<xref ref-type="bibr" rid="ref52">Repp, 1982</xref>; <xref ref-type="bibr" rid="ref2">Allen et al., 2003</xref>; <xref ref-type="bibr" rid="ref12">Clarke and Garrett, 2004</xref>; <xref ref-type="bibr" rid="ref26">Jesse, 2021</xref>). In terms of cognitive processing, low-level acoustic perception and high-level lexical processing are integrated to determine the presence or significance of a specific contrast (<xref ref-type="bibr" rid="ref8">Borsky et al., 1998</xref>; <xref ref-type="bibr" rid="ref13">Coleman, 2003</xref>).</p>
<p>Low-level processing involves interpreting the acoustic properties of speech sounds. Acoustic cues refer to distinct auditory features that convey information. Phoneme contrasts are marked by a variety of cues in spectral and temporal acoustic dimensions. These acoustic cues are redundant, such that several distinct cues occur for a particular contrast and can be traded with each other for the perception of a particular phoneme (<xref ref-type="bibr" rid="ref39">Liberman et al., 1967</xref>; <xref ref-type="bibr" rid="ref52">Repp, 1982</xref>; <xref ref-type="bibr" rid="ref68">Winn et al., 2012</xref>). As an example, place of articulation for stop consonants can be cued by F2 and F3 transitions, burst frequency, or burst amplitude (<xref ref-type="bibr" rid="ref52">Repp, 1982</xref>, for multiple examples). These cues covary in natural speech, and listeners must integrate them to achieve the most reliable identification of the incoming speech stimuli. Although this process may remain robust while a cue is missing or degraded, it would be expected that the demand for cognitive processing would increase with higher ambiguity of the stimulus and degradation of differential cues.</p>
<p>High-level phonological processing is guided by top-down knowledge of linguistic rules and context (<xref ref-type="bibr" rid="ref10">Casserly and Pisoni, 2010</xref>; <xref ref-type="bibr" rid="ref26">Jesse, 2021</xref>). The presence of high-level processing on phoneme discrimination leads to a perceptual bias in which listeners disambiguate underspecified phonemes toward meaningful compositions (<xref ref-type="bibr" rid="ref19">Ganong, 1980</xref>; <xref ref-type="bibr" rid="ref26">Jesse, 2021</xref>). For example, individuals may identify ambiguous speech sounds as a real word rather than a nonsense word if context is present.</p>
<p>For this high-level component in phonological perception, it is challenging to separate the specific contribution of each type of processing on task accuracy, as both high-level and low-level processing may be involved in successful performance. However, an ambiguous stimulus, which theoretically requires higher levels of processing, would likely demand the allocation of more cognitive resources, thus increasing listening effort (<xref ref-type="bibr" rid="ref33">Kramer et al., 2013</xref>; <xref ref-type="bibr" rid="ref27">Johnsrude and Rodd, 2016</xref>). Therefore, having an objective method that is sensitive to the individual demands for both low and high-level phonological processing could help to explain the variability in speech perception performance attributed to hearing impairment and challenging auditory environments (<xref ref-type="bibr" rid="ref49">Phatak and Grant, 2014</xref>; <xref ref-type="bibr" rid="ref20">Gianakas and Winn, 2019</xref>).</p>
<p>Assessing cognitive processes objectively during phonological contrast perception involves measuring responses to speech stimuli occurring at a cortical level. Physiological markers like late-latency auditory event-related potentials and mismatch negativity (MMN) have provided evidence of a pre-attentive component of phonological discrimination which could capture changes in the phonological pattern before conscious perception (<xref ref-type="bibr" rid="ref45">N&#x00E4;&#x00E4;t&#x00E4;nen et al., 2007</xref>; <xref ref-type="bibr" rid="ref57">Steinhauer and Connolly, 2008</xref>). However, cortical measures can be time-consuming and uncomfortable for participants. As an alternative, pupillometry could be used as a tool to investigate the temporal dynamics between low- and high-level processing of phonological stimuli. The pupil dilation is linked to increasing the norepinephrine release from the locus coeruleus, an area associated with attentional prioritization and perception of novelties (<xref ref-type="bibr" rid="ref16">Eckstein et al., 2017</xref>; <xref ref-type="bibr" rid="ref29">Kafkas and Montaldi, 2018</xref>). Task-evoked pupil dilation is measured by videorecording the pupil size during a task. It has the advantages of being cost-effective and providing good temporal resolution (<xref ref-type="bibr" rid="ref72">Winn et al., 2018</xref>). In terms of cognitive processing, at a low level, pupil dilation is sensitive to the perception of novelty that arises from a mismatch between stimulus and context, as changes in the stimulus frequency, intensity or pitch (<xref ref-type="bibr" rid="ref62">Virtala et al., 2018</xref>; <xref ref-type="bibr" rid="ref3">Bala et al., 2020</xref>). At a high level, pupil dilation has been shown to increase with linguistic processing demand, working memory load, and the effort required to resolve ambiguity in speech (<xref ref-type="bibr" rid="ref64">Wendt et al., 2016</xref>; <xref ref-type="bibr" rid="ref75">Zekveld et al., 2018</xref>; <xref ref-type="bibr" rid="ref28">Kadem et al., 2020</xref>; <xref ref-type="bibr" rid="ref44">Micula et al., 2022</xref>).</p>
<p>Previous research on the use of pupillometry to assess the discrimination of speech sounds, particularly phonemes, is limited and mainly focused on perception within word or sentence context (<xref ref-type="bibr" rid="ref63">Wagner et al., 2016</xref>; <xref ref-type="bibr" rid="ref31">Kinzuka et al., 2020</xref>; <xref ref-type="bibr" rid="ref70">Winn and Teece, 2021</xref>). <xref ref-type="bibr" rid="ref31">Kinzuka et al. (2020)</xref> used an oddball paradigm to evaluate the correlation between the perceptual ability of Japanese speakers to discriminate the /r/ and /l/ sounds in real words and their pupillometric responses. The study found that higher language proficiency is associated with earlier occurring differences in peak of pupil dilation (PPD) between target frequent and infrequent stimuli. In another study addressing the effects of phonological manipulation on pupil dynamics, <xref ref-type="bibr" rid="ref70">Winn and Teece (2021</xref>, <xref ref-type="bibr" rid="ref71">2022)</xref> measured pupil dilation during sentence recognition using embedded phonologically altered words in both normal hearing and cochlear implant subjects. The authors reported a steeper increase in pupil size in response to phonological alterations, with larger differences when the target phoneme was substituted by noise instead of another phoneme. For cochlear implant users, the contrast in pupil responses between sentences with and without phonological substitutions was shallower than for normal hearing peers, suggesting a relationship between degraded speech perception and the pupil response. This effect of speech degradation was also observed by <xref ref-type="bibr" rid="ref63">Wagner et al. (2016)</xref>, who reported steeper pupil dilation curve slopes for unexpected word prosody in full-frequency-spectrum speech but not for cochlear implant simulated speech.</p>
<p>It is important to note that in the paradigms described above, participants&#x2019; attention was directed toward processing the entire sentence or word, actively engaging high-level processing to interpret its meaning. However, the presence of context makes it difficult to distinguish the pupil variation caused by the perception of a phonological contrast, from that caused by the perception of lexical meaning variation which is known to cause pupil dilation independently (<xref ref-type="bibr" rid="ref30">Kamp and Donchin, 2015</xref>). Additionally, sentence comprehension in adverse listening conditions is known to produce higher pupil dilation (<xref ref-type="bibr" rid="ref64">Wendt et al., 2016</xref>; <xref ref-type="bibr" rid="ref47">Ohlenforst et al., 2018</xref>; <xref ref-type="bibr" rid="ref60">Trau-Margalit et al., 2023</xref>). Therefore, directing attention to the whole sentence might obscure the responses related to phonological contrast detection, since pupillometry seems to reflect responses to attended stimuli rather than passive listening (<xref ref-type="bibr" rid="ref33">Kramer et al., 2013</xref>) and is larger for perceived rather than unperceived errors (<xref ref-type="bibr" rid="ref30">Kamp and Donchin, 2015</xref>). On the contrary, it is possible that by directing attention to the presence of phonologically altered stimuli, responses may better reflect the low-level processing independently of the presence of context.</p>
<p>This study aimed to investigate the pupil temporal dynamics during the auditory processing of phonological contrasts, in an effort to differentiate low- and high-level processing of phonological information. For that, two paradigms were contrasted. First, we investigated the pupillometric response to low-level processing of phonological contrasts, measured during the perception of lexically decontextualized phoneme contrasts (phonological discrimination task). Second, we explored the possibility to record similar responses in the presence of lexical information, without prompting high-level sentence processing (detection task). Additionally, to introduce an acoustic challenge to the perception of the phonological contrast, we investigate how those responses are impacted a sub-optimal speech input, using a vocoded speech signal.</p>
<p>To minimize the influence of lexical knowledge on phonological perception, we chose to explore phonological contrasts using nonwords as our tokens. This approach aimed to preserve the real-world relevance of word-like items, while removing their lexical meaning. Traditional methods for assessing phonological identification and recognition often employ syllabic continua (<xref ref-type="bibr" rid="ref24">Iverson, 2003</xref>; <xref ref-type="bibr" rid="ref1">Abada et al., 2008</xref>; <xref ref-type="bibr" rid="ref37">Lewis and Bidelman, 2020</xref>). However, syllables may not demand the same level of cognitive processing for their phonological contrasts as longer stimuli. Therefore, we chose to use nonwords, as they would enhance the ecological validity of the pupillometry measures.</p>
<p>We hypothesized that if pupil dynamics were sensitive to the low-level acoustic properties of a phonological contrast, larger pupil dilations would be measured in conditions where the phonological contrast is present. These differences would be maintained independently of the presence of context and under a vocoded speech signal, whenever the phonological contrast was correctly perceived. However, if the pupil dynamics reflect the high-level linguistic processing required to disambiguate a phonological contrast, larger pupil dilations would be measured for a phonological contrast only in the presence of lexical information. We expect that the results provided here enhance the understanding in pupillary responses to phonological processing, shedding light on their sensitivity to different processing levels and adverse speech conditions.</p>
</sec>
<sec sec-type="materials|methods" id="sec6">
<label>2.</label>
<title>Materials and methods</title>
<sec id="sec7">
<label>2.1.</label>
<title>Participants</title>
<p>A convenient sample was recruited across the researcher&#x2019;s place of employment, in the Capital Region of Denmark. A sample of 22 adults (age: [25; 0&#x2013;65;0], median: 50&#x2009;years; females: 39%; all with more than 11&#x2009;years of education) who reported Danish as their first language, and self-reported normal hearing, were included. Participation was voluntary, and the researchers were contacted directly by the participants after an online announcement in an internal website. Participants joined during working hours and were compensated with their regular salary for their time.</p>
<p>This study was waived from ethical review by the Regional Committee of Health Research Ethics - Capital Region, Denmark, after inquiry submission, as it was considered to be research in the social domain. All participants gave active consent to the study, after receiving written and oral information, in accordance with the Declaration of Helsinki. Requirements regarding the General Data Protection Regulation (GDPR) were carefully followed.</p>
</sec>
<sec id="sec8">
<label>2.2.</label>
<title>Stimuli</title>
<sec id="sec9">
<label>2.2.1.</label>
<title>Phonological discrimination task</title>
<p>The discrimination task was composed of two lists containing 40 pairs of disyllabic-nonwords, selected from the nonword corpus published by <xref ref-type="bibr" rid="ref46">Nielsen and Dau (2019)</xref>. From the original material, C&#x2212;/a/-C&#x2212;/a/ nonwords starting with one of the 14 main initial phonemes for the Danish language (/p t k b d g m n l f v s r h /) were selected. The recordings selected had 100% speech intelligibility score reported in the original study by <xref ref-type="bibr" rid="ref46">Nielsen and Dau (2019)</xref>, for both first and second consonants. The present study focuses only on the first phoneme contrast.</p>
<p>For the task, nonwords were combined in pairs, to account for all the minimal-pair combinations in Danish for which just one distinctive production feature is present (place, voice/aspiration, or manner, as in &#x2018;<italic>bafi &#x2013; &#x2018;pafi&#x2019;</italic>). To facilitate phonological &#x2500; instead of acoustical &#x2500; comparison of the word pairs, recordings of each nonword from three different speakers were selected from the original material and each pair was presented using audio from two different speakers, selected randomly among the three possible recordings. Post-hoc analysis revealed no effect of the speaker-pair used on participants&#x2019; performance.</p>
<p>All the audio recordings were normalized by root mean square (RMS) and silence was added in the beginning of each file to align the nonwords&#x2019; offset during the task and to randomize the nonword start and the interval between nonwords.</p>
</sec>
<sec id="sec10">
<label>2.2.2.</label>
<title>Detection task</title>
<p>A second task explored the effect of context, in which the participants were asked to track a phoneme substitution in a word within a sentence (<xref ref-type="bibr" rid="ref9001">Pittman and Schuett, 2013</xref>). The detection task was composed of two lists of 36 four-word-sentences. The original sentences in the lists were composed of simple words from a 3&#x2009;year-old child&#x2019;s vocabulary (<xref ref-type="bibr" rid="ref5">Bleses et al., 2008a</xref>, <xref ref-type="bibr" rid="ref6">2008b</xref>) to guarantee that the words included would be well known by the participants. The sentences were evaluated as highly meaningful by a group of 25 native Danish speakers, in a pre-study conducted by our group. For half of the sentences in each list, the first phoneme of the second word was substituted for another phoneme with similar phonotactic probability (e.g., <italic>Hunden finder altid maden</italic> [the dog always finds the food] -&#x2009;&#x003E;&#x2009;<italic>Hunden <underline>sinder</underline> altid maden</italic>, an equivalent example in English would be &#x2018;Dad buys new shirts&#x2019; -&#x2009;&#x003E;&#x2009;&#x2018;Dad <underline>fuys</underline> new shirts&#x2019;, from <xref ref-type="bibr" rid="ref9001">Pittman and Schuett (2013)</xref>). We were careful to choose phonemes that would generate a nonword when replacing the original phoneme, which was confirmed by a group of 14 native speakers who listened to the generated nonwords in isolation and were asked to write the first real word it would remind them of (less than 50% of the participants could point to same original or other real word). Due to this requirement, the phonemes selected had contrasts in one or more production features with the original phoneme, which potentially added cues that may have aided detection in the sentence context. Moreover, to avoid that the second word in the sentence could be predicted by the sentence context, the same group of 25 native speakers were asked to complete the sentences where the target word was missing. Only the sentences with less than 10% of participants filling in the same real word (defined as low cloze probability in <xref ref-type="bibr" rid="ref36">Kutas and Hillyard, 1984</xref>) were included in the lists.</p>
<p>The final 72 sentences, half with embedded nonwords, were recorded by a female native Danish speaker with an accent from the Danish capital region. She was instructed to pronounce the sentences in a natural prosody but in a slow speaking pace. All the recordings were normalized by RMS and silence was added in the beginning of each file to randomize the sentences&#x2019; start.</p>
</sec>
<sec id="sec11">
<label>2.2.3.</label>
<title>Stimuli vocoding</title>
<p>In order to reduce the acoustic features, challenging the detection and discrimination of phonological contrasts (<xref ref-type="bibr" rid="ref58">Stenfelt and R&#x00F6;nnberg, 2009</xref>), one list was randomly vocoded for each participant. The vocoding process includes dividing the speech signal into frequency bands, extracting the amplitude envelope for each band, and using it to modulate a noise band, resynthesizing the bands to create a new audio file. For this study, the vocoded versions of the stimuli were generated using the software Praat (<xref ref-type="bibr" rid="ref7">Boersma and Weenink, 1992</xref>) and the open-source code provided by <xref ref-type="bibr" rid="ref67">Winn (2021)</xref> (version 45). An 8-channel vocoder was used, with flat-spectrum noise-carrier, and corner frequencies set between 0.2-8&#x2009;kHz. This number of bands was chosen to add challenges to the transmission of spectral information while approaching the asymptotic speech-recognition performance in quiet (<xref ref-type="bibr" rid="ref15">Dorman et al., 1997</xref>; <xref ref-type="bibr" rid="ref18">Friesen et al., 2001</xref>; <xref ref-type="bibr" rid="ref73">Xu et al., 2005</xref>).</p>
</sec>
<sec id="sec12">
<label>2.2.4.</label>
<title>Vocoded real-word recognition</title>
<p>Considering that a participant&#x2019;s inability to recognize real words in the vocoded condition could influence their performance on nonword detection, vocoded word recognition scores were also calculated. The first list of the clinical test Dantale I (<xref ref-type="bibr" rid="ref17">Elberling et al., 1989</xref>) in silence was vocoded using the method described above. Participants were presented with the recorded monosyllabic words in isolation and were asked to repeat them aloud. The participant&#x2019;s response was recorded and transcribed, for offline scoring.</p>
</sec>
</sec>
<sec id="sec13">
<label>2.3.</label>
<title>Pupillometry</title>
<p>Pupil size was continually measured by the Pupil Core&#x00AE; platform (Pupil Labs GmbH, Berlin). The glasses-mounted solution includes one front camera recording the gaze direction and two infra-red cameras that record the pupils at a sampling frequency of 200&#x2009;Hz. Pupil tracking is done in dark mode. The software provides the pupil size for each eye in arbitrary units (pixels) and a confidence score, defined as an index, between 0 and 1, indicating the quality of the acquired value.</p>
</sec>
<sec id="sec14">
<label>2.4.</label>
<title>Procedure</title>
<p>The study protocol was implemented via computer on the OpenSesame platform (<xref ref-type="bibr" rid="ref41">Math&#x00F4;t et al., 2012</xref>), using the features developed by <xref ref-type="bibr" rid="ref59">Sulas et al. (2022)</xref>.</p>
<p>The experiment was conducted in an acoustically treated sound studio. Participants indicated their responses using a touchscreen monitor placed on a table in front of them. The monitor was positioned to have the top &#x00BE; of the screen aligned to the participant&#x2019;s eye. Sound was presented from a loudspeaker positioned 1&#x2009;m directly in front of the participant (0-degrees Azimuth). Test participants wore the pupillometry glasses with the cameras adjusted so that the pupils were in the middle of the cameras respective field of view. The glasses were worn during the whole session and adjusted as needed between tasks in case of displacement. Lighting conditions and the screen luminance were kept constant at 200 lumens.</p>
<p>Prior to starting the experimental tasks, the participants were familiarized with vocoded speech. Sentences were presented back-to-back in non-vocoded and vocoded conditions, for about 3&#x2009;min, until the participant reported feeling comfortable recognizing the sentence in the vocoded version. The full testing session included other speech perception tasks not reported here and took approximately 1.5&#x2009;h. Task order and sequence were randomized to counterbalance fatigue effects. All tests were preceded by verbal and written instructions, plus a training phase during which direct verbal feedback and clarifications were provided.</p>
<p>In the phonological discrimination task, word pairs were presented one by one. The participant was asked to indicate if the second word in the pair was the same as the first in a &#x2018;yes/no&#x2019; paradigm. A fixation dot was kept in the screen from 2&#x2009;s before until 2&#x2009;s after the presentation of each word pair (detailed in <xref rid="fig1" ref-type="fig">Figure 1</xref>). Participants were asked to look at the dot in order to reduce eye movements and improve the quality of the pupillometry data. After each pair presentation, participants indicated their response via touchscreen. This task took approximately 7&#x2009;min to complete in each condition, and conditions were randomized across participants.</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption>
<p>Example of sequence of screens and actions on the phonological discrimination and detection tasks.</p>
</caption>
<graphic xlink:href="fpsyg-14-1232262-g001.tif"/>
</fig>
<p>In the detection task, participants were asked to indicate if the sentence contained a nonword (the phonologically modified word) in a &#x2018;yes/no&#x2019; paradigm. Participants listened to a list of 36 sentences in each condition. The trial sequence is illustrated in <xref rid="fig1" ref-type="fig">Figure 1</xref>. Two seconds of silence were added before and after each sentence, while a fixation dot was kept on the screen. Testing took approximately 5&#x2009;min in each condition, and conditions were randomized across participants.</p>
</sec>
<sec id="sec15">
<label>2.5.</label>
<title>Analysis</title>
<sec id="sec16">
<label>2.5.1.</label>
<title>Task performance</title>
<p>For both phonological discrimination and the detection tasks, accuracy for &#x2018;yes/no&#x2019; responses was recorded. Analysis was conducted in terms of signal detection theory (<xref ref-type="bibr" rid="ref40">Macmillan and Creelman, 2005</xref>). The &#x2018;signal&#x2019; in the stimulus was defined as the presence of a phonological contrast, namely, the presence of a nonword in the sentence or a phonological substitution in the second token of the nonword-pair. Responses were classified as &#x2018;hits&#x2019;: correct responses when the signal was present, &#x2018;misses&#x2019;: incorrect responses when the signal was present, &#x2018;correct rejections&#x2019;: correct responses when the signal was absent, and &#x2018;false alarms&#x2019;: incorrectly reporting the presence of the signal when it was absent. The proportion of correct responses was calculated as the sum of &#x2018;hits&#x2019; and &#x2018;correct rejections&#x2019;, divided by the total number of trials (<xref ref-type="bibr" rid="ref40">Macmillan and Creelman, 2005</xref>).</p>
<p>The discrimination score (<italic>d&#x2019;</italic>) was calculated as measure of the participants&#x2019; sensitivity to the presence of a signal (<xref ref-type="bibr" rid="ref40">Macmillan and Creelman, 2005</xref>). It was estimated by subtracting the <italic>z</italic>-transformed &#x2018;hit&#x2019; rates and &#x2018;false-alarm&#x2019; rates. To avoid floor and ceiling effects in <italic>d&#x2019;</italic> calculation, a correction for the extreme values was performed using the log linear approach described by <xref ref-type="bibr" rid="ref56">Stanislaw and Todorov (1999)</xref>, by adding 0.5 to both the number of &#x2018;hits&#x2019; and &#x2018;false alarms&#x2019; and adding 1 to the number of trials, before calculating the <italic>d&#x2019;</italic> score. Additionally, to analyze a possible response bias toward selecting one of the two options (&#x2018;yes&#x2019;/&#x2018;no&#x2019;), the criterion location was calculated as minus half of the sum of <italic>z-</italic>transformed &#x2018;hit&#x2019; and &#x2018;false-alarms&#x2019;. A positive criterion value indicates a bias to &#x2018;miss&#x2019; the signal although it is present, while negative values represent bias toward accusing the presence of the signal despite its absence (&#x2018;false alarms&#x2019;). Together, <italic>d&#x2019;</italic> and criterion location give a parameter of participants&#x2019; strategy in the phonological discrimination and detection tasks.</p>
</sec>
<sec id="sec17">
<label>2.5.2.</label>
<title>Pupillometry pre-processing and analysis.</title>
<p>Pupil data were segmented by trial, and data were analyzed from the eye with best overall confidence during the task, calculated by the percentage of data points over 0.85 of confidence, as reported by the equipment software. The data were cleaned of blinks and artifacts by detecting dilation speed outliers with the method described by <xref ref-type="bibr" rid="ref34">Kret and Sjak-Shie (2019)</xref> and excluding the flagged data points with a backward and forward margin of 50&#x2009;ms. Data reconstruction was done using Piecewise Cubic Hermite Interpolating Polynomial (Pchip) or linear interpolation when the Pchip was not possible (where there were not enough points available before or after the region to interpolate), considered the good reconstruction properties of both methods reported by <xref ref-type="bibr" rid="ref14">Dan et al. (2020)</xref>. Blinks above 500&#x2009;ms were not reconstructed. The individual data points were downsampled to 30&#x2009;Hz, as the pupil response latency of is over 200&#x2009;ms (<xref ref-type="bibr" rid="ref72">Winn et al., 2018</xref>; <xref ref-type="bibr" rid="ref42">Math&#x00F4;t and Vilotijevi&#x0107;, 2022</xref>), and smoothed using a moving-average filter of 0.1&#x2009;s.</p>
<p>Trials with more than 45% interpolated data were excluded from the analysis (<xref ref-type="bibr" rid="ref9">Burg et al., 2021</xref>; <xref ref-type="bibr" rid="ref76">Zhang et al., 2022</xref>). Baseline pupil size was calculated per-trial by taking the mean pupil size during the 500&#x2009;ms right before stimulus onset (<xref ref-type="bibr" rid="ref55">Seropian et al., 2022</xref>). All subsequent data points in the trial were calculated as the proportional change relative to that baseline pupil size. As a last step, raw and processed data were visually inspected to identify and exclude trials with potential contamination, as artifacts in the baseline estimation period or absolute changes in pupil size over 40% of the baseline (<xref ref-type="bibr" rid="ref65">Winn, 2016</xref>; <xref ref-type="bibr" rid="ref72">Winn et al., 2018</xref>).</p>
<p>Subjects with more than 50% of the trials excluded from one task condition, had their results excluded from the analysis in that specific task. This criterion excluded pupillometric data from three subjects in both conditions of the phonological discrimination task only. For the remaining participants and tests, the aggregated trace of the pupil response for correct answers was calculated. Data was extracted regarding the value and time of the maximum pupil size &#x2013; respectively, peak pupil dilation (PPD) and the peak pupil dilation latency (PPL) &#x2013; from the time window spamming from the target stimulus onset (the second word) to 1&#x2009;s after the audio offset. To compare with studies with similar methodology (<xref ref-type="bibr" rid="ref63">Wagner et al., 2016</xref>; <xref ref-type="bibr" rid="ref70">Winn and Teece, 2021</xref>), a growth curve analysis (GCA) was carried out, which models the quadratic fit of the pupil curve between the target-stimulus onset and the PPD.</p>
</sec>
<sec id="sec18">
<label>2.5.3.</label>
<title>Inferential analysis</title>
<p>Inferential analysis was conducted in Python 3.9, using &#x2018;SciPy&#x2019; (v. 1.7.3) and &#x2018;Statsmodels&#x2019; (v. 0.13.2) packages. Normality in distribution was assessed using Shapiro&#x2013;Wilk, for the subsequent choice of parametric or nonparametric statistical tests described in the results. Paired comparisons were conducted for mean/median comparison of signal detection performance (<italic>d&#x2019;</italic>) in vocoded and non-vocoded conditions, and the sequential points in the pupillometric curve in &#x2018;yes&#x2019; versus &#x2018;no&#x2019; tasks. Logistic regression was used to investigate how GCA parameters (intercept, slope, and quadratic term) could be modeled to determine the type of pair (&#x2018;yes&#x2019; or &#x2018;no&#x2019;) identified. Additionally, effects of vocoding in the pupillometry metrics were analyzed using a linear mixed effect model in a matrix of auditory condition (&#x2018;vocoded&#x2019; or &#x2018;non-vocoded&#x2019;) and pair type (&#x2018;yes&#x2019; or &#x2018;no&#x2019;), with participants attributed as random effects. The inclusion of pair type in the model derives from the assumption that the detection of a phonological contrast in the target word (&#x2018;yes&#x2019; tasks) would produce a more prominent response in task evoked pupillometry (<xref ref-type="bibr" rid="ref31">Kinzuka et al., 2020</xref>).</p>
</sec>
</sec>
</sec>
<sec sec-type="results" id="sec19">
<label>3.</label>
<title>Results</title>
<sec id="sec20">
<label>3.1.</label>
<title>Performance results</title>
<p>The participants had near ceiling scores on the perception of phonological contrasts for non-vocoded speech, with mean <italic>d&#x2019;</italic> scores of 3.39 (SD&#x2009;=&#x2009;0.56) for the phonological discrimination (<xref rid="fig2" ref-type="fig">Figure 2</xref>) and 3.66 (SD&#x2009;=&#x2009;0.46) for the detection task (<xref rid="fig3" ref-type="fig">Figure 3</xref>). For vocoded speech, the performance decreased significantly in both tests, with mean d&#x2019; scores of 1.04 (SD&#x2009;=&#x2009;0.48) for the phonological discrimination and 1.26 (SD&#x2009;=&#x2009;0.54) for the detection task. The difference between non-vocoded and vocoded conditions was confirmed by paired comparison <italic>t</italic>-tests, <italic>t</italic> (21)&#x2009;=&#x2009;16.15 for phonological discrimination and <italic>t</italic> (21)&#x2009;=&#x2009;15.39 for detection task, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001 for both tasks. Despite lower scores, mean performance was above the 50% chance level in the vocoded condition (mean 70% correct responses for phonological discrimination and 73% correct responses for detection task), confirming that the participants were able to perform both tasks with the vocoded stimuli. Vocoded real-word recognition in the Dantale test had an average accuracy of 34% (SD&#x2009;=&#x2009;16%). In a simple regression model, the word recognition of vocoded speech alone accounted for over 20% of the variance in the phonological discrimination <italic>d&#x2019;</italic> scores, <italic>R<sup>2</sup></italic>&#x2009;=&#x2009;0.21, <italic>F</italic> (1,19)&#x2009;=&#x2009;4.97, <italic>p</italic>&#x2009;=&#x2009;0.04, but did not explain the variance in the detection task, <italic>R<sup>2</sup></italic>&#x2009;=&#x2009;0.01, <italic>F</italic> (1,19)&#x2009;=&#x2009;0.14, <italic>p</italic>&#x2009;=&#x2009;0.71.</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption>
<p>Performance in the discrimination task on vocoded and non-vocoded conditions. Stacked bar plot. White bars represent nonword pairs containing a phonological contrast in the second nonword and gray bars pairs with the same nonword. Full bars represent correct responses, while dashed bars represent errors.</p>
</caption>
<graphic xlink:href="fpsyg-14-1232262-g002.tif"/>
</fig>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption>
<p>Performance in the detection task on vocoded and non-vocoded conditions. Stacked bar plot. White bars represent sentences containing a nonword and gray bars sentences without a nonword. Full bars represent correct responses, while dashed bars represent errors.</p>
</caption>
<graphic xlink:href="fpsyg-14-1232262-g003.tif"/>
</fig>
<p>Analyzing the effect of lexical context in the detection of a phonological contrast, Wilcoxon signed-ranks test showed no difference in performance with or without lexical context, when comparing the <italic>d&#x2019;</italic> scores of the phonological discrimination and detection tasks, <italic>z</italic>&#x2009;=&#x2009;80.0, <italic>p</italic>&#x2009;=&#x2009;0.13. Nevertheless, in the response bias analysis, criterion was located positively at a mean of 0.24 (SD&#x2009;=&#x2009;0.26) for the detection task, suggesting that participants were biased toward not detecting the nonword despite its presence, while for the phonological discrimination task, criterion was placed much closer to zero, at a mean of &#x2212;0.04 (SD&#x2009;=&#x2009;0.18), suggesting no bias on the response.</p>
</sec>
<sec id="sec21">
<label>3.2.</label>
<title>Pupillometry responses</title>
<p>The analysis of the pupil data was restricted to trials with correct responses to determine whether successful responses could be differentiated based on pupil dynamics. The aggregated pupillometry response traces across time for both tasks, encompassing data from all participants, are presented in <xref rid="fig4" ref-type="fig">Figure 4</xref> and <xref rid="fig5" ref-type="fig">Figure 5</xref> with respective detailed information in <xref rid="tab1" ref-type="table">Table 1</xref> and <xref rid="tab2" ref-type="table">Table 2</xref>. In the phonological discrimination task, PPL occurred at a mean of 678&#x2009;ms (SD&#x2009;=&#x2009;867) after the presentation of the second word. In the detection task, the PPL for all conditions occurred at a mean of 2.04&#x2009;s (SD&#x2009;=&#x2009;1.07) after the onset of the nonword, or after 2.18&#x2009;s (SD&#x2009;=&#x2009;1.13) of the onset of the second word for all-real-word sentences when the same alignment was used, which aligns roughly with the offset of the sentence.</p>
<fig position="float" id="fig4">
<label>Figure 4</label>
<caption>
<p>Pupil size over time in vocoded and non-vocoded conditions, for the phonological discrimination task, aggregated between participants.</p>
</caption>
<graphic xlink:href="fpsyg-14-1232262-g004.tif"/>
</fig>
<fig position="float" id="fig5">
<label>Figure 5</label>
<caption>
<p>Pupil size over time in vocoded and non-vocoded conditions, for the detection task, aggregated between participants. &#x002A; Timeframes with significant difference between nonword and all-real word sentence types in non-vocoded condition (<italic>p</italic>&#x2009;&#x003C;&#x2009;0.05).</p>
</caption>
<graphic xlink:href="fpsyg-14-1232262-g005.tif"/>
</fig>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption>
<p>Pupillometry measures for the phonological discrimination task in the non-vocoded and vocoded conditions, for each pair type.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Condition</th>
<th/>
<th align="center" valign="top" colspan="2">Contrast</th>
<th align="center" valign="top" colspan="2">Equal</th>
<th align="center" valign="top"><italic>t</italic> (18)/<italic>z</italic></th>
<th align="center" valign="top">
<italic>p</italic>
</th>
<th align="center" valign="top">n</th>
</tr>
</thead>
<tbody>
<tr>
<td/>
<td/>
<td align="center" valign="top">M</td>
<td align="center" valign="top">SD</td>
<td align="center" valign="top">M</td>
<td align="center" valign="top">SD</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Non-vocoded</td>
<td align="left" valign="top">PPD (%)</td>
<td align="center" valign="top">0.13</td>
<td align="center" valign="top">0.34</td>
<td align="center" valign="top">0.24</td>
<td align="center" valign="top">0.41</td>
<td align="center" valign="top">-1.2<sup>a</sup></td>
<td align="center" valign="top">0.247</td>
<td align="center" valign="top">19</td>
</tr>
<tr>
<td align="left" valign="top">PPL (ms)</td>
<td align="center" valign="top">516</td>
<td align="center" valign="top">811</td>
<td align="center" valign="top">787</td>
<td align="center" valign="top">926</td>
<td align="center" valign="top">12.5<sup>b</sup></td>
<td align="center" valign="top">
<bold>0.037</bold><sup>&#x002A;</sup></td>
<td align="center" valign="top">19</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Vocoded</td>
<td align="left" valign="top">PPD (%)</td>
<td align="center" valign="top">0.22</td>
<td align="center" valign="top">0.46</td>
<td align="center" valign="top">0.17</td>
<td align="center" valign="top">0.34</td>
<td align="center" valign="top">1.06<sup>a</sup></td>
<td align="center" valign="top">0.303</td>
<td align="center" valign="top">19</td>
</tr>
<tr>
<td align="left" valign="top">PPL (ms)</td>
<td align="center" valign="top">592</td>
<td align="center" valign="top">844</td>
<td align="center" valign="top">817</td>
<td align="center" valign="top">919</td>
<td align="center" valign="top">14.5<sup>b</sup></td>
<td align="center" valign="top">0.054</td>
<td align="center" valign="top">19</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Statistical tests: <sup>a</sup> paired t test; <sup>b</sup> Wilcoxon signed ranks test; PPD&#x2009;=&#x2009;peak pupil dilation; PPL&#x2009;=&#x2009;peak pupil latency.</p>
<p><sup>&#x002A;</sup><italic>p</italic> &#x003C; 0.05.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption>
<p>Pupillometry measures for the detection task in the non-vocoded and vocoded conditions, for each pair type.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Condition</th>
<th/>
<th align="center" valign="top" colspan="2">Nonword</th>
<th align="center" valign="top" colspan="2">All-real</th>
<th align="center" valign="top"><italic>t</italic> (21)/<italic>z</italic></th>
<th align="center" valign="top">
<italic>p</italic>
</th>
<th align="center" valign="top">n</th>
</tr>
</thead>
<tbody>
<tr>
<td/>
<td/>
<td align="center" valign="top">M</td>
<td align="center" valign="top">SD</td>
<td align="center" valign="top">M</td>
<td align="center" valign="top">SD</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Non-vocoded</td>
<td align="left" valign="top">PPD (%)</td>
<td align="center" valign="top">0.20</td>
<td align="center" valign="top">0.36</td>
<td align="center" valign="top">0.05</td>
<td align="center" valign="top">0.37</td>
<td align="center" valign="top">2.32<sup>a</sup></td>
<td align="center" valign="top">
<bold>0.030</bold><sup>&#x002A;</sup></td>
<td align="center" valign="top">22</td>
</tr>
<tr>
<td align="left" valign="top">PPL (ms)</td>
<td align="center" valign="top">1844</td>
<td align="center" valign="top">1,090</td>
<td align="center" valign="top">2013</td>
<td align="center" valign="top">1,190</td>
<td align="center" valign="top">96.0<sup>b</sup></td>
<td align="center" valign="top">0.498</td>
<td align="center" valign="top">22</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Vocoded</td>
<td align="left" valign="top">PPD (%)</td>
<td align="center" valign="top">0.25</td>
<td align="center" valign="top">0.39</td>
<td align="center" valign="top">0.28</td>
<td align="center" valign="top">0.42</td>
<td align="center" valign="top">&#x2212;122.0<sup>b</sup></td>
<td align="center" valign="top">0.898</td>
<td align="center" valign="top">22</td>
</tr>
<tr>
<td align="left" valign="top">PPL (ms)</td>
<td align="center" valign="top">2,233</td>
<td align="center" valign="top">1,030</td>
<td align="center" valign="top">2,342</td>
<td align="center" valign="top">1,067</td>
<td align="center" valign="top">&#x2212;0.31<sup>a</sup></td>
<td align="center" valign="top">0.754</td>
<td align="center" valign="top">22</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Statistical tests: <sup>a</sup> paired t test; <sup>b</sup> Wilcoxon signed ranks test; PPD&#x2009;=&#x2009;peak pupil dilation; PPL&#x2009;=&#x2009;peak pupil latency.</p>
<p><sup>&#x002A;</sup><italic>p</italic> &#x003C; 0.05.</p>
</table-wrap-foot>
</table-wrap>
<sec id="sec22">
<label>3.2.1.</label>
<title>Differences in pupil responses due to phonological contrast</title>
<p>In the non-vocoded condition, pupil parameters were sensitive to the presence or absence of the phonological contrast. In the phonological discrimination task, the PPL for pairs without contrast occurred, on average, 217&#x2009;ms later than pairs with contrast, while PPD values had no significant difference (<xref rid="tab1" ref-type="table">Table 1</xref>). A logistic regression analysis showed no significant effects on the pupil curve intercept, slope, quadratic term, or their interactions, between pairs with and without contrast, <italic>&#x03C7;</italic>2 (7, <italic>n</italic>&#x2009;=&#x2009;44)&#x2009;=&#x2009;42.6, <italic>p</italic>&#x2009;=&#x2009;0.67. In the detection task, participants exhibited greater PPD for sentences containing a nonword compared to sentences containing all real words, with no significant difference in PPL (<xref rid="tab2" ref-type="table">Table 2</xref>). The differences in pupil dilation were significant in the interval of 610&#x2009;ms to 1750&#x2009;ms after the nonword onset (<xref rid="fig5" ref-type="fig">Figure 5</xref>), although the logistic regression analysis did not show any significant differences in the parameters of the fitted curve between trials with and without the phonological contrast, <italic>&#x03C7;</italic>2 (7, <italic>n</italic>&#x2009;=&#x2009;37)&#x2009;=&#x2009;33.9, <italic>p</italic>&#x2009;=&#x2009;0.45. No differences between trials with or without the phonological contrast were found for the vocoded stimuli.</p>
</sec>
<sec id="sec23">
<label>3.2.2.</label>
<title>Differences in pupil responses due to speech degradation</title>
<p>In contrast to the perceptual results, there was no effect of speech degradation on pupil measures in the phonological discrimination task. This was found when analyzed with a linear mixed effects model that included auditory condition (vocoded or non-vocoded) and pair type (phonological contrast present or absent) as fixed effects, and participant as a random effect (models&#x2019; marginal <italic>R</italic><sup>2</sup>s&#x2009;=&#x2009;0.001 and 0.021, conditional <italic>R</italic><sup>2</sup>s&#x2009;=&#x2009;0.394 and 0.311, for PPD and PPL, respectively).</p>
<p>For the detection task, a higher PPD (<italic>&#x03B2;</italic>&#x2009;=&#x2009;0.014, <italic>p</italic>&#x2009;=&#x2009;0.007) was seen as an effect of speech degradation, when considered in a similar condition x pair type linear mixed effects model, with participants as random effects (marginal <italic>R</italic><sup>2</sup>&#x2009;=&#x2009;0.037, conditional <italic>R</italic><sup>2</sup>&#x2009;=&#x2009;0.611). There was no effect of vocoding on PPL (<italic>&#x03B2;</italic>&#x2009;=&#x2009;0.359, <italic>p</italic>&#x2009;=&#x2009;0.102, marginal <italic>R</italic><sup>2</sup>&#x2009;=&#x2009;0.032, conditional <italic>R</italic><sup>2</sup>&#x2009;=&#x2009;0.103).</p>
</sec>
</sec>
</sec>
<sec sec-type="discussions" id="sec24">
<label>4.</label>
<title>Discussion</title>
<p>This study aimed to investigate the sensitivity of pupil temporal dynamics to multiple levels of auditory processing of phonological information. At the low-level, by analyzing the possibility of detecting phonological discrimination in the absence of lexical context, and at the high-level by directing attention to the phonological contrasts in the presence of context and lack of complete acoustic information.</p>
<sec id="sec25">
<label>4.1.</label>
<title>Performance results</title>
<p>Performance data showed similar performance in the perception of phonological contrasts in both isolated nonword-pairs and nonwords embedded in sentences. Performance was equally affected by speech degradation, as indicated by changes in accuracy and <italic>d&#x2019;</italic> scores in the vocoded speech condition. Although performance was poorer when the stimuli were degraded with a vocoder, participants were still able to perform the task above-chance, with over 70% accuracy. These results demonstrate participants&#x2019; ability to utilize both low- and high-level strategies to perform speech tasks effectively. However, it indicates that accuracy and sensitivity alone do not provide sufficient information to differentiate between the type of strategy used by individual participants.</p>
<p>The false alarm rate in the vocoded phonological discrimination task (<xref rid="fig2" ref-type="fig">Figure 2</xref>) suggests that perceiving two speakers as producing the same nonword was challenging in the degraded speech condition, although the material used in this study has been evaluated as not containing ambiguous phonemes in unprocessed speech (<xref ref-type="bibr" rid="ref46">Nielsen and Dau, 2019</xref>). False alarms occur when pairs were mistakenly perceived as having a contrast when they did not contain a contrast (i.e., two different speakers producing the same word), revealing a failure to perceive stable signal characteristics during phoneme recognition. Moreover, single-word recognition under vocoded condition appeared as a predictive factor for phonological discrimination, indicating similarity in the tasks&#x2019; underlying processes. Participants in both tasks were forced to rely on the variable acoustic characteristics of the speech signal to make phonological decisions, and failures occurred unbiased, regardless of the presence or absence of contrast.</p>
<p>Previous studies of phoneme confusion, employing similar 8-channel noise vocoders, have documented higher consonant recognition performance compared to the results observed in this study (<xref ref-type="bibr" rid="ref18">Friesen et al., 2001</xref>; <xref ref-type="bibr" rid="ref73">Xu et al., 2005</xref>; <xref ref-type="bibr" rid="ref77">Zhou et al., 2010</xref>; <xref ref-type="bibr" rid="ref25">Jahn et al., 2019</xref>; <xref ref-type="bibr" rid="ref21">Goupell et al., 2020</xref>). These prior studies reported consonant recognition accuracy ranging from approximately 60% (<xref ref-type="bibr" rid="ref77">Zhou et al., 2010</xref>) in &#x2018;consonant-vowel&#x2019; contexts, to 68% for monosyllables (<xref ref-type="bibr" rid="ref21">Goupell et al., 2020</xref>) and around 80% (<xref ref-type="bibr" rid="ref73">Xu et al., 2005</xref>; <xref ref-type="bibr" rid="ref25">Jahn et al., 2019</xref>) within &#x2018;vowel-consonant-vowel&#x2019; contexts. However, those studies have used a closed set of syllables or words for the consonant recognition task. The open-set word recognition used in our study was more difficult for participants when attempting to identify the target word. The open-set task increased the number of potential responses, which enhances the activation of neighboring words in the word recognition task. Moreover, for the discrimination of contrasts, the vocoder might be more detrimental to the identification of initial consonants rather than medial consonants, as the highest accuracies were reported in studies using medial consonant identification (<xref ref-type="bibr" rid="ref18">Friesen et al., 2001</xref>; <xref ref-type="bibr" rid="ref25">Jahn et al., 2019</xref>). In medial positions, the transition information from vowel to consonant is readily available and contributes to phoneme recognition (<xref ref-type="bibr" rid="ref73">Xu et al., 2005</xref>). Therefore, the participants&#x2019; ability to predict the consonants may have been compromised in our study, shown by the reduced accuracy scores in the phonological discrimination task.</p>
<p>As expected, the presence of context led to a bias toward reporting nonwords as real words in the vocoded detection task, causing the participants to ignore or miss the phonological contrast. A degraded signal amplifies the perceptual bias in phonological perception, increasing the reliance on non-acoustic information such as lexical information and context when categorizing phonological contrasts (<xref ref-type="bibr" rid="ref20">Gianakas and Winn, 2019</xref>; <xref ref-type="bibr" rid="ref61">Vickery et al., 2022</xref>; <xref ref-type="bibr" rid="ref71">Winn and Teece, 2022</xref>), producing the effects observed.</p>
</sec>
<sec id="sec26">
<label>4.2.</label>
<title>Pupillometry responses</title>
<sec id="sec27">
<label>4.2.1.</label>
<title>Differences in pupil responses due to phonological contrast</title>
<p>The presence of a phonological contrast did not elicit higher pupil dilation in the phonological discrimination task, as it would be expected in a presence of a variant stimuli (<xref ref-type="bibr" rid="ref63">Wagner et al., 2016</xref>; <xref ref-type="bibr" rid="ref31">Kinzuka et al., 2020</xref>). The differences here can be attributed to the demands of the tasks. The simplicity of the forced-choice task might not have elicited sufficient differences in the demand for cognitive processing to capture the effect of the phonological contrast. Additionally, in contrast to previous studies which used words as material for discrimination, in our study the participants could not use lexical information to support the decision regarding the change in the phoneme category. Therefore, their judgment was forced to occur solely at the phonological level. The absence of a significant difference in pupil parameters suggests that pupil dynamics may be more sensitive to higher-level cognitive and language processing, as to lexical categorization (<xref ref-type="bibr" rid="ref30">Kamp and Donchin, 2015</xref>), rather than lower-level phonological categorization.</p>
<p>Moreover, the pupil response to phonological contrasts may be indistinguishable from the response for the perception of acoustic contrasts. The contrast between two speakers in our paradigm, one in each token in the nonword-pair, was done to ensure that discrimination was occurring at a phonological rather than acoustical level. It is known that different speakers possess a natural variability in multiple acoustic domains as voice-onset-time, vowel formants, consonant intensity, among others (<xref ref-type="bibr" rid="ref2">Allen et al., 2003</xref>; <xref ref-type="bibr" rid="ref11">Christiansen and Henrichsen, 2011</xref>). Therefore, identifying two nonwords as the same would require their processing at the phonological level. However, as the pupil dilates for acoustic deviants, such as pure tones and noise varying in frequency (<xref ref-type="bibr" rid="ref38">Liao et al., 2016</xref>; <xref ref-type="bibr" rid="ref54">Selezneva et al., 2021</xref>), pupil dilation could also be an index of the processing of the dynamic acoustic characteristics of speech in an effort to solve ambiguity caused by interspeaker variations in phoneme production and boundaries (<xref ref-type="bibr" rid="ref37">Lewis and Bidelman, 2020</xref>; <xref ref-type="bibr" rid="ref66">Winn, 2020</xref>; <xref ref-type="bibr" rid="ref51">Reese and Reinisch, 2022</xref>; <xref ref-type="bibr" rid="ref74">Yu, 2022</xref>). Such a response to acoustic differences would explain the comparable PPDs recorded for both pairs with and without phonological contrast, since for both types of pairs the acoustic variability was present.</p>
<p>Interestingly, in the phonological discrimination task, PPLs were shorter for pairs with phonological contrast than for pairs without contrast. <xref ref-type="bibr" rid="ref32">Koelewijn et al. (2017)</xref> describe the PPL as a measure of the speed of cognitive processing, with shorter latencies indicating faster cognitive processing or the need for processing less information. One explanation for our results is that to correctly identify a phonological contrast, the participant would only need to identify the first phoneme of the second word in the pair, but to correctly identify the absence of a contrast required the processing of the whole nonword in a pair. Therefore, a decision could be taken quicker with far less information for pairs with contrast.</p>
<p>The presence of context, in the detection task, led to higher PPD in sentences containing a phonological altered word (nonword). This effect was expected as it had been previously reported by <xref ref-type="bibr" rid="ref63">Wagner et al. (2016)</xref> and <xref ref-type="bibr" rid="ref70">Winn and Teece (2021</xref>, <xref ref-type="bibr" rid="ref71">2022)</xref>. These studies found that substituted and distorted phonemes within words in a sentence lead to steeper pupil dilation. As discussed in <xref ref-type="bibr" rid="ref71">Winn and Teece (2022)</xref>, the presence of sentence context makes it difficult to determine if the higher dilation occurs due to increased cognitive demand for sentence processing introduced by the ambiguous lexical entry, or due to the detection of the phonological contrast. However, the absence of difference in the results of the phonological discrimination task suggests that the pupil response may be more closely linked to the violation of the lexical expectation rather than the phonological contrast.</p>
<p>Remarkably, in our study, participants were not asked to process the whole sentence in any manner (they did not repeat it back, nor derived its meaning). Therefore, it could be expected that after detecting the nonword in the second position of the sentence, the participants&#x2019; demand for processing would immediately decrease, which should have resulted in a reduction in the pupil size. Yet, the observed pupil behavior indicates that the whole sentence was processed before the response was given. Despite the different protocols used, these results are consistent with <xref ref-type="bibr" rid="ref70">Winn and Teece (2021</xref>, <xref ref-type="bibr" rid="ref71">2022)</xref>, in which participants were asked to repeat the whole sentence back to the experimenter. These findings suggest that, despite being instructed to track individual words in the sentence, listeners may have used the whole sentence context to make decisions regarding the presence or absence of the phonological contrast. As an anecdotal report, during the experiment session, several participants reported attempting to &#x2018;repair&#x2019; the nonword or &#x2018;figure out the correct word&#x2019;.</p>
</sec>
<sec id="sec28">
<label>4.2.2.</label>
<title>Differences in pupil responses due to speech degradation</title>
<p>The results in the vocoded condition support the argument that the pupil response reflects processing at the lexical and sentence level. The high accuracy scores for identifying the presence of a nonword within a sentence shows that participants were able to detect the phonological alteration despite the vocoded speech, indicating that phonological discrimination was occurring at a low-level. However, the lack of difference in the pupil parameters between sentences with and without phonologically modified words suggests that the pupil response captured the increase in cognitive processing required to understand the vocoded sentences, rather than the detection of a phonological contrast. Furthermore, the trend of interpreting nonwords as real words in the performance results suggests that the participants were likely attempting phonological restoration throughout the vocoded experiment. In other words, it is possible that the absence of differences in the pupillary response between stimuli with and without phonological contrast reflects the registration of a different type of response besides the detection of the contrast. The physiological mechanisms underlying pupil dilation are also involved in the process of decision-making (<xref ref-type="bibr" rid="ref29">Kafkas and Montaldi, 2018</xref>). As such, when decisions require greater cognitive processing and memory demand, pupil size increases. It is important to note that the signal restoration of the vocoded stimuli comes at a cost even for real words (<xref ref-type="bibr" rid="ref69">Winn et al., 2015</xref>; <xref ref-type="bibr" rid="ref4">Balling et al., 2017</xref>). This global response, which is related to the processing of the auditory stimulus as a whole, may be more pronounced than the response to the detection of the phonological contrast, thereby masking its signal.</p>
<p>Another possible explanation for the lack of difference in pupil metrics between stimuli with or without phonological contrast is that errors in detecting the contrast may have occurred at different moments in the stimuli presentation. Since participants were not instructed about the possible location of the phonological contrast, it was not possible to track the exact moment when errors occurred. As a result, the effect of the phonological contrast may have been distributed across the time series average (<xref ref-type="bibr" rid="ref70">Winn and Teece, 2021</xref>), which could not be tracked by our analysis.</p>
</sec>
</sec>
<sec id="sec29">
<label>4.3.</label>
<title>Study limitations</title>
<p>As in any forced-choice task, the methodology used opens the possibility for participants to &#x2018;guess&#x2019; the responses. This effect can be considered during the signal detection analysis of the performance but might influence the amplitude and morphology of the pupil responses. Responses based on chance, with low or no processing of the stimulus, can contaminate the time-series average during pupil analysis and effects be missed. Additionally, pupil responses are modulated by the sympathetic nervous system, which can be influenced by a range of factors such as engagement, fatigue, or self-perception of performance (<xref ref-type="bibr" rid="ref23">Hopstaken et al., 2015</xref>; <xref ref-type="bibr" rid="ref75">Zekveld et al., 2018</xref>; <xref ref-type="bibr" rid="ref43">McGarrigle et al., 2021</xref>). Although we attempted to counterbalance for fatigue effects by randomizing the order of the presentation of the tasks and stimuli, it is possible that the low scores in speech perception, achieved in the vocoded speech condition, have led to disengagement from the task, which would be reflected in an overall reduction of pupil dilation (<xref ref-type="bibr" rid="ref23">Hopstaken et al., 2015</xref>; <xref ref-type="bibr" rid="ref48">Ohlenforst et al., 2017</xref>).</p>
<p>It is worth noting that the characteristics of the phonological contrast in the phonological discrimination task and the detection task were not the same. While in the phonological discrimination task the contrast was defined by a change in one production feature, multiple production features were modified in the detection task. In terms of acoustic differences, this might mean that the acoustic degradation would affect different aspects of the phonological perception in each task (<xref ref-type="bibr" rid="ref73">Xu et al., 2005</xref>; <xref ref-type="bibr" rid="ref77">Zhou et al., 2010</xref>). Furthermore, it raises the possibility that pupil dilation would be sensitive to the distance between the expected stimulus and the contrast, as previously observed for non-speech stimuli (<xref ref-type="bibr" rid="ref38">Liao et al., 2016</xref>; <xref ref-type="bibr" rid="ref70">Winn and Teece, 2021</xref>).</p>
<p>Furthermore, participants in our study were exposed to only a brief practice session with the vocoded stimuli. While this training was conducted similarly as previous studies (<xref ref-type="bibr" rid="ref22">Hervais-Adelman et al., 2011</xref>; <xref ref-type="bibr" rid="ref69">Winn et al., 2015</xref>), adapting to vocoded speech may require longer practice (<xref ref-type="bibr" rid="ref22">Hervais-Adelman et al., 2011</xref>). Thus, it is possible that the immediate results produced by the spectral degradation would not have been sustained in a longer task, which would have induced phonological accommodation and potentially have led to better speech recognition (<xref ref-type="bibr" rid="ref26">Jesse, 2021</xref>).</p>
</sec>
</sec>
<sec sec-type="conclusions" id="sec30">
<label>5.</label>
<title>Conclusion</title>
<p>The present study offers insights on the pupil temporal dynamics from the processing of phonological information. The lack of differences in the pupil dilation to the presence of a phonological contrast in lexically decontextualized nonwords (phonological discrimination task) could suggest that pupil dynamics are more sensitive to higher-level cognitive and language processing, such as lexical categorization, rather than lower-level phonological categorization. Nevertheless, the pupil response to phonological contrasts may overlap with responses to acoustic differences, indicating that pupil dilation may reflect the processing of dynamic acoustic characteristics of speech.</p>
<p>In the presence of lexical/contextual information (detection task), phonological contrasts led to higher pupil dilation. This increase in pupil dilation could be attributed either to an increase in cognitive demand for processing a sentence containing a nonword, or to a response to the detection of the phonological contrast. The inability to distinguish between high and low-level processing in the detection task stemmed from participants&#x2019; apparent reliance on sentence context when making decisions about phonological contrasts, despite explicit instructions to track individual words.</p>
<p>These findings bring important considerations to the use of pupillometry when investigating phonological perception in the presence of lexical meaning or acoustic variability. Further research is needed to gain a comprehensive understanding of the intricate interactions among acoustic, phonological, and linguistic factors and their influence on pupil dynamics during speech perception.</p>
</sec>
<sec sec-type="data-availability" id="sec31">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec sec-type="ethics-statement" id="sec32">
<title>Ethics statement</title>
<p>The requirement of ethical approval was waived by the Scientific Ethics Committees, Center for Regional Development, Capital Region - Denmark for the studies involving humans because the Scientific Ethics Committees, Center for Regional Development, Capital Region - Denmark considered it to be a study in the social domain. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.</p>
</sec>
<sec sec-type="author-contributions" id="sec33">
<title>Author contributions</title>
<p>JC, FP, EN, KF, and BL: conceptualization and design. JC: data collection and statistical analysis and writing&#x2014;original draft preparation. FP, EN, KF, and BL: writing&#x2014;review and editing. All authors approved the submitted version.</p>
</sec>
</body>
<back>
<sec sec-type="funding-information" id="sec34">
<title>Funding</title>
<p>The project from which this study originated has received funding from the European Union&#x2019;s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie Grant Agreement n. 860755.</p>
</sec>
<ack>
<p>The authors would like to thank Prof Andrea Pittman, from the Dept of Communication Sciences and Disorders, School of Health and Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA, for the valuable input on the paradigm&#x2019;s design; and to thank Yue Zhang and Pierre-Yves Hassan, from Oticon A/S, for expert technical support and helpful discussions on pupillometry analysis.</p>
</ack>
<sec sec-type="COI-statement" id="sec35">
<title>Conflict of interest</title>
<p>JC, FP, EN, and KF were employed by the company Oticon A/S, Sm&#x00F8;rum, Denmark, while this study was conducted.</p>
<p>The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec100" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abada</surname> <given-names>S. H.</given-names></name> <name><surname>Baum</surname> <given-names>S. R.</given-names></name> <name><surname>Titone</surname> <given-names>D.</given-names></name></person-group> (<year>2008</year>). <article-title>The effects of contextual strength on phonetic identification in younger and older listeners</article-title>. <source>Exp. Aging Res.</source> <volume>34</volume>, <fpage>232</fpage>&#x2013;<lpage>250</lpage>. doi: <pub-id pub-id-type="doi">10.1080/03610730802070183</pub-id>, PMID: <pub-id pub-id-type="pmid">18568981</pub-id></citation>
</ref>
<ref id="ref2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Allen</surname> <given-names>J. S.</given-names></name> <name><surname>Miller</surname> <given-names>J. L.</given-names></name> <name><surname>DeSteno</surname> <given-names>D.</given-names></name></person-group> (<year>2003</year>). <article-title>Individual talker differences in voice-onset-time</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>113</volume>, <fpage>544</fpage>&#x2013;<lpage>552</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.1528172</pub-id>, PMID: <pub-id pub-id-type="pmid">12558290</pub-id></citation>
</ref>
<ref id="ref3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bala</surname> <given-names>A. D. S.</given-names></name> <name><surname>Whitchurch</surname> <given-names>E. A.</given-names></name> <name><surname>Takahashi</surname> <given-names>T. T.</given-names></name></person-group> (<year>2020</year>). <article-title>Human auditory detection and discrimination measured with the pupil dilation response</article-title>. <source>J. Assoc. Res. Otolaryngol.</source> <volume>21</volume>, <fpage>43</fpage>&#x2013;<lpage>59</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10162-019-00739-x</pub-id>, PMID: <pub-id pub-id-type="pmid">31792632</pub-id></citation>
</ref>
<ref id="ref4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Balling</surname> <given-names>L. W.</given-names></name> <name><surname>Morris</surname> <given-names>D. J.</given-names></name> <name><surname>T&#x00F8;ndering</surname> <given-names>J.</given-names></name></person-group> (<year>2017</year>). <article-title>Investigating lexical competition and the cost of phonemic restoration</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>142</volume>, <fpage>3603</fpage>&#x2013;<lpage>3612</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.5017603</pub-id>, PMID: <pub-id pub-id-type="pmid">29289097</pub-id></citation>
</ref>
<ref id="ref5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bleses</surname> <given-names>D.</given-names></name> <name><surname>Vach</surname> <given-names>W.</given-names></name> <name><surname>Slott</surname> <given-names>M.</given-names></name> <name><surname>Wehberg</surname> <given-names>S.</given-names></name> <name><surname>Thomsen</surname> <given-names>P.</given-names></name> <name><surname>Madsen</surname> <given-names>T. O.</given-names></name> <etal/></person-group>. (<year>2008a</year>). <article-title>Early vocabulary development in Danish and other languages: a CDI-based comparison</article-title>. <source>J. Child Lang.</source> <volume>35</volume>, <fpage>619</fpage>&#x2013;<lpage>650</lpage>. doi: <pub-id pub-id-type="doi">10.1017/S0305000908008714</pub-id>, PMID: <pub-id pub-id-type="pmid">18588717</pub-id></citation>
</ref>
<ref id="ref6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bleses</surname> <given-names>D.</given-names></name> <name><surname>Vach</surname> <given-names>W.</given-names></name> <name><surname>Slott</surname> <given-names>M.</given-names></name> <name><surname>Wehberg</surname> <given-names>S.</given-names></name> <name><surname>Thomsen</surname> <given-names>P.</given-names></name> <name><surname>Madsen</surname> <given-names>T. O.</given-names></name> <etal/></person-group>. (<year>2008b</year>). <article-title>The Danish communicative developmental inventories: validity and main developmental trends</article-title>. <source>J. Child Lang.</source> <volume>35</volume>, <fpage>651</fpage>&#x2013;<lpage>669</lpage>. doi: <pub-id pub-id-type="doi">10.1017/S0305000907008574</pub-id>, PMID: <pub-id pub-id-type="pmid">18588718</pub-id></citation>
</ref>
<ref id="ref7">
<citation citation-type="other"><person-group person-group-type="author"><name><surname>Boersma</surname> <given-names>P.</given-names></name> <name><surname>Weenink</surname> <given-names>D.</given-names></name></person-group> (<year>1992</year>). <source>Praat: doing phonetics by computer</source>. <comment>Available at: </comment><ext-link xlink:href="https://www.praat.org" ext-link-type="uri">https://www.praat.org</ext-link> (Accessed January 2, 2022).</citation>
</ref>
<ref id="ref8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borsky</surname> <given-names>S.</given-names></name> <name><surname>Tuller</surname> <given-names>B.</given-names></name> <name><surname>Shapiro</surname> <given-names>L. P.</given-names></name></person-group> (<year>1998</year>). <article-title>&#x201C;How to milk a coat:&#x201D; the effects of semantic and acoustic information on phoneme categorization</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>103</volume>, <fpage>2670</fpage>&#x2013;<lpage>2676</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.422787</pub-id>, PMID: <pub-id pub-id-type="pmid">9604360</pub-id></citation>
</ref>
<ref id="ref9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burg</surname> <given-names>E. A.</given-names></name> <name><surname>Thakkar</surname> <given-names>T.</given-names></name> <name><surname>Fields</surname> <given-names>T.</given-names></name> <name><surname>Misurelli</surname> <given-names>S. M.</given-names></name> <name><surname>Kuchinsky</surname> <given-names>S. E.</given-names></name> <name><surname>Roche</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Systematic comparison of trial exclusion criteria for Pupillometry data analysis in individuals with single-sided deafness and Normal hearing</article-title>. <source>Trends Hear.</source> <volume>25</volume>:<fpage>23312165211013256</fpage>. doi: <pub-id pub-id-type="doi">10.1177/23312165211013256</pub-id>, PMID: <pub-id pub-id-type="pmid">34024219</pub-id></citation>
</ref>
<ref id="ref10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Casserly</surname> <given-names>E. D.</given-names></name> <name><surname>Pisoni</surname> <given-names>D. B.</given-names></name></person-group> (<year>2010</year>). <article-title>Speech perception and production</article-title>. <source>Wiley Interdiscip. Rev. Cogn. Sci.</source> <volume>1</volume>, <fpage>629</fpage>&#x2013;<lpage>647</lpage>. doi: <pub-id pub-id-type="doi">10.1002/wcs.63</pub-id>, PMID: <pub-id pub-id-type="pmid">23946864</pub-id></citation>
</ref>
<ref id="ref11">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Christiansen</surname> <given-names>T. U.</given-names></name> <name><surname>Henrichsen</surname> <given-names>P. J.</given-names></name></person-group> (<year>2011</year>). <source>Objective evaluation of consonant-vowel pairs produced by native speakers of Danish</source>. <publisher-name>European Acoustics Association, EAA</publisher-name>. <publisher-loc>Madrid</publisher-loc></citation>
</ref>
<ref id="ref12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clarke</surname> <given-names>C. M.</given-names></name> <name><surname>Garrett</surname> <given-names>M. F.</given-names></name></person-group> (<year>2004</year>). <article-title>Rapid adaptation to foreign-accented English</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>116</volume>, <fpage>3647</fpage>&#x2013;<lpage>3658</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.1815131</pub-id>, PMID: <pub-id pub-id-type="pmid">15658715</pub-id></citation>
</ref>
<ref id="ref13">
<citation citation-type="journal"><person-group person-group-type="author">
<name><surname>Coleman</surname> <given-names>J.</given-names></name>
</person-group> (<year>2003</year>). <article-title>Discovering the acoustic correlates of phonological contrasts</article-title>. <source>J. Phon.</source> <volume>31</volume>, <fpage>351</fpage>&#x2013;<lpage>372</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.wocn.2003.10.001</pub-id></citation>
</ref>
<ref id="ref14">
<citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Dan</surname> <given-names>E. L.</given-names></name> <name><surname>D&#x00EE;n&#x015F;oreanu</surname> <given-names>M.</given-names></name> <name><surname>Mure&#x015F;an</surname> <given-names>R. C.</given-names></name></person-group> (<year>2020</year>). <article-title>Accuracy of six interpolation methods applied on pupil diameter data</article-title>. <conf-name>In 2020 IEEE international conference on automation, quality and testing, robotics (AQTR)</conf-name>, <fpage>1</fpage>&#x2013;<lpage>5</lpage>. <publisher-loc>IEEE</publisher-loc>. <conf-loc>Cluj-Napoca, Romania</conf-loc></citation>
</ref>
<ref id="ref15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dorman</surname> <given-names>M. F.</given-names></name> <name><surname>Loizou</surname> <given-names>P. C.</given-names></name> <name><surname>Rainey</surname> <given-names>D.</given-names></name></person-group> (<year>1997</year>). <article-title>Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>102</volume>, <fpage>2403</fpage>&#x2013;<lpage>2411</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.419603</pub-id>, PMID: <pub-id pub-id-type="pmid">9348698</pub-id></citation>
</ref>
<ref id="ref16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eckstein</surname> <given-names>M. K.</given-names></name> <name><surname>Guerra-Carrillo</surname> <given-names>B.</given-names></name> <name><surname>Miller Singley</surname> <given-names>A. T.</given-names></name> <name><surname>Bunge</surname> <given-names>S. A.</given-names></name></person-group> (<year>2017</year>). <article-title>Beyond eye gaze: what else can eyetracking reveal about cognition and cognitive development?</article-title> <source>Dev. Cogn. Neurosci.</source> <volume>25</volume>, <fpage>69</fpage>&#x2013;<lpage>91</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.dcn.2016.11.001</pub-id>, PMID: <pub-id pub-id-type="pmid">27908561</pub-id></citation>
</ref>
<ref id="ref17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Elberling</surname> <given-names>C.</given-names></name> <name><surname>Ludvigsen</surname> <given-names>C.</given-names></name> <name><surname>Lyregaard</surname> <given-names>P. E.</given-names></name></person-group> (<year>1989</year>). <article-title>Dantale: a new Danish speech material</article-title>. <source>Scand. Audiol.</source> <volume>18</volume>, <fpage>169</fpage>&#x2013;<lpage>175</lpage>. doi: <pub-id pub-id-type="doi">10.3109/01050398909070742</pub-id>, PMID: <pub-id pub-id-type="pmid">2814331</pub-id></citation>
</ref>
<ref id="ref18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friesen</surname> <given-names>L. M.</given-names></name> <name><surname>Shannon</surname> <given-names>R. V.</given-names></name> <name><surname>Baskent</surname> <given-names>D.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name></person-group> (<year>2001</year>). <article-title>Speech recognition in noise as a function of the number of spectral channels: comparison of acoustic hearing and cochlear implants</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>110</volume>, <fpage>1150</fpage>&#x2013;<lpage>1163</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.1381538</pub-id>, PMID: <pub-id pub-id-type="pmid">11519582</pub-id></citation>
</ref>
<ref id="ref19">
<citation citation-type="journal"><person-group person-group-type="author">
<name><surname>Ganong</surname> <given-names>W. F.</given-names></name>
</person-group> (<year>1980</year>). <article-title>Phonetic categorization in auditory word perception</article-title>. <source>J. Exp. Psychol. Hum. Percept. Perform.</source> <volume>6</volume>, <fpage>110</fpage>&#x2013;<lpage>125</lpage>. doi: <pub-id pub-id-type="doi">10.1037//0096-1523.6.1.110</pub-id></citation>
</ref>
<ref id="ref20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gianakas</surname> <given-names>S. P.</given-names></name> <name><surname>Winn</surname> <given-names>M. B.</given-names></name></person-group> (<year>2019</year>). <article-title>Lexical bias in word recognition by cochlear implant listeners</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>146</volume>, <fpage>3373</fpage>&#x2013;<lpage>3383</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.5132938</pub-id>, PMID: <pub-id pub-id-type="pmid">31795696</pub-id></citation>
</ref>
<ref id="ref21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goupell</surname> <given-names>M. J.</given-names></name> <name><surname>Draves</surname> <given-names>G. T.</given-names></name> <name><surname>Litovsky</surname> <given-names>R. Y.</given-names></name></person-group> (<year>2020</year>). <article-title>Recognition of vocoded words and sentences in quiet and multi-talker babble with children and adults</article-title>. <source>PLoS One</source> <volume>15</volume>:<fpage>e0244632</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0244632</pub-id>, PMID: <pub-id pub-id-type="pmid">33373427</pub-id></citation>
</ref>
<ref id="ref22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hervais-Adelman</surname> <given-names>A. G.</given-names></name> <name><surname>Davis</surname> <given-names>M. H.</given-names></name> <name><surname>Johnsrude</surname> <given-names>I. S.</given-names></name> <name><surname>Taylor</surname> <given-names>K. J.</given-names></name> <name><surname>Carlyon</surname> <given-names>R. P.</given-names></name></person-group> (<year>2011</year>). <article-title>Generalization of perceptual learning of vocoded speech</article-title>. <source>J. Exp. Psychol. Hum. Percept. Perform.</source> <volume>37</volume>, <fpage>283</fpage>&#x2013;<lpage>295</lpage>. doi: <pub-id pub-id-type="doi">10.1037/a0020772</pub-id></citation>
</ref>
<ref id="ref23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hopstaken</surname> <given-names>J. F.</given-names></name> <name><surname>van der Linden</surname> <given-names>D.</given-names></name> <name><surname>Bakker</surname> <given-names>A. B.</given-names></name> <name><surname>Kompier</surname> <given-names>M. A. J.</given-names></name></person-group> (<year>2015</year>). <article-title>The window of my eyes: task disengagement and mental fatigue covary with pupil dynamics</article-title>. <source>Biol. Psychol.</source> <volume>110</volume>, <fpage>100</fpage>&#x2013;<lpage>106</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.biopsycho.2015.06.013</pub-id>, PMID: <pub-id pub-id-type="pmid">26196899</pub-id></citation>
</ref>
<ref id="ref24">
<citation citation-type="journal"><person-group person-group-type="author">
<name><surname>Iverson</surname> <given-names>P.</given-names></name>
</person-group> (<year>2003</year>). <article-title>Evaluating the function of phonetic perceptual phenomena within speech recognition: an examination of the perception of /d/&#x2212;/t/ by adult cochlear implant users</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>113</volume>, <fpage>1056</fpage>&#x2013;<lpage>1064</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.1531985</pub-id>, PMID: <pub-id pub-id-type="pmid">12597198</pub-id></citation>
</ref>
<ref id="ref25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jahn</surname> <given-names>K. N.</given-names></name> <name><surname>DiNino</surname> <given-names>M.</given-names></name> <name><surname>Arenberg</surname> <given-names>J. G.</given-names></name></person-group> (<year>2019</year>). <article-title>Reducing simulated channel interaction reveals differences in phoneme identification between children and adults with Normal hearing</article-title>. <source>Ear Hear.</source> <volume>40</volume>, <fpage>295</fpage>&#x2013;<lpage>311</lpage>. doi: <pub-id pub-id-type="doi">10.1097/AUD.0000000000000615</pub-id>, PMID: <pub-id pub-id-type="pmid">29927780</pub-id></citation>
</ref>
<ref id="ref26">
<citation citation-type="journal"><person-group person-group-type="author">
<name><surname>Jesse</surname> <given-names>A.</given-names></name>
</person-group> (<year>2021</year>). <article-title>Sentence context guides phonetic retuning to speaker idiosyncrasies</article-title>. <source>J. Exp. Psychol. Learn. Mem. Cogn.</source> <volume>47</volume>, <fpage>184</fpage>&#x2013;<lpage>194</lpage>. doi: <pub-id pub-id-type="doi">10.1037/xlm0000805</pub-id>, PMID: <pub-id pub-id-type="pmid">31855000</pub-id></citation>
</ref>
<ref id="ref27">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Johnsrude</surname> <given-names>I. S.</given-names></name> <name><surname>Rodd</surname> <given-names>J. M.</given-names></name></person-group> (<year>2016</year>). &#x201C;<article-title>Chapter 40 - factors that increase processing demands when listening to speech</article-title>&#x201D; in <source>Neurobiology of language</source>. eds. <person-group person-group-type="editor"><name><surname>Hickok</surname> <given-names>G.</given-names></name> <name><surname>Small</surname> <given-names>S. L.</given-names></name></person-group> (<publisher-loc>San Diego</publisher-loc>: <publisher-name>Academic Press</publisher-name>), <fpage>491</fpage>&#x2013;<lpage>502</lpage>.</citation>
</ref>
<ref id="ref28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kadem</surname> <given-names>M.</given-names></name> <name><surname>Herrmann</surname> <given-names>B.</given-names></name> <name><surname>Rodd</surname> <given-names>J. M.</given-names></name> <name><surname>Johnsrude</surname> <given-names>I. S.</given-names></name></person-group> (<year>2020</year>). <article-title>Pupil dilation is sensitive to semantic ambiguity and acoustic degradation</article-title>. <source>Trends Hear.</source> <volume>24</volume>:<fpage>2331216520964068</fpage>. doi: <pub-id pub-id-type="doi">10.1177/2331216520964068</pub-id>, PMID: <pub-id pub-id-type="pmid">33124518</pub-id></citation>
</ref>
<ref id="ref29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kafkas</surname> <given-names>A.</given-names></name> <name><surname>Montaldi</surname> <given-names>D.</given-names></name></person-group> (<year>2018</year>). <article-title>How do memory systems detect and respond to novelty?</article-title> <source>Neurosci. Lett.</source> <volume>680</volume>, <fpage>60</fpage>&#x2013;<lpage>68</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.neulet.2018.01.053</pub-id>, PMID: <pub-id pub-id-type="pmid">29408218</pub-id></citation>
</ref>
<ref id="ref30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kamp</surname> <given-names>S.-M.</given-names></name> <name><surname>Donchin</surname> <given-names>E.</given-names></name></person-group> (<year>2015</year>). <article-title>ERP and pupil responses to deviance in an oddball paradigm</article-title>. <source>Psychophysiology</source> <volume>52</volume>, <fpage>460</fpage>&#x2013;<lpage>471</lpage>. doi: <pub-id pub-id-type="doi">10.1111/psyp.12378</pub-id>, PMID: <pub-id pub-id-type="pmid">25369764</pub-id></citation>
</ref>
<ref id="ref31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kinzuka</surname> <given-names>Y.</given-names></name> <name><surname>Minami</surname> <given-names>T.</given-names></name> <name><surname>Nakauchi</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Pupil dilation reflects English /l//r/ discrimination ability for Japanese learners of English: a pilot study</article-title>. <source>Sci. Rep.</source> <volume>10</volume>:<fpage>8052</fpage>. doi: <pub-id pub-id-type="doi">10.1038/s41598-020-65020-1</pub-id>, PMID: <pub-id pub-id-type="pmid">32415182</pub-id></citation>
</ref>
<ref id="ref32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koelewijn</surname> <given-names>T.</given-names></name> <name><surname>Versfeld</surname> <given-names>N. J.</given-names></name> <name><surname>Kramer</surname> <given-names>S. E.</given-names></name></person-group> (<year>2017</year>). <article-title>Effects of attention on the speech reception threshold and pupil response of people with impaired and normal hearing</article-title>. <source>Hear. Res.</source> <volume>354</volume>, <fpage>56</fpage>&#x2013;<lpage>63</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.heares.2017.08.006</pub-id>, PMID: <pub-id pub-id-type="pmid">28869841</pub-id></citation>
</ref>
<ref id="ref33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kramer</surname> <given-names>S. E.</given-names></name> <name><surname>Lorens</surname> <given-names>A.</given-names></name> <name><surname>Coninx</surname> <given-names>F.</given-names></name> <name><surname>Zekveld</surname> <given-names>A. A.</given-names></name> <name><surname>Piotrowska</surname> <given-names>A.</given-names></name> <name><surname>Skarzynski</surname> <given-names>H.</given-names></name></person-group> (<year>2013</year>). <article-title>Processing load during listening: the influence of task characteristics on the pupil response</article-title>. <source>Lang. Cogn. Process.</source> <volume>28</volume>, <fpage>426</fpage>&#x2013;<lpage>442</lpage>. doi: <pub-id pub-id-type="doi">10.1080/01690965.2011.642267</pub-id></citation>
</ref>
<ref id="ref34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kret</surname> <given-names>M. E.</given-names></name> <name><surname>Sjak-Shie</surname> <given-names>E. E.</given-names></name></person-group> (<year>2019</year>). <article-title>Preprocessing pupil size data: guidelines and code</article-title>. <source>Behav. Res. Methods</source> <volume>51</volume>, <fpage>1336</fpage>&#x2013;<lpage>1342</lpage>. doi: <pub-id pub-id-type="doi">10.3758/s13428-018-1075-y</pub-id></citation>
</ref>
<ref id="ref35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuhl</surname> <given-names>P. K.</given-names></name> <name><surname>Conboy</surname> <given-names>B. T.</given-names></name> <name><surname>Coffey-Corina</surname> <given-names>S.</given-names></name> <name><surname>Padden</surname> <given-names>D.</given-names></name> <name><surname>Rivera-Gaxiola</surname> <given-names>M.</given-names></name> <name><surname>Nelson</surname> <given-names>T.</given-names></name></person-group> (<year>2008</year>). <article-title>Phonetic learning as a pathway to language: new data and native language magnet theory expanded (NLM-e)</article-title>. <source>Philos. Trans. R. Soc. Lond. B Biol. Sci.</source> <volume>363</volume>, <fpage>979</fpage>&#x2013;<lpage>1000</lpage>. doi: <pub-id pub-id-type="doi">10.1098/rstb.2007.2154</pub-id></citation>
</ref>
<ref id="ref36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kutas</surname> <given-names>M.</given-names></name> <name><surname>Hillyard</surname> <given-names>S. A.</given-names></name></person-group> (<year>1984</year>). <article-title>Brain potentials during reading reflect word expectancy and semantic association</article-title>. <source>Nature</source> <volume>307</volume>, <fpage>161</fpage>&#x2013;<lpage>163</lpage>. doi: <pub-id pub-id-type="doi">10.1038/307161a0</pub-id>, PMID: <pub-id pub-id-type="pmid">6690995</pub-id></citation>
</ref>
<ref id="ref37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lewis</surname> <given-names>G. A.</given-names></name> <name><surname>Bidelman</surname> <given-names>G. M.</given-names></name></person-group> (<year>2020</year>). <article-title>Autonomic nervous system correlates of speech categorization revealed through Pupillometry</article-title>. <source>Front. Neurosci.</source> <volume>13</volume>, <fpage>1</fpage>&#x2013;<lpage>10</lpage>. doi: <pub-id pub-id-type="doi">10.3389/fnins.2019.01418</pub-id>, PMID: <pub-id pub-id-type="pmid">31998068</pub-id></citation>
</ref>
<ref id="ref38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liao</surname> <given-names>H.-I.</given-names></name> <name><surname>Yoneya</surname> <given-names>M.</given-names></name> <name><surname>Kidani</surname> <given-names>S.</given-names></name> <name><surname>Kashino</surname> <given-names>M.</given-names></name> <name><surname>Furukawa</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>Human pupillary dilation response to deviant auditory stimuli: effects of stimulus properties and voluntary attention</article-title>. <source>Front. Neurosci.</source> <volume>10</volume>:<fpage>43</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fnins.2016.00043</pub-id>, PMID: <pub-id pub-id-type="pmid">26924959</pub-id></citation>
</ref>
<ref id="ref39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liberman</surname> <given-names>A. M.</given-names></name> <name><surname>Cooper</surname> <given-names>F. S.</given-names></name> <name><surname>Shankweiler</surname> <given-names>D. P.</given-names></name> <name><surname>Studdert-Kennedy</surname> <given-names>M.</given-names></name></person-group> (<year>1967</year>). <article-title>Perception of the speech code</article-title>. <source>Psychol. Rev.</source> <volume>74</volume>, <fpage>431</fpage>&#x2013;<lpage>461</lpage>. doi: <pub-id pub-id-type="doi">10.1037/h0020279</pub-id></citation>
</ref>
<ref id="ref40">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Macmillan</surname> <given-names>N. A.</given-names></name> <name><surname>Creelman</surname> <given-names>C. D.</given-names></name></person-group> (<year>2005</year>). <source>Detection theory: a user&#x2019;s guide</source>, <edition>2nd ed</edition>. <publisher-loc>Mahwah, NJ, US</publisher-loc>: <publisher-name>Lawrence Erlbaum Associates Publishers</publisher-name>.</citation>
</ref>
<ref id="ref41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Math&#x00F4;t</surname> <given-names>S.</given-names></name> <name><surname>Schreij</surname> <given-names>D.</given-names></name> <name><surname>Theeuwes</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>OpenSesame: an open-source, graphical experiment builder for the social sciences</article-title>. <source>Behav. Res. Methods</source> <volume>44</volume>, <fpage>314</fpage>&#x2013;<lpage>324</lpage>. doi: <pub-id pub-id-type="doi">10.3758/s13428-011-0168-7</pub-id>, PMID: <pub-id pub-id-type="pmid">22083660</pub-id></citation>
</ref>
<ref id="ref42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Math&#x00F4;t</surname> <given-names>S.</given-names></name> <name><surname>Vilotijevi&#x0107;</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). <article-title>Methods in cognitive pupillometry: design, preprocessing, and statistical analysis</article-title>. <source>Behav. Res. Methods</source> <volume>55</volume>, <fpage>3055</fpage>&#x2013;<lpage>3077</lpage>. doi: <pub-id pub-id-type="doi">10.3758/s13428-022-01957-7</pub-id>, PMID: <pub-id pub-id-type="pmid">36028608</pub-id></citation>
</ref>
<ref id="ref43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McGarrigle</surname> <given-names>R.</given-names></name> <name><surname>Rakusen</surname> <given-names>L.</given-names></name> <name><surname>Mattys</surname> <given-names>S.</given-names></name></person-group> (<year>2021</year>). <article-title>Effortful listening under the microscope: examining relations between pupillometric and subjective markers of effort and tiredness from listening</article-title>. <source>Psychophysiology</source> <volume>58</volume>:<fpage>e13703</fpage>. doi: <pub-id pub-id-type="doi">10.1111/psyp.13703</pub-id>, PMID: <pub-id pub-id-type="pmid">33031584</pub-id></citation>
</ref>
<ref id="ref44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Micula</surname> <given-names>A.</given-names></name> <name><surname>R&#x00F6;nnberg</surname> <given-names>J.</given-names></name> <name><surname>Ksi&#x0105;&#x017C;ek</surname> <given-names>P.</given-names></name> <name><surname>Murmu Nielsen</surname> <given-names>R.</given-names></name> <name><surname>Wendt</surname> <given-names>D.</given-names></name> <name><surname>Fiedler</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2022</year>). <article-title>A glimpse of memory through the eyes: pupillary responses measured during encoding reflect the likelihood of subsequent memory recall in an auditory free recall test</article-title>. <source>Trends Hear.</source> <volume>26</volume>:<fpage>233121652211305</fpage>. doi: <pub-id pub-id-type="doi">10.1177/23312165221130581</pub-id>, PMID: <pub-id pub-id-type="pmid">36305085</pub-id></citation>
</ref>
<ref id="ref45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>N&#x00E4;&#x00E4;t&#x00E4;nen</surname> <given-names>R.</given-names></name> <name><surname>Paavilainen</surname> <given-names>P.</given-names></name> <name><surname>Rinne</surname> <given-names>T.</given-names></name> <name><surname>Alho</surname> <given-names>K.</given-names></name></person-group> (<year>2007</year>). <article-title>The mismatch negativity (MMN) in basic research of central auditory processing: a review</article-title>. <source>Clin. Neurophysiol.</source> <volume>118</volume>, <fpage>2544</fpage>&#x2013;<lpage>2590</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.clinph.2007.04.026</pub-id>, PMID: <pub-id pub-id-type="pmid">17931964</pub-id></citation>
</ref>
<ref id="ref46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nielsen</surname> <given-names>J. B.</given-names></name> <name><surname>Dau</surname> <given-names>T.</given-names></name></person-group> (<year>2019</year>). <article-title>A Danish nonsense word corpus for phoneme recognition measurements</article-title>. <source>Acta. Acust. United Acust.</source> <volume>105</volume>, <fpage>183</fpage>&#x2013;<lpage>194</lpage>. doi: <pub-id pub-id-type="doi">10.3813/AAA.919299</pub-id></citation>
</ref>
<ref id="ref47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ohlenforst</surname> <given-names>B.</given-names></name> <name><surname>Wendt</surname> <given-names>D.</given-names></name> <name><surname>Kramer</surname> <given-names>S. E.</given-names></name> <name><surname>Naylor</surname> <given-names>G.</given-names></name> <name><surname>Zekveld</surname> <given-names>A. A.</given-names></name> <name><surname>Lunner</surname> <given-names>T.</given-names></name></person-group> (<year>2018</year>). <article-title>Impact of SNR, masker type and noise reduction processing on sentence recognition performance and listening effort as indicated by the pupil dilation response</article-title>. <source>Hear. Res.</source> <volume>365</volume>, <fpage>90</fpage>&#x2013;<lpage>99</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.heares.2018.05.003</pub-id>, PMID: <pub-id pub-id-type="pmid">29779607</pub-id></citation>
</ref>
<ref id="ref48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ohlenforst</surname> <given-names>B.</given-names></name> <name><surname>Zekveld</surname> <given-names>A. A.</given-names></name> <name><surname>Lunner</surname> <given-names>T.</given-names></name> <name><surname>Wendt</surname> <given-names>D.</given-names></name> <name><surname>Naylor</surname> <given-names>G.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Impact of stimulus-related factors and hearing impairment on listening effort as indicated by pupil dilation</article-title>. <source>Hear. Res.</source> <volume>351</volume>, <fpage>68</fpage>&#x2013;<lpage>79</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.heares.2017.05.012</pub-id>, PMID: <pub-id pub-id-type="pmid">28622894</pub-id></citation>
</ref>
<ref id="ref49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Phatak</surname> <given-names>S. A.</given-names></name> <name><surname>Grant</surname> <given-names>K. W.</given-names></name></person-group> (<year>2014</year>). <article-title>Phoneme recognition in vocoded maskers by normal-hearing and aided hearing-impaired listeners</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>136</volume>, <fpage>859</fpage>&#x2013;<lpage>866</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.4889863</pub-id>, PMID: <pub-id pub-id-type="pmid">25096119</pub-id></citation>
</ref>
<ref id="ref9001">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pittman</surname> <given-names>A. L.</given-names></name> <name><surname>Schuett</surname> <given-names>B. C.</given-names></name></person-group> (<year>2013</year>). <article-title>Effects of Semantic and Acoustic Context on Nonword Detection in Children With Hearing Loss. Ear &#x0026; Hearing</article-title>. <volume>34</volume>, <fpage>213</fpage>&#x2013;<lpage>220</lpage>. doi: <pub-id pub-id-type="doi">10.1097/AUD.0b013e31826e5006</pub-id></citation>
</ref>
<ref id="ref50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pittman</surname> <given-names>A. L.</given-names></name> <name><surname>Stewart</surname> <given-names>E. C.</given-names></name> <name><surname>Odgear</surname> <given-names>I. S.</given-names></name> <name><surname>Willman</surname> <given-names>A. P.</given-names></name></person-group> (<year>2017</year>). <article-title>Detecting and learning new words: the impact of advancing age and hearing loss</article-title>. <source>Am. J. Audiol.</source> <volume>26</volume>, <fpage>318</fpage>&#x2013;<lpage>327</lpage>. doi: <pub-id pub-id-type="doi">10.1044/2017_AJA-17-0025</pub-id>, PMID: <pub-id pub-id-type="pmid">28834533</pub-id></citation>
</ref>
<ref id="ref51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reese</surname> <given-names>H.</given-names></name> <name><surname>Reinisch</surname> <given-names>E.</given-names></name></person-group> (<year>2022</year>). <article-title>Cognitive load does not increase reliance on speaker information in phonetic categorization</article-title>. <source>JASA Express Lett.</source> <volume>2</volume>:<fpage>055203</fpage>. doi: <pub-id pub-id-type="doi">10.1121/10.0009895</pub-id></citation>
</ref>
<ref id="ref52">
<citation citation-type="journal"><person-group person-group-type="author">
<name><surname>Repp</surname> <given-names>B. H.</given-names></name>
</person-group> (<year>1982</year>). <article-title>Phonetic trading relations and context effects: new experimental evidence for a speech mode of perception</article-title>. <source>Psychol. Bull.</source> <volume>92</volume>, <fpage>81</fpage>&#x2013;<lpage>110</lpage>. doi: <pub-id pub-id-type="doi">10.1037/0033-2909.92.1.81</pub-id>, PMID: <pub-id pub-id-type="pmid">7134330</pub-id></citation>
</ref>
<ref id="ref53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Romberg</surname> <given-names>A. R.</given-names></name> <name><surname>Saffran</surname> <given-names>J. R.</given-names></name></person-group> (<year>2010</year>). <article-title>Statistical learning and language acquisition</article-title>. <source>Wiley Interdiscip. Rev. Cogn. Sci.</source> <volume>1</volume>, <fpage>906</fpage>&#x2013;<lpage>914</lpage>. doi: <pub-id pub-id-type="doi">10.1002/wcs.78</pub-id>, PMID: <pub-id pub-id-type="pmid">21666883</pub-id></citation>
</ref>
<ref id="ref54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Selezneva</surname> <given-names>E.</given-names></name> <name><surname>Brosch</surname> <given-names>M.</given-names></name> <name><surname>Rathi</surname> <given-names>S.</given-names></name> <name><surname>Vighneshvel</surname> <given-names>T.</given-names></name> <name><surname>Wetzel</surname> <given-names>N.</given-names></name></person-group> (<year>2021</year>). <article-title>Comparison of pupil dilation responses to unexpected sounds in monkeys and humans</article-title>. <source>Front. Psychol.</source> <volume>12</volume>:<fpage>754604</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2021.754604</pub-id>, PMID: <pub-id pub-id-type="pmid">35002851</pub-id></citation>
</ref>
<ref id="ref55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seropian</surname> <given-names>L.</given-names></name> <name><surname>Ferschneider</surname> <given-names>M.</given-names></name> <name><surname>Cholvy</surname> <given-names>F.</given-names></name> <name><surname>Micheyl</surname> <given-names>C.</given-names></name> <name><surname>Bidet-Caulet</surname> <given-names>A.</given-names></name> <name><surname>Moulin</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). <article-title>Comparing methods of analysis in pupillometry: application to the assessment of listening effort in hearing-impaired patients</article-title>. <source>Heliyon</source> <volume>8</volume>:<fpage>e09631</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.heliyon.2022.e09631</pub-id>, PMID: <pub-id pub-id-type="pmid">35734572</pub-id></citation>
</ref>
<ref id="ref56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stanislaw</surname> <given-names>H.</given-names></name> <name><surname>Todorov</surname> <given-names>N.</given-names></name></person-group> (<year>1999</year>). <article-title>Calculation of signal detection theory measures</article-title>. <source>Behav. Res. Methods Instrum. Comput.</source> <volume>31</volume>, <fpage>137</fpage>&#x2013;<lpage>149</lpage>. doi: <pub-id pub-id-type="doi">10.3758/BF03207704</pub-id></citation>
</ref>
<ref id="ref57">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Steinhauer</surname> <given-names>K.</given-names></name> <name><surname>Connolly</surname> <given-names>J. F.</given-names></name></person-group> (<year>2008</year>). &#x201C;<article-title>Event-related potentials in the study of language</article-title>&#x201D; in <source>Handbook of the Neuroscience of Language</source>. eds. <person-group person-group-type="editor"><name><surname>Stemmer</surname> <given-names>B.</given-names></name> <name><surname>Whitaker</surname> <given-names>H.</given-names></name></person-group> (<publisher-loc>Canada</publisher-loc>: <publisher-name>Elsevier</publisher-name>), <fpage>91</fpage>&#x2013;<lpage>104</lpage>.</citation>
</ref>
<ref id="ref58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stenfelt</surname> <given-names>S.</given-names></name> <name><surname>R&#x00F6;nnberg</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>The signal-cognition interface: interactions between degraded auditory signals and cognitive processes</article-title>. <source>Scand. J. Psychol.</source> <volume>50</volume>, <fpage>385</fpage>&#x2013;<lpage>393</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1467-9450.2009.00748.x</pub-id></citation>
</ref>
<ref id="ref59">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Sulas</surname> <given-names>E.</given-names></name> <name><surname>Hasan</surname> <given-names>P.-Y.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Patou</surname> <given-names>F.</given-names></name></person-group> (<year>2022</year>). <source>Streamlining experiment design in cognitive hearing science using OpenSesame</source>. <source>Behav. Res. Methods.</source>. <volume>55</volume>, <fpage>1965</fpage>&#x2013;<lpage>1979</lpage>, doi: <pub-id pub-id-type="doi">10.3758/s13428-022-01886-5</pub-id>, PMID: <pub-id pub-id-type="pmid">35794416</pub-id></citation>
</ref>
<ref id="ref60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Trau-Margalit</surname> <given-names>A.</given-names></name> <name><surname>Fostick</surname> <given-names>L.</given-names></name> <name><surname>Harel-Arbeli</surname> <given-names>T.</given-names></name> <name><surname>Nissanholtz Gannot</surname> <given-names>R.</given-names></name> <name><surname>Taitelbaum-Swead</surname> <given-names>R.</given-names></name></person-group> (<year>2023</year>). <article-title>Speech recognition in noise task among children and young-adults: a pupillometry study</article-title>. <source>Front. Psychol.</source> <volume>14</volume>:<fpage>1188485</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2023.1188485</pub-id>, PMID: <pub-id pub-id-type="pmid">37425148</pub-id></citation>
</ref>
<ref id="ref61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vickery</surname> <given-names>B.</given-names></name> <name><surname>Fogerty</surname> <given-names>D.</given-names></name> <name><surname>Dubno</surname> <given-names>J. R.</given-names></name></person-group> (<year>2022</year>). <article-title>Phonological and semantic similarity of misperceived words in babble: effects of sentence context, age, and hearing loss</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>151</volume>, <fpage>650</fpage>&#x2013;<lpage>662</lpage>. doi: <pub-id pub-id-type="doi">10.1121/10.0009367</pub-id>, PMID: <pub-id pub-id-type="pmid">35105039</pub-id></citation>
</ref>
<ref id="ref62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Virtala</surname> <given-names>P.</given-names></name> <name><surname>Partanen</surname> <given-names>E.</given-names></name> <name><surname>Tervaniemi</surname> <given-names>M.</given-names></name> <name><surname>Kujala</surname> <given-names>T.</given-names></name></person-group> (<year>2018</year>). <article-title>Neural discrimination of speech sound changes in a variable context occurs irrespective of attention and explicit awareness</article-title>. <source>Biol. Psychol.</source> <volume>132</volume>, <fpage>217</fpage>&#x2013;<lpage>227</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.biopsycho.2018.01.002</pub-id>, PMID: <pub-id pub-id-type="pmid">29305875</pub-id></citation>
</ref>
<ref id="ref63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wagner</surname> <given-names>A. E.</given-names></name> <name><surname>Toffanin</surname> <given-names>P.</given-names></name> <name><surname>Ba&#x015F;kent</surname> <given-names>D.</given-names></name></person-group> (<year>2016</year>). <article-title>The timing and effort of lexical access in natural and degraded speech</article-title>. <source>Front. Psychol.</source> <volume>7</volume>:<fpage>398</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2016.00398</pub-id>, PMID: <pub-id pub-id-type="pmid">27065901</pub-id></citation>
</ref>
<ref id="ref64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wendt</surname> <given-names>D.</given-names></name> <name><surname>Dau</surname> <given-names>T.</given-names></name> <name><surname>Hjortkj&#x00E6;r</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>Impact of background noise and sentence complexity on processing demands during sentence comprehension</article-title>. <source>Front. Psychol.</source> <volume>7</volume>:<fpage>345</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2016.00345</pub-id>, PMID: <pub-id pub-id-type="pmid">27014152</pub-id></citation>
</ref>
<ref id="ref65">
<citation citation-type="journal"><person-group person-group-type="author">
<name><surname>Winn</surname> <given-names>M.</given-names></name>
</person-group> (<year>2016</year>). <article-title>Rapid release from listening effort resulting from semantic context, and effects of spectral degradation and Cochlear implants</article-title>. <source>Trends Hear.</source> <volume>20</volume>:<fpage>2331216516669723</fpage>. doi: <pub-id pub-id-type="doi">10.1177/2331216516669723</pub-id>, PMID: <pub-id pub-id-type="pmid">27698260</pub-id></citation>
</ref>
<ref id="ref66">
<citation citation-type="journal"><person-group person-group-type="author">
<name><surname>Winn</surname> <given-names>M. B.</given-names></name>
</person-group> (<year>2020</year>). <article-title>Accommodation of gender-related phonetic differences by listeners with cochlear implants and in a variety of vocoder simulations</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>147</volume>, <fpage>174</fpage>&#x2013;<lpage>190</lpage>. doi: <pub-id pub-id-type="doi">10.1121/10.0000566</pub-id>, PMID: <pub-id pub-id-type="pmid">32006986</pub-id></citation>
</ref>
<ref id="ref67">
<citation citation-type="other"><person-group person-group-type="author">
<name><surname>Winn</surname> <given-names>M. B.</given-names></name>
</person-group> (<year>2021</year>). <source>Vocoder: vocode all selected sounds in the objects list or all sounds in a specified folder</source>. <comment>Available at: </comment><ext-link xlink:href="http://www.mattwinn.com/praat/vocode_all_selected_v45.txt" ext-link-type="uri">http://www.mattwinn.com/praat/vocode_all_selected_v45.txt</ext-link> (Accessed August 2, 2022).</citation>
</ref>
<ref id="ref68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Winn</surname> <given-names>M. B.</given-names></name> <name><surname>Chatterjee</surname> <given-names>M.</given-names></name> <name><surname>Idsardi</surname> <given-names>W. J.</given-names></name></person-group> (<year>2012</year>). <article-title>The use of acoustic cues for phonetic identification: effects of spectral degradation and electric hearing</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>131</volume>, <fpage>1465</fpage>&#x2013;<lpage>1479</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.3672705</pub-id>, PMID: <pub-id pub-id-type="pmid">22352517</pub-id></citation>
</ref>
<ref id="ref69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Winn</surname> <given-names>M. B.</given-names></name> <name><surname>Edwards</surname> <given-names>J. R.</given-names></name> <name><surname>Litovsky</surname> <given-names>R. Y.</given-names></name></person-group> (<year>2015</year>). <article-title>The impact of auditory spectral resolution on listening effort revealed by pupil dilation</article-title>. <source>Ear Hear.</source> <volume>36</volume>, <fpage>e153</fpage>&#x2013;<lpage>e165</lpage>. doi: <pub-id pub-id-type="doi">10.1097/AUD.0000000000000145</pub-id>, PMID: <pub-id pub-id-type="pmid">25654299</pub-id></citation>
</ref>
<ref id="ref70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Winn</surname> <given-names>M. B.</given-names></name> <name><surname>Teece</surname> <given-names>K. H.</given-names></name></person-group> (<year>2021</year>). <article-title>Listening effort is not the same as speech intelligibility score</article-title>. <source>Trends Hear.</source> <volume>25</volume>:<fpage>23312165211027688</fpage>. doi: <pub-id pub-id-type="doi">10.1177/23312165211027688</pub-id>, PMID: <pub-id pub-id-type="pmid">34261392</pub-id></citation>
</ref>
<ref id="ref71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Winn</surname> <given-names>M. B.</given-names></name> <name><surname>Teece</surname> <given-names>K. H.</given-names></name></person-group> (<year>2022</year>). <article-title>Effortful listening despite correct responses: the cost of mental repair in sentence recognition by listeners with Cochlear implants</article-title>. <source>J. Speech Lang. Hear. Res.</source> <volume>65</volume>, <fpage>3966</fpage>&#x2013;<lpage>3980</lpage>. doi: <pub-id pub-id-type="doi">10.1044/2022_JSLHR-21-00631</pub-id>, PMID: <pub-id pub-id-type="pmid">36112516</pub-id></citation>
</ref>
<ref id="ref72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Winn</surname> <given-names>M. B.</given-names></name> <name><surname>Wendt</surname> <given-names>D.</given-names></name> <name><surname>Koelewijn</surname> <given-names>T.</given-names></name> <name><surname>Kuchinsky</surname> <given-names>S. E.</given-names></name></person-group> (<year>2018</year>). <article-title>Best practices and advice for using pupillometry to measure listening effort: an introduction for those who want to get started</article-title>. <source>Trends Hear.</source> <volume>22</volume>:<fpage>2331216518800869</fpage>. doi: <pub-id pub-id-type="doi">10.1177/2331216518800869</pub-id>, PMID: <pub-id pub-id-type="pmid">30261825</pub-id></citation>
</ref>
<ref id="ref73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>L.</given-names></name> <name><surname>Thompson</surname> <given-names>C. S.</given-names></name> <name><surname>Pfingst</surname> <given-names>B. E.</given-names></name></person-group> (<year>2005</year>). <article-title>Relative contributions of spectral and temporal cues for phoneme recognition</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>117</volume>, <fpage>3255</fpage>&#x2013;<lpage>3267</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.1886405</pub-id>, PMID: <pub-id pub-id-type="pmid">15957791</pub-id></citation>
</ref>
<ref id="ref74">
<citation citation-type="journal"><person-group person-group-type="author">
<name><surname>Yu</surname> <given-names>A. C. L.</given-names></name>
</person-group> (<year>2022</year>). <article-title>Perceptual cue weighting is influenced by the listener&#x2019;s gender and subjective evaluations of the speaker: the case of English stop voicing</article-title>. <source>Front. Psychol.</source> <volume>13</volume>:<fpage>840291</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2022.840291</pub-id>, PMID: <pub-id pub-id-type="pmid">35529558</pub-id></citation>
</ref>
<ref id="ref75">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zekveld</surname> <given-names>A. A.</given-names></name> <name><surname>Koelewijn</surname> <given-names>T.</given-names></name> <name><surname>Kramer</surname> <given-names>S. E.</given-names></name></person-group> (<year>2018</year>). <article-title>The pupil dilation response to auditory stimuli: current state of knowledge</article-title>. <source>Trends Hear.</source> <volume>22</volume>:<fpage>2331216518777174</fpage>. doi: <pub-id pub-id-type="doi">10.1177/2331216518777174</pub-id>, PMID: <pub-id pub-id-type="pmid">30249172</pub-id></citation>
</ref>
<ref id="ref76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Malaval</surname> <given-names>F.</given-names></name> <name><surname>Lehmann</surname> <given-names>A.</given-names></name> <name><surname>Deroche</surname> <given-names>M. L. D.</given-names></name></person-group> (<year>2022</year>). <article-title>Luminance effects on pupil dilation in speech-in-noise recognition</article-title>. <source>PLoS One</source> <volume>17</volume>:<fpage>e0278506</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0278506</pub-id></citation>
</ref>
<ref id="ref77">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>N.</given-names></name> <name><surname>Xu</surname> <given-names>L.</given-names></name> <name><surname>Lee</surname> <given-names>C.-Y.</given-names></name></person-group> (<year>2010</year>). <article-title>The effects of frequency-place shift on consonant confusion in cochlear implant simulations</article-title>. <source>J. Acoust. Soc. Am.</source> <volume>128</volume>, <fpage>401</fpage>&#x2013;<lpage>409</lpage>. doi: <pub-id pub-id-type="doi">10.1121/1.3436558</pub-id></citation>
</ref>
</ref-list>
</back>
</article>