<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Neurosci.</journal-id>
<journal-title>Frontiers in Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1662-453X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnins.2014.00451</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Perceptual factors contribute more than acoustical factors to sound localization abilities with virtual sources</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>And&#x000E9;ol</surname> <given-names>Guillaume</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/54617"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Savel</surname> <given-names>Sophie</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/162493"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Guillaume</surname> <given-names>Anne</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/203641"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>D&#x000E9;partement Action et Cognition en Situation Op&#x000E9;rationnelle, Institut de Recherche Biom&#x000E9;dicale des Arm&#x000E9;es</institution> <country>Br&#x000E9;tigny sur Orge, France</country></aff>
<aff id="aff2"><sup>2</sup><institution>Laboratoire de M&#x000E9;canique et d&#x00027;Acoustique, Centre National de la Recherche Scientifique, UPR 7051, Equipe Sons, Aix-Marseille Universit&#x000E9;, Centrale Marseille</institution> <country>Marseille, France</country></aff>
<aff id="aff3"><sup>3</sup><institution>Laboratoire d&#x00027;Accidentologie, de Biom&#x000E9;canique et d&#x00027;&#x000C9;tude du Comportement Humain</institution> <country>Nanterre, France</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Brian Simpson, Air Force Research Laboratory, USA</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Frederick Jerome Gallun, Department of Veterans Affairs, USA; Douglas Brungart, Walter Reed National Military Medical Center, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Guillaume And&#x000E9;ol, D&#x000E9;partement Action et Cognition en Situation Op&#x000E9;rationnelle, Institut de Recherche Biom&#x000E9;dicale des Arm&#x000E9;es, BP 73, 91223 Br&#x000E9;tigny sur Orge, France e-mail: <email>guillaume.andeol&#x00040;irba.fr</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Auditory Cognitive Neuroscience, a section of the journal Frontiers in Neuroscience.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>29</day>
<month>01</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date>
<volume>8</volume>
<elocation-id>451</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>04</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>22</day>
<month>12</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2015 And&#x000E9;ol, Savel and Guillaume.</copyright-statement>
<copyright-year>2015</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract><p>Human sound localization abilities rely on binaural and spectral cues. Spectral cues arise from interactions between the sound wave and the listener&#x00027;s body (head-related transfer function, HRTF). Large individual differences were reported in localization abilities, even in young normal-hearing adults. Several studies have attempted to determine whether localization abilities depend mostly on acoustical cues or on perceptual processes involved in the analysis of these cues. These studies have yielded inconsistent findings, which could result from methodological issues. In this study, we measured sound localization performance with normal and modified acoustical cues (i.e., with individual and non-individual HRTFs, respectively) in 20 na&#x000EF;ve listeners. Test conditions were chosen to address most methodological issues from past studies. Procedural training was provided prior to sound localization tests. The results showed no direct relationship between behavioral results and an acoustical metrics (spectral-shape prominence of individual HRTFs). Despite uncertainties due to technical issues with the normalization of the HRTFs, large acoustical differences between individual and non-individual HRTFs appeared to be needed to produce behavioral effects. A subset of 15 listeners then trained in the sound localization task with individual HRTFs. Training included either visual correct-answer feedback (for the test group) or no feedback (for the control group), and was assumed to elicit perceptual learning for the test group only. Few listeners from the control group, but most listeners from the test group, showed significant training-induced learning. For the test group, learning was related to pre-training performance (i.e., the poorer the pre-training performance, the greater the learning amount) and was retained after 1 month. The results are interpreted as being in favor of a larger contribution of perceptual factors than of acoustical factors to sound localization abilities with virtual sources.</p></abstract>
<kwd-group>
<kwd>sound localization</kwd>
<kwd>perceptual learning</kwd>
<kwd>procedural learning</kwd>
<kwd>head-related transfer function</kwd>
<kwd>individual differences</kwd>
</kwd-group>
<counts>
<fig-count count="11"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="50"/>
<page-count count="17"/>
<word-count count="10034"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="introduction" id="s1">
<title>Introduction</title>
<p>Individuals receive information about their environment mainly via the visual and auditory sensory modalities. The auditory system has lower spatial resolution than the visual system, but allows perception beyond the visual field and in darkness. However, there is no direct encoding of space in the auditory system. Auditory space perception relies on the processing of binaural cues (i.e., interaural differences in the level and time of arrival of the incoming sound wave) for the left/right dimension, and spectral cues (i.e., filtering of the incoming sound wave by the listener&#x00027;s upper body, which corresponds to the head-related transfer function, HRTF) for the up/down and front/back dimensions. These direction-dependent cues are transformed into a complex audio-spatial map, which depends on anatomical characteristics and develops through experience with sensory&#x02014;mainly visual (King, <xref ref-type="bibr" rid="B22">2009</xref>)&#x02014;feedback. Audio-spatial maps have been found to be highly plastic throughout life (Clifton et al., <xref ref-type="bibr" rid="B14">1988</xref>; Hofman et al., <xref ref-type="bibr" rid="B20">1998</xref>; Otte et al., <xref ref-type="bibr" rid="B32">2013</xref>). Experience-dependent plasticity provides a potential neural basis for training-induced perceptual improvements in performance.</p>
<p>Large individual differences in localization ability have been reported, even in young normal-hearing adults (Wightman and Kistler, <xref ref-type="bibr" rid="B43">1989</xref>; Makous and Middlebrooks, <xref ref-type="bibr" rid="B26">1990</xref>; Wenzel et al., <xref ref-type="bibr" rid="B40">1993</xref>; Populin, <xref ref-type="bibr" rid="B33">2008</xref>; Savel, <xref ref-type="bibr" rid="B37">2009</xref>). These individual differences were mainly observed under experimental conditions that are assumed to involve spectral cues: localization in the up/down and front/back dimensions (Wightman and Kistler, <xref ref-type="bibr" rid="B43">1989</xref>; Wenzel et al., <xref ref-type="bibr" rid="B40">1993</xref>) and in noise (Best et al., <xref ref-type="bibr" rid="B9">2005</xref>). Two main contributing factors to localization abilities have therefore been proposed: spectral cues, and perceptual processes involved in the analysis of these cues. Several studies have assessed the contributions of these two factors separately.</p>
<p>It has been proposed that localization abilities depend mainly on the physical saliency of the acoustical cues carried by HRTFs. According to this hypothesis, the performance of listeners with poorer abilities would be hampered by insufficiently salient spectral cues. This hypothesis was initially supported by the finding that listeners with poor localization performance substantially improved when these listeners used the HRTFs of other listeners who had better performance (Butler and Belendiuk, <xref ref-type="bibr" rid="B11">1977</xref>; Wenzel et al., <xref ref-type="bibr" rid="B41">1988</xref>; Asano et al., <xref ref-type="bibr" rid="B6">1990</xref>). However, the physical saliency of spectral cues was not quantified, and more recent studies, involving more listeners, did not confirm this finding (M&#x000F8;ller et al., <xref ref-type="bibr" rid="B29">1996</xref>; Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>). A recent study assessed the spectral shape prominence of 15 individual HRTFs, and found no relationship between this acoustical metrics and localization performance in noise (And&#x000E9;ol et al., <xref ref-type="bibr" rid="B5">2013</xref>).</p>
<p>Alternatively, it has been proposed that providing listeners with other-than-their-own HRTFs should affect their localization performance regardless of the saliency of spectral cues (Wenzel et al., <xref ref-type="bibr" rid="B40">1993</xref>; M&#x000F8;ller et al., <xref ref-type="bibr" rid="B29">1996</xref>; Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>). Four studies compared the localization performance obtained using the individual&#x00027;s own HRTFs (normal cues) to the performance obtained using non-individual HRTFs (modified cues) in the same listeners. The two studies involving listeners with previous experience in localization tests reported a difference in performance between HRTFs (M&#x000F8;ller et al., <xref ref-type="bibr" rid="B29">1996</xref>; Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>). Conversely, the two studies involving na&#x000EF;ve listeners reported no difference (Bronkhorst, <xref ref-type="bibr" rid="B10">1995</xref>; Begault et al., <xref ref-type="bibr" rid="B8">2001</xref>). The latter negative findings may have been due to the involvement of na&#x000EF;ve listeners, who usually have more variable performance&#x02014;perhaps due to differences in the speed of procedural learning (e.g., handling of the response device, Djelani et al., <xref ref-type="bibr" rid="B15">2000</xref>; Majdak et al., <xref ref-type="bibr" rid="B25">2010</xref>). There were multiple other methodological differences between the four studies<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref>. Reports of a lack of difference in performance could also result from insufficiently large &#x0201C;inter-spectral distance&#x0201D; (ISD) between individual and non-individual HRTFs (as defined by Middlebrooks, <xref ref-type="bibr" rid="B27">1999a</xref>). On the other hand, the reports of large differences might be explained merely by the fact that the listeners did not learn to use the cues provided by the non-individual HRTFs. Perceptual learning produces a recalibration of the audio-spatial map (Hofman et al., <xref ref-type="bibr" rid="B20">1998</xref>; Carlile and Blackman, <xref ref-type="bibr" rid="B12">2013</xref>). By simulating complete recalibration, Majdak et al. (<xref ref-type="bibr" rid="B24">2014</xref>) showed that using non-individual HRTFs should have a moderate impact on sound localization performance. However, they found that non-acoustical factors (attention, perceptual abilities) would be highly relevant for predicting sound localization performance.</p>
<p>Non-acoustical factors, such as perceptual processes, have been proposed to explain the large individual differences reported in studies about discrimination between front and rear sources (Wightman and Kistler, <xref ref-type="bibr" rid="B46">1999</xref>) and about sound localization in noise (And&#x000E9;ol et al., <xref ref-type="bibr" rid="B4">2011</xref>, <xref ref-type="bibr" rid="B5">2013</xref>). The perceptual processes involved in the analysis of spectral cues (Drennan and Watson, <xref ref-type="bibr" rid="B16">2001</xref>; Sabin et al., <xref ref-type="bibr" rid="B36">2012</xref>) and sound localization accuracy with individual HRTFs (Majdak et al., <xref ref-type="bibr" rid="B25">2010</xref>) were both found to improve with training in the auditory task. In the latter study, acoustical cues were kept constant but sensory (visual) feedback was provided during training. The resulting improvement in localization performance was assumed to reflect perceptual learning. However, increased exposure to the experimental environment (e.g., apparatus) and/or procedural learning (i.e., learning of the task contingencies) could have also contributed to the observed improvement.</p>
<p>In the present study, we assessed the contributions of acoustical and perceptual factors to sound localization abilities with virtual sources under experimental conditions that were chosen specifically to address the confounds present in previous studies&#x02014;i.e., factors that could interfere with, or mask, the actual contribution of the factor investigated. Twenty na&#x000EF;ve listeners were given procedural training prior to sound localization tests in &#x0201C;classical&#x0201D; conditions (anechoic environment, constant target/head distance, large range of azimuths and elevations). Acoustical and perceptual factors were separately manipulated, and the resulting effects on localization performance were assessed.</p>
<p>To investigate the role of acoustical cues, sound localization performance was measured with individual and non-individual HRTFs (normal and modified cues). We quantified the &#x0201C;spectral strength,&#x0201D; which is assumed to quantify the amount of spectral detail, of each HRTF (And&#x000E9;ol et al., <xref ref-type="bibr" rid="B5">2013</xref>), and the ISD between individual and non-individual HRTFs. The following observations would be in favor of a substantial contribution of acoustical factors to sound localization abilities with virtual sources: a relationship between performance and spectral strength with individual HRTFs, a difference in performance between individual and non-individual HRTFs, and a relationship between this behavioral difference and the ISD between HRTFs.</p>
<p>The role of perceptual processes was investigated as follows. A subset of 15 listeners performed training to the sound localization task with individual HRTFs. Seven listeners received visual correct-answer feedback during training (test group) and eight received no feedback (control group). The amount of training-induced learning was assessed by comparing pre- and post-test performance. The persistence of learning was assessed by a follow-up post-test. In studies of perceptual training, it is often assumed that the training regimen elicits more efficient perceptual learning if correct-answer feedback is provided (Amitay et al., <xref ref-type="bibr" rid="B2">2010</xref>), particularly for complex tasks (Garcia et al., <xref ref-type="bibr" rid="B17">2013</xref>). For sound localization, it has even been suggested that no perceptual learning can occur if no feedback is provided (Recanzone et al., <xref ref-type="bibr" rid="B34">1998</xref>; Irving and Moore, <xref ref-type="bibr" rid="B21">2011</xref>). We therefore assumed that the training regimen in the present study elicited perceptual learning for the test group only. For this group, significant training-induced improvements in localization performance would indicate that perceptual learning occurred. The finding of a relationship between the amount of learning and the performance as measured prior to training for the test group would therefore reflect the contribution of a common&#x02014;perceptual in this case&#x02014;factor to the two behavioral metrics. Taken together, these results would indicate a large contribution of perceptual factors to sound localization abilities with virtual sources.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Overview of the study</title>
<p>To test the hypotheses presented in the Introduction, two consecutive experiments were conducted. In the first experiment, the role of acoustical factors was assessed by comparing the localization performance obtained using individual HRTFs (normal acoustical cues) to that obtained using non-individual HRTFs (modified cues). The spectral strength of each HRTF, and the ISD between individual and non-individual HRTFs, were evaluated. Prior to the sound localization tests, each listener performed procedural training with visual targets to reduce the contribution of procedural factors to the results. The second experiment assessed the role of perceptual factors by comparing localization performance prior to and following a 5-day training regimen. A first group received visual feedback (test group) and a second group (control group) received no feedback. An improvement of performance for the first group would be in favor of a contribution of perceptual factors to sound localization abilities with virtual sources, because acoustical factors were constant during training. The control group allowed to assess the potential contribution of other factors (familiarization, procedural learning,&#x02026;) to the observed training-induced improvements.</p>
</sec>
<sec>
<title>Listeners</title>
<p>Twenty-five na&#x000EF;ve listeners participated (11 females, mean age 27 &#x000B1; 5 years; right-handed according to the Edinburgh Handedness Inventory, see Oldfield, <xref ref-type="bibr" rid="B30">1971</xref>). All had normal hearing (thresholds of 15 dB HL or less at octave frequencies from 0.125 to 8 kHz) and normal otoscopy. None had history of auditory pathology. Written informed consent was obtained, in agreement with the guidelines of the Declaration of Helsinki and the Huriet law on biomedical research in humans. Listeners were paid 10 &#x020AC;/h for their participation. After completion of the study, the data from five listeners were excluded due to errors in the processing of their HRTFs (see below).</p>
</sec>
<sec>
<title>Experimental apparatus</title>
<p>The localization experiment was conducted inside a sphere, which was located in a 30-m<sup>2</sup>, light and sound-attenuating (&#x0003C;0.02 Lux and 35 dBA) room. The setup was a black sphere with a radius of 1.4 m that was truncated at its base (1.2 m below center, elevation &#x0003D; &#x02212;60&#x000B0;). This sphere represented the perceptual space of the listener during testing (see Figure <xref ref-type="fig" rid="F1">1</xref>). Three lines of optical fibers were used to visually indicate the medial vertical, medial horizontal, and medial frontal planes on the interior surface of the sphere. A network of 619 optical fibers, each connected to one LED, was distributed on the sphere. The LEDs (color &#x0003D; red, size &#x0003D; 1&#x000B0; of visual angle, luminance &#x0003D; 10 cd/m<sup>2</sup>), when turned on, were used either as visual targets or as feedback signals.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Interior view (left) and exterior schematic view (right) of the experimental apparatus</bold>.</p></caption>
<graphic xlink:href="fnins-08-00451-g0001.tif"/>
</fig>
<p>The listener was seated on a stool that was adjusted so as to match the center of the listener&#x00027;s head with that of the sphere. During testing, the matching was verified using an electromagnetic sensor (Polhemus Fastrack) mounted on the headphones (Beyer DT990Pro). Listeners used a &#x0201C;God Eye Localization Pointing&#x0201D; system (GELP, Gilkey et al., <xref ref-type="bibr" rid="B18">1995</xref>) to provide their localization responses. The GELP was composed of a plastic globe (radius &#x0003D; 15 cm) that represented a reduced version of the listener&#x00027;s perceptual space and a stylus. Listeners had to point the stylus on the globe so that the vector &#x0201C;center of the globe to stylus tip&#x0201D; had the same direction as the vector &#x0201C;center of the listener&#x00027;s head to perceived target direction on the sphere.&#x0201D; The position of the stylus tip was recorded using an electromagnetic sensor (Polhemus Fastrack), whose transmitter was mounted on the bar supporting the globe. To help the transfer of representation from perceptual to response spaces, the globe contained a figurine&#x00027;s head that represented the listener&#x00027;s head at the center of the sphere, and white circles that represented the three main planes (medial horizontal, medial vertical, and medial frontal). The position of the LEDs relative to the listener&#x00027;s head varied in azimuth from 0 to 360&#x000B0; and in elevation from &#x02212;60 to 90&#x000B0;. The angular separation between LEDs was 15 or 20&#x000B0;.</p>
</sec>
<sec>
<title>Measurement and spectral characterization of HRTFs</title>
<p>One non-individual (Neumann KU-100 dummy head) and 25 individual (listeners) HRTFs were measured in a semi-anechoic room (Illsonic Sonex Audio) using the procedure described in And&#x000E9;ol et al. (<xref ref-type="bibr" rid="B5">2013</xref>). Directional transfer functions (DTFs) were then derived from each HRTF using the method proposed by Middlebrooks (<xref ref-type="bibr" rid="B27">1999a</xref>). DTFs only contain the directional components of the HRTF, and are independent of the characteristics of the microphone or of its positioning into the ear canal. To compute DTFs, each HRTF has to be divided by the square root of the weighted sum of squared HRTFs that have been measured for each sound source direction. The weights are adjusted to take into account the non-uniform distribution of sound directions. The spectral strength, which corresponds to the ISD between a flat spectrum and the magnitude spectrum of the DTF, was computed for each HRTF using the procedure described in And&#x000E9;ol et al. (<xref ref-type="bibr" rid="B5">2013</xref>). The ISD between individual and non-individual HRTFs was quantified as the difference in DTF.</p>
<p>As a result of an error in DTFs computation (i.e., use of the HRTF measured for the 90&#x000B0; elevation instead of the weighted sum of squared HRTFs), which was detected after collection of the behavioral data, five listeners were excluded from the study. They had ISDs between correctly and incorrectly assessed DTFs greater than the smallest ISD between individual and non-individual HRTFs in the 25-listener cohort (9.5 dB<sup>2</sup>). ISDs between correct and incorrect DTFs ranged from 1.1 to 6.6 dB<sup>2</sup> across the remaining 20 listeners (see Table <xref ref-type="table" rid="T1">1</xref>). These values are below the ISDs between individual and non-individual HRTFs (range &#x0003D; 9.5 to 17.2 dB<sup>2</sup>). However, to verify that the error in DTFs was unlikely to affect the behavioral results reported below, five of the 20 listeners performed an additional localization test with individual HRTFs, using their correct and incorrect DTFs. The results showed little or no effect of the difference in DTF (see Appendix). We therefore refer below to &#x0201C;individual HRTFs&#x0201D; in spite of the small error in DTF presentation.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Individual value of the ISD between correct and incorrect DTFs (in dB<sup>2</sup>)</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Listener</bold></th>
<th align="center"><bold>ISD (dB<sup>2</sup>)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">L8</td>
<td align="center">3.6</td>
</tr>
<tr>
<td align="left">L9</td>
<td align="center">4.4</td>
</tr>
<tr>
<td align="left">L11</td>
<td align="center">1.6</td>
</tr>
<tr>
<td align="left">L12</td>
<td align="center">1.4</td>
</tr>
<tr>
<td align="left">L13</td>
<td align="center">1.8</td>
</tr>
<tr>
<td align="left">L14</td>
<td align="center">2.5</td>
</tr>
<tr>
<td align="left">L15</td>
<td align="center">3.5</td>
</tr>
<tr>
<td align="left">L17</td>
<td align="center">3.9</td>
</tr>
<tr>
<td align="left">L18</td>
<td align="center">3.3</td>
</tr>
<tr>
<td align="left">L21</td>
<td align="center">2.2</td>
</tr>
<tr>
<td align="left">L22</td>
<td align="center">6.6</td>
</tr>
<tr>
<td align="left">L23</td>
<td align="center">1.3</td>
</tr>
<tr>
<td align="left">L24</td>
<td align="center">2.2</td>
</tr>
<tr>
<td align="left">L26</td>
<td align="center">4.9</td>
</tr>
<tr>
<td align="left">L27</td>
<td align="center">1.2</td>
</tr>
<tr>
<td align="left">L28</td>
<td align="center">2.7</td>
</tr>
<tr>
<td align="left">L30</td>
<td align="center">3.8</td>
</tr>
<tr>
<td align="left">L31</td>
<td align="center">4.1</td>
</tr>
<tr>
<td align="left">L33</td>
<td align="center">1.1</td>
</tr>
<tr>
<td align="left">L34</td>
<td align="center">1.3</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Stimuli</title>
<p>Stimuli for sound localization tests were digitally generated at a 48.8-kHz sampling rate, 24-bit resolution using a real-time processor (RX6 Tucker-Davis Technologies), and were converted to the analog domain, routed to a headphone buffer (HB7 Tucker-Davis Technologies) and presented through headphones (Beyer DT990Pro). The stimulus was a 150-ms (including 10-ms on/off cosine-squared ramps) burst of pink noise that was filtered between 0.05 and 14 kHz using sixth-order and seventh-order Butterworth filters, respectively. The overall stimulus level was 60 dB SPL.</p>
</sec>
<sec>
<title>Procedures</title>
<p>Listeners (<italic>N</italic> &#x0003D; 20 after removal of five listeners) performed procedural training with the GELP using visual targets (3 consecutive days) and then completed sound localization pre-tests with individual and non-individual HRTFs in counterbalanced order (2 days). A subset of 15 listeners then performed training to the sound localization task with individual HRTFs (5 days) followed by sound localization &#x0201C;immediate&#x0201D; post-tests with individual and non-individual HRTFs in fixed order (2 days). All except one trained listeners performed a &#x0201C;long-term&#x0201D; post-test with individual HRTFs (1 month after the immediate post-tests).</p>
<p>The directions of the visual or auditory targets were chosen as follows. For sound localization tests, virtual auditory targets were created by interpolating the directions used for the HRTF measurement. The target directions were determined using 119-point meshes mapped onto the surface of the perceptual space (shortened at &#x02212;60&#x000B0; of elevation) using the Hypermesh (Altair, MI, USA) software. Three different meshes were used for the pre-test, immediate post-test, and long-term post-test. A 7&#x000B0; azimuth translation was applied so that the directions tested using individual HRTFs were different from those tested using non-individual HRTFs. For the procedural and auditory trainings, the target directions corresponded to the positions of the optical fibers on the surface of the sphere. The surface of the sphere was divided into eight areas defined by the intersection of the median horizontal, vertical and frontal planes. For a given session of procedural or auditory training, the target directions were randomly but equally chosen among the eight areas. The target directions varied between sessions. Thus, the sets of 119 (sound localization tests) or 120 (auditory training) target directions varied between training sessions, between pre- and post-tests, and between individual and non-individual HRTFs.</p>
<sec>
<title>Procedural training</title>
<p>The setup and response device were the same as those used for auditory tests. The procedural training stage had two goals: (1) familiarize the listener with the experimental environment and (2) reduce experimental noise related to the use of the response device (i.e., pointing errors in the transfer of representation from egocentric perceptual space to allocentric response space). Visual targets were used to prevent auditory learning.</p>
<p>Once the listener was installed in the sphere, a visual cross was turned on to indicate the &#x0201C;straight ahead&#x0201D; direction (azimuth and elevation &#x0003D; 0&#x000B0;). The listener oriented to the straight ahead direction and pressed the stylus button. The cross was turned off and a red visual target was then presented on the sphere by turning on one LED. For trials with no feedback, listeners had to indicate the perceived direction of the visual target using the GELP, and to validate their response by pressing the stylus button. For trials with feedback, listeners pointed to the perceived direction without pressing the stylus button. If the spherical angular error between actual and pointed directions was below the &#x0201C;permissible&#x0201D; error (&#x0003D;8&#x000B0; for day 1; &#x0003D; error measured for the last no-feedback block of the preceding day&#x02014;2&#x000B0; for days 2 and 3), a &#x0201C;hit&#x0201D; sound was emitted. Otherwise, the listener had to modify the pointed direction until they reached permissible error. The trial ended either by the emission of the hit sound or after 30 s. The position of the target changed from trial to trial. The listeners performed three training sessions (duration &#x0003D; 1 h 30 each). For each session, two blocks of 40 trials with correct-answer feedback (15&#x02013;20 min) alternated with three blocks of 32 trials with no feedback (12&#x02013;15 min) in fixed order (no/with/no/with/no feedback).</p>
<p>The spherical angular error averaged across the 20 listeners decreased from 9.2&#x000B0; (&#x000B1;1.6) for the first to 6.6&#x000B0; (&#x000B1;1.3) for the last no-feedback blocks. Individual errors were stable across, at least, the last three no-feedback blocks (repeated measure ANOVA, error at no-feedback blocks as the within-listener factor, post-hoc Tukey-HSD: <italic>p</italic> &#x0003E; 0.50).</p>
</sec>
<sec>
<title>Sound localization tests</title>
<p>Before each presentation of the auditory target, the listener&#x00027;s position relative to the straight ahead direction was verified using the electromagnetic sensor. In case of a deviation above 5&#x000B0;, a message required the listener to rectify their position. Once the listener was correctly positioned, the auditory target was presented over headphones at one of 119 possible virtual directions on the sphere. The listener was free to move after the offset of the auditory target. The listener had to indicate the perceived direction using the GELP. There was no time restriction but listeners were encouraged to respond quickly. No correct-answer feedback was provided. The set of 119 directions was repeated six times (total number of trials &#x0003D; 714). The responses collected at the first repetition were excluded from the analyses. Each pre- and post-test had an overall duration of 1.5&#x02013;2 h, and was divided into three series of four 60-trial blocks (54 for the last one). Listeners had to stay inside the sphere during between-block breaks (1.5 min) but were allowed to leave the setup during between-series breaks (10 min).</p>
</sec>
<sec>
<title>Auditory training</title>
<p>The auditory stimuli used during training had the same characteristics as those used in the sound localization pre- and post-tests except that only individual HRTFs were used. Each of the five training sessions included three 20-min blocks of 40 trials, with 8-min breaks between blocks. For the test group (<italic>N</italic> &#x0003D; 7), training consisted in providing the listener with trial-by-trial visual feedback (red LED turned on during 250 ms after the listener&#x00027;s response) as to the correct auditory target direction. Listeners were instructed to search for the red light, face it, and come back to the straight-ahead position. The auditory target &#x0002B; visual feedback sequence was replayed at least once. Listeners were then allowed to replay the sequence as many times as they wished. Training for the test group was similar to that used in the study by Majdak et al. (<xref ref-type="bibr" rid="B25">2010</xref>), except that their listeners were allowed only one sequence replay. For the control group (<italic>N</italic> &#x0003D; 8), training sessions were identical to pre- and post-tests sessions, except for the number of trials (660 trials instead of 714) that allowed the training duration to be similar for the two groups. The events and listener&#x00027;s actions during testing are listed in Table <xref ref-type="table" rid="T2">2</xref>.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Order of events and listener&#x00027;s actions during auditory training</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Events</bold></th>
<th align="left"><bold>Listener&#x00027;s actions</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Straight ahead indicator turned on</td>
<td align="left">Face the straight ahead indicator</td>
</tr>
<tr>
<td align="left">Auditory target presentation</td>
<td align="left">Indicate the target direction using GELP</td>
</tr>
<tr>
<td align="left">Visual feedback (red light) turned on</td>
<td align="left">Face the red light and come back</td>
</tr>
<tr>
<td align="left">Straight ahead indictor turned on</td>
<td align="left">Face the straight ahead indicator</td>
</tr>
<tr>
<td align="left">Visual feedback turned off</td>
<td/>
</tr>
<tr>
<td align="left">Auditory target re-presentation</td>
<td/>
</tr>
<tr>
<td align="left">Visual feedback turned on</td>
<td align="left">Choose to replay the auditory target &#x0002B; visual feedback sequence or to move to the next trial</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec>
<title>Data analysis</title>
<p>Localization responses were computed using a three-pole coordinate system (Kistler and Wightman, <xref ref-type="bibr" rid="B23">1992</xref>). In this system, the position of a point is coded by the three following angles: the left/right angle in the medial vertical plane (direction in the left/right dimension), the front/back angle in the medial frontal plane (direction in the front/back dimension), and the up/down angle in the medial horizontal plane (direction in the up/down dimension). This coordinate system has the advantage that a given angular distance corresponds to a constant distance on the sphere for all spatial regions. Conversely, in two-pole&#x02014;lateral/polar (Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>) and azimuth/elevation (Oldfield and Parker, <xref ref-type="bibr" rid="B31">1984</xref>)&#x02014;coordinate systems, a compression of space occurs when points are close to the poles. Another advantage of the three-pole system is the distinction between spatial dimensions that depend on different localization cues or processes: binaural cues for localization in the left/right dimension (Strutt, <xref ref-type="bibr" rid="B38">1907</xref>), spectral-shape analysis (Wightman and Kistler, <xref ref-type="bibr" rid="B44">1993</xref>) or determination of the main spectral-notch position (Butler and Belendiuk, <xref ref-type="bibr" rid="B11">1977</xref>) for localization in the up/down dimension, and comparison of the levels of different bandwidths (Wightman and Kistler, <xref ref-type="bibr" rid="B45">1997</xref>) or more complex cues (Bronkhorst, <xref ref-type="bibr" rid="B10">1995</xref>; Zhang and Hartmann, <xref ref-type="bibr" rid="B49">2010</xref>) for localization in the front/back dimension.</p>
<p>Scatterplots of raw data (i.e., target against response directions) are provided in Figures <xref ref-type="fig" rid="F2">2</xref>&#x02013;<xref ref-type="fig" rid="F4">4</xref> for the up/down, front/back, and left/right dimensions, respectively. Because left/right judgments remain generally accurate with non-individual HRTFs (Wightman and Kistler, <xref ref-type="bibr" rid="B45">1997</xref>), and individual differences in localization abilities were mainly observed for up/down and front/back dimensions, statistical analyzes were performed for the latter two dimensions only.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Individual judgment position against target position with individual and non-individual HRTFs (black and gray dots, respectively) at the pre-test in the up/down dimension</bold>. Each panel couple is for a different listener (<italic>N</italic> &#x0003D; 20).</p></caption>
<graphic xlink:href="fnins-08-00451-g0002.tif"/>
</fig>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>Same as Figure <xref ref-type="fig" rid="F2">2</xref> but for the front/back dimension</bold>. The front/back reversal rate for individual and non-individual HRTFs are indicated in each panel couple.</p></caption>
<graphic xlink:href="fnins-08-00451-g0003.tif"/>
</fig>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>Same as Figure <xref ref-type="fig" rid="F2">2</xref> but for the left/right dimension</bold>.</p></caption>
<graphic xlink:href="fnins-08-00451-g0004.tif"/>
</fig>
<p>Numerous studies have reported frequent front/back (response pointing to the frontal hemifield for a target presented in the rear or vice versa) and up/down reversals (response pointing to above 0&#x000B0; elevation for a target presented at below 0&#x000B0; elevation or vice versa) in localization responses. Such reversals drastically increase angular errors, unless they are excluded or corrected (e.g., a response at &#x02212;50&#x000B0; elevation is transformed into 50&#x000B0;). We therefore assessed the following localization scores: up/down angular error after correction of up/down reversals (in &#x000B0;), and down &#x02192; up, up &#x02192; down, and front/back reversal rates (in %). Up/down errors were separately assessed for &#x0201C;high,&#x0201D; &#x0201C;middle,&#x0201D; and &#x0201C;low&#x0201D; target elevations (elevation &#x0003D; 25 to 75&#x000B0;, &#x02212;15 to 15&#x000B0;, &#x02212;60 to &#x02212;25&#x000B0;, respectively). Responses at &#x000B1;15&#x000B0; front/back angles and those at &#x000B1;20&#x000B0; up/down angles were not considered as front/back and up/down reversals, respectively.</p>
<p>The within- and across-listener paired comparisons listed below were statistically assessed using Wilcoxon tests. Relationships between two metrics were assessed using Spearman correlation coefficients. Two-tailed <italic>p</italic>-values are reported below.</p>
<p>To examine the role of acoustical factors, we assessed:</p>
<list list-type="order">
<list-item><p>The relationship between spectral strength and pre-test performance with individual HRTFs for the 20-listener cohort.</p></list-item>
<list-item><p>The individual and cohort differences between individual and non-individual HRTFs in pre-test performance.</p></list-item>
<list-item><p>The relationship between this behavioral difference and the ISD between individual and non-individual HRTFs for the cohort.</p></list-item>
</list>
<p>To examine the role of perceptual factors, we first computed individual amounts of training-induced improvement (i.e., pre-test &#x02013; post-test difference in score, referred to below as &#x0201C;learning amount&#x0201D;) with individual HRTFs. Then, we determined for each listener whether learning was significant using a Wilcoxon test (pre-test against post-test scores). Finally, we assessed within each trained group:</p>
<list list-type="order">
<list-item><p>The relationship between learning amount at the immediate post-test and pre-test score.</p></list-item>
<list-item><p>Whether the listeners with significant learning at the immediate post-test had similar immediate and long-term post-test scores.</p></list-item>
</list>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Relationship between spectral strength and pre-test performance with individual HRTFs</title>
<p>With individual HRTFs, no relationship was found between spectral strength and performance at the pre-test (see Figure <xref ref-type="fig" rid="F5">5</xref>), regardless of whether performance was expressed in terms of up/down angular errors (high elevations: <italic>R</italic> &#x0003D; &#x02212;0.21, <italic>p</italic> &#x0003D; 0.37; middle elevations: <italic>R</italic> &#x0003D; 0.32, <italic>p</italic> &#x0003D; 0.16; low elevations: <italic>R</italic> &#x0003D; 0.14, <italic>p</italic> &#x0003D; 0.56), up/down reversals (up &#x02192; down: <italic>R</italic> &#x0003D; &#x02212;0.11, <italic>p</italic> &#x0003D; 0.64; down &#x02192; up: <italic>R</italic> &#x0003D; &#x02212;0.01, <italic>p</italic> &#x0003D; 0.95), or front/back reversals (<italic>R</italic> &#x0003D; &#x02212;0.01, <italic>p</italic> &#x0003D; 0.99). However, the spectral strength of the non-individual HRTFs was weaker than that of all individual HRTFs (12.8 dB<sup>2</sup> vs. 17.6 to 45.0 dB<sup>2</sup>) for the low elevation region, where (down &#x02192; up) reversals were significantly more frequent with non-individual than with individual HRTFs.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p><bold>Individual localization scores at the pre-test against spectral strength with individual HRTFs</bold>. <bold>(A&#x02013;C)</bold> Up/down errors (in &#x000B0;) for high, middle, and low target elevations. <bold>(D&#x02013;F)</bold> Up &#x02192; down, down &#x02192; up, and front/back reversal rates (in %).</p></caption>
<graphic xlink:href="fnins-08-00451-g0005.tif"/>
</fig>
</sec>
<sec>
<title>Difference between individual and non-individual HRTFs at the pre-test</title>
<p>For up/down errors (see Figures <xref ref-type="fig" rid="F2">2</xref>, <xref ref-type="fig" rid="F6">6A&#x02013;C</xref>), only a few listeners (1, 6, and 6 for high, middle, and low target elevations, respectively) individually showed significant differences between HRTFs. The lack of difference was observed regardless of whether listeners had large or small errors, and is therefore unlikely to have been due to a floor effect. The difference between HRTFs as assessed for the cohort was significant for high target elevations (median up/down error &#x000B1; 1 inter-quartile range &#x0003D; 18 &#x000B1; 3&#x000B0; with individual HRTFs &#x0003C; 19 &#x000B1; 5&#x000B0; with non-individual HRTFs, <italic>p</italic> &#x0003D; 0.004) but was not significant for middle (24 &#x000B1; 8&#x000B0; vs. 23 &#x000B1; 8&#x000B0;, <italic>p</italic> &#x0003D; 0.52) and low target elevations (23 &#x000B1; 6&#x000B0; vs. 21 &#x000B1; 8&#x000B0;, <italic>p</italic> &#x0003D; 0.99). Up &#x02192; down reversals were infrequent with individual HRTFs (see Figure <xref ref-type="fig" rid="F6">6D</xref>). The difference between HRTFs was small but significant for six listeners and for the cohort (median &#x0003D; 3 &#x000B1; 5% with individual HRTFs vs. 5 &#x000B1; 7% with non-individual HRTFs, <italic>p</italic> &#x0003D; 0.03). Down &#x02192; up reversals were more frequent than up &#x02192; down reversals, and increased with non-individual HRTFs (see Figure <xref ref-type="fig" rid="F6">6E</xref>). The difference between HRTFs was significant for 17 listeners and for the cohort (median &#x0003D; 20 &#x000B1; 14% &#x0003C; 51 &#x000B1; 26%, <italic>p</italic> &#x0003C; 0.001). For front/back reversals (see Figures <xref ref-type="fig" rid="F3">3</xref>, <xref ref-type="fig" rid="F6">6F</xref>), only two listeners individually showed significant difference between HRTFs. The difference for the cohort was not significant (median &#x0003D; 35 &#x000B1; 10% &#x02248; 35 &#x000B1; 11%, <italic>p</italic> &#x0003D; 0.37). Visual inspection of raw data in the left/right dimension indicates no difference between HRTFs (see Figure <xref ref-type="fig" rid="F4">4</xref>).</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p><bold>Individual localization scores with non-individual against with individual HRTFs at the pre-test. (A&#x02013;C)</bold> Up/down errors (in &#x000B0;) for high, middle, and low target elevations. <bold>(D&#x02013;F)</bold> Up &#x02192; down, down &#x02192; up, and front/back reversal rates (in %). Each symbol is for a different listener. Circles and bars represent the means and 95% confidence intervals averaged across about 30 (up/down error) to 96 (front/back reversals) target positions. Filled circles indicate the listeners with significant difference between individual and non-individual HRTFs according to Wilcoxon tests.</p></caption>
<graphic xlink:href="fnins-08-00451-g0006.tif"/>
</fig>
</sec>
<sec>
<title>Relationship between behavioral difference and ISD between individual and non-individual HRTFs</title>
<p>The ISD values varied across target regions and listeners (Figure <xref ref-type="fig" rid="F7">7</xref>), but were essentially&#x02014;except for high elevations&#x02014;well-above 10 dB<sup>2</sup>, which should be large enough to produce behavioral effects according to the results from a past study (Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>). However, we found no <italic>positive</italic> correlation between the signed difference in localization score and the ISD between non-individual and individual HRTFs (up/down errors: <italic>R</italic> &#x0003D; &#x02212;0.03, <italic>p</italic> &#x0003D; 0.90 for high elevations, <italic>R</italic> &#x0003D; &#x02212;0.07, <italic>p</italic> &#x0003D; 0.77 for middle elevations, <italic>R</italic> &#x0003D; &#x02212;0.42, <italic>p</italic> &#x0003D; 0.037 for low elevations; up &#x02192; down reversals: <italic>R</italic> &#x0003D; 0.32, <italic>p</italic> &#x0003D; 0.16; down &#x02192; up reversals: <italic>R</italic> &#x0003D; 0.37, <italic>p</italic> &#x0003D; 0.11; front/back reversals: <italic>R</italic> &#x0003D; &#x02212;0.02, <italic>p</italic> &#x0003D; 0.93). Note that if the listeners who had <italic>lower</italic> scores with non-individual HRTFs than with individual HRTFs were excluded from analyses, no correlation was significant.</p>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p><bold>Individual signed differences in localization score against ISD between non-individual and individual HRTFs</bold>. <bold>(A&#x02013;C)</bold> Up/down errors (in &#x000B0;) for high, middle, and low target elevations. <bold>(D&#x02013;F)</bold> Up &#x02192; down, down &#x02192; up, and front/back reversal rates (in %).</p></caption>
<graphic xlink:href="fnins-08-00451-g0007.tif"/>
</fig>
</sec>
<sec>
<title>Significance of learning with individual HRTFs</title>
<p>Individual raw data collected at the pre-test and the post-test for the two groups are provided for the up/down and front/back dimensions in Figures <xref ref-type="fig" rid="F8">8</xref>, <xref ref-type="fig" rid="F9">9</xref>, respectively. In the up/down dimension, the listeners from the test group mostly showed substantial training-induced improvement in performance (i.e., post-test responses closer to perfect performance than pre-test responses, see left panels in Figure <xref ref-type="fig" rid="F8">8</xref>), but those from the control group showed little or no improvement (see right panels in Figure <xref ref-type="fig" rid="F8">8</xref>). For up/down errors, many listeners from the test group (2, 4, and 4/7 for high, middle, and low target elevations, respectively) but only a few listeners from the control group (2, 1, and 2/8, respectively) showed significant learning (see filled symbols above the dashed lines in Figures <xref ref-type="fig" rid="F10">10A&#x02013;C</xref>). Up &#x02192; down reversals were infrequent prior to training but nonetheless significantly decreased with training for one listener from the test group and for two listeners from the control group (see Figure <xref ref-type="fig" rid="F10">10D</xref>). Down &#x02192; up reversals were frequent prior to training and significantly decreased with training for four listeners from the test group but for no listener from the control group (see filled symbols above the dashed line in Figure <xref ref-type="fig" rid="F10">10E</xref>). In the front/back dimension, post-test responses were similar to pre-test responses for all except one listener (L27) from the control group (see right panels in Figure <xref ref-type="fig" rid="F9">9</xref>), but frequently came closer to perfect performance with training for the test group, particularly for targets presented in front (see left panels in Figure <xref ref-type="fig" rid="F9">9</xref>). Learning as assessed on front/back reversal rates was significant for three listeners from the test group but for no listener from the control group (see filled symbols above the dashed line in Figure <xref ref-type="fig" rid="F10">10F</xref>).</p>
<fig id="F8" position="float">
<label>Figure 8</label>
<caption><p><bold>Individual judgment position against target position with individual HRTFs at the pre- and post-tests (black and gray dots, respectively) for the test and control listeners (left and right columns, respectively) in the up/down dimension</bold>. Each panel couple is for a different listener.</p></caption>
<graphic xlink:href="fnins-08-00451-g0008.tif"/>
</fig>
<fig id="F9" position="float">
<label>Figure 9</label>
<caption><p><bold>Same as in Figure <xref ref-type="fig" rid="F8">8</xref> but for front/back dimension</bold>.</p></caption>
<graphic xlink:href="fnins-08-00451-g0009.tif"/>
</fig>
<fig id="F10" position="float">
<label>Figure 10</label>
<caption><p><bold>Individual learning amounts (pre-test minus post-test localization score) against pre-test scores for the test and control listeners (blue and pink symbols, respectively) with individual HRTFs</bold>. <bold>(A&#x02013;C)</bold> Up/down errors (in &#x000B0;) for high, middle, and low target elevations. <bold>(D&#x02013;F)</bold> Up &#x02192; down, down &#x02192; up, and front/back reversal rates (in %). Filled symbols indicate the listeners with significant difference between pre- and post-tests according to Wilcoxon tests.</p></caption>
<graphic xlink:href="fnins-08-00451-g0010.tif"/>
</fig>
<p>At the pre-test, no significant difference was observed between the test and control groups (up/down errors: 16 &#x000B1; 4&#x000B0; vs. 18 &#x000B1; 2&#x000B0;, <italic>p</italic> &#x0003D; 0.28 for high elevations, 24 &#x000B1; 6&#x000B0; vs. 25 &#x000B1; 10&#x000B0;, <italic>p</italic> &#x0003D; 0.87 for middle elevations, 24 &#x000B1; 7&#x000B0; vs. 22 &#x000B1; 7&#x000B0;, <italic>p</italic> &#x0003D; 0.61 for low elevations; up &#x02192; down reversals: 2 &#x000B1; 4% vs. 3 &#x000B1; 3%, <italic>p</italic> &#x0003D; 0.44; down &#x02192; up reversals: 20 &#x000B1; 22% vs. 19 &#x000B1; 12%, <italic>p</italic> &#x0003D; 0.69; front/back reversals: 38 &#x000B1; 8% vs. 32 &#x000B1; 6%, <italic>p</italic> &#x0003D; 0.19). At the post-test, the test group had significantly smaller up/down errors for middle and low target elevations, and smaller down &#x02192; up reversal rates, than the control group (22 &#x000B1; 6&#x000B0; vs. 27 &#x000B1; 7&#x000B0;, <italic>p</italic> &#x0003D; 0.004, 15 &#x000B1; 3&#x000B0; vs. 21 &#x000B1; 15&#x000B0;, <italic>p</italic> &#x0003D; 0.02, and 12 &#x000B1; 9% vs. 23 &#x000B1; 20%, <italic>p</italic> &#x0003D; 0.01, respectively). However, no significant between-group difference was observed in up/down errors for high target elevations and in up &#x02192; down reversals (15 &#x000B1; 3&#x000B0; vs. 15 &#x000B1; 2&#x000B0;, <italic>p</italic> &#x0003D; 0.54 and 2 &#x000B1; 2% vs. 0.3 &#x000B1; 2%, <italic>p</italic> &#x0003D; 0.17, respectively).</p>
</sec>
<sec>
<title>Relationship between learning amount and pre-test results with individual HRTFs</title>
<p>The correlations between learning amount and pre-test score were assessed for each variable and group. For up/down errors, learning significantly increased with the pre-test score for the test group (<italic>R</italic> &#x0003D; 0.96, <italic>p</italic> &#x0003D; 0.003 for all target elevations), whereas no correlation was found for the control group (<italic>R</italic> &#x0003D; 0.14, <italic>p</italic> &#x0003D; 0.75; <italic>R</italic> &#x0003D; 0.31, <italic>p</italic> &#x0003D; 0.46; <italic>R</italic> &#x0003D; 0.50, <italic>p</italic> &#x0003D; 0.22 for high, middle, and low elevations, respectively). For up/down reversals, the correlations were significant for the test group (up &#x02192; down: <italic>R</italic> &#x0003D; 0.93, <italic>p</italic> &#x0003D; 0.003; down &#x02192; up: <italic>R</italic> &#x0003D; 0.98, <italic>p</italic> &#x0003C; 0.001) but were not for the control group (up &#x02192; down: <italic>R</italic> &#x0003D; 0.55, <italic>p</italic> &#x0003D; 0.17; down &#x02192; up: <italic>R</italic> &#x0003D; 0.49, <italic>p</italic> &#x0003D; 0.22). For front/back reversals, no correlation was significant (test group: <italic>R</italic> &#x0003D; 0.75, <italic>p</italic> &#x0003D; 0.07; control group: <italic>R</italic> &#x0003D; &#x02212;0.02, <italic>p</italic> &#x0003D; 0.98).</p>
<p>Furthermore, to check whether the improvement in performance reflected or not an adaptation to errors in DTF computation (see Section Measurement and Spectral Characterization of HRTFs), the correlations between learning amount and ISD between correct and incorrect DTFs were assessed. No <italic>positive</italic> correlation was found for any variable and group (test group: <italic>R</italic> &#x0003D; 0.07, <italic>p</italic> &#x0003D; 0.91; <italic>R</italic> &#x0003D; &#x02212;0.07, <italic>p</italic> &#x0003D; 0.91; <italic>R</italic> &#x0003D; &#x02212;0.79, <italic>p</italic> &#x0003D; 0.048 for high, middle, and low elevations, respectively. <italic>R</italic> &#x0003D; 0.68, <italic>p</italic> &#x0003D; 0.11; <italic>R</italic> &#x0003D; &#x02212;0.29, <italic>p</italic> &#x0003D; 0.56; <italic>R</italic> &#x0003D; &#x02212;0.07, <italic>p</italic> &#x0003D; 0.91 for up &#x02192; down, down &#x02192; up, and front/back reversals, respectively. Control group: <italic>R</italic> &#x0003D; &#x02212;0.16, <italic>p</italic> &#x0003D; 0.71; <italic>R</italic> &#x0003D; 0.30, <italic>p</italic> &#x0003D; 0.47; <italic>R</italic> &#x0003D; 0.01, <italic>p</italic> &#x0003D; 0.98 for high, middle, and low elevations, respectively. <italic>R</italic> &#x0003D; 0.20, <italic>p</italic> &#x0003D; 0.63; <italic>R</italic> &#x0003D; 0.61, <italic>p</italic> &#x0003D; 0.11; <italic>R</italic> &#x0003D; &#x02212;0.08, <italic>p</italic> &#x0003D; 0.84 for up &#x02192; down, down &#x02192; up, and front/back reversals, respectively).</p>
</sec>
<sec>
<title>Retention of learning with individual HRTFs</title>
<p>All listeners with significant learning at the immediate post-test showed no significant difference in score between immediate and long-term post-tests (3/3 in the test group for down &#x02192; up reversals and 2/2 in the control group for up &#x02192; down reversals; 1/1, 3/3, and 3/3 in the test group and 2/2, 1/1, and 2/2 in the control group for up/down angular errors for high, middle and low elevations, respectively; 2/2 in the test group for front/back reversals).</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<sec>
<title>Role of acoustical factors</title>
<p>To examine the contribution of acoustical factors to sound localization abilities with virtual sources, we assessed for 20 na&#x000EF;ve listeners the relationship between the spectral strength and the localization performance with individual HRTFs, the difference in performance between individual and non-individual HRTFs (normal and modified cues), and its relationship with the ISD between HRTFs. Localization performance was measured in terms of up/down angular errors following correction of reversals for three target elevations (high, middle, low), up &#x02192; down reversals, down &#x02192; up reversals, and front/back reversals rates. We found no relationship between spectral strength and performance with individual HRTFs, nor between behavioral difference and ISD between HRTFs. The only sizeable difference in performance between HRTFs appeared in the low elevation region. In that region, where the acoustical differences between HRTFs (in terms of spectral strength and ISD) were the largest, we noted that the target was perceived in the lower (i.e., correct) hemisphere with individual HRTFs but in the upper (i.e., incorrect) hemisphere with non-individual HRTFs. Past studies involving trained listeners found sizeable differences in localization performance between individual and non-individual HRTFs in both front/back and up/down dimensions (M&#x000F8;ller et al., <xref ref-type="bibr" rid="B29">1996</xref>; Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>). Those involving na&#x000EF;ve listeners reported little or no difference in the front/back dimension (Bronkhorst, <xref ref-type="bibr" rid="B10">1995</xref>; Begault et al., <xref ref-type="bibr" rid="B8">2001</xref>), as for the present study, but they also reported no difference in the up/down dimension, contrary to the present study.</p>
<p>Concerning the front/back dimension, the present findings indicate that the lack of difference in past studies was unlikely due to a floor effect in the (poor) performance of listeners with no prior experience in the task (Bronkhorst, <xref ref-type="bibr" rid="B10">1995</xref>), or to an insufficient ISD between individual and non-individual HRTFs (Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>). First, our listeners performed procedural training prior to auditory tests, which prevented exposure to the experimental environment and response device from affecting the results. Second, the lack of behavioral difference between HRTFs in the auditory task was observed regardless of whether the listener had good or poor performance. Third, most values of ISD between individual and non-individual HRTFs were assumed to be sufficiently large to affect behavioral results according to the results from a past study (Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>).</p>
<p>Front/back reversal rates were substantially higher in the present study using individual HRTFs than in free-field past studies (Wightman and Kistler, <xref ref-type="bibr" rid="B43">1989</xref>; Carlile et al., <xref ref-type="bibr" rid="B13">1997</xref>; Martin et al., <xref ref-type="bibr" rid="B27a">2001</xref>). Higher front/back reversal rates for virtual sources presented with individual cues than for real sources have previously been reported (Wightman and Kistler, <xref ref-type="bibr" rid="B43">1989</xref>; Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>). These difference could possibly result from headphone transfer function issues (Wightman and Kistler, <xref ref-type="bibr" rid="B42">2005</xref>), degree of spatial resolution during the HRTF measurement, and/or errors in DTF computation (present study, see Section Measurement and Spectral Characterization of HRTFs). In the present study, the error in DTF computation was present in both individual and non-individual HRTFs, and could therefore have reduced the behavioral differences between HRTFs.</p>
<p>Concerning the up/down dimension, the discrepancy between the present study and Bronkhorst (<xref ref-type="bibr" rid="B10">1995</xref>) and Begault et al. (<xref ref-type="bibr" rid="B8">2001</xref>) studies could arise from methodological issues. Bronkhorst used other listeners&#x00027; HRTFs as non-individual HRTFs. Given our observations, this has probably reduced the differences in spectral strength&#x02014;and therefore the behavioral differences&#x02014;between individual and non-individual HRTFs. In the Begault et al. (<xref ref-type="bibr" rid="B8">2001</xref>) study, the auditory target positions were limited to the horizontal plane, excluding the low elevation region where we observed the strongest difference between individual and non-individual HRTFs.</p>
<p>We also suggested that the discrepancy between the four past studies (Bronkhorst, <xref ref-type="bibr" rid="B10">1995</xref>; M&#x000F8;ller et al., <xref ref-type="bibr" rid="B29">1996</xref>; Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>; Begault et al., <xref ref-type="bibr" rid="B8">2001</xref>) could arise from differences in experimental protocol (see Footnote 1). In the present study, we used a &#x0201C;classical&#x0201D; protocol, which resembles the protocol used in a past study that reported a difference between HRTFs (Middlebrooks, <xref ref-type="bibr" rid="B28">1999b</xref>). Beyond differences in the listener&#x00027;s characteristics (na&#x000EF;ve in the present study but trained in the past study), we explain the discrepancy between the present and Middlebrooks&#x00027;s studies in terms of data analysis. Middlebrooks assessed reversals without distinction between the up/down and front/back dimensions, and angular (polar) errors following correction of reversals using a more conservative criterion than ours.</p>
<p>To sum-up, the lack of correlation between spectral strength and performance with individual HRTFs showed that this acoustical factor is not a good predictor of performance. Another acoustical factor is the degree of matching between the listener&#x00027;s individual localization cues and those provided by the signal to localize. Our results suggest that large mismatch is needed to produce behavioral effects. However, the validity of this statement is limited by the remaining uncertainty in the quality of the HRTFs.</p>
</sec>
<sec>
<title>Role of perceptual factors</title>
<p>To examine the contribution of perceptual factors to sound localization abilities with virtual sources, a subset of 15 listeners performed training to the sound localization task with fixed acoustical cues (individual HRTFs). The listeners were provided with either sensory (visual) or no correct-answer feedback. We expected the training regimen to elicit perceptual learning, that is, an improvement in the perceptual processes involved in the analysis of acoustical cues, for the &#x0201C;test&#x0201D; group who received feedback. Beyond the use of feedback, the perceptual and procedural contributions to training-induced improvements in performance are rarely separated (Robinson and Summerfield, <xref ref-type="bibr" rid="B35">1996</xref>; Wright and Fitzgerald, <xref ref-type="bibr" rid="B47">2001</xref>). In the present study, the improvement observed following auditory training was unlikely to be triggered by procedural learning for several reasons. First, the listeners performed procedural training with non-auditory stimuli over 3 days prior to sound localization tests, which resulted in optimal and steady ability to handle the response device. Second, further exposure to the procedural aspects of the task during auditory training resulted in significant improvements for only a few listeners from the control group. Third, individual differences in learning amount were larger in the present study (see Figure <xref ref-type="fig" rid="F10">10</xref>) than those reported for procedural learning in a past study (training to interaural time and level differences, Wright and Fitzgerald, <xref ref-type="bibr" rid="B47">2001</xref>). In addition, we observed that the training-induced improvements were retained after 1 month. This suggests that the improvement was not due to modification of the listening strategy, or to a temporary increase in the listener&#x00027;s attentional resources (Goldstone, <xref ref-type="bibr" rid="B19">1998</xref>).</p>
<p>It could seem counter-intuitive that an improvement in sound localization performance is still possible despite a lifetime of localization learning. However, training-induced improvements with normal cues and correct-answer feedback have been reported in previous studies, including for the &#x0201C;most robust&#x0201D; localization ability (i.e., localization of real sources in the left/right dimension, see Savel, <xref ref-type="bibr" rid="B37">2009</xref>; Irving and Moore, <xref ref-type="bibr" rid="B21">2011</xref>). Moreover, improvements in the front/back dimension could result from increased weighting of spectral cues but decreased weighting of dynamic cues&#x02014;available in everyday life conditions but unavailable in the present experiment (Wightman and Kistler, <xref ref-type="bibr" rid="B46">1999</xref>)&#x02014;to front/back discrimination following training. Part of the training-induced improvement observed with individual HRTFs could result from exposure to abnormal cues (i.e., incorrect DTFs). In agreement, there are multiple reports of learning of&#x02014;adaptation to&#x02014;abnormal spectral cues with exposure (Hofman et al., <xref ref-type="bibr" rid="B20">1998</xref>; Van Wanrooij and Van Opstal, <xref ref-type="bibr" rid="B39">2005</xref>; Carlile and Blackman, <xref ref-type="bibr" rid="B12">2013</xref>). However, the ISD between normal and abnormal spectral cues (i.e., between correct and incorrect DTFs, see Table <xref ref-type="table" rid="TA1">1</xref> and Appendix) in the present study was probably too small to produce significant improvement (Van Wanrooij and Van Opstal, <xref ref-type="bibr" rid="B39">2005</xref>). Moreover, no positive correlation was found between the amount of improvement and the ISD between correct and incorrect DTFs.</p>
<p>Our findings confirm the results of a previous study that reported substantial improvement in sound localization with individual HRTFs after a similar training protocol (Majdak et al., <xref ref-type="bibr" rid="B25">2010</xref>). Our results indicate furthermore that this improvement might not be explained by procedural learning.</p>
<p>As perceptual learning is often stimulus-specific, findings of a generalization of learning to untrained stimuli or conditions are mostly believed to reflect task or procedural learning (Wright and Zhang, <xref ref-type="bibr" rid="B48">2009</xref>). However, it has been suggested that generalization could also reflect perceptual learning (Ahissar, <xref ref-type="bibr" rid="B1">2001</xref>). In this case, the learning involves&#x02014;often high level&#x02014;sensory processes that are not specific to the task. In the present study, we assessed whether the listeners from the test and control groups who showed significant learning following auditory training in the trained condition (individual HRTFs) also showed significant learning in an untrained condition (non-individual HRTFs). No learning generalization was observed for the localization responses in the front/back dimension, but most listeners from the test group showed generalization for up/down reversals and up/down errors. Because these listeners had received procedural training, we assume that the generalization was perceptual. The generalization observed could mean that the training improved sensory processes that are not specific to sound localization with individual HRTFs. One of these processes could be, for example, the analysis of the spectral shape of the stimulus (And&#x000E9;ol et al., <xref ref-type="bibr" rid="B5">2013</xref>), a process that is involved regardless of the HRTFs set. Overall, the results indicate that training-induced modifications of perceptual processes had substantial effects on localization performance with virtual sources.</p>
<p>Moreover, we found that the training-induced learning amount was related to the pre-training performance (i.e., poorer initial performance led to larger learning amount), a result also observed in several previous studies (Wright and Fitzgerald, <xref ref-type="bibr" rid="B47">2001</xref>; Amitay et al., <xref ref-type="bibr" rid="B3">2005</xref>; Astle et al., <xref ref-type="bibr" rid="B7">2013</xref>). This correlation is in favor of a contribution of common&#x02014;here perceptual&#x02014;factors to the two metrics. In other words, our results suggest that perceptual processes account for individual differences in sound localization abilities with virtual sources in na&#x000EF;ve listeners.</p>
<p>Taken together, these results are consistent with a large contribution of perceptual processes to sound localization abilities with virtual sources. Majdak et al. (<xref ref-type="bibr" rid="B24">2014</xref>) recently reached a similar conclusion using a sound localization model. By modifying model parameters relative to acoustical or non-acoustical factors, they found that non-acoustical factors (such as for example perceptual abilities to process localization cues) were better predictors of performance than acoustical factors (quality of the directional cues in the HRTFs).</p>
</sec>
</sec>
<sec sec-type="conclusion" id="s5">
<title>Conclusion</title>
<p>The study assessed the contributions of acoustical and perceptual factors to the ability to localize virtual sound sources presented in quiet for na&#x000EF;ve normal-hearing young adults. The spectral strength of the HRTFs did not seem to be a relevant acoustical factor to account for localization performance. Only large modifications of acoustical localization cues seemed to produce behavioral effects, although technical issues with the normalization of the HRTFs might have blurred part of the results. Auditory training with visual correct-answer feedback and constant acoustical cues substantially improved performance. These findings are consistent with a greater role of perceptual factors than of acoustical factors in sound localization abilities with virtual sources. Further research is needed to assess whether the present results generalize to the case of localization in free field.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
</sec>
</body>
<back>
<ack>
<p>This work was supported in part by the French Procurement Agency (Direction G&#x000E9;n&#x000E9;rale de l&#x00027;Armement, DGA). The authors thank Jean Christophe Bouy for software development, Lionel Pellieux for HRTFs measurements and signal processing manipulations, and the two reviewers for many helpful comments.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ahissar</surname> <given-names>M.</given-names></name></person-group> (<year>2001</year>). <article-title>Perceptual training: a tool for both modifying the brain and exploring it</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>98</volume>, <fpage>11842</fpage>&#x02013;<lpage>11843</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.221461598</pub-id><pub-id pub-id-type="pmid">11592994</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Amitay</surname> <given-names>S.</given-names></name> <name><surname>Halliday</surname> <given-names>L.</given-names></name> <name><surname>Taylor</surname> <given-names>J.</given-names></name> <name><surname>Sohoglu</surname> <given-names>E.</given-names></name> <name><surname>Moore</surname> <given-names>D. R.</given-names></name></person-group> (<year>2010</year>). <article-title>Motivation and intelligence drive auditory perceptual learning</article-title>. <source>PLoS ONE</source> <volume>5</volume>:<fpage>e9816</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0009816</pub-id><pub-id pub-id-type="pmid">20352121</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Amitay</surname> <given-names>S.</given-names></name> <name><surname>Hawkey</surname> <given-names>D. J. C.</given-names></name> <name><surname>Moore</surname> <given-names>D. R.</given-names></name></person-group> (<year>2005</year>). <article-title>Auditory frequency discrimination learning is affected by stimulus variability</article-title>. <source>Percept. Psychophys</source>. <volume>67</volume>, <fpage>691</fpage>&#x02013;<lpage>698</lpage>. <pub-id pub-id-type="doi">10.3758/BF03193525</pub-id><pub-id pub-id-type="pmid">16134462</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>And&#x000E9;ol</surname> <given-names>G.</given-names></name> <name><surname>Guillaume</surname> <given-names>A.</given-names></name> <name><surname>Micheyl</surname> <given-names>C.</given-names></name> <name><surname>Savel</surname> <given-names>S.</given-names></name> <name><surname>Pellieux</surname> <given-names>L.</given-names></name> <name><surname>Moulin</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <article-title>Auditory efferents facilitate sound localization in noise in humans</article-title>. <source>J. Neurosci</source>. <volume>31</volume>, <fpage>6759</fpage>&#x02013;<lpage>6763</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.0248-11.2011</pub-id><pub-id pub-id-type="pmid">21543605</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>And&#x000E9;ol</surname> <given-names>G.</given-names></name> <name><surname>Macpherson</surname> <given-names>E. A.</given-names></name> <name><surname>Sabin</surname> <given-names>A. T.</given-names></name></person-group> (<year>2013</year>). <article-title>Sound localization in noise and sensitivity to spectral shape</article-title>. <source>Hear. Res</source>. <volume>304</volume>, <fpage>20</fpage>&#x02013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.1016/j.heares.2013.06.001</pub-id><pub-id pub-id-type="pmid">23769958</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Asano</surname> <given-names>F.</given-names></name> <name><surname>Suzuki</surname> <given-names>Y.</given-names></name> <name><surname>Sone</surname> <given-names>T.</given-names></name></person-group> (<year>1990</year>). <article-title>Role of spectral cues in median plane localization</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>88</volume>, <fpage>159</fpage>&#x02013;<lpage>168</lpage>. <pub-id pub-id-type="doi">10.1038/srep01158</pub-id><pub-id pub-id-type="pmid">2380444</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Astle</surname> <given-names>A. T.</given-names></name> <name><surname>Li</surname> <given-names>R. W.</given-names></name> <name><surname>Webb</surname> <given-names>B. S.</given-names></name> <name><surname>Levi</surname> <given-names>D. M.</given-names></name> <name><surname>McGraw</surname> <given-names>P. V.</given-names></name></person-group> (<year>2013</year>). <article-title>A Weber-like law for perceptual learning</article-title>. <source>Sci. Rep</source>. <volume>3</volume>:<fpage>1158</fpage>. <pub-id pub-id-type="doi">10.1038/srep01158</pub-id><pub-id pub-id-type="pmid">23362458</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Begault</surname> <given-names>D. R.</given-names></name> <name><surname>Wenzel</surname> <given-names>E. M.</given-names></name> <name><surname>Anderson</surname> <given-names>M. R.</given-names></name></person-group> (<year>2001</year>). <article-title>Direct comparison of the impact of head tracking, reverberation, and individualized head-related transfer functions on the spatial perception of a virtual speech source</article-title>. <source>J. Audio Eng. Soc</source>. <volume>49</volume>, <fpage>904</fpage>&#x02013;<lpage>916</lpage>. <pub-id pub-id-type="pmid">11885605</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Best</surname> <given-names>V.</given-names></name> <name><surname>van Schaik</surname> <given-names>A.</given-names></name> <name><surname>Jin</surname> <given-names>C.</given-names></name> <name><surname>Carlile</surname> <given-names>S.</given-names></name></person-group> (<year>2005</year>). <article-title>Auditory spatial perception with sources overlapping in frequency and time</article-title>. <source>Acta Acust. United Acust</source>. <volume>91</volume>, <fpage>421</fpage>&#x02013;<lpage>428</lpage>. <pub-id pub-id-type="pmid">23527271</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bronkhorst</surname> <given-names>A. W.</given-names></name></person-group> (<year>1995</year>). <article-title>Localization of real and virtual sound sources</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>98</volume>, <fpage>2542</fpage>&#x02013;<lpage>2553</lpage>. <pub-id pub-id-type="doi">10.1121/1.413219</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butler</surname> <given-names>R. A.</given-names></name> <name><surname>Belendiuk</surname> <given-names>K.</given-names></name></person-group> (<year>1977</year>). <article-title>Spectral cues utilized in the localization of sound in the median sagittal plane</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>61</volume>, <fpage>1264</fpage>&#x02013;<lpage>1269</lpage>. <pub-id pub-id-type="doi">10.1121/1.381427</pub-id><pub-id pub-id-type="pmid">881481</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carlile</surname> <given-names>S.</given-names></name> <name><surname>Blackman</surname> <given-names>T.</given-names></name></person-group> (<year>2013</year>). <article-title>Relearning auditory spectral cues for locations inside and outside the visual field</article-title>. <source>J. Assoc. Res. Otolaryngol</source>. <volume>15</volume>, <fpage>249</fpage>&#x02013;<lpage>263</lpage>. <pub-id pub-id-type="doi">10.1007/s10162-013-0429-5</pub-id><pub-id pub-id-type="pmid">24306277</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carlile</surname> <given-names>S.</given-names></name> <name><surname>Leong</surname> <given-names>P.</given-names></name> <name><surname>Hyams</surname> <given-names>S.</given-names></name></person-group> (<year>1997</year>). <article-title>The nature and distribution of errors in sound localization by human listeners</article-title>. <source>Hear. Res</source>. <volume>114</volume>, <fpage>179</fpage>&#x02013;<lpage>196</lpage>. <pub-id pub-id-type="doi">10.1016/S0378-5955(97)00161-5</pub-id><pub-id pub-id-type="pmid">9447931</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clifton</surname> <given-names>R. K.</given-names></name> <name><surname>Gwiazda</surname> <given-names>J.</given-names></name> <name><surname>Bauer</surname> <given-names>J. A.</given-names></name> <name><surname>Clarkson</surname> <given-names>M. G.</given-names></name> <name><surname>Held</surname> <given-names>R. M.</given-names></name></person-group> (<year>1988</year>). <article-title>Growth in head size during infancy: implications for sound localization</article-title>. <source>Dev. Psychol</source>. <volume>24</volume>, <fpage>477</fpage>&#x02013;<lpage>483</lpage>. <pub-id pub-id-type="doi">10.1037/0012-1649.24.4.477</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Djelani</surname> <given-names>T.</given-names></name> <name><surname>Porschmann</surname> <given-names>C.</given-names></name> <name><surname>Sahrhage</surname> <given-names>J.</given-names></name> <name><surname>Blauert</surname> <given-names>J.</given-names></name></person-group> (<year>2000</year>). <article-title>An interactive virtual-environment generator for psychoacoustic research II: collection of head-related impulse responses and evaluation of auditory localization</article-title>. <source>Acta Acust. United Acust</source>. <volume>86</volume>, <fpage>1046</fpage>&#x02013;<lpage>1053</lpage>.</citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Drennan</surname> <given-names>W. R.</given-names></name> <name><surname>Watson</surname> <given-names>C. S.</given-names></name></person-group> (<year>2001</year>). <article-title>Sources of variation in profile analysis. I. Individual differences and extended training</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>110</volume>, <fpage>2491</fpage>&#x02013;<lpage>2497</lpage>. <pub-id pub-id-type="doi">10.1121/1.1408310</pub-id><pub-id pub-id-type="pmid">11757938</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garcia</surname> <given-names>A.</given-names></name> <name><surname>Kuai</surname> <given-names>S.-G.</given-names></name> <name><surname>Kourtzi</surname> <given-names>Z.</given-names></name></person-group> (<year>2013</year>). <article-title>Differences in the time course of learning for hard compared to easy training</article-title>. <source>Front. Psychol</source>. <volume>4</volume>:<issue>110</issue>. <pub-id pub-id-type="doi">10.3389/fpsyg.2013.00110</pub-id><pub-id pub-id-type="pmid">23471514</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gilkey</surname> <given-names>R. H.</given-names></name> <name><surname>Good</surname> <given-names>M. D.</given-names></name> <name><surname>Ericson</surname> <given-names>M. A.</given-names></name> <name><surname>Brinkman</surname> <given-names>J.</given-names></name> <name><surname>Stewart</surname> <given-names>J. M.</given-names></name></person-group> (<year>1995</year>). <article-title>A pointing technique for rapidly collecting localization responses in auditory research</article-title>. <source>Behav. Res. Methods Instrum. Comput</source>. <volume>27</volume>, <fpage>1</fpage>&#x02013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.3758/BF03203614</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goldstone</surname> <given-names>R. L.</given-names></name></person-group> (<year>1998</year>). <article-title>Perceptual learning</article-title>. <source>Annu. Rev. Psychol</source>. <volume>49</volume>, <fpage>585</fpage>&#x02013;<lpage>612</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.psych.49.1.585</pub-id><pub-id pub-id-type="pmid">9496632</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hofman</surname> <given-names>P. M.</given-names></name> <name><surname>Van Riswick</surname> <given-names>J. G.</given-names></name> <name><surname>Van Opstal</surname> <given-names>A. J.</given-names></name></person-group> (<year>1998</year>). <article-title>Relearning sound localization with new ears</article-title>. <source>Nat. Neurosci</source>. <volume>1</volume>, <fpage>417</fpage>&#x02013;<lpage>421</lpage>. <pub-id pub-id-type="doi">10.1038/1633</pub-id><pub-id pub-id-type="pmid">10196533</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Irving</surname> <given-names>S.</given-names></name> <name><surname>Moore</surname> <given-names>D. R.</given-names></name></person-group> (<year>2011</year>). <article-title>Training sound localization in normal hearing listeners with and without a unilateral ear plug</article-title>. <source>Hear. Res</source>. <volume>280</volume>, <fpage>100</fpage>&#x02013;<lpage>108</lpage>. <pub-id pub-id-type="doi">10.1016/j.heares.2011.04.020</pub-id><pub-id pub-id-type="pmid">21640176</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>King</surname> <given-names>A. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Visual influences on auditory spatial learning</article-title>. <source>Philos. Trans. R. Soc. Lond. B Biol. Sci</source>. <volume>364</volume>, <fpage>331</fpage>&#x02013;<lpage>339</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2008.0230</pub-id><pub-id pub-id-type="pmid">18986967</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kistler</surname> <given-names>D. J.</given-names></name> <name><surname>Wightman</surname> <given-names>F. L.</given-names></name></person-group> (<year>1992</year>). <article-title>A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>91</volume>, <fpage>1637</fpage>&#x02013;<lpage>1647</lpage>. <pub-id pub-id-type="doi">10.1121/1.402444</pub-id><pub-id pub-id-type="pmid">1564200</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Majdak</surname> <given-names>P.</given-names></name> <name><surname>Baumgartner</surname> <given-names>R.</given-names></name> <name><surname>Laback</surname> <given-names>B.</given-names></name></person-group> (<year>2014</year>). <article-title>Acoustic and non-acoustic factors in modeling listener-specific performance of sagittal-plane sound localization</article-title>. <source>Front. Psychol</source>. <volume>5</volume>:<issue>319</issue>. <pub-id pub-id-type="doi">10.3389/fpsyg.2014.00319</pub-id><pub-id pub-id-type="pmid">24795672</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Majdak</surname> <given-names>P.</given-names></name> <name><surname>Goupell</surname> <given-names>M. J.</given-names></name> <name><surname>Laback</surname> <given-names>B.</given-names></name></person-group> (<year>2010</year>). <article-title>3-D localization of virtual sound sources: effects of visual environment, pointing method, and training</article-title>. <source>Atten. Percept. Psychophys</source>. <volume>72</volume>, <fpage>454</fpage>&#x02013;<lpage>469</lpage>. <pub-id pub-id-type="doi">10.3758/APP.72.2.454</pub-id><pub-id pub-id-type="pmid">20139459</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Makous</surname> <given-names>J. C.</given-names></name> <name><surname>Middlebrooks</surname> <given-names>J. C.</given-names></name></person-group> (<year>1990</year>). <article-title>Two-dimensional sound localization by human listeners</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>87</volume>, <fpage>2188</fpage>&#x02013;<lpage>2200</lpage>. <pub-id pub-id-type="doi">10.1121/1.399186</pub-id><pub-id pub-id-type="pmid">2348023</pub-id></citation>
</ref>
<ref id="B27a">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martin</surname> <given-names>R. L.</given-names></name> <name><surname>McAnally</surname> <given-names>K. I.</given-names></name> <name><surname>Senova</surname> <given-names>M. A.</given-names></name></person-group> (<year>2001</year>). <article-title>Free-field equivalent localization of virtual audio</article-title>. <source>J. Audio Eng. Soc</source>. <volume>49</volume>, <fpage>14</fpage>&#x02013;<lpage>22</lpage>.</citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Middlebrooks</surname> <given-names>J. C.</given-names></name></person-group> (<year>1999a</year>). <article-title>Individual differences in external-ear transfer functions reduced by scaling in frequency</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>106</volume>, <fpage>1480</fpage>&#x02013;<lpage>1492</lpage>. <pub-id pub-id-type="doi">10.1121/1.427176</pub-id><pub-id pub-id-type="pmid">10489705</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Middlebrooks</surname> <given-names>J. C.</given-names></name></person-group> (<year>1999b</year>). <article-title>Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>106</volume>, <fpage>1493</fpage>&#x02013;<lpage>1510</lpage>. <pub-id pub-id-type="doi">10.1121/1.427147</pub-id><pub-id pub-id-type="pmid">10489706</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>M&#x000F8;ller</surname> <given-names>H.</given-names></name> <name><surname>S&#x000F8;rensen</surname> <given-names>M. F.</given-names></name> <name><surname>Jensen</surname> <given-names>C. B.</given-names></name> <name><surname>Hammersh&#x000F8;i</surname> <given-names>D.</given-names></name></person-group> (<year>1996</year>). <article-title>Binaural technique: do we need individual recordings?</article-title> <source>J. Audio Eng. Soc</source>. <volume>44</volume>, <fpage>451</fpage>&#x02013;<lpage>469</lpage>.</citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oldfield</surname> <given-names>R. C.</given-names></name></person-group> (<year>1971</year>). <article-title>The assessment and analysis of handedness: the Edinburgh inventory</article-title>. <source>Neuropsychologia</source> <volume>9</volume>, <fpage>97</fpage>&#x02013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.1016/0028-3932(71)90067-4</pub-id><pub-id pub-id-type="pmid">5146491</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oldfield</surname> <given-names>S. R.</given-names></name> <name><surname>Parker</surname> <given-names>S. P.</given-names></name></person-group> (<year>1984</year>). <article-title>Acuity of sound localisation: a topography of auditory space. I. Normal hearing conditions</article-title>. <source>Perception</source> <volume>13</volume>, <fpage>581</fpage>&#x02013;<lpage>600</lpage>. <pub-id pub-id-type="doi">10.1068/p130581</pub-id><pub-id pub-id-type="pmid">6535983</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Otte</surname> <given-names>R. J.</given-names></name> <name><surname>Agterberg</surname> <given-names>M. J. H.</given-names></name> <name><surname>Van Wanrooij</surname> <given-names>M. M.</given-names></name> <name><surname>Snik</surname> <given-names>A. F. M.</given-names></name> <name><surname>Van Opstal</surname> <given-names>A. J.</given-names></name></person-group> (<year>2013</year>). <article-title>Age-related hearing loss and ear morphology affect vertical but not horizontal sound-localization performance</article-title>. <source>J. Assoc. Res. Otolaryngol</source>. <volume>14</volume>, <fpage>261</fpage>&#x02013;<lpage>273</lpage>. <pub-id pub-id-type="doi">10.1007/s10162-012-0367-7</pub-id><pub-id pub-id-type="pmid">23319012</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Populin</surname> <given-names>L. C.</given-names></name></person-group> (<year>2008</year>). <article-title>Human sound localization: measurements in untrained, head-unrestrained subjects using gaze as a pointer</article-title>. <source>Exp. Brain Res</source>. <volume>190</volume>, <fpage>11</fpage>&#x02013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1007/s00221-008-1445-2</pub-id><pub-id pub-id-type="pmid">18575853</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Recanzone</surname> <given-names>G. H.</given-names></name> <name><surname>Makhamra</surname> <given-names>S. D. D. R.</given-names></name> <name><surname>Guard</surname> <given-names>D. C.</given-names></name></person-group> (<year>1998</year>). <article-title>Comparison of relative and absolute sound localization ability in humans</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>103</volume>, <fpage>1085</fpage>&#x02013;<lpage>1097</lpage>. <pub-id pub-id-type="doi">10.1121/1.421222</pub-id><pub-id pub-id-type="pmid">9479763</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robinson</surname> <given-names>K.</given-names></name> <name><surname>Summerfield</surname> <given-names>A. Q.</given-names></name></person-group> (<year>1996</year>). <article-title>Adult auditory learning and training</article-title>. <source>Ear Hear</source>. <volume>17</volume>, <fpage>51S</fpage>&#x02013;<lpage>65S</lpage>. <pub-id pub-id-type="doi">10.1097/00003446-199617031-00006</pub-id><pub-id pub-id-type="pmid">8807276</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sabin</surname> <given-names>A. T.</given-names></name> <name><surname>Eddins</surname> <given-names>D. A.</given-names></name> <name><surname>Wright</surname> <given-names>B. A.</given-names></name></person-group> (<year>2012</year>). <article-title>Perceptual learning of auditory spectral modulation detection</article-title>. <source>Exp. Brain Res</source>. <volume>218</volume>, <fpage>567</fpage>&#x02013;<lpage>577</lpage>. <pub-id pub-id-type="doi">10.1007/s00221-012-3049-0</pub-id><pub-id pub-id-type="pmid">22418781</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Savel</surname> <given-names>S.</given-names></name></person-group> (<year>2009</year>). <article-title>Individual differences and left/right asymmetries in auditory space perception. I. Localization of low-frequency sounds in free field</article-title>. <source>Hear. Res</source>. <volume>255</volume>, <fpage>142</fpage>&#x02013;<lpage>154</lpage>. <pub-id pub-id-type="doi">10.1016/j.heares.2009.06.013</pub-id><pub-id pub-id-type="pmid">19567263</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Strutt</surname> <given-names>J. W.</given-names></name></person-group> (<year>1907</year>). <article-title>On our perception of sound direction</article-title>. <source>Philos. Mag</source>. <volume>13</volume>, <fpage>214</fpage>&#x02013;<lpage>232</lpage>. <pub-id pub-id-type="doi">10.1080/14786440709463595</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Wanrooij</surname> <given-names>M. M.</given-names></name> <name><surname>Van Opstal</surname> <given-names>A. J.</given-names></name></person-group> (<year>2005</year>). <article-title>Relearning sound localization with a new ear</article-title>. <source>Nat. Neurosci</source>. <volume>25</volume>, <fpage>5413</fpage>&#x02013;<lpage>5424</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.0850-05.2005</pub-id><pub-id pub-id-type="pmid">15930391</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wenzel</surname> <given-names>E. M.</given-names></name> <name><surname>Arruda</surname> <given-names>M.</given-names></name> <name><surname>Kistler</surname> <given-names>D. J.</given-names></name> <name><surname>Wightman</surname> <given-names>F. L.</given-names></name></person-group> (<year>1993</year>). <article-title>Localization using nonindividualized head-related transfer functions</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>94</volume>, <fpage>111</fpage>&#x02013;<lpage>123</lpage>. <pub-id pub-id-type="doi">10.1121/1.407089</pub-id><pub-id pub-id-type="pmid">8354753</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wenzel</surname> <given-names>E. M.</given-names></name> <name><surname>Wightman</surname> <given-names>F. L.</given-names></name> <name><surname>Kistler</surname> <given-names>D. J.</given-names></name> <name><surname>Foster</surname> <given-names>S. H.</given-names></name></person-group> (<year>1988</year>). <article-title>Acoustic origins of individual differences in sound localization behavior</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>84</volume>, <fpage>S79</fpage>. <pub-id pub-id-type="doi">10.1121/1.2026486</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wightman</surname> <given-names>F.</given-names></name> <name><surname>Kistler</surname> <given-names>D.</given-names></name></person-group> (<year>2005</year>). <article-title>Measurement and validation of human HRTFs for use in hearing research</article-title>. <source>Acta Acust. United Acust</source>. <volume>91</volume>, <fpage>429</fpage>&#x02013;<lpage>439</lpage>.</citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wightman</surname> <given-names>F. L.</given-names></name> <name><surname>Kistler</surname> <given-names>D. J.</given-names></name></person-group> (<year>1989</year>). <article-title>Headphone simulation of free-field listening. II: psychophysical validation</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>85</volume>, <fpage>868</fpage>&#x02013;<lpage>878</lpage>. <pub-id pub-id-type="doi">10.1121/1.397558</pub-id><pub-id pub-id-type="pmid">2926001</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wightman</surname> <given-names>F. L.</given-names></name> <name><surname>Kistler</surname> <given-names>D. J.</given-names></name></person-group> (<year>1993</year>). <article-title>Sound localization</article-title>, in <source>Human Psychophysics Springer Handbook of Auditory Research</source>, eds <person-group person-group-type="editor"><name><surname>Yost</surname> <given-names>W. A.</given-names></name> <name><surname>Popper</surname> <given-names>A. N.</given-names></name> <name><surname>Fay</surname> <given-names>R. R.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>155</fpage>&#x02013;<lpage>192</lpage>.</citation>
</ref>
<ref id="B45">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wightman</surname> <given-names>F. L.</given-names></name> <name><surname>Kistler</surname> <given-names>D. J.</given-names></name></person-group> (<year>1997</year>). <article-title>Factors affecting the relative salience of sound localization cues</article-title>, in <source>Binaural and Spatial Hearing in Real and Virtual Environments</source>, eds <person-group person-group-type="editor"><name><surname>Gilkey</surname> <given-names>R. H.</given-names></name> <name><surname>Anderson</surname> <given-names>T. H.</given-names></name></person-group> (<publisher-loc>Mahwah, NJ</publisher-loc>: <publisher-name>Lawrence Erlbaum Associates</publisher-name>), <fpage>1</fpage>&#x02013;<lpage>23</lpage>.</citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wightman</surname> <given-names>F. L.</given-names></name> <name><surname>Kistler</surname> <given-names>D. J.</given-names></name></person-group> (<year>1999</year>). <article-title>Resolution of front-back ambiguity in spatial hearing by listener and source movement</article-title>. <source>J. Acoust. Soc. Am</source>. <volume>105</volume>, <fpage>2841</fpage>&#x02013;<lpage>2853</lpage>. <pub-id pub-id-type="doi">10.1121/1.426899</pub-id><pub-id pub-id-type="pmid">10335634</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>B. A.</given-names></name> <name><surname>Fitzgerald</surname> <given-names>M. B.</given-names></name></person-group> (<year>2001</year>). <article-title>Different patterns of human discrimination learning for two interaural cues to sound-source location</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>98</volume>, <fpage>12307</fpage>&#x02013;<lpage>12312</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.211220498</pub-id><pub-id pub-id-type="pmid">11593048</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>B. A.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name></person-group> (<year>2009</year>). <article-title>A review of the generalization of auditory learning</article-title>. <source>Philos. Trans. R. Soc. Lond. B Biol. Sci</source>. <volume>364</volume>, <fpage>301</fpage>&#x02013;<lpage>311</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2008.0262</pub-id><pub-id pub-id-type="pmid">18977731</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>P. X.</given-names></name> <name><surname>Hartmann</surname> <given-names>W. M.</given-names></name></person-group> (<year>2010</year>). <article-title>On the ability of human listeners to distinguish between front and back</article-title>. <source>Hear. Res</source>. <volume>260</volume>, <fpage>30</fpage>&#x02013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.1016/j.heares.2009.11.001</pub-id><pub-id pub-id-type="pmid">19900525</pub-id></citation>
</ref>
</ref-list>
<app-group>
<app id="A1">
<title>Appendix</title>
<p>An error in DTFs computation was detected following collection of behavioral data. To assess whether this error influenced behavioral results, we compared the performance with individual HRTFs obtained using correct DTFs to that obtained using incorrect DTFs in five listeners. The methods were similar to those used to compare individual and non-individual HRTFs (set of 119 target positions, six repetitions) except that the type of DTFs (correct or incorrect) randomly changed from trial to trial. Each listener performed 1428 trials over 2 days. The first 119 trials of each day, which contained approximately the same number of trials with correct and with incorrect DTFs, were excluded from the analyses. Visual inspection of the raw data in the left/right, front/back, and up/down dimensions showed similar results for correct and incorrect DTFs for each listener (Figure <xref ref-type="fig" rid="FA1">A1</xref>), including listener L22 who had the highest ISD between DTFs (6.6 dB<sup>2</sup>). Wilcoxon tests showed better performance with correct than with incorrect DTFs for only one of 30 comparisons (5 listeners &#x000D7; 6 variables, see Table <xref ref-type="table" rid="TA1">A1</xref>): listener (L22) for up/down errors for high elevations (17&#x000B0; <italic>vs</italic>. 13&#x000B0;, <italic>p</italic> &#x0003D; 0.005). The differences between DTFs for the 5-listener group were not significant (up/down error: 17 &#x000B1; 2&#x000B0; <italic>vs</italic>. 15 &#x000B1; 3&#x000B0;, <italic>p</italic> &#x0003D; 0.06; 26 &#x000B1; 7&#x000B0; <italic>vs</italic>. 26 &#x000B1; 4&#x000B0;, <italic>p</italic> &#x0003D; 0.19; 19&#x000B0; &#x000B1; 4 <italic>vs</italic>. 19 &#x000B1; 6&#x000B0;, <italic>p</italic> &#x0003D; 0.63 for high, middle, and low target elevations, respectively; up &#x02192; down reversals: 2 &#x000B1; 01% <italic>vs</italic>. 2 &#x000B1; 3%, <italic>p</italic> &#x0003D; 0.99; down &#x02192; up reversals: 13 &#x000B1; 9% <italic>vs</italic>. 19 &#x000B1; 14%, <italic>p</italic> &#x0003D; 0.58; Front/back reversals: 38 &#x000B1; 8% <italic>vs</italic>. 32 &#x000B1; 6%, <italic>p</italic> &#x0003D; 0.19).</p>
<fig id="FA1" position="float">
<label>Figure A1</label>
<caption><p><bold>Individual judgment position against target position using correct and incorrect DTFs (black and gray dots, respectively) with individual HRTFs in the left/right, up/down, and front/back dimensions</bold>. Each panel couple is for a different listener (<italic>N</italic> &#x0003D; 5).</p></caption>
<graphic xlink:href="fnins-08-00451-a0001.tif"/>
</fig>
<table-wrap position="float" id="TA1">
<label>Table A1</label>
<caption><p><bold>Comparison between correct and incorrect DTFs for each variable and each listener</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" colspan="3"/>
<th align="center"><bold>L22</bold></th>
<th align="center"><bold>L8</bold></th>
<th align="center"><bold>L13</bold></th>
<th align="center"><bold>L12</bold></th>
<th align="center"><bold>L33</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" colspan="2">Spectral strength of the individual HRTFs (dB<sup>2</sup>)</td>
<td align="left">Incorrect DTFs</td>
<td align="center">21.0</td>
<td align="center">18.3</td>
<td align="center">15.6</td>
<td align="center">15.4</td>
<td align="center">12.8</td>
</tr>
<tr>
<td align="center" colspan="2"/>
<td align="left">Correct DTFs</td>
<td align="center">15.6</td>
<td align="center">15.2</td>
<td align="center">13.3</td>
<td align="center">14.0</td>
<td align="center">11.9</td>
</tr>
<tr>
<td align="left" colspan="2">Inter-DTF (Incorrect &#x02212; Correct) ISD (dB<sup>2</sup>)</td>
<td/>
<td align="center">6.6</td>
<td align="center">3.6</td>
<td align="center">1.8</td>
<td align="center">1.4</td>
<td align="center">1.1</td>
</tr>
<tr>
<td align="left">Up/down error (&#x000B0;)</td>
<td align="left">High elevations</td>
<td align="left">Incorrect DTFs</td>
<td align="center">17</td>
<td align="center">22</td>
<td align="center">15</td>
<td align="center">18</td>
<td align="center">16</td>
</tr>
<tr>
<td/>
<td/>
<td align="left">Correct DTFs</td>
<td align="center">13</td>
<td align="center">19</td>
<td align="center">14</td>
<td align="center">18</td>
<td align="center">15</td>
</tr>
<tr>
<td/>
<td/>
<td align="left">Difference</td>
<td align="left">P &#x0003D; 0.005</td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
</tr>
<tr>
<td/>
<td align="left">Middle elevations</td>
<td align="left">Incorrect DTFs</td>
<td align="center">26</td>
<td align="center">22</td>
<td align="center">29</td>
<td align="center">21</td>
<td align="center">29</td>
</tr>
<tr>
<td/>
<td/>
<td align="left">Correct DTFs</td>
<td align="center">26</td>
<td align="center">22</td>
<td align="center">29</td>
<td align="center">25</td>
<td align="center">29</td>
</tr>
<tr>
<td/>
<td/>
<td align="left">Difference</td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
</tr>
<tr>
<td/>
<td align="left">Low elevations</td>
<td align="left">Incorrect DTFs</td>
<td align="center">20</td>
<td align="center">16</td>
<td align="center">15</td>
<td align="center">29</td>
<td align="center">19</td>
</tr>
<tr>
<td/>
<td/>
<td align="left">Correct DTFs</td>
<td align="center">22</td>
<td align="center">15</td>
<td align="center">15</td>
<td align="center">29</td>
<td align="center">19</td>
</tr>
<tr>
<td/>
<td/>
<td align="left">Difference</td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
</tr>
<tr>
<td align="left" colspan="2">Up &#x02192; down reversals (%)</td>
<td align="left">Incorrect DTFs</td>
<td align="center">3</td>
<td align="center">9</td>
<td align="center">2</td>
<td align="center">1</td>
<td align="center">2</td>
</tr>
<tr>
<td align="center" colspan="2"/>
<td align="left">Correct DTFs</td>
<td align="center">1</td>
<td align="center">12</td>
<td align="center">2</td>
<td align="center">0</td>
<td align="center">4</td>
</tr>
<tr>
<td align="center" colspan="2"/>
<td align="left">Difference</td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
</tr>
<tr>
<td align="left" colspan="2">Down &#x02192; up reversals (%)</td>
<td align="left">Incorrect DTFs</td>
<td align="center">19</td>
<td align="center">3</td>
<td align="center">13</td>
<td align="center">11</td>
<td align="center">35</td>
</tr>
<tr>
<td align="center" colspan="2"/>
<td align="left">Correct DTFs</td>
<td align="center">24</td>
<td align="center">3</td>
<td align="center">10</td>
<td align="center">19</td>
<td align="center">35</td>
</tr>
<tr>
<td align="center" colspan="2"/>
<td align="left">Difference</td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center">P &#x0003D; 0.030</td>
<td align="center"><italic>ns</italic></td>
</tr>
<tr>
<td align="left" colspan="2">Front/back reversals (%)</td>
<td align="left">Incorrect DTFs</td>
<td align="center">26</td>
<td align="center">24</td>
<td align="center">30</td>
<td align="center">19</td>
<td align="center">32</td>
</tr>
<tr>
<td align="center" colspan="2"/>
<td align="left">Correct DTFs</td>
<td align="center">30</td>
<td align="center">21</td>
<td align="center">32</td>
<td align="center">24</td>
<td align="center">32</td>
</tr>
<tr>
<td align="center" colspan="2"/>
<td align="left">Difference</td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
<td align="center"><italic>ns</italic></td>
</tr>
</tbody>
</table>
</table-wrap>
</app>
</app-group>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>Middlebrooks (<xref ref-type="bibr" rid="B28">1999b</xref>) used a &#x0201C;classical&#x0201D; protocol with an absolute localization task, a virtual sound source simulated in an anechoic environment, a large range of source elevations and azimuths, and constant target/listener distance. M&#x000F8;ller et al., (<xref ref-type="bibr" rid="B29">1996</xref>) used a non-anechoic environment and variable target distances. Bronkhorst (<xref ref-type="bibr" rid="B10">1995</xref>) used a forced-choice localization task. Begault et al. (<xref ref-type="bibr" rid="B8">2001</xref>) restrained the target positions to the horizontal plane.</p></fn>
</fn-group>
</back>
</article>
