<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="discussion">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Comput. Neurosci.</journal-id>
<journal-title>Frontiers in Computational Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Comput. Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1662-5188</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fncom.2016.00124</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neuroscience</subject>
<subj-group>
<subject>Opinion</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Distorted Low-Level Visual Features Affect Saliency-Based Visual Attention</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Bahmani</surname> <given-names>Hamed</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/38333/overview"/></contrib>
<contrib contrib-type="author">
<name><surname>Wahl</surname> <given-names>Siegfried</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/387134/overview"/></contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>ZEISS Vision Science Lab, Institute for Ophthalmic Research, Eberhard Karls Universit&#x000E4;t T&#x000FC;bingen</institution> <country>T&#x000FC;bingen, Germany</country></aff>
<aff id="aff2"><sup>2</sup><institution>Physiology of Cognitive Processes, Max Planck Institute for Biological Cybernetics</institution> <country>T&#x000FC;bingen, Germany</country></aff>
<aff id="aff3"><sup>3</sup><institution>Bernstein Center for Computational Neuroscience</institution> <country>T&#x000FC;bingen, Germany</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Si Wu, Beijing Normal University, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Mingsha Zhang, Beijing Normal University, China</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Hamed Bahmani <email>hamed.bahmani&#x00040;uni-tuebingen.de</email></p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>29</day>
<month>11</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="collection">
<year>2016</year>
</pub-date>
<volume>10</volume>
<elocation-id>124</elocation-id>
<history>
<date date-type="received">
<day>15</day>
<month>08</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>15</day>
<month>11</month>
<year>2016</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2016 Bahmani and Wahl.</copyright-statement>
<copyright-year>2016</copyright-year>
<copyright-holder>Bahmani and Wahl</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<kwd-group><kwd>visual attention</kwd>
<kwd>saliency</kwd>
<kwd>distortion</kwd>
<kwd>low-level features</kwd>
<kwd>computational modeling</kwd></kwd-group>
<contract-sponsor id="cn001">Eberhard Karls Universit&#x000E4;t T&#x000FC;bingen<named-content content-type="fundref-id">10.13039/501100002345</named-content></contract-sponsor>
<counts>
<fig-count count="1"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="27"/>
<page-count count="4"/>
<word-count count="2555"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Image distortions can attract attention away from the natural scene saliency (Redi et al., <xref ref-type="bibr" rid="B24">2011</xref>). Performance of viewers in visual search tasks and their fixation patterns are also affected by different types and amounts of distortions (Vu et al., <xref ref-type="bibr" rid="B25">2008</xref>). In this paper, we have discussed the opinion that distortions could largely affect the performance of predictive models of visual attention, and simulated the effects of distorted low-level visual features on the saliency-based bottom-up visual attention. Saliency is a fast and pre-attentive mechanism for orienting visual attention to intrinsically important objects which pop-out more easily in a cluttered scene. Distortion of the low-level features that contribute to saliency may impair the readiness of the visual system in detection of salient objects, which may have major implications for critical situations like driving or locomotion. These distortions in natural life can be introduced by eye diseases such as cataract, or spectacles which may alter color perception (de Fez et al., <xref ref-type="bibr" rid="B7">2002</xref>) or cause undesired optical effects like blurring, non-uniform magnification, and image displacement (Barbero and Portilla, <xref ref-type="bibr" rid="B3">2016</xref>). The extent to which each of these distorted saliency features may affect the attentional performance is addressed in this paper by employing a biologically-inspired predictive model of visual attention. We briefly summarize the current standing of computational work on visual attention models in the following section and suggest a simple and influential model of saliency to examine the above hypothesis. Furthermore, we demonstrate in an example the hindered performance of the predictive saliency model on distorted images.</p>
</sec>
<sec id="s2">
<title>Models of visual attention</title>
<p>Despite the widely shared belief in general public that we see everything around us, only a small fraction of the information registered by the visual system reaches levels of processing that mediate perception and directly influence behavior (Itti and Koch, <xref ref-type="bibr" rid="B11">2000</xref>). Selective attention is the key to this process which turns looking into seeing (Carrasco, <xref ref-type="bibr" rid="B6">2011</xref>). But how does the visual system select and enhance the representation of one particular feature or spatial location over less relevant features and locations? Much evidence has accumulated in favor of a two-component framework for the control of where in a visual scene attention is deployed: a bottom-up, fast, and image-based mechanism that biases the observer toward selecting stimuli based on their saliency, and a second slower, top-down mechanism, which uses task-dependent cues to direct the spotlight of attention under voluntary control. Koch and Ullman (<xref ref-type="bibr" rid="B16">1985</xref>) introduced the idea of a saliency map to accomplish pre-attentive selection. A saliency map is an explicit two-dimensional map that encodes the saliency of visual objects in the environment purely based on the low-level visual attributes of the object (Itti et al., <xref ref-type="bibr" rid="B14">1998</xref>). Competition among neurons in this map gives rise to a single winning location that corresponds to the most salient object, which constitutes the next target. If this location is subsequently inhibited, the system automatically shifts to the next most salient location. This internal dynamic models the saccadic eye movements in visual search. This purely computational hypothesis received experimental support from many electrophysiological studies including single-cell recordings from lateral intraparietal neurons of macaque monkeys which responded to visual stimuli only when those stimuli were made salient (Gottlieb et al., <xref ref-type="bibr" rid="B10">1998</xref>).</p>
<p>Today, more than fifty quantitative models for saliency and fixation prediction are available which have been accumulated over the past 20 years; some of them tried to also incorporate top-down attention (Borji and Itti, <xref ref-type="bibr" rid="B4">2013</xref>; Borji et al., <xref ref-type="bibr" rid="B5">2013</xref>) or context-aware saliency detection (Goferman et al., <xref ref-type="bibr" rid="B9">2012</xref>). However, not all of them are biologically plausible (Zhang and Sclaroff, <xref ref-type="bibr" rid="B27">2013</xref>) or explain low-level visual features (K&#x000FC;mmerer et al., <xref ref-type="bibr" rid="B17">2014</xref>); and the metrics used to compare the performance of these models are often different and inconsistent with each other (K&#x000FC;mmerer et al., <xref ref-type="bibr" rid="B18">2015</xref>; Gide et al., <xref ref-type="bibr" rid="B8">2016</xref>). For the purpose of this paper, we chose the original model of saliency-based visual attention for rapid scene analysis by Itti et al. (<xref ref-type="bibr" rid="B14">1998</xref>) for its utmost biological plausibility and simple dichotomy between low-level visual features which allows us to look at different distortion effects on each feature map independently, as well as the final saliency map. In the next section, we explain how this model can be used to simulate the effect of distortions on visual attention and visual search.</p>
</sec>
<sec id="s3">
<title>Attention and image distortions</title>
<p>Visual attention models have been used in many computer vision applications (Pal, <xref ref-type="bibr" rid="B23">2016</xref>), including image and video compression and retrieval (Ouerhani et al., <xref ref-type="bibr" rid="B22">2001</xref>; Li et al., <xref ref-type="bibr" rid="B20">2011</xref>), multimedia technologies (Le Callet and Niebur, <xref ref-type="bibr" rid="B19">2013</xref>), and 2D (Akamine and Farias, <xref ref-type="bibr" rid="B1">2014</xref>) and 3D (Jiang et al., <xref ref-type="bibr" rid="B15">2014</xref>) image quality assessment. Moreover, researchers have conducted eye-tracking experiments to judge image quality instead of using a computational model for visual attention, thus making the results independent of the reliability of an attention model (Vu et al., <xref ref-type="bibr" rid="B25">2008</xref>; Liu and Heynderickx, <xref ref-type="bibr" rid="B21">2011</xref>; Redi et al., <xref ref-type="bibr" rid="B24">2011</xref>). A very recent paper (Gide et al., <xref ref-type="bibr" rid="B8">2016</xref>, available online) has evaluated several state-of-the-art visual attention models over multiple databases consisting of distorted images with various types and levels of distortions. However, types of distortions that have been addressed in this paper (e.g., blur, noise and compression) are different from the low-level features taken into account by our selected model (intensity, color, and orientation); and are rather important from image processing points of view such as image acquisition, compression and transmission. In this opinion paper, we suggest a biologically plausible model for a similar purpose, which is applicable to more clinical and behavioral studies like simulating eye diseases or effects of eyewear. It can be also used to study the effect of distortions on visual attention in natural circumstances like driving and sports.</p>
<p>As a demonstration of the proposed opinion, we simulated an attention-demanding detection task while driving in a purely computational environment. We tested the model of bottom-up visual attention for the effects of distortions on a traffic sign search task, using manipulated static images containing different traffic signs in cluttered scenes. If a stimulus like traffic sign is sufficiently salient, it will pop out of the visual scene. The speed of this saliency-based form of attention is on the order of 25&#x02013;50 ms per item (Itti and Koch, <xref ref-type="bibr" rid="B12">2001a</xref>) which is sufficiently fast for tasks like driving. We used a simple performance measure similar to Bahmani et al. (<xref ref-type="bibr" rid="B2">2008</xref>) to quantify the goodness of the model in detecting targets on distorted images. We compared the performance of the model on natural and distorted images in Figure <xref ref-type="fig" rid="F1">1</xref> by illustrating the number of false hits before detecting the target. We modified four original images (left panels) containing one or more traffic signs in a natural cluttered scene from a traffic sign image database (Itti and Koch, <xref ref-type="bibr" rid="B13">2001b</xref>; available on Itti&#x00027;s Lab homepage: <ext-link ext-link-type="uri" xlink:href="http://ilab.usc.edu">ilab.usc.edu</ext-link>) to obtain simulated barrel-lens-aberration distortion, manipulated color contrast, motion-blurred distortion, and distorted contrast/intensity versions (right panels). We then fed these images to the Saliency model (Walther and Koch, <xref ref-type="bibr" rid="B26">2006</xref>; available online at: <ext-link ext-link-type="uri" xlink:href="http://www.saliencytoolbox.net">www.saliencytoolbox.net</ext-link>) and compared the performance of the visual search on original and distorted images. The model does not need a training phase or parameter setting and generates the simulated saccadic path of human observer instantly. As already noticeable in Figure <xref ref-type="fig" rid="F1">1</xref>, the main target (inside the green circle on the left panels) has been either not detected, or detected late after a few saccades on distorted images (see scan paths on the right panels). In all examples the visual search for the target starts with the green circle which indicates the first hit, and continues up to six points on the image creating a scan path which simulates saccades.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Natural images consisting of one or more traffic signs in a cluttered scene and their simulated distorted versions are fed to a bottom-up visual attention model predicting the performance of human vision as the number of simulated saccades before detecting the target</bold>. Green circle in all panels indicates the &#x0201C;first hit&#x0201D; predicted as the most salient location or object by the model, while yellow circles, and the path among them show next <italic>salient</italic> spots on the image and the scan path respectively (<italic>simulated saccades</italic>). From the top to the bottom, right panels are the barrel-lens aberration distortion, altered color, motion blurred, and altered intensity/contrast version of the corresponding left panel.</p></caption>
<graphic xlink:href="fncom-10-00124-g0001.tif"/>
</fig>
</sec>
<sec sec-type="conclusions" id="s4">
<title>Conclusion</title>
<p>The ability to direct the gaze as fast as possible toward salient objects in cluttered scenes has evolutionary significance because it allows the animal to detect quickly prey and predators in the visual world. In modern human era, we are not concerned much about predators, but our daily life and social interactions rely heavily on a rapid and reliable visual processing. Impairment of a fast pre-attentive mechanism for visual search has major implications in situations like driving, where rapid reaction to visual stimuli is important. In this paper, we proposed the use of a predictive model to evaluate the effects of distorted early visual features on saliency. Furthermore, we provided examples of distorted natural images with altered colors, orientation and contrast components and tested the performance of a simulated visual search task driven by bottom-up visual attention mechanism. Our results confirm a substantial effect of distorted early visual features on detectability of salient objects in natural cluttered scenes. These effects should be taken into account in situations which high levels of visual attention is required. Moreover, we suggest further experiments to compare the performance of computational models of visual attention with human observers to confirm the results of this simulation, or discover differences in robustness of biological and computational visual systems to distortions.</p>
</sec>
<sec id="s5">
<title>Author contributions</title>
<p>HB and SW conceived the opinion and wrote the paper.</p>
</sec>
<sec id="s6">
<title>Funding</title>
<p>The work was supported by the Eberhard-Karls-University Tuebingen as part of the German Excellence Initiative from the Federal Ministry of Education and Research (BMBF).</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The reviewer MZ and handling Editor declared their shared affiliation, and the handling Editor states that the process nevertheless met the standards of a fair and objective review.</p></sec>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Akamine</surname> <given-names>W. Y. L.</given-names></name> <name><surname>Farias</surname> <given-names>M. C. Q.</given-names></name></person-group> (<year>2014</year>). <article-title>Incorporating visual attention models into video quality metrics</article-title>, in <source>90160O</source> (<publisher-loc>San Francisco</publisher-loc>).</citation>
</ref>
<ref id="B2">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bahmani</surname> <given-names>H.</given-names></name> <name><surname>Nasrabadi</surname> <given-names>A. M.</given-names></name> <name><surname>Gholpayeghani</surname> <given-names>M. R. H.</given-names></name></person-group> (<year>2008</year>). <article-title>Nonlinear data fusion in saliency-based visual attention</article-title>, in <source>2008 4th International IEEE Conference Intelligent Systems, IS 2008</source> (<publisher-loc>Varna</publisher-loc>), <fpage>327</fpage>&#x02013;<lpage>330</lpage>.</citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barbero</surname> <given-names>S.</given-names></name> <name><surname>Portilla</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>The relationship between dioptric power and magnification in progressive addition lenses</article-title>. <source>Ophthalmic Physiol. Opt.</source> <volume>36</volume>, <fpage>421</fpage>&#x02013;<lpage>427</lpage>. <pub-id pub-id-type="doi">10.1111/opo.12301</pub-id><pub-id pub-id-type="pmid">27146008</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borji</surname> <given-names>A.</given-names></name> <name><surname>Itti</surname> <given-names>L.</given-names></name></person-group> (<year>2013</year>). <article-title>State-of-the-art in visual attention modeling</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intell.</source> <volume>35</volume>, <fpage>185</fpage>&#x02013;<lpage>207</lpage>. <pub-id pub-id-type="doi">10.1109/TPAMI.2012.89</pub-id><pub-id pub-id-type="pmid">22487985</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borji</surname> <given-names>A.</given-names></name> <name><surname>Sihite</surname> <given-names>D. N.</given-names></name> <name><surname>Itti</surname> <given-names>L.</given-names></name></person-group> (<year>2013</year>). <article-title>Quantitative analysis of human-model agreement in visual saliency modeling: a comparative study</article-title>. <source>IEEE Trans. Image Process.</source> <volume>22</volume>, <fpage>55</fpage>&#x02013;<lpage>69</lpage>. <pub-id pub-id-type="doi">10.1109/TIP.2012.2210727</pub-id><pub-id pub-id-type="pmid">22868572</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carrasco</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Visual attention: the past 25 years</article-title>. <source>Vision Res.</source> <volume>51</volume>, <fpage>1484</fpage>&#x02013;<lpage>1525</lpage>. <pub-id pub-id-type="doi">10.1016/j.visres.2011.04.012</pub-id><pub-id pub-id-type="pmid">21549742</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Fez</surname> <given-names>M. D.</given-names></name> <name><surname>Luque</surname> <given-names>M. J.</given-names></name> <name><surname>Viqueira</surname> <given-names>V.</given-names></name></person-group> (<year>2002</year>). <article-title>Enhancement of contrast sensitivity and losses of chromatic discrimination with tinted lenses</article-title>. <source>Optom. Vis. Sci.</source> <volume>79</volume>, <fpage>590</fpage>&#x02013;<lpage>597</lpage>. <pub-id pub-id-type="doi">10.1097/00006324-200209000-00010</pub-id><pub-id pub-id-type="pmid">12322929</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Gide</surname> <given-names>M. S.</given-names></name> <name><surname>Dodge</surname> <given-names>S. F.</given-names></name> <name><surname>Karam</surname> <given-names>L. J.</given-names></name></person-group> (<year>2016</year>). <source>The Effect of Distortions on the Prediction of Visual Attention</source>. <volume>Vol. 53</volume>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://arxiv.org/abs/1604.03882">http://arxiv.org/abs/1604.03882</ext-link></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goferman</surname> <given-names>S.</given-names></name> <name><surname>Zelnik-Manor</surname> <given-names>L.</given-names></name> <name><surname>Tal</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Context-aware saliency detection</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intell.</source> <volume>34</volume>, <fpage>1915</fpage>&#x02013;<lpage>1926</lpage>. <pub-id pub-id-type="doi">10.1109/TPAMI.2011.272</pub-id><pub-id pub-id-type="pmid">22201056</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gottlieb</surname> <given-names>J. P.</given-names></name> <name><surname>Kusunoki</surname> <given-names>M.</given-names></name> <name><surname>Goldberg</surname> <given-names>M. E.</given-names></name></person-group> (<year>1998</year>). <article-title>The representation of visual salience in monkey parietal cortex</article-title>. <source>Nature</source> <volume>391</volume>, <fpage>481</fpage>&#x02013;<lpage>484</lpage>. <pub-id pub-id-type="doi">10.1038/35135</pub-id><pub-id pub-id-type="pmid">9461214</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Itti</surname> <given-names>L.</given-names></name> <name><surname>Koch</surname> <given-names>C.</given-names></name></person-group> (<year>2000</year>). <article-title>A saliency-based search mechanism for overt and covert shifts of visual attention</article-title>. <source>Vision Res.</source> <volume>40</volume>, <fpage>1489</fpage>&#x02013;<lpage>1506</lpage>. <pub-id pub-id-type="doi">10.1016/S0042-6989(99)00163-7</pub-id><pub-id pub-id-type="pmid">10788654</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Itti</surname> <given-names>L.</given-names></name> <name><surname>Koch</surname> <given-names>C.</given-names></name></person-group> (<year>2001a</year>). <article-title>Computational modelling of visual attention</article-title>. <source>Nat. Rev. Neurosci.</source> <volume>2</volume>, <fpage>194</fpage>&#x02013;<lpage>203</lpage>. <pub-id pub-id-type="doi">10.1038/35058500</pub-id><pub-id pub-id-type="pmid">11256080</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Itti</surname> <given-names>L.</given-names></name> <name><surname>Koch</surname> <given-names>C.</given-names></name></person-group> (<year>2001b</year>). <article-title>Feature combination strategies for saliency-based visual attention systems</article-title>. <source>J. Electron. Imaging</source> <volume>10</volume>, <fpage>161</fpage>&#x02013;<lpage>169</lpage>. <pub-id pub-id-type="doi">10.1117/1.1333677</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Itti</surname> <given-names>L.</given-names></name> <name><surname>Koch</surname> <given-names>C.</given-names></name> <name><surname>Niebur</surname> <given-names>E.</given-names></name></person-group> (<year>1998</year>). <article-title>A model of saliency based visual attention for rapid scene analysis</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intell.</source> <volume>20</volume>, <fpage>1254</fpage>&#x02013;<lpage>1259</lpage>. <pub-id pub-id-type="doi">10.1016/S1053-5357(00)00088-3</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>Q.</given-names></name> <name><surname>Duan</surname> <given-names>F.</given-names></name> <name><surname>Shao</surname> <given-names>F.</given-names></name></person-group> (<year>2014</year>). <article-title>3D visual attention for stereoscopic image quality assessment</article-title>. <source>J. Softw.</source> <volume>9</volume>, <fpage>1841</fpage>&#x02013;<lpage>1847</lpage>. <pub-id pub-id-type="doi">10.4304/jsw.9.7.1841-1847</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koch</surname> <given-names>C.</given-names></name> <name><surname>Ullman</surname> <given-names>S.</given-names></name></person-group> (<year>1985</year>). <article-title>Shifts in selective visual attention: towards the underlying neural circuitry</article-title>. <source>Hum. Neurobiol.</source> <volume>4</volume>, <fpage>219</fpage>&#x02013;<lpage>227</lpage>. <pub-id pub-id-type="doi">10.1016/j.imavis.2008.02.004</pub-id><pub-id pub-id-type="pmid">3836989</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>K&#x000FC;mmerer</surname> <given-names>M.</given-names></name> <name><surname>Theis</surname> <given-names>L.</given-names></name> <name><surname>Bethge</surname> <given-names>M.</given-names></name></person-group> (<year>2014</year>). <article-title>Deep gaze I-boosting saliency prediction with feature maps trained on ImageNet</article-title>, in <source>arXiv1411.1045 [cs, q-bio, stat]</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://arxiv.org/abs/1411.1045\nfiles/1004/arXiv-Kummerer_et_al-2014-Deep_Gaze_I-Boosting_Saliency_Prediction_with_Feature_Maps_Trained_on_ImageNet.pdf">http://arxiv.org/abs/1411.1045\nfiles/1004/arXiv-Kummerer_et_al-2014-Deep_Gaze_I-Boosting_Saliency_Prediction_with_Feature_Maps_Trained_on_ImageNet.pdf</ext-link></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>K&#x000FC;mmerer</surname> <given-names>M.</given-names></name> <name><surname>Wallis</surname> <given-names>T. S. A.</given-names></name> <name><surname>Bethge</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>Information-theoretic model comparison unifies saliency metrics</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>112</volume>, <fpage>16054</fpage>&#x02013;<lpage>16059</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1510393112</pub-id><pub-id pub-id-type="pmid">26655340</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Le Callet</surname> <given-names>P.</given-names></name> <name><surname>Niebur</surname> <given-names>E.</given-names></name></person-group> (<year>2013</year>). <article-title>Visual attention and applications in multimedia technologies</article-title>. <source>Proc. IEEE Inst. Electr. Electron Eng</source>. <volume>101</volume>, <fpage>2058</fpage>&#x02013;<lpage>2067</lpage>. <pub-id pub-id-type="doi">10.1109/JPROC.2013.2265801</pub-id><pub-id pub-id-type="pmid">24489403</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Qin</surname> <given-names>S.</given-names></name> <name><surname>Itti</surname> <given-names>L.</given-names></name></person-group> (<year>2011</year>). <article-title>Visual attention guided bit allocation in video compression</article-title>. <source>Image Vis. Comput.</source> <volume>29</volume>, <fpage>1</fpage>&#x02013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1016/j.imavis.2010.07.001</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Heynderickx</surname> <given-names>I.</given-names></name></person-group> (<year>2011</year>). <article-title>Visual attention in objective image quality assessment: based on eye-tracking data</article-title>. <source>IEEE Trans. Circuits Syst. Video Technol.</source> <volume>21</volume>, <fpage>971</fpage>&#x02013;<lpage>982</lpage>. <pub-id pub-id-type="doi">10.1109/TCSVT.2011.2133770</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ouerhani</surname> <given-names>N.</given-names></name> <name><surname>Bracamonte</surname> <given-names>J.</given-names></name> <name><surname>Hugli</surname> <given-names>H.</given-names></name> <name><surname>Ansorge</surname> <given-names>M.</given-names></name> <name><surname>Pellandini</surname> <given-names>F.</given-names></name></person-group> (<year>2001</year>). <article-title>Adaptive color image compression based on visual attention</article-title>, in <source>Proceedings of the 11th International Conference on Image Analysis and Processing</source> (<publisher-loc>Palermo</publisher-loc>), <fpage>416</fpage>&#x02013;<lpage>421</lpage>.</citation>
</ref>
<ref id="B23">
<citation citation-type="book"><person-group person-group-type="editor"><name><surname>Pal</surname> <given-names>R.</given-names></name></person-group> (ed.). (<year>2016</year>). <source>Innovative Research in Attention Modeling and Computer Vision Applications.</source> <publisher-loc>Hershey, PA</publisher-loc>: <publisher-name>IGI Publishing</publisher-name>. <pub-id pub-id-type="doi">10.4018/978-1-4666-8723-3</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Redi</surname> <given-names>J.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Zunino</surname> <given-names>R.</given-names></name> <name><surname>Heynderickx</surname> <given-names>I.</given-names></name></person-group> (<year>2011</year>). <article-title>Interactions of visual attention and quality perception</article-title>. <source>Int. Soc. Opt. Photonics.</source> <volume>7865</volume>, <fpage>78650S</fpage>&#x02013;<lpage>78650S-11</lpage>. <pub-id pub-id-type="doi">10.1117/12.876712</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vu</surname> <given-names>C. T.</given-names></name> <name><surname>Larson</surname> <given-names>E. C.</given-names></name> <name><surname>Chandler</surname> <given-names>D. M.</given-names></name></person-group> (<year>2008</year>). <article-title>Visual fixation patterns when judging image quality: effects of distortion type, amount, and subject experience</article-title>, in <source>2008 IEEE Southwest Symposium on Image Analysis and Interpretation</source> (<publisher-loc>Santa Fe, NM</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>73</fpage>&#x02013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1109/SSIAI.2008.4512288</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walther</surname> <given-names>D.</given-names></name> <name><surname>Koch</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <article-title>Modeling attention to salient proto-objects</article-title>. <source>Neural Netw.</source> <volume>19</volume>, <fpage>1395</fpage>&#x02013;<lpage>1407</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2006.10.001</pub-id><pub-id pub-id-type="pmid">17098563</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Sclaroff</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <article-title>Saliency detection: a boolean map approach</article-title>, in <source>2013 IEEE International Conference on Computer Vision</source> (<publisher-loc>Sydney, NSW</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>153</fpage>&#x02013;<lpage>160</lpage>. <pub-id pub-id-type="doi">10.1109/ICCV.2013.26</pub-id></citation>
</ref>
</ref-list>
</back>
</article>