<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Comput. Neurosci.</journal-id>
<journal-title>Frontiers in Computational Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Comput. Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1662-5188</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fncom.2022.890447</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neuroscience</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Spatio-Temporally Efficient Coding Assigns Functions to Hierarchical Structures of the Visual System</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Sihn</surname> <given-names>Duho</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1457408/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Kim</surname> <given-names>Sung-Phil</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/201210/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Biomedical Engineering, Ulsan National Institute of Science and Technology</institution>, <addr-line>Ulsan</addr-line>, <country>South Korea</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Paul Miller, Brandeis University, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Shujaat Khan, Siemens Healthineers, United States; Muhammad Usman, Chosun University, South Korea; Seongyong Park, The Korea Advanced Institute of Science and Technology, South Korea, in collaboration with reviewer SK</p></fn>
<corresp id="c001">&#x002A;Correspondence: Sung-Phil Kim, <email>spkim@unist.ac.kr</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Frontiers in Computational Neuroscience, a section of the journal Frontiers in Computational Neuroscience</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>27</day>
<month>05</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>16</volume>
<elocation-id>890447</elocation-id>
<history>
<date date-type="received">
<day>06</day>
<month>03</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>05</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2022 Sihn and Kim.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Sihn and Kim</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Hierarchical structures constitute a wide array of brain areas, including the visual system. One of the important questions regarding visual hierarchical structures is to identify computational principles for assigning functions that represent the external world to hierarchical structures of the visual system. Given that visual hierarchical structures contain both bottom-up and top-down pathways, the derived principles should encompass these bidirectional pathways. However, existing principles such as predictive coding do not provide an effective principle for bidirectional pathways. Therefore, we propose a novel computational principle for visual hierarchical structures as spatio-temporally efficient coding underscored by the efficient use of given resources in both neural activity space and processing time. This coding principle optimises bidirectional information transmissions over hierarchical structures by simultaneously minimising temporal differences in neural responses and maximising entropy in neural representations. Simulations demonstrated that the proposed spatio-temporally efficient coding was able to assign the function of appropriate neural representations of natural visual scenes to visual hierarchical structures. Furthermore, spatio-temporally efficient coding was able to predict well-known phenomena, including deviations in neural responses to unlearned inputs and bias in preferred orientations. Our proposed spatio-temporally efficient coding may facilitate deeper mechanistic understanding of the computational processes of hierarchical brain structures.</p>
</abstract>
<kwd-group>
<kwd>efficient coding</kwd>
<kwd>hierarchical structure</kwd>
<kwd>neural response</kwd>
<kwd>neural representation</kwd>
<kwd>receptive field</kwd>
<kwd>selectivity</kwd>
<kwd>visual system</kwd>
</kwd-group>
<contract-num rid="cn001">NRF-2019M3E5D2A01058328</contract-num>
<contract-num rid="cn001">No. 2021M3E5D2A01019542</contract-num>
<contract-sponsor id="cn001">Ministry of Science and ICT, South Korea<named-content content-type="fundref-id">10.13039/501100014188</named-content></contract-sponsor>
<counts>
<fig-count count="9"/>
<table-count count="0"/>
<equation-count count="7"/>
<ref-count count="95"/>
<page-count count="20"/>
<word-count count="13099"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>It is well-established that a wide array of brain areas has a hierarchical structure, including the visual system (<xref ref-type="bibr" rid="B23">Felleman and Van Essen, 1991</xref>; <xref ref-type="bibr" rid="B57">Mesulam, 1998</xref>; <xref ref-type="bibr" rid="B33">Harris et al., 2019</xref>; <xref ref-type="bibr" rid="B35">Hilgetag and Goulas, 2020</xref>). Studies have identified a link between hierarchical structures and gene expression (<xref ref-type="bibr" rid="B12">Burt et al., 2018</xref>; <xref ref-type="bibr" rid="B32">Hansen et al., 2021</xref>), suggesting that hierarchical structures are genetically determined <italic>a priori</italic>. Given that one of the major functions of the brain is to represent the external world (<xref ref-type="bibr" rid="B20">deCharms and Zador, 2000</xref>; <xref ref-type="bibr" rid="B46">Kriegeskorte and Diedrichsen, 2019</xref>), an ensuing question arises: How do <italic>a priori</italic> hierarchical brain structures attain functions to represent the external world? This question can be addressed by identifying a fundamental neural coding principle that assigns representational functions to hierarchical structures.</p>
<p>The traditional view of the neural coding of visual hierarchical structures is bottom-up visual information processing, whereby simple features are processed in a lower visual hierarchy and more complex features created by integrating simple features are processed in a higher visual hierarchy (<xref ref-type="bibr" rid="B39">Hubel and Wiesel, 1962</xref>, <xref ref-type="bibr" rid="B40">1968</xref>; <xref ref-type="bibr" rid="B68">Riesenhuber and Poggio, 1999</xref>, <xref ref-type="bibr" rid="B69">2000</xref>; <xref ref-type="bibr" rid="B73">Serre et al., 2007</xref>; <xref ref-type="bibr" rid="B22">DiCarlo et al., 2012</xref>; <xref ref-type="bibr" rid="B91">Yamins et al., 2014</xref>). However, this view does not consider the role of top-down pathways that are abundant even in the early visual system, such as the lateral geniculate nucleus (<xref ref-type="bibr" rid="B59">Murphy and Sillito, 1987</xref>; <xref ref-type="bibr" rid="B88">Wang et al., 2006</xref>) and primary visual cortex (<xref ref-type="bibr" rid="B94">Zhang et al., 2014</xref>; <xref ref-type="bibr" rid="B58">Muckli et al., 2015</xref>; <xref ref-type="bibr" rid="B41">Huh et al., 2018</xref>).</p>
<p>The role of top-down visual processing is especially prominent in predictive coding (<xref ref-type="bibr" rid="B67">Rao and Ballard, 1999</xref>; <xref ref-type="bibr" rid="B79">Spratling, 2017</xref>). According to predictive coding, a higher hierarchy performs top-down predictions of neural responses in a lower hierarchy. Both inference and learning of predictive coding are based on the minimisation of bottom-up prediction errors. Predictive coding has been used to explain the neural responses corresponding to prediction errors (<xref ref-type="bibr" rid="B24">Friston, 2005</xref>) and extends from the explanations of perception to action (<xref ref-type="bibr" rid="B25">Friston, 2010</xref>; <xref ref-type="bibr" rid="B16">Clark, 2013</xref>). A recent study combined predictive coding with sparse coding (i.e., sparse deep predictive coding) and demonstrate that it could enhance perceptual explanatory power (<xref ref-type="bibr" rid="B9">Boutin et al., 2021</xref>).</p>
<p>Nevertheless, predictive coding has several theoretical shortcomings. Since inference in predictive coding aims to minimise prediction errors, the hierarchical structure would require an additional information processing subsystem to perform this inference. In addition, because bottom-up transmitted information contains only prediction errors, predictive coding requires the presence of error units (biological neurons) in the hierarchical structure to represent this prediction error, yet such error units remain as hypothetical entities and evidence for prediction error responses is limited in some conditions (<xref ref-type="bibr" rid="B78">Solomon et al., 2021</xref>).</p>
<p>In a hierarchical structure in which information is exchanged in both directions, if the information represented by the upper and lower hierarchies at the same time is different, it is difficult to obtain a stable neural response on time domain for an external input. The reason is that if the information represented by the upper and lower hierarchy are different, different information is exchanged, and thus the information represented next time may be also different. These inter-connected structures could also produce chaotic dynamics (<xref ref-type="bibr" rid="B71">Rubinov et al., 2009</xref>; <xref ref-type="bibr" rid="B83">Tomov et al., 2014</xref>). Nevertheless, neural responses in the early visual system can be decoded as external inputs. These decodable neural responses include both neuronal spikes (<xref ref-type="bibr" rid="B5">Berens et al., 2012</xref>; <xref ref-type="bibr" rid="B92">Zavitz et al., 2016</xref>) and blood-oxygen-level-dependent responses (<xref ref-type="bibr" rid="B44">Kamitani and Tong, 2005</xref>; <xref ref-type="bibr" rid="B10">Brouwer and Heeger, 2009</xref>). Therefore, it is important to find neural coding principles that enable decodable stable neural representations in hierarchical structures on time domain.</p>
<p>Existing efficient coding (<xref ref-type="bibr" rid="B2">Attneave, 1954</xref>; <xref ref-type="bibr" rid="B3">Barlow, 1961</xref>; <xref ref-type="bibr" rid="B48">Laughlin, 1981</xref>) does not require additional information processing subsystems or virtual error units, which are shortcomings of predictive coding. Unfortunately, however, existing efficient coding does not consider hierarchical structures on the time domain to deal with the above issue. Many other ideas have been proposed for representation learning (<xref ref-type="bibr" rid="B4">Bengio et al., 2013</xref>). Properties of the visual cortex have been successfully studied using sparse coding (<xref ref-type="bibr" rid="B62">Olshausen and Field, 1996</xref>). Sparse coding has been successfully implemented using artificial neural networks such as restricted Boltzmann machines for representation learning (<xref ref-type="bibr" rid="B30">Goodfellow et al., 2011</xref>). However, these sparse coding studies have the disadvantage that they do not take into account the passage of time, which is an important aspect in the operation of the real brain. The hierarchical structure of the brain, which ascends to the upper hierarchy from the input by the bottom-up pathway and then descends to the input by the top-down pathway again, resembles the structure of autoencoders (<xref ref-type="bibr" rid="B8">Bourlard and Kamp, 1988</xref>; <xref ref-type="bibr" rid="B36">Hinton and Zemel, 1993</xref>) of artificial neural networks. Many autoencoders have also been used for representation learning (<xref ref-type="bibr" rid="B4">Bengio et al., 2013</xref>). Typical examples include sparse autoencoders (<xref ref-type="bibr" rid="B66">Ranzato et al., 2006</xref>), denoising autoencoders (<xref ref-type="bibr" rid="B87">Vincent et al., 2008</xref>), and contractive autoencoders (<xref ref-type="bibr" rid="B70">Rifai et al., 2011</xref>). However, these examples do not take into account the passage of time.</p>
<p>We, therefore, propose a novel computational principle for hierarchical structures as spatio-temporally efficient coding underscored by the efficient use of given resources in both neural activity space and processing time (<xref ref-type="fig" rid="F1">Figure 1</xref>). Spatio-temporally efficient coding minimises temporal differences of neural responses (temporally efficient coding), and maximises activity space of neural responses for different external stimuli (spatially efficient coding). We call the combination of temporally efficient coding and spatially efficient coding as spatiotemporal efficient coding. By spatio-temporally efficient coding, neural responses change smoothly but dynamically. Those dynamical changes of neural responses for the changing external world differs from the slow feature analysis (<xref ref-type="bibr" rid="B90">Wiskott and Sejnowski, 2002</xref>; <xref ref-type="bibr" rid="B6">Berkes and Wiskott, 2005</xref>; <xref ref-type="bibr" rid="B17">Creutzig and Sprekeler, 2008</xref>) which also minimises temporal differences of neural responses. Similar to slow feature analysis, there have been studies on the properties of cells in the visual cortex using temporal coherence to obtain slow representations (<xref ref-type="bibr" rid="B42">Hurri and Hyv&#x00E4;rinen, 2002</xref>; <xref ref-type="bibr" rid="B95">Zou et al., 2011</xref>). However, such studies using temporal coherence lack the aspect of dynamically reacting to changes in external input or neural responses of other hierarchies.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Spatio-temporally efficient coding. <bold>(A)</bold> This illustration depicts a hierarchical structure of the brain. Open black circles indicate an ensemble of neuronal units in each hierarchy of the brain. Open black square indicates visual input. Back arrows indicate information transmissions of bottom-up (upward arrow), recurrent (loop arrow), and top-down (downward arrow). <bold>(B)</bold> This illustration depicts a hierarchical structure of the brain and learning objectives. The hierarchical structure learns to represent input from the external world, which is depicted as black squares (e.g., visual input). Open black circles indicate an ensemble of neuronal units in each hierarchy of the brain. Inference based on spatio-temporally efficient coding is made by neuronal units as bottom-up, recurrent, and top-down information transmissions over time (black arrows). Learning in spatio-temporally efficient coding consists of two objectives: minimising the temporal differences between present and future neural responses and maximising the informational entropy of neural responses. For example, information transmissions (purple arrows) are optimised to minimise the temporal differences between present neural responses at the corresponding hierarchy (red filled circle) and future neural responses (circle to the right of the red filed circle) while concurrently maximising the informational entropy of neural responses at the corresponding hierarchy (red filled circle).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fncom-16-890447-g001.tif"/>
</fig>
<p>Temporally efficient coding decreases the conditional entropy of neural response given stimulus, <italic>H</italic>(<italic>X</italic>|<italic>S</italic>) where <italic>X</italic> indicates neural response and <italic>S</italic> indicates stimulus. Spatially efficient coding increases <italic>H</italic>(<italic>X</italic>). Spatio-temporally efficient coding, thus, increases the Shannon mutual information <italic>I</italic>(<italic>X</italic>; <italic>S</italic>) = <italic>H</italic>(<italic>X</italic>)&#x2212;<italic>H</italic>(<italic>X</italic>|<italic>S</italic>) simultaneously in both terms: <italic>H</italic>(<italic>X</italic>) and &#x2212;<italic>H</italic>(<italic>X</italic>|<italic>S</italic>). This is also the definition of another existing efficient coding (<xref ref-type="bibr" rid="B25">Friston, 2010</xref>). In this manner, spatio-temporally efficient coding in hierarchical structures can also be seen as an extension of existing efficient coding into hierarchical structures on time domain.</p>
<p>Spatio-temporally efficient coding enables rapid stabilisation of neural responses, smooth neural representations, and decodable stable neural responses. Through simulations, these properties (smooth temporal trajectory of neural responses, smooth neural representations, decodable stable neural responses; <xref ref-type="fig" rid="F2">Figures 2</xref>&#x2013;<xref ref-type="fig" rid="F4">4</xref>) and predictable phenomena (deviant neural response for unlearned inputs, orientation preference bias; <xref ref-type="fig" rid="F5">Figures 5</xref>&#x2013;<xref ref-type="fig" rid="F7">7</xref>) were confirmed.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>A simulation of spatio-temporally efficient coding. <bold>(A)</bold> Architecture of the simulation model. An input image is divided into four subspaces with overlaps, denoted by different colours. A subset of neuronal units in lower hierarchy receive corresponding parts of the image (matching colours). There is no spatial correspondence between lower and upper hierarchies. Pink, magenta, and purple arrows indicate bottom-up, recurrent, and top-down information transmissions,respectively. <bold>(B)</bold> Learning curves of STEC. Left panel: mean temporal difference. Right panel: mean negative entropy. As the negative entropy of the lower and upper hierarchies decreases, it can be observed that the temporal difference increases and then stabilises or decreases again. Vertical axis has the logarithm scale. <bold>(C)</bold> The magnitude of synaptic weights on lower hierarchy units is compared for different conditions, where STEC, the balanced condition between temporally and spatially efficient coding objectives; TEC, temporally efficient coding alone; SEC, spatially efficient coding alone which is existing efficient coding; Sparse, sparse coding. Each cross indicates the L2 norm of synaptic weights for each unit. Black squares indicate the mean. <bold>(D)</bold> Left: Four different original input images and their reconstructions from the neuronal responses of lower hierarchy. Right: Representative neural responses for different conditions. The neural response is the output of the sigmoid function and is therefore normalised to a range between 0 and 1.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fncom-16-890447-g002.tif"/>
</fig>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S2.SS1">
<title>Spatio-Temporally Efficient Coding</title>
<p>A possible approach to overcome the shortcomings of both bottom-up processing and predictive coding in visual hierarchical structures is to make bottom-up information transmissions similarly to top-down information transmissions across hierarchies, instead of transmitting bottom-up prediction errors. For example, context-independent bottom-up predictions and context-dependent top-down predictions (<xref ref-type="bibr" rid="B82">Teufel and Fletcher, 2020</xref>). Such bidirectional information transmissions eliminate the necessity for hypothetical error units, while presumably elucidating the neural responses of hierarchical structures underlying bottom-up feature integration and top-down predictive coding. A neural coding principle underlying bidirectional information transmissions of hierarchical structures can be found in the theory of efficient coding that draws upon the efficient use of given resources (<xref ref-type="bibr" rid="B49">Laughlin, 2001</xref>; <xref ref-type="bibr" rid="B11">Bullmore and Sporns, 2012</xref>), which crucially include limited time resources related to processing speed (<xref ref-type="bibr" rid="B31">Griffiths et al., 2015</xref>; <xref ref-type="bibr" rid="B53">Lieder and Griffiths, 2020</xref>). A possible solution to promote the most efficient use of limited time resources by the bidirectional information transmission system is to render present neural responses similar to future ones before the occurrence of future neural responses. This can be achieved by minimising the temporal differences between present and future neural responses. Accordingly, we consider this temporal difference minimisation as our <italic>learning principle</italic>, referred to as <italic>temporally efficient coding</italic>. Here, <italic>inference</italic> simply refers to a bidirectional information transmission mediated by top-down and bottom-up pathways. Unlike inference in predictive coding, which requires further error minimisation, inference in temporally efficient coding involves simple single-step information transmission.</p>
<p>Temporally efficient coding involves a trivial solution: neural responses do not change to changes in external events. This trivial solution is comparable to the dark-room problem of predictive coding or free-energy principle, where an agent stays and is unchanged in a dark room with no surprise or unpredicted parts (<xref ref-type="bibr" rid="B26">Friston et al., 2012</xref>; <xref ref-type="bibr" rid="B16">Clark, 2013</xref>). We circumvent this issue by adding a complementary neural coding (learning) principle that maximises the informational entropy of neural responses to alter neural responses to changing external events. It maximises the neural response space available to represent the external world under the constraints of both the number of neurons and maximum firing rates. Maximal entropy coding indicates that the system uses spatial resources of neural responses efficiently (<xref ref-type="bibr" rid="B2">Attneave, 1954</xref>; <xref ref-type="bibr" rid="B3">Barlow, 1961</xref>; <xref ref-type="bibr" rid="B48">Laughlin, 1981</xref>), referred to as <italic>spatially efficient coding</italic>. By combining spatially efficient coding and temporally efficient coding, we propose a neural coding principle termed <italic>spatio-temporally efficient coding</italic> (<xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<p>Spatially efficient coding has the same objective as existing efficient coding (<xref ref-type="bibr" rid="B3">Barlow, 1961</xref>; <xref ref-type="bibr" rid="B48">Laughlin, 1981</xref>), which minimises informational redundancy because it increases the difference between neural responses. Temporally efficient coding, on the other hand, can be regarded to increase informational redundancy, as it reduces the difference between neural responses. Two seemingly opposing coding objectives can be reconciled by isolating mechanisms that decrease the differences between consecutive neural responses on time domain and those that increase the differences between neural responses to the apparently different external world. In the real brain, it can be explained that different mechanisms are applied depending on the degree of difference in neural response. In the implementation of the present study, two coding objectives were applied in different ways. As an implementation of temporally efficient coding, we minimise the difference between consecutive neural responses on the time. In spatially efficient coding, the neural responses to the apparently different external stimuli are implemented in a minibatch method that simultaneously learns different images. We maximised the difference between neural responses to different images within each time step (see section &#x201C;Implementation of Spatio-Temporally Efficient Coding&#x201D;).</p>
<p>Temporally efficient coding trains the present neural response to be similar to the future neural response in order to efficiently use a given time resource. The temporal trajectory of the neural response is smoothed as the difference between the present and future neural responses is minimised. It thereby minimises the size of the space represented, when a single stimulus (stimulus in the external world) is represented on the time domain. In other words, it reduces neuronal noise (also see, <xref ref-type="fig" rid="F2">Figure 2D</xref>) which is defined as the uncertainty of a neural response for given stimulus (<xref ref-type="bibr" rid="B7">Borst and Theunissen, 1999</xref>). This is to decrease the conditional entropy of neural response given stimulus, <italic>H</italic>(<italic>X</italic>|<italic>S</italic>) where <italic>X</italic> indicates neural response and <italic>S</italic> indicates stimulus. Spatially efficient coding increases <italic>H</italic>(<italic>X</italic>). Spatio-temporally efficient coding, thus, increases the Shannon mutual information <italic>I</italic>(<italic>X</italic>; <italic>S</italic>) = <italic>H</italic>(<italic>X</italic>)&#x2212;<italic>H</italic>(<italic>X</italic>|<italic>S</italic>) simultaneously in both terms: <italic>H</italic>(<italic>X</italic>) and &#x2212;<italic>H</italic>(<italic>X</italic>|<italic>S</italic>). This is also the definition of another existing efficient coding (<xref ref-type="bibr" rid="B25">Friston, 2010</xref>). Spatio-temporally efficient coding in hierarchical structures, therefore, can also be seen as an extension of existing efficient coding into hierarchical structures on time domain.</p>
<p>Neural system homeostasis is associated with maximisation of mutual information between neural responses and external stimuli (<xref ref-type="bibr" rid="B84">Toyoizumi et al., 2005</xref>; <xref ref-type="bibr" rid="B81">Sullivan and de Sa, 2006</xref>). Since spatio-temporally efficient coding increases the Shannon mutual information between neural responses and external stimuli, it is related to homeostasis. In particular, temporal difference minimisation of neural responses in temporally efficient coding is reminiscent of homeostasis of energy metabolism. Smoothing the temporal trajectory of a neural responses reduces the variance of the neural response distribution so that the neural response stays within a certain range. This is also a consequence of the homeostatic plasticity (<xref ref-type="bibr" rid="B85">Turrigiano and Nelson, 2004</xref>) of the brain (also see, <xref ref-type="fig" rid="F8">Figure 8</xref>).</p>
<p>As mentioned earlier, smoothing the temporal trajectories of neural responses reduces the spatial extent of neural responses to static stimuli. This leads to a rapid stabilisation of the neural response to the static stimulus (also see, <xref ref-type="fig" rid="F2">Figures 2D</xref>, <xref ref-type="fig" rid="F4">4</xref>); decodable stable neural representations which is the main subject of the present study. The rapid stabilisation of neural responses shows the characteristic of temporally efficient coding, which makes efficient use of given time resources. Another effect of this smoothing the temporal trajectory of neural response is to render smooth neural representations that locally preserves the structure of the external world. If a stimulus is static or changes smoothly, making the temporal trajectory of the neural response smooth is to render a similar neural response to the similar stimuli. This is smooth neural representations that locally preserves the structure of the external world (also see, <xref ref-type="fig" rid="F3">Figure 3</xref>).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Smooth neural representations of visual images by spatio-temporally efficient coding. <bold>(A)</bold> Examples of relative Euclidean distances from the reference image (Ref. left) to other images in the natural scene image space are depicted on the horizontal axis. Relative Euclidean distances from the neural representations of the reference image at hierarchy 1 to those of corresponding other images are depicted on the vertical axis. <bold>(B)</bold> Local similarity based on correlations of distances between overall natural scene images (<italic>N</italic> = 42 out of 4212: 1%, <italic>N</italic> = 84 out of 4212: 2%; <italic>N</italic> = 168 out of 4212: 4%) (global feature based distance) with those of corresponding neural responses (Euclidean distance). Error bars indicate standard deviations. <bold>(C)</bold> Discriminability of neural responses for images (<italic>N</italic> = 4170 out of 4212; 99%). Three horizontal lines in each box indicates 25, 50, and 75 % levels of data, respectively.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fncom-16-890447-g003.tif"/>
</fig>
</sec>
<sec id="S2.SS2">
<title>Implementation of Spatio-Temporally Efficient Coding</title>
<p>In the present study, visual information processing in hierarchical structures was established as biologically inspired temporal processing. Specifically, visual information processing is described as a function <italic>f</italic><sub>t</sub> for both image <italic>X</italic><sub><italic>image</italic></sub> and neural responses <italic>X</italic><sub>h</sub> in each hierarchy <italic>h</italic> such that it maps from <italic>X</italic><sub><italic>image</italic></sub> and <italic>X</italic><sub>h</sub> at time <italic>t-1</italic> to those at time <italic>t</italic>:</p>
<disp-formula id="S2.E1"><label>(1)</label><mml:math id="M1"><mml:mtable columnspacing="5pt" displaystyle="true" rowspacing="0pt"><mml:mtr><mml:mtd columnalign="center"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mrow><mml:mrow><mml:mpadded width="+5.6pt"><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>g</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mo rspace="5.3pt">,</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mpadded><mml:mo>&#x00D7;</mml:mo><mml:mpadded width="+5.6pt"><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi mathvariant="normal">h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo rspace="5.3pt">,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mpadded><mml:mo>&#x00D7;</mml:mo><mml:mpadded width="+5.6pt"><mml:mi mathvariant="normal">&#x2026;</mml:mi></mml:mpadded><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi mathvariant="normal">h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mi mathvariant="normal">H</mml:mi><mml:mo rspace="5.3pt">,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x2192;</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>g</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mo rspace="5.3pt">,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:mtd><mml:mtd/></mml:mtr><mml:mtr><mml:mtd columnalign="center"><mml:mrow><mml:mi/><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mpadded width="+5.6pt"><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi mathvariant="normal">h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo rspace="5.3pt">,</mml:mo><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mpadded><mml:mo>&#x00D7;</mml:mo><mml:mpadded width="+5.6pt"><mml:mi mathvariant="normal">&#x2026;</mml:mi></mml:mpadded><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi mathvariant="normal">h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mi mathvariant="normal">H</mml:mi><mml:mo rspace="5.3pt">,</mml:mo><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:mtd><mml:mtd/></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where <italic>X</italic><sub>h,t&#x2013;1</sub> and <italic>X</italic><sub>h,t</sub> are the neural responses <italic>X</italic><sub>h</sub> at time <italic>t</italic>-1 and <italic>t</italic>, respectively, at hierarchy <italic>h</italic>. For convenience, <italic>X</italic><sub>h =0</sub> denotes <italic>X</italic><sub><italic>image</italic></sub>, in particular <italic>X</italic><sub>h =0, t</sub> denotes the image presented at time <italic>t</italic>. The details of the visual information processing <italic>f</italic><sub>t</sub> are as follows: If <italic>h</italic> &#x003E; 0, then,</p>
<disp-formula id="S2.E2"><label>(2)</label><mml:math id="M2"><mml:mtable columnspacing="5pt" displaystyle="true" rowspacing="0pt"><mml:mtr><mml:mtd columnalign="center"><mml:mrow><mml:mpadded width="+5.6pt"><mml:msub><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub><mml:mo fence="true">|</mml:mo></mml:mrow><mml:msub><mml:mi mathvariant="normal">X</mml:mi><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:msub></mml:msub></mml:mpadded><mml:mo>=</mml:mo><mml:mi/></mml:mrow></mml:mtd><mml:mtd/></mml:mtr><mml:mtr><mml:mtd columnalign="center"><mml:mrow><mml:mi mathvariant="normal">&#x03C3;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msubsup><mml:mi>W</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mi mathvariant="normal">h</mml:mi></mml:mrow><mml:mi>T</mml:mi></mml:msubsup><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:msubsup><mml:mi>W</mml:mi><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">h</mml:mi></mml:mrow><mml:mi>T</mml:mi></mml:msubsup><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:msubsup><mml:mi>W</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mi mathvariant="normal">h</mml:mi></mml:mrow><mml:mi>T</mml:mi></mml:msubsup><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mtext>h</mml:mtext></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd><mml:mtd/></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where <italic>X</italic><sub>h,t</sub> is an <italic>X</italic><sub>h</sub> value vector at time <italic>t</italic>, <italic>f</italic><sub>t</sub>(&#x22C5;)|<sub>X<sub>h,t</sub></sub> indicates restricting the range of the function value <italic>f</italic><sub>t</sub>(&#x22C5;) to <italic>X</italic><sub>h,t</sub>, <italic>W</italic><sub>h+1,h</sub> is a synaptic weight matrix from hierarchy <italic>h+1</italic> to <italic>h</italic>, <italic>T</italic> is the transpose of a matrix, <italic>b</italic><sub>h</sub> is a bias vector at hierarchy <italic>h</italic>, and &#x03C3;(&#x22C5;) is a sigmoid function. The terms of <inline-formula><mml:math id="INEQ15"><mml:mrow><mml:msubsup><mml:mi>W</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mi mathvariant="normal">h</mml:mi></mml:mrow><mml:mi>T</mml:mi></mml:msubsup><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="INEQ16"><mml:mrow><mml:msubsup><mml:mi>W</mml:mi><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">h</mml:mi></mml:mrow><mml:mi>T</mml:mi></mml:msubsup><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="INEQ17"><mml:mrow><mml:msubsup><mml:mi>W</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mi mathvariant="normal">h</mml:mi></mml:mrow><mml:mi>T</mml:mi></mml:msubsup><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> indicate top-down, recurrent, and bottom-up information transmissions, respectively. In case of <italic>h</italic> + 1 &#x003E; <italic>H</italic> the <inline-formula><mml:math id="INEQ19"><mml:mrow><mml:msubsup><mml:mi>W</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mi mathvariant="normal">h</mml:mi></mml:mrow><mml:mi>T</mml:mi></mml:msubsup><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> term would be omitted. If <italic>h</italic> = 0, then,</p>
<disp-formula id="S2.E3"><label>(3)</label><mml:math id="M3"><mml:mtable columnspacing="5pt" displaystyle="true"><mml:mtr><mml:mtd columnalign="center"><mml:mrow><mml:mpadded width="+5.6pt"><mml:msub><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub><mml:mo fence="true">|</mml:mo></mml:mrow><mml:msub><mml:mi mathvariant="normal">X</mml:mi><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:msub></mml:msub></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mi mathvariant="normal">&#x03C3;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msubsup><mml:mi>W</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mi mathvariant="normal">h</mml:mi></mml:mrow><mml:mi>T</mml:mi></mml:msubsup><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mtext>h</mml:mtext></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:mtd><mml:mtd/></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>If <italic>h</italic> &#x003E; 0, <italic>X</italic><sub>h,t</sub> = <italic>f</italic><sub>t</sub>|<sub>X<sub>h,t</sub></sub>. On the other hand, when <italic>h</italic> = 0, <italic>X</italic><sub>h,t</sub> is the image presented at time <italic>t</italic>, not <italic>f</italic><sub>t</sub>|<sub>X<sub>h,t</sub></sub>. This function <italic>f</italic><sub>t</sub> makes <italic>inferences</italic> using spatio-temporally efficient coding. Unlike inference in predictive coding (<xref ref-type="bibr" rid="B67">Rao and Ballard, 1999</xref>; <xref ref-type="bibr" rid="B79">Spratling, 2017</xref>) that requires additional processes such as minimisation of prediction errors, inference in spatio-temporally efficient coding is a function value <italic>f</italic><sub>t</sub> itself.</p>
<p><italic>Learning</italic> in spatio-temporally efficient coding minimises the ensuing objectives of both temporally and spatially efficient coding. The objective of temporally efficient coding is given by:</p>
<disp-formula id="S2.E4"><label>(4)</label><mml:math id="M4"><mml:mtable columnspacing="5pt" displaystyle="true" rowspacing="0pt"><mml:mtr><mml:mtd columnalign="center"><mml:mrow><mml:mpadded width="+5.6pt"><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>T</mml:mi><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>p</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:mpadded><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:munderover><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>n</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover></mml:mstyle></mml:mrow></mml:mtd><mml:mtd/></mml:mtr><mml:mtr><mml:mtd columnalign="center"><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:mrow><mml:mpadded width="+3.3pt"><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">&#x25E6;</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>t</mml:mi><mml:mo stretchy="false">~</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>g</mml:mi><mml:mpadded width="+5.6pt"><mml:mi>e</mml:mi></mml:mpadded></mml:mrow><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo>-</mml:mo><mml:mrow><mml:mrow><mml:mpadded width="+3.3pt"><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">&#x25E6;</mml:mo><mml:mpadded width="+3.3pt"><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">&#x25E6;</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>t</mml:mi><mml:mo stretchy="false">~</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>g</mml:mi><mml:mpadded width="+5.6pt"><mml:mi>e</mml:mi></mml:mpadded></mml:mrow><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mtd><mml:mtd/></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where &#x25E6; is the function composition, <italic>x</italic><sub><italic>h</italic> =0,<italic>t&#x0342;</italic>, image =<italic>n</italic></sub> indicates that the image is fixed to the <italic>n</italic>th sample throughout the temporal processing (i.e., <italic>X</italic><sub>h = 0,t&#x2212;2</sub> = <italic>X</italic><sub>h = 0,t&#x2212;1</sub> = <italic>X</italic><sub>h = 0,t</sub> is <italic>n</italic>th image sample), squaring is operated component-wise. In case of <italic>h</italic> = 0, <inline-formula><mml:math id="INEQ29"><mml:mrow><mml:mpadded width="+3.3pt"><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">&#x25E6;</mml:mo><mml:mpadded width="+3.3pt"><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">&#x25E6;</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>t</mml:mi><mml:mo stretchy="false">~</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mpadded width="+2.8pt"><mml:mrow><mml:mtext>image</mml:mtext></mml:mrow></mml:mpadded><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="normal">X</mml:mi><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi mathvariant="normal">h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:msub></mml:msub><mml:mo rspace="8.1pt">:</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi mathvariant="normal">h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> which is the image presented at time <italic>t+1</italic> while <inline-formula><mml:math id="INEQ30"><mml:msub><mml:mrow><mml:mrow><mml:mrow><mml:mpadded width="+3.3pt"><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub></mml:mpadded><mml:mo rspace="5.8pt">&#x25E6;</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi mathvariant="normal">t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>t</mml:mi><mml:mo stretchy="false">~</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mpadded width="+2.8pt"><mml:mrow><mml:mtext>image</mml:mtext></mml:mrow></mml:mpadded><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo fence="true">|</mml:mo></mml:mrow><mml:msub><mml:mi mathvariant="normal">X</mml:mi><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi mathvariant="normal">h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:msub></mml:msub></mml:math></inline-formula> is the function value as inference. The function composition <italic>f</italic><sub>t</sub>&#x25E6;<italic>f</italic><sub>t&#x2212;1</sub> of two functions (inferences) in the <italic>L</italic><sub>Temporal</sub> allows the simultaneous learning of all information transmissions across all hierarchies because the depth of hierarchy is <italic>H</italic> = 2 in the present study (bottom-up, recurrent, and top-down information transmission across the hierarchy of depth <italic>H</italic> = 2). The inimisation of the given objective minimises the temporal differences between present and future neural responses. The objective of temporally efficient coding is to render present neural responses similar to future neural responses that has not yet arrived. By doing so, it uses the given time resources efficiently. This minimisation of temporal difference creates a learning effect in which the temporal trajectory of the neural response becomes smooth. The expected effect of this learning effect is to quickly stabilise neural responses when a static external stimulus is given (also see, <xref ref-type="fig" rid="F2">Figures 2D</xref>, <xref ref-type="fig" rid="F3">3</xref>). In that aspect, it is also to use the given time resources efficiently.</p>
<p>The objective of spatially efficient coding is given by,</p>
<disp-formula id="S2.E5"><label>(5)</label><mml:math id="M5"><mml:mtable columnspacing="5pt" displaystyle="true"><mml:mtr><mml:mtd columnalign="center"><mml:mrow><mml:mpadded width="+5.6pt"><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mtext>Spatial</mml:mtext></mml:mrow></mml:msub></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mstyle displaystyle="false"><mml:munderover><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>n</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover></mml:mstyle><mml:mrow><mml:mrow><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mrow><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>t</mml:mi><mml:mo stretchy="false">~</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>g</mml:mi><mml:mpadded width="+5.6pt"><mml:mi>e</mml:mi></mml:mpadded></mml:mrow><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo fence="true">|</mml:mo></mml:mrow><mml:msub><mml:mi mathvariant="normal">X</mml:mi><mml:mi mathvariant="normal">h</mml:mi></mml:msub></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:mtd><mml:mtd/></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where <italic>h</italic> &#x003E; 0 in <italic>X</italic><sub>h</sub> (so, <inline-formula><mml:math id="INEQ35"><mml:mrow><mml:mpadded width="+5.6pt"><mml:msub><mml:mrow><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>t</mml:mi><mml:mo stretchy="false">~</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mpadded width="+2.8pt"><mml:mrow><mml:mtext>image</mml:mtext></mml:mrow></mml:mpadded><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo fence="true">|</mml:mo></mml:mrow><mml:msub><mml:mi mathvariant="normal">X</mml:mi><mml:mrow><mml:mtext>h</mml:mtext></mml:mrow></mml:msub></mml:msub></mml:mpadded><mml:mo>=</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi mathvariant="normal">h</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) and <italic>P</italic>(&#x22C5;) is a probability. Because the term <italic>L</italic><sub>Spatial</sub> is an estimation of negative informational entropy, minimising the objective maximises the informational entropy of <italic>f</italic><sub>t</sub>. This objective increases the entropy of each neuron, and it can be seen that an increase in marginal entropy in each neuron will increase the joint entropy of the entire system. Therefore, spatially efficient coding maximises the entropy of individual neurons as in Laughlin&#x2019;s study (<xref ref-type="bibr" rid="B48">Laughlin, 1981</xref>) and consequently the entropy of the entire system. The objective of spatially efficient coding is to render different neural responses to different inputs from the external world. We, therefore, can overcome the trivial solution of temporally efficient coding, where there is no change in neural response despite changes in the external world. This thereby does not conflict with the learning of temporally efficient coding, which decreases differences in consecutive neural responses in the time domain to inputs of one stream.</p>
<p>To minimise <italic>L</italic><sub>Spatial</sub>, which is the objective of spatially efficient coding, it is necessary to calculate the probability <italic>P</italic>(&#x22C5;) in <italic>L</italic><sub>Spatial</sub> (eq. 5). Instead of calculating the exact probabilities, we obtained pre-normalised densities in the sense of probabilities without a partition function. As the value of the partition function is fixed, it does not affect the minimisation process. Note that, when <italic>h</italic> &#x003E; 0, <italic>X</italic><sub>h,t</sub> is the space of all possible neural responses, i.e., <italic>X</italic><sub>h,t</sub> = [0, 1]<sup>dim<italic>X</italic><sub>h,t</sub></sup>. Let <italic>x</italic> &#x2208; <italic>X</italic><sub>h,t</sub> be a value of <inline-formula><mml:math id="INEQ41"><mml:msub><mml:mrow><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>h</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>t</mml:mi><mml:mo stretchy="false">~</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mpadded width="+2.8pt"><mml:mrow><mml:mtext>image</mml:mtext></mml:mrow></mml:mpadded><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo fence="true">|</mml:mo></mml:mrow><mml:msub><mml:mi mathvariant="normal">X</mml:mi><mml:mrow><mml:mtext>h</mml:mtext></mml:mrow></mml:msub></mml:msub></mml:math></inline-formula>. Kernel density estimation was used to obtain <italic>P</italic>(<italic>x</italic>). Using a Gaussian kernel with width 0.1(dim&#x2061;<italic>X</italic><sub>h,t</sub>)<sup>1/2</sup>, the neural response density <italic>Q</italic>(<italic>x</italic>) and compensation density <italic>Q</italic>&#x2032;(<italic>x</italic>) at <italic>x</italic> &#x2208; <italic>X</italic><sub>h,t</sub> were obtained. Then, the pre-normalised density of interest is,</p>
<disp-formula id="S2.E6"><label>(6)</label><mml:math id="M6"><mml:mtable columnspacing="5pt" displaystyle="true"><mml:mtr><mml:mtd columnalign="center"><mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo rspace="8.1pt">)</mml:mo></mml:mrow></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac><mml:mrow><mml:mi>Q</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd><mml:mtd/></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>The neural response density <italic>Q</italic>(<italic>x</italic>) is obtained by kernel density estimation of neural responses on <italic>X</italic><sub>h,t</sub>. The compensation density <italic>Q</italic>&#x2032;(<italic>x</italic>) is obtained using pseudo-uniformly generated samples on <italic>X</italic><sub>h,t</sub> instead of the neural responses (For sparseness constraint, i.e., sparse neural responses, <italic>Q</italic>&#x2032;(<italic>x</italic>) is obtained using pseudo-uniformly generated samples on [&#x2212;1, 1]<sup>dim <italic>X</italic><sub>h,t</sub></sup> instead of <italic>X</italic><sub>h,t</sub> = [0, 1]<sup>dim <italic>X</italic><sub>h,t</sub></sup>, so that <italic>Q</italic>&#x2032;(<italic>x</italic>) has a fat distribution around zero. See, <xref ref-type="fig" rid="F5">Figure 5</xref>) The compensation density is necessary to compensate for the non-uniform intrinsic expectation of <italic>Q</italic>(&#x22C5;) resulting from the fact that <italic>X</italic><sub>h,t</sub> is bounded. At the boundary of <italic>X</italic><sub>h,t</sub>, the density of neural responses, <italic>Q</italic>(<italic>x</italic>), measured by kernel density estimation is decrease. This intrinsic decrease corresponds to <italic>Q</italic>&#x2032;(<italic>x</italic>). We compensated for the decrease by dividing <italic>Q</italic>(<italic>x</italic>) by <italic>Q</italic>&#x2032;(<italic>x</italic>).</p>
<p>Finally, the objective of spatio-temporally efficient coding is a linear combination of those two objectives:</p>
<disp-formula id="S2.E7"><label>(7)</label><mml:math id="M7"><mml:mtable columnspacing="5pt" displaystyle="true"><mml:mtr><mml:mtd columnalign="center"><mml:mrow><mml:mpadded width="+5.6pt"><mml:mi>L</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mtext>Temporal</mml:mtext></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mrow><mml:mi mathvariant="normal">&#x03BB;</mml:mi><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mtext>Spatial</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:mrow></mml:mtd><mml:mtd/></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where &#x03BB; is a regularisation parameter. A smaller &#x03BB; indicates a greater emphasis on the temporally efficient coding objective, whereas a larger &#x03BB; indicates the opposite. As mentioned previously, temporally efficient coding decreases the conditional entropy of neural response given stimulus, <italic>H</italic>(<italic>X</italic>|<italic>S</italic>) where <italic>X</italic> indicates neural response and <italic>S</italic> indicates stimulus. Spatially efficient coding increases <italic>H</italic>(<italic>X</italic>). Hence, the regularisation parameter &#x03BB; can be seen as controlling the balance between <italic>H</italic>(<italic>X</italic>) and &#x2212;<italic>H</italic>(<italic>X</italic>|<italic>S</italic>). It is a modification of fixed balance of Shannon mutual information <italic>I</italic>(<italic>X</italic>; <italic>S</italic>) = <italic>H</italic>(<italic>X</italic>)&#x2212;<italic>H</italic>(<italic>X</italic>|<italic>S</italic>) of the existing efficient coding (<xref ref-type="bibr" rid="B25">Friston, 2010</xref>).</p>
<p>Temporal trajectories of neural responses are smoothed by temporally efficient coding, but this does not mean just slow neural representations. By spatially efficient coding, different neural responses to different inputs from the external world should be exhibited. Therefore, it should show rapid changes in neural responses to sudden changes in the external world (fast representation). This is the difference from slow feature analysis (<xref ref-type="bibr" rid="B90">Wiskott and Sejnowski, 2002</xref>; <xref ref-type="bibr" rid="B6">Berkes and Wiskott, 2005</xref>; <xref ref-type="bibr" rid="B17">Creutzig and Sprekeler, 2008</xref>) or temporal coherence (<xref ref-type="bibr" rid="B42">Hurri and Hyv&#x00E4;rinen, 2002</xref>; <xref ref-type="bibr" rid="B95">Zou et al., 2011</xref>), which targets slow neural representations. On the one hand, with temporally efficient coding, changes in neural responses should be smoothed out quickly when the external input is not changing (fast stabilisation).</p>
<p>Suppose that Gaussian noise is added to a series of temporally correlated external inputs (e.g., static images or smoothly moved images + Gaussian noise). If these noisy external inputs are still temporally correlated, then smoothing the temporal trajectory of neural responses (by temporally efficient coding objective <italic>L</italic><sub>Temporal</sub>) is rendering temporally similar neural responses to temporally similar inputs. It is also smooth neural representations that locally preserves the structure of the external world. Moreover, if the noise is provided independently of the input, the effect of the noise on neural representations will be dispelled by multiple independent trials. Therefore, temporally efficient coding objective <italic>L</italic><sub>Temporal</sub> increases the fidelity of the neural representation with respect to a Gaussian noise.</p>
<p>Smooth neural representation also means making different neural responses to different inputs. In other words, it increases discriminability for different inputs. This is achieved by spatially efficient coding objective <italic>L</italic><sub>Spatial</sub> that maximises the entropy of neural responses. Assumed that Gaussian noise is added to external inputs. Let <italic>D</italic><sub>true</sub> be a binary-valued random variable for the discrimination between two actually different inputs such that <italic>D</italic><sub>true</sub> = 1 means the discrimination that two noised inputs differ and <italic>D</italic><sub>true</sub> = 0 means the discrimination that two noised inputs are same. <italic>D</italic><sub>true</sub> is probabilistic because of the Gaussian noise mentioned earlier. Let <italic>D</italic><sub>response</sub> be a binary-valued random variable for the discrimination between two neural responses for inputs such that <italic>D</italic><sub>response</sub> = 1 means the discrimination that two neural responses differ and <italic>D</italic><sub>response</sub> = 0 means the discrimination that two neural responses are same. Let <italic>P</italic><sub>true</sub> and <italic>P</italic><sub>response</sub> be the probability mass functions of <italic>D</italic><sub>true</sub> and <italic>D</italic><sub>response</sub>, respectively. Spatially efficient coding objective <italic>L</italic><sub>Spatial</sub> decreases the Kullback&#x2013;Leibler divergence from <italic>P</italic><sub>response</sub> to <italic>P</italic><sub>true</sub>, i.e., <italic>D</italic><sub>KL</sub>(<italic>P</italic><sub>true</sub>||<italic>P</italic><sub>response</sub>). Hence it also decreases the binary cross entropy <italic>H</italic>(<italic>P</italic><sub>true</sub>) + <italic>D</italic><sub>KL</sub>(<italic>P</italic><sub>true</sub>||<italic>P</italic><sub>response</sub>) where <italic>H</italic>(&#x22C5;) is an informational entropy.</p>
<p>In the present study, the depth <italic>H</italic> of hierarchies was set to 2, the minimum depth to realise both bottom-up and top-down pathways in the same hierarchy. Images with 64&#x00D7;96 size were divided into four overlapping 38&#x00D7;58 patches. Each patch was connected to 16 of 64 lower hierarchy units. Further, 64 lower hierarchy units were fully connected to 64 upper hierarchy units. All units in each hierarchy are fully connected to each other (<xref ref-type="fig" rid="F2">Figure 2A</xref>).</p>
<p>Because <italic>L</italic> = <italic>L</italic><sub>Temporal</sub> + &#x03BB;<italic>L</italic><sub>Spatial</sub> is differentiable, the minimisation of the objective in spatio-temporally efficient coding was performed with a gradient descent. The Adam optimiser (<xref ref-type="bibr" rid="B45">Kingma and Ba, 2015</xref>) was used to perform the stochastic gradient descent with momentum. The parameters of the Adam optimiser used in this study were &#x03B1; = 0.001, &#x03B2;<sub>1</sub> = 0.9, &#x03B2;<sub>2</sub> = 0.999, and=10<sup>&#x2212;8</sup>. The optimisation lasted 10<sup>4</sup> iterations for each repetition and restarted with five repetitions. For each iteration, the duration of temporal processing <italic>f</italic><sub>t</sub> was five (i.e., <italic>t</italic> &#x2208; [1,5]), and the minibatch size was 40. For a given image, the duration of temporal processing of <italic>f</italic><sub>t</sub> was given as five (i.e., <italic>t</italic> &#x2208; [1,5]) in both learning and inference, because four time steps are required for the image information to reach the top hierarchy and return over <italic>H</italic> = 2 hierarchies, in addition to one time step to obtain future neural responses. In learning, after temporal processing of five durations was finished, new temporal processing begins, and the initial values of neural responses of new temporal processing were the last neural responses values of the previous temporal processing. In our simulations, we repeatedly exposed the hierarchical structure to natural scene images, which enabled it to learn the bidirectional information transmissions between top-down and bottom-up hierarchies using spatio-temporally efficient coding with a range of the balancing parameter &#x03BB;. Successful learning was confirmed by minimising or stabilising <italic>L</italic> during learning (see, <xref ref-type="fig" rid="F2">Figure 2B</xref>). Further, we verified that the learned hierarchical structure could successfully reconstruct an input image, as shown in <xref ref-type="fig" rid="F2">Figure 2D</xref>.</p>
<p>The appendix of the summary for all mathematical formulations is available at the <xref ref-type="supplementary-material" rid="DS1">Supplementary Material</xref>.</p>
<p>Simulation codes for spatio-temporally efficient coding are available from <ext-link ext-link-type="uri" xlink:href="https://github.com/DuhoSihn/Spatio-temporally">https://github.com/DuhoSihn/Spatio-temporally</ext-link> efficient-coding (<xref ref-type="bibr" rid="B74">Sihn, 2021</xref>).</p>
<p>The flowchart of details of implementation of spatio-temporally efficient coding (<xref ref-type="fig" rid="F9">Figure 9A</xref>) and experimental workflow in the present study (<xref ref-type="fig" rid="F9">Figure 9B</xref>) were summarised in <xref ref-type="fig" rid="F9">Figure 9</xref>.</p>
</sec>
<sec id="S2.SS3">
<title>Datasets</title>
<p>For the simulations, van Hateren&#x2019;s natural scene image dataset (<xref ref-type="bibr" rid="B86">van Hateren and van der Schaaf, 1998</xref>) was used. The dataset was downloaded from <ext-link ext-link-type="uri" xlink:href="https://pirsquared.org/research/#van-hateren-database">https://pirsquared.org/research/#van-hateren-database</ext-link>. The images were downsized to 64&#x00D7;96 pixels. For the comparison tests, the MNIST handwritten digit dataset (<xref ref-type="bibr" rid="B50">Lecun et al., 1998</xref>) was used. The dataset was downloaded from <ext-link ext-link-type="uri" xlink:href="http://yann.lecun.com/exdb/mnist/">http://yann.lecun.com/exdb/mnist/</ext-link>. The images were resized to 64&#x00D7;96 pixels to fit the images used in the simulations. All image data were rescaled between 0 and 1.</p>
</sec>
<sec id="S2.SS4">
<title>Computation of Receptive Fields</title>
<p>Images of 64&#x00D7;96 size with randomly selected 16&#x00D7;16 subregions were created to compute the receptive field. These 64&#x00D7;96 images had a value of one in the 16&#x00D7;16 subregions and zero otherwise. Receptive fields were calculated based on the average neural response when the visual hierarchy (<italic>X</italic><sub>h = 1</sub>&#x00D7;&#x2026;&#x00D7;<italic>X</italic><sub>h = H</sub>) was exposed to these images. Specifically, the value of the receptive field at a pixel was defined as the average neural response to an image with a value of one at this pixel. This averaging method was inspired by the previous study in case of binary neural responses (firing or not) (<xref ref-type="bibr" rid="B61">Ohzawa et al., 1996</xref>). The difference is that the previous study was for image averaging and the present study was for neural response averaging.</p>
</sec>
</sec>
<sec id="S3" sec-type="results">
<title>Results</title>
<sec id="S3.SS1">
<title>Balancing Between Temporally and Spatially Efficient Coding</title>
<p>The temporally efficient coding objective that minimises the temporal difference between present and future neural responses smooths the temporal trajectory of neural responses. The spatially efficient coding objective that maximises the informational entropy of neural responses renders different neural responses to different inputs. We can predict that when the temporally efficient coding objective and the spatially efficient coding objective are properly balanced, different neural responses to different stimuli and temporally smooth neural responses to the same stimuli can be expected. This means that when the external input changes, the neural response changes quickly, and when the external input does not change, the neural response quickly stabilises. If the temporally efficient coding objective is overweighted, it can be predicted to arrive at a trivial solution in which the neural response does not change despite changes in external input. Conversely, if the spatially efficient coding objective is overweighted, it can be predicted that the temporal trajectory of the neural response is not smooth, resulting in neuronal noise. Three regularisation parameters &#x03BB; &#x2208; {0, 10, 1000} were used to confirm these predictions through simulations, and each &#x03BB; corresponds to each simulation condition: the temporally efficient coding overweighted (&#x03BB; = 0), the balanced condition of two coding objectives (&#x03BB; = 10), and the condition of the spatially efficient coding overweighted (&#x03BB; = 1000), respectively. The selection of &#x03BB; &#x2208; {0, 10, 1000} made the comparison between spatio-temporally efficient coding and other neural codings. Putting &#x03BB; = 0 renders that <italic>L</italic> = <italic>L</italic><sub>Temporal</sub> in the eq. (7), consequentially making that the temporally efficient coding objective is overweighted (TEC condition). Putting &#x03BB; = 1000 renders that <italic>L</italic><sub>Temporal</sub> lose its influence such that <italic>L</italic>&#x2248;<italic>L</italic><sub>Spatial</sub> in the eq. (7), consequentially making that the spatially efficient coding objective is overweighted (SEC condition). Putting &#x03BB; = 10 renders the appropriate balance between <italic>L</italic><sub>Temporal</sub> and <italic>L</italic><sub>Spatial</sub>, representing the balanced condition of two objectives (STEC condition). The SEC condition is a condition in which only the spatially efficient coding objective is considered, and represents the existing efficient coding (<xref ref-type="bibr" rid="B3">Barlow, 1961</xref>; <xref ref-type="bibr" rid="B48">Laughlin, 1981</xref>). Sparse coding (<xref ref-type="bibr" rid="B62">Olshausen and Field, 1996</xref>, <xref ref-type="bibr" rid="B63">1997</xref>) was chosen to compare with other neural coding principles where temporal factors were not considered. The sparseness constraint (see section &#x201C;Implementation of Spatio-Temporally Efficient Coding&#x201D;) is applied to &#x03BB; = 1000 to implement sparse coding (Sparse condition).</p>
<p>Based on the simulation, we confirmed this phenomenon by observing that, under STEC condition (&#x03BB; = 10), when the external input changes, the neural response changes rapidly, and when the external input does not change, the neural response quickly stabilises. In addition, we confirmed that under TEC condition (&#x03BB; = 0, control), we arrive at the trivial solution in which the neural response does not change even when the external input changes. We also confirmed that under SEC condition (&#x03BB; = 1000) and Sparse condition (&#x03BB; = 1000 + sparseness constraint), the temporal trajectory of the neural response is not smooth, resulting in neuronal noise (<xref ref-type="fig" rid="F2">Figure 2D</xref>). This also shows the problem of applying existing efficient coding (<xref ref-type="bibr" rid="B3">Barlow, 1961</xref>; <xref ref-type="bibr" rid="B48">Laughlin, 1981</xref>) and sparse coding (<xref ref-type="bibr" rid="B62">Olshausen and Field, 1996</xref>, <xref ref-type="bibr" rid="B63">1997</xref>) directly to the hierarchical structure on time domain.</p>
<p>A change in balance between the two objectives also altered the relative strengths between bottom-up and top-down synaptic connections. In order to find out whether the strength of the bottom-up and top-down synaptic weights is related to the balance between the two objectives, the synaptic weights at various &#x03BB; values were investigated. The synaptic weights were defined as <inline-formula><mml:math id="INEQ102"><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:munder><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mi>i</mml:mi></mml:munder><mml:msubsup><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mn>2</mml:mn></mml:msubsup></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> where <italic>w</italic><sub><italic>ij</italic></sub> is an <italic>ij</italic>th component of a synaptic weight matrix <italic>W</italic> which is defined at Implantation of spatio-temporally efficient coding section. Simulation results demonstrated that top-down synaptic strengths from upper to lower hierarchy were increased compared to bottom-up synaptic strengths from images to lower hierarchy as &#x03BB; increased, with an emphasis on spatially efficient coding (<xref ref-type="fig" rid="F2">Figure 2C</xref>, see section &#x201C;Materials and Methods&#x201D; for details on the computation of synaptic strength). This finding suggests that the balance between spatially and temporally efficient coding could be related to the balance between bottom-up and top-down synaptic strengths in the brain.</p>
</sec>
<sec id="S3.SS2">
<title>Smooth Neural Representations of the External World</title>
<p>As a major function of the brain is to represent the external world (<xref ref-type="bibr" rid="B20">deCharms and Zador, 2000</xref>; <xref ref-type="bibr" rid="B46">Kriegeskorte and Diedrichsen, 2019</xref>), we investigated whether spatio-temporally efficient coding creates appropriate neural representations of the external world. Spatio-temporally efficient coding can be expected to smooth the temporal trajectory of neural responses while having different neural responses to different external inputs. This effect may render smooth neural representations that locally preserves the structure of the external world, i.e., similar neural responses for similar external stimuli. To confirm this by simulation, we compared similarities between natural scene images and neural responses to those images. We quantified this by examining the relations between a natural scene image space and a neural response space. Specifically, we first measured the global feature based distance (<xref ref-type="bibr" rid="B21">Di Ges&#x00F9; and Starovoitov, 1999</xref>) from a natural scene image (i.e., reference image) to other natural scene images in the image space and the Euclidean distances from the neural responses for the reference image to those for other compared images in the neural response space (<xref ref-type="fig" rid="F3">Figure 3A</xref>). It has been proven that global feature based distance reflects differences in images in a way that humans actually perceive (<xref ref-type="bibr" rid="B21">Di Ges&#x00F9; and Starovoitov, 1999</xref>). When calculating the distances between neural responses, we used the neural responses at five time steps after receiving a given image based on the assumption that the neural responses became temporally steady after five steps of temporal processing (see <xref ref-type="fig" rid="F1">Figure 1B</xref>).</p>
<p>As a further analysis, we measured local similarity and discriminability based on distances defined in the natural scene image space and the neural response space. Local similarity indicates how well the similarity among the relatively similar images is reflected in the neural responses to those images, representing one of measures that similar neural responses for similar external stimuli. The local similarity was measured the Pearson&#x2019;s linear correlation between the distances in the image space and those in the neural response space. A high correlation indicates that neural responses tend to be similar when the visual system perceives natural scenes close to each other (in the sense of global feature based distance). We set each image sample as a reference and repeated the calculations for the correlation across all image samples. We defined local similarity as the average correlation obtained by using the distances from one image to its neighbouring images only (1, 2, and 4% of all images).</p>
<p>Discriminability is a measure complementary to local similarity, which measures whether neural responses to different images are different. We measured discriminability as the proportion of dissimilar neural responses among dissimilar images. This dissimilarity was defined as 99% of all, based on certain distances (the global feature based distance for images, the Euclidean distance for neural responses). Specifically, the discriminability for an image <italic>s</italic><sub>ref</sub> was defined as |{<italic>s</italic> &#x2208; images |<italic>d</italic><sub>S</sub>(<italic>s</italic>, <italic>s</italic><sub>ref</sub>) &#x003E; &#x03B8;<sub>S</sub> and <italic>d</italic><sub>X</sub>(<italic>f</italic>(<italic>s</italic>), <italic>f</italic>(<italic>s</italic><sub>ref</sub>)) &#x003E; &#x03B8;<sub>X</sub>}| divided by |{<italic>s</italic> &#x2208; images |<italic>d</italic><sub>S</sub>(<italic>s</italic>, <italic>s</italic><sub>ref</sub>) &#x003E; &#x03B8;<sub>S</sub>}| where |{&#x22C5;}| is the set size, <italic>d</italic><sub>S</sub>(<italic>s</italic>, <italic>s</italic><sub>ref</sub>) is the distance on the natural scene image space, <italic>d</italic><sub>X</sub>(<italic>f</italic>(<italic>s</italic>), <italic>f</italic>(<italic>s</italic><sub>ref</sub>)) is the distance on the neural response space, &#x03B8;<sub>S</sub> is the threshold which indicates 99% of all images, and &#x03B8;<sub>X</sub> is the threshold which indicates 99% of all neural responses. A high discriminability indicates that if the image is different, the neural response will also be different.</p>
<p>If the learned neural representations are smooth, i.e., locally preserving the structure of the external world, both local similarity and discriminability will be high. We can predict that when the temporally efficient coding objective and spatially efficient coding objective are balanced (STEC condition), neural representations are smooth, so that local similarity and discriminability are high. From the simulations, we confirmed that local similarity and discriminability do not lower in STEC condition than SEC or Sparse condition (<xref ref-type="fig" rid="F3">Figures 3B,C</xref>).</p>
</sec>
<sec id="S3.SS3">
<title>Decodable Stable Neural Representations</title>
<p>In a hierarchical structure in which information is exchanged in both directions, if the information represented by the upper and lower hierarchies at the same time is different, it is difficult to obtain a stable neural response on time domain for an external input. The reason is that if the information represented by the upper and lower hierarchy are different, different information is exchanged, and thus the information represented next time may be also different. These inter-connected structures could also produce chaotic dynamics (<xref ref-type="bibr" rid="B71">Rubinov et al., 2009</xref>; <xref ref-type="bibr" rid="B83">Tomov et al., 2014</xref>).</p>
<p>The objective of temporally efficient coding is to render present neural responses similar to future neural responses that has not yet arrived. This minimisation of temporal difference creates a learning effect in which the temporal trajectory of the neural response becomes smooth. It thereby minimises the size of the space represented, when a single stimulus is represented on the time domain. In other words, it reduces neuronal noise which is defined as the uncertainty of a neural response for given stimulus (<xref ref-type="bibr" rid="B7">Borst and Theunissen, 1999</xref>). The expected effect of this learning effect is to quickly stabilise neural responses when a static external stimulus is given. The objective of spatially efficient coding is to render different neural responses to different stimuli. The expected effect of spatio-temporally efficient coding is to render decodable stable neural representations which is an appropriate function to hierarchical brain structures on the time domain.</p>
<p>To confirm these expectations, we performed four simulation experiments. The first simulation is to present multiple images for five time step for each image and check the neural responses for those images. When each image was given, it was checked whether the neural response was quickly stabilised (<xref ref-type="fig" rid="F4">Figure 4A</xref>). The second simulation measures the amount of noise in the neural responses when each image is presented during 25 time step. Neuronal noise is defined as the uncertainty of a neural response for given stimulus (<xref ref-type="bibr" rid="B7">Borst and Theunissen, 1999</xref>), so we measured the conditional entropy of neural responses during each five time step as neuronal noise. That is, <inline-formula><mml:math id="INEQ108"><mml:mrow><mml:mpadded width="+2.8pt"><mml:mi>Neuronal</mml:mi></mml:mpadded><mml:mpadded width="+5.6pt"><mml:mi>noise</mml:mi></mml:mpadded><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:mfrac><mml:munder><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi mathvariant="normal">s</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mi mathvariant="normal">S</mml:mi></mml:mrow></mml:munder><mml:mi>H</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:msup><mml:mi>t</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:msup><mml:mi>t</mml:mi><mml:mo>&#x2032;</mml:mo></mml:msup><mml:mo>+</mml:mo><mml:mn>5</mml:mn><mml:mo>]</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> for each <italic>t</italic>&#x2032; &#x2208; {0, 5, 10, 15, 20} where <italic>s</italic> is an image, <italic>S</italic> is the set of all images, <italic>H</italic>(&#x22C5;|&#x22C5;) is the conditional entropy, and <italic>f</italic><sub>t</sub> is the neural response at time <italic>t</italic>. Probability was estimated by kernel density estimation without a partition function, and thus shifted conditional entropy was measured. We checked the amount of neuronal noise (<xref ref-type="fig" rid="F4">Figure 4B</xref>). In the third simulation, we checked whether the neural responses to each image stabilised over time. To do this, we measured how confused the neural response over time (9 time step) to one image was with the neural response to a similar image. This is represented by the confusion index. Confusion index for image <italic>s</italic> at time <italic>t</italic> = |<italic>f</italic><sub>t</sub>(<italic>s</italic>)&#x2212;<italic>f</italic><sub>10</sub>(<italic>s</italic>)|/|<italic>f</italic><sub>10</sub>(<italic>s</italic>)&#x2212;<italic>f</italic><sub>10</sub>(<italic>s</italic>&#x2032;)|where <italic>f</italic><sub>t</sub>(<italic>s</italic>) is the neural response for image <italic>s</italic> at time <italic>t</italic> and <italic>s</italic>&#x2032; is the nearest image of <italic>s</italic> based on the global feature based distance (<xref ref-type="bibr" rid="B21">Di Ges&#x00F9; and Starovoitov, 1999</xref>). If the confusion index is less than 1, it can be said that the neural response is well stabilised. We checked whether the neural response was stabilised by reducing the confusion index to less than 1 (<xref ref-type="fig" rid="F4">Figure 4C</xref>). In fourth simulation, we checked how decodable the neural responses was. To measure how decodable the neural response is, the neural responses of time steps 1,&#x2026;, 9 were decoded using the neural responses of time steps 10 and 11 as a training set. The na&#x00EF;ve Bayes classifier (<xref ref-type="bibr" rid="B34">Hastie et al., 2009</xref>) was selected as the decoder, and each of the 4212 natural scene images was defined as one class (<xref ref-type="fig" rid="F4">Figure 4D</xref>).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Decodable stable neural representations via spatio-temporally efficient coding. <bold>(A)</bold> Examples of neural responses in the lower hierarchy. These were neural responses of 3rd units in <xref ref-type="fig" rid="F2">Figure 2D</xref>. Dotted vertical black lines indicate the presentations of new external input. <bold>(B)</bold> Neuronal noise that measured as shifted conditional entropy of neural responses given stimulus. Conditional entropy was measured by collecting neural responses at every five time step. <bold>(C)</bold> Confusion index that a measure of how much it confuses neural responses to one stimulus with neural responses to another similar stimulus. The dotted lines denote the confusion index of 1 which indicates the confusion. Error bars indicate standard deviations. <bold>(D)</bold> Decoding accuracy via the na&#x00EF;ve Bayes classifier. The upper dotted line denotes the maximum decoding accuracy, 1. The lower dotted line denotes the chance level, 1/4212. Error bars indicate standard deviations.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fncom-16-890447-g004.tif"/>
</fig>
<p>As a result of the simulation, different neural responses to different stimuli were shown in the STEC condition, and these neural responses were rapidly stabilised. On the other hand, neural responses were less stabilised under SEC and sparse conditions (<xref ref-type="fig" rid="F4">Figure 4A</xref>). The factor affecting the stabilisation of the neural response was the amount of neuronal noise. To quantify the amount of neural noise, conditional entropy of neural responses given stimulus was measured. Conditional entropy was measured by collecting neural responses at every five time step. The STEC condition showed lower neuronal noise than the SEC and Sparse conditions (<xref ref-type="fig" rid="F4">Figure 4B</xref>).</p>
<p>We checked whether the neural response was well stabilised through the confusion index. Under STEC condition, the confusion index was kept below 1 on average for the majority of time steps at each hierarchy (<xref ref-type="fig" rid="F4">Figure 4C</xref>). This means that the neural representations are rapidly stabilised in STEC condition. On the other hand, the confusion index was not sufficiently reduced in SEC and Sparse condition (<xref ref-type="fig" rid="F4">Figure 4C</xref>). This means that sufficient stabilisation of neural representations is difficult with only existing efficient coding or sparse coding.</p>
<p>Finally, we checked whether the neural response is decodable. In the STEC condition, the decoding accuracy gradually increased over time, suggesting decodable stable neural representations. On the other hand, the decoding accuracy was low in SEC and Sparse conditions.</p>
</sec>
<sec id="S3.SS4">
<title>Relation to Homeostasis</title>
<p>Neural system homeostasis is associated with maximisation of mutual information between neural responses and external stimuli (<xref ref-type="bibr" rid="B84">Toyoizumi et al., 2005</xref>; <xref ref-type="bibr" rid="B81">Sullivan and de Sa, 2006</xref>). Since spatio-temporally efficient coding increases the Shannon mutual information between neural responses and external stimuli, it is related to homeostasis. In particular, temporal difference minimisation of neural responses in temporally efficient coding is reminiscent of homeostasis of energy metabolism. Smoothing the temporal trajectory of a neural responses reduces the variance of the neural response distribution so that the neural response stays within a certain range. We confirmed this through simulation. It was checked whether the neural response had a middle value, not an extreme value such as 0 or 1, which is homeostasis of neural responses. In STEC condition emphasising temporally efficient coding objective, neural responses were concentrated at the middle value than in SEC condition (<xref ref-type="fig" rid="F8">Figure 8</xref>). <xref ref-type="fig" rid="F8">Figure 8B</xref> showed specific cases corresponding to stimuli in <xref ref-type="fig" rid="F8">Figure 8A</xref>. <xref ref-type="fig" rid="F8">Figure 8C</xref> showed the distribution of overall neural responses, indicating that this is general phenomena of STEC. This is also a consequence of the homeostatic plasticity (<xref ref-type="bibr" rid="B85">Turrigiano and Nelson, 2004</xref>) of the brain.</p>
</sec>
<sec id="S3.SS5">
<title>Deviant Neural Responses to Unlearned Inputs</title>
<p>The visual system often responds selectively to sensory inputs (<xref ref-type="bibr" rid="B55">Margoliash, 1983</xref>; <xref ref-type="bibr" rid="B89">Waydo et al., 2006</xref>). Even for the type of sensory inputs to which the visual system is responsive, unfamiliar inputs induce larger neural responses compared to familiar inputs (<xref ref-type="bibr" rid="B38">Huang et al., 2018</xref>; <xref ref-type="bibr" rid="B43">Issa et al., 2018</xref>). These large neural responses to unfamiliar inputs are thought to be due to prediction errors (<xref ref-type="bibr" rid="B43">Issa et al., 2018</xref>). On the one hand, spatio-temporally efficient coding renders smooth neural representations, i.e., locally preserving the structure of the external world. Because unlearned inputs differ from learned inputs, if their neural representations are smooth, their neural representations will also differ. By STEC homeostasis, since the neural response to the learned input has a middle value (<xref ref-type="fig" rid="F8">Figure 8</xref>), the neural responses to the unlearned input (if these differ from the neural responses to the learned input) may be closed to extreme values such as 0 or 1. These are deviant neural responses to unlearned inputs. Accordingly, we can expect that spatio-temporally efficient coding could predict the phenomenon of deviant neural responses to unlearned inputs without the introduction of prediction error responses mediated by error units. We checked whether the distribution of neural responses to unlearned input is close to extreme values such as 0 or 1, unlike the distribution of neural responses to learned input.</p>
<p>In the simulations, the visual hierarchical structure learned to be learned with natural scene images, and novel handwritten digit images were used as unlearned visual inputs (<xref ref-type="fig" rid="F5">Figure 5A</xref>). The simulation results revealed that neural responses were distributed over middle values for learned images and over smaller or larger values for unlearned inputs (<xref ref-type="fig" rid="F5">Figures 5B,C</xref>), suggesting that spatio-temporally efficient coding could predict the phenomenon of deviant neural responses to unlearned inputs. The same conclusion could also be reached if the learned images were handwritten digit images and the unlearned images were natural scene images (<xref ref-type="fig" rid="F5">Figures 5D&#x2013;F</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Neural response distributions for learned and unlearned inputs. <bold>(A)</bold> Natural scene images are used as learned inputs (for learning in the visual hierarchical structure) and handwritten digit images are used as unlearned inputs (not used in learning). <bold>(B)</bold> Examples of neural response distributions of lower hierarchy units for each of the five time steps of bidirectional information transmissions, corresponding to images in <bold>(A)</bold>. The colour scale indicates the proportion. <bold>(C)</bold> Overall neural response distributions in response to all input images (either learned or unlearned) at lower and upper hierarchies, respectively. <bold>(D&#x2013;F)</bold> Are similar to <bold>(A&#x2013;C)</bold>, but handwritten digit images are used as learned inputs and natural scene images are used as unlearned inputs.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fncom-16-890447-g005.tif"/>
</fig>
<p>Simulations in the present study demonstrated that neural responses were distributed around smaller or larger extremes for unlearned inputs and around intermediate values for learned inputs (<xref ref-type="fig" rid="F5">Figures 5B,C</xref>). In a separate analysis, we allowed the neural responses to learned inputs be distributed only around lower values (see section &#x201C;Implementation of Spatio-Temporally Efficient Coding,&#x201D; &#x03BB; = 5), similar to sparse coding (<xref ref-type="bibr" rid="B62">Olshausen and Field, 1996</xref>, <xref ref-type="bibr" rid="B63">1997</xref>), and observed that neural responses of the lower hierarchy to unlearned inputs exhibited higher values (<xref ref-type="fig" rid="F6">Figure 6</xref>). Under this sparseness constraint, when an external stimulus was first given, the neural response surged from zero to a large value and then stabilised again (<xref ref-type="fig" rid="F6">Figures 6C,D</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>Neural response distributions for learned and unlearned inputs under sparseness constraint. <bold>(A)</bold> Natural scene images are used as learned inputs (for learning in the visual hierarchical structure) and handwritten digit images are used as unlearned inputs (not used in learning). <bold>(B)</bold> Lower hierarchy neural responses for consecutive external inputs in <bold>(A)</bold>. The neural response is the output of the sigmoid function and is therefore normalised to a range between 0 and 1. <bold>(C)</bold> Examples of neural response distributions of lower hierarchy units for each of the five time steps of bidirectional information transmissions, corresponding to images in <bold>(A)</bold>. Unlike <bold>(B)</bold>, it is a neural response when images are presented separately rather than consecutively. The colour scale indicates the proportion. <bold>(D)</bold> Overall neural response distributions in response to all input images (either learned or unlearned) at lower and upper hierarchies, respectively.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fncom-16-890447-g006.tif"/>
</fig>
</sec>
<sec id="S3.SS6">
<title>Preferred Orientation Biases of Receptive Fields</title>
<p>Neurons in the visual system prefer horizontal and vertical orientations over oblique orientations (<xref ref-type="bibr" rid="B27">Furmanski and Engel, 2000</xref>; <xref ref-type="bibr" rid="B51">Li et al., 2003</xref>). Indeed, orientation discrimination is more sensitive to horizontal and vertical orientations than to oblique orientations (<xref ref-type="bibr" rid="B29">Girshick et al., 2011</xref>). This is a bias toward cardinals (<xref ref-type="bibr" rid="B29">Girshick et al., 2011</xref>). Since smooth neural representations of spatio-temporally efficient coding reflects the structure of the external world well, we predicted that it would also reflect the environmental statistics of natural scenes. We investigated whether units in the visual hierarchical structure that learned by spatio-temporally efficient coding of natural scene images exhibited such biases. We checked whether the neural responses of the units were highest to the horizontal/vertical orientation stimuli. Units in the lower hierarchy had the Gabor-like visual receptive fields, while units in the upper hierarchy had more complex visual receptive fields (<xref ref-type="fig" rid="F7">Figure 7A</xref>) (also see section &#x201C;Computation of Receptive Fields&#x201D;). Because the units had an oriented Gabor-like receptive field, we used oriented bar stimuli to measure the unit&#x2019;s preferred orientation. We presented a static bar oriented in one of eight angles that moved in the direction perpendicular to the orientation angle (<xref ref-type="fig" rid="F7">Figure 7B</xref>) and defined the response of each unit to that orientation by the largest response during presentation. We then defined the preferred orientation of each unit as the orientation that elicited the largest response. Example neural responses were shown in <xref ref-type="fig" rid="F7">Figure 7C</xref>. Simulation results revealed that units clearly prefer horizontal and vertical orientations over oblique orientations (<xref ref-type="fig" rid="F7">Figure 7D</xref>), consistent with the orientation bias of visual cortical neurons and in accordance with context-independent bottom-up prediction (<xref ref-type="bibr" rid="B82">Teufel and Fletcher, 2020</xref>). This means that smooth neural representations when well-balanced between temporally efficient coding objective and spatially efficient coding objective reflect the environmental statistics of natural scenes.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p>Orientation preference. <bold>(A)</bold> Examples of visual receptive fields of lower and upper hierarchy units. <bold>(B)</bold> Orientation images used to test the visual orientation preference of neuronal units at hierarchy 1 in the visual hierarchical structure that learned by the spatio-temporally efficient coding of natural scene images. White bars in each of the eight orientations are moved in the perpendicular directions denoted by blue dotted arrows. <bold>(C)</bold> Examples of lower hierarchy neural responses for bar stimuli. Each colour indicates each orientation. The neural responses of each orientation were obtained from the bar positions with the largest neural responses at time step 5. These four example units correspond to 1st, 3rd, 5th, and 7th units in <bold>(A)</bold>. <bold>(D)</bold> Histograms of the orientation preference at time step.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fncom-16-890447-g007.tif"/>
</fig>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<title>Discussion</title>
<p>The present study aimed to find computational principles that enables visual hierarchical structures to attain the function to represent external visual information. To address the lack of neural coding principles to encompass both bottom-up and top-down pathways, we propose spatio-temporally efficient coding as a novel computational model. As a principled way of efficiently using given resources in both neural activity space and processing time, this coding principle optimises bidirectional information transmissions over hierarchical structures by simultaneously minimising temporally differences in neural responses and maximising entropy in neural representations. Simulation results showed that the proposed spatio-temporally efficient coding assigned the function of appropriate neural representations of natural scenes to a visual hierarchical structure on time domain and that it could predict deviations in neural responses to unlearned inputs and a bias in preferred orientations, which are well known characteristics of the visual system.</p>
<p>Appropriate neural representations in the present study were decodable stable neural representations which is an appropriate function to hierarchical brain structures on the time domain. To demonstrate this, we compared spatio-temporally efficient coding (STEC condition) with existing efficient coding (SEC condition) and sparse coding (Sparse condition) as shown in <xref ref-type="fig" rid="F4">Figure 4</xref>. Only spatio-temporally efficient coding showed decodable stable neural representations. Existing predictive coding is conceptually problematic to apply to such real-time information processing (<xref ref-type="bibr" rid="B37">Hogendoorn and Burkitt, 2019</xref>). It would be interesting to compare the results with the conceptually improved predictive coding as well. Temporal coherence (<xref ref-type="bibr" rid="B42">Hurri and Hyv&#x00E4;rinen, 2002</xref>; <xref ref-type="bibr" rid="B95">Zou et al., 2011</xref>) or slow feature analysis (<xref ref-type="bibr" rid="B90">Wiskott and Sejnowski, 2002</xref>; <xref ref-type="bibr" rid="B6">Berkes and Wiskott, 2005</xref>; <xref ref-type="bibr" rid="B17">Creutzig and Sprekeler, 2008</xref>) elicits smooth changes in neural responses, similar to temporally efficient coding in the present study. However, they were not used as comparative models because they are not intended to elicit different neural responses to different external inputs and are not suitable for direct application to the bidirectional multiple hierarchical structure of in the present study. Nevertheless, they can be substituted to the role of temporally efficient coding in the present study.</p>
<p>Since its initial proposal (<xref ref-type="bibr" rid="B2">Attneave, 1954</xref>; <xref ref-type="bibr" rid="B3">Barlow, 1961</xref>), spatially efficient coding has been validated experimentally (<xref ref-type="bibr" rid="B48">Laughlin, 1981</xref>). However, observed correlations between neurons, which maximise entropy to a lesser extent compared to mere spatially efficient coding assuming no inter-neuronal correlations, have yet to be incorporated into the principle of spatially efficient coding. Empirically observed neuronal correlations may drive computational processes of the brain away from strict spatially efficient coding. Recent studies suggest that biological visual systems are intermediate between strict spatially efficient coding and correlated neural responses (<xref ref-type="bibr" rid="B80">Stringer et al., 2019</xref>). Therefore, to create biologically plausible computational models, it is necessary to mitigate the spatially efficient coding objective by combining firing-rate-dependent correlations (<xref ref-type="bibr" rid="B19">de la Rocha et al., 2007</xref>). This enables more accurate information transmissions of visual perception mediated by visual hierarchical structures. As we focused on integrating spatially efficient coding with temporally efficient coding for computation in hierarchical structures, this study did not incorporate the correlations between neurons in spatially efficient coding, which will be pursued in follow-up studies.</p>
<p>Based on our simulations, we observed that the learning of bidirectional information transmission networks with spatio-temporally efficient coding was hindered when the balancing parameter &#x03BB; was too small or too large (<xref ref-type="fig" rid="F2">Figure 2D</xref>). Therefore, it was necessary to confine &#x03BB; within a certain range, in which the magnitude of &#x03BB; affected neural responses such that a larger &#x03BB; rendered responses more variable (<xref ref-type="fig" rid="F2">Figure 2D</xref>). Such increased variability is likely to originate from recurrent responses via higher hierarchies. This was confirmed by the observation that top-down synaptic weights become larger than bottom-up synaptic weights when &#x03BB; increased (<xref ref-type="fig" rid="F2">Figure 2C</xref>). Although a large &#x03BB; attenuated the appropriateness of neural representations (<xref ref-type="fig" rid="F3">Figure 3</xref>), it rendered stronger top-down synaptic connections in lower hierarchy (<xref ref-type="fig" rid="F2">Figure 2C</xref>), which is consistent with the previous finding that top-down synaptic connections are stronger than bottom-up connections in the lateral geniculate nucleus (<xref ref-type="bibr" rid="B76">Sillito et al., 2006</xref>). As to why top-down synaptic weights increase with a larger &#x03BB; value (<xref ref-type="fig" rid="F2">Figure 2C</xref>), we speculate that learning via spatio-temporally efficient coding may increase the range of neural responses to maximise entropy through top-down pathways. While the bottom-up pathways originating from external inputs are invariant during learning, the top-down pathways originating from higher hierarchy neural responses are more flexible to adjustment to maximise entropy during learning. An increase in these top-down synaptic weights predicts impairment of eye movement tracking partially occluded visual targets in schizophrenic patients (<xref ref-type="bibr" rid="B1">Adams et al., 2012</xref>). This may be due to an increased higher hierarchy&#x2019;s influence in patients with schizophrenia. The increased higher hierarchy&#x2019;s influence in our simulations is that they do not stabilise neural responses sufficiently to distinguish a given input from other inputs (<xref ref-type="fig" rid="F4">Figure 4C</xref>).</p>
<p>The deviant neural responses to unlearned inputs observed in this study (<xref ref-type="fig" rid="F5">Figure 5</xref>) arise from smooth neural representations for learned inputs (distributed over the middle value). As such, neural representations for learned inputs extrude neural responses to unlearned inputs into a range of deviant neural responses. We conjecture that the visual system may generate deviant neural responses via a similar mechanism.</p>
<p>Spatio-temporally efficient coding predicted a bias in preferred orientations (<xref ref-type="fig" rid="F7">Figure 7</xref>). In this regard, spatially efficient coding alone has been reported to predict bias in preferred orientations (<xref ref-type="bibr" rid="B28">Ganguli and Simoncelli, 2014</xref>). Notably, spatio-temporally efficient coding was able to predict this bias well, even when &#x03BB; was low, that is, when the spatially efficient coding objective was less weighted (<xref ref-type="fig" rid="F7">Figure 7D</xref>). Therefore, this bias prediction should be viewed as a result of spatio-temporally efficient coding, not as a result of spatially efficient coding alone.</p>
<p>The trial-to-trial variability of neural responses can be widely observed in the brain (<xref ref-type="bibr" rid="B54">Malins et al., 2018</xref>; <xref ref-type="bibr" rid="B18">Daniel et al., 2019</xref>; <xref ref-type="bibr" rid="B60">Nogueira et al., 2020</xref>; <xref ref-type="bibr" rid="B52">Li et al., 2021</xref>; <xref ref-type="bibr" rid="B93">Zhang et al., 2022</xref>). It is observed as both electrophysiological signals (<xref ref-type="bibr" rid="B18">Daniel et al., 2019</xref>; <xref ref-type="bibr" rid="B52">Li et al., 2021</xref>) and blood-oxygenation-level-dependent signals (<xref ref-type="bibr" rid="B54">Malins et al., 2018</xref>; <xref ref-type="bibr" rid="B93">Zhang et al., 2022</xref>). These trial-to-trial variability of neural responses are neuronal noise. Assuming strict efficient coding, neurons must code different information to minimise redundancy. This makes neural coding susceptible to neuronal noise (<xref ref-type="bibr" rid="B65">Pryluk et al., 2019</xref>). If we allow some redundancy, this coding can be made more robust to neuronal noise (<xref ref-type="bibr" rid="B65">Pryluk et al., 2019</xref>). In the real brain, neurons are known to perform robust coding (not strict efficient coding) against neuronal noise by having smooth tuning curves (<xref ref-type="bibr" rid="B80">Stringer et al., 2019</xref>). These smooth tuning curves are observed from many experimental data (<xref ref-type="bibr" rid="B13">Chen and Hafed, 2018</xref>; <xref ref-type="bibr" rid="B47">Kutter et al., 2018</xref>; <xref ref-type="bibr" rid="B14">Chettih and Harvey, 2019</xref>; <xref ref-type="bibr" rid="B15">Christensen et al., 2019</xref>), supporting the mitigation of strict efficient coding. Spatio-temporally efficient coding makes the neural responses distributed around the middle value by minimising the temporal difference of the neural responses (<xref ref-type="fig" rid="F8">Figure 8</xref>). This is deviant from strict efficient coding as it does not reach entropy maximisation. Therefore, it may be closer to the coding principle of the real brain.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption><p>Neural response distributions. <bold>(A)</bold> Examples of natural scene images. <bold>(B)</bold> Examples of neural response distributions of lower hierarchy units for each of the five time steps of bidirectional information transmissions, corresponding to images in <bold>(A)</bold>. The colour scale indicates the proportion. <bold>(C)</bold> Overall neural response distributions in response to all input images (either STEC or SEC) at lower and upper hierarchies, respectively. The colour scale indicates the proportion.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fncom-16-890447-g008.tif"/>
</fig>
<fig id="F9" position="float">
<label>FIGURE 9</label>
<caption><p>Summary. <bold>(A)</bold> Flowchart of details of implementation of spatio-temporally efficient coding. <bold>(B)</bold> Experimental workflow in the present study.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fncom-16-890447-g009.tif"/>
</fig>
<p>The present study has several limitations. First, for simplicity, our simulation model contained only two hierarchies. However, it is necessary to explore how spatio-temporally efficient coding operates in models with more hierarchies. We also modelled 64 neuronal units at each hierarchy, as we assumed that this would be sufficient to represent the natural scene images used in this study. Nevertheless, the interactions between the number of neuronal units, levels of hierarchy, and spatio-temporally efficient coding require further investigation. Second, we demonstrated that the visual hierarchical structure could learn to represent static natural scene images with spatio-temporally efficient coding, but future follow-up studies will investigate whether the visual hierarchical structure learns to represent moving scenes using the same coding principle (also see, <xref ref-type="bibr" rid="B72">Sederberg et al., 2018</xref>). Finally, the scope of the present study was limited to the visual system given that its hierarchical structure is well documented, but spatio-temporally efficient coding may be applied to other systems (e.g., somatosensory system) or to movements and planning. Recent efficient coding researches are expanding their scope to perception generalisation (<xref ref-type="bibr" rid="B77">Sims, 2018</xref>), subjective value (<xref ref-type="bibr" rid="B64">Polan&#x00ED;a et al., 2019</xref>), and memory (<xref ref-type="bibr" rid="B56">McPherson and McDermott, 2020</xref>). It seems that spatio-temporally efficient coding can also broaden its scope.</p>
<sec id="S4.SS1">
<title>Conclusion</title>
<p>In the present study, we proposed spatio-temporally efficient coding, inspired by the efficient use of given resources in neural systems, as a neural coding mechanism to assign representational functions to the hierarchical structures of the visual system. Simulations demonstrated that the visual hierarchical structure could represent the external world (i.e., natural scenes) appropriately using bidirectional information transmissions (<xref ref-type="fig" rid="F2">Figures 2</xref>&#x2013;<xref ref-type="fig" rid="F4">4</xref>). Furthermore, spatio-temporally efficient coding predicted the well-known properties of visual cortical neurons, including deviations in neural responses to unlearned images (<xref ref-type="fig" rid="F5">Figures 5</xref>, <xref ref-type="fig" rid="F6">6</xref>) and bias in preferred orientations (<xref ref-type="fig" rid="F7">Figure 7</xref>). Our proposed spatio-temporally efficient coding may facilitate deeper mechanistic understanding of the computational processes of hierarchical brain structures.</p>
</sec>
</sec>
<sec id="S5" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/<xref ref-type="supplementary-material" rid="DS1">Supplementary Material</xref>.</p>
</sec>
<sec id="S6">
<title>Author Contributions</title>
<p>DS: conceptualisation, methodology, software, validation, formal analysis, investigation, resources, data curation, writing &#x2013; original draft, writing &#x2013; review and editing, visualisation, supervision, and project administration. S-PK: writing &#x2013; original draft, writing &#x2013; review and editing, visualisation, supervision, project administration, and funding acquisition. Both authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="pudiscl1" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="S7" sec-type="funding-information">
<title>Funding</title>
<p>This research was supported by the Brain Convergence Research Programs of the National Research Foundation (NRF), funded by the Korean Government (MSIT) (Nos. NRF-2019M3E5D2A01058328 and 2021M3E5D2A01019542).</p>
</sec>
<ack>
<p>This research was previously uploaded to the preprint server bioRxiv (<xref ref-type="bibr" rid="B75">Sihn and Kim, 2021</xref>). Available at: <ext-link ext-link-type="uri" xlink:href="https://www.biorxiv.org/content/10.1101/2021.08.13.456321v4.abstract">https://www.biorxiv.org/content/10.1101/2021.08.13.456321v4.abstract</ext-link>.</p>
</ack>
<sec id="S9" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fncom.2022.890447/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fncom.2022.890447/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.docx" id="DS1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Adams</surname> <given-names>R. A.</given-names></name> <name><surname>Perrinet</surname> <given-names>L. U.</given-names></name> <name><surname>Friston</surname> <given-names>K.</given-names></name></person-group> (<year>2012</year>). <article-title>Smooth pursuit and visual occlusion: active inference and oculomotor control in schizophrenia.</article-title> <source><italic>PLoS One</italic></source> <volume>7</volume>:<issue>e47502</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0047502</pub-id> <pub-id pub-id-type="pmid">23110076</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Attneave</surname> <given-names>F.</given-names></name></person-group> (<year>1954</year>). <article-title>Some informational aspects of visual perception.</article-title> <source><italic>Psychol. Rev.</italic></source> <volume>61</volume> <fpage>183</fpage>&#x2013;<lpage>193</lpage>. <pub-id pub-id-type="doi">10.1037/h0054663</pub-id> <pub-id pub-id-type="pmid">13167245</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barlow</surname> <given-names>H. B.</given-names></name></person-group> (<year>1961</year>). &#x201C;<article-title>Possible principles underlying the transformations of sensory messages</article-title>,&#x201D; in <source><italic>Sensory Communication</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Rosenblith</surname> <given-names>W. A.</given-names></name></person-group> (<publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>217</fpage>&#x2013;<lpage>234</lpage>.</citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bengio</surname> <given-names>Y.</given-names></name> <name><surname>Courville</surname> <given-names>A.</given-names></name> <name><surname>Vincent</surname> <given-names>P.</given-names></name></person-group> (<year>2013</year>). <article-title>Representation learning: a review and new perspectives.</article-title> <source><italic>IEEE Trans. Pattern Anal. Mach. Intellig.</italic></source> <volume>35</volume> <fpage>1798</fpage>&#x2013;<lpage>1828</lpage>. <pub-id pub-id-type="doi">10.1109/TPAMI.2013.50</pub-id> <pub-id pub-id-type="pmid">23787338</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berens</surname> <given-names>P.</given-names></name> <name><surname>Ecker</surname> <given-names>A. S.</given-names></name> <name><surname>Cotton</surname> <given-names>R. J.</given-names></name> <name><surname>Ma</surname> <given-names>W. J.</given-names></name> <name><surname>Bethge</surname> <given-names>M.</given-names></name> <name><surname>Tolias</surname> <given-names>A. S.</given-names></name></person-group> (<year>2012</year>). <article-title>A fast and simple population code for orientation in primate V1.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>32</volume> <fpage>10618</fpage>&#x2013;<lpage>10626</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.1335-12.2012</pub-id> <pub-id pub-id-type="pmid">22855811</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berkes</surname> <given-names>P.</given-names></name> <name><surname>Wiskott</surname> <given-names>L.</given-names></name></person-group> (<year>2005</year>). <article-title>Slow feature analysis yields a rich repertoire of complex cell properties.</article-title> <source><italic>J. Vis.</italic></source> <volume>5</volume> <fpage>579</fpage>&#x2013;<lpage>602</lpage>. <pub-id pub-id-type="doi">10.1167/5.6.9</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borst</surname> <given-names>A.</given-names></name> <name><surname>Theunissen</surname> <given-names>F. E.</given-names></name></person-group> (<year>1999</year>). <article-title>Information theory and neural coding.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>2</volume> <fpage>947</fpage>&#x2013;<lpage>957</lpage>. <pub-id pub-id-type="doi">10.1038/14731</pub-id> <pub-id pub-id-type="pmid">10526332</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bourlard</surname> <given-names>H.</given-names></name> <name><surname>Kamp</surname> <given-names>Y.</given-names></name></person-group> (<year>1988</year>). <article-title>Auto-association by multilayer perceptrons and singular value decomposition.</article-title> <source><italic>Biol. Cybernet.</italic></source> <volume>59</volume> <fpage>291</fpage>&#x2013;<lpage>294</lpage>. <pub-id pub-id-type="doi">10.1007/BF00332918</pub-id> <pub-id pub-id-type="pmid">3196773</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boutin</surname> <given-names>V.</given-names></name> <name><surname>Franciosini</surname> <given-names>A.</given-names></name> <name><surname>Chavane</surname> <given-names>F.</given-names></name> <name><surname>Ruffier</surname> <given-names>F.</given-names></name> <name><surname>Perrinet</surname> <given-names>L.</given-names></name></person-group> (<year>2021</year>). <article-title>Sparse deep predictive coding captures contour integration capabilities of the early visual system.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>17</volume>:<issue>e1008629</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1008629</pub-id> <pub-id pub-id-type="pmid">33497381</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brouwer</surname> <given-names>G. J.</given-names></name> <name><surname>Heeger</surname> <given-names>D. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Decoding and reconstructing color from responses in human visual cortex.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>29</volume> <fpage>13992</fpage>&#x2013;<lpage>14003</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.3577-09.2009</pub-id> <pub-id pub-id-type="pmid">19890009</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bullmore</surname> <given-names>E.</given-names></name> <name><surname>Sporns</surname> <given-names>O.</given-names></name></person-group> (<year>2012</year>). <article-title>The economy of brain network organization.</article-title> <source><italic>Nat. Rev. Neurosci.</italic></source> <volume>13</volume> <fpage>336</fpage>&#x2013;<lpage>349</lpage>. <pub-id pub-id-type="doi">10.1038/nrn3214</pub-id> <pub-id pub-id-type="pmid">22498897</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burt</surname> <given-names>J. B.</given-names></name> <name><surname>Demirta&#x015F;</surname> <given-names>M.</given-names></name> <name><surname>Eckner</surname> <given-names>W. J.</given-names></name> <name><surname>Navejar</surname> <given-names>N. M.</given-names></name> <name><surname>Ji</surname> <given-names>J. L.</given-names></name> <name><surname>Martin</surname> <given-names>W. J.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Hierarchy of transcriptomic specialization across human cortex captured by structural neuroimaging topography.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>21</volume> <fpage>1251</fpage>&#x2013;<lpage>1259</lpage>. <pub-id pub-id-type="doi">10.1038/s41593-018-0195-0</pub-id> <pub-id pub-id-type="pmid">30082915</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>C.-Y.</given-names></name> <name><surname>Hafed</surname> <given-names>Z. M.</given-names></name></person-group> (<year>2018</year>). <article-title>Orientation and contrast tuning properties and temporal flicker fusion characteristics of primate superior colliculus neurons.</article-title> <source><italic>Front. Neural Circ.</italic></source> <volume>12</volume>:<issue>58</issue>. <pub-id pub-id-type="doi">10.3389/fncir.2018.00058</pub-id> <pub-id pub-id-type="pmid">30087598</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chettih</surname> <given-names>S. N.</given-names></name> <name><surname>Harvey</surname> <given-names>C. D.</given-names></name></person-group> (<year>2019</year>). <article-title>Single-neuron perturbations reveal feature-specific competition in V1.</article-title> <source><italic>Nature</italic></source> <volume>567</volume> <fpage>334</fpage>&#x2013;<lpage>340</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-019-0997-6</pub-id> <pub-id pub-id-type="pmid">30842660</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Christensen</surname> <given-names>R. K.</given-names></name> <name><surname>Lind&#x00E9;n</surname> <given-names>H.</given-names></name> <name><surname>Nakamura</surname> <given-names>M.</given-names></name> <name><surname>Barkat</surname> <given-names>T. R.</given-names></name></person-group> (<year>2019</year>). <article-title>White noise background improves tone discrimination by suppressing cortical tuning curves.</article-title> <source><italic>Cell Rep.</italic></source> <volume>29</volume> <fpage>2041.e4</fpage>&#x2013;<lpage>2053.e4</lpage>. <pub-id pub-id-type="doi">10.1016/j.celrep.2019.10.049</pub-id> <pub-id pub-id-type="pmid">31722216</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Whatever next? Predictive brains, situated agents, and the future of cognitive science.</article-title> <source><italic>Behav. Brain Sci.</italic></source> <volume>36</volume> <fpage>181</fpage>&#x2013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X12000477</pub-id> <pub-id pub-id-type="pmid">23663408</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Creutzig</surname> <given-names>F.</given-names></name> <name><surname>Sprekeler</surname> <given-names>H.</given-names></name></person-group> (<year>2008</year>). <article-title>Predictive coding and the slowness principle: an information-theoretic approach.</article-title> <source><italic>Neural Comput.</italic></source> <volume>20</volume> <fpage>1026</fpage>&#x2013;<lpage>1041</lpage>. <pub-id pub-id-type="doi">10.1162/neco.2008.01-07-455</pub-id> <pub-id pub-id-type="pmid">18085988</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Daniel</surname> <given-names>E.</given-names></name> <name><surname>Meindertsma</surname> <given-names>T.</given-names></name> <name><surname>Arazi</surname> <given-names>A.</given-names></name> <name><surname>Donner</surname> <given-names>T. H.</given-names></name> <name><surname>Dinstein</surname> <given-names>I.</given-names></name></person-group> (<year>2019</year>). <article-title>The Relationship between trial-by-trial variability and oscillations of cortical population activity.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>9</volume>:<issue>16901</issue>. <pub-id pub-id-type="doi">10.1038/s41598-019-53270-7</pub-id> <pub-id pub-id-type="pmid">31729426</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Rocha</surname> <given-names>J.</given-names></name> <name><surname>Doiron</surname> <given-names>B.</given-names></name> <name><surname>Shea-Brown</surname> <given-names>E.</given-names></name> <name><surname>Josic</surname> <given-names>K.</given-names></name> <name><surname>Reyes</surname> <given-names>A.</given-names></name></person-group> (<year>2007</year>). <article-title>Correlation between neural spike trains increases with firing rate.</article-title> <source><italic>Nature</italic></source> <volume>448</volume> <fpage>802</fpage>&#x2013;<lpage>806</lpage>. <pub-id pub-id-type="doi">10.1038/nature06028</pub-id> <pub-id pub-id-type="pmid">17700699</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>deCharms</surname> <given-names>R. C.</given-names></name> <name><surname>Zador</surname> <given-names>A.</given-names></name></person-group> (<year>2000</year>). <article-title>Neural representation and the cortical code.</article-title> <source><italic>Annu. Rev. Neurosci.</italic></source> <volume>23</volume> <fpage>613</fpage>&#x2013;<lpage>647</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.neuro.23.1.613</pub-id> <pub-id pub-id-type="pmid">10845077</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Di Ges&#x00F9;</surname> <given-names>V.</given-names></name> <name><surname>Starovoitov</surname> <given-names>V.</given-names></name></person-group> (<year>1999</year>). <article-title>Distance-based functions for image comparison.</article-title> <source><italic>Pattern Recogn. Lett.</italic></source> <volume>20</volume> <fpage>207</fpage>&#x2013;<lpage>214</lpage>. <pub-id pub-id-type="doi">10.1016/S0167-8655(98)00115-9</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>DiCarlo</surname> <given-names>J. J.</given-names></name> <name><surname>Zoccolan</surname> <given-names>D.</given-names></name> <name><surname>Rust</surname> <given-names>N. C.</given-names></name></person-group> (<year>2012</year>). <article-title>How does the brain solve visual object recognition?</article-title> <source><italic>Neuron</italic></source> <volume>73</volume> <fpage>415</fpage>&#x2013;<lpage>434</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2012.01.010</pub-id> <pub-id pub-id-type="pmid">22325196</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Felleman</surname> <given-names>D. J.</given-names></name> <name><surname>Van Essen</surname> <given-names>D. C.</given-names></name></person-group> (<year>1991</year>). <article-title>Distributed hierarchical processing in the primate cerebral cortex.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>1</volume> <fpage>1</fpage>&#x2013;<lpage>47</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/1.1.1</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name></person-group> (<year>2005</year>). <article-title>A theory of cortical responses.</article-title> <source><italic>Philos. Trans. R. Soc. B</italic></source> <volume>360</volume> <fpage>815</fpage>&#x2013;<lpage>836</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2005.1622</pub-id> <pub-id pub-id-type="pmid">15937014</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name></person-group> (<year>2010</year>). <article-title>The free-energy principle: a unified brain theory?</article-title> <source><italic>Nat. Rev. Neurosci.</italic></source> <volume>11</volume> <fpage>127</fpage>&#x2013;<lpage>138</lpage>. <pub-id pub-id-type="doi">10.1038/nrn2787</pub-id> <pub-id pub-id-type="pmid">20068583</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name> <name><surname>Thornton</surname> <given-names>C.</given-names></name> <name><surname>Clark</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Free-energy minimization and the dark-room problem.</article-title> <source><italic>Front. Psychol.</italic></source> <volume>3</volume>:<issue>130</issue>. <pub-id pub-id-type="doi">10.3389/fpsyg.2012.00130</pub-id> <pub-id pub-id-type="pmid">22586414</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Furmanski</surname> <given-names>C. S.</given-names></name> <name><surname>Engel</surname> <given-names>S. A.</given-names></name></person-group> (<year>2000</year>). <article-title>An oblique effect in human primary visual cortex.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>3</volume> <fpage>535</fpage>&#x2013;<lpage>536</lpage>. <pub-id pub-id-type="doi">10.1038/75702</pub-id> <pub-id pub-id-type="pmid">10816307</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ganguli</surname> <given-names>D.</given-names></name> <name><surname>Simoncelli</surname> <given-names>E. P.</given-names></name></person-group> (<year>2014</year>). <article-title>Efficient sensory encoding and bayesian inference with heterogeneous neural populations.</article-title> <source><italic>Neural Comput.</italic></source> <volume>26</volume> <fpage>2103</fpage>&#x2013;<lpage>2134</lpage>. <pub-id pub-id-type="doi">10.1162/NECO_a_00638</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Girshick</surname> <given-names>A. R.</given-names></name> <name><surname>Landy</surname> <given-names>M. S.</given-names></name> <name><surname>Simoncelli</surname> <given-names>E. P.</given-names></name></person-group> (<year>2011</year>). <article-title>Cardinal rules: visual orientation perception reflects knowledge of environmental statistics.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>14</volume> <fpage>926</fpage>&#x2013;<lpage>932</lpage>. <pub-id pub-id-type="doi">10.1038/nn.2831</pub-id> <pub-id pub-id-type="pmid">21642976</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goodfellow</surname> <given-names>I. J.</given-names></name> <name><surname>Courville</surname> <given-names>A.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name></person-group> (<year>2011</year>). &#x201C;<article-title>Spike-and-slab sparse coding for unsupervised feature discovery</article-title>,&#x201D; in <source><italic>Proceedings of the Conference on Neural Information Processing Systems</italic></source>, <publisher-loc>Granada</publisher-loc>.</citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Griffiths</surname> <given-names>T. L.</given-names></name> <name><surname>Lieder</surname> <given-names>F.</given-names></name> <name><surname>Goodman</surname> <given-names>N. D.</given-names></name></person-group> (<year>2015</year>). <article-title>Rational use of cognitive resources: levels of analysis between the computational and the algorithmic.</article-title> <source><italic>Top. Cogn. Sci.</italic></source> <volume>7</volume> <fpage>217</fpage>&#x2013;<lpage>229</lpage>. <pub-id pub-id-type="doi">10.1111/tops.12142</pub-id> <pub-id pub-id-type="pmid">25898807</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hansen</surname> <given-names>J. Y.</given-names></name> <name><surname>Markello</surname> <given-names>R. D.</given-names></name> <name><surname>Vogel</surname> <given-names>J. W.</given-names></name> <name><surname>Seidlitz</surname> <given-names>J.</given-names></name> <name><surname>Bzdok</surname> <given-names>D.</given-names></name> <name><surname>Misic</surname> <given-names>B.</given-names></name></person-group> (<year>2021</year>). <article-title>Mapping gene transcription and neurocognition across human neocortex.</article-title> <source><italic>Nat. Hum. Behav.</italic></source> <volume>5</volume> <fpage>1240</fpage>&#x2013;<lpage>1250</lpage>. <pub-id pub-id-type="doi">10.1038/s41562-021-01082-z</pub-id> <pub-id pub-id-type="pmid">33767429</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harris</surname> <given-names>J. A.</given-names></name> <name><surname>Mihalas</surname> <given-names>S.</given-names></name> <name><surname>Hirokawa</surname> <given-names>K. E.</given-names></name> <name><surname>Whitesell</surname> <given-names>J. D.</given-names></name> <name><surname>Choi</surname> <given-names>H.</given-names></name> <name><surname>Bernard</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Hierarchical organization of cortical and thalamic connectivity.</article-title> <source><italic>Nature</italic></source> <volume>575</volume> <fpage>195</fpage>&#x2013;<lpage>202</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-019-1716-z</pub-id> <pub-id pub-id-type="pmid">31666704</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hastie</surname> <given-names>T.</given-names></name> <name><surname>Tibshirani</surname> <given-names>R.</given-names></name> <name><surname>Friedman</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <source><italic>The Elements of Statistical Learning: Data Mining, Inference, and Prediction</italic></source>, <edition>2nd Edn</edition>. <publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>.</citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hilgetag</surname> <given-names>C. C.</given-names></name> <name><surname>Goulas</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x2018;Hierarchy&#x2019; in the organization of brain networks.</article-title> <source><italic>Philos. Trans. R. Soc. B</italic></source> <volume>375</volume>:<issue>20190319</issue>. <pub-id pub-id-type="doi">10.1098/rstb.2019.0319</pub-id> <pub-id pub-id-type="pmid">32089116</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hinton</surname> <given-names>G. E.</given-names></name> <name><surname>Zemel</surname> <given-names>R. S.</given-names></name></person-group> (<year>1993</year>). &#x201C;<article-title>Autoencoders, minimum description length and Helmholtz free energy</article-title>,&#x201D; in <source><italic>Proceedings of the Conference on Neural Information Processing Systems</italic></source>, <publisher-loc>Denver, CO</publisher-loc>.</citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hogendoorn</surname> <given-names>H.</given-names></name> <name><surname>Burkitt</surname> <given-names>A. N.</given-names></name></person-group> (<year>2019</year>). <article-title>Predictive coding with neural transmission delays: a real-time temporal alignment hypothesis.</article-title> <source><italic>eNeuro</italic></source> <volume>6</volume>:<issue>ENEURO.0412-18.2019</issue>. <pub-id pub-id-type="doi">10.1523/ENEURO.0412-18.2019</pub-id> <pub-id pub-id-type="pmid">31064839</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G.</given-names></name> <name><surname>Ramachandran</surname> <given-names>S.</given-names></name> <name><surname>Lee</surname> <given-names>T. S.</given-names></name> <name><surname>Olson</surname> <given-names>C. R.</given-names></name></person-group> (<year>2018</year>). <article-title>Neural correlate of visual familiarity in macaque area V2.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>38</volume> <fpage>8967</fpage>&#x2013;<lpage>8975</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.0664-18.2018</pub-id> <pub-id pub-id-type="pmid">30181138</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hubel</surname> <given-names>D. H.</given-names></name> <name><surname>Wiesel</surname> <given-names>T. N.</given-names></name></person-group> (<year>1962</year>). <article-title>Receptive fields, binocular interaction and functional architecture in the cat&#x2019;s visual cortex.</article-title> <source><italic>J. Physiol.</italic></source> <volume>160</volume> <fpage>106</fpage>&#x2013;<lpage>154</lpage>. <pub-id pub-id-type="doi">10.1113/jphysiol.1962.sp006837</pub-id> <pub-id pub-id-type="pmid">14449617</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hubel</surname> <given-names>D. H.</given-names></name> <name><surname>Wiesel</surname> <given-names>T. N.</given-names></name></person-group> (<year>1968</year>). <article-title>Receptive fields and functional architecture of monkey striate cortex.</article-title> <source><italic>J. Physiol.</italic></source> <volume>195</volume> <fpage>215</fpage>&#x2013;<lpage>243</lpage>. <pub-id pub-id-type="doi">10.1113/jphysiol.1968.sp008455</pub-id> <pub-id pub-id-type="pmid">4966457</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huh</surname> <given-names>C. Y. L.</given-names></name> <name><surname>Peach</surname> <given-names>J. P.</given-names></name> <name><surname>Bennett</surname> <given-names>C.</given-names></name> <name><surname>Vega</surname> <given-names>R. M.</given-names></name> <name><surname>Hestrin</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>Feature-specific organization of feedback pathways in mouse visual cortex.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>28</volume> <fpage>114.e5</fpage>&#x2013;<lpage>120.e5</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2017.11.056</pub-id> <pub-id pub-id-type="pmid">29276127</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hurri</surname> <given-names>J.</given-names></name> <name><surname>Hyv&#x00E4;rinen</surname> <given-names>A.</given-names></name></person-group> (<year>2002</year>). &#x201C;<article-title>Temporal coherence, natural image sequences, and the visual cortex</article-title>,&#x201D; in <source><italic>Proceedings of the Conference on Neural Information Processing Systems</italic></source>, <publisher-loc>Vancouver</publisher-loc>.</citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Issa</surname> <given-names>E. B.</given-names></name> <name><surname>Cadieu</surname> <given-names>C. F.</given-names></name> <name><surname>DiCarlo</surname> <given-names>J. J.</given-names></name></person-group> (<year>2018</year>). <article-title>Neural dynamics at successive stages of the ventral visual stream are consistent with hierarchical error signals.</article-title> <source><italic>eLife</italic></source> <volume>7</volume>:<issue>e42870</issue>. <pub-id pub-id-type="doi">10.7554/eLife.42870</pub-id> <pub-id pub-id-type="pmid">30484773</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kamitani</surname> <given-names>Y.</given-names></name> <name><surname>Tong</surname> <given-names>F.</given-names></name></person-group> (<year>2005</year>). <article-title>Decoding the visual and subjective contents of the human brain.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>8</volume> <fpage>679</fpage>&#x2013;<lpage>685</lpage>. <pub-id pub-id-type="doi">10.1038/nn1444</pub-id> <pub-id pub-id-type="pmid">15852014</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kingma</surname> <given-names>D. P.</given-names></name> <name><surname>Ba</surname> <given-names>J.</given-names></name></person-group> (<year>2015</year>). &#x201C;<article-title>Adam: a method for stochastic optimization</article-title>,&#x201D; in <source><italic>Proceedings of the 3th International Conference on Learning Representations</italic></source>, <publisher-loc>San Diego, CA</publisher-loc>.</citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kriegeskorte</surname> <given-names>N.</given-names></name> <name><surname>Diedrichsen</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Peeling the onion of brain representations.</article-title> <source><italic>Annu. Rev. Neurosci.</italic></source> <volume>42</volume> <fpage>407</fpage>&#x2013;<lpage>432</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-neuro-080317-061906</pub-id> <pub-id pub-id-type="pmid">31283895</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kutter</surname> <given-names>E. F.</given-names></name> <name><surname>Bostroem</surname> <given-names>J.</given-names></name> <name><surname>Elger</surname> <given-names>C. E.</given-names></name> <name><surname>Mormann</surname> <given-names>F.</given-names></name> <name><surname>Nieder</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>). <article-title>Single neurons in the human brain encode numbers.</article-title> <source><italic>Neuron</italic></source> <volume>100</volume> <fpage>753.e4</fpage>&#x2013;<lpage>761.e4</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2018.08.036</pub-id> <pub-id pub-id-type="pmid">30244883</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laughlin</surname> <given-names>S. B.</given-names></name></person-group> (<year>1981</year>). <article-title>A simple coding procedure enhances a neuron&#x2019;s information capacity.</article-title> <source><italic>Z. Naturforsch.</italic></source> <volume>36</volume> <fpage>910</fpage>&#x2013;<lpage>912</lpage>. <pub-id pub-id-type="doi">10.1515/znc-1981-9-1040</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laughlin</surname> <given-names>S. B.</given-names></name></person-group> (<year>2001</year>). <article-title>Energy as a constraint on the coding and processing of sensory information.</article-title> <source><italic>Curr. Opin. Neurobiol.</italic></source> <volume>11</volume> <fpage>475</fpage>&#x2013;<lpage>480</lpage>. <pub-id pub-id-type="doi">10.1016/S0959-4388(00)00237-3</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lecun</surname> <given-names>Y.</given-names></name> <name><surname>Bottou</surname> <given-names>L.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name> <name><surname>Haffner</surname> <given-names>P.</given-names></name></person-group> (<year>1998</year>). <article-title>Gradient-based learning applied to document recognition.</article-title> <source><italic>Proc. IEEE</italic></source> <volume>86</volume> <fpage>2278</fpage>&#x2013;<lpage>2324</lpage>. <pub-id pub-id-type="doi">10.1109/5.726791</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>B.</given-names></name> <name><surname>Peterson</surname> <given-names>M. R.</given-names></name> <name><surname>Freeman</surname> <given-names>R. D.</given-names></name></person-group> (<year>2003</year>). <article-title>Oblique effect: a neural basis in the visual cortex.</article-title> <source><italic>J. Neurophysiol.</italic></source> <volume>90</volume> <fpage>204</fpage>&#x2013;<lpage>217</lpage>. <pub-id pub-id-type="doi">10.1152/jn.00954.2002</pub-id> <pub-id pub-id-type="pmid">12611956</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>D.</given-names></name> <name><surname>Constantinidis</surname> <given-names>C.</given-names></name> <name><surname>Murray</surname> <given-names>J. D.</given-names></name></person-group> (<year>2021</year>). <article-title>Trial-to-trial variability of spiking delay activity in prefrontal cortex constrains burst-coding models of working memory.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>41</volume> <fpage>8928</fpage>&#x2013;<lpage>8945</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.0167-21.2021</pub-id> <pub-id pub-id-type="pmid">34551937</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lieder</surname> <given-names>F.</given-names></name> <name><surname>Griffiths</surname> <given-names>T. L.</given-names></name></person-group> (<year>2020</year>). <article-title>Resource-rational analysis: understanding human cognition as the optimal use of limited computational resources.</article-title> <source><italic>Behav. Brain Sci.</italic></source> <volume>43</volume>:<issue>E1</issue>. <pub-id pub-id-type="doi">10.1017/S0140525X1900061X</pub-id> <pub-id pub-id-type="pmid">30714890</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Malins</surname> <given-names>J. G.</given-names></name> <name><surname>Pugh</surname> <given-names>K. R.</given-names></name> <name><surname>Buis</surname> <given-names>B.</given-names></name> <name><surname>Frost</surname> <given-names>S. J.</given-names></name> <name><surname>Hoeft</surname> <given-names>F.</given-names></name> <name><surname>Landi</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Individual differences in reading skill are related to trial-by-trial neural activation variability in the reading network.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>38</volume> <fpage>2981</fpage>&#x2013;<lpage>2989</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.0907-17.2018</pub-id> <pub-id pub-id-type="pmid">29440534</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Margoliash</surname> <given-names>D.</given-names></name></person-group> (<year>1983</year>). <article-title>Acoustic parameters underlying the responses of song-specific neurons in the white-crowned sparrow.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>3</volume> <fpage>1039</fpage>&#x2013;<lpage>1057</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.03-05-01039.1983</pub-id> <pub-id pub-id-type="pmid">6842281</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McPherson</surname> <given-names>M. J.</given-names></name> <name><surname>McDermott</surname> <given-names>J. H.</given-names></name></person-group> (<year>2020</year>). <article-title>Time-dependent discrimination advantages for harmonic sounds suggest efficient coding for memory.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>117</volume> <fpage>32169</fpage>&#x2013;<lpage>32180</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.2008956117</pub-id> <pub-id pub-id-type="pmid">33262275</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mesulam</surname> <given-names>M. M.</given-names></name></person-group> (<year>1998</year>). <article-title>From sensation to cognition.</article-title> <source><italic>Brain</italic></source> <volume>121</volume> <fpage>1013</fpage>&#x2013;<lpage>1052</lpage>. <pub-id pub-id-type="doi">10.1093/brain/121.6.1013</pub-id> <pub-id pub-id-type="pmid">9648540</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Muckli</surname> <given-names>L.</given-names></name> <name><surname>De Martino</surname> <given-names>F.</given-names></name> <name><surname>Vizioli</surname> <given-names>L.</given-names></name> <name><surname>Petro</surname> <given-names>L. S.</given-names></name> <name><surname>Smith</surname> <given-names>F. W.</given-names></name> <name><surname>Ugurbil</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Contextual feedback to superficial layers of V1.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>25</volume> <fpage>2690</fpage>&#x2013;<lpage>2695</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2015.08.057</pub-id> <pub-id pub-id-type="pmid">26441356</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murphy</surname> <given-names>P. C.</given-names></name> <name><surname>Sillito</surname> <given-names>A. M.</given-names></name></person-group> (<year>1987</year>). <article-title>Corticofugal feedback influences the generation of length tuning in the visual pathway.</article-title> <source><italic>Nature</italic></source> <volume>392</volume> <fpage>727</fpage>&#x2013;<lpage>729</lpage>. <pub-id pub-id-type="doi">10.1038/329727a0</pub-id> <pub-id pub-id-type="pmid">3670375</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nogueira</surname> <given-names>R.</given-names></name> <name><surname>Peltier</surname> <given-names>N. E.</given-names></name> <name><surname>Anzai</surname> <given-names>A.</given-names></name> <name><surname>DeAngelis</surname> <given-names>G. C.</given-names></name> <name><surname>Mart&#x00ED;nez-Trujillo</surname> <given-names>J.</given-names></name> <name><surname>Moreno-Bote</surname> <given-names>R.</given-names></name></person-group> (<year>2020</year>). <article-title>The effects of population tuning and trial-by-trial variability on information encoding and behavior.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>40</volume> <fpage>1066</fpage>&#x2013;<lpage>1083</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.0859-19.2019</pub-id> <pub-id pub-id-type="pmid">31754013</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ohzawa</surname> <given-names>I.</given-names></name> <name><surname>DeAngelis</surname> <given-names>G. C.</given-names></name> <name><surname>Freeman</surname> <given-names>R. D.</given-names></name></person-group> (<year>1996</year>). <article-title>Encoding of binocular disparity by simple cells in the cat&#x2019;s visual cortex.</article-title> <source><italic>J. Neurophysiol.</italic></source> <volume>75</volume> <fpage>1779</fpage>&#x2013;<lpage>1805</lpage>. <pub-id pub-id-type="doi">10.1152/jn.1996.75.5.1779</pub-id> <pub-id pub-id-type="pmid">8734580</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olshausen</surname> <given-names>B. A.</given-names></name> <name><surname>Field</surname> <given-names>D. J.</given-names></name></person-group> (<year>1996</year>). <article-title>Emergence of simple-cell receptive field properties by learning a sparse code for natural images.</article-title> <source><italic>Nature</italic></source> <volume>381</volume> <fpage>607</fpage>&#x2013;<lpage>609</lpage>. <pub-id pub-id-type="doi">10.1038/381607a0</pub-id> <pub-id pub-id-type="pmid">8637596</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olshausen</surname> <given-names>B. A.</given-names></name> <name><surname>Field</surname> <given-names>D. J.</given-names></name></person-group> (<year>1997</year>). <article-title>Sparse coding with an overcomplete basis set: a strategy employed by V1?</article-title> <source><italic>Vis. Res.</italic></source> <volume>37</volume> <fpage>3311</fpage>&#x2013;<lpage>3325</lpage>. <pub-id pub-id-type="doi">10.1016/S0042-6989(97)00169-7</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Polan&#x00ED;a</surname> <given-names>R.</given-names></name> <name><surname>Woodford</surname> <given-names>M.</given-names></name> <name><surname>Ruff</surname> <given-names>C. C.</given-names></name></person-group> (<year>2019</year>). <article-title>Efficient coding of subjective value.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>22</volume> <fpage>134</fpage>&#x2013;<lpage>142</lpage>. <pub-id pub-id-type="doi">10.1038/s41593-018-0292-0</pub-id> <pub-id pub-id-type="pmid">30559477</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pryluk</surname> <given-names>R.</given-names></name> <name><surname>Kfir</surname> <given-names>Y.</given-names></name> <name><surname>Gelbard-Sagiv</surname> <given-names>H.</given-names></name> <name><surname>Fried</surname> <given-names>I.</given-names></name> <name><surname>Paz</surname> <given-names>R.</given-names></name></person-group> (<year>2019</year>). <article-title>A tradeoff in the neural code across regions and species.</article-title> <source><italic>Cell</italic></source> <volume>176</volume> <fpage>597.e18</fpage>&#x2013;<lpage>609.e18</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2018.12.032</pub-id> <pub-id pub-id-type="pmid">30661754</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ranzato</surname> <given-names>M.</given-names></name> <name><surname>Poultney</surname> <given-names>C. S.</given-names></name> <name><surname>Chopra</surname> <given-names>S.</given-names></name> <name><surname>LeCun</surname> <given-names>Y.</given-names></name></person-group> (<year>2006</year>). &#x201C;<article-title>Efficient learning of sparse representations with an energy-based model</article-title>,&#x201D; in <source><italic>Proceedings of the Conference on Neural Information Processing Systems</italic></source>, <publisher-loc>Vancouver</publisher-loc>. <pub-id pub-id-type="doi">10.3390/e20080583</pub-id> <pub-id pub-id-type="pmid">33265672</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rao</surname> <given-names>R. P. N.</given-names></name> <name><surname>Ballard</surname> <given-names>D. H.</given-names></name></person-group> (<year>1999</year>). <article-title>Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>2</volume> <fpage>79</fpage>&#x2013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1038/4580</pub-id> <pub-id pub-id-type="pmid">10195184</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Riesenhuber</surname> <given-names>M.</given-names></name> <name><surname>Poggio</surname> <given-names>T.</given-names></name></person-group> (<year>1999</year>). <article-title>Hierarchical models of object recognition in cortex.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>2</volume> <fpage>1019</fpage>&#x2013;<lpage>1025</lpage>. <pub-id pub-id-type="doi">10.1038/14819</pub-id> <pub-id pub-id-type="pmid">10526343</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Riesenhuber</surname> <given-names>M.</given-names></name> <name><surname>Poggio</surname> <given-names>T.</given-names></name></person-group> (<year>2000</year>). <article-title>Models of object recognition.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>3</volume> <fpage>1199</fpage>&#x2013;<lpage>1204</lpage>. <pub-id pub-id-type="doi">10.1038/81479</pub-id> <pub-id pub-id-type="pmid">11127838</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rifai</surname> <given-names>S.</given-names></name> <name><surname>Vincent</surname> <given-names>P.</given-names></name> <name><surname>Muller</surname> <given-names>X.</given-names></name> <name><surname>Glorot</surname> <given-names>X.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name></person-group> (<year>2011</year>). &#x201C;<article-title>Contractive auto-encoders: explicit invariance during feature extraction</article-title>,&#x201D; in <source><italic>Proceedings of the International Conference on Machine Learning</italic></source>, <publisher-loc>Bellevue</publisher-loc>.</citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rubinov</surname> <given-names>M.</given-names></name> <name><surname>Sporns</surname> <given-names>O.</given-names></name> <name><surname>van Leeuwen</surname> <given-names>C.</given-names></name> <name><surname>Breakspear</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>Symbiotic relationship between brain structure and dynamics.</article-title> <source><italic>BMC Neurosci.</italic></source> <volume>10</volume>:<issue>55</issue>. <pub-id pub-id-type="doi">10.1186/1471-2202-10-55</pub-id> <pub-id pub-id-type="pmid">19486538</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sederberg</surname> <given-names>A. J.</given-names></name> <name><surname>MacLean</surname> <given-names>J. N.</given-names></name> <name><surname>Palmer</surname> <given-names>S. E.</given-names></name></person-group> (<year>2018</year>). <article-title>Learning to make external sensory stimulus predictions using internal correlations in populations of neurons.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>115</volume> <fpage>1105</fpage>&#x2013;<lpage>1110</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1710779115</pub-id> <pub-id pub-id-type="pmid">29348208</pub-id></citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Serre</surname> <given-names>T.</given-names></name> <name><surname>Oliva</surname> <given-names>A.</given-names></name> <name><surname>Poggio</surname> <given-names>T.</given-names></name></person-group> (<year>2007</year>). <article-title>A feedforward architecture accounts for rapid categorization.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>104</volume> <fpage>6424</fpage>&#x2013;<lpage>6429</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0700622104</pub-id> <pub-id pub-id-type="pmid">17404214</pub-id></citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sihn</surname> <given-names>D.</given-names></name></person-group> (<year>2021</year>). <article-title>Simulation codes for spatio-temporally efficient coding.</article-title> <source><italic>bioRxiv</italic></source> [<comment>Preprint</comment>]. <pub-id pub-id-type="doi">10.5281/zenodo.5298182</pub-id></citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sihn</surname> <given-names>D.</given-names></name> <name><surname>Kim</surname> <given-names>S.-P.</given-names></name></person-group> (<year>2021</year>). <article-title>Spatio-temporally efficient coding assigns functions to hierarchical structures of the visual system.</article-title> <source><italic>bioRxiv</italic></source> [<comment>Preprint</comment>]. <pub-id pub-id-type="doi">10.1101/2021.08.13.456321</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sillito</surname> <given-names>A. M.</given-names></name> <name><surname>Cudeiro</surname> <given-names>J.</given-names></name> <name><surname>Jones</surname> <given-names>H. E.</given-names></name></person-group> (<year>2006</year>). <article-title>Always returning: feedback and sensory processing in visual cortex and thalamus.</article-title> <source><italic>Trends Neurosci.</italic></source> <volume>29</volume> <fpage>307</fpage>&#x2013;<lpage>316</lpage>. <pub-id pub-id-type="doi">10.1016/j.tins.2006.05.001</pub-id> <pub-id pub-id-type="pmid">16713635</pub-id></citation></ref>
<ref id="B77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sims</surname> <given-names>C. R.</given-names></name></person-group> (<year>2018</year>). <article-title>Efficient coding explains the universal law of generalization in human perception.</article-title> <source><italic>Science</italic></source> <volume>360</volume> <fpage>652</fpage>&#x2013;<lpage>656</lpage>. <pub-id pub-id-type="doi">10.1126/science.aaq1118</pub-id> <pub-id pub-id-type="pmid">29748284</pub-id></citation></ref>
<ref id="B78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Solomon</surname> <given-names>S. S.</given-names></name> <name><surname>Tang</surname> <given-names>H.</given-names></name> <name><surname>Sussman</surname> <given-names>E.</given-names></name> <name><surname>Kohn</surname> <given-names>A.</given-names></name></person-group> (<year>2021</year>). <article-title>Limited evidence for sensory prediction error responses in visual cortex of macaques and humans.</article-title> <source><italic>Cereb. Cortex</italic></source> <volume>31</volume> <fpage>3136</fpage>&#x2013;<lpage>3152</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhab014</pub-id> <pub-id pub-id-type="pmid">33683317</pub-id></citation></ref>
<ref id="B79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spratling</surname> <given-names>M. W.</given-names></name></person-group> (<year>2017</year>). <article-title>A review of predictive coding algorithms.</article-title> <source><italic>Brain Cogn.</italic></source> <volume>112</volume> <fpage>92</fpage>&#x2013;<lpage>97</lpage>. <pub-id pub-id-type="doi">10.1016/j.bandc.2015.11.003</pub-id> <pub-id pub-id-type="pmid">26809759</pub-id></citation></ref>
<ref id="B80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stringer</surname> <given-names>C.</given-names></name> <name><surname>Pachitariu</surname> <given-names>M.</given-names></name> <name><surname>Steinmetz</surname> <given-names>N.</given-names></name> <name><surname>Carandini</surname> <given-names>M.</given-names></name> <name><surname>Harris</surname> <given-names>K. D.</given-names></name></person-group> (<year>2019</year>). <article-title>High-dimensional geometry of population responses in visual cortex.</article-title> <source><italic>Nature</italic></source> <volume>571</volume> <fpage>361</fpage>&#x2013;<lpage>365</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-019-1346-5</pub-id> <pub-id pub-id-type="pmid">31243367</pub-id></citation></ref>
<ref id="B81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sullivan</surname> <given-names>T. J.</given-names></name> <name><surname>de Sa</surname> <given-names>V. R.</given-names></name></person-group> (<year>2006</year>). <article-title>Homeostatic synaptic scaling in self-organizing maps.</article-title> <source><italic>Neural Netw.</italic></source> <volume>19</volume> <fpage>734</fpage>&#x2013;<lpage>743</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2006.05.006</pub-id> <pub-id pub-id-type="pmid">16782305</pub-id></citation></ref>
<ref id="B82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Teufel</surname> <given-names>C.</given-names></name> <name><surname>Fletcher</surname> <given-names>P. C.</given-names></name></person-group> (<year>2020</year>). <article-title>Forms of prediction in the nervous system.</article-title> <source><italic>Nat. Rev. Neurosci.</italic></source> <volume>21</volume> <fpage>231</fpage>&#x2013;<lpage>242</lpage>. <pub-id pub-id-type="doi">10.1038/s41583-020-0275-5</pub-id> <pub-id pub-id-type="pmid">32157237</pub-id></citation></ref>
<ref id="B83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tomov</surname> <given-names>P.</given-names></name> <name><surname>Pena</surname> <given-names>R. F. O.</given-names></name> <name><surname>Zaks</surname> <given-names>M. A.</given-names></name> <name><surname>Roque</surname> <given-names>A. C.</given-names></name></person-group> (<year>2014</year>). <article-title>Sustained oscillations, irregular firing, and chaotic dynamics in hierarchical modular networks with mixtures of electrophysiological cell types.</article-title> <source><italic>Front. Comput. Neurosci.</italic></source> <volume>8</volume>:<issue>103</issue>. <pub-id pub-id-type="doi">10.3389/fncom.2014.00103</pub-id> <pub-id pub-id-type="pmid">25228879</pub-id></citation></ref>
<ref id="B84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Toyoizumi</surname> <given-names>T.</given-names></name> <name><surname>Pfister</surname> <given-names>J.-P.</given-names></name> <name><surname>Aihara</surname> <given-names>K.</given-names></name> <name><surname>Gerstner</surname> <given-names>W.</given-names></name></person-group> (<year>2005</year>). <article-title>Generalized Bienenstock&#x2013;Cooper&#x2013;Munro rule for spiking neurons that maximizes information transmission.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>102</volume> <fpage>5239</fpage>&#x2013;<lpage>5244</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0500495102</pub-id> <pub-id pub-id-type="pmid">15795376</pub-id></citation></ref>
<ref id="B85"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Turrigiano</surname> <given-names>G. G.</given-names></name> <name><surname>Nelson</surname> <given-names>S. B.</given-names></name></person-group> (<year>2004</year>). <article-title>Homeostatic plasticity in the developing nervous system.</article-title> <source><italic>Nat. Rev. Neurosci.</italic></source> <volume>5</volume> <fpage>97</fpage>&#x2013;<lpage>107</lpage>. <pub-id pub-id-type="doi">10.1038/nrn1327</pub-id> <pub-id pub-id-type="pmid">14735113</pub-id></citation></ref>
<ref id="B86"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Hateren</surname> <given-names>J. H.</given-names></name> <name><surname>van der Schaaf</surname> <given-names>A.</given-names></name></person-group> (<year>1998</year>). <article-title>Independent component filters of natural images compared with simple cells in primary visual cortex.</article-title> <source><italic>Proc. R. Soc. B</italic></source> <volume>265</volume> <fpage>359</fpage>&#x2013;<lpage>366</lpage>. <pub-id pub-id-type="doi">10.1098/rspb.1998.0303</pub-id> <pub-id pub-id-type="pmid">9523437</pub-id></citation></ref>
<ref id="B87"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vincent</surname> <given-names>P.</given-names></name> <name><surname>Larochelle</surname> <given-names>H.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name> <name><surname>Manzagol</surname> <given-names>P.-A.</given-names></name></person-group> (<year>2008</year>). &#x201C;<article-title>Extracting and composing robust features with denoising autoencoders</article-title>,&#x201D; in <source><italic>Proceedings of the International Conference on Machine Learning</italic></source>, <publisher-loc>Helsinki</publisher-loc>.</citation></ref>
<ref id="B88"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>W.</given-names></name> <name><surname>Jones</surname> <given-names>H. E.</given-names></name> <name><surname>Andolina</surname> <given-names>I. M.</given-names></name> <name><surname>Salt</surname> <given-names>T. E.</given-names></name> <name><surname>Sillito</surname> <given-names>A. M.</given-names></name></person-group> (<year>2006</year>). <article-title>Functional alignment of feedback effects from visual cortex to thalamus.</article-title> <source><italic>Nat. Neurosci.</italic></source> <volume>9</volume> <fpage>1330</fpage>&#x2013;<lpage>1336</lpage>. <pub-id pub-id-type="doi">10.1038/nn1768</pub-id> <pub-id pub-id-type="pmid">16980966</pub-id></citation></ref>
<ref id="B89"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Waydo</surname> <given-names>S.</given-names></name> <name><surname>Kraskov</surname> <given-names>A.</given-names></name> <name><surname>Quian Quiroga</surname> <given-names>R.</given-names></name> <name><surname>Fried</surname> <given-names>I.</given-names></name> <name><surname>Koch</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <article-title>Sparse representation in the human medial temporal lobe.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>26</volume> <fpage>10232</fpage>&#x2013;<lpage>10234</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.2101-06.2006</pub-id> <pub-id pub-id-type="pmid">17021178</pub-id></citation></ref>
<ref id="B90"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wiskott</surname> <given-names>L.</given-names></name> <name><surname>Sejnowski</surname> <given-names>T. J.</given-names></name></person-group> (<year>2002</year>). <article-title>Slow feature analysis: unsupervised learning of invariances.</article-title> <source><italic>Neural Comput.</italic></source> <volume>14</volume> <fpage>715</fpage>&#x2013;<lpage>770</lpage>. <pub-id pub-id-type="doi">10.1162/089976602317318938</pub-id> <pub-id pub-id-type="pmid">11936959</pub-id></citation></ref>
<ref id="B91"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yamins</surname> <given-names>D. L. K.</given-names></name> <name><surname>Hong</surname> <given-names>H.</given-names></name> <name><surname>Cadieu</surname> <given-names>C. F.</given-names></name> <name><surname>Solomon</surname> <given-names>E. A.</given-names></name> <name><surname>Seibert</surname> <given-names>D.</given-names></name> <name><surname>DiCarlo</surname> <given-names>J. J.</given-names></name></person-group> (<year>2014</year>). <article-title>Performance-optimized hierarchical models predict neural responses in higher visual cortex.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>111</volume> <fpage>8619</fpage>&#x2013;<lpage>8624</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1403112111</pub-id> <pub-id pub-id-type="pmid">24812127</pub-id></citation></ref>
<ref id="B92"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zavitz</surname> <given-names>E.</given-names></name> <name><surname>Yu</surname> <given-names>H.-H.</given-names></name> <name><surname>Rowe</surname> <given-names>E. G.</given-names></name> <name><surname>Rosa</surname> <given-names>M. G. P.</given-names></name> <name><surname>Price</surname> <given-names>N. S. C.</given-names></name></person-group> (<year>2016</year>). <article-title>Rapid adaptation induces persistent biases in population codes for visual motion.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>36</volume> <fpage>4579</fpage>&#x2013;<lpage>4590</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.4563-15.2016</pub-id> <pub-id pub-id-type="pmid">27098699</pub-id></citation></ref>
<ref id="B93"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Q.</given-names></name> <name><surname>Cramer</surname> <given-names>S. R.</given-names></name> <name><surname>Ma</surname> <given-names>Z.</given-names></name> <name><surname>Turner</surname> <given-names>K. L.</given-names></name> <name><surname>Gheres</surname> <given-names>K. W.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2022</year>). <article-title>Brain-wide ongoing activity is responsible for significant cross-trial BOLD variability.</article-title> <source><italic>Cerebr. Cortex</italic></source> <comment>[Epub ahead of print]</comment>. <pub-id pub-id-type="doi">10.1093/cercor/bhac016</pub-id> <pub-id pub-id-type="pmid">35179203</pub-id></citation></ref>
<ref id="B94"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Xu</surname> <given-names>M.</given-names></name> <name><surname>Kamigaki</surname> <given-names>T.</given-names></name> <name><surname>Hoang Do</surname> <given-names>J. P.</given-names></name> <name><surname>Chang</surname> <given-names>W.</given-names></name> <name><surname>Jenvay</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Long-range and local circuits for top-down modulation of visual cortex processing.</article-title> <source><italic>Science</italic></source> <volume>345</volume> <fpage>660</fpage>&#x2013;<lpage>665</lpage>. <pub-id pub-id-type="doi">10.1126/science.1254126</pub-id> <pub-id pub-id-type="pmid">25104383</pub-id></citation></ref>
<ref id="B95"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zou</surname> <given-names>W. Y.</given-names></name> <name><surname>Ng</surname> <given-names>A. Y.</given-names></name> <name><surname>Yu</surname> <given-names>K.</given-names></name></person-group> (<year>2011</year>). &#x201C;<article-title>Unsupervised learning of visual invariance with temporal coherence</article-title>,&#x201D; in <source><italic>Proceedings of the Conference on Neural Information Processing Systems</italic></source>, <publisher-loc>Granada</publisher-loc>. <pub-id pub-id-type="doi">10.1007/s00422-005-0585-8</pub-id> <pub-id pub-id-type="pmid">16021516</pub-id></citation></ref>
</ref-list>
</back>
</article>