<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="brief-report">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Robot. AI</journal-id>
<journal-title>Frontiers in Robotics and AI</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Robot. AI</abbrev-journal-title>
<issn pub-type="epub">2296-9144</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/frobt.2017.00038</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Robotics and AI</subject>
<subj-group>
<subject>Perspective</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Emergent Intentionality in Perception-Action Subsumption Hierarchies</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Windridge</surname> <given-names>David</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="cor1">&#x0002A;</xref>
<uri xlink:href="http://frontiersin.org/people/u/433093"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Computer Science, Middlesex University</institution>, <addr-line>London</addr-line>, <country>United Kingdom</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Serge Thill, Plymouth University, United Kingdom</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Terrence C. Stewart, University of Waterloo, Canada; Tarek Richard Besold, University of Bremen, Germany</p></fn>
<corresp content-type="corresp" id="cor1">&#x0002A;Correspondence: David Windridge, <email>d.windridge&#x00040;mdx.ac.uk</email></corresp>
<fn fn-type="other" id="fn001"><p>Specialty section: This article was submitted to Computational Intelligence, a section of the journal Frontiers in Robotics and AI</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>15</day>
<month>08</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>4</volume>
<elocation-id>38</elocation-id>
<history>
<date date-type="received">
<day>21</day>
<month>04</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>25</day>
<month>07</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2017 Windridge.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Windridge</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>A <italic>cognitively autonomous</italic> artificial agent may be defined as one able to modify both its external world-model <italic>and</italic> the framework by which it represents the world, requiring two simultaneous optimization objectives. This presents deep epistemological issues centered on the question of how a framework for representation (as opposed to the entities it represents) may be objectively validated. In this article, formalizing previous work in this field, it is argued that subsumptive perception-action learning has the capacity to resolve these issues by (a) building the perceptual hierarchy from the bottom up so as to ground all proposed representations and (b) maintaining a bijective coupling between proposed percepts and projected action possibilities to ensure empirical falsifiability of these grounded representations. In doing so, we will show that such subsumptive perception-action learners intrinsically incorporate a model for how intentionality emerges from randomized exploratory activity in the form of &#x0201C;motor babbling.&#x0201D; Moreover, such a model of intentionality also naturally translates into a model for human&#x02013;computer interfacing that makes minimal assumptions as to cognitive states.</p>
</abstract>
<kwd-group>
<kwd>perception-action learning</kwd>
<kwd>intention recognition</kwd>
<kwd>embodied cognition</kwd>
<kwd>subsumption hierarchies</kwd>
<kwd>symbol grounding</kwd>
</kwd-group>
<contract-num rid="cn01">731593</contract-num>
<contract-sponsor id="cn01">H2020 Future and Emerging Technologies<named-content content-type="fundref-id">10.13039/100010664</named-content></contract-sponsor>
<counts>
<fig-count count="0"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="23"/>
<page-count count="5"/>
<word-count count="4102"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="introduction">
<label>1</label> <title>Introduction</title>
<p>Significant deficits have been apparent in traditional approaches to embodied computer vision for some time (Dreyfus, <xref ref-type="bibr" rid="B2">1972</xref>). In the conventional approach to autonomous robotics, a computer vision system is employed to build a model of the agent&#x02019;s environment <italic>prior</italic> to the act of planning the agent&#x02019;s actions within the modeled domain. Visuo-haptic data arising from these actions will then typically be used to further constrain the environment model, either actively or passively (in active learning the agent actions are driven by the imperative of reducing ambiguity in the environment model (Koltchinskii, <xref ref-type="bibr" rid="B8">2010</xref>; Settles, <xref ref-type="bibr" rid="B17">2010</xref>)).</p>
<p>However, it is apparent, in this approach, that there exists a very wide disparity between the visual parameterization of the agent&#x02019;s domain and its action capabilities within it (Nehaniv et al., <xref ref-type="bibr" rid="B15">2002</xref>). For instance, the agent&#x02019;s visual parametric freedom will typically encompass the full intensity ranges of the RGB channels of each individual pixel of a camera CCD, such that the range of <italic>possible</italic> images generated per time-frame is of an extremely large order of magnitude, despite the fact that only a minuscule fraction of this representational space would ever be experienced by the agent. (Note that this observation is not limited purely to vision-based approaches&#x02014;alternative modalities such as LIDAR and SONAR would also exhibit the same issues.) On the other hand, the agent&#x02019;s motor capability is likely to be very much more parametrically constrained (perhaps consisting of the possible Euler angle settings of the various actuator motors). This disparity is manifested in classical problems such as <italic>framing</italic> (McCarthy and Hayes, <xref ref-type="bibr" rid="B12">1969</xref>) and <italic>symbol grounding</italic>. (The latter occurs when abstractly manipulated symbolic objects lack an intrinsic connection to the real-world objects that they represent; thus a chess-playing robot typically requires a prior supervised computer vision problem to be solved in order to apply deduced moves to visually presented chess pieces.)</p>
<p>Perception-Action (P-A) learning was proposed in order to overcome these issues, adopting as its informal motto, &#x0201C;action <italic>precedes</italic> perception&#x0201D; (Granlund, <xref ref-type="bibr" rid="B5">2003</xref>; Felsberg et al., <xref ref-type="bibr" rid="B3">2009</xref>). By this it is meant that, in a fully formalizable sense, actions are conceptually prior to perceptions; i.e., perceptual capabilities should depend on action-capabilities and not vice versa. (We thus distinguish <italic>PA-learning</italic> from more generalized forms of learning within a perception/action context (cf., e.g., (Mai et al., <xref ref-type="bibr" rid="B9">2013</xref>; Masuta et al., <xref ref-type="bibr" rid="B11">2015</xref>; Millan, <xref ref-type="bibr" rid="B13">2016</xref>)), in which the nature of the perceptual domain remains fixed <italic>a priori</italic> [albeit with potential variations in, e.g., visual saliency].)</p>
<p>It will be the argument of this article that perception-action learning, as well as having this capacity to resolve fundamental epistemic questions about emergent representational capacity, also naturally gives a model for emergent intentionality that applies to both human and artificial agents, and may thus be deployed as an effective design-strategy in human&#x02013;computer interfacing.</p>
</sec>
<sec id="S2">
<label>2</label> <title>Perception-Action Learning</title>
<p>Perception-Action learning agents thus proceed by randomly sampling their action space (&#x0201C;motor babbling&#x0201D;). For each motor action that produces a discernible perceptual output in the bootstrap representation space <italic>S</italic> (consisting of, e.g., camera pixels), a percept <italic>p<sub>i</sub></italic>&#x02009;&#x02208;&#x02009;<italic>S</italic> is greedily allocated. The agent thus progressively arrives at a set of novel percepts that relate directly to the agent&#x02019;s action capabilities in relation to the constraints of the environment (i.e., the environment&#x02019;s <italic>affordances</italic>); the agent learns to perceive only that which it can change. More accurately, the agent learns to perceive only that which it <italic>hypothesizes</italic> that it can change&#x02014;thus, the set of experimental data points &#x0222A;<italic><sub>i</sub>p<sub>i</sub></italic>&#x02009;&#x02282;&#x02009;<italic>S</italic> can, in theory, be generalized over so as to create an <italic>affordance-manifold</italic> that can be mapped onto the action space via the injective relation {<italic>actions</italic>}&#x02009;&#x02192;&#x02009;{<italic>percept</italic><sub>initial</sub>}&#x02009;&#x000D7;&#x02009;{<italic>percept</italic><sub>final</sub>} (Windridge and Kittler, <xref ref-type="bibr" rid="B21">2008</xref>, <xref ref-type="bibr" rid="B22">2010</xref>; Windridge et al., <xref ref-type="bibr" rid="B19">2013a</xref>).</p>
<sec id="S2-1">
<label>2.1</label> <title>Subsumptive Perception-Action Learning</title>
<p>Importantly, this approach permits <italic>Cognitive Bootstrapping</italic> (Windridge and Kittler, <xref ref-type="bibr" rid="B22">2010</xref>), the bootstrapping of an autonomous agent&#x02019;s representational framework simultaneously with the world-model represented in terms of that framework. This centers on the fact that the learned manifold embodying the <italic>injective</italic> relation {<italic>actions</italic>}&#x02009;&#x02192;&#x02009;{<italic>percept</italic><sub>initial</sub>}&#x02009;&#x000D7;&#x02009;{<italic>percept</italic><sub>final</sub>} represents a constrained subset of the initial action domain, and as such, is susceptible to parametric compression. Furthermore, this parametric compression in the action domain (corresponding to the bootstrapping of a higher level action) necessarily corresponds to a parametric compression in the perceptual domain (P-A learning enforces a <italic>bijective</italic> relation <inline-formula><mml:math id="M1"><mml:mrow><mml:mtext>&#x0007B;</mml:mtext><mml:mi>a</mml:mi><mml:mi>c</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mi>s</mml:mi><mml:mtext>&#x0007D;</mml:mtext><mml:mo>&#x02194;</mml:mo><mml:mtext>&#x0007B;</mml:mtext><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>c</mml:mi><mml:mi>e</mml:mi><mml:mi>p</mml:mi><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mtext>initial</mml:mtext></mml:mrow><mml:mrow><mml:mtext>new</mml:mtext></mml:mrow></mml:msubsup><mml:mtext>&#x0007D;</mml:mtext><mml:mo>&#x000D7;</mml:mo><mml:mtext>&#x0007B;</mml:mtext><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>c</mml:mi><mml:mi>e</mml:mi><mml:mi>p</mml:mi><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mtext>final</mml:mtext></mml:mrow><mml:mrow><mml:mtext>new</mml:mtext></mml:mrow></mml:msubsup><mml:mtext>&#x0007D;</mml:mtext></mml:mrow></mml:math></inline-formula> such that each hypothesizable action (i.e., intention primitive) has a unique, discriminable outcome (Windridge and Kittler, <xref ref-type="bibr" rid="B21">2008</xref>, <xref ref-type="bibr" rid="B22">2010</xref>; Windridge et al., <xref ref-type="bibr" rid="B19">2013a</xref>)).</p>
<p>Each induced higher level action/intention (e.g., <italic>Translate</italic>) is thus created coextantly with a higher level percept domain (e.g., <italic>Object</italic>). The falsifiability of such induced representational concepts arises from actively addressing the question of whether this higher level perception in fact constitutes a useful description of the world, i.e., whether it yields a net compression in the agent&#x02019;s internal representation of its own possible interactions with the world (its affordances). In particular, it is argued in Windridge and Kittler (<xref ref-type="bibr" rid="B21">2008</xref>), that the perception-action bijectivity constraint applied in such a hierarchical manner is uniquely sufficient to enable simultaneous empirical falsifiability of the cognitive agent&#x02019;s world model <italic>and</italic> the means by which this world is perceived (by virtue of the implicit grounding of the unique set of higher level percepts so generated).</p>
<p>Very often parametric compressibility will be predicated on the discovery of <italic>invariances</italic> in the existing perceptual space with respect to randomized exploratory actions. Thus, for example, an agent might progress from a pixel-based representation of the world to an object-based representation of the world via the discovery that certain patches of pixels <italic>retain their (relative) identity</italic> under translation, i.e., such that it becomes far more efficient to represent the world in terms of indexed objects rather than pixel intensities (though the latter would, of course, still constitute the base of the representational hierarchy). This particular representational enhancement can represent an enormous compression (Wolff, <xref ref-type="bibr" rid="B23">1987</xref>); a pixel-based representation has a parametric magnitude of <italic>P<sup>n</sup></italic> (with <italic>P</italic> and <italic>n</italic> being the intensity resolution and number of pixels, respectively), while an object-based representation typically has a parametric magnitude of &#x0007E; <italic>n<sup>o</sup>, o</italic>&#x02009;&#x0226A;&#x02009;<italic>n</italic>, where <italic>o</italic> is the number of objects.</p>
<p>When such a high level perceptual manifold is created it permits proactive sampling&#x02014;the agent can propose actions with perceptual outcomes that have not yet been experienced by the agent, but which are consistent with its current representational model (this guarantees falsifiability of both the perceptual model as well as the generalized affordance model). Perception-Action learning thus constitutes a form of active learning: randomized selection of perceptual goals within the hypothesized perception-action manifold leads more rapidly to the capture of data that might falsify the current hypothesis than would otherwise be the case (i.e., if the agent were performing randomly selected actions within the original motor domain). Thus, while the system is always &#x0201C;motor babbling&#x0201D; in a manner analogous to the learning process of infant humans, the fact of carrying out this motor babbling in a higher level P-A manifold means that the learning system as a whole more rapidly converges on the &#x0201C;correct&#x00022; model of the world. (Correct in the sense of being a true model of the world&#x02019;s affordances; i.e., every possible instantiation of the induced high-level actions terminates in the anticipated percept, with no possible environmental actions being overlooked.)</p>
<p>This P-A motor-babbling activity can take place in <italic>any</italic> P-A manifold, of whatever level of abstraction; we may thus, by combining the idea of P-A learning with Brooke&#x02019;s notion of task subsumption, conceive of a <italic>hierarchical perception-action learner</italic> (Shevchenko et al., <xref ref-type="bibr" rid="B18">2009</xref>), in which a vertical representation hierarchy is progressively constructed for which randomized exploratory motor activity at the highest level of the corresponding motor hierarchy would rapidly converge on an ideal representation of the agent&#x02019;s world in terms of its affordance potentialities. Such a system would thus converge upon both a model of the world, and an ideal strategy for representation of that world in terms of the learning agent&#x02019;s action capabilities within it. In the example given, which juxtaposes a simulated camera-equipped robot arm in relation to a child&#x02019;s shape-shorter puzzle, the robotic agent commences by motor babbling in the initial motor-actuator domain, and eventually progresses to motor-babbling in the bootstrapped &#x0201C;move-shape-to-hole&#x0201D; action domain (i.e., placing a randomly chosen object into its corresponding hole). This apparently intentional activity amounts to solving the shape-sorter puzzle, even though the system is still only motor babbling albeit at a higher level of the induced hierarchy, and has no prior programming as to the &#x0201C;goal&#x0201D; of the environment.<xref ref-type="fn" rid="fn1"><sup>1</sup></xref></p>
<p>Procedurally, this takes place as a recursive loop alternating between <italic>exploration, generalization</italic>, and <italic>representation</italic> as in Algorithm <xref ref-type="table" rid="T1">1</xref>. Note in particular, in Algorithm <xref ref-type="table" rid="T1">1</xref>, that the act of parametrically instantiating a proposed bijective perception-action term <inline-formula><mml:math id="M2"><mml:mrow><mml:mtext>&#x0007B;</mml:mtext><mml:msubsup><mml:mi>P</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mtext>initial</mml:mtext></mml:mrow></mml:msubsup><mml:mtext>&#x0007D;</mml:mtext><mml:mo>&#x000D7;</mml:mo><mml:mtext>&#x0007B;</mml:mtext><mml:msubsup><mml:mi>P</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mtext>final</mml:mtext></mml:mrow></mml:msubsup><mml:mtext>&#x0007D;</mml:mtext></mml:mrow></mml:math></inline-formula> with respect to an initial perceptual state <inline-formula><mml:math id="M3"><mml:mrow><mml:mo>&#x0007B;</mml:mo><mml:msubsup><mml:mi>P</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mtext>initial</mml:mtext></mml:mrow></mml:msubsup><mml:mo>&#x0007D;</mml:mo></mml:mrow></mml:math></inline-formula> and a sought perceptual end-state <inline-formula><mml:math id="M4"><mml:mrow><mml:mtext>&#x0007B;</mml:mtext><mml:msubsup><mml:mi>P</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mtext>final</mml:mtext></mml:mrow></mml:msubsup><mml:mtext>&#x0007D;</mml:mtext></mml:mrow></mml:math></inline-formula>is equivalent to formulating an <italic>intention</italic> (which may or may not be achievable in the environment).</p>
<table-wrap position="float" id="T1">
<label>Algorithm 1</label>
<caption><p><italic>Ab Initio</italic> Induction of Perception-Action Hierarchy in Artificial Agents.</p></caption>
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="left" valign="top">1:</td>
<td align="left" valign="top"><bold>Initialization</bold> Obtain:</td>
</tr>
<tr>
<td align="left" valign="top">2:</td>
<td align="left" valign="top">Bootstrap percept set {<italic>P</italic><sub>1</sub>} (eg camera pixel)</td>
</tr>
<tr>
<td align="left" valign="top">3:</td>
<td align="left" valign="top">Bootstrap action set {<italic>A</italic><sub>1</sub>} (motor primitives)</td>
</tr>
<tr>
<td align="left" valign="top">4:</td>
<td align="left" valign="top">Inference mechanism capable of generalizing exploratory samples from function <italic>M</italic></td>
</tr>
<tr>
<td align="left" valign="top"/>
<td align="left" valign="top"><inline-formula><mml:math id="M5"><mml:mrow><mml:mi>M</mml:mi><mml:mo>:</mml:mo><mml:mtext>&#x0007B;</mml:mtext><mml:msubsup><mml:mi>P</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mtext>initial</mml:mtext></mml:mrow></mml:msubsup><mml:mtext>&#x0007D;</mml:mtext><mml:mo>&#x000D7;</mml:mo><mml:mtext>&#x0007B;</mml:mtext><mml:msubsup><mml:mi>P</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mtext>final</mml:mtext></mml:mrow></mml:msubsup><mml:mtext>&#x0007D;</mml:mtext><mml:mo>&#x000D7;</mml:mo><mml:mtext>&#x0007B;</mml:mtext><mml:mi>A</mml:mi><mml:mtext>&#x0007D;</mml:mtext><mml:mo>&#x02192;</mml:mo><mml:mtext>&#x0007B;</mml:mtext><mml:mi mathvariant="italic">achieved</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">not&#x0005F;achieved</mml:mi><mml:mtext>&#x0007D;</mml:mtext></mml:mrow></mml:math></inline-formula></td>
</tr>
<tr>
<td align="left" valign="top">5:</td>
<td align="left" valign="top"><bold>while</bold> prediction accuracy&#x02009;&#x0003C;&#x02009;threshold) <bold>do</bold></td>
</tr>
<tr>
<td align="left" valign="top">6:</td>
<td align="left" valign="top">&#x02003;<bold>A)Carry-out randomized exploratory activity on basis of representational-framework</bold></td>
</tr>
<tr>
<td align="left" valign="top"/>
<td align="left" valign="top">&#x02003;i.e. generate grounded top-down parametric instantiations <italic>A<sub>i</sub></italic><sub>&#x02264;</sub><italic><sub>n</sub></italic>(<italic>P</italic><sup>initial</sup>, <italic>P</italic><sup>final</sup>) by randomly selecting initial &#x00026; target percepts at <italic>proposed</italic> top level of hierarchy, <italic>n</italic></td>
</tr>
<tr>
<td align="left" valign="top">7:</td>
<td align="left" valign="top">&#x02003;<bold>B)Induce rules governing action legitimacy</bold></td>
</tr>
<tr>
<td align="left" valign="top"/>
<td align="left" valign="top">&#x02003;legitimate actions achieve intended perceptual goal (&#x0003D;&#x02009;affordance-based model of world)</td>
</tr>
<tr>
<td align="left" valign="top"/>
<td align="left" valign="top">&#x02003;Generate function <italic>M</italic>: {<italic>P</italic>}&#x02009;&#x000D7;&#x02009;{<italic>P</italic>}&#x02009;&#x000D7;&#x02009;{<italic>A</italic>}&#x02009;&#x02192;&#x02009;{<italic>true, false</italic>} (e.g. via first-order logical induction or stochastic discrimination)</td>
</tr>
<tr>
<td align="left" valign="top">8:</td>
<td align="left" valign="top">&#x02003;<bold>C)Remap perceptual variables to represent novel high-level action hypothesis in most efficient manner</bold></td>
</tr>
<tr>
<td align="left" valign="top"/>
<td align="left" valign="top">&#x02003;i.e. form the bijection: {<italic>A<sub>n</sub></italic><sub>&#x0002B;1</sub>} &#x02194; {<italic>P<sub>n</sub></italic><sub>&#x0002B;1</sub>}&#x02009;&#x000D7;&#x02009;{<italic>P<sub>n</sub></italic><sub>&#x0002B;1</sub>}</td>
</tr>
<tr>
<td align="left" valign="top">9:</td>
<td align="left" valign="top"><bold>end while</bold></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Perceptual goals thus exist at all levels of the hierarchy, and the subsumptive nature of the hierarchy means that goals and sub-goals are scheduled with increasingly specific content as the high-level abstract goal is progressively grounded through the hierarchy.<xref ref-type="fn" rid="fn2"><sup>2</sup></xref> (Thus, as humans, we may conceive the high-level intention &#x0201C;drive to work,&#x0201D; which in order to be enacted, involves the execution of a large range of sub-goals with correspondingly lower level perceptual goals, e.g., the intention &#x0201C;stay in the center of the lane,&#x0201D; etc.) (The hierarchical perception-action paradigm at no stage specifies <italic>how</italic> the scheduled sub-task is to achieve the perceptual goal&#x02014;this is free within the framework, and may be achieved by a variety of mechanisms, e.g., optimal control, minimum jerk, etc.)</p>
<p>Moreover, these perceptual goals have no internal content; in a fully achieved perception-action learning agent, the environment effectively &#x0201C;becomes it <italic>own</italic> representation&#x0201D; (Newell and Simon, <xref ref-type="bibr" rid="B16">1972</xref>), representing a significant compression of the information that an agent needs to retain. This relates directly to the issue of symbol grounding, a seminal problem in the conceptual underpinning of the classical approach to machine learning (Harnad, <xref ref-type="bibr" rid="B6">1990</xref>). The problem arises when one attempts to relate an abstract symbol manipulation system (it was a common historical assumption that computational reasoning would center on a system such as first-order logic deduction (McCarthy and Hayes, <xref ref-type="bibr" rid="B12">1969</xref>)) with the stochastic, shifting reality of sensor data. In hierarchical P-A learning the problem is eliminated by virtue of the fact that symbolic representations are <italic>abstracted from the bottom-up</italic> (Marr, <xref ref-type="bibr" rid="B10">1982</xref>; G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B4">1994</xref>; Granlund, <xref ref-type="bibr" rid="B5">2003</xref>; Modayil and Kuipers, <xref ref-type="bibr" rid="B14">2004</xref>). They are thus always intrinsically grounded (for an example of utilization of first-order logic induction within a subsumption hierarchy see Windridge and Kittler (<xref ref-type="bibr" rid="B22">2010</xref>)).</p>
<p>The subsumption hierarchy is thus typically characterized by continuous stochastic relationships on the lower levels with more discrete, symbolic manipulations occurring at the higher levels&#x02014;for this reason, consistent with findings of Shevchenko et al. (<xref ref-type="bibr" rid="B18">2009</xref>), motor-babbling at the top of the representation hierarchy involves the spontaneous scheduling of perceptual goals and subgoals at the lower level of the hierarchy in a way that (as the hierarchy becomes progressively deeper) looks increasingly <italic>intentional</italic>. (This phenomenon is readily apparent in the development of motor movement of human infants as schema abstraction takes place&#x02014;for instance, the intuition of a generalized percept category <italic>container</italic> correlates with the attempt to validate this notion via the repeated placing of a variety of objects into a variety of containers; cf., Hintzman (<xref ref-type="bibr" rid="B7">1986</xref>) for an analysis of scheme abstraction in infants.)</p>
<p>Such high-level schema-employment in humans can, in principle, be detected via an appropriate classification system, enabling novel forms of intentional interfacing between humans and machines.</p>
</sec>
</sec>
<sec id="S3">
<label>3</label> <title>Human&#x02013;Computer Interfacing</title>
<p>The percept-action relationship may thus be modeled in reverse to characterize human intentional behavior; consider how, as humans we typically represent our environment when driving a vehicle. At one level, we internally represent the immediate environment in metric-related terms (i.e., we are concerned with our proximity to other road users, to the curb and so on). At a higher level, however, we are concerned primarily with <italic>navigation</italic>-related entities (i.e., how individual roads are <italic>connected</italic>). That the latter constitutes a higher hierarchical level, both mathematically and experientially, is guaranteed by the fact that the topological representation <italic>subsumes</italic>, or supervenes upon, the metric representation; i.e., the metric-level provides additional &#x0201C;fine-grained&#x0201D; information to the road topology: the metric representation can be reduced to the topological representation, but not vice versa.</p>
<p>We can thus adopt the perception-action bijectivity principle as a <italic>design paradigm</italic> in building HCI systems by demanding that intentional acts on the part of the user are correlated maximally efficiently (i.e., bijectively) with perceptual transitions apparent to the user. This thus permits a user interface that makes minimal assumptions as to underlying cognitive processes, assuming nothing more than the ability to discriminate percept termina. This subsumption architecture paradigm was used in Windridge et al. (<xref ref-type="bibr" rid="B20">2013b</xref>) to demonstrate, in the context of a driver assistance system, induction of the intentional hierarchy for drivers of a vehicle in which action and eye-gaze take place with respect an external road camera view. The corresponding system constructed for the project demonstrator was thus able to determine the driver&#x02019;s intentional hierarchy in relation to the current road situation and provide assistance accordingly. In principle, such an interface can also be extended to direct mechanical assistance by substituting the computationally modeled perception-action system for the human perception-action system along the lines of the horse&#x02013;rider interaction paradigm.</p>
<p>Such P-A HCI interfaces will generally require the ability to adaptively link high-level reasoning processes (modeled by, e.g., first-order logic) with low-level reactive processes (modeled, for example, stochastically). This amounts to a requirement to propagate learning across the symbolic/sub-symbolic divide. However, because the P-A hierarchy does not make intrinsic distinction between these (there is only progressively grounded P-A abstraction), it is possible to conceive of P-A learning platforms that embody a variety of different learning approaches at different hierarchical levels, but which are all able to learn together by passing derivatives between hierarchical layers in a manner analogous to deep learning approaches.</p>
<p>An example utilizing a two-layer P-A hierarchy is given in Windridge et al. (<xref ref-type="bibr" rid="B19">2013a</xref>) which incorporates a fuzzy first-order logic reasoning process on the top level and an Euler-Lagrange-based trajectory optmization process on the lower level. The fuzzy-reasoning process employs predicates embodying the P-A bijectivity condition to compute the fixed point of the logical operator <italic>T<sub>P</sub></italic>; i.e., <italic>T<sub>P</sub></italic>(<italic>I</italic>)&#x02009;&#x0003D;&#x02009;<italic>I</italic> for each time interval <italic>t</italic>.</p>
<p><italic>I</italic> is thus the <italic>Herbrand</italic> model, the minimal logically consistent &#x0201C;world model&#x0201D; for time <italic>t</italic>, of the logical programme <italic>P</italic> (where <italic>P</italic>&#x02009;&#x0003D;&#x02009;<italic>fixed clauses</italic>&#x02009;&#x0002B;&#x02009;<italic>temporalized detections</italic>&#x02009;&#x0002B;&#x02009;<italic>ground atom queries for t</italic>&#x02009;&#x0002B;&#x02009;1; <italic>P</italic> hence embodies a series of first-order logical rules concerning traffic behavior). This functionalization of the logical reasoning enables the predicate-prediction disparity with respect to the lower level to modulate the lower level&#x02019;s Euler-Lagrange optimization via the interlevel Jacobean derivatives. The net result is logically weighted updating of the Euler-Lagrange optimization that allows for on-line (top-down and bottom-up) adaptivity to human inputs. For example, in top-down terms, this allows a logically influenced Bayesian prior for gaze-location at junctions to be derived. It also allows for adaptive symbol tethering; for example actively associating eye-gaze clusters with specific semantically described road entities (such as stop and give-way signs) via their logical context.</p>
<p>In principle, any high-level abstract reasoning or induction process can be incorporated with low-level stochastic learning in this manner; highly flexible human&#x02013;computer interfaces are thus made possible through adopting perception-action bijectivity as a <italic>design principle</italic>.</p>
</sec>
<sec id="S4">
<label>4</label> <title>Conclusion</title>
<p>We have proposed perception-action hierarchies as a natural solution to the problem of representational induction in artificial agents in a manner that maintains empirical validatability. In such <italic>ab initio</italic> P-A hierarchies (i.e., where cognitive representations are bootstrapped in a bottom-up fashion), exploration is conducted via motor-babbling at progressively higher levels of the hierarchy. This necessarily involves the spontaneous scheduling of perceptual goals and subgoals in the induced lower levels of the hierarchy in such a way that, as the hierarchy becomes deeper, that the randomized exploration becomes increasingly &#x0201C;intentional&#x0201D; (a phenomenon that is readily apparent in the development of motor movement in human infants).</p>
<p>This has implications for social robotics; in particular, it becomes possible to envisage communicable actions within collections of agents employing P-A hierarchies. Here, the same bijectivity considerations apply to perceptions and actions as before, however, the induction and grounding of symbols would be conducted through linguistic exchange (we note in passing that the perception-action bijectivity constraint implicitly embodies the notion of mirroring without requiring specific perceptual apparata&#x02014;&#x0201C;mirror neurons,&#x0201D; etc.).</p>
<p>P-A subsumption hierarchies naturally also encompass symbolic/subsymbolic integration and permit adaptive learning with respect to existing knowledge bases; in this case a bijective P-A consistency criterion is imposed on the engineered subsumption hierarchy. Moreover, P-A-subsumption hierarchies naturally lend themselves to a &#x0201C;deep&#x0201D; formulation in neural-symbolic terms (d&#x02019;Avila Garcez et al., <xref ref-type="bibr" rid="B1">2009</xref>); this is the subject of ongoing research.</p>
<p>We therefore conclude that perception-action learning, as well as enabling autonomous cognitive bootstrapping architectures, also constitutes a particularly straightforward approach to modeling human intentionality, in that it makes fewest cognitive assumptions&#x02014;the existence of perceptual representation is only assumed in so far as it directly relates to an observable high-level action concept (such a &#x0201C;navigating a junction,&#x0201D; &#x0201C;stopping at a red light,&#x0201D; etc.); conversely, the ability to correctly interpret a human agent&#x02019;s action implicitly invokes a necessary and sufficient set of perceptual representations on the part of the agent. This bijectivity of perception and action also gives a natural explanation for wider intention-related phenomenon such as <italic>action mirroring</italic>.</p>
</sec>
<sec id="S5" sec-type="author-contributor">
<title>Author Contributions</title>
<p>The author is responsible for all aspects of the work.</p>
</sec>
<sec id="S6">
<title>Conflict of Interest Statement</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> The author would like to acknowledge financial support from the Horizon 2020 European Research project DREAMS4CARS (no. 731593).</p></fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="book"><person-group person-group-type="author"><name><surname>d&#x02019;Avila Garcez</surname> <given-names>A. S.</given-names></name> <name><surname>Lamb</surname> <given-names>L. C.</given-names></name> <name><surname>Gabbay</surname> <given-names>D. M.</given-names></name></person-group> (<year>2009</year>). <source>Neural-Symbolic Cognitive Reasoning</source>. <publisher-loc>Berlin, Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>.</citation></ref>
<ref id="B2"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Dreyfus</surname> <given-names>H.</given-names></name></person-group> (<year>1972</year>). <source>What Computers Can&#x02019;t Do</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Harper and Row</publisher-name>.</citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Felsberg</surname> <given-names>M.</given-names></name> <name><surname>Wiklund</surname> <given-names>J.</given-names></name> <name><surname>Granlund</surname> <given-names>G.</given-names></name></person-group> (<year>2009</year>). <article-title>Exploratory learning structures in artificial cognitive systems</article-title>. <source>Image Vis. Comput.</source> <volume>27</volume>, <fpage>1671</fpage>&#x02013;<lpage>1687</lpage>.<pub-id pub-id-type="doi">10.1016/j.imavis.2009.02.012</pub-id></citation></ref>
<ref id="B4"><citation citation-type="book"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name></person-group> (<year>1994</year>). <article-title>&#x0201C;How logic emerges from the dynamics of information,&#x0201D;</article-title> in <source>Logic and Information Flow</source>, eds <person-group person-group-type="editor"><name><surname>van Eijck</surname> <given-names>J.</given-names></name> <name><surname>Visser</surname> <given-names>A.</given-names></name></person-group> (<publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>49</fpage>&#x02013;<lpage>77</lpage>.</citation></ref>
<ref id="B5"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Granlund</surname> <given-names>G.</given-names></name></person-group> (<year>2003</year>). <article-title>&#x0201C;Organization of architectures for cognitive vision systems,&#x0201D;</article-title> in <conf-name>Proceedings of Workshop on Cognitive Vision</conf-name> (<conf-loc>Germany</conf-loc>: <conf-sponsor>Schloss Dagstuhl</conf-sponsor>).</citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harnad</surname> <given-names>S.</given-names></name></person-group> (<year>1990</year>). <article-title>The symbol grounding problem</article-title>. <source>Physica D</source> <volume>42</volume>, <fpage>335</fpage>&#x02013;<lpage>346</lpage>.<pub-id pub-id-type="doi">10.1016/0167-2789(90)90087-6</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hintzman</surname> <given-names>D. L.</given-names></name></person-group> (<year>1986</year>). <article-title>Schema abstraction in a multiple-trace memory model</article-title>. <source>Psychol. Rev.</source> <volume>93</volume>, <fpage>411</fpage>&#x02013;<lpage>428</lpage>.<pub-id pub-id-type="doi">10.1037/0033-295X.93.4.411</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koltchinskii</surname> <given-names>V.</given-names></name></person-group> (<year>2010</year>). <article-title>Rademacher complexities and bounding the excess risk in active learning</article-title>. <source>J. Mach. Learn. Res.</source> <volume>11</volume>, <fpage>2457</fpage>&#x02013;<lpage>2485</lpage>.</citation></ref>
<ref id="B9"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Mai</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Jin</surname> <given-names>Y.</given-names></name> <name><surname>Yang</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Simple perception-action strategy based on hierarchical temporal memory,&#x0201D;</article-title> in <conf-name>IEEE International Conference on Robotics and Biomimetics (ROBIO)</conf-name>, <conf-loc>Shenzhen</conf-loc>, <fpage>1759</fpage>&#x02013;<lpage>1764</lpage>.</citation></ref>
<ref id="B10"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Marr</surname> <given-names>D.</given-names></name></person-group> (<year>1982</year>). <source>Vision: A Computational Approach</source>. <publisher-loc>San Francisco</publisher-loc>: <publisher-name>Freeman &#x00026; Co</publisher-name>.</citation></ref>
<ref id="B11"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Masuta</surname> <given-names>H.</given-names></name> <name><surname>Lim</surname> <given-names>H. O.</given-names></name> <name><surname>Motoyoshi</surname> <given-names>T.</given-names></name> <name><surname>Koyanagi</surname> <given-names>K.</given-names></name> <name><surname>Oshima</surname> <given-names>T.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x0201C;Direct perception and action system for unknown object grasping,&#x0201D;</article-title> in <conf-name>24th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)</conf-name>, <conf-loc>Kobe</conf-loc>, <fpage>313</fpage>&#x02013;<lpage>318</lpage>.</citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCarthy</surname> <given-names>J.</given-names></name> <name><surname>Hayes</surname> <given-names>P.</given-names></name></person-group> (<year>1969</year>). <article-title>Some philosophical problems from the standpoint of artificial intelligence</article-title>. <source>Mach. Intell.</source> <volume>4</volume>, <fpage>463</fpage>&#x02013;<lpage>502</lpage>.</citation></ref>
<ref id="B13"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Millan</surname> <given-names>J. D. R.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Brain-controlled devices: the perception-action closed loop,&#x0201D;</article-title> in <conf-name>4th International Winter Conference on Brain-Computer Interface (BCI)</conf-name>, <conf-loc>Taebaek</conf-loc>, <fpage>1</fpage>&#x02013;<lpage>2</lpage>.</citation></ref>
<ref id="B14"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Modayil</surname> <given-names>J.</given-names></name> <name><surname>Kuipers</surname> <given-names>B.</given-names></name></person-group> (<year>2004</year>). <article-title>&#x0201C;Bootstrap learning for object discovery,&#x0201D;</article-title> in <conf-name>IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-04)</conf-name> (<conf-loc>Sendai</conf-loc>: <conf-sponsor>IEEE</conf-sponsor>).</citation></ref>
<ref id="B15"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Nehaniv</surname> <given-names>C. L.</given-names></name> <name><surname>Polani</surname> <given-names>D.</given-names></name> <name><surname>Dautenhahn</surname> <given-names>K.</given-names></name> <name><surname>te Boekhorst</surname> <given-names>R.</given-names></name> <name><surname>Canamero</surname> <given-names>L.</given-names></name></person-group> (<year>2002</year>). <article-title>&#x0201C;Meaningful information, sensor evolution, and the temporal horizon of embodied organisms,&#x0201D;</article-title> in <source>Artificial Life VIII</source>, eds <person-group person-group-type="editor"><name><surname>Standish</surname> <given-names>R.</given-names></name> <name><surname>Bedau</surname> <given-names>M. A.</given-names></name> <name><surname>Abbass</surname> <given-names>H. A.</given-names></name></person-group> (<publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>345</fpage>&#x02013;<lpage>349</lpage>.</citation></ref>
<ref id="B16"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Newell</surname> <given-names>A.</given-names></name> <name><surname>Simon</surname> <given-names>H.A.</given-names></name></person-group> (<year>1972</year>). <source>Human Problem Solving</source>, Vol. <volume>104</volume>. <publisher-loc>Englewood Cliffs, NJ</publisher-loc>: <publisher-name>Prentice-Hall</publisher-name>.</citation></ref>
<ref id="B17"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Settles</surname> <given-names>B.</given-names></name></person-group> (<year>2010</year>). <source>Active Learning Literature Survey</source>. <publisher-loc>Madison</publisher-loc>: <publisher-name>University of Wisconsin</publisher-name>.</citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shevchenko</surname> <given-names>M.</given-names></name> <name><surname>Windridge</surname> <given-names>D.</given-names></name> <name><surname>Kittler</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>A linear-complexity reparameterisation strategy for the hierarchical bootstrapping of capabilities within perception-action architectures</article-title>. <source>Image Vis. Comput.</source> <volume>27</volume>, <fpage>1702</fpage>&#x02013;<lpage>1714</lpage>.<pub-id pub-id-type="doi">10.1016/j.imavis.2008.12.002</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Windridge</surname> <given-names>D.</given-names></name> <name><surname>Felsberg</surname> <given-names>M.</given-names></name> <name><surname>Shaukat</surname> <given-names>A.</given-names></name></person-group> (<year>2013a</year>). <article-title>A framework for hierarchical perception-action learning utilizing fuzzy reasoning</article-title>. <source>IEEE Trans. Cybern.</source> <volume>43</volume>, <fpage>155</fpage>&#x02013;<lpage>169</lpage>.<pub-id pub-id-type="doi">10.1109/TSMCB.2012.2202109</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Windridge</surname> <given-names>D.</given-names></name> <name><surname>Shaukat</surname> <given-names>A.</given-names></name> <name><surname>Hollnagel</surname> <given-names>E.</given-names></name></person-group> (<year>2013b</year>). <article-title>Characterizing driver intention via hierarchical perception-action modeling</article-title>. <source>IEEE Trans. Hum. Mach. Syst.</source> <volume>43</volume>, <fpage>17</fpage>&#x02013;<lpage>31</lpage>.<pub-id pub-id-type="doi">10.1109/TSMCA.2012.2216868</pub-id></citation></ref>
<ref id="B21"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Windridge</surname> <given-names>D.</given-names></name> <name><surname>Kittler</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <article-title>&#x0201C;Epistemic constraints on autonomous symbolic representation in natural and artificial agents,&#x0201D;</article-title> in <source>Applications of Computational Intelligence in Biology. Studies in Computational Intelligence</source>, Vol. <volume>122</volume>, eds <person-group person-group-type="editor"><name><surname>Smolinski</surname> <given-names>T. G.</given-names></name> <name><surname>Milanova</surname> <given-names>M. G.</given-names></name> <name><surname>Hassanien</surname> <given-names>A. E.</given-names></name></person-group> (<publisher-loc>Berlin, Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>395</fpage>&#x02013;<lpage>422</lpage>.</citation></ref>
<ref id="B22"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Windridge</surname> <given-names>D.</given-names></name> <name><surname>Kittler</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>&#x0201C;Perception-action learning as an epistemologically-consistent model for self-updating cognitive representation,&#x0201D;</article-title> in <source>Brain Inspired Cognitive Systems 2008</source>, Vol. <volume>657</volume>. eds <person-group person-group-type="editor"><name><surname>Hussain</surname> <given-names>A.</given-names></name> <name><surname>Aleksander</surname> <given-names>I.</given-names></name> <name><surname>Smith</surname> <given-names>L.</given-names></name> <name><surname>Barros</surname> <given-names>A.</given-names></name> <name><surname>Chrisley</surname> <given-names>R.</given-names></name> <name><surname>Cutsuridis</surname> <given-names>V.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>95</fpage>&#x02013;<lpage>134</lpage>.</citation></ref>
<ref id="B23"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Wolff</surname> <given-names>J. G.</given-names></name></person-group> (<year>1987</year>). <article-title>&#x0201C;Cognitive development as optimisation,&#x0201D;</article-title> in <source>Computational Models of Learning</source>, ed. <person-group person-group-type="editor"><name><surname>Bolc</surname> <given-names>L.</given-names></name></person-group> (<publisher-loc>Heidelberg</publisher-loc>: <publisher-name>Springer-Verlag</publisher-name>), <fpage>161</fpage>&#x02013;<lpage>205</lpage>.</citation></ref>
</ref-list>
<fn-group>
<fn id="fn1"><p><sup>1</sup>In this case, the &#x0201C;move-shape-to-hole&#x0201D; action is induced following the failure of the &#x0201C;move-shape-onto-surface&#x0201D; action to produce the anticipated result (i.e., when, following exploratory &#x0201C;move-shape-onto-surface&#x0201D; actions, the object happens by chance to fall into a hole to which matches its shape and orientation). This immediately falsifies the existing rule-base such that that the action domain is necessarily modified, by first-order logical rule-induction, to account for this possibility. In the context of the PA bijectivity condition this requires the existence of an action &#x0201C;move-shape-to-hole&#x0201D; perceptually parameterized by a set of labels corresponding to the perceptual representation of holes. Exploratory instantiation of this higher-level rule then corresponds to placing random objects into their corresponding holes, i.e., &#x0201C;solving the shapesorter,&#x0201D; even though no such external goal specification has taken place.</p></fn>
<fn id="fn2"><p><sup>2</sup>The subsumption hierarchy thus acts bidirectionally; the hierarchy is learned bottom-up, while exploratory actions are instantiated top-down.</p></fn>
</fn-group>
</back>
</article>