<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Hum. Neurosci.</journal-id>
<journal-title>Frontiers in Human Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Hum. Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1662-5161</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnhum.2014.00867</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neuroscience</subject>
<subj-group>
<subject>Original Research Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Analogy, explanation, and proof</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Hummel</surname> <given-names>John E.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/48171"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Licato</surname> <given-names>John</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/174707"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Bringsjord</surname> <given-names>Selmer</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Psychology, University of Illinois</institution> <country>Urbana-Champaign, IL, USA</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Computer Science, Rensselaer Polytechnic Institute</institution> <country>Troy, NY, USA</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Cognitive Science, Rensselaer Polytechnic Institute</institution> <country>Troy, NY, USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Aron K. Barbey, University of Illinois at Urbana-Champaign, USA</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Paul Thagard, University of Waterloo, Canada; Ken Forbus, Northwestern University, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: John E. Hummel, Department of Psychology, University of Illinois, 603 E. Daniel St., Champaign, IL 61820, USA e-mail: <email>jehummel&#x00040;illinois.edu</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to the journal Frontiers in Human Neuroscience.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>06</day>
<month>11</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date>
<volume>8</volume>
<elocation-id>867</elocation-id>
<history>
<date date-type="received">
<day>15</day>
<month>05</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>07</day>
<month>10</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2014 Hummel, Licato and Bringsjord.</copyright-statement>
<copyright-year>2014</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract><p>People are habitual explanation generators. At its most mundane, our propensity to explain allows us to infer that we should not drink milk that smells sour; at the other extreme, it allows us to establish facts (e.g., theorems in mathematical logic) whose truth was not even known prior to the existence of the explanation (proof). What do the cognitive operations underlying the inference that the milk is sour have in common with the proof that, say, the square root of two is irrational? Our ability to generate explanations bears striking similarities to our ability to make analogies. Both reflect a capacity to generate inferences and generalizations that go beyond the featural similarities between a novel problem and familiar problems in terms of which the novel problem may be understood. However, a notable difference between analogy-making and explanation-generation is that the former is a process in which a single <italic>source</italic> situation is used to reason about a single <italic>target</italic>, whereas the latter often requires the reasoner to integrate multiple sources of knowledge. This seemingly small difference poses a challenge to the task of marshaling our understanding of analogical reasoning to understanding explanation. We describe a model of explanation, derived from a model of analogy, adapted to permit systematic violations of this one-to-one mapping constraint. Simulation results demonstrate that the resulting model can generate explanations for novel explananda and that, like the explanations generated by human reasoners, these explanations vary in their coherence.</p></abstract>
<kwd-group>
<kwd>explanation</kwd>
<kwd>analogy</kwd>
<kwd>logic</kwd>
<kwd>modeling</kwd>
<kwd>LISA</kwd>
</kwd-group>
<counts>
<fig-count count="2"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="41"/>
<page-count count="11"/>
<word-count count="9943"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="introduction" id="s1">
<title>Introduction</title>
<p>People constantly seek, generate, and evaluate explanations (Thagard, <xref ref-type="bibr" rid="B35">1989</xref>, <xref ref-type="bibr" rid="B36">2012</xref>; Sloman, <xref ref-type="bibr" rid="B33">2005</xref>; Keil, <xref ref-type="bibr" rid="B21">2006</xref>). At its most mundane, our propensity to explain guides our simplest actions, as when we decide to throw away sour milk: &#x0201C;The milk smells sour because it has gone bad.&#x0201D; At the other extreme, explanation (<italic>aka</italic>, <italic>abduction</italic>; see Josephson and Josephson, <xref ref-type="bibr" rid="B20">1994</xref>; Magnani, <xref ref-type="bibr" rid="B28">2009</xref>) lies at the heart of our most uniquely human endeavors, including science, engineering and mathematics. And in between, it helps us to understand why a street might be closed or why people in Kansas tend to vote Republican. As anyone who has ever given an essay exam knows, the ability to explain is also a powerful index of understanding.</p>
<p>What all these activities have in common is that they are largely inductive exercises<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref> : Our inference that the milk has gone bad is based on our previous experiences with spoiled milk, rather than the fact that sour-smelling milk is logically guaranteed to be spoiled (consider, e.g., buttermilk). Scientific theories are similarly inductive in nature. Observations consistent with a theory add to the evidence that the theory is correct, but they do not logically <italic>prove</italic> it correct (a fact known as the <italic>problem of scientific induction</italic>). Even our species&#x00027; most purely deductive endeavor, logical theorem proving, has an element of induction at its base: A mathematician&#x00027;s proof of a theorem may consist entirely of deductive reasoning from the premises to the conclusion, but the theorem itself likely arose from less formal, more inductive (perhaps even &#x0201C;intuitive&#x0201D;), origins.</p>
<p>These considerations suggest a common set of inductive mechanisms may underlie all our explanatory behaviors, from throwing away the milk, to discovering and refining the theory of gravity or evolution, to the decision that Poincare&#x00027;s Conjecture or G&#x000F6;del&#x00027;s Incompleteness Theorems are worth trying to prove.</p>
<p>All animals are capable of inductive inference. Even a rat will freeze at the presentation of a tone that has been paired with a shock, or press a lever in anticipation of a food reward. But the human capacity for inductive inference differs qualitatively from the inductive abilities of other primates in our ability to make inferences that depend, not just on the perceptual features of the objects involved, but on the <italic>relations</italic> between those objects (Penn et al., <xref ref-type="bibr" rid="B31">2008</xref>): The fact that a planet orbits a star does not depend on the perceptual &#x0201C;features&#x0201D; of the planet or star, but on the relations between the bodies&#x00027; masses and distance. And the fact that a log will make a suitable bridge across a stream depends on the relation between the length of the log and the width of the stream, and on the relation between the strength of the log and the weight of the objects we wish to move across the stream.</p>
<sec>
<title>Analogy and explanation: the role of flexible relational knowledge representation</title>
<p>Chief among the manifestations of our ability to reason explicitly about relations is our ability to reason using analogies, schemas, and rules (Gick and Holyoak, <xref ref-type="bibr" rid="B11">1980</xref>, <xref ref-type="bibr" rid="B12">1983</xref>; Gentner, <xref ref-type="bibr" rid="B9">1983</xref>; Holyoak and Thagard, <xref ref-type="bibr" rid="B14">1995</xref>; Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>, <xref ref-type="bibr" rid="B17">2003</xref>). Indeed, analogy-making is broadly regarded as a <italic>sine qua non</italic> of relational thought (see Gentner, <xref ref-type="bibr" rid="B9">1983</xref>; Holyoak and Thagard, <xref ref-type="bibr" rid="B14">1995</xref>; Doumas et al., <xref ref-type="bibr" rid="B4">2008</xref>). Accordingly, although a detailed, algorithmic account of explanation remains largely elusive (but see Friedman and Forbus, <xref ref-type="bibr" rid="B8">2008</xref>; Hummel et al., <xref ref-type="bibr" rid="B19">2008</xref>; Hummel and Landy, <xref ref-type="bibr" rid="B18">2009</xref>; Landy and Hummel, <xref ref-type="bibr" rid="B24">2010</xref>, for progress in this direction), accounts of explanation generation, use (Ahn et al., <xref ref-type="bibr" rid="B1">1987</xref>; Vosniadou and Brewer, <xref ref-type="bibr" rid="B38">1987</xref>; VanLehn et al., <xref ref-type="bibr" rid="B37">1992</xref>; Patalano et al., <xref ref-type="bibr" rid="B30">2006</xref>), and evaluation (Keil, <xref ref-type="bibr" rid="B21">2006</xref>; Lombrozo and Carey, <xref ref-type="bibr" rid="B27">2006</xref>) rest on assumptions that are common to explanation-generation and analogy-making (e.g., Falkenhainer, <xref ref-type="bibr" rid="B6">1990</xref>).</p>
<p>This paper presents our early attempts to understand, at a detailed algorithmic level, the cognitive operations that underlie our ability to generate explanations. One of our goals in this work is to understand what our decision to throw away spoiled milk has in common with, say, the insights that led G&#x000F6;del to prove the incompleteness of mathematics (on certain reasonable assumptions). As the empirical literature on explanation generation is comparatively thin, our starting point is one of first principles: What do we know about how people generate explanations, and how can those facts constrain our modeling?</p>
<p>One thing we know about explanation is that it depends on our ability to flexibly access, combine, and apply existing knowledge (Ahn et al., <xref ref-type="bibr" rid="B1">1987</xref>; Vosniadou and Brewer, <xref ref-type="bibr" rid="B38">1987</xref>). This flexibility is illustrated by an experiment by Patalano et al. (<xref ref-type="bibr" rid="B30">2006</xref>). In one condition, Patalano et al. gave subjects a novel explanandum of the form &#x0201C;In the population as a whole, people tend to prefer Pepsi to Coke as often as they prefer Coke to Pepsi. However, ministers tend to prefer Coke over Pepsi,&#x0201D; and asked them to explain this new &#x0201C;fact.&#x0201D; Their research subjects generated many different explanations, but one of the typical ones took the general form: &#x0201C;Ministers tend to be conservative. Perhaps the Coke Corporation supports conservative causes.&#x0201D; This explanation reflects a combination of knowledge about ministers, corporations, and the kinds of factors that can lead a person to prefer one product to another, and reflects tremendous flexibility in the way that knowledge is assessed and combined.</p>
<p>The way we generate explanations suggests at least three kinds of flexibility in the representations and processes underlying those explanations. The first is the kind of <italic>relational</italic> flexibility underlying analogical reasoning. For example, one way to account for the &#x0201C;conservative causes&#x0201D; explanation above is to assume that the subject knows that if a person agrees with the political leanings of a company, then that person will tend to prefer the products of that company. Such a schema needs to be relationally flexible in the sense that it needs to be variabilized so that, in the limit, it can be used to reason about any person, product and company.</p>
<p>Second, explanation requires <italic>semantic</italic> flexibility so that it can exploit partial but imperfect matches between the objects and relations composing an explanandum and those in the relevant schemas or examples in LTM. For example, imagine that our experimental subject did not have a &#x0201C;product preference schema&#x0201D; but did know of a prior case in which her friend preferred to use a particular cell phone company because of their liberal-leaning political activism. The subject could use this prior example as a <italic>source</italic> analog (Holyoak and Thagard, <xref ref-type="bibr" rid="B13">1989</xref>) with which to reason about the situation involving ministers and Coke, but only if their mental representations of the situations allowed them to tolerate the semantic differences between their friend, the cell phone company, and the cell phone service on the one hand, and ministers, the Coca Cola Corporation, and Coke on the other (Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>).</p>
<p>These same kinds of flexibility also characterize human reasoning using analogies, schemas, and rules (Holyoak and Thagard, <xref ref-type="bibr" rid="B13">1989</xref>, <xref ref-type="bibr" rid="B14">1995</xref>; Falkenhainer, <xref ref-type="bibr" rid="B6">1990</xref>; Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>, <xref ref-type="bibr" rid="B17">2003</xref>).</p>
</sec>
<sec>
<title>Beyond analogy: causal knowledge flexible knowledge integration</title>
<p>Although our capacity for explanation shares a great deal with our capacity for analogical reasoning, two additional properties of explanation seem to distinguish it from general-purpose analogy-making. The first concerns the role of causal relations (and related higher-order relations, such as logical entailment). Although higher-order relations play an important, even crucial, role in analogy (e.g., Gentner, <xref ref-type="bibr" rid="B9">1983</xref>; Markman, <xref ref-type="bibr" rid="B29">1997</xref>), analogy-making is nonetheless seen as a general-purpose inference engine, equally happy to operate over all kinds of relational structures. By contrast, causal relations enjoy a privileged organizing role in explanatory structures: It is almost impossible to answer the question <italic>why?</italic> without invoking <italic>because</italic>. As elaborated shortly, we conjecture that, as embodied in explanations, causal relations are neither as implicit as simple &#x0201C;associative links&#x0201D; (as embodied, say, in the Rescorla and Wagner, <xref ref-type="bibr" rid="B32">1972</xref>, model of associative learning, or even in extant Bayesian models of causal inference) but neither as explicit, and thus working-memory (WM)-demanding, as full-blown variabilized relations (Hummel et al., <xref ref-type="bibr" rid="B19">2008</xref>). Rather, they are explicit structures that organize explanations into meaningful parts and guide explanatory reasoning, but which can be held in WM along with the structures they relate.</p>
<p>A second important difference between analogy and explanation concerns the scope of the knowledge structures brought to bear on the solution of a problem. Analogy is typically construed as a process of reasoning about a novel <italic>target</italic> problem or domain in terms of a familiar <italic>source</italic> or <italic>base</italic> domain (Gentner, <xref ref-type="bibr" rid="B9">1983</xref>; Gick and Holyoak, <xref ref-type="bibr" rid="B12">1983</xref>; Holyoak and Thagard, <xref ref-type="bibr" rid="B13">1989</xref>; Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>, <xref ref-type="bibr" rid="B17">2003</xref>), and accordingly, extant models of analogy make inferences from a single source to a single target. Of course, multiple source and target analogs may be used for the purposes of inducing a general schema or rule from multiple examples (Gick and Holyoak, <xref ref-type="bibr" rid="B11">1980</xref>, <xref ref-type="bibr" rid="B12">1983</xref>; Gentner and Medina, <xref ref-type="bibr" rid="B10">1998</xref>; Kuehne et al., <xref ref-type="bibr" rid="B23">2000</xref>; Hummel and Holyoak, <xref ref-type="bibr" rid="B17">2003</xref>) or learning a new relation (Doumas et al., <xref ref-type="bibr" rid="B4">2008</xref>). But within any single reasoning episode, reasoning is conceptualized as being based on the mapping from a single source to a single target, and this convention not arbitrary.</p>
<p>The key bottleneck in analogical reasoning is the process of <italic>mapping</italic> the target onto the source: Finding a set of correspondences between the elements (objects, relations, and propositions) of the target and source that systematically reflects the relational structures of both (e.g., Gentner, <xref ref-type="bibr" rid="B9">1983</xref>; Falkenhainer et al., <xref ref-type="bibr" rid="B7">1989</xref>; Holyoak and Thagard, <xref ref-type="bibr" rid="B13">1989</xref>, <xref ref-type="bibr" rid="B14">1995</xref>; Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>). Only once these mappings have been established can structurally consistent inferences about the target be drawn from the source. However, this mapping problem is fundamentally ill-posed without assuming constraints on its solution. One of the most basic constraints on mapping&#x02014;one that is universally accepted among models of analogical reasoning&#x02014;is the <italic>1:1 mapping constraint</italic>: Each element in the target may correspond to at most one in the source, and vice versa. Even with this constraint, mapping is hard (NP-hard, to be precise), but at least it&#x00027;s solvable; without it, analogical mapping would be hopelessly underconstrained (e.g., Falkenhainer et al., <xref ref-type="bibr" rid="B7">1989</xref>; Holyoak and Thagard, <xref ref-type="bibr" rid="B13">1989</xref>). Consistent with these computational considerations, Markman (<xref ref-type="bibr" rid="B29">1997</xref>) showed that human analogy-making is likewise bound by this 1:1 mapping constraint.</p>
<p>Things are not so tidy in the case of explanation. Generating an explanation often requires integrating information from <italic>multiple</italic> sources in LTM. Returning to our ministers and Coke example, the reasoner may have one set of schemas describing the properties of ministers, another describing the conditions under which one&#x00027;s beliefs might lead to particular product preferences, and yet another describing what it means for one person (e.g., a minister) to agree with another person or entity (e.g., the Coke Corporation). In order to generate the &#x0201C;supports conservative causes&#x0201D; explanation, it is necessary to integrate these diverse sources of knowledge, somehow keeping track of what corresponds to what within and between the explanandum and the various schemas. And in order to integrate information from multiple sources in LTM, it is necessary to violate the 1:1 mapping constraint: The minister in the explanandum will correspond to one object in the <italic>product preference</italic> schema and a different one in the <italic>ministers</italic> schema. As elaborated below, we present a solution to this problem that works by serializing the mapping of the explanandum onto the various knowledge structures in LTM. In contrast to the kind of serialization that goes on in the case of schema-, rule-, or relation-induction from multiple examples, the serialization required for explanation must be performed in the service of making inferences about a single target (the explanandum) during a single reasoning episode.</p>
</sec>
</sec>
<sec>
<title>A process model of explanation</title>
<sec>
<title>Knowledge representation</title>
<p>The point of departure for our effort is Hummel and Holyoak&#x00027;s (<xref ref-type="bibr" rid="B16">1997</xref>, <xref ref-type="bibr" rid="B17">2003</xref>) LISA model of analogical reasoning. LISA is an artificial neural network whose representations and processes are rendered symbolic (i.e., explicitly relational) by virtue of its solution to the problem of dynamically binding relational roles to their fillers. LISA represents propositions [such as <italic>prefer</italic> (ministers, Coke)] using a hierarchy of distributed and progressively more localist codes (see Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>LISA representation of the proposition <italic>prefer</italic> (ministers, Coke)</bold>. <italic>Semantic</italic> units (small circles) represent the semantic features of objects and relational roles. <italic>Object</italic> and <italic>role</italic> units (large circles and triangles, respectively) represent objects, such as ministers (M) and Coke (C), and relational roles, such as <italic>prefer-agent</italic> (<italic>p1</italic>) and <italic>preferred-thing</italic> (<italic>p2</italic>), in a localist fashion. <italic>Sub-proposition</italic> (SP; aka <italic>role-binding</italic>) units (rectangles) represent bindings of arguments (objects or complete propositions) to relational roles and <italic>proposition</italic> (P) units (oval) represent complete propositions. When a proposition becomes active (i.e., enters working memory), role-filler bindings are represented by synchrony of firing: Separate role bindings (SPs, object, role and their associated semantic units) fire out of synchrony with one another, and units representing the same role binding fire in synchrony with one another.</p></caption>
<graphic xlink:href="fnhum-08-00867-g0001.tif"/>
</fig>
<p>At the bottom of the hierarchy, objects and relational roles are represented as patterns of activation distributed over units coding for their semantic features. At the next level, objects and roles are represented by localist <italic>object</italic> and <italic>role</italic> units, which share bidirectional excitatory connections with the semantic units describing them. For example, the object unit <italic>minister</italic> might share connections with semantics such as <italic>human</italic>, <italic>adult</italic>, <italic>religious</italic>, <italic>official</italic>, etc. Role-filler bindings are encoded by <italic>sub-proposition</italic> units (SPs), which share bi-directional excitatory connections with the object and role units they bind together. At the top of the hierarchy, <italic>proposition</italic> (P) units bind individual role bindings (SPs) together into complete propositions.</p>
<p>This hierarchy represents propositions both in LISA&#x00027;s LTM and, when a proposition becomes active, in its WM. In LTM, a proposition&#x00027;s role bindings are represented strictly by the conjunctive SPs. However, this kind of conjunctive code is inadequate as a general solution to the binding problem in WM, because it fails to represent relational roles independently of their arguments (Hummel and Biederman, <xref ref-type="bibr" rid="B15a">1992</xref>; Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>; Hummel, <xref ref-type="bibr" rid="B15">2011</xref>). When a proposition becomes active (i.e., enters WM) its role bindings are represented both conjunctively by the SPs, and <italic>dynamically</italic> by synchrony of firing: The separate SPs composing a proposition fire out of synchrony with one another. As a result, relational roles fire in synchrony with the fillers to which they are bound, and separate role-filler bindings fire out of synchrony with one another. On the semantic units, the result is a collection of mutually desynchronized distributed patterns of activation, one for each role-filler binding. These representations have the property that they represent relational roles and their arguments independently of one another (i.e., the same units will represent a given object or relational role, regardless of the role or object to which it happens to be bound at the time) and simultaneously specify how roles are bound to their fillers. They are therefore both distributed and explicitly relational, i.e., symbolic (see Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>).</p>
<p>LISA&#x00027;s knowledge representations are compartmentalized into &#x0201C;analogs&#x0201D;: Collections of propositions that together represent complete events, concepts, rules or schemas. Within an analog, a given object or role is represented by a single unit regardless of the number of propositions in which it plays a role. However, separate analogs do not share object, role, SP or P units: A given object or role is represented by one unit in one analog and by a different unit in another analog. As such, object and role units do not represent objects or roles in the abstract; they represent specific instantiations or tokens of those objects or roles in specific analogs. (The same is true of SP and P units.) Accordingly, we will refer to object, role, SP and P units collectively as <italic>token units</italic>. In contrast to the token units, all analogs connect to a common pool of semantic units. The semantic units thus represent the abstract <italic>types</italic> to which the tokens refer.</p>
<p>For the purposes of LISA&#x00027;s operation, analogs are divided into three sets: A <italic>driver</italic> and one or more <italic>recipients</italic> are assumed to reside in <italic>active memory</italic> (a primed subset of LTM that is larger than WM; Cowan, <xref ref-type="bibr" rid="B3">2001</xref>); all others are dormant in LTM. LISA&#x00027;s operations are controlled by the driver. One (or at most three) at a time, propositions in the driver become active and enter the <italic>phase set</italic>: The set of active but mutually de-synchronized role bindings. The phase set is LISA&#x00027;s WM, and like human WM (see Cowan, <xref ref-type="bibr" rid="B3">2001</xref>), is limited to at most 4&#x02013;6 role bindings at a time. The patterns of activation that propositions in the phase set generate on the semantic units excite other propositions in LISA&#x00027;s LTM (for memory retrieval) and in its active memory (for mapping, analogical inference and schema induction) and thereby bootstrap all the functions LISA performs (see Hummel and Holyoak, <xref ref-type="bibr" rid="B17">2003</xref>, Supplementary Material, for the full details; source code for the 2003 version of the model can be downloaded free from <ext-link ext-link-type="uri" xlink:href="http://internal.psychology.illinois.edu/&#x0007E;jehummel/models.php">http://internal.psychology.illinois.edu/&#x0007E;jehummel/models.php</ext-link>).</p>
</sec>
<sec>
<title>Processing</title>
<p>Most of the operations performed by the model described here are &#x0201C;standard LISA&#x0201D; and, unless stated otherwise, are performed as described in Hummel and Holyoak (<xref ref-type="bibr" rid="B17">2003</xref>) (exceptions to this generalization are described where they become relevant). LISA performs memory retrieval as a form of guided pattern recognition (Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>): Patterns of activation generated on the semantic units by one proposition tend to activate other, similar, propositions in LTM, retrieving them into active memory. For example, the patterns activated by the proposition <italic>prefer</italic> (ministers, Coke) might activate the proposition <italic>prefer</italic> (person, product) in the &#x0201C;product preference&#x0201D; schema.</p>
<p>Augmented with a simple algorithm for learning which structures in the recipient tend to activate which structures in the driver, LISA&#x00027;s retrieval algorithm serves as a basis for analogical mapping: In this trivial analogy, <italic>ministers</italic> bound to <italic>prefer-agent</italic> activates <italic>person</italic> bound to <italic>prefer-agent</italic> in the schema, and <italic>Coke</italic> bound to <italic>preferred-object</italic> activates <italic>product</italic> bound to <italic>preferred-object</italic>; as a result, <italic>ministers</italic> fires at the same time as (in synchrony with) <italic>person</italic> and <italic>Coke</italic> fires with <italic>product</italic>, so LISA maps <italic>ministers</italic> to <italic>person</italic> and <italic>Coke</italic> to <italic>product</italic>. The same is true for corresponding roles of the <italic>prefer</italic> relation, and the SP and P units binding those roles to their fillers.</p>
<p>LISA represents these correspondences as learned <italic>mapping connections</italic> between corresponding structures (e.g., between <italic>ministers</italic> and <italic>person</italic>, etc.). These connections serve both to represent the learned mappings and to constrain future mappings. They also play a central role in LISA&#x00027;s capacity for <italic>self-supervised learning</italic>&#x02014;the core of its algorithm for analogical inference and schema induction (Hummel and Holyoak, <xref ref-type="bibr" rid="B17">2003</xref>).</p>
<p>One of the main adaptive functions of analogical thinking is that it supports <italic>relational generalization</italic>: Inferences based on the relational roles that objects play, rather than just the literal features of the objects themselves. In the current example, once LISA maps <italic>ministers</italic> to <italic>person</italic> and <italic>Coke</italic> to <italic>product</italic> (along with their roles), it is then prepared to &#x0201C;copy with substitution and generation&#x0201D; (Holyoak and Thagard, <xref ref-type="bibr" rid="B13">1989</xref>) the structure of the entire &#x0201C;product preference schema&#x0201D; over onto the &#x0201C;minister and Coke&#x0201D; situation, effectively filling in a (partial) explanation for why ministers might prefer Coke. Through repeated cycles of retrieval, mapping, and inference (elaborated below), the model is able to violate the 1:1 mapping constraint to integrate multiple sources of knowledge through sequential analogical inference.</p>
<p>LISA&#x00027;s knowledge representations (&#x0201C;LISAese&#x0201D;) enjoy the flexibility of distributed representations and the relational sophistication of symbolic representations. As such, they are an ideal platform on which to build a model of understanding and explanation.</p>
</sec>
<sec>
<title>Causal relations</title>
<p>Consider a set of propositions that together might form a &#x0201C;product preference&#x0201D; schema:</p>
<list list-type="simple">
<list-item><p>P1: <italic>agree-with</italic> (person, corporation)</p></list-item>
<list-item><p>P2: <italic>produce</italic> (corporation, product)</p></list-item>
<list-item><p>P3: <italic>prefer</italic> (person, product)</p></list-item>
</list>
<p>Another set of propositions that might form an &#x0201C;agreement&#x0201D; schema:</p>
<list list-type="simple">
<list-item><p>P1: <italic>believe</italic> (entity1, proposition)</p></list-item>
<list-item><p>P2: <italic>believe</italic> (entity2, proposition)</p></list-item>
<list-item><p>P3: <italic>agree-with</italic> (entity1, entity2)</p></list-item>
</list>
<p>Assuming that these propositions constitute reasonable caricatures of the preference and agreement schemas, they are clearly causally related to one another. Specifically, P1 and P2 (<italic>agree-with</italic> and <italic>produce</italic>) in the preference schema jointly cause P3 (<italic>prefer</italic>), and P1 and P2 (<italic>believe</italic>) in the agreement schema jointly cause P3 (<italic>agree-with</italic>). How should these causal relations be represented for the purposes of generating explanations?</p>
<p>One straightforward approach is to represent them as explicit propositions, for example:</p>
<list list-type="simple">
<list-item><p>P4: <italic>and</italic> (P1, P2)</p></list-item>
<list-item><p>P5: <italic>cause</italic> (P4, P3)</p></list-item>
</list>
<p>LISAese makes it possible for one proposition to take another as an argument, so this approach to representing causal relations is perfectly plausible; and in some circumstances, people can undoubtedly do so. However, LISAese assumes that explicit propositions are represented in WM and therefore consume finite WM capacity (specifically, when the propositions become active, all their roles must fire out of synchrony with one another, each role occupying a &#x0201C;slot&#x0201D; in WM). As such, we suggest that this approach is likely to be too demanding of WM capacity to serve as a general solution to the problem of representing causal relations for the purposes of explanation: Note that P4 and P5 collectively introduce four additional role bindings into each schema; that&#x00027;s eight additional role bindings that would need to occupy slots (although not all at the same time) in our intrinsically capacity-limited WM. It seems intuitive that, although we are aware of the causal relations, and can name them when asked, we do not necessarily think so explicitly about them in the service of generating an explanation.</p>
<p>Alternatively, we could represent causal relations in an entirely implicit fashion, for example as associative links whose weights indicate causal strength (e.g., as in a Bayes net). This approach would solve the WM problem caused by the explicit propositions, but it goes too far in the opposite direction, representing causal relations only as implicit links rather than explicit structures that can be activated, analogically mapped, and ultimately inferred (e.g., by analogical inference) into the emerging explanation.</p>
<p>We propose a third alternative: To represent groups of related propositions by connecting them to <italic>group</italic> units (Hummel et al., <xref ref-type="bibr" rid="B19">2008</xref>). For example, the fact that P1 and P2 in the agreement schema (the <italic>believe</italic> relations) jointly cause something can be represented by connecting P1 and P2 to a single group unit, and tagging that group as a <italic>cause</italic> by connecting it to semantic units representing <italic>cause</italic> (see Figure <xref ref-type="fig" rid="F2">2</xref>). Likewise, the fact that P3 is an effect can be represented by connecting it to a group unit, and connecting that unit to semantic units representing <italic>effect</italic>. Finally, the fact that the P1/P2 group is the cause of P3 can be represented by connecting the cause and effect groups to a higher-level <italic>cause-effect</italic> (CE) group unit. This latter unit represents the strength of the causal relation by connecting to semantic units coding for that strength.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>LISA representation of the cause-effect relation: Entity 1 believes proposition <italic>p [believe (e1, p)]</italic>, entity 2 believes proposition <italic>p [believe (e2, p)]</italic>, and these facts jointly cause e1 to agree with e2 [<italic>agree</italic> (e1, e2)]</bold>. To represent that <italic>believe</italic> (e1, <italic>p</italic>) and <italic>believe</italic> (e2, <italic>p</italic>) jointly cause something, the units representing these propositions (left-most ovals) share bi-directional excitatory connections to a unit (left-most diamond) representing a <italic>cause</italic> group. To represent that <italic>agree</italic> (e1, e2) is the effect of something, the unit representing that proposition (right-most oval) shares a bi-directional excitatory connection to a unit (right-most diamond) representing an <italic>effect</italic> group. To represent that the cause on the left is the cause of the effect on the right, the corresponding cause and effect groups share bi-directional excitatory connections with a unit (upper-most diamond) representing a <italic>cause-effect</italic> (<italic>CE</italic>) group. Connections between the group units and their respective <italic>cause</italic>, <italic>effect</italic>, and <italic>CE</italic> semantic units are not shown.</p></caption>
<graphic xlink:href="fnhum-08-00867-g0002.tif"/>
</fig>
<p>The resulting representation is more explicit than simply representing causal relations as associative links: causal relations are represented as collections of units that can be activated, mapped and inferred. But at the same time, it is less WM-demanding than explicit propositions: Because group units reside &#x0201C;above&#x0201D; P units in the representational hierarchy (and are effectively different &#x0201C;data types&#x0201D;), they incur no additional WM burden over and above the propositions they link as causally related. (Of course, the notion of units as &#x0201C;data types&#x0201D; is metaphorical. All that matters is that the cognitive architecture respect the units&#x00027; different spots in the representational hierarchy; see Doumas et al., <xref ref-type="bibr" rid="B4">2008</xref>).</p>
<p>Group units also serve to organize LISA&#x00027;s knowledge into meaningful packages, including but not limited to causes, effects, and cause-effect pairings. As a result, they play a central role in determining which propositions are likely to become active in close temporal proximity (i.e., what LISA is likely to &#x0201C;think about&#x0201D; in what order; see Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>, <xref ref-type="bibr" rid="B17">2003</xref>). Specifically, LISA&#x00027;s processing is constrained to activate propositions in group-based sets (in the driver), and it is constrained to retrieve propositions from LTM in group-based sets (a departure from the original LISA; Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>, <xref ref-type="bibr" rid="B17">2003</xref>). During memory retrieval, the probability of a group being retrieved into WM at any instant, <italic>t</italic>, is proportional to the group&#x00027;s activation at time <italic>t</italic>. If a cause or effect group gets retrieved, then that event automatically triggers retrieval of the group&#x00027;s parent CE group. As a result, if LISA is reminded of a familiar effect (e.g., some novel explanandum activates a proposition in LTM connected to an effect group), then it will tend to be reminded of the cause as well (via the shared CE group). Thus, group units not only play an important role in LISA&#x00027;s representation of causal relations; they also play a key role in its metacognition, controlling what it &#x0201C;thinks about&#x0201D; together, and controlling what it is reminded of together.</p>
</sec>
<sec>
<title>Flow of control</title>
<p>Armed with group-augmented LISAese, LISA&#x00027;s algorithm for explanation operates according to a retrieve-map-infer cycle that is applied iteratively to construct a causal chain representing an explanation of the explanandum (see Table <xref ref-type="table" rid="T1">1</xref>). (The same retrieve-map-infer process also characterizes reasoning by analogy (e.g., Gentner, <xref ref-type="bibr" rid="B9">1983</xref>; Holyoak and Thagard, <xref ref-type="bibr" rid="B13">1989</xref>; Hummel and Holyoak, <xref ref-type="bibr" rid="B17">2003</xref>); but in analogical reasoning, it is performed only once, not iteratively). This process can be initiated by placing the proposition(s) representing the explanandum into the driver, connected to an isolated effect group (i.e., an effect group with no parent CE group and no sibling cause group). All of LISA&#x00027;s other knowledge resides dormant in LTM. In the case of our ministers and Coke example, the driver would contain the proposition <italic>prefer</italic> (ministers, Coke) connected to an effect group; LTM would contain all of LISA&#x00027;s other knowledge, including the preference and agreement schemas. (In general, it is not necessary to connect the propositions of the explanandum to an effect group, as elaborated in the Simulations, but doing so is a convenient way to mark the explanandum as <italic>that which is to be explained</italic>.)</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Illustration of the retrieve-map-infer cycle that governs explanation-generation in LISA</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Cycle</bold></th>
<th align="left"><bold>Retrieval Cue</bold></th>
<th align="left"><bold>Retrieved</bold></th>
<th align="left"><bold>Mappings</bold></th>
<th align="left"><bold>Inference</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">0</td>
<td align="left"><italic>prefer</italic> (ministers, Coke)</td>
<td align="left"><italic>prefer</italic> (person, product)</td>
<td align="left"><italic>prefer1</italic> &#x02192; <italic>prefer1</italic></td>
<td align="left"><italic>agree-with</italic> (min., corp.)</td>
</tr>
<tr>
<td/>
<td/>
<td align="left">(from <italic>Product Preference Schema</italic>)</td>
<td align="left"><italic>prefer2</italic> &#x02192; <italic>prefer2</italic></td>
<td align="left"><italic>prod</italic>uce (corp., Coke)</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td align="left">ministers &#x02192; person</td>
<td/>
</tr>
<tr>
<td/>
<td/>
<td/>
<td align="left">Coke &#x02192; product</td>
<td/>
</tr>
<tr>
<td align="left">1</td>
<td align="left"><italic>agree-with</italic> (min., corp.)</td>
<td align="left"><italic>agree-with</italic> (ent1, ent2)</td>
<td align="left">agree1 &#x02192; agree1</td>
<td align="left"><italic>believe</italic> (min., prop.)</td>
</tr>
<tr>
<td/>
<td/>
<td align="left">(from <italic>Agreement Schema</italic>)</td>
<td align="left">agree2 &#x02192; agree2</td>
<td align="left"><italic>believe</italic> (corp., prop.)</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td align="left">ministers &#x02192; entity1</td>
<td/>
</tr>
<tr>
<td/>
<td/>
<td/>
<td align="left">corporation &#x02192; entity2</td>
<td/>
</tr>
</tbody>
</table>
</table-wrap>
<p>LISA initiates the explanation process by activating the proposition and its effect group in the attempt to retrieve a relevant schema or prior example from LTM. In the case of the current example, the isolated effect group activates the <italic>effect</italic> semantic, and the <italic>prefer</italic> proposition activates the semantics of <italic>ministers&#x0002B;prefer-agent</italic> and <italic>Coke&#x0002B;preferred-object</italic> (the semantics of <italic>ministers</italic> fire in synchrony with those of <italic>prefer-agent</italic> and out of synchrony with those of <italic>Coke</italic> and <italic>preferred-object</italic>, but all these units fire in synchrony with <italic>effect</italic>). The resulting patterns of activation on the semantic units represent the query <italic>Why do ministers prefer Coke?</italic>, and tend to activate effect groups (via the <italic>effect</italic> semantic) connected to semantically similar propositions in LTM (via the semantics connected to the proposition).</p>
<p>In the current example, P3 in the preference schema, <italic>prefer</italic> (person, product), is likely to be retrieved. Since retrieval is group-based, with a bias in favor of retrieving CE groups over isolated cause or effect groups, the activation of P3 in the preference schema is likely to result in the retrieval of the whole product preference schema. (For convenience, we illustrate flow of control in LISA using the preference and agreement schemas, but the logic is exactly the same if, instead of schemas, LISA has analogous specific examples.)</p>
<p>If LISA fails to retrieve a CE group from LTM, then it halts, declaring the explanation complete. (If the resulting explanation is the empty set, then LISA&#x00027;s answer is effectively &#x0201C;I don&#x00027;t know.&#x0201D;) If LISA succeeds in retrieving a CE group, then it places a proxy of that group into a <italic>workspace.</italic> That is, it copies the units composing the group into a new target analog, which we assume to correspond to neurons in frontal cortex with rapidly modifiable synapses (see Knowlton et al., <xref ref-type="bibr" rid="B22">2012</xref>). Retrieval is thus a matter of activation and proxy creation rather than simply activating an existing structure in LTM (another important departure from Hummel and Holyoak, <xref ref-type="bibr" rid="B16">1997</xref>, <xref ref-type="bibr" rid="B17">2003</xref>). LISA then maps the elements of the explanandum onto the proxy of the CE group, for example, mapping <italic>ministers</italic> onto <italic>person</italic>, <italic>Coke</italic> onto <italic>product</italic>, and <italic>prefer</italic> onto <italic>prefer</italic> (along with their SPs, P units and effect groups; see Table <xref ref-type="table" rid="T1">1</xref>).</p>
<p>The model next makes the workspace the driver and the explanandum the recipient and, using analogical inference (i.e., self-supervised learning; Hummel and Holyoak, <xref ref-type="bibr" rid="B17">2003</xref>), infers the missing elements in the explanation. In this case, it would infer:</p>
<list list-type="simple">
<list-item><p>P2: <italic>agree-with</italic> (ministers, corporation)</p></list-item>
<list-item><p>P3: <italic>produce</italic> (corporation, Coke).</p></list-item>
</list>
<p>In making these inferences, it will connect both P2 and P3 to a cause group (also inferred from the workspace), and connect both that cause group and the existing effect group [containing P1: <italic>prefer</italic> (ministers, Coke)] to a CE group. LISA&#x00027;s explanation now consists of the hypothesis &#x0201C;ministers prefer Coke because they agree with the corporation that makes Coke.&#x0201D;</p>
<p>Finally, LISA attaches both P2 (<italic>agree-with</italic>) and P3 (<italic>produce</italic>) to their own effect groups, turns control back over to the explanandum (which is now an emerging explanation) and starts the whole cycle over again. Attaching P2 and P3 to effect groups is LISA&#x00027;s way of seeking new causes to explain these facts: <italic>Why do ministers agree with the Coke corporation?</italic> (P2) and (less sensibly) <italic>Why does the Coke corporation produce Coke?</italic> (P3). When the effect group connected to P2: <italic>agree-with</italic> (ministers, corporation) is used to drive retrieval, the result is likely to be retrieval of the agreement schema (or an analogous specific example), in which case the same processes described above augment the explanation with the statements:</p>
<list list-type="simple">
<list-item><p>P4: <italic>believe</italic> (ministers, some-proposition)</p></list-item>
<list-item><p>P5: <italic>believe</italic> (corporation, some-proposition),</p></list-item>
</list>
<p>Connecting both P4 and P5 to a cause group linked via a CE group to the effect group connected to P3 (<italic>agree-with</italic>).</p>
<p>In the current instantiation of the model, these processes are repeated until the retrieval phase fails to retrieve a CE group. This &#x0201C;explanation is done when retrieval fails&#x0201D; approach is a clear limitation of the model in its current state.</p>
<p>What is important to point out in the preceding description of the flow of control is the model&#x00027;s solution to the 1:1 mapping problem: LISA mapped <italic>ministers</italic> to <italic>person</italic> in the context of the preference schema, and then mapped <italic>ministers</italic> to <italic>person</italic> (a completely different token) in the agreement schema. It then inferred <italic>corporation</italic> from the preference schema into the explanandum and then mapped <italic>corporation</italic> onto <italic>entity</italic> in the agreement schema. How did it &#x0201C;know&#x0201D; that <italic>corporation</italic> in the preference schema had the same referent as <italic>entity</italic> in the agreement schema, or that <italic>person</italic> in the preference schema had the same referent as <italic>person</italic> in the agreement schema? The answer is that it did not know, and it did not have to. Rather than having to make the impossible decision of whether two tokens have the same referent, LISA&#x00027;s iterative retrieve-map-infer algorithm need only decide whether two units <italic>correspond</italic>, that is, map to one another, within the confines of the <italic>current</italic> retrieve-map-infer cycle. In so doing, it side-steps the question of whether the tokens &#x0201C;have the same referent.&#x0201D; In short, LISA replaces the question &#x0201C;are they the same?&#x0201D; with the question &#x0201C;do they correspond?&#x0201D; and in so doing provides an effective solution to one particularly thorny variant of the type-token problem. Its ability to do so is a cornerstone of its ability to integrate multiple diverse sources of knowledge in LTM in the service of explaining a novel explanandum.</p>
</sec>
</sec>
<sec>
<title>Simulations</title>
<p>The model described thus far is still in an early stage of development. In order to test its potential, we ran three sets of simulations. The first two were based on elaborations of the minister/Coke example given previously, and the third was directed as a small first step to understanding how the processes of analogy and explanation might manifest themselves in the domain of mathematical theorem proving&#x02014;specifically, G&#x000F6;del&#x00027;s First Incompleteness Theorem (GI).</p>
<sec>
<title>Simulation 1: why do ministers prefer coke?</title>
<p>Our first simulations were designed to explore the model&#x00027;s ability to explain why a novel (but nonetheless fairly mundane) explanandum, such as the assertion that ministers prefer Coke to Pepsi. In these simulations, the explanandum was the statement that ministers prefer Coke. We placed several schemas into LISA&#x00027;s LTM (see the Supplementary Material for the simulation details): (1) A (partial) <italic>preference</italic> schema stated that a person may prefer some manufacturer&#x00027;s product because they agree with (e.g., the politics of) that manufacturer. (2) A <italic>minister</italic> schema specified various properties of ministers. And (3) an <italic>agreement</italic> schema specified that if a person supports some (e.g., political) cause and some other entity supports the same cause, then that person and entity agree with one another. We also seeded LISA&#x00027;s LTM with an irrelevant story about a person driving to the beach so that we could evaluate the selectivity of the model&#x00027;s retrieval process.</p>
<p>We ran this simulation several times, and the model produced explanations of varying quality. A typical result was an explanation such as:</p>
<list list-type="simple">
<list-item><p>P1: <italic>prefer</italic> (ministers, Coke)</p></list-item>
<list-item><p>P2: <italic>support</italic> (ministers, some-cause)</p></list-item>
<list-item><p>P3: <italic>support</italic> (corporation, some-cause)</p></list-item>
<list-item><p>P4: <italic>agree-with</italic> (ministers, corporation)</p></list-item>
<list-item><p>P5: <italic>manufacture</italic> (corporation, Coke)</p></list-item>
<list-item><p><italic>cause</italic> (P2, P3) (P4)</p></list-item>
<list-item><p><italic>cause</italic> (P4, P5) (P1)</p></list-item>
</list>
<p>where P1 is the explanandum and &#x0201C;<italic>cause</italic>&#x0201D; is shorthand for a collection of cause, effect, and CE groups; the first pair of parentheses on each line enclose the P units connected to the cause group and the second pair enclose the propositions connected to the effect group. In other words, LISA inferred that: ministers support some cause (P2); the corporation that makes Coke (P5) supports the same cause (P3); these facts together cause the minister to agree with the corporation [<italic>cause</italic> (P2, P3) (P4)]; and this agreement, along with the fact that the company manufactures Coke, causes the ministers to prefer Coke [<italic>cause</italic> (P4, P5) (P1)].</p>
<p>This explanation represents the most typical result of the pilot runs. The model sometimes also produced a truncated &#x0201C;explanation&#x0201D; in which the ministers are assumed to agree with the Coke corporation, but the model failed to infer why. This explanation obtains when the explanandum, <italic>prefer</italic> (ministers, Coke), retrieves the preference schema on the first retrieve-map-infer cycle, but fails to retrieve anything on the next cycle.</p>
<p>A third result obtained when the explanandum retrieved nothing even on its first retrieval cycle. In this case, LISA halted without generating any explanation at all (effectively saying, &#x0201C;I don&#x00027;t know&#x0201D;).</p>
<p>Finally, the model occasionally retrieved the agreement schema (rather than the preference schema) on the first retrieval attempt. In this case, because analogical mappings are relationally flexible, <italic>ministers</italic> maps to <italic>person</italic>, and <italic>Coke</italic> maps successfully (but nonsensically) to <italic>entity</italic>. In this case, the model generates the nonsensical &#x0201C;explanation&#x0201D;:</p>
<list list-type="simple">
<list-item><p>P1: <italic>prefer</italic> (ministers, Coke)</p></list-item>
<list-item><p>P2: <italic>support</italic> (ministers, some-cause)</p></list-item>
<list-item><p>P3: <italic>support</italic> (Coke, some-cause)</p></list-item>
</list>
<p>In no cases did the model retrieve completely irrelevant information from LTM (e.g., about Bill driving to the beach), illustrating that the algorithm is capable of selectively retrieving and mapping only situation-relevant information. That said, both its successes and its failures can be traced directly to the success or failure of the retrieval stage: LISA&#x00027;s mappings and inferences on a given retrieve-map-infer cycle will follow structurally from whatever it retrieves during the <italic>retrieve</italic> phase of this cycle. If it retrieves something sensible, then its inferences will be sensible; if not, then its inferences will be less sensible or even nonsensical. It is a sharp limitation of the model in its current state that it cannot evaluate, for itself, which of these is the case.</p>
</sec>
<sec>
<title>Simulation 2: why do ministers dislike coke?</title>
<p>The results of the first simulations were informative for the purposes of exploring the model&#x00027;s properties, but in order to quantify the model&#x00027;s behavior, we ran a suite of 60 additional simulations during which we more carefully tabulated the simulation results. On these simulations, the explanandum was the statement &#x0201C;ministers dislike Coke&#x0201D; (a proxy for &#x0201C;ministers prefer Pepsi&#x0201D;). These simulations replaced the <italic>agreement</italic> schema from the first simulations with a <italic>disagreement</italic> schema and included an additional schema specifying that Coke used to contain cocaine (see the Supplementary Material for details).</p>
<p>Table <xref ref-type="table" rid="T2">2</xref> summarizes the results of Simulation 2. On 16 of the 60 runs, the model produced no explanations (by failing to retrieve anything from LTM on the first cycle), and on an additional 10 runs it made inferences without causally connecting those inferences to the explanandum. In each of the latter 10 cases, the model simply asserted that ministers disagree with some entity, and that that entity supports Coke (the beverage, not the corporation). This response is analogous to LISA declaring, &#x0201C;some people support Coke and ministers disagree with those people.&#x0201D; Inasmuch as this assertion is not an explanation of <italic>why</italic> ministers dislike Coke (or disagree with those people) it is perhaps strangely appropriate that LISA did not embed the constituent propositions inside cause and effect groups. It is also tempting to observe that this general form of &#x0201C;explanation&#x0201D;&#x02014;simply restating the original question in different terms&#x02014;is not uncommon in human interactions (e.g., answering &#x0201C;Because it&#x00027;s true!&#x0201D; when asked, &#x0201C;Why do you believe <italic>x</italic>?&#x0201D;). Part of understanding explanation is explaining the kinds of explanations people are likely to offer, including the bad ones.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Summary of the number (<italic>n</italic>) of each category of explanation the model generated in Simulation 2</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Explanation category</bold></th>
<th align="left"><bold><italic>n</italic></bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left"><bold>Nothing (&#x0201C;I don&#x00027;t know&#x0201D;)</bold></td>
<td align="left"><bold>16</bold></td>
</tr>
<tr>
<td align="left"><bold>Assertions without causes</bold></td>
<td align="left"><bold>10</bold></td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x02022; <italic>disagree-with</italic> (minister, entity), <italic>support</italic> (entity, Coke)</td>
<td/>
</tr>
<tr>
<td align="left"><bold>One causal link</bold></td>
<td align="left"><bold>8</bold></td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x02022; <italic>contained</italic> (Coke, cocaine) &#x02192; <italic>dislike</italic> (minister, Coke)</td>
<td align="left">(5)</td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x02022; <italic>immoral</italic> (Coke) &#x02192;<italic>dislike</italic> (minister, Coke)</td>
<td align="left">(3)</td>
</tr>
<tr>
<td align="left"><bold>Two causal links</bold></td>
<td align="left"><bold>8</bold></td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x02022; (<italic>contained</italic> (Coke, cocaine), <italic>illegal</italic> (cocaine))&#x02192; <italic> dislike</italic> (minister, Coke)</td>
<td align="left">(5)</td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;(<italic>contained</italic> (Coke, cocaine), <italic>manuf</italic>. (Coke-corp., Coke)) &#x02192; <italic>contained</italic>(Coke-corp.)</td>
<td/>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x02022; <italic>immoral</italic> (Coke)&#x02192; <italic>dislike</italic> (minister, Coke)</td>
<td align="left">(1)</td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;<italic>contained</italic> (Coke, cocaine)&#x02192; <italic>immoral</italic> (Coke)</td>
<td/>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x02022; (<italic>contained</italic> (Coke, cocaine), <italic>illegal</italic> (cocaine))&#x02192; <italic>dislike</italic> (minister, Coke)</td>
<td align="left">(2)</td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;<italic>contained</italic> (Coke, cocaine)&#x02192; <italic>contained</italic>(Coke-corp.)</td>
<td/>
</tr>
<tr>
<td align="left"><bold>Three causal links</bold></td>
<td align="left"><bold>18</bold></td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x02022; <italic>immoral</italic> (Coke)&#x02192; <italic>dislike</italic> (minister, Coke)</td>
<td align="left">(15)</td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;(<italic>contained</italic> (Coke, cocaine), <italic>illegal</italic> (cocaine)) <italic>immoral</italic> (Coke)</td>
<td/>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;(<italic>immoral</italic> (Coke), <italic>manufacture</italic> (Coke-corp., Coke)) &#x02192;<italic>immoral</italic> (Coke-corp.)</td>
<td/>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x02022; <italic>immoral</italic> (Coke)&#x02192; <italic>dislike</italic> (minister, Coke)</td>
<td align="left">(2)</td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;(<italic>contained</italic> (Coke, cocaine), <italic>illegal</italic> (cocaine)) <italic> immoral</italic> (Coke)</td>
<td/>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;<italic> immoral</italic> (Coke) &#x02192; <italic>immoral</italic> (Coke-corp.)</td>
<td/>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x02022; (<italic>contained</italic> (Coke, cocaine), <italic>illegal</italic> (cocaine)) &#x02192;<italic> dislike</italic> (minister, Coke)</td>
<td align="left">(1)</td>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;<italic> illegal</italic> (cocaine) &#x02192;<italic>dislike</italic> (minister, cocaine)</td>
<td/>
</tr>
<tr>
<td align="left">&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;(<italic>contained</italic> (Coke, cocaine), <italic>manu.</italic> (Coke-corp., Coke))&#x02192; <italic>contained</italic> (Coke-corp.)</td>
<td/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Right-facing arrows indicate causal relations. Propositions nested within parentheses act as joint causes.</italic></p>
</table-wrap-foot>
</table-wrap>
<p>On eight runs, the model generated short explanations consisting of a single causal link: either &#x0201C;ministers dislike Coke because Coke used to contain cocaine&#x0201D; (5 runs) or &#x0201C;ministers dislike Coke because Coke is immoral&#x0201D; (3 runs). On an additional eight runs, it generated explanations consisting of two causal links, for example, &#x0201C;ministers dislike Coke because Coke is immoral, and Coke is immoral because it used to contain cocaine.&#x0201D; The remaining 18 runs resulted in explanations consisting of thee causal relations. The most common (15 runs) asserted that &#x0201C;Ministers dislike Coke because Coke is immoral. Coke is immoral because it used to contain cocaine. The Coke Corporation is immoral because Coke is immoral and the Coke Corporation manufactures Coke.&#x0201D; This last statement (that the Coke Corporation is immoral) seems unnecessary to the logic of the explanation, and demonstrates that the algorithm is able to pursue causal chains that do not strictly lead to the explanandum. Interpreted colloquially, this behavior resembles the model adding a parenthetical aside (&#x0201C;Oh, by the way, this also implies that the Coke Corporation is immoral.&#x0201D;).</p>
</sec>
<sec>
<title>Proof generation as explanation</title>
<p>In logic and mathematics, and the formal sciences generally, exquisite, definitive explanations are routinely provided via proofs. (This empirical fact leaves perfectly intact the important observation that in many disciplines, deduction doesn&#x00027;t explain; and indeed leaves intact the specific empirical fact that much deduction has no explanatory value in many contexts, scientifically speaking: A disjunction of <italic>P</italic> or <italic>Q</italic> follows deductively from <italic>P</italic>, and this theorem is crucial in proofs of <italic>P</italic> or <italic>not-P</italic>, but the theorem is manifestly empty in many contexts.) The Incompleteness Theorems are themselves a case in point, for until one sees how the proofs work, one cannot really understand in what senses these results are limatative, and until one understands these senses, one cannot understand what the theorems in question, in broad context, tell us.</p>
<p>The mechanisms used for analogy and explanation generation can also be implicated in the generation of logico-mathematical proofs. One might think of a proof as a type of explanation where instead of having causally-connected explanations, the individual progression of beliefs is connected by accepted inference rules in some logical system. (Because causal and logical relations have similar syntactic forms, and because LISA&#x00027;s cause, effect and CE groups are equally suited to represent both, we will, for convenience, simply refer to such relations as &#x0201C;causal,&#x0201D; bearing in mind that causal and logical relations have important semantic differences.) The generation of such proofs, then, can be aided by analogy in at least two ways: First, the mechanism described earlier for retrieving source analogs might be used in retrieving a relevant logical inference rule. Alternately, analogy may be used to transfer high-level <italic>strategies</italic> from one domain to another.</p>
<p>As an example, consider the highly influential Incompleteness Theorems of Kurt G&#x000F6;del. The First Incompleteness Theorem, which we will refer to as G1, places fundamental limitations on any finitely formal theory that is expressive enough to capture ordinary arithmetic.<xref ref-type="fn" rid="fn0002"><sup>2</sup></xref> How could G&#x000F6;del have devised such a brilliant proof in the first place?</p>
<p>Of course, we can never know precisely what it was that allowed G&#x000F6;del to figure out what he did, except to note that G&#x000F6;del mustered plenty of extra-analogical innovation to accomplish his Incompleteness Theorems. At most, we can speculate based on the mathematical knowledge that was already well-known to G&#x000F6;del at the time [see Ebbinghaus et al. (<xref ref-type="bibr" rid="B5">1994</xref>), which is regarded to be a description of G1 that is quite close to G&#x000F6;del&#x00027;s original work], but as we are not interested here in the details of the intellectual history of mathematical logic, we will be very brief. One plausible speculation is that the initial insight that led to G1&#x00027;s conjecturing came from an analogy to a simpler problem. G&#x000F6;del was described as having a thought that moved &#x0201C;from conjecture to conjecture,&#x0201D; even when he was not quite sure &#x0201C;how (or whether it is possible) to bridge the gap between them&#x0201D; (Wang, <xref ref-type="bibr" rid="B39">1995</xref>, p. 184). In fact, there is a suitable source analog, known as the Liar Paradox (LP). The simplest form of LP consists of a single proposition <italic>l</italic>: &#x0201C;This sentence is a lie.&#x0201D; One runs into trouble when attempting to determine the truth value of <italic>l:</italic> If it is true, then it is a lie, and therefore false; but if it is false, then since it says it is false, it&#x00027;s true. What are we to do with a sentence that seemingly is neither true nor false, but is still somehow meaningful?</p>
<p>A careful analysis of <italic>l</italic> will allow us to make some useful inferences. Since <italic>l</italic> is self-referring, we might take this to constitute a simple existence proof of a property of the English language: that it is self-referable, meaning that it allows for the formulation of a self-referring statement such as <italic>l</italic>. Our failed attempt to assign a truth value to <italic>l</italic> also tells us that English has at least one unverifiable statement. The fact that such a statement exists means, by definition, that English is <italic>logically incomplete</italic>. Let us name the three properties we just deduced of English as: <italic>self-referable</italic>, <italic>has-unprovable-statement</italic>, and <italic>incomplete</italic>, respectively.</p>
<p>The analogy that allows a high-level proof-sketch to transfer from LP to G1 can then be sketched as follows. We have a source analog consisting of the knowledge that our analysis of <italic>l</italic> provided:</p>
<list list-type="simple">
<list-item><p>P1: <italic>self-referable</italic> (English)</p></list-item>
<list-item><p>P2: has-unprovable-statement (English)</p></list-item>
<list-item><p>P3: <italic>incomplete</italic> (English)</p></list-item>
<list-item><p><italic>cause</italic> (P1) (P2)</p></list-item>
<list-item><p><italic>cause</italic> (P2) (P3)</p></list-item>
</list>
<p>We can then fill out a very minimal target analog, consisting of only a single proposition:</p>
<list list-type="simple">
<list-item><p>Q3: incomplete (&#x003A3;)</p></list-item>
</list>
<p>Here &#x003A3; represents any theory of interest; in this case &#x003A3; is ultimately going to range over all formal theories of arithmetic that have certain basic properties. That, however, is not relevant for this particular analogical match. Rather, all we are asking LISA to do is tell us: If we want to show that some theory &#x003A3; is incomplete, what do we have to do?</p>
<p>The resulting inference retrieves concepts from the domain to which &#x003A3; belongs, which in our hypothetical example is the formal logico-mathematical domain. A successful analogical match, then, will match the predicates used in our source analog to the very rigorously defined formal concepts in the target domain. Finally, the resulting set of analogical inferences give a high-level description of how to prove Q1.</p>
<p>When we ran LISA with the source (P1&#x02026; P3, along with their causal relations) and explanandum (Q3) described above, it generated the explanation:</p>
<list list-type="simple">
<list-item><p>Q1: self-referable (&#x003A3;)</p></list-item>
<list-item><p>Q2: has-unprovable-statement (&#x003A3;)</p></list-item>
<list-item><p>Q3: incomplete (&#x003A3;)</p></list-item>
<list-item><p><italic>cause</italic> (Q1) (Q2)</p></list-item>
<list-item><p><italic>cause</italic> (Q2) (Q3)</p></list-item>
</list>
<p>It would also be plausible to assume that instead of starting with Q3 in the target, we start with nothing but Q1. We ran this simulation as well, and LISA again generated the correct explanation. Essentially, these analogical inferences would have told G&#x000F6;del that in order to show incompleteness for &#x003A3;, he would have to show that &#x003A3; has an unprovable sentence, which he would in turn be able to show by exploring whether &#x003A3; can be self-referring.<xref ref-type="fn" rid="fn0003"><sup>3</sup></xref>.</p>
<p>What we have just described is an overview of what has been called Analogico-Deductive Reasoning (ADR), or the combination of analogical and hypothetico-deductive reasoning (Bringsjord and Licato, <xref ref-type="bibr" rid="B2">2012</xref>; Licato et al., <xref ref-type="bibr" rid="B25">2012</xref>). In ADR, an analogical inference is used to generate some hypothesis in the target domain, and subsequent deductive reasoning is used to either deductively prove this hypothesis, refute it by deriving a contradiction, or suggest an experiment and an expected outcome of the experiment which would then either support or refute the hypothesis. ADR is just another example of how analogy can be used to generate an understanding of the world, whether through explanation, or through formal proofs, as in the LP-to-G1 example.</p>
<p>For a more detailed example modeling the analogy from LP to G1, see Licato et al. (<xref ref-type="bibr" rid="B26">2013</xref>). We conclude this example by noting that although analogy may have been useful in pointing G&#x000F6;del toward the insight that LP may be useful in proving G1, <italic>much</italic> more work was necessary before his proofs could be considered complete, e.g., the creation of G&#x000F6;del numbering, the formalization of effective procedures, etc.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s2">
<title>Discussion</title>
<p>We described our progress toward a process model of explanation. The model is based on a model of analogy (Hummel and Holyoak&#x00027;s, <xref ref-type="bibr" rid="B16">1997</xref>, <xref ref-type="bibr" rid="B17">2003</xref>, LISA), reflecting our assumption that many of the core processes of explanation are also core processes of analogy making.</p>
<p>However, modeling explanation necessitates going beyond modeling analogy in at least two important respects: First, explanation, much more than analogy, depends on causal relations. We model the representation of causal relations using units representing groups of propositions (and other groups). This representational format is more explicit than simple associative links between causes and effects (as in Bayesian models, e.g., Tenenbaum et al., <xref ref-type="bibr" rid="B34">2006</xref>), but less explicit than propositions about cause and effect. It permits the model to use cause, effect, and cause-effect groups as units of both cognitive control and memory retrieval.</p>
<p>Second, explanation, unlike analogy, often requires the reasoner to integrate information from diverse sources in LTM, which in turn requires the reasoner to violate the 1:1 mapping constraint. We resolve this difficulty by serializing the process of incorporating facts from different sources in LTM.</p>
<p>Preliminary simulation results suggest that the approach is a promising approach to modeling explanation, and indeed, to the problem of understanding more broadly.</p>
<p>That said, the model is in an early stage of development, and many problems remain to be solved before we have a complete (much less correct) process model of explanation. First, we must address the problem of how a human reasoner knows when an explanation is complete. In the current model, this decision is based strictly on the failure to retrieve additional causes from LTM. This is clearly incomplete, but what is right is harder to say. Second, we must address the problem of explanation evaluation (for progress in this direction, see Thagard, <xref ref-type="bibr" rid="B35a">2001</xref>). One of the hard problems to be solved in this domain is contradiction detection: How does the cognitive architecture know when it has postulated something just plain stupid [e.g., &#x0201C;<italic>believe</italic> (Coke, some-proposition)&#x0201D;]? Third, we must include a role for elaboration in explanation: in our example problem, for instance, the model is given the knowledge that ministers are politically conservative, but the model never suggests that the source of agreement between the Coke corporation and the ministers is one of conservative values. Such elaboration is not part of the causal chain approach here, but seems to be a central component of explanation generation.</p>
<p>These issues remain serious hurdles in our attempt to understand how people generate explanations. In the meantime, we believe our current work takes us a step closer to understanding the basic processes underlying explanation, from inferences/hypotheses as simple as why ministers might prefer Coke, to conjectures about theorems that might be possible and worthwhile to prove.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The Associate Editor Aron K. Barbey declares that, despite being affiliated to the same institution as author John Hummel, the review process was handled objectively and no conflict of interest exists. The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
</sec>
</body>
<back>
<ack>
<p>This research was supported by AFOSR Grant &#x00023; FA9550-07-1-0147. We are grateful to Brian Ross, Eric Taylor, and Derek Devnich for helpful discussions about this research.</p>
</ack>
<sec sec-type="supplementary-material" id="s3">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://www.frontiersin.org/journal/10.3389/fnhum.2014.00867/abstract">http://www.frontiersin.org/journal/10.3389/fnhum.2014.00867/abstract</ext-link></p>
<supplementary-material xlink:href="Presentation1.PDF" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ahn</surname> <given-names>W.-K.</given-names></name> <name><surname>Mooney</surname> <given-names>R. J.</given-names></name> <name><surname>Brewer</surname> <given-names>W. F.</given-names></name> <name><surname>DeJong</surname> <given-names>G. F.</given-names></name></person-group> (<year>1987</year>). <article-title>Schema acquisition from one example: psychological evidence for explanation-based learning</article-title> in <source>Proceedings of the Ninth Annual Conference of the Cognitive Science Society</source> (<publisher-loc>Mahwah, NJ</publisher-loc>), <fpage>50</fpage>&#x02013;<lpage>57</lpage>.</citation>
</ref>
<ref id="B2">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bringsjord</surname> <given-names>S.</given-names></name> <name><surname>Licato</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>Psychometric artificial general intelligence: the Piaget-MacGyver room</article-title>, in <source>Theoretical Foundations of Artificial General Intelligence</source>, eds <person-group person-group-type="editor"><name><surname>Wang</surname> <given-names>P.</given-names></name> <name><surname>Goertzel</surname> <given-names>B.</given-names></name></person-group> (<publisher-loc>Amsterdam</publisher-loc>: <publisher-name>Atlantis Press</publisher-name>), <fpage>25</fpage>&#x02013;<lpage>48</lpage>.</citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cowan</surname> <given-names>N.</given-names></name></person-group> (<year>2001</year>). <article-title>The magical number 4 in short-term memory: a reconsideration of mental storage capacity</article-title>. <source>Behav. Brain Sci</source>. <volume>24</volume>, <fpage>87</fpage>&#x02013;<lpage>114</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X01003922</pub-id><pub-id pub-id-type="pmid">11515286</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Doumas</surname> <given-names>L. A. A.</given-names></name> <name><surname>Hummel</surname> <given-names>J. E.</given-names></name> <name><surname>Sandhofer</surname> <given-names>C. M.</given-names></name></person-group> (<year>2008</year>). <article-title>A theory of the discovery and predication of relational concepts</article-title>. <source>Psychol. Rev</source>. <volume>115</volume>, <fpage>1</fpage>&#x02013;<lpage>43</lpage>. <pub-id pub-id-type="doi">10.1037/0033-295X.115.1.1</pub-id><pub-id pub-id-type="pmid">18211183</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ebbinghaus</surname> <given-names>H.</given-names></name> <name><surname>Flum</surname> <given-names>J.</given-names></name> <name><surname>Thomas</surname> <given-names>W.</given-names></name></person-group> (<year>1994</year>). <source>Mathematical Logic</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer-Verlag</publisher-name>.</citation>
</ref>
<ref id="B6">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Falkenhainer</surname> <given-names>B.</given-names></name></person-group> (<year>1990</year>). <article-title>A unified approach to explanation and theory formation</article-title>, in <source>Computational Models of Scientific Discovery and Theory Formation</source>, eds <person-group person-group-type="editor"><name><surname>Shrager</surname> <given-names>J.</given-names></name> <name><surname>Langley</surname> <given-names>P.</given-names></name></person-group> (<publisher-loc>Atlanta, GA</publisher-loc>: <publisher-name>Morgan Kaufmann</publisher-name>), <fpage>157</fpage>&#x02013;<lpage>196</lpage>.</citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Falkenhainer</surname> <given-names>B.</given-names></name> <name><surname>Forbus</surname> <given-names>K. D.</given-names></name> <name><surname>Gentner</surname> <given-names>D.</given-names></name></person-group> (<year>1989</year>). <article-title>The structure-mapping engine: algorithm and examples</article-title>. <source>Artif. Intell</source>. <volume>41</volume>, <fpage>1</fpage>&#x02013;<lpage>63</lpage>. <pub-id pub-id-type="doi">10.1016/0004-3702(89)90077-5</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friedman</surname> <given-names>S.</given-names></name> <name><surname>Forbus</surname> <given-names>K.</given-names></name></person-group> (<year>2008</year>). <article-title>Learning qualitative causal models via generalization and quantity analysis</article-title>. <source>Proc. Cogn. Sci. Soc</source>.</citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gentner</surname> <given-names>D.</given-names></name></person-group> (<year>1983</year>). <article-title>Structure-mapping: a theoretical framework for analogy</article-title>. <source>Cogn. Sci</source>. <volume>7</volume>, <fpage>155</fpage>&#x02013;<lpage>170</lpage>. <pub-id pub-id-type="doi">10.1207/s15516709cog0702_3</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gentner</surname> <given-names>D.</given-names></name> <name><surname>Medina</surname> <given-names>J.</given-names></name></person-group> (<year>1998</year>). <article-title>Similarity and the development of rules</article-title>. <source>Cognition</source> <volume>65</volume>, <fpage>263</fpage>&#x02013;<lpage>297</lpage>. <pub-id pub-id-type="doi">10.1016/S0010-0277(98)00002-X</pub-id><pub-id pub-id-type="pmid">9557385</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gick</surname> <given-names>M. L.</given-names></name> <name><surname>Holyoak</surname> <given-names>K. J.</given-names></name></person-group> (<year>1980</year>). <article-title>Analogical problem solving</article-title>. <source>Cogn. Psychol</source>. <volume>12</volume>, <fpage>36</fpage>&#x02013;<lpage>355</lpage>. <pub-id pub-id-type="doi">10.1016/0010-0285(80)90013-4</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gick</surname> <given-names>M. L.</given-names></name> <name><surname>Holyoak</surname> <given-names>K. J.</given-names></name></person-group> (<year>1983</year>). <article-title>Schema induction and analogical transfer</article-title>. <source>Cogn. Psychol</source>. <volume>15</volume>, <fpage>1</fpage>&#x02013;<lpage>38</lpage>. <pub-id pub-id-type="doi">10.1016/0010-0285(83)90002-6</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holyoak</surname> <given-names>K. J.</given-names></name> <name><surname>Thagard</surname> <given-names>P.</given-names></name></person-group> (<year>1989</year>). <article-title>Analogical mapping by constraint satisfaction</article-title>. <source>Cogn. Sci</source>. <volume>13</volume>, <fpage>295</fpage>&#x02013;<lpage>355</lpage>. <pub-id pub-id-type="doi">10.1207/s15516709cog1303_1</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Holyoak</surname> <given-names>K. J.</given-names></name> <name><surname>Thagard</surname> <given-names>P.</given-names></name></person-group> (<year>1995</year>). <source>Mental Leaps: Analogy in Creative Thought</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hummel</surname> <given-names>J. E.</given-names></name></person-group> (<year>2011</year>). <article-title>Getting symbols out of a neural architecture</article-title>. <source>Connect. Sci</source>. <volume>23</volume>, <fpage>109</fpage>&#x02013;<lpage>118</lpage>. <pub-id pub-id-type="doi">10.1080/09540091.2011.569880</pub-id></citation>
</ref>
<ref id="B15a">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hummel</surname> <given-names>J. E.</given-names></name> <name><surname>Biederman</surname> <given-names>I.</given-names></name></person-group> (<year>1992</year>). <article-title>Dynamic binding in a neural network for shape recognition</article-title>. <source>Psychol. Rev</source>. <volume>99</volume>, <fpage>480</fpage>&#x02013;<lpage>517</lpage>. <pub-id pub-id-type="doi">10.1037/0033-295X.99.3.480</pub-id><pub-id pub-id-type="pmid">1502274</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hummel</surname> <given-names>J. E.</given-names></name> <name><surname>Holyoak</surname> <given-names>K. J.</given-names></name></person-group> (<year>1997</year>). <article-title>Distributed representations of structure: a theory of analogical access and mapping</article-title>. <source>Psychol. Rev</source>. <volume>104</volume>, <fpage>427</fpage>&#x02013;<lpage>466</lpage>. <pub-id pub-id-type="doi">10.1037/0033-295X.104.3.427</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hummel</surname> <given-names>J. E.</given-names></name> <name><surname>Holyoak</surname> <given-names>K. J.</given-names></name></person-group> (<year>2003</year>). <article-title>A symbolic-connectionist theory of relational inference and generalization</article-title>. <source>Psychol. Rev</source>. <volume>110</volume>, <fpage>220</fpage>&#x02013;<lpage>264</lpage>. <pub-id pub-id-type="doi">10.1037/0033-295X.110.2.220</pub-id><pub-id pub-id-type="pmid">12747523</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hummel</surname> <given-names>J. E.</given-names></name> <name><surname>Landy</surname> <given-names>D. H.</given-names></name></person-group> (<year>2009</year>). <article-title>From analogy to explanation: relaxing the 1:1 mapping constraint&#x02026; very carefully</article-title>, in <source>New Frontiers in Analogy Research: Proceedings of the Second International Conference on Analogy</source>, eds <person-group person-group-type="editor"><name><surname>Kokinov</surname> <given-names>B.</given-names></name> <name><surname>Holyoak</surname> <given-names>K.</given-names></name> <name><surname>Gentner</surname> <given-names>D.</given-names></name></person-group> (<publisher-loc>Sofia</publisher-loc>).</citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hummel</surname> <given-names>J. E.</given-names></name> <name><surname>Landy</surname> <given-names>D. H.</given-names></name> <name><surname>Devnich</surname> <given-names>D.</given-names></name></person-group> (<year>2008</year>). <article-title>Toward a process model of explanation with implications for the binding problem</article-title>, in <source>Naturally Inspired AI: Papers from the AAAI Fall Symposium</source>. Technical Report FS-08-06, <fpage>79</fpage>&#x02013;<lpage>86</lpage>.</citation>
</ref>
<ref id="B20">
<citation citation-type="book"><person-group person-group-type="editor"><name><surname>Josephson</surname> <given-names>J. R.</given-names></name> <name><surname>Josephson</surname> <given-names>S. G.</given-names></name></person-group> (eds.). (<year>1994</year>). <source>Abductive Inference: Computation, Philosophy, Technology</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keil</surname> <given-names>F. C.</given-names></name></person-group> (<year>2006</year>). <article-title>Explanation and understanding</article-title>. <source>Annu. Rev. Psychol</source>. <volume>57</volume>, <fpage>227</fpage>&#x02013;<lpage>254</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.psych.57.102904.190100</pub-id><pub-id pub-id-type="pmid">16318595</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Knowlton</surname> <given-names>B. J.</given-names></name> <name><surname>Morrison</surname> <given-names>R. G.</given-names></name> <name><surname>Hummel</surname> <given-names>J. E.</given-names></name> <name><surname>Holyoak</surname> <given-names>K. J.</given-names></name></person-group> (<year>2012</year>). <article-title>A neurocomputational system for relational reasoning</article-title>. <source>Trends Cogn. Sci</source>. <volume>17</volume>, <fpage>373</fpage>&#x02013;<lpage>381</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2012.06.002</pub-id><pub-id pub-id-type="pmid">22717468</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuehne</surname> <given-names>S.</given-names></name> <name><surname>Gentner</surname> <given-names>D.</given-names></name> <name><surname>Forbus</surname> <given-names>K.</given-names></name></person-group> (<year>2000</year>). <article-title>Modeling infant learning via symbolic structural alignment</article-title>. <source>Proc. Cogn. Sci. Soc</source>.</citation>
</ref>
<ref id="B24">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Landy</surname> <given-names>D. H.</given-names></name> <name><surname>Hummel</surname> <given-names>J. E.</given-names></name></person-group> (<year>2010</year>). <article-title>Explanatory reasoning for inductive confidence</article-title>, in <source>Proceedings of the 32nd Annual Conference of the Cognitive Science Society</source> (<publisher-loc>Mahwah, NJ</publisher-loc>).</citation>
</ref>
<ref id="B25">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Licato</surname> <given-names>J.</given-names></name> <name><surname>Bringsjord</surname> <given-names>S.</given-names></name> <name><surname>Hummel</surname> <given-names>J. E.</given-names></name></person-group> (<year>2012</year>). <article-title>Exploring the role of analogico-deductive reasoning in the balance-beam task</article-title>, in <source>Rethinking Cognitive Development: Proceedings of the 42nd Annual Meeting of the Jean Piaget Society</source> (<publisher-loc>Toronto</publisher-loc>).</citation>
</ref>
<ref id="B26">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Licato</surname> <given-names>J.</given-names></name> <name><surname>Govindarajulu</surname> <given-names>N. S.</given-names></name> <name><surname>Bringsjord</surname> <given-names>S.</given-names></name> <name><surname>Pomeranz</surname> <given-names>M.</given-names></name> <name><surname>Gittelson</surname> <given-names>L.</given-names></name></person-group> (<year>2013</year>). <article-title>Analogico-deductive generation of G&#x000F6;del&#x00027;s first incompleteness theorem from the liar paradox</article-title>, in <source>Proceedings of the 2013 International Joint Conference on Artificial Intelligence (IJCAI 2013)</source> (<publisher-loc>Beijing</publisher-loc>).</citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lombrozo</surname> <given-names>T.</given-names></name> <name><surname>Carey</surname> <given-names>S.</given-names></name></person-group> (<year>2006</year>). <article-title>Functional explanation and the function of explanation</article-title>. <source>Cognition</source> <volume>99</volume>, <fpage>167</fpage>&#x02013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2004.12.009</pub-id><pub-id pub-id-type="pmid">15939416</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Magnani</surname> <given-names>L.</given-names></name></person-group> (<year>2009</year>). <source>Abductive Cognition: The Epistemological and Eco-cognitive Dimensions of Hypothetical Reasoning</source>. <publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>.</citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Markman</surname> <given-names>A. B.</given-names></name></person-group> (<year>1997</year>). <article-title>Constraints on analogical inference</article-title>. <source>Cogn. Sci</source>. <volume>21</volume>, <fpage>373</fpage>&#x02013;<lpage>418</lpage>. <pub-id pub-id-type="doi">10.1207/s15516709cog2104_1</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Patalano</surname> <given-names>A. L.</given-names></name> <name><surname>Chin-Parker</surname> <given-names>S.</given-names></name> <name><surname>Ross</surname> <given-names>B. H.</given-names></name></person-group> (<year>2006</year>). <article-title>The importance of being coherent: category coherence, cross-classification, and reasoning</article-title>. <source>J. Mem. Lang</source>. <volume>54</volume>, <fpage>407</fpage>&#x02013;<lpage>424</lpage>. <pub-id pub-id-type="doi">10.1016/j.jml.2005.10.005</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Penn</surname> <given-names>D. C.</given-names></name> <name><surname>Holyoak</surname> <given-names>K. J.</given-names></name> <name><surname>Povinelli</surname> <given-names>D. J.</given-names></name></person-group> (<year>2008</year>). <article-title>Darwin&#x00027;s mistake: explaining the discontinuity between human and nonhuman minds</article-title>. <source>Brain Behav. Sci</source>. <volume>31</volume>, <fpage>109</fpage>&#x02013;<lpage>178</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X08003543</pub-id><pub-id pub-id-type="pmid">18479531</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Rescorla</surname> <given-names>R. A.</given-names></name> <name><surname>Wagner</surname> <given-names>A. R.</given-names></name></person-group> (<year>1972</year>). <article-title>A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement</article-title>, in <source>Classical Conditioning II: Current Research and Theory</source>, eds <person-group person-group-type="editor"><name><surname>Black</surname> <given-names>A. H.</given-names></name> <name><surname>Prokasy</surname> <given-names>W. F.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Appleton-Century-Crofts</publisher-name>), <fpage>64</fpage>&#x02013;<lpage>99</lpage>.</citation>
</ref>
<ref id="B33">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Sloman</surname> <given-names>S.</given-names></name></person-group> (<year>2005</year>). <source>Causal Models: How People Think about the World and its Alternatives</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tenenbaum</surname> <given-names>J. B.</given-names></name> <name><surname>Griffiths</surname> <given-names>T. L.</given-names></name> <name><surname>Kemp</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <article-title>Theory-based Bayesian models of inductive learning and reasoning</article-title>. <source>Trends Cogn. Sci</source>. <volume>10</volume>, <fpage>309</fpage>&#x02013;<lpage>318</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2006.05.009</pub-id><pub-id pub-id-type="pmid">16797219</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thagard</surname> <given-names>P.</given-names></name></person-group> (<year>1989</year>). <article-title>Explanatory coherence</article-title>. <source>Behav. Brain Sci</source>. <volume>12</volume>, <fpage>435</fpage>&#x02013;<lpage>467</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X00057046</pub-id></citation>
</ref>
<ref id="B35a">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Thagard</surname> <given-names>P.</given-names></name></person-group> (<year>2001</year>). <article-title>How to make decisions: coherence, emotion, and practical inference</article-title>, in <source>Varieties of Practical Inference</source>, ed <person-group person-group-type="editor"><name><surname>Millgram</surname> <given-names>E.</given-names></name></person-group> (<publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>355</fpage>&#x02013;<lpage>371</lpage>.</citation>
</ref>
<ref id="B36">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Thagard</surname> <given-names>P.</given-names></name></person-group> (<year>2012</year>). <source>The Cognitive Science of Science: Explanation, Discovery, and Conceptual Change</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>VanLehn</surname> <given-names>K.</given-names></name> <name><surname>Jones</surname> <given-names>R. M.</given-names></name> <name><surname>Chi</surname> <given-names>M. T. H.</given-names></name></person-group> (<year>1992</year>). <article-title>A model of the self-explanation effect</article-title>. <source>J. Learn. Sci</source>. <volume>2</volume>, <fpage>1</fpage>&#x02013;<lpage>59</lpage>. <pub-id pub-id-type="doi">10.1207/s15327809jls0201_1</pub-id><pub-id pub-id-type="pmid">20078759</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vosniadou</surname> <given-names>S.</given-names></name> <name><surname>Brewer</surname> <given-names>W. F.</given-names></name></person-group> (<year>1987</year>). <article-title>Theories of knowledge restructuring in development</article-title>. <source>Rev. Educ. Res</source>. <volume>57</volume>, <fpage>51</fpage>&#x02013;<lpage>67</lpage>. <pub-id pub-id-type="doi">10.3102/00346543057001051</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>H.</given-names></name></person-group> (<year>1995</year>). <article-title>On &#x02018;computabilism&#x02019; and physicalism: some subproblems</article-title>, in <source>Nature&#x00027;s Imagination: The Frontiers of Scientific Vision</source>, ed <person-group person-group-type="editor"><name><surname>Cornwell</surname> <given-names>J.</given-names></name></person-group> (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>161</fpage>&#x02013;<lpage>189</lpage>.</citation>
</ref>
</ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>In practice, it is difficult to distinguish inductive inferences from deductive inferences from faulty axioms. If you misidentify a car in the parking lot as &#x0201C;your car,&#x0201D; is this an inductive inference (which is not guaranteed to be right, even given correct premises and a syntactically valid inferential form) or a (valid) deductive inference from faulty premises? In this paper, we shall refer to inferences of this kind, and of the kind underlying analogy and explanation in general, as <italic>inductive</italic> to emphasize the fact that they are not guaranteed to be right and the fact that they are typically based more on specific experiences than a-priori axioms.</p></fn>
<fn id="fn0002"><p><sup>2</sup>We leave out a more precise description of G&#x000F6;del&#x00027;s Incompleteness Theorems because they are not directly relevant to the purposes of this paper.</p></fn>
<fn id="fn0003"><p><sup>3</sup>Note that self-reference is not the only way to show incompleteness, or even the only way to get a version of the Liar Paradox. For example, suppose one has the two sentences:</p>
<p>The sentence below is false.</p>
<p>The sentence above is true.</p>
<p>These two sentences form a paradox that is similar to LP, without exploiting (direct) self-reference. Here, however, we are only concerned with one route that G&#x000F6;del might well have used to discover incompleteness</p></fn>
</fn-group>
</back>
</article>
