<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Neurosci.</journal-id>
<journal-title>Frontiers in Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1662-453X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnins.2022.1070645</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neuroscience</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Robust tactile object recognition in open-set scenarios using Gaussian prototype learning</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Zheng</surname> <given-names>Wendong</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/796071/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Liu</surname> <given-names>Huaping</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/134240/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Guo</surname> <given-names>Di</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Sun</surname> <given-names>Fuchun</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Computer Science and Technology, Tsinghua University</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>State Key Laboratory of Intelligent Technology and Systems, Beijing National Research Center for Information Science and Technology, Tsinghua University</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Chenwei Deng, Beijing Institute of Technology, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Yunkai Li, Tianjin University, China; Ling Weng, Hebei University of Technology, China</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Huaping Liu <email>hpliu&#x00040;tsinghua.edu.cn</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Perception Science, a section of the journal Frontiers in Neuroscience</p></fn></author-notes>
<pub-date pub-type="epub">
<day>28</day>
<month>12</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>16</volume>
<elocation-id>1070645</elocation-id>
<history>
<date date-type="received">
<day>15</day>
<month>10</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>11</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Zheng, Liu, Guo and Sun.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Zheng, Liu, Guo and Sun</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<abstract>
<p>Tactile object recognition is crucial for effective grasping and manipulation. Recently, it has started to attract increasing attention in robotic applications. While there are many works on tactile object recognition and they also achieved promising performances in some applications, most of them are usually limited to closed world scenarios, where the object instances to be recognition in deployment are known and the same as that of during training. Since robots usually operate in realistic open-set scenarios, they inevitably encounter unknown objects. If automation systems falsely recognize unknown objects as one of the known classes based on the pre-trained model, it can lead to potentially catastrophic consequences. It motivates us to break the closed world assumption and to study tactile object recognition in realistic open-set conditions. Although several open-set recognition methods have been proposed, they focused on visual tasks and may not be suitable for tactile recognition. It is mainly due to that these methods do not take into account the special characteristic of tactile data in their models. To this end, we develop a novel Gaussian Prototype Learning method for robust tactile object recognition. Particularly, the proposed method converts feature distributions to probabilistic representations, and exploit uncertainty for tactile recognition in open-set scenarios. Experiments on the two tactile recognition benchmarks demonstrate the effectiveness of the proposed method on open-set tasks.</p></abstract>
<kwd-group>
<kwd>tactile perception</kwd>
<kwd>object recognition</kwd>
<kwd>open-set recognition</kwd>
<kwd>Gaussian prototype learning</kwd>
<kwd>tactile object recognition</kwd>
</kwd-group>
<counts>
<fig-count count="9"/>
<table-count count="3"/>
<equation-count count="11"/>
<ref-count count="48"/>
<page-count count="14"/>
<word-count count="8950"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>Object recognition is a prerequisite for robotic dexterous manipulations, which is the cornerstone of many robotic applications (Li et al., <xref ref-type="bibr" rid="B25">2018</xref>; Qiao et al., <xref ref-type="bibr" rid="B31">2021</xref>). For example, a robot needs to know the category of an object for selecting a suitable interaction pattern or manipulation strategy during exploring the surroundings or performing manipulation (He et al., <xref ref-type="bibr" rid="B19">2020</xref>; Zheng et al., <xref ref-type="bibr" rid="B45">2020a</xref>). Therefore, how to effectively realize object recognition has recently attracted widespread attention in robotic research fields.</p>
<p>Since tactile sensing is an effective way of perceiving some physical properties of the manipulated objects through physical interaction (Luo et al., <xref ref-type="bibr" rid="B28">2017</xref>), it has been extensively used in robotic tasks involving object recognition, material identification (Zheng et al., <xref ref-type="bibr" rid="B46">2019</xref>), texture recognition and robotic grasp detection (Guo et al., <xref ref-type="bibr" rid="B14">2021</xref>). Liu and Sun (<xref ref-type="bibr" rid="B26">2017</xref>) proposed a tactile recognition method for classify material identification. Xu et al. (<xref ref-type="bibr" rid="B39">2013</xref>) proposed a tactile identification method with Bayesian exploration. Kerr et al. (<xref ref-type="bibr" rid="B22">2018</xref>) used tactile data to classify the materials with the BioTAC sensor. In addition, tactile information is used as an effective complement of visual information for robotic tasks. In Liu et al. (<xref ref-type="bibr" rid="B27">2016</xref>), a novel visual-tactile fusion method was proposed for object recognition using joint group kernel sparse coding. Guo et al. (<xref ref-type="bibr" rid="B15">2017</xref>) adopted tactile information as an important complement of visual information for the robotic grasp detection task. These works have shown that tactile perception plays a significant role in robotic recognition tasks.</p>
<p>While there are many works on tactile recognition and they have been demonstrated to be effective for some specific applications (Yi et al., <xref ref-type="bibr" rid="B41">2020</xref>), they have mainly focused on constructing predictive models to classify predefined and fixed object classes in closed-set scenarios, assuming that the classes seen in testing must have appeared in training. In fact, such an assumption is usually violated in actual robotic applications (Zheng et al., <xref ref-type="bibr" rid="B47">2020b</xref>). This is mainly due to that robots are commonly deployed in realistic unconstrained environments, where objects of unknown classes are regularly encountered. When observing an unknown object, these closed-set classification methods incorrectly categorize it as one of the known classes with high confidence. As classifier prediction in robotic applications can trigger some kind of costly robotic action, such misclassification can be catastrophic and is often not acceptable. Thus, it is necessary to investigate robust tactile recognition in open-set scenarios, which is also referred to open-set tactile recognition. The schematic is shown in <xref ref-type="fig" rid="F1">Figure 1</xref>, where robots should have the dual ability of unknown detection and known classification.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>The schematic of open-set tactile object recognition. Some images are from <ext-link ext-link-type="uri" xlink:href="https://sites.gatech.edu/hrl/mr-gan/">https://sites.gatech.edu/hrl/mr-gan/</ext-link>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0001.tif"/>
</fig>
<p>To the best of our knowledge, tactile object recognition of open-set scenarios is still unexplored research in the robotic field. Similar to other open-set recognition, open-set tactile recognition also faces the core challenge of how to not only correctly classify samples from the known classes but also effectively detect and flag unknown examples as the novel. Many methods have been proposed to handle this problem in the literature. The mainstream methods attempt to utilize thresholding to reformulate open-set recognition as a closed-set classifier. As feature distribution of training samples is not explicitly considered in their learning objectives, the learned features generally have excessive intra-class variance. The inter-class distance can even be smaller than the intra-class distance in the learned feature space. This makes it difficult to set an appropriate threshold that well separates known from unknown.</p>
<p>In addition, another technical solution aims to collect unknown samples for training a (<italic>K</italic>&#x0002B;1)-class model, where <italic>K</italic> is the number of known categories and all unknowns are treated as an additional category. The strategy is simple and intuitive, but it usually requires large-scale training data to represent the large numbers of unknowns in open scenarios. However, collecting sufficient tactile data is difficult for training due to the complex collection process and constraints of robot-object physical interactions. Hence, constructing an effective model for open-set tactile recognition is still an open question.</p>
<p>As we know, humans can effectively recognize objects in open environments based on template or prototype matching. Motivated by the recognition mechanism, we propose an uncertainty estimation model for open-set tactile object recognition in this work. The framework consists of two main components, which are the feature extractor and the class prototypes. The feature extractor simulates the perception system of humans for transforming the raw sensing data into abstract representations. Moreover, the prototypes for each category serve as abstract memories of the corresponding category in the brain. By matching the tactile features (abstract representation) with prototypes (classes memories), the proposed model performs object recognition. During inference, if the feature of a test tactile sample can not match well with all the prototypes of the known classes, it will be considered as the unknown.</p>
<p>To this end, the learned features of each known class are characterized by a Gaussian distribution in our framework. As known samples follow the prior distributions, those test samples located in low probability regions will be recognized as unknown by the model. Meanwhile, for the test samples from known classes, the model will compute its probabilities over all known classes and classify it as the class with the highest probability. To explicitly enforce training samples following Gaussian distributions, we introduce a likelihood regularization term to the classification discriminant function during training. In addition, we further add a classification margin to make each cluster more compact and further improve the generalization of the model. The main contributions are summarized as follows:</p>
<list list-type="order">
<list-item><p>In this paper, we specifically address tactile object recognition in open-set scenarios. To this end, a novel Gaussian Prototype Learning method is proposed, which is suitable for both unknown detection and known classification.</p></list-item>
<list-item><p>We introduce a likelihood regularization term to explicitly enforce training samples following Gaussian distributions. In addition, we further introduce a classification margin to make each cluster more compact, which is more beneficial for unknown detection.</p></list-item>
<list-item><p>We perform comprehensive experimental evaluations of our proposed method on publicly available tactile datasets. The experimental results demonstrate the effectiveness of the proposed method.</p></list-item>
</list>
<p>Please note that our proposed open-set tactile recognition is not just a matter of the robot filling in gaps in its knowledge base. Instead, we aim to enable robots will be able to continually expand the scope of the knowledge to learn new unknown classes over time in an active learning manner. That is to say, at any particular point in time the model needs to be able to detect and reject unseen data belonging to unknown tasks or classes. These unknown data could be utilized and learned with another algorithm in some human-in-the-loop system at a later stage. We believe that this research will aid in active learning and continual learning in open-set conditions, which can serve as the first step toward building lifelong robot tactile recognition systems.</p>
<p>In the following, related works are briefly reviewed in Section 2. In Section 3, we describe the problem of tactile open-set object recognition. Section 4 details the framework architecture and learning model of the proposed method. The experimental results and analysis are given in Section 5.</p>
</sec>
<sec id="s2">
<title>2. Related work</title>
<p>In this section, the main related works are briefly reviewed from two aspects: <italic>tactile object recognition</italic> and <italic>open-Set Recognition</italic>.</p>
<sec>
<title>2.1. Tactile object recognition</title>
<p>Object recognition is a fundamental perceptual capability for many robot applications (Meyer et al., <xref ref-type="bibr" rid="B29">2020</xref>). While vision enables robots to have excellent visual recognition capabilities (Deng et al., <xref ref-type="bibr" rid="B6">2019</xref>; Han et al., <xref ref-type="bibr" rid="B17">2019</xref>), it is not always effective for object recognition in practical tasks (Yang et al., <xref ref-type="bibr" rid="B40">2019</xref>). This is mainly due to that objects of similar appearance can have very different physical properties, which may not be easily obtained visually (Deng et al., <xref ref-type="bibr" rid="B7">2022</xref>). Tactile sensing is an important perception modality, of which the interactive nature allows it to convey rich and diverse tactile information, such as texture, roughness, or stiffness (Li et al., <xref ref-type="bibr" rid="B24">2020</xref>). It is crucial for robots to explore and learn the mechanical properties of manipulated objects, especially when interacting with unknown objects in practical environments.</p>
<p>Considering its effectiveness in perceiving environments, tactile information has been extensively adopted in a variety of robot recognition tasks. Liu and Sun (<xref ref-type="bibr" rid="B26">2017</xref>) proposed a tactile material recognition model with semantic labels, which improved the identification performance. Kerr et al. (<xref ref-type="bibr" rid="B22">2018</xref>) utilized BioTAC sensor to collect tactile data, and then these data are used to classify the materials. Yuan et al. (<xref ref-type="bibr" rid="B43">2018</xref>) used GelSight tactile sensor to recognize 11 properties of the clothes, which aim to help the robot understand their material properties. Based on a hybrid touch approach, Taunyazov et al. (<xref ref-type="bibr" rid="B36">2019</xref>) developed an effective tactile identification framework for texture classification. More recently, Gu et al. (<xref ref-type="bibr" rid="B13">2020</xref>) proposed an event-based tactile object recognition method with a spiking graph neural network using electronic skins.</p>
<p>Although the mentioned tactile-based recognition methods have been successfully applied in some specific robotic tasks, most of them are deployed under a closed-set condition. Such a closed-set scenario is practically unfeasible in robotic applications. Robots commonly are deployed in open environments, where they will often come across new types of objects. Recently, Abderrahmane et al. (<xref ref-type="bibr" rid="B1">2018</xref>, <xref ref-type="bibr" rid="B2">2019</xref>) proposed a tactile recognition framework, which can recognize both known as well as novel objects. Nevertheless, this framework still did not explicitly consider the nature of open-set. In particular, the set of novel classes that can be recognized must be known in advance in the framework. Moreover, it relied on the hypothesis that attributes learned from the training seen-classes are shared by the testing unseen-classes. Obviously, they are potential drawbacks in practice applications. Consequently, existing methods are not suitable for open-set tactile object recognition.</p>
</sec>
<sec>
<title>2.2. Open-set recognition</title>
<p>Open-set tactile recognition faces the core challenge is how to not only correctly classify samples from the known classes but also effectively detect and flag unknown examples as the novel. Traditional closed-set classification models may not work in open-set problems because they often predict high confidence for inputs that are significantly different from the training classes (Wang et al., <xref ref-type="bibr" rid="B37">2022</xref>). To tackle this challenge, a variety of related methods have been proposed in the literature. An intuitive method is to use closed-set classifier to solve open-set recognition by setting rejection threshold, such as 1-vs-set SVM (Scheirer et al., <xref ref-type="bibr" rid="B32">2012</xref>), SROSR (Zhang and Patel, <xref ref-type="bibr" rid="B44">2016</xref>), NNO (Bendale and Boult, <xref ref-type="bibr" rid="B3">2015</xref>), DOC (Shu et al., <xref ref-type="bibr" rid="B33">2017</xref>), and CROSR (Dhamija et al., <xref ref-type="bibr" rid="B9">2018</xref>). Exploring this idea, Scheirer et al. (<xref ref-type="bibr" rid="B32">2012</xref>) proposed 1-vs-Set model based on SVM to detect unknown samples by adding an extra hyper-line. Bendale and Boult (<xref ref-type="bibr" rid="B3">2015</xref>) extended Nearest Class Mean (NCM) classifier to open-set conditions, establishing a Nearest Non-Outlier (NNO) algorithm. Recently, Bendale and Boult (<xref ref-type="bibr" rid="B4">2016</xref>) proposed to use the Openmax layer to replace the Softmax layer in deep neural networks. This method redistributes the probability distribution of Softmax to obtain the class probability of unknown samples.</p>
<p>As most of these models ignore constructing reasonable feature distribution for different classes, the learned features generally have excessive intra-class variance (Han et al., <xref ref-type="bibr" rid="B16">2017</xref>). The inter-class distance can even be smaller than the intra-class distance in the learned feature space. As a consequence, it is hard to select an appropriate threshold that well separates known from unknowns. Moreover, feature distribution of training samples is not explicitly considered in their learning objectives, which will limit the performance of the model to detect unknown samples.</p>
<p>Another technical route is to collect or synthesize examples of extra classes for representing unknowns. Along this line, G-OpenMax (Ge et al., <xref ref-type="bibr" rid="B11">2017</xref>) proposed to train a generator for synthesizing examples that represent all unknown classes for model training. Neal et al. (<xref ref-type="bibr" rid="B30">2018</xref>) developed counterfactual image generation, which aimed to generate extra class image samples that cannot be classified into any known class. Since the complex collection process and operation constraints, it is difficult to acquire large amounts of tactile data for unknown. Therefore, it is unfeasible to learn an effective model with limited training data for generating sufficient samples to represent unknowns.</p>
</sec>
</sec>
<sec id="s3">
<title>3. Problem formulation</title>
<p>In this work, we aim to realize robotic tactile object recognition in open-set scenarios. The goal is to endow robots with an effective mechanism to detect samples from unknown classes that may be encountered during testing, which are not available to be seen in training. To accomplish this goal, the tactile open-set recognition model is able to (i) correctly classify known tactile inputs (i.e., classes from the training set) and (ii) effectively detect unknown tactile classes (i.e., classes not exposed in the training set).</p>
<p>Let us formalize the problem described above. Given a tactile training dataset <inline-formula><mml:math id="M1"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula>, where <inline-formula><mml:math id="M2"><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> denotes a training tactile sample, <italic>y</italic><sub><italic>i</italic></sub>&#x02208;<italic>Y</italic> &#x0003D; {1, 2, &#x02026;., <italic>K</italic>} is the corresponding class label and <italic>M</italic> denotes the number of training samples. The testing dataset <inline-formula><mml:math id="M3"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> where <inline-formula><mml:math id="M4"><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>, <inline-formula><mml:math id="M5"><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:mi>K</mml:mi><mml:mo>,</mml:mo><mml:mi>K</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula> (<italic>k</italic>&#x02032;&#x0003E;<italic>K</italic>) and <italic>N</italic> is the number of testing samples. Here, {<italic>k</italic>&#x0002B;1, &#x02026;., <italic>k</italic>&#x02032;} denotes the set of unknown categories, which is referred to as novelty and uniformly denoted as <italic>Y</italic><sub><italic>K</italic>&#x0002B;1</sub> in this paper. Therefore, <inline-formula><mml:math id="M6"><mml:msup><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mi>Y</mml:mi><mml:mo>&#x0222A;</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <italic>Y</italic>&#x02229;<italic>Y</italic><sub><italic>K</italic>&#x0002B;1</sub> &#x0003D; &#x02205;. Our task is that the tactile recognition system need to determine whether a tactile observation <inline-formula><mml:math id="M7"><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> is from the known classes <italic>Y</italic> or the unknown classes <italic>Y</italic><sub><italic>K</italic>&#x0002B;1</sub>. If <italic>t</italic><sub><italic>j</italic></sub> is from <italic>Y</italic>, the classifier should predict a class label &#x00177;&#x02208;<italic>Y</italic>, otherwise it can be judged as the novel class <italic>Y</italic><sub><italic>K</italic>&#x0002B;1</sub>.</p>
<p>The primary challenge of solving this problem is how to enable the model to classify tactile examples of seen classes into their respective classes and meantime detect tactile data of unseen classes. Traditional classifiers predict the class of the input instance with the highest Softmax probability. Since the model is impossible to know in advance unknown classes that may be encountered in practice, it tends to predict the lowest probability on the unknown classes. As a consequence, directly using closed-set classifiers for open-set recognition would classify unknown instances into known categories with improperly high confidence, yielding poor performance in open-set recognition. What is more, it is hard to collect sufficient tactile data in practice. These factors make the existing open-set methods unsuitable for tactile recognition. Therefore, it needs to be investigated carefully.</p>
<p>As discussed above, the open-set tactile recognition is a non-trivial task due to the following two major challenges:</p>
<list list-type="order">
<list-item><p>Similar to other open-set recognition problems, open-set tactile recognition also faces the core challenge is how to not only correctly classify samples from the known classes but also effectively detect and flag unknown examples as the novel.</p></list-item>
<list-item><p>Different from other open-set visual recognition tasks, collecting sufficient tactile data is difficult for training due to the complex collection process and constraints of robot-object physical interactions. This makes it difficult to migrate some existing open-set recognition methods with a complicated network to the tactile open-set recognition task.</p></list-item>
<list-item><p>Moreover, the tactile signals for object recognition are commonly high-dimensional dynamic time-series, which exhibit many challenges. Firstly, it is impossible to directly use high-dimensional signals into the existing machine learning methods without any preprocessing techniques. Additionally, there is the nature of misalignment among different tactile measurements. It makes tactile open-set recognition more difficult.</p></list-item>
</list>
</sec>
<sec id="s4">
<title>4. The proposed method</title>
<p>In this section, we first expound the framework architecture of the proposed method, and then we elaborate the details of the Gaussian prototype learning model in the method. Finally, we describe the algorithm optimization of the model.</p>
<sec>
<title>4.1. Framework architecture</title>
<p>The framework of our proposed model is shown in <xref ref-type="fig" rid="F2">Figure 2</xref>, which can be structurally disentangled into two main modules: <italic>feature extraction module</italic> <italic>f</italic>(&#x003B8;, <italic>t</italic>) and <italic>Gaussian prototype learning module</italic>. The feature extraction module is used to transform the raw tactile inputs into abstract feature representations, where <italic>t</italic> is a tactile input and &#x003B8; denotes the parameters of the feature extraction module. Different from the traditional softmax layer for classification on the learned features, we adopt a prototype learning module to learn class prototypes <inline-formula><mml:math id="M8"><mml:msubsup><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> on the extracted features for each class <italic>y</italic><sub><italic>i</italic></sub>&#x02208;<italic>Y</italic>, where the superscript <italic>l</italic>&#x02208;{1, 2, &#x02026;, <italic>L</italic>} is the number of prototypes in each category. Finally, we apply these prototypes for classification by template matching. When the extracted feature <italic>f</italic>(&#x003B8;, <italic>t</italic>) of an input <italic>t</italic> can not match well with all prototypes of all known classes, it can be viewed as unknown. In this model, a feature extraction module and prototype module are jointly learned from data during training, thus forming a unified end-to-end deep framework, which is beneficial to improve the performance of recognition.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>The framework of our proposed method for open-set recognition.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0002.tif"/>
</fig>
<p>Previous experiments demonstrated that when the number of prototypes <italic>l</italic> in each class is large, it can not promote the classification accuracy and on the contrary will degrade the performance of the model. In fact, the deep neural network is very powerful for feature representation. Although the initial feature distribution is complex and scattered, the features of each class can be compacted to fit a single class centroid with some appropriate constraints after transformation. As such, we maintain one prototype for each category in our model. For convenience, <inline-formula><mml:math id="M9"><mml:msubsup><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> is denoted as &#x003BC;<sub><italic>y</italic><sub><italic>i</italic></sub></sub>, of which the superscript is omitted in the following description.</p>
</sec>
<sec>
<title>4.2. Gaussian prototype learning model</title>
<p>Given a tactile input <italic>t</italic><sub><italic>j</italic></sub>, we firstly extract its abstract representation through the feature extraction module <italic>f</italic>(&#x003B8;, <italic>t</italic><sub><italic>j</italic></sub>), and then search the nearest prototype based on the Euclidean distance between the extracted feature with all prototypes in the feature space. Finally, we assign the class label of this prototype to the tactile input. The process can be described as:</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M10"><mml:mrow><mml:mover accent='true'><mml:mi>y</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mtable columnalign='left'><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mtext>&#x02009;</mml:mtext><mml:mi>arg</mml:mi><mml:munderover><mml:mrow><mml:mi>max</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x0003E;</mml:mo><mml:mi>&#x003B4;</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mi>U</mml:mi><mml:mi>n</mml:mi><mml:mi>k</mml:mi><mml:mi>n</mml:mi><mml:mi>o</mml:mi><mml:mi>w</mml:mi><mml:mi>n</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>K</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x02264;</mml:mo><mml:mi>&#x003B4;</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula>
<p>where <italic>g</italic><sub><italic>i</italic></sub>(<italic>x</italic>) is the class discriminant function that denotes the matching score of tactile sample <italic>t</italic><sub><italic>j</italic></sub> with class <italic>i</italic>, &#x003B4; is a rejection threshold.</p>
<p>To train the framework, we introduce the three optimization objectives, which are <italic>discriminative classification loss, feature distribution loss</italic> and <italic>learning to detect unknowns</italic>.</p>
<sec>
<title>4.2.1. Discriminative classification loss</title>
<p>Intuitively, an ideal class prototype should effectively discriminate and classify samples from different categories. To achieve the goal, we propose a discriminative classification loss. It aims to make the prototype of the corresponding class closer to <italic>f</italic>(&#x003B8;, <italic>t</italic><sub><italic>i</italic></sub>) while the prototypes of other classes stay away from <italic>f</italic>(&#x003B8;, <italic>t</italic><sub><italic>i</italic></sub>), ensuring tactile input is correctly classified.</p>
<p>Essentially, the discriminative classification loss is a novel distance-based cross-entropy loss. Similar to traditional cross-entropy loss, it calculates cross-entropy loss with class probabilities obtained from the distances between samples feature and all prototypes. Specifically, given a sample <italic>t</italic><sub><italic>i</italic></sub> and its class label <italic>y</italic><sub><italic>i</italic></sub>, the probability of belonging to the corresponding prototype can be measured by the distance, and the probabilities are normalized in a similar way of Softmax. With this definition, the loss is defined as:</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M11"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>c</mml:mi><mml:mi>l</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x003BC;</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x02212;</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mi>N</mml:mi></mml:mfrac><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:mrow><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:mrow><mml:mi>&#x00393;</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mover accent='true'><mml:mi>y</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:mstyle><mml:mi>log</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mover accent='true'><mml:mi>y</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mo>|</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:mstyle></mml:mrow></mml:math></disp-formula>
<p>where &#x00393;(&#x000B7;) is symbolic function, and <italic>p</italic><sub><italic>y</italic><sub><italic>i</italic></sub></sub> is class-specific probability, of which the definition can be expressed as:</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M12"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mover accent='true'><mml:mi>y</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mo>|</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mi>T</mml:mi></mml:mfrac><mml:msubsup><mml:mrow><mml:mrow><mml:mo>&#x02016;</mml:mo><mml:mrow><mml:mi>f</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x02212;</mml:mo><mml:msub><mml:mi>&#x003BC;</mml:mi><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo>&#x02016;</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn><mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mstyle displaystyle='true'><mml:msub><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mi>T</mml:mi></mml:mfrac><mml:msubsup><mml:mrow><mml:mrow><mml:mo>&#x02016;</mml:mo><mml:mrow><mml:mi>f</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x02212;</mml:mo><mml:msub><mml:mi>&#x003BC;</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo>&#x02016;</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn><mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:msup></mml:mrow></mml:mstyle></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>
<p>where T is a temperature coefficient that is used to control the characteristics of the classifier. We set the value of <italic>T</italic> as the variance &#x003C3;<sup>2</sup> in the feature space, in order to normalize the representation space and increase the stability of the system. All classes prototypes &#x003BC;<sub><italic>i</italic></sub> with <italic>i</italic>&#x02208;<italic>Y</italic> and the variance &#x003C3;<sup>2</sup> are updated in an online manner.</p>
<p>By minimizing <italic>L</italic><sub><italic>cls</italic></sub>(&#x003B8;, &#x003BC;<sub><italic>i</italic></sub>), the loss aims to encourage separating the samples from different categories in learned feature space. In particular, this objective is to decrease the distance between samples of the same category and the corresponding prototype, and increase the distance between the sample and all other incorrect prototypes. Since the objective considers all prototypes in each updating step, it can better guarantee the convergence of training.</p>
</sec>
<sec>
<title>4.2.2. Feature distribution loss</title>
<p>For open-set recognition, the learned features need not only to be separable in different classes but also be compact in the same class. However, the above classification loss only makes the features of different categories separable. As a result, a feature <italic>t</italic><sub><italic>i</italic></sub> is far away from the corresponding category centroid &#x003BC;<sub><italic>y</italic><sub><italic>i</italic></sub></sub>, but it still is correctly classified if it is relatively closer to &#x003BC;<sub><italic>y</italic><sub><italic>i</italic></sub></sub> than to the feature centroids of the other classes. To tackle this issue, we further introduce a feature distribution loss to learn discriminative and compact representation, making it more applicable for our task.</p>
<p>The feature distribution loss is essentially the maximum likelihood regularization term on the assumption of Gaussian distribution. Specifically, we assume that the extracted feature on the training set conforms the Gaussian mixture distribution, viewing class prototype &#x003BC;<sub><italic>y</italic><sub><italic>i</italic></sub></sub> as the mean of a Gaussian component, which can be expressed as:</p>
<disp-formula id="E4"><label>(4)</label><mml:math id="M13"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>p</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mi>N</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>f</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003C3;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x003C3;<sub><italic>y</italic><sub><italic>i</italic></sub></sub> is covariance of class <italic>y</italic><sub><italic>i</italic></sub> in the feature space, and <italic>p</italic>(<italic>y</italic><sub><italic>i</italic></sub>) is the prior probability of class <italic>y</italic><sub><italic>i</italic></sub>. For the convenience of calculation, the likelihood regularization term is defined as the negative log-likelihood. By reasonably setting constant prior probabilities <italic>p</italic>(<italic>y</italic><sub><italic>i</italic></sub>), the likelihood regularization term <italic>L</italic><sub><italic>lkd</italic></sub> is simplified to Equation (5).</p>
<disp-formula id="E5"><label>(5)</label><mml:math id="M14"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi><mml:mi>k</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mo class="qopname">log</mml:mo><mml:mi>N</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>f</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003C3;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>The objective of the regularization term aims to maximize the log-likelihood of sample <italic>t</italic><sub><italic>i</italic></sub> for its corresponding class.</p>
<p>By minimizing <italic>L</italic><sub><italic>lkd</italic></sub>, the model can effectively reduce the within-class variance and constrain the feature distribution of known classes, so it can reserve more feature space for unknown classes and improve the performance of the proposed method for detecting unknowns.</p>
</sec>
<sec>
<title>4.2.3. Learning to detect unknowns</title>
<p>The threshold-based rejection is frequently used in open-set recognition tasks. Most of the existing methods directly adopt the predefined threshold to detect unknowns, which is not suitable in practical applications. In order to make our model effective on the open set tasks, we explicitly consider adopting class-specific rejection criteria. In particular, we use an adaptive strategy by letting the value threshold &#x003B4; to be proportional to maximal distance &#x00394;<sub><italic>y</italic><sub><italic>i</italic></sub></sub> between samples specific class <italic>y</italic><sub><italic>i</italic></sub> and the corresponding class centroid &#x003BC;<sub><italic>y</italic><sub><italic>i</italic></sub></sub>, i.e., &#x003B4; &#x0003D; &#x003B1;&#x00394;<sub><italic>y</italic><sub><italic>i</italic></sub></sub> where &#x003B1; is proportional coefficient. Formally, Equation (1) can be expressed as:</p>
<disp-formula id="E6"><label>(6)</label><mml:math id="M15"><mml:mrow><mml:mover accent='true'><mml:mi>y</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mtable columnalign='left'><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>c</mml:mi><mml:mi>l</mml:mi><mml:mi>a</mml:mi><mml:mi>s</mml:mi><mml:mi>s</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>arg</mml:mi><mml:munderover><mml:mrow><mml:mi>max</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>k</mml:mi></mml:munderover><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x0003E;</mml:mo><mml:mi>&#x003B1;</mml:mi><mml:msub><mml:mi>&#x00394;</mml:mi><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mi>U</mml:mi><mml:mi>n</mml:mi><mml:mi>k</mml:mi><mml:mi>n</mml:mi><mml:mi>o</mml:mi><mml:mi>w</mml:mi><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x02003;&#x02003;&#x02003;&#x02003;&#x02003;&#x02003;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x02264;</mml:mo><mml:mi>&#x003B1;</mml:mi><mml:msub><mml:mi>&#x00394;</mml:mi><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
<p>where <inline-formula><mml:math id="M16"><mml:msub><mml:mrow><mml:mi>g</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mi>&#x003C3;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:msubsup><mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula>. Instead of adopting the pre-defined threshold, we explicitly learn specific threshold of each category by minimizing the following objective:</p>
<disp-formula id="E7"><label>(7)</label><mml:math id="M17"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>h</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>Y</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mo class="qopname">max</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mi>m</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003C3;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac><mml:msup><mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>&#x003B8;</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:mi>&#x003B1;</mml:mi><mml:msub><mml:mrow><mml:mi>&#x00394;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>m</italic> &#x0003D; &#x02212;1 if <italic>i</italic> &#x0003D; <italic>y</italic><sub><italic>i</italic></sub> and <italic>m</italic> &#x0003D; 1 otherwise.</p>
<p>By minimizing <italic>L</italic><sub><italic>thr</italic></sub>, the model can obtain class-specific rejection thresholds, instead of presetting a global threshold as in prior works. It makes the proposed model effective to detect unknown samples.</p>
</sec>
</sec>
<sec>
<title>4.3. Algorithm optimization</title>
<p>With the above-mentioned analysis, the optimization process of our proposed method is structurally divided into two components: <italic>optimization of feature representation</italic> and <italic>optimization of rejection threshold</italic>.</p>
<p>(1) In this optimization of feature representation, the trainable parameters in the proposed method are composed of two parts, i.e., parameters of encoder network for feature transformation <italic>f</italic>(&#x003B8;, <italic>t</italic><sub><italic>i</italic></sub>) and all classes prototypes &#x003BC;<sub><italic>i</italic></sub>. To this end, we combine discriminative classification loss and feature distribution loss. The formally objective function is expressed as:</p>
<disp-formula id="E8"><label>(8)</label><mml:math id="M18"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>L</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mi>l</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:mi>&#x003BB;</mml:mi><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi><mml:mi>k</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B8;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x003BB;&#x02265;0 is weighting coefficients, which controls the trade-off of the two loss terms to optimal performance. For the hybrid optimization objective function <italic>L</italic>(&#x003B8;, &#x003BC;<sub><italic>y</italic><sub><italic>i</italic></sub></sub>) in Equation (5), we can directly calculate the gradients of &#x02202;<italic>L/&#x02202;f</italic> and &#x02202;<italic>L/&#x02202;&#x003BC;</italic><sub><italic>y</italic><sub><italic>i</italic></sub></sub>. According to the error back propagation, we can calculate the gradient of &#x02202;<italic>L/&#x02202;&#x003B8;</italic>. With the gradients of <italic>L</italic> over all parameters, we can jointly optimize both feature extractor and all classes prototypes using a gradient descent (SGD) optimization algorithm in an end-to-end way.</p>
<p>(2) For optimization of rejection threshold, it aims to achieve optimal class-specific thresholds &#x003B1;&#x00394;<sub><italic>i</italic></sub>. In this process, we held-out set of samples from the training set to learn the optimal thresholds. Hence, we split the samples into two parts, one part used for learning the feature extractor <italic>f</italic>(&#x003B8;) and the classes prototypes &#x003BC;<sub><italic>y</italic><sub><italic>i</italic></sub></sub> and the remaining part for learning the value of &#x003B1;&#x00394;<sub><italic>y</italic><sub><italic>i</italic></sub></sub>.</p>
<p>In summary, the optimization process of our proposed model is elaborated in <xref ref-type="fig" rid="F10">Algorithm 1</xref>.</p>
<fig id="F10" position="float">
<label>Algorithm 1</label>
<caption><p>The program flowchart of the proposed method.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0010.tif"/>
</fig>
</sec>
</sec>
<sec id="s5">
<title>5. Experiments</title>
<p>In this section, the proposed open-set tactile recognition method is comprehensively evaluated on two publicly available datasets. Firstly, the adopted datasets, evaluation metrics, comparison methods, and implementation details are described. Then experiments results and their analysis are provided. Finally, we further analyze the sensitivity of hyperparameters in the model.</p>
<sec>
<title>5.1. Dataset splits</title>
<p>We demonstrate our proposed method on two publicly available data sets, which are Haptic Texture Database (LTM_108) dataset (Strese et al., <xref ref-type="bibr" rid="B34">2016</xref>) and Penn Haptic Adjective Corpus (PHAC-2) (Chu et al., <xref ref-type="bibr" rid="B5">2015</xref>) dataset. They have been used to evaluate a model&#x00027;s ability to recognize objects or textures by tactile modality (Liu and Sun, <xref ref-type="bibr" rid="B26">2017</xref>). In these two data sets, their tactile data, respectively, represent two typical types of tactile information. Different from closed-set recognition, open-set tactile recognition needs a special setup and experiments. The splitting of the dataset is described as follows:</p>
<p><bold>LTM_108:</bold> The LTM_108 dataset consists of 108 different surface material instances, which are divided into 9 categories based on the material properties. These material images of the dataset are shown in <xref ref-type="fig" rid="F3">Figure 3</xref>. In this dataset, it provides multimodal data for each material instance, namely visual images, tactile acceleration traces and sound signals generated from the surface-tool interaction. The dataset provides a training set and a testing set. They both contain 108 material instances and every instance has ten tactile samples. In this experiment, we only use the tactile acceleration traces as tactile data for object recognition.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>All material images of the data set. The numbers, respectively, denote the beginning of each category. The original images are from Strese et al. (<xref ref-type="bibr" rid="B34">2016</xref>). It has been reproduced with permission from IEEE, available at <ext-link ext-link-type="uri" xlink:href="https://zeus.lmt.ei.tum.de/downloads/texture/">https://zeus.lmt.ei.tum.de/downloads/texture/</ext-link>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0003.tif"/>
</fig>
<p>Although this dataset has been directly used for some closed-set tasks of tactile recognition, we use this dataset to tackle more challenging the open-set tactile recognition task. To provide a suitable test platform, a new dataset split is proposed based on the original dataset. In particular, we randomly select <italic>K</italic> &#x0003C; 9 categories tactile samples from the train set to train our models and use totally 9 categories of tactile samples from the test set for test evaluation. This setting ensures that the test set appears some material categories that are not in the training set. The <xref ref-type="table" rid="T1">Table 1</xref> show a case of the dataset splits when <italic>K</italic> &#x0003D; 6.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>The details of the dataset splits on LTM_108.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Material category</bold></th>
<th valign="top" align="center"><bold>Training samples</bold></th>
<th valign="top" align="center"><bold>Testing samples</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Mesh</td>
<td valign="top" align="center">13 &#x000D7; 10</td>
<td valign="top" align="center">13 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Stones</td>
<td valign="top" align="center">9 &#x000D7; 10</td>
<td valign="top" align="center">9 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Glossy</td>
<td valign="top" align="center">9 &#x000D7; 10</td>
<td valign="top" align="center">9 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Wood</td>
<td valign="top" align="center">13 &#x000D7; 10</td>
<td valign="top" align="center">13 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Rubbers</td>
<td valign="top" align="center">5 &#x000D7; 10</td>
<td valign="top" align="center">5 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Fibers</td>
<td valign="top" align="center">15 &#x000D7; 10</td>
<td valign="top" align="center">15 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Foams</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">12 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Foils and paper</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">15 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Textile and fabrics</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">17 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Total</td>
<td valign="top" align="center">64 &#x000D7; 10</td>
<td valign="top" align="center">108 &#x000D7; 10</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><bold>PHAC-2:</bold> There are 60 objects in the PHAC-2 dataset. The visual images of the dataset are shown in <xref ref-type="fig" rid="F4">Figure 4</xref>. According to the physical properties, these objects are divided into eight categories. In this data set, each object contains tactile signals and visual images. The tactile signals are collected by two SynTouch BioTacs tactile sensors, which are installed to the grippers of a PR2 robot. In order to mimic the process of humans exploring the tactile properties of objects, the robot used four exploratory procedures to acquire five types of tactile data. According to the specified procedures, ten trials are performed on each object, resulting in totally 600 tactile samples. Although the joint data on gripper during exploratory movements are available, we focused on the tactile signals for classification in this experiment. In particular, each tactile sample consists of five components <italic>P</italic><sub><italic>DC</italic></sub>, <italic>P</italic><sub><italic>AC</italic></sub>, <italic>T</italic><sub><italic>DC</italic></sub>, <italic>T</italic><sub><italic>AC</italic></sub> and <italic>E</italic><sub>19</sub>.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>The PHAC-2 contains 60 objects, which are organized by their primary material. The original images are from Chu et al. (<xref ref-type="bibr" rid="B5">2015</xref>). It has been reproduced with permission from Elsevier, available at <ext-link ext-link-type="uri" xlink:href="https://hi.is.mpg.de/research_projects/learning-haptic-adjectives-from-tactile-data">https://hi.is.mpg.de/research_projects/learning-haptic-adjectives-from-tactile-data</ext-link>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0004.tif"/>
</fig>
<p>Similar to the above dataset setting, the PHAC-2 dataset also needs to be reorganized and split. Firstly, we randomly select an object instance from each material category as test objects and remain other 52 object instances. Then, we randomly select <italic>K</italic> &#x0003C; 8 categories from eight categories from the remaining object instances. When <italic>K</italic> &#x0003D; 5, the details of the dataset splits are shown in <xref ref-type="table" rid="T2">Table 2</xref>. Please note that according to the above setting, not only does the test set contains some categories that are not in the training set, but also the training set and the testing set do not share the same object instance even from the same category. Different from instance-level recognition, this experiment can be referred to as categorization-level open-set recognition. To this need, we need the proposed model to have generalization and robustness for unseen object instances.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>The details of the dataset splits on the PHAC-2.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Material category</bold></th>
<th valign="top" align="center"><bold>Original samples</bold></th>
<th valign="top" align="center"><bold>Training samples</bold></th>
<th valign="top" align="center"><bold>Testing samples</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Foam</td>
<td valign="top" align="center">16 &#x000D7; 10</td>
<td valign="top" align="center">15 &#x000D7; 10</td>
<td valign="top" align="center">1 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Organic</td>
<td valign="top" align="center">5 &#x000D7; 10</td>
<td valign="top" align="center">4 &#x000D7; 10</td>
<td valign="top" align="center">1 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Fabric</td>
<td valign="top" align="center">7 &#x000D7; 10</td>
<td valign="top" align="center">6 &#x000D7; 10</td>
<td valign="top" align="center">1 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Plastic</td>
<td valign="top" align="center">13 &#x000D7; 10</td>
<td valign="top" align="center">12 &#x000D7; 10</td>
<td valign="top" align="center">1 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Paper</td>
<td valign="top" align="center">12 &#x000D7; 10</td>
<td valign="top" align="center">11 &#x000D7; 10</td>
<td valign="top" align="center">1 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Stone</td>
<td valign="top" align="center">2 &#x000D7; 10</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">1 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Glass</td>
<td valign="top" align="center">2 &#x000D7; 10</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">1 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Metal</td>
<td valign="top" align="center">3 &#x000D7; 10</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">1 &#x000D7; 10</td>
</tr>
<tr>
<td valign="top" align="left">Total</td>
<td valign="top" align="center">60 &#x000D7; 10</td>
<td valign="top" align="center">48 &#x000D7; 10</td>
<td valign="top" align="center">8 &#x000D7; 10</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>5.2. Data preprocessing and network architecture</title>
<p>Considering the difference between the two types of tactile signals, we adopt two different feature extraction methods and network architectures for classification. The specific details are as follows.</p>
<p><bold>LTM_108:</bold> In the LTM_108 dataset, the recorded tactile signals are three-axis acceleration traces. Firstly, the three-axis acceleration traces are conversed to a one-dimensional signal by the DFT321 algorithm (Kuchenbecker et al., <xref ref-type="bibr" rid="B23">2010</xref>). Considering the effectiveness of short-time Fourier transform (STFT) extracting features of time-series signals, we adopt STFT to convert a one-dimensional DFT321 signal into a spectrogram. These spectrograms are in the log domain, where the length of a frame length is 500 and the increment of frame and frame is 250. By the predefined configuration settings mentioned above, there are 100 spectrogram samples of size 50 x 250 extracted from each tactile acceleration trace.</p>
<p>As convolutional neural network (CNN) has proven to be effective in visual classification, which has achieved good performance on many tasks. Moreover, some CNN models pre-trained on ImageNet (Deng et al., <xref ref-type="bibr" rid="B8">2009</xref>) have shown generalization and discrimination. In this experiment, we use the pre-trained Resnet18 (He et al., <xref ref-type="bibr" rid="B18">2016</xref>) model on ImageNet as the network backbone of the proposed method.</p>
<p><bold>PHAC-2:</bold> As in Abderrahmane et al. (<xref ref-type="bibr" rid="B2">2019</xref>), we firstly normalize the five components (<italic>P</italic><sub><italic>DC</italic></sub>, <italic>P</italic><sub><italic>AC</italic></sub>, <italic>T</italic><sub><italic>DC</italic></sub>, <italic>T</italic><sub><italic>AC</italic></sub>, <italic>E</italic><sub>19</sub>) in each signal sample, respectively. As the sample rate of <italic>P</italic><sub><italic>AC</italic></sub> is higher than other components of a tactile sample, we downsample it to match the other signals&#x00027; sample rate of 100 Hz. For some exploratory movements, the length of tactile signals varies considerably from objects. In order to resolve the length difference of signal, we downsample the signal of each exploratory movement to a fixed length of 150. Principal Component Analysis is used independently on the <italic>E</italic><sub>19</sub> data from each exploratory movement to capture the four most principal components across all objects. Thus, we obtain 64 tactile signals for each object in each trial.</p>
<p>Recently, Ji et al. (<xref ref-type="bibr" rid="B21">2015</xref>) has demonstrated the effectiveness of CNN on temporal signals with limited amounts. In this experiment, we adopt Convolutional neural networks (CNN) to perform tactile object recognition. The specific network structure is the same as the Haptic CNN model in Gao et al. (<xref ref-type="bibr" rid="B10">2016</xref>). Every tactile sample per object has 64 tactile signals. We concatenate the 64 features along the channel axis, which is used as the input of our model.</p>
</sec>
<sec>
<title>5.3. Evaluation metric</title>
<p>In this experiment, we use the three metrics to evaluate the classification performance, including <italic>Accuracy</italic> and <italic>F-measure</italic> and <italic>AUC</italic>.</p>
<list list-type="bullet">
<list-item><p>Accuracy: As a common metric method to evaluate classifiers on a closed set task, recognition accuracy <italic>Acc</italic> is defined as:</p></list-item>
</list>
<disp-formula id="E9"><label>(9)</label><mml:math id="M24"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>A</mml:mi><mml:mi>c</mml:mi><mml:mi>c</mml:mi><mml:mi>u</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>c</mml:mi><mml:mi>y</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mtext class="textrm" mathvariant="normal">TP &#x0002B; TN</mml:mtext></mml:mrow><mml:mrow><mml:mtext class="textrm" mathvariant="normal">TN &#x0002B; TP &#x0002B; FP &#x0002B; FN</mml:mtext></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<list list-type="simple">
<list-item><p>where TP, TN, FN, and FP, respectively, denote true positive, true negative false negative, and false positive. The sum of the three quantities is equal to the total number of samples.</p></list-item>
<list-item><p>&#x02022; F-measure: F-measure is commonly evaluation metric, which is defined as a harmonic mean of Precision <italic>P</italic> and Recall <italic>R</italic>:</p></list-item>
</list>
<disp-formula id="E10"><label>(10)</label><mml:math id="M25"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>F</mml:mi><mml:mo>-</mml:mo><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi><mml:mi>s</mml:mi><mml:mi>u</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mo>=</mml:mo><mml:mn>2</mml:mn><mml:mo>&#x000D7;</mml:mo><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>R</mml:mi></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<list list-type="simple">
<list-item><p>As suggested in Bendale and Boult (<xref ref-type="bibr" rid="B4">2016</xref>) and Geng et al. (<xref ref-type="bibr" rid="B12">2020</xref>), we use macro-averaged F1-score. It is denoted as macro-F1.</p></list-item>
<list-item><p>&#x02022; AUC: It denotes area under the ROC curve (AUC), which measures the performance of detecting unknown between known and unknown data.</p></list-item>
</list>
</sec>
<sec>
<title>5.4. Comparison methods</title>
<p>To validate the advantages of our proposed method, several classical methods were also implemented for comparison. A brief description of the methods is as follows:</p>
<list list-type="bullet">
<list-item><p>Softmax: It used the highest probability from the softmax layer of networks as the confidence score for classification.</p></list-item>
<list-item><p>&#x003C4;-Softmax (Hendrycks and Gimpel, <xref ref-type="bibr" rid="B20">2016</xref>): It aims to use a global threshold on the softmax probability to determine whether an input sample belongs to an unknown class. We refer to this method as &#x003C4;-Softmax.</p></list-item>
<list-item><p>&#x003C4;-Center (Wen et al., <xref ref-type="bibr" rid="B38">2016</xref>): It can be combined with cross-entropy loss to encourage the training data to form better-behaved class structures, which may be easier to model and facilitate greater distinction of open-set inputs. To this end, we also use it to detect unknown classes by a predefined threshold, which is denoted as &#x003C4;-Center loss.</p></list-item>
<list-item><p>OpenMax (Bendale and Boult, <xref ref-type="bibr" rid="B4">2016</xref>): It proposed replacing the softmax layer with OpenMax, which calibrates the confidence score with Weibull distribution. It proposed an inference method for detecting novel classes.</p></list-item>
</list>
<p>We note that some advanced methods, such as Yoshihashi et al. (<xref ref-type="bibr" rid="B42">2019</xref>) and Sun et al. (<xref ref-type="bibr" rid="B35">2020</xref>), have also been proposed to deal with open-set visual recognition. However, we do not take them for comparison, because the networks of these methods are too complex to work on the limited training data of tactile tasks.</p>
</sec>
<sec>
<title>5.5. Implementation details</title>
<p>For open-set recognition, the ratio of seen and unseen is an important factor, which quantifies the openness of the problem. As in Zhou et al. (<xref ref-type="bibr" rid="B48">2021</xref>), the openness is defined as:</p>
<disp-formula id="E11"><label>(11)</label><mml:math id="M26"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>o</mml:mi><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>n</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:msqrt><mml:mrow><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:msqrt></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>N</italic><sub><italic>train</italic></sub> and <italic>N</italic><sub><italic>test</italic></sub>, respectively, denote the number of categories in training set and testing set. As we described in the preliminaries, <italic>N</italic><sub><italic>train</italic></sub> &#x0003D; <italic>K</italic>.</p>
<p>In this experiment, we empirically set the likelihood regularization parameter &#x003BB; to 0.01 in experiments. For the margin parameter &#x003B1;, the optimization of the objective function becomes more difficult as the value increases. Therefore, &#x003B1; needs to be smaller when the number of classes gets more. In our experiments, we empirically set &#x003B1; to 0.4 and 0.3 for LTM_108 and PHAC-2, respectively.</p>
</sec>
<sec>
<title>5.6. Experimental results and analysis</title>
<p>Experimental results on the LTM_108 and PHAC-2 datasets are reported in this subsection. In this experiment, we randomly select 6 categories as known classes for LTM_108. Considering the instance imbalance of categories in PHAC-2, the first five categories are used as known classes. Ten trials are performed on each experiment, and the averaged results are used as final metric scores. In this setting, the corresponding experimental results on different methods are shown in <xref ref-type="table" rid="T3">Table 3</xref>.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Experimental results of different method.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Model</bold></th>
<th valign="top" align="center" colspan="3" style="border-bottom: thin solid #000000;"><bold>LTM_108</bold></th>
<th valign="top" align="center" colspan="3" style="border-bottom: thin solid #000000;"><bold>PHAC-2</bold></th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>Accuracy</bold></th>
<th valign="top" align="center"><bold>Macro-F1</bold></th>
<th valign="top" align="center"><bold>AUC</bold></th>
<th valign="top" align="center"><bold>Accuracy</bold></th>
<th valign="top" align="center"><bold>Macro-F1</bold></th>
<th valign="top" align="center"><bold>AUC</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Softmax</td>
<td valign="top" align="center">59.1%</td>
<td valign="top" align="center">0.491</td>
<td valign="top" align="center">0.878</td>
<td valign="top" align="center">62.5%</td>
<td valign="top" align="center">0.518</td>
<td valign="top" align="center">0.871</td>
</tr>
<tr>
<td valign="top" align="left">&#x003C4;-Softmax (Hendrycks and Gimpel, <xref ref-type="bibr" rid="B20">2016</xref>)</td>
<td valign="top" align="center">61.7%</td>
<td valign="top" align="center">0.567</td>
<td valign="top" align="center">0.970</td>
<td valign="top" align="center">70.01%</td>
<td valign="top" align="center">0.625</td>
<td valign="top" align="center">0.975</td>
</tr>
<tr>
<td valign="top" align="left">&#x003C4;-Center (Wen et al., <xref ref-type="bibr" rid="B38">2016</xref>)</td>
<td valign="top" align="center">64.9%</td>
<td valign="top" align="center">0.613</td>
<td valign="top" align="center">0.975</td>
<td valign="top" align="center">71.3%</td>
<td valign="top" align="center">0.634</td>
<td valign="top" align="center">0.977</td>
</tr>
<tr>
<td valign="top" align="left">OpenMax (Bendale and Boult, <xref ref-type="bibr" rid="B4">2016</xref>)</td>
<td valign="top" align="center">62.5%</td>
<td valign="top" align="center">0.574</td>
<td valign="top" align="center">0.978</td>
<td valign="top" align="center">58.7%</td>
<td valign="top" align="center">0.536</td>
<td valign="top" align="center">0.928</td>
</tr>
<tr>
<td valign="top" align="left">Proposed method</td>
<td valign="top" align="center">70.76%</td>
<td valign="top" align="center">0.669</td>
<td valign="top" align="center">0.986</td>
<td valign="top" align="center">75.5%</td>
<td valign="top" align="center">0.703</td>
<td valign="top" align="center">0.986</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>From <xref ref-type="table" rid="T3">Table 3</xref>, it can be seen that our proposed method achieves the highest Acc, macro-F1, and AUC on the two datasets. It indicates that the proposed model outperforms the compared methods, which also demonstrates that the proposed method is able to effectively improve the ability to detect unknowns while ensuring the accuracy of the known classification simultaneously.</p>
<p>As mentioned above, the open-set recognition on PHAC-2 is more challenging, as its test set and training set does not share the same object instance. Besides, we do not perform any data augmentation or employ some specific and complex networks in these experiments. Even so, our proposed method still achieves optimal performance. This further verifies the effectiveness of the proposed method.</p>
<p>Additionally, it is clear that &#x003C4;-Center exhibits a better performance among all these compared methods because it explicitly encourages stronger compactness of feature, which is beneficial for open-set recognition. However, it mainly aims at improving the softmax loss and feature distribution is not explicitly modeled. Therefore, it can not achieve optimal performances dealing with the tactile OSR problem. Since integrating the advantages from both classification discrimination with Gaussian Prototype Learning and likelihood estimation of feature distribution, our proposed method performs better in open-set conditions. It highlights the importance of considering the likelihood of feature distribution in the tactile OSR problem.</p>
<p>In particular, we can observe that as a state-of-art open-set recognition method, Openmax shows low performance, especially on the PHAC-2 dataset. It is mainly due to low recall on known classes with a few training instances since test instances from smaller classes are usually projected farther from the mean activation vector of the corresponding class. This demonstrates that Openmax may be also hardly infer the class probability of unknown inputs by the probability distribution of Softmax. Moreover, our experiments indicate that merely thresholding the output probabilities of softmax helps, but is still relatively weak for open set recognition.</p>
</sec>
<sec>
<title>5.7. Effectiveness of different openness</title>
<p>To valid the robustness of our proposed model to different openness, we evaluate performance over multiple openness values in the experiments. In particular, we vary the openness of Equation (11) by varying the number of classes in the training sample, while the number of test classes remains the same. We evaluate the performance by macro F1-scores. The corresponding results are shown in <xref ref-type="fig" rid="F5">Figures 5</xref>, <xref ref-type="fig" rid="F6">6</xref>.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Macro F1 against varying openness with different methods on the LTM_108 dataset.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0005.tif"/>
</fig>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p>Macro F1 against varying openness with different methods on the PHAC-2 dataset.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0006.tif"/>
</fig>
<p>As to be expected, when more known classes are available during training, the performances of classifiers are better for all methods in <xref ref-type="fig" rid="F5">Figures 5</xref>, <xref ref-type="fig" rid="F6">6</xref>. We can observe that the proposed approach remains relatively stable over a wide range of openness, which produces better results compared to other methods.</p>
</sec>
<sec>
<title>5.8. Parameter sensitivity analysis</title>
<p>In the proposed model, &#x003B1; and &#x003BB; are important parameters, and their values affect on the model&#x00027;s performance. To obtain optimal values for these parameters, we conduct extensive experiments to perform grid search for &#x003B1; and &#x003BB; within the set {0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0} and {0.0001, 0.001, 0.01, 0.1, 1, 10, 100}. The experimental results show that the model can achieve optimal performance when &#x003B1; = 0.4 and &#x003BB; = 0.01 on the LTM_108 dataset. For the PHAC-2 dataset, the model shows the best performance where &#x003B1; = 0.3 and &#x003BB; = 0.01. For the convenience of explanation, the sensitivity analysis of these two parameters is divided into two parts for illustration.</p>
<p>To analyze the effect of these parameters &#x003B1; on the proposed model&#x00027;s performance, we set the value of &#x003BB; to 0.01 and perform grid search of the parameter &#x003B1; within the set {0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0}. The relationships between Accuracy and the macro-F1 and of the value of &#x003B1; are shown on the two datasets in <xref ref-type="fig" rid="F7">Figures 7</xref>, <xref ref-type="fig" rid="F8">8</xref>, respectively. It can be observed that the performance of the model is very sensitive to the value of the parameter &#x003B1;, and the model performs well when &#x003B1;&#x02208;[0.1, 0.8] on both datasets.</p>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p>Acc and macro F1 for different &#x003B1; on the LTM_108 dataset.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0007.tif"/>
</fig>
<fig id="F8" position="float">
<label>Figure 8</label>
<caption><p>Acc and macro F1 for different &#x003B1; on the PHAC-2 dataset.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0008.tif"/>
</fig>
<p>Then, we conduct experiments to study the effect of the parameter &#x003BB; on the performance of the model. Fixing parameters &#x003B1; = 0.4 on the LTM_108 dataset and &#x003B1; = 0.3 on the PHAC-2 dataset, we tune the parameter &#x003BB; within the set {0.0001, 0.001, 0.01, 0.1, 1, 10, 100} and the corresponding experimental results are given in <xref ref-type="fig" rid="F9">Figure 9</xref>. It can be observed that our model achieves good performances in the range of &#x003BB;&#x02208;(0, 1]. When &#x003BB;&#x0003E;1, the model&#x00027;s performance on the contrary degrades. This is mainly because the likelihood regularization starts to play a role when the training accuracy is close to saturation, and a strong regularization weakens the discrimination effect of the model. Hence, there is a need to find the optimal balance of the two terms in the optimization process.</p>
<fig id="F9" position="float">
<label>Figure 9</label>
<caption><p>The performance of proposed model in terms of &#x003BB;.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-1070645-g0009.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="conclusions" id="s6">
<title>6. Conclusion</title>
<p>In this work, we specifically address tactile object recognition in open-set scenarios, which aims to enable robots to exploit tactile explorations in unstructured environments. To this end, we proposed a novel Gaussian prototype learning model, which incorporates classification and novel class detection into a unified framework. In particular, a likelihood regularization term is introduced to explicitly consider the feature distribution of tactile data. In addition, we further develop an adaptive classification margin to improve the performance of the model. Experimental results validate the effectiveness of the proposed method, which has the potential to improve the performance of open-set tactile perception. We believe that it makes the first step to formulate lifelong tactile recognition in the real world. In the future, we will explore the generalization of the proposed method to realize active continual learning in the open world.</p>
</sec>
<sec sec-type="data-availability" id="s7">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec sec-type="author-contributions" id="s8">
<title>Author contributions</title>
<p>WZ and HL proposed the basic idea of this method and completed theoretical modeling. DG performed the experiments analysis and revised the manuscript. FS provided overall supervision of this research. All authors contributed to the article and approved the submitted version.</p>
</sec>
</body>
<back>
<sec sec-type="funding-information" id="s9">
<title>Funding</title>
<p>This work was supported by the National Natural Science Fund for Key International Collaboration (62120106005) and partially funded by the China Postdoctoral Science Foundation under Grant 2022M711825.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abderrahmane</surname> <given-names>Z.</given-names></name> <name><surname>Ganesh</surname> <given-names>G.</given-names></name> <name><surname>Crosnier</surname> <given-names>A.</given-names></name> <name><surname>Cherubini</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>). <article-title>Haptic zero: recognition of objects never touched before</article-title>. <source>Rob. Auton. Syst</source>. <volume>105</volume>, <fpage>11</fpage>&#x02013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.1016/j.robot.2018.03.002</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abderrahmane</surname> <given-names>Z.</given-names></name> <name><surname>Ganesh</surname> <given-names>G.</given-names></name> <name><surname>Crosnier</surname> <given-names>A.</given-names></name> <name><surname>Cherubini</surname> <given-names>A.</given-names></name></person-group> (<year>2019</year>). <article-title>A deep learning framework for tactile recognition of known as well as novel objects</article-title>. <source>IEEE Trans. Ind. Inform</source>. <volume>16</volume>, <fpage>423</fpage>&#x02013;<lpage>432</lpage>. <pub-id pub-id-type="doi">10.1109/TII.2019.2898264</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bendale</surname> <given-names>A.</given-names></name> <name><surname>Boult</surname> <given-names>T.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x0201C;Towards open world recognition,&#x0201D;</article-title> in <source>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</source> (<publisher-loc>Boston, MA</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1893</fpage>&#x02013;<lpage>1902</lpage>.</citation>
</ref>
<ref id="B4">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bendale</surname> <given-names>A.</given-names></name> <name><surname>Boult</surname> <given-names>T. E.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Towards open set deep networks,&#x0201D;</article-title> in <source>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</source> (<publisher-loc>Las Vegas, NV</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1563</fpage>&#x02013;<lpage>1572</lpage>.</citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chu</surname> <given-names>V.</given-names></name> <name><surname>McMahon</surname> <given-names>I.</given-names></name> <name><surname>Riano</surname> <given-names>L.</given-names></name> <name><surname>McDonald</surname> <given-names>C. G.</given-names></name> <name><surname>He</surname> <given-names>Q.</given-names></name> <name><surname>Perez-Tejada</surname> <given-names>J. M.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Robotic learning of haptic adjectives through physical interaction</article-title>. <source>Rob. Auton. Syst</source>. <volume>63</volume>, <fpage>279</fpage>&#x02013;<lpage>292</lpage>. <pub-id pub-id-type="doi">10.1016/j.robot.2014.09.021</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deng</surname> <given-names>C.</given-names></name> <name><surname>Han</surname> <given-names>Y.</given-names></name> <name><surname>Zhao</surname> <given-names>B.</given-names></name></person-group> (<year>2019</year>). <article-title>High-performance visual tracking with extreme learning machine framework</article-title>. <source>IEEE Trans. Cybern</source>. <volume>50</volume>, <fpage>2781</fpage>&#x02013;<lpage>2792</lpage>. <pub-id pub-id-type="doi">10.1109/TCYB.2018.2886580</pub-id><pub-id pub-id-type="pmid">30624237</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deng</surname> <given-names>C.</given-names></name> <name><surname>Jing</surname> <given-names>D.</given-names></name> <name><surname>Han</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>S.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name></person-group> (<year>2022</year>). <article-title>Far-net: fast anchor refining for arbitrary-oriented object detection</article-title>. <source>IEEE Geosci. Remote Sens. Lett</source>. <volume>19</volume>, <fpage>1</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1109/LGRS.2022.3144513</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Deng</surname> <given-names>J.</given-names></name> <name><surname>Dong</surname> <given-names>W.</given-names></name> <name><surname>Socher</surname> <given-names>R.</given-names></name> <name><surname>Li</surname> <given-names>L.-J.</given-names></name> <name><surname>Li</surname> <given-names>K.</given-names></name> <name><surname>Fei-Fei</surname> <given-names>L.</given-names></name></person-group> (<year>2009</year>). <article-title>&#x0201C;Imagenet: a large-scale hierarchical image database,&#x0201D;</article-title> in <source>2009 IEEE Conference on Computer Vision and Pattern Recognition</source> (<publisher-loc>Miami, FL</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>248</fpage>&#x02013;<lpage>255</lpage>.</citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dhamija</surname> <given-names>A. R.</given-names></name> <name><surname>G&#x000FC;nther</surname> <given-names>M.</given-names></name> <name><surname>Boult</surname> <given-names>T. E.</given-names></name></person-group> (<year>2018</year>). <article-title>Reducing network agnostophobia</article-title>. <source>arXiv preprint arXiv:1811.04110</source>. <pub-id pub-id-type="doi">10.48550/arXiv.1811.04110</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gao</surname> <given-names>Y.</given-names></name> <name><surname>Hendricks</surname> <given-names>L. A.</given-names></name> <name><surname>Kuchenbecker</surname> <given-names>K. J.</given-names></name> <name><surname>Darrell</surname> <given-names>T.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Deep learning for tactile understanding from visual and haptic data,&#x0201D;</article-title> in <source>2016 IEEE International Conference on Robotics and Automation (ICRA)</source> (<publisher-loc>IEEE</publisher-loc>), <fpage>536</fpage>&#x02013;<lpage>543</lpage>.</citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ge</surname> <given-names>Z.</given-names></name> <name><surname>Demyanov</surname> <given-names>S.</given-names></name> <name><surname>Chen</surname> <given-names>Z.</given-names></name> <name><surname>Garnavi</surname> <given-names>R.</given-names></name></person-group> (<year>2017</year>). <article-title>Generative openmax for multi-class open set classification</article-title>. <source>arXiv preprint arXiv:1707.07418</source>. <pub-id pub-id-type="doi">10.5244/C.31.42</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Geng</surname> <given-names>C.</given-names></name> <name><surname>Huang</surname> <given-names>S.-J.</given-names></name> <name><surname>Chen</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Recent advances in open set recognition: a survey</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intell</source>. <volume>43</volume>, <fpage>3614</fpage>&#x02013;<lpage>3631</lpage>. <pub-id pub-id-type="doi">10.1109/TPAMI.2020.2981604</pub-id><pub-id pub-id-type="pmid">32191881</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gu</surname> <given-names>F.</given-names></name> <name><surname>Sng</surname> <given-names>W.</given-names></name> <name><surname>Taunyazov</surname> <given-names>T.</given-names></name> <name><surname>Soh</surname> <given-names>H.</given-names></name></person-group> (<year>2020</year>). <article-title>Tactilesgnet: a spiking graph neural network for event-based tactile object recognition</article-title>. <source>arXiv preprint arXiv:2008.08046</source>. <pub-id pub-id-type="doi">10.1109/IROS45743.2020.9341421</pub-id><pub-id pub-id-type="pmid">27295638</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>D.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Fang</surname> <given-names>B.</given-names></name> <name><surname>Sun</surname> <given-names>F.</given-names></name> <name><surname>Yang</surname> <given-names>W.</given-names></name></person-group> (<year>2021</year>). <article-title>Visual affordance guided tactile material recognition for waste recycling</article-title>. <source>IEEE Trans. Autom. Sci. Eng</source>. <volume>19</volume>, <fpage>2656</fpage>&#x02013;<lpage>2664</lpage>. <pub-id pub-id-type="doi">10.1109/TASE.2021.3065991</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>D.</given-names></name> <name><surname>Sun</surname> <given-names>F.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Kong</surname> <given-names>T.</given-names></name> <name><surname>Fang</surname> <given-names>B.</given-names></name> <name><surname>Xi</surname> <given-names>N.</given-names></name></person-group> (<year>2017</year>). <article-title>&#x0201C;A hybrid deep architecture for robotic grasp detection,&#x0201D;</article-title> in <source>2017 IEEE International Conference on Robotics and Automation (ICRA)</source> (<publisher-loc>Singapore</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1609</fpage>&#x02013;<lpage>1614</lpage>.</citation>
</ref>
<ref id="B16">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Han</surname> <given-names>Y.</given-names></name> <name><surname>Deng</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Zhao</surname> <given-names>B.</given-names></name></person-group> (<year>2017</year>). <article-title>&#x0201C;Adaptive feature representation for visual tracking,&#x0201D;</article-title> in <source>2017 IEEE International Conference on Image Processing</source> (<publisher-loc>Beijing</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1867</fpage>&#x02013;<lpage>1870</lpage>.</citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname> <given-names>Y.</given-names></name> <name><surname>Deng</surname> <given-names>C.</given-names></name> <name><surname>Zhao</surname> <given-names>B.</given-names></name> <name><surname>Tao</surname> <given-names>D.</given-names></name></person-group> (<year>2019</year>). <article-title>State-aware anti-drift object tracking</article-title>. <source>IEEE Trans. Image Process</source>. <volume>28</volume>, <fpage>4075</fpage>&#x02013;<lpage>4086</lpage>. <pub-id pub-id-type="doi">10.1109/TIP.2019.2905984</pub-id><pub-id pub-id-type="pmid">30892207</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>He</surname> <given-names>K.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Ren</surname> <given-names>S.</given-names></name> <name><surname>Sun</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Deep residual learning for image recognition,&#x0201D;</article-title> in <source>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</source> (<publisher-loc>Las Vegas, NV</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>770</fpage>&#x02013;<lpage>778</lpage>.<pub-id pub-id-type="pmid">32166560</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>W.</given-names></name> <name><surname>Xue</surname> <given-names>C.</given-names></name> <name><surname>Yu</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Yang</surname> <given-names>C.</given-names></name></person-group> (<year>2020</year>). <article-title>Admittance-based controller design for physical human-robot interaction in the constrained task space</article-title>. <source>IEEE Trans. Autom. Sci. Eng</source>. <volume>17</volume>, <fpage>1937</fpage>&#x02013;<lpage>1949</lpage>. <pub-id pub-id-type="doi">10.1109/TASE.2020.2983225</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hendrycks</surname> <given-names>D.</given-names></name> <name><surname>Gimpel</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>A baseline for detecting misclassified and out-of-distribution examples in neural networks</article-title>. <source>arXiv preprint arXiv:1610.02136</source>. <pub-id pub-id-type="doi">10.48550/arXiv.1610.02136</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ji</surname> <given-names>M.</given-names></name> <name><surname>Fang</surname> <given-names>L.</given-names></name> <name><surname>Zheng</surname> <given-names>H.</given-names></name> <name><surname>Strese</surname> <given-names>M.</given-names></name> <name><surname>Steinbach</surname> <given-names>E.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x0201C;Preprocessing-free surface material classification using convolutional neural networks pretrained by sparse autoencoder,&#x0201D;</article-title> in <source>2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP)</source> (<publisher-loc>Boston, MA</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1</fpage>&#x02013;<lpage>6</lpage>.</citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kerr</surname> <given-names>E.</given-names></name> <name><surname>McGinnity</surname> <given-names>T. M.</given-names></name> <name><surname>Coleman</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>Material recognition using tactile sensing</article-title>. <source>Expert. Syst. Appl</source>. <volume>94</volume>, <fpage>94</fpage>&#x02013;<lpage>111</lpage>. <pub-id pub-id-type="doi">10.1016/j.eswa.2017.10.045</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kuchenbecker</surname> <given-names>K. J.</given-names></name> <name><surname>McMahan</surname> <given-names>W.</given-names></name> <name><surname>Landin</surname> <given-names>N.</given-names></name> <name><surname>Romano</surname> <given-names>J. M.</given-names></name></person-group> (<year>2010</year>). <article-title>&#x0201C;Dimensional reduction of high-frequencey accelerations for haptic rendering,&#x0201D;</article-title> in <source>Proceedings of EuroHaptics</source> (<publisher-loc>Amsterdam; Berlin</publisher-loc>: <publisher-name>springer</publisher-name>), <fpage>79</fpage>&#x02013;<lpage>86</lpage>.</citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Q.</given-names></name> <name><surname>Kroemer</surname> <given-names>O.</given-names></name> <name><surname>Su</surname> <given-names>Z.</given-names></name> <name><surname>Veiga</surname> <given-names>F. F.</given-names></name> <name><surname>Kaboli</surname> <given-names>M.</given-names></name> <name><surname>Ritter</surname> <given-names>H. J.</given-names></name></person-group> (<year>2020</year>). <article-title>A review of tactile information: perception and action through touch</article-title>. <source>IEEE Trans. Rob</source>. <volume>36</volume>, <fpage>1619</fpage>&#x02013;<lpage>1634</lpage>. <pub-id pub-id-type="doi">10.1109/TRO.2020.3003230</pub-id><pub-id pub-id-type="pmid">30618706</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Huang</surname> <given-names>B.</given-names></name> <name><surname>Ajoudani</surname> <given-names>A.</given-names></name> <name><surname>Yang</surname> <given-names>C.</given-names></name> <name><surname>Su</surname> <given-names>C.-Y.</given-names></name> <name><surname>Bicchi</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>). <article-title>Asymmetric bimanual control of dual-arm exoskeletons for human-cooperative manipulations</article-title>. <source>IEEE Trans. Rob</source>. <volume>34</volume>, <fpage>264</fpage>&#x02013;<lpage>271</lpage>. <pub-id pub-id-type="doi">10.1109/TRO.2017.2765334</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Sun</surname> <given-names>F.</given-names></name></person-group> (<year>2017</year>). <article-title>Material identification using tactile perception: a semantics-regularized dictionary learning method</article-title>. <source>IEEE/ASME Trans. Mechatron</source>. <volume>23</volume>, <fpage>1050</fpage>&#x02013;<lpage>1058</lpage>. <pub-id pub-id-type="doi">10.1109/TMECH.2017.2775208</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Yu</surname> <given-names>Y.</given-names></name> <name><surname>Sun</surname> <given-names>F.</given-names></name> <name><surname>Gu</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>Visual-tactile fusion for object recognition</article-title>. <source>IEEE Trans. Autom. Sci. Eng</source>. <volume>14</volume>, <fpage>996</fpage>&#x02013;<lpage>1008</lpage>. <pub-id pub-id-type="doi">10.1109/TASE.2016.2549552</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>S.</given-names></name> <name><surname>Bimbo</surname> <given-names>J.</given-names></name> <name><surname>Dahiya</surname> <given-names>R.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name></person-group> (<year>2017</year>). <article-title>Robotic tactile perception of object properties: a review</article-title>. <source>Mechatronics</source> <volume>48</volume>, <fpage>54</fpage>&#x02013;<lpage>67</lpage>. <pub-id pub-id-type="doi">10.1016/j.mechatronics.2017.11.002</pub-id><pub-id pub-id-type="pmid">8237454</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Meyer</surname> <given-names>J.</given-names></name> <name><surname>Eitel</surname> <given-names>A.</given-names></name> <name><surname>Brox</surname> <given-names>T.</given-names></name> <name><surname>Burgard</surname> <given-names>W.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Improving unimodal object recognition with multimodal contrastive learning,&#x0201D;</article-title> in <source>IEEE/RSJ International Conference on Intelligent Robots and Systems, Vol. 2</source> (<publisher-loc>Las Vegas, NV</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>4</fpage>.</citation>
</ref>
<ref id="B30">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Neal</surname> <given-names>L.</given-names></name> <name><surname>Olson</surname> <given-names>M.</given-names></name> <name><surname>Fern</surname> <given-names>X.</given-names></name> <name><surname>Wong</surname> <given-names>W.-K.</given-names></name> <name><surname>Li</surname> <given-names>F.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Open set learning with counterfactual images,&#x0201D;</article-title> in <source>Proceedings of the European Conference on Computer Vision</source> (<publisher-loc>Munich; Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>613</fpage>&#x02013;<lpage>628</lpage>.</citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Qiao</surname> <given-names>H.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>X.</given-names></name></person-group> (<year>2021</year>). <article-title>A survey of brain-inspired intelligent robots: integration of vision, decision, motion control, and musculoskeletal systems</article-title>. <source>IEEE Trans. Cybern</source>. <volume>52</volume>, <fpage>11267</fpage>&#x02013;<lpage>11280</lpage>. <pub-id pub-id-type="doi">10.1109/TCYB.2021.3071312</pub-id><pub-id pub-id-type="pmid">33909584</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scheirer</surname> <given-names>W. J.</given-names></name> <name><surname>de Rezende Rocha</surname> <given-names>A.</given-names></name> <name><surname>Sapkota</surname> <given-names>A.</given-names></name> <name><surname>Boult</surname> <given-names>T. E.</given-names></name></person-group> (<year>2012</year>). <article-title>Toward open set recognition</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intell</source>. <volume>35</volume>, <fpage>1757</fpage>&#x02013;<lpage>1772</lpage>. <pub-id pub-id-type="doi">10.1109/TPAMI.2012.256</pub-id><pub-id pub-id-type="pmid">23682001</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Shu</surname> <given-names>L.</given-names></name> <name><surname>Xu</surname> <given-names>H.</given-names></name> <name><surname>Liu</surname> <given-names>B.</given-names></name></person-group> (<year>2017</year>). <article-title>&#x0201C;Doc: deep open classification of text documents,&#x0201D;</article-title> in <source>Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing</source> (<publisher-loc>Copenhagen, Stroudsburg</publisher-loc>: <publisher-name>ACL</publisher-name>), <fpage>2911</fpage>&#x02013;<lpage>2916</lpage>.</citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Strese</surname> <given-names>M.</given-names></name> <name><surname>Schuwerk</surname> <given-names>C.</given-names></name> <name><surname>Iepure</surname> <given-names>A.</given-names></name> <name><surname>Steinbach</surname> <given-names>E.</given-names></name></person-group> (<year>2016</year>). <article-title>Multimodal feature-based surface material classification</article-title>. <source>IEEE Trans. Haptics</source>. <volume>10</volume>, <fpage>226</fpage>&#x02013;<lpage>239</lpage>. <pub-id pub-id-type="doi">10.1109/TOH.2016.2625787</pub-id><pub-id pub-id-type="pmid">27845677</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>X.</given-names></name> <name><surname>Yang</surname> <given-names>Z.</given-names></name> <name><surname>Zhang</surname> <given-names>C.</given-names></name> <name><surname>Ling</surname> <given-names>K. -V.</given-names></name> <name><surname>Peng</surname> <given-names>G.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Conditional gaussian distribution learning for open set recognition,&#x0201D;</article-title> in <source>Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition</source> (<publisher-loc>Seattle, NJ</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>13480</fpage>&#x02013;<lpage>13489</lpage>.</citation>
</ref>
<ref id="B36">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Taunyazov</surname> <given-names>T.</given-names></name> <name><surname>Koh</surname> <given-names>H. F.</given-names></name> <name><surname>Wu</surname> <given-names>Y.</given-names></name> <name><surname>Cai</surname> <given-names>C.</given-names></name> <name><surname>Soh</surname> <given-names>H.</given-names></name></person-group> (<year>2019</year>). <article-title>&#x0201C;Towards effective tactile identification of textures using a hybrid touch approach,&#x0201D;</article-title> in <source>2019 International Conference on Robotics and Automation (ICRA)</source> (<publisher-loc>Montreal, QC</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>4269</fpage>&#x02013;<lpage>4275</lpage>.</citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>W.</given-names></name> <name><surname>Han</surname> <given-names>Y.</given-names></name> <name><surname>Deng</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name></person-group> (<year>2022</year>). <article-title>Hyperspectral image classification via deep structure dictionary learning</article-title>. <source>Remote Sens</source>. 14, 2266. <pub-id pub-id-type="doi">10.3390/rs14092266</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wen</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>K.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Qiao</surname> <given-names>Y.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;A discriminative feature learning approach for deep face recognition,&#x0201D;</article-title> in <source>European Conference on Computer Vision</source> (<publisher-loc>Glasgow; Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>499</fpage>&#x02013;<lpage>515</lpage>.</citation>
</ref>
<ref id="B39">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>D.</given-names></name> <name><surname>Loeb</surname> <given-names>G. E.</given-names></name> <name><surname>Fishel</surname> <given-names>J. A.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Tactile identification of objects using bayesian exploration,&#x0201D;</article-title> in <source>2013 IEEE International Conference on Robotics and Automation</source> (<publisher-loc>Karlsruhe</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>3056</fpage>&#x02013;<lpage>3061</lpage>.<pub-id pub-id-type="pmid">22783186</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>C.</given-names></name> <name><surname>Luo</surname> <given-names>J.</given-names></name> <name><surname>Liu</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>M.</given-names></name> <name><surname>Dai</surname> <given-names>S.-L.</given-names></name></person-group> (<year>2019</year>). <article-title>Haptics electromyography perception and learning enhanced intelligence for teleoperated robot</article-title>. <source>IEEE Trans. Autom. Sci. Eng</source>. <volume>16</volume>, <fpage>1512</fpage>&#x02013;<lpage>1521</lpage>. <pub-id pub-id-type="doi">10.1109/TASE.2018.2874454</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yi</surname> <given-names>Z.</given-names></name> <name><surname>Xu</surname> <given-names>T.</given-names></name> <name><surname>Guo</surname> <given-names>S.</given-names></name> <name><surname>Shang</surname> <given-names>W.</given-names></name> <name><surname>Wu</surname> <given-names>X.</given-names></name></person-group> (<year>2020</year>). <article-title>Tactile surface roughness categorization with multineuron spike train distance</article-title>. <source>IEEE Trans. Autom. Sci. Eng</source>. <volume>18</volume>, <fpage>1835</fpage>&#x02013;<lpage>1845</lpage>. <pub-id pub-id-type="doi">10.1109/TASE.2020.3021742</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Yoshihashi</surname> <given-names>R.</given-names></name> <name><surname>Shao</surname> <given-names>W.</given-names></name> <name><surname>Kawakami</surname> <given-names>R.</given-names></name> <name><surname>You</surname> <given-names>S.</given-names></name> <name><surname>Iida</surname> <given-names>M.</given-names></name> <name><surname>Naemura</surname> <given-names>T.</given-names></name></person-group> (<year>2019</year>). <article-title>&#x0201C;Classification-reconstruction learning for open-set recognition,&#x0201D;</article-title> in <source>Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition</source> (<publisher-loc>Long Beach, CA</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>4016</fpage>&#x02013;<lpage>4025</lpage>.</citation>
</ref>
<ref id="B43">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Yuan</surname> <given-names>W.</given-names></name> <name><surname>Mo</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>S.</given-names></name> <name><surname>Adelson</surname> <given-names>E. H.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Active clothing material perception using tactile sensing and deep learning,&#x0201D;</article-title> in <source>2018 IEEE International Conference on Robotics and Automation (ICRA)</source> (<publisher-loc>Brisbane, QLD</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>4842</fpage>&#x02013;<lpage>4849</lpage>.</citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Patel</surname> <given-names>V. M.</given-names></name></person-group> (<year>2016</year>). <article-title>Sparse representation-based open set recognition</article-title>. <source>IEEE Trans. Pattern. Anal. Mach. Intell</source>. <volume>39</volume>, <fpage>1690</fpage>&#x02013;<lpage>1696</lpage>. <pub-id pub-id-type="doi">10.1109/TPAMI.2016.2613924</pub-id><pub-id pub-id-type="pmid">28114060</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>W.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Sun</surname> <given-names>F.</given-names></name></person-group> (<year>2020a</year>). <article-title>Lifelong visual-tactile cross-modal learning for robotic material perception</article-title>. <source>IEEE Trans. Neural Netw. Learn. Syst</source>. <volume>32</volume>, <fpage>1192</fpage>&#x02013;<lpage>1203</lpage>. <pub-id pub-id-type="doi">10.1109/TNNLS.2020.2980892</pub-id><pub-id pub-id-type="pmid">32275626</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>W.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>B.</given-names></name> <name><surname>Sun</surname> <given-names>F.</given-names></name></person-group> (<year>2019</year>). <article-title>Cross-modal surface material retrieval using discriminant adversarial learning</article-title>. <source>IEEE Trans. Ind. Inform</source>. <volume>15</volume>, <fpage>4978</fpage>&#x02013;<lpage>4987</lpage>. <pub-id pub-id-type="doi">10.1109/TII.2019.2895602</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>W.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>B.</given-names></name> <name><surname>Sun</surname> <given-names>F.</given-names></name></person-group> (<year>2020b</year>). <article-title>Cross-modal material perception for novel objects: a deep adversarial learning method</article-title>. <source>IEEE Trans. Autom. Sci. Eng</source>. <volume>17</volume>, <fpage>697</fpage>&#x02013;<lpage>707</lpage>. <pub-id pub-id-type="doi">10.1109/TASE.2019.2941230</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>D.-W.</given-names></name> <name><surname>Ye</surname> <given-names>H.-J.</given-names></name> <name><surname>Zhan</surname> <given-names>D.-C.</given-names></name></person-group> (<year>2021</year>). <article-title>&#x0201C;Learning placeholders for open-set recognition,&#x0201D;</article-title> in <source>Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition</source> (<publisher-loc>Nashville, TN</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>4401</fpage>&#x02013;<lpage>4410</lpage>.</citation>
</ref>
</ref-list> 
</back>
</article> 