<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Archiving and Interchange DTD v2.3 20070202//EN" "archivearticle.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="methods-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2022.897883</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Methods</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A novel heuristic target-dependent neural architecture search method with small samples</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Fu</surname>
<given-names>Leiyang</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1632911"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Li</surname>
<given-names>Shaowen</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Rao</surname>
<given-names>Yuan</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Liang</surname>
<given-names>Jinxin</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Teng</surname>
<given-names>Jie</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>He</surname>
<given-names>Quanling</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>School of Information and Computer Science, Anhui Agricultural University</institution>, <addr-line>Hefei, Anhui</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Department of Science and Technology, Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment</institution>, <addr-line>Hefei, Anhui</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Lei Shu, Nanjing Agricultural University, China</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Jun Meng, Dalian University of Technology, China; Lifu Gao, Hefei Institutes of Physical Science (CAS), China; Chuanlei Zhang, Tianjin University of Science and Technology, China</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Shaowen Li, <email xlink:href="mailto:shwli@ahau.edu.cn">shwli@ahau.edu.cn</email>
</p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Sustainable and Intelligent Phytoprotection, a section of the journal Frontiers in Plant Science</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>07</day>
<month>11</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>897883</elocation-id>
<history>
<date date-type="received">
<day>16</day>
<month>03</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>11</day>
<month>10</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Fu, Li, Rao, Liang, Teng and He</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Fu, Li, Rao, Liang, Teng and He</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>It is well known that crop classification is essential for genetic resources and phenotype development. Compared with traditional methods, convolutional neural networks can be utilized to identify features automatically. Nevertheless, crops and scenarios are quite complex, which makes it challenging to develop a universal classification method. Furthermore, manual design demands professional knowledge and is time-consuming and labor-intensive. In contrast, auto-search can create network architectures when faced with new species. Using rapeseed images for experiments, we collected eight types to build datasets (rapeseed dataset (RSDS)). In addition, we proposed a novel target-dependent search method based on VGGNet (target-dependent neural architecture search (TD-NAS)). The result shows that test accuracy does not differ significantly between small and large samples. Therefore, the influence of the dataset size on generalization is limited. Moreover, we used two additional open datasets (Pl@ntNet and ICL-Leaf) to test and prove the effectiveness of our method due to three notable features: (a) small sample sizes, (b) stable generalization, and (c) free of unpromising detections.</p>
</abstract>
<kwd-group>
<kwd>crop classification</kwd>
<kwd>target-dependent</kwd>
<kwd>neural architecture search</kwd>
<kwd>small samples</kwd>
<kwd>Bayesian optimization</kwd>
</kwd-group>
<counts>
<fig-count count="8"/>
<table-count count="6"/>
<equation-count count="6"/>
<ref-count count="66"/>
<page-count count="13"/>
<word-count count="5501"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<title>Introduction</title>
<p>Image classification can distinguish objects by color, texture, shape, and spatial relationship. It uses computers to analyze images and classify each pixel or region into several categories without human interpretation (<xref ref-type="bibr" rid="B59">Wang and Wang, 2019</xref>). The following are two agricultural scenarios. (a) Genetic resources: artificial recognition is time-consuming and near-impossible. Automatic species identification is significant for taxonomy. <xref ref-type="bibr" rid="B47">Purohit et&#xa0;al. (2016)</xref> studied a machine-learning method using leaf characteristics to recognize species. In recent years, deep learning, which can automatically extract features from original data, has dramatically improved classification performance (<xref ref-type="bibr" rid="B7">Barr&#xe9; et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B34">Lee and Chang, 2017</xref>; <xref ref-type="bibr" rid="B44">Pawara et&#xa0;al., 2017</xref>). Some studies propose associating machine learning with neural networks (<xref ref-type="bibr" rid="B22">Fu et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B36">Li et&#xa0;al., 2020</xref>). (b) Phenotype development: a phenotype is a characteristic or combination of an organism influenced by the genotype and by the environment. Usually, plants grow in a highly variable environment. More accurate and robust algorithms are needed to deal with complex backgrounds and quantify phenotypic characteristics. They can distinguish different components or even instances. <xref ref-type="bibr" rid="B35">Li et&#xa0;al. (2020)</xref> thoroughly reviewed phenotyping technologies and used machine vision to measure plant stress.</p>
<p>There exist several machine-learning methods such as support vector machines (SVMs) (<xref ref-type="bibr" rid="B50">Rumpf et&#xa0;al., 2010</xref>), k-nearest neighbor (KNN) (<xref ref-type="bibr" rid="B48">Rahaman et&#xa0;al., 2019</xref>), random forest (RF) (<xref ref-type="bibr" rid="B43">Mohana et&#xa0;al., 2021</xref>), and scale-invariant feature transform (SIFT) (<xref ref-type="bibr" rid="B41">Lowe, 2004</xref>). However, they have some shortcomings: (a) Classifiers are simple, and the effect of recognition is underperforming. (b) Manual design requires professional knowledge, so it is hard for researchers. Deep learning has rapidly developed in recent years because it can automatically extract features (<xref ref-type="bibr" rid="B33">Lecun et&#xa0;al., 2015</xref>) and has achieved excellent performance in vision tasks (<xref ref-type="bibr" rid="B2">Amara et&#xa0;al., 2017</xref>). Nevertheless, due to a large number of parameters, designing a good neural network is still a hard task (<xref ref-type="bibr" rid="B53">Suganuma et&#xa0;al., 2017</xref>): (a) People made models manually in the early days, which required a lot of professional knowledge (<xref ref-type="bibr" rid="B55">Sun et&#xa0;al., 2020</xref>). (b) Neural networks are problem-oriented, whereas manually designed architectures are not. Two main factors affect the performance of neural networks: hyperparameters and training parameters. Training parameters can be learned in the training stage. However, hyperparameters must be set before training. Usually, hyperparameters determine the structure of a network, such as the number and type of layers, kinds of nodes, etc. We hope to find the best hyperparameters for a given dataset in a reasonable time. This process is called hyperparametric optimization. Researchers proved that a gradient descent algorithm is significantly effective for calculating training parameters (<xref ref-type="bibr" rid="B49">Rumelhart et&#xa0;al., 1986</xref>). In contrast, there are no explicit strategies for optimizing hyperparameters (<xref ref-type="bibr" rid="B39">Liu et&#xa0;al., 2020</xref>).</p>
<p>Most neural networks can be divided into three types: (a) professional knowledge is required for manual design, such as VGG (<xref ref-type="bibr" rid="B19">Ferentinos, 2018</xref>) and ResNet (<xref ref-type="bibr" rid="B25">He et&#xa0;al., 2016</xref>); (b) semi-automatic design methods like genetic neural networks (<xref ref-type="bibr" rid="B64">Xie and Yuille, 2017</xref>), hierarchical evolution (<xref ref-type="bibr" rid="B38">Liu et&#xa0;al., 2017</xref>), and others; and (c) fully automatic design, such as when Google introduces the neural architecture search (NAS) concept (<xref ref-type="bibr" rid="B66">Zoph and Le, 2016</xref>), which has received considerable attention (<xref ref-type="bibr" rid="B5">Baker et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B42">Lu et&#xa0;al., 2018</xref>). NAS can search for the best hyperparameters to perform better than manual design. In addition, NAS can reduce trials and errors remarkably. Although NAS is attractive, it still lacks interpretability. Furthermore, model training and verification are costly, and early stopping is meaningful for NAS (<xref ref-type="bibr" rid="B6">Baker et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B4">Awad et&#xa0;al., 2021</xref>).</p>
<p>This paper studies the hyperparameter optimization of deep learning, and its organization is as follows: (a) Firstly, we examine pertinent technologies. (b) Secondly, the proposed method, as well as the underlying principles, is explained. (c) We then carry out experiments and discuss the results. (d) Finally, conclusions and future research directions are provided.</p>
</sec>
<sec id="s2">
<title>Related works</title>
<sec id="s2_1">
<title>Deep learning and convolutional neural networks</title>
<p>Deep learning is a branch of machine learning (<xref ref-type="bibr" rid="B15">Deng and Yu, 2014</xref>). It is an automatic feature selection strategy based on neural networks. It can combine low-level features to form abstract high-level features without manual selection. Compared with traditional image recognition and target detection methods, the accuracy and generalization are improved. At present, the main types of neural networks are multilayer perceptrons (MLPs), convolutional neural networks (CNNs), and recurrent neural networks (RNNs), among which CNNs are the most widely used method in classification. Generally, CNNs comprise convolution layers, pooling layers, and fully connected layers. A convolution layer uses correlation information to extract features. A pooling layer (mean pooling or max pooling) compresses the amount of data and parameters, reduces overfitting, and keeps the model invariant to translation, rotation, and scaling. Each neuron (also named a node) in a fully connected layer connects with the previous neurons. Therefore, the multidimensional features are integrated and transformed into several dimensions for classification or detection purposes.</p>
<p>Typical CNN models include AlexNet (<xref ref-type="bibr" rid="B46">Picon et&#xa0;al., 2019</xref>), VGGNet, GoogLeNet (<xref ref-type="bibr" rid="B40">Liu et&#xa0;al., 2017</xref>), ResNet, MobileNet (<xref ref-type="bibr" rid="B26">Howard et&#xa0;al., 2017</xref>), etc. AlexNet is the champion network of the ILSVRC-2012, and it includes five convolution layers and three fully connected layers. According to <xref ref-type="bibr" rid="B9">Bengio et&#xa0;al. (2013)</xref>, deeper CNNs can extract more representative features. Later, researchers found that blindly increasing the number of layers would slow network convergence (<xref ref-type="bibr" rid="B23">Glorot and Bengio, 2010</xref>). Microsoft proposed RESNET with residual blocks and fast connections, which made it possible to build a deeper network (<xref ref-type="bibr" rid="B56">Szegedy et&#xa0;al., 2016</xref>). Google proposed MobileNet for mobile and embedded vision applications.</p>
<p>The rapid development of deep learning is inseparable from the extensive use of GPUs. Implementations of CNNs mostly require GPUs to provide computing support. CNN processes roughly include (a) data preparation and preprocessing; (b) model development, training, and testing; and (c) model deployment. Usually, a dataset can be divided into training, verification, and test sets, with ratios of 7:2:1, 8:1:1, and 6:2:2. Training sets are for learning parameters; verification sets are for optimizing and adjusting hyperparameters; and test sets evaluate performance and generalization. Some public datasets exist, such as PlantVillage (<xref ref-type="bibr" rid="B52">Sm et&#xa0;al., 2020</xref>), Kaggle (<xref ref-type="bibr" rid="B1">Ad et&#xa0;al</xref>), etc. It is worth noting that many researchers collect their own (<xref ref-type="bibr" rid="B37">Lin et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B13">Chen et&#xa0;al., 2020</xref>).</p>
</sec>
<sec id="s2_2">
<title>Data augmentation</title>
<p>The size and diversity of datasets are essential factors affecting the classification effect of CNNs. Data augmentation can expand the number of images, including moving, flipping, zooming, etc. Deep learning can learn features from images regardless of their positions. Therefore, we can expand datasets through augmentation to avoid overfitting. For example, Perez et&#xa0;al. (<xref ref-type="bibr" rid="B45">Perez and Wang, 2017</xref>) developed a new way to use generative adversarial networks (GANs) to make images in different styles.</p>
</sec>
<sec id="s2_3">
<title>Neural architecture search</title>
<p>Grid search (GS) is a simple method to find the optimal parameters. However, an exhaustive search may consume time due to the enormous hyperparameter space. Random search (RS) (<xref ref-type="bibr" rid="B3">Andonie and Florea, 2020</xref>) explores randomly in the hyperparameter space and improves the performance, but the result may be worse sometimes. For example, the result is not stable. Until now, researchers have proposed many NAS methods: (a) NAS methods based on reinforcement learning (RL); (b) NAS methods based on model optimization; and (c) other improved NAS methods.</p>
<sec id="s2_3_1">
<title>(a) NAS methods based on reinforcement learning</title>
<p>Researchers designed a controller to generate strings representing the structures of CNNs, trained each CNN model, and used verification set accuracy as a reward. They optimized the hyperparameters of DCNNs using a novel MARL-based approach (multiagent reinforcement learning) (<xref ref-type="bibr" rid="B29">Iranfar et al., 2022</xref>). They then created a multiobjective reward function and applied it to reinforcement learning in order to find the best network with the least latency (<xref ref-type="bibr" rid="B58">Tran and Bae, 2021</xref>).</p>
</sec>
<sec id="s2_3_2">
<title>(b) NAS methods based on model optimization</title>
<p>To improve the performance of neural architecture search, researchers proposed ENAs based on evolutionary computing (EC) (<xref ref-type="bibr" rid="B57">Thomas et&#xa0;al</xref>) to design CNN architectures. EC is a population-based technology to obtain an optimal global solution. There are some EC-based technologies, such as genetic algorithms (GAs) (<xref ref-type="bibr" rid="B14">Ching-Shih. Deb et&#xa0;al., 2002</xref>), particle swarm optimization (PSO) (<xref ref-type="bibr" rid="B32">Kennedy and Eberhart, 1995</xref>), and artificial ant colony algorithms (<xref ref-type="bibr" rid="B17">Dorigo et&#xa0;al., 2006</xref>). Researchers proposed a two-stage evolutionary search with transfer learning (EvoNAS-TL) (<xref ref-type="bibr" rid="B61">Wen et al., 2021</xref>). Also, EPSOCNN, which stands for efficient particle swarm optimization, is suggested as a way to improve CNN architectures (<xref ref-type="bibr" rid="B60">Wang et&#xa0;al., 2020</xref>).</p>
</sec>
<sec id="s2_3_3">
<title>(c) Other improved NAS methods</title>
<p>To limit the search space, <xref ref-type="bibr" rid="B65">Yu et&#xa0;al. (2021)</xref> and <xref ref-type="bibr" rid="B54">Sun et&#xa0;al. (2019)</xref> proposed block-based methods. However, the results are insufficient and unstable due to the lack of theoretical support. <xref ref-type="bibr" rid="B27">Hu et&#xa0;al. (2021)</xref> proposed a new performance estimation metric named random-weight evaluation (RWE) to quantify the quality of CNNs. <xref ref-type="bibr" rid="B42">Lu et&#xa0;al. (2018)</xref> proposed NSGANet, an evolutionary algorithm that combines prior knowledge from handcrafted architectures with an exploration comprising crossover and mutation. Some software packages provide search functions, such as pyGPGO and Optunity (<xref ref-type="bibr" rid="B10">Bergstra et&#xa0;al., 2011</xref>), Hyperopt&#x2013;Sklearn (<xref ref-type="bibr" rid="B11">Bergstra et&#xa0;al., 2015</xref>), etc.</p>
<p>Many CNN models are challenging to apply on mobile/edge devices due to limited resources such as memory capacity and power consumption. Researchers have carefully designed some lightweight networks. <xref ref-type="bibr" rid="B16">Donegan et&#xa0;al. (2021)</xref> used a differentiable NAS method to find efficient CNNs for the Intel Movidius Vision Processing Unit (VPU), achieving state-of-the-art accuracy on ImageNet. An FPGA-based CNN accelerator (field programmable gate array) was proposed (<xref ref-type="bibr" rid="B18">Fan et&#xa0;al., 2020</xref>) with an accurate performance model of hardware design. Intelligent edge-cloud scenarios are expected to meet diverse requirements.</p>
<p>All the above studies do not effectively record evolutionary information, so they cannot guide the whole search process based on experience. In contrast, Bayesian optimization (<xref ref-type="bibr" rid="B21">Frazier, 2018</xref>) assumes the search space as a Gaussian distribution, learns experience in search processes, and calculates better parameters iteratively (<xref ref-type="bibr" rid="B62">Wistuba</xref>; <xref ref-type="bibr" rid="B24">Gupta et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B30">Ji et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B8">Basha et&#xa0;al., 2021</xref>). However, these methods still cost enormous computing resources. As a result, this paper suggests a heuristic target-dependent method that only needs small samples and is entirely automatic.</p>
</sec>
</sec>
<sec id="s2_4">
<title>Proposed method</title>
<p>From the above, we know that neural architecture search is time-consuming and requires many resources. Therefore, we intend to optimize the hyperparameter exploration process. In short, the main contributions of this paper are as follows:</p>
<list list-type="alpha-lower">
<list-item>
<p>Method: We proposed a target-dependent search method that only needs small samples. Besides accuracy, we use precision and recall to promote generalization. Also, our method can find searches that are not working well and stop them early to save time and resources.</p>
</list-item>
<list-item>
<p>Dataset: We collected eight kinds of rapeseed images and created the dataset RSDS.</p>
</list-item>
<list-item>
<p>Comparison: Horizontally, we compare each TD-NAS based on VGGNets. Vertically, we explore the TD-NAS based on VGGNet-D (VGGNet-16). Furthermore, we test our method on two additional open datasets, Pl@ntNet and ICL-Leaf.</p>
</list-item>
</list>
</sec>
<sec id="s2_5">
<title>Infrastructure and hyperparameters</title>
<p>As for the primary network architecture, we chose VGGNet, a typical convolutional network with six deepening structures labeled A, A-LRN, B, C, D, and E. Here we select A, B, D, and E for experiments, and D is the most famous model named VGGNet-16 (<xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>).</p>
<fig id="f1" position="float">
<label>Figure&#xa0;1</label>
<caption>
<p>VGGNet-16.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-897883-g001.tif"/>
</fig>
<p>We keep the number and position of convolution and pooling layers fixed, while the layer number, the dropout rate, and the neuron number of fully connected layers can be changed. <xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref> shows the hyperparameter space. It is worth noting that there is at least one fully connected layer, and the number of neurons in the last fully connected layer is eight to produce the output&#x2014;eight classes of rapeseed.</p>
<table-wrap id="T1" position="float">
<label>Table&#xa0;1</label>
<caption>
<p>Hyperparameter space.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Level</th>
<th valign="top" align="center">Layer</th>
<th valign="top" align="center">Name</th>
<th valign="top" align="center">Values</th>
<th valign="top" align="center">Type</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Architectural</td>
<td valign="top" align="center">
<italic>FC</italic>
</td>
<td valign="top" align="center">
<italic>LN</italic>
</td>
<td valign="top" align="center">{1, 2, 3}</td>
<td valign="top" align="left">Integer</td>
</tr>
<tr>
<td valign="top" align="left">Internal</td>
<td valign="top" align="center">
<italic>FC</italic>
</td>
<td valign="top" align="center">
<italic>DR</italic>
</td>
<td valign="top" align="center">{0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9}</td>
<td valign="top" align="left">Float</td>
</tr>
<tr>
<td valign="top" align="left">Internal</td>
<td valign="top" align="center">
<italic>FC</italic>
</td>
<td valign="top" align="center">
<italic>NN</italic>
</td>
<td valign="top" align="center">{512, 1,024, 2,048, 4,096}</td>
<td valign="top" align="left">Integer</td>
</tr>
<tr>
<td valign="top" align="left">External</td>
<td valign="top" align="center">
<italic>OU</italic>
</td>
<td valign="top" align="center">&#x3c9;<sub>2 =</sub> 1- &#x3c9;<sub>1</sub>
</td>
<td valign="top" align="center">{1.0, 0.9, 0.8, 0.7, 0.6, 0.5}</td>
<td valign="top" align="left">Float</td>
</tr>
<tr>
<td valign="top" align="left">External</td>
<td valign="top" align="center">
<italic>OU</italic>
</td>
<td valign="top" align="center">&#x3b2;</td>
<td valign="top" align="center">{0.5, 1.0, 2.0}</td>
<td valign="top" align="left">Float</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>FC, fully connected; LN, layer number; DR, dropout rate; NN, neuron number; OU, output. &#x3c9;, &#x3b2;: parameters in formula (2) (discussed in the next section.)</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s2_6">
<title>Search principle and Bayesian optimization</title>
<p>The aim is to find the hyperparameters of a model with the best performance on verification sets. Let <italic>T</italic> be the objective function for getting maximum accuracy (<italic>ACC</italic>).</p>
<disp-formula>
<label>(1)</label>
<mml:math display="block" id="M1">
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:msup>
<mml:mi>p</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mtext>argmax</mml:mtext>
</mml:mrow>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>p</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>In formula (1), <italic>D</italic> is a hyperparameter space. We can create a model for each <italic>hp</italic> in <italic>D</italic>, train the model, and evaluate its performance on verification sets. This paper separates the RSDS dataset into three parts: the training set, the validation set, and the test set, with a ratio of 7:2:1. We use formula (2) as the judgment criteria (<italic>jc</italic>) for evaluating model qualities:</p>
<disp-formula>
<label>(2)</label>
<mml:math display="block" id="M2">
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mi>c</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>&#x3c9;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>c</mml:mi>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>&#x3c9;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>&#x3b2;</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Here, <italic>F<sub>&#x3b2;</sub>
</italic> (balanced <italic>F</italic>-score) is the harmonic average of precision and recall. Usually, the smaller the F-score, the better the generalization. Therefore, we refine formula (1) to formula (3):</p>
<disp-formula>
<label>(3)</label>
<mml:math display="block" id="M3">
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:msup>
<mml:mi>p</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mtext>argmax</mml:mtext>
</mml:mrow>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>p</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>p</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>j</mml:mi>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>The following <xref ref-type="algorithm" rid="algo1">
<bold>Algorithm 1</bold>
</xref> gives the naive hyperparameter search process:</p>
<statement id="algo1">
<label>Algorithm 1 HPS: Hyperparameters Search.</label>
<p>
<preformat>
<bold>Input:</bold> Space_D, Data_T, Data_VOutput: hp* (1) Get the training set (Data_T) and the verification set (Data_V). (2) Select a judgment criterion: <italic>ACC.</italic> (3) Set hyperparameter search space (Space_D) and initialize one hyperparameter (hp): (4) Generate a CNN model with the hp. (5) Train the model, and verify it. (6) Select the next hyperparameter (hp), repeat (4), or Quit. (7) Output the optimal hyperparameter (hp*).
</preformat>
</p>
</statement>
<p>
<xref ref-type="algorithm" rid="algo1">
<bold>Algorithm 1</bold>
</xref> does not record evolutionary information, so it cannot guide search processes effectively. Even if adopting a random search, uncertainty still exists.</p>
<p>We propose a heuristic target-dependent search method. The so-called heuristic means our approach can evaluate a better location and start a new search. Here, we choose Bayesian optimization, which assumes the superparameter space as a Gaussian distribution and obtains better candidates each time. Our method introduces a stop criterion to reduce the search scale without lowering generalization. The so-called target-dependent means that the explored architecture is not universal and only valid for specific crops. We can quickly rerun the proposed method to search out new architectures when facing new species.</p>
<p>Bayesian optimization has two components: (a) Bayesian statistics for constructing an objective function (typically a Gaussian process); and (b) acquisition function for calculating the following sampling points. After initializing several points, Bayesian optimization can calculate <italic>a posteriori</italic> and iterate reasoning until meeting an exit condition. Algorithm 2 shows our TD-NAS method based on Bayesian optimization, where GP is a Gaussian process, Acq_F is an acquisition function, and Dyn_QF is a dynamic quit function. In Algorithm 2, we focus on steps (6), (4), and (3). Step (4) costs massive resources for VGGNet training and verifying, so step (6) should select hyperparameters elegantly to reduce the number of models. Furthermore, step (4) should stop the training and verifying processes when there are unpromising detections. Step (3) checks the dynamic quit conditions and decides whether to quit or not.</p>
<statement id="algo2">
<label>Algorithm 2 TD-NAS: Target-dependent neural architecture search.</label>
<p>
<preformat>
<bold>Input:</bold> VGGNet, Space_D, Data_T, Data_V, GP, Acq_F, Dyn_QFOutput: x* (1) Init S = {(x<sub>i</sub>, y<sub>i</sub>)}, y<sub>i</sub> = f(x<sub>i</sub>), x<sub>i</sub> &#x3f5; <italic>Space_D</italic>, let f~GP(&#x3bc;, K). (2) Select a judgment criterion: <italic>jc</italic>. (3) While not Dyn_QF() do: (4) Train and verify VGGNet(x, Data_T, Data_V) with unpromising detections. (5) Calculate p(y | x, S). (6) Acq_F(x, p(y | x, S)), get x<sub>new</sub>. (7) y<sub>new</sub> = f(x<sub>new</sub>). (8) S = S U (x<sub>new</sub>, y<sub>new</sub>). (9) Output the optimal hyperparameter (x*).
</preformat>
</p>
</statement>
<p>Step (6): The acquisition function strikes a balance between exploration and exploitation.</p>
<p>In Bayesian optimization, the acquisition function (Acq_F) is critical for generating points according to prior knowledge. Exploitation means evaluating at expected points because global optima are likely to reside there. Exploration means considering uncertain points is helpful because objects tend to be far from where we have measured them. Usually, there are three typical acquisition functions: expected improvement (EI), entropy search (ES), and knowledge gradient (KG). The expected value of EI is easy to figure out, which makes it a popular acquisition function.</p>
<p>Let <inline-formula>
<mml:math display="inline" id="im1">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>n</mml:mi>
<mml:mo>*</mml:mo>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>max</mml:mtext>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>m</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> be the max previous value. We have one other position, x, to be evaluated, and then we get f (x). Now, the best-observed point is either <italic>f(x)</italic> or <inline-formula>
<mml:math display="inline" id="im2">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>n</mml:mi>
<mml:mo>*</mml:mo>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>. The improvement is then <inline-formula>
<mml:math display="inline" id="im3">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>n</mml:mi>
<mml:mo>*</mml:mo>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>; if this quantity is positive, else 0, mark as <inline-formula>
<mml:math display="inline" id="im4">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtext>f</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>n</mml:mi>
<mml:mo>*</mml:mo>
</mml:msubsup>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> for convenience. Unfortunately, we should train and validate the entire network to get <italic>f(x)</italic>. Instead, we can take the expected value of this improvement and define formula (4):</p>
<disp-formula>
<label>(4)</label>
<mml:math display="block" id="M4">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>argmaxEI</mml:mtext>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Here, <inline-formula>
<mml:math display="inline" id="im5">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>EI</mml:mtext>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>E</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtext>f</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>n</mml:mi>
<mml:mo>*</mml:mo>
</mml:msubsup>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and <italic>E<sub>n</sub>
</italic> indicates that the expectation is taken under the posterior distribution, as shown in formula (5): <italic>f(x)</italic> given <italic>x</italic>
<sub>1:</sub>
<italic>
<sub>n</sub>
</italic>, <italic>y</italic>
<sub>1:</sub>
<italic>
<sub>n</sub>
</italic> is normally distributed with mean <italic>&#x3bc;<sub>n</sub>
</italic> (x) and variance <inline-formula>
<mml:math display="inline" id="im6">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<disp-formula>
<label>(5)</label>
<mml:math display="block" id="M5">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>|</mml:mo>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>:</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>~</mml:mo>
<mml:mtext>Normal</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Unlike the <italic>f(x)</italic> in step 4 of Algorithm 2, EI<italic>
<sub>n</sub>
</italic> (<italic>x</italic>) is low cost to observe and allows for easy evaluation of first- and second-order derivatives, as shown in formula (6):</p>
<disp-formula>
<label>(6)</label>
<mml:math display="block" id="M6">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>EI</mml:mtext>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>&#x394;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>&#x3a6;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>&#x394;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>&#x3c6;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>&#x394;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>The definitions of <italic>&#x3a6;</italic>(),<italic>&#x3c6;</italic>() can be found in (<xref ref-type="bibr" rid="B31">Jones et al., 1998</xref>). Here, <inline-formula>
<mml:math display="inline" id="im7">
<mml:mrow>
<mml:msub>
<mml:mi>&#x394;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>x</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>n</mml:mi>
<mml:mo>*</mml:mo>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> is the expected difference between the proposed point <italic>x</italic> and the previous best. Note that EI<italic>
<sub>n</sub>
</italic> (<italic>x</italic>) balances between high expected quality (&#x394;<italic>
<sub>n</sub>
</italic> (<italic>x</italic>)) versus high uncertainty (&#x3c3;<italic>
<sub>n</sub>
</italic> (<italic>x</italic>)).</p>
<p>
<bold>Step (4):</bold> Training and validating the VGGNet without unpromising detections.</p>
<p>We calculate verification errors when training and verifying. If the error exceeds the average prior value, it is unpromising to go further. <xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref> shows five hyperparameters in search, including two unpromising detections. Stopping these unpromising detections early can save resources and time.</p>
<fig id="f2" position="float">
<label>Figure&#xa0;2</label>
<caption>
<p>Unpromising detections.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-897883-g002.tif"/>
</fig>
<p>
<bold>Step (3):</bold> Making dynamic quit decisions.</p>
<p>
<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref> gives dynamic quit conditions and their generation approach. To control the search process, a dynamic quit function (Dyn_QF) uses these conditions, including whether the queue of the hyperparameter space is empty or the maximum number of iterations has been reached.</p>
<fig id="f3" position="float">
<label>Figure&#xa0;3</label>
<caption>
<p>Dynamic quit conditions. Generation approach.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-897883-g003.tif"/>
</fig>
</sec>
</sec>
<sec id="s3">
<title>Experiment and discussion</title>
<sec id="s3_1">
<title>Dataset and experimental condition</title>
<p>We took photos using a Canon EoS6D camera, which has 20.2 million effective pixels and a maximum resolution of 5,472 &#xd7; 3,648. We then resized each image to 224 &#xd7; 224 pixels to improve the processing speed. To run programs, we used an HP-OMEN laptop with an i9-9880H CPU, 32 GB of memory, an NVIDIA RTX2080 graphics card (8 GB), and Python installed.</p>
<p>We collected rapeseed images at an experimental station of Anhui Agricultural University, located at 117.2&#xb0; east longitude and 31.5&#xb0; north latitude, in Sanhe town, Hefei, China. We obtained eight kinds of rapeseed images, at least 1,000 of each class (<italic>C<sub>i</sub>
</italic>|<italic>i</italic> = 1,&#x2026;, 8), as shown in <xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref>, named RSDS. We divided the RSDS into three parts: training set (Tr), verification set (V), and test set (Te), with a ratio of 7:2:1. We randomly obtained <italic>RSDS-0.1K</italic> with (Tr, V, Te = 700, 200, 100) images per class. Using data augmentation, we obtained more sets as follows: <italic>RSDS-0.2K</italic> (Tr, V, Te = 1,400, 400, 200), <italic>RSDS-0.4K</italic> (Tr, V, Te = 2,800, 800, 400), and <italic>RSDS-1.0K</italic> (Tr, V, Te = 7,000, 2,000, 1,000).</p>
<fig id="f4" position="float">
<label>Figure&#xa0;4</label>
<caption>
<p>Rapeseed dataset (RSDS).</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-897883-g004.tif"/>
</fig>
</sec>
<sec id="s3_2">
<title>Result discussion</title>
<p>We use VGGNet as the base framework. When training, the initial learning rate is 0.01, and the epoch size is 50. For comparing two algorithms, &#x201c;better&#x201d; means (a) fewer attempts for the same score and (b) a higher score after the same number of tries.</p>
<p>For horizontal comparisons, (a) we set <italic>&#x3c9;</italic>
<sub>1</sub> = 1, which means only accuracy is the evaluation indicator. In <xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref>, the verification accuracy of TD-NAS based on VGGNet-16 reaches 81.38%. However, the result obtained on VGGNet-E (VGGNet-19) is worse than the original, indicating that Bayesian optimization also has limitations in dealing with deep networks. The number of neurons in the last fully connected layer fixes eight to output the probability values of rapeseed classes through a Softmax function. (b) Do not fix &#x3c9;<sub>1</sub>. Instead, use <italic>jc</italic> as the indicator in formula (2) (<xref ref-type="table" rid="T3">
<bold>Table&#xa0;3</bold>
</xref>).</p>
<table-wrap id="T2" position="float">
<label>Table&#xa0;2</label>
<caption>
<p>TD-NAS and original VGGNet (A, B, D, and E) (<italic>RSDS-0.1K</italic>, <italic>&#x3c9;</italic>
<sub>1</sub> = 1).</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left"/>
<th valign="top" colspan="2" align="center">VGGNet-A</th>
<th valign="top" colspan="2" align="center">VGGNet-B</th>
<th valign="top" colspan="2" align="center">VGGNet-D</th>
<th valign="top" colspan="2" align="center">VGGNet-E</th>
</tr>
<tr>
<th valign="top" align="left"/>
<th valign="top" align="center">Original</th>
<th valign="top" align="center">TD-NAS</th>
<th valign="top" align="center">Original</th>
<th valign="top" align="center">TD-NAS</th>
<th valign="top" align="center">Original</th>
<th valign="top" align="center">TD-NAS</th>
<th valign="top" align="center">Original</th>
<th valign="top" align="center">TD-NAS</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">
<italic>V-ACC</italic>
</td>
<td valign="top" align="center">74.06%</td>
<td valign="top" align="center">78.81%</td>
<td valign="top" align="center">77.38%</td>
<td valign="top" align="center">77.94%</td>
<td valign="top" align="center">79.19%</td>
<td valign="top" align="center">
<bold>81.38%</bold>
</td>
<td valign="top" align="center">80.75%</td>
<td valign="top" align="center">78.44%</td>
</tr>
<tr>
<td valign="top" colspan="9" align="left">
<italic>FC</italic>
</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>LN</italic>
</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>DR</italic>
</td>
<td valign="top" align="center">{0.7, 0.7, 0.0}</td>
<td valign="top" align="center">{0.6, 0.4, 0.0}</td>
<td valign="top" align="center">{0.7, 0.7, 0.0}</td>
<td valign="top" align="center">{0.7, 0.5, 0.0}</td>
<td valign="top" align="center">{0.7, 0.7, 0.0}</td>
<td valign="top" align="center">{0.6, 0.5, 0.0}</td>
<td valign="top" align="center">{0.7, 0.7, 0.0}</td>
<td valign="top" align="center">{0.5, 0.3, 0.0}</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>NN</italic>
</td>
<td valign="top" align="center">{4,096, 4,096, 8}</td>
<td valign="top" align="center">{2,048, 2,048, 8}</td>
<td valign="top" align="center">{4,096, 4,096, 8}</td>
<td valign="top" align="center">{2,048, 2,048, 8}</td>
<td valign="top" align="center">{4,096, 4,096, 8}</td>
<td valign="top" align="center">{4,096, 2,048, 8}</td>
<td valign="top" align="center">{4,096, 4,096, 8}</td>
<td valign="top" align="center">{2,048, 2,048, 8}</td>
</tr>
<tr>
<td valign="top" colspan="9" align="left">
<italic>OU</italic>
</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>&#x3c9;</italic>
<sub>1</sub>
</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>&#x3b2;</italic>
</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>V-ACC, verification accuracy; FC, fully connected; LN, layer number; DR, dropout rate; NN, neuron number; OU, output. Bold means To highlight the biggest verification accuracy (V-ACC).</p>
</fn>
<fn>
<p>Bold values mean to highlight the biggest verification accuracy (V-ACC).</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="T3" position="float">
<label>Table&#xa0;3</label>
<caption>
<p>TD-NAS (<italic>RSDS-0.1K</italic>, <italic>&#x3c9;</italic>
<sub>1</sub> not fixed).</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left"/>
<th valign="top" colspan="4" align="center">TD-NAS</th>
</tr>
<tr>
<th valign="top" align="left"/>
<th valign="top" align="center">VGGNet-A</th>
<th valign="top" align="center">VGGNet-B</th>
<th valign="top" align="center">VGGNet-D</th>
<th valign="top" align="center">VGGNet-E</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">
<italic>V-ACC</italic>
</td>
<td valign="top" align="center">77.56%</td>
<td valign="top" align="center">75.13%</td>
<td valign="top" align="center">
<bold>81.06%</bold>
</td>
<td valign="top" align="center">79.94%</td>
</tr>
<tr>
<td valign="top" colspan="5" align="left">
<italic>FC</italic>
</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>LN</italic>
</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>DR</italic>
</td>
<td valign="top" align="center">{0.6, 0.4, 0.0}</td>
<td valign="top" align="center">{0.7, 0.5, 0.0}</td>
<td valign="top" align="center">{0.6, 0.5, 0.0}</td>
<td valign="top" align="center">{0.5, 0.3, 0.0}</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>NN</italic>
</td>
<td valign="top" align="center">{2,048, 2,048, 8}</td>
<td valign="top" align="center">{2,048, 2,048, 8}</td>
<td valign="top" align="center">{4,096, 2,048, 8}</td>
<td valign="top" align="center">{2,048, 2,048, 8}</td>
</tr>
<tr>
<td valign="top" colspan="5" align="left">
<italic>OU</italic>
</td>
</tr>
<tr>
<td valign="top" align="left">&#x3c9;<sub>1</sub>
</td>
<td valign="top" align="center">0.9</td>
<td valign="top" align="center">0.9</td>
<td valign="top" align="center">0.8</td>
<td valign="top" align="center">0.9</td>
</tr>
<tr>
<td valign="top" align="left">&#x3b2;</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.5</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.5</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>V-ACC, verification accuracy; FC, fully connected; LN, layer number; DR, dropout rate; NN, neuron number; OU, output. Bold means To highlight the biggest verification accuracy (V-ACC).</p>
</fn>
<fn>
<p>Bold values mean to highlight the biggest verification accuracy (V-ACC).</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>Keep the fully connected parameters from <xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref> unchanged and take into account <italic>&#x3c9;</italic>
<sub>1</sub> and <italic>&#x3b2;</italic>. After searching, we still get the highest verification accuracy (81.06%, <xref ref-type="table" rid="T3">
<bold>Table&#xa0;3</bold>
</xref>) based on VGGNet-D, which is slightly lower than the accuracy of VGGNet-16 (81.38%, <xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref>). However, the verification accuracy of TD-NAS based on VGGNet-E has increased from 78.44% (<xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref>) to 79.94% (<xref ref-type="table" rid="T3">
<bold>Table&#xa0;3</bold>
</xref>).</p>
<p>Now, for vertical comparisons, (a) we use four datasets (<xref ref-type="table" rid="T4">
<bold>Table&#xa0;4</bold>
</xref>) to search for TD-NAS on VGGNet-D (VGGNet-16). (b) We choose the model made by <italic>RSDS-0.1K</italic>, figure out how accurate it is on all of the test sets, and compare it to other models (<xref ref-type="table" rid="T5">
<bold>Table&#xa0;5</bold>
</xref>). In <xref ref-type="table" rid="T4">
<bold>Table&#xa0;4</bold>
</xref>, it can be seen that the bigger the dataset size, the higher the verification accuracy (91.23%, <xref ref-type="table" rid="T4">
<bold>Table&#xa0;4</bold>
</xref>). It is worth noting that from <italic>RSDS-0.4K</italic> to <italic>RSDS-1.0K</italic>, the promotion of verification accuracy is only 1.95%, but the amount of training data has increased by 1.5 times. Meanwhile, the time consumptions of <italic>RSDS-0.1K</italic>, <italic>RSDS-0.2K</italic>, <italic>RSDS-0.4K</italic>, and <italic>RSDS-1.0K</italic> are approximately 00:42:32, 01:11:36, 04:20:38, and 07:26:00, respectively. The costs of training and verification vary greatly, but the benefits are limited.</p>
<table-wrap id="T4" position="float">
<label>Table&#xa0;4</label>
<caption>
<p>TD-NAS on VGGNet-D (VGGNet-16) (<italic>&#x3c9;<sub>1</sub> not fixed</italic>).</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left"/>
<th valign="top" colspan="4" align="center">TD-NAS on VGGNet-D (VGGNet-16)</th>
</tr>
<tr>
<th valign="top" align="left"/>
<th valign="top" align="center">
<italic>RSDS-0.1K</italic>
</th>
<th valign="top" align="center">
<italic>RSDS-0.2K</italic>
</th>
<th valign="top" align="center">
<italic>RSDS-0.4K</italic>
</th>
<th valign="top" align="center">
<italic>RSDS-1.0K</italic>
</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">
<italic>V-ACC</italic>
</td>
<td valign="top" align="center">81.06%</td>
<td valign="top" align="center">86.53%</td>
<td valign="top" align="center">89.28%</td>
<td valign="top" align="center">
<bold>91.23%</bold>
</td>
</tr>
<tr>
<td valign="top" colspan="5" align="left">
<italic>FC</italic>
</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>LN</italic>
</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>DR</italic>
</td>
<td valign="top" align="center">{0.6, 0.5, 0.0}</td>
<td valign="top" align="center">{0.7, 0.3, 0.0}</td>
<td valign="top" align="center">{0.4, 0.6, 0.0}</td>
<td valign="top" align="center">{0.5, 0.4, 0.0}</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>NN</italic>
</td>
<td valign="top" align="center">{4,096, 2,048, 8}</td>
<td valign="top" align="center">{4,096, 2,048, 8}</td>
<td valign="top" align="center">{2,048, 2,048, 8}</td>
<td valign="top" align="center">{2,048, 1,024, 8}</td>
</tr>
<tr>
<td valign="top" colspan="5" align="left">
<italic>OU</italic>
</td>
</tr>
<tr>
<td valign="top" align="left">&#x3c9;<sub>1</sub>
</td>
<td valign="top" align="center">0.8</td>
<td valign="top" align="center">0.9</td>
<td valign="top" align="center">0.9</td>
<td valign="top" align="center">0.8</td>
</tr>
<tr>
<td valign="top" align="left">&#x3b2;</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.5</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>V-ACC, verification accuracy; FC, fully connected; LN, layer number; DR, dropout rate; NN, neuron number; OU, output. Bold means To highlight the biggest verification accuracy (V-ACC).</p>
</fn>
<fn>
<p>Bold values mean to highlight the biggest verification accuracy (V-ACC).</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="T5" position="float">
<label>Table&#xa0;5</label>
<caption>
<p>Test accuracy comparison.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">T-ACC</th>
<th valign="top" colspan="4" align="center">TD-NAS on VGGNet-D (VGGNet-16)</th>
</tr>
<tr>
<th/>
<th valign="top" align="center">RSDS-0.1K</th>
<th valign="top" align="center">RSDS-0.2K</th>
<th valign="top" align="center">RSDS-0.4K</th>
<th valign="top" align="center">RSDS-1.0K</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">RSDS-0.1K</td>
<td valign="top" align="center">81.13%</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">RSDS-0.2K</td>
<td valign="top" align="center">79.94%</td>
<td valign="top" align="center">81.44%</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">RSDS-0.4K</td>
<td valign="top" align="center">80.22%</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">83.13%</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">RSDS-1.0K</td>
<td valign="top" align="center">81.68%</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">82.73%</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>T-ACC, test accuracy.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>The TD-NAS mentioned above on VGGNet-D (VGGNet-16) searches each dataset to generate models. <xref ref-type="table" rid="T5">
<bold>Table&#xa0;5</bold>
</xref> shows the accuracy of these models on test sets; the diagonal part of the table shows the accuracy of each model on its own test set, whereas the first column is the accuracy of the model generated by <italic>RSDS-0.1K</italic> on all test sets. It is worth noting that the test accuracy of the model trained on small samples is not much different from that trained on large ones.</p>
<p>
<xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref> shows four confusion matrixes generated on each test set separately, and the model is TD-NAS on VGGNet-D (VGGNet-16). Overall, our model performed well across different test sets. It is worth noting that the two pairs [C4, C5] and [C7, C8] are far more likely than others to misjudge each other. When a person looks at the rapeseed images of these two pairs, they look very similar in color and shape.</p>
<fig id="f5" position="float">
<label>Figure&#xa0;5</label>
<caption>
<p>Confusion matrixes generated by TD-NAS on VGGNet-D (VGGNet-16).</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-897883-g005.tif"/>
</fig>
</sec>
<sec id="s3_3">
<title>Pl@ntNet-300K and ICL-Leaf datasets</title>
<p>A novel image dataset with high intrinsic ambiguity was presented (<xref ref-type="bibr" rid="B12">Camille et&#xa0;al., 2021</xref>), built explicitly for evaluating and comparing set-valued classifiers. It consists of 306,146 images covering 1,081 species, with two particular features: (a) The dataset has a strong class imbalance, which means that a few species account for most images. (b) Many species are visually similar (<xref ref-type="fig" rid="f6">
<bold>Figure&#xa0;6</bold>
</xref>), making identification difficult even for eye experts.</p>
<fig id="f6" position="float">
<label>Figure&#xa0;6</label>
<caption>
<p>Examples of visually similar images belonging to two different classes.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-897883-g006.tif"/>
</fig>
<p>
<xref ref-type="table" rid="T6">
<bold>Table&#xa0;6</bold>
</xref> shows that test accuracy depends strongly on the number of images per class. We selected eight species, including <italic>Cirsium arvense</italic>, and searched for architecture based on VGG-16. Let [s1, s2] be the interval of the image number per class (for example  (<xref ref-type="bibr" rid="B48">Rahaman et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B54">Sun et&#xa0;al., 2019</xref>),), and we searched twice with s1 and s2 per class (for &gt;2,000, set [2,500, 3,000]), then calculated the mean accuracy. From <xref ref-type="table" rid="T6">
<bold>Table&#xa0;6</bold>
</xref>, we found that our method got higher accuracy than Ref  (<xref ref-type="bibr" rid="B12">Camille et&#xa0;al., 2021</xref>). claimed, except for the situation &#x201c;&gt;2,000&#x201d;. The results in Ref (<xref ref-type="bibr" rid="B12">Camille et&#xa0;al., 2021</xref>). were made with ResNet50, which has 49 convolutional layers and one fully connected layer. This architecture is deeper than ours and better at handling large samples.</p>
<table-wrap id="T6" position="float">
<label>Table&#xa0;6</label>
<caption>
<p>Test accuracy comparison.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Number of images per class</th>
<th valign="top" align="center">Mean accuracy claimed in (<xref ref-type="bibr" rid="B12">Camille et&#xa0;al., 2021</xref>)</th>
<th valign="top" align="center">Ours, TD-NAS on VGGNet-D (VGGNet-16)</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">10&#x2013;50</td>
<td valign="top" align="center">35%</td>
<td valign="top" align="center">41.25%</td>
</tr>
<tr>
<td valign="top" align="left">50&#x2013;500</td>
<td valign="top" align="center">59%</td>
<td valign="top" align="center">63.38%</td>
</tr>
<tr>
<td valign="top" align="left">500&#x2013;2,000</td>
<td valign="top" align="center">79%</td>
<td valign="top" align="center">80.97%</td>
</tr>
<tr>
<td valign="top" align="left">&gt;2,000</td>
<td valign="top" align="center">93%</td>
<td valign="top" align="center">86.75%</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Another public leaf dataset called the ICL (<xref ref-type="bibr" rid="B28">Hu et&#xa0;al., 2012</xref>) was built by the Intelligent Computing Laboratory (ICL) at the Institute of Intelligent Machines, Chinese Academy of Sciences. It contains 16,851 samples from 220 species, with 26 to 1,078 samples per species (<xref ref-type="fig" rid="f7">
<bold>Figure&#xa0;7</bold>
</xref>).</p>
<fig id="f7" position="float">
<label>Figure&#xa0;7</label>
<caption>
<p>Typical samples for each species.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-897883-g007.tif"/>
</fig>
<p>We selected eight species, including <italic>Amorpha fruticosa</italic>, and searched for architecture based on VGG-16 two times. We got an average test accuracy of 92.31%, a little more than 92.08% in Ref  (<xref ref-type="bibr" rid="B63">Xiao et&#xa0;al., 2010</xref>)., which used a traditional machine-learning method named HOG-MMC (orientation histogram based on dimension reduction of maximum edge criterion).</p>
<p>From the above, we proved that our method could quickly discover new architectures when faced with Pl@ntNet-300K and ICL-Leaf datasets. The contribution of our method includes three features: (a) small sample sizes, (b) stable generalization, and (c) free of unpromising detections. In experiments of Pl@ntNet-300K and ICL-Leaf, as for feature a, we selected [500, 2,000] and [1,000, 1,200] images per class and got average test accuracies represented above. As for feature b, we increased the number of images per class to [2,500, 3,000] and [1,500, 2,000], and test accuracy had risen a little (&lt; 6%), but time cost ascended (&gt; 26%). As for feature c, we set a switch to control NAS with or without unpromising detections, and statistics showed that more than 19% of detections are unpromising. To sum up, due to features a and c, we can quickly find out new architectures when faced with new species but still get feature b&#x2019;s stable generalization.</p>
</sec>
</sec>
<sec id="s4">
<title>Conclusion</title>
<p>This paper aims to design a novel target-dependent neural architecture search method based on VGGNet. This goal was successfully achieved on a self-built dataset with eight kinds of rapeseed images. We select accuracy, precision, and recall as the evaluation indicators. We adopt Bayesian optimization to obtain better candidate parameters and introduce a stop criterion for optimizing the dynamic search process. Results show that the test accuracy of the model trained on small samples is not much different from those trained on large ones. The generalization of the model generated by our method is not sensitive to dataset size, making it meaningful to search out models from small samples when facing new objects. For example, we tested our method on two other open datasets, Pl@ntNet and ICL-Leaf.</p>
<p>Due to the enormous model structure and parameter space, this paper only dealt with fully connected layers and the output layer. We kept the original network structure unchanged, such as the kernel and pooling size in convolutional layers. However, the full network structure search, including convolutional layers, is universal without margin. Some literature has pointed out that Bayesian optimization is only suitable for medium-sized problems (<xref ref-type="bibr" rid="B51">Shahriari et&#xa0;al., 2015</xref>). When faced with new objects, how to quickly search a minimized network structure is still an attractive topic. <xref ref-type="fig" rid="f8">
<bold>Figure&#xa0;8</bold>
</xref> shows our robot platform, with our models running on (4), an onboard computer, and real-time images captured by (1), a Realsense D435i camera. We continue to explore in-depth research questions, such as changing kernels and pooling sizes in convolutional layers (<xref ref-type="bibr" rid="B20">Franchini et&#xa0;al., 2022</xref>). We are thinking about using the proposed method on more CNN frameworks in the future.</p>
<fig id="f8" position="float">
<label>Figure&#xa0;8</label>
<caption>
<p>Our robot platform (capture pictures by (1) and run models on (4)).</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-897883-g008.tif"/>
</fig>
</sec>
<sec id="s5" sec-type="data-availability">
<title>Data availability statement</title>
<p>The self-build dataset RSDS will be made available by the authors. Another two open datasets (Pl@ntNet and ICL-Leaf) can be found on the official sites or the Internet, such as: <uri xlink:href="https://zenodo.org/record/5645731">https://zenodo.org/record/5645731</uri>.</p>
</sec>
<sec id="s6" sec-type="author-contributions">
<title>Author contributions</title>
<p>LYF and YR planned and designed the research. JXL, JT, and QLH performed the experiments. LYF wrote the manuscript. SWL supervised this work and reviewed the manuscript. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="s7" sec-type="funding-information">
<title>Funding</title>
<p>948 Project of Introduction of International Advanced Agricultural Science and Technology by Ministry of Agriculture, Ministry of Agriculture, China, No.2016-X34.</p>
</sec>
<sec id="s8" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="s9" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ad</surname> <given-names>A.</given-names>
</name>
<name>
<surname>De</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Aeh</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>An optimized model based on convolutional neural networks and orthogonal learning particle swarm optimization algorithm for plant diseases diagnosis</article-title>. <source>Swarm Evol. Computation</source> <volume>52</volume>, <fpage>100616</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.swevo.2019.100616</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="other">
<person-group person-group-type="author">
<name>
<surname>Amara</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Bouaziz</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Algergawy</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2017</year>). A Deep Learning-based Approach for Banana Leaf Diseases Classification. Datenbanksysteme f&#xfc;r Business, Technologie und Web (BTW 2017), 17 Fachtagung des GI-Fachbereichs,Datenbanken und Informationssysteme&#x201d; (DBIS), 6-10 M&#xe4;rz 2017, Stuttgart, Germany, Workshopband.</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Andonie</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Florea</surname> <given-names>A. C.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Weighted random search for CNN hyperparameter optimization</article-title>. <source>Int. J. Computers Commun. Control (IJCCC)</source> <volume>15</volume> (<issue>2</issue>), <fpage>3868</fpage>. doi: <pub-id pub-id-type="doi">10.15837/ijccc.2020.2.3868</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Awad</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Mallik</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Hutter</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>DEHB: Evolutionary hyberband for scalable, robust and efficient hyperparameter optimization</article-title>. In <conf-name>Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence</conf-name>, <publisher-name>Main Track</publisher-name>, <fpage>2147</fpage>&#x2013;<lpage>2153</lpage>. doi: <pub-id pub-id-type="doi">10.24963/ijcai.2021/296</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Baker</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Gupta</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Naik</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Raskar</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Designing neural network architectures using reinforcement learning</article-title>. In <conf-name>5th International Conference on Learning Representations, 2017</conf-name>. doi: <pub-id pub-id-type="doi">10.48550/arXiv.1611.02167</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Baker</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Gupta</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Raskar</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Naik</surname> <given-names>N.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>Accelerating neural architecture search using performance prediction</article-title>. In <conf-name>International Conference on Learning Representations (ICLR) Workshops 2018</conf-name>. doi: <pub-id pub-id-type="doi">10.48550/arXiv.1705.10823</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barr&#xe9;</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Durand</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Chenu</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Meunier</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Montagne</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Castel</surname> <given-names>G.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>Geological control of soil organic carbon and nitrogen stocks at the landscape scale</article-title>. <source>Geoderma</source> <volume>285</volume>, <fpage>50</fpage>&#x2013;<lpage>56</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.geoderma.2016.09.029</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Basha</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Vinakota</surname> <given-names>S. K.</given-names>
</name>
<name>
<surname>Dubey</surname> <given-names>S. R.</given-names>
</name>
<name>
<surname>Pulabaigari</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Mukherjee</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>AutoFCL: automatically tuning fully connected layers for handling small dataset</article-title>. <source>Neural Computing App.</source> <volume>33</volume>, <fpage>8055</fpage>&#x2013;<lpage>8065</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s00521-020-05549-4</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bengio</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Courville</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Vincent</surname> <given-names>P.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Representation learning: A review and new perspectives</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intelligence</source> <volume>35</volume> (<issue>8</issue>), <fpage>1798</fpage>&#x2013;<lpage>1828</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TPAMI.2013.50</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Bergstra</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Bardenet</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Bengio</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>K&#xe9;gl</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2011</year>). &#x201c;<article-title>Algorithms for hyper-parameter optimization</article-title>,&#x201d; in <conf-name>Proceedings of the 24th International Conference on Neural Information Processing Systems; Granada, Spain: Curran Associates Inc</conf-name>. (<publisher-loc>Red Hook, NY, USA</publisher-loc>: <publisher-name>Curran Associates Inc.</publisher-name>) <fpage>2546</fpage>&#x2013;<lpage>2554</lpage>. doi: <pub-id pub-id-type="doi">10.5555/2986459.2986743</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bergstra</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Komer</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Eliasmith</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Yamins</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Cox</surname> <given-names>D. D.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Hyperopt: a Python library for model selection and hyperparameter optimization</article-title>. <source>Comput. Sci. Discov</source> <volume>8</volume> (<issue>1</issue>). doi: <pub-id pub-id-type="doi">10.1088/1749-4699/8/1/014008</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Camille</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Alexis</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Pierre</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Antoine</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Jean-Christophe</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Mathias</surname> <given-names>C.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). &#x201c;<article-title>Pl@ntNet-300K: a plant image dataset with high label ambiguity and a long-tailed distribution</article-title>,&#x201d; in <conf-name>Proc. of Thirty-fifth Conference on Neural Information Processing Systems, Datasets and Benchmarks Track</conf-name>. doi: <pub-id pub-id-type="doi">10.5281/zenodo.5645731</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Nanehkaran</surname> <given-names>Y. A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Using deep transfer learning for image-based plant disease identification</article-title>. <source>Comput. Electron. Agricult.</source> <volume>173</volume>, <fpage>105393</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.compag.2020.105393</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ching-Shih. Deb</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Pratap</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Agarwal</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Meyarivan</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>A fast and elitist multiobjective genetic algorithm: NSGA-II</article-title>. <source>IEEE Trans. Evol. Comput.</source> <volume>6</volume> (<issue>2</issue>), <fpage>182</fpage>&#x2013;<lpage>197</lpage>. doi: <pub-id pub-id-type="doi">10.1109/4235.996017</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Deng</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Yu</surname> <given-names>D.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Deep learning: Methods and applications</article-title>. <source>Foundations Trends Signal Processing</source> <volume>7</volume> (<issue>3</issue>), <fpage>197</fpage>&#x2013;<lpage>387</lpage>. doi: <pub-id pub-id-type="doi">10.1561/2000000039</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Donegan</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Yous</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Sinha</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Byrne</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). &#x201c;<article-title>VPU specific CNNs through neural architecture search</article-title>,&#x201d; in <conf-name>2020 25th International Conference on Pattern Recognition (ICPR)</conf-name>, <volume>2021</volume> pp. <fpage>9772</fpage>&#x2013;<lpage>9779</lpage>. doi: <pub-id pub-id-type="doi">10.1109/ICPR48806.2021.9412794</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dorigo</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Birattari</surname> <given-names>M.</given-names>
</name>
<name>
<surname>St&#xfc;tzle</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Ant colony optimization</article-title>. <source>IEEE Comput. Intell. Magazine</source> <volume>1</volume> (<issue>4</issue>), <fpage>28</fpage>&#x2013;<lpage>39</lpage>. doi: <pub-id pub-id-type="doi">10.1109/MCI.2006.329691</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Fan</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Ferianc</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Que</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Niu</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Luk</surname> <given-names>W.</given-names>
</name>
</person-group> (<year>2020</year>). &#x201c;<article-title>Optimizing FPGA-based CNN accelerator using differentiable neural architecture search</article-title>,&#x201d; in <conf-name>2020 IEEE 38th International Conference on Computer Design (ICCD)</conf-name>, <conf-loc>Hartford, CT</conf-loc>. <fpage>465</fpage>&#x2013;<lpage>468</lpage>. doi: <pub-id pub-id-type="doi">10.1109/ICCD50377.2020.00085</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ferentinos</surname> <given-names>K. P.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Deep learning models for plant disease detection and diagnosis</article-title>. <source>Comput. Electron. Agricult.</source> <volume>145</volume>, <fpage>311</fpage>&#x2013;<lpage>318</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.compag.2018.01.009</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Franchini</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Ruggiero</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Porta</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Zanni</surname> <given-names>L</given-names>
</name>
</person-group>. (<year>2022</year>). <article-title>Neural architecture search <italic>via</italic> standard machine learning methodologies</article-title>. <source>Mathematics Eng.</source> <volume>5</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>21</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3934/mine.2023012</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Frazier</surname> <given-names>P. I.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>A tutorial on Bayesian optimization</article-title>. <source>INFORMS TutORials in Operations Research</source>, <fpage>255</fpage>&#x2013;<lpage>278</lpage>. doi: <pub-id pub-id-type="doi">10.1287/educ.2018.0188</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Fu</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Gao</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Ping</surname> <given-names>W.</given-names>
</name>
</person-group> (<year>2016</year>). &#x201c;<article-title>Content-based image retrieval based on CNN and SVM</article-title>,&#x201d; in <conf-name>2016 2nd IEEE International Conference on Computer and Communications (ICCC)</conf-name>. <volume>2016</volume>, pp. <fpage>638</fpage>&#x2013;<lpage>642</lpage>. doi: <pub-id pub-id-type="doi">10.1109/CompComm.2016.7924779</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Glorot</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Bengio</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Understanding the difficulty of training deep feedforward neural networks</article-title>. <source>J. Mach. Learn. Res.</source> <volume>9</volume>, <fpage>249</fpage>&#x2013;<lpage>256</lpage>.</citation>
</ref>
<ref id="B24">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Gupta</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Ong</surname> <given-names>Y.-S.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2017</year>). &#x201c;<article-title>Insights on transfer optimization: Because experience is the best teacher</article-title>,&#x201d; in <conf-name>IEEE Transactions on Emerging Topics in Computational Intelligence</conf-name>, Vol. <volume>2</volume>, <fpage>51</fpage>&#x2013;<lpage>64</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TETCI.2017.2769104</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>He</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Ren</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Deep residual learning for image recognition</article-title>. <source>IEEE</source> <volume>2016</volume>, <fpage>770</fpage>&#x2013;<lpage>778</lpage>. doi: <pub-id pub-id-type="doi">10.1109/CVPR.2016.90</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Howard</surname> <given-names>A. G.</given-names>
</name>
<name>
<surname>Zhu</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Kalenichenko</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Weyand</surname> <given-names>T.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>MobileNets: Efficient convolutional neural networks for mobile vision applications</article-title>. <source>arXiv preprint</source>. doi: <pub-id pub-id-type="doi">10.48550/arXiv.1704.04861</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Cheng</surname> <given-names>R.</given-names>
</name>
<name>
<surname>He</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>M.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). <article-title>Accelerating multi-objective neural architecture search by random-weight evaluation</article-title>. <source>Complex Intelligent Syst</source> <volume>2021</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s40747-021-00594-5</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Hu</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Jia</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Ling</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>D.</given-names>
</name>
</person-group> (<year>2012</year>). &#x201c;<article-title>Multiscale distance matrix for fast plant leaf recognition</article-title>,&#x201d; in <conf-name>IEEE Trans. Image Processing</conf-name>, Vol. <volume>21</volume>. <fpage>4667</fpage>&#x2013;<lpage>4672</lpage>.</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Iranfar</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Zapater</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Atienza</surname> <given-names>D.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Multiagent reinforcement learning for hyperparameter optimization of convolutional neural networks</article-title>. <source>IEEE Trans. Computer-Aided Design Integrated Circuits Syst.</source> <volume>41</volume>, <fpage>1034</fpage>&#x2013;<lpage>1047</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/tcad.2021.3077193</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Ji</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Jiang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Qian</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2019</year>). &#x201c;<article-title>A novel doubly reweighting multisource transfer learning framework</article-title>,&#x201d; in <conf-name>IEEE Transactions on Emerging Topics in Computational Intelligence</conf-name>. <volume>3</volume>, <fpage>380</fpage>&#x2013;<lpage>391</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TETCI.2018.286832</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname> <given-names>D. R.</given-names>
</name>
<name>
<surname>Schonlau</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Welch</surname> <given-names>W. J.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Efficient global optimization of expensive black-box functions</article-title>. <source>J. Global Optimization</source> <volume>13</volume>, <fpage>455</fpage>&#x2013;<lpage>492</lpage>. doi: <pub-id pub-id-type="doi">10.1023/A:1008306431147</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Kennedy</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Eberhart</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>1995</year>). &#x201c;<article-title>Particle swarm optimization</article-title>,&#x201d; in <conf-name>Icnn95-international Conference on Neural Networks</conf-name>. vol.<volume>4</volume>, pp. <fpage>1942</fpage>&#x2013;<lpage>1948</lpage>. doi: <pub-id pub-id-type="doi">10.1109/ICNN.1995.488968</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lecun</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Bengio</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Hinton</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Deep learning</article-title>. <source>Nature</source> <volume>521</volume> (<issue>7553</issue>), <fpage>436</fpage>. doi: <pub-id pub-id-type="doi">10.1038/nature14539</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Chang</surname> <given-names>J. H.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Deep learning ensemble with asymptotic techniques for oscillometric blood pressure estimation</article-title>. <source>Comput. Methods Programs Biomed.</source> <volume>151</volume>, <fpage>1</fpage>&#x2013;<lpage>13</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.cmpb.2017.08.005</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Guo</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A review of computer vision technologies for plant phenotyping</article-title>. <source>Comput. Electron. Agricult.</source> <volume>176</volume>, <fpage>105672</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.compag.2020.105672</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Nie</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Chao</surname> <given-names>X.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Do we really need deep CNN for plant diseases identification</article-title>? <source>Comput. Electron. Agric.</source> <volume>178</volume> (<issue>3</issue>), <fpage>105803</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.compag.2020.105803</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Mu</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Shi</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Pang</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>X.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>A novel method of maize leaf disease image identification based on a multichannel convolutional neural network</article-title>. <source>Transactions of the ASABE</source> <volume>61</volume> (<issue>5</issue>), <fpage>1461</fpage>&#x2013;<lpage>1474</lpage>. doi: <pub-id pub-id-type="doi">10.13031/trans.12440</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Simonyan</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Vinyals</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Fernando</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Kavukcuoglu</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Hierarchical representations for efficient architecture search</article-title>. <source>ICLR 2018</source>. doi: <pub-id pub-id-type="doi">10.48550/arXiv.1711.00436</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Xue</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Yen</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A survey on evolutionary neural architecture search</article-title>. <source>IEEE Trans Neural Netw Learn Syst.</source>, <fpage>1</fpage>&#x2013;<lpage>21</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TNNLS.2021.3100554</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>He</surname> <given-names>D. J.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Identification of apple leaf diseases based on deep convolutional neural networks</article-title>. <source>Symmetry</source> <volume>10</volume> (<issue>1</issue>), <fpage>11</fpage>. doi: <pub-id pub-id-type="doi">10.3390/sym10010011</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lowe</surname> <given-names>D. G.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Distinctive image features from scale-invariant keypoints</article-title>. <source>Int. J. Comput. Vision</source> <volume>60</volume> (<issue>2</issue>), <fpage>91</fpage>&#x2013;<lpage>110</lpage>. doi: <pub-id pub-id-type="doi">10.1023/B:VISI.0000029664.99615.94</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lu</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Whalen</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Boddeti</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Dhebar</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Deb</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Goodman</surname> <given-names>E.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>NSGA-net: Neural architecture search using multi-objective genetic algorithm</article-title>. <source>GECCO 2019</source>. doi: <pub-id pub-id-type="doi">10.48550/arXiv.1810.03522</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mohana</surname> <given-names>R. M.</given-names>
</name>
<name>
<surname>Reddy</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Anisha</surname> <given-names>P. R.</given-names>
</name>
<name>
<surname>Murthy</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Random forest algorithms for the classification of tree-based ensemble</article-title>. <source>Mater. Today: Proc.</source> <volume>12</volume>, In Press. doi: <pub-id pub-id-type="doi">10.1016/j.matpr.2021.01.788</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Pawara</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Okafor</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Surinta</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Schomaker</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Wiering</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2017</year>). &#x201c;<article-title>Comparing local descriptors and bags of visual words to deep convolutional neural networks for plant recognition</article-title>,&#x201d; in <conf-name>Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods-ICPRAM</conf-name>, <fpage>479</fpage>&#x2013;<lpage>486</lpage>. doi: <pub-id pub-id-type="doi">10.5220/0006196204790486</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perez</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>The effectiveness of data augmentation in image classification using deep learning</article-title>.<source>arXiv preprint</source>. doi: <pub-id pub-id-type="doi">10.48550/arXiv.1712.04621</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Picon</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Alvarez-Gila</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Seitz</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Ortiz-Barredo</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Echazarra</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Johannes</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). <article-title>Deep convolutional neural networks for mobile capture device-based crop disease classification in the wild</article-title>. <source>Comput. Electron. Agricult.</source> <volume>161</volume>, <fpage>280</fpage>&#x2013;<lpage>290</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.compag.2018.04.002</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Purohit</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Viroja</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Gandhi</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Chaudhary</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Automatic plant species recognition technique using machine learning approaches</article-title>. <source>Int. Conf. Computing Netw. Commun</source> <volume>2015</volume>, <fpage>710</fpage>&#x2013;<lpage>719</lpage>. doi: <pub-id pub-id-type="doi">10.1109/CoCoNet.2015.7411268</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Rahaman</surname> <given-names>M. A.</given-names>
</name>
<name>
<surname>Hossain</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Hossain</surname> <given-names>M. F.</given-names>
</name>
</person-group> (<year>2019</year>). &#x201c;<article-title>A color and texture based approach for the detection and classification of plant leaf disease using KNN classifier</article-title>,&#x201d; in <conf-name>2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)</conf-name>. pp. <fpage>1</fpage>&#x2013;<lpage>6</lpage>. doi: <pub-id pub-id-type="doi">10.1109/ECACE.2019.8679247</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rumelhart</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Hinton</surname> <given-names>G. E.</given-names>
</name>
<name>
<surname>Williams</surname> <given-names>R. J.</given-names>
</name>
</person-group> (<year>1986</year>). <article-title>Learning representations by back propagating errors</article-title>. <source>Nature</source> <volume>323</volume> (<issue>6088</issue>), <fpage>533</fpage>&#x2013;<lpage>536</lpage>. doi: <pub-id pub-id-type="doi">10.1038/323533a0</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rumpf</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Mahlein</surname> <given-names>A. K.</given-names>
</name>
<name>
<surname>Steiner</surname> <given-names>U.</given-names>
</name>
<name>
<surname>Oerke</surname> <given-names>E. C.</given-names>
</name>
<name>
<surname>Dehne</surname> <given-names>H. W.</given-names>
</name>
<name>
<surname>Pl&#xfc;mer</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Early detection and classification of plant diseases with support vector machines based on hyperspectral reflectance</article-title>. <source>Comput. Electron. Agricult.</source> <volume>74</volume> (<issue>1</issue>), <fpage>91</fpage>&#x2013;<lpage>99</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.compag.2010.06.009</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shahriari</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Swersky</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Adams</surname> <given-names>R. P.</given-names>
</name>
<name>
<surname>Freitas</surname> <given-names>N. D.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Taking the human out of the loop: A review of Bayesian optimization</article-title>. <source>Proc. IEEE</source> <volume>104</volume> (<issue>1</issue>), <fpage>148</fpage>&#x2013;<lpage>175</lpage>. doi: <pub-id pub-id-type="doi">10.1109/JPROC.2015.2494218</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sm</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Rs</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Dr</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Deep convolutional neural network based detection system for real-time corn plant disease recognition</article-title>. <source>Proc. Comput. Sci.</source> <volume>167</volume>, <fpage>2003</fpage>&#x2013;<lpage>2010</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.procs.2020.03.236</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Suganuma</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Shirakawa</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Nagao</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>A genetic programming approach to designing convolutional neural network architectures</article-title>. <conf-name>GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference</conf-name>, <fpage>497</fpage>&#x2013;<lpage>504</lpage>. doi: <pub-id pub-id-type="doi">10.1145/3071178.3071229</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Xue</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Yen</surname> <given-names>G. G.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Completely automated CNN architecture design based on blocks</article-title>. <source>IEEE Trans. Neural Networks Learn. Systems</source> <volume>99</volume>, <fpage>1</fpage>&#x2013;<lpage>13</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TNNLS.2019.2919608</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Xue</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Yen</surname> <given-names>G. G.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Evolving deep convolutional neural networks for image classification</article-title>. <source>IEEE Trans. Evol. Computation</source> <volume>24</volume> (<issue>2</issue>), <fpage>394</fpage>&#x2013;<lpage>407</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TEVC.2019.2916183</pub-id>
</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Szegedy</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Ioffe</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Vanhoucke</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Alemi</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Inception-v4, inception-ResNet and the impact of residual connections on learning</article-title>. <conf-name>AAAI'17: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence</conf-name>, <fpage>4278</fpage>&#x2013;<lpage>4284</lpage>. doi: <pub-id pub-id-type="doi">10.48550/arXiv.1602.07261</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Thomas</surname>
</name>
<name>
<surname>Fogel</surname> <given-names>D. B.</given-names>
</name>
<name>
<surname>Michalewicz</surname> <given-names>Z</given-names>
</name>
</person-group>. (<year>1997</year>). <source>Handbook of evolutionary computation</source>. (<publisher-loc>Boca Raton, Florida, USA</publisher-loc>: <publisher-name>CRC Press</publisher-name>). doi: <pub-id pub-id-type="doi">10.1201/9780367802486</pub-id>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tran</surname> <given-names>L. T.</given-names>
</name>
<name>
<surname>Bae</surname> <given-names>S.-H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Training-free hardware-aware neural architecture search with reinforcement learning</article-title>. <source>J. Broadcast Eng.</source> <volume>26</volume>, <fpage>855</fpage>&#x2013;<lpage>861</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.5909/jbe.2020.26.7.855</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Z.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>A survey of recent work on fine-grained image classification techniques</article-title>. <source>J. Visual Commun. Image Representation</source> <volume>59</volume>, <fpage>210</fpage>&#x2013;<lpage>214</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.jvcir.2018.12.049</pub-id>
</citation>
</ref>
<ref id="B60">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Wang</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Xue</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). &#x201c;<article-title>Particle swarm optimisation for evolving deep neural networks for image classification by evolving and stacking transferable blocks</article-title>,&#x201d; in <conf-name>IEEE Congress on Evolutionary Computation (CEC) as part of the IEEE World Congress on Computational Intelligence (IEEE WCCI) Electr Network</conf-name>. pp. <fpage>1</fpage>&#x2013;<lpage>8</lpage>. doi: <pub-id pub-id-type="doi">10.1109/CEC48606.2020.9185541</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wen</surname> <given-names>Y.-W.</given-names>
</name>
<name>
<surname>Peng</surname> <given-names>S.-H.</given-names>
</name>
<name>
<surname>Ting</surname> <given-names>C.-K.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Two-stage evolutionary neural architecture search for transfer learning</article-title>. <source>IEEE Trans. Evol. Comput.</source> <volume>25</volume>, <fpage>928</fpage>&#x2013;<lpage>940</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/tevc.2021.3097937</pub-id>
</citation>
</ref>
<ref id="B62">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Wistuba</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>AutoML (2017)</article-title>. <source>Automatic Machine Learning Workshop (ICML 2017)</source>. <uri xlink:href="https://research.ibm.com/publications/bayesian-optimization-combined-with-incremental-evaluation-for-neural-network-architecture-optimization">https://research.ibm.com/publications/bayesian-optimization-combined-with-incremental-evaluation-for-neural-network-architecture-optimization</uri>.</citation>
</ref>
<ref id="B63">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Xiao</surname> <given-names>X. Y.</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>X. F.</given-names>
</name>
</person-group> (<year>2010</year>). &#x201c;<article-title>Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence. ICIC 2010</article-title>,&#x201d; In <source>Lecture Notes in Computer Science</source> <volume>6216</volume>. <publisher-loc>Berlin, Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>. doi: <pub-id pub-id-type="doi">10.1007/978-3-642-14932-0_19</pub-id>
</citation>
</ref>
<ref id="B64">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Xie</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Yuille</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2017</year>). &#x201c;<article-title>Genetic CNN</article-title>,&#x201d; in <conf-name>2017 IEEE International Conference on Computer Vision (ICCV)</conf-name>. <volume>2017</volume>, pp. <fpage>1388</fpage>&#x2013;<lpage>1397</lpage>. doi: <pub-id pub-id-type="doi">10.1109/ICCV.2017.154</pub-id>
</citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yu</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Yankang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Jiayu</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Adam</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A self-adaptive mutation neural architecture search algorithm based on blocks</article-title>. <source>IEEE Comput. Intell. Magazine</source> <volume>16</volume> (<issue>3</issue>), <fpage>67</fpage>&#x2013;<lpage>78</lpage>. doi: <pub-id pub-id-type="doi">10.1109/MCI.2021.3084435</pub-id>
</citation>
</ref>
<ref id="B66">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Zoph</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Le</surname> <given-names>Q. V.</given-names>
</name>
</person-group> (<year>2016</year>) <source>Neural architecture search with reinforcement learning</source>. Available at: <uri xlink:href="https://ui.adsabs.harvard.edu/abs/2016arXiv161101578Z">https://ui.adsabs.harvard.edu/abs/2016arXiv161101578Z</uri>.</citation>
</ref>
</ref-list>
</back>
</article>