<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Energy Res.</journal-id>
<journal-title>Frontiers in Energy Research</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Energy Res.</abbrev-journal-title>
<issn pub-type="epub">2296-598X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1090033</article-id>
<article-id pub-id-type="doi">10.3389/fenrg.2022.1090033</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Energy Research</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Foreign body intrusion monitoring and recognition approach in a power distribution room based on the improved YOLOv4 deep learning network</article-title>
<alt-title alt-title-type="left-running-head">Chen et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fenrg.2022.1090033">10.3389/fenrg.2022.1090033</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Chen</surname>
<given-names>Shenyu</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Dai</surname>
<given-names>Xiaofeng</given-names>
</name>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Wang</surname>
<given-names>Zengyu</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2085347/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>Pan</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Chen</surname>
<given-names>Zetao</given-names>
</name>
</contrib>
</contrib-group>
<aff>
<institution>Tianhe Power Supply Bureau of Guangzhou Power Supply Bureau</institution>, <institution>Guangdong Power Co., Ltd.</institution>, <addr-line>Guangzhou</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1605897/overview">Amrit Mukherjee</ext-link>, University of South Bohemia in &#x10c;esk&#xe9; Bud&#x11b;jovice, Czechia</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1079570/overview">Gaurav Dhiman</ext-link>, Government Bikram College of Commerce Patiala, India</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1616714/overview">Ramin Ranjbarzadeh</ext-link>, Dublin City University, Ireland</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Zengyu Wang, <email>wangzengyugzps@gmail.com</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Process and Energy Systems Engineering, a section of the journal Frontiers in Energy Research</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>01</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>10</volume>
<elocation-id>1090033</elocation-id>
<history>
<date date-type="received">
<day>04</day>
<month>11</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>30</day>
<month>12</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Chen, Dai, Wang, Zhang and Chen.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Chen, Dai, Wang, Zhang and Chen</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>In order to avoid safety problems caused by foreign bodies such as mice that may appear in the power distribution room and by demarcating the electronic fence area for key monitoring in the video surveillance screen, a foreign body intrusion monitoring and recognition approach in a power distribution room based on the improved YOLOv4 deep learning network is proposed. To optimize the detection effects, the YOLOv4 algorithm is improved from the aspects of network structure, frame detection, and loss function. At the same time, the channel pruning algorithm is used to prune the model to simplify the model structure. The experimental results show the effectiveness of the improved YOLOv4 deep learning network, which has high detection accuracy, fast detection speed, and takes up less space after pruning.</p>
</abstract>
<kwd-group>
<kwd>power distribution room</kwd>
<kwd>foreign body intrusion monitoring</kwd>
<kwd>improved YOLOv4</kwd>
<kwd>channel pruning algorithm</kwd>
<kwd>monitoring-highway hazard</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>With the deepening of the reform of the power system and the formation of the new normal of the macro economy, the competition in the energy consumption market has become increasingly fierce, and the external environment has gradually increased the service requirements of power grid operators and power supply companies (<xref ref-type="bibr" rid="B13">Kaur et al., 2020</xref>; <xref ref-type="bibr" rid="B39">Yuan et al., 2022</xref>; <xref ref-type="bibr" rid="B21">Liu et al., 2023</xref>). The existing management and monitoring mode of the power room is based on sensors, video monitoring, manual analysis, robot inspection, etc. (<xref ref-type="bibr" rid="B7">Dhiman and Kaur, 2019</xref>; <xref ref-type="bibr" rid="B16">Kumar and Dhiman, 2021</xref>; <xref ref-type="bibr" rid="B38">Yang et al., 2021</xref>). Artificial intelligence can be used to improve the level of automation management and monitoring of power rooms so as to provide customers with better power supply services (<xref ref-type="bibr" rid="B12">Jacob and Darney, 2021</xref>; <xref ref-type="bibr" rid="B15">Kiruthika et al., 2021</xref>; <xref ref-type="bibr" rid="B20">Liu et al., 2021</xref>). At present, equipment and security monitoring methods of some power distribution rooms are relatively backward, so they may not be able to timely and accurately reflect information about abnormal situations (<xref ref-type="bibr" rid="B10">Gangolells et al., 2010</xref>; <xref ref-type="bibr" rid="B24">Mroszczyk, 2015</xref>). Research studies on the intelligent identification technology of video surveillance based on deep learning can realize the real-time monitoring and identification of foreign body intrusion in the power room.</p>
<p>Target detection technology has been applied in this field for decades (<xref ref-type="bibr" rid="B3">Anastasiadou et al., 2021</xref>; <xref ref-type="bibr" rid="B28">Ren et al., 2021a</xref>; <xref ref-type="bibr" rid="B6">Chen et al., 2022</xref>; <xref ref-type="bibr" rid="B25">Qin et al., 2022</xref>; <xref ref-type="bibr" rid="B37">Yang et al., 2022</xref>). Traditional target detection can be divided into three steps: one is to select some candidate regions on a given image; the second step is to characterize these regions&#x2019; extraction; and the third step is to use the trained classifier for classification, generally selecting Bagging, AdaBoost, and support vector machine (SVM) (<xref ref-type="bibr" rid="B32">Tang et al., 2020</xref>; <xref ref-type="bibr" rid="B34">Wang et al., 2020</xref>; <xref ref-type="bibr" rid="B36">Yang et al., 2020</xref>). When the selected feature descriptor has good robustness, the performance of the traditional object detection algorithm is also acceptable. However, the traditional target detection has the problem that the artificially designed sliding window leads to an unsatisfactory detection effect (<xref ref-type="bibr" rid="B26">Ramcharan et al., 2017</xref>; <xref ref-type="bibr" rid="B18">Li et al., 2019</xref>). Now, deep learning techniques which are a powerful class of methods that can automatically learn feature representations from data, have developed rapidly, (<xref ref-type="bibr" rid="B14">Ker et al., 2017</xref>; <xref ref-type="bibr" rid="B23">Mezgec et al., 2019</xref>). In terms of foreign body detection, according to the low environmental adaptability of the existing foreign body detection and identification model, foreign body intrusion detection and the identification model for urban rail transit lines is constructed in the literature (<xref ref-type="bibr" rid="B4">Bashar, 2019</xref>). <xref ref-type="bibr" rid="B30">Singamaneni et al. (2022)</xref> proposed an improved YOLOv3 foreign body detection method for transmission lines, which has certain advantages in performance. One of the main problems it aimed to solve was that the image is prone to distortion during the insulator detection process and the model is greatly affected by the external environment. The algorithm in the literature (<xref ref-type="bibr" rid="B1">Aggarwal, 2019</xref>) first adopts adaptive gamma transform to automatically adjust the brightness of the insulator image, and YOLOv4 is used to learn the feature representation of different levels of the insulator. In order to simplify the complex structure of YOLOv4 target detection network, many parameters, high configuration required for training, and low number of transmission frames of real-time detection pictures, it is difficult to achieve industrial application popularization. The study by <xref ref-type="bibr" rid="B29">Sharma (2018)</xref> proposes an improved light-weight network based on the YOLOv4 quantization algorithm. The original YOLOv4 network is improved and optimized, the ShuffleNetv2 light-weight network is used to replace the original YOLOv4 backbone network, and the SENet module is integrated into ShuffleNetv2 to make the model converge better.</p>
<p>To bring a new and higher level of foreign body intrusion into the power distribution room, a foreign body intrusion monitoring and recognition approach in the power distribution room based on the improved YOLOv4 deep learning network is proposed. To optimize the detection effects, the YOLOv4 algorithm is improved from the aspects of the network structure, frame detection, and loss function. The main contributions of the study are summarized as follows:<list list-type="simple">
<list-item>
<p>(1) The YOLOv4 algorithm is improved from the aspects of the network structure, frame detection, and loss function.</p>
</list-item>
<list-item>
<p>(2) The channel pruning algorithm is used to prune the model to simplify the model structure and reduce the amount of calculation.</p>
</list-item>
</list>
</p>
<p>This paper is organized as follows: the first section establishes the design framework for foreign body intrusion recognition in power distribution rooms, the second section introduces the identification method of foreign body intrusion in power distribution rooms based on improved YOLOv4, the third section is experimental verification and analysis, and the fourth section is the conclusion.</p>
</sec>
<sec id="s2">
<title>2 Design framework for foreign body intrusion identification in a power distribution room</title>
<p>For the intrusion of foreign objects, the video frame is selected for frame selection, small target detection, classification, and identification. When abnormal intrusion is detected, its shape is captured through images, a general category judgment is made, and then corresponding alarms are sent to the duty personnel This would reduce the safety accidents caused by mice and other small animals entering the electric room by mistake. The detection steps are as follows:</p>
<p>
<statement content-type="step" id="Step_1">
<label>Step 1</label>
<p>Video frame selection and format processing. According to the comprehensive consideration of the scene, demand, and performance, a reasonable video frame selection interval is designed, and the captured single frame picture is converted into a JPG format picture.</p>
</statement>
</p>
<p>
<statement content-type="step" id="Step_2">
<label>Step 2</label>
<p>Image preprocessing. Image preprocessing is performed on the format-converted image to remove some noise interference in the image and ensure that the training and inference data preprocessing operations are consistent.</p>
</statement>
</p>
<p>
<statement content-type="step" id="Step_3">
<label>Step 3</label>
<p>Small mouse detection and identification. According to the preprocessing result, the image is preprocessed before recognition, and then it is input into the foreign body detection model to determine whether a mouse is detected.</p>
</statement>
</p>
<p>
<statement content-type="step" id="Step_4">
<label>Step 4</label>
<p>Judgment results and output alarm information. According to the recognition result, if no mouse is detected, it will directly return to none to continue the detection. If detected, it will output alarm information. The identification process of foreign body intrusion in the power distribution room is shown in <xref ref-type="fig" rid="F1">Figure 1</xref>.</p>
</statement>
</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Flow of the foreign body intrusion identification system in the power distribution room.</p>
</caption>
<graphic xlink:href="fenrg-10-1090033-g001.tif"/>
</fig>
</sec>
<sec id="s3">
<title>3 Identification method of foreign body intrusion in a power distribution room based on improved YOLOv4</title>
<sec id="s3-1">
<title>3.1 Data preprocessing</title>
<p>Based on the samples in the pilot electric room scene, labellum is used in this study to manually label foreign bodies such as mice in each image, and the minimum circumscribed rectangle of the target is used as the real frame during labeling. The number of annotations is shown in <xref ref-type="table" rid="T1">Table 1</xref>.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Labeling of foreign bodies in the power distribution room.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Scene</th>
<th align="center">Main category</th>
<th align="center">Subcategory</th>
<th align="center">Sample size</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">Xinghui international electric house</td>
<td align="center">Foreign body detection</td>
<td align="center">mouse</td>
<td align="center">526</td>
</tr>
<tr>
<td align="center">Dongpu training room</td>
<td align="center">Foreign body detection</td>
<td align="center">mouse</td>
<td align="center">471</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3-2">
<title>3.2 The improvement strategy of the YOLOv4 model</title>
<p>The YOLOv4 target detection algorithm is an improved version of the YOLOv3. Compared with YOLOv3, YOLOv4 implements feature extraction of input images through the backbone feature extraction network CSPDarknet53, and Path aggregation network (PANet) is used to achieve feature fusion between different output layers (<xref ref-type="bibr" rid="B11">Hu, 2022</xref>; <xref ref-type="bibr" rid="B40">Zhang et al., 2022</xref>).</p>
<p>In this study, the foreign body detection and recognition model in the distribution room based on the YOLOv4 algorithm is mainly composed of the following four modules: the center and scale prediction (CSP) module divides the stacking of the original residual blocks into two parts, one part continues the stacking of the residual blocks, the other part is connected to the output only after a small amount of processing (<xref ref-type="bibr" rid="B27">Ren et al., 2021b</xref>; <xref ref-type="bibr" rid="B2">Ahmad et al., 2022</xref>; <xref ref-type="bibr" rid="B19">Liang and Pei, 2022</xref>). The batch normalization and Mish (CBM) module is used to extract input image features. The spatial pyramid pooling (SPP) module uses the maximum pooling of different scales to pool the input feature layers and then stack them, which can greatly increase the receptive field.</p>
<sec id="s3-2-1">
<title>3.2.1 The improvement of the network structure</title>
<p>The input image is scaled to a pixel size of 608 &#xd7; 608 as the input of the network, and local features are extracted by convolution in three different scales. The three output feature maps of different sizes, 19 &#xd7; 19, 38 &#xd7; 38, and 76 &#xd7; 76, correspond to scale 1, scale 2, and scale 3, respectively. Among them, the scale 19 &#xd7; 19 adds some convolution blocks and then convolve the output detection frame information after the auxiliary basic network, the scale 38 &#xd7; 38 introduces a new round of convolution based on the convolution before the end of scale 1 after block and up-sampling, and the CSPDarknet53 network is combined to perform the final convolution. Then, the regression judgment of the category and location is performed. To avoid problems such as label overlap, binary cross-sections perform category prediction, which is used to improve accuracy.</p>
</sec>
<sec id="s3-2-2">
<title>3.2.2 The optimization of border detection</title>
<p>In the process of dividing the input photo into M &#xd7; M blocks, each of the detection boxes contains five parameters, which are the abscissa <italic>x</italic> of the object center position, the ordinate <italic>y</italic>, the confidence score <italic>C</italic>
<sub>
<italic>objec</italic>
</sub>, the height <italic>h</italic>, and width <italic>w</italic> of the detection frame. It effectively reduces the influence of the bounding box regression dislocation caused by the fixed detection bounding box scale. As an improved YOLOv4 algorithm, the difference is that the offset of <italic>x</italic> and <italic>y</italic> is used as the linear expression of the length and width of the detection frame, as shown in Eqs. <xref ref-type="disp-formula" rid="e1">1</xref>&#x2013;<xref ref-type="disp-formula" rid="e4">4</xref>:<disp-formula id="e1">
<mml:math id="m1">
<mml:mrow>
<mml:msub>
<mml:mi>R</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>w</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>
<disp-formula id="e2">
<mml:math id="m2">
<mml:mrow>
<mml:msub>
<mml:mi>R</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>h</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
<label>(2)</label>
</disp-formula>
<disp-formula id="e3">
<mml:math id="m3">
<mml:mrow>
<mml:msub>
<mml:mi>R</mml:mi>
<mml:mi>w</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>w</mml:mi>
</mml:msub>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>w</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(3)</label>
</disp-formula>
<disp-formula id="e4">
<mml:math id="m4">
<mml:mrow>
<mml:msub>
<mml:mi>R</mml:mi>
<mml:mi>h</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>h</mml:mi>
</mml:msub>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(4)</label>
</disp-formula>where <italic>R</italic>
<sub>
<italic>x</italic>
</sub>, <italic>R</italic>
<sub>
<italic>y</italic>
</sub>, <italic>R</italic>
<sub>
<italic>w</italic>
</sub>, and <italic>R</italic>
<sub>
<italic>h</italic>
</sub> are the estimated values of the four position coordinates on the feature map, respectively, and <italic>P</italic>
<sub>
<italic>x</italic>
</sub> and <italic>P</italic>
<sub>
<italic>y</italic>
</sub> represent the coordinate values of the center point. Then, the abscissa, ordinate, width, and height of the available frame information are studied, and the formulas for solving the values are as follows:<disp-formula id="e5">
<mml:math id="m5">
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
<label>(5)</label>
</disp-formula>
<disp-formula id="e6">
<mml:math id="m6">
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>y</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
<label>(6)</label>
</disp-formula>
<disp-formula id="e7">
<mml:math id="m7">
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>w</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>w</mml:mi>
</mml:msub>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>w</mml:mi>
</mml:msub>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(7)</label>
</disp-formula>
<disp-formula id="e8">
<mml:math id="m8">
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>h</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>h</mml:mi>
</mml:msub>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:msub>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(8)</label>
</disp-formula>where <italic>b</italic>
<sub>
<italic>x</italic>
</sub>, <italic>b</italic>
<sub>
<italic>y</italic>
</sub>, <italic>b</italic>
<sub>
<italic>w</italic>
</sub>, and <italic>b</italic>
<sub>
<italic>h</italic>
</sub> are the information of the obtained frame, <italic>c</italic>
<sub>
<italic>x</italic>
</sub> and <italic>c</italic>
<sub>
<italic>y</italic>
</sub> indicate the offset of the frame coordinates, and the width and sum of the preset anchor frame height are represented by <italic>P</italic>
<sub>
<italic>w</italic>
</sub> and <italic>P</italic>
<sub>
<italic>h</italic>
</sub>, respectively. On this basis, the mathematical expressions of confidence and intersection ratio are as follows:<disp-formula id="e9">
<mml:math id="m9">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>I</mml:mi>
<mml:mi>O</mml:mi>
<mml:msubsup>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">h</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
<label>(9)</label>
</disp-formula>
<disp-formula id="e10">
<mml:math id="m10">
<mml:mrow>
<mml:mi>I</mml:mi>
<mml:mi>O</mml:mi>
<mml:msubsup>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">p</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">h</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:msub>
<mml:mi>B</mml:mi>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2229;</mml:mo>
<mml:mi>B</mml:mi>
<mml:msub>
<mml:mi>B</mml:mi>
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:msub>
<mml:mi>B</mml:mi>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x222a;</mml:mo>
<mml:mi>B</mml:mi>
<mml:msub>
<mml:mi>B</mml:mi>
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>(10)</label>
</disp-formula>where <italic>P</italic>
<sub>
<italic>object</italic>
</sub> represents the probability that the target exists in the cell, <inline-formula id="inf1">
<mml:math id="m11">
<mml:mrow>
<mml:mi>I</mml:mi>
<mml:mi>O</mml:mi>
<mml:msubsup>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">h</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> represents the accuracy of the current bounding detection box to predict the target position, area () represents the area, <italic>BB</italic>
<sub>
<italic>dt</italic>
</sub> represents the detection bounding box, and <italic>BB</italic>
<sub>
<italic>gt</italic>
</sub> represents the reference frame based on the training label.</p>
</sec>
<sec id="s3-2-3">
<title>3.2.3 The optimization of loss function</title>
<p>Considering the contribution rate of each loss and the weight in training, the loss function of the improved YOLOv4 algorithm is as follows:<disp-formula id="e11">
<mml:math id="m12">
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mo>&#x2212;</mml:mo>
</mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>s</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:msup>
<mml:mi>M</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:munderover>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:msubsup>
<mml:mi>V</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mfenced open="[" close="]" separators="|">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x2b;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:msup>
<mml:mi>M</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:munderover>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:msubsup>
<mml:mi>V</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mfenced open="[" close="]" separators="|">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msqrt>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:msqrt>
<mml:mo>&#x2212;</mml:mo>
<mml:msqrt>
<mml:msub>
<mml:mover accent="true">
<mml:mi>w</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:msqrt>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x2b;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msqrt>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:msqrt>
<mml:mo>&#x2212;</mml:mo>
<mml:msqrt>
<mml:msub>
<mml:mover accent="true">
<mml:mi>h</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:msqrt>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:msup>
<mml:mi>M</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:munderover>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:msubsup>
<mml:mi>V</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:msup>
<mml:mi>M</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:munderover>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:msubsup>
<mml:mi>V</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x2b;</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:msup>
<mml:mi>M</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:munderover>
<mml:msubsup>
<mml:mi>V</mml:mi>
<mml:mi>i</mml:mi>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
<mml:mrow>
<mml:munder>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>s</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
</mml:mrow>
</mml:munder>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>p</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(11)</label>
</disp-formula>where <italic>V</italic>
<sub>
<italic>i</italic>
</sub> indicates whether there is a foreign body intrusion in the grid <italic>i</italic>; if there is an object, <italic>V</italic>
<sub>
<italic>i</italic>
</sub> &#x3d; 1, otherwise it is 0. <inline-formula id="inf2">
<mml:math id="m13">
<mml:mrow>
<mml:msubsup>
<mml:mi>V</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> indicates whether there is a foreign body intrusion in the <italic>i</italic>th detection frame in the <italic>i</italic>th grid; if so, <inline-formula id="inf3">
<mml:math id="m14">
<mml:mrow>
<mml:msubsup>
<mml:mi>V</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> &#x3d; 1, otherwise, it is 0. <inline-formula id="inf4">
<mml:math id="m15">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf5">
<mml:math id="m16">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are the loss weights when foreign bodies are found and not found in the detection frame, respectively.</p>
</sec>
<sec id="s3-2-4">
<title>3.2.4 YOLOv4 model simplification based on the channel pruning algorithm</title>
<p>To improve the practicability of the detection model and reduce the calculation of the model amount, it is necessary to prune the model. The channel pruning algorithm is used in this study to prune the YOLOv4 model while ensuring accuracy. The channel pruning algorithm realizing the principle is shown in <xref ref-type="fig" rid="F2">Figure 2</xref>.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Pruning schematic of the channel pruning algorithm.</p>
</caption>
<graphic xlink:href="fenrg-10-1090033-g002.tif"/>
</fig>
<p>During model training, the contribution rate of each input channel is scored using the coefficients of batch normalization (BN) layers in the network. The channel with a high contribution rate is retained, the channel with low contribution rate is pruned, and the pruned model structure is more compact.</p>
</sec>
</sec>
<sec id="s3-3">
<title>3.3 Evaluation indicators</title>
<p>To evaluate the recognition effect, the average accuracy index is used to measure the matching accuracy of the detection frame to the target object; it can be expressed as:<disp-formula id="e12">
<mml:math id="m17">
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mi mathvariant="normal">o</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">v</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
<mml:mi mathvariant="normal">o</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">l</mml:mi>
<mml:mi mathvariant="normal">o</mml:mi>
<mml:mi mathvariant="normal">b</mml:mi>
<mml:mi mathvariant="normal">j</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">c</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
<mml:mi mathvariant="normal">o</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">l</mml:mi>
<mml:mi mathvariant="normal">I</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>(12)</label>
</disp-formula>where <inline-formula id="inf6">
<mml:math id="m18">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mi mathvariant="normal">o</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">v</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the actual quantity of pictures at the <italic>i</italic>th category, <inline-formula id="inf7">
<mml:math id="m19">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
<mml:mi mathvariant="normal">o</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">l</mml:mi>
<mml:mi mathvariant="normal">o</mml:mi>
<mml:mi mathvariant="normal">b</mml:mi>
<mml:mi mathvariant="normal">j</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">c</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the quantity of all objects at the data set category <italic>i</italic>, and <inline-formula id="inf8">
<mml:math id="m20">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
<mml:mi mathvariant="normal">o</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">l</mml:mi>
<mml:mi mathvariant="normal">I</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the quantity of images containing objects in the category <italic>i</italic>.</p>
<p>The average precision metric is<disp-formula id="e13">
<mml:math id="m21">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mi>i</mml:mi>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:math>
<label>(13)</label>
</disp-formula>
</p>
</sec>
</sec>
<sec id="s4">
<title>4 Experimental verification and analysis</title>
<p>The experimental parameters are shown in <xref ref-type="table" rid="T2">Table 2</xref> and <xref ref-type="table" rid="T3">Table 3</xref>. The identification object is a mouse in the power distribution room. The scene locations are the pilot power room and the Dongpu training room. The site is characterized by no light in the room, and the infrared white-light function of the camera needs to be automatically turned on. The camera deployment requirement is that in the pilot power room, at least one camera will monitor the entire power room. The height of the camera stand is 1&#xa0;m and 2.3&#xa0;m, respectively, and the angle is 15&#xb0; downward. The experimental data can be referred to in the reference (<xref ref-type="bibr" rid="B9">Dhiman and Kumar, 2017</xref>; <xref ref-type="bibr" rid="B17">Li et al., 2021</xref>; <xref ref-type="bibr" rid="B33">Traor&#xe9; and Pauwels, 2021</xref>; <xref ref-type="bibr" rid="B35">Xu et al., 2021</xref>). During the training phase, the base learning rate was .00001, the weight decay was .00005, the batch size was 64, and the momentum was .9. In this parameter setting, the change in the loss value in the training process of traditional YOLOv4 and improved YOLOv4 used is compared, and the results are shown in <xref ref-type="fig" rid="F3">Figure 3</xref>.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Experimental environment.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Soft and hard items</th>
<th align="center">Detail</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">CPU</td>
<td align="center">48 Intel(R) Xeon(R) CPU E7-4830 v3 @ 2.10&#xa0;GHz</td>
</tr>
<tr>
<td align="center">MEM</td>
<td align="center">256G</td>
</tr>
<tr>
<td align="center">Hard disk</td>
<td align="center">&#x2265; 500G</td>
</tr>
<tr>
<td align="center">Operating system</td>
<td align="center">Ubuntu18.04</td>
</tr>
<tr>
<td align="center">Docker version</td>
<td align="center">Docker version 19</td>
</tr>
<tr>
<td align="center">GPU</td>
<td align="center">NVIDIA Tesla P4</td>
</tr>
<tr>
<td align="center">CUDA version</td>
<td align="center">CUDA 10.1</td>
</tr>
<tr>
<td align="center">Deep learning framework</td>
<td align="center">TensorFlow2.x and Keras</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Parameter setting for channel pruning.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Handle mode</th>
<th align="center">Parameter</th>
<th align="center">Value</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="4" align="center">Sparse training</td>
<td align="center">Batch size</td>
<td align="center">9</td>
</tr>
<tr>
<td align="center">Learning rate</td>
<td align="center">.0008</td>
</tr>
<tr>
<td align="center">Number of iterations</td>
<td align="center">500</td>
</tr>
<tr>
<td align="center">Sparsity rate</td>
<td align="center">.001</td>
</tr>
<tr>
<td align="center">Channel pruning</td>
<td align="center">Pruning rate</td>
<td align="center">.85</td>
</tr>
<tr>
<td rowspan="2" align="center">Fine-tuning the model</td>
<td align="center">Number of iterations</td>
<td align="center">10000</td>
</tr>
<tr>
<td align="center">Batch size</td>
<td align="center">8</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Training loss curve of the model.</p>
</caption>
<graphic xlink:href="fenrg-10-1090033-g003.tif"/>
</fig>
<p>In <xref ref-type="fig" rid="F3">Figure 3</xref>, the loss values of the two networks decrease rapidly with the iterations, but it is evident that the loss value of the improved YOLO converges to a lower value of about .2, while the unimproved loss value is significantly higher at .5. Therefore, the loss value can directly reflect that the model of the improved YOLOv4 algorithm is more ideal, and the detection accuracy and speed are more in line with the experimental requirements.</p>
<p>
<xref ref-type="table" rid="T4">Table 4</xref> shows the pruning result. After pruning, the number of parameters of the model is reduced by 91.67%, the storage space of the model is reduced by 213.17 MB, and the forward operation time is shortened by .043&#xa0;s. After fine tuning, the model parameters and occupied space increased slightly, but the accuracy was still similar to that before pruning, and the forward propagation time did not change significantly. The results show that the pruning process can simplify the model while ensuring the original accuracy of the model.</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>Parameters after fine tuning.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Parameter</th>
<th align="center">Original model</th>
<th align="center">After pruning</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">Number of parameters (piece)</td>
<td align="center">6172143</td>
<td align="center">514242</td>
</tr>
<tr>
<td align="center">
<italic>mAP</italic> (%)</td>
<td align="center">97.48</td>
<td align="center">97.32</td>
</tr>
<tr>
<td align="center">Model storage space (MB)</td>
<td align="center">231.64</td>
<td align="center">18.47</td>
</tr>
<tr>
<td align="center">Operation time (s)</td>
<td align="center">.094</td>
<td align="center">.051</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The number of channels in each layer after pruning is shown in <xref ref-type="fig" rid="F4">Figure 4</xref>. As can be seen from <xref ref-type="fig" rid="F4">Figure 4</xref>, the number of channels in each convolutional layer has been greatly reduced, which proves that the pruning algorithm is effective.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Channel changes before and after pruning.</p>
</caption>
<graphic xlink:href="fenrg-10-1090033-g004.tif"/>
</fig>
<p>When the network training is performed for about 15,000 times, the loss value stabilizes around .2. At this time, the images in the test set are sent to the trained network for testing and verification, and the performance of foreign body recognition in various scenarios is shown in <xref ref-type="table" rid="T5">Table 5</xref>.</p>
<table-wrap id="T5" position="float">
<label>TABLE 5</label>
<caption>
<p>Performance of the foreign body detection model.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center"/>
<th align="center">Test scenario</th>
<th align="center">Correct number</th>
<th align="center">Error number</th>
<th align="center">Accuracy rate (%)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">Mouse detection and identification</td>
<td align="center">Video 1</td>
<td align="center">51</td>
<td align="center">4</td>
<td align="center">92.73</td>
</tr>
<tr>
<td align="center">Mouse detection and identification</td>
<td align="center">Video 2</td>
<td align="center">67</td>
<td align="center">6</td>
<td align="center">91.78</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In <xref ref-type="table" rid="T5">Table 5</xref>, the result shows the good recognition performance, which can accurately identify the foreign body in different scenarios, showing good recognition performance.</p>
<p>To reflect the superiority, the faster R-CNN (<xref ref-type="bibr" rid="B31">Stanford et al., 2017</xref>), YOLOv3 (<xref ref-type="bibr" rid="B5">Chen and Zhao, 2021</xref>), SSD (<xref ref-type="bibr" rid="B8">Dhiman and Kumar, 2018</xref>), and the YOLOv4 model are tested (<xref ref-type="bibr" rid="B22">Ma et al., 2021</xref>). After training, the test set data are used to evaluate the performance of these five algorithms. The evaluation results are shown in <xref ref-type="table" rid="T6">Table 6</xref>.</p>
<table-wrap id="T6" position="float">
<label>TABLE 6</label>
<caption>
<p>Performance comparison of five algorithms.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Model</th>
<th align="center">Accuracy (%)</th>
<th align="center">Recall (%)</th>
<th align="center">
<italic>mAP</italic> (%)</th>
<th align="center">Detection speed (f/s)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">Faster R-CNN</td>
<td align="center">83.47</td>
<td align="center">90.17</td>
<td align="center">84.23</td>
<td align="center">49.12</td>
</tr>
<tr>
<td align="center">YOLOv3</td>
<td align="center">89.17</td>
<td align="center">92.45</td>
<td align="center">90.17</td>
<td align="center">87.37</td>
</tr>
<tr>
<td align="center">SSD</td>
<td align="center">85.64</td>
<td align="center">88.46</td>
<td align="center">87.14</td>
<td align="center">118.24</td>
</tr>
<tr>
<td align="center">YOLOv4</td>
<td align="center">91.76</td>
<td align="center">92.94</td>
<td align="center">97.48</td>
<td align="center">72.96</td>
</tr>
<tr>
<td align="center">Method of this study</td>
<td align="center">93.43</td>
<td align="center">97.24</td>
<td align="center">97.32</td>
<td align="center">80.74</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In <xref ref-type="table" rid="T5">Table 5</xref>, compared with the other four models, the accuracy of the method in this study is higher. The <italic>mAP</italic> of the YOLOv4 model after pruning is 1.6% lower than that of the YOLOv4 model before pruning, and the accuracy of the model after fine tuning is still similar to that before pruning. In terms of detection speed, although YOLOv3 and SSD are faster than the method proposed, their <italic>mAP</italic> is 7.3% and 10.5% lower than the method proposed in this study, respectively. In summary, compared with the other four models, the method in this study has higher <italic>mAP</italic> and higher detection speed, among which <italic>mAP</italic> is 13.5%, 7.3%, and 10.5% higher than faster R-CNN, YOLOv3, and SSD, respectively. The detection speed is 39.2% and 9.6% higher than faster R-CNN and YOLOv4, respectively. The detection accuracy of the method is high, the detection speed is fast, and the space occupied after pruning processing is small, which proves that the YOLOv4 detection model based on the pruning algorithm has high practicability and can be used for the detection of foreign bodies in power distribution rooms, thereby ensuring power distribution safety.</p>
</sec>
<sec sec-type="conclusion" id="s5">
<title>5 Conclusion</title>
<p>In this paper, a foreign body intrusion monitoring and recognition approach in the power distribution room based on the improved YOLOv4 deep learning network is proposed. To optimize the detection effect, the YOLOv4 algorithm is improved from the aspects of network structure, frame detection, and loss function. The results show that the proposed approach has higher <italic>mAP</italic> and higher detection speed. The <italic>mAP</italic> index is 13.5%, 7.3%, and 10.5% higher than faster R-CNN, YOLOv3, and SSD algorithm, respectively. The detection speed is, respectively, 39.2% and 9.6% higher than the faster R-CNN and YOLOv4 algorithms. The improved YOLOv4 has high detection accuracy, fast detection speed, and takes up less space after pruning. It can be used for the detection of foreign bodies in the power distribution room so as to ensure the safety of power distribution. However, it still needs to be pointed out that the model is based on the results of static image processing, and how to detect dynamic videos will be the main research direction in the future.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s6">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7">
<title>Author contributions</title>
<p>All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.</p>
</sec>
<sec id="s8">
<title>Funding</title>
<p>I and my fellow co-authors are fully aware of and agree with the payment of the listed article&#x2019;s processing fee should the manuscript be accepted for publication.</p>
</sec>
<sec sec-type="COI-statement" id="s9">
<title>Conflict of interest</title>
<p>SC, XD, ZW, PZ, and ZC were employed by Tianhe Power Supply Bureau of Guangzhou Power Supply Bureau, Guangdong Power Co. Ltd.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aggarwal</surname>
<given-names>L. P.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Data augmentation in dermatology image recognition using machine learning</article-title>. <source>Skin Res. Technol.</source> <volume>25</volume> (<issue>6</issue>), <fpage>815</fpage>&#x2013;<lpage>820</lpage>. <pub-id pub-id-type="doi">10.1111/srt.12726</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ahmad</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Shahid</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Alam</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ashraf</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Sajid</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kotecha</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Levelized multiple workflow allocation strategy under precedence constraints with task merging in IaaS cloud environment</article-title>. <source>IEEE Access</source> <volume>10</volume>, <fpage>92809</fpage>&#x2013;<lpage>92827</lpage>. <pub-id pub-id-type="doi">10.1109/access.2022.3202651</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Anastasiadou</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Gavanas</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Pitsiava-Latinopoulou</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bekiaris</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Infrastructure planning for autonomous electric vehicles, integrating safety and sustainability aspects: A multi-criteria analysis approach</article-title>. <source>Energies</source> <volume>14</volume> (<issue>17</issue>), <fpage>5269</fpage>&#x2013;<lpage>5274</lpage>. <pub-id pub-id-type="doi">10.3390/en14175269</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bashar</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Survey on evolving deep learning neural network architectures</article-title>. <source>J. Artif. Intell.</source> <volume>1</volume> (<issue>2</issue>), <fpage>73</fpage>&#x2013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.36548/jaicn.2019.2.003</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Mineral exploration targeting by combination of recursive indicator elimination with the &#x2113;2-regularization logistic regression based on geochemical data</article-title>. <source>Ore Geol. Rev.</source> <volume>135</volume> (<issue>2</issue>), <fpage>104213</fpage>. <pub-id pub-id-type="doi">10.1016/j.oregeorev.2021.104213</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Image recognition method of power equipment based on deep learning</article-title>. <source>Electr. Technol.</source> <volume>9</volume>, <fpage>34</fpage>&#x2013;<lpage>36</lpage>.</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dhiman</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Kaur</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Stoa: A bio-inspired based optimization algorithm for industrial engineering problems</article-title>. <source>Eng. Appl. Artif. Intell.</source> <volume>82</volume>, <fpage>148</fpage>&#x2013;<lpage>174</lpage>. <pub-id pub-id-type="doi">10.1016/j.engappai.2019.03.021</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dhiman</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Emperor penguin optimizer: A bio-inspired algorithm for engineering problems</article-title>. <source>Knowledge-Based Syst.</source> <volume>159</volume>, <fpage>20</fpage>&#x2013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.1016/j.knosys.2018.06.001</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dhiman</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Spotted hyena optimizer: A novel bio-inspired based metaheuristic technique for engineering applications</article-title>. <source>Adv. Eng. Softw.</source> <volume>114</volume>, <fpage>48</fpage>&#x2013;<lpage>70</lpage>. <pub-id pub-id-type="doi">10.1016/j.advengsoft.2017.05.014</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gangolells</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Casals</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Forcada</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Roca</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Fuertes</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Mitigating construction safety risks using prevention through design</article-title>. <source>J. Saf. Res.</source> <volume>41</volume> (<issue>2</issue>), <fpage>107</fpage>&#x2013;<lpage>122</lpage>. <pub-id pub-id-type="doi">10.1016/j.jsr.2009.10.007</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Detection and identification of foreign bodys in urban rail transit lines based on image processing</article-title>. <source>Automation Instrum.</source> <volume>9</volume> (<issue>7</issue>), <fpage>86</fpage>&#x2013;<lpage>90</lpage>.</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jacob</surname>
<given-names>I. J.</given-names>
</name>
<name>
<surname>Darney</surname>
<given-names>P. E.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Design of deep learning algorithm for IoT application by image based recognition</article-title>. <source>J. ISMAC</source> <volume>3</volume> (<issue>3</issue>), <fpage>276</fpage>&#x2013;<lpage>290</lpage>. <pub-id pub-id-type="doi">10.36548/jismac.2021.3.008</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaur</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Awasthi</surname>
<given-names>L. K.</given-names>
</name>
<name>
<surname>Sangal</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>Dhiman</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Tunicate swarm algorithm: A new bio-inspired based metaheuristic paradigm for global optimization</article-title>. <source>Eng. Appl. Artif. Intell.</source> <volume>90</volume>, <fpage>103541</fpage>. <pub-id pub-id-type="doi">10.1016/j.engappai.2020.103541</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ker</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Rao</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lim</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Deep learning applications in medical image analysis</article-title>. <source>IEEE Access</source> <volume>6</volume> (<issue>13</issue>), <fpage>9375</fpage>&#x2013;<lpage>9389</lpage>. <pub-id pub-id-type="doi">10.1109/access.2017.2788044</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kiruthika</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sakthi</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Kaviya</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Vishnupriya</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Blood bank monitoring and blood identification system using IoT device</article-title>. <source>Ann. Romanian Soc. Cell. Biol.</source> <volume>25</volume> (<issue>6</issue>), <fpage>182</fpage>&#x2013;<lpage>192</lpage>.</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Dhiman</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A comparative study of fuzzy optimization through fuzzy number</article-title>. <source>Int. J. Mod. Res.</source> <volume>1</volume>, <fpage>1</fpage>&#x2013;<lpage>14</lpage>.</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Couti&#xf1;o</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Giannakis</surname>
<given-names>G. B.</given-names>
</name>
<name>
<surname>Leus</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A momentum-guided frank-wolfe algorithm</article-title>. <source>IEEE Trans. Signal Process.</source> <volume>69</volume> (<issue>12</issue>), <fpage>3597</fpage>&#x2013;<lpage>3611</lpage>. <pub-id pub-id-type="doi">10.1109/tsp.2021.3087910</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Pan</surname>
<given-names>J. S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Hyperspectral image recognition using SVM combined deep learning</article-title>. <source>J. Internet Technol.</source> <volume>20</volume> (<issue>3</issue>), <fpage>851</fpage>&#x2013;<lpage>859</lpage>.</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liang</surname>
<given-names>Liming</given-names>
</name>
<name>
<surname>Pei</surname>
<given-names>Zou</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Insulator detection algorithm based on YOLOv4</article-title>. <source>Softw. Guide</source> <volume>21</volume> (<issue>08</issue>), <fpage>132</fpage>&#x2013;<lpage>137</lpage>.</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Mohamed</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A novel three-step classification approach based on time-dependent spectral features for complex power quality disturbances</article-title>. <source>IEEE Trans. Instrum. Meas.</source> <volume>70</volume>, <fpage>1</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1109/tim.2021.3050187</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yuan</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Gong</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Mohamed</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2023</year>). <article-title>Adaptive spectral trend based optimized EWT for monitoring the parameters of multiple power quality disturbances</article-title>. <source>Int. J. Electr. Power and Energy Syst.</source> <volume>146</volume>, <fpage>108797</fpage>. <pub-id pub-id-type="doi">10.1016/j.ijepes.2022.108797</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jia</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>An effective risk identification method for power fence operation based on neighborhood correlation network and vector calculation</article-title>. <source>Energy Rep.</source> <volume>7</volume>, <fpage>6995</fpage>&#x2013;<lpage>7003</lpage>. <pub-id pub-id-type="doi">10.1016/j.egyr.2021.10.061</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mezgec</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Eftimov</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Bucher</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Korousic Seljak</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Mixed deep learning and natural language processing method for fake-food image recognition and standardization to help automated dietary assessment</article-title>. <source>Public health Nutr.</source> <volume>22</volume> (<issue>7</issue>), <fpage>1193</fpage>&#x2013;<lpage>1202</lpage>. <pub-id pub-id-type="doi">10.1017/S1368980018000708</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mroszczyk</surname>
<given-names>J. W.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Improving construction safety: A team effort</article-title>. <source>Prof. Saf.</source> <volume>60</volume> (<issue>6</issue>), <fpage>55</fpage>&#x2013;<lpage>68</lpage>.</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qin</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Ming</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Song</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Helmet detection method based on lightweight deep learning model</article-title>. <source>Sci. Technol. Eng.</source> <volume>22</volume> (<issue>14</issue>), <fpage>5659</fpage>&#x2013;<lpage>5665</lpage>.</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ramcharan</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Baranowski</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>McCloskey</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Ahmed</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Legg</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Hughes</surname>
<given-names>D. P.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Deep learning for image-based cassava disease detection</article-title>. <source>Front. plant Sci.</source> <volume>8</volume> (<issue>14</issue>), <fpage>1852</fpage>&#x2013;<lpage>2201</lpage>. <pub-id pub-id-type="doi">10.3389/fpls.2017.01852</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ren</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Detection of foreign bodys in coal mine belt transportation based on Fast_YOLOv3 algorithm</article-title> [J]. <source>Industrial Min. Automation</source> <volume>47</volume> (<issue>12</issue>), <fpage>128</fpage>&#x2013;<lpage>133</lpage>.</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ren</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yue</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Adaptive parameters optimization model with 3D information extraction for infrared small target detection based on particle swarm optimization algorithm</article-title>. <source>Infrared Phys. Technol.</source> <volume>117</volume> (<issue>3</issue>), <fpage>103838</fpage>&#x2013;<lpage>103847</lpage>. <pub-id pub-id-type="doi">10.1016/j.infrared.2021.103838</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sharma</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Guided stochastic gradient descent algorithm for inconsistent datasets</article-title>. <source>Appl. Soft Comput.</source> <volume>73</volume> (<issue>5</issue>), <fpage>1068</fpage>&#x2013;<lpage>1080</lpage>. <pub-id pub-id-type="doi">10.1016/j.asoc.2018.09.038</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singamaneni</surname>
<given-names>K. K.</given-names>
</name>
<name>
<surname>Dhiman</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Juneja</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Muhammad</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>AlQahtani</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Zaki</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>A novel QKD approach to enhance IIOT privacy and computational knacks</article-title>. <source>Sensors</source> <volume>22</volume> (<issue>18</issue>), <fpage>6741</fpage>. <pub-id pub-id-type="doi">10.3390/s22186741</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stanford</surname>
<given-names>F. C.</given-names>
</name>
<name>
<surname>Alfaris</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Gomez</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Ricks</surname>
<given-names>E. T.</given-names>
</name>
<name>
<surname>Shukla</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Corey</surname>
<given-names>K. E.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>The utility of weight loss medications after bariatric surgery for weight regain or inadequate weight loss: A multi-center study</article-title>. <source>Surg. Obes. Relat. Dis.</source> <volume>13</volume> (<issue>3</issue>), <fpage>491</fpage>&#x2013;<lpage>500</lpage>. <pub-id pub-id-type="doi">10.1016/j.soard.2016.10.018</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Xiao</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Sebe</surname>
<given-names>N.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>When dictionary learning meets deep learning: Deep dictionary learning and coding network for image recognition with limited data</article-title>. <source>IEEE Trans. neural Netw. Learn. Syst.</source> <volume>32</volume> (<issue>5</issue>), <fpage>2129</fpage>&#x2013;<lpage>2141</lpage>. <pub-id pub-id-type="doi">10.1109/tnnls.2020.2997289</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Traor&#xe9;</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Pauwels</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Sequential convergence of AdaGrad algorithm for smooth convex optimization</article-title>. <source>Operations Res. Lett.</source> <volume>49</volume> (<issue>4</issue>), <fpage>452</fpage>&#x2013;<lpage>458</lpage>. <pub-id pub-id-type="doi">10.1016/j.orl.2021.04.011</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Automated diabetic retinopathy grading and lesion detection based on the modified R&#x2010;FCN object&#x2010;detection algorithm</article-title>. <source>IET Comput. Vis.</source> <volume>14</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1049/iet-cvi.2018.5508</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Mandic</surname>
<given-names>D. P.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Convergence of the RMSProp deep learning method with penalty for nonconvex optimization</article-title>. <source>Neural Netw.</source> <volume>139</volume>, <fpage>17</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2021.02.011</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Mao</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Deep learning detection method for safety helmets on construction work surfaces</article-title>. <source>Comput. Appl.</source> <volume>40</volume> (<issue>S2</issue>), <fpage>178</fpage>&#x2013;<lpage>182</lpage>. <pub-id pub-id-type="doi">10.1002/cpe.6234</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Design of safety helmet and mask detection system based on deep learning</article-title>. <source>Comput. Knowl. Technol.</source> <volume>18</volume> (<issue>10</issue>), <fpage>15</fpage>&#x2013;<lpage>18</lpage>.</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lv</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Image recognition of wind turbine blade damage based on a deep learning model with transfer learning and an ensemble learning classifier</article-title>. <source>Renew. Energy</source> <volume>163</volume> (<issue>5</issue>), <fpage>386</fpage>&#x2013;<lpage>397</lpage>. <pub-id pub-id-type="doi">10.1016/j.renene.2020.08.125</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuan</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lan</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Mohamed</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>A novel recognition method for complex power quality disturbances based on visualization trajectory circle and machine vision</article-title>. <source>IEEE Trans. Instrum. Meas.</source> <volume>71</volume>, <fpage>1</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1109/tim.2022.3204985</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Improved YOLOv3 foreign body detection method in transmission lines</article-title>. <source>Laser J.</source> <volume>43</volume> (<issue>05</issue>), <fpage>82</fpage>&#x2013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2022.3170696</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>