<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="review-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Energy Res.</journal-id>
<journal-title>Frontiers in Energy Research</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Energy Res.</abbrev-journal-title>
<issn pub-type="epub">2296-598X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">742993</article-id>
<article-id pub-id-type="doi">10.3389/fenrg.2021.742993</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Energy Research</subject>
<subj-group>
<subject>Methods</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Customer Load Forecasting Method Based on the Industry Electricity Consumption Behavior Portrait</article-title>
<alt-title alt-title-type="left-running-head">Guan et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Customer Load Potrait</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Guan</surname>
<given-names>Weiling</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>Daolu</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Yu</surname>
<given-names>Huang</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Peng</surname>
<given-names>Binggang</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wu</surname>
<given-names>Yufeng</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Yu</surname>
<given-names>Tao</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1266386/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wang</surname>
<given-names>Keying</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<label>
<sup>1</sup>
</label>School of Electric Power Engineering South China University of Technology, <addr-line>Guangzhou</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<label>
<sup>2</sup>
</label>Guangdong Provincial Key Laboratory of Intelligent Measurement and Advanced Metering of Power Grid, <addr-line>Guangzhou</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1222566/overview">Xiaoshun Zhang</ext-link>, Shantou University, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1415833/overview">Guiyuan Zhang</ext-link>, Shantou University, China</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1416022/overview">Da Xu</ext-link>, China University of Geosciences Wuhan, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Tao Yu, <email>taoyu1@scut.edu.cn</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Smart Grids, a section of the journal Frontiers in Energy Research</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>10</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>9</volume>
<elocation-id>742993</elocation-id>
<history>
<date date-type="received">
<day>17</day>
<month>07</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>04</day>
<month>08</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Guan, Zhang, Yu, Peng, Wu, Yu and Wang.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Guan, Zhang, Yu, Peng, Wu, Yu and Wang</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>With the dramatic increase of energy demand and the continuous increase of power system operation pressure, higher requirements are put forward for the development of power grid planning and optimization operation. It is important for the refinement of distribution network planning to deeply extract the characteristics of user load. First, the process of load characteristic analysis method from the user level to the industry level is proposed, which achieves the division of electricity consumption patterns of various industries, thus building a panoramic portrait of industry electricity consumption behavior. Then, by expanding the information filled in by traditional customers, the feature vector of each user is extracted, and the users&#x27; industry electricity consumption patterns are used as the label. Therefore, a method for identifying the electricity consumption pattern of the customer based on the BB-stacking model fusion framework is proposed, which yields the preliminary forecast results of customer load based on the actual load accounting results of the customers. Finally, comparative simulations with different methods verify the effectiveness of the proposed algorithm, which can provide prominent guidance for the actual distribution network planning&#x20;work.</p>
</abstract>
<kwd-group>
<kwd>industry electricity consumption behavior portrait</kwd>
<kwd>cluster analysis</kwd>
<kwd>ensemble learning framework</kwd>
<kwd>multidimensional electricity consumption characteristics</kwd>
<kwd>customer load forecasting</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<sec id="s1-1">
<title>Development of Customer Portrait</title>
<p>More recently, with the gradual transformation of enterprises from product oriented to user oriented in the production, it is significant for formulating marketing strategies and product design to fully understand customers and their needs (<xref ref-type="bibr" rid="B23">Wu et&#x20;al., 2020</xref>). At present, in-depth mining of user data and research on accurate user portrait are gradually deepening. Accurate user portrait technology has been widely used in the Internet, finance, retail, operators, advertising, and other industries (<xref ref-type="bibr" rid="B12">Liu and Du, 2020</xref>; <xref ref-type="bibr" rid="B34">Zhang et&#x20;al., 2020</xref>). In terms of advertising, Google uses big data to locate high-value users, which helps mobile e-commerce app to locate high-value users and push advertisements and adopts flexible data tracking methods for users of different tag categories, so that brands can achieve accurate advertising push for target users (<xref ref-type="bibr" rid="B20">Shan, 2018</xref>). In terms of auxiliary business decision-making, eBay carries out business circle customer group analysis, product marketability analysis, store operation analysis, personalized consumption analysis, and customer loss analysis by integrating online and offline massive data, providing decision support for commercial real estate and comprehensive big data analysis and prediction for project parties and brands (<xref ref-type="bibr" rid="B2">Chen et&#x20;al., 2021</xref>). Obviously, user portrait technology has become an effective method to improve customer service quality and customer experience, which is an important basis for integrating high-quality resources and realizing enterprise and user value (<xref ref-type="bibr" rid="B16">Pitner et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B33">Yu et&#x20;al., 2017</xref>).</p>
<p>In the field of electric power research, it has become a rising research direction to build user portraits in different application scenarios according to the actual needs (<xref ref-type="bibr" rid="B17">Qiu et&#x20;al., 2017</xref>). In the early stage of the study, traditional user portraits were mainly used in the marketing portal to build user electricity sensitivity or credit portraits to guide the electricity recovery work (<xref ref-type="bibr" rid="B19">Sanchez et&#x20;al., 2008</xref>; <xref ref-type="bibr" rid="B5">Han et&#x20;al., 2014</xref>; <xref ref-type="bibr" rid="B1">Ampimah et&#x20;al., 2017</xref>). To meet the requirements of demand response, the methods of establishing electricity consumption behavior tag library and realizing the portrait of different types of users&#x27; electricity consumption behavior patterns were proposed in (<xref ref-type="bibr" rid="B17">Qiu et&#x20;al., 2017</xref>) and (<xref ref-type="bibr" rid="B38">Zhong et&#x20;al., 2018</xref>). With the gradual popularization of new energy (<xref ref-type="bibr" rid="B25">Yang et&#x20;al., 2016</xref>; <xref ref-type="bibr" rid="B30">Yang et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B28">Yang et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B31">Yang et&#x20;al., 2019a</xref>; <xref ref-type="bibr" rid="B29">Yang et&#x20;al., 2019b</xref>; <xref ref-type="bibr" rid="B27">Yang et&#x20;al., 2020</xref>), the load change and influence mechanism on the user side are becoming more and more complex. Therefore, the user portrait method is becoming more and more important.</p>
<p>The above pieces of literature are all portraits of the users who have been connected to the power grid, so as to formulate appropriate strategies to guide the corresponding users to change their electricity consumption behavior. However, there is rare research on the reported customers who are not connected to the power grid. In the distribution network planning work, reference (<xref ref-type="bibr" rid="B10">Lian et&#x20;al., 2014</xref>) makes use of the complementarity between user loads to optimize the access decision reasonably, which can effectively improve the load distribution and utilization rate of power supply equipment. Moreover, the multidimensional analysis of user load can also provide effective guidance for the optimal scheduling and control of smart grid (<xref ref-type="bibr" rid="B26">Yang et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B37">Zhang et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B24">Xi et&#x20;al., 2016</xref>; <xref ref-type="bibr" rid="B36">Zhang et&#x20;al., 2016</xref>; <xref ref-type="bibr" rid="B35">Zhang et&#x20;al., 2021</xref>). However, due to the lack of research on the load of customers who are not connected, it is limited to transfer the user load of the existing distribution network. For the more common scenario of business expansion, the data information support and optimization ability are insufficient.</p>
</sec>
<sec id="s1-2">
<title>Research and Contribution of the Paper</title>
<p>In view of the above research results, the contributions of this paper can be summarized as follows:<list list-type="simple">
<list-item>
<p>1) Unlike the traditional single user portrait, this paper digs the power consumption behavior law of massive users, which constructs a more popularized and applied power consumption behavior portrait for industry.</p>
</list-item>
<list-item>
<p>2) A load forecasting method is presented, which is different from the general sense of load forecasting. Aiming at the new customers who are not connected with electricity, this paper extracts the time sequence characteristics of customer load through limited data information, which provides an effective reference for the actual distribution network planning.</p>
</list-item>
<list-item>
<p>3) The software integrating the above functions is developed and has been applied in engineering, which provides an effective tool for the decision-making of distribution network planners.</p>
</list-item>
</list>
</p>
</sec>
</sec>
<sec id="s2">
<title>Portrait of Industry Electricity Consumption Behavior</title>
<p>In this paper, a load characteristic analysis method from user level to industry level is proposed. First, the massive users of various industries in the region are systematically analyzed and integrated. Therefore, a comprehensive and practical industry electricity consumption behavior portrait in the region is profiled. The key steps are shown in <xref ref-type="fig" rid="F1">Figure&#x20;1</xref>.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Construction process of industry electricity consumption behavior portrait.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g001.tif"/>
</fig>
<sec id="s2-1">
<title>Load Data Preprocessing</title>
<p>For continuous load data, Lagrange interpolation (<xref ref-type="bibr" rid="B3">Criscuolo et&#x20;al., 1984</xref>) is used to fill the vacancy and bad data, which can play a good repair effect. For the data points that need to be filled, <inline-formula id="inf1">
<mml:math id="m1">
<mml:mi>m</mml:mi>
</mml:math>
</inline-formula> normal data points adjacent to the point are selected as samples to calculate the Lagrange interpolation formula:<disp-formula id="e1">
<mml:math id="m2">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>m</mml:mi>
</mml:munderover>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mstyle displaystyle="true">
<mml:mo>&#x220f;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>&#x2260;</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mi>m</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mstyle displaystyle="true">
<mml:mo>&#x220f;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>&#x2260;</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mi>m</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(1)</label>
</disp-formula>where <inline-formula id="inf2">
<mml:math id="m3">
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf3">
<mml:math id="m4">
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are the sampling time of the <inline-formula id="inf4">
<mml:math id="m5">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula> and <inline-formula id="inf5">
<mml:math id="m6">
<mml:mi>j</mml:mi>
</mml:math>
</inline-formula> data points in a day, <inline-formula id="inf6">
<mml:math id="m7">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> means the load value of the <inline-formula id="inf7">
<mml:math id="m8">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula> data point, and <inline-formula id="inf8">
<mml:math id="m9">
<mml:mi>m</mml:mi>
</mml:math>
</inline-formula> is the number of sample points used to construct the polynomial. Finally, the approximate value of the missing value can be obtained by substituting the missing time <inline-formula id="inf9">
<mml:math id="m10">
<mml:mi>t</mml:mi>
</mml:math>
</inline-formula> into&#x20;<inline-formula id="inf10">
<mml:math id="m11">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>To avoid the influence of the actual load value in the follow-up analysis process, the linear proportion normalization method is used to normalize the load data after repair, which can be expressed as<disp-formula id="e2">
<mml:math id="m12">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(2)</label>
</disp-formula>where <inline-formula id="inf11">
<mml:math id="m13">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> represents the <inline-formula id="inf12">
<mml:math id="m14">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula>-th load data point after normalization and <inline-formula id="inf13">
<mml:math id="m15">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the peak load of the&#x20;user.</p>
</sec>
<sec id="s2-2">
<title>Load Analysis of User Level</title>
<sec id="s2-2-1">
<title>User Load Curve Analysis</title>
<p>The user load curve has strong randomness, so it is difficult to ensure that the load curve of the day is typical for the user. In this paper, the time series mining method in (<xref ref-type="bibr" rid="B11">Lin et&#x20;al., 2017</xref>) is used to extract the typical daily load curve of the user in a period of time <inline-formula id="inf14">
<mml:math id="m16">
<mml:mi>T</mml:mi>
</mml:math>
</inline-formula>; the specific process is as follows:<list list-type="simple">
<list-item>
<p>1) Dimensionality reduction of load data: let <inline-formula id="inf15">
<mml:math id="m18">
<mml:mi>w</mml:mi>
</mml:math>
</inline-formula> be the number of segments of daily load data processed by PAA method, then the daily load time series after dimensionality reduction is <inline-formula id="inf16">
<mml:math id="m19">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>X</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo>&#x22ef;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>w</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and the calculation formula is as follows:</p>
</list-item>
</list>
<disp-formula id="e3">
<mml:math id="m20">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mi>w</mml:mi>
<mml:mi>n</mml:mi>
</mml:mfrac>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mi>n</mml:mi>
<mml:mi>w</mml:mi>
</mml:mfrac>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mfrac>
<mml:mi>n</mml:mi>
<mml:mi>w</mml:mi>
</mml:mfrac>
</mml:mrow>
</mml:munderover>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(3)</label>
</disp-formula>where <inline-formula id="inf17">
<mml:math id="m21">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> represents the mean value of the time <inline-formula id="inf18">
<mml:math id="m22">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula> series segment after dimension reduction.<list list-type="simple">
<list-item>
<p>2) Symbolic representation: the SAX method (<xref ref-type="bibr" rid="B14">Notaristefano et&#x20;al., 2013</xref>) is used to assign symbolic values to each segment of the daily load time series after dimension reduction according to the average value of each segment and then select the optimal value <inline-formula id="inf19">
<mml:math id="m23">
<mml:mi>&#x3b1;</mml:mi>
</mml:math>
</inline-formula> as the size of symbol set; the discretized symbol sequence can be expressed as <inline-formula id="inf20">
<mml:math id="m24">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>Y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo>&#x22ef;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>w</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</list-item>
<list-item>
<p>3) Frequency of statistical symbol sequence: after the symbolic representation of daily load data is completed, the frequency of various symbol sequences is counted. Then, the most common load curve forms are screened out and the abnormal load is eliminated. Suppose the most symbol sequence appears is&#x20;<inline-formula id="inf21">
<mml:math id="m25">
<mml:mi>s</mml:mi>
</mml:math>
</inline-formula>.</p>
</list-item>
<list-item>
<p>4) Extract typical daily load curve: stack the daily load curve corresponding to the symbol sequence with the highest frequency, and calculate the mean value to obtain the final typical load curve <inline-formula id="inf22">
<mml:math id="m26">
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>d</mml:mi>
</mml:msup>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>l</mml:mi>
<mml:mn>1</mml:mn>
<mml:mi>d</mml:mi>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mo>&#x22ef;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>l</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, which can be computed&#x20;by</p>
</list-item>
</list>
<disp-formula id="e4">
<mml:math id="m27">
<mml:mrow>
<mml:msubsup>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>s</mml:mi>
</mml:mfrac>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>s</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(4)</label>
</disp-formula>where <inline-formula id="inf23">
<mml:math id="m28">
<mml:mrow>
<mml:msubsup>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> stands for the <inline-formula id="inf24">
<mml:math id="m29">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula>-th data point of the typical daily load curve <inline-formula id="inf25">
<mml:math id="m30">
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>d</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf26">
<mml:math id="m31">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> is the <inline-formula id="inf27">
<mml:math id="m32">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula>-th data point of the <inline-formula id="inf28">
<mml:math id="m33">
<mml:mi>r</mml:mi>
</mml:math>
</inline-formula>-th sample daily load&#x20;curve.</p>
</sec>
<sec id="s2-2-2">
<title>User Load Characteristics Analysis</title>
<p>The user load curve can directly describe the change trend of user load in a day, and the load characteristics can more comprehensively characterize the user&#x27;s electricity behavior from multiple perspectives. In this paper, user load characteristics are selected and calculated from three aspects, i.e.,&#x20;user load level, typical peak-valley characteristics, and load development law. The names and expression of each index are shown in <xref ref-type="table" rid="T1">Table&#x20;1</xref>.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Selected user load characteristics.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Type</th>
<th align="center">User characteristic</th>
<th align="center">Expression</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Load level</td>
<td align="left">Contract capacity</td>
<td align="left">
<inline-formula id="inf29">
<mml:math id="m34">
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Final annual maximum load</td>
<td align="left">
<inline-formula id="inf30">
<mml:math id="m35">
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>f</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left">Peak-valley characteristics</td>
<td align="left">Daily load rate</td>
<td align="left">
<inline-formula id="inf31">
<mml:math id="m36">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>v</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Daily minimum load rate</td>
<td align="left">
<inline-formula id="inf32">
<mml:math id="m37">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Peak-valley difference rate</td>
<td align="left">
<inline-formula id="inf33">
<mml:math id="m38">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>3</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Peak load rate</td>
<td align="left">
<inline-formula id="inf34">
<mml:math id="m39">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>4</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>v</mml:mi>
<mml:mo>.</mml:mo>
<mml:mi>p</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>v</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Valley load rate</td>
<td align="left">
<inline-formula id="inf35">
<mml:math id="m40">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>5</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>v</mml:mi>
<mml:mo>.</mml:mo>
<mml:mi>v</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>v</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Flat load rate</td>
<td align="left">
<inline-formula id="inf36">
<mml:math id="m41">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>6</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>v</mml:mi>
<mml:mo>.</mml:mo>
<mml:mi>s</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>v</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left">Long-term load development law</td>
<td align="left">Saturated load density</td>
<td align="left">
<inline-formula id="inf37">
<mml:math id="m42">
<mml:mrow>
<mml:mi>D</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>f</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Utility rate</td>
<td align="left">
<inline-formula id="inf38">
<mml:math id="m43">
<mml:mrow>
<mml:mi mathvariant="italic">&#x3f5;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>f</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In <xref ref-type="table" rid="T1">Table&#x20;1</xref>, the load level indicators represent the volume of user power consumption, which is the key information in the distribution network planning. Indicators of typical peak-valley characteristics can reflect the changing trend of user load in a day with less information granularity. The index of long-term load development reflects the rule that a user&#x27;s load gradually increases to saturation after access to electricity and the relationship between saturated load and its area occupied. It is an important basis for reasonable estimation of the user&#x27;s&#x20;load.</p>
</sec>
</sec>
<sec id="s2-3">
<title>Preliminary Classification of Users Based on Industry Classification Standards</title>
<p>To serve distribution network planning, the results of single user load characteristic analysis lack universality. Thus, it is necessary to explore the general law of massive users&#x2019; load changes.</p>
<p>Generally, the users are classified into various industries according to the National Economic Industry Classification national standard, forming a user group in each industry, which has similarity in the long-term development law of load. In this way, the summary of the law and subsequent application is facilitated. After the preliminary classification of users is completed, due to the different characteristics of user peaks and valleys, it is necessary to further carry out industry power mode analysis and feature extraction.</p>
</sec>
<sec id="s2-4">
<title>Load Analysis of Industry Level</title>
<sec id="s2-4-1">
<title>Analysis of Industry Electricity Consumption Pattern</title>
<p>In the same industry, each user&#x27;s electricity consumption behavior is still different. The gray wolf optimized fuzzy mean clustering algorithm (GWO-FCM) with high clustering accuracy and fast computing speed in (<xref ref-type="bibr" rid="B4">Gao et&#x20;al., 2019</xref>) is used to further subdivide the user groups in the industry. The specific process is as follows:<list list-type="simple">
<list-item>
<p>1) Setting parameters: set the minimum number of clusters as <inline-formula id="inf39">
<mml:math id="m44">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and the maximum number of clusters as <inline-formula id="inf40">
<mml:math id="m45">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and parameters of GWO-FCM. The number of first clusters is set as&#x20;<inline-formula id="inf41">
<mml:math id="m46">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</list-item>
<list-item>
<p>2) Implement clustering algorithm: suppose that there are <inline-formula id="inf42">
<mml:math id="m47">
<mml:mi>N</mml:mi>
</mml:math>
</inline-formula> users in the industry<inline-formula id="inf43">
<mml:math id="m48">
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>; <inline-formula id="inf44">
<mml:math id="m49">
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> denotes the typical daily load curve of user <inline-formula id="inf45">
<mml:math id="m50">
<mml:mi>k</mml:mi>
</mml:math>
</inline-formula>. GWO-FCM algorithm is used to cluster the typical daily load curves of all users in the industry.</p>
</list-item>
<list-item>
<p>3) Evaluate the clustering effect: use silhouette coefficient (SC) to evaluate the compactness and separability of clusters. <inline-formula id="inf46">
<mml:math id="m51">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>C</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> represents the SC of <inline-formula id="inf47">
<mml:math id="m52">
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, which can be computed&#x20;by</p>
</list-item>
</list>
<disp-formula id="e5">
<mml:math id="m53">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>C</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>a</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>b</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>a</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>b</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(5)</label>
</disp-formula>
<disp-formula id="e6">
<mml:math id="m54">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>a</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>k</mml:mi>
<mml:mo>&#x2260;</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>w</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
<mml:mo>&#x7c;</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(6)</label>
</disp-formula>
<disp-formula id="e7">
<mml:math id="m55">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>b</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>v</mml:mi>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>c</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>v</mml:mi>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>w</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>v</mml:mi>
</mml:msub>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>}</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(7)</label>
</disp-formula>where <inline-formula id="inf48">
<mml:math id="m56">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>a</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> is the average distance between <inline-formula id="inf49">
<mml:math id="m57">
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> and other samples in <inline-formula id="inf50">
<mml:math id="m58">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> cluster; <inline-formula id="inf51">
<mml:math id="m59">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>b</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> is the minimum average distance between <inline-formula id="inf52">
<mml:math id="m60">
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> and the samples not in <inline-formula id="inf53">
<mml:math id="m61">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. The closer <inline-formula id="inf54">
<mml:math id="m63">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>C</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> is to 1, the better the compactness and separability of samples are and the better the clustering effect&#x20;is.</p>
<p>The mean value of the SC of all samples is calculated to evaluate the effectiveness of the clustering, which can be computed by<disp-formula id="e8">
<mml:math id="m64">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>C</mml:mi>
<mml:mi>M</mml:mi>
<mml:mi>E</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>N</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>N</mml:mi>
</mml:mfrac>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mi>S</mml:mi>
<mml:mi>C</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>d</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(8)</label>
</disp-formula>
<list list-type="simple">
<list-item>
<p>4) Traverse the optional range of cluster number: if the current cluster number <inline-formula id="inf55">
<mml:math id="m65">
<mml:mi>c</mml:mi>
</mml:math>
</inline-formula> is greater than <inline-formula id="inf56">
<mml:math id="m66">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, go to step 5). Otherwise, if <inline-formula id="inf57">
<mml:math id="m67">
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, go to step&#x20;2).</p>
</list-item>
<list-item>
<p>5) The number of clusters with the maximum mean value of the contour coefficient is selected as the optimal number of clusters <inline-formula id="inf58">
<mml:math id="m68">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. After the clustering centers are normalized, the typical curve of the power consumption pattern of the industry is obtained. The typical curve of industry electricity consumption pattern <inline-formula id="inf59">
<mml:math id="m69">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> can be expressed as <inline-formula id="inf60">
<mml:math id="m70">
<mml:mrow>
<mml:msubsup>
<mml:mi>L</mml:mi>
<mml:mi>u</mml:mi>
<mml:mi>c</mml:mi>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>u</mml:mi>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>c</mml:mi>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mo>&#x22ef;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>u</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
<mml:mi>c</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</list-item>
</list>
</p>
</sec>
<sec id="s2-4-2">
<title>Characteristic Extraction of Industry Electricity Consumption</title>
<p>After completing the division of industry electricity consumption pattern, it is necessary to extract the corresponding characteristics of each electricity consumption pattern in the industry. Considering the nature of the characteristics, it is not significant to summarize and extract the load level and typical peak-valley characteristics at the industry level. Considering the actual needs of the power grid, it is of great significance to extract the long-term load development law of various electricity consumption patterns in the industry.</p>
<p>Nonparametric kernel density estimation (<xref ref-type="bibr" rid="B7">Lambert et&#x20;al., 1999</xref>) is a data sample-driven method, which can fit the probability distribution of features without prior knowledge. In this paper, a nonparametric kernel density estimation method is used to fit the probability density of long-term load development characteristics, and then typical characteristics are extracted.</p>
<p>Taking the saturated load density as an example, suppose that there are <inline-formula id="inf61">
<mml:math id="m71">
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> users belonging to industry electricity consumption pattern <inline-formula id="inf62">
<mml:math id="m72">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, where the saturated load density of user <inline-formula id="inf63">
<mml:math id="m73">
<mml:mi>k</mml:mi>
</mml:math>
</inline-formula> is <inline-formula id="inf64">
<mml:math id="m74">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>; then the probability density function <inline-formula id="inf65">
<mml:math id="m76">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>D</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> of <inline-formula id="inf66">
<mml:math id="m77">
<mml:mi>D</mml:mi>
</mml:math>
</inline-formula> can be computed by<disp-formula id="e9">
<mml:math id="m78">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>D</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:munderover>
<mml:mi>K</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>D</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mi>h</mml:mi>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(9)</label>
</disp-formula>where <inline-formula id="inf67">
<mml:math id="m79">
<mml:mi>k</mml:mi>
</mml:math>
</inline-formula> means the kernel function; <inline-formula id="inf68">
<mml:math id="m80">
<mml:mi>h</mml:mi>
</mml:math>
</inline-formula> denotes the bandwidth.</p>
<p>To ensure the continuity of the probability density function, the kernel function needs to be a smooth probability density function. Generally, Gaussian kernel function is often selected, which can be expressed as follows:<disp-formula id="e10">
<mml:math id="m81">
<mml:mrow>
<mml:mi>K</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msqrt>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>&#x3c0;</mml:mi>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:mfrac>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>x</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(10)</label>
</disp-formula>
</p>
<p>The point with the largest value of the probability density fitting function <inline-formula id="inf69">
<mml:math id="m82">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>D</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> is selected as the typical characteristic value of the electricity consumption pattern <inline-formula id="inf70">
<mml:math id="m83">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, which can be computed by<disp-formula id="e11">
<mml:math id="m84">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mtext>argmax</mml:mtext>
</mml:mrow>
<mml:mi>D</mml:mi>
</mml:munder>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>D</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(11)</label>
</disp-formula>where <inline-formula id="inf71">
<mml:math id="m85">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> stands for the typical saturated load density value of the electricity consumption pattern&#x20;<inline-formula id="inf72">
<mml:math id="m86">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</sec>
</sec>
<sec id="s2-5">
<title>Construction of Industry Electricity Consumption Behavior Portrait</title>
<p>After the load analysis from the user level to the industry level, the load curves and related characteristics of the power users and industry modes are obtained, respectively, at the user level and the industry level, which is the description of the electricity consumption behavior of the industry from different angles and at different levels.</p>
<p>To enable the distribution network planners to grasp the electricity consumption of various industries intuitively, massive analysis results are sorted and visualized. Then, a panoramic portrait of industry electricity consumption behavior in a region is built. The form and content of the portrait are shown in <xref ref-type="fig" rid="F2">Figure&#x20;2</xref>.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Description process of industry electricity consumption behavior.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g002.tif"/>
</fig>
</sec>
</sec>
<sec id="s3">
<title>Pattern Classification of New Customers&#x27; Electricity Consumption</title>
<p>In the traditional business expansion and installation, for the customers who have not been connected to the power grid, the planners are supposed to calculate the customer&#x27;s load based on the customer&#x27;s reported capacity and business experience, working out the business expansion and access scheme. The processing flow is extensive and lacks consideration for user load time sequence characteristics.</p>
<p>After the construction of a panoramic portrait of industry electricity consumption behavior in the region, if we can reasonably infer the electricity consumption pattern according to the relevant power consumption information provided by customers, it can provide more effective technical support for business decision-making of planners.</p>
<p>The basic flow of the electricity consumption pattern classification in this paper is shown in <xref ref-type="fig" rid="F3">Figure&#x20;3</xref>. The training model obtains the electricity pattern classification ability by using the users&#x2019; characteristics as the training samples. After training the model, the corresponding characteristic vectors are extracted from the electricity consumption information of the customers and input into the classification model to obtain the electricity consumption pattern classification results of the customers.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Basic process of electricity consumption pattern classification for customers.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g003.tif"/>
</fig>
<sec id="s3-1">
<title>Customer Classification Algorithm Based on BB-Stacking Model Fusion Framework</title>
<p>In the field of the classification problem, ensemble learning (EL) has gained the attention of a huge number of scholars, because it can make up for each single model&#x27;s advantages (<xref ref-type="bibr" rid="B22">Wang et&#x20;al., 2015</xref>). Among them, bagging (<xref ref-type="bibr" rid="B6">Hu et&#x20;al., 2011</xref>) and boosting (<xref ref-type="bibr" rid="B13">Lu et&#x20;al., 2006</xref>) are the most classic and widely used integration methods, which have their own characteristics in model generalization and model accuracy. Specifically, bagging builds multiple datasets in bootstrap way and trains multiple single models in parallel, integrating the output by Voting. The advantages of each model are integrated, preventing the overfitting phenomenon effectively. Boosting framework belongs to the mode of serial integration of models. The output of each model is used as the input of the lower model, and the minimum deviation is used as the loss function to continuously improve the accuracy of each model. Generally, the characteristics of the two integration methods are shown in <xref ref-type="table" rid="T2">Table&#x20;2</xref>.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Characteristics and typical algorithms of bagging and boosting.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Integration mode</th>
<th align="center">Training mode</th>
<th align="center">Generalization performance</th>
<th align="center">Accuracy performance</th>
<th align="center">Typical algorithm</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Bagging</td>
<td align="left">Parallel</td>
<td align="left">Strong</td>
<td align="left">Weak</td>
<td align="left">RF</td>
</tr>
<tr>
<td align="left">Boosting</td>
<td align="left">Serial</td>
<td align="left">Weak</td>
<td align="left">Strong</td>
<td align="left">XGBoost</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>To give full play to the advantages of the two integration methods, a model fusion framework of BB-stacking (bagging and boosting in stacking) based on the stacking method is proposed. It combines the two integration methods reasonably to realize the organic trade-off between high-precision and strong generalization performance, yielding the high-precision classification of customers.</p>
<p>Traditional stacking is composed of the base model in the lower layer and the metamodel in the upper layer. Firstly, the initial data is divided into subdatasets by k-fold, and then the subdatasets are input into the base model for training and classification. Then the output of the lower layer is reconstructed and sent to the upper model for training and classification. The BB-stacking model fusion algorithm proposed in this paper changes the lower base model layer into the integration layer on the basis of the two-tier structure of stacking, in which bagging and boosting methods are used to preliminarily integrate the original model and then transferred to the upper metamodel to coordinate the advantages of the two integration modes, obtaining a model with higher accuracy and more stable generalization ability.</p>
</sec>
<sec id="s3-2">
<title>Expansion of New Customers&#x27; Electricity Consumption Information</title>
<p>In the traditional business process, customers provide limited information in the information filling process. To make the pattern classification of electricity consumption more accurate, this paper proposes an appropriate expansion of the electricity consumption information reported by the customers. In addition to the traditional information of electricity consumption type, electricity consumption location, and electricity consumption capacity, the peak electricity consumption time schedule, peak electricity consumption level estimation, and valley electricity consumption level estimation are added as the customer&#x27;s electricity consumption information. <xref ref-type="table" rid="T3">Table&#x20;3</xref> shows the demonstration of extended power consumption information of a customer.</p>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Extended electricity consumption information of a customer.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Peak electricity consumption schedule</th>
<th align="center">Peak electricity consumption level estimation (kW)</th>
<th align="center">Valley electricity consumption level estimation (kW)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">8:00&#x2013;12:00</td>
<td rowspan="2" align="center">1,000</td>
<td rowspan="2" align="center">50</td>
</tr>
<tr>
<td align="left">14:00&#x2013;18:00</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>According to the example of extended electricity consumption information of the reported customer in <xref ref-type="table" rid="T3">Table&#x20;3</xref>, the reference load curve of the reported customer is obtained by time simulation. According to the definition of the peak-valley characteristic index in <xref ref-type="table" rid="T1">Table&#x20;1</xref>, the peak-valley characteristic vector of the customer can be calculated as the input of the electricity consumption pattern classification&#x20;model.</p>
</sec>
</sec>
<sec id="s4">
<title>Customer Load Forecasting</title>
<sec id="s4-1">
<title>Calculation of Customer Load Level</title>
<p>The next step is to calculate the load level of the customers. Based on the regional industry electricity consumption behavior portrait, the corresponding method is adopted to calculate the customers&#x2019; load according to their type of electricity consumption (<xref ref-type="bibr" rid="B32">Yaoyao et&#x20;al., 2013</xref>). The specific calculation method is as follows:<list list-type="simple">
<list-item>
<p>1) Commercial, nonindustrial, and residential customers</p>
</list-item>
</list>
</p>
<p>It is recommended to use the load density method to calculate the customers&#x27; load because the load level of these customers is closely related to their building area. Assuming that the classification result of the customer is industry electricity consumption pattern <inline-formula id="inf73">
<mml:math id="m87">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, the customer accounting load can be computed by<disp-formula id="e12">
<mml:math id="m88">
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x22c5;</mml:mo>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>z</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(12)</label>
</disp-formula>where <inline-formula id="inf74">
<mml:math id="m89">
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>z</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> means the building area of the customer.<list list-type="simple">
<list-item>
<p>2) Industrial customers</p>
</list-item>
</list>
</p>
<p>Generally, the load level of industrial customers often depends on the capacity of production equipment and has weak correlation with the building area. It is recommended to use the utility rate method to calculate the customer load. Similarly, assuming that the classification result of the customer is industry electricity consumption pattern <inline-formula id="inf75">
<mml:math id="m90">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, the customer accounting load can be computed by<disp-formula id="e13">
<mml:math id="m91">
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="italic">&#x3f5;</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x22c5;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>z</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(13)</label>
</disp-formula>where <inline-formula id="inf76">
<mml:math id="m92">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="italic">&#x3f5;</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> stands for the typical utility rate value of the electricity consumption pattern <inline-formula id="inf77">
<mml:math id="m93">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>; <inline-formula id="inf78">
<mml:math id="m94">
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>z</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> means the building area of the customer.</p>
</sec>
<sec id="s4-2">
<title>Forecasting of Customer Load Curve</title>
<p>After the customer load level estimation is completed, the load curve can be forecasted according to the electricity consumption pattern classification results of the customer in the industry. The forecasting load curve of the customer belonging to electricity consumption pattern <inline-formula id="inf79">
<mml:math id="m95">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is <inline-formula id="inf80">
<mml:math id="m96">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and the calculation formula is as follows:<disp-formula id="e14">
<mml:math id="m97">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>u</mml:mi>
<mml:mi>c</mml:mi>
</mml:msubsup>
<mml:mo>&#x22c5;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(14)</label>
</disp-formula>where <inline-formula id="inf81">
<mml:math id="m98">
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mi>u</mml:mi>
<mml:mi>c</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> is the typical load curve of industry electricity consumption pattern&#x20;<inline-formula id="inf82">
<mml:math id="m99">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</sec>
<sec id="s4-3">
<title>Evaluation Index of Customer Load Forecasting Results</title>
<p>The customer load forecasting technology is based on the industry electricity consumption behavior portrait of the region. Through the limited electricity consumption information, customer load after access is estimated. To provide guidance for planners to make customer access decisions, the accuracy of the expected results needs to be evaluated. Assuming that the actual load curve of the customer is <inline-formula id="inf83">
<mml:math id="m100">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, maximum absolute error ratio (MAER) and Euclidean distance (ED) are used to evaluate the accuracy of the forecasting results.<list list-type="simple">
<list-item>
<p>1) To directly reflect the maximum error level between the forecasting load curve and the actual load curve of customer, MAER is used to measure the error, which can be computed&#x20;by</p>
</list-item>
</list>
<disp-formula id="e15">
<mml:math id="m101">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>E</mml:mi>
<mml:mi>R</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:munder>
<mml:mrow>
<mml:mi>max</mml:mi>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:munder>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(15)</label>
</disp-formula>where <inline-formula id="inf84">
<mml:math id="m102">
<mml:mrow>
<mml:msub>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> means the <inline-formula id="inf85">
<mml:math id="m103">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula>-th load point of <inline-formula id="inf86">
<mml:math id="m104">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>; <inline-formula id="inf87">
<mml:math id="m105">
<mml:mrow>
<mml:msub>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> denotes the <inline-formula id="inf88">
<mml:math id="m106">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula>-th load point of <inline-formula id="inf89">
<mml:math id="m107">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>.<list list-type="simple">
<list-item>
<p>2) To ensure that ED is not affected by the actual load level of the customer, the maximum value of the actual load curve <inline-formula id="inf90">
<mml:math id="m108">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is taken as the benchmark, and then the ED between the actual load curve and the estimated load curve can be calculated&#x20;by</p>
</list-item>
</list>
<disp-formula id="e16">
<mml:math id="m109">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(16)</label>
</disp-formula>
<disp-formula id="e17">
<mml:math id="m110">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(17)</label>
</disp-formula>
<disp-formula id="e18">
<mml:math id="m111">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(18)</label>
</disp-formula>where <inline-formula id="inf91">
<mml:math id="m112">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> means the <inline-formula id="inf92">
<mml:math id="m113">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula>-th load point of <inline-formula id="inf93">
<mml:math id="m114">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>; <inline-formula id="inf94">
<mml:math id="m115">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> denotes the <inline-formula id="inf95">
<mml:math id="m116">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula>-th load point of <inline-formula id="inf96">
<mml:math id="m117">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">L</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="true">&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</sec>
</sec>
<sec id="s5">
<title>Case Studies</title>
<p>The dataset used in this paper is the load data of distribution transformer in an economically developed city from 2015 to 2019. One measurement point data is obtained every 15&#xa0;min, with a total of 96 load data points per day. From the huge dataset, the distribution transformer data with high data quality and load development to saturation are selected. Combined with the corresponding industry information identification of distribution transformer users, the data cleaning and load characteristic analysis from user level to industry level are completed. Finally, the regional industry electricity consumption behavior image of the city is constructed.</p>
<p>In all industries, the metal products industry accounts for the highest proportion of users, which is a typical industry in the region. Taking the metal products industry as an example, this paper analyzes and demonstrates the output results and verifies the effectiveness of the proposed method.</p>
<sec id="s5-1">
<title>Display of Industry Electricity Consumption Behavior Portrait</title>
<sec id="s5-1-1">
<title>Analysis Results of User Level Load Characteristics</title>
<p>
<list list-type="simple">
<list-item>
<p>1) User load curve analysis</p>
</list-item>
</list>
</p>
<p>Taking a user in the metal products industry as an example, the typical daily load curve of the user is extracted. Set the PAA segment number as 6 and the number of elements in symbol set as 4. The symbol sequence with the highest number of users is <inline-formula id="inf97">
<mml:math id="m118">
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>b</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. The typical load curve of the user can be obtained through <xref ref-type="disp-formula" rid="e4">Eq. 4</xref>, as shown in <xref ref-type="fig" rid="F4">Figure&#x20;4</xref>.<list list-type="simple">
<list-item>
<p>2) User load characteristic extraction</p>
</list-item>
</list>
</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Typical load curve of a&#x20;user.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g004.tif"/>
</fig>
<p>Similarly, take the user as an example, collect the user&#x27;s relevant loading information, and calculate the corresponding user load characteristics, as shown in <xref ref-type="table" rid="T4">Table&#x20;4</xref>. It can be found that the user has large-scale production and the daily peak-valley difference is high. From the long-term development point of view, when the user load develops to saturation, the utility rate is&#x20;low.</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>Examples of user load characteristics.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Type</th>
<th align="center">User characteristic</th>
<th align="center">Index value</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="2" align="left">Load level</td>
<td align="left">Contract capacity</td>
<td align="center">800&#xa0;kW</td>
</tr>
<tr>
<td align="left">Final annual maximum load</td>
<td align="center">427.7&#xa0;kW</td>
</tr>
<tr>
<td rowspan="6" align="left">Peak-valley characteristics</td>
<td align="left">Daily load rate</td>
<td align="center">0.6316</td>
</tr>
<tr>
<td align="left">Daily minimum load rate</td>
<td align="center">0.2624</td>
</tr>
<tr>
<td align="left">Peak-valley difference rate</td>
<td align="center">0.7376</td>
</tr>
<tr>
<td align="left">Peak load rate</td>
<td align="center">1.1779</td>
</tr>
<tr>
<td align="left">Valley load rate</td>
<td align="center">0.5147</td>
</tr>
<tr>
<td align="left">Flat load rate</td>
<td align="center">1.2931</td>
</tr>
<tr>
<td rowspan="2" align="left">Long-term load development law</td>
<td align="left">Saturated load density</td>
<td align="center">65.39&#xa0;W/m<sup>2</sup>
</td>
</tr>
<tr>
<td align="left">Utility rate</td>
<td align="center">0.5909</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s5-1-2">
<title>Analysis Results of Industry Level Load Characteristics</title>
<p>
<list list-type="simple">
<list-item>
<p>1) Industry load curve analysis</p>
</list-item>
</list>
</p>
<p>After extracting the typical load curve of 2,478 users in the metal products industry, the load curve of this user group is clustered. The minimum cluster number <inline-formula id="inf98">
<mml:math id="m119">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is set as two and the maximum cluster number <inline-formula id="inf99">
<mml:math id="m120">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is set as 8. Finally, the algorithm flow is executed, the average value of SC of each clustering result is calculated through <xref ref-type="disp-formula" rid="e5">Eqs 5</xref>&#x2013;<xref ref-type="disp-formula" rid="e8">8</xref>, and the result is shown in <xref ref-type="fig" rid="F5">Figure&#x20;5</xref>.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Change trend of clustering effect evaluation&#x20;index.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g005.tif"/>
</fig>
<p>It can be seen that the average value of SC of GWO-FCM clustering algorithm reaches the maximum when <inline-formula id="inf100">
<mml:math id="m121">
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, then the optimal clustering number <inline-formula id="inf101">
<mml:math id="m122">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> is determined, and the corresponding industry typical electricity consumption pattern division results are shown in <xref ref-type="fig" rid="F6">Figure&#x20;6</xref>. It can be found that the load curves of each electricity consumption pattern are obviously different, and the division effect of electricity consumption pattern is satisfying.<list list-type="simple">
<list-item>
<p>1) Industry load characteristic extraction</p>
</list-item>
</list>
</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Division result of typical electricity consumption pattern in the industry.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g006.tif"/>
</fig>
<p>After dividing the typical electricity consumption pattern of the industry, the characteristic extraction of the utility rate of 342 users in electricity consumption pattern one is taken as an example. The bandwidth <inline-formula id="inf102">
<mml:math id="m123">
<mml:mi>h</mml:mi>
</mml:math>
</inline-formula> is set as 0.01. The probability density function of the user utility rate of the group is obtained by fitting <xref ref-type="disp-formula" rid="e9">Eq. 9</xref>, and the results are shown in <xref ref-type="fig" rid="F7">Figure&#x20;7</xref>.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Users&#x2019; utility rate distribution and probability density function.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g007.tif"/>
</fig>
</sec>
</sec>
<sec id="s5-2">
<title>Customers&#x2019; Electricity Consumption Pattern Classification</title>
<p>To verify the effectiveness of load forecasting technology for new customers, 141 metal products&#x2019; users with complete electricity consumption data are selected as hypothetical customers. Therefore, the corresponding characteristic vectors are extracted and input into the trained BB-stacking model for electricity consumption pattern classification. In this paper, the traditional decision tree model (<xref ref-type="bibr" rid="B18">Safavian and Landgrebe, 1991</xref>), GRNN model (<xref ref-type="bibr" rid="B21">Specht, 1991</xref>), and PNN model (<xref ref-type="bibr" rid="B15">Oh and Pedrycz, 2002</xref>) are used to compare with the BB-stacking model. The classification accuracy under different training sample sizes is shown in <xref ref-type="fig" rid="F8">Figure&#x20;8</xref>.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption>
<p>Accuracy rate trend of customer electricity consumption pattern classification.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g008.tif"/>
</fig>
<p>It can be found that, in the training process, the BB-stacking classification model can achieve a better classification effect than other classification models in a small training sample size. When the training sample size increases, the BB-stacking classification model can maintain a more stable generalization ability. It is obvious that the BB-stacking classification model prevents the phenomenon of classification ability decline caused by underfitting, performing better classification effect and stability.</p>
</sec>
<sec id="s5-3">
<title>Analysis of Customer Load Forecast Results</title>
<p>After the electricity consumption pattern classification of the customers is completed, the forecasting load curves of 141 customers are obtained through <xref ref-type="disp-formula" rid="e12">Eqs 12</xref>&#x2013;<xref ref-type="disp-formula" rid="e14">14</xref> combined with the typical electricity consumption characteristics of the industry. To compare the effectiveness of various methods, the similarity between the load curve forecast results and the actual load curves is evaluated by EM and MAER, and the comparative analysis is carried out from the following two aspects.<list list-type="simple">
<list-item>
<p>1) Comparative analysis of different clustering numbers</p>
</list-item>
</list>
</p>
<p>Considering that the different number of clusters selected by the clustering algorithm will lead to different electricity consumption patterns, which may affect the accuracy of the forecasting results of customers, this paper uses the GWO-FCM algorithm to test the case of cluster number <inline-formula id="inf103">
<mml:math id="m124">
<mml:mi>c</mml:mi>
</mml:math>
</inline-formula> from 2 to 8. The mean value of EM and MAER under each clustering number is compared and shown in <xref ref-type="fig" rid="F9">Figure&#x20;9</xref>.<list list-type="simple">
<list-item>
<p>2) Comparative analysis of different methods</p>
</list-item>
</list>
</p>
<fig id="F9" position="float">
<label>FIGURE 9</label>
<caption>
<p>Mean value of evaluation index on different clustering numbers.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g009.tif"/>
</fig>
<p>To further study the effectiveness of the proposed customer load forecasting method, this paper compares the traditional load estimation method with the proposed method. In addition, the sectional load curve obtained by the segment simulation method in the process of electricity consumption pattern classification is compared with the proposed method in this&#x20;paper.</p>
<p>The comparison between the load curve forecasting result of a certain customer and other methods is shown in <xref ref-type="fig" rid="F10">Figure&#x20;10</xref>. It can be found that there is little difference between the maximum load level of the estimated load and the actual load curve, but there is no time sequence characteristic. To some extent, the segment simulation method reflects the time sequence characteristics of customers, but the difference is large when the load level changes. The overall trend of the load curve forecasting result is very close to the actual load curve, which can meet the optimal access requirements in the planning field considering the time sequence characteristics of customers&#x2019;&#x20;load.</p>
<fig id="F10" position="float">
<label>FIGURE 10</label>
<caption>
<p>Comparison and analysis of customer load forecasting results.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g010.tif"/>
</fig>
<p>Comparing the error index of actual load curve and load curve forecasting results under different methods, the mean value comparison of test samples is shown in <xref ref-type="table" rid="T5">Table&#x20;5</xref>. It can be found that the proposed method performs best in both of the two evaluation indexes. Significantly, the accuracy is greatly improved compared with the traditional load estimation method.</p>
<table-wrap id="T5" position="float">
<label>TABLE 5</label>
<caption>
<p>Comparison of the mean value of error indexes of different methods.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Method</th>
<th align="center">ED</th>
<th align="center">MAER</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Customer load forecasting</td>
<td align="char" char=".">1.3007</td>
<td align="char" char=".">0.2444</td>
</tr>
<tr>
<td align="left">Segment simulation</td>
<td align="char" char=".">1.6246</td>
<td align="char" char=".">0.2787</td>
</tr>
<tr>
<td align="left">Traditional load estimation</td>
<td align="char" char=".">3.3229</td>
<td align="char" char=".">0.5886</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Counting the two evaluation indexes of load estimation results of each hypothetical customer under different methods, two box diagrams are drawn to observe the distribution of sample indexes of each method, which are shown in <xref ref-type="fig" rid="F11">Figures 11</xref>, <xref ref-type="fig" rid="F12">12</xref>. According to the box diagrams, the sample median level of the two error evaluation indexes of the traditional load estimation method is the highest, and the error fluctuation range is the widest; the results show that the segmented simulation method achieves the suboptimal effect in the two evaluation indexes, and the error fluctuation is also controlled in a small range. It is worth noting that the median of the evaluation index samples of the proposed method is the lowest, the falling range of the error index samples is the narrowest, and the fluctuation range of the algorithm performance is smaller, showing the best stability.</p>
<fig id="F11" position="float">
<label>FIGURE 11</label>
<caption>
<p>Euclidean distance (ED) of different methods.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g011.tif"/>
</fig>
<fig id="F12" position="float">
<label>FIGURE 12</label>
<caption>
<p>Maximum absolute error ratio (MAER) of different methods.</p>
</caption>
<graphic xlink:href="fenrg-09-742993-g012.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="conclusion" id="s6">
<title>Conclusion</title>
<p>Aiming at the problem of distribution network planning refinement, in this paper, the methods for regional industry electricity consumption behavior portrait construction are proposed. Further, the load forecasting technology of new customers is studied. The main conclusions are as follows:<list list-type="simple">
<list-item>
<p>1) The methods for regional industry electricity consumption behavior portrait construction can fully mine the multidimensional characteristics of users in various industries, providing effective data support for distribution network planning. The division of electricity consumption pattern can comprehensively and accurately reflect the power consumption characteristics and general laws of various industries.</p>
</list-item>
<list-item>
<p>2) The proposed load forecasting method has a good effect on the load forecasting of the customers who are not connected to the power grid, providing effective guidance for the optimization of the customers&#x27; access decision and load scheduling (<xref ref-type="bibr" rid="B9">Li et&#x20;al., 2021</xref>; <xref ref-type="bibr" rid="B8">Li and Yu, 2021</xref>). Moreover, the proposed method achieves significant performance in terms of the forecasting effect, algorithm stability, and practical application&#x20;value.</p>
</list-item>
</list>
</p>
</sec>
</body>
<back>
<sec id="s7">
<title>Data Availability Statement</title>
<p>The datasets presented in this article are not readily available because regional power consumption data are subject to confidentiality requirements. Requests to access the datasets should be directed to <email>1085283057@qq.com</email>.</p>
</sec>
<sec id="s8">
<title>Author Contributions</title>
<p>Conceptualization, methodology, and writing&#x2014;original draft, WG; data curation, DZ; investigation, HY; writing&#x2014;review and editing, BP; formal analysis and visualization, YW; resources and funding acquisition, TY; supervision,&#x20;KW.</p>
</sec>
<sec id="s9">
<title>Funding</title>
<p>The authors gratefully acknowledge the support of the Science and Technology Project supported by China Southern Power Grid (GDKJXM20172939).</p>
</sec>
<sec sec-type="COI-statement" id="s10">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ampimah</surname>
<given-names>B. C.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Optimizing Sheddable and Shiftable Residential Electricity Consumption by Incentivized Peak and Off-Peak Credit Function Approach[J]</article-title>. <source>Appl. Energ.</source> <volume>210</volume>, <fpage>S0306261917309716</fpage>. <pub-id pub-id-type="doi">10.1016/j.apenergy.2017.07.097</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yin</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Rong</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Cong</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Monitoring and Recognizing Enterprise Public Opinion from High-Risk Users Based on User Portrait and Random Forest Algorithm[J]</article-title>. <source>Axioms</source> <volume>10</volume> (<issue>2</issue>), <fpage>106</fpage>. <pub-id pub-id-type="doi">10.3390/axioms10020106</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Criscuolo</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Mastroianni</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Occorsio</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>1984</year>). <article-title>Uniform Convergence of Derivatives of Extended Lagrange Interpolation[J]</article-title>. <source>J.&#x20;Comput. Appl. Maths.</source> <volume>282</volume> (<issue>1</issue>), <fpage>669</fpage>&#x2013;<lpage>698</lpage>. </citation>
</ref>
<ref id="B4">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Gao</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Cao</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2019</year>). &#x201c;<article-title>Daily Power Load Curves Analysis Based on Grey Wolf Optimization Clustering Algorithm</article-title>,&#x201d; in <conf-loc>International Seminar of Purple Mountain Forum Academic Forum</conf-loc>, <conf-loc>Beijing, China</conf-loc> (<publisher-name>Springer</publisher-name>). </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Han</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Fan</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Optimizing the Performance of Ice-Storage Systems in Electricity Load Management through a Credit Mechanism: An Analytical Work for Jiangsu, China</article-title>. <source>Energ. Proced.</source> <volume>61</volume>, <fpage>2876</fpage>&#x2013;<lpage>2879</lpage>. <pub-id pub-id-type="doi">10.1016/j.egypro.2014.12.327</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Mao</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Hybrid Modeling for the Prediction of Leaching Rate in Leaching Process Based on Negative Correlation Learning Bagging Ensemble Algorithm</article-title>. <source>Comput. Chem. Eng.</source> <volume>35</volume> (<issue>12</issue>), <fpage>2611</fpage>&#x2013;<lpage>2617</lpage>. <pub-id pub-id-type="doi">10.1016/j.compchemeng.2011.02.012</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lambert</surname>
<given-names>C. G.</given-names>
</name>
<name>
<surname>Harrington</surname>
<given-names>S. E.</given-names>
</name>
<name>
<surname>Harvey</surname>
<given-names>C. R.</given-names>
</name>
<name>
<surname>Glodjo</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Efficient On-Line Nonparametric Kernel Density Estimation</article-title>. <source>Algorithmica</source> <volume>25</volume> (<issue>1</issue>), <fpage>37</fpage>&#x2013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1007/pl00009282</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A New Adaptive Controller Based on Distributed Deep Reinforcement Learning for PEMFC Air Supply System</article-title>. <source>Energ. Rep.</source> <volume>7</volume>, <fpage>1267</fpage>&#x2013;<lpage>1279</lpage>. <pub-id pub-id-type="doi">10.1016/j.egyr.2021.02.043</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Efficient Experience Replay Based Deep Deterministic Policy Gradient for AGC Dispatch in Integrated Energy System</article-title>. <source>Appl. Energ.</source> <volume>285</volume>, <fpage>116386</fpage>. <pub-id pub-id-type="doi">10.1016/j.apenergy.2020.116386</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Lian</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Di</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2014</year>). &#x201c;<article-title>Detailed Power Distribution Network Planning Based on the Description of Load Characteristics</article-title>,&#x201d; in <conf-name>China International Conference on Electricity Distribution</conf-name>, <conf-loc>Shenzhen, China</conf-loc>, <conf-date>September 23&#x2013;26, 2014</conf-date> (<publisher-name>IEEE</publisher-name>), <fpage>1759</fpage>&#x2013;<lpage>1762</lpage>. </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Power Load Classification Method Based on Information Entropy Piecewise Aggregate Approximation and Spectral Clustering[J]</article-title>. <source>Proc. CSEE</source> <volume>37</volume> (<issue>8</issue>), <fpage>2242</fpage>&#x2013;<lpage>2252</lpage>. <pub-id pub-id-type="doi">10.13334/j.0258-8013.pcsee.160279</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Du</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). &#x201c;<article-title>A User Portrait Construction Method for Financial Credit</article-title>,&#x201d; in <conf-name>Proceedings of 2020 Chinese Intelligent Systems Conference Volume II.</conf-name> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Plataniotis</surname>
<given-names>K. N.</given-names>
</name>
<name>
<surname>Venetsanopoulos</surname>
<given-names>A. N.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>S. Z.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Ensemble-based Discriminant Learning with Boosting for Face Recognition</article-title>. <source>IEEE Trans. Neural Netw.</source> <volume>17</volume> (<issue>1</issue>), <fpage>166</fpage>&#x2013;<lpage>178</lpage>. <pub-id pub-id-type="doi">10.1109/tnn.2005.860853</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Notaristefano</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Chicco</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Piglione</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Data Size Reduction with Symbolic Aggregate Approximation for Electrical Load Pattern Grouping</article-title>. <source>IET Generation, Transm. Distribution</source> <volume>7</volume> (<issue>2</issue>), <fpage>108</fpage>&#x2013;<lpage>117</lpage>. <pub-id pub-id-type="doi">10.1049/iet-gtd.2012.0383</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Oh</surname>
<given-names>S.-K.</given-names>
</name>
<name>
<surname>Pedrycz</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>The Design of Self-Organizing Polynomial Neural Networks</article-title>. <source>Inf. Sci.</source> <volume>141</volume> (<issue>3&#x2013;4</issue>), <fpage>237</fpage>&#x2013;<lpage>258</lpage>. <pub-id pub-id-type="doi">10.1016/s0020-0255(02)00175-5</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pitner</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Kriksciuniene</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Sakalauskas</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Tracking Customer Portrait by Unsupervised Classification Techniques[J]</article-title>. <source>Transformations Business Econ.</source> <volume>11</volume> (<issue>3</issue>), <fpage>167</fpage>&#x2013;<lpage>189</lpage>. </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qiu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Ying</surname>
<given-names>T. U.</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Construction of Power Customer Portrait and its Credit Evaluation and Electricity Fee Risk Control Based on Tag Library System[J]</article-title>. <source>Telecommunications Sci.</source> <volume>1</volume>, <fpage>206</fpage>&#x2013;<lpage>213</lpage>. </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Safavian</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Landgrebe</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>1991</year>). <article-title>A Survey of Decision Tree Classifier Methodology</article-title>. <source>IEEE Trans. Syst. Man. Cybern.</source> <volume>21</volume> (<issue>3</issue>), <fpage>660</fpage>&#x2013;<lpage>674</lpage>. <pub-id pub-id-type="doi">10.1109/21.97458</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Sanchez</surname>
<given-names>J.&#x20;J.</given-names>
</name>
<name>
<surname>Barquin</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Centeno</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>L&#xf3;pez-Pe&#xf1;a</surname>
<given-names>&#xc1;.</given-names>
</name>
</person-group> (<year>2008</year>). &#x201c;<article-title>A Multidisciplinary Approach to Model Long-Term Investments in Electricity Generation: Combining System Dynamics, Credit Risk Theory and Game Theory</article-title>,&#x201d; in <conf-name>Power &#x26; Energy Society General Meeting-conversion &#x26; Delivery of Electrical Energy in the Century</conf-name> (<publisher-name>IEEE</publisher-name>). <pub-id pub-id-type="doi">10.1109/pes.2008.4596452</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shan</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Research on User Portrait Based on Online Review: Taking C-Trip Hotel as an Example[J]</article-title>. <source>Inf. Stud. Theor. Appl</source>. </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Specht</surname>
<given-names>D. F.</given-names>
</name>
</person-group> (<year>1991</year>). <article-title>A General Regression Neural Network</article-title>. <source>IEEE Trans. Neural Netw.</source> <volume>2</volume> (<issue>6</issue>), <fpage>568</fpage>&#x2013;<lpage>576</lpage>. <pub-id pub-id-type="doi">10.1109/72.97934</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>X.-Z.</given-names>
</name>
<name>
<surname>Xing</surname>
<given-names>H.-J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hua</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>C.-R.</given-names>
</name>
<name>
<surname>Pedrycz</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>A Study on Relationship Between Generalization Abilities and Fuzziness of Base Classifiers in Ensemble Learning</article-title>. <source>IEEE Trans. Fuzzy Syst.</source> <volume>23</volume> (<issue>5</issue>), <fpage>1638</fpage>&#x2013;<lpage>1654</lpage>. <pub-id pub-id-type="doi">10.1109/tfuzz.2014.2371479</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Wan</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2020</year>). &#x201c;<article-title>Research on Recommendation System Based on User Portrait</article-title>,&#x201d; in <conf-name>2020 IEEE International Conference on Artificial Intelligence and Information Systems (ICAIIS)</conf-name> (<publisher-name>IEEE</publisher-name>). <pub-id pub-id-type="doi">10.1109/icaiis49377.2020.9194941</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xi</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Qiu</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>A Wolf Pack Hunting Strategy Based Virtual Tribes Control for Automatic Generation Control of Smart Grid</article-title>. <source>Appl. Energ.</source> <volume>178</volume>, <fpage>198</fpage>&#x2013;<lpage>211</lpage>. <pub-id pub-id-type="doi">10.1016/j.apenergy.2016.06.041</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>Q. H.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Nonlinear Maximum Power point Tracking Control and Modal Analysis of DFIG Based Wind Turbine</article-title>. <source>Int. J.&#x20;Electr. Power Energ. Syst.</source> <volume>74</volume>, <fpage>429</fpage>&#x2013;<lpage>436</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijepes.2015.07.036</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>Q. H.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Perturbation Estimation Based Coordinated Adaptive Passive Control for Multimachine Power Systems</article-title>. <source>Control. Eng. Pract.</source> <volume>44</volume>, <fpage>172</fpage>&#x2013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.1016/j.conengprac.2015.07.012</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Shu</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Comprehensive Overview of Meta-Heuristic Algorithm Applications on Pv Cell Parameter Identification</article-title>. <source>Energ. Convers. Manage.</source> <volume>208</volume>, <fpage>112595</fpage>. <pub-id pub-id-type="doi">10.1016/j.enconman.2020.112595</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Shu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Robust Sliding-Mode Control of Wind Energy Conversion Systems for Optimal Power Extraction via Nonlinear Perturbation Observers</article-title>. <source>Appl. Energ.</source> <volume>210</volume>, <fpage>711</fpage>&#x2013;<lpage>723</lpage>. <pub-id pub-id-type="doi">10.1016/j.apenergy.2017.08.027</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Shu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Sang</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Dynamic Leader Based Collective Intelligence for Maximum Power Point Tracking of PV Systems Affected by Partial Shading Condition</article-title>. <source>Energ. Convers. Manage.</source> <volume>179</volume>, <fpage>286</fpage>&#x2013;<lpage>303</lpage>. <pub-id pub-id-type="doi">10.1016/j.enconman.2018.10.074</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Shu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Fang</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Grouped Grey Wolf Optimizer for Maximum Power point Tracking of Doubly-Fed Induction Generator Based Wind Turbine</article-title>. <source>Energ. Convers. Manage.</source> <volume>133</volume>, <fpage>427</fpage>&#x2013;<lpage>443</lpage>. <pub-id pub-id-type="doi">10.1016/j.enconman.2016.10.062</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zhong</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Shu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Novel Bio-Inspired Memetic Salp Swarm Algorithm and Application to MPPT for PV Systems Considering Partial Shading Condition</article-title>. <source>J.&#x20;Clean. Prod.</source> <volume>215</volume>, <fpage>1203</fpage>&#x2013;<lpage>1222</lpage>. <pub-id pub-id-type="doi">10.1016/j.jclepro.2019.01.150</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yaoyao</surname>
<given-names>H. E.</given-names>
</name>
<name>
<surname>Qifa</surname>
<given-names>X. U.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>A Power Load Probability Density Forecasting Method Based on RBF Neural Network Quantile Regression[J]</article-title>. <source>Proc. CSEE</source> <volume>33</volume> (<issue>1</issue>), <fpage>93</fpage>&#x2013;<lpage>98</lpage>. </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Xin</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Application Research on the Customer&#x27;s Portrait of Electric Power Marketing Informatization[J]</article-title>. <source>Comput. Technol. Automation</source>. </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Qin</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Research on Contextual Recommendation System of Agricultural Science and Technology Resource Based on User Portrait[J]</article-title>. <source>J.&#x20;Phys. Conf. Ser.</source> <volume>1693</volume> (<issue>1</issue>), <fpage>012186</fpage>. <pub-id pub-id-type="doi">10.1088/1742-6596/1693/1/012186</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Adaptive Distributed Auction-Based Algorithm for Optimal Mileage Based AGC Dispatch with High Participation of Renewable Energy</article-title>. <source>Int. J.&#x20;Electr. Power Energ. Syst.</source> <volume>124</volume>, <fpage>106371</fpage>. <pub-id pub-id-type="doi">10.1016/j.ijepes.2020.106371</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Virtual Generation Tribe Based Robust Collaborative Consensus Algorithm for Dynamic Generation Command Dispatch Optimization of Smart Grid</article-title>. <source>Energy</source> <volume>101</volume>, <fpage>34</fpage>&#x2013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1016/j.energy.2016.02.009</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Approximate Ideal Multi-Objective Solution Q(&#x3bb;) Learning for Optimal Carbon-Energy Combined-Flow in Multi-Energy Power Systems</article-title>. <source>Energ. Convers. Manage.</source> <volume>106</volume>, <fpage>543</fpage>&#x2013;<lpage>556</lpage>. <pub-id pub-id-type="doi">10.1016/j.enconman.2015.09.049</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhong</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Shao</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Lv</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2018</year>). &#x201c;<article-title>Research on Electricity Consumption Behavior of Electric Power Users Based on Tag Technology and Clustering Algorithm</article-title>,&#x201d; in <conf-name>2018 5th International Conference on Information Science and Control Engineering (ICISCE)</conf-name>. <pub-id pub-id-type="doi">10.1109/icisce.2018.00102</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>