<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Environ. Sci.</journal-id>
<journal-title>Frontiers in Environmental Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Environ. Sci.</abbrev-journal-title>
<issn pub-type="epub">2296-665X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">747101</article-id>
<article-id pub-id-type="doi">10.3389/fenvs.2021.747101</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Environmental Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>An Outlier-Robust Point and Interval Forecasting System for Daily PM<sub>2.5</sub> Concentration</article-title>
<alt-title alt-title-type="left-running-head">Yin and Fang</alt-title>
<alt-title alt-title-type="right-running-head">Robust Forecasting System for PM<sub>2.5</sub>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Yin</surname>
<given-names>Ziqi</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Fang</surname>
<given-names>Xin</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1418293/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<label>
<sup>1</sup>
</label>Faculty of Information Technology, Macau University of Science and Technology, <addr-line>Macau</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<label>
<sup>2</sup>
</label>School of Business, Macau University of Science and Technology, <addr-line>Macau</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1354214/overview">Pei Du</ext-link>, Xi&#x2019;an Jiaotong University, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1420855/overview">Song Ding</ext-link>, Zhejiang University of Finance and Economics, China</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1307894/overview">Kevin Li</ext-link>, University of Windsor, Canada</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Xin Fang, <email>xfang@must.edu.mo</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Environmental Economics and Management, a section of the journal Frontiers in Environmental Science</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>18</day>
<month>10</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>9</volume>
<elocation-id>747101</elocation-id>
<history>
<date date-type="received">
<day>25</day>
<month>07</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>19</day>
<month>08</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Yin and Fang.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Yin and Fang</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Air pollution forecasting, particularly of PM<sub>2.5</sub> levels, can be used not only to deliver effective warning information to the public but also to provide support for decisions regarding the control and treatment of air pollution problems. However, there are still some challenging issues in air pollution forecasting that urgently need to be solved, such as how to handle and model outliers, improve forecasting stability, and correct forecasting results. In this context, this study proposes an outlier-robust forecasting system to attempt to tackle the abovementioned issues and bridge the gap in the current research. Specifically, the system developed consists of two parts that deal with point and interval forecasting, respectively. For point forecasting, a data preprocessing module is proposed based on outlier handling and data decomposition to mitigate the negative influences of outliers and noise, which can also help the model capture the main characteristics of the original time series. Meanwhile, an outlier-robust forecasting module is designed for better modeling of the preprocessed data. For the model to further improve its accuracy, a nonlinear correction module based on an error ensemble strategy is developed that can provide more accurate forecasting results. Finally, the interval forecasting part of the system is based on a newly proposed artificial intelligence&#x2013;based distribution evaluation and the results of the point forecasting part to present the range of future changes. Experimental results and analysis utilizing daily PM<sub>2.5</sub> concentration from two provincial capital cities in China are discussed to verify the superiority and effectiveness of the system developed, which can be considered an effective technique for point and interval forecasting of daily PM<sub>2.5</sub> concentration.</p>
</abstract>
<kwd-group>
<kwd>PM<sub>2.5</sub> concentration</kwd>
<kwd>point forecasting</kwd>
<kwd>interval forecasting</kwd>
<kwd>outlier handling and modeling</kwd>
<kwd>forecasting system</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Urbanization, industrialization, and energy consumption have caused the issue of air pollution to become increasingly serious. The air pollution issue is considered a major concern (<xref ref-type="bibr" rid="B1">Andrade et&#x20;al., 2015</xref>) and regarded as the single largest health risk (<xref ref-type="bibr" rid="B35">Wendel, 2014</xref>). It can have adverse effects on human beings and bring great economic losses as well as problems for society, affecting areas such as public health (<xref ref-type="bibr" rid="B15">Li et&#x20;al., 2017</xref>), corporate cash holdings (<xref ref-type="bibr" rid="B14">Li et&#x20;al., 2021</xref>), and the tourism industry (<xref ref-type="bibr" rid="B11">Hao et&#x20;al., 2021</xref>). In this context, to solve air pollution issues and accelerate ecological progress, air pollution forecasting, particularly of PM<sub>2.5</sub> (particulate matter with an aerodynamic diameter of 2.5&#xa0;&#x3bc;m or less) levels, has been acknowledged as a promising technique for air pollution control and treatment (<xref ref-type="bibr" rid="B21">Liu et&#x20;al., 2019a</xref>). However, the performance of air pollution forecasting suffers from many factors, and how to develop a model that can improve forecasting effectiveness is not only a challenging research topic but also a matter of growing concern. As a result, developing a more effective model for modeling PM<sub>2.5</sub> concentration has become an imperative task that cannot be postponed (<xref ref-type="bibr" rid="B41">Yang et&#x20;al., 2019a</xref>).</p>
<p>Over the past few decades, many studies have been conducted to propose an effective air pollution forecasting model. In general, the current air pollution forecasting models mainly belong to three categories, namely, the chemical transport method (CTM), the traditional statistical method, and the artificial intelligence method. In the first category, <xref ref-type="bibr" rid="B29">Timmermans et&#x20;al. (2017)</xref> employed a kind of CTM method to analyze PM<sub>2.5</sub> in China. <xref ref-type="bibr" rid="B46">Zhang et&#x20;al. (2019)</xref> used the community multiscale air quality (CMAQ) method to analyze PM<sub>2.5</sub> pollution events in Qingdao, China. Similarly, <xref ref-type="bibr" rid="B7">Fan et&#x20;al. (2015)</xref> employed third-generation air quality modeling system Models-3/CMAQ to analyze air pollution episodes in one region of China. However, the forecasting performance of CTM methods is easily influenced by the scale and quality of the emissions data (<xref ref-type="bibr" rid="B8">Feng et&#x20;al., 2015</xref>). Furthermore, <xref ref-type="bibr" rid="B27">Stern et&#x20;al. (2008)</xref> suggested that due to incomplete descriptions of physical and chemical processes and limited knowledge of pollution sources, CTM models may produce poor forecasting results. In the traditional statistical method category, the widely employed statistical model mainly consists of an autoregressive integrated moving average (ARIMA) and regression method. <xref ref-type="bibr" rid="B30">Vlachogianni et&#x20;al. (2011)</xref> employed a regression model for forecasting nitrogen oxides (NOx) and PM<sub>10</sub> (particulate matter with an aerodynamic diameter of 10&#xa0;&#x3bc;m or less). <xref ref-type="bibr" rid="B44">Zafra et&#x20;al. (2017)</xref> developed an ARIMA model using hourly PM<sub>10</sub> concentration data. However, the traditional statistical model is unable to extract the complex and nonlinear features of pollutant concentration data, which may result in undesirable results (<xref ref-type="bibr" rid="B33">Wang et&#x20;al., 2020a</xref>). Fortunately, with the rapid development of advanced technologies, new methods, including artificial neural networks (ANN), support vector regression (SVR), extreme learning machines (ELM), and other artificial intelligence methods, have been proposed and are being widely employed in different forecasting fields, including air pollution forecasting. These novel methods are acknowledged as promising solutions to air pollution forecasting issues (<xref ref-type="bibr" rid="B42">Yang et&#x20;al., 2019b</xref>).</p>
<p>However, all individual models, including artificial intelligence models, have their advantages and disadvantages. For instance, they may ignore the importance of data preprocessing and optimization and thus be unsuited to meet the requirements of decision-making and management. As a result, to overcome the deficiency of individual artificial intelligence methods and improve air pollution forecasting effectiveness, researchers have started to pay increasing attention to the development of hybrid models by hybridizing multifarious methods from individual forecasting models, mainly including data decomposition techniques (empirical mode decomposition (EMD) (<xref ref-type="bibr" rid="B47">Zhu et&#x20;al., 2017</xref>), ensemble EMD (EEMD) (<xref ref-type="bibr" rid="B3">Bai et&#x20;al., 2019</xref>), complementary EEMD (CEEMD) (<xref ref-type="bibr" rid="B43">Yang and Wang, 2017</xref>), fast EEMD (FEEMD) (<xref ref-type="bibr" rid="B22">Luo et&#x20;al., 2018</xref>), complete EEMD with adaptive noise (CEEMDAN) (<xref ref-type="bibr" rid="B12">Hao and Tian, 2019</xref>), improved CEEMDAN (ICEEMDAN) (<xref ref-type="bibr" rid="B25">Sharma et&#x20;al., 2020</xref>), variational mode decomposition (VMD) (<xref ref-type="bibr" rid="B38">Wu and Lin, 2019a</xref>), wavelet transform (WT) (<xref ref-type="bibr" rid="B4">Cheng et&#x20;al., 2019</xref>), discrete WT (DWT) (<xref ref-type="bibr" rid="B26">Siwek and Osowski, 2012</xref>), stationary WT (SWT) (<xref ref-type="bibr" rid="B2">Bai et&#x20;al., 2016</xref>), maximum overlap DWT (MODWT) (<xref ref-type="bibr" rid="B23">Prakash et&#x20;al., 2011</xref>), wavelet packet decomposition (WPD) (<xref ref-type="bibr" rid="B19">Liu et&#x20;al., 2019b</xref>), and empirical WT (EWT) (<xref ref-type="bibr" rid="B18">Liu and Chen, 2020</xref>), and so on) and artificial intelligence optimization (whale optimization algorithm (WOA) (<xref ref-type="bibr" rid="B40">Xu et&#x20;al., 2017</xref>), bat algorithm (BA) (<xref ref-type="bibr" rid="B37">Wu and Lin, 2019b</xref>), modified grey wolf optimization (MGWO) (<xref ref-type="bibr" rid="B39">Xing et&#x20;al., 2019</xref>), cuckoo search (CS) (<xref ref-type="bibr" rid="B28">Sun and Sun, 2017</xref>), multi-objective Harris hawks optimization (MOHHO) (<xref ref-type="bibr" rid="B6">Du et&#x20;al., 2020</xref>), and so on). For example, <xref ref-type="bibr" rid="B13">Jiang et&#x20;al. (2019)</xref> designed a hybrid system based on ICEEMDAN, imperialist competitive algorithm (ICA), and backpropagation neural network (BPNN) for pollutant forecasting. Similarly, <xref ref-type="bibr" rid="B6">Du et&#x20;al. (2020)</xref> devised a hybrid model using ELM, MOHHO, and ICEEMDAN for air quality forecasting. The abovementioned hybrid forecasting studies prove the superiority of hybrid modeling, which has emerged as the most promising research direction in the air pollution forecasting&#x20;field.</p>
<p>Although many hybrid models have been proposed for air pollution forecasting, there are still some challenging issues that urgently need to be solved. Specifically, as far as we are aware, most previous studies have employed data decomposition to improve forecasting performance while ignoring the significance of handling and modeling outliers in air pollution data, which may lead to the hybrid model being unable to further enhance the forecasting performance. Moreover, artificial intelligence optimization algorithms are used to search for the optimal parameters of methods in a hybrid model, but the forecasting ability can only be improved to a certain degree. Significantly, however, these studies ignore the time-consuming issues caused by incorporating artificial intelligence optimization algorithms, which may be unable to completely overcome the model&#x2019;s limitations, i.e.,&#x20;the instability of the final results. Furthermore, most previous studies have emphasized the contribution of advanced data decomposition and optimization algorithms while ignoring the significance of mining the characteristics of the original air pollution time series and correcting forecasting results to further improve the model&#x2019;s forecasting performance, despite the growing importance of air pollution forecasting performance. Moreover, another issue with air pollution forecasting, especially daily air pollution forecasting, is that it is mainly focused on point forecasting and thus can only provide deterministic information that is insufficient for real application and cannot provide uncertainty information. As far as we know, interval forecasting can make up for the defects of point forecasting, but this method has been ignored by the relevant researchers despite it being a novel research area that is especially deserving of attention.</p>
<p>In this study, to develop an effective model that overcomes the abovementioned limitations of most previous studies, a novel outlier-robust point and interval forecasting system is proposed for forecasting daily PM<sub>2.5</sub> concentration, which is composed of two parts. The point forecasting part proposes a novel forecasting model based on a data preprocessing module, an outlier-robust forecasting module, and a nonlinear correction module to obtain&#x20;future deterministic information, whereas the interval forecasting part is designed based on the newly proposed artificial intelligence&#x2013;based distribution evaluation and point forecasting results to obtain future uncertainty information. Specifically, first, a data preprocessing module is proposed, which takes into consideration the significance of outlier handling and data decomposition, that can be employed to mitigate the negative effects of outlier and noise information so that the model developed can capture the main features and achieve better performance. Second, an outlier-robust forecasting module is designed, based on outlier-robust ELM (ORELM), to forecast preprocessed data. Next, a nonlinear correction module based on an error ensemble strategy is developed to mine information in the forecasting results and further improve the model&#x2019;s forecasting performance. Finally, an artificial intelligence&#x2013;based distribution evaluation method is designed in the interval forecasting part, which can be combined with the point forecasting results to provide a range of future changes. Case studies utilizing daily PM<sub>2.5</sub> concentrations from two cities in China are designed to validate the developed system.</p>
<p>The main novelty and contribution of this study to current research can be summarized as follows:<list list-type="simple">
<list-item>
<p>(1) <italic>
<bold>The forecasting focus of the system developed is not only on point forecasting but also on interval forecasting.</bold>
</italic> Previous daily PM<sub>2.5</sub> concentration forecasting studies mainly focused on point forecasting, which can provide deterministic information and is insufficient for real application, while ignoring the research about interval forecasting, which can provide uncertainty information. Therefore, an outlier-robust forecasting system that consists of point and interval forecasting is successfully proposed in this study, which can make up for the defects of point forecasting and is validated well in two cities in China.</p>
</list-item>
<list-item>
<p>(2) <italic>
<bold>An improved data preprocessing module is designed to solve the outlier data and noise information issues simultaneously.</bold>
</italic> Most previous studies only employed data decomposition for data preprocessing while ignoring the significance of outlier handling and modeling and failed to further enhance the forecasting performance. As a result, this study develops an improved data preprocessing module based on outlier handling and data decomposition that can effectively overcome the limitations caused by outlier and noise information.</p>
</list-item>
<list-item>
<p>(3) <italic>
<bold>Point forecasting performance is further enhanced by proposing an outlier-robust forecasting module and a nonlinear correction module.</bold>
</italic> In the daily PM<sub>2.5</sub> concentration forecasting field, outlier modeling and forecasting result correction are of great importance but have been ignored by the relevant researchers. Thus, on the one hand, the outlier-robust forecasting module is designed to further solve the outlier modeling issue; on the other hand, a nonlinear correction module is developed based on an error ensemble strategy to mine information in the forecasting results and further enhance the model&#x2019;s forecasting performance.</p>
</list-item>
<list-item>
<p>(4) <italic>
<bold>Convincing experiments are designed to verify the effectiveness and superiority of the system developed.</bold>
</italic> For system evaluation, five evaluation metrics are employed in the evaluation of point forecasting, whereas two typical metrics are used in the evaluation of interval forecasting. For model comparison, four experiments are designed to prove the effectiveness and superiority of outlier handling and modeling, data decomposition, nonlinear correction module, and the system developed in daily PM<sub>2.5</sub> concentration forecasting.</p>
</list-item>
</list>
</p>
<p>The methods are presented in the <italic>Methodology</italic> section, next the construction of the outlier-robust point and interval forecasting system are discussed, then the <italic>Experimental Analysis</italic> section presents the experiments, and the final section draws the conclusions of this&#x20;study.</p>
</sec>
<sec sec-type="methods" id="s2">
<title>Methodology</title>
<p>This section introduces the detailed methods used in the system developed, which includes a data preprocessing module, an outlier-robust forecasting module, a nonlinear correction module, artificial intelligence&#x2013;based distribution evaluation, and interval forecasting theory.</p>
<sec id="s2-1">
<title>Data Preprocessing Module</title>
<p>The data preprocessing module is proposed on the basis of outlier handling and data decomposition, which can overcome the limitations caused by outlier and noise information.</p>
<sec id="s2-1-1">
<title>Outlier Handling</title>
<p>Outliers in the original time series data will have a negative influence on the development of a model, which may bring poor results. Therefore, in this study, an outlier handling algorithm, the Hampel filter (HF), is introduced into the data preprocessing module. HF, developed by <xref ref-type="bibr" rid="B10">Hampel (1974)</xref>, is robust against outliers (<xref ref-type="bibr" rid="B20">Liu et&#x20;al., 2004</xref>). A brief explanation of HF can be defined as follows (<xref ref-type="bibr" rid="B36">Wu et&#x20;al., 2021</xref>).</p>
<p>Given a sequence <italic>x</italic>
<sub>1</sub>, <italic>x</italic>
<sub>2</sub>, &#x2026;, <italic>x</italic>
<sub>
<italic>n</italic>
</sub>, <inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:msubsup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>K</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>is a set of numbers within a moving window and <italic>m</italic>
<sub>
<italic>i</italic>
</sub> is the median value from the moving window, which are defined as follows:<disp-formula id="e1">
<mml:math id="m2">
<mml:mrow>
<mml:msubsup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>K</mml:mi>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mn>...</mml:mn>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mn>...</mml:mn>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>
<disp-formula id="e2">
<mml:math id="m3">
<mml:mrow>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>m</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mn>...</mml:mn>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mn>...</mml:mn>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(2)</label>
</disp-formula>where <italic>K</italic> is the sliding window&#x2019;s half-width.</p>
<p>The new data obtained after using HF to handle the original sequence can be defined as<disp-formula id="e3">
<mml:math id="m4">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>t</mml:mi>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
<mml:mo>&#x3e;</mml:mo>
<mml:mi>t</mml:mi>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(3)</label>
</disp-formula>where <italic>t</italic> denotes a positive integer and <italic>S</italic>
<sub>
<italic>i</italic>
</sub> denotes the median absolute deviation (MAD), which can be given by<disp-formula id="e4">
<mml:math id="m5">
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1.4826</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>m</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mn>...</mml:mn>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(4)</label>
</disp-formula>
</p>
</sec>
<sec id="s2-1-2">
<title>Data Decomposition</title>
<p>Data decomposition, which has been acknowledged as a promising data preprocessing technique, has been widely used in forecasting fields to solve complex nonstationary, nonlinear time series forecasting issues. Numerous studies have shown that the data decomposition algorithm has a significant influence on forecasting performance. Thus, a suitable decomposition method should be selected to identify and extract the inner characteristics of the original time series. In previous studies, such as <xref ref-type="bibr" rid="B17">Lin et&#x20;al. (2017)</xref> and <xref ref-type="bibr" rid="B41">Yang et&#x20;al. (2019a)</xref>, VMD, proposed by <xref ref-type="bibr" rid="B5">Dragomiretskiy and Zosso (2014)</xref>, has been shown to be superior to other algorithms, including EMD, EEMD, and CEEMD. Therefore, VMD is introduced into the data preprocessing module to mitigate the negative noise information influence. The main procedure of VMD is as follows.</p>
<p>
<statement>
<p>
<bold>Step 1:</bold> setting parameters of VMD, while each mode <inline-formula id="inf2">
<mml:math id="m6">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mn>1</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, center pulsation <inline-formula id="inf3">
<mml:math id="m7">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>w</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mn>1</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, and Lagrangian multipliers &#x3bb; are initialized.</p>
</statement>
</p>
<p>
<statement>
<p>
<bold>Step 2:</bold> <italic>y</italic>
<sub>
<italic>k</italic>
</sub> and <italic>w</italic> are updated by<disp-formula id="e5">
<mml:math id="m8">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>f</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2260;</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mo>&#x2b;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:mfrac>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>&#x3b1;</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(5)</label>
</disp-formula>
<disp-formula id="e6">
<mml:math id="m9">
<mml:mrow>
<mml:msubsup>
<mml:mi>w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mi>w</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mtext>d</mml:mtext>
<mml:mi>w</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mtext>d</mml:mtext>
<mml:mi>w</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mfrac>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(6)</label>
</disp-formula>where <italic>n</italic> is the iterations number, <italic>f</italic>(<italic>t</italic>) and <italic>y</italic>
<sub>
<italic>k</italic>
</sub> are the original time series and the <italic>k</italic>th component, whereas <inline-formula id="inf4">
<mml:math id="m10">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>f</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf5">
<mml:math id="m11">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf6">
<mml:math id="m12">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf7">
<mml:math id="m13">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> denote the Fourier transforms of <italic>f</italic>(<italic>t</italic>), <italic>y</italic>
<sub>
<italic>i</italic>
</sub>(<italic>t</italic>), <italic>&#x3bb;</italic>(<italic>t</italic>), and <inline-formula id="inf8">
<mml:math id="m14">
<mml:mrow>
<mml:msubsup>
<mml:mi>y</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, respectively.</p>
</statement>
</p>
<p>
<statement>
<p>
<bold>Step 3:</bold> &#x3bb; can be updated by<disp-formula id="e7">
<mml:math id="m15">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2190;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>&#x3c0;</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>f</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>k</mml:mi>
</mml:munder>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>u</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(7)</label>
</disp-formula>
</p>
</statement>
</p>
<p>
<statement>
<p>
<bold>Step 4:</bold> if <inline-formula id="inf9">
<mml:math id="m16">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msub>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x2016;</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mi>n</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>&#x2016;</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x2016;</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mi>n</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mo>&#x2016;</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#x3c;</mml:mo>
<mml:mi>e</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, the VMD algorithm is stopped, and a series of band-limited modes is returned; otherwise, return to Step 2 to repeat the iteration.</p>
</statement>
</p>
</sec>
</sec>
<sec id="s2-2">
<title>Outlier-Robust Forecasting Module</title>
<p>The basic forecasting model is the important foundation of a hybrid model, which can make a significant difference in forecasting results. If outliers are present within the dataset, the performance of the model developed will be significantly affected. Considering the significance of outlier modeling and outlier robustness, the ORELM model is acknowledged as a potential contributor for modeling data with outliers. Therefore, the ORELM model is introduced into daily PM<sub>2.5</sub> concentration forecasting to design an outlier-robust forecasting module. The original version of the ORELM model is ELM, developed by <xref ref-type="bibr" rid="B9">Huang et&#x20;al. (2004)</xref>, which has many merits, such as its simple structure, better performance, fast computation speed, and the fact that it does not need a large number of samples. Furthermore, previous studies have revealed that ELM methods are superior to some typical ANN methods in solving forecasting issues (<xref ref-type="bibr" rid="B41">Yang et&#x20;al., 2019a</xref>), and it has become one of the most promising approaches.</p>
<p>Given a training dataset with <italic>M</italic> samples, i.e.,&#x20;(<italic>x</italic>
<sub>
<italic>t</italic>
</sub>, <italic>y</italic>
<sub>
<italic>t</italic>
</sub>), <italic>t</italic>&#x20;&#x3d; 1, <inline-formula id="inf10">
<mml:math id="m17">
<mml:mo>&#x22ef;</mml:mo>
</mml:math>
</inline-formula>, <italic>M</italic>, the ELM model for input data <italic>x</italic>
<sub>
<italic>t</italic>
</sub> and output data <italic>y</italic>
<sub>
<italic>t</italic>
</sub> can be presented as<disp-formula id="e8">
<mml:math id="m18">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>L</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
</mml:mrow>
</mml:mstyle>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>L</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
<mml:mi>G</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x22c5;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(8)</label>
</disp-formula>where <italic>L</italic> denotes the number of hidden layer nodes, <italic>w</italic>
<sub>
<italic>i</italic>
</sub> and <italic>b</italic>
<sub>
<italic>i</italic>
</sub> denote the input weight and hidden bias, <italic>G</italic> is the excitation function, <inline-formula id="inf11">
<mml:math id="m19">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> represents the connected weight between the <italic>i</italic>th hidden layer node and the output layer, and <inline-formula id="inf12">
<mml:math id="m20">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> represents the forecasting results.</p>
<p>By defining the hidden layer output matrix, i.e.,&#x20;<italic>H</italic>,<disp-formula id="e9">
<mml:math id="m21">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mtext>&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;</mml:mtext>
<mml:mo>&#x22ee;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mi>G</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x22c5;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mtext>&#xa0;&#xa0;</mml:mtext>
<mml:mo>&#x22ef;</mml:mo>
<mml:mtext>&#xa0;&#xa0;&#xa0;</mml:mtext>
<mml:mi>G</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
<mml:mo>&#x22c5;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mtext>&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;</mml:mtext>
<mml:mo>&#x22ee;</mml:mo>
<mml:mtext>&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;</mml:mtext>
<mml:mo>&#x22ef;</mml:mo>
<mml:mtext>&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;&#xa0;</mml:mtext>
<mml:mo>&#x22ee;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi>G</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x22c5;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>M</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mo>&#x22ef;</mml:mo>
<mml:mtext>&#xa0;&#xa0;</mml:mtext>
<mml:mi>G</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
<mml:mo>&#x22c5;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>M</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>L</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(9)</label>
</disp-formula>
</p>
<p>The ELM model presented in <xref ref-type="disp-formula" rid="e8">Eq. 8</xref> can be rewritten as<disp-formula id="e10">
<mml:math id="m22">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>Y</mml:mi>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(10)</label>
</disp-formula>where <inline-formula id="inf13">
<mml:math id="m23">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x22ef;</mml:mo>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>Y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x22ef;</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>M</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>The optimal solution of <inline-formula id="inf14">
<mml:math id="m24">
<mml:mi>&#x3b2;</mml:mi>
</mml:math>
</inline-formula> can be obtained by solving <inline-formula id="inf15">
<mml:math id="m25">
<mml:mrow>
<mml:munder>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:munder>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x2016;</mml:mo>
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>Y</mml:mi>
</mml:mrow>
<mml:mo>&#x2016;</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>; the corresponding formula is<disp-formula id="e11">
<mml:math id="m26">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mi>H</mml:mi>
<mml:mo>&#x2020;</mml:mo>
</mml:msup>
<mml:mi>Y</mml:mi>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(11)</label>
</disp-formula>where <inline-formula id="inf16">
<mml:math id="m27">
<mml:mrow>
<mml:msup>
<mml:mi>H</mml:mi>
<mml:mo>&#x2020;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>represents the Moore&#x2013;Penrose generalized inverse matrix of <italic>H</italic>; the corresponding formula is<disp-formula id="e12">
<mml:math id="m28">
<mml:mrow>
<mml:msup>
<mml:mi>H</mml:mi>
<mml:mo>&#x2020;</mml:mo>
</mml:msup>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>H</mml:mi>
<mml:mi>T</mml:mi>
</mml:msup>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mi>H</mml:mi>
<mml:mi>T</mml:mi>
</mml:msup>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(12)</label>
</disp-formula>
</p>
<p>As mentioned above, to enhance the ELM model&#x2019;s robustness when modeling data with outliers, the ORELM model is developed by <xref ref-type="bibr" rid="B45">Zhang and Luo (2015)</xref>. The core idea is redefining the minimum problem as<disp-formula id="e13">
<mml:math id="m29">
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:munder>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:munder>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x2016;</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo>&#x2016;</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>k</mml:mi>
</mml:mfrac>
<mml:msubsup>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x2016;</mml:mo>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2016;</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mo>.</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>.</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>Y</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>H</mml:mi>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(13)</label>
</disp-formula>where <italic>e</italic> represents training error and <italic>k</italic> is the regularization parameter.</p>
<p>To solve the newly defined problem, the augmented Lagrange multiplier (ALM) algorithm is adopted, and the corresponding iteration process is defined as<disp-formula id="e14">
<mml:math id="m30">
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>arg</mml:mi>
<mml:munder>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:munder>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>&#x3bc;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>arg</mml:mi>
<mml:munder>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
<mml:mi>e</mml:mi>
</mml:munder>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>&#x3bc;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>e</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>&#x3bc;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>Y</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>H</mml:mi>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(14)</label>
</disp-formula>where <inline-formula id="inf17">
<mml:math id="m31">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#xa0;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>represents the Lagrange multiplier vector, <italic>&#x3bc;</italic> is the penalty parameter, and <inline-formula id="inf18">
<mml:math id="m32">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf19">
<mml:math id="m33">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are defined as<disp-formula id="e15">
<mml:math id="m34">
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>H</mml:mi>
<mml:mi>T</mml:mi>
</mml:msup>
<mml:mi>H</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mfrac>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>I</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mi>H</mml:mi>
<mml:mi>T</mml:mi>
</mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>Y</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mi>&#x3bc;</mml:mi>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mi>h</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>Y</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>H</mml:mi>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mi>&#x3bc;</mml:mi>
</mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>&#x3bc;</mml:mi>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(15)</label>
</disp-formula>
</p>
</sec>
<sec id="s2-3">
<title>Nonlinear Correction Module</title>
<p>For a forecasting model, forecasting error is inevitable, but the short-term trend in the variation of the forecasting error can be anticipated by establishing a nonlinear model (<xref ref-type="bibr" rid="B31">Vukicevic, 1991</xref>). To further improve the performance of the system developed, a nonlinear correction module based on an error ensemble strategy is developed to mine information in the forecasting results, which is composed of three&#x20;steps.</p>
<p>
<statement>
<p>
<italic>
<bold>Step 1: generating the error sequence</bold>
</italic>
</p>
<p>Defining the actual value of the <italic>t</italic> th datum as <italic>A</italic>(<italic>t</italic>) and forecasting the value of the <italic>t</italic> th datum as <italic>F</italic>(<italic>t</italic>), the forecasting error value of the <italic>t</italic> th datum can be obtained by<disp-formula id="e16">
<mml:math id="m35">
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>F</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(16)</label>
</disp-formula>
</p>
</statement>
</p>
<p>
<statement>
<p>
<italic>
<bold>Step 2: developing the error forecasting&#x20;model</bold>
</italic>
</p>
<p>Defining the error value of the <italic>t</italic>&#x2212;<italic>d</italic> th datum as <italic>E</italic>(<italic>t</italic>&#x2212;<italic>d</italic>), according to the detailed error sequence, the forecasting model can be developed and denoted as <italic>f</italic>. The error forecasting value of the <italic>t</italic> th datum, named <italic>EF</italic>(<italic>t</italic>), can be obtained by<disp-formula id="e17">
<mml:math id="m36">
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mi>F</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mn>...</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>d</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(17)</label>
</disp-formula>
</p>
</statement>
</p>
<p>
<statement>
<p>
<italic>
<bold>Step 3: correcting the forecasting results</bold>
</italic>
</p>
<p>To obtain a final result on the basis of the original forecasting results and corresponding error forecasting results, an error ensemble strategy based on ORELM is proposed, which fully exploits the advantages of ORELM and is equipped with outlier robustness. By developing an outlier-robust ensemble model, denoted as <italic>En.f</italic>, the final forecasting results of the <italic>t</italic> th datum, i.e.,&#x20;<italic>FF</italic>(<italic>t</italic>), are<disp-formula id="e18">
<mml:math id="m37">
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mi>F</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>E</mml:mi>
<mml:mi>n</mml:mi>
<mml:mo>.</mml:mo>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mi>F</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>F</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(18)</label>
</disp-formula>
</p>
</statement>
</p>
</sec>
<sec id="s2-4">
<title>Artificial Intelligence&#x2013;Based Distribution Evaluation</title>
<p>Distribution evaluation plays a vital role in many fields, such as wind energy evaluation, time series analysis, and interval forecasting. In recent years, in order to further mine data characteristics, researchers have focused on applying different distribution functions to fit the experimental data and obtain a suitable distribution; then, the interval forecasting results can be obtained according to the interval forecasting theory and point forecasting results. However, to the best of our knowledge, the related research is well validated in many fields, but so far, few studies have involved research on or application to daily PM<sub>2.5</sub> concentration forecasting. In this context, four typical distributions, i.e.,&#x20;Weibull, Gamma, Rayleigh, and Lognormal, are introduced in this study to fit the daily PM<sub>2.5</sub> concentration data. In general, the goodness of fit (0 &#x2264; <italic>R</italic>
<sup>2</sup> &#x2264; 1) was employed to measure the fitting performance of one distribution. Traditionally, the maximum likelihood estimation (MLE) method is used to estimate the distribution function&#x2019;s parameters. However, the MLE method may not obtain the optimal distribution parameters. To the best of our knowledge, the larger the <italic>R</italic>
<sup>2</sup> value, the more optimal the distribution. As a result, the optimal distribution determination problem can be converted into solving the maximum value problem. Inspired by <xref ref-type="bibr" rid="B34">Wang et&#x20;al. (2020b)</xref>, <xref ref-type="bibr" rid="B24">Schwarz et&#x20;al. (2020)</xref>, and <xref ref-type="bibr" rid="B16">Lie&#xdf; et&#x20;al. (2021)</xref>, artificial intelligence optimization can be considered a promising technique for searching for the optimal distribution parameters. Based on this idea and considering the limitations of the traditional method, the artificial intelligence&#x2013;based distribution evaluation is proposed to obtain the optimal distribution in this study. In order to obtain the optimal distribution, specifically, an advanced optimization algorithm named grey wolf optimizer (GWO) is adopted to search for the optimal parameters of specific distribution by maximizing the values of <italic>R</italic>
<sup>2</sup>. In this study, the minus <italic>R</italic>
<sup>2</sup> is defined as the objective function of GWO-based distribution evaluation. Finally, the distribution with the best <italic>R</italic>
<sup>2</sup> value among all distributions is selected as the optimal distribution of PM<sub>2.5</sub>, which can be combined with interval forecasting theory to achieve interval forecasting.</p>
</sec>
<sec id="s2-5">
<title>Interval Forecasting Theory</title>
<p>Given the significance level <inline-formula id="inf20">
<mml:math id="m38">
<mml:mi>&#x3b1;</mml:mi>
</mml:math>
</inline-formula>, actual value <italic>A</italic>
<sub>
<italic>t</italic>
</sub>, and lower and upper limits (<italic>L</italic>, <italic>U</italic>), the probability can be given by<disp-formula id="e19">
<mml:math id="m39">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>L</mml:mi>
<mml:mo>&#x2264;</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>U</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(19)</label>
</disp-formula>
</p>
<p>For a random variable time series, <xref ref-type="disp-formula" rid="e19">Eq. 19</xref> can be rewritten as<disp-formula id="e20">
<mml:math id="m40">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>L</mml:mi>
<mml:mo>&#x2264;</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>U</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>L</mml:mi>
<mml:mo>&#x2264;</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>a</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>a</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(20)</label>
</disp-formula>
</p>
<p>Supposing that the forecasting value has a similar distribution function, the estimated variance can be determined, and then the following conditional probability formula can be obtained as<disp-formula id="e21">
<mml:math id="m41">
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>L</mml:mi>
<mml:mo>&#x2264;</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>a</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo>&#x222b;</mml:mo>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>a</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:munderover>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>&#x7c;</mml:mo>
<mml:mi mathvariant="italic">&#x398;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo>&#x222b;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>a</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mi>U</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>&#x7c;</mml:mo>
<mml:mi mathvariant="italic">&#x398;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#x222b;</mml:mo>
<mml:mrow>
<mml:mi>z</mml:mi>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>&#x7c;</mml:mo>
<mml:mi mathvariant="italic">&#x398;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mover>
<mml:mi>a</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#x222b;</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>&#x7c;</mml:mo>
<mml:mi mathvariant="italic">&#x398;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mi>S</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mrow>
</mml:math>
<label>(21)</label>
</disp-formula>
</p>
<p>The lower and upper limits can be obtained by<disp-formula id="e22">
<mml:math id="m42">
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>L</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>U</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>L</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x2264;</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>U</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo>&#x222b;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>L</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>U</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:munderover>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>&#x7c;</mml:mo>
<mml:mi mathvariant="italic">&#x398;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo>&#x222b;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>L</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>a</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:munderover>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>&#x7c;</mml:mo>
<mml:mi mathvariant="italic">&#x398;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>&#x3b1;</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mo>}</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(22)</label>
</disp-formula>
</p>
</sec>
</sec>
<sec id="s3">
<title>Construction of the Outlier-Robust Forecasting System</title>
<p>The outlier-robust point and interval forecasting system is constructed in this section; the details of the system design and system evaluation are as follows.</p>
<sec id="s3-1">
<title>System Design</title>
<p>The system design is composed of the point forecasting part and the interval forecasting part, which can provide deterministic information and uncertainty information in the future, respectively.</p>
<sec id="s3-1-1">
<title>Point Forecasting</title>
<p>The point forecasting part is developed based on three modules, i.e.,&#x20;a data preprocessing module, an outlier-robust forecasting module, and a nonlinear correction module, and consists of three phases as follows.</p>
<p>
<list list-type="simple">
<list-item>
<p>&#x25c6; <bold>Phase I: data preprocessing</bold>. Considering the negative influence of outlier data and noise information in original daily PM<sub>2.5</sub> concentration data, a data preprocessing module is designed based on outlier handling and data decomposition that can effectively solve the outlier data and noise information issues simultaneously. Specifically, on the one hand, the HF algorithm is employed to detect and correct the outliers in the original data, which can eliminate the outliers&#x2019; negative influence on the model&#x2019;s development from the perspective of improving data quality. On the other hand, the advanced data decomposition method named VMD is performed to decompose the processed data into a number of modes, which can eliminate the noise&#x2019;s negative effect on the model&#x2019;s performance from the perspective of signal denoising and helps the model effectively capture the main features of the daily PM<sub>2.5</sub> concentration&#x20;data.</p>
</list-item>
<list-item>
<p>&#x25c6; <bold>Phase II: forecasting preprocessed data</bold>. In this phase, some modes obtained in Phase I are forecasted, and then, the forecasting results can be obtained. As a result, it is necessary to select a suitable model for preprocessed data forecasting. Most previous studies tended to develop an optimized ANN model for mode forecasting while ignoring the instability parameter setting problem of artificial intelligence optimization algorithms, which not only cannot guarantee the forecasting model&#x2019;s stability but also bring time-consuming issues. In this context, the ORELM model, with its simple structure, high forecasting performance, and fast calculation speed, better forecasting ability than some typical ANN models, and better outlier robustness than the original ELM model, is selected to design the outlier-robust forecasting module to obtain the future value of each mode, and finally, by summing the forecasting results of each mode, the results of daily ahead PM<sub>2.5</sub> concentration forecasting can be achieved.</p>
</list-item>
<list-item>
<p>&#x25c6; <bold>Phase III: forecasting results correction</bold>. Most previous studies have focused on using data decomposition and optimization to enhance forecasting ability while ignoring the significance of forecasting results correction. Therefore, in this phase, a nonlinear correction module based on an error ensemble strategy is designed to mine information in the forecasting results and thereby enhance the model&#x2019;s performance, which is composed of three steps, i.e.,&#x20;generating an error sequence, developing an error forecasting model, and correcting the forecasting results. Following these three steps, the results of the point forecasting can be obtained; these results can provide deterministic information in the future.</p>
</list-item>
</list>
</p>
</sec>
<sec id="s3-1-2">
<title>Interval Forecasting</title>
<p>The interval forecasting part is developed based on the previous point forecasting results, artificial intelligence&#x2013;based distribution evaluation, and interval forecasting theory and consists of two phases as follows.</p>
<p>
<list list-type="simple">
<list-item>
<p>&#x25c6; <bold>Phase I: distribution evaluation</bold>. Distribution evaluation is the crucial basis of the interval forecasting part, whereas the traditional MLE method may not fit the optimal distribution for a specific PM<sub>2.5</sub> concentration dataset. In order to solve this issue, an artificial intelligence&#x2013;based distribution evaluation is proposed to obtain the optimal distribution. Specifically, the advanced optimization algorithm named GWO is selected to search the parameters of four typical distributions, i.e.,&#x20;Weibull, Gamma, Rayleigh, and Lognormal. Finally, the distribution function with the best <italic>R</italic>
<sup>2</sup> is considered the optimal distribution.</p>
</list-item>
<list-item>
<p>&#x25c6; <bold>Phase II: obtaining interval forecasting results</bold>. According to the point forecasting results, the interval forecasting results are estimated using the optimal distribution determined in Phase I and interval forecasting theory, which can provide uncertainty information in the future.</p>
</list-item>
</list>
</p>
</sec>
</sec>
<sec id="s3-2">
<title>System Evaluation</title>
<p>This section is designed to provide system evaluation metrics, including point forecasting evaluation and interval forecasting evaluation.</p>
<sec id="s3-2-1">
<title>Point Forecasting Evaluation</title>
<p>To evaluate the forecasting performance of the system developed for daily PM<sub>2.5</sub> concentration point forecasting, the three typical metrics listed in <xref ref-type="table" rid="T1">Table&#x20;1</xref> are selected in this&#x20;study.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Three typical metrics.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Abbreviation</th>
<th align="center">Full name</th>
<th align="center">Equation</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">MAE</td>
<td align="left">Mean absolute error</td>
<td align="left">
<inline-formula id="inf21">
<mml:math id="m43">
<mml:mrow>
<mml:mi mathvariant="bold">MAE</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>N</mml:mi>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left">RMSE</td>
<td align="left">Root mean square error</td>
<td align="left">
<inline-formula id="inf22">
<mml:math id="m44">
<mml:mrow>
<mml:mi mathvariant="bold">RMSE</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>N</mml:mi>
</mml:mfrac>
<mml:mo>&#xd7;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
<tr>
<td align="left">MAPE</td>
<td align="left">Root mean square error</td>
<td align="left">
<inline-formula id="inf23">
<mml:math id="m45">
<mml:mrow>
<mml:mi mathvariant="bold">MAPE</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>N</mml:mi>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>100</mml:mn>
<mml:mo>%</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>
<italic>F</italic>
<sub>
<italic>i</italic>
</sub> and <italic>A</italic>
<sub>
<italic>i</italic>
</sub> denote the forecasting value and actual value, respectively, at time <italic>i</italic>, and <italic>N</italic> is the length of the time series.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>In addition to evaluating the forecasting accuracy using these three typical evaluation metrics, to measure the similarity of the forecasting value curve and actual value curve, grey relational analysis (GRA) (<xref ref-type="bibr" rid="B32">Wang et&#x20;al., 2015</xref>) is introduced into point forecasting evaluation, which provides&#x20;a&#x20;new metric named grey relational degree (GRD) for further analysis. Moreover, forecasting stability is another important metric in practical application. Thus, a metric named variance ratio (VR) (<xref ref-type="bibr" rid="B43">Yang and Wang, 2017</xref>) is introduced into the point forecasting evaluation, which can measure the forecasting stability of different models by considering the variance between the actual and the forecast values.</p>
</sec>
<sec id="s3-2-2">
<title>Interval Forecasting Evaluation</title>
<p>To evaluate the interval forecasting performance, two widely used metrics named forecasting interval coverage probability (FICP) and forecasting interval normalized average width (FINAW) are introduced into the interval forecasting evaluation. Specifically, the FICP indicator is selected to measure the probability that the actual observation value falls into the forecasting interval, which can reflect the reliability of the interval forecasting results. The larger the FICP value, the higher the interval forecasting accuracy. Furthermore, to avoid the excessive pursuit of the reliability of interval forecasting while ignoring the effective measure of interval forecasting width for interval forecasting results, FINAW is employed to express the width of interval forecasting results. The FICP and FINAW can be calculated by<disp-formula id="e23">
<mml:math id="m46">
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mi>I</mml:mi>
<mml:mi>C</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>N</mml:mi>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>100</mml:mn>
<mml:mtext>%</mml:mtext>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(23)</label>
</disp-formula>where <inline-formula id="inf24">
<mml:math id="m47">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>U</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:msub>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2209;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>U</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> and <italic>N</italic> is the length of the time series,<disp-formula id="e24">
<mml:math id="m48">
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mi>I</mml:mi>
<mml:mi>N</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>W</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mi>R</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
</mml:mstyle>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>U</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(24)</label>
</disp-formula>where <italic>U</italic>
<sub>
<italic>i</italic>
</sub> and <italic>L</italic>
<sub>
<italic>i</italic>
</sub> denote the forecasting interval&#x2019;s upper and lower limits, respectively, at time <italic>i</italic>, and <italic>R</italic> is the range of actual values.</p>
</sec>
</sec>
</sec>
<sec id="s4">
<title>Experimental Analysis</title>
<p>The experiments and analysis are presented in this section, which is mainly composed of data description, experiment design, and four detailed experiments.</p>
<sec id="s4-1">
<title>Data Description</title>
<p>To validate the ability of the outlier-robust system developed to perform point forecasting and interval forecasting of daily PM<sub>2.5</sub> pollution, Jinan and Zhengzhou are considered as the study areas; two datasets collected from these two study areas are used as illustrative empirical studies in this study. Jinan, the capital city of Shandong Province, is located in the middle of China. Zhengzhou, the capital city of Henan Province, is located in the&#x20;middle part of the Yellow River. Specifically, two daily PM<sub>2.5</sub> concentration datasets, covering 1&#xa0;yr from July 1, 2017, to June&#x20;30, 2018, are employed in this study. In the experiment, the data, from July 1, 2017, to May 31, 2018, are employed as training data for the development of the proposed system, whereas the data from June 1, 2018, to June 30, 2018, are considered as testing data to test the forecasting performance of the system developed.</p>
</sec>
<sec id="s4-2">
<title>Experiment Design</title>
<p>As mentioned above, in this study, an outlier-robust point and interval forecasting system is developed, which is composed of a data preprocessing module, an outlier-robust forecasting module, a nonlinear correction module, artificial intelligence&#x2013;based distribution evaluation, and interval forecasting theory to obtain future deterministic information and uncertainty information about daily PM<sub>2.5</sub> pollution. To verify the forecasting superiority of the system developed, sufficient empirical research should be carried out. In addition to comparing the performance of the system developed with that of the other types of forecasting models, the contribution of each component proposed or employed in the system developed should also be proved by designing appropriate comparative studies. For this purpose, this study designs four experiments to conduct a convincing evaluation of the system developed. Specifically, in Experiment I, the effectiveness of outlier handling and modeling in the system developed is verified from the perspectives of data preprocessing and model selection. In Experiment II, the effectiveness of data decomposition in the system proposed is compared with other decomposition algorithm&#x2013;based models and a model without a decomposition preprocess. In Experiment III, a nonlinear correction module is developed to correct the forecasting results, which is designed to compare the proposed system with the model without correcting the process and the model with a simple error-addition strategy. It should be noted that the experiments for each model in Experiments I&#x2013;III are carried out 100&#x20;times in this study, and the average values of the forecasting results are considered the final forecasting results for practical application and model comparison, which can ensure that the system developed is more reliable, accurate, and independent of random factors to some extent. In Experiment IV, different distributions of daily PM<sub>2.5</sub> concentration are compared to obtain the optimal distribution, and the interval forecasting results based on point forecasting are obtained and evaluated by two typical metrics.</p>
</sec>
<sec id="s4-3">
<title>Experiment I: The Effectiveness of Outlier Handling and Modeling</title>
<p>To evaluate the effectiveness of outlier handling and modeling, eight models, i.e.,&#x20;ELM, regularized ELM (RELM), weighted RELM (WRELM), ORELM, HF-ELM, HF-RELM, HF-WRELM, and HF-ORELM, are proposed and tested. The MAE, RMSE, MAPE, VR, and GRD values of these eight models are shown in <xref ref-type="table" rid="T2">Table&#x20;2</xref>. Meanwhile, the results of the different models in the two cities are depicted in <xref ref-type="fig" rid="F1">Figures 1</xref>, <xref ref-type="fig" rid="F2">2</xref>, which indicate that the ORELM model is superior to ELM, RELM, and WRELM, whereas the HF-ORELM model is superior to the seven other models. As shown in <xref ref-type="table" rid="T2">Table&#x20;2</xref>, two types of comparison can be designed based on these eight models. Comparison I compares the forecasting results of the ELM (HF-ELM), RELM (HF-RELM), WRELM (HF-WRELM), and ORELM (HF-ORELM) models. Meanwhile, Comparison II compares the forecasting results of the ORELM and HF-ORELM models (or ELM and HF-ELM, or RELM and HF-RELM, or WRELM and HF-WRELM). In other words, transverse comparison and longitudinal comparison can be conducted according to the metric values in <xref ref-type="table" rid="T2">Table&#x20;2</xref>. The detailed comparisons are as follows:<list list-type="simple">
<list-item>
<p>1) In Comparison I, by comparing the ORELM (HF-ORELM) with ELM (HF-ELM), RELM (HF-RELM), and WRELM (HF-WRELM), it can be observed that the ORELM model is superior to the ELM, RELM, and WRELM models, whereas the HF-ORELM model is superior to the HF-ELM, HF-RELM, and HF-WRELM models. For example, for daily PM<sub>2.5</sub> concentration forecasting in Jinan, the ORELM model has a lower MAPE value of 28.6266%, compared to the MAPE values of 31.5740%, 31.8239%, and 30.9697% for the ELM, RELM, and WRELM models, respectively. Furthermore, for daily PM<sub>2.5</sub> concentration forecasting in Zhengzhou, the HF-ORELM model achieves the best MAPE value of 25.9379% compared to the MAPE values of 30.4850%, 30.5535%, and 28.8852% for&#x20;the HF-ELM, HF-RELM, and HF-WRELM models, respectively. The differences in the model forecasting results compared illustrate that the ORELM model is more powerful and robust than the other models for daily PM<sub>2.5</sub> concentration forecasting. Therefore, we can reasonably conclude that the ORELM model will make a great contribution to the final successful modeling; therefore, it can be selected as the basic forecasting model for the outlier-robust forecasting system.</p>
</list-item>
<list-item>
<p>2) In Comparison II, the forecasting results of models with outlier handling and those of models without outlier handling can be compared to evaluate the effectiveness of outlier handling in the proposed data preprocessing module. By comparing the forecasting results of the ORELM and HF-ORELM model (or ELM and HF-ELM, or RELM and HF-RELM, or WRELM and HF-WRELM), it can be observed that the HF-based model is superior to the models without outlier handling. For example, the MAPE values of HF-ELM, HF-RELM, HF-WRELM, and HF-ORELM for daily PM<sub>2.5</sub> concentration forecasting in Jinan are 30.1351%, 29.9125%, 29.0110%, and 26.1079%, whereas the corresponding models without outlier handling have larger MAPE values, i.e.,&#x20;ELM (31.5740%), RELM (31.8239%), WRELM (30.9697%), and ORELM (28.6266%). The differences between the forecasting results of the ORELM and HF-ORELM models, ELM and HF-ELM, RELM and HF-RELM, WRELM and HF-WRELM reveal that outlier handling is of great importance for daily PM<sub>2.5</sub> concentration forecasting, which can be combined with data decomposition to design the data preprocessing module to further improve forecasting performance.</p>
</list-item>
</list>
</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Forecasting results of four individual models and HF-based models.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Model</th>
<th align="center">MAE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">RMSE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">MAPE (%)</th>
<th align="center">VR (&#x2212;)</th>
<th align="center">GRD (&#x2212;)</th>
<th align="center">Model</th>
<th align="center">MAE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">RMSE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">MAPE (%)</th>
<th align="center">VR (&#x2212;)</th>
<th align="center">GRD (&#x2212;)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="12" align="left">
<italic>Site: Jinan</italic>
</td>
</tr>
<tr>
<td align="left">ELM</td>
<td align="char" char=".">7.3582</td>
<td align="char" char=".">10.0165</td>
<td align="char" char=".">31.5740</td>
<td align="char" char=".">0.3709</td>
<td align="char" char=".">0.6769</td>
<td align="center">HF-ELM</td>
<td align="char" char=".">6.9967</td>
<td align="char" char=".">9.4329</td>
<td align="char" char=".">30.1351</td>
<td align="char" char=".">0.3342</td>
<td align="char" char=".">0.6712</td>
</tr>
<tr>
<td align="left">RELM</td>
<td align="char" char=".">7.4260</td>
<td align="char" char=".">10.0446</td>
<td align="char" char=".">31.8239</td>
<td align="char" char=".">0.3649</td>
<td align="char" char=".">0.6729</td>
<td align="center">HF-RELM</td>
<td align="char" char=".">6.9388</td>
<td align="char" char=".">9.3872</td>
<td align="char" char=".">29.9125</td>
<td align="char" char=".">0.3385</td>
<td align="char" char=".">0.6726</td>
</tr>
<tr>
<td align="left">WRELM</td>
<td align="char" char=".">7.2311</td>
<td align="char" char=".">9.8402</td>
<td align="char" char=".">30.9697</td>
<td align="char" char=".">0.3761</td>
<td align="char" char=".">0.6733</td>
<td align="center">HF-WRELM</td>
<td align="char" char=".">6.7626</td>
<td align="char" char=".">9.2276</td>
<td align="char" char=".">29.0110</td>
<td align="char" char=".">0.3529</td>
<td align="char" char=".">0.6693</td>
</tr>
<tr>
<td align="left">ORELM</td>
<td align="char" char=".">6.8231</td>
<td align="char" char=".">9.2232</td>
<td align="char" char=".">28.6266</td>
<td align="char" char=".">0.3514</td>
<td align="char" char=".">0.6560</td>
<td align="center">HF-ORELM</td>
<td align="char" char=".">6.4068</td>
<td align="char" char=".">8.2988</td>
<td align="char" char=".">26.1079</td>
<td align="char" char=".">0.4033</td>
<td align="char" char=".">0.6598</td>
</tr>
</tbody>
</table>
<table>
<thead valign="top">
<tr>
<th align="center">Model</th>
<th align="center">MAE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">RMSE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">MAPE (%)</th>
<th align="center">VR (&#x2212;)</th>
<th align="center">GRD (&#x2212;)</th>
<th align="center">Model</th>
<th align="center">MAE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">RMSE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">MAPE (%)</th>
<th align="center">VR (&#x2212;)</th>
<th align="center">GRD (&#x2212;)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="12" align="left">
<italic>Site: Zhengzhou</italic>
</td>
</tr>
<tr>
<td align="left">ELM</td>
<td align="char" char=".">8.8045</td>
<td align="char" char=".">11.0005</td>
<td align="char" char=".">33.2161</td>
<td align="char" char=".">0.4607</td>
<td align="char" char=".">0.6804</td>
<td align="center">HF-ELM</td>
<td align="char" char=".">8.2973</td>
<td align="char" char=".">10.6228</td>
<td align="char" char=".">30.4850</td>
<td align="char" char=".">0.5829</td>
<td align="char" char=".">0.6689</td>
</tr>
<tr>
<td align="left">RELM</td>
<td align="char" char=".">8.7766</td>
<td align="char" char=".">10.9252</td>
<td align="char" char=".">33.0157</td>
<td align="char" char=".">0.4571</td>
<td align="char" char=".">0.6778</td>
<td align="center">HF-RELM</td>
<td align="char" char=".">8.3243</td>
<td align="char" char=".">10.6302</td>
<td align="char" char=".">30.5535</td>
<td align="char" char=".">0.5942</td>
<td align="char" char=".">0.6672</td>
</tr>
<tr>
<td align="left">WRELM</td>
<td align="char" char=".">8.6697</td>
<td align="char" char=".">10.7735</td>
<td align="char" char=".">32.3432</td>
<td align="char" char=".">0.5422</td>
<td align="char" char=".">0.6836</td>
<td align="center">HF-WRELM</td>
<td align="char" char=".">7.9918</td>
<td align="char" char=".">10.1581</td>
<td align="char" char=".">28.8852</td>
<td align="char" char=".">0.6194</td>
<td align="char" char=".">0.6714</td>
</tr>
<tr>
<td align="left">ORELM</td>
<td align="char" char=".">8.0171</td>
<td align="char" char=".">9.7210</td>
<td align="char" char=".">28.4607</td>
<td align="char" char=".">0.5481</td>
<td align="char" char=".">0.6977</td>
<td align="center">HF-ORELM</td>
<td align="char" char=".">7.6229</td>
<td align="char" char=".">9.6868</td>
<td align="char" char=".">25.9379</td>
<td align="char" char=".">0.7379</td>
<td align="char" char=".">0.6804</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>(&#x2212;) indicates no measurement&#x20;unit.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Forecasting results of different comparative studies in Jinan.</p>
</caption>
<graphic xlink:href="fenvs-09-747101-g001.tif"/>
</fig>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Forecasting results of different comparative studies in Zhengzhou.</p>
</caption>
<graphic xlink:href="fenvs-09-747101-g002.tif"/>
</fig>
<p>
<italic>
<bold>Summary:</bold>
</italic> by taking Zhengzhou as an example, the improvement percentage values of MAPE between the different models are employed to summarize the contribution and effectiveness of outlier handling and modeling in this study. The detailed results are 8.2222% (HF-ELM vs ELM), 7.4577% (HF-RELM vs RELM), 10.6916% (HF-WRELM vs WRELM), 8.8642% (HF-ORELM vs ORELM), 14.3166% (ORELM vs ELM), 13.7965% (ORELM vs RELM), and 12.0041% (ORELM vs WRELM). It can be concluded that the HF algorithm and ORELM model are suitable for outlier handling and modeling, which make a great contribution to the success of the system developed in this&#x20;study.</p>
</sec>
<sec id="s4-4">
<title>Experiment II: The Effectiveness of Data Decomposition</title>
<p>To verify the contribution of data decomposition in the proposed data preprocessing module and the superiority of the forecasting results of the outlier-robust forecasting module developed, four models, i.e.,&#x20;HF-ORELM, HF-EMD-ORELM-S, HF-EEMD-ORELM-S, and HF-VMD-ORELM-S, are developed and compared in Jinan and Zhengzhou. In detail, the HF-EMD-ORELM-S, HF-EEMD-ORELM-S, and HF-VMD-ORELM-S employ different data decomposition algorithms to decompose the data after outlier handling into some modes, and the simple addition way is used to add all modes&#x2019; forecasting results to obtain the daily PM<sub>2.5</sub> concentration forecasting results. The MAE, RMSE, MAPE, VR, and GRD values of HF-ORELM, HF-EMD-ORELM-S, HF-EEMD-ORELM-S, and HF-VMD-ORELM-S are shown in <xref ref-type="table" rid="T3">Table&#x20;3</xref>. Moreover, the forecasting results of these four models in the two cities are shown in <xref ref-type="fig" rid="F3">Figure&#x20;3</xref>, which indicates that the HF-VMD-ORELM-S model is superior to the original HF-ORELM model and the EMD- or EEMD-based HF-ORELM model. In this experiment, two comparisons can be designed as follows:<list list-type="simple">
<list-item>
<p>1) Comparison I is proposed to validate the superiority of the data decomposition algorithm in the system developed by comparing the HF-VMD-ORELM-S with other decomposition method&#x2013;based forecasting models, i.e.,&#x20;HF-EMD-ORELM-S and HF-EEMD-ORELM-S. It can be observed that the HF-EMD-ORELM-S model obtains worse forecasting performance compared with the EEMD- and VMD-based models, whereas the VMD-based model achieves better forecasting performance compared with the EMD- and EEMD-based models. For example, for daily PM<sub>2.5</sub> concentration forecasting in Zhengzhou, the MAE, RMSE, MAPE, VR, and GRD values of HF-VMD-ORELM-S are 1.1259, 1.5228, 3.8169%, 0.9523, and 0.9222, respectively, whereas the metric values of HF-EMD-ORELM-S are 4.2089, 5.9923, 14.1646%, 0.9024, and 0.7705, and the values of HF-EEMD-ORELM-S are 2.1140, 2.7875, 7.0637%, 0.8161, and 0.8652. It is obvious that there are significant differences in the forecasting power of these three models, which further demonstrates the significance of selecting a suitable data decomposition algorithm for the data preprocessing module and the system developed. Therefore, in this study, the VMD algorithm is combined with outlier handling to design the data preprocessing module, which also makes great contributions to the success of the system developed.</p>
</list-item>
<list-item>
<p>2) Comparison II is designed to verify the superiority of the outlier-robust forecasting module developed in daily PM<sub>2.5</sub> concentration forecasting. In Experiment I, the superiority of HF-ORELM over ORELM has been proven, which means the outlier handling is an indispensable part of the forecasting system developed. Against this background, there is no need to compare the proposed outlier-robust forecasting module&#x2019;s results with those of the individual ORELM model. As a result, in Comparison II, the outlier-robust forecasting module is compared with the HF-ORELM model to prove the contribution of the VMD algorithm. From <xref ref-type="table" rid="T3">Table&#x20;3</xref>, it can be observed that the HF-VMD-ORELM-S model performs better than the HF-ORELM model in terms of MAE, RMSE, MAPE, VR, and GRD. For example, in comparison with the HF-ORELM model, the proposed HF-VMD-ORELM-S model presents an improvement from (6.4068, 8.2988, 26.1079%, 0.4033, and 0.6598) to (1.1259, 1.5228, 3.8169%, 0.9523, and 0.9222) in terms of MAE, RMSE, MAPE, VR, and GRD in Jinan. The results prove that the forecasting ability of the HF-VMD-ORELM-S model is superior to the benchmark model, which can provide a better basis for the success of the system developed.</p>
</list-item>
</list>
</p>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Forecasting results of HF-ORELM and HF-ORELM with different data decompositions.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Model</th>
<th align="center">MAE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">RMSE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">MAPE (%)</th>
<th align="center">VR (&#x2212;)</th>
<th align="center">GRD (&#x2212;)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="6" align="left">
<italic>Site: Jinan</italic>
</td>
</tr>
<tr>
<td align="left">HF-ORELM</td>
<td align="char" char=".">6.4068</td>
<td align="char" char=".">8.2988</td>
<td align="char" char=".">26.1079</td>
<td align="char" char=".">0.4033</td>
<td align="char" char=".">0.6598</td>
</tr>
<tr>
<td align="left">HF-EMD-ORELM-S</td>
<td align="char" char=".">4.2089</td>
<td align="char" char=".">5.9923</td>
<td align="char" char=".">14.1646</td>
<td align="char" char=".">0.9024</td>
<td align="char" char=".">0.7705</td>
</tr>
<tr>
<td align="left">HF-EEMD-ORELM-S</td>
<td align="char" char=".">2.1140</td>
<td align="char" char=".">2.7875</td>
<td align="char" char=".">7.0637</td>
<td align="char" char=".">0.8161</td>
<td align="char" char=".">0.8652</td>
</tr>
<tr>
<td align="left">HF-VMD-ORELM-S</td>
<td align="char" char=".">1.1259</td>
<td align="char" char=".">1.5228</td>
<td align="char" char=".">3.8169</td>
<td align="char" char=".">0.9523</td>
<td align="char" char=".">0.9222</td>
</tr>
</tbody>
</table>
<table>
<thead valign="top">
<tr>
<th align="left">Model</th>
<th align="center">MAE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">RMSE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">MAPE (%)</th>
<th align="center">VR (&#x2212;)</th>
<th align="center">GRD (&#x2212;)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="6" align="left">
<italic>Site: Zhengzhou</italic>
</td>
</tr>
<tr>
<td align="left">HF-ORELM</td>
<td align="char" char=".">7.6229</td>
<td align="char" char=".">9.6868</td>
<td align="char" char=".">25.9379</td>
<td align="char" char=".">0.7379</td>
<td align="char" char=".">0.6804</td>
</tr>
<tr>
<td align="left">HF-EMD-ORELM-S</td>
<td align="char" char=".">5.5281</td>
<td align="char" char=".">6.4669</td>
<td align="char" char=".">18.1868</td>
<td align="char" char=".">0.9350</td>
<td align="char" char=".">0.7335</td>
</tr>
<tr>
<td align="left">HF-EEMD-ORELM-S</td>
<td align="char" char=".">2.6744</td>
<td align="char" char=".">3.6666</td>
<td align="char" char=".">8.3559</td>
<td align="char" char=".">0.7184</td>
<td align="char" char=".">0.8596</td>
</tr>
<tr>
<td align="left">HF-VMD-ORELM-S</td>
<td align="char" char=".">1.4782</td>
<td align="char" char=".">2.7491</td>
<td align="char" char=".">5.0868</td>
<td align="char" char=".">0.9365</td>
<td align="char" char=".">0.9151</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>(&#x2212;) indicates no measurement&#x20;unit.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Forecasting results of HF-ORELM with/without data decomposition.</p>
</caption>
<graphic xlink:href="fenvs-09-747101-g003.tif"/>
</fig>
<p>
<italic>
<bold>Summary:</bold>
</italic> by taking Jinan as an example, the improvement percentage values of MAPE between different models are employed to summarize the contribution and effectiveness of data decomposition in this study. The detailed results are 85.3803% (HF-VMD-ORELM-S vs HF-ORELM), 73.0532% (HF-VMD-ORELM-S vs HF-EMD-ORELM-S), 45.9646% (HF-VMD-ORELM-S vs HF-EEMD-ORELM-S), 45.7459% (HF-EMD-ORELM-S vs HF-ORELM), and 72.9442% (HF-EEMD-ORELM-S vs HF-ORELM). It can be concluded that the VMD algorithm is superior to the EMD and EEMD algorithms and is a promising technique for daily PM<sub>2.5</sub> concentration decomposition, which can also make a great contribution to the success of the system developed.</p>
</sec>
<sec id="s4-5">
<title>Experiment III: The Effectiveness of the Nonlinear Correction Module</title>
<p>As mentioned above, the third module, named the nonlinear correction module, is proposed to correct the results of the outlier-robust forecasting module to further improve the daily PM<sub>2.5</sub> concentration forecasting performance. To prove the superiority and effectiveness of the proposed nonlinear correction module and the system developed for point forecasting, the performance of the point forecasting part developed, i.e.,&#x20;HF-VMD-ORELM&#x2b;EnError, is compared with HF-VMD-ORELM-S and HF-VMD-ORELM&#x2b;Error in this section. In detail, the HF-VMD-ORELM-S model without a correcting process is the best model in Experiment II, which can provide the results of the devised forecasting module, whereas the HF-VMD-ORELM&#x2b;Error model is a model with a simple error-addition strategy. The MAE, RMSE, MAPE, VR, and GRD values of the system developed, HF-VMD-ORELM-S, and HF-VMD-ORELM&#x2b;Error are listed in <xref ref-type="table" rid="T4">Table&#x20;4</xref>; meanwhile, the results of these three models are shown in <xref ref-type="fig" rid="F4">Figure&#x20;4</xref>. Based on Experiment III, the following conclusions can be obtained:<list list-type="simple">
<list-item>
<p>1) The HF-VMD-ORELM&#x2b;Error model performs better than HF-VMD-ORELM-S model in Jinan but performs worse than HF-VMD-ORELM&#x2b;Error in Zhengzhou, which indicates that the simple error-addition strategy cannot guarantee the effectiveness of error correction. Therefore, how to correct the forecasting results is a challenging issue in forecasting fields. In other words, the method of correcting forecasting results plays a vital role in the success of the system developed. In this context, a nonlinear correction module based on an error ensemble strategy is presented to further improve the model&#x2019;s forecasting performance.</p>
</list-item>
<list-item>
<p>2) By comparing the HF-VMD-ORELM&#x2b;EnError model and the HF-VMD-ORELM&#x2b;Error model, it can be found that the main difference between these two models is the forecasting results correcting method. As shown in <xref ref-type="table" rid="T4">Table&#x20;4</xref>, the forecasting results obtained by the system developed are better than those obtained by the HF-VMD-ORELM&#x2b;Error model, with the MAE, RMSE, MAPE, VR, and GRD values in Jinan and Zhengzhou being (1.0744, 1.5525, 3.5736%, 0.9648, and 0.9228), and (1.3841, 2.7134, 4.8328%, 0.9439, and 0.9193), respectively. Therefore, the HF-VMD-ORELM&#x2b;EnError model performs better than the HF-VMD-ORELM&#x2b;Error model, which proves the superiority of the nonlinear correction module based on an error ensemble strategy.</p>
</list-item>
<list-item>
<p>3) The contribution of the proposed nonlinear correction module and the superiority of the system developed can be measured by comparing the point forecasting results of the system developed with those of the HF-VMD-ORELM-S model. As shown in <xref ref-type="table" rid="T4">Table&#x20;4</xref>, in comparison with the HF-VMD-ORELM-S model, the proposed system presents an improvement from (1.4782, 2.7491, 5.0868%, 0.9365, and 0.9151) to (1.3841, 2.7134, 4.8328%, 0.9439, and 0.9193) in terms of MAE, RMSE, MAPE, VR, and GRD in Zhengzhou. In previous experiments, the superiority of the HF-VMD-ORELM-S model over other benchmark models has been proven. Therefore, considering the system developed performs better than HF-VMD-ORELM-S, we can conclude that the system designed in this study is superior to other models and can be widely employed in daily PM<sub>2.5</sub> concentration forecasting.</p>
</list-item>
</list>
</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>Metrics of the system developed and compared models.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Model</th>
<th align="center">MAE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">RMSE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">MAPE (%)</th>
<th align="center">VR (&#x2212;)</th>
<th align="center">GRD (&#x2212;)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="6" align="left">
<italic>Site: Jinan</italic>
</td>
</tr>
<tr>
<td align="left">HF-VMD-ORELM-S</td>
<td align="char" char=".">1.1259</td>
<td align="char" char=".">1.5228</td>
<td align="char" char=".">3.8169</td>
<td align="char" char=".">0.9523</td>
<td align="char" char=".">0.9222</td>
</tr>
<tr>
<td align="left">HF-VMD-ORELM&#x2b;Error</td>
<td align="char" char=".">1.1472</td>
<td align="char" char=".">1.5203</td>
<td align="char" char=".">3.9048</td>
<td align="char" char=".">0.9529</td>
<td align="char" char=".">0.9224</td>
</tr>
<tr>
<td align="left">HF-VMD-ORELM&#x2b;EnError</td>
<td align="char" char=".">1.0744</td>
<td align="char" char=".">1.5525</td>
<td align="char" char=".">3.5736</td>
<td align="char" char=".">0.9648</td>
<td align="char" char=".">0.9228</td>
</tr>
</tbody>
</table>
<table>
<thead valign="top">
<tr>
<th align="left">Model</th>
<th align="center">MAE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">RMSE (&#x3bc;g/m<sup>3</sup>)</th>
<th align="center">MAPE (%)</th>
<th align="center">VR (&#x2212;)</th>
<th align="center">GRD (&#x2212;)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="6" align="left">
<italic>Site: Zhengzhou</italic>
</td>
</tr>
<tr>
<td align="left">HF-VMD-ORELM-S</td>
<td align="char" char=".">1.4782</td>
<td align="char" char=".">2.7491</td>
<td align="char" char=".">5.0868</td>
<td align="char" char=".">0.9365</td>
<td align="char" char=".">0.9151</td>
</tr>
<tr>
<td align="left">HF-VMD-ORELM&#x2b;Error</td>
<td align="char" char=".">1.4252</td>
<td align="char" char=".">2.6989</td>
<td align="char" char=".">5.0681</td>
<td align="char" char=".">0.9350</td>
<td align="char" char=".">0.9192</td>
</tr>
<tr>
<td align="left">HF-VMD-ORELM&#x2b;EnError</td>
<td align="char" char=".">1.3841</td>
<td align="char" char=".">2.7134</td>
<td align="char" char=".">4.8328</td>
<td align="char" char=".">0.9439</td>
<td align="char" char=".">0.9193</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>(&#x2212;) indicates no measurement&#x20;unit.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Forecasting results of the system developed and compared models in Jinan and Zhengzhou.</p>
</caption>
<graphic xlink:href="fenvs-09-747101-g004.tif"/>
</fig>
<p>
<italic>
<bold>Summary:</bold>
</italic> by taking the MAPE metric as an example, the improvement percentage values between different models are employed to summarize the contribution and effectiveness of the nonlinear correction module in this study. The detailed results for Jinan are 6.3743% (HF-VMD-ORELM&#x2b;EnError vs HF-VMD-ORELM-S), 8.4819% (HF-VMD-ORELM&#x2b;EnError vs HF-VMD-ORELM&#x2b;Error), and &#x2212;2.3029% (HF-VMD-ORELM&#x2b;Error vs HF-VMD-ORELM-S), whereas the values for Zhengzhou are 4.9933% (HF-VMD-ORELM&#x2b;EnError vs HF-VMD-ORELM-S), 4.6428% (HF-VMD-ORELM&#x2b;EnError vs HF-VMD-ORELM&#x2b;Error), and 0.3676% (HF-VMD-ORELM&#x2b;Error vs HF-VMD-ORELM-S). It can be concluded that the proposed nonlinear correction module is not only effective for improving the final forecasting results but also is superior to the HF-VMD-ORELM&#x2b;Error model. Furthermore, the HF-VMD-ORELM&#x2b;Error model may perform worse than the HF-VMD-ORELM-S model. In other words, the proposed nonlinear correction module is suitable for correcting forecasting results, which can contribute to improving the performance of the system developed<italic>.</italic>
</p>
</sec>
<sec id="s4-6">
<title>Experiment IV: Interval Forecasting</title>
<p>In the system developed, the interval forecasting can be achieved by the proposed interval forecasting part according to the results of the point forecasting part, artificial intelligence&#x2013;based distribution evaluation, and interval forecasting theory. In order to perform the interval forecasting, a distribution evaluation of daily PM<sub>2.5</sub> concentration data is conducted. As mentioned above, the traditional MLE method may not obtain the optimal distribution function for a specific PM<sub>2.5</sub> concentration dataset, whereas artificial intelligence optimization is a powerful technique for determining the optimal distribution. Therefore, in this study, the advanced optimization algorithm named GWO is selected to fit four typical distributions, i.e.,&#x20;Weibull, Gamma, Rayleigh, and Lognormal. In order to prove the superiority of GWO, detailed distribution is also determined by MLE, and the parameters and <italic>R</italic>
<sup>2</sup> values provided by MLE and GWO are presented in <xref ref-type="table" rid="T5">Table&#x20;5</xref>. Furthermore, the comparison is also depicted in <xref ref-type="fig" rid="F5">Figure&#x20;5</xref>.</p>
<table-wrap id="T5" position="float">
<label>TABLE 5</label>
<caption>
<p>Distribution parameters and <italic>R</italic>
<sup>2</sup> provided by MLE and GWO.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left">Site</th>
<th rowspan="2" align="center">Method</th>
<th colspan="3" align="center">Weibull</th>
<th colspan="3" align="center">Gamma</th>
<th colspan="2" align="center">Rayleigh</th>
<th colspan="3" align="center">Lognormal</th>
</tr>
<tr>
<th align="center">
<italic>&#x3bb;</italic>
</th>
<th align="center">
<italic>k</italic>
</th>
<th align="center">
<italic>R</italic>
<sup>
<italic>2</italic>
</sup>
</th>
<th align="center">
<italic>&#x3b8;</italic>
</th>
<th align="center">
<italic>k</italic>
</th>
<th align="center">
<italic>R</italic>
<sup>
<italic>2</italic>
</sup>
</th>
<th align="center">
<italic>&#x3c3;</italic>
</th>
<th align="center">
<italic>R</italic>
<sup>
<italic>2</italic>
</sup>
</th>
<th align="center">
<italic>&#x3bc;</italic>
</th>
<th align="center">
<italic>&#x3c3;</italic>
</th>
<th align="center">
<italic>R</italic>
<sup>
<italic>2</italic>
</sup>
</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="2" align="left">Jinan</td>
<td align="left">MLE</td>
<td align="char" char=".">64.9598</td>
<td align="char" char=".">1.7146</td>
<td align="char" char=".">0.9708</td>
<td align="char" char=".">17.6411</td>
<td align="char" char=".">3.2567</td>
<td align="char" char=".">0.9839</td>
<td align="char" char=".">48.2826</td>
<td align="char" char=".">0.9432</td>
<td align="char" char=".">3.8896</td>
<td align="char" char=".">0.5554</td>
<td align="char" char=".">0.9956</td>
</tr>
<tr>
<td align="left">GWO</td>
<td align="char" char=".">59.1176</td>
<td align="char" char=".">1.8952</td>
<td align="char" char=".">0.9879</td>
<td align="char" char=".">16.8538</td>
<td align="char" char=".">3.1787</td>
<td align="char" char=".">0.9936</td>
<td align="char" char=".">41.7205</td>
<td align="char" char=".">0.9873</td>
<td align="char" char=".">3.8587</td>
<td align="char" char=".">0.5689</td>
<td align="char" char=".">0.9976</td>
</tr>
</tbody>
</table>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left">Site</th>
<th rowspan="2" align="center">Method</th>
<th colspan="3" align="center">Weibull</th>
<th colspan="3" align="center">Gamma</th>
<th colspan="2" align="center">Rayleigh</th>
<th colspan="3" align="center">Lognormal</th>
</tr>
<tr>
<th align="center">
<italic>&#x3bb;</italic>
</th>
<th align="center">
<italic>k</italic>
</th>
<th align="center">
<italic>R</italic>
<sup>
<italic>2</italic>
</sup>
</th>
<th align="center">
<italic>&#x3b8;</italic>
</th>
<th align="center">
<italic>k</italic>
</th>
<th align="center">
<italic>R</italic>
<sup>
<italic>2</italic>
</sup>
</th>
<th align="center">
<italic>&#x3c3;</italic>
</th>
<th align="center">
<italic>R</italic>
<sup>
<italic>2</italic>
</sup>
</th>
<th align="center">
<italic>&#x3bc;</italic>
</th>
<th align="center">
<italic>&#x3c3;</italic>
</th>
<th align="center">
<italic>R</italic>
<sup>
<italic>2</italic>
</sup>
</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="2" align="left">Zhengzhou</td>
<td align="left">MLE</td>
<td align="char" char=".">71.8142</td>
<td align="char" char=".">1.4689</td>
<td align="char" char=".">0.9325</td>
<td align="char" char=".">25.1562</td>
<td align="char" char=".">2.5516</td>
<td align="char" char=".">0.9494</td>
<td align="char" char=".">57.8859</td>
<td align="char" char=".">0.7784</td>
<td align="char" char=".">3.9533</td>
<td align="char" char=".">0.6057</td>
<td align="char" char=".">0.9818</td>
</tr>
<tr>
<td align="left">GWO</td>
<td align="char" char=".">57.3570</td>
<td align="char" char=".">1.5556</td>
<td align="char" char=".">0.9779</td>
<td align="char" char=".">23.2795</td>
<td align="char" char=".">2.2547</td>
<td align="char" char=".">0.9818</td>
<td align="char" char=".">39.7569</td>
<td align="char" char=".">0.9666</td>
<td align="char" char=".">3.8587</td>
<td align="char" char=".">0.5703</td>
<td align="char" char=".">0.9879</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Distribution results provided by MLE and GWO.</p>
</caption>
<graphic xlink:href="fenvs-09-747101-g005.tif"/>
</fig>
<p>On the basis of <xref ref-type="table" rid="T5">Table&#x20;5</xref> and <xref ref-type="fig" rid="F5">Figure&#x20;5</xref>, we find that the GWO-based distribution evaluation can obtain the best <italic>R</italic>
<sup>2</sup> values for each distribution function, which indicates that the GWO-based distribution evaluation is superior to the MLE method and is suitable for fitting the detailed distribution. As a result, the results of the GWO-based distribution evaluation can be compared by <italic>R</italic>
<sup>2</sup>. As shown in <xref ref-type="table" rid="T5">Table&#x20;5</xref>, the <italic>R</italic>
<sup>2</sup> values of Weibull, Gamma, Rayleigh, and Lognormal are (0.9879, 0.9936, 0.9873, and 0.9976) and (0.9779, 0.9818, 0.9666, and 0.9879) in Jinan and Zhengzhou, respectively. It can be observed that the Lognormal distribution achieves the largest <italic>R</italic>
<sup>2</sup>, which means that the Lognormal distribution can effectively fit the daily PM<sub>2.5</sub> concentration data in Jinan and Zhengzhou. Thus, the optimal Lognormal distribution obtained can be combined with the point forecasting results and interval forecasting theory to achieve the final interval forecasting.</p>
<p>The interval forecasting results under different significance levels are depicted in <xref ref-type="fig" rid="F6">Figure&#x20;6</xref>, and the corresponding evaluation metric values are listed in <xref ref-type="table" rid="T6">Table&#x20;6</xref>. From <xref ref-type="table" rid="T6">Table&#x20;6</xref>, we can find that the interval forecasting performances for Jinan and Zhengzhou are different at the same significance level. For example, when &#x3b1; &#x3d; 0.30, the FINAW and FICP values for Jinan and Zhengzhou are (0.6268, 100.0000%) and (0.4695, 96.6667%), respectively. The main reasons for this phenomenon are that the interval forecasting performance largely depends on the point forecasting performance. As the system developed has achieved excellent point forecasting performance, it has also achieved ideal interval forecasting results. Moreover, for the same dataset, the FINAW and FICP values under five significance levels are different. For instance, for the Zhengzhou dataset, the FINAW and FICP values for &#x3b1; &#x3d; 0.20 and &#x3b1; &#x3d; 0.25 are (0.7711, 100.0000%) and (0.6097, 96.6667%), respectively. Furthermore, the solid lines represent the actual values, and the shaded areas represent the forecasting intervals in <xref ref-type="fig" rid="F6">Figure&#x20;6</xref>. Obviously, as most of the observations fall into the shaded area, the interval forecasting ability of the system established can be considered effective and good. According to the abovementioned analysis and discussion, we can reasonably conclude that the system developed can be a promising tool for daily PM<sub>2.5</sub> concentration interval forecasting.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Results of interval forecasting in Jinan and Zhengzhou.</p>
</caption>
<graphic xlink:href="fenvs-09-747101-g006.tif"/>
</fig>
<table-wrap id="T6" position="float">
<label>TABLE 6</label>
<caption>
<p>Results of interval forecasting based on different significance levels.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left">&#x3b1;</th>
<th colspan="2" align="center">Jinan</th>
<th colspan="2" align="center">Zhengzhou</th>
</tr>
<tr>
<th align="center">FINAW (&#x2212;)</th>
<th align="center">FICP (%)</th>
<th align="center">FINAW (&#x2212;)</th>
<th align="center">FICP (%)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">0.20</td>
<td align="char" char=".">1.0295</td>
<td align="char" char=".">100.0000</td>
<td align="char" char=".">0.7711</td>
<td align="char" char=".">100.0000</td>
</tr>
<tr>
<td align="left">0.25</td>
<td align="char" char=".">0.8140</td>
<td align="char" char=".">100.0000</td>
<td align="char" char=".">0.6097</td>
<td align="char" char=".">96.6667</td>
</tr>
<tr>
<td align="left">0.30</td>
<td align="char" char=".">0.6268</td>
<td align="char" char=".">100.0000</td>
<td align="char" char=".">0.4695</td>
<td align="char" char=".">96.6667</td>
</tr>
<tr>
<td align="left">0.35</td>
<td align="char" char=".">0.4574</td>
<td align="char" char=".">100.0000</td>
<td align="char" char=".">0.3426</td>
<td align="char" char=".">93.3333</td>
</tr>
<tr>
<td align="left">0.40</td>
<td align="char" char=".">0.2994</td>
<td align="char" char=".">96.6667</td>
<td align="char" char=".">0.2243</td>
<td align="char" char=".">93.3333</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>(&#x2212;) indicates no measurement&#x20;unit.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec sec-type="conclusion" id="s5">
<title>Conclusion</title>
<p>Forecasting air pollution is not only a challenging research topic but also a growing concern. To model and forecast the complex PM<sub>2.5</sub> concentration time series, in this study, a novel outlier-robust point and interval forecasting system is developed, which attempts to mitigate or solve some of the challenges in current studies. In the system developed, the point forecasting part is designed to provide future deterministic information on daily PM<sub>2.5</sub> concentration, whereas the interval forecasting part is devised to present future uncertainty information. More specifically, three modules, named the data preprocessing module, the outlier-robust forecasting module, and the nonlinear correction module, are proposed in the point forecasting part. The data preprocessing module, considering the negative influence of outliers and noise on the development of the model, is designed on the basis of outlier handling and data decomposition for the purpose of solving the outlier data and noise information issues simultaneously. Moreover, in order to obtain a forecasting model with outlier robustness, the ORELM model with superior performance is selected to design the outlier-robust forecasting module to forecast each mode. Furthermore, the nonlinear correction module is developed based on an error ensemble strategy, which can mine information in the forecasting results and further improve the model&#x2019;s forecasting performance. Afterward, the interval forecasting part is developed based on artificial intelligence&#x2013;based distribution evaluation and interval forecasting theory, which can be incorporated with the point forecasting results to obtain the range of future changes. The experimental results illustrate that the system developed can not only perform better than other compared models in point forecasting but also provide uncertainty information in the future. Moreover, outside the field of daily PM<sub>2.5</sub> concentration forecasting, the system developed can also be employed to solve other challenging issues, including energy forecasting, economic forecasting, and financial forecasting.</p>
<p>Although the system developed shows better performance in daily PM<sub>2.5</sub> concentration forecasting, there are still some limitations that must be considered in future research. For example, other features neglected in this study may make the forecasting more reliable and practical and can further improve the forecasting performance, which can be a promising subject for future studies. Moreover, only 1-day-ahead forecasting is conducted by the system developed; how to achieve multi-day-ahead forecasting is of great importance and worth studying but is still a challenging task. As a result, further studies about multi-day-ahead point and interval forecasting for PM<sub>2.5</sub> concentration can be considered an important research direction in future studies.</p>
</sec>
</body>
<back>
<sec id="s6">
<title>Data Availability Statement</title>
<p>Publicly available datasets were analyzed in this study. This data can be found here: China Air Quality Online Monitoring and Analysis platform (<ext-link ext-link-type="uri" xlink:href="https://www.aqistudy.cn">https://www.aqistudy.cn</ext-link>).</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>ZY and XF conducted the research.</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ack>
<p>The authors are grateful to the reviewers and the editor for their constructive comments and suggestions for this&#x20;paper.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Andrade</surname>
<given-names>M. d. F.</given-names>
</name>
<name>
<surname>Ynoue</surname>
<given-names>R. Y.</given-names>
</name>
<name>
<surname>Freitas</surname>
<given-names>E. D.</given-names>
</name>
<name>
<surname>Todesco</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Vara Vela</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Ibarra</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Air Quality Forecasting System for Southeastern Brazil</article-title>. <source>Front. Environ. Sci.</source> <volume>3</volume>, <fpage>9</fpage>. <pub-id pub-id-type="doi">10.3389/fenvs.2015.00009</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bai</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Air Pollutants Concentrations Forecasting Using Back Propagation Neural Network Based on Wavelet Decomposition with Meteorological Conditions</article-title>. <source>Atmos. Pollut. Res.</source> <volume>7</volume>, <fpage>557</fpage>&#x2013;<lpage>566</lpage>. <pub-id pub-id-type="doi">10.1016/j.apr.2016.01.004</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bai</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zeng</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>An Ensemble Long Short-Term Memory Neural Network for Hourly PM<sub>2.5</sub> Concentration Forecasting</article-title>. <source>Chemosphere</source> <volume>222</volume>, <fpage>286</fpage>&#x2013;<lpage>294</lpage>. <pub-id pub-id-type="doi">10.1016/j.chemosphere.2019.01.121</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheng</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Hybrid Algorithm for Short-Term Forecasting of PM<sub>2.5</sub> in China</article-title>. <source>Atmos. Environ.</source> <volume>200</volume>, <fpage>264</fpage>&#x2013;<lpage>279</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2018.12.025</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dragomiretskiy</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Zosso</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Variational Mode Decomposition</article-title>. <source>IEEE Trans. Signal. Process.</source> <volume>62</volume>, <fpage>531</fpage>&#x2013;<lpage>544</lpage>. <pub-id pub-id-type="doi">10.1109/TSP.2013.2288675</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Du</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Hao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Niu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A Novel Hybrid Model Based on Multi-Objective Harris Hawks Optimization Algorithm for Daily PM<sub>2.5</sub> and PM<sub>10</sub> Forecasting</article-title>. <source>Appl. Soft Comput.</source> <volume>96</volume>, <fpage>106620</fpage>. <pub-id pub-id-type="doi">10.1016/j.asoc.2020.106620</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fan</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Lan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Chan</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Hong</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Process Analysis of Regional Aerosol Pollution During Spring in the Pearl River Delta Region, China</article-title>. <source>Atmos. Environ.</source> <volume>122</volume>, <fpage>829</fpage>&#x2013;<lpage>838</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2015.09.013</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Feng</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hou</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Artificial Neural Networks Forecasting of PM<sub>2.5</sub> Pollution Using Air Mass Trajectory Based Geographic Model and Wavelet Transformation</article-title>. <source>Atmos. Environ.</source> <volume>107</volume>, <fpage>118</fpage>&#x2013;<lpage>128</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2015.02.030</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hampel</surname>
<given-names>F. R.</given-names>
</name>
</person-group> (<year>1974</year>). <article-title>The Influence Curve and its Role in Robust Estimation</article-title>. <source>J.&#x20;Am. Stat. Assoc.</source> <volume>69</volume>, <fpage>383</fpage>&#x2013;<lpage>393</lpage>. <pub-id pub-id-type="doi">10.1080/01621459.1974.10482962</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Niu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Impacts of Haze Pollution on China&#x27;s Tourism Industry: A System of Economic Loss Analysis</article-title>. <source>J.&#x20;Environ. Manage.</source> <volume>295</volume>, <fpage>113051</fpage>. <pub-id pub-id-type="doi">10.1016/j.jenvman.2021.113051</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>The Study and Application of a Novel Hybrid System for Air Quality Early-Warning</article-title>. <source>Appl. Soft Comput.</source> <volume>74</volume>, <fpage>729</fpage>&#x2013;<lpage>746</lpage>. <pub-id pub-id-type="doi">10.1016/j.asoc.2018.09.005</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>G. B.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Q. Y.</given-names>
</name>
<name>
<surname>Siew</surname>
<given-names>C. K.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Extreme Learning Machine: A&#x20;New Learning Scheme of Feedforward Neural Networks</article-title>. <conf-name>IEEE International Conference on Neural Networks</conf-name>. <pub-id pub-id-type="doi">10.1109/IJCNN.2004.1380068</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jiang</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>An Innovative Hybrid Air Pollution Early-Warning System Based on Pollutants Forecasting and Extenics Evaluation</article-title>. <source>Knowledge-Based Syst.</source> <volume>164</volume>, <fpage>174</fpage>&#x2013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.1016/j.knosys.2018.10.036</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zeng</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>The Impact of Air Pollution on Corporate Cash Holdings</article-title>. <source>Borsa Istanbul Rev.</source> <volume>21</volume>, <fpage>S90</fpage>&#x2013;<lpage>S98</lpage>. <pub-id pub-id-type="doi">10.1016/j.bir.2021.04.007</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Lei</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>The Health Economic Loss of Fine Particulate Matter (PM 2.5 ) in Beijing</article-title>. <source>J.&#x20;Clean. Prod.</source> <volume>161</volume>, <fpage>1153</fpage>&#x2013;<lpage>1161</lpage>. <pub-id pub-id-type="doi">10.1016/j.jclepro.2017.05.029</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lie&#xdf;</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Gebauer</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Don</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Machine Learning with GA Optimization to Model the Agricultural Soil-Landscape of Germany: An Approach Involving Soil Functional Types with Their Multivariate Parameter Distributions Along the Depth Profile</article-title>. <source>Front. Environ. Sci.</source> <volume>9</volume>, <fpage>692959</fpage>. <pub-id pub-id-type="doi">10.3389/fenvs.2021.692959</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>An Ensemble Model Based on Machine Learning Methods and Data Preprocessing for Short-Term Electric Load Forecasting</article-title>. <source>Energies</source> <volume>10</volume>, <fpage>1186</fpage>. <pub-id pub-id-type="doi">10.3390/en10081186</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Prediction of Outdoor PM<sub>2.5</sub> Concentrations Based on a Three-Stage Hybrid Neural Network Model</article-title>. <source>Atmos. Pollut. Res.</source> <volume>11</volume>, <fpage>469</fpage>&#x2013;<lpage>481</lpage>. <pub-id pub-id-type="doi">10.1016/j.apr.2019.11.019</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Duan</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2019b</year>). <article-title>Air PM<sub>2.5</sub> Concentration Multi-step Forecasting Using a New Hybrid Modeling Method: Comparing Cases for Four Cities in China</article-title>. <source>Atmos. Pollut. Res.</source> <volume>10</volume>, <fpage>1588</fpage>&#x2013;<lpage>1600</lpage>. <pub-id pub-id-type="doi">10.1016/j.apr.2019.05.007</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Shah</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>On-line Outlier Detection and Data Cleaning</article-title>. <source>Comput. Chem. Eng.</source> <volume>28</volume>, <fpage>1635</fpage>&#x2013;<lpage>1647</lpage>. <pub-id pub-id-type="doi">10.1016/j.compchemeng.2004.01.009</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2019a</year>). <article-title>Improved Pollution Forecasting Hybrid Algorithms Based on the Ensemble Method</article-title>. <source>Appl. Math. Model.</source> <volume>73</volume>, <fpage>473</fpage>&#x2013;<lpage>486</lpage>. <pub-id pub-id-type="doi">10.1016/j.apm.2019.04.032</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Luo</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Yue</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Research and Application of a Novel Hybrid Decomposition-Ensemble Learning Paradigm with Error Correction for Daily PM 10 Forecasting</article-title>. <source>Atmos. Res.</source> <volume>201</volume>, <fpage>34</fpage>&#x2013;<lpage>45</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosres.2017.10.009</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Prakash</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Jain</surname>
<given-names>V. K.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>A Wavelet-Based Neural Network Model to Predict Ambient Air Pollutants&#x27; Concentration</article-title>. <source>Environ. Model. Assess.</source> <volume>16</volume>, <fpage>503</fpage>&#x2013;<lpage>517</lpage>. <pub-id pub-id-type="doi">10.1007/s10666-011-9270-6</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schwarz</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Hoffmann</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Knapp</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Strauch</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Synergies or Trade-Offs? Optimizing a Virtual Urban Region to foster Plant Species Richness, Climate Regulation, and Compactness Under Varying Landscape Composition</article-title>. <source>Front. Environ. Sci.</source> <volume>8</volume>, <fpage>16</fpage>. <pub-id pub-id-type="doi">10.3389/fenvs.2020.00016</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sharma</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Deo</surname>
<given-names>R. C.</given-names>
</name>
<name>
<surname>Prasad</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Parisi</surname>
<given-names>A. V.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A Hybrid Air Quality Early-Warning Framework: An Hourly Forecasting Model with Online Sequential Extreme Learning Machines and Empirical Mode Decomposition Algorithms</article-title>. <source>Sci. Total Environ.</source> <volume>709</volume>, <fpage>135934</fpage>. <pub-id pub-id-type="doi">10.1016/j.scitotenv.2019.135934</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Siwek</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Osowski</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Improving the Accuracy of Prediction of PM10 Pollution by the Wavelet Transformation and an Ensemble of Neural Predictors</article-title>. <source>Eng. Appl. Artif. Intelligence</source> <volume>25</volume>, <fpage>1246</fpage>&#x2013;<lpage>1258</lpage>. <pub-id pub-id-type="doi">10.1016/j.engappai.2011.10.013</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stern</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Builtjes</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Schaap</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Timmermans</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Vautard</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Hodzic</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>A Model Inter-comparison Study Focussing on Episodes with Elevated PM10 Concentrations</article-title>. <source>Atmos. Environ.</source> <volume>42</volume>, <fpage>4567</fpage>&#x2013;<lpage>4588</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2008.01.068</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Daily PM 2.5 Concentration Prediction Based on Principal Component Analysis and LSSVM Optimized by Cuckoo Search Algorithm</article-title>. <source>J.&#x20;Environ. Manage.</source> <volume>188</volume>, <fpage>144</fpage>&#x2013;<lpage>152</lpage>. <pub-id pub-id-type="doi">10.1016/j.jenvman.2016.12.011</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Timmermans</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Kranenburg</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Manders</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Hendriks</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Segers</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Dammers</surname>
<given-names>E.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Source Apportionment of PM<sub>2.5</sub> Across China Using LOTOS-EUROS</article-title>. <source>Atmos. Environ.</source> <volume>164</volume>, <fpage>370</fpage>&#x2013;<lpage>386</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2017.06.003</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vlachogianni</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kassomenos</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Karppinen</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Karakitsios</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Kukkonen</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Evaluation of a Multiple Regression Model for the Forecasting of the Concentrations of NOx and PM10 in Athens and Helsinki</article-title>. <source>Sci. Total Environ.</source> <volume>409</volume>, <fpage>1559</fpage>&#x2013;<lpage>1571</lpage>. <pub-id pub-id-type="doi">10.1016/j.scitotenv.2010.12.040</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vukicevic</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>1991</year>). <article-title>Nonlinear and Linear Evolution of Initial Forecast Errors</article-title>. <source>Mon. Weather Rev.</source> <volume>119</volume>, <fpage>1602</fpage>&#x2013;<lpage>1611</lpage>. <pub-id pub-id-type="doi">10.1175/1520-0493(1991)119&#x3c;1602:NALEOI&#x3e;2.0.CO</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>J.-Z.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>The Study and Application of a Novel Hybrid Forecasting Model - A Case Study of Wind Speed Forecasting in China</article-title>. <source>Appl. Energ.</source> <volume>143</volume>, <fpage>472</fpage>&#x2013;<lpage>488</lpage>. <pub-id pub-id-type="doi">10.1016/j.apenergy.2015.01.038</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Du</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Hao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Niu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2020a</year>). <article-title>An Innovative Hybrid Model Based on Outlier Detection and Correction Algorithm and Heuristic Intelligent Optimization Algorithm for Daily Air Quality index Forecasting</article-title>. <source>J.&#x20;Environ. Manage.</source> <volume>255</volume>, <fpage>109855</fpage>. <pub-id pub-id-type="doi">10.1016/j.jenvman.2019.109855</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Du</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Niu</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2020b</year>). <article-title>Outlier-robust Hybrid Electricity price Forecasting Model for Electricity Market Management</article-title>. <source>J.&#x20;Clean. Prod.</source> <volume>249</volume>, <fpage>119318</fpage>. <pub-id pub-id-type="doi">10.1016/j.jclepro.2019.119318</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wendel</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Air Pollution Ranks as Largest Health Risk</article-title>. <source>Eos Trans. AGU</source> <volume>95</volume>, <fpage>120</fpage>. <pub-id pub-id-type="doi">10.1002/2014eo140007</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>D.-C.</given-names>
</name>
<name>
<surname>Bahrami Asl</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Razban</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Air Compressor Load Forecasting Using Artificial Neural Network</article-title>. <source>Expert Syst. Appl.</source> <volume>168</volume>, <fpage>114209</fpage>. <pub-id pub-id-type="doi">10.1016/j.eswa.2020.114209</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2019b</year>). <article-title>A Novel Optimal-Hybrid Model for Daily Air Quality Index Prediction Considering Air Pollutant Factors</article-title>. <source>Sci. Total Environ.</source> <volume>683</volume>, <fpage>808</fpage>&#x2013;<lpage>821</lpage>. <pub-id pub-id-type="doi">10.1016/j.scitotenv.2019.05.288</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2019a</year>). <article-title>Daily Urban Air Quality index Forecasting Based on Variational Mode Decomposition, Sample Entropy and LSTM Neural Network</article-title>. <source>Sustain. Cities Soc.</source> <volume>50</volume>, <fpage>101657</fpage>. <pub-id pub-id-type="doi">10.1016/j.scs.2019.101657</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xing</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yue</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Xiang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>A Deep Belief Network Combined with Modified Grey Wolf Optimization Algorithm for PM<sub>2.5</sub> Concentration Prediction</article-title>. <source>Appl. Sci.</source> <volume>9</volume>, <fpage>3765</fpage>. <pub-id pub-id-type="doi">10.3390/app9183765</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Air Quality Early-Warning System for Cities in China</article-title>. <source>Atmos. Environ.</source> <volume>148</volume>, <fpage>239</fpage>&#x2013;<lpage>257</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2016.10.046</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Niu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Du</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2019a</year>). <article-title>Hybrid Wind Energy Forecasting and Analysis System Based on Divide and Conquer Scheme: A Case Study in China</article-title>. <source>J.&#x20;Clean. Prod.</source> <volume>222</volume>, <fpage>942</fpage>&#x2013;<lpage>959</lpage>. <pub-id pub-id-type="doi">10.1016/j.jclepro.2019.03.036</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Niu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Du</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2019b</year>). <article-title>A Hybrid Forecasting System Based on a Dual Decomposition Strategy and Multi-Objective Optimization for Electricity Price Forecasting</article-title>. <source>Appl. Energ.</source> <volume>235</volume>, <fpage>1205</fpage>&#x2013;<lpage>1225</lpage>. <pub-id pub-id-type="doi">10.1016/j.apenergy.2018.11.034</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>A New Air Quality Monitoring and Early Warning System: Air Quality Assessment and Air Pollutant Concentration Prediction</article-title>. <source>Environ. Res.</source> <volume>158</volume>, <fpage>105</fpage>&#x2013;<lpage>117</lpage>. <pub-id pub-id-type="doi">10.1016/j.envres.2017.06.002</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zafra</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>&#xc1;ngel</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Torres</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>ARIMA Analysis of the Effect of Land Surface Coverage on PM 10 Concentrations in a High-Altitude Megacity</article-title>. <source>Atmos. Pollut. Res.</source> <volume>8</volume>, <fpage>660</fpage>&#x2013;<lpage>668</lpage>. <pub-id pub-id-type="doi">10.1016/j.apr.2017.01.002</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Outlier-robust Extreme Learning Machine for Regression Problems</article-title>. <source>Neurocomputing</source> <volume>151</volume>, <fpage>1519</fpage>&#x2013;<lpage>1527</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2014.09.022</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Xue</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Gong</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Process Analysis of PM<sub>2.5</sub> Pollution Events in a Coastal City of China Using CMAQ</article-title>. <source>J.&#x20;Environ. Sci.</source> <volume>79</volume>, <fpage>225</fpage>&#x2013;<lpage>238</lpage>. <pub-id pub-id-type="doi">10.1016/j.jes.2018.09.007</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lian</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Che</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Daily Air Quality Index Forecasting with Hybrid Models: A Case in China</article-title>. <source>Environ. Pollut.</source> <volume>231</volume>, <fpage>1232</fpage>&#x2013;<lpage>1244</lpage>. <pub-id pub-id-type="doi">10.1016/j.envpol.2017.08.069</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>