<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Energy Res.</journal-id>
<journal-title>Frontiers in Energy Research</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Energy Res.</abbrev-journal-title>
<issn pub-type="epub">2296-598X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">852349</article-id>
<article-id pub-id-type="doi">10.3389/fenrg.2022.852349</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Energy Research</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Multivariate Time Series Prediction for Loss of Coolant Accidents With a Zigmoid-Based LSTM</article-title>
<alt-title alt-title-type="left-running-head">Gong et al.</alt-title>
<alt-title alt-title-type="right-running-head">Multivariate LOCA Prediction With zLSTM</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Gong</surname>
<given-names>Shanshan</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1682893/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Yang</surname>
<given-names>Suyuan</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1721548/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>She</surname>
<given-names>Jingke</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1222596/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Li</surname>
<given-names>Weiqi</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1721571/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Lu</surname>
<given-names>Shaofei</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1282139/overview"/>
</contrib>
</contrib-group>
<aff>
<institution>College of Computer Science and Electronic Engineering</institution>, <institution>Hunan University</institution>, <addr-line>Changsha</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/525670/overview">Xianping Zhong</ext-link>, University of Pittsburgh, United States</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1320941/overview">Guang Hu</ext-link>, Karlsruhe Institute of Technology (KIT), Germany</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1588103/overview">Sai Zhang</ext-link>, Idaho National Laboratory (DOE), United States</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Jingke She, <email>shejingke@hnu.edu.cn</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Nuclear Energy, a section of the journal Frontiers in Energy Research</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>04</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>10</volume>
<elocation-id>852349</elocation-id>
<history>
<date date-type="received">
<day>11</day>
<month>01</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>28</day>
<month>02</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Gong, Yang, She, Li and Lu.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Gong, Yang, She, Li and Lu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Post-LOCA prediction is of safety significance to NPP, but requires a processing coverage of non-linearity, both short and long-term memory, and multiple system parameters. To enable an ability promotion of previous LOCA prediction models, a new gate function called zigmoid is introduced and embedded to the traditional long short-term memory (LSTM) model. The newly constructed zigmoid-based LSTM (zLSTM) amplifies the gradient at the far end of the time series, which enhances the long-term memory without weakening the short-term one. Multiple system parameters are integrated into a 12-dimension input vector to the zLSTM for a comprehensive consideration based on which the LOCA prediction can be accurately generated. Experimental results show both accuracy evaluations and LOCA progression produced by the proposed zLSTM, and two baseline methods demonstrating the superiority of applying zLSTM to LCOA predictions.</p>
</abstract>
<kwd-group>
<kwd>LOCA</kwd>
<kwd>prediction</kwd>
<kwd>multivariate time series</kwd>
<kwd>zigmoid</kwd>
<kwd>LSTM</kwd>
</kwd-group>
<contract-num rid="cn001">2020YFB1713400</contract-num>
<contract-num rid="cn002">TC19084DY</contract-num>
<contract-sponsor id="cn001">Ministry of Science and Technology of the People&#x2019;s Republic of China<named-content content-type="fundref-id">10.13039/501100002855</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">Ministry of Industry and Information Technology of the People&#x2019;s Republic of China<named-content content-type="fundref-id">10.13039/501100006579</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Loss of coolant accident (LOCA) is a severe accident that causes safety threat to nuclear power plants (NPPs). Obviously, it is of great importance to systematically analyze, prevent, and predict LOCAs such that effective decision-making support can be offered to the emergency response strategy. The prediction of the LOCA progression trends, as one of the significant emergency measures, provides evaluation of safety threats ahead of their physical occurrence and allows the emergency response strategy to plan accordingly before worse scenarios emerge. However, the non-linearity of LOCAs and associated complex system factors prevent accurate LOCA predictions. As a coupling result influenced by multiple system parameters, the prediction for LOCA progression also faces multivariant processing challenges, which makes the system modeling more complicated.</p>
<p>In the past decades, various attempts have been taken for process predictions in NPPs. A series of assumptions based on statistical methods and mathematical equations are applied for process predictions such as 1) monitoring the real-time condition of LOCA <italic>via</italic> time&#x2013;frequency domain reflectometry (TFDR) (<xref ref-type="bibr" rid="B7">Lee et al., 2017</xref>) and 2) using RELAP5/MOD3.3 code to predict the LOCA of the main stream break on generation III reactor (<xref ref-type="bibr" rid="B25">Yang et al., 2019</xref>). The aforementioned research studies rely on effort-consuming system modeling and have made feasible progress on LOCA prediction, but the challenges of multivariate processing/coupling remain for further investigation.</p>
<p>Using data-based artificial intelligence (AI) approaches has become an effective way to solve the non-linearity problem with the progress of machine learning, especially when enormous simulated NPP data from previous research studies have founded a firm database for AI applications.</p>
<p>A variety of traditional machine learning algorithms have been applied to NPPs. An abnormal operation state detection method of NPP based on an unsupervised deep generative model is established by using variational auto encoders (VAE) and isolation forest (iForest) (<xref ref-type="bibr" rid="B8">Li et al., 2021</xref>). <xref ref-type="bibr" rid="B12">Moshkbar-Bakhshayesh and Ghafari (2022</xref>) used support vector machine (SVM) as a machine learning&#x2013;based method to predict the vessel water level. <xref ref-type="bibr" rid="B24">Xiang et al. (2020</xref>) proposed a clustering algorithm for the transient detection in NPPs. Furthermore, <xref ref-type="bibr" rid="B22">Wang et al. (2021a</xref>) utilized the clustering algorithm together with SVM and principal component analysis (PCA) for the sensor anomalies in NPPs, which is also reviewed in <xref ref-type="bibr" rid="B4">Hu et al. (2021</xref>).</p>
<p>By stacking multiple hidden layers, deep neural network (DNN) has stronger non-linear feature extraction ability. It was utilized to predict the vessel water level (<xref ref-type="bibr" rid="B6">Koo et al., 2018</xref>) as well as to identify the fault diagnosis scheme (<xref ref-type="bibr" rid="B16">Santos et al., 2019</xref>).</p>
<p>Convolutional neural network (CNN) is a variant of DNN and is usually used for image processing. Viewing NPP sensor data as images, CNN was applied to event identification (<xref ref-type="bibr" rid="B10">Lin et al., 2021</xref>; <xref ref-type="bibr" rid="B13">Pantera et al., 2021</xref>) and break size estimation (<xref ref-type="bibr" rid="B9">Lin et al., 2022</xref>). The mentioned traditional machine learning algorithms (SVM) and deep learning methods (DNN and CNN) can deal with non-linearity, while the sequential data-dependency and multiple physical factors are not taken into account.</p>
<p>Recurrent neural network (RNN), as a classical example, has been successfully applied to sequential data modeling in former explorations. Several long short-term memory (LSTM)&#x2013;based models cover both the non-linearity and time correlation of LOCAs. For example, the LSTM-based expert system was adopted to predict LOCA behaviors (<xref ref-type="bibr" rid="B11">Mira et al., 2020</xref>; <xref ref-type="bibr" rid="B15">Santhosh et al., 2010</xref>; <xref ref-type="bibr" rid="B1">Chen et al., 2021</xref>) and to evaluate abnormal operation conditions in NPPs (<xref ref-type="bibr" rid="B17">She et al., 2020</xref>; <xref ref-type="bibr" rid="B23">Wang et al., 2021b</xref>). The coolant flowrate variation was analyzed by <xref ref-type="bibr" rid="B18">She et al. (2021</xref>) using a combination of CNN and LSTM. PCA and LSTM were used to identify the fault diagnosis scheme <xref ref-type="bibr" rid="B14">Saeed et al. (2020</xref>).</p>
<p>It is necessary to consider modeling non-linearity, multivariate processing, and long-term memory for accurate prediction of LOCA. The aforementioned literatures ignored that LSTM cannot model longer time series. To fully cover the non-linearity, time correlation, and multivariate processing for LOCA predictions, this study proposes an improved LSTM model in which a new gate function called &#x2018;zigmoid&#x2019; is constructed. With rigorous experimental verifications conducted on simulated LOCA datasets, the zLSTM is proved to be more accurate and efficient for post-LOCA predictions.</p>
<p>This article starts with <xref ref-type="sec" rid="s1">Section 1</xref> as the introduction and illustrates the zigmoid function in <xref ref-type="sec" rid="s2">Section 2</xref>. After the presentation of the verification experiments in <xref ref-type="sec" rid="s3">Section 3</xref>, this article is then concluded in <xref ref-type="sec" rid="s4">Section 4</xref>.</p>
</sec>
<sec id="s2">
<title>2 Zigmoid Method</title>
<sec id="s2-1">
<title>2.1 Zigmoid for Better Long-Term Memory</title>
<p>Established for the sequential processing problems, RNN obtained preliminary short-term memory. To enable the long-term memory, <xref ref-type="bibr" rid="B3">Hochreiter and Schmidhuber (1997</xref>) made a gate-level innovation on RNN and created LSTM that is capable for both short and long-term processing. Nevertheless, the contribution of <italic>x</italic>
<sub>
<italic>t</italic>
</sub> in LSTM will decay in k timesteps by <inline-formula id="inf1">
<mml:math id="m1">
<mml:msubsup>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> when <italic>f</italic>
<sub>
<italic>t</italic>
</sub> is a constant (<xref ref-type="bibr" rid="B20">Tallec and Ollivier, 2018</xref>). This gives the unit an effective decay period timescale of <inline-formula id="inf2">
<mml:math id="m2">
<mml:mi>O</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>. In case the LSTM is required to have a memory of 1,000 steps, the forget gate parameter <italic>f</italic>
<sub>
<italic>t</italic>
</sub> has to reach 0.999 since the effective decay period timescale &#x3d; 1/(1 &#x2212; <italic>f</italic>
<sub>
<italic>t</italic>
</sub>). However, it is very challeging to achieve this, according to the following analysis of LSTM.</p>
<p>The standard LSTM process is defined as follows:<disp-formula id="e1">
<mml:math id="m3">
<mml:msub>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(1)</label>
</disp-formula>
<disp-formula id="e2">
<mml:math id="m4">
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold-italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi mathvariant="bold-italic">s</mml:mi>
<mml:mi mathvariant="bold-italic">i</mml:mi>
<mml:mi mathvariant="bold-italic">g</mml:mi>
<mml:mi mathvariant="bold-italic">m</mml:mi>
<mml:mi mathvariant="bold-italic">o</mml:mi>
<mml:mi mathvariant="bold-italic">i</mml:mi>
<mml:mi mathvariant="bold-italic">d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold-italic">W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">f</mml:mi>
<mml:mi mathvariant="bold-italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold-italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold-italic">W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">f</mml:mi>
<mml:mi mathvariant="bold-italic">h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold-italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn mathvariant="bold-italic">1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold-italic">b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">f</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(2)</label>
</disp-formula>
<disp-formula id="e3">
<mml:math id="m5">
<mml:msub>
<mml:mrow>
<mml:mi>o</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(3)</label>
</disp-formula>
<disp-formula id="e4">
<mml:math id="m6">
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">&#x302;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>tanh</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(4)</label>
</disp-formula>
<disp-formula id="e5">
<mml:math id="m7">
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2299;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2299;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">&#x302;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:math>
<label>(5)</label>
</disp-formula>
<disp-formula id="e6">
<mml:math id="m8">
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>o</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2299;</mml:mo>
<mml:mi>tanh</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(6)</label>
</disp-formula>where <italic>W</italic>
<sub>
<italic>ix</italic>
</sub>, <italic>W</italic>
<sub>
<italic>ih</italic>
</sub>, <italic>b</italic>
<sub>
<italic>i</italic>
</sub>, <italic>W</italic>
<sub>
<italic>fx</italic>
</sub>, <italic>W</italic>
<sub>
<italic>fh</italic>
</sub>, <italic>b</italic>
<sub>
<italic>f</italic>
</sub>, <italic>W</italic>
<sub>
<italic>ox</italic>
</sub>, <italic>W</italic>
<sub>
<italic>oh</italic>
</sub>, <italic>b</italic>
<sub>
<italic>o</italic>
</sub>, <italic>W</italic>
<sub>
<italic>cx</italic>
</sub>, <italic>W</italic>
<sub>
<italic>ch</italic>
</sub>, and <italic>b</italic>
<sub>
<italic>c</italic>
</sub> are trainable parameters.</p>
<p>However, the derivative of sigmoid is 0.000999 when <italic>f</italic>
<sub>
<italic>t</italic>
</sub> &#x3d; 0.999, as shown in <xref ref-type="fig" rid="F1">Figure 1</xref>, which causes LSTM to be untrainable at this stage. In other words, the LSTM&#x2019;s long-term memory ability is weakened at the far end of the time series and cannot guarantee accurate prediction in LOCAs.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Sigmoid properties.</p>
</caption>
<graphic xlink:href="fenrg-10-852349-g001.tif"/>
</fig>
<p>Since the sigmoid function in the forget gate determines the long-term memory of LSTM, a natural idea is to amplify the derivative of the sigmoid function such that model training is still feasible for LSTM even when <italic>f</italic>
<sub>
<italic>t</italic>
</sub> reaches 0.999. For this purpose, zigmoid is constructed by embedding a transfer function within the original sigmoid.<disp-formula id="e7">
<mml:math id="m9">
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>s</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(7)</label>
</disp-formula>
<disp-formula id="e8">
<mml:math id="m10">
<mml:mi>t</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>s</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:mfenced open="{" close="">
<mml:mrow>
<mml:mtable class="cases">
<mml:mtr>
<mml:mtd columnalign="left">
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd columnalign="left">
<mml:mi>x</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="left">
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd columnalign="left">
<mml:mi>x</mml:mi>
<mml:mo>&#x3c;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(8)</label>
</disp-formula>where <italic>&#x3b2;</italic> is a hyper-parameter.</p>
<p>The derivative of zigmoid is<disp-formula id="e9">
<mml:math id="m11">
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>s</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x2a;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>s</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:math>
<label>(9)</label>
</disp-formula>
<disp-formula id="e10">
<mml:math id="m12">
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>s</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x3d;</mml:mo>
<mml:mfenced open="{" close="">
<mml:mrow>
<mml:mtable class="cases">
<mml:mtr>
<mml:mtd columnalign="left">
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd columnalign="left">
<mml:mi>x</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="left">
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd columnalign="left">
<mml:mi>x</mml:mi>
<mml:mo>&#x3c;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(10)</label>
</disp-formula>
</p>
<p>When <italic>&#x3b2;</italic> is a large value, that is, <inline-formula id="inf3">
<mml:math id="m13">
<mml:mi>max</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x2265;</mml:mo>
<mml:mn>1.0</mml:mn>
</mml:math>
</inline-formula>, this may lead to gradient explosion. The numerical experiments in PyTorch show that such a case can be avoided when <italic>&#x3b2;</italic> &#x2264; 2.5402. On the other hand, the lower bound of <italic>&#x3b2;</italic> is set to 1.0 because a small value of <italic>&#x3b2;</italic> cannot achieve effective gradient amplification. Therefore, <italic>&#x3b2;</italic> is ranged to [1.0, 2.5402] to ensure the system stability.</p>
<p>As shown in the comparison of <xref ref-type="fig" rid="F1">Figures 1</xref>, <xref ref-type="fig" rid="F2">2</xref>, zigmoid amplifies the derivate at the far end of the time axis where sigmoid failed to do so.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Zigmoid properties.</p>
</caption>
<graphic xlink:href="fenrg-10-852349-g002.tif"/>
</fig>
</sec>
<sec id="s2-2">
<title>2.2 zLSTM</title>
<p>The aforementioned defined zigmoid with ranged <italic>&#x3b2;</italic> is expected to alleviate the gradient problem for long time series. The following attempts are then conducted to build a new variant of LSTM using the proposed zigmoid (zLSTM) as shown in <xref ref-type="fig" rid="F3">Figure 3</xref>:<list list-type="simple">
<list-item>
<p>1. Replace sigmoid function in the forget gate with zigmoid such that the gradient can be effectively amplified (<xref ref-type="fig" rid="F3">Figure 3</xref>).</p>
</list-item>
<list-item>
<p>2. Replace <italic>i</italic>
<sub>
<italic>t</italic>
</sub> with (1 &#x2212; <italic>f</italic>
<sub>
<italic>t</italic>
</sub>) (<xref ref-type="bibr" rid="B2">Cho et al., 2014</xref>) in order to reduce the trainable parameters.</p>
</list-item>
</list>
<disp-formula id="e11">
<mml:math id="m14">
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi mathvariant="bold-italic">z</mml:mi>
<mml:mi mathvariant="bold-italic">i</mml:mi>
<mml:mi mathvariant="bold-italic">g</mml:mi>
<mml:mi mathvariant="bold-italic">m</mml:mi>
<mml:mi mathvariant="bold-italic">o</mml:mi>
<mml:mi mathvariant="bold-italic">i</mml:mi>
<mml:mi mathvariant="bold-italic">d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(11)</label>
</disp-formula>
<disp-formula id="e12">
<mml:math id="m15">
<mml:msub>
<mml:mrow>
<mml:mi>o</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(12)</label>
</disp-formula>
<disp-formula id="e13">
<mml:math id="m16">
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">&#x302;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>tanh</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(13)</label>
</disp-formula>
<disp-formula id="e14">
<mml:math id="m17">
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2299;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x2299;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">&#x302;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:math>
<label>(14)</label>
</disp-formula>
<disp-formula id="e15">
<mml:math id="m18">
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>o</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2299;</mml:mo>
<mml:mi>tanh</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(15)</label>
</disp-formula>where <italic>W</italic>
<sub>
<italic>ix</italic>
</sub>, <italic>W</italic>
<sub>
<italic>ih</italic>
</sub>, <italic>b</italic>
<sub>
<italic>i</italic>
</sub>, <italic>W</italic>
<sub>
<italic>fx</italic>
</sub>, <italic>W</italic>
<sub>
<italic>fh</italic>
</sub>, <italic>b</italic>
<sub>
<italic>f</italic>
</sub>, <italic>W</italic>
<sub>
<italic>ox</italic>
</sub>, <italic>W</italic>
<sub>
<italic>oh</italic>
</sub>, <italic>b</italic>
<sub>
<italic>o</italic>
</sub>, <italic>W</italic>
<sub>
<italic>cx</italic>
</sub>, <italic>W</italic>
<sub>
<italic>ch</italic>
</sub>, and <italic>b</italic>
<sub>
<italic>c</italic>
</sub> are trainable parameters.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Structure of zLSTM.</p>
</caption>
<graphic xlink:href="fenrg-10-852349-g003.tif"/>
</fig>
</sec>
<sec id="s2-3">
<title>2.3 Hyper-Parameter <italic>&#x3b2;</italic>
</title>
<p>An appropriate hyper-parameter <italic>&#x3b2;</italic> is of great significance for controlling the intensity of the derivative amplification. <italic>&#x3b2;</italic> should be a value that amplifies the gradient enough for the network to learn long-term information.</p>
<p>Further information to be learned is that the smaller gradient is in zigmoid. For an input sequence with length <italic>L</italic>:<disp-formula id="e16">
<mml:math id="m19">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>L</mml:mi>
</mml:math>
<label>(16)</label>
</disp-formula>
<disp-formula id="e17">
<mml:math id="m20">
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(17)</label>
</disp-formula>
</p>
<p>Therefore,<disp-formula id="e18">
<mml:math id="m21">
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfrac>
<mml:mi>&#x3b2;</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>l</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>g</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(18)</label>
</disp-formula>
</p>
<p>The gradient has to be greater than a certain value <italic>g</italic>
<sub>
<italic>min</italic>
</sub> such that the networks can continue the learning. The <italic>g</italic>
<sub>
<italic>min</italic>
</sub> is observed and suggested in this study as 0.01.<disp-formula id="e19">
<mml:math id="m22">
<mml:mfrac>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfrac>
<mml:mi>&#x3b2;</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>l</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>g</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x2265;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>g</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:math>
<label>(19)</label>
</disp-formula>
</p>
<p>yields<disp-formula id="e20">
<mml:math id="m23">
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>g</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msup>
<mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfenced>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>l</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>g</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfrac>
</mml:math>
<label>(20)</label>
</disp-formula>
</p>
<p>Thus, the <italic>&#x3b2;</italic> value can be calculated as<disp-formula id="e21">
<mml:math id="m24">
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfenced open="{" close="">
<mml:mrow>
<mml:mtable class="cases">
<mml:mtr>
<mml:mtd columnalign="left">
<mml:mn>1.0</mml:mn>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd columnalign="left">
<mml:msub>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3c;</mml:mo>
<mml:mn>1.0</mml:mn>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="left">
<mml:msub>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd columnalign="left">
<mml:mn>1.0</mml:mn>
<mml:mo>&#x2264;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:mn>2.5402</mml:mn>
<mml:mspace width="0.17em"/>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="left">
<mml:mn>2.5402</mml:mn>
<mml:mspace width="0.17em"/>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd columnalign="left">
<mml:msub>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3e;</mml:mo>
<mml:mn>2.5402</mml:mn>
<mml:mspace width="0.17em"/>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(21)</label>
</disp-formula>where <inline-formula id="inf4">
<mml:math id="m25">
<mml:msub>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>g</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>l</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:math>
</inline-formula>.</p>
</sec>
<sec id="s2-4">
<title>2.4 Capabilities of zLSTM</title>
<p>zLSTM obtained by the aforementioned procedure has the following capabilities:<list list-type="simple">
<list-item>
<p>1. zLSTM contains basic properties of LSTM:</p>
<list list-type="simple">
<list-item>
<p>a. Non-linearity: As a variant of RNN, zLSTM inherits the non-linearity processing ability.</p>
</list-item>
<list-item>
<p>b. Short-term memory: Zigmoid maintains LSTM&#x2019;s short-term memory ability.</p>
</list-item>
</list>
</list-item>
<list-item>
<p>2. As an improved version of LSTM, zLSTM has advantages such as</p>
<list list-type="simple">
<list-item>
<p>a. Long-term memory: Compared with sigmoid, zigmoid has a greater gradient with the same output. This enhances the long-term performance by allowing the model to conduct learning over the full length of the time series. Longer inputs are then allowed to be fed into the model, enriching the information used for predictions.</p>
</list-item>
<list-item>
<p>b. Reduced trainable parameters: Given that the sum of forget gate and input gate is 1, the input gate can be omitted to deduct parameters. Gate reduction brings reduced trainable parameters such that the computing time and resource are less than traditional models.</p>
</list-item>
</list>
</list-item>
</list>
</p>
<p>With the properties and advantages mentioned previously, this work proposes zLSTM as a better solution for LOCA prediction due to the following considerations:<list list-type="simple">
<list-item>
<p>1. Compared to traditional machine learning algorithms, such as the ones mentioned in <xref ref-type="bibr" rid="B12">Moshkbar-Bakhshayesh and Ghafari (2022</xref>), and feed-forward neural network (<xref ref-type="bibr" rid="B16">Santos et al., 2019</xref>), zLSTM can achieve better feature extraction with its LSTM kernel that performs the calculation along timesteps. It is then suggested for the non-linear LOCA process, whose variation features are hard to capture.</p>
</list-item>
<list-item>
<p>2. Long term features can be captured by a combination of existing models, such as CNN &#x2b; LSTM (<xref ref-type="bibr" rid="B21">Wang et al., 2021c</xref>), with a sacrifice of more hyper-parameters and more tuning tricks, which burdens the model hyper-parameter processing and deployment. zLSTM, on the contrary, avoids such process by using an improved structure without additional hyper-parameters.</p>
</list-item>
<list-item>
<p>3. The training process of zLSTM is more executable due to reduced gate and parameters, allowing it to generate training/predicting results with less time and efforts.</p>
</list-item>
<list-item>
<p>4. Compared to those baseline models, the LOCA prediction from zLSTM has better credibility and enhanced generalization performance due to zLSTM&#x2019;s lower overfitting probability and fewer trainable parameters.</p>
</list-item>
</list>
</p>
</sec>
</sec>
<sec id="s3">
<title>3 Experiments</title>
<sec id="s3-1">
<title>3.1 Datasets</title>
<p>The datasets are obtained from LOCA simulations using an industry-grade NPP simulation platform (<xref ref-type="bibr" rid="B19">Sun et al., 2017</xref>). The simulations are carried out at 100% reactor power for LOCA cases, that is, break sizes of 0.9, 0.95, 1.0, 1.5, and 2.0&#xa0;<italic>cm</italic>
<sup>2</sup>.</p>
<p>There are a total of twelve crucial system parameters selected as the modeling features:<list list-type="simple">
<list-item>
<p>1. pressurizer water level;</p>
</list-item>
<list-item>
<p>2. coolant average temperature;</p>
</list-item>
<list-item>
<p>3. steam generator No. 1 water level;</p>
</list-item>
<list-item>
<p>4. steam generator No. 2 water level;</p>
</list-item>
<list-item>
<p>5. loop 1 coolant flowrate;</p>
</list-item>
<list-item>
<p>6. loop 2 coolant flowrate;</p>
</list-item>
<list-item>
<p>7. pressurizer pressure;</p>
</list-item>
<list-item>
<p>8. stream generator No. 1 output pressure;</p>
</list-item>
<list-item>
<p>9. stream generator No. 2 output pressure;</p>
</list-item>
<list-item>
<p>10. reactor power;</p>
</list-item>
<list-item>
<p>11. cold leg temperature;</p>
</list-item>
<list-item>
<p>12. hot leg temperature.</p>
</list-item>
</list>
</p>
</sec>
<sec id="s3-2">
<title>3.2 Data Preprocessing</title>
<p>To reduce the influence of multiple dimensions, the dataset is preprocessed using the z-score method such that fast convergency can be achieved during the model training process.<disp-formula id="e22">
<mml:math id="m26">
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mo>&#x304;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3c3;</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:math>
<label>(22)</label>
</disp-formula>where <inline-formula id="inf5">
<mml:math id="m27">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mo>&#x304;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> and <italic>&#x3c3;</italic> denote mean and variance of x, respectively.</p>
</sec>
<sec id="s3-3">
<title>3.3 Metrics</title>
<p>As common metrics for regression task evaluation, mean squared error (MSE) and mean absolute error (MAE) are chosen as the performance judgment for the proposed zLSTM model.<disp-formula id="e23">
<mml:math id="m28">
<mml:mi>M</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>E</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">&#x302;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:math>
<label>(23)</label>
</disp-formula>
<disp-formula id="e24">
<mml:math id="m29">
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>E</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:mo stretchy="false">&#x7c;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">&#x302;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">&#x7c;</mml:mo>
</mml:math>
<label>(24)</label>
</disp-formula>where <italic>y</italic>
<sub>
<italic>i</italic>
</sub> and <inline-formula id="inf6">
<mml:math id="m30">
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">&#x302;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:math>
</inline-formula> are the original value and predicted value, respectively.</p>
</sec>
<sec id="s3-4">
<title>3.4 Hyper-Parameter Setting</title>
<p>The zLSTM structure consists of one input layer, two hidden layers, and one output layer. Details are provided in <xref ref-type="table" rid="T1">Table 1</xref>.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Hyper parameters of zLSTM model.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Item</th>
<th align="center">Value</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Units of input layer</td>
<td align="center">1<xref ref-type="table-fn" rid="Tfn1">
<sup>a</sup>
</xref> or 12<xref ref-type="table-fn" rid="Tfn2">
<sup>b</sup>
</xref>
</td>
</tr>
<tr>
<td align="left">Units of zLSTM of 1st hidden layer</td>
<td align="center">128</td>
</tr>
<tr>
<td align="left">Units of zLSTM of 2nd hidden layer</td>
<td align="center">64</td>
</tr>
<tr>
<td align="left">Units of output layer</td>
<td align="center">1</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="Tfn1">
<label>a</label>
<p>1 for zLSTM univariate (zLSTM-U).</p>
</fn>
<fn id="Tfn2">
<label>b</label>
<p>12 for zLSTM multivariate (zLSTM-M).</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s3-5">
<title>3.5 Baseline Methods</title>
<p>There have been two similar investigations performed by <xref ref-type="bibr" rid="B17">She et al. (2020</xref>) and <xref ref-type="bibr" rid="B18">She et al. (2021</xref>). However, their major purpose was to verify the feasibility and effects of applying deep learning methods to the LOCA predictions. Neither of them covers the multivariate processing performance that requires theoretical innovation on the NN itself, such as defining a new zigmoid function for the LSTM model. To demonstrate the superiority of using the zigmoid method for multivariate processing, these two previous cases are selected as the baseline, and the prediction accuracy represented by MSE and MAE is compared among all the three methods. The mentioned two previous works, LSTM and CNN-LSTM, are compared with zLSTM to demonstrate its superiority on post-LOCA predictions.</p>
</sec>
<sec id="s3-6">
<title>3.6 Model Training</title>
<p>All datasets are randomly split into three subsets, that is, a training set (60%), a validation set (20%), and a test set (20%). Multivariate time series data needed for model training are derived by applying the rolling update method. Following the previous work, the window sizes for LSTM and CNN-LSTM remain 5 and 50. zLSTM uses the same window size as the compared baseline, which means that it uses window size 5 when comparing with LSTM and 50 for CNN-LSTM. The training parameters are optimized using Adam algorithm (<xref ref-type="bibr" rid="B5">Kingma and Adam, 2014</xref>) with a learning rate 10<sup>&#x2013;3</sup> for all models. When the training starts, a sliding window moves from the first row of the training dataset and provides a series of training input data describing the parameter variation during the period limited by the window size. The model learns and memorizes the variations such that it can reproduce similar ones once the test data is fed to it. The model is trained during such iterations until desired loss value is reached. More training process details are provided in <xref ref-type="bibr" rid="B17">She et al. (2020</xref>).</p>
</sec>
<sec id="s3-7">
<title>3.7 Model Verification Experiments</title>
<p>The performance of the proposed zLSTM is verified through experiments designed to predict crucial parameters of LOCA, in which both univariate scenario and multivariate scenario are tested using zLSTM and the two baseline methods.</p>
<p>The first crucial parameter chosen as the prediction feature is loop 1 flowrate since it is the most impacted parameter during a LOCA. Flowrate data from the test dataset are the so called &#x201c;single input&#x201d; for the univariate scenario. As for the multivariate scenario, all the twelve system parameters are integrated into a vector <italic>x</italic>
<sub>
<italic>t</italic>
</sub> and fed into the zLSTM for a coupled prediction processing. A single output (loop 1 flowrate prediction) is generated by zLSTM&#x2019;s single-cell output layer that merges the processing results of the 12-dimension vector. The multivariate experiment is only for zLSTM since both baseline methods are originally single-input models without parameter-coupling capability. With a diversity consideration, similar univariate and multivariate experiments are conducted to predict the pressurizer water level as well.</p>
<p>The univariate test is necessary since the two baseline methods are oriented to only one system parameter prediction. During this experiment, the memorizing performance of the models for long- and short-term information is tested, allowing the zLSTM to present its long-term memory advantage with the amplified gradient. For a fair play, zLSTM used for this experiment takes the same single input as the baseline methods. Such univariate-input zLSTM is named zLSTM-univariate (zLSTM-U).</p>
<p>The multivariate test, on the other hand, is to confirm a lower loss value when the prediction is generated with an algorithm (zLSTM) that takes all associated parameters into account. In this case, system parameters associated to the predicted feature are fed to zLSTM as multivariate inputs, naming it zLSTM-multivariate (zLSTM-M).</p>
<sec id="s3-7-1">
<title>3.7.1 Prediction of Loop 1 Coolant Flowrate</title>
<p>As mentioned in the verification process introduction, the univariate experiment uses the single input value for LSTM, CNN-LSTM, and zLSTM-U. The multivariate experiment, which is for zLSTM-M only, yields a single predicted feature (flowrate or water level) using a 12-dimension vector containing all the key system parameters. The experiments in this subsection focus on the variation of the loop 1 coolant flowrate during LOCAs of five different break sizes. Regarding the discussion in <xref ref-type="sec" rid="s3-6">Section 3.6</xref>, both zLSTM-U and zLSTM-M select window size 5 to run the univariate and multivariate tests against LSTM, which forms a test group {LSTM, zLSTM-U5, zLSTM-M5}. When comparing to CNN-LSTM that has window size 50, the test group becomes {CNN-LSTM, zLSTM-U50, zLSTM-M50}. Model performance is evaluated using MSE and MAE for each of the six models, providing twelve accuracy evaluation results for each of the five LOCA cases. <xref ref-type="table" rid="T2">Table 2</xref> shows all these 60 results for the loop 1 flowrate predictions. The predicted LOCA trends are plotted in <xref ref-type="fig" rid="F4">Figures 4</xref>, <xref ref-type="fig" rid="F5">5</xref>.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Prediction accuracy evaluations for loop 1 flowrate.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left"/>
<th align="center">Metric</th>
<th align="center">Model</th>
<th align="center">0.9</th>
<th align="center">0.95</th>
<th align="center">1.0</th>
<th align="center">1.5</th>
<th align="center">2.0</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="3" align="left">LSTMvs.zLSTM</td>
<td rowspan="3" align="left">MAE</td>
<td align="left">LSTM</td>
<td align="center">1.751 &#xd7; 10<sup>0</sup>
</td>
<td align="center">1.841 &#xd7; 10<sup>0</sup>
</td>
<td align="center">3.023 &#xd7; 10<sup>&#x2013;3</sup>
</td>
<td align="center">1.114 &#xd7; 10<sup>0</sup>
</td>
<td align="center">4.800 &#xd7; 10<sup>&#x2013;3</sup>
</td>
</tr>
<tr>
<td align="left">zLSTM-U5</td>
<td align="center">
<underline>1.598 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>1.605 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>2.732 &#xd7; 10<sup>&#x2212;3</sup>
</underline>
</td>
<td align="center">
<underline>1.040 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>3.928 &#xd7; 10<sup>&#x2212;3</sup>
</underline>
</td>
</tr>
<tr>
<td align="left">zLSTM-M5</td>
<td align="center">
<bold>
<italic>9.235 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;1</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>9.243 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;1</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>2.008 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;3</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>9.603 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;1</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>3.443 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;3</italic>
</bold>
</sup>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">MSE</td>
<td align="left">LSTM</td>
<td align="center">1.039 &#xd7; 10<sup>1</sup>
</td>
<td align="center">1.048 &#xd7; 10<sup>1</sup>
</td>
<td align="center">2.700 &#xd7; 10<sup>&#x2013;5</sup>
</td>
<td align="center">7.702 &#xd7; 10<sup>0</sup>
</td>
<td align="center">8.000 &#xd7; 10<sup>&#x2013;5</sup>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left"/>
<td align="left">zLSTM-U5</td>
<td align="center">
<underline>9.835 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>9.850 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>2.300 &#xd7; 10<sup>&#x2212;5</sup>
</underline>
</td>
<td align="center">
<underline>4.659 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>5.700 &#xd7; 10<sup>&#x2212;5</sup>
</underline>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left"/>
<td align="left">zLSTM-M5</td>
<td align="center">
<bold>
<italic>3.458 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>0</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>3.444 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>0</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>1.800 &#xd7; </italic>
</bold>
<bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;5</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>3.984 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>0</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>5.000 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;5</italic>
</bold>
</sup>
</td>
</tr>
<tr>
<td rowspan="3" align="left">CNN-LSTMvs.zLSTM</td>
<td rowspan="3" align="left">MAE</td>
<td align="left">CNN-LSTM</td>
<td align="center">2.353 &#xd7; 10<sup>0</sup>
</td>
<td align="center">2.270 &#xd7; 10<sup>0</sup>
</td>
<td align="center">2.615 &#xd7; 10<sup>&#x2013;3</sup>
</td>
<td align="center">1.223 &#xd7; 10<sup>0</sup>
</td>
<td align="center">3.848 &#xd7; 10<sup>&#x2013;3</sup>
</td>
</tr>
<tr>
<td align="left">zLSTM-U50</td>
<td align="center">
<underline>9.006 &#xd7; 10<sup>&#x2212;1</sup>
</underline>
</td>
<td align="center">
<underline>8.550 &#xd7; 10<sup>&#x2212;1</sup>
</underline>
</td>
<td align="center">
<underline>2.003 &#xd7; 10<sup>&#x2212;3</sup>
</underline>
</td>
<td align="center">
<underline>1.030 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>3.187 &#xd7; 10<sup>&#x2212;3</sup>
</underline>
</td>
</tr>
<tr>
<td align="left">zLSTM-M50</td>
<td align="center">
<bold>
<italic>7.217 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;1</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>7.241 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;1</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>1.630 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;3</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>1.025 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>0</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>2.330 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;3</italic>
</bold>
</sup>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">MSE</td>
<td align="left">CNN-LSTM</td>
<td align="center">2.140 &#xd7; 10<sup>1</sup>
</td>
<td align="center">2.322 &#xd7; 10<sup>1</sup>
</td>
<td align="center">2.700 &#xd7; 10<sup>&#x2013;5</sup>
</td>
<td align="center">6.878 &#xd7; 10<sup>0</sup>
</td>
<td align="center">5.700 &#xd7; 10<sup>&#x2013;5</sup>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left"/>
<td align="left">zLSTM-U50</td>
<td align="center">
<underline>4.054 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>4.030 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>1.500 &#xd7; 10<sup>&#x2212;5</sup>
</underline>
</td>
<td align="center">
<underline>5.231 &#xd7; 10<sup>0</sup>
</underline>
</td>
<td align="center">
<underline>5.300 &#xd7; 10<sup>&#x2212;5</sup>
</underline>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left"/>
<td align="left">zLSTM-M50</td>
<td align="center">
<bold>
<italic>2.126 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>0</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>2.121 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>0</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>1.300 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;5</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>4.141 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>0</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>4.400 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;5</italic>
</bold>
</sup>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>Underline data, the best result of univariate experiments; bold data, the best result of multivariate and univariate experiments.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Prediction of loop 1 coolant flowrate on break size 0.9&#xa0;<italic>cm</italic>
<sup>2</sup>.</p>
</caption>
<graphic xlink:href="fenrg-10-852349-g004.tif"/>
</fig>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Prediction of loop 1 coolant flowrate on break size 1.0&#xa0;<italic>cm</italic>
<sup>2</sup>.</p>
</caption>
<graphic xlink:href="fenrg-10-852349-g005.tif"/>
</fig>
</sec>
<sec id="s3-7-2">
<title>3.7.2 Prediction of Pressurizer Water Level</title>
<p>As another crucial system parameter describing the LOCA behavior, the pressurizer water level is predicted in this subsection by experiments same as in <xref ref-type="sec" rid="s3-7-1">Section 3.7.1</xref>. <xref ref-type="table" rid="T3">Table 3</xref> presents the 60 MSE/MAE data as the accuracy evaluations of the two test groups. Meanwhile, the water level variation illustrated by all the models is presented in <xref ref-type="fig" rid="F6">Figures 6</xref>, <xref ref-type="fig" rid="F7">7</xref>.</p>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Prediction accuracy evaluations for pressurizer water level.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left"/>
<th align="center">Metric</th>
<th align="center">Model</th>
<th align="center">0.9</th>
<th align="center">0.95</th>
<th align="center">1.0</th>
<th align="center">1.5</th>
<th align="center">2.0</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="3" align="left">LSTMvs.zLSTM</td>
<td rowspan="3" align="left">MAE</td>
<td align="left">LSTM</td>
<td align="center">1.505 &#xd7; 10<sup>&#x2013;1</sup>
</td>
<td align="center">1.519 &#xd7; 10<sup>&#x2013;1</sup>
</td>
<td align="center">4.167 &#xd7; 10<sup>&#x2013;2</sup>
</td>
<td align="center">4.791 &#xd7; 10<sup>&#x2013;1</sup>
</td>
<td align="center">8.124 &#xd7; 10<sup>&#x2013;2</sup>
</td>
</tr>
<tr>
<td align="left">zLSTM-U5</td>
<td align="center">
<underline>1.402 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>1.393 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>3.855 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>4.698 &#xd7; 10<sup>&#x2212;1</sup>
</underline>
</td>
<td align="center">
<underline>7.919 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
</tr>
<tr>
<td align="left">zLSTM-M5</td>
<td align="center">
<bold>
<italic>8.800 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;4</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>8.990 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;4</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>1.214 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;6</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>1.346 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;3</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>5.000 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;6</italic>
</bold>
</sup>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">MSE</td>
<td align="left">LSTM</td>
<td align="center">7.775 &#xd7; 10<sup>&#x2013;2</sup>
</td>
<td align="center">7.898 &#xd7; 10<sup>&#x2013;2</sup>
</td>
<td align="center">2.423 &#xd7; 10<sup>&#x2013;3</sup>
</td>
<td align="center">3.794 &#xd7; 10<sup>&#x2013;1</sup>
</td>
<td align="center">9.233 &#xd7; 10<sup>&#x2013;3</sup>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left"/>
<td align="left">zLSTM-U5</td>
<td align="center">
<underline>7.723 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>7.723 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>2.126 &#xd7; 10<sup>&#x2212;3</sup>
</underline>
</td>
<td align="center">
<underline>3.453 &#xd7; 10<sup>&#x2212;1</sup>
</underline>
</td>
<td align="center">
<underline>8.841 &#xd7; 10<sup>&#x2212;3</sup>
</underline>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left"/>
<td align="left">zLSTM-M5</td>
<td align="center">
<bold>
<italic>1.618 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;2</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>1.894 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;2</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>9.000 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;4</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>2.939 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;2</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>1.761 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;3</italic>
</bold>
</sup>
</td>
</tr>
<tr>
<td rowspan="3" align="left">CNN-LSTMvs.zLSTM</td>
<td rowspan="3" align="left">MAE</td>
<td align="left">CNN-LSTM</td>
<td align="center">9.192 &#xd7; 10<sup>&#x2013;2</sup>
</td>
<td align="center">9.238 &#xd7; 10<sup>&#x2013;2</sup>
</td>
<td align="center">3.150 &#xd7; 10<sup>&#x2013;2</sup>
</td>
<td align="center">4.221 &#xd7; 10<sup>&#x2013;1</sup>
</td>
<td align="center">3.595 &#xd7; 10<sup>&#x2013;2</sup>
</td>
</tr>
<tr>
<td align="left">zLSTM-U50</td>
<td align="center">
<underline>5.561 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>4.991 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>2.770 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>3.777 &#xd7; 10<sup>&#x2212;1</sup>
</underline>
</td>
<td align="center">
<underline>3.373 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
</tr>
<tr>
<td align="left">zLSTM-M50</td>
<td align="center">
<bold>
<italic>1.264 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;2</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>1.410 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;2</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>5.940 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;4</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>2.448 &#xd7; 10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;2</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>9.870 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;4</italic>
</bold>
</sup>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left">MSE</td>
<td align="left">CNN-LSTM</td>
<td align="center">3.231 &#xd7; 10<sup>&#x2013;2</sup>
</td>
<td align="center">3.234 &#xd7; 10<sup>&#x2013;2</sup>
</td>
<td align="center">1.359 &#xd7; 10<sup>&#x2013;3</sup>
</td>
<td align="center">3.216 &#xd7; 10<sup>&#x2013;1</sup>
</td>
<td align="center">2.872 &#xd7; 10<sup>&#x2013;3</sup>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left"/>
<td align="left">zLSTM-U50</td>
<td align="center">
<underline>1.227 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>1.131 &#xd7; 10<sup>&#x2212;2</sup>
</underline>
</td>
<td align="center">
<underline>1.196 &#xd7; 10<sup>&#x2212;3</sup>
</underline>
</td>
<td align="center">
<underline>2.416 &#xd7; 10<sup>&#x2212;1</sup>
</underline>
</td>
<td align="center">
<underline>2.029 &#xd7; 10<sup>&#x2212;3</sup>
</underline>
</td>
</tr>
<tr>
<td align="left"/>
<td align="left"/>
<td align="left">zLSTM-M50</td>
<td align="center">
<bold>
<italic>4.600 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;4</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>4.860 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;4</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>6.000 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;6</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>8.740 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;4</italic>
</bold>
</sup>
</td>
<td align="center">
<bold>
<italic>2.00 &#xd7;</italic>
</bold> <bold>
<italic>10</italic>
</bold>
<sup>
<bold>
<italic>&#x2013;5</italic>
</bold>
</sup>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>Underline data, the best result of univariate experiments; bold data, the best result of multivariate and univariate experiments.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Prediction of pressurizer water level on break size 0.9&#xa0;<italic>cm</italic>
<sup>2</sup>.</p>
</caption>
<graphic xlink:href="fenrg-10-852349-g006.tif"/>
</fig>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Prediction of pressurizer water level on break size 0.95&#xa0;<italic>cm.</italic>
<sup>2</sup>
</p>
</caption>
<graphic xlink:href="fenrg-10-852349-g007.tif"/>
</fig>
</sec>
<sec id="s3-7-3">
<title>3.7.3 Result Analysis</title>
<p>The prediction accuracy metrics listed in <xref ref-type="table" rid="T2">Tables 2</xref>, <xref ref-type="table" rid="T3">3</xref> describe the prediction performance of the tested models. The upper half of each table presents the prediction errors from zLSTM (both U and M) against those from LSTM. It can be seen that zLSTM achieves lower errors than LSTM on both MAE and MSE. For instance, the average MAE difference between LSTM and zLSTM-U5 for flowrate prediction is 0.26 in <xref ref-type="table" rid="T2">Table 2</xref>, giving a 28.7% improvement on prediction accuracy. Additionally, zLSTM working on multivariate mode also beats its univariate mode due to the advantages brought by the 12-dimension parameter vector. The average MAE difference between zLSTM-U5 and zLSTM-M5 is 0.24, giving an accuracy improvement of 28.8%. Similar comparison is reflected by the lower half of each table, where zLSTM once again proves its superiority over CNN-LSTM with accuracy improvements such as 29.52% for pressurizer water level prediction (MSE in <xref ref-type="table" rid="T3">Table 3</xref>, CNN-LSTM vs. zLSTM-U50).</p>
<p>
<xref ref-type="fig" rid="F4">Figure 4</xref> to <xref ref-type="fig" rid="F7">Figure 7</xref> visually illustrate the predicted LOCA trends in different LOCA cases. The mini graphs within the figures amplify chosen segments of the trends, offering a better view to the model performance. After the LOCA occurs at <italic>t</italic> &#x3d; 10<italic>s</italic>, the loop 1 flowrate and the pressurizer water level experience dramatical variations, and then approach a stable state with help from the emergency response system. During the entire process, the zLSTM group {U5, M5, U50, M50} represents a more precise prediction performance. At the beginning of the LOCA, it is the zLSTM that grasps the suddenly inserted non-linear variation using its efficient short-term memory, producing a prediction close to the actual trend. How the zigmoid function can enhance the model&#x2019;s long-term memory is well verified when the zLSTM models generate better predictions at the far end of the time axis. zLSTM-M50 is the one that grasps the progression trends most accurately and persistently, which demonstrate the importance and effect of using multivariate processing (12-dimension vector) and wide data window (size 50).</p>
<p>The analysis conducted to investigate further explanations is presented as follows:<list list-type="simple">
<list-item>
<p>1. The multivariate mode of zLSTM (zLSTM-M) allows the prediction to be generated based on the coupling of system parameters, that is, the prediction comprehensively considers all the 12 critical system parameters relevant to LOCA progression. Sufficient information provided by such a 12-variable input vector guarantees improved prediction accuracy.</p>
</list-item>
<list-item>
<p>2. LOCA predictions for small breaks received higher loss values than those for big ones. They confirm the difficulties of learning and simulating a process with dramatical variations, for example, a small LOCA. Inflect points shorten the time period necessary for information gathering, preventing the model from sufficient evaluation of the progression process. The loss values rise along with the number of inflect points, showing that more inflect points cause more missing information during learning and prediction.</p>
</list-item>
<list-item>
<p>3. It is observed that a larger input window size gives the zLSTM model a better performance since the window size determines the coverage of critical information. More accurate results are generated when the model is capable of learning comprehensively by capturing more useful information from the predicted process.</p>
</list-item>
</list>
</p>
<p>To summarize, the application of zigmoid function to LSTM has enhanced the short and long-term memory of the model. With the input vector integrated using 12 system parameters, the zLSTM-M model can be even more comprehensive to the multivariate environment of LOCA, allowing the predicted feature to be more accurate.</p>
</sec>
</sec>
</sec>
<sec id="s4">
<title>4 Conclusion</title>
<p>A new gate function zigmoid is raised as a solution to the far-end gradient problem of RNN class models, which is proposed to cover the non-linearity, time correlation, and multivariate processing for LOCA predictions. Proved through theoretical analysis, the zigmoid function is embedded into traditional LSTM to form zLSTM that is capable of effectively memorizing both short- and long-term information. Its multivariate processing is enabled by using a 12-dimension input vector that integrates 12 system parameters. The multivariate mode gathers all-sided system information that eliminates blind spots during the prediction process. The verification experiments successfully demonstrate the aforementioned advantages of the zLSTM model. The accuracy metrics (MAE/MSE) of zLSTM is kept lower than traditional models for both univariate and multivariate scenarios. During the LOCA progression, the parameter trends are followed by zLSTM&#x2019;s prediction, with the smallest deviation according to the experiment figures. All these findings confirm zLSTM to be a better method for LOCA predictions.</p>
<p>In addition to the achievements, there are a few issues remaining for future investigation. First, zLSTM is constructed by replacing only the forget gate in LSTM. Possible further enhancement could be obtained with more applications of the zigmoid function. The next is the model training process that may be improved using more actual NPP data. Last but not least, inflect points in the LOCA trend cannot be well followed by the prediction curve generated from the deep learning models, which implies alternative solutions in future explorations.</p>
</sec>
</body>
<back>
<sec id="s5">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="s11">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>SG proposed the idea of using zLSTM methods for LOCA prediction and established the main structure of the deep learning models used in this work. SY performed the data preprocessing and model training task. JS drafted most parts of the article and coordinated the cooperation of all the co-authors. WL was responsible for the experiment results analysis as well as preparing the tables and figures. SL provided key instructions to the group members to ensure accurate and efficient research methodologies. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>The authors would like to acknowledge the financial and technical support received from the following research projects and institutions, including but not limited to, National Key Research and Development Project (2020YFB1713400), and The Industrial Internet Innovation and Development Project of China (TC19084DY).</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s11">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.frontiersin.org/articles/10.3389/fenrg.2022.852349/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fenrg.2022.852349/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM1" mimetype="application/ZIP" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ren</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Research on Simulation and State Prediction of Nuclear Power System Based on Lstm Neural Network</article-title>. <source>Sci. Technology Nucl. Installations</source> <volume>2021</volume>, <fpage>8839867</fpage>. <pub-id pub-id-type="doi">10.1155/2021/8839867</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Cho</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>van Merri&#xeb;nboer</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Gulcehre</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Bahdanau</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Bougares</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Schwenk</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). &#x201c;<article-title>Learning Phrase Representations Using RNN Encoder&#x2013;Decoder for Statistical Machine Translation</article-title>,&#x201d; in <conf-name>Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)</conf-name>, <conf-loc>Doha, Qatar</conf-loc>, <conf-date>October 25 and 29, 2014</conf-date> (<publisher-loc>Doha, Qatar</publisher-loc>: <publisher-name>Association for Computational Linguistics</publisher-name>), <fpage>1724</fpage>&#x2013;<lpage>1734</lpage>. </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hochreiter</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Schmidhuber</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Long Short-Term Memory</article-title>. <source>Neural Comput.</source> <volume>9</volume> (<issue>8</issue>), <fpage>1735</fpage>&#x2013;<lpage>1780</lpage>. <pub-id pub-id-type="doi">10.1162/neco.1997.9.8.1735</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Data-driven Machine Learning for Fault Detection and Diagnosis in Nuclear Power Plants: A Review</article-title>. <source>Front. Energ. Res.</source> <volume>9</volume>, <fpage>663296</fpage>. </citation>
</ref>
<ref id="B5">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Kingma</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Adam</surname>
<given-names>J. B.</given-names>
</name>
</person-group> (<year>2014</year>). <source>A Method for Stochastic Optimization</source>. <publisher-loc>San Diego, CA USA</publisher-loc>: <publisher-name>Computer Science</publisher-name>. </citation>
</ref>
<ref id="B6">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Koo</surname>
<given-names>Y. D.</given-names>
</name>
<name>
<surname>Man</surname>
<given-names>G. N.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>C. H.</given-names>
</name>
</person-group> (<year>2018</year>). &#x201c;<article-title>Prediction of Nuclear Reactor Vessel Water Level Using Deep Neural Networks</article-title>,&#x201d; in <conf-name>2018 International Conference on Electronics, Information, and Communication (ICEIC)</conf-name>, <conf-loc>Honolulu, HI, USA</conf-loc>, <conf-date>24-27 Jan. 2018</conf-date> (<publisher-name>IEEE</publisher-name>). <pub-id pub-id-type="doi">10.23919/elinfocom.2018.8330616</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>C.-K.</given-names>
</name>
<name>
<surname>Kwon</surname>
<given-names>G.-Y.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Jung</surname>
<given-names>M. K.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>J. B.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>H.-S.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Real-Time Condition Monitoring of LOCA via Time-Frequency Domain Reflectometry</article-title>. <source>IEEE Trans. Instrum. Meas.</source> <volume>66</volume>, <fpage>1864</fpage>&#x2013;<lpage>1873</lpage>. <pub-id pub-id-type="doi">10.1109/tim.2017.2664578</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Qiu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Research on Anomaly Detection Method of Nuclear Power Plant Operation State Based on Unsupervised Deep Generative Model</article-title>. <source>Ann. Nucl. Energ.</source> <volume>167</volume>, <fpage>108785</fpage>. </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>T.-H.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>S.-C.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>T.-C.</given-names>
</name>
<name>
<surname>Ferng</surname>
<given-names>Y.-M.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Localization and Size Estimation for Breaks in Nuclear Power Plants</article-title>. <source>Nucl. Eng. Technology</source> <volume>54</volume> (<issue>1</issue>), <fpage>193</fpage>&#x2013;<lpage>206</lpage>. <pub-id pub-id-type="doi">10.1016/j.net.2021.07.007</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>T.-H.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>T.-C.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>S.-C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Deep Learning Schemes for Event Identification and Signal Reconstruction in Nuclear Power Plants with Sensor Faults</article-title>. <source>Ann. Nucl. Energ.</source> <volume>154</volume>, <fpage>108113</fpage>. <pub-id pub-id-type="doi">10.1016/j.anucene.2020.108113</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mira</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Pigg</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Kozlowski</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Deng</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Qu</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Neural-based Time Series Forecasting of Loss of Coolant Accidents in Nuclear Power Plants - Sciencedirect</article-title>. <source>Expert Syst. Appl.</source> <volume>160</volume>. </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Moshkbar-Bakhshayesh</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Ghafari</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Prediction of Steam/water Stratified Flow Characteristics in Npps Transients Using Svm Learning Algorithm with Combination of thermal-hydraulic Model and New Data Mapping Technique</article-title>. <source>Ann. Nucl. Energ.</source> <volume>166</volume>, <fpage>108699</fpage>. </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pantera</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Stul&#xed;k</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Vidal-Ferr&#xe0;ndiz</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Carre&#xf1;o</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Ginestar</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>George</surname>
<given-names>I.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Localizing Perturbations in Pressurized Water Reactors Using One-Dimensional Deep Convolutional Neural Networks</article-title>. <source>Sensors</source> <volume>22</volume> (<issue>1</issue>), <fpage>113</fpage>.<pub-id pub-id-type="doi">10.3390/s22010113</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saeed</surname>
<given-names>H. A.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>M.-j.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>B.-w.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Novel Fault Diagnosis Scheme Utilizing Deep Learning Networks</article-title>. <source>Prog. Nucl. Energ.</source> <volume>118</volume>, <fpage>103066</fpage>. <pub-id pub-id-type="doi">10.1016/j.pnucene.2019.103066</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Santhosh</surname>
<given-names>M. K.</given-names>
</name>
<name>
<surname>Thangamani</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Mukhopadhyay</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Verma</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Rao</surname>
<given-names>V. V. S. S.</given-names>
</name>
<name>
<surname>Vaze</surname>
<given-names>K. K.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). &#x201c;<article-title>Neural Network Based Diagnostic System for Accident Management in Nuclear Power Plants</article-title>,&#x201d; in <conf-name>2010 2nd International Conference on Reliability, Safety and Hazard - Risk-Based Technologies and Physics-of-Failure Methods (ICRESH)</conf-name>, <conf-loc>Mumbai, India</conf-loc>, <conf-date>14-16 Dec. 2010</conf-date> (<publisher-name>IEEE</publisher-name>), <fpage>572</fpage>&#x2013;<lpage>578</lpage>. <pub-id pub-id-type="doi">10.1109/icresh.2010.5779613</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Santos</surname>
<given-names>M. C. d.</given-names>
</name>
<name>
<surname>Pinheiro</surname>
<given-names>V. H. C.</given-names>
</name>
<name>
<surname>Desterro</surname>
<given-names>F. S. M. d.</given-names>
</name>
<name>
<surname>Avellar</surname>
<given-names>R. K. d.</given-names>
</name>
<name>
<surname>Schirru</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Santos Nicolau</surname>
<given-names>A. d.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Deep Rectifier Neural Network Applied to the Accident Identification Problem in a Pwr Nuclear Power Plant</article-title>. <source>Ann. Nucl. Energ.</source> <volume>133</volume>, <fpage>400</fpage>&#x2013;<lpage>408</lpage>. <pub-id pub-id-type="doi">10.1016/j.anucene.2019.05.039</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>She</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Xue</surname>
<given-names>S. Y.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Cao</surname>
<given-names>H. S.</given-names>
</name>
</person-group> (<year>2020</year>). &#x201c;<article-title>The Application of LSTM Model to the Prediction of Abnormal Condition in Nuclear Power Plants</article-title>,&#x201d; in <source>Nuclear Power Plants: Innovative Technologies for Instrumentation and Control Systems, the Fourth International Symposium on Software Reliability, Industrial Safety, Cyber Security and Physical Protection of Nuclear Power Plant (ISNPP)</source>. <pub-id pub-id-type="doi">10.1007/978-981-15-1876-8_46</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>She</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Xue</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Diagnosis and Prediction for Loss of Coolant Accidents in Nuclear Power Plants Using Deep Learning Methods</article-title>. <source>Front. Energ. Res.</source> <volume>9</volume>, <fpage>665262</fpage>. <pub-id pub-id-type="doi">10.3389/fenrg.2021.665262</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Liao</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Su</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Control System Design and Validation Platform Development for Small Pressurized Water Reactors (Spwr) by Coupling an Engineering Simulator and Matlab/simulink</article-title>. <source>Ann. Nucl. Energ.</source> <volume>102</volume> (<issue>APR</issue>), <fpage>309</fpage>&#x2013;<lpage>316</lpage>. <pub-id pub-id-type="doi">10.1016/j.anucene.2016.12.034</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Tallec</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Ollivier</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2018</year>). <source>Can Recurrent Neural Networks Warp Time?</source> <comment>Arxiv</comment>. </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Ayodeji</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Xia</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X. K.</given-names>
</name>
<name>
<surname>kang</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Advanced Fault Diagnosis Method for Nuclear Power Plant Based on Convolutional Gated Recurrent Network and Enhanced Particle Swarm Optimization</article-title>. <source>Ann. Nucl. Energ.</source> <volume>151</volume>, <fpage>107934</fpage>. <pub-id pub-id-type="doi">10.1016/j.anucene.2020.107934</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Yue</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Saeed</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y. K.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Fault Identification and Diagnosis Based on Kpca and Similarity Clustering for Nuclear Power Plants</article-title>. <source>Ann. Nucl. Energ.</source> <volume>150</volume>, <fpage>107786</fpage>. </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>M-D.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>T-H.</given-names>
</name>
<name>
<surname>Jhan</surname>
<given-names>K-C.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>S-C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Abnormal Event Detection, Identification and Isolation in Nuclear Power Plants Using Lstm Networks</article-title>. <source>Prog. Nucl. Energ.</source> <volume>140</volume>, <fpage>103928</fpage>. <pub-id pub-id-type="doi">10.1016/j.pnucene.2021.103928</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xiang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>X. M.</given-names>
</name>
<name>
<surname>Xiong</surname>
<given-names>F. R.</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>X. M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Deep Learning-Based Unsupervised Representation Clustering Methodology for Automatic Nuclear Reactor Operating Transient Identification</article-title>. <source>Knowledge-Based Syst.</source> <volume>204</volume>, <fpage>1061782020</fpage>. </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Deng</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Ishii</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Best Estimate Plus Uncertainty Analysis of a Large Break Loca on Generation Iii Reactor with Relap5</article-title>. <source>Ann. Nucl. Energ.</source> <volume>127</volume> (<issue>MAY</issue>), <fpage>326</fpage>&#x2013;<lpage>340</lpage>. <pub-id pub-id-type="doi">10.1016/j.anucene.2018.12.019</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>