<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mar. Sci.</journal-id>
<journal-title>Frontiers in Marine Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mar. Sci.</abbrev-journal-title>
<issn pub-type="epub">2296-7745</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmars.2022.1113788</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Marine Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Significant wave height prediction based on deep learning in the South China Sea</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Hao</surname>
<given-names>Peng</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1714609"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Li</surname>
<given-names>Shuang</given-names>
</name>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Gao</surname>
<given-names>Yu</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/2123232"/>
</contrib>
</contrib-group>
<aff id="aff1">
<institution>Institute of Physical Oceanography and Remote Sensing, Ocean College, Zhejiang University</institution>, <addr-line>Zhoushan</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Lichuan Wu, Uppsala University, Sweden</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Carlos P&#xe9;rez-Collazo, University of Vigo, Spain; Junliang Gao, Jiangsu University of Science and Technology, China</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Shuang Li, <email xlink:href="mailto:lshuang@zju.edu.cn">lshuang@zju.edu.cn</email>
</p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Physical Oceanography, a section of the journal Frontiers in Marine Science</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>23</day>
<month>02</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>9</volume>
<elocation-id>1113788</elocation-id>
<history>
<date date-type="received">
<day>01</day>
<month>12</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>12</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Hao, Li and Gao</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Hao, Li and Gao</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Significant wave height (SWH) prediction can effectively improve the safety of marine activities and reduce the occurrence of maritime accidents, which is of great significance to national security and the development of the marine economy. In this study, we comprehensively analyzed the SWH prediction performance of the recurrent neural network (RNN), long short-term memory network (LSTM), and gated recurrent unit network (GRU) by considering different input lengths, prediction lengths, and model complexity. The experimental results show that (1) the input length impacts the prediction results of SWH, but it does not mean that the longer the input length, the better the prediction performance. When the input length is 24h, the prediction performance of RNN, LSTM, and GRU models is better. (2) The prediction length influences the SWH prediction results. As the prediction length increases, the prediction performance gradually decreases. Among them, RNN is not suitable for 48h long-term SWH prediction. (3) The more layers of the model, the better the SWH prediction performance is not necessarily. When the number of layers is set to 3 or 4, the model&#x2019;s prediction performance is better.</p>
</abstract>
<kwd-group>
<kwd>significant wave height</kwd>
<kwd>South China Sea</kwd>
<kwd>deep learning</kwd>
<kwd>RNN</kwd>
<kwd>LSTM</kwd>
<kwd>GRU</kwd>
</kwd-group>
<contract-sponsor id="cn001">National Natural Science Foundation of China<named-content content-type="fundref-id">10.13039/501100001809</named-content>
</contract-sponsor>
<counts>
<fig-count count="9"/>
<table-count count="4"/>
<equation-count count="16"/>
<ref-count count="36"/>
<page-count count="12"/>
<word-count count="5114"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<label>1</label>
<title>Introduction</title>
<p>Wave disaster is the most common marine disaster in the world, and the possible disasters mainly include the following: (1) Billow can cause ships at sea to capsize, destroy offshore platforms, and bring great disasters to marine transportation and construction, fishing, and marine military activities. (2) Billow can destroy coastal embankments, seawalls, docks, marine aquaculture facilities, and other marine structures. (3) Billow sometimes carries a large amount of sediment into the harbor and waterway, causing disasters such as silt up. In recent years, due to abnormal climate change, sea wave disasters have occurred frequently in coastal cities in the South China Sea, seriously threatening the safety of life and property of people in coastal cities. Therefore, accurate SWH prediction can effectively improve the safety of marine activities and reduce the occurrence of marine accidents, which is of great significance to the development of national security and the marine economy (<xref ref-type="bibr" rid="B31">Wang and Oey, 2008</xref>; <xref ref-type="bibr" rid="B32">Young and Ribal, 2019</xref>; <xref ref-type="bibr" rid="B8">Fan et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B12">Gao et&#xa0;al., 2021b</xref>; <xref ref-type="bibr" rid="B20">Li et&#xa0;al., 2021</xref>).</p>
<p>Because of the importance of accurately predicting SWH, researchers have conducted extensive research on SWH prediction methods. Traditional methods mainly include statistical methods and numerical simulation methods and have been widely used in global sea state prediction (<xref ref-type="bibr" rid="B14">Group, 1988</xref>; <xref ref-type="bibr" rid="B30">Vanem, 2016</xref>; <xref ref-type="bibr" rid="B17">Kazeminezhad and Siadatmousavi, 2017</xref>; <xref ref-type="bibr" rid="B29">Umesh and Swain, 2018</xref>; <xref ref-type="bibr" rid="B21">Liang et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B22">Liu et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B28">Swain et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B7">Emmanouil et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B11">Gao et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B19">Li et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B10">Gao et&#xa0;al., 2021a</xref>). Both statistical methods and numerical simulation methods attempt to predict SWH through approximate mathematical relational models. Due to the strong nonlinearity of ocean waves&#x2019; physical processes and mechanisms, especially in extreme cases (such as typhoons), this method may largely fail to achieve SWH prediction. In addition, such methods usually require high-performance equipment for real-time computing, which is computationally expensive (<xref ref-type="bibr" rid="B15">Huang and Dong, 2021</xref>; <xref ref-type="bibr" rid="B36">Zhou et&#xa0;al., 2021</xref>).</p>
<p>With the development of artificial intelligence, machine learning methods such as Artificial Neural Networks, Random Forests, and Support Vector Machine are gradually applied to SWH prediction work (<xref ref-type="bibr" rid="B26">Peres et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B5">Deshmukh et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B13">Gopinath and Dwarakish, 2016</xref>; <xref ref-type="bibr" rid="B1">Berbi&#x107; et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B9">Gao et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B3">Callens et&#xa0;al., 2020</xref>). Compared with traditional methods, this kind of method has the advantages of generality and fast calculation speed. However, it also has its own shortcomings. The machine learning method is only suitable for short-term prediction under good sea conditions, and the prediction under extreme conditions is not ideal. Furthermore, as the size of the input data increases, the model cannot extract enough features, and the accuracy of machine learning methods in predicting SWH may drop sharply (<xref ref-type="bibr" rid="B24">Ni and Ma, 2020</xref>).</p>
<p>Deep learning is a new research direction in the field of machine learning. Compared with machine learning methods, deep learning methods extract feature information by training deep neural networks, which have stronger feature expression capabilities and can achieve end-to-end training. It is worth mentioning that the emergence of recurrent neural networks has greatly improved the ability to solve time series prediction problems (<xref ref-type="bibr" rid="B34">Zaremba et&#xa0;al., 2014</xref>). In addition, there are two variant structures of RNN, one is LSTM (<xref ref-type="bibr" rid="B33">Yu et&#xa0;al., 2019</xref>)and the other is GRU (<xref ref-type="bibr" rid="B6">Dey and Salem, 2017</xref>). It is precisely because of the powerful ability of the recurrent neural network in solving time series problems that recurrent neural network has received extensive attention from researchers in predicting SWH (<xref ref-type="bibr" rid="B27">Sadeghifar et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B8">Fan et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B4">Chen and Huang, 2021</xref>; <xref ref-type="bibr" rid="B16">J&#xf6;rges et&#xa0;al., 2021</xref>; <xref ref-type="bibr" rid="B23">Miky et&#xa0;al., 2021</xref>; <xref ref-type="bibr" rid="B35">Zhang et&#xa0;al., 2021</xref>; <xref ref-type="bibr" rid="B2">Bethel et&#xa0;al., 2022</xref>).</p>
<p>However, when using recurrent neural networks for SWH prediction, there is still a major research challenge: how to better build a model to capture the dynamic dependencies between variables, especially in extreme environments. Specifically, SWH prediction models are often a mixture of short-term and long-term dependencies. A successful SWH prediction model should capture these two dependencies to make accurate predictions. Long-term dependence considers the difference between different seasons, and short-term dependence considers the fluctuation of wave height caused by wind direction and short-term wind direction changes. How to design the model structure, different layer settings, model update rules, learning rate, input length, and output length settings, etc., the results may be quite different. Understanding the effect of different recurrent neural network parameter settings on the SWH prediction ability is crucial to accurately predict the SWH ability.</p>
<p>This paper explores the effect of different parameter settings on the performance of RNN, LSTM, and GRU models for predicting SWH. Based on the wind speed and SWH data of the three stations, by setting different input lengths, prediction lengths, and network layers, we comprehensively measure the ability of different methods and parameters to predict SWH at different stations. The experimental results show that: (1) It is not that the longer the input length, the better the prediction effect. As the input length increases, the SWH prediction performance of the model gradually decreases. (2) As the prediction time increases, the SWH prediction performance of the model gradually decreases. (3) The blind increase of the number of layers will not only not improve the SWH prediction performance of the model, but will have the opposite effect.</p>
<p>The rest of the paper is organized as follows. In section 2, we describe the research area, research data, research method, etc. In section 3, we present the experimental results and make a detailed discussion. Finally, in section 4, we summarize our findings and provide an outlook for future work.</p>
</sec>
<sec id="s2" sec-type="materials|methods">
<label>2</label>
<title>Materials and methods</title>
<p>In this section, we first introduce the data sources and research area; then introduce three recurrent neural network methods; and finally describe the experimental environment, experimental procedures, and metrics in detail.</p>
<sec id="s2_1">
<label>2.1</label>
<title>Materials</title>
<p>ERA5 is the fifth generation ECMWF reanalysis for the global climate and weather for the past 4 to 7 decades. It combines model data with observations from across the world into a globally complete and consistent dataset using the laws of physics.</p>
<p>As shown in <xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>, we select three stations S1 (19.75&#xb0;N, 114.75&#xb0;E), S2 (14.75&#xb0;N, 110.75&#xb0;E), S3 (11.75&#xb0;N, 113.75&#xb0;E) to analyze the ability of the model to predict SWH. This area is affected by monsoons all year round and is one of the main areas of global typhoon activity. The temporal resolution is hours, the horizontal resolution of wind speed is 0.25&#xb0;&#xd7;0.25&#xb0;, and the horizontal resolution of SWH is 0.5&#xb0;&#xd7;0.5&#xb0;. More information can be found on the website: <uri xlink:href="https://cds.climate.copernicusc.eu/">https://cds.climate.copernicusc.eu/</uri>.</p>
<fig id="f1" position="float">
<label>Figure&#xa0;1</label>
<caption>
<p>Location distribution of stations S1, S2, and S3, the green curve represents the path of Typhoon Rai.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1113788-g001.tif"/>
</fig>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Description of significant wave height prediction</title>
<p>For the SWH prediction of a certain station, it is essentially a time series prediction problem that takes past time series data as input and a certain amount of future time series data as output.</p>
<p>From the perspective of a time dimension, the observations at time length <italic>t</italic> form a tensor sequence <italic>X</italic>
<sub>1</sub>
<italic>, X</italic>
<sub>2</sub>
<italic>,&#x2026;, X<sub>t</sub>
</italic>. Therefore, the SWH prediction problem can be defined as a tensor sequence of <italic>J</italic> time lengths in the past, to predict the tensor sequence of the next <italic>K</italic> time lengths:</p>
<disp-formula>
<label>(1)</label>
<mml:math display="block" id="M1">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>X</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>X</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>g</mml:mi>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:munder>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>|</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>J</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>In this study, we used the u10, v10, and SWH information of the three stations S1, S2, and S3 from 2018 to 2021 to predict SWH. Among them, we use the data from 2018-2020 as the training set and the data from 2021 as the validation set. To ensure the relative independence of training and validation data sets, the validation data is excluded from model training.</p>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Methods</title>
<sec id="s2_3_1">
<label>2.3.1</label>
<title>Recurrent neural network</title>
<p>RNN is a type of neural network for processing time series data. <xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref> shows the RNN network structure. Compared with the traditional neural network whose inputs are independent of each other and cannot model sequence data with contextual relationships, RNN can retain the state of the previous input data. When the neuron calculates the current output, the previous neuron state is used as joint inputs are computed together. In this way, it has a great advantage in processing time series data.</p>
<fig id="f2" position="float">
<label>Figure&#xa0;2</label>
<caption>
<p>RNN module architecture.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1113788-g002.tif"/>
</fig>
<p>The information state transfer formula of the unit at time <italic>t</italic> in RNN is as follows,</p>
<disp-formula>
<label>(2)</label>
<mml:math display="block" id="M2">
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mtext>tanh</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>U</mml:mi>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:mi>W</mml:mi>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>h</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(3)</label>
<mml:math display="block" id="M3">
<mml:mrow>
<mml:msub>
<mml:mi>o</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>V</mml:mi>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>o</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Where <italic>U</italic> is the weight matrix from the input layer to the hidden layer, <italic>h<sub>t</sub>
</italic> is the state, <italic>o<sub>t</sub>
</italic> is the output, <italic>b<sub>h</sub>
</italic> and <italic>b<sub>o</sub>
</italic> are the bias, <italic>W</italic> is the weight matrix from the state to the hidden layer, and <italic>V</italic> is the weight matrix from the hidden layer to the output layer. It can be seen from <xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref> that the <italic>W</italic>, <italic>U</italic>, and <italic>V</italic> corresponding to each time node are unchanged. By this method, parameter sharing is achieved, and the number of parameters is greatly reduced.</p>
</sec>
<sec id="s2_3_2">
<label>2.3.2</label>
<title>Long short-term memory</title>
<p>LSTM is a special kind of RNN. As shown in <xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>, LSTM uses two gates to control the content of the cell state <italic>c</italic>: one is the forget gate, which determines how much the unit state <italic>c<sub>t-</sub>
</italic>
<sub>1</sub> from the previous moment is retained to the current moment <italic>c<sub>t</sub>
</italic>; the other is the input gate, which determines how much of the input <italic>x<sub>t</sub>
</italic> of the network at the current moment is saved in the unit state <italic>c<sub>t</sub>
</italic>. The LSTM uses an output gate to control how much of the unit state <italic>c<sub>t</sub>
</italic> is input to the current output value <italic>h<sub>t</sub>
</italic> of the LSTM.</p>
<p>The information state transfer formula of the unit at time <italic>t</italic> in LSTM is as follows,</p>
<disp-formula>
<label>(4)</label>
<mml:math display="block" id="M4">
<mml:mrow>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3c3;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>f</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo> <mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>f</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(5)</label>
<mml:math display="block" id="M5">
<mml:mrow>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3c3;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(6)</label>
<mml:math display="block" id="M6">
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>tanh</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>z</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>z</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(7)</label>
<mml:math display="block" id="M7">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#xb7;</mml:mo>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#xb7;</mml:mo>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(8)</label>
<mml:math display="block" id="M8">
<mml:mrow>
<mml:msub>
<mml:mi>o</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3c3;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>o</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>o</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(9)</label>
<mml:math display="block" id="M9">
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>o</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#xb7;</mml:mo>
<mml:mi>tanh</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>f<sub>t</sub>
</italic> represents the processing formula of the forget gate, <italic>i<sub>t</sub>
</italic> represents the processing formula of the input gate, <italic>z<sub>t</sub>
</italic> is the unit state candidate vector at the current time, <italic>o<sub>t</sub>
</italic> represents the processing formula of the output gate, <italic>W</italic> represents the given weight matrix, &#x3c3; represents the sigmoid function, and &#xb7; represents the element-wise product.</p>
<fig id="f3" position="float">
<label>Figure&#xa0;3</label>
<caption>
<p>LSTM module architecture.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1113788-g003.tif"/>
</fig>
</sec>
<sec id="s2_3_3">
<label>2.3.3</label>
<title>Gate recurrent unit</title>
<p>GRU is a variant of LSTM. GRU further simplifies the LSTM network, using only two gated structures, an update gate, and a reset gate, to update and transfer information. The update gate is responsible for controlling the influence of the state information of the previous moment on the state of the current moment. The larger the value of the update gate, the more the state information of the previous moment is brought in, and the reset gate is responsible for controlling the degree of ignoring the status information of the previous moment. The smaller the value of the reset gate, the more is ignored. Its unit structure is shown in <xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref>.</p>
<fig id="f4" position="float">
<label>Figure&#xa0;4</label>
<caption>
<p>GRU module architecture.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1113788-g004.tif"/>
</fig>
<p>The information state transfer formula of the unit at time <italic>t</italic> in GRU is as follows,</p>
<disp-formula>
<label>(10)</label>
<mml:math display="block" id="M10">
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3c3;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>z</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>z</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(11)</label>
<mml:math display="block" id="M11">
<mml:mrow>
<mml:msub>
<mml:mi>r</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3c3;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(12)</label>
<mml:math display="block" id="M12">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>h</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>tanh</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>h</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>r</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#xb7;</mml:mo>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mover accent="true">
<mml:mi>h</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(13)</label>
<mml:math display="block" id="M13">
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&#xb7;</mml:mo>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>&#xb7;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>h</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>z<sub>t</sub>
</italic> is the processing formula of the update gate, <italic>r<sub>t</sub>
</italic> is the processing formula of the reset gate, <inline-formula>
<mml:math display="inline" id="im1">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>h</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>is the unit state candidate vector at the current time, <italic>h<sub>t</sub>
</italic> is the output value, <italic>W</italic> represents the given weight matrix, <italic>&#x3c3;</italic> represents the sigmoid function, and &#xb7; represents the element-wise product.</p>
</sec>
</sec>
<sec id="s2_4">
<label>2.4</label>
<title>Experimental design</title>
<sec id="s2_4_1">
<label>2.4.1</label>
<title>Experimental environment</title>
<p>All models are trained using the Adam optimizer (<xref ref-type="bibr" rid="B18">Kingma and Ba, 2014</xref>) with a starting learning rate of 0.001. The training process is stopped after 30 epochs. All experiments are implemented in Pytorch (<xref ref-type="bibr" rid="B25">Paszke et&#xa0;al., 2019</xref>) and conducted on NVIDIA 3070 GPU. Other detailed parameter information in the experiment is listed in <xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>.</p>
<table-wrap id="T1" position="float">
<label>Table&#xa0;1</label>
<caption>
<p>Parameters setting.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="center">Parameter</th>
<th valign="top" align="center">Setting</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="center">Input length</td>
<td valign="top" align="center">24/168/360/720</td>
</tr>
<tr>
<td valign="top" align="center">Output length</td>
<td valign="top" align="center">1/6/12/24/48</td>
</tr>
<tr>
<td valign="top" align="center">Layers</td>
<td valign="top" align="center">1/2/3/4/5</td>
</tr>
<tr>
<td valign="top" align="center">Hidden size</td>
<td valign="top" align="center">128</td>
</tr>
<tr>
<td valign="top" align="center">Stride</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="center">Batch size</td>
<td valign="top" align="center">64</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s2_4_2">
<label>2.4.2</label>
<title>Experimental procedures</title>
<p>In this study, all methods can achieve end-to-end training, and the entire calculation process does not require manual processing but is completely handed over to the deep learning model, from learning the input data feature to obtaining the result. The advantage of end-to-end training is that it reduces the complexity of computational processing. The overall flow of the experimental design is shown in <xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>.</p>
<fig id="f5" position="float">
<label>Figure&#xa0;5</label>
<caption>
<p>Experimental flow chart.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1113788-g005.tif"/>
</fig>
<p>The detailed steps of the SWH prediction experiment are as follows.</p>
<list list-type="simple">
<list-item>
<p>(1) Data preprocessing, using the MinmaxScaler function of the sklearn library to normalize the input data.</p>
</list-item>
<list-item>
<p>(2) Divide the data, using the data from 2018-2020 as the training set and the data in 2021 as the validation set.</p>
</list-item>
<list-item>
<p>(3) Set a fixed random seed to ensure that each experiment can be reproduced.</p>
</list-item>
<list-item>
<p>(4) Model training, using the Mean Square Error (MSE) loss function and the Adam optimization function to iteratively train the model, and automatically save the optimal weight by recording the loss function value. During the training process, the learning rate is dynamically adjusted every 10 epochs to prevent the model from not converging.</p>
</list-item>
<list-item>
<p>(5) Visualization, visualize the experimental results and intuitively compare the SWH prediction ability of different methods.</p>
</list-item>
</list>
</sec>
<sec id="s2_4_3">
<label>2.4.3</label>
<title>Metrics</title>
<p>We use the following three measures to assess the model&#x2019;s performance: Root Mean Square Error (<italic>RMSE</italic>), Mean Absolute Error (<italic>MAE</italic>), and Coefficient of Determination (<italic>R<sup>2</sup>
</italic>). The following are the calculation algorithms for the above-mentioned three metrics,</p>
<disp-formula>
<label>(14)</label>
<mml:math display="block" id="M14">
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mi>M</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>E</mml:mi>
<mml:mo>=</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(15)</label>
<mml:math display="block" id="M15">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>E</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>|</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(16)</label>
<mml:math display="block" id="M16">
<mml:mrow>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:msubsup>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:msubsup>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>n</italic> is the total number of test samples, <italic>y<sub>i</sub>
</italic>, <inline-formula>
<mml:math display="inline" id="im2">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math display="inline" id="im3">
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> are the true value, the predicted value, and the arithmetic mean of <italic>y<sub>i</sub>
</italic>, respectively. Note that the lower the <italic>RMSE</italic> and <italic>MAE</italic> values, the better the consistency between the measurement and the prediction, but the higher the <italic>R</italic>
<sup>2</sup> value, the more accurate the prediction.</p>
</sec>
</sec>
</sec>
<sec id="s3" sec-type="results">
<label>3</label>
<title>Results</title>
<sec id="s3_1">
<label>3.1</label>
<title>Effect of input length on SWH prediction performance</title>
<p>To verify the influence of the input length on the SWH prediction results, when the initial learning rate is 0.001, the number of hidden layers is 3, the hidden size is 128, and the prediction length is 24h, the input lengths are set to 24h, 168h, 360h, 720h, discuss the effect of input length on SWH prediction performance. The influence of input length on the prediction of SWH is shown in <xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref>, in which the bold font is the optimal result of this group of experiments.</p>
<table-wrap id="T2" position="float">
<label>Table&#xa0;2</label>
<caption>
<p>The prediction result of significant wave height.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" colspan="2" align="center" rowspan="2">Input Length</th>
<th valign="middle" colspan="3" align="center">RNN</th>
<th valign="middle" colspan="3" align="center">LSTM</th>
<th valign="middle" colspan="3" align="center">GRU</th>
</tr>
<tr>
<th valign="middle" align="center">
<italic>RMSE</italic>
</th>
<th valign="middle" align="center">
<italic>MAE</italic>
</th>
<th valign="middle" align="center">
<italic>R<sup>2</sup>
</italic>
</th>
<th valign="middle" align="center">
<italic>RMSE</italic>
</th>
<th valign="middle" align="center">
<italic>MAE</italic>
</th>
<th valign="middle" align="center">
<italic>R<sup>2</sup>
</italic>
</th>
<th valign="middle" align="center">
<italic>RMSE</italic>
</th>
<th valign="middle" align="center">
<italic>MAE</italic>
</th>
<th valign="middle" align="center">
<italic>R<sup>2</sup>
</italic>
</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="middle" align="center">S1</td>
<td valign="middle" align="center">24h<break/>168h<break/>360h<break/>720h</td>
<td valign="middle" align="center">0.116<break/>
<bold>0.113</bold>
<break/>0.118<break/>0.125</td>
<td valign="middle" align="center">0.084<break/>
<bold>0.083</bold>
<break/>0.085<break/>0.095</td>
<td valign="middle" align="center">0.779<break/>
<bold>0.786</bold>
<break/>0.763<break/>0.731</td>
<td valign="middle" align="center">
<bold>0.118</bold>
<break/>0.119<break/>0.118<break/>0.122</td>
<td valign="middle" align="center">
<bold>0.085</bold>
<break/>0.087<break/>0.087<break/>0.092</td>
<td valign="middle" align="center">
<bold>0.771</bold>
<break/>0.765<break/>0.762<break/>0.743</td>
<td valign="middle" align="center">
<bold>0.116</bold>
<break/>0.118<break/>0.119<break/>0.126</td>
<td valign="middle" align="center">
<bold>0.084</bold>
<break/>0.087<break/>0.089<break/>0.095</td>
<td valign="middle" align="center">
<bold>0.778</bold>
<break/>0.766<break/>0.758<break/>0.728</td>
</tr>
<tr>
<td valign="middle" align="center">S2</td>
<td valign="middle" align="center">24h<break/>168h<break/>360h<break/>720h</td>
<td valign="middle" align="center">
<bold>0.428</bold>
<break/>0.887<break/>0.472<break/>0.515</td>
<td valign="middle" align="center">
<bold>0.295</bold>
<break/>0.773<break/>0.345<break/>0.375</td>
<td valign="middle" align="center">
<bold>0.790</bold>
<break/>0.044<break/>0.721<break/>0.668</td>
<td valign="middle" align="center">
<bold>0.431</bold>
<break/>0.447<break/>0.469<break/>0.499</td>
<td valign="middle" align="center">
<bold>0.295</bold>
<break/>0.311<break/>0.331<break/>0.362</td>
<td valign="middle" align="center">
<bold>0.787</bold>
<break/>0.757<break/>0.725<break/>0.686</td>
<td valign="middle" align="center">
<bold>0.422</bold>
<break/>0.449<break/>0.484<break/>0.530</td>
<td valign="middle" align="center">
<bold>0.293</bold>
<break/>0.316<break/>0.345<break/>0.389</td>
<td valign="middle" align="center">
<bold>0.796</bold>
<break/>0.754<break/>0.707<break/>0.646</td>
</tr>
<tr>
<td valign="middle" align="center">S3</td>
<td valign="middle" align="center">24h<break/>168h<break/>360h<break/>720h</td>
<td valign="middle" align="center">
<bold>0.436</bold>
<break/>0.591<break/>0.489<break/>0.589</td>
<td valign="middle" align="center">
<bold>0.297</bold>
<break/>0.459<break/>0.353<break/>0.469</td>
<td valign="middle" align="center">
<bold>0.788</bold>
<break/>0.589<break/>0.714<break/>0.582</td>
<td valign="middle" align="center">
<bold>0.423</bold>
<break/>0.458<break/>0.482<break/>0.515</td>
<td valign="middle" align="center">
<bold>0.289</bold>
<break/>0.312<break/>0.331<break/>0.365</td>
<td valign="middle" align="center">
<bold>0.800</bold>
<break/>0.753<break/>0.722<break/>0.680</td>
<td valign="middle" align="center">
<bold>0.420</bold>
<break/>0.458<break/>0.496<break/>0.544</td>
<td valign="middle" align="center">
<bold>0.289</bold>
<break/>0.315<break/>0.348<break/>0.395</td>
<td valign="middle" align="center">
<bold>0.803</bold>
<break/>0.752<break/>0.705<break/>0.644</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Bold: Black font is the optimal result.</p>
</table-wrap-foot>
</table-wrap>
<p>It can be seen from the experimental results that it is not that the longer the input length, the better the prediction effect. As the input length increases, the model&#x2019;s SWH prediction performance tends to be poor. Although using the RNN model to predict SWH at S1, the best result is obtained when the input length is 168h, but from the overall trend, the SWH prediction effect still shows a downward trend. At S1, the prediction effect of RNN is the best, and the prediction effect of GRU is slightly better than that of LSTM. The possible reason is that the SWH of the S1 is lower throughout the year, and the complex model has the opposite effect. At S2 and S3, the GRU model had the best SWH prediction effect, and the values of <italic>RMSE</italic> and <italic>MAE</italic> were slightly higher than those at S1. The possible reason was that S1 and S2 had higher annual SWH and greater fluctuations. Overall, considering the SWH prediction performance of different models at S1, S2, and S3, when the input length is 24h, GRU predicts SWH the best, which also benefits from the unique design of the GRU model.</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Effect of prediction length on SWH prediction performance</title>
<p>To verify the influence of prediction length on SWH prediction results, according to the analysis of experimental results in Section 3.1, the initial learning rate was set as 0.001, the number of hidden layers as 3, the hidden size as 128, the input length as 24h, and the output length as 1h, 6h, 12h, 24h, and 48h, respectively, to discuss the influence of output length on SWH prediction performance. The influence of prediction length on the prediction of SWH is shown in <xref ref-type="table" rid="T3">
<bold>Table&#xa0;3</bold>
</xref>, in which the bold font is the optimal result of this group of experiments.</p>
<table-wrap id="T3" position="float">
<label>Table&#xa0;3</label>
<caption>
<p>The prediction result of significant wave height.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" colspan="2" align="center" rowspan="2">Output Length</th>
<th valign="middle" colspan="3" align="center">RNN</th>
<th valign="middle" colspan="3" align="center">LSTM</th>
<th valign="middle" colspan="3" align="center">GRU</th>
</tr>
<tr>
<th valign="middle" align="center">
<italic>RMSE</italic>
</th>
<th valign="middle" align="center">
<italic>MAE</italic>
</th>
<th valign="middle" align="center">
<italic>R<sup>2</sup>
</italic>
</th>
<th valign="middle" align="center">
<italic>RMSE</italic>
</th>
<th valign="middle" align="center">
<italic>MAE</italic>
</th>
<th valign="middle" align="center">
<italic>R<sup>2</sup>
</italic>
</th>
<th valign="middle" align="center">
<italic>RMSE</italic>
</th>
<th valign="middle" align="center">
<italic>MAE</italic>
</th>
<th valign="middle" align="center">
<italic>R<sup>2</sup>
</italic>
</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="middle" align="center">S1</td>
<td valign="middle" align="center">1h<break/>6h<break/>12h<break/>24h<break/>48h</td>
<td valign="middle" align="center">
<bold>0.007</bold>
<break/>0.041<break/>0.084<break/>0.116<break/>0.163</td>
<td valign="middle" align="center">
<bold>0.004</bold>
<break/>0.030<break/>0.064<break/>0.084<break/>0.123</td>
<td valign="middle" align="center">
<bold>0.999</bold>
<break/>0.971<break/>0.884<break/>0.779<break/>0.566</td>
<td valign="middle" align="center">
<bold>0.009</bold>
<break/>0.043<break/>0.076<break/>0.118<break/>0.164</td>
<td valign="middle" align="center">
<bold>0.006</bold>
<break/>0.032<break/>0.054<break/>0.085<break/>0.124</td>
<td valign="middle" align="center">
<bold>0.998</bold>
<break/>0.968<break/>0.906<break/>0.771<break/>0.560</td>
<td valign="middle" align="center">
<bold>0.008</bold>
<break/>0.040<break/>0.074<break/>0.116<break/>0.163</td>
<td valign="middle" align="center">
<bold>0.005</bold>
<break/>0.029<break/>0.053<break/>0.084<break/>0.123</td>
<td valign="middle" align="center">
<bold>0.998</bold>
<break/>0.972<break/>0.910<break/>0.778<break/>0.567</td>
</tr>
<tr>
<td valign="middle" align="center">S2</td>
<td valign="middle" align="center">1h<break/>6h<break/>12h<break/>24h<break/>48h</td>
<td valign="middle" align="center">
<bold>0.025</bold>
<break/>0.162<break/>0.252<break/>0.428<break/>0.942</td>
<td valign="middle" align="center">
<bold>0.017</bold>
<break/>0.112<break/>0.170<break/>0.295<break/>0.810</td>
<td valign="middle" align="center">
<bold>0.999</bold>
<break/>0.969<break/>0.926<break/>0.790<break/>0.003</td>
<td valign="middle" align="center">
<bold>0.041</bold>
<break/>0.153<break/>0.263<break/>0.431<break/>0.659</td>
<td valign="middle" align="center">
<bold>0.028</bold>
<break/>0.103<break/>0.175<break/>0.295<break/>0.479</td>
<td valign="middle" align="center">
<bold>0.997</bold>
<break/>0.972<break/>0.919<break/>0.787<break/>0.512</td>
<td valign="middle" align="center">
<bold>0.032</bold>
<break/>0.142<break/>0.255<break/>0.422<break/>0.654</td>
<td valign="middle" align="center">
<bold>0.020</bold>
<break/>0.095<break/>0.172<break/>0.293<break/>0.481</td>
<td valign="middle" align="center">
<bold>0.998</bold>
<break/>0.976<break/>0.924<break/>0.796<break/>0.519</td>
</tr>
<tr>
<td valign="middle" align="center">S3</td>
<td valign="middle" align="center">1h<break/>6h<break/>12h<break/>24h<break/>48h</td>
<td valign="middle" align="center">
<bold>0.021</bold>
<break/>0.186<break/>0.272<break/>0.436<break/>0.870</td>
<td valign="middle" align="center">
<bold>0.014</bold>
<break/>0.155<break/>0.214<break/>0.297<break/>0.747</td>
<td valign="middle" align="center">
<bold>0.999</bold>
<break/>0.960<break/>0.916<break/>0.788<break/>0.176</td>
<td valign="middle" align="center">
<bold>0.037</bold>
<break/>0.135<break/>0.250<break/>0.423<break/>0.669</td>
<td valign="middle" align="center">
<bold>0.025</bold>
<break/>0.095<break/>0.175<break/>0.289<break/>0.470</td>
<td valign="middle" align="center">
<bold>0.998</bold>
<break/>0.979<break/>0.929<break/>0.800<break/>0.513</td>
<td valign="middle" align="center">
<bold>0.032</bold>
<break/>0.131<break/>0.237<break/>0.420<break/>0.659</td>
<td valign="middle" align="center">
<bold>0.022</bold>
<break/>0.093<break/>0.166<break/>0.289<break/>0.466</td>
<td valign="middle" align="center">
<bold>0.998</bold>
<break/>0.980<break/>0.936<break/>0.803<break/>0.526</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Bold: Black font is the optimal result.</p>
</table-wrap-foot>
</table-wrap>
<p>It can be seen from the results that with the increase in the prediction length, the prediction effect gradually decreases. In S1, S2, and S3, the prediction results of the RNN method in 1h are all the best, the possible reason is that the RNN model is relatively simple and the prediction length is short. However, the prediction results at 48h show that the SWH prediction effect of the RNN model is poor and may not be used for long-term SWH prediction. As the prediction length increases, the advantages of the unique design of the LSTM and GRU models emerge. Overall, the SWH prediction performance of the GRU model is slightly better than that of the LSTM model. The RNN model has certain advantages in short-term SWH prediction, but as the prediction length gradually increases to 24h later, it loses its effect.</p>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Effect of hidden layers on SWH prediction performance</title>
<p>To verify the influence of the hidden layers on the SWH prediction results, when the initial learning rate is 0.001, the hidden size is 128, the input length is 24h, and the prediction length is 24h, discuss the effect of layers on SWH prediction performance. The influence of hidden layers on the prediction of SWH is shown in <xref ref-type="table" rid="T4">
<bold>Table&#xa0;4</bold>
</xref>, in which the bold font is the optimal result of this group of experiments.</p>
<table-wrap id="T4" position="float">
<label>Table&#xa0;4</label>
<caption>
<p>The prediction result of significant wave height.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" colspan="2" align="center" rowspan="2">Layers</th>
<th valign="middle" colspan="3" align="center">RNN</th>
<th valign="middle" colspan="3" align="center">LSTM</th>
<th valign="middle" colspan="3" align="center">GRU</th>
</tr>
<tr>
<th valign="middle" align="center">
<italic>RMSE</italic>
</th>
<th valign="middle" align="center">
<italic>MAE</italic>
</th>
<th valign="middle" align="center">
<italic>R<sup>2</sup>
</italic>
</th>
<th valign="middle" align="center">
<italic>RMSE</italic>
</th>
<th valign="middle" align="center">
<italic>MAE</italic>
</th>
<th valign="middle" align="center">
<italic>R<sup>2</sup>
</italic>
</th>
<th valign="middle" align="center">
<italic>RMSE</italic>
</th>
<th valign="middle" align="center">
<italic>MAE</italic>
</th>
<th valign="middle" align="center">
<italic>R<sup>2</sup>
</italic>
</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="middle" align="center">S1</td>
<td valign="middle" align="center">1<break/>2<break/>3<break/>4<break/>5<break/>6</td>
<td valign="middle" align="center">0.117<break/>
<bold>0.116</bold>
<break/>0.116<break/>0.116<break/>0.116<break/>0.116</td>
<td valign="middle" align="center">0.085<break/>
<bold>0.084</bold>
<break/>0.084<break/>0.084<break/>0.084<break/>0.085</td>
<td valign="middle" align="center">0.776<break/>
<bold>0.779</bold>
<break/>0.779<break/>0.779<break/>0.779<break/>0.779</td>
<td valign="middle" align="center">0.119<break/>
<bold>0.118</bold>
<break/>0.118<break/>0.119<break/>0.132<break/>0.141</td>
<td valign="middle" align="center">0.086<break/>
<bold>0.085</bold>
<break/>0.085<break/>0.085<break/>0.096<break/>0.103</td>
<td valign="middle" align="center">0.769<break/>
<bold>0.772</bold>
<break/>0.771<break/>0.770<break/>0.716<break/>0.677</td>
<td valign="middle" align="center">0.118<break/>0.117<break/>0.116<break/>
<bold>0.116</bold>
<break/>0.116<break/>0.117</td>
<td valign="middle" align="center">0.085<break/>0.084<break/>0.084<break/>
<bold>0.084</bold>
<break/>0.084<break/>0.084</td>
<td valign="middle" align="center">0.773<break/>0.777<break/>0.778<break/>
<bold>0.779</bold>
<break/>0.779<break/>0.777</td>
</tr>
<tr>
<td valign="middle" align="center">S2</td>
<td valign="middle" align="center">1<break/>2<break/>3<break/>4<break/>5<break/>6</td>
<td valign="middle" align="center">0.441<break/>0.429<break/>0.428<break/>
<bold>0.427</bold>
<break/>0.616<break/>0.792</td>
<td valign="middle" align="center">0.306<break/>0.295<break/>0.295<break/>
<bold>0.292</bold>
<break/>0.518<break/>0.638</td>
<td valign="middle" align="center">0.777<break/>0.790<break/>0.790<break/>
<bold>0.791</bold>
<break/>0.567<break/>0.283</td>
<td valign="middle" align="center">0.437<break/>0.431<break/>0.431<break/>
<bold>0.431</bold>
<break/>0.937<break/>0.937</td>
<td valign="middle" align="center">0.304<break/>0.298<break/>0.295<break/>
<bold>0.293</bold>
<break/>0.737<break/>0.736</td>
<td valign="middle" align="center">0.781<break/>0.787<break/>0.787<break/>
<bold>0.787</bold>
<break/>-0.001<break/>-0.001</td>
<td valign="middle" align="center">0.435<break/>0.427<break/>
<bold>0.422</bold>
<break/>0.426<break/>0.424<break/>0.429</td>
<td valign="middle" align="center">0.302<break/>0.295<break/>
<bold>0.293</bold>
<break/>0.295<break/>0.294<break/>0.298</td>
<td valign="middle" align="center">0.783<break/>0.792<break/>
<bold>0.796</bold>
<break/>0.792<break/>0.794<break/>0.789</td>
</tr>
<tr>
<td valign="middle" align="center">S3</td>
<td valign="middle" align="center">1<break/>2<break/>3<break/>4<break/>5<break/>6</td>
<td valign="middle" align="center">0.447<break/>0.487<break/>0.436<break/>
<bold>0.421</bold>
<break/>0.427<break/>0.584</td>
<td valign="middle" align="center">0.308<break/>0.379<break/>0.297<break/>
<bold>0.288</bold>
<break/>0.291<break/>0.434</td>
<td valign="middle" align="center">0.777<break/>0.736<break/>0.788<break/>
<bold>0.802</bold>
<break/>0.797<break/>0.620</td>
<td valign="middle" align="center">0.442<break/>0.430<break/>
<bold>0.423</bold>
<break/>0.431<break/>0.943<break/>0.932</td>
<td valign="middle" align="center">0.305<break/>0.295<break/>
<bold>0.289</bold>
<break/>0.291<break/>0.739<break/>0.734</td>
<td valign="middle" align="center">0.782<break/>0.793<break/>
<bold>0.800</bold>
<break/>0.793<break/>0.010<break/>0.032</td>
<td valign="middle" align="center">0.439<break/>0.423<break/>
<bold>0.420</bold>
<break/>0.421<break/>0.422<break/>0.421</td>
<td valign="middle" align="center">0.302<break/>0.291<break/>0.289<break/>
<bold>0.288</bold>
<break/>0.289<break/>0.290</td>
<td valign="middle" align="center">0.785<break/>0.800<break/>
<bold>0.803</bold>
<break/>0.802<break/>0.801<break/>0.802</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Bold: Black font is the optimal result.</p>
</table-wrap-foot>
</table-wrap>
<p>It can be seen from the results that it is not that the more layers of the model, the better the prediction performance. In S1, RNN and LSTM have the best SWH prediction performance when the number of layers is set to 2, but GRU can also achieve the same prediction performance when the number of layers is set to 4. In S2, RNN and LSTM have the best SWH prediction performance when the number of layers is set to 4, and the SWH prediction performance of RNN is slightly better than that of LSTM; GRU can achieve the best SWH performance when the number of layers is set to 3 and is better than RNN, LSTM. In S3, the prediction effect of GRU is the best when the number of layers is set to 3. To sum up, when building a model to predict SWH, it is necessary to set a reasonable number of layers, to achieve the goal of low calculation amount and good prediction effect.</p>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Experiments on typhoon cases to test the performance of SWH prediction</title>
<p>Typhoon Rai (No. 2122) formed on December 13, 2021, in the low latitudes of the Northwest Pacific Ocean. It developed slowly at the beginning, and then experienced rapid intensification twice in the coastal waters east of the Philippines and after crossing the Philippines and entering the South China Sea, reaching the level of a super typhoon identified by the Central Meteorological Observatory. On December 21, it gradually weakened and dissipated. Typhoon &#x201c;Rai&#x201d; became the strongest super typhoon affecting the South China Sea since meteorological records began.</p>
<p>To better study the prediction ability of SWH under extreme environments, we selected the period from December 1, 2021, to December 31, 2021, at S1, S2, and S3 for the experiment. When the initial learning rate is 0.001, the hidden size is 128, the number of layers is 3, the input length is 24h, and the prediction length is 1h, 6h, 12h, and 24h respectively, we comprehensively analyze the advantages and disadvantages of SWH prediction by different methods in the extreme environment by analyzing the prediction and real comparison curve and the error value curve.</p>
<p>It can be seen from <xref ref-type="fig" rid="f6">
<bold>Figure&#xa0;6</bold>
</xref> that the three methods of RNN, LSTM, and GRU all showed better prediction performance in the 1h short-term SWH prediction. Specifically, at the peak of SWH, the RNN method can fit well, while LSTM and GRU will predict too high or too low. In this respect, the SWH prediction effect of RNN is slightly better than that of LSTM and GRU. As the prediction time increases, the prediction performance of the three methods starts to decrease. It can be seen from <xref ref-type="fig" rid="f7">
<bold>Figure&#xa0;7</bold>
</xref> that during the 6h SWH prediction process, in S1, the prediction of LSTM at the peak is greater than the actual value. In S2, a similar situation also occurs, but this situation does not exist in S3. The possible reason is data features that lack similar variation features in the training set. As can be seen from <xref ref-type="fig" rid="f8">
<bold>Figure&#xa0;8</bold>
</xref>, in the 12h SWH prediction, LSTM still has the situation mentioned above. We have reason to think that this is caused by the lack of data with similar changing characteristics in the training set. With the further increase of the forecast time, it can be seen from <xref ref-type="fig" rid="f9">
<bold>Figure&#xa0;9</bold>
</xref> that the SWH prediction results of the three methods are compared with the true value, and the 24h prediction results have an obvious trend of lagging, and there will be a certain prediction error at the peak. However, the overall trend is still roughly the same, indicating that RNN, LSTM, and GRU still have certain reference values for the prediction of long-term SWH in extreme environments.</p>
<fig id="f6" position="float">
<label>Figure&#xa0;6</label>
<caption>
<p>1h SWH prediction performance of RNN, LSTM, and GRU at S1, S2, and S3. <bold>(A)</bold> The 1h SWH prediction performance of RNN, LSTM, and GRU at the S1 station. <bold>(B)</bold> The 1h SWH prediction performance of RNN, LSTM, and GRU at the S2 station. <bold>(C)</bold> The 1h SWH prediction performance of RNN, LSTM, and GRU at the S3 station.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1113788-g006.tif"/>
</fig>
<fig id="f7" position="float">
<label>Figure&#xa0;7</label>
<caption>
<p>6h SWH prediction performance of RNN, LSTM, and GRU at S1, S2, and S3. <bold>(A)</bold> The 6h SWH prediction performance of RNN, LSTM, and GRU at the S1 station. <bold>(B)</bold> The 6h SWH prediction performance of RNN, LSTM, and GRU at the S2 station. <bold>(C)</bold> The 6h SWH prediction performance of RNN, LSTM, and GRU at the S3 station.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1113788-g007.tif"/>
</fig>
<fig id="f8" position="float">
<label>Figure&#xa0;8</label>
<caption>
<p>12h SWH prediction performance of RNN, LSTM, and GRU at S1, S2, and S3. <bold>(A)</bold> The 12h SWH prediction performance of RNN, LSTM, and GRU at the S1 station. <bold>(B)</bold> The 12h SWH prediction performance of RNN, LSTM, and GRU at the S2 station. <bold>(C)</bold> The 12h SWH prediction performance of RNN, LSTM, and GRU at the S3 station.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1113788-g008.tif"/>
</fig>
<fig id="f9" position="float">
<label>Figure&#xa0;9</label>
<caption>
<p>24h SWH prediction performance of RNN, LSTM, and GRU at S1, S2, and S3. <bold>(A)</bold> The 24h SWH prediction performance of RNN, LSTM, and GRU at the S1 station. <bold>(B)</bold> The 24h SWH prediction performance of RNN, LSTM, and GRU at the S2 station. <bold>(C)</bold> The 24h SWH prediction performance of RNN, LSTM, and GRU at the S3 station.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1113788-g009.tif"/>
</fig>
</sec>
</sec>
<sec id="s4" sec-type="conclusions">
<label>4</label>
<title>Conclusions</title>
<p>Predicting SWH is conducive to understanding the sea wave conditions in advance, effectively improving the safety of marine activities, and reducing the occurrence of marine accidents, which is of great significance to the development of national security and the marine economy. This paper comprehensively analyzes and discusses the SWH prediction performance of RNN, LSTM, and GRU. The main findings of this paper are as follows:</p>
<list list-type="simple">
<list-item>
<p>(1) The input length has an influence on the prediction results, but it does not mean that the longer the input length, the better the prediction performance. As the input length increases, the SWH prediction performance of the model tends to decrease.</p>
</list-item>
<list-item>
<p>(2) The prediction length has an influence on the prediction results. As the prediction length increases, the prediction performance gradually decreases. But in the 48h SWH prediction, the prediction performance of RNN is the worst, which is not suitable for long-term SWH prediction.</p>
</list-item>
<list-item>
<p>(3) The more layers of the model, the better the SWH prediction performance is not necessarily. When establishing an SWH prediction model, it is necessary to set the number of layers reasonably to achieve the purpose of less calculation and better prediction performance.</p>
</list-item>
</list>
<p>Deep learning methods have achieved good results in predicting SWH experiments. But there are also certain defects: (1) It is like a &#x201c;black box&#x201d;. The inference mechanism between input and output is unclear. (2) These methods focus on time series research the response to spatial changes is not obvious. In future work, we will focus on the interpretability of the model and generalize the research scope from a single point to a region.</p>
</sec>
<sec id="s5" sec-type="data-availability">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s6" sec-type="author-contributions">
<title>Author contributions</title>
<p>PH designed the experiment. YG collected and analyzed the data. SL provided a critical revision of the article. All authors contributed to the article and approved the submitted version.</p>
</sec>
</body>
<back>
<sec id="s7" sec-type="funding-information">
<title>Funding</title>
<p>This work was supported by the National Natural Science Foundation of China (Grant Number 41876003).</p>
</sec>
<ack>
<title>Acknowledgments</title>
<p>We would like to acknowledge the organizations that provided the sources of the data used in this work - namely, the European Centre for Medium-Range Weather Forecasts. We also thank the HPC Center of Zhejiang University (Zhoushan Campus) for computational support.</p>
</ack>
<sec id="s8" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="s9" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Berbi&#x107;</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Ocvirk</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Carevi&#x107;</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Lon&#x10d;ar</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Application of neural networks and support vector machine for significant wave height prediction</article-title>. <source>Oceanologia</source> <volume>59</volume> (<issue>3</issue>), <fpage>331</fpage>&#x2013;<lpage>349</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.oceano.2017.03.007</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bethel</surname> <given-names>B. J.</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Dong</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>D.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Forecasting hurricane-forced significant wave heights using a long short-term memory network in the Caribbean Sea</article-title>. <source>Ocean Sci.</source> <volume>18</volume> (<issue>2</issue>), <fpage>419</fpage>&#x2013;<lpage>436</lpage>. doi: <pub-id pub-id-type="doi">10.5194/os-18-419-2022</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Callens</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Morichon</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Abadie</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Delpey</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Liquet</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Using random forest and gradient boosting trees to improve wave forecast at a specific location</article-title>. <source>Appl. Ocean Res.</source> <volume>104</volume>, <fpage>102339</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.apor.2020.102339</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>W.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Spatial&#x2013;temporal convolutional gated recurrent unit network for significant wave height estimation from shipborne marine radar data</article-title>. <source>IEEE Trans. Geosci. Remote Sens.</source> <volume>60</volume>, <fpage>1</fpage>&#x2013;<lpage>11</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TGRS.2020.3034752</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Deshmukh</surname> <given-names>A. N.</given-names>
</name>
<name>
<surname>Deo</surname> <given-names>M. C.</given-names>
</name>
<name>
<surname>Bhaskaran</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Nair</surname> <given-names>T. M. B.</given-names>
</name>
<name>
<surname>Sandhya</surname> <given-names>K. G.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Neural-Network-Based data assimilation to improve numerical ocean wave forecast</article-title>. <source>IEEE J. Ocean. Eng.</source> <volume>41</volume> (<issue>4</issue>), <fpage>944</fpage>&#x2013;<lpage>953</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/joe.2016.2521222</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Dey</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Salem</surname> <given-names>F. M.</given-names>
</name>
</person-group> (<year>2017</year>). &#x201c;<article-title>Gate-variants of gated recurrent unit (GRU) neural networks</article-title>,&#x201d; in <conf-name>2017 IEEE 60th international midwest symposium on circuits and systems (MWSCAS)</conf-name>. <fpage>1597</fpage>&#x2013;<lpage>1600</lpage> (IEEE).</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Emmanouil</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Aguilar</surname> <given-names>S. G.</given-names>
</name>
<name>
<surname>Nane</surname> <given-names>G. F.</given-names>
</name>
<name>
<surname>Schouten</surname> <given-names>J.-J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Statistical models for improving significant wave height predictions in offshore operations</article-title>. <source>Ocean Eng.</source> <volume>206</volume>, <fpage>107249</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.oceaneng.2020.107249</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fan</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Xiao</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Dong</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A novel model to predict significant wave height based on long short-term memory network</article-title>. <source>Ocean Eng.</source> <volume>205</volume>, <elocation-id>107298</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.oceaneng.2020.107298</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Bi</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Bai</surname> <given-names>Z.</given-names>
</name>
</person-group> (<year>2021</year>b). <article-title>A forecasting model for wave heights based on a long short-term memory neural network</article-title>. <source>Acta Oceanol. Sin.</source> <volume>40</volume> (<issue>1</issue>), <fpage>62</fpage>&#x2013;<lpage>69</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s13131-020-1680-3</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Meng</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Jia</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Fan</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Estimating significant wave height from SAR imagery based on an SVM regression model</article-title>. <source>Acta Oceanol. Sin.</source> <volume>37</volume> (<issue>3</issue>), <fpage>103</fpage>&#x2013;<lpage>110</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s13131-018-1203-7</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Dong</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Zang</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>a). <article-title>Investigation on the effects of Bragg reflection on harbor oscillations</article-title>. <source>Coast. Eng.</source> <volume>170</volume>, <fpage>103977</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.coastaleng.2021.103977</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Zang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Dong</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Zhu</surname> <given-names>Y.</given-names>
</name>
<etal/>
</person-group>. (<year>2020</year>). <article-title>Numerical investigation of harbor oscillations induced by focused transient wave groups</article-title>. <source>Coast. Eng.</source> <volume>158</volume>, <fpage>103670</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.coastaleng.2020.103670</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gopinath</surname> <given-names>D. I.</given-names>
</name>
<name>
<surname>Dwarakish</surname> <given-names>G. S.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Real-time prediction of waves using neural networks trained by particle swarm optimization</article-title>. <source>Int. J. Ocean Climate Syst.</source> <volume>7</volume> (<issue>2</issue>), <fpage>70</fpage>&#x2013;<lpage>79</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1759313116642896</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Group</surname> <given-names>T. W.</given-names>
</name>
</person-group> (<year>1988</year>). <article-title>The WAM model&#x2013;a third generation ocean wave prediction model</article-title>. <source>J. Phys. Oceanogr.</source> <volume>18</volume> (<issue>12</issue>), <fpage>1775</fpage>&#x2013;<lpage>1810</lpage>. doi: <pub-id pub-id-type="doi">10.1175/1520-0485(1988)018&lt;1775:TWMTGO&gt;2.0.CO;2</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Dong</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Improved short-term prediction of significant wave height by decomposing deterministic and stochastic components</article-title>. <source>Renewable Energy</source> <volume>177</volume>, <fpage>743</fpage>&#x2013;<lpage>758</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.renene.2021.06.008</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>J&#xf6;rges</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Berkenbrink</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Stumpe</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Prediction and reconstruction of ocean wave heights based on bathymetric data using LSTM neural networks</article-title>. <source>Ocean Eng.</source> <volume>232</volume>, <fpage>109046</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.oceaneng.2021.109046</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kazeminezhad</surname> <given-names>M. H.</given-names>
</name>
<name>
<surname>Siadatmousavi</surname> <given-names>S. M.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Performance evaluation of WAVEWATCH III model in the Persian gulf using different wind resources</article-title>. <source>Ocean Dyn.</source> <volume>67</volume> (<issue>7</issue>), <fpage>839</fpage>&#x2013;<lpage>855</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s10236-017-1063-2</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kingma</surname> <given-names>D. P.</given-names>
</name>
<name>
<surname>Ba</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Adam: A method for stochastic optimization</article-title>. <source>arXiv preprint arXiv</source> <volume>1412</volume>, <fpage>6980</fpage>.</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liang</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Gao</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Shao</surname> <given-names>Z.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Characteristics of global waves based on the third-generation wave model SWAN</article-title>. <source>Mar. Structures</source> <volume>64</volume>, <fpage>35</fpage>&#x2013;<lpage>53</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.marstruc.2018.10.011</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Cheung</surname> <given-names>K. F.</given-names>
</name>
<name>
<surname>Cross</surname> <given-names>P.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Numerical wave modeling for operational and survival analyses of wave energy converters at the US navy wave energy test site in Hawaii</article-title>. <source>Renewable Energy</source> <volume>161</volume>, <fpage>240</fpage>&#x2013;<lpage>256</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.renene.2020.06.089</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Hao</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Yu</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>CLTS-net: A more accurate and universal method for the long-term prediction of significant wave height</article-title>. <source>J. Mar. Sci. Eng.</source> <volume>9</volume> (<issue>12</issue>), <fpage>1464</fpage>. doi: <pub-id pub-id-type="doi">10.3390/jmse9121464</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Rogers</surname> <given-names>W. E.</given-names>
</name>
<name>
<surname>Babanin</surname> <given-names>A. V.</given-names>
</name>
<name>
<surname>Young</surname> <given-names>I. R.</given-names>
</name>
<name>
<surname>Romero</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Zieger</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). <article-title>Observation-based source terms in the third-generation wave model WAVEWATCH III: Updates and verification</article-title>. <source>J. Phys. Oceanogr.</source> <volume>49</volume> (<issue>2</issue>), <fpage>489</fpage>&#x2013;<lpage>517</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/jpo-d-18-0137.1</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Miky</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Kaloop</surname> <given-names>M. R.</given-names>
</name>
<name>
<surname>Elnabwy</surname> <given-names>M. T.</given-names>
</name>
<name>
<surname>Baik</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Alshouny</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A recurrent-Cascade-Neural network- nonlinear autoregressive networks with exogenous inputs (NARX) approach for long-term time-series prediction of wave height based on wave characteristics measurements</article-title>. <source>Ocean Eng.</source> <volume>240</volume>, <elocation-id>109958</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.oceaneng.2021.109958</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ni</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>X.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>An integrated long-short term memory algorithm for predicting polar westerlies wave height</article-title>. <source>Ocean Eng.</source> <volume>215</volume>, <elocation-id>107715</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.oceaneng.2020.107715</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Paszke</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Gross</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Massa</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Lerer</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Bradbury</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Chanan</surname> <given-names>G.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). &#x201c;<article-title>Pytorch: An imperative style, high-performance deep learning library</article-title>,&#x201d; in <source>Advances in neural information processing systems</source>, <fpage>32</fpage>.</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peres</surname> <given-names>D. J.</given-names>
</name>
<name>
<surname>Iuppa</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Cavallaro</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Cancelliere</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Foti</surname> <given-names>E.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Significant wave height record extension by neural networks and reanalysis wind data</article-title>. <source>Ocean Model.</source> <volume>94</volume>, <fpage>128</fpage>&#x2013;<lpage>140</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ocemod.2015.08.002</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sadeghifar</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Nouri Motlagh</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Torabi Azad</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Mohammad Mahdizadeh</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Coastal wave height prediction using recurrent neural networks (RNNs) in the south Caspian Sea</article-title>. <source>Mar. Geodesy</source> <volume>40</volume> (<issue>6</issue>), <fpage>454</fpage>&#x2013;<lpage>465</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/01490419.2017.1359220</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Swain</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Umesh</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Balchand</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>WAM and WAVEWATCH-III intercomparison studies in the north Indian ocean using oceansat-2 scatterometer winds</article-title>. <source>J. Ocean Climate</source> <volume>9</volume>, <fpage>2516019219866569</fpage>. doi: <pub-id pub-id-type="doi">10.1177/2516019219866569</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Umesh</surname> <given-names>P. A.</given-names>
</name>
<name>
<surname>Swain</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Inter-comparisons of SWAN hindcasts using boundary conditions from WAM and WWIII for northwest and northeast coasts of India</article-title>. <source>Ocean Eng.</source> <volume>156</volume>, <fpage>523</fpage>&#x2013;<lpage>549</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.oceaneng.2018.03.029</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vanem</surname> <given-names>E.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Joint statistical models for significant wave height and wave period in a changing climate</article-title>. <source>Mar. Structures</source> <volume>49</volume>, <fpage>180</fpage>&#x2013;<lpage>205</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.marstruc.2016.06.001</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname> <given-names>D.-P.</given-names>
</name>
<name>
<surname>Oey</surname> <given-names>L.-Y.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Hindcast of waves and currents in hurricane Katrina</article-title>. <source>Bull. Am. Meteorol. Soc.</source> <volume>89</volume> (<issue>4</issue>), <fpage>487</fpage>&#x2013;<lpage>496</lpage>. doi: <pub-id pub-id-type="doi">10.1175/BAMS-89-4-487</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Young</surname> <given-names>I. R.</given-names>
</name>
<name>
<surname>Ribal</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Multiplatform evaluation of global trends in wind speed and wave height</article-title>. <source>Science</source> <volume>364</volume> (<issue>6440</issue>), <fpage>548</fpage>&#x2013;<lpage>552</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1126/science.aav9527</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yu</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Si</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>A review of recurrent neural networks: LSTM cells and network architectures</article-title>. <source>Neural Comput.</source> <volume>31</volume> (<issue>7</issue>), <fpage>1235</fpage>&#x2013;<lpage>1270</lpage>. doi: <pub-id pub-id-type="doi">10.1162/neco_a_01199</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zaremba</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Sutskever</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Vinyals</surname> <given-names>O.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Recurrent neural network regularization</article-title>. <source>arXiv preprint arXiv</source> <volume>2329</volume> (<issue>2014</issue>), <fpage>1409</fpage>.</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Gao</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Ren</surname> <given-names>P.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Ocean wave height series prediction with numerical long short-term memory</article-title>. <source>J. Mar. Sci. Eng.</source> <volume>9</volume> (<issue>5</issue>), <elocation-id>514</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3390/jmse9050514</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Xie</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Zhou</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Hui</surname> <given-names>N.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). <article-title>ConvLSTM-based wave forecasts in the south and East China seas</article-title>. <source>Front. Mar. Sci.</source> <volume>8</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fmars.2021.680079</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>