<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Water</journal-id>
<journal-title>Frontiers in Water</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Water</abbrev-journal-title>
<issn pub-type="epub">2624-9375</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/frwa.2023.1237592</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Water</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Enhancing water use efficiency in precision irrigation: data-driven approaches for addressing data gaps in time series</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Zeynoddin</surname> <given-names>Mohammad</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2347736/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Gumiere</surname> <given-names>Silvio Jos&#x000E9;</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/751981/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Bonakdari</surname> <given-names>Hossein</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Soils and Agri-Food Engineering, Universit&#x000E9; Laval</institution>, <addr-line>Qu&#x000E9;bec City, QC</addr-line>, <country>Canada</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Civil Engineering, University of Ottawa</institution>, <addr-line>Ottawa, ON</addr-line>, <country>Canada</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Georgia A. Papacharalampous, National Technical University of Athens, Greece</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Francesco Granata, University of Cassino, Italy; Stelian Curceac, Karlsruhe Institute of Technology (KIT), Germany</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Silvio Jos&#x000E9; Gumiere <email>Silvio-Jose.Gumiere&#x00040;fsaa.ulaval.ca</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>22</day>
<month>08</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>5</volume>
<elocation-id>1237592</elocation-id>
<history>
<date date-type="received">
<day>09</day>
<month>06</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>31</day>
<month>07</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2023 Zeynoddin, Gumiere and Bonakdari.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Zeynoddin, Gumiere and Bonakdari</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<abstract>
<p>Real-time soil matric potential measurements for determining potato production&#x00027;s water availability are currently used in precision irrigation. It is well known that managing irrigation based on soil matric potential (SMP) helps increase water use efficiency and reduce crop environmental impact. Yet, SMP monitoring presents challenges and sometimes leads to gaps in the collected data. This research sought to address these data gaps in the SMP time series. Using meteorological and field measurements, we developed a filtering and imputation algorithm by implementing three prominent predictive models in the algorithm to estimate missing values. Over 2 months, we gathered hourly SMP values from a field north of the P&#x000E9;ribonka River in Lac-Saint-Jean, Qu&#x000E9;bec, Canada. Our study evaluated various data input combinations, including only meteorological data, SMP measurements, or a mix of both. The Extreme Learning Machine (ELM) model proved the most effective among the tested models. It outperformed the <italic>k</italic>-Nearest Neighbors (<italic>k</italic>NN) model and the Evolutionary Optimized Inverse Distance Method (<italic>ga</italic>IDW). The ELM model, with five inputs comprising SMP measurements, achieved a correlation coefficient of 0.992, a root-mean-square error of 0.164 cm, a mean absolute error of 0.122 cm, and a Nash-Sutcliffe efficiency of 0.983. The ELM model requires at least five inputs to achieve the best results in the study context. These can be meteorological inputs like relative humidity, dew temperature, land inputs, or a combination of both. The results were within 5% of the best-performing input combination we identified earlier. To mitigate the computational demands of these models, a quicker baseline model can be used for initial input filtering. With this method, we expect the output from simpler models such as <italic>ga</italic>IDW and <italic>k</italic>NN to vary by no more than 20%. Nevertheless, this discrepancy can be efficiently managed by leveraging more sophisticated models.</p></abstract>
<kwd-group>
<kwd>imputation</kwd>
<kwd>machine learning</kwd>
<kwd>modeling</kwd>
<kwd>hydro-informatics</kwd>
<kwd>soil matric potential</kwd>
<kwd>water management</kwd>
</kwd-group>
<counts>
<fig-count count="9"/>
<table-count count="0"/>
<equation-count count="8"/>
<ref-count count="54"/>
<page-count count="13"/>
<word-count count="8088"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Water and Hydrocomplexity</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>Water scarcity continues to be a significant barrier to agricultural productivity. Enhancing the efficiency of water use in agriculture is a crucial challenge to achieve higher crop yields (Molden et al., <xref ref-type="bibr" rid="B40">2010</xref>; Chen et al., <xref ref-type="bibr" rid="B15">2018</xref>; Matteau et al., <xref ref-type="bibr" rid="B37">2021</xref>). Despite agriculture being the primary consumer of the Earth&#x00027;s freshwater, competition from other sectors is intensifying. Governments frequently advocate for improved irrigation efficiency to balance this demand (FAO, <xref ref-type="bibr" rid="B25">2008</xref>).</p>
<p>Real-time Soil Matric Potential (SMP) measurements are essential in this context. Irrigation management strategies based on SMP can enhance water productivity and increase yield (Matteau et al., <xref ref-type="bibr" rid="B37">2021</xref>, <xref ref-type="bibr" rid="B38">2022b</xref>). However, crops&#x00027; optimal irrigation thresholds based on SMP vary (Rekika et al., <xref ref-type="bibr" rid="B43">2014</xref>; L&#x000E9;tourneau et al., <xref ref-type="bibr" rid="B33">2015</xref>; P&#x000E9;riard et al., <xref ref-type="bibr" rid="B42">2015</xref>). Consequently, personalized irrigation strategies based on crop-specific SMP ranges can help prevent over-irrigation or water deficiency (Matteau et al., <xref ref-type="bibr" rid="B39">2022a</xref>). Yet, SMP real-time measurements can generate unstructured and messy data impairing decision-making and predictive modeling performance (Kamilaris and Prenafeta-Bold&#x000FA;, <xref ref-type="bibr" rid="B31">2018</xref>). These unstructured and missing data (UMD) impede the effective implementation of data-driven methods, like precision agriculture and machine learning, leading to less-than-optimal farming strategies and inefficient resource utilization. This issue is especially pressing given the need for sustainable farming practices to meet increasing global food demand while minimizing environmental impacts (Godfray et al., <xref ref-type="bibr" rid="B28">2010</xref>).</p>
<p>UMD, often a result of equipment failure, data entry errors, or data loss, is another pressing issue, particularly given that measurements are taken under conditions heavily influenced by natural environmental circumstances (Di Piazza, <xref ref-type="bibr" rid="B18">2011</xref>; Fountas et al., <xref ref-type="bibr" rid="B26">2015</xref>; Wolfert et al., <xref ref-type="bibr" rid="B48">2017</xref>; Bleidorn et al., <xref ref-type="bibr" rid="B9">2022</xref>). Dealing with missing data effectively reduces estimation bias and improves parameter accuracy (Rouzinov and Berchtold, <xref ref-type="bibr" rid="B44">2022</xref>).</p>
<p>Several strategies exist to address UMD. The primary strategy is deletion when data are missing at random and unrelated to the variable itself (Allison, <xref ref-type="bibr" rid="B1">2003</xref>) or the missing rate is &#x0003C; 5% (Dong and Peng, <xref ref-type="bibr" rid="B19">2013</xref>). However, when the missing rate surpasses 10%, biased results are likely (Bennett, <xref ref-type="bibr" rid="B6">2001</xref>). In such situations, statistical and artificial estimation methods prove useful. For instance, in environmental research, it is often assumed that neighboring sites can significantly contribute to data reconstruction (Cheng and Lu, <xref ref-type="bibr" rid="B16">2017</xref>). Therefore, spatial and temporal interpolation methods are commonly used (Tonini et al., <xref ref-type="bibr" rid="B46">2016</xref>; Tipton et al., <xref ref-type="bibr" rid="B45">2017</xref>).</p>
<p>Notably, Inverse Distance Weighting (IDW), kriging, and cokriging methods, along with their variants, are widely adopted for this purpose (Eskelson et al., <xref ref-type="bibr" rid="B23">2009</xref>; Bhattacharjee et al., <xref ref-type="bibr" rid="B7">2014</xref>). Among them, IDW methods like harmony search-IDW, genetic algorithm (ga)-IDW, and particle swarm-IDW are the most reliable for estimating missing environmental data (Chang et al., <xref ref-type="bibr" rid="B14">2005</xref>; Gholipour et al., <xref ref-type="bibr" rid="B27">2013</xref>; Li and Wang, <xref ref-type="bibr" rid="B34">2013</xref>; Barbulescu et al., <xref ref-type="bibr" rid="B4">2020</xref>). E.g., B&#x000E1;rbulescu et al. (<xref ref-type="bibr" rid="B5">2021</xref>) affirmed the reliability of <italic>ga</italic>IDW, demonstrating that its accuracy surpassed other methods in 70% of study cases.</p>
<p>Other statistical methods for imputation include seasonal and nonseasonal autoregressive integrated moving average models (Yozgatligil et al., <xref ref-type="bibr" rid="B50">2013</xref>), autoregressive models with exogenous variables (Bidwell, <xref ref-type="bibr" rid="B8">2005</xref>), principal component regression, and maximum likelihood methods (Enders and Bandalos, <xref ref-type="bibr" rid="B22">2001</xref>). The k-Nearest Neighbors (<italic>k</italic>NN) model, a popular statistical space-based model, has proven robust, reliable, and simple-structured and surpasses many average-based methods in dealing with missing values (Troyanskaya et al., <xref ref-type="bibr" rid="B47">2001</xref>; Kim et al., <xref ref-type="bibr" rid="B32">2004</xref>; Cordeiro et al., <xref ref-type="bibr" rid="B17">2022</xref>).</p>
<p>In artificial intelligence (AI), many methods have been developed for imputing missing data. Artificial Neural Networks (ANNs), evolutionary polynomial regression, vector autoregressive imputation methods, and complex deep learning models like recurrent neural networks, convolutional neural networks, and long-short term memory models (Zhou and Zhang, <xref ref-type="bibr" rid="B54">2022</xref>) are standard AI methods used in this field. Among these, the Extreme Learning Machine (ELM) is unique due to its simple structure, ease of parameter tuning, fast training process, and better scalability and generalizability than other AI methods, such as support vector machines. It can generate accurate results with minimal data (Huang et al., <xref ref-type="bibr" rid="B30">2012</xref>; Huang, <xref ref-type="bibr" rid="B29">2015</xref>; Evans et al., <xref ref-type="bibr" rid="B24">2020</xref>).</p>
<p>Given the significance of SMP in agricultural practices and the imperative of managing UMD data, this research seeks to develop and compare models and an algorithm that chooses optimal inputs for modeling UMD data. We intend to review different inputs used in deterministic and AI methods and interpret the results of our algorithm, which assesses and selects the best input-model combination. Considering the extensive literature review, features of the models, and comprehensive research on optimal input selection, <italic>ga</italic>IDW, <italic>k</italic>NN, and ELM were chosen for imputing the SMP dataset in this study. These models are fast, widely used, and have demonstrated superior accuracy. Notably, apart from <italic>k</italic>NN, which was used in Cordeiro et al. (<xref ref-type="bibr" rid="B17">2022</xref>), these methods have yet to be previously employed for imputing SMPs or compared against each other.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>2. Materials and methods</title>
<sec>
<title>2.1. Extreme learning machine</title>
<p>The feed-forward neural network (FFNN) with the integrated backpropagation (BP) training method is a popular neural network used in research due to its impressive ability to solve complex non-linear problems. This combination allows for the effective optimization of network weight and bias, non-linear mapping over input/output parameters, and the creation of flexible models, which is not achievable with traditional regression approaches. However, this method is known for its time-consuming training process, the potential to become trapped in local minima, and numerous configurable model parameters. This approach&#x00027;s drawbacks were noted in previous studies (Bonakdari et al., <xref ref-type="bibr" rid="B10">2020a</xref>,<xref ref-type="bibr" rid="B11">b</xref>).</p>
<p>To address the limitations of the FFNN approach, a new approach called the extreme learning machine (ELM) was introduced. The ELM is a single-neuron training approach for FFNNs, where a hidden neuron bias and input weights are chosen stochastically, and the output weights are determined by solving a linear problem. The change from a non-linear system to a linear system accelerates the training speed of the ELM. Additionally, the only variable in this strategy is the number of hidden neurons (Zeynoddin et al., <xref ref-type="bibr" rid="B53">2018</xref>). The single-layer FFNN ELM formula is as follows:</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M1"><mml:mrow><mml:mi>T</mml:mi><mml:msub><mml:mi>V</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>h</mml:mi></mml:msubsup><mml:mrow><mml:mi>O</mml:mi><mml:mi>p</mml:mi><mml:msub><mml:mi>W</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mtext>&#x02009;</mml:mtext><mml:mi>f</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>I</mml:mi><mml:mi>p</mml:mi><mml:msub><mml:mi>W</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>.</mml:mo><mml:mi>I</mml:mi><mml:mi>N</mml:mi><mml:msub><mml:mi>V</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:mstyle><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mi>s</mml:mi></mml:mrow></mml:math></disp-formula>
<p>In the present study, the activation function is denoted by <italic>f(.)</italic>, the output weight matrix is represented by <italic>OPW</italic><sub><italic>j</italic></sub>, <italic>h</italic> is the number of hidden neurons, the input weights matrix is <italic>IpW</italic><sub><italic>j</italic></sub>, <italic>TV</italic><sub><italic>i</italic></sub> refers to the target parameter, <italic>s</italic> represents the number of input variables, and <italic>INVi</italic> denotes the input variables. The sigmoid activation function is chosen in this investigation based on its strong performance in previous studies reviewed in the literature. The function is described as follows (Azimi et al., <xref ref-type="bibr" rid="B3">2017</xref>; Yaseen et al., <xref ref-type="bibr" rid="B49">2018</xref>; Ebtehaj et al., <xref ref-type="bibr" rid="B20">2019</xref>):</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M2"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>f</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:msub><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x0002B;</mml:mo><mml:msup><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mo>-</mml:mo><mml:mi>T</mml:mi><mml:mi>V</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>To enhance the generalizability of the ELM model and minimize the impact of randomly selected input weights and bias, the iterative procedure outlined by Ebtehaj et al. (<xref ref-type="bibr" rid="B21">2021</xref>) is employed in this study. One thousand iterations were also used to find the optimal weights.</p>
</sec>
<sec>
<title>2.2. Optimized inverse distance weighted method</title>
<sec>
<title>2.2.1. Inverse distance weighted method</title>
<p>Problems associated with data measurement most commonly result in missing observations. These problems include insufficient measurement tools, site access limitations, systematic or operator-sourced errors, and expenditures. Using two broad categories of deterministic and geostatistical methodologies, researchers have worked to create mathematical and statistical strategies to address these flaws and represent hydrological events with models that can be understood and interpreted. Regarding mathematical equations and measured points, deterministic techniques such as the inverse distance weight (IDW), splines, local polynomial interpolation, radial basis functions, natural neighbors, and the Thiessen technique have been established, which are based on statistical notions and geostatistical approaches (Azari et al., <xref ref-type="bibr" rid="B2">2021</xref>). The IDW approach was used in the present investigation to impute data from measurement points with missing data. The relationship used in this method is as follows:</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M3"><mml:mrow><mml:mi>T</mml:mi><mml:msub><mml:mi>V</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:mi>I</mml:mi><mml:mi>N</mml:mi><mml:msub><mml:mi>V</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msubsup><mml:mi>D</mml:mi><mml:mi>i</mml:mi><mml:mi>&#x003B1;</mml:mi></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:msubsup><mml:mi>D</mml:mi><mml:mi>i</mml:mi><mml:mi>&#x003B1;</mml:mi></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mstyle></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>
<p>where <italic>TV</italic><sub><italic>i, j</italic>, &#x003C4;</sub> is the imputed value, <italic>INV</italic><sub><italic>i, j</italic>, &#x003C4;</sub> denotes the target variable at point <italic>TV</italic><sub><italic>i</italic></sub> at time step <italic>j, D</italic><sub><italic>i</italic></sub> is the distance of points with available data at the same time step as the target point, <italic>N</italic> is the length of each sample, and &#x003B1; is the weighting parameter. The parameter &#x003B1; determines the quantity of available data depending on their distance from the location with missing data. In this way, for &#x003B1; larger than one, closer sites receive greater weights than faraway sites. The normal range for the parameter &#x003B1; is [0, 2], where 0 indicates a simple average without affecting the distance feature and 2 indicates that greater weights are used for close sites (Ly et al., <xref ref-type="bibr" rid="B36">2011</xref>; Azari et al., <xref ref-type="bibr" rid="B2">2021</xref>). Its value is generally determined based on trial and error and evaluation criteria such as cross-validation. This method is limited to land data, and other inputs, such as meteorological variables, cannot be used.</p>
</sec>
<sec>
<title>2.2.2. Optimization process</title>
<p>In this study, a reliable evolutionary optimization approach is used to approximate the most accurate &#x003B1; by taking historical values into account. For this purpose, a genetic algorithm (<italic>ga</italic>) is used to optimize &#x003B1;. Darwin&#x00027;s idea of evolution, which enhances survival via reproduction, crossover, and gene mutation, inspired the genetic algorithm. Population solutions, such as natural chromosomes, are used to start the algorithm. Gene encoding, the initial stage of using a <italic>ga</italic>, is a technique for building decision variables equivalent to the genes in chromosomes. The <italic>ga</italic> imitates reproduction, crossover, and mutation to sustain superior solutions and create better offspring to get closer to the objective function. This technique helps select and create a new population. The objective function ranks the generated population. Gene evolution eliminates the worst individuals and selects better individuals that fit the objective function. Accordingly, the GA simplifies power parameter tuning. Different values for tuning the GA parameters are used to optimize the IDW method. For instance, Chang et al. (<xref ref-type="bibr" rid="B14">2005</xref>) used a population size of 20, an &#x003B1; search space of [0, 10], and 150 maximum generations. B&#x000E1;rbulescu et al. (<xref ref-type="bibr" rid="B5">2021</xref>) investigated different values of the population size [10, 80], number of generations (maximum 10), mutation rate (&#x0003C; 0.1), and crossover rate [0.6, 1]. They reported that high values of the population do not affect the model outcomes significantly. The fewest errors were generated with a population size of [35, 45], between 5 and 9 generations, a mutation rate within [0.04, 0.08], and a crossover rate within [0.6, 0.8] for different crossover/mutation methods. According to the reported values and with the aim of increasing the search space, the search space for feasible values of &#x003B1; is &#x0003E;0 (1e<sup>&#x02212;5</sup>) with feasible adaptive mutation, a maximum of 100 generations, a population of 50, and a crossover rate of 0.8. The mean absolute error (MAE), mean error (ME), and root-mean-square error (MAE) are the conventional objective functions used (Chang et al., <xref ref-type="bibr" rid="B14">2005</xref>). Consequently, the RMSE is also set as the objective function.</p>
</sec>
</sec>
<sec>
<title>2.3. <italic>k</italic>-nearest neighbors</title>
<p><italic>k</italic>-nearest neighbors (<italic>k</italic>NN) is a widely used reliable technique among data imputation methods (Troyanskaya et al., <xref ref-type="bibr" rid="B47">2001</xref>; Kim et al., <xref ref-type="bibr" rid="B32">2004</xref>; Cordeiro et al., <xref ref-type="bibr" rid="B17">2022</xref>). This method is similar in its basic ideas to IDW. When using the <italic>k</italic>NN technique, missing time steps are filled in using data from the next neighboring site/column dataset. To approximate the degree to which two values are close to one another, the Euclidean distance is utilized. Each neighbor&#x00027;s compared value must have the same dimension and time step. The missing value is imputed using the weighted average of the <italic>k</italic>-nearest values in the relevant column. The weights are calculated by the following equation:</p>
<disp-formula id="E4"><label>(4)</label><mml:math id="M4"><mml:mrow><mml:mi>k</mml:mi><mml:msub><mml:mi>W</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mfrac><mml:mrow><mml:msub><mml:mn>1</mml:mn><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>E</mml:mi><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow><mml:mrow><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mi>E</mml:mi><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mstyle></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>
<p>where <italic>kW</italic><sub><italic>i</italic></sub> denotes the weights applied to the <italic>k</italic>-nearest column to impute the data, and <italic>ED</italic><sub><italic>i</italic></sub> is the Euclidean distance between the target datasets and the other datasets with the same time stamp. A limitation of this method is that there should be at least one row of datasets with accurate values.</p>
</sec>
</sec>
<sec id="s3">
<title>3. Method analysis and workflow</title>
<p>The advantage of the <italic>k</italic>NN method compared to the other two methods is the simplicity and the low degree of freedom of the model. However, the main problem with IDW and <italic>k</italic>NN methods is the uncertainty in choosing the number of neighbors or, in other words, the number of model inputs. These methods are deterministic and are less flexible than other methods when receiving different inputs of different natures and establishing a meaningful relationship between these inputs and the target variable. In addition to the problem of the uncertainty of input selection, the IDW model faces the problem of the uncertainty of the &#x003B1; tuning parameter. Although this problem can be solved by adding an optimization technique, the parameters of the optimizer add further complexity.</p>
<p>The ELM model has more complexity in terms of structure and applicability than the other two models. This characteristic makes the model much better than the other two methods at establishing a relationship between the inputs and target values. However, this method also has the problem of uncertain inputs, along with uncertain model tuning parameters, and is computationally expensive compared to the other two methods.</p>
<p>Therefore, in this research, the aim is to compare the models and develop an algorithm based on which different inputs to the models are checked, where the best ones are filtered out after modeling based on evaluation criteria. The possibility of using different inputs when reconstructing UMD data for the deterministic methods introduced is reviewed. Finally, the results of the developed algorithm, which produces all possible combinations of inputs for the models and selects the best input and model, are interpreted and evaluated. All the models and the search algorithm are coded in the MATLAB environment. <xref ref-type="fig" rid="F1">Figure 1</xref> shows the workflow of this research.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Flowchart of the studied methodology.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frwa-05-1237592-g0001.tif"/>
</fig>
</sec>
<sec id="s4">
<title>4. Evaluation criteria</title>
<p>To assess the effectiveness of the models and contrast the results, a thorough assessment of the models is carried out utilizing correlation, absolute and relative error, complexity indices, and other visual metrics. The root-mean-square error (RMSE), mean absolute error (MAE), and correlation coefficient (R) are calculated accordingly. Another method for calculating model differences is the Nash-Sutcliffe efficiency (NSE). This index, a normalized method, assesses the residuals&#x00027; variance with respect to a reference sample. It ranges from &#x02013;&#x0221E; to 1, with 1 being the ideal efficiency/fit. The NSE is, in general, comparable to the R index. However, it assesses the model&#x00027;s effectiveness and performance quality and reflects more precisely the desirable and undesirable aspects of the model under discussion. As a result, it is a useful metric for assessing how well a model performs in comparison to a benchmark model.</p>
<disp-formula id="E5"><label>(5)</label><mml:math id="M5"><mml:mrow><mml:mtable columnalign='left'><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mi>R</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi>o</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:mover accent='true'><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi>o</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo stretchy='true'>&#x000AF;</mml:mo></mml:mover><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:mover accent='true'><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo stretchy='true'>&#x000AF;</mml:mo></mml:mover><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:mstyle></mml:mrow><mml:mrow><mml:msqrt><mml:mrow><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi>o</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:mover accent='true'><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi>o</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo stretchy='true'>&#x000AF;</mml:mo></mml:mover><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mstyle><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:mover accent='true'><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo stretchy='true'>&#x000AF;</mml:mo></mml:mover><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mstyle></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:math></disp-formula>
<disp-formula id="E6"><label>(6)</label><mml:math id="M6"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msubsup><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>o</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>/</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:msqrt></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E7"><label>(7)</label><mml:math id="M7"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>M</mml:mi><mml:mi>A</mml:mi><mml:mi>E</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:mfrac><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msubsup><mml:mo>|</mml:mo><mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>o</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E8"><label>(8)</label><mml:math id="M8"><mml:mrow><mml:mi>N</mml:mi><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x02212;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi>o</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mstyle></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi>o</mml:mi></mml:msub><mml:msub><mml:mrow></mml:mrow><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:msub><mml:mrow><mml:mover accent='true'><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi>o</mml:mi></mml:msub></mml:mrow><mml:mo stretchy='true'>&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mstyle></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>
<p>where <italic>T</italic><sub><italic>oi</italic></sub> is the i<sup>th</sup> target variable, and <italic>T</italic><sub><italic>mi</italic></sub> is the i<sup>th</sup> modeled target. <italic>n</italic> is the number of observations.</p>
</sec>
<sec id="s5">
<title>5. Study field and measurement method</title>
<p>The study field is located north of the P&#x000E9;ribonka River, Saint-Jean Lake, Quebec Province, Canada. The measurement sites extend from 71.9992&#x000B0; to 71.9908&#x000B0; west and from 48.7437&#x000B0; to 48.7540&#x000B0; north. There are a total of 12 measurement sites in the study field. The field and measurement site locations are shown in <xref ref-type="fig" rid="F2">Figure 2</xref>. Because of the large number of missing values in the records of one sensor, and to enable consistent training and testing research hypothesis evaluation, the associated recording is removed from the study datasets. Along with the land measurements, four meteorological (meteo) variables are also measured for the study field. These variables are wind speed (WS), relative humidity (RH), dew, and 2 m air temperatures (DT and AT) for the period of the study. These values are measured on an hourly basis from July to August 2022. The statistical features of the records are shown in <xref ref-type="fig" rid="F3">Figure 3</xref>. The soil matric potential (SMP) was measured continuously with commercial tensiometers (HXM-80, Hortau Inc., L&#x000E9;vis, Qu&#x000E9;bec, Canada) connected to the same ST-4 datalogger. The tensiometers were located at the positions displayed in <xref ref-type="fig" rid="F3">Figure 3</xref> at a depth of 15 cm below the ground.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>The study field location.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frwa-05-1237592-g0002.tif"/>
</fig>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Dataset statistics and representation.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frwa-05-1237592-g0003.tif"/>
</fig>
<p>To test the methods presented in this study, an interval in which all time series had reliable values was chosen, and the time steps of missing values were removed from all time series. Therefore, 678 data points for each time series remained. Seventy percent of the datasets were used in the training section, and the remaining thirty percent were used for model evaluation. The size of the test portion was selected based on the maximum rate of missing data and was chosen randomly to simulate real conditions. To choose the target site, the site with the least correlation with the others was selected as the target (T) (<xref ref-type="fig" rid="F3">Figure 3</xref>). The inputs were standardized before being used in the modeling process.</p>
</sec>
<sec sec-type="results" id="s6">
<title>6. Results</title>
<p>Two general scenarios are defined for imputation, as shown in <xref ref-type="fig" rid="F1">Figure 1</xref>. In the first scenario, the models generate imputations based on all inputs, and each input combination is filtered out after modeling based on the indices. The inputs differ based on the model&#x00027;s structure and the capability of processing inputs of different natures. Therefore, <italic>k</italic>NN and the ELM modeled the target with 14 inputs, including all meteo and land measurements. On the other hand, <italic>ga</italic>IDW modeled the target only with land measurements. All combinations of these inputs were analyzed and filtered out, resulting in 16,383 models for the ELM and <italic>k</italic>NN and 1,023 models for <italic>ga</italic>IDW. These tasks were performed on a computer with an Intel Core i7 processor and 16 gigabytes of RAM, resulting in 10.83 h of processing time for the ELM, 0.05 h for <italic>k</italic>NN and 8.42 h for <italic>ga</italic>IDW.</p>
<p>A graphical representation of the performance of the models is shown in <xref ref-type="fig" rid="F4">Figure 4</xref> for all generated models. The Violin plots show the distribution and other statistical features of the calculated performance indices (R, MSE, etc.) for the models with independent input data. A violin plot combines aspects of a box plot and a kernel density plot to display model distribution and summary statistics. The width of the violin at different points represents the density or frequency of data values. The high median of the ELM in terms of R and its low values of RMSE, MAE, and NSE suggest better performance for this model, followed by <italic>k</italic>NN and <italic>ga</italic>IDW. The ELM was successful in creating relationships among all inputs in all combinations. The bimodal distribution of indices for the ELM and <italic>k</italic>NN show their higher sensitivity to the inputs compared to that of <italic>ga</italic>IDW. The uniform and narrower violin body of <italic>ga</italic>IDW (in RMSE, MAE, and NSE) indicates less variability in the model results. None of the models generated outlier results, indicating the procedure&#x00027;s reliability. According to these plots, the performance of the ELM model was more accurate, followed by <italic>k</italic>NN and <italic>ga</italic>IDW, in general.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>The index ranges for applied models with independent input filtering.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frwa-05-1237592-g0004.tif"/>
</fig>
<p>As mentioned earlier, this study aims to assess the possibility of constructing values of different locations in the field with minimum inputs. Therefore, the three subscenarios of modeling with meteo&#x0002B;land inputs, land inputs, and meteo inputs are defined. After assessing all models for different input groups, the best combinations of each group (for instance, 1-input, 2-inputs, &#x02026;, 14-inputs) are filtered out based on the accuracy indices for each model, and the results of superior combinations are shown in <xref ref-type="fig" rid="F5">Figure 5</xref>. This figure shows the overall performance of the three models with different input combinations. The detailed results are shown in <xref ref-type="supplementary-material" rid="SM1">Tables A1</xref>&#x02013;<xref ref-type="supplementary-material" rid="SM1">A3</xref>. In all three subscenarios, the ELM performed better than <italic>k</italic>NN and <italic>ga</italic>IDW. Following relative improvements and alternations of the models for different sub-scenarios are provided. In the chosen 14 superior combinations, the ELM, compared to <italic>k</italic>NN, is on average more accurate, with R = 37%, RMSE = &#x02212;74%, MAE = &#x02212;76%, and NSE = 85%, in the meteo&#x0002B;land subscenario. In the meteo subscenario, the ELM, compared to <italic>k</italic>NN, is, on average, more accurate with R = 37%, RMSE = &#x02212;37%, MAE = &#x02212;45%, and NSE = &#x02212;130%. In the last subscenario (land inputs), the ELM, compared to <italic>k</italic>NN, is on average more accurate, with R = 33%, RMSE = &#x02212;73%, MAE = &#x02212;76%, and NSE = 79%, and compared to <italic>ga</italic>IDW, it is more precise, with R = 54%, RMSE = &#x02212;77%, MAE = &#x02212;80%, and NSE = 175%. <italic>k</italic>NN, compared to <italic>ga</italic>IDW, is also, on average, more accurate with R = 16%, RMSE = &#x02212;16%, MAE = &#x02212;17%, and NSE = 54%.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Model results for independent input scenarios, C represents the combinations presented in <xref ref-type="supplementary-material" rid="SM1">Tables A1</xref>&#x02013;<xref ref-type="supplementary-material" rid="SM1">A3</xref>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frwa-05-1237592-g0005.tif"/>
</fig>
<p>The accuracy of estimations for the ELM model increased when increasing the number of inputs to 5, and after that, the accuracy decreased slightly with the increase in the number of inputs. The ELM estimated the target variable most accurately with 5 inputs, having R = 0.992, RMSE = 0.164, MAE = 0.122, and NSE = 0.983. In the first 5 combinations (<inline-formula><mml:math id="M9"><mml:msubsup><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mi>E</mml:mi></mml:mrow><mml:mrow><mml:mi>M</mml:mi><mml:mi>L</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula>1-<inline-formula><mml:math id="M10"><mml:msubsup><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mi>E</mml:mi></mml:mrow><mml:mrow><mml:mi>M</mml:mi><mml:mi>L</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula>5), only land measurements were involved, and after that, meteo&#x0002B;land combinations yielded slightly more accurate results than the land inputs alone, so they can be used interchangeably and are considerably more accurate than only meteo inputs. The RH, AT, and DT are more involved in the meteo&#x0002B;land combinations than in the other meteo inputs (<xref ref-type="supplementary-material" rid="SM1">Table A1</xref>).</p>
<p>Conversely, the accuracy of estimations for the <italic>k</italic>NN model decreased when increasing the number of inputs in both the meteo&#x0002B;land and land subscenarios. The <italic>k</italic>NN results were obtained using a coefficient of one land input, with a maximum accuracy of R = 0.937, RMSE = 0.455, MAE = 0.336, and NSE = 0.873. The meteo inputs are involved in 3-input combinations and above. With the addition of meteo inputs, <italic>k</italic>NN generated more accurate estimations compared to only land or only meteo inputs. The RH and WS for meteo inputs were more important input variables than the other two meteo variables in <italic>k</italic>NN modeling (<xref ref-type="supplementary-material" rid="SM1">Table A2</xref>). <italic>ga</italic>IDW also followed the same accuracy decrease pattern as <italic>k</italic>NN. The best result for this model was obtained with one input, and using all neighbor records decreased the accuracy to half, as demonstrated in the results of <xref ref-type="supplementary-material" rid="SM1">Table A3</xref>.</p>
<p>The second scenario consists of primary input filtering and use in other models. The processing time for different models was mentioned above, and it was noted that a model such as <italic>ga</italic>IDW or the ELM could be computationally expensive. Therefore, it is reasonable to use a powerful model to determine the best input combinations and use them in other modeling methods for future applications. The ELM model proved to be robust in finding connections among different inputs in the previous step, while the other models generated relatively na&#x000EF;ve results. In this step, the ELM is used as a benchmark model to obtain the primary estimations and filter out combinations. Then, these combinations are used in the other two models to see how much they deviate from the benchmark model and their results in the previous step. The results depicted in <xref ref-type="fig" rid="F6">Figure 6</xref> reveal that a similar pattern of accuracy changes can be observed in the fixed-input scenario as the number of inputs increases. The gap between the three models is almost the same as the previous one.</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p>Model results for fixed-input scenarios, C represents the combinations presented in <xref ref-type="supplementary-material" rid="SM1">Tables A1</xref>&#x02013;<xref ref-type="supplementary-material" rid="SM1">A3</xref>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frwa-05-1237592-g0006.tif"/>
</fig>
<p>Following, the comparison of the improvement or degradation of the accuracy of the models relative to each other is presented in terms of percentages of indices&#x00027; changes. In the filtered combinations, <italic>k</italic>NN compared to the ELM changed, on average, by R = 53%, RMSE = &#x02212;76%, MAE = &#x02212;79%, and NSE = 133% in the meteo&#x0002B;land subscenario. In the meteo subscenario, <italic>k</italic>NN compared to the ELM changed, on average, by R = 26%, RMSE = &#x02212;41%, MAE = &#x02212;49%, and NSE = &#x02212;123%. In the last subscenario (land inputs), <italic>k</italic>NN changed compared to the ELM, on average, by R = 38%, RMSE = &#x02212;74%, MAE = &#x02212;77%, and NSE = 95%. <italic>ga</italic>IDW compared to the ELM changed, on average, by R = 72%, RMSE = &#x02212;78%, MAE = &#x02212;80%, NSE = 217%. <italic>k</italic>NN compared to <italic>ga</italic>IDW also changed, on average, by R = 24%, RMSE = &#x02212;24%, MAE = &#x02212;12%, NSE = 63%. These changes are close in RMSE and MAE indices for this modeling scenario compared to the previous one. However, R and NSE have drastic changes in some cases. <xref ref-type="fig" rid="F7">Figure 7</xref> shows the change rates in the indices when a fixed-input scenario is used. It can be seen that both the <italic>k</italic>NN and <italic>ga</italic>IDW models&#x00027; results differ by a maximum of 20% in R, RMSE, and MAE, except for models with only meteo inputs. However, the efficiency of these models has considerable changes, as it can deviate by up to 40% from the base model in the meteo&#x0002B;land subscenario.</p>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p>Changes in the indices of models with filtered inputs compared to the same models with independent inputs. Fxd, fixed input models; idpt, independent input models. No column, no changes.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frwa-05-1237592-g0007.tif"/>
</fig>
<p><xref ref-type="fig" rid="F8">Figure 8</xref> shows the statistical features of the superior models for each modeling method in both scenarios. A powerful model should also be able to reproduce the statistical characteristics of the target series (Zeynoddin and Bonakdari, <xref ref-type="bibr" rid="B51">2022</xref>, <xref ref-type="bibr" rid="B52">2023</xref>). It should be able to estimate the series&#x00027; mean, median, and distribution, as well as regenerate any outliers. Therefore, box-density plots of the target and models are presented in <xref ref-type="fig" rid="F8">Figure 8</xref>. It can be observed that all models estimated the core features of the target with a high degree of accuracy. The interquartile area and extremes were reproduced with very good accuracy. However, only the ELM model was able to estimate all the outliers of the target series. The scatter plots also show that the chosen models could produce a majority of estimations of the target with 95% intervals.</p>
<fig id="F8" position="float">
<label>Figure 8</label>
<caption><p>Model statistics for the best results. ELM: Combination (C5): (5,7,8,12,14), <italic>k</italic>NN (C1): (8), <italic>ga</italic>IDW (C1): (8).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frwa-05-1237592-g0008.tif"/>
</fig>
<p>In all three subscenarios, input 8, which is one of the land inputs, is the main and most important input. This input also has the greatest similarity to the target compared to the others, as shown in <xref ref-type="fig" rid="F3">Figure 3</xref>. If this input was excluded from the input combinations, the best results of the ELM model would be the [1, 3, 7, 11, 13] combination with R = 0.975, RMSE = 0.287, MAE = 0.229, and NSE = 0.949, which is very close to the best result in <xref ref-type="supplementary-material" rid="SM1">Table A1</xref>. In this combination, two meteo inputs (RH and DT) and three land measurements are involved. With one input, the best results would be [9], with R = 0.655, RMSE = 0.972, MAE = 0.808, and NSE = 0.420, which is considerably lower than the others. With two inputs, the best ELM results would be [7, 12], with R = 0.885, RMSE = 0. 603, MAE = 0. 482, and NSE = 0. 777, and with three inputs, the best ELM results would be [3, 7, 12], with R = 0. 945, RMSE = 0. 418, MAE = 0. 328, and NSE = 0. 893.</p>
<p>The RH and DT meteo input and records at sites 7, 11, 12, and 13 are the most important input variables for estimating the target. For the ELM model to provide the best results with a maximum of 5% difference compared to the best input combination, it needs at least 5 inputs, which can be different combinations of the meteo and land inputs, as shown earlier. With the same exclusion assumption (removing input 8), <italic>k</italic>NN&#x00027;s best result would be the [4, 5, 9] combination, with R = 0.497, RMSE = 1.151, MAE = 0.930, and NSE = 0.187. Removing input 8 from the combinations greatly affects the results of <italic>k</italic>NN. Similar to <italic>k</italic>NN, the <italic>ga</italic>IDW results would be impacted significantly by removing input 8. The outcome of this model after removing input 8 is R = 0.553, RMSE = 1.219, MAE = 1.032, and NSE = 0.087. This model generates almost identical results for inputs 5, 9, 11, and 13, regardless of the number of inputs or the combination of them.</p>
</sec>
<sec sec-type="discussion" id="s7">
<title>7. Discussion</title>
<p>Using the nearest adjacent sites for data reconstruction is a common approach, and it can be effective in certain cases. The general rule of thumb is based on the assumption that adjacent sites are more likely to have similar characteristics or behavior, which makes them potentially suitable for imputing missing data. However, it is important to note that this rule may not always hold true, and there are several factors to consider when deciding whether to use adjacent sites for data reconstruction, such as spatial relationships, the homogeneity of the study area, temporal relationships, data quality and consistency (Eskelson et al., <xref ref-type="bibr" rid="B23">2009</xref>; Carvalho et al., <xref ref-type="bibr" rid="B13">2016</xref>; Cheng and Lu, <xref ref-type="bibr" rid="B16">2017</xref>; Liu et al., <xref ref-type="bibr" rid="B35">2020</xref>). All these conditions apply to the data used. However, the adjacent sites were less effective than the others in imputing the missing values in this case study. <xref ref-type="fig" rid="F9">Figure 9</xref> shows the affecting inputs for all three models with and without input 8. This figure shows the relative positions of the inputs to the target.</p>
<fig id="F9" position="float">
<label>Figure 9</label>
<caption><p>The connections of the most important affecting inputs (main and alternatives) on the target variable.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frwa-05-1237592-g0009.tif"/>
</fig>
<p>The application of SMP in precision agriculture, water management, and hydrological studies is extensive. But the challenges in using this parameter and handling UMD are significant. Following, a few studies are presented that encountered UMD. Our findings in the context of these studies, as proper imputation of SMP by proposed methods, could enhance water use efficiency in precision irrigation and hydrological assessments. Borken et al. (<xref ref-type="bibr" rid="B12">2000</xref>), while studying the influence of rainfall on distribution in the CH<sub>4</sub> oxidation in an ecosystem, found a strong correlation between the variability of this CH<sub>4</sub> and SMP. They handled the challenge of UMD by replacing the missing SMP with estimations of the soil water balance model. Using RH and DT meteorological variables or even the rainfall time series and some SMP measurements in the neighborhood, as discussed here, could provide better insight for their study, considering a correlation of 0.89 between parameters. Similarly, when Nzokou et al. (<xref ref-type="bibr" rid="B41">2010</xref>) faced the problem of missing and erroneous data while logging SMP for automated irrigation and management of trees, they could compensate the UMD by using logged data by other wired sensors. AI methods&#x00027; results are dependent on the inputs and their inherent errors. When modeling a soil parameter with these methods, the number of inputs gains importance. So, Cordeiro et al. (<xref ref-type="bibr" rid="B17">2022</xref>) could decrease the number of inputs in their model by increasing the accuracy of the imputed features by <italic>k</italic>NN.</p>
<p>Based on the results of the fixed-input scenario, 20% changes in model outcomes such as R, RMSE, and MSE can be expected. As <italic>k</italic>NN is considerably faster than the ELM and gaIDW, it can be used as a filtering model to find the best input, and the ELM or other more complex non-linear models can be used to impute the datasets. With this approach, the computational problem can be handled. The limitations of this study can be addressed as follows. The models used in this study are data-driven and are influenced by the quality of the data, the number of inputs, and hyperparameters adjustment, like <italic>ga</italic>IDW. Accordingly, in case the structure of the time series varies in different timeframes, the results will be affected, specifically in simple-structured models like IDW or <italic>k</italic>NN. Although the proposed filtering algorithm, as investigating all possible input combinations and filtering superior ones by a fast model and using them in a more complex model like ELM, other machine learning methods or even deep learning methods, is applicable to all types of time series and different structures. Spatial models like <italic>ga</italic>IDW are also limited to <italic>in-situ</italic> inputs. This research studied a wide range of different combinations of inputs of different natures. These inputs were data that have been commonly examined in various studies and are readily available. However, there might be other relevant variables that were not considered. Based on the addressed limitations, for future studies, it is suggested that support vector machines, group methods of data handling, or even convolutional neural networks, which are more complicated machine learning methods, be considered as other AI methods in SMP imputation. Therefore, a comparison of different AI methods in SMP estimation in different climatic conditions can also be provided. Adding different inputs to the input set can also expand the search space for the best input combinations, which increases the chance of finding other estimation possibilities. However, based on the findings of this study, exclusively using variables such as meteorological data as input for imputation, which have a different nature than the target variable, does not yield accurate results; instead, they need to be combined with land data to improve accuracy, as was observed in the meteo subscenario. This approach generates more accurate outcomes than only taking land measurements as the inputs.</p>
</sec>
<sec sec-type="conclusions" id="s8">
<title>8. Conclusion</title>
<p>Water management is vital for precise irrigation guidelines, enhancing potato crop productivity, and optimizing water consumption. Real-time soil matric potential (SMP) can improve water use efficiency. However, the ability to record this variable is constrained, leading to UMD data in the associated time series. In this study, a comparison of three models and the development of an algorithm were investigated, based on which a thorough analysis of inputs was performed to determine the possibility of imputing missing values in datasets with meteorological or field measurements. Four meteorological variables and ten field measurements, constituting 16,383 distinct combinations, were used to reconstruct the missing values. In these scenarios, sole meteorological, sole land, and combinations of both types of variables were investigated. The results of applying the ELM, <italic>k</italic>NN, and <italic>ga</italic>IDW in two different scenarios and three subscenarios showed that the ELM model outperformed <italic>k</italic>NN and <italic>ga</italic>IDW with 5 inputs consisting of land measurements. Based on a search of the models&#x00027; results for input alternatives, it was determined that the ELM model requires a minimum of 5 inputs, which can be combinations of RH and DT meteorological variables and land inputs, to achieve optimal results within 5% of the best input combination found earlier. The best <italic>k</italic>NN outcome was obtained for one land input. Combining meteorological variables as meteo&#x0002B;land inputs, enhanced the model outputs. <italic>ga</italic>IDW method produced the best results with the same land input as that of <italic>k</italic>NN and almost identical indices. It was observed that the adjacent sites were not as effective as the others in imputing the missing values, and other input combination possibilities should be investigated. Computational cost is a problem for AI models that was mentioned earlier. To solve this problem, a fast base model can be used to filter the inputs. With this approach, a maximum 20% difference in the results of simple-structured models such as <italic>ga</italic>IDW and <italic>k</italic>NN could be expected. However, this issue can be addressed with more complex models, such as stochastic models or other AI models, like group methods of data handling. Exclusively using variables such as meteorological data as input for imputation, which have a different nature than the target variable, does not yield accurate results; instead, they need to be combined with land data to improve accuracy, as was observed in the meteo subscenario.</p>
</sec>
<sec sec-type="data-availability" id="s9">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec sec-type="author-contributions" id="s10">
<title>Author contributions</title>
<p>Methodology: HB and MZ. Validation: SG, HB, and MZ. Formal analysis, software, visualization, and investigation: MZ. Resources, conceptualization, project administration, funding acquisition, and data curation: SG. Writing&#x02014;original draft preparation: SG and MZ. Writing&#x02014;review and editing and supervision: SG and HB. All authors contributed to the article and approved the submitted version.</p>
</sec>
</body>
<back>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec sec-type="supplementary-material" id="s12">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/frwa.2023.1237592/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/frwa.2023.1237592/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Allison</surname> <given-names>P. D.</given-names></name></person-group> (<year>2003</year>). <article-title>Missing data techniques for structural equation modeling</article-title>. <source>J. Abnorm. Psychol.</source> <volume>112</volume>, <fpage>545</fpage>&#x02013;<lpage>557</lpage>. <pub-id pub-id-type="doi">10.1037/0021-843X.112.4.545</pub-id><pub-id pub-id-type="pmid">27176912</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Azari</surname> <given-names>A.</given-names></name> <name><surname>Zeynoddin</surname> <given-names>M.</given-names></name> <name><surname>Ebtehaj</surname> <given-names>I.</given-names></name> <name><surname>Sattar</surname> <given-names>A. M. A.</given-names></name> <name><surname>Gharabaghi</surname> <given-names>B.</given-names></name> <name><surname>Bonkadari</surname> <given-names>H.</given-names></name></person-group> (<year>2021</year>). <article-title>Integrated preprocessing techniques with linear stochastic approaches in groundwater level forecasting</article-title>. <source>Acta. Geophys.</source> <volume>6</volume>, <fpage>472</fpage>. <pub-id pub-id-type="doi">10.1007/s11600-021-00617-2</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Azimi</surname> <given-names>H.</given-names></name> <name><surname>Bonakdari</surname> <given-names>H.</given-names></name> <name><surname>Ebtehaj</surname> <given-names>I.</given-names></name></person-group> (<year>2017</year>). <article-title>Sensitivity analysis of the factors affecting the discharge capacity of side weirs in trapezoidal channels using extreme learning machines</article-title>. <source>Flow Meas. Instr.</source> <volume>54</volume>, <fpage>216</fpage>&#x02013;<lpage>223</lpage>. <pub-id pub-id-type="doi">10.1016/j.flowmeasinst.2017.02.005</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barbulescu</surname> <given-names>A.</given-names></name> <name><surname>Bautu</surname> <given-names>A.</given-names></name> <name><surname>Bautu</surname> <given-names>E.</given-names></name></person-group> (<year>2020</year>). <article-title>Optimizing inverse distance weighting with particle swarm optimization</article-title>. <source>Appl. Sci.</source> <volume>10</volume>, <fpage>2054</fpage>. <pub-id pub-id-type="doi">10.3390/app10062054</pub-id><pub-id pub-id-type="pmid">29714339</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>B&#x000E1;rbulescu</surname> <given-names>A.</given-names></name> <name><surname>&#x0015E;erban</surname> <given-names>C.</given-names></name> <name><surname>Indrecan</surname> <given-names>M.-L.</given-names></name></person-group> (<year>2021</year>). <article-title>Computing the beta parameter in IDW interpolation by using a genetic algorithm</article-title>. <source>Water</source> <volume>13</volume>, <fpage>863</fpage>. <pub-id pub-id-type="doi">10.3390/w13060863</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bennett</surname> <given-names>D. A.</given-names></name></person-group> (<year>2001</year>). <article-title>How can I deal with missing data in my study?</article-title> <source>Austr. J. Pub. Health</source> <volume>25</volume>, <fpage>464</fpage>&#x02013;<lpage>469</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-842X.2001.tb00294.x</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bhattacharjee</surname> <given-names>S.</given-names></name> <name><surname>Mitra</surname> <given-names>P.</given-names></name> <name><surname>Ghosh</surname> <given-names>S. K.</given-names></name></person-group> (<year>2014</year>). <article-title>Spatial interpolation to predict missing attributes in GIS using semantic kriging</article-title>. <source>IEEE Trans. Geosci. Remote Sensing</source> <volume>52</volume>, <fpage>4771</fpage>&#x02013;<lpage>4780</lpage>. <pub-id pub-id-type="doi">10.1109/TGRS.2013.2284489</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bidwell</surname> <given-names>V. J.</given-names></name></person-group> (<year>2005</year>). <article-title>Realistic forecasting of groundwater level, based on the eigenstructure of aquifer dynamics</article-title>. <source>Mathematic. Comput. Simulat.</source> <volume>69</volume>, <fpage>12</fpage>&#x02013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1016/j.matcom.2005.02.023</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bleidorn</surname> <given-names>M. T.</given-names></name> <name><surname>Pinto</surname> <given-names>W. P.</given-names></name> <name><surname>Schmidt</surname> <given-names>I. M.</given-names></name> <name><surname>Mendon&#x000E7;a</surname> <given-names>A. S. F.</given-names></name> <name><surname>Reis</surname> <given-names>J. A. T. d.</given-names></name></person-group> (<year>2022</year>). <article-title>Methodological approaches for imputing missing data into monthly flows series</article-title>. <source>Rev. Ambiente &#x000C1;gua</source> <volume>17</volume>, <fpage>1</fpage>&#x02013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.4136/ambi-agua.2795</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonakdari</surname> <given-names>H.</given-names></name> <name><surname>Moradi</surname> <given-names>F.</given-names></name> <name><surname>Ebtehaj</surname> <given-names>I.</given-names></name> <name><surname>Gharabaghi</surname> <given-names>B.</given-names></name> <name><surname>Sattar</surname> <given-names>A. A.</given-names></name> <name><surname>Azimi</surname> <given-names>A. H.</given-names></name> <etal/></person-group>. (<year>2020a</year>). <article-title>A non-tuned machine learning technique for abutment scour depth in clear water condition</article-title>. <source>Water</source> <volume>12</volume>, <fpage>301</fpage>. <pub-id pub-id-type="doi">10.3390/w12010301</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonakdari</surname> <given-names>H.</given-names></name> <name><surname>Qasem</surname> <given-names>S. N.</given-names></name> <name><surname>Ebtehaj</surname> <given-names>I.</given-names></name> <name><surname>Zaji</surname> <given-names>A. H.</given-names></name> <name><surname>Gharabaghi</surname> <given-names>B.</given-names></name> <name><surname>Moazamnia</surname> <given-names>M.</given-names></name></person-group> (<year>2020b</year>). <article-title>An expert system for predicting the velocity field in narrow open channel flows using self-adaptive extreme learning machines</article-title>. <source>Measurement</source> <volume>151</volume>, <fpage>107202</fpage>. <pub-id pub-id-type="doi">10.1016/j.measurement.2019.107202</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borken</surname> <given-names>W.</given-names></name> <name><surname>Brumme</surname> <given-names>R.</given-names></name> <name><surname>Xu</surname> <given-names>Y.-J.</given-names></name></person-group> (<year>2000</year>). <article-title>Effects of prolonged soil drought on CH 4 oxidation in a temperate spruce forest</article-title>. <source>J. Geophys. Res.</source> <volume>105</volume>, <fpage>7079</fpage>&#x02013;<lpage>7088</lpage>. <pub-id pub-id-type="doi">10.1029/1999JD901170</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carvalho</surname> <given-names>J. R. P.</given-names></name> <name><surname>Nakai</surname> <given-names>A. M.</given-names></name> <name><surname>Monteiro</surname> <given-names>J. E. B.</given-names></name></person-group> (<year>2016</year>). <article-title>Spatio-temporal modeling of data imputation for daily rainfall series in homogeneous zones</article-title>. <source>Rev. Bras. meteoRol.</source> <volume>31</volume>, <fpage>196</fpage>&#x02013;<lpage>201</lpage>. <pub-id pub-id-type="doi">10.1590/0102-778631220150025</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chang</surname> <given-names>C. L.</given-names></name> <name><surname>Lo</surname> <given-names>S. L.</given-names></name> <name><surname>Yu</surname> <given-names>S. L.</given-names></name></person-group> (<year>2005</year>). <article-title>Applying fuzzy theory and genetic algorithm to interpolate precipitation</article-title>. <source>J. Hydrol.</source> <volume>314</volume>, <fpage>92</fpage>&#x02013;<lpage>104</lpage>. <pub-id pub-id-type="doi">10.1016/j.jhydrol.2005.03.034</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>B.</given-names></name> <name><surname>Han</surname> <given-names>M. Y.</given-names></name> <name><surname>Peng</surname> <given-names>K.</given-names></name> <name><surname>Zhou</surname> <given-names>S. L.</given-names></name> <name><surname>Shao</surname> <given-names>L.</given-names></name> <name><surname>Wu</surname> <given-names>X. F.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Global land-water nexus: Agricultural land and freshwater use embodied in worldwide supply chains</article-title>. <source>Sci. Total Environ.</source> <volume>614</volume>, <fpage>931</fpage>&#x02013;<lpage>943</lpage>. <pub-id pub-id-type="doi">10.1016/j.scitotenv.2017.09.138</pub-id><pub-id pub-id-type="pmid">28946381</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>S.</given-names></name> <name><surname>Lu</surname> <given-names>F.</given-names></name></person-group> (<year>2017</year>). <article-title>A two-step method for missing spatio-temporal data reconstruction</article-title>. <source>IJGI</source> <volume>6</volume>, <fpage>187</fpage>. <pub-id pub-id-type="doi">10.3390/ijgi6070187</pub-id><pub-id pub-id-type="pmid">20729163</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cordeiro</surname> <given-names>M.</given-names></name> <name><surname>Markert</surname> <given-names>C.</given-names></name> <name><surname>Ara&#x000FA;jo</surname> <given-names>S. S.</given-names></name> <name><surname>Campos</surname> <given-names>N. G.</given-names></name> <name><surname>Gondim</surname> <given-names>R. S.</given-names></name> <name><surname>Da Silva</surname> <given-names>T. L. C.</given-names></name> <etal/></person-group>. (<year>2022</year>). <article-title>Towards smart farming: fog-enabled intelligent irrigation system using deep neural networks</article-title>. <source>Future Gen. Comp. Syst.</source> <volume>129</volume>, <fpage>115</fpage>&#x02013;<lpage>124</lpage>. <pub-id pub-id-type="doi">10.1016/j.future.2021.11.013</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Di Piazza</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <source>The Problem of Missing Data in Hydroclimatic Time Series. Applicationof Spatial Interpolation Techniques to Construct a Comprehensive of Hydroclimatic Data</source> [Th&#x000E8;se de doctorat]. Sicily: IRIS, Universit&#x000E9; de Palerme.</citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dong</surname> <given-names>Y.</given-names></name> <name><surname>Peng</surname> <given-names>C.-Y. J.</given-names></name></person-group> (<year>2013</year>). <article-title>Principled missing data methods for researchers</article-title>. <source>Springerplus</source> <volume>2</volume>, <fpage>222</fpage>. <pub-id pub-id-type="doi">10.1186/2193-1801-2-222</pub-id><pub-id pub-id-type="pmid">23853744</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ebtehaj</surname> <given-names>I.</given-names></name> <name><surname>Bonakdari</surname> <given-names>H.</given-names></name> <name><surname>Zaji</surname> <given-names>A. H.</given-names></name> <name><surname>Sharafi</surname> <given-names>H.</given-names></name></person-group> (<year>2019</year>). <article-title>Sensitivity analysis of parameters affecting scour depth around bridge piers based on the non-tuned, rapid extreme learning machine method</article-title>. <source>Neural Comput. Appl.</source> <volume>31</volume>, <fpage>9145</fpage>&#x02013;<lpage>9156</lpage>. <pub-id pub-id-type="doi">10.1007/s00521-018-3696-6</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ebtehaj</surname> <given-names>I.</given-names></name> <name><surname>Soltani</surname> <given-names>K.</given-names></name> <name><surname>Amiri</surname> <given-names>A.</given-names></name> <name><surname>Faramarzi</surname> <given-names>M.</given-names></name> <name><surname>Madramootoo</surname> <given-names>C. A.</given-names></name> <name><surname>Bonakdari</surname> <given-names>H.</given-names></name></person-group> (<year>2021</year>). <article-title>Prognostication of shortwave radiation using an improved no-tuned fast machine learning</article-title>. <source>Sustainability</source> <volume>13</volume>, <fpage>8009</fpage>. <pub-id pub-id-type="doi">10.3390/su13148009</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Enders</surname> <given-names>C.</given-names></name> <name><surname>Bandalos</surname> <given-names>D.</given-names></name></person-group> (<year>2001</year>). <article-title>The relative performance of full information maximum likelihood estimation for missing data in structural equation models</article-title>. <source>Struc. Eq. Model. Multidiscip. J.</source> <volume>8</volume>, <fpage>430</fpage>&#x02013;<lpage>457</lpage>. <pub-id pub-id-type="doi">10.1207/S15328007SEM0803_5</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eskelson</surname> <given-names>B. N. I.</given-names></name> <name><surname>Temesgen</surname> <given-names>H.</given-names></name> <name><surname>Lemay</surname> <given-names>V.</given-names></name> <name><surname>Barrett</surname> <given-names>T. M.</given-names></name> <name><surname>Crookston</surname> <given-names>N. L.</given-names></name> <name><surname>Hudak</surname> <given-names>A. T.</given-names></name></person-group> (<year>2009</year>). <article-title>The roles of nearest neighbor methods in imputing missing data in forest inventory and monitoring databases</article-title>. <source>Scand. J. Forest Res.arch</source> <volume>24</volume>, <fpage>235</fpage>&#x02013;<lpage>246</lpage>. <pub-id pub-id-type="doi">10.1080/02827580902870490</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Evans</surname> <given-names>S.</given-names></name> <name><surname>Williams</surname> <given-names>G. P.</given-names></name> <name><surname>Jones</surname> <given-names>N. L.</given-names></name> <name><surname>Ames</surname> <given-names>D. P.</given-names></name> <name><surname>Nelson</surname> <given-names>E. J.</given-names></name></person-group> (<year>2020</year>). <article-title>Exploiting earth observation data to impute groundwater level measurements with an extreme learning machine</article-title>. <source>Remote Sensing</source> <volume>12</volume>, <fpage>2044</fpage>. <pub-id pub-id-type="doi">10.3390/rs12122044</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="book"><person-group person-group-type="author"><collab>FAO</collab></person-group>. (<year>2008</year>). <source>International Year of the Potato 2008 New Light on a Hidden Treasure</source>. <publisher-loc>Rome</publisher-loc>: <publisher-name>FAO</publisher-name>.</citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fountas</surname> <given-names>S.</given-names></name> <name><surname>Carli</surname> <given-names>G.</given-names></name> <name><surname>S&#x000F8;rensen</surname> <given-names>C. G.</given-names></name> <name><surname>Tsiropoulos</surname> <given-names>Z.</given-names></name> <name><surname>Cavalaris</surname> <given-names>C.</given-names></name> <name><surname>Vatsanidou</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Farm management information systems: current situation and future perspectives</article-title>. <source>Comp. Electr. Agric.</source> <volume>115</volume>, <fpage>40</fpage>&#x02013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.1016/j.compag.2015.05.011</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gholipour</surname> <given-names>Y.</given-names></name> <name><surname>Shahbazi</surname> <given-names>M. M.</given-names></name> <name><surname>Behnia</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>An improved version of inverse distance weighting metamodel assisted harmony search algorithm for truss design optimization</article-title>. <source>Lat. Am. J. Solids Struct.</source> <volume>10</volume>, <fpage>283</fpage>&#x02013;<lpage>300</lpage>. <pub-id pub-id-type="doi">10.1590/S1679-78252013000200004</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Godfray</surname> <given-names>H. C. J.</given-names></name> <name><surname>Beddington</surname> <given-names>J. R.</given-names></name> <name><surname>Crute</surname> <given-names>I. R.</given-names></name> <name><surname>Haddad</surname> <given-names>L.</given-names></name> <name><surname>Lawrence</surname> <given-names>D.</given-names></name> <name><surname>Muir</surname> <given-names>J. F.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Food security: the challenge of feeding 9 billion people</article-title>. <source>Science</source> <volume>327</volume>, <fpage>812</fpage>&#x02013;<lpage>818</lpage>. <pub-id pub-id-type="doi">10.1126/science.1185383</pub-id><pub-id pub-id-type="pmid">20110467</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G.-B.</given-names></name></person-group> (<year>2015</year>). <article-title>What are extreme learning machines? Filling the gap between frank rosenblatt&#x00027;s dream and john von neumann&#x00027;s puzzle</article-title>. <source>Cogn. Comput.</source> <volume>7</volume>, <fpage>263</fpage>&#x02013;<lpage>278</lpage>. <pub-id pub-id-type="doi">10.1007/s12559-015-9333-0</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G.-B.</given-names></name> <name><surname>Zhou</surname> <given-names>H.</given-names></name> <name><surname>Ding</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>R.</given-names></name></person-group> (<year>2012</year>). <article-title>Extreme learning machine for regression and multiclass classification</article-title>. <source>IEEE Trans. Syst. Man. Cybern. B. Cybern.</source> <volume>42</volume>, <fpage>513</fpage>&#x02013;<lpage>529</lpage>. <pub-id pub-id-type="doi">10.1109/TSMCB.2011.2168604</pub-id><pub-id pub-id-type="pmid">21984515</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kamilaris</surname> <given-names>A.</given-names></name> <name><surname>Prenafeta-Bold&#x000FA;</surname> <given-names>F. X.</given-names></name></person-group> (<year>2018</year>). <article-title>Deep learning in agriculture: a survey</article-title>. <source>Comput. Electr. Agric.</source> <volume>147</volume>, <fpage>70</fpage>&#x02013;<lpage>90</lpage>. <pub-id pub-id-type="doi">10.1016/j.compag.2018.02.016</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>K.-Y.</given-names></name> <name><surname>Kim</surname> <given-names>B.-J.</given-names></name> <name><surname>Yi</surname> <given-names>G.-S.</given-names></name></person-group> (<year>2004</year>). <article-title>Reuse of imputed data in microarray analysis increases imputation efficiency</article-title>. <source>BMC Bioinf.</source> <volume>5</volume>, <fpage>160</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-5-160</pub-id><pub-id pub-id-type="pmid">15504240</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>L&#x000E9;tourneau</surname> <given-names>G.</given-names></name> <name><surname>Caron</surname> <given-names>J.</given-names></name> <name><surname>Anderson</surname> <given-names>L.</given-names></name> <name><surname>Cormier</surname> <given-names>J.</given-names></name></person-group> (<year>2015</year>). <article-title>Matric potential-based irrigation management of field-grown strawberry: Effects on yield and water use efficiency</article-title>. <source>Agric. Water Manage.</source> <volume>161</volume>, <fpage>102</fpage>&#x02013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.1016/j.agwat.2015.07.005</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Wang</surname> <given-names>P.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Intelligent optimization on power values for inverse distance weighting,&#x0201D;</article-title> in <source>2013 International Conference on Information Science and Cloud Computing Companion (IEEE)</source>, <publisher-loc>Manhattan, NY</publisher-loc>, <fpage>370</fpage>&#x02013;<lpage>375</lpage>.</citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Chen</surname> <given-names>W.</given-names></name></person-group> (<year>2020</year>). <article-title>Three-step imputation of missing values in condition monitoring datasets</article-title>. <source>IET Gen. Trans. Distrib.</source> <volume>14</volume>, <fpage>3288</fpage>&#x02013;<lpage>3300</lpage>. <pub-id pub-id-type="doi">10.1049/iet-gtd.2019.1446</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ly</surname> <given-names>S.</given-names></name> <name><surname>Charles</surname> <given-names>C.</given-names></name> <name><surname>Degr&#x000E9;</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <article-title>Geostatistical interpolation of daily rainfall at catchment scale: the use of several variogram models in the Ourthe and Ambleve catchments, Belgium</article-title>. <source>Hydrol. Earth Syst. Sci.</source> <volume>15</volume>, <fpage>2259</fpage>&#x02013;<lpage>2274</lpage>. <pub-id pub-id-type="doi">10.5194/hess-15-2259-2011</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matteau</surname> <given-names>J.-P.</given-names></name> <name><surname>C&#x000E9;licourt</surname> <given-names>P.</given-names></name> <name><surname>L&#x000E9;tourneau</surname> <given-names>G.</given-names></name> <name><surname>Gumiere</surname> <given-names>T.</given-names></name> <name><surname>Gumiere</surname> <given-names>S. J.</given-names></name></person-group> (<year>2021</year>). <article-title>Potato varieties response to soil matric potential based irrigation</article-title>. <source>Agronomy</source> <volume>11</volume>, <fpage>352</fpage>. <pub-id pub-id-type="doi">10.3390/agronomy11020352</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matteau</surname> <given-names>J.-P.</given-names></name> <name><surname>C&#x000E9;licourt</surname> <given-names>P.</given-names></name> <name><surname>L&#x000E9;tourneau</surname> <given-names>G.</given-names></name> <name><surname>Gumiere</surname> <given-names>T.</given-names></name> <name><surname>Gumiere</surname> <given-names>S. J.</given-names></name></person-group> (<year>2022b</year>). <article-title>Effects of irrigation thresholds and temporal distribution on potato yield and water productivity in sandy soil</article-title>. <source>Agric. Water Manage.</source> <volume>264</volume>, <fpage>107483</fpage>. <pub-id pub-id-type="doi">10.1016/j.agwat.2022.107483</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matteau</surname> <given-names>J.-P.</given-names></name> <name><surname>Celicourt</surname> <given-names>P.</given-names></name> <name><surname>Shahriarina</surname> <given-names>E.</given-names></name> <name><surname>Letellier</surname> <given-names>P.</given-names></name> <name><surname>Gumiere</surname> <given-names>T.</given-names></name> <name><surname>Gumiere</surname> <given-names>S. J.</given-names></name></person-group> (<year>2022a</year>). <article-title>Relationship between irrigation thresholds and potato tuber depth in sandy soil</article-title>. <source>Front. Soil Sci.</source> 2. <pub-id pub-id-type="doi">10.3389/fsoil.2022.898618</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Molden</surname> <given-names>D.</given-names></name> <name><surname>Oweis</surname> <given-names>T.</given-names></name> <name><surname>Steduto</surname> <given-names>P.</given-names></name> <name><surname>Bindraban</surname> <given-names>P.</given-names></name> <name><surname>Hanjra</surname> <given-names>M. A.</given-names></name> <name><surname>Kijne</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>Improving agricultural water productivity: between optimism and caution</article-title>. <source>Agric. Water Manage.</source> <volume>97</volume>, <fpage>528</fpage>&#x02013;<lpage>535</lpage>. <pub-id pub-id-type="doi">10.1016/j.agwat.2009.03.023</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nzokou</surname> <given-names>P.</given-names></name> <name><surname>Gooch</surname> <given-names>N. J.</given-names></name> <name><surname>Cregg</surname> <given-names>B. M.</given-names></name></person-group> (<year>2010</year>). <article-title>Design and implementation of a soil matric potential-based automated irrigation system for drip irrigating fraser fir</article-title>. <source>Hortte</source> <volume>20</volume>, <fpage>1030</fpage>&#x02013;<lpage>1036</lpage>. <pub-id pub-id-type="doi">10.21273/HORTSCI.20.6.1030</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>P&#x000E9;riard</surname> <given-names>Y.</given-names></name> <name><surname>Caron</surname> <given-names>J.</given-names></name> <name><surname>Lafond</surname> <given-names>J. A.</given-names></name> <name><surname>Jutras</surname> <given-names>S.</given-names></name></person-group> (<year>2015</year>). <article-title>Root water uptake by romaine lettuce in a muck soil: linking tip burn to hydric deficit</article-title>. <source>Vadose Zone J.</source> 14, vzj2014.10.0139. <pub-id pub-id-type="doi">10.2136/vzj2014.10.0139</pub-id></citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rekika</surname> <given-names>D.</given-names></name> <name><surname>Caron</surname> <given-names>J.</given-names></name> <name><surname>Rancourt</surname> <given-names>G. T.</given-names></name> <name><surname>Lafond</surname> <given-names>J. A.</given-names></name> <name><surname>Gumiere</surname> <given-names>S. J.</given-names></name> <name><surname>Jenni</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Optimal irrigation for onion and celery production and spinach seed germination in histosols</article-title>. <source>Agronomy J.</source> <volume>106</volume>, <fpage>981</fpage>&#x02013;<lpage>994</lpage>. <pub-id pub-id-type="doi">10.2134/agronj2013.0235</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rouzinov</surname> <given-names>S.</given-names></name> <name><surname>Berchtold</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). <article-title>Regression-based approach to test missing data mechanisms</article-title>. <source>Data</source> <volume>7</volume>, <fpage>16</fpage>. <pub-id pub-id-type="doi">10.3390/data7020016</pub-id></citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tipton</surname> <given-names>J.</given-names></name> <name><surname>Hooten</surname> <given-names>M.</given-names></name> <name><surname>Goring</surname> <given-names>S.</given-names></name></person-group> (<year>2017</year>). <article-title>Reconstruction of spatio-temporal temperature from sparse historical records using robust probabilistic principal component regression</article-title>. <source>Adv. Stat. Clim. Meteorol. Oceanogr.</source> <volume>3</volume>, <fpage>1</fpage>&#x02013;<lpage>16</lpage>. <pub-id pub-id-type="doi">10.5194/ascmo-3-1-2017</pub-id></citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tonini</surname> <given-names>F.</given-names></name> <name><surname>Dillon</surname> <given-names>W. W.</given-names></name> <name><surname>Money</surname> <given-names>E. S.</given-names></name> <name><surname>Meentemeyer</surname> <given-names>R. K.</given-names></name></person-group> (<year>2016</year>). <article-title>Spatio-temporal reconstruction of missing forest microclimate measurements</article-title>. <source>Agric. Forest Meteorol.</source> 218-<volume>219</volume>, <fpage>1</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1016/j.agrformet.2015.11.004</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Troyanskaya</surname> <given-names>O.</given-names></name> <name><surname>Cantor</surname> <given-names>M.</given-names></name> <name><surname>Sherlock</surname> <given-names>G.</given-names></name> <name><surname>Brown</surname> <given-names>P.</given-names></name> <name><surname>Hastie</surname> <given-names>T.</given-names></name> <name><surname>Tibshirani</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2001</year>). <article-title>Missing value estimation methods for DNA microarrays</article-title>. <source>Bioinformatics</source> <volume>17</volume>, <fpage>520</fpage>&#x02013;<lpage>525</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/17.6.520</pub-id><pub-id pub-id-type="pmid">11395428</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wolfert</surname> <given-names>S.</given-names></name> <name><surname>Ge</surname> <given-names>L.</given-names></name> <name><surname>Verdouw</surname> <given-names>C.</given-names></name> <name><surname>Bogaardt</surname> <given-names>M.-J.</given-names></name></person-group> (<year>2017</year>). <article-title>Big data in smart farming &#x02013; a review</article-title>. <source>Agric. Syst.</source> <volume>153</volume>, <fpage>69</fpage>&#x02013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.1016/j.agsy.2017.01.023</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Yaseen</surname> <given-names>Z. M.</given-names></name> <name><surname>Deo</surname> <given-names>R. C.</given-names></name> <name><surname>Ebtehaj</surname> <given-names>I.</given-names></name> <name><surname>Bonakdari</surname> <given-names>H.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Hybrid data intelligent models and applications for water level prediction,&#x0201D;</article-title> in <source>Handbook of Research on Predictive Modeling and Optimization Methods in Science and Engineering</source>, eds. I. Giannoccaro, D. Kim, S. Sekhar Roy, T. L&#x000E4;nsivaara, R. Deo, and P. Samui (<publisher-loc>London</publisher-loc>: <publisher-name>IGI Global</publisher-name>), <fpage>121</fpage>&#x02013;<lpage>139</lpage>.</citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yozgatligil</surname> <given-names>C.</given-names></name> <name><surname>Aslan</surname> <given-names>S.</given-names></name> <name><surname>Iyigun</surname> <given-names>C.</given-names></name> <name><surname>Batmaz</surname> <given-names>I.</given-names></name></person-group> (<year>2013</year>). <article-title>Comparison of missing value imputation methods in time series: the case of Turkish meteorological data</article-title>. <source>Theor. Appl. Climatol.</source> <volume>112</volume>, <fpage>143</fpage>&#x02013;<lpage>167</lpage>. <pub-id pub-id-type="doi">10.1007/s00704-012-0723-x</pub-id></citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeynoddin</surname> <given-names>M.</given-names></name> <name><surname>Bonakdari</surname> <given-names>H.</given-names></name></person-group> (<year>2022</year>). <article-title>Structural-optimized sequential deep learning methods for surface soil moisture forecasting, case study Quebec, Canada</article-title>. <source>Neural. Comput. Applic.</source> <volume>10</volume>, <fpage>19895</fpage>&#x02013;<lpage>19921</lpage>. <pub-id pub-id-type="doi">10.1007/s00521-022-07529-2</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeynoddin</surname> <given-names>M.</given-names></name> <name><surname>Bonakdari</surname> <given-names>H.</given-names></name></person-group> (<year>2023</year>). <article-title>A comparative analysis of SMAP-derived soil moisture modeling by optimized machine learning methods: a case study of the Quebec province</article-title>. <source>ECWS-7 2023</source> <volume>37</volume>, <fpage>1</fpage>&#x02013;<lpage>4</lpage>. <pub-id pub-id-type="doi">10.3390/ECWS-7-14183</pub-id></citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeynoddin</surname> <given-names>M.</given-names></name> <name><surname>Bonakdari</surname> <given-names>H.</given-names></name> <name><surname>Azari</surname> <given-names>A.</given-names></name> <name><surname>Ebtehaj</surname> <given-names>I.</given-names></name> <name><surname>Gharabaghi</surname> <given-names>B.</given-names></name> <name><surname>Madavar</surname> <given-names>H. R.</given-names></name></person-group> (<year>2018</year>). <article-title>Novel hybrid linear stochastic with non-linear extreme learning machine methods for forecasting monthly rainfall a tropical climate</article-title>. <source>J. Environ. Manage.</source> <volume>222</volume>, <fpage>190</fpage>&#x02013;<lpage>206</lpage>. <pub-id pub-id-type="doi">10.1016/j.jenvman.2018.05.072</pub-id><pub-id pub-id-type="pmid">29843092</pub-id></citation></ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>R.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name></person-group> (<year>2022</year>). <article-title>Reconstruction of missing spring discharge by using deep learning models with ensemble empirical mode decomposition of precipitation</article-title>. <source>Environ. Sci. Pollut. Res. Int.</source> <volume>29</volume>, <fpage>82451</fpage>&#x02013;<lpage>82466</lpage>. <pub-id pub-id-type="doi">10.1007/s11356-022-21597-w</pub-id><pub-id pub-id-type="pmid">35751724</pub-id></citation></ref>
</ref-list> 
</back>
</article> 