<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="editorial" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Appl. Math. Stat.</journal-id>
<journal-title>Frontiers in Applied Mathematics and Statistics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Appl. Math. Stat.</abbrev-journal-title>
<issn pub-type="epub">2297-4687</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">756949</article-id>
<article-id pub-id-type="doi">10.3389/fams.2021.756949</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Applied Mathematics and Statistics</subject>
<subj-group>
<subject>Editorial</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Editorial: Data Science Applications to Inverse and Optimization Problems in Earth Science</article-title>
<alt-title alt-title-type="left-running-head">Leeuwenburgh et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Editorial: Data Science Applications</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Leeuwenburgh</surname>
<given-names>Olwijn</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1046123/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Emerick</surname>
<given-names>Alexandre A.</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/776129/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Jafarpour</surname>
<given-names>Behnam</given-names>
</name>
<xref ref-type="aff" rid="aff4">
<sup>4</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/884612/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>Dongxiao</given-names>
</name>
<xref ref-type="aff" rid="aff5">
<sup>5</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1047000/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Luo</surname>
<given-names>Xiaodong</given-names>
</name>
<xref ref-type="aff" rid="aff6">
<sup>6</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/885756/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<label>
<sup>1</sup>
</label>Netherlands Organisation for Applied Scientific Research (TNO), <addr-line>Utrecht</addr-line>, <country>Netherlands</country>
</aff>
<aff id="aff2">
<label>
<sup>2</sup>
</label>Department of Geoscience and Engineering, Delft University of Technology, <addr-line>Delft</addr-line>, <country>Netherlands</country>
</aff>
<aff id="aff3">
<label>
<sup>3</sup>
</label>Petrobras (Brazil), <addr-line>Rio de Janeiro</addr-line>, <country>Brazil</country>
</aff>
<aff id="aff4">
<label>
<sup>4</sup>
</label>Department of Electrical and Computer Engineering, and Department of Civil and Environmental Engineering, University of Southern California, <addr-line>Los Angeles</addr-line>, <addr-line>CA</addr-line>, <country>United&#x20;States</country>
</aff>
<aff id="aff5">
<label>
<sup>5</sup>
</label>Southern University of Science and Technology, <addr-line>Shenzhen</addr-line>, <country>China</country>
</aff>
<aff id="aff6">
<label>
<sup>6</sup>
</label>Norwegian Research Centre (NORCE), <addr-line>Bergen</addr-line>, <country>Norway</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited and reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/184945/overview">Hong-Kun Xu</ext-link>, Hangzhou Dianzi University, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Xiaodong Luo, <email>xluo@norceresearch.no</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Optimization, a section of the journal Frontiers in Applied Mathematics and Statistics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>03</day>
<month>09</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>7</volume>
<elocation-id>756949</elocation-id>
<history>
<date date-type="received">
<day>11</day>
<month>08</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>24</day>
<month>08</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Leeuwenburgh, Emerick, Jafarpour, Zhang and Luo.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Leeuwenburgh, Emerick, Jafarpour, Zhang and Luo</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<kwd-group>
<kwd>data science</kwd>
<kwd>data assimilation</kwd>
<kwd>optimization</kwd>
<kwd>inversion</kwd>
<kwd>earth sciences</kwd>
<kwd>machine learning</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<p>
<bold>Editorial on the Research Topic</bold>
</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/researchtopic/15751">
<bold>Data Science Applications to Inverse and Optimization Problems in Earth Science</bold>
</ext-link>
</p>
<p>Solving inverse and optimization problems that are encountered in the earth sciences is often challenging because of the computational cost of simulating models, the nonlinearity of forward models, the frequently large number of uncertain parameters or decision options and the limited information provided by data. These challenges have motivated a significant investment of effort into the development of efficient methods to improve the efficacy and reduce the overall computational costs of inversion and optimization workflows.</p>
<p>In recent years, the recent developments in data science (including machine learning) have attracted increased attention from researchers and practitioners from both academia and industry, for its proven ability to construct useful predictive models from large numbers of data. In comparison to data science, inverse and optimization theories have a relatively longer history within earth science. While various inverse and optimization methods have been well established and successfully applied to real-world problems, there is still room to further improve and strengthen their performance and applicability in terms of e.g., accuracy, computational efficiency, and uncertainty representation. The papers in this topic address various challenges in earth sciences, combining data science, inverse and/or optimization theories.</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2021.673412">Gao et&#x20;al.</ext-link> present an extension of the distributed Gauss-Newton method to optimization problems that allows for large numbers of controls by use of a limited-memory BFGS scheme. In the distributed optimization approach, multiple parameter or control solutions are simultaneously updated in an iterative manner. The updates exploit information gathered in a growing database of intermediate solutions that allows for learning from distributed data. In contrast to ensemble methods, by selecting data based on distance, solutions are able to converge towards different modes, resulting in multiple distinct solutions.</p>
<p>The presence of multiple modes in the posterior distribution is addressed in the context of inverse problems by <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2021.636524">Conjard and Omre</ext-link>. They define a selection Kalman model (SKM) as an extension of the traditional Kalman model for Gaussian distributions towards (spatially) multimodal distributions for linear-Gaussian forward models. In synthetic experiments, the SKM is found to outperform the traditional Kalman model, which tends to produce blurred distributions because of its tendency towards Gaussianity. The new approach could be the initial step towards an ensemble version that supports nonlinear forward models.</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2021.651178">Coutinho et&#x20;al.</ext-link> consider the reduction of computational cost for expensive model-based workflows by use of proxy models. The idea here is to replace the large online cost associated with applying iterative procedures to many model realizations by a single offline training stage in which fast models are trained using machine learning techniques. The authors consider an extension of the so-called Embed to Control (E2C) approach, introduced into the geosciences for the purpose of simulating reservoir flow by <xref ref-type="bibr" rid="B1">Jin et al. [1]</xref>. In particular, various options for prediction and conditioning to well data are compared and the authors demonstrate that improved predictions can be obtained relative to those obtained with previously proposed reduced-order model approaches.</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2021.689934">Nasir et&#x20;al.</ext-link> combine fast proxy models based on convolutional neural networks with deep-reinforcement learning techniques to inform improved solutions for optimization problems that involve decisions on where to place wells to develop subsurface reservoir systems. By considering a very large training database constructed from randomly sampled model parameters, operational constraints and economic conditions, it is expected that valid optimized results can be generated almost instantly for new scenarios within the range of training&#x20;data.</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2021.673077">Nezhadali et&#x20;al.</ext-link> consider the use of reduced models obtained by multiple levels of domain coarsening in ensemble inversion workflows. The loss of accuracy associated with the models of multilevel fidelities could be balanced by an increased ensemble size, leading to lower Monte Carlo errors. A scheme is introduced to estimate and (approximately) account for the multilevel modelling error. The resulting workflow is applied to experiments with synthetic reservoir flow models, which suggest that the multilevel approach with error correction outperforms the conventional approach.</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2021.655224">Fablet et&#x20;al.</ext-link> consider the challenges associated with the sparse sampling of observation data by earth-orbiting satellites. In particular, they investigate if it is possible to learn a representation of the processes underlying the observed data that could be used to interpolate to times that are not directly observed. The interpolation problem can be formulated as an optimization problem where parameters of a model is estimated such that some measure of the variance (or energy) in the interpolated model state is minimized. The authors consider neural network representations of an energy state and apply their proposed methodology to interpolate sea surface temperature and height&#x20;data.</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2021.686754">Jiang et&#x20;al.</ext-link> present a study in which the recently developed data-space inversion (DSI) method is extended with a parameterization technique based on a recurrent autoencoder (RAE). The DSI method enables fast updates of predictions based on new data without the need for explicit forward modelling. Instead, prior forecasts are added to the state vector and updated directly. The proposed parameterization enables a low-dimensional representation of the time series forecast data that can be seen as an alternative to more traditional PCA representations. The methodology is applied to a complex fractured reservoir system, which is operated with a detailed management logic that results in frequent changes to the wells in the reservoir.</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2021.687743">Lin et&#x20;al.</ext-link> investigate the impact of ensemble-based inversion on forecast degradation caused by the introduction of shocks through the update of the dynamic model state. They propose an incremental update solution, as also adopted in stochastic ensemble smoothers that are frequently applied to parameter estimation problems, but in this case for the class of deterministic filter methods such as the Ensemble Transform Kalman Filter (ETKF). The new scheme is tested with a shallow-water&#x20;model.</p>
<p>The papers in this research topic cover a diverse range of application domains, fundamental and applied research, workflows (interpolation, inversion, optimization), and methods. They demonstrate that the earth sciences remain a fertile ground for exciting and promising new developments in advanced computational methods.</p>
</body>
<back>
<sec id="s1">
<title>Author Contributions</title>
<p>The research topic was initiated by XL. All topic editors have contributed to the formulation of the scope and the editorial process.</p>
</sec>
<sec sec-type="COI-statement" id="s2">
<title>Conflict of Interest</title>
<p>AE was employed by Petrobras (Brazil).</p>
<p>The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="s3" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jin</surname>
<given-names>ZL.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Durlofsky</surname>
<given-names>LJ.</given-names>
</name>
</person-group> <article-title>Deep-Learning-Based Surrogate Model for Reservoir Simulation With Time-Varying Well Controls. <italic>J Petrol Sci Eng</italic>
</article-title> (<year>2020</year>) 192:107273. <pub-id pub-id-type="doi">10.1016/j.petrol.2020.107273</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>