<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Artif. Intell.</journal-id>
<journal-title>Frontiers in Artificial Intelligence</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Artif. Intell.</abbrev-journal-title>
<issn pub-type="epub">2624-8212</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/frai.2025.1628029</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Artificial Intelligence</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Utilizing XGBoosts to correct arcjet contamination in magnetic field measurements from GOES missions</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Inceoglu</surname> <given-names>Fadil</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1040703/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/data-curation/"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/project-administration/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
<role content-type="https://credit.niso.org/contributor-roles/validation/"/>
<role content-type="https://credit.niso.org/contributor-roles/visualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Loto&#x00027;aniu</surname> <given-names>Paul T. M.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder</institution>, <addr-line>Boulder, CO</addr-line>, <country>United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>National Centers for Environmental Information, National Oceanic and Atmospheric Administration</institution>, <addr-line>Boulder, CO</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Daniel Okoh, The National Space Research and Development Agency (NASRDA), Nigeria</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Sreelakshmi Jayaraman, University of Michigan, United States</p>
<p>Ahmad Bin Ahmarofi, Universiti Teknologi MARA Cawangan Kedah, Malaysia</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Fadil Inceoglu <email>fadil.inceoglu&#x00040;colorado.edu</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>16</day>
<month>09</month>
<year>2025</year>
</pub-date>
<pub-date pub-type="collection">
<year>2025</year>
</pub-date>
<volume>8</volume>
<elocation-id>1628029</elocation-id>
<history>
<date date-type="received">
<day>13</day>
<month>05</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>27</day>
<month>08</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2025 Inceoglu and Loto&#x00027;aniu.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Inceoglu and Loto&#x00027;aniu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The magnetometers onboard the Geostationary Operational Environmental Satellites (GOES) provide crucial measurements for space weather monitoring and scientific research. However, periodic arcjet thruster firings introduce contamination in the measured magnetic field, affecting data accuracy. The currently used correction matrix approach mitigates these effects but struggles with transient variations and residual errors. In this study, we present an alternative correction method using XGBoost, a machine learning algorithm, to correct arcjet-induced contamination in the GOES-17 magnetometer data using GOES-18 as ground truth. Using cross-satellite comparisons and supervised learning techniques, our model is effective in reducing artificial disturbances, especially non-linear variations. We found that the XGBoost method works better than the existing correction matrix approach for E and P components, while the correction matrix performs better for the N component. Although some limitations remain due to training data constraints, our results highlight the importance of machine learning to improve magnetometer data quality by recognizing and correcting complex satellite-driven artifacts. The collocation of GOES-17 and GOES-18 provided a unique opportunity for cross-satellite calibration and validation, and with a longer collocation period, the XGBoost method shows significant promise for better correction of operational data, emphasizing the need for such configurations in future satellite missions.</p></abstract>
<kwd-group>
<kwd>machine learning</kwd>
<kwd>XGBoost</kwd>
<kwd>arcjet</kwd>
<kwd>GOES</kwd>
<kwd>magnetic field</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="8"/>
<page-count count="9"/>
<word-count count="5424"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Machine Learning and Artificial Intelligence</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1 Introduction</title>
<p>Geostationary Operational Environmental Satellites (GOES) have continuously measured Earth&#x00027;s magnetic field from geostationary orbit for more than 40 years, providing critical data for space weather monitoring and scientific research. The magnetometer (MAG) instruments onboard these satellites are operationally used by NOAA&#x00027;s Space Weather Prediction Center (SWPC) for real-time space weather forecasting and event detection, including geomagnetic storms, radiation belt dynamics, and magnetospheric processes (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>). Beyond operational use, GOES magnetometer data serve as a fundamental resource for space physics research, playing a key role in empirical magnetospheric modeling and the study of ultralow-frequency (ULF) waves (<xref ref-type="bibr" rid="B7">Loto&#x00027;aniu and Inceoglu, 2024</xref>).</p>
<p>The GOES-R series magnetometer measurements are periodically contaminated by arcjet thruster firings, which are performed to maintain spacecraft position&#x02013;specifically for North-South station keeping&#x02013;in geostationary orbit. Arcjet thrusters generate thrust by electrically heating hydrazine propellant, producing a partially ionized plasma plume whose elevated density and plasma pressure cause local magnetic field disturbances near the spacecraft. These firings introduce magnetic field perturbations, primarily due to the diamagnetic effect of the thruster plume and pressure gradients in the surrounding plasma environment (<xref ref-type="bibr" rid="B1">Califf et al., 2020a</xref>,<xref ref-type="bibr" rid="B2">b</xref>). Previous studies have highlighted the need for correction methods, as arcjet disturbances can introduce biases in the measured magnetic field data (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>). <xref ref-type="bibr" rid="B2">Califf et al. (2020b)</xref> developed the currently used correction matrix approach that applies linear adjustments to mitigate the effects of arcjet contamination, but this method has limitations in handling transient variations and residual errors in corrected data.</p>
<p>Machine learning methods have recently been explored to improve the quality of magnetometer data. <xref ref-type="bibr" rid="B5">Inceoglu and Loto&#x00027;aniu (2021)</xref> applied both supervised and unsupervised learning to correct offset anomalies stemming from thermal and seasonal effects in the GOES-16 magnetometer data, demonstrating that machine learning models can adapt to complex and non-linear variations in spacecraft magnetic field measurements. Building on this broader interest in ML-based correction methods, we explore the use of XGBoost (<xref ref-type="bibr" rid="B3">Chen and Guestrin, 2016</xref>), a tree-based gradient-boosting machine learning model, to correct arcjet-induced contamination in GOES-17 data using collocated GOES-18 magnetometer data as ground truth. Using supervised learning techniques and cross-satellite comparisons, we aim to develop a robust correction model that adapts to the dynamic characteristics of arcjet disturbances. GOES-18, which benefits from improved thermal stability and instrument design compared to GOES-16 and GOES-17 (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>), provides an ideal dataset to be used as the ground truth for training and validating the correction algorithm.</p>
</sec>
<sec id="s2">
<title>2 Data and methods</title>
<sec>
<title>2.1 Overview of the GOES-R series and magnetometer design</title>
<p>The GOES-R series (GOES-16 to GOES-19) represents the most recent generation of NOAA&#x00027;s geostationary satellites, designed for continuous weather and space weather monitoring. GOES-16 (launched in November 2016, earlier GOES-East) is located at 75.2&#x000B0;W. GOES-17 (launched in March 2018) served as GOES-West at 137.2&#x000B0; W from February 2019 until January 2023 (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>), after which it was relocated to on-orbit storage at 105&#x000B0; W.<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref> GOES-18 (launched in March 2022) transitioned to 137.0&#x000B0; W in mid-2022 and fully assumed operational GOES-West status by early January 2023 (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>). Each spacecraft carries a pair of fluxgate magnetometers mounted on a boom, with the inboard (IB) sensor positioned 6.3 m and the outboard (OB) sensor 8.5 m from the spacecraft. The IB sensor is more susceptible to thermal influences from the spacecraft bus, whereas the OB sensor generally provides greater long-term stability (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>).</p>
<p>Following its launch in March 2022, GOES-18 was positioned close to GOES-17 for 2.5 months, with a longitudinal separation of only 0.4 degree (136.8&#x000B0;W vs. 137.2&#x000B0;W). This first of its kind collocation provided a unique opportunity for direct cross-satellite calibration, as both spacecraft observed nearly identical geomagnetic conditions. These comparisons demonstrated that GOES-18, equipped with the new Goddard magnetometers (GMAG), exhibits diurnal and long-term stability within &#x000B1;1 nT, compared to variations of &#x0007E;2 nT on GOES-17 and &#x0003E;10 nT on GOES-16 (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>). This improvement reflects both the change in vendor (NASA Goddard vs. MEDA for GOES-16/17) and significant engineering upgrades, including redesigned sensor and electronics units, added heaters and thermal isolation spacers, enhanced blanketing, and extensive ground and on-orbit thermal testing, all of which mitigated the thermal instabilities seen in earlier MAG instruments (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>).</p>
</sec>
<sec>
<title>2.2 Contamination in the magnetic field measurements</title>
<p>The arcjet thrusters aboard the GOES-R series satellites&#x02014;GOES-16 to GOES-19&#x02014;are periodically fired to maintain the spacecraft&#x00027;s geostationary orbit. Each satellite is equipped with four thrusters, numbered 13, 14, 15, and 16 (<xref ref-type="fig" rid="F1">Figure 1</xref>), nominally fire in alternating pairs (13&#x02013;15 or 14&#x02013;16) approximately every four days for around 90 minutes per maneuver (<xref ref-type="bibr" rid="B1">Califf et al., 2020a</xref>), although the exact pairing may occasionally vary. During these firings, the GOES-R magnetometers detect a significant artificial disturbance in the local magnetic field, introducing deviations of up to &#x0007E;20 nT, 20% of the typical geomagnetic field strength at geostationary orbit (<xref ref-type="bibr" rid="B2">Califf et al., 2020b</xref>). This contamination occurs due to two primary physical mechanisms: (i) the diamagnetic effect of the thruster plume, which reduces the local ambient magnetic field strength, and (ii) plasma pressure gradients within the thruster exhaust that introduce additional localized magnetic perturbations (<xref ref-type="bibr" rid="B2">Califf et al., 2020b</xref>,<xref ref-type="bibr" rid="B1">a</xref>). The disturbance is most pronounced along the thrust axis, primarily affecting the poleward (P) component of the measured magnetic field, and remains relatively stable throughout the burn period. Although rapid recovery of the magnetic field occurs once the thrusters are deactivated, small residual offsets (&#x0007E;1-2 nT) may persist for hours due to thermal effects on the magnetometer electronics (<xref ref-type="bibr" rid="B1">Califf et al., 2020a</xref>).</p>
<fig position="float" id="F1">
<label>Figure 1</label>
<caption><p>Illustration of the inboard and outboard GOES magnetometers mounted on a boom and arcjet thrusters. The axes show the relationship between earthward-poleward-normal (EPN) coordinate frame (<xref ref-type="bibr" rid="B1">Califf et al., 2020a</xref>).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frai-08-1628029-g0001.tif">
<alt-text>Diagram showing a cube with numbered thrusters (13-16) on one face indicating nozzle direction. Two thrusters point to the left, two upwards. Axes are labeled E (&#x0002B;Z), N (&#x0002B;X), and P (&#x02212;Y). Below, &#x0201C;Inboard MAG&#x0201D; and &#x0201C;Outboard MAG&#x0201D; are marked with two ellipses, connected by dotted lines.</alt-text>
</graphic>
</fig>
<p>Similar contamination in magnetic field measurements can also be observed in GOES-17 and GOES-18 (<xref ref-type="fig" rid="F2">Figure 2</xref>). During the GOES-17/18 collocation described above, both satellites measured the Earth&#x00027;s magnetic field under nearly identical conditions, enabling direct cross-satellite comparisons. Outside this overlap, the longitudinal separation between the satellites leads to differences in the ambient field, preventing one from being used as ground truth for the other.</p>
<fig position="float" id="F2">
<label>Figure 2</label>
<caption><p>The top panels show the magnetic field measurements from 26-27 July 2022 from GOES-17 (red) and GOES-18 (orange) in the E <bold>(a)</bold>, P <bold>(b)</bold>, and N <bold>(c)</bold> components of the EPN frame. The lower panels <bold>(d, e)</bold> show the current and voltage values corresponding to the arcjet firings on GOES-17.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frai-08-1628029-g0002.tif">
<alt-text>Five graphs display various measurements over time from 14:00 to 02:00. Graph (a) shows magnetic field strength B for G17-E and G18-E, ranging from 10 to 25 nT. Graph (b) presents B for G17-P and G18-P, between 80 and 100 nT. Graph (c) depicts B for G17-N and G18-N, ranging from 10 to 20 nT. Graph (d) illustrates current (I1, I2, I3), showing a peak for I2 around 22:00. Graph (e) displays voltage (V13, V14, V15, V16), where four lines start around zero and rise sharply at specific times.</alt-text>
</graphic>
</fig>
<p>During the collocation, we identified 21 GOES-17 arcjet firing events during which GOES-18 did not fire its arcjets simultaneously. This situation provided us with a great opportunity to directly compare the effects of arcjet contamination in GOES-17 measurements. To avoid ambiguity, only events that occurred during geomagnetically quiet to moderate conditions (generally Kp &#x02264; 4, with no major disturbances such as magnetopause crossings) were selected. Any background wave activity present during the collocation would have been observed nearly identically by both satellites (given their 0.4 degree longitudinal separation) and therefore does not introduce bias in the cross-satellite correction. All magnetic field data are analyzed in the Earth-Pointing Normal (EPN) coordinate system, where the X-axis points toward Earth&#x00027;s center, the Y-axis is aligned opposite the solar panel axis (approximately anti-sunward), and the Z-axis completes the right-handed system, generally pointing northward along the spacecraft&#x00027;s orbital normal. This coordinate system is spacecraft-fixed and provides a stable reference frame for interpreting arcjet-related disturbances, which are closely aligned with specific spacecraft structures and thruster directions. When the arcjets are activated, their current (<xref ref-type="fig" rid="F2">Figure 2c</xref>) and voltage (<xref ref-type="fig" rid="F2">Figure 2d</xref>) levels increase and remain elevated while firing. Simultaneously, there is a clear reduction in the magnetic field measurements in the E and P components of the EPN coordinate frame, which persists for the full firing duration (<xref ref-type="fig" rid="F2">Figures 2a</xref>, <xref ref-type="fig" rid="F2">b</xref>).</p>
<p>Given the significance of GOES magnetometer data for scientific research, mitigation of arcjet-induced contamination is crucial. A correction algorithm was developed for both the OB and IB magnetometers on each GOES-R series spacecraft, based on an observed linear relationship between the arcjet disturbance and the ambient magnetic field (<xref ref-type="bibr" rid="B1">Califf et al., 2020a</xref>). While both sensors are corrected independently, OB data are generally prioritized due to its improved thermal and bias stability. This least-squares regression correction was initially found to reduce residual errors to below 1.5 nT during steady-state firings in GOES-16 (<xref ref-type="bibr" rid="B1">Califf et al., 2020a</xref>). However, in cases involving non-nominal firing configurations, such as single-thruster firings or evolving thrust levels, residuals from the matrix correction can be substantially larger, particularly on GOES-17 (<xref ref-type="bibr" rid="B1">Califf et al., 2020a</xref>). These abnormal cases are relatively uncommon compared to the standard paired-thruster maneuvers but illustrate the need for more robust correction techniques. Our study focused on well-defined paired thruster firings during the GOES-17/18 collocation; therefore, the performance of XGBoost in these rarer abnormal cases remains to be evaluated. Importantly, the existing correction matrix does not account for the short-lived transient effects at the start and end of arcjet burns, which are flagged as invalid in operational data. In designing our approach, we developed the XGBoost model to better capture these transient shoulders, providing a framework that can in principle address such short-duration features.</p>
</sec>
<sec>
<title>2.3 XGBoost: training, validation, and test</title>
<p>XGBoost (Extreme Gradient Boosting) is an advanced, scalable tree boosting system designed for efficiency, accuracy, and performance in large-scale machine learning applications (<xref ref-type="bibr" rid="B3">Chen and Guestrin, 2016</xref>). It is an implementation of gradient boosted decision trees (<xref ref-type="bibr" rid="B4">Friedman, 2001</xref>) that introduces several optimizations to enhance speed and scalability. XGBoost iteratively constructs an ensemble of decision trees, where each new tree corrects errors made by previous trees, optimizing an objective function through gradient descent (<xref ref-type="bibr" rid="B3">Chen and Guestrin, 2016</xref>). The system incorporates a regularized learning objective to control model complexity and mitigate overfitting, ensuring generalization to unseen data.</p>
<p>To develop XGBoost models for each EPN coordinate frame component, we used data from the GOES-17/18 collocation period. We selected 21 GOES-17 arcjet firing events during which GOES-18 had no simultaneous firings, ensuring a clean reference for training and validation. GOES-18 magnetic field measurements were used as the ground truth, and data from both IB and OB magnetometers were combined to increase the training volume and enhance generalizability.</p>
<p>Before training the XGBoost models, we preprocessed the data to make sure that the GOES-18 data, which is used as the ground truth, and the GOES-17 data, which will be corrected, overlap without any longer-term effects due to the slightly higher thermal sensitivity of the GOES-17 magnetometers (<xref ref-type="fig" rid="F3">Figure 3</xref>). To achieve this objective, we first subtracted the GOES-18 measurements in each component of the EPN coordinate frame from the GOES-17 measurements during the periods when there are no arcjet firings (<xref ref-type="fig" rid="F3">Figures 3a</xref>, <xref ref-type="fig" rid="F3">d</xref>) for each of the 21 days from the IB and OB magnetic field data. Subsequently, we fitted a piecewise linear regression function to each difference to determine the longer-term difference between the two satellites (<xref ref-type="fig" rid="F3">Figures 3b</xref>, <xref ref-type="fig" rid="F3">e</xref>). We then subtracted this longer-term trend in magnetic field measurements from the GOES-18 data for each component in the EPN frame (<xref ref-type="fig" rid="F3">Figures 3c</xref>, <xref ref-type="fig" rid="F3">f</xref>) to create &#x0201C;adjusted GOES-18&#x0201D; values that effectively represent what GOES-17 would have measured in the absence of arcjet contamination. These adjusted values are no longer the raw GOES-18 measurements but serve as a proxy ground truth for training and validation.</p>
<fig position="float" id="F3">
<label>Figure 3</label>
<caption><p>The left panels <bold>(a, d, g)</bold> show the magnetic field measurements from GOES-17 (red) and GOES-18 (orange) in the E <bold>(a)</bold>, P <bold>(d)</bold>, and N <bold>(g)</bold> components of the EPN coordinate frame. The middle panels <bold>(b, e, h)</bold> show the differences between the GOES-18 and GOES-17 magnetic field measurements (black dots) for the E <bold>(b)</bold>, P <bold>(e)</bold>, and N <bold>(h)</bold> components, together with the piecewise linear regression fits (red lines). The right panels <bold>(c, f, i)</bold> show the adjusted GOES-18 values (green) after subtracting the piecewise linear regression fits for the E <bold>(c)</bold>, P <bold>(f)</bold>, and N <bold>(i)</bold> components.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frai-08-1628029-g0003.tif">
<alt-text>Nine-panel grid of line graphs and scatter plots shows magnetic field variations over time. Panels (a), (d), and (g) display double line graphs comparing two datasets (G17 and G18) in orange and red. Panels (b), (e), and (h) show scatter plots of differences with linear fit lines in black and red. Panels (c), (f), and (i) present adjusted datasets in red and green. Time ranges on the x-axis are from 14:00 to 02:00, and magnetic field strength in nanoteslas (nT) is indicated on the y-axis. Each panel is labeled from (a) to (i).</alt-text>
</graphic>
</fig>
<p>Training, validation, and test sets were created from 21 GOES-17 arcjet events during the collocation period, using adjusted GOES-18 data as ground truth and GOES-17 current and voltage as inputs. We restricted the inputs to spacecraft-intrinsic parameters (current and voltage), as these directly drive the contamination mechanism and keep the correction algorithm self-contained. External variables such as the ambient magnetic field or interplanetary conditions were not included, because GOES satellites are normally located at different longitudes and the 2.5-month collocation period did not provide enough variability in interplanetary and ambient magnetic fields to train a generalized model that incorporates broader space weather effects.</p>
<p>Each &#x0201C;dataset&#x0201D; refers to a time window centered on a single arcjet firing, typically spanning the duration of the maneuver with a buffer before and after. This process yielded 30 arcjet events across 15 days for training, 4 events from 2 days for validation, and 4 events from 4 days for testing. Only IB magnetometer data were used for testing, as IB and OB arcjet responses are highly similar. To ensure test-day independence, all OB data from the test days were excluded from the training and validation sets.</p>
<p>Hyperparameter optimization was explored using a Bayesian search over a range of values (<xref ref-type="table" rid="T1">Table 1</xref>). However, optimization did not yield significant improvements compared to the default values, largely due to the limited size and variability of the training dataset. Therefore, we retained the default values in all further analyses.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Default XGBoost hyperparameter values and the parameter ranges explored using Bayesian optimization.</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th valign="top" align="left"><bold>Parameter</bold></th>
<th valign="top" align="center"><bold>Default value</bold></th>
<th valign="top" align="center"><bold>Search range</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">booster</td>
<td valign="top" align="center">gbtree</td>
<td valign="top" align="center">gbtree</td>
</tr>
<tr>
<td valign="top" align="left">learning_rate</td>
<td valign="top" align="center">0.3</td>
<td valign="top" align="center">{0.0001, 0.001}</td>
</tr>
<tr>
<td valign="top" align="left">max_depth</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">{64, 128, 256}</td>
</tr>
<tr>
<td valign="top" align="left">max_leaves</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">{128, 256, 512}</td>
</tr>
<tr>
<td valign="top" align="left">n_estimators</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">{30, 100, 500, 1,000, 1,500}</td>
</tr>
<tr>
<td valign="top" align="left">max_bin</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">{10, 100, 1,000}</td>
</tr>
<tr>
<td valign="top" align="left">min_child_weight</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">[0.01, 10.0]</td>
</tr>
<tr>
<td valign="top" align="left">subsample</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">-</td>
</tr>
<tr>
<td valign="top" align="left">colsample_bytree</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">-</td>
</tr>
<tr>
<td valign="top" align="left">gamma</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">-</td>
</tr>
<tr>
<td valign="top" align="left">reg_alpha</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">-</td>
</tr>
<tr>
<td valign="top" align="left">reg_lambda</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">-</td>
</tr></tbody>
</table>
<table-wrap-foot>
<p>Parameters without a search range were kept fixed.</p>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>3 Results</title>
<p>The R<sup>2</sup> and mean squared error (MSE) values for each component of the EPN coordinate frame for the training and validation sets display very high and low values, respectively (<xref ref-type="table" rid="T2">Table 2</xref>). These values primarily reflect the fact that the GOES-17 data and the adjusted GOES-18 ground truth are nearly identical outside the arcjet firing periods, which make up roughly 10% of the total data length. Since the XGBoost correction algorithm is applied across the full time window&#x02013;including before, during, and after arcjet events&#x02013;the evaluation metrics are computed over the entire period to reflect overall model performance. Visual inspection of the corrected time series during arcjet activity is used in tandem to assess the model&#x00027;s effectiveness in addressing the contamination.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>XGBoost performance metrics for training and validation sets.</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th/>
<th valign="top" align="center" colspan="2"><bold>Training</bold></th>
<th valign="top" align="center" colspan="2"><bold>Validation</bold></th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>R</bold><sup>2</sup></th>
<th valign="top" align="center"><bold>MSE</bold></th>
<th valign="top" align="center"><bold>R</bold><sup>2</sup></th>
<th valign="top" align="center"><bold>MSE</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><bold>E</bold></td>
<td valign="top" align="center">0.9983</td>
<td valign="top" align="center">0.1066</td>
<td valign="top" align="center">0.9897</td>
<td valign="top" align="center">0.1734</td>
</tr>
<tr>
<td valign="top" align="left"><bold>P</bold></td>
<td valign="top" align="center">0.9999</td>
<td valign="top" align="center">0.0598</td>
<td valign="top" align="center">0.9977</td>
<td valign="top" align="center">0.4944</td>
</tr>
<tr>
<td valign="top" align="left"><bold>N</bold></td>
<td valign="top" align="center">0.9993</td>
<td valign="top" align="center">0.0317</td>
<td valign="top" align="center">0.9903</td>
<td valign="top" align="center">0.1101</td>
</tr></tbody>
</table>
</table-wrap>
<p>We chose four dates as our test days where GOES-17 fired its arcjets to maneuver while GOES-18 continued to measure the undisturbed magnetic field; 22, 26, and 29 July and 2 August 2022. Using the corrected IB magnetic field data from GOES-17 and the adjusted magnetic field data from GOES-18 we calculated the Pearson correlations and MSE values. The results show that our correction algorithm for each component of the EPN coordinate frame has very strong correlations, except for the component P on August 2nd, and component E on July 29th when the R<sup>2</sup> is lower and hence the MSE value is higher compared with other days and components (<xref ref-type="table" rid="T3">Table 3</xref>).</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Correlation coefficients and mean squared errors (MSE) for E, P, and N components calculated using the corrected GOES-17 magnetic field data and adjusted GOES-18 data for the test set.</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th valign="top" align="left"><bold>Date</bold></th>
<th valign="top" align="center" colspan="2"><bold>E</bold></th>
<th valign="top" align="center" colspan="2"><bold>P</bold></th>
<th valign="top" align="center" colspan="2"><bold>N</bold></th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>R</bold><sup>2</sup></th>
<th valign="top" align="center"><bold>MSE</bold></th>
<th valign="top" align="center"><bold>R</bold><sup>2</sup></th>
<th valign="top" align="center"><bold>MSE</bold></th>
<th valign="top" align="center"><bold>R</bold><sup>2</sup></th>
<th valign="top" align="center"><bold>MSE</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">22 July</td>
<td valign="top" align="center">0.9713</td>
<td valign="top" align="center">0.3612</td>
<td valign="top" align="center">0.9976</td>
<td valign="top" align="center">0.2419</td>
<td valign="top" align="center">0.9973</td>
<td valign="top" align="center">0.1058</td>
</tr>
<tr>
<td valign="top" align="left">26 July</td>
<td valign="top" align="center">0.9887</td>
<td valign="top" align="center">0.0991</td>
<td valign="top" align="center">0.9985</td>
<td valign="top" align="center">0.0711</td>
<td valign="top" align="center">0.9903</td>
<td valign="top" align="center">0.2333</td>
</tr>
<tr>
<td valign="top" align="left">29 July</td>
<td valign="top" align="center">0.9559</td>
<td valign="top" align="center">0.2746</td>
<td valign="top" align="center">0.9980</td>
<td valign="top" align="center">0.2217</td>
<td valign="top" align="center">0.9970</td>
<td valign="top" align="center">0.0424</td>
</tr>
<tr>
<td valign="top" align="left">02 Aug</td>
<td valign="top" align="center">0.9938</td>
<td valign="top" align="center">0.1258</td>
<td valign="top" align="center">0.9774</td>
<td valign="top" align="center">2.7067</td>
<td valign="top" align="center">0.9966</td>
<td valign="top" align="center">0.1766</td>
</tr></tbody>
</table>
</table-wrap>
<p>In general, the corrected GOES-17 magnetic field measurements in the E component, obtained using the XGBoost algorithm (orange lines in <xref ref-type="fig" rid="F4">Figures 4a</xref>, <xref ref-type="fig" rid="F4">d</xref>, <xref ref-type="fig" rid="F4">g</xref>, <xref ref-type="fig" rid="F4">j</xref>), closely follow the adjusted GOES-18 values (green lines in <xref ref-type="fig" rid="F4">Figures 4a</xref>, <xref ref-type="fig" rid="F4">d</xref>, <xref ref-type="fig" rid="F4">g</xref>, <xref ref-type="fig" rid="F4">j</xref>). Additionally, the existing correction algorithm, which is based on a correction matrix, exhibits slight differences in long-term trends (blue lines in <xref ref-type="fig" rid="F4">Figures 4a</xref>, <xref ref-type="fig" rid="F4">d</xref>, <xref ref-type="fig" rid="F4">g</xref>, <xref ref-type="fig" rid="F4">j</xref>).</p>
<fig position="float" id="F4">
<label>Figure 4</label>
<caption><p>The left panel shows 4 dates we tested the XGBoost developed to correct the E component of the EPN coordinate frame from GOES-17 (red), adjusted GOES-18 (green), the corrected GOES-17 using the current (blue) and XGBoost (orange) methods. The middle panel shows the same but focused around the arcjet firing periods, while the right panel shows the histograms of the differences between the adjusted GOES-18 values and the XGBoost (orange) and the current (blue) models. The left panel <bold>(a, d, g, j)</bold> shows 4 dates we tested the XGBoost developed to correct the E component of the EPN coordinate frame from GOES-17 (red), adjusted GOES-18 (green), the corrected GOES-17 using the current (blue) and XGBoost (orange) methods. The middle panel <bold>(b, e, h, k)</bold> shows the same but focused around the arcjet firing periods, while the right panel <bold>(c, f, i, l)</bold> shows the histograms of the differences between the adjusted GOES-18 values and the XGBoost (orange) and the current (blue) models.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frai-08-1628029-g0004.tif">
<alt-text>Series of graphs showing magnetic field strength (B in nanoteslas) over specific dates in July and August 2022. Multiple lines represent different models: G17-E, G18-E adj., current, and XGB. Each date includes a main line graph, with associated bar graphs illustrating data distribution for each model. Dates shown are July 22, 23, 26, 29, and August 2. Each graph illustrates fluctuations and comparisons across models.</alt-text>
</graphic>
</fig>
<p>When zooming in on the arcjet firing periods each day, the XGBoost-based correction is observed to perform significantly better, particularly on July 26 (<xref ref-type="fig" rid="F4">Figure 4e</xref>) and August 2, 2022 (<xref ref-type="fig" rid="F4">Figure 4k</xref>). On July 22, while the existing correction algorithm removes the overall reduction in the magnetic field measurements during the arcjet firing period, a shoulder remains visible at 19:30 (<xref ref-type="fig" rid="F4">Figure 4b</xref>). Conversely, the XGBoost-based correction algorithm eliminates the shoulder at the beginning of the arcjet firing but introduces a dip in the measurements (<xref ref-type="fig" rid="F4">Figure 4b</xref>). On July 29, the XGBoost-based correction algorithm erroneously produces reduced values toward the end of the arcjet firing period, starting around 20:15 (<xref ref-type="fig" rid="F4">Figure 4h</xref>).</p>
<p>We then compared the distributions of the differences between the adjusted GOES-18 magnetic field values in the E component and the corrections obtained using the XGBoost- and matrix-based algorithms (<xref ref-type="fig" rid="F4">Figures 4c</xref>, <xref ref-type="fig" rid="F4">f</xref>, <xref ref-type="fig" rid="F4">i</xref>, <xref ref-type="fig" rid="F4">l</xref>). In general, the XGBoost-based correction yields differences within 2 nT, whereas the existing correction algorithm can exceed 4 nT in some cases. This shows that the XGBoost algorithm provides better corrections overall.</p>
<p>Both the XGBoost-based algorithm and the existing correction method produce results without any deviation in the long-term trend of the P component (<xref ref-type="fig" rid="F5">Figures 5a</xref>, <xref ref-type="fig" rid="F5">d</xref>, <xref ref-type="fig" rid="F5">g</xref>, <xref ref-type="fig" rid="F5">j</xref>). However, sudden spikes at the beginning of nearly every arcjet firing event can be observed in the existing correction method (<xref ref-type="fig" rid="F5">Figures 5a</xref>, <xref ref-type="fig" rid="F5">d</xref>, <xref ref-type="fig" rid="F5">g</xref>, <xref ref-type="fig" rid="F5">j</xref>). When zooming in on the arcjet firing periods, these structures become more pronounced, whereas the XGBoost method does not generate spikes of comparable magnitude (<xref ref-type="fig" rid="F5">Figures 5b</xref>, <xref ref-type="fig" rid="F5">e</xref>, <xref ref-type="fig" rid="F5">h</xref>, <xref ref-type="fig" rid="F5">k</xref>). Additionally, the XGBoost correction algorithm fails to properly correct the arcjet contamination on August 2 (<xref ref-type="fig" rid="F5">Figure 5k</xref>), which is also evident from the R<sup>2</sup> and MSE values (<xref ref-type="table" rid="T3">Table 3</xref>).</p>
<fig position="float" id="F5">
<label>Figure 5</label>
<caption><p>The same as <xref ref-type="fig" rid="F4">Figure 4</xref>, but for the P component of the EPN coordinate frame. The left panel <bold>(a, d, g, j)</bold> shows 4 dates we tested the XGBoost developed to correct the P component of the EPN coordinate frame from GOES-17 (red), adjusted GOES-18 (green), the corrected GOES-17 using the current (blue) and XGBoost (orange) methods. The middle panel <bold>(b, e, h, k)</bold> shows the same but focused around the arcjet firing periods, while the right panel <bold>(c, f, i, l)</bold> shows the histograms of the differences between the adjusted GOES-18 values and the XGBoost (orange) and the current (blue) models.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frai-08-1628029-g0005.tif">
<alt-text>Graphs showing magnetic field data (B in nanoTesla) over five days: July 22, 26, 29, and August 2, 2022. Each row represents a different day with line plots and corresponding bar charts. The lines represent G17-P, G18-P adjusted, current, and XGB data with some variations in trends between 14:00 and 21:30. Bar charts highlight data differences for each day.</alt-text>
</graphic>
</fig>
<p>Overall, the XGBoost correction algorithm generally performs slightly better than the existing correction algorithm for the P component (<xref ref-type="fig" rid="F5">Figures 5c</xref>, <xref ref-type="fig" rid="F5">f</xref>, <xref ref-type="fig" rid="F5">i</xref>, <xref ref-type="fig" rid="F5">l</xref>).</p>
<p>For the N component, results from the test data indicate that the XGBoost correction algorithm slightly underperforms compared to the existing correction algorithm (<xref ref-type="fig" rid="F6">Figure 6</xref>). Although the XGBoost algorithm provides corrections without any deviation from the long-term trend (<xref ref-type="fig" rid="F6">Figures 6a</xref>, <xref ref-type="fig" rid="F6">d</xref>, <xref ref-type="fig" rid="F6">g</xref>, <xref ref-type="fig" rid="F6">j</xref>), it does not correct the arcjet contamination as effectively as the existing correction algorithm when zoomed in <xref ref-type="fig" rid="F6">Figures 6b</xref>, <xref ref-type="fig" rid="F6">e</xref>, <xref ref-type="fig" rid="F6">h</xref>, <xref ref-type="fig" rid="F6">k</xref>.</p>
<fig position="float" id="F6">
<label>Figure 6</label>
<caption><p>The same as <xref ref-type="fig" rid="F4">Figure 4</xref>, but for the N component of the EPN coordinate frame. The left panel <bold>(a, d, g, j)</bold> shows 4 dates we tested the XGBoost developed to correct the N component of the EPN coordinate frame from GOES-17 (red), adjusted GOES-18 (green), the corrected GOES-17 using the current (blue) and XGBoost (orange) methods. The middle panel <bold>(b, e, h, k)</bold> shows the same but focused around the arcjet firing periods, while the right panel <bold>(c, f, i, l)</bold> shows the histograms of the differences between the adjusted GOES-18 values and the XGBoost (orange) and the current (blue) models.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frai-08-1628029-g0006.tif">
<alt-text>Graphs displaying magnetic field data (B in nT) over specific dates in July and August 2022. Line charts and bar graphs compare different datasets: G17-N, current data, G18-N adjusted, and XGB predictions. Dates include July 22, 26, 29, and August 2, with distinct data trends and comparisons for each day.</alt-text>
</graphic>
</fig>
<p>This is also evident from the distributions of the differences between the adjusted GOES-18 magnetic field values in the N component and the corrections produced by the XGBoost and current methods (<xref ref-type="fig" rid="F6">Figures 6c</xref>, <xref ref-type="fig" rid="F6">f</xref>, <xref ref-type="fig" rid="F6">i</xref>, <xref ref-type="fig" rid="F6">l</xref>). The primary reason for this limitation is the insufficient amount of training data, particularly in the N component, which had fewer contaminated magnetic field measurements compared to the others. Unfortunately, the limited data availability is a direct consequence of the short collocation period.</p>
<sec>
<title>3.1 Uncertainties and potential bias</title>
<p>In addition to the performance of the model, it is important to consider uncertainties at the instrument and platform levels that may influence the corrected results.</p>
<p>First, daily and seasonal changes in the thermal environment in geostationary orbit can cause diurnal structure and long-term drifts on the raw measurements, especially for the earlier R-series MAG sensors. Intersatellite analyses show that GOES-16 OB data exhibit artificial diurnal variations of order &#x000B1;3 nT (1&#x003C3; &#x02248; &#x000B1;1.5 nT), whereas GOES-17 OB exhibits minimal daily variation and long-term stability within &#x0007E;&#x000B1;2 nT (<xref ref-type="bibr" rid="B8">Rich et al., 2024</xref>). By contrast, the GOES-18 GMAG demonstrates much improved thermal stability: on-orbit OB-IB differences remain &#x02272;&#x000B1;0.2 nT under diurnal cycling, and overall accuracy meets the NOAA &#x000B1;1 nT requirement (excluding arcjet periods) (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>).</p>
<p>Second, zero-level (offset) calibration and its refinement can introduce step-like biases if not updated. For GOES-17, yaw-flip maneuvers (180&#x000B0; rotations) were used to refine the OB zero offsets (e.g., &#x02212;0.25 nT in P and &#x0002B;1.82 nT in N) and reduce long-term OB differences with other GOES spacecraft (<xref ref-type="bibr" rid="B8">Rich et al., 2024</xref>). Related analyses also indicate component-dependent biases on GOES-15 that manifest as pre/post-yaw-flip shifts (<xref ref-type="bibr" rid="B8">Rich et al., 2024</xref>). During the GOES-17/18 collocation, IB-OB statistics further show a mean offset in the GOES-17 N component (IB&#x02212;OB &#x02248; &#x02212;2.1 &#x000B1; 0.62 nT), while GMAG (GOES-18) IB-OB means are near zero (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>). Additionally, GOES-17 experienced a 2021 safehold event after which a small residual bias shift persisted (<xref ref-type="bibr" rid="B6">Loto&#x00027;aniu et al., 2023</xref>).</p>
<p>Third, sensor placement and thermal coupling lead to IB/OB differences. Prior work showed the GOES-16 IB sensor is more thermally susceptible than OB, producing time-varying offsets of several nT tied to eclipse seasons and diurnal heating; ML corrections reduced these variations from &#x0007E;3-5 nT to &#x0007E;0-2 nT (E component) but some residual, MLT-dependent offsets remained (<xref ref-type="bibr" rid="B5">Inceoglu and Loto&#x00027;aniu, 2021</xref>). These findings motivate our use of GOES-18 as the reference and our emphasis on spacecraft-intrinsic inputs for the correction.</p>
<p>In the context of our study, these factors imply that (i) residual uncertainty in the corrected GOES-17 series is bounded below by the stability/accuracy of the GOES-18 GMAG (order &#x0007E;1 nT) and by any residual, component-dependent GOES-17 biases during the collocation; and (ii) IB and OB arcjet responses are highly similar in our data, but long-term IB/OB thermal susceptibilities differ. Practically, this is why we (a) trained against adjusted GOES-18 (collocation-aligned) values, (b) withheld OB data from test days, and (c) evaluate performance by component and day. We recommend users treat start/end-of-burn transients and days with known configuration changes or post-anomaly bias shifts with additional caution, and we view uncertainty quantification (e.g., confidence intervals on corrections or QC flags) as a natural extension for future operational use.</p>
<p>It should be noted that the XGBoost correction was designed to address plume-related arcjet contamination during firing periods and does not explicitly correct for the small residual thermal offsets (&#x0007E;1-2 nT) that can persist for hours after thruster shutdown (<xref ref-type="bibr" rid="B1">Califf et al., 2020a</xref>). These long-lived effects were removed during preprocessing and remain a separate calibration challenge.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s4">
<title>4 Conclusions</title>
<p>The GOES magnetometers provide critical measurements for space weather monitoring and scientific research. However, the magnetic field data measured by the GOES missions are periodically contaminated by arcjet thruster firings due to attitude and maneuver corrections, introducing artificial disturbances that can impact both operational and research applications. The existing correction method, the correction matrix approach, mitigates these effects, but struggles with transient variations and residual errors, necessitating a more adaptive solution.</p>
<p>In this study, we developed an XGBoost-based machine learning model for each component of the EPN coordinate frame to correct arcjet-induced contamination in the GOES-17 magnetometer data using GOES-18 as a ground truth, after adjusting for longer-term trends. Our results demonstrate that the XGBoost shows great promise in reducing artificial disturbances, particularly in that it outperforms the existing correction in mitigating the non-linear features observed in the P component at the start and end of thruster firings. Although the model shows strong performance, some limitations remain due to training data constraints, particularly in certain magnetic field components in the EPN frame.</p>
<p>Although our model was developed and evaluated retrospectively, the underlying architecture and speed of XGBoost inference suggest that, with appropriate data preprocessing pipelines, the approach could be adapted for near-real-time implementation. This opens the possibility of future operational integration, enabling better correction of thruster-related contamination as part of the NOAA space weather monitoring workflow.</p>
<p>From an operational standpoint, interpretability remains a critical factor when assessing correction methods. Although the existing matrix approach is deterministic and familiar to operators, it can leave residual artifacts that resemble genuine geophysical events, such as magnetopause crossings, potentially leading to false positives. In contrast, ML-corrected data more effectively suppresses such artifacts but introduces a level of complexity in traceability and confidence, especially when corrections are based on patterns learned from other satellites. For operational users, a hybrid approach, in which ML corrections are applied but are accompanied by quality flags or confidence intervals, may offer the best balance between accuracy and interpretability.</p>
<p>We emphasize the importance of satellite collocation, such as the overlapping orbits of GOES-17 and GOES-18, as it provides a unique opportunity for cross-satellite calibration and validation, which is essential for improving magnetometer data accuracy and enhancing space weather monitoring capabilities.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The training, validation, and test data, which were used for our models to correct arcjet-related anomalies in the magnetic field data, contain several different components. These include 1) magnetic field data in the EPN coordinate frame from two GOES-R series magnetometers, GOES-17 and GOES-18. And 2) arcjet specific data, including current and voltage supplied to the arcjets, and a binary arcjet flag derived from these values. Different components of these training data are subject to different export controlled restrictions. GOES-R Level 1b (L1b) data are generally publicly available on the NCEI product site (<ext-link ext-link-type="uri" xlink:href="https://www.ncei.noaa.gov/products/goes-r-magnetometer">https://www.ncei.noaa.gov/products/goes-r-magnetometer</ext-link>). The L1b files contain the magnetic field data and the arcjet flag. However, the GOES-18 data from the time period used for this study were produced prior to provisional maturity validation of the data product, and as such are not available in the public archive. The GOES-17 and GOES-18 magnetic field data and the associated arcjet flag for days corresponding to the training, validation and test set will be published in a Zenodo repository (doi: <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.15721786">10.5281/zenodo.15721786</ext-link>) alongside the trained models. These data have been retroactively reprocessed, and have the same calibrations applied as the publicly archived GOES-R L1b data (temperature, zero-level offset and alignment). The arcjet flag is a binary flag indicating no arcjet firing (flag value 0) or arcjet firing (1). Thus, it is possible to identify the periods of arcjet firings in the magnetic field data. The arcjet currents and voltages, however, cannot be made available publicly due to International Traffic in Arms (ITAR) restrictions. The raw telemetry corresponding to currents and voltages is available in the Level 0 (L0, <ext-link ext-link-type="uri" xlink:href="https://doi.org/doi:10.25921/sv2f-sm24">https://doi.org/doi:10.25921/sv2f-sm24</ext-link>) GOES-R MAG files which can be requested and evaluated on a case-by-case basis through NCEI customer support (<email>ncei.info&#x00040;noaa.gov</email>). Please note that the specific process to convert L0 telemetry values into scientific units is not part of the L0 metadata, and the relevant documentation to do so cannot be released due to ITAR restrictions. Users may be able to create synthetic current and voltage estimates based on the information provided both in this paper, and the previous paper on the arcjet correction (<xref ref-type="bibr" rid="B1">Califf et al., 2020a</xref>). For further information or questions about the GOES-R data products, users can email <email>swx.mag&#x00040;noaa.gov</email>.</p>
</sec>
<sec sec-type="author-contributions" id="s6">
<title>Author contributions</title>
<p>FI: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Supervision, Validation, Visualization, Writing &#x02013; original draft, Writing &#x02013; review &#x00026; editing. PL: Writing &#x02013; review &#x00026; editing.</p>
</sec>
<sec sec-type="funding-information" id="s7">
<title>Funding</title>
<p>The author(s) declare that financial support was received for the research and/or publication of this article. This research was supported by the NOAA cooperative agreements NA17OAR4320101 and NA22OAR4320151.</p>
</sec>
<ack><p>The views, opinions, and findings contained in this report are those of the authors and should not be construed as an official National Oceanic and Atmospheric Administration, National Aeronautics and Space Administration, or other U.S. Government position, policy, or decision. FI and PL would like to thank Alison Jarvis, Aspen Unkeles, and William Rowland for their contributions to data provision, data availability statement and language review, and Laurel Rachmeler for her helpful suggestions.</p>
</ack>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="ai-statement" id="s8">
<title>Generative AI statement</title>
<p>The author(s) declare that no Gen AI was used in the creation of this manuscript.</p>
<p>Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>As noted in the NOAA Transition to Operations notice: <ext-link ext-link-type="uri" xlink:href="https://www.goes-r.gov/users/transitionToOperations17.html">https://www.goes-r.gov/users/transitionToOperations17.html</ext-link>.</p></fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Califf</surname> <given-names>S.</given-names></name> <name><surname>Early</surname> <given-names>D.</given-names></name> <name><surname>Grotenhuis</surname> <given-names>M.</given-names></name> <name><surname>Loto&#x00027;aniu</surname> <given-names>T. M.</given-names></name> <name><surname>Kronenwetter</surname> <given-names>J.</given-names></name></person-group> (<year>2020a</year>). <article-title>Correcting the arcjet thruster disturbance in GOES-16 magnetometer data</article-title>. <source>Space Weather</source> <volume>18</volume>:<fpage>e02347</fpage>. <pub-id pub-id-type="doi">10.1029/2019SW002347</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Califf</surname> <given-names>S.</given-names></name> <name><surname>Loto&#x00027;aniu</surname> <given-names>T. M.</given-names></name> <name><surname>Early</surname> <given-names>D.</given-names></name> <name><surname>Grotenhuis</surname> <given-names>M.</given-names></name></person-group> (<year>2020b</year>). <article-title>Arcjet thruster influence on local magnetic field measurements from a geostationary satellite</article-title>. <source>J. Spacecr. Rockets</source> <volume>57</volume>, <fpage>177</fpage>&#x02013;<lpage>186</lpage>. <pub-id pub-id-type="doi">10.2514/1.A34546</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>T.</given-names></name> <name><surname>Guestrin</surname> <given-names>C.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Xgboost: a scalable tree boosting system,&#x0201D;</article-title> in <source>Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD &#x00027;16</source> (<publisher-loc>New York, NY, USA</publisher-loc>: <publisher-name>Association for Computing Machinery</publisher-name>), <fpage>785</fpage>&#x02013;<lpage>794</lpage>. <pub-id pub-id-type="doi">10.1145/2939672.2939785</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friedman</surname> <given-names>J. H.</given-names></name></person-group> (<year>2001</year>). <article-title>Greedy function approximation: a gradient boosting machine</article-title>. <source>Ann. Stat</source>. <volume>29</volume>, <fpage>1189</fpage>&#x02013;<lpage>1232</lpage>. <pub-id pub-id-type="doi">10.1214/aos/1013203451</pub-id><pub-id pub-id-type="pmid">38281721</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Inceoglu</surname> <given-names>F.</given-names></name> <name><surname>Loto&#x00027;aniu</surname> <given-names>P. T. M.</given-names></name></person-group> (<year>2021</year>). <article-title>Using unsupervised and supervised machine learning methods to correct offset anomalies in the GOES-16 magnetometer data</article-title>. <source>Space Weather</source> <volume>19</volume>:<fpage>e2021S</fpage>W002892. <pub-id pub-id-type="doi">10.1029/2021SW002892</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loto&#x00027;aniu</surname> <given-names>P. T. M.</given-names></name> <name><surname>Davis</surname> <given-names>A.</given-names></name> <name><surname>Jarvis</surname> <given-names>A.</given-names></name> <name><surname>Grotenhuis</surname> <given-names>M.</given-names></name> <name><surname>Rich</surname> <given-names>F. J.</given-names></name> <name><surname>Califf</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2023</year>). <article-title>Initial on-orbit results from the GOES-18 spacecraft science magnetometer</article-title>. <source>Space Sci. Rev</source>. <volume>219</volume>:<fpage>84</fpage>. <pub-id pub-id-type="doi">10.1007/s11214-023-01032-3</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loto&#x00027;aniu</surname> <given-names>P. T. M.</given-names></name> <name><surname>Inceoglu</surname> <given-names>F.</given-names></name></person-group> (<year>2024</year>). <article-title>The distribution of PC5 ultralow-frequency waves at geostationary orbit</article-title>. <source>Astrophys. J</source>. <volume>969</volume>:<fpage>91</fpage>. <pub-id pub-id-type="doi">10.3847/1538-4357/ad41dd</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rich</surname> <given-names>F. J.</given-names></name> <name><surname>Califf</surname> <given-names>S.</given-names></name> <name><surname>Loto&#x00027;aniu</surname> <given-names>P. T. M.</given-names></name> <name><surname>Coakley</surname> <given-names>M.</given-names></name> <name><surname>Krimchansky</surname> <given-names>A.</given-names></name> <name><surname>Singer</surname> <given-names>H. J.</given-names></name></person-group> (<year>2024</year>). <article-title>Intersatellite comparisons of GOES magnetic field measurements</article-title>. <source>Space Weather</source> <volume>22</volume>:<fpage>e2023S</fpage>W003736. <pub-id pub-id-type="doi">10.1029/2023SW003736</pub-id></citation>
</ref>
</ref-list>
</back>
</article>