1 Introduction

Front. Therm. Eng.

Frontiers in Thermal Engineering

Front. Therm. Eng.

2813-0456

Frontiers Media S.A.

1594443

10.3389/fther.2025.1594443

Thermal Engineering

Original Research

Phy-ChemNODE: an end-to-end physics-constrained autoencoder-NeuralODE framework for learning stiff chemical kinetics of hydrocarbon fuels

Kumar et al.

10.3389/fther.2025.1594443

Kumar

Tadbhagya

¹ Kumar

Anuj

¹ ² Pal

Pinaki

¹ *

¹ Transportation and Power Systems Division, Argonne National Laboratory (DOE), Lemont, IL, United States ² Department of Mechanical and Aerospace Engineering, North Carolina State University, Raleigh, NC, United States

Edited by: Fang Wang, Zhongyuan University of Technology, China

Reviewed by: Yuanqing Wu, Dongguan University of Technology, China

Tommaso Grassi, Max Planck Institute for Extraterrestrial Physics, Germany

Ahmed Almeldein, Oak Ridge National Laboratory (DOE), United States

*Correspondence: Pinaki Pal, pal@anl.gov

15 08 2025

2025

1594443

16 03 2025 30 07 2025

2025

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

Predictive computational fluid dynamics (CFD) simulations of reacting flows in energy conversion systems are accompanied by a major computational bottleneck of solving a stiff system of coupled ordinary differential equations (ODEs) associated with detailed fuel chemistry. This issue is exacerbated with the complexity of fuel chemistry as the number of reactive scalars and chemical reactions increase. In this work, a physics-constrained Autoencoder (AE)-NeuralODE framework, termed as PhyChemNODE, is developed for data-driven modeling and temporal emulation of stiff chemical kinetics for complex hydrocarbon fuels, wherein a non-linear AE is employed for dimensionality reduction of the thermochemical state and the NODE learns temporal dynamics of the system in the low-dimensional latent space obtained from the AE. Both the AE and NODE are trained together in an end-to-end manner. We further enhance the approach by incorporating elemental mass conservation constraints directly into the loss function during model training. This ensures that total mass as well as individual elemental species masses are conserved in an a-posteriori manner. Demonstration studies are performed for methane combustion kinetics (32 species, 266 chemical reactions) over a wide thermodynamic and composition space at high pressure. Effects of various model hyperparameters, such as relative weighting of different terms in the loss function and dimensionality of the AE latent space, on the accuracy of Phy-ChemNODE are assessed. The physics-based constraints are shown to improve both training efficiency and physical consistency of the data-driven model. Further, a-posteriori autoregressive inference tests demonstrate that Phy-ChemNODE leads to reduced temporal stiffness in the latent space, and achieves 1-3 orders of magnitude speedup relative to the detailed kinetic mechanism depending on the type of ODE solver (implicit or explicit) used for numerical integration, while ensuring prediction fidelity.

stiff chemical kinetics fuel combustion autoencoders neural ordinary differential equations physics-informed neural network chemistry acceleration computational fluid dynamics numerical modeling

Office of Fossil Energy and Carbon Management10.13039/100020312

section-at-acceptance

Heat Engines

1 Introduction

Computational fluid dynamics (CFD) modeling of reacting flows, such as those encountered in gas turbine and internal combustion engines, are computationally demanding owing to the complex interactions among multiple physico-chemical phenomena and the need to resolve a wide range of spatiotemporal scales governing the evolution of a large number of reactive scalars (chemical species). In particular, modeling of detailed chemical kinetics presents a major bottleneck, which is governed by a stiff system of coupled ordinary differential equations (ODEs) and characterized by high condition number of the corresponding chemical Jacobian matrix (Shampine, 1993). In addition, as the complexity of fuel chemistry increases, the dimensionality and stiffness of the ODE system also become more prohibitive (Lu and Law, 2009). To address these computational challenges, kinetic model reduction is typically performed (Lu and Law, 2005; Jones and Rigopoulos, 2005; Valorani et al., 2006; Maas and Pope, 1992), but it often leads to less reliable description of chemical kinetics.

In this context, data-driven machine learning (ML) techniques (Dana, 1987; Wold et al., 1987) have been extensively explored in an effort to emulate chemical kinetics and accelerate detailed finite-rate chemistry computations. Some of these approaches reduce the dimensionality of the reaction system via linear projection onto an appropriate basis typically identified through Principal Component Analysis (PCA). Artificial neural network (ANNs) are, then, employed for regression of the PC source terms and transport coefficients (Owoyele et al., 2017; Kumar et al., 2023). On the other hand, ANN-based approaches have also been used for predicting chemical source terms directly from the thermochemical state (Christo et al., 1996; Blasco et al., 1998; Sen et al., 2010; Ranade et al., 2019; Wan et al., 2020). More recently, novel deep learning architectures have been explored to capture the temporal evolution of chemical kinetics and combustion simulations, such as Deep Operator Network (DeepONet) (Kumar and Echekki, 2024) and Fourier Neural Operator (FNO) (Zhang et al., 2024).

Despite the potential benefits of employing ML methods, When they are trained in an a-priori or offline setting and then coupled with a numerical solver, predicted solutions may diverge or become unstable. Due to the non-linearity of the combustion process, even minor predictive errors can escalate into significant discrepancies in the temporal evolution of thermochemical state due to error accumulation. To handle this issue, an alternative and more robust data-driven technique for chemical source term computations based on neural ordinary differential equations (NeuralODEs, NODEs) (Chen et al., 2018) was developed by the authors, known as ChemNODE (Owoyele and Pal, 2021). It combines the chemical source term predictions with ODE integration in an a-posteriori learning paradigm, where the source terms predicted by the neural network are passed to the ODE solver for time integration, and the neural network weights are optimized to minimize the loss computed between the predicted and ground truth thermochemical states (comprised of species mass fractions and thermodynamic variables). A key advantage of this approach is that NODEs learn continuous-time dynamics which can be integrated using existing ODE solvers. This ensures that the predicted thermochemical state, even after a long-time horizon, remains adherent to the ground truth solution trajectory. For relatively large chemical kinetic mechanisms, the coupling of a non-linear autoencoder (AE) to perform dimensionality reduction and a NODE to evolve the dynamics in the lower-dimensional latent space has shown promise (Vijayarangan et al., 2024). A similar approach has also been pursued in the field of astro-chemistry (Sun Tang and Turk, 2022; Maes et al., 2024).

Another major limitation of traditional black-box ML techniques applied to the modeling of chemical kinetics is that they do not inherently incorporate conservation laws, which can adversely impact simulation accuracy and hinder the integration of ML surrogate models with multidimensional CFD solvers. With the advent of scientific machine learning, the practice of combining domain science-specific constraints to embed physics into neural network training has emerged lately (Raissi et al., 2019). This is achieved through regularization of the loss function, in which physical laws are embedded in the learning process as soft constraints (Kumar Tadbhagya et al., 2023; Almeldein and Van Dam, 2023; Kumar et al., 2025; Weng et al., 2025; Kercher and Votsmeier, 2025). This has been shown to improve model accuracy and generalization, especially in scenarios where traditional methods struggle with noisy data and high-dimensional problems governed by parameterized differential equations. Another way to enforce conservation laws is by enforcing a hard constraint in the neural network architecture through constraint layers (Sturm and Wexler, 2022; Mohan et al., 2023).

In light of the above discussion, the overarching goal of the present work is to demonstrate an end-to-end physics-constrained AE-NODE framework called Phy-ChemNODE for data-driven modeling and temporal emulation of stiff chemical kinetics for hydrocarbon fuels. Unlike previous work (Vijayarangan et al., 2024; Sun Tang and Turk, 2022; Maes et al., 2024), the present study demonstrates that this integrated approach improves physical consistency of the resulting data-driven model with mass conservation laws and accelerates training convergence, while simultaneously reducing temporal stiffness of the chemical kinetic system. The remainder of the paper is organized as follows: the Phy-ChemNODE framework is first outlined along with details of the physics-constrained formulation and training methodology in Sections 2, 3, respectively. Subsequently, results from a-posteriori proof-of-concept studies are discussed in Section 4. Finally, the Conclusions section summarizes the major findings and directions for future work.

2 Physics-constrained autoencoder-NODE (Phy-ChemNODE) framework for stiff chemical kinetics emulation

In combustion CFD simulations, it is a common numerical approach to decouple finite-rate chemistry from transport using operator splitting. Chemistry is solved (independently from advective and diffusive transport) within each computational grid cell considered as a homogeneous reactor, which is equivalent to solving a system of stiff ODEs. The temporal evolution of N s reactive scalars (chemical species) can be defined by: d Y k d t = ω k ̇ ρ , k = 1,2,3 , … , N s (1) where Y k is the mass fraction of k t h species ( N s being the total number of species), ω k ̇ is the corresponding chemical source term computed using law of mass action, and ρ refers to mixture density. The temporal evolution of temperature (T) is governed by a similar ODE defined as follows: d T d t = − ∑ k = 1 N s h k ω k ̇ ρ c p , k = 1,2,3 , … , N s (2) where h k and c p refer to the enthalpy of k t h species and mixture-averaged constant pressure specific heat, respectively. To calculate the source terms, one needs to account for several elementary reactions involving production and consumption of multiple species. As the chemical mechanism becomes larger, the number of chemical species and elementary reactions also increase (Lu and Law, 2009). This leads to prohibitive computational costs since all chemical time scales must be fully resolved. In the NODE-based data-driven framework (Owoyele and Pal, 2021), the expensive physics-based computation of chemical source terms is replaced by a neural network, which can be described as Equation 3 below: d Φ d t = f Φ , t ; Θ (3) where Φ = [ T , Y 1 , Y 2 , … , Y N s ] is the vector of thermochemical state (temperature and species mass fractions), and f ( Φ , t ; Θ ) is a feedforward neural network parameterized by weights Θ . For larger chemical mechanisms, Φ increases in dimensionality and stiffness. To address the high dimensionality, a non-linear AE is coupled with the NODE for dimensionality reduction, so that the NODE learns the temporal evolution of the dynamical system in a reduced-order latent space obtained from the non-linear projection of the AE. A schematic of the coupled AE-NODE data-driven modeling framework is shown in Figure 1. The model training process is posed as an optimization problem of determining the optimal parameters of the encoder ( φ ) , NODE ( h ( z ) ) , and the decoder ( ψ ) networks in an end-to-end manner, that minimizes the loss function defined as: L Phy−ChemNODE = λ rec L rec + L data + λ z L z + ∑ j = 1 N e l λ e l − j L e l − j (4) where the reconstruction loss L rec = L ( Φ , Φ ̃ ) measures the loss between ground truth ( Φ ) and corresponding encoder-decoder mapping ( Φ ̃ = ψ ( φ ( Φ ) ) , the data loss L data = L ( Φ , Φ ̂ ) measures the loss between ground truth and encoder + NODE + decoder prediction ( Φ ^ ) , and the latent loss L z = L ( z ̄ , z ) measures the loss between encoder mapping of ground truth ( z ̄ = φ ( Φ ) ) and encoder + NODE prediction. Each loss term is chosen to be in mean absolute error (MAE) form. It is noted that the loss terms L rec and L z ensure that the encoder and decoder mappings are bijective or unique. The loss function also contains elemental mass conservation constraints (Kumar Tadbhagya et al., 2023; Kumar et al., 2025), defined in Equation 5 below: L e l − j = 1 N ∑ i = 1 N ∑ k = 1 N s N j k W j | Y k , i − Y ̂ k , i | W k (5) where L e l − j refers to the loss associated with mass conservation of element j (in the chemical system with a total of N e l elements). Y ̂ k , i and Y k , i correspond to the AE + NODE predicted and ground truth mass fractions of k t h species, respectively. W j is the atomic mass of element j , N j k is the number of atoms of element j in k t h species, W k is the molecular weight of k t h species, and N is the number of training data points. Lastly, the weights λ rec , λ z , and λ e l − j in Equation 4 balance the contributions from the different loss terms.

FIGURE 1

Schematic of the coupled AE-NODE framework.

Diagram illustrating a Neural ODE model. It shows data entering from "Physical Space" at time \( t = t^n \) into an Encoder \(\varphi(\Phi)\), transitioning into "Latent Space" represented by neural networks \( h(z) \), and then passing through a Decoder \(\psi(z)\) to reach "Physical Space" at time \( t = t^{n+1} \). Equations for transformation are depicted.

3 Proof-of-concept study

For proof-of-concept demonstration of Phy-ChemNODE, an autoigniting methane-oxygen ( CH 4 - O 2 ) zero-dimensional (0D) homogeneous reactor is considered, at a constant pressure of 20 atm that corresponds to practical high-pressure isobaric operating conditions for stationary gas turbine engines. The detailed chemical mechanism (Gregory et al., 2016) consists of 32 species and 266 chemical reactions. The ground truth data for model training is generated using Cantera (Goodwin et al., 2009), which solves the coupled ODE system (Equations 1, 2) with detailed chemistry. The thermodynamic and composition space chosen for data generation comprises 9 equispaced initial temperatures in the range T i = [1600 K, 2000 K] and 11 equivalence ratios within ϕ = [ 1.0 , 1.5 ] resulting in a total of 99 initial conditions. Each of these initial conditions is integrated to chemical equilibrium. Ground truth data is generated by selecting time instants such that the change in any of the thermochemical scalars (species mass fractions or temperature) between two successive time instants is greater than 1% of their corresponding overall ranges of variation, and then downsampling 200 points from the selected time instants. A 70 % , 20 % , 10 % random split (based on initial conditions) is used to obtain the training, validation, and test datasets, respectively. The AE-NODE model is initialized with the same initial conditions (during training) as the physics-based simulations. The input to the encoder is the vector Φ containing the temperature and species mass fractions, which is scaled using the maximum and minimum of the training data, respectively, and the output is a vector in the latent space ( z ) . The decoder has the same dense architecture as the encoder, with an input size equal to the latent dimension ( d i m ( z ) = 4 ) (chosen based on a sensitivity study discussed in Section 4) and the output size equal to the physical space vector ( d i m ( Φ ) = 33 ) . Both the encoder and decoder have 5 hidden layers with 64 neurons each and Exponential Linear Unit (ELU) activation function. The NODE has the same input and output dimensions as the latent space ( d i m ( z ) = 4 ) , and a 4 hidden-layer dense network with 64 neurons in each hidden layer and ELU activation function, to model the chemical source terms in the lower-dimensional latent space. The output layers for the encoder, NODE, and the decoder are considered to be linear. The above discussed AE-NODE architecture was finalized based on multiple ablation studies, targeting a good balance between dimensionality reduction and overall model accuracy. Although more exhaustive hyperparameter optimization can be performed to further optimize the number of latent variables, it is out of scope of the current work.

The forward pass through the NODE requires time integration, for which a 4th order explicit Runge-Kutta (RK) solver is used. Once the time integration is completed, the temporal trajectories are mapped back to physical space and the loss is computed using Equation 4. The gradients for updating the neural network parameters are calculated using backward adjoint automatic differentiation and ADAM optimizer with exponential learning rate decay (every 200 epochs) is used. The model is trained for 10000 epochs. To ensure that all the loss terms are of similar magnitude, λ rec = 5.0 is used. Further, λ z = 0.05 and λ e l − H = λ e l − C = λ e l − O = 0.5 are chosen based on a hyperparameter sweep (discussed in Section 4). The training framework was implemented in Julia programming language using Flux.jl (Innes et al., 2018) library and the model was trained on 2 AMD EPYC 7713 64-core processors for a walltime of 96 h.

4 Results and discussion

To determine the optimal weighting of terms in the loss function and the latent space dimensionality, hyperparameter studies were carried out. Figure 2a compares the loss terms computed on the validation set (post training) corresponding to different λ z values for λ rec = 5 , λ e l − j = 0.5 , and fixed size of the latent space (dim ( z ) = 4 ), and Figure 2b shows a similar comparison for varying size of the latent space (dim ( z ) ) with λ rec = 5 , λ z = 0.05 , and λ e l − j = 0.5 . Moreover, to assess the impact of adding elemental mass conservation constraints to the training loss function, Figure 2c compares the decay of data loss (during training) between the cases trained with ( λ ele−j = 0.5 ) and without elemental mass constraints ( λ ele−j = 0 ) on the validation set. It can be clearly seen that incorporating the soft constraints in the loss function results in lower loss for the same number of epochs, thereby enabling more efficient model training and faster training convergence.

FIGURE 2

Comparison of loss terms (computed on validation set) across hyperparameter experiments: (a) varying λ z , (b) varying latent space size ( d i m ( z ) ) , and (c) data loss ( L data ) evolution with ( λ e l − j = 0.5 ) and without ( λ e l − j = 0 ) elemental mass constraints.

Three line graphs show different loss terms. Graph (a) depicts loss terms \(L_{rec}\), \(L_{data}\), and \(L_{latent}\) against \(\lambda_z\). Graph (b) shows \(10 \times L_{rec}\), \(L_{data}\), and \(L_{latent}\) against \(dim(z)\). Graph (c) illustrates \(L_{data}\) over epochs with \(\lambda_{el-j} = 0\) and \(\lambda_{el-j} = 0.5\).

The trained Phy-ChemNODE model is then deployed for predicting the temporal evolution of thermochemical scalars in a-posteriori autoregressive tests. Figure 3 plots the temporal evolution of temperature and a subset of species mass fractions ( CH 4 , CO, CO 2 , OH, and O 2 ) for an initial condition (corresponding to T 0 = 1600 K) in the training set ( ϕ = 1.0 ) and test set ( ϕ = 1.1 ) , where ground truth data is indicated by solid lines and the predicted Phy-ChemNODE solutions are shown in markers. Overall, great agreement can be observed between the predictions and ground truth data. Figure 4 shows the temporal evolution of a few intermediate species for another set of initial conditions in the training set ( T 0 = 1650 K, ϕ = 1.0 ) and the test set ( T 0 = 1700 K , ϕ = 1.05 ), again demonstrating high accuracy. As further quantification of the accuracy of Phy-ChemNODE framework, Figure 5 shows the test set MAEs for a subset of the thermochemical scalars including both major and minor species, scaled by their corresponding data ranges, indicating that Phy-ChemNODE performs extremely well in capturing the temporal dynamics.

FIGURE 3

Temporal evolution of temperature ( T ) and mass fractions of CH 4 , CO, CO 2 , OH, and O 2 corresponding to initial conditions in (a) training set ( T i = 1600 K, ϕ = 1.0 ) and (b) test set ( T i = 1600 K, ϕ = 1.1 ). The mass fractions of CH 4 , CO 2 and OH are scaled by 4, and that of CO by 3 for ease of plotting. Solid lines denote ground truth and markers denote Phy-ChemNODE predictions.

Two line graphs labeled (a) and (b) show molecule fractions and temperature over time in milliseconds. Graphs track O2, OH, CO, CO2, CH4, and temperature (T) with distinct colors. Both graphs display similar trends, with sharp changes around 0.01 milliseconds. Temperature is measured on the right y-axis, reaching approximately 3500 Kelvin, while molecule fractions are plotted on the left, ranging from 0 to 0.8.

FIGURE 4

Temporal evolution of intermediate species ( CH 3 , CH 2 O and C 2 H 4 ) corresponding to initial conditions in (a) training set ( T i = 1650 K, ϕ = 1.0 ) and (b) test set ( T i = 1700 K, ϕ = 1.05 ). Solid lines denote ground truth and markers denote Phy-ChemNODE predictions.

Two line graphs (a) and (b) show the concentration of CH3, CH2O, and C2H4 over time in milliseconds on the x-axis, with Y_i on the y-axis. Both graphs have similar trend lines with CH3 in red, CH2O in green, and C2H4 in brown. The lines increase sharply, peak, and drop quickly.

FIGURE 5

Scaled test set MAEs for prediction of temperature ( T ) and a few species mass fractions.

Bar chart displaying the normalized error ratio \(|y - \hat{y}| / (y_{max} - y_{min})\) for various substances: \(T\), \(H_2\), \(H\), \(O\), \(O_2\), \(OH\), \(H_2O\), \(HO_2\), \(H_2O_2\), \(CO\), \(CO_2\), \(C\), \(CH\), \(CH_2\), \(CH_3\), and \(CH_4\). Heights vary, with prominent peaks for \(H\) and \(H_2O_2\).

Based on inference on an Intel i7-1165G7 workstation with 16 cores, Phy-ChemNODE yields speedups of 6x and 860x over the full chemical mechanism in terms of overall simulation walltime, when deployed with implicit (backward differentiation formula (BDF)) and explicit (RK45) solvers, respectively. Figure 6 plots the latent space temporal dynamics corresponding to the initial conditions of T 0 = 1650 K, ϕ = 1.0 . It is evident that the evolution of latent variables is much smoother than that of thermochemical scalars in the physical space earlier shown in Figure 4a. This indicates that the coupled AE-NODE approach significantly reduces the temporal stiffness of the chemical kinetic system. Similar stiffness reduction in the latent space was also observed by Nair et al. (2025) in case of AE-NODE models of advection-dominated dynamical systems. Lastly, Figure 7 shows the predicted temporal evolution of C, H, and O mass fractions from autoregressive tests corresponding to certain initial conditions from the training and test sets. Evidently, the model trained with elemental mass constraints in the loss function (Phy-ChemNODE/PCNODE) conserves the elemental mass fractions during deployment much better than the MAE-trained case without constraints.

FIGURE 6

Temporal evoluiton of latent space variables corresponding to T i = 1650 K, ϕ = 1.0 .

Line graph showing latent variables over time in milliseconds. Latent-1 (blue) decreases steadily, Latent-2 (black) quickly rises then stabilizes, Latent-3 (red) peaks then drops to zero, and Latent-4 (green) remains stable.

FIGURE 7

Temporal evolution of mass fractions of: (a) C, (b) H, and (c) O elements corresponding to initial conditions in the training ( T 0 = 1600 K, ϕ = 1.0 ) and test ( T 0 = 1600 K, ϕ = 1.1 ) sets.

Three graphs show elemental mass fractions over time in milliseconds for carbon (C), hydrogen (H), and oxygen (O). Each graph compares Ground Truth (black line), MAE (blue circles), and PCNODE (red diamonds). The top graph (a) illustrates carbon with slight fluctuations; the middle graph (b) shows hydrogen with minimal variation; the bottom graph (c) depicts oxygen maintaining a stable pattern. Each graph has a different equivalence ratio (\(\phi\)) around 1.0 and 1.1.

In future studies, multiple avenues for further extension of the Phy-ChemNODE framework will be pursued. These include: (a) efficient scaling of the training workflow to wider ranges of thermodynamic conditions (including multiple pressures), larger kinetic mechanisms, and multicomponent fuels; (b) integration of Phy-ChemNODE with CFD solvers and demonstration of accelerated multidimensional reacting flow simulations on modern high-performance computing (HPC) platforms; (c) demonstration for constant-volume combustion; (d) incorporation of uncertainty quantification; and (e) exploration of training methodologies to enhance model out-of-distribution generalizability.

5 Conclusion

In this work, an end-to-end physics-constrained AE-NODE framework (PhyChemNODE) was introduced for accelerated temporal emulation of stiff chemical kinetics targeting complex hydrocarbon fuel combustion. The deep learning approach employed a non-linear AE for dimensionality reduction of the thermochemical state and utilized a NODE to learn temporal dynamics of the kinetic system in the low-dimensional latent space obtained from the AE. In addition, elemental mass conservation constraints were included in the loss function during training of the data-driven model to ensure that total mass and mass of each elemental species are conserved. Proof-of-concept studies were performed for homogeneous autoignition of a methane-oxygen mixture over a range of composition and thermodynamic conditions at high pressure. The results showed that the physics-based constraints not only improve physical consistency of the resulting data-driven model, but also enhance training efficiency. In addition, Phy-ChemNODE achieved 6–860 × speedup relative to the detailed chemical mechanism depending on the type of ODE solver (implicit or explicit) used for numerical integration during autoregressive inference tests. Temporal evolution of the latent variables was visualized and it was found that the coupled AE-NODE approach leads to reduced temporal stiffness of the chemical kinetic system. In future work, Phy-ChemNODE will be further scaled to larger kinetic mechanisms and wider ranges of thermodynamic conditions (including pressure), and demonstrated for multidimensional combustion CFD simulations. In addition, uncertainty quantification techniques will be incorporated and training methodologies will be explored to enhance out-of-distribution generalizability.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

TK: Data curation, Methodology, Conceptualization, Validation, Investigation, Software, Formal Analysis, Writing – original draft. AK: Software, Investigation, Methodology, Writing – original draft, Formal Analysis, Data curation. PP: Methodology, Supervision, Conceptualization, Investigation, Writing – original draft, Resources, Funding acquisition, Project administration.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. The research work was funded by the U.S. DOE Fossil Energy and Carbon Management (FECM) office through the Technology Commercialization Fund (TCF) program.

The authors would like to acknowledge the computing core hours available through the Improv cluster provided by the Laboratory Computing Resource Center (LCRC) at Argonne National Laboratory. The contents of this manuscript have been presented at the Machine Learning and the Physical Sciences Workshop organized as part of the 38th conference on Neural Information Processing Systems (NeurIPS) in 2024 (Kumar et al., 2024).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Licenses and Permissions

The submitted manuscript has been created by UChicago Argonne, LLC, Operator of Argonne National Laboratory (Argonne). Argonne, a U.S. Department of Energy (DOE) Office of Science laboratory, is operated under Contract No. DEAC02-06CH11357. The U.S. Government retains for itself, and others acting on its behalf, a paid-up nonexclusive, irrevocable worldwide license in said article to reproduce, prepare derivative works, distribute copies to the public, and perform publicly and display publicly, by or on behalf of the Government.

References Almeldein

Van Dam

(2023). Accelerating chemical kinetics calculations with physics informed neural networks. J. Eng. Gas Turbines Power 145 (9), 091008. 10.1115/1.4062654 Blasco

J. A.

Fueyo

Dopazo

Ballester

(1998). Modelling the temporal evolution of a reduced combustion chemical system with an artificial neural network. Combust. Flame 113 (1), 38–52. 10.1016/s0010-2180(97)00211-3 Chen

R. T. Q.

Rubanova

Bettencourt

Duvenaud

D. K.

(2018). Neural ordinary differential equations. Adv. neural Inf. Process. Syst. 31. Christo

F. C.

Masri

A. R.

Nebot

E. M.

Pope

S. B.

(1996). An integrated PDF/neural network approach for simulating turbulent reacting systems. Symposium Int. Combust. 26 (1), 43–48. 10.1016/s0082-0784(96)80198-6 Dana

H. B.

(1987). “Modular learning in neural networks,” in Proceedings of the sixth national conference on artificial intelligence - volume 1, AAAI’87 (AAAI Press), 279–284. Goodwin

D. G.

Moffat

H. K.

Speth

R. L.

(2009). Cantera: an object-oriented software toolkit for chemical kinetics, thermodynamics, and transport processes caltech, 124. CA: Pasadena. Gregory

P. S.

Tao

Wang

(2016). Foundational fuel chemistry model version 1.0 (ffcm-1). Innes

Saba

Fischer

Gandhi

Rudilosso

M. C.

Joy

N. M.

(2018). Fashionable modelling with flux. CoRR, 01457. abs/1811. Jones

W. P.

Rigopoulos

(2005). Rate-controlled constrained equilibrium: formulation and application to nonpremixed laminar flames. Combust. Flame 142 (3), 223–234. 10.1016/j.combustflame.2005.03.008 Kercher

Votsmeier

(2025). Machine learning surrogate models for mechanistic kinetics: embedding atomic balance and positivity. J. Phys. Chem. Lett. 16 (19). 10.1021/acs.jpclett.5c00602 Kumar

Echekki

(2024). Combustion chemistry acceleration with DeepONets. Fuel 365, 131212. 10.1016/j.fuel.2024.131212 Kumar

Rieth

Owoyele

Chen

J. H.

Echekki

(2023a). Acceleration of turbulent combustion dns via principal component transport. Combust. Flame 255, 112903. 10.1016/j.combustflame.2023.112903 Kumar

Kumar

Pal

(2023b). “A physics-constrained neuralODE approach for robust learning of stiff chemical kinetics,” in NeurIPS machine learning and the physical sciences workshop. Kumar

Kumar

Pal

(2024). “A physics-informed autoencoder-neuralODE framework (Phy-ChemNODE) for learning complex fuel combustion kinetics,” in NeurIPS machine learning and the physical sciences workshop. Kumar

Kumar

Pal

(2025). A physics-constrained neural ordinary differential equations approach for robust learning of stiff chemical kinetics. Combust. Theory Model. 29 (2), 332–347. 10.1080/13647830.2025.2478266 Lu

Law

C. K.

(2005). A directed relation graph method for mechanism reduction. Proc. Combust. Inst. 30 (1), 1333–1341. 10.1016/j.proci.2004.08.145 Lu

Law

C. K.

(2009). Toward accommodating realistic fuel chemistry in large-scale computations. Prog. Energy Combust. Sci. 35 (2), 192–215. 10.1016/j.pecs.2008.10.002 Maas

Pope

S. B.

(1992). Simplifying chemical kinetics: intrinsic low-dimensional manifolds in composition space. Combust. Flame 88 (3), 239–264. 10.1016/0010-2180(92)90034-m Maes

De Ceuster

Sande

M. V. de

Decin

(2024). MACE: a machine-learning approach to chemistry emulation. Astrophysical J. 969 (79), 79. 10.3847/1538-4357/ad47a1 Mohan

A. T.

Lubbers

Chertkov

Livescu

(2023). Embedding hard physical constraints in neural network coarse-graining of three-dimensional turbulence. Phys. Rev. Fluids 8, 014604. 10.1103/physrevfluids.8.014604 Nair

A. S.

Barwey

Pal

MacArt

J. F.

Arcomano

Maulik

(2025). Understanding latent timescales in neural ordinary differential equation models of advection-dominated dynamical systems. Phys. D: Nonlinear Phenom. 476, 134650. 10.1016/j.physd.2025.134650 Owoyele

Pal

(2021). ChemNODE: a neural ordinary differential equations framework for efficient chemical kinetic solvers. Energy AI 7, 100118. 10.1016/j.egyai.2021.100118 Owoyele

Echekki

(2017). Toward computationally efficient combustion DNS with complex fuels via principal component transport. Combust. Theory Model. 21 (4), 770–798. 10.1080/13647830.2017.1296976 Raissi

Perdikaris

Karniadakis

G. E.

(2019). Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 378, 686–707. 10.1016/j.jcp.2018.10.045 Ranade

Alqahtani

Farooq

Echekki

(2019). An ann based hybrid chemistry framework for complex fuels. Fuel 241, 625–636. 10.1016/j.fuel.2018.12.082 Sen

B. A

Hawkes

E. R.

Menon

(2010). Large eddy simulation of extinction and reignition with artificial neural networks based chemical kinetics. Combust. Flame 157 (3), 566–578. 10.1016/j.combustflame.2009.11.006 Shampine

L. F.

(1993). Ill-conditioned matrices and the integration of stiff ODEs. J. Comput. Appl. Math. 48 (3), 279–292. 10.1016/0377-0427(93)90025-7 Sturm

P. O.

Wexler

A. S.

(2022). Conservation laws in a neural network architecture: enforcing the atom balance of a julia-based photochemical model (v0.2.0). Geosci. Model Dev. 15 (8), 3417–3431. 10.5194/gmd-15-3417-2022 Sun Tang

Turk

(2022). “Reduced order model for chemical kinetics: a case study with primordial chemical network,” in ICML workshop on machine learning for astrophysics. Valorani

Creta

Goussis

D. A.

Lee

J. C.

Habib

N. N.

(2006). An automatic procedure for the simplification of chemical kinetic mechanisms based on CSP. Combust. Flame 146 (1), 29–51. 10.1016/j.combustflame.2006.03.011 Vijayarangan

Uranakara

H. A.

Barwey

Malpica Galassi

Rafi Malik

Valorani

(2024). A data-driven reduced-order model for stiff chemical kinetics using dynamics-informed training. Energy AI 15, 100325. 10.1016/j.egyai.2023.100325 Wan

Barnaud

Vervisch

Domingo

(2020). Chemistry reduction using machine learning trained from non-premixed micro-mixing modeling: application to dns of a syngas turbulent oxy-flame with side-wall effects. Combust. Flame 220, 119–129. 10.1016/j.combustflame.2020.06.008 Weng

Han

Zhang

Chen

Z. X.

Zhou

(2025). Extended fourier neural operators to learn stiff kinetics under unseen conditions. Combust. Flame 272, 113847. 10.1016/j.combustflame.2024.113847 Wold

Esbensen

Paul

(1987). Principal component analysis. Chemom. Intelligent Laboratory Syst. 2 (1), 37–52. 10.1016/0169-7439(87)80084-9 Zhang

Weng

Zhao

Zhou

(2024). Learning transient evolution of multidimensional reacting flows by multiscale fourier neural operators. Proc. Combust. Inst. 40, 105714. 10.1016/j.proci.2024.105714