AUTHOR=Mudunuru Maruti K. , Son Kyongho , Jiang Peishi , Hammond Glenn , Chen Xingyuan TITLE=Scalable deep learning for watershed model calibration JOURNAL=Frontiers in Earth Science VOLUME=Volume 10 - 2022 YEAR=2022 URL=https://www.frontiersin.org/journals/earth-science/articles/10.3389/feart.2022.1026479 DOI=10.3389/feart.2022.1026479 ISSN=2296-6463 ABSTRACT=Watershed models such as the Soil and Water Assessment Tool (SWAT) consist of high-dimensional physical and empirical parameters. These parameters often need to be estimated/calibrated through inverse modeling to produce reliable predictions on hydrological fluxes and states. Existing parameter estimation methods can be time consuming, inefficient, and computationally expensive for high-dimensional problems. In this paper, we present a fast, accurate, and robust method to calibrate the SWAT model (i.e., 20 parameters) using scalable deep learning (DL). We developed DL-enabled inverse models based on convolutional neural networks to assimilate observed streamflow data and estimate the SWAT model parameters. Scalable hyperparameter tuning is performed using high-performance computing resources to identify the top 50 optimal neural network architectures. We used ensemble SWAT simulations to train, validate, and test the DL models. We estimated the parameters of the SWAT model using observed streamflow data and assessed the impact of measurement errors on SWAT model calibration. We tested and validated the proposed scalable DL methodology on the American River Watershed, located in the Pacific Northwest-based Yakima River basin. Our results show that the DL model-based calibration is better than two popular parameter estimation methods (i.e., the generalized likelihood uncertainty estimation [GLUE] and the dynamically dimensioned search [DDS], which is a global optimization algorithm). For the set of parameters that are sensitive to the observations, our DL method yield narrower ranges than the GLUE method but broader ranges than values produced using the DDS method within the sampling range even under high relative observational errors. The SWAT model calibration performance using the DL, GLUE, and DDS methods are compared using R2 and a set of efficiency metrics, including Nash-Sutcliffe, logarithmic Nash-Sutcliffe, Kling-Gupta, modified Kling-Gupta, and non-parametric Kling-Gupta scores, computed on the observed and simulated watershed responses. Our results show that the DL calibration leads to more accurate low and high streamflow predictions than the GLUE and DDS sets. Our research demonstrates that the proposed DL method has high potential to improve our current practice in calibrating large-scale integrated hydrologic models.