AUTHOR=Zhang Tianhong , Tian Tian , Zhang Yifan , Teng Qiliang , Zhang Yujie , Pang Jun , Wang Ya , Yang Chao TITLE=Clinical decision system for renal cell carcinoma integrating interpretable machine learning algorithms JOURNAL=Frontiers in Surgery VOLUME=Volume 12 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/surgery/articles/10.3389/fsurg.2025.1588208 DOI=10.3389/fsurg.2025.1588208 ISSN=2296-875X ABSTRACT=BackgroundKidney cancer is a highly heterogeneous oncologic disease with historically poor prognosis. Precise assessment of the risk of distal metastasis can facilitate risk stratification and improve prognosis for kidney cancer patients.MethodsData from the Surveillance, Epidemiology, and End Results (SEER) database, we identified 40,527 kidney cancer patients diagnosed between 2010 and 2017 were obtained. LASSO, univariate and multivariate logistic regression analyses were employed to screen independent risk factors for distal metastasis. Six machine learning (ML) algorithms including logistic regression (LR), Naïve Bayes Classifier (NBC), Decision Tree (DT), Random Forest (RF), Gradient boosting machine (GBM) and Extreme gradient boosting (XGB), were further applied to build the predictive models. After testing with ten-fold cross-validation and receiver operating characteristic (ROC) analysis, the model with the highest area under curve (AUC) was selected as the best performing model to establish the risk predictive nomogram and web calculator.ResultsIn distal metastasis risk prediction, the XGB model had the best performance in both training (AUC = 0.91) and testing (AUC = 0.851) datasets among the six ML algorithms. Variables including marital status, sequence number, primary site, grade, pathological type, T-stage, N-stage, the calculated risk of XGB, surgical and radiation treatment were incorporated to establish a nomogram to predict the 1-, 3-, and 5-years survival probability. The calibration plots, decision curve analysis (DCA), ROC curves and Kaplan–Meier (KM) curves all verified the predictive utility of the nomogram.ConclusionsWe established a favorable prediction for the occurrence of distal metastasis with the ML model. The nomogram based on XGB algorithm can contribute to identify high-risk patients and provide optimal clinical strategies.