AUTHOR=Zhang Yanji , Tian Yingying , Jian Yang , Wei Zairong , Cai Silang , Zhang Gaofengi , Deng Chengliang TITLE=Development and external validation of machine-learning based models to predict diabetic foot ulcer in diabetes population JOURNAL=Frontiers in Endocrinology VOLUME=Volume 16 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/endocrinology/articles/10.3389/fendo.2025.1692917 DOI=10.3389/fendo.2025.1692917 ISSN=1664-2392 ABSTRACT=BackgroundDiabetic foot ulcer (DFU) is a common and serious complication in patients with diabetes, which affects the quality of life greatly as well as brings high risk for mortality. Identification of high-risk individuals, as early as possible is important for efficient intervention and prevention. This study systematically evaluates and summarizes the diagnostic accuracy of machine learning approaches for predicting DFU risk in diabetic patients.MethodsThis study adhered to the TRIPOD+AI (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis, Extended for Artificial Intelligence) guidelines. Using data from the National Health and Nutrition Examination Survey (NHANES) 1999–2004 to determine diagnosis of DFU related clinical characteristics, laboratory indicators and lifestyle-related variables. The diagnostic performance of its models trained using Logistic Regression (LR), K-Nearest Neighbors (KNN), Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Support Vector Machine (SVM) classifiers were compared. An independent testing dataset collected from the Second Affiliated Hospital of Zunyi Medical University was used to conduct external validation.ResultsThis study included 1, 857 participants from NHANES and 807 individuals recruited at the testing dataset. Key predictors identified in NHANES were numbness in extremities, direct HDL cholesterol, lymphocyte, white blood cell, segmented neutrophils, and BMI. Among them, the RF was identified as having the highest area under receiver operating characteristic curve (AUC) for NHANES at 0.81. The RF model also had the highest discriminative performance in external validation (as measured by an AUC of 0.79). Other models also provided good results in external validation: XGBoost had an AUC of 0.76, SVM reached 0.72, KNN reached 0.70, and LR received a score of 0.69.ConclusionThe ability of machine learning models to predict DFU risk was good in a combined population cohort when measured using common metrics but varied across distinct regions. These results support future clinical evaluation of these models and underscore the need to select algorithms a priori based on the target patient population.