AUTHOR=Yang Yuan-Jiao , Yan Han-Bing , Liu Wen-Tao , Yang Zhi-Chao , Wang Xiao-Hui , Liu Chen , Zhang Ya-Nan , Wang Jun , Yao Jin-Peng , He Hui TITLE=Application of machine learning to predict the occurrence of venous thromboembolism in patients hospitalized for coronary artery disease: a single-center retrospective study JOURNAL=Frontiers in Cardiovascular Medicine VOLUME=Volume 12 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/cardiovascular-medicine/articles/10.3389/fcvm.2025.1610938 DOI=10.3389/fcvm.2025.1610938 ISSN=2297-055X ABSTRACT=BackgroundThis study aimed to construct a prediction model for the occurrence of venous thromboembolism (VTE) in patients hospitalized with coronary heart disease (CHD) using machine learning algorithms.MethodsClinical data were from the medical records of CHD patients admitted to tertiary hospitals in eastern Liaoning Province between 2019 and 2024. Five machine learning algorithms—random forest (RF), classification and regression tree (CART), logistic regression (LR), logistic regression + least absolute shrinkage and selection operator (LR + LASSO), and extreme gradient boosting (XGBoost)—were used to construct predictive models. The area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and accuracy were comparison metrics between different models.ResultsA total of 3113 CHD inpatients were included in the study. In the internal validation set, XGBoost had the highest AUC (0.704), sensitivity (0.708), and accuracy (0.692), and RF had the highest specificity (0.706). In the time external validation set, LR + LASSO had the highest AUC (0.649), the highest specificity (0.683) for RF, and the highest sensitivity (0.682) and accuracy (0.656) for XGBoost. D-dimer, Age, and Neutrophil Count (NEUT) were the three most important relevant indicators.ConclusionThe prediction model based on machine learning algorithms for the occurrence of VTE in CHD inpatients has a specific diagnostic value. The prediction model constructed by LR + LASSO and XGBoost is more effective than the models constructed by other methods. The results of this study can provide research ideas for the clinical prevention and treatment of VTE events occurring in CHD inpatients.