AUTHOR=Ruan Ting , Wu Wenjun , Piao Mingji , Sun Yihan , Ju Xingai , Liu Mengyou , Lu Li , Zhang Bo , Zeng Yifei , Zhang Dongxiao , Li Yongxin , Cui Jianchun 

TITLE=Machine learning-assisted tongue image analysis for the diagnosis of Hashimoto’s thyroiditis

JOURNAL=Frontiers in Medicine

VOLUME=Volume 12 - 2025

YEAR=2025

URL=https://www.frontiersin.org/journals/medicine/articles/10.3389/fmed.2025.1673891

DOI=10.3389/fmed.2025.1673891

ISSN=2296-858X

ABSTRACT=ObjectiveThis study aims to evaluate the value of a machine learning model based on tongue features in the adjunctive diagnosis of Hashimoto’s thyroiditis (HT) and its concomitant hypothyroidism.MethodsTongue images and related clinical data were retrospectively collected from 120 HT patients (60 each from the euthyroid group and the hypothyroidism group), and the tongue region was segmented by preprocessing, and the image feature dimensions were extracted with 1,125 dimensions. Therefore, four methods, namely, random forest (RF), logistic regression (LR), support vector machine (SVM), and decision tree (DT), were utilized for model training, and 80 tongue images of 40 patients from Lixin County People’s Hospital in Anhui Province were utilized for external validation. The model performance evaluation indexes included AUC (Area Under the Curve), Sensitivity, Specificity, Positive Predictive Value (PPV), and Negative Predictive Value (NPV).Resultst-Distributed Stochastic Neighbor Embedding (t-SNE) visualization based on the test set revealed a distinguishable clustering trend between the two groups. The key classification features included tongue texture uniformity, body morphological features, and color depth. The AUC of the four models was higher than 0.82, confirming that the tongue image features have significant predictive value for HT, and the lower limit of 95% CI for all models was higher than 0.75, indicating that the models had stable differentiation ability. The AUC of SVM (0.894) was the best, significantly higher than the other models (RF: 0.857, LR: 0.876, and DT:0.828), indicating that the SVM possesses the strongest ability to classify patients with and without HT and the highest stability. The SVM exhibited balanced performance, with a sensitivity of 0.804 and specificity of 0.936. Consequently, it represents the optimal model for achieving an equilibrium between recall and precision. In external validation, the efficacy of the four models is notable, and the trend is consistent with the test set. SVM still demonstrates notable performance and possesses the best generalization ability among the four models.ConclusionThe tongue image-based machine learning model can effectively assist in distinguishing euthyroid from hypothyroidism in HT, offering a non-invasive, low-cost, and intelligent tool for auxiliary diagnosis and disease risk monitoring in primary care settings.