AUTHOR=Zhou Yumei , Fan Pengbei , Zhang Haiyun , Han Shuai , Bai Minghua , Wang Ji , Wang Qi TITLE=Multi-omics and machine learning identify novel biomarkers and therapeutic targets of COVID-19 JOURNAL=Frontiers in Immunology VOLUME=Volume 16 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/immunology/articles/10.3389/fimmu.2025.1671936 DOI=10.3389/fimmu.2025.1671936 ISSN=1664-3224 ABSTRACT=IntroductionCOVID-19 has caused over 7 million deaths worldwide since its onset in 2019, and the virus remains a significant health threat. Identifying sensitive and specific biomarkers, along with elucidating immune-mediated mechanisms, is essential for improving the diagnosis, treatment, and prevention of COVID-19. To predict key molecular markers of COVID-19 using an established multi-omics framework combined with machine learning models.MethodsWe conducted an integrated analysis of single-cell RNA sequencing (scRNA-seq), bulk RNA sequencing, and proteomics data to identify critical biomarkers associated with COVID-19. The multi-omics approach enabled the characterization of gene expression dynamics and alterations in immune cell subsets in COVID-19 patients. Machine learning techniques and molecular docking analyses were employed to identify biomarkers and therapeutic targets within the disease’s pathophysiological network.ResultsPrincipal component analysis effectively grouped samples based on clinical characteristics. Using random forest and SVM-RFE models, we identified clinical indicators capable of accurately distinguishing COVID-19 patients. Transcriptomic analysis, including scRNA-seq, highlighted the pivotal role of CD8+ T cells, and WGCNA identified related module genes. Proteomic analysis, integrated with machine learning, revealed 36 DEPs. Further investigation identified several genes associated with monocyte proportions. Correlation analysis showed that BTD, CFL1, PIGR, and SERPINA3 were strongly linked to CD8+ T cell abundance in COVID-19 patients. ROC curve analysis demonstrated that these genes could effectively distinguish between COVID-19 patients and healthy individuals. Concordant findings from both transcriptomic and proteomic levels support BTD, CFL1, PIGR, and SERPINA3 as potential auxiliary diagnostic markers. Finally, AlphaFold-based molecular docking analysis suggested these biomarkers may also serve as candidate therapeutic targets.DiscussionPreliminary findings indicate that BTD, CFL1, PIGR, and SERPINA3 are vital molecular biomarkers related of CD8+ T cell, providing new insights into the molecular mechanisms and long-term prevention of COVID-19.