AUTHOR=Wang Jia-Wen , Meng Meng , Dai Mu-Wei , Liang Ping , Hou Juan TITLE=Correlation does not equal causation: the imperative of causal inference in machine learning models for immunotherapy JOURNAL=Frontiers in Immunology VOLUME=Volume 16 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/immunology/articles/10.3389/fimmu.2025.1630781 DOI=10.3389/fimmu.2025.1630781 ISSN=1664-3224 ABSTRACT=Machine learning (ML) has played a crucial role in advancing precision immunotherapy by integrating multi-omics data to identify biomarkers and predict therapeutic responses. However, a prevalent methodological flaw persists in immunological studies—an overreliance on correlation-based analysis while neglecting causal inference. Traditional ML models struggle to capture the intricate dynamics of immune interactions and often function as “black boxes.” A systematic review of 90 studies on immune checkpoint inhibitors revealed that despite employing ML or deep learning techniques, none incorporated causal inference. Similarly, all 36 retrospective studies modeling melanoma exhibited the same limitation. This “knowledge–practice gap” highlights a disconnect: although researchers acknowledge that correlation does not imply causation, causal inference is often omitted in practice. Recent advances in causal ML, like Targeted-BEHRT, CIMLA, and CURE, offer promising solutions. These models can distinguish genuine causal relationships from spurious correlations, integrate multimodal data—including imaging, genomics, and clinical records—and control for unmeasured confounders, thereby enhancing model interpretability and clinical applicability. Nevertheless, practical implementation still faces major challenges, including poor data quality, algorithmic opacity, methodological complexity, and interdisciplinary communication barriers. To bridge these gaps, future efforts must focus on advancing research in causal ML, developing platforms such as the Perturbation Cell Atlas and federated causal learning frameworks, and fostering interdisciplinary training programs. These efforts will be essential to translating causal ML from theoretical innovation to clinical reality in the next 5-10 years—representing not only a methodological upgrade, but also a paradigm shift in immunotherapy research and clinical decision-making.