AUTHOR=Wang Angela , Xiao Elena , Cheng Jason , Shen Xiaoxi TITLE=Genetic risk predictions using deep learning models with summary data JOURNAL=Frontiers in Bioinformatics VOLUME=Volume 5 - 2025 YEAR=2026 URL=https://www.frontiersin.org/journals/bioinformatics/articles/10.3389/fbinf.2025.1657021 DOI=10.3389/fbinf.2025.1657021 ISSN=2673-7647 ABSTRACT=BackgroundAs a driving force of the Fourth Industrial Revolution, deep learning methods have achieved significant success across various fields, including genetic and genomic studies. While individual-level genetic data is ideal for deep learning models, privacy concerns and data-sharing restrictions often limit its availability to researchers.MethodsIn this paper, we investigated the potential applications of deep learning models—including deep neural networks, convolutional neural networks, recurrent neural networks, and transformers—when only genetic summary data, such as linkage disequilibrium matrices, is available. The bootstrap method was used to approximate the test error. Simulation studies and real data analyses were conducted to compare the performance of deep learning methods in genetic risk prediction using individual-level genetic data versus genetic summary data.ResultsThe test mean squared errors (MSEs) of most applied deep learning models are comparable when using individual-level data versus summary data.ConclusionOur results suggest that suitable deep learning methods could also serve as an alternative approach to predict disease related traits when only linkage disequilibrium matrices are available as input.