<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3-mathml3.dtd">
<article xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="EN" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Aging Neurosci.</journal-id>
<journal-title-group>
<journal-title>Frontiers in Aging Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Aging Neurosci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1663-4365</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnagi.2025.1597511</article-id>
<article-version article-version-type="Version of Record" vocab="NISO-RP-8-2008"/>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Original Research</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Integrated analysis of N-glycosylation and Alzheimer&#x2019;s disease: identifying key biomarkers and mechanisms</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Zhang</surname> <given-names>Hao</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &amp; editing</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Formal analysis" vocab-term-identifier="https://credit.niso.org/contributor-roles/formal-analysis/">Formal analysis</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Software" vocab-term-identifier="https://credit.niso.org/contributor-roles/software/">Software</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Data curation" vocab-term-identifier="https://credit.niso.org/contributor-roles/data-curation/">Data curation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Methodology" vocab-term-identifier="https://credit.niso.org/contributor-roles/methodology/">Methodology</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Visualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/visualization/">Visualization</role>
</contrib>
<contrib contrib-type="author">
<name><surname>Chen</surname> <given-names>WenJun</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Data curation" vocab-term-identifier="https://credit.niso.org/contributor-roles/data-curation/">Data curation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Visualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/visualization/">Visualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Funding acquisition" vocab-term-identifier="https://credit.niso.org/contributor-roles/funding-acquisition/">Funding acquisition</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Software" vocab-term-identifier="https://credit.niso.org/contributor-roles/software/">Software</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Methodology" vocab-term-identifier="https://credit.niso.org/contributor-roles/methodology/">Methodology</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &amp; editing</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Formal analysis" vocab-term-identifier="https://credit.niso.org/contributor-roles/formal-analysis/">Formal analysis</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
</contrib>
<contrib contrib-type="author">
<name><surname>Yuan</surname> <given-names>ShuYou</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Visualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/visualization/">Visualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Methodology" vocab-term-identifier="https://credit.niso.org/contributor-roles/methodology/">Methodology</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &amp; editing</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Formal analysis" vocab-term-identifier="https://credit.niso.org/contributor-roles/formal-analysis/">Formal analysis</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Software" vocab-term-identifier="https://credit.niso.org/contributor-roles/software/">Software</role>
</contrib>
<contrib contrib-type="author">
<name><surname>Cai</surname> <given-names>Bo</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Software" vocab-term-identifier="https://credit.niso.org/contributor-roles/software/">Software</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Data curation" vocab-term-identifier="https://credit.niso.org/contributor-roles/data-curation/">Data curation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &amp; editing</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Visualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/visualization/">Visualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Methodology" vocab-term-identifier="https://credit.niso.org/contributor-roles/methodology/">Methodology</role>
</contrib>
<contrib contrib-type="author">
<name><surname>Ding</surname> <given-names>ShaoXiang</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Visualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/visualization/">Visualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Data curation" vocab-term-identifier="https://credit.niso.org/contributor-roles/data-curation/">Data curation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Software" vocab-term-identifier="https://credit.niso.org/contributor-roles/software/">Software</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &amp; editing</role>
</contrib>
<contrib contrib-type="author">
<name><surname>Bao</surname> <given-names>HongXia</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Visualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/visualization/">Visualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &amp; editing</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Software" vocab-term-identifier="https://credit.niso.org/contributor-roles/software/">Software</role>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Sun</surname> <given-names>JunKai</given-names></name>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Supervision" vocab-term-identifier="https://credit.niso.org/contributor-roles/supervision/">Supervision</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &amp; editing</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Conceptualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/conceptualization/">Conceptualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Project administration" vocab-term-identifier="https://credit.niso.org/contributor-roles/project-administration/">Project administration</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Resources" vocab-term-identifier="https://credit.niso.org/contributor-roles/resources/">Resources</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Investigation" vocab-term-identifier="https://credit.niso.org/contributor-roles/investigation/">Investigation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Funding acquisition" vocab-term-identifier="https://credit.niso.org/contributor-roles/funding-acquisition/">Funding acquisition</role>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Lu</surname> <given-names>Wei</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2896489/overview"/>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Supervision" vocab-term-identifier="https://credit.niso.org/contributor-roles/supervision/">Supervision</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Conceptualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/conceptualization/">Conceptualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Resources" vocab-term-identifier="https://credit.niso.org/contributor-roles/resources/">Resources</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &amp; editing</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Project administration" vocab-term-identifier="https://credit.niso.org/contributor-roles/project-administration/">Project administration</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Investigation" vocab-term-identifier="https://credit.niso.org/contributor-roles/investigation/">Investigation</role>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Zhu</surname> <given-names>HaoGang</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c003"><sup>&#x002A;</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &amp; editing</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Conceptualization" vocab-term-identifier="https://credit.niso.org/contributor-roles/conceptualization/">Conceptualization</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Investigation" vocab-term-identifier="https://credit.niso.org/contributor-roles/investigation/">Investigation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Resources" vocab-term-identifier="https://credit.niso.org/contributor-roles/resources/">Resources</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Data curation" vocab-term-identifier="https://credit.niso.org/contributor-roles/data-curation/">Data curation</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Project administration" vocab-term-identifier="https://credit.niso.org/contributor-roles/project-administration/">Project administration</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Supervision" vocab-term-identifier="https://credit.niso.org/contributor-roles/supervision/">Supervision</role>
</contrib>
</contrib-group>
<aff id="aff1"><label>1</label><institution>Geriatrics Center, Wuxi Second Geriatric Hospital</institution>, <city>Wuxi</city>, <country country="cn">China</country></aff>
<aff id="aff2"><label>2</label><institution>Department of Neurology, Wuxi Second Geriatric Hospital</institution>, <city>Wuxi</city>, <country country="cn">China</country></aff>
<aff id="aff3"><label>3</label><institution>Department of Laboratory, Wuxi Second Geriatric Hospital</institution>, <city>Wuxi</city>, <country country="cn">China</country></aff>
<aff id="aff4"><label>4</label><institution>Department of Pathology, Wuxi Second Geriatric Hospital</institution>, <city>Wuxi</city>, <country country="cn">China</country></aff>
<aff id="aff5"><label>5</label><institution>Department of Interventional Radiology, Wuxi No. 5 People&#x2019;s Hospital</institution>, <city>Wuxi</city>, <country country="cn">China</country></aff>
<author-notes>
<corresp id="c001"><label>&#x002A;</label>Correspondence: JunKai Sun, <email xlink:href="mailto:854437824@qq.com">854437824@qq.com</email></corresp>
<corresp id="c002">Wei Lu, <email xlink:href="mailto:luwei5166@gmail.com">luwei5166@gmail.com</email></corresp>
<corresp id="c003">HaoGang Zhu, <email xlink:href="mailto:911207258@qq.com">911207258@qq.com</email></corresp>
</author-notes>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2025-11-13">
<day>13</day>
<month>11</month>
<year>2025</year>
</pub-date>
<pub-date publication-format="electronic" date-type="collection">
<year>2025</year>
</pub-date>
<volume>17</volume>
<elocation-id>1597511</elocation-id>
<history>
<date date-type="received">
<day>21</day>
<month>03</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>24</day>
<month>10</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2025 Zhang, Chen, Yuan, Cai, Ding, Bao, Sun, Lu and Zhu.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Zhang, Chen, Yuan, Cai, Ding, Bao, Sun, Lu and Zhu</copyright-holder>
<license>
<ali:license_ref start_date="2025-11-13">https://creativecommons.org/licenses/by/4.0/</ali:license_ref>
<license-p>This is an open-access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License (CC BY)</ext-link>. The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</license-p>
</license>
</permissions>
<abstract>
<sec>
<title>Background</title>
<p>Alzheimer&#x2019;s disease (AD) is the most prevalent cause of dementia in the elderly, imposing a significant societal burden. Current therapeutic approaches primarily address symptoms, underscoring the critical need to elucidate its pathogenesis and identify robust early biomarkers. N-glycosylation, a critical post-translational modification, is dysregulated in neurodegenerative disorders, yet its role in AD and diagnostic potential remain underexplored.</p>
</sec>
<sec>
<title>Objective</title>
<p>This investigation aimed to characterize the interplay between N-glycosylation and AD through multi-dimensional bioinformatics analysis, identify core differentially expressed genes (DEGs) associated with this crosstalk, and evaluate their diagnostic efficacy in early AD detection.</p>
</sec>
<sec>
<title>Methods</title>
<p>A bibliometric analysis of Web of Science literature spanning 2001&#x2013;2025 was performed using VOSviewer, CiteSpace, and R. Transcriptomic data were analyzed with LIMMA to identify DEGs. Feature prioritization and molecular interaction decoding were achieved through Lasso, Random Forest, XGBoost, and SHAP analysis.</p>
</sec>
<sec>
<title>Results</title>
<p>Bibliometric analysis highlighted a shift toward granular molecular mechanisms, with &#x201C;bisecting GlcNAc&#x201D; and &#x201C;GNT-III (MGAT3)&#x201D; emerging as key research topics. Differential expression profiling identified 6,845 DEGs, including TMEM59, MLEC, and MAX. Machine learning algorithms consistently prioritized these three genes as core N-glycosylation-related biomarkers, alongside APP as a key associated molecule. Among transcription factors, MAX was identified as a central regulator, with a subset of 8 factors (including MAX and BRD9) pinpointed as critical modulators of N-glycosylation and glial activation in AD. Diagnostic models demonstrated strong performance: logistic regression achieved an AUC of 0.947 with MAX, APP, and MLEC; Random Forest and XGBoost attained perfect AUC = 1.0 in primary analyses; and a clinical nomogram integrating core genes yielded an AUC of 0.899. SHAP analysis confirmed MAX, APP, MLEC, and TMEM59 as top predictors, revealing significant positive interactions between MLEC and TMEM59 (<italic>p</italic> = 0.00019) and a negative interaction between MAX and MGAT3 (<italic>p</italic> = 0.0288). Notably, MAX alone served as a impactful single-gene biomarker, with AUC values ranging from 0.644 to 0.898 across external validation.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>MAX, MLEC, and TMEM59 represent key N-glycosylation-linked diagnostic biomarkers for AD. This integrative framework provides novel insights into AD pathogenesis and lays the foundation for personalized diagnostic tools and therapies, warranting experimental validation.</p>
</sec>
</abstract>
<kwd-group>
<kwd>Alzheimer&#x2019;s disease</kwd>
<kwd>N-glycosylation</kwd>
<kwd>diagnostic biomarkers</kwd>
<kwd>machine learning</kwd>
<kwd>transcription factors</kwd>
</kwd-group>
<funding-group>
<funding-statement>The author(s) declare financial support was received for the research and/or publication of this article. This work was supported by four grants awarded to Hao Zhang: the Wuxi Science and Technology Development Fund (Award No. Y20232005), the 2025 Open Project of the Provincial Key Laboratory for Integrative Chinese-Western Prevention and Treatment of Geriatric Diseases at Yangzhou University (Award No. 202528), the 2025 Wuxi Aging Research Project (Award No. WXLN25-A-24), and the 2025 Annual Scientific Research Project of the Jiangsu Provincial Association of Geriatrics (Award No. JGS2025ZDM010).</funding-statement>
</funding-group>
<counts>
<fig-count count="11"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="35"/>
<page-count count="19"/>
<word-count count="10327"/>
</counts>
<custom-meta-group>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Alzheimer&#x2019;s Disease and Related Dementias</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<label>1</label>
<title>Introduction</title>
<p>Alzheimer&#x2019;s disease (AD) is a progressive neurodegenerative disorder and the leading cause of dementia in the elderly population. It is characterized by a gradual decline in cognitive function and impairment of daily living activities, imposing a significant burden on affected individuals, their families, and society (<xref ref-type="bibr" rid="B17">Kawaharada et al., 2019</xref>). The rising prevalence of AD aligns with global demographic shifts toward an aging population, making it a critical public health priority. Projections indicate that by 2,050, dementia incidence will double in Europe and triple globally, with AD accounting for the majority of cases (<xref ref-type="bibr" rid="B33">You, 2025</xref>). Beyond its direct impact on patients, AD also affects caregivers, who often face psychological, physical, and economic strain, as well as healthcare systems, which must address the growing demand for long-term care and support services (<xref ref-type="bibr" rid="B8">Fang et al., 2024</xref>). Despite extensive research and resource allocation, current therapeutic approaches&#x2014;both pharmacological (e.g., cholinesterase inhibitors and NMDA receptor antagonists) and non-pharmacological&#x2014;remain largely symptomatic and do not halt or reverse disease progression (<xref ref-type="bibr" rid="B16">Kaye et al., 2024</xref>). This limitation underscores the urgent need to elucidate AD pathogenesis and identify reliable biomarkers for early diagnosis and effective management.</p>
<p>Significant progress has been made in characterizing the molecular hallmarks of AD, including amyloid-&#x03B2; plaques and neurofibrillary tangles composed of hyperphosphorylated tau protein (<xref ref-type="bibr" rid="B15">Huang and Jiang, 2009</xref>). However, emerging research highlights the complexity of AD, implicating a diverse array of genetic, epigenetic, metabolic, and environmental factors that contribute to its heterogeneity and clinical variability (<xref ref-type="bibr" rid="B26">Migliore and Copped&#x00E8;, 2022</xref>). Among these, post-translational protein modifications, particularly N-glycosylation, have garnered increasing attention due to their critical roles in protein folding, trafficking, and cell signaling (<xref ref-type="bibr" rid="B12">Hou et al., 2023</xref>). N-glycosylation, a highly conserved enzymatic process involving the attachment of oligosaccharides to asparagine residues, is essential for proper protein conformation and function (<xref ref-type="bibr" rid="B7">D&#x00FC;nser and Schoberer, 2025</xref>). Dysregulation of N-glycosylation pathways has been implicated in the pathogenesis of various diseases, including neurodegenerative disorders, suggesting that alterations in glycosylation may contribute to protein aggregation, synaptic dysfunction, and neuroinflammation in AD (<xref ref-type="bibr" rid="B31">Xu et al., 2025a</xref>).</p>
<p>Despite these insights, the precise role of N-glycosylation in AD pathogenesis remains incompletely understood. While studies have documented the importance of N-glycans in neuronal protein homeostasis and synaptic plasticity, direct evidence linking specific N-glycosylation-related genes to AD onset and progression is limited (<xref ref-type="bibr" rid="B20">Lee et al., 2017</xref>). Furthermore, few investigations have systematically evaluated the diagnostic utility of N-glycosylation profiles or associated genetic markers in AD, in contrast to established biomarkers such as amyloid-&#x03B2;, tau, and neuroimaging modalities. This knowledge gap represents a critical obstacle to the development of novel, non-invasive diagnostic strategies for early AD detection and risk stratification (<xref ref-type="bibr" rid="B18">Khan, 2018</xref>). Additionally, existing studies often rely on single-omic or unidimensional analytic approaches, lacking the integrative perspective necessary to disentangle the complex molecular networks underlying AD (<xref ref-type="bibr" rid="B32">Xu et al., 2025b</xref>).</p>
<p>To address these challenges, this study integrates multi-dimensional bioinformatic and statistical methods to systematically explore the intersection of N-glycosylation and AD. By combining bibliometric analysis, gene expression profiling, and machine learning, this work seeks to overcome previous limitations. Bibliometric approaches map thematic evolution and identify research frontiers within the N-glycosylation&#x2013;AD field, while differential gene expression analysis provides insights into molecular alterations associated with disease status. The incorporation of machine learning algorithms facilitates robust feature selection and prioritization of candidate biomarkers, offering the potential to enhance diagnostic accuracy beyond traditional approaches.</p>
<p>The research design encompasses a comprehensive workflow: first, systematic literature retrieval and bibliometric mapping to delineate key trends and knowledge gaps; second, integration of transcriptomic data from multiple publicly available databases to identify differentially expressed N-glycosylation-related genes in AD; third, employment of advanced machine learning techniques&#x2014;including regularized regression and ensemble methods&#x2014;to screen and validate potential biomarkers; and finally, network and regulatory analyses to elucidate interactions among identified genes, transcription factors (TFs), and molecular pathways. This multi-layered strategy aims to maximize the discovery of clinically relevant targets and provide a holistic understanding of the N-glycosylation landscape in AD.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<label>2</label>
<title>Materials and methods</title>
<sec id="S2.SS1">
<label>2.1</label>
<title>Data sources and retrieval strategy for N-glycosylation and AD bibliometrics</title>
<p>To investigate the intersection of N-glycosylation and AD through bibliometric analysis, data were retrieved from the Web of Science Core Collection, a gold-standard database for such research. English-language research articles and reviews published between 2001 and 2025 were included using the optimized search query:</p>
<p>&#x201C;TS = ((&#x201C;N-glycosylat&#x002A;&#x201D; OR &#x201C;N-linked glycosylat&#x002A;&#x201D; OR &#x201C;N-glycan&#x002A;&#x201D; OR &#x201C;N-glycoprotein&#x002A;&#x201D; OR &#x201C;protein N-glycosylat&#x002A;&#x201D; OR &#x201C;n-glycoform&#x002A;&#x201D;) AND (Alzheimer&#x002A; OR &#x201C;Alzheimer&#x2019;s disease&#x201D; OR &#x201C;Alzheimer&#x2019;s&#x201D; OR &#x201C;Alzheimer disease&#x201D; OR &#x201C;Alzheimer&#x2019;s-like&#x201D; OR &#x201C;Alzheimer pathology&#x201D; OR &#x201C;Alzheimer pathogenesis&#x201D; OR &#x201C;tau protein&#x201D; OR &#x201C;beta-amyloid&#x201D;)).&#x201D;</p>
<p>The initial search yielded 239 documents, which were filtered to retain only research articles and reviews. The complete list of these documents and detailed search outcomes are provided in <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>.</p>
</sec>
<sec id="S2.SS2">
<label>2.2</label>
<title>Analytical methods and tools for bibliometric analysis</title>
<p>Three analytical tools were employed: VOSviewer (v1.6.20), CiteSpace (v6.4 R1), and R (v4.3.3). VOSviewer was used to generate network visualizations, including thematic evolution mappings (to track shifts in research focus over time) and thematic quadrant mappings (to classify themes by centrality and density) (<xref ref-type="bibr" rid="B35">Zhou et al., 2025</xref>). CiteSpace constructed circular thematic landscapes (to identify clusters using silhouette coefficients) and performed targeted cluster analyses (for in-depth exploration of specific clusters) (<xref ref-type="bibr" rid="B28">Sebastian et al., 2020</xref>). Additionally, keyword burst detection was applied to characterize dynamic changes in research themes. R was utilized for quantitative analysis of publication data, including thematic trend quantification and keyword correlation calculations, to ensure rigorous examination of the literature.</p>
</sec>
<sec id="S2.SS3">
<label>2.3</label>
<title>Identification of N-glycosylation-related genes in AD via differential expression and machine learning</title>
<p>The GSE5281 dataset was analyzed to investigate N-glycosylation-related genes in AD. This dataset comprises gene expression profiles from 150 brain samples collected across three Alzheimer&#x2019;s Disease Centers (ADCs): Arizona ADC, Duke University ADC, and Washington University ADC (<xref ref-type="bibr" rid="B34">Zhang et al., 2022</xref>). The dataset covers six brain regions relevant to AD and aging, including the entorhinal cortex, hippocampus, medial temporal gyrus, posterior cingulate, superior frontal gyrus, and primary visual cortex. Samples were stratified by diagnostic group, age group, and APOE genotype to ensure comprehensive coverage of molecular mechanisms in AD and normal aging.</p>
<p>Differentially expressed genes (DEGs) were identified using the LIMMA package in R, a widely used tool for microarray data analysis. The workflow included data preprocessing to normalize expression values and reduce technical variability, followed by fitting linear models to assess differential expression between AD-affected and control samples. DEGs were defined using stringent criteria: an absolute log2 fold change (logFc) &#x003E; 0.4 and an adjusted <italic>p</italic>-value (Benjamini-Hochberg correction) &#x003C; 0.05.</p>
<p>To investigate the overlap between DEGs and N-glycosylation-related genes, a list of N-glycosylation-associated genes was retrieved from the GeneCards database (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>). The intersection between DEGs and this list was computed to identify genes implicated in both AD pathogenesis and N-glycosylation processes. All analyses, including data preprocessing, statistical modeling, and visualization, were performed using R.</p>
<p>Machine learning techniques were employed to identify genes associated with AD from an initial set of 39 genes derived from the intersection of DEGs and N-glycosylation-related genes in the GSE5281 and GSE122063 datasets. Three regularization methods were applied: Lasso, Elastic-Net, and Adaptive Lasso. Lasso regression was configured with an alpha parameter (&#x03B1;) set to 1, Elastic-Net with &#x03B1; = 0.5, and Adaptive Lasso without a predefined &#x03B1;, allowing adaptive adjustment of regularization strength. These methods were implemented using the glmnet package in R, with cross-validation to determine the optimal regularization parameter (&#x03BB;) that minimized deviance. Genes with non-zero coefficients at the optimal &#x03BB; were considered significant. Results were visualized using coefficient path plots and feature impact bar charts, and a Venn diagram was constructed to illustrate the overlap of selected genes across the three methods.</p>
</sec>
<sec id="S2.SS4">
<label>2.4</label>
<title>Identification of diagnostic biomarkers in DEGs via machine learning analysis</title>
<p>The relationship between five specific genes and AD was investigated using the GSE48350 and GSE5281 datasets. The workflow began with data preprocessing, including data normalization and removal of features with zero variance. Feature selection was performed using the Random Forest algorithm, configured to grow 100 trees to assess feature importance. Three machine learning models were constructed: logistic regression, Random Forest, and XGBoost. The logistic regression model was implemented using the &#x201C;lrm&#x201D; function from the &#x201C;rms&#x201D; package, the Random Forest model with 100 trees and importance parameter set to TRUE, and the XGBoost model trained for 50 rounds using the &#x201C;xgb.train&#x201D; function with the objective parameter set to &#x201C;binary:logistic.&#x201D; Model performance was evaluated using the Area Under the Curve (AUC) metric.</p>
</sec>
<sec id="S2.SS5">
<label>2.5</label>
<title>Identification of N-glycosylation-related genes associated with TMEM59 in AD</title>
<p>The GSE5281 and GSE122063 datasets were utilized to investigate molecular interactions associated with TMEM59, focusing on 11 genes implicated in N-glycosylation and AD. These genes&#x2014;STT3A, MLEC, CALR, CANX, CTSD, MGAT1, MGAT2, MGAT3, MGAT5, APP, and STT3B&#x2014;were selected based on their roles in glycosylation processes, calcium signaling, and protein quality control. For example, STT3A and STT3B are involved in glycosylation initiation (<xref ref-type="bibr" rid="B22">Lu et al., 2018</xref>), MGAT family members in glycan chain modification (<xref ref-type="bibr" rid="B4">Cha et al., 2017</xref>), and MLEC in glycan recognition (<xref ref-type="bibr" rid="B6">Dos Santos Silva et al., 2019</xref>). CALR and CANX act as chaperones in protein folding (<xref ref-type="bibr" rid="B19">Knee et al., 2003</xref>), while CTSD is linked to protein degradation (<xref ref-type="bibr" rid="B23">Marques et al., 2020</xref>). Eight feature selection algorithms were employed: LASSO, Elastic Net, Random Forest, XGBoost, Boruta, Stepwise Regression, Genetic Algorithm, and Decision Tree. Data preprocessing included normalization and removal of samples with missing values. The analysis was implemented in R with parallel computing for efficiency. Core genes selected by at least four algorithms were aggregated as the final output.</p>
</sec>
<sec id="S2.SS6">
<label>2.6</label>
<title>Discovering transcription factors linked to MLEC, TMEM59, and glial activation in AD</title>
<p>To identify pivotal TFs associated with AD, the ChIPBase database was leveraged to predict 44 TFs regulating the expression of four genes of interest: GFAP, FOS, MLEC, and TMEM59. These genes were selected based on their relevance to glial cell differentiation, a process intricately linked to AD pathogenesis (<xref ref-type="bibr" rid="B9">Giesen et al., 2003</xref>). The VennDiagram package in R was used to perform intersection analysis, identifying common TFs among GFAP, FOS, MLEC, and TMEM59.</p>
<p>Subsequently, the GSE5281 and GSE48350 datasets were analyzed, providing comprehensive gene expression profiles from brain regions relevant to AD and aging. Seven machine learning algorithms&#x2014;Lasso, Elastic Net, Adaptive Lasso, XGBoost, Boruta, Genetic Algorithm, and Linear Discriminant Analysis (LDA)&#x2014;were applied to assess the significance of the predicted TFs. Each algorithm was optimized for feature selection, balancing model complexity and predictive accuracy. The performance of these algorithms was quantified by the selection rate, defined as the ratio of selected genes to the total number of genes considered. Core biomarkers consistently recognized by at least four algorithms were identified as key contributors to AD pathogenesis. The R programming language was used for these analyses, with specific packages such as &#x201C;glmnet,&#x201D; &#x201C;xgboost,&#x201D; &#x201C;Boruta,&#x201D; &#x201C;GA,&#x201D; and &#x201C;MASS&#x201D; playing critical roles.</p>
</sec>
<sec id="S2.SS7">
<label>2.7</label>
<title>Investigating the diagnostic value of transcription factors associated with MLEC, TMEM59, and microglial activation in AD</title>
<p>The diagnostic performance of machine learning algorithms in AD was evaluated using a panel of genes associated with N-glycosylation and AD pathogenesis. The analysis focused on 13 molecular features: APP, CALR, CANX, CTSD, MAX, MGAT1, MGAT2, MGAT3, MGAT5, MLEC, STT3A, STT3B, and TMEM59. Machine learning algorithms, including logistic regression, support vector machines (linear and radial kernels), random forest, XGBoost, k-nearest neighbors (KNN), LDA, naive Bayes, and decision trees, were implemented to assess diagnostic accuracy. Feature selection was performed using random forest-based importance scoring, and models were trained and validated using five-fold repeated cross-validation. Key performance metrics, including AUC, accuracy, sensitivity, specificity, and F1 score, were calculated for each algorithm and gene combination. The analysis was conducted in R, leveraging packages such as &#x201C;caret,&#x201D; &#x201C;pROC,&#x201D; &#x201C;xgboost,&#x201D; and &#x201C;randomForest.&#x201D;</p>
</sec>
<sec id="S2.SS8">
<label>2.8</label>
<title>Machine learning and SHAP analysis of N-glycosylation-related molecular features in AD</title>
<p>The GSE5281 and GSE48350 datasets were used to analyze 13 molecular features selected based on their roles in N-glycosylation pathways and potential involvement in AD pathogenesis. These features included APP, CALR, CANX, CTSD, MAX, MGAT1, MGAT2, MGAT3, MGAT5, MLEC, STT3A, STT3B, and TMEM59. Data preprocessing involved normalization, median imputation for missing values, and removal of zero-variance features. Feature selection was conducted using a hybrid approach combining random forest importance scoring (ntree = 100) and logistic regression coefficients. Predictive models were built using random forest (ntree = 100), XGBoost (nrounds = 50), and logistic regression with interaction terms. Model performance was evaluated using AUC-ROC, with random forest and XGBoost achieving perfect AUC values of 1.0. SHAP values were calculated to interpret feature contributions, and molecular interactions were analyzed using heatmaps and interaction plots. All analyses were implemented in R, with visualization supported by packages such as &#x201C;ggplot2,&#x201D; &#x201C;pheatmap,&#x201D; and SHAP tools.</p>
<p>The workflow implemented in our study is depicted in <xref ref-type="fig" rid="F1">Figure 1</xref>, detailed accession numbers and information of the GEO datasets we used can be found in <xref ref-type="supplementary-material" rid="FS1">Supplementary Figure 1</xref>.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Workflow of the integrated bibliometric and bioinformatics analysis for identifying N-glycosylation-related genes in Alzheimer&#x2019;s disease (AD).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g001.tif">
<alt-text content-type="machine-generated">Flowchart depicting the research process for identifying biomarkers in Alzheimer&#x2019;s Disease (AD). It begins with bibliometric analysis and thematic evolution, then integrates multi-omic data and feature selection. Differential expression reveals significant genes. Model development and interpretation involve logistic regression and SHAP analysis. Transcription factor analysis identifies MAX as a regulator. Molecular interactions highlight TMEM106-MLEC correlation. The process emphasizes TMEM59 and MLEC as biomarkers, with potential for diagnostic validation and insights into N-glycosylation pathway dysregulation in AD pathogenesis.</alt-text>
</graphic>
</fig>
</sec>
</sec>
<sec id="S3" sec-type="results">
<label>3</label>
<title>Results</title>
<sec id="S3.SS1">
<label>3.1</label>
<title>Thematic evolution and research trends</title>
<p>Thematic evolution analysis of the N-glycosylation&#x2013;AD field from 2001 to 2025 revealed sustained focus on AD&#x2019;s interactions with core biological and biochemical processes, with N-glycosylation serving as a central connecting theme (<xref ref-type="fig" rid="F2">Figure 2A</xref>). Between 2021 and 2024, research narrowed its focus to granular molecular and biochemical topics, including &#x201C;cleavage,&#x201D; &#x201C;protein,&#x201D; &#x201C;mechanisms,&#x201D; &#x201C;mass-spectrometry,&#x201D; and &#x201C;peptide,&#x201D; reflecting a shift toward dissecting AD pathogenesis at the molecular level to advance diagnostics and therapeutics. Thematic quadrant analysis (2021&#x2013;2024) further clarified research priorities (<xref ref-type="fig" rid="F2">Figure 2B</xref>): the upper-right quadrant (&#x201C;Motor Themes&#x201D;) included high-centrality and high-density topics such as &#x201C;Alzheimer&#x2019;s disease expression&#x201D; and &#x201C;glycosylation,&#x201D; which bridge research directions; the upper-left quadrant (&#x201C;Niche Themes&#x201D;) featured specialized subfields like &#x201C;nervous-system&#x201D; and &#x201C;receptor subunits&#x201D;; the lower-right quadrant (&#x201C;Basic Themes&#x201D;) highlighted foundational concepts such as &#x201C;biomarker&#x201D; and &#x201C;dementia&#x201D;; and the lower-left quadrant (&#x201C;Emerging/Declining Themes&#x201D;) included evolving interests like &#x201C;tau-protein&#x201D; and &#x201C;peptide.&#x201D;</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Thematic evolution and research trends. <bold>(A)</bold> The temporal progression of research themes in N-glycosylation and Alzheimer&#x2019;s disease (AD) from 2001 to 2025, underscoring N-glycosylation as a pivotal theme in AD research. <bold>(B)</bold> Categorizes research priorities from 2021 to 2024 into four distinct categories: motor themes (high centrality and density), niche themes (specialized subfields), basic themes (foundational concepts), and emerging/declining themes. <bold>(C)</bold> Identifies key research clusters, including glycomic analysis of AD clinical samples and the role of GNT-III in glycan maturation. <bold>(D)</bold> Focuses on Cluster #3, highlighting keywords such as &#x201C;glycosylation,&#x201D; &#x201C;glycan,&#x201D; and &#x201C;Alzheimer&#x2019;s beta secretase,&#x201D; and emphasizing the relevance of GNT-III to AD pathology.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g002.tif">
<alt-text content-type="machine-generated">A collage of four graphical visualizations. A: Sankey diagram shows research topics and their transitions across five time periods from 2001 to 2025. B: Quadrant chart categorizes ideas based on exploration and exploitation axes. C: Circular clustering of text elements, color-coded by years 2015 to 2024, highlighting topic evolution. D: Heatmap with a timeline from 2015 to 2025, intensity indicating degree level, and topics listed along the horizontal axis.</alt-text>
</graphic>
</fig>
<p>CiteSpace-generated circular thematic landscapes identified high-confidence clusters (<xref ref-type="fig" rid="F2">Figure 2C</xref>): Cluster #0 (&#x201C;cerebrospinal fluid,&#x201D; &#x201C;N-glycan profile,&#x201D; silhouette coefficient: 0.726) emphasized glycomic analysis of AD clinical samples; Cluster #1 (&#x201C;neurodegenerative diseases,&#x201D; silhouette coefficient: 0.712) contextualized N-glycosylation within broader neurodegeneration; and Cluster #3 (&#x201C;GNT-III activity,&#x201D; silhouette coefficient: 0.909) focused on enzymatic regulation of glycan maturation. Nodes such as &#x201C;cerebrospinal fluid&#x201D; and &#x201C;N-glycosylation&#x201D; exhibited high centrality, underscoring their role in connecting clusters. Deep analysis of Cluster #3 revealed high-degree keywords&#x2014;&#x201C;glycosylation,&#x201D; &#x201C;glycan,&#x201D; and &#x201C;Alzheimer&#x2019;s beta secretase&#x201D;&#x2014;highlighting their relevance to GNT-III&#x2013;mediated AD pathways (<xref ref-type="fig" rid="F2">Figure 2D</xref>). Temporal tracking (2015&#x2013;2025) showed sustained interest in keywords such as &#x201C;oxidative stress&#x201D; and &#x201C;cell adhesion,&#x201D; indicating growing focus on GNT-III&#x2019;s role in AD-related pathology.</p>
</sec>
<sec id="S3.SS2">
<label>3.2</label>
<title>Keyword analysis and research focus</title>
<p>The three-field plot visualized the interconnections between key literature, institutions, and keywords in the N-glycosylation&#x2013;AD field (<xref ref-type="fig" rid="F3">Figure 3A</xref>). Influential studies, including &#x201C;Vassar R (1999) Science,&#x201D; &#x201C;Kizuka Y (2016) Biochem J,&#x201D; &#x201C;Ohtsubo K (2006) Cell,&#x201D; &#x201C;Gizaw ST (2016) BBA-Gen Subjects,&#x201D; and &#x201C;Kizuka Y (2015) EMBO Mol Med,&#x201D; formed the backbone of current understanding of N-glycosylation&#x2019;s role in AD pathogenesis. These works were closely linked to leading institutions such as Karolinska Institutet, the University of California System, and Fukushima Medical University, underscoring their dominant contributions to the field. Core keywords included &#x201C;glycosylation,&#x201D; &#x201C;Alzheimer&#x2019;s disease,&#x201D; and &#x201C;amyloid-beta,&#x201D; while &#x201C;bisecting GlcNAc&#x201D; emerged as a prominent focus, signaling growing interest in this specific glycan modification&#x2019;s role in AD.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Keyword analysis and research focus. <bold>(A)</bold> Presents a three-field plot linking key studies, institutions, and keywords such as &#x201C;glycosylation,&#x201D; &#x201C;Alzheimer&#x2019;s disease,&#x201D; and &#x201C;amyloid-beta.&#x201D; <bold>(B)</bold> Tracks keyword bursts, revealing early interest in &#x201C;N-linked glycosylation&#x201D; and recent focus on &#x201C;pathology&#x201D; and &#x201C;tau.&#x201D; <bold>(C)</bold> Displays a network visualization with &#x201C;Alzheimer&#x2019;s disease&#x201D; as the central node. <bold>(D)</bold> Highlights recent thematic priorities via warm red node coloring, emphasizing &#x201C;bisecting GlcNAc&#x201D; and &#x201C;GNT-III (MGAT3)&#x201D; as emerging research focuses.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g003.tif">
<alt-text content-type="machine-generated">A four-panel composite image depicting Alzheimer&#x2019;s disease research trends. Panel A shows a complex flow of research topics connected by grey lines, highlighting shifts in focus over time. Panel B is a table listing keywords related to Alzheimer&#x2019;s research, displaying their prominence from 2001 to 2025 via offset colored bars. Panel C is a network map with nodes representing related terms, densely connected by lines, with &#x201C;Alzheimer&#x2019;s disease&#x201D; as a central node. Panel D also shows a network map similar to Panel C, color-coded by time to illustrate the evolution of research interactions.</alt-text>
</graphic>
</fig>
<p>The keyword burst detection chart (<xref ref-type="fig" rid="F3">Figure 3B</xref>) tracked high-impact temporal trends (2001&#x2013;2025): early bursts included &#x201C;N-linked glycosylation&#x201D; (strength = 3.65) and &#x201C;protein glycosylation&#x201D; (strength = 3.85), reflecting foundational interest in glycan biology. More recently, &#x201C;pathology&#x201D; (strength = 3.63) and &#x201C;tau&#x201D; (strength = 3.72) have sustained bursts since 2020 (projected through 2025), indicating current focus on disease mechanisms. &#x201C;Bisecting GlcNAc&#x201D; also showed a notable burst (strength = 2.74) from 2016 to 2018, underscoring its historical relevance as a specialized research topic. Network visualizations further delineated thematic connections in N-glycosylation&#x2013;AD research (<xref ref-type="fig" rid="F3">Figures 3C,D</xref>). In <xref ref-type="fig" rid="F3">Figure 3C</xref>, &#x201C;Alzheimer&#x2019;s disease&#x201D; was the most central node (57 occurrences, total link strength = 134), reflecting its status as the field&#x2019;s core focus, while &#x201C;N-glycosylation&#x201D; (24 occurrences, total link strength = 58) confirmed its pivotal role in AD pathological processes. <xref ref-type="fig" rid="F3">Figure 3D</xref> highlighted recent thematic priorities via warm red node coloring: &#x201C;N-glycosylation&#x201D; (24 occurrences, link strength = 58) remained central, &#x201C;Bisecting GlcNAc&#x201D; (8 occurrences, link strength = 27) showed sustained relevance in recent studies, and &#x201C;GNT-III (MGAT3)&#x201D; (2 occurrences, link strength = 7) emerged as a novel focus, consistent with its role in regulating bisecting GlcNAc formation.</p>
</sec>
<sec id="S3.SS3">
<label>3.3</label>
<title>Key differentially expressed and N-glycosylation-related genes in Alzheimer&#x2019;s disease identified via machine learning</title>
<p>Differential expression analysis of the GSE5281 dataset identified 6,845 differentially expressed genes (DEGs) (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>), visualized in the volcano plot (<xref ref-type="fig" rid="F4">Figure 4A</xref>). The plot highlights significant upregulated genes (red) and downregulated genes (blue), with gray dots representing non-significant genes. Notably, TMEM59 was identified as a significantly downregulated gene, while MLEC and MAX were significantly upregulated in AD-affected samples compared to controls. The intersection of DEGs with N-glycosylation-related genes from the GeneCards database yielded 39 overlapping genes.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Key differentially expressed and N-glycosylation-related genes. <bold>(A&#x2013;F)</bold> Focus on the GSE5281 dataset: a volcano plot <bold>(A)</bold> highlights significant upregulated (red) and downregulated (blue) differentially expressed genes (DEGs); regression coefficient paths and feature impact bar charts from Lasso <bold>(B,D)</bold>, Elastic-Net <bold>(C)</bold>, and Adaptive Lasso <bold>(E)</bold> reveal genes such as UNG, MLEC, and MAN1C1 as top contributors; a Venn diagram <bold>(F)</bold> underscores overlapping genes across these methods, identifying robust biomarker candidates. <bold>(G&#x2013;L)</bold> Shift to the GSE122063 dataset: regression coefficient paths and feature impact bar charts <bold>(G&#x2013;J)</bold> highlight DAD1, MLEC, and TMEM59 as key genes; a Venn diagram <bold>(K)</bold> illustrates overlapping genes across methods; comparative analysis <bold>(L)</bold> reveals consistent candidates such as MLEC and TMEM59 between the two datasets.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g004.tif">
<alt-text content-type="machine-generated">Panel A shows a volcano plot with colored dots indicating differential gene expression. Panel B and G show Lasso regression plots, while C and H display Elastic-Net regression plots. Panels D, E, I, and J compare feature impacts using standardized coefficients, highlighting directionality. Panels F and K are Venn diagrams illustrating gene intersections across methods. Panel L is another Venn diagram showing overlap between GSE5281 and GSE122063 datasets.</alt-text>
</graphic>
</fig>
<p>In <xref ref-type="fig" rid="F4">Figure 4B</xref>, the Lasso regression coefficient path reveals the influence of various genes on the model, with genes like UNG and MLEC showing the most significant impact as indicated by their steep paths. <xref ref-type="fig" rid="F4">Figure 4C</xref> presents the Elastic-Net regression coefficient path, where genes such as UNG, MLEC, and MAN1C1 are prominently featured, demonstrating their importance in the model with considerable standardized coefficients. <xref ref-type="fig" rid="F4">Figure 4D</xref> showcases the Lasso feature impact, highlighting the top genes selected by absolute coefficient. UNG, MLEC, and MAN1C1 are at the forefront, with UNG having the highest positive coefficient, suggesting its strong association with the disease. <xref ref-type="fig" rid="F4">Figure 4E</xref> illustrates the Adaptive Lasso feature impact, where UNG again leads with the highest positive coefficient, followed by MAN1C1 and MLEC. This figure emphasizes the genes&#x2019; significance in the context of AD, with a clear distinction between positive and negative contributions. Lastly, <xref ref-type="fig" rid="F4">Figure 4F</xref> provides a Venn diagram of the key gene intersections across the three methods, showing a core set of genes that are consistently selected: ADPRH, ALG12, ALG8, ARSG, CXCR4, DNPH1, MAGT1, MAN1C1, MLEC, NOX4, NTRK1, ORAI1, SLC3A2, SRD5A3, TMEM59, and UNG. This overlap underscores the robustness of these genes as potential biomarkers for AD, as they are identified across different regularization techniques.</p>
<p>The analysis of the GSE122063 dataset using machine learning algorithms identified a set of genes with potential significance in AD. <xref ref-type="fig" rid="F4">Figure 4G</xref> illustrates the Lasso regression results, where genes such as DAD1 and MLEC exhibit the highest absolute standardized coefficients, indicating their strong influence on the model. <xref ref-type="fig" rid="F4">Figure 4H</xref> presents the Elastic-Net regression outcomes, showing a similar pattern with DAD1 and MLEC being prominent, alongside other genes like ALG8 and MOGS. <xref ref-type="fig" rid="F4">Figure 4I</xref> highlights the top features selected by Lasso regression, with DAD1, MLEC, and ALG8 having the most substantial impact. The Adaptive Lasso results in <xref ref-type="fig" rid="F4">Figure 4J</xref> show DAD1 as the most influential gene, followed by TMEM59 and MLEC, pointing to their crucial role in the disease pathology. The Venn diagram in <xref ref-type="fig" rid="F4">Figure 4K</xref> reveals the intersection of genes selected by all three methods, with ALG1, ALG3, ALG8, DAD1, MLEC, MOGS, ORAI1, PRKCSH, RPN1, SLC3A2, TMEM59, and TMEM59L being consistently identified across Lasso, Elastic-Net, and Adaptive Lasso, underscoring their potential as biomarkers. Finally, <xref ref-type="fig" rid="F4">Figure 4L</xref> compares the gene selection between the GSE5281 and GSE122063 datasets, showing an overlap in ALG8, MLEC, ORAI1, SLC3A2, and TMEM59, which further validates their importance in AD research.</p>
</sec>
<sec id="S3.SS4">
<label>3.4</label>
<title>Validation of gene biomarkers in DEGs and nomogram development for Alzheimer&#x2019;s disease diagnosis</title>
<p>The analysis of the GSE48350 dataset focusing on five genes yielded several significant findings. The logistic regression model achieved a test AUC of 0.594, indicating moderate predictive accuracy. <xref ref-type="fig" rid="F5">Figure 5A</xref> presents the model performance comparison across various machine learning algorithms, showcasing the logistic regression model&#x2019;s moderate predictive power. <xref ref-type="fig" rid="F5">Figure 5B</xref> illustrates the ranking of the five genes by their importance across different models, with TMEM59 showing the highest mean importance. <xref ref-type="fig" rid="F5">Figure 5C</xref> depicts the molecular signature expression patterns, highlighting a clear distinction between control and disease groups, particularly for TMEM59, which exhibited the most pronounced differential expression. <xref ref-type="fig" rid="F5">Figure 5D</xref> presents the clinical prediction nomogram, which achieved a model performance AUC of 0.699, indicating a reasonable ability to discriminate between control and disease states. This nomogram assigns points to each gene based on its standardized coefficient, with TMEM59 receiving the highest point allocation, significantly contributing to the total points and thus the risk probability of AD. These results underscore the potential utility of these genes as diagnostic biomarkers for AD, with TMEM59 emerging as a particularly promising candidate due to its significant differential expression and high feature importance across models.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Validation of gene biomarkers and diagnostic models. <bold>(A&#x2013;D)</bold> Analyze the GSE48350 dataset: model performance is compared across machine learning algorithms <bold>(A)</bold>; TMEM59 emerges as the most important gene through rankings <bold>(B)</bold>; molecular signature expression patterns distinguish control and disease groups <bold>(C)</bold>; a clinical prediction nomogram <bold>(D)</bold> validates TMEM59&#x2019;s role. <bold>(E&#x2013;H)</bold> Focus on the GSE5281 dataset: logistic regression and Random Forest models achieve AUC scores of 0.8566 and 0.8505 <bold>(E)</bold>; a nomogram <bold>(F)</bold> achieves an AUC of 0.899, emphasizing TMEM59&#x2019;s contribution; feature importance analysis <bold>(G)</bold> and expression patterns <bold>(H)</bold> highlight TMEM59 and MLEC as critical drivers. &#x002A;<italic>p</italic> &#x003C; 0.05, &#x002A;&#x002A;<italic>p</italic> &#x003C; 0.01, &#x002A;&#x002A;&#x002A;<italic>p</italic> &#x003C; 0.001.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g005.tif">
<alt-text content-type="machine-generated">A multi-panel figure showcases various data visualizations related to medical research: A: A line graph comparing model performance with different algorithms, showing true positive rates versus false positive rates. B: A bar chart presenting average feature importances across models. C: A scatter plot of molecular signature expression patterns, divided by experimental groups. D and F: Clinical prediction nomograms outlining scores and risk probabilities. E: A second line graph for model performance, similar to panel A. G: Another bar chart on feature importances, similar to panel B. H: A comparative scatter plot of molecular signature expression levels between control and disease groups.</alt-text>
</graphic>
</fig>
<p>The in-depth analysis of the GSE5281 dataset shed light on the roles of five key genes&#x2014;ALG8, MLEC, ORAI1, SLC3A2, and TMEM59&#x2014;in AD. The performance of various machine learning models was evaluated in <xref ref-type="fig" rid="F5">Figure 5E</xref>, with logistic regression and Random Forest standing out, achieving test AUC scores of 0.8566 and 0.8505, respectively. XGBoost also performed well, with a test AUC of 0.8304. The clinical prediction nomogram in <xref ref-type="fig" rid="F5">Figure 5F</xref>, which incorporates these genes, shows a high model performance AUC of 0.899, indicating its strong predictive power. This nomogram assigns points to each gene based on its expression level, with TMEM59 receiving the most points, suggesting it plays a crucial role in determining the risk probability of AD. <xref ref-type="fig" rid="F5">Figure 5G</xref> highlights the average feature importance across models, with TMEM59 being identified as the most important gene, followed by ALG8 and MLEC. This underscores the significant impact of these genes on the model&#x2019;s predictive capabilities. <xref ref-type="fig" rid="F5">Figure 5H</xref>, which includes MLEC and TMEM59, shows molecular signature expression patterns.</p>
</sec>
<sec id="S3.SS5">
<label>3.5</label>
<title>Characterization of N-glycosylation-related genes correlated with TMEM59 in Alzheimer&#x2019;s disease pathology</title>
<p>The heatmap (<xref ref-type="fig" rid="F6">Figure 6A</xref>) resulting from the analysis of the GSE5281 dataset illustrates the pairwise correlations among the nine selected genes. A significant negative correlation of &#x2013;0.46 was observed between TMEM59 and MLEC, as indicated by the deep blue color in the heatmap. This finding suggests an inverse relationship between the expression levels of these two genes. The hierarchical clustering dendrogram positioned TMEM59 and MLEC in separate clusters, reinforcing the distinctness of their expression profiles among the analyzed genes. In the analysis of the GSE122063 dataset (<xref ref-type="fig" rid="F6">Figure 6B</xref>), a notable correlation was identified between TMEM59 and MLEC, with a correlation coefficient of 0.52, indicating a moderate positive relationship. This finding suggests that the expression levels of TMEM59 and MLEC may be associated in the samples analyzed from individuals with AD. Analyzing the GSE122063 dataset, <xref ref-type="fig" rid="F6">Figure 6C</xref> shows that MLEC expression levels are increased in the disease group compared to controls.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>Molecular interactions of TMEM59. <bold>(A)</bold> (GSE5281) shows a heatmap revealing a strong negative correlation between TMEM59 and MLEC, with hierarchical clustering distinguishing their expression profiles. <bold>(B)</bold> (GSE122063) highlights a moderate positive correlation between these genes. Panel <bold>(C)</bold> displays elevated MLEC expression in disease groups within GSE122063. <bold>(D&#x2013;F)</bold> (GSE122063) detail gene selection rates, noting the Genetic Algorithm&#x2019;s 100% selection rate, and illustrate the importance of genes like STT3A, CALR, and MLEC across algorithms. <bold>(G&#x2013;I)</bold> (GSE5281) reveal similar trends, with the Genetic Algorithm again achieving a 100% selection rate and underscoring genes like APP, MLEC, and STT3B. Overall, MLEC consistently ranks as a key gene associated with TMEM59 across both datasets. &#x002A;<italic>p</italic> &#x003C; 0.05, &#x002A;&#x002A;<italic>p</italic> &#x003C; 0.01, &#x002A;&#x002A;&#x002A;<italic>p</italic> &#x003C; 0.001.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g006.tif">
<alt-text content-type="machine-generated">Nine-panel image displaying various scientific charts and graphs. Panels A and B show molecular interaction networks with heatmaps and dendrograms. Panel C illustrates box plots of molecular signature expression patterns for ALGB, MLEC, and ORA1. Panels D and G present bar charts of the number of genes selected by different algorithms. Panels E and H depict heatmaps of gene importance across algorithms. Panels F and I show bar charts of core genes, highlighting their normalized average importance and selection frequency by algorithms.</alt-text>
</graphic>
</fig>
<p>In the GSE122063 dataset, the feature selection algorithms identified STT3A, CALR, and MLEC as the top three genes associated with TMEM59. <xref ref-type="fig" rid="F6">Figure 6D</xref> shows that the Genetic Algorithm selected all genes, achieving a 100% selection rate, while XGBoost selected 9 genes, indicating an 81.8% selection rate. The heatmap in <xref ref-type="fig" rid="F6">Figure 6E</xref> demonstrates the normalized importance of each gene across algorithms, with STT3A, CALR, and MLEC showing the highest importance. <xref ref-type="fig" rid="F6">Figure 6F</xref> summarizes the core genes, highlighting STT3A, CALR, and MLEC as frequently selected by multiple algorithms. In the GSE5281 dataset, the top three genes associated with TMEM59 were APP, MLEC, and STT3B. <xref ref-type="fig" rid="F6">Figure 6G</xref> details the number of genes selected by each algorithm, with the Genetic Algorithm achieving a 100% selection rate and XGBoost selecting 9 genes, indicating an 81.8% selection rate. The heatmap in <xref ref-type="fig" rid="F6">Figure 6H</xref> illustrates the normalized importance of each gene across algorithms, with APP, MLEC, and STT3B consistently showing high importance. <xref ref-type="fig" rid="F6">Figure 6I</xref> confirms these findings, emphasizing APP, MLEC, and STT3B as core genes frequently selected by multiple algorithms. Across both datasets, MLEC is consistently identified as one of the top three genes associated with TMEM59.</p>
</sec>
<sec id="S3.SS6">
<label>3.6</label>
<title>Uncovering transcription factors that regulate MLEC, TMEM59, and glial activation in Alzheimer&#x2019;s disease</title>
<p>The Venn diagram analysis identified 44 shared TFs among the genes GFAP, FOS, MLEC, and TMEM59 (<xref ref-type="fig" rid="F7">Figure 7A</xref>), as detailed in <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>. This set of TFs is indicative of a regulatory network that may be involved in the biological processes associated with these genes. The rigorous application of machine learning algorithms to the GSE5281 dataset has yielded a refined list of 17 key TFs from an initial pool of 44 predicted TFs. This selection was based on the consistent recognition by multiple algorithms, indicating their potential significance in AD. The 17 TFs identified as critical include MAX, SMC3, ZBTB7A, TCF12, SMARCC1, EP300, SMARCA4, BRD1, EGR1, BRD9, RXRA, MBD4, ARID2, FLI1, TRIM24, SRF, and CTCF. As depicted in <xref ref-type="fig" rid="F7">Figure 7B</xref>, the Elastic Net and Lasso algorithms demonstrated the highest gene selection rates, suggesting their effectiveness in this context. <xref ref-type="fig" rid="F7">Figure 7C</xref> provides a comparative view of the number of genes selected by each method, highlighting the variability in gene selection across different algorithms. <xref ref-type="fig" rid="F7">Figure 7D</xref> presents a heatmap that ranks gene importance across various methods, offering a visual representation of the relative significance of each TF. <xref ref-type="fig" rid="F7">Figure 7E</xref> specifically emphasizes the core biomarkers (MAX ranked top), selected by at least four methods, underscoring their potential role as key regulators in AD.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p>Transcription factors regulating N-glycosylation-related genes. <bold>(A)</bold> Identifies 44 shared transcription factors (TFs) among GFAP, FOS, MLEC, and TMEM59. <bold>(B&#x2013;E)</bold> Focus on GSE5281: <bold>(B)</bold> Elastic Net and Lasso have the highest selection rates. <bold>(C)</bold> Compares selection rates, <bold>(D)</bold> Ranks TF importance via a heatmap, and <bold>(E)</bold> highlights core biomarkers like MAX. <bold>(F&#x2013;I)</bold> Analyze GSE48350: <bold>(F)</bold> Indicates Elastic Net selected the most genes (37), <bold>(G)</bold> Contrasts selection rates, <bold>(H)</bold> ranks TF importance, and Panel I shows selection frequency. <bold>(J)</bold> Presents a Venn diagram of key TFs from both datasets, identifying eight common factors: MAX, ZBTB7A, BRD9, RXRA, MBD4, ARID2, FLI1, and TRIM24.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g007.tif">
<alt-text content-type="machine-generated">Multiple panels displaying various data visualizations related to gene analysis. Panel A shows a Venn diagram with overlap among GFAP, FOS, TMEAM50, and MLEC. Panels B and F depict bar charts of feature selection by algorithm with varying importance scores. Panel C and G present bar graphs on data interaction showcasing comparisons across different models. Panels D and H offer heatmaps ranking gene importance. Panels E and I display core biomarkers selected by methods with frequency counts. Panel J features a Venn diagram comparing datasets GSE2561 and GSE48350 with overlapping genes.</alt-text>
</graphic>
</fig>
<p>The analysis leveraging the GSE48350 dataset, as depicted in <xref ref-type="fig" rid="F7">Figure 7F</xref>, demonstrates that the Elastic Net algorithm selected the highest number of genes (37), followed by Lasso (32), and Genetic Algorithm and Adaptive Lasso (20 each). In contrast, the Random Forest and XGBoost algorithms selected fewer genes, indicating a lower frequency of gene importance attribution by these methods. <xref ref-type="fig" rid="F7">Figure 7G</xref> provides a comparative overview of the number of genes selected by each method, with the bar chart visually representing the variability in gene selection across different algorithms. <xref ref-type="fig" rid="F7">Figure 7H</xref> presents a heatmap that ranks gene importance across various methods, offering a visual representation of the relative significance of each TF. Notably, genes such as STAG1, MAZ, BRD9, and FLI1 were consistently identified as important across multiple methods, as highlighted in <xref ref-type="fig" rid="F7">Figure 7I</xref>, which also shows the frequency of gene selection by the algorithms. The Venn diagram in <xref ref-type="fig" rid="F7">Figure 7J</xref> illustrates the intersection of key TFs identified from two distinct datasets, GSE5281 and GSE48350. The intersection, indicating the common TFs significant to both datasets, consists of 8 factors: MAX, ZBTB7A, BRD9, RXRA, MBD4, ARID2, FLI1, and TRIM24.</p>
</sec>
<sec id="S3.SS7">
<label>3.7</label>
<title>Evaluating the diagnostic potential of transcription factors linked to MLEC, TMEM59, and microglial activation in Alzheimer&#x2019;s disease</title>
<p>The analysis of the GSE5281 dataset (<xref ref-type="fig" rid="F8">Figures 8A&#x2013;D</xref>) has been conducted with a focus on eight TFs, utilizing machine learning models to predict Alzheimer&#x2019;s disease risk. The results are summarized as follows: <xref ref-type="fig" rid="F8">Figure 8A</xref> presents the ROC curves for three predictive models. The logistic regression model achieved a test AUC of 0.8444, indicating good model performance. The random forest model demonstrated a slightly higher discriminative ability with a test AUC of 0.8715, while the XGBoost model showed a test AUC of 0.8601. These AUC values suggest that all models are effective in distinguishing between disease and control samples, with the random forest model performing the best among the three. In <xref ref-type="fig" rid="F8">Figure 8B</xref>, the boxplot analysis reveals significant differences in the expression levels of TFs between control and disease groups. MAX, in particular, shows a notable higher median expression in control samples compared to disease samples, suggesting its potential role in the disease&#x2019;s pathogenesis.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption><p>Diagnostic potential of transcription factors. <bold>(A&#x2013;D)</bold> Analyze the GSE5281 dataset: ROC curves compare predictive models <bold>(A)</bold>, with Random Forest achieving the highest AUC of 0.8715; boxplots reveal significant differences in TF expression between control and disease groups <bold>(B)</bold>; a clinical prediction nomogram <bold>(C)</bold> achieves an AUC of 0.985; feature importance analysis <bold>(D)</bold> ranks MAX as the most influential TF. <bold>(E&#x2013;H)</bold> Focus on the GSE48350 dataset: logistic regression demonstrates the highest AUC of 0.772 <bold>(E)</bold>; differential expression analysis supports these findings <bold>(F)</bold>; a nomogram achieves an AUC of 0.74 <bold>(G)</bold>; feature importance analysis highlights significance in BRD9 and MAX <bold>(H)</bold>. &#x002A;<italic>p</italic> &#x003C; 0.05, &#x002A;&#x002A;<italic>p</italic> &#x003C; 0.01, &#x002A;&#x002A;&#x002A;<italic>p</italic> &#x003C; 0.001.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g008.tif">
<alt-text content-type="machine-generated">Grouped visuals illustrating model performance, molecular signature expression patterns, and clinical prediction nomograms. Panels A and E show ROC curves comparing model performance. Panels B and F present box plots of molecular signature expression between control and disease groups. Panels C and G are clinical prediction nomograms detailing individual feature contributions to risk prediction. Panels D and H are bar charts summarizing average feature importance across models.</alt-text>
</graphic>
</fig>
<p><xref ref-type="fig" rid="F8">Figure 8C</xref> features a clinical prediction nomogram that integrates the influence of various TFs to estimate disease risk. The nomogram is based on a logistic regression model with an AUC of 0.985, indicating excellent predictive power. For instance, a patient with a MAX expression level of 6.5 would receive 30 points, which, when combined with points from other factors, contributes to a total score that predicts disease risk. <xref ref-type="fig" rid="F8">Figure 8D</xref> illustrates the average feature importance across the three models, with MAX being the most influential, followed by ZBTB7A, BRD9, and RXRA. The normalized mean importance scores highlight MAX&#x2019;s predominant role in the model&#x2019;s predictive accuracy, with a score of 1.000, significantly higher than the other factors.</p>
<p>The current study (<xref ref-type="fig" rid="F8">Figures 8E&#x2013;H</xref>) analyzed the expression patterns and diagnostic potential of eight TFs (MAX, ZBTB7A, BRD9, RXRA, MBD4, ARID2, FLI1, and TRIM24) in AD using the GSE48350 dataset, which comprises 253 samples from postmortem brain tissues. <xref ref-type="fig" rid="F8">Figure 8E</xref> illustrates the receiver operating characteristic (ROC) curves for the three machine learning models. The logistic regression model demonstrated the highest area under the curve (AUC) of 0.772, followed by RF (AUC = 0.734) and XGBoost (AUC = 0.707). The performance gap between training and testing datasets was minimal for all models, indicating robust generalizability. <xref ref-type="fig" rid="F8">Figure 8F</xref> presents the differential expression analysis of five TFs (ARID2, BRD9, FLI1, MAX, and RXRA) between control and disease groups. Significant differences in expression levels were observed for all analyzed TFs. Notably, MAX exhibited elevated expression in the disease group compared to the control group. <xref ref-type="fig" rid="F8">Figure 8G</xref> displays a clinical prediction nomogram based on the logistic regression model, with an AUC of 0.74. The nomogram assigns points to each TF based on its expression level, and the total points are converted into a risk probability. MAX contributes significantly to the risk score, alongside BRD9 and ARID2. Specifically, higher expression levels of MAX are associated with an increased risk probability, underscoring its role as an important predictor in the model. <xref ref-type="fig" rid="F8">Figure 8H</xref> highlights the average feature importance of the eight TFs across the three machine learning models. BRD9 ranked highest with a normalized importance of 1.000, followed by ARID2 (0.860) and MAX (0.633). The integrated importance scores reflect the relative contribution of each TF to the diagnostic model. MAX emerges as one of the most critical features, with its importance supported by both its differential expression and its substantial contribution to the model&#x2019;s predictive power.</p>
</sec>
<sec id="S3.SS8">
<label>3.8</label>
<title>Optimizing diagnostic accuracy with key molecules (MAX, MLEC, TMEM59) and N-glycosylation-related genes across different algorithms</title>
<p>The diagnostic performance of machine learning algorithms was evaluated using the GSE5281 dataset, with a focus on achieving optimal diagnostic efficacy with fewer gene combinations. <xref ref-type="fig" rid="F9">Figure 9A</xref> illustrates a radar plot comparing key performance metrics across algorithms. Logistic regression demonstrated superior performance in accuracy, specificity, and F1 score, while XGBoost and SVM linear showed strong AUC values. <xref ref-type="fig" rid="F9">Figure 9B</xref> highlights algorithm stability through boxplots of AUC distributions. Logistic regression and SVM linear exhibited the highest median AUC values with minimal variability, indicating robustness. <xref ref-type="fig" rid="F9">Figure 9C</xref> displays a performance matrix heatmap for different gene combinations and algorithms. Notably, logistic regression achieved an AUC of 0.947 using only three genes (MAX, APP, MLEC), and an AUC of 0.948 with five genes (MAX, APP, MLEC, TMEM59, MGAT3). Even with a single gene (MAX), logistic regression demonstrated an AUC of 0.898, underscoring its diagnostic utility. The findings highlight the importance of MAX as a critical feature in gene panels, with logistic regression emerging as the most stable and accurate algorithm for minimizing the number of genes while maximizing diagnostic efficacy.</p>
<fig id="F9" position="float">
<label>FIGURE 9</label>
<caption><p>Optimization of diagnostic models. <bold>(A&#x2013;C)</bold> Analyze the GSE5281 dataset: a radar plot compares key performance metrics across algorithms <bold>(A)</bold>, with logistic regression demonstrating superior accuracy; AUC stability is assessed through boxplots <bold>(B)</bold>, further validating logistic regression and SVM linear; a performance matrix heatmap <bold>(C)</bold> underscores logistic regression&#x2019;s efficiency, achieving an AUC of 0.947 with three genes (MAX, APP, MLEC). <bold>(D&#x2013;F)</bold> Shift to the GSE48350 dataset: Naive Bayes achieves the highest mean AUC of 0.723 using five genes <bold>(D)</bold>; algorithm stability is highlighted <bold>(E)</bold>; performance across gene combinations is shown in heatmap matrices <bold>(F)</bold>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g009.tif">
<alt-text content-type="machine-generated">Image consisting of multiple data visualizations related to machine learning and gene analysis. Panel A and D show multi-metric performance radar plots with various color-coded lines representing different parameters. Panels B and E display algorithm stability analysis using box plots for different machine learning algorithms. Panels C and F present heatmaps titled &#x201C;Performance Matrix: Gene Combination vs Algorithms,&#x201D; displaying AUC values with a gradient color scheme. Each visualization compares different gene and algorithm combinations, highlighting performance variations.</alt-text>
</graphic>
</fig>
<p>The analysis of the GSE48350 dataset, comprising 253 samples, evaluated the diagnostic performance of machine learning algorithms across different gene combinations related to N-glycosylation and AD. <xref ref-type="fig" rid="F9">Figure 9D</xref> illustrates the multi-metric performance of algorithms, with Naive Bayes achieving the highest mean AUC of 0.723 using the five-gene combination of MAX, MGAT3, APP, STT3A, and CTSD, supported by an accuracy of 0.702, sensitivity of 0.873, specificity of 0.331, and an F1 score of 0.8. <xref ref-type="fig" rid="F9">Figure 9E</xref> shows the stability of algorithms through boxplots of AUC distributions, with Naive Bayes demonstrating robust stability with a median AUC above 0.70, while algorithms such as Decision Tree and SVM Linear exhibited greater variability. <xref ref-type="fig" rid="F9">Figure 9F</xref> provides a heat map matrix comparing algorithm performance across gene combinations, indicating that Naive Bayes achieved the highest AUC of 0.723 with five genes (MAX, MGAT3, APP, STT3A, CTSD). Even with fewer genes, Naive Bayes maintained strong diagnostic efficacy, achieving an AUC of 0.704 with three genes (MAX, MGAT3, APP) and an AUC of 0.644 with a single gene (MAX). While the five-gene combination yielded the highest diagnostic performance, the three-gene combination offers a balance between diagnostic accuracy and clinical practicality. The MAX gene was identified as the most critical single gene in this analysis.</p>
</sec>
<sec id="S3.SS9">
<label>3.9</label>
<title>Key molecular features and interactions in Alzheimer&#x2019;s disease revealed by machine learning and SHAP analysis</title>
<p>The analysis of <xref ref-type="fig" rid="F10">Figures 10A&#x2013;K</xref> provides a comprehensive overview of the key molecular features and interactions driving AD classification in the GSE5281 dataset. The random forest and XGBoost models demonstrated exceptional performance with perfect AUC values of 1.0, outperforming other algorithms such as SVM Radial (AUC = 0.976) and Elastic Net (AUC = 0.953) (<xref ref-type="fig" rid="F10">Figure 10A</xref>). Feature importance analysis identified MAX, APP, MLEC, and TMEM59 as the most impactful predictors, with MAX showing the highest mean absolute SHAP value of 0.195 (<xref ref-type="fig" rid="F10">Figures 10B,C</xref>). The SHAP value distribution (<xref ref-type="fig" rid="F10">Figure 10B</xref>) and feature value vs. SHAP impact analysis (<xref ref-type="fig" rid="F10">Figure 10E</xref>) revealed nonlinear relationships between feature expression levels and their contributions to predictions. MAX and APP exhibited strong positive impacts, with their contributions increasing sharply at higher expression levels. MLEC and TMEM59 also showed significant contributions, though their dependencies on expression levels were weaker. The force plot (<xref ref-type="fig" rid="F10">Figure 10D</xref>) and individual prediction decomposition (<xref ref-type="fig" rid="F10">Figure 10F</xref>) further highlighted the dominant roles of MAX, MLEC, APP, and TMEM59 in driving prediction scores, with MAX contributing the most significantly. Interaction analysis uncovered several key relationships. The interaction between MLEC and TMEM59 was highly significant (<italic>p</italic> = 0.00019, &#x03B2; = 2.469), with higher expression of both genes correlating with increased predicted probabilities (<xref ref-type="fig" rid="F10">Figure 10I</xref>). Conversely, the interaction between MAX and MGAT3 (<italic>p</italic> = 0.0288, &#x03B2; = &#x2013;4.491) showed a strong negative effect, where higher MAX expression reduced predicted probabilities when MGAT3 expression was low (<xref ref-type="fig" rid="F10">Figure 10H</xref>). The interaction between APP and MLEC (<italic>p</italic> = 0.03068, &#x03B2; = &#x2013;1.473) also demonstrated a significant negative effect (<xref ref-type="fig" rid="F10">Figure 10G</xref>). While some interactions, such as MAX &#x00D7; MLEC (<italic>p</italic> = 0.3604, &#x03B2; = &#x2013;1.439) and APP &#x00D7; MGAT3 (<italic>p</italic> = 0.3216, &#x03B2; = &#x2013;0.827), did not reach statistical significance, they suggested potential regulatory trends worth further investigation (<xref ref-type="fig" rid="F10">Figures 10J,K</xref>).</p>
<fig id="F10" position="float">
<label>FIGURE 10</label>
<caption><p>Key molecular features and interactions. <bold>(A&#x2013;K)</bold> Focus on the GSE5281 dataset: ROC curves demonstrate Random Forest and XGBoost achieving perfect AUC values of 1.0 <bold>(A)</bold>; SHAP value distribution <bold>(B)</bold> and feature importance analysis <bold>(C)</bold> identify MAX, APP, MLEC, and TMEM59 as top predictors; a force plot <bold>(D)</bold> emphasizes their dominant roles in driving prediction scores. <bold>(E&#x2013;K)</bold> Explore nonlinear relationships and significant interactions: interactions include a positive effect between MLEC and TMEM59 (<italic>p</italic> = 0.00019) and a negative effect between MAX and MGAT3 (<italic>p</italic> = 0.0288).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g010.tif">
<alt-text content-type="machine-generated">Composite image displaying multiple charts related to feature analysis and predictive modeling. Panel A shows Receiver Operating Characteristics (ROC) curves. Panel B presents a feature importance plot. Panels C and D contain bar charts of feature significance and predictive feature coefficients. Panel E illustrates SHAP impact on non-linear feature-value relationships. Panel F features individualized prediction decomposition. Panels G to K depict interaction plots with color gradients indicating expression levels. The overall focus is on analyzing predictive features and model performance.</alt-text>
</graphic>
</fig>
<p>The analysis of N-glycosylation-related molecular features in AD using the GSE48350 dataset provided a comprehensive understanding of their roles and interactions through 11 key visualizations. The Random Forest model demonstrated exceptional classification performance with an AUC of 1.0 (<xref ref-type="fig" rid="F11">Figure 11A</xref>), outperforming other algorithms such as XGBoost (AUC = 0.976) and SVM Radial (AUC = 0.852). This result was supported by SHAP value distributions (<xref ref-type="fig" rid="F11">Figure 11B</xref>), which identified APP and MAX as the most impactful features due to their high SHAP values. The feature importance analysis (<xref ref-type="fig" rid="F11">Figure 11C</xref>) further confirmed APP and MAX as the top predictors, followed by MGAT3 and STT3A. Force plots (<xref ref-type="fig" rid="F11">Figure 11D</xref>) and waterfall plots (<xref ref-type="fig" rid="F11">Figure 11F</xref>) detailed the directional contributions of these features to prediction scores, with MAX showing a strong positive impact and CTSD a significant negative impact. The non-linear relationships between feature expression values and SHAP impacts (<xref ref-type="fig" rid="F11">Figure 11E</xref>) revealed that APP and MAX had positive correlations with AD classification, while STT3A demonstrated a negative correlation. MGAT3 exhibited a non-linear trend, highlighting its complex role in AD pathology. Interaction analyses uncovered significant synergies between molecular pairs. The interaction between MAX and MGAT1 (<xref ref-type="fig" rid="F11">Figure 11G</xref>) showed a strong positive effect (<italic>P</italic> = 0.0016, &#x03B2; = 7.7), while the interaction between MGAT3 and STT3A (<xref ref-type="fig" rid="F11">Figure 11I</xref>) demonstrated a highly significant negative effect (<italic>P</italic> &#x003C; 0.001, &#x03B2; = &#x2013;6.864). The interaction between MGAT3 and MGAT1 (<xref ref-type="fig" rid="F11">Figure 11H</xref>) also showed a near-significant effect (<italic>P</italic> = 0.0775, &#x03B2; = 2.767). Other interactions, such as APP &#x00D7; MGAT3 (<xref ref-type="fig" rid="F11">Figure 11K</xref>) and MAX &#x00D7; STT3A (<xref ref-type="fig" rid="F11">Figure 11J</xref>), showed limited or non-significant effects (<italic>P</italic> = 0.1743, &#x03B2; = &#x2013;2.298 and <italic>P</italic> = 0.4841, &#x03B2; = &#x2013;1.745, respectively).</p>
<fig id="F11" position="float">
<label>FIGURE 11</label>
<caption><p>N-glycosylation-related molecular features. <bold>(A&#x2013;K)</bold> Analyze the GSE48350 dataset: ROC curves highlight Random Forest&#x2019;s perfect AUC of 1.0 <bold>(A)</bold>; SHAP value distribution <bold>(B)</bold> and feature importance analysis <bold>(C)</bold> underscore APP and MAX as top predictors. <bold>(D&#x2013;F)</bold> Detail feature contributions: a force plot <bold>(D)</bold> and waterfall plot <bold>(F)</bold> detail feature contributions to prediction scores. <bold>(G&#x2013;K)</bold> Explore significant interactions: relationships include a positive effect between MAX and MGAT1 (<italic>p</italic> = 0.0016) and a negative effect between MGAT3 and STT3A (<italic>p</italic> &#x003C; 0.001).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-17-1597511-g011.tif">
<alt-text content-type="machine-generated">Panel of data visualizations. A: ROC curves comparing different models. B: Dot plot of data distributions. C: Feature importance bar chart ranking predictive features. D: Partial feature dependence plot. E: Scatter plots showing SHAP impact versus feature value for top predictive features. F: Waterfall chart for individual prediction explanations. G-K: Heatmaps showing feature interactions on predictive models with color gradients indicating predicted values.</alt-text>
</graphic>
</fig>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<label>4</label>
<title>Discussion</title>
<p>AD is a devastating neurodegenerative disorder characterized by progressive cognitive decline and memory impairment, predominantly affecting the elderly population (<xref ref-type="bibr" rid="B27">Rampa et al., 2013</xref>). Its pathophysiology is marked by the accumulation of amyloid-beta plaques and neurofibrillary tangles, which lead to synaptic dysfunction and neuronal loss (<xref ref-type="bibr" rid="B10">Han and Shi, 2016</xref>). As the leading cause of dementia globally, AD poses significant challenges to affected individuals, their families, and healthcare systems (<xref ref-type="bibr" rid="B21">Lozupone et al., 2025</xref>). With its prevalence on the rise, understanding the underlying mechanisms of AD and identifying reliable biomarkers for early diagnosis have become critical priorities in the development of effective therapeutic strategies (<xref ref-type="bibr" rid="B30">Wu et al., 2021</xref>).</p>
<p>This investigation explores the role of N-glycosylation in AD pathology, highlighting its potential as a key factor in disease progression. By integrating gene expression data from the multiple datasets with advanced machine learning techniques, we identified several N-glycosylation-associated genes that may serve as promising biomarkers for early detection. These findings underscore the importance of these biomarkers in improving diagnostic accuracy and guiding future therapeutic interventions, thereby advancing our understanding of AD mechanisms and paving the way for targeted treatment approaches.</p>
<p>In-depth analysis of gene expression profiles in the GSE5281 dataset revealed a substantial set of DEGs. Notably, TMEM59 was markedly downregulated, while MLEC and MAX were significantly upregulated. This observation aligns with transcriptomic studies indicating widespread alterations in glycosylation-related genes in the brains of individuals with AD, where nearly 80% of such genes were reported to be differentially expressed in at least one brain region (<xref ref-type="bibr" rid="B29">Tang et al., 2023</xref>). The downregulation of TMEM59 may disrupt glycoprotein maturation and trafficking, impairing protein-folding quality control mechanisms and exacerbating proteostatic stress&#x2014;a phenomenon previously linked to AD pathology (<xref ref-type="bibr" rid="B25">Meng et al., 2020</xref>). Conversely, the upregulation of MLEC, which encodes malectin, an ER-resident lectin involved in the recognition of misfolded glycoproteins, could reflect a compensatory response to the increased burden of unfolded proteins or aberrant N-glycan structures in the AD brain (<xref ref-type="bibr" rid="B5">Chen et al., 2011</xref>). MAX, a transcriptional regulator implicated in cellular responses to metabolic stress, may be associated with the metabolic alterations observed in AD neurons (<xref ref-type="bibr" rid="B13">Hsieh and Dang, 2016</xref>). These expression patterns diverge from those seen in other neurodegenerative conditions, underscoring a potential disease-specific adaptation in N-glycosylation pathways. Thus, the coordinated dysregulation of TMEM59, MLEC, and MAX highlights a mechanistic nexus linking glycoprotein homeostasis, ER stress, and metabolic reprogramming in AD.</p>
<p>The application of machine learning algorithms enabled the identification of 5 N-glycosylation-associated genes with potential diagnostic relevance for AD. Feature selection methods such as Lasso, Elastic-Net, and Adaptive Lasso were employed to address concerns raised in previous literature regarding the limitations of single-method feature selection, particularly in high-dimensional, collinear &#x201C;omics&#x201D; data. Recent biomarker studies have emphasized the importance of integrating multiple selection strategies to enhance the robustness and interpretability of candidate biomarkers, especially when dataset shift and cohort heterogeneity pose significant translational challenges. Unlike the reliance on univariate or standard regression-based selections, the present methodology combines penalized regression with ensemble learning, thereby mitigating overfitting and improving out-of-sample predictive performance. This integrative analytic framework surpasses previous approaches by systematically accounting for correlation structures among glycosylation genes and accommodating the biological complexity inherent to AD. Importantly, the genes prioritized by this approach show minimal overlap with those identified by conventional differential expression analysis alone, demonstrating the capacity of machine learning to uncover non-obvious biomarker candidates in the context of multifactorial disease etiology (<xref ref-type="bibr" rid="B1">Abbas and El-Manzalawy, 2020</xref>).</p>
<p>The observed correlation between MLEC and TMEM59 suggests a functional interplay within the N-glycosylation pathway, which may be pivotal for maintaining glycoprotein quality control in neural tissue. Previous glycomic investigations have identified enhanced or aberrant glycan biosynthesis as a hallmark of AD, wherein genetic or dietary modulation of glycosylation enzymes directly impacts cognitive and behavioral phenotypes (<xref ref-type="bibr" rid="B11">Hawkinson et al., 2025</xref>). The concurrent upregulation of MLEC alongside the downregulation of TMEM59 could indicate an adaptive but ultimately insufficient cellular response to glycoprotein misfolding and aggregation. Mechanistically, MLEC may facilitate the recognition and retention of misfolded glycoproteins in the endoplasmic reticulum, whereas TMEM59 is implicated in the trafficking and processing of these substrates (<xref ref-type="bibr" rid="B2">Boada-Romero et al., 2013</xref>). The linkage between MLEC and TMEM59 thus provides a molecular rationale for the dysfunction in protein quality control and aberrant glycosylation characterizing AD neuropathology.</p>
<p>The identification of eight TFs&#x2014;including MAX, ZBTB7A, BRD9, RXRA, MBD4, ARID2, FLI1, and TRIM24&#x2014;as regulators of MLEC, TMEM59, GFAP, and FOS expands our understanding of the gene regulatory networks that orchestrate glycosylation and neuroinflammatory responses in AD. Prior computational studies have highlighted the centrality of specific TFs, such as STAT1 and HSF5, in controlling glycosyltransferase gene expression in the AD brain (<xref ref-type="bibr" rid="B29">Tang et al., 2023</xref>), and these regulatory axes are predicted to alter glycan biosynthesis in a cell-type and region-specific manner. The selection of MAX as a pivotal regulator is particularly noteworthy, as its role extends beyond classic metabolic regulation, implicating it in neural stress responses and possibly in the modulation of microglial and astrocytic activation states (<xref ref-type="bibr" rid="B24">McFerrin and Atchley, 2011</xref>). Notably, this regulatory constellation diverges from previously established transcriptional landscapes in other neurodegenerative disorders, suggesting a unique transcriptional architecture underpinning AD-specific glycosylation and glial responses. These findings not only corroborate the significance of transcriptional modulation in AD pathogenesis but also reveal novel candidate regulators that may serve as intervention points for modulating disease trajectory.</p>
<p>The comparative evaluation of diagnostic algorithms demonstrated that logistic regression exhibited superior accuracy and stability for classifying AD status based on gene expression signatures, outperforming ensemble methods such as random forest and XGBoost in this context. This result stands in partial contrast to recent meta-analyses of machine learning in medical diagnostics, which often report that ensemble models provide enhanced performance, especially when handling high-dimensional or nonlinear data. However, the relatively modest sample sizes and stringent feature selection employed in this study may favor the generalizability and interpretability of linear models, aligning with reports that logistic regression can outperform more complex algorithms when the number of predictors is tightly controlled and the risk of overfitting is minimized (<xref ref-type="bibr" rid="B14">Hu and Hu, 2014</xref>). Furthermore, the consistent performance of logistic regression across external validation datasets underscores its practical utility in clinical biomarker development for AD, particularly when transparency and reproducibility are paramount. This methodological insight supports a nuanced approach to model selection, emphasizing context-dependent optimization over blanket preference for algorithmic complexity.</p>
<p>The limitations of this study are primarily reflected in the absence of wet lab validation and the relatively small sample size, which restricts the generalizability of the findings. The reliance on publicly available datasets may introduce batch effects that could skew the results, potentially impacting the robustness of the identified biomarkers (<xref ref-type="bibr" rid="B3">Carry et al., 2023</xref>). Additionally, while machine learning techniques were employed to enhance the identification of N-glycosylation-related genes, the interpretability of complex models remains a challenge, necessitating further validation to establish clinical relevance. These aspects highlight the need for larger, multi-center studies that integrate both computational and experimental approaches to bolster the reliability of the findings.</p>
<p>In summary, this investigation identified several genes associated with N-glycosylation and AD, demonstrating their potential as biomarkers for early diagnosis. The innovative application of machine learning methods underscores the significance of these findings in advancing our understanding of AD pathology and facilitating early intervention strategies. Future research should prioritize larger cohorts and clinical validation of these biomarkers, as well as exploring their mechanistic roles in disease progression. The integration of these insights into clinical practice may pave the way for improved diagnostic and therapeutic frameworks in AD management.</p>
</sec>
<sec id="S5" sec-type="conclusion">
<label>5</label>
<title>Conclusion</title>
<p>This study bridges bibliometric and bioinformatics approaches to elucidate the role of N-glycosylation in AD. Bibliometric analysis highlights a growing focus on molecular mechanisms in AD research, with N-glycosylation emerging as a key area of interest. Bioinformatics identified critical genes such as TMEM59, MLEC, and MAX, which are implicated in glycoprotein homeostasis and neuroinflammatory processes. Machine learning models, particularly logistic regression, demonstrated strong diagnostic potential using minimal gene panels. Additionally, TFs like MAX and BRD9 were linked to glycosylation regulation and glial activation. These findings advance our understanding of N-glycosylation&#x2019;s role in AD pathogenesis and provide promising targets for early diagnosis and therapeutic intervention.</p>
</sec>
</body>
<back>
<sec id="S6" sec-type="data-availability">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in this article/<xref ref-type="supplementary-material" rid="TS1">Supplementary material</xref>.</p>
</sec>
<sec id="S7" sec-type="ethics-statement">
<title>Ethics statement</title>
<p>This study is based on published experimental research and is not currently applicable to medical ethics.</p>
</sec>
<sec id="S8" sec-type="author-contributions">
<title>Author contributions</title>
<p>HZ: Writing &#x2013; review &#x0026; editing, Formal analysis, Software, Writing &#x2013; original draft, Data curation, Methodology, Visualization. WC: Data curation, Visualization, Funding acquisition, Software, Methodology, Writing &#x2013; review &#x0026; editing, Formal analysis, Writing &#x2013; original draft. SY: Visualization, Methodology, Writing &#x2013; original draft, Writing &#x2013; review &#x0026; editing, Formal analysis, Software. BC: Software, Data curation, Writing &#x2013; review &#x0026; editing, Visualization, Writing &#x2013; original draft, Methodology. SD: Writing &#x2013; original draft, Visualization, Data curation, Software, Writing &#x2013; review &#x0026; editing. HB: Writing &#x2013; original draft, Visualization, Writing &#x2013; review &#x0026; editing, Software. JS: Supervision, Writing &#x2013; review &#x0026; editing, Conceptualization, Project administration, Resources, Writing &#x2013; original draft, Investigation, Funding acquisition. WL: Supervision, Conceptualization, Resources, Writing &#x2013; original draft, Writing &#x2013; review &#x0026; editing, Project administration, Investigation. HGZ: Writing &#x2013; review &#x0026; editing, Conceptualization, Investigation, Resources, Writing &#x2013; original draft, Project administration, Supervision.</p>
</sec>
<ack>
<title>Acknowledgments</title>
<p>We thank HaoGang Zhu and Wei Lu for discussions and generous support.</p>
</ack>
<sec id="S10" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="S11" sec-type="ai-statement">
<title>Generative AI statement</title>
<p>The authors declare that no Generative AI was used in the creation of this manuscript.</p>
<p>Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.</p>
</sec>
<sec id="S12" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="S13" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fnagi.2025.1597511/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fnagi.2025.1597511/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Image_1.tif" id="FS1" mimetype="image/tiff"/>
<supplementary-material xlink:href="Table_1.xls" id="TS1" mimetype="application/vnd.ms-excel"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Abbas</surname> <given-names>M.</given-names></name> <name><surname>El-Manzalawy</surname> <given-names>Y.</given-names></name></person-group> (<year>2020</year>). <article-title>Machine learning based refined differential gene expression analysis of pediatric sepsis.</article-title> <source><italic>BMC Med. Genom.</italic></source> <volume>13</volume>:<fpage>122</fpage>. <pub-id pub-id-type="doi">10.1186/s12920-020-00771-4</pub-id> <pub-id pub-id-type="pmid">32859206</pub-id></mixed-citation></ref>
<ref id="B2"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Boada-Romero</surname> <given-names>E.</given-names></name> <name><surname>Letek</surname> <given-names>M.</given-names></name> <name><surname>Fleischer</surname> <given-names>A.</given-names></name> <name><surname>Pallauf</surname> <given-names>K.</given-names></name> <name><surname>Ram&#x00F3;n-Barros</surname> <given-names>C.</given-names></name> <name><surname>Pimentel-Mui&#x00F1;os</surname> <given-names>F. X.</given-names></name></person-group> (<year>2013</year>). <article-title>TMEM59 defines a novel ATG16L1-binding motif that promotes local activation of LC3.</article-title> <source><italic>EMBO J.</italic></source> <volume>32</volume> <fpage>566</fpage>&#x2013;<lpage>582</lpage>. <pub-id pub-id-type="doi">10.1038/emboj.2013.8</pub-id> <pub-id pub-id-type="pmid">23376921</pub-id></mixed-citation></ref>
<ref id="B3"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Carry</surname> <given-names>P. M.</given-names></name> <name><surname>Vigers</surname> <given-names>T.</given-names></name> <name><surname>Vanderlinden</surname> <given-names>L. A.</given-names></name> <name><surname>Keeter</surname> <given-names>C.</given-names></name> <name><surname>Dong</surname> <given-names>F.</given-names></name> <name><surname>Buckner</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2023</year>). <article-title>Propensity scores as a novel method to guide sample allocation and minimize batch effects during the design of high throughput experiments.</article-title> <source><italic>BMC Bioinform.</italic></source> <volume>24</volume>:<fpage>86</fpage>. <pub-id pub-id-type="doi">10.1186/s12859-023-05202-6</pub-id> <pub-id pub-id-type="pmid">36882691</pub-id></mixed-citation></ref>
<ref id="B4"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cha</surname> <given-names>H. M.</given-names></name> <name><surname>Lim</surname> <given-names>J. H.</given-names></name> <name><surname>Yeon</surname> <given-names>J. H.</given-names></name> <name><surname>Hwang</surname> <given-names>J. M.</given-names></name> <name><surname>Kim</surname> <given-names>D. I.</given-names></name></person-group> (<year>2017</year>). <article-title>Co-overexpression of Mgat1 and Mgat4 in CHO cells for production of highly sialylated albumin-erythropoietin.</article-title> <source><italic>Enzyme Microb. Technol.</italic></source> <volume>103</volume> <fpage>53</fpage>&#x2013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.1016/j.enzmictec.2017.04.010</pub-id> <pub-id pub-id-type="pmid">28554385</pub-id></mixed-citation></ref>
<ref id="B5"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Hu</surname> <given-names>D.</given-names></name> <name><surname>Yabe</surname> <given-names>R.</given-names></name> <name><surname>Tateno</surname> <given-names>H.</given-names></name> <name><surname>Qin</surname> <given-names>S. Y.</given-names></name> <name><surname>Matsumoto</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Role of malectin in Glc(2)Man(9)GlcNAc(2)-dependent quality control of &#x03B1;1-antitrypsin.</article-title> <source><italic>Mol. Biol. Cell</italic></source> <volume>22</volume> <fpage>3559</fpage>&#x2013;<lpage>3570</lpage>. <pub-id pub-id-type="doi">10.1091/mbc.E11-03-0201</pub-id> <pub-id pub-id-type="pmid">21813736</pub-id></mixed-citation></ref>
<ref id="B6"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dos Santos Silva</surname> <given-names>P. M.</given-names></name> <name><surname>de Oliveira</surname> <given-names>W. F.</given-names></name> <name><surname>Albuquerque</surname> <given-names>P. B. S.</given-names></name> <name><surname>Dos Santos Correia</surname> <given-names>M. T.</given-names></name> <name><surname>Coelho</surname> <given-names>L. C. B. B.</given-names></name></person-group> (<year>2019</year>). <article-title>Insights into anti-pathogenic activities of mannose lectins.</article-title> <source><italic>Int. J. Biol. Macromol.</italic></source> <volume>140</volume> <fpage>234</fpage>&#x2013;<lpage>244</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijbiomac.2019.08.059</pub-id> <pub-id pub-id-type="pmid">31400430</pub-id></mixed-citation></ref>
<ref id="B7"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>D&#x00FC;nser</surname> <given-names>K.</given-names></name> <name><surname>Schoberer</surname> <given-names>J.</given-names></name></person-group> (<year>2025</year>). <article-title>Trafficking and localization of Golgi-resident N-glycan processing enzymes in plants.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>16</volume>:<fpage>1624949</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2025.1624949</pub-id> <pub-id pub-id-type="pmid">40786940</pub-id></mixed-citation></ref>
<ref id="B8"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fang</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>K.</given-names></name> <name><surname>Liu</surname> <given-names>D.</given-names></name> <name><surname>Yang</surname> <given-names>Y.</given-names></name> <name><surname>Xi</surname> <given-names>H.</given-names></name> <name><surname>Xie</surname> <given-names>W.</given-names></name><etal/></person-group> (<year>2024</year>). <article-title>Polyphenol-based polymer nanoparticles for inhibiting amyloid protein aggregation: Recent advances and perspectives.</article-title> <source><italic>Front. Nutr.</italic></source> <volume>11</volume>:<fpage>1408620</fpage>. <pub-id pub-id-type="doi">10.3389/fnut.2024.1408620</pub-id> <pub-id pub-id-type="pmid">39135555</pub-id></mixed-citation></ref>
<ref id="B9"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Giesen</surname> <given-names>K.</given-names></name> <name><surname>Lammel</surname> <given-names>U.</given-names></name> <name><surname>Langehans</surname> <given-names>D.</given-names></name> <name><surname>Krukkert</surname> <given-names>K.</given-names></name> <name><surname>Bunse</surname> <given-names>I.</given-names></name> <name><surname>Kl&#x00E4;mbt</surname> <given-names>C.</given-names></name></person-group> (<year>2003</year>). <article-title>Regulation of glial cell number and differentiation by ecdysone and Fos signaling.</article-title> <source><italic>Mech. Dev.</italic></source> <volume>120</volume> <fpage>401</fpage>&#x2013;<lpage>413</lpage>. <pub-id pub-id-type="doi">10.1016/s0925-4773(03)00009-1</pub-id> <pub-id pub-id-type="pmid">12676319</pub-id></mixed-citation></ref>
<ref id="B10"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Han</surname> <given-names>P.</given-names></name> <name><surname>Shi</surname> <given-names>J. A.</given-names></name></person-group> (<year>2016</year>). <article-title>Theoretical analysis of the synergy of amyloid and tau in Alzheimer&#x2019;s disease.</article-title> <source><italic>J. Alzheimers Dis.</italic></source> <volume>52</volume> <fpage>1461</fpage>&#x2013;<lpage>1470</lpage>. <pub-id pub-id-type="doi">10.3233/JAD-151206</pub-id> <pub-id pub-id-type="pmid">27104897</pub-id></mixed-citation></ref>
<ref id="B11"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hawkinson</surname> <given-names>T. R.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name> <name><surname>Ribas</surname> <given-names>R. A.</given-names></name> <name><surname>Medina</surname> <given-names>T.</given-names></name> <name><surname>Nielsen</surname> <given-names>R. S.</given-names></name> <name><surname>Clarke</surname> <given-names>H. A.</given-names></name><etal/></person-group> (<year>2025</year>). <article-title>Hyper-Glycosylation as a central metabolic driver of Alzheimer&#x2019;s disease.</article-title> <source><italic>bioRxiv [Preprint]</italic></source> <pub-id pub-id-type="doi">10.1101/2025.04.30.651461</pub-id> <pub-id pub-id-type="pmid">40654813</pub-id></mixed-citation></ref>
<ref id="B12"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hou</surname> <given-names>X.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Bu</surname> <given-names>D.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Sun</surname> <given-names>S.</given-names></name></person-group> (<year>2023</year>). <article-title>EMNGly: Predicting N-linked glycosylation sites using the language models for feature extraction.</article-title> <source><italic>Bioinformatics</italic></source> <volume>39</volume>:<fpage>btad650</fpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btad650</pub-id> <pub-id pub-id-type="pmid">37930896</pub-id></mixed-citation></ref>
<ref id="B13"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hsieh</surname> <given-names>A. L.</given-names></name> <name><surname>Dang</surname> <given-names>C. V.</given-names></name></person-group> (<year>2016</year>). <article-title>MYC, metabolic synthetic lethality, and cancer.</article-title> <source><italic>Recent Results Cancer Res.</italic></source> <volume>207</volume> <fpage>73</fpage>&#x2013;<lpage>91</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-42118-6_4</pub-id> <pub-id pub-id-type="pmid">27557535</pub-id></mixed-citation></ref>
<ref id="B14"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>W.</given-names></name> <name><surname>Hu</surname> <given-names>J.</given-names></name></person-group> (<year>2014</year>). <article-title>Prediction of siRNA potency using sparse logistic regression.</article-title> <source><italic>J. Comput. Biol.</italic></source> <volume>21</volume> <fpage>420</fpage>&#x2013;<lpage>427</lpage>. <pub-id pub-id-type="doi">10.1089/cmb.2009.0115</pub-id> <pub-id pub-id-type="pmid">21091052</pub-id></mixed-citation></ref>
<ref id="B15"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>H. C.</given-names></name> <name><surname>Jiang</surname> <given-names>Z. F.</given-names></name></person-group> (<year>2009</year>). <article-title>Accumulated amyloid-beta peptide and hyperphosphorylated tau protein: Relationship and links in Alzheimer&#x2019;s disease.</article-title> <source><italic>J. Alzheimers Dis.</italic></source> <volume>16</volume> <fpage>15</fpage>&#x2013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.3233/JAD-2009-0960</pub-id> <pub-id pub-id-type="pmid">19158417</pub-id></mixed-citation></ref>
<ref id="B16"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kaye</surname> <given-names>A. D.</given-names></name> <name><surname>Islam</surname> <given-names>R. K.</given-names></name> <name><surname>Tong</surname> <given-names>V. T.</given-names></name> <name><surname>Tynes</surname> <given-names>B. E.</given-names></name> <name><surname>Sala</surname> <given-names>K. R.</given-names></name> <name><surname>Abbott</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2024</year>). <article-title>Transcutaneous electrical nerve stimulation for prevention and treatment of post-herpetic neuralgia: A narrative review</article-title>. <source><italic>Cureus</italic></source>. <volume>16</volume>:<fpage>e74416</fpage>. <pub-id pub-id-type="doi">10.7759/cureus.74416</pub-id> <pub-id pub-id-type="pmid">39723328</pub-id></mixed-citation></ref>
<ref id="B17"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kawaharada</surname> <given-names>R.</given-names></name> <name><surname>Sugimoto</surname> <given-names>T.</given-names></name> <name><surname>Matsuda</surname> <given-names>N.</given-names></name> <name><surname>Tsuboi</surname> <given-names>Y.</given-names></name> <name><surname>Sakurai</surname> <given-names>T.</given-names></name> <name><surname>Ono</surname> <given-names>R.</given-names></name></person-group> (<year>2019</year>). <article-title>Impact of loss of independence in basic activities of daily living on caregiver burden in patients with Alzheimer&#x2019;s disease: A retrospective cohort study</article-title>. <source><italic>Geriatr Gerontol Int</italic></source>. <volume>19</volume>, <fpage>1243</fpage>&#x2013;<lpage>1247</lpage>. <pub-id pub-id-type="doi">10.1111/ggi.13803</pub-id> <pub-id pub-id-type="pmid">31638736</pub-id></mixed-citation></ref>
<ref id="B18"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Khan</surname> <given-names>T. K.</given-names></name></person-group> (<year>2018</year>). <article-title>An algorithm for preclinical diagnosis of Alzheimer&#x2019;s disease.</article-title> <source><italic>Front. Neurosci.</italic></source> <volume>12</volume>:<fpage>275</fpage>. <pub-id pub-id-type="doi">10.3389/fnins.2018.00275</pub-id> <pub-id pub-id-type="pmid">29760644</pub-id></mixed-citation></ref>
<ref id="B19"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Knee</surname> <given-names>R.</given-names></name> <name><surname>Ahsan</surname> <given-names>I.</given-names></name> <name><surname>Mesaeli</surname> <given-names>N.</given-names></name> <name><surname>Kaufman</surname> <given-names>R. J.</given-names></name> <name><surname>Michalak</surname> <given-names>M.</given-names></name></person-group> (<year>2003</year>). <article-title>Compromised calnexin function in calreticulin-deficient cells.</article-title> <source><italic>Biochem. Biophys. Res. Commun.</italic></source> <volume>304</volume> <fpage>661</fpage>&#x2013;<lpage>666</lpage>. <pub-id pub-id-type="doi">10.1016/s0006-291x(03)00643-0</pub-id> <pub-id pub-id-type="pmid">12727205</pub-id></mixed-citation></ref>
<ref id="B20"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>J. H.</given-names></name> <name><surname>Jeong</surname> <given-names>Y. R.</given-names></name> <name><surname>Kim</surname> <given-names>Y. G.</given-names></name> <name><surname>Lee</surname> <given-names>G. M.</given-names></name></person-group> (<year>2017</year>). <article-title>Understanding of decreased sialylation of Fc-fusion protein in hyperosmotic recombinant Chinese hamster ovary cell culture: N-glycosylation gene expression and N-linked glycan antennary profile.</article-title> <source><italic>Biotechnol. Bioeng.</italic></source> <volume>114</volume> <fpage>1721</fpage>&#x2013;<lpage>1732</lpage>. <pub-id pub-id-type="doi">10.1002/bit.26284</pub-id> <pub-id pub-id-type="pmid">28266015</pub-id></mixed-citation></ref>
<ref id="B21"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lozupone</surname> <given-names>M.</given-names></name> <name><surname>Dibello</surname> <given-names>V.</given-names></name> <name><surname>Resta</surname> <given-names>E.</given-names></name> <name><surname>Sardone</surname> <given-names>R.</given-names></name> <name><surname>Castellana</surname> <given-names>F.</given-names></name> <name><surname>Zupo</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2025</year>). <article-title>Uncertainties in anti-amyloid monoclonal antibody therapy for Alzheimer&#x2019;s disease: The challenges ahead.</article-title> <source><italic>Exp. Rev. Neurother.</italic></source> <volume>25</volume> <fpage>649</fpage>&#x2013;<lpage>659</lpage>. <pub-id pub-id-type="doi">10.1080/14737175.2025.2500752</pub-id> <pub-id pub-id-type="pmid">40314575</pub-id></mixed-citation></ref>
<ref id="B22"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Fermaintt</surname> <given-names>C. S.</given-names></name> <name><surname>Cherepanova</surname> <given-names>N. A.</given-names></name> <name><surname>Gilmore</surname> <given-names>R.</given-names></name> <name><surname>Yan</surname> <given-names>N.</given-names></name> <name><surname>Lehrman</surname> <given-names>M. A.</given-names></name></person-group> (<year>2018</year>). <article-title>Mammalian STT3A/B oligosaccharyltransferases segregate N-glycosylation at the translocon from lipid-linked oligosaccharide hydrolysis.</article-title> <source><italic>Proc. Natl. Acad. Sci. U. S. A.</italic></source> <volume>115</volume> <fpage>9557</fpage>&#x2013;<lpage>9562</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1806034115</pub-id> <pub-id pub-id-type="pmid">30181269</pub-id></mixed-citation></ref>
<ref id="B23"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Marques</surname> <given-names>A. R. A.</given-names></name> <name><surname>Di Spiezio</surname> <given-names>A.</given-names></name> <name><surname>Thie&#x00DF;en</surname> <given-names>N.</given-names></name> <name><surname>Schmidt</surname> <given-names>L.</given-names></name> <name><surname>Gr&#x00F6;tzinger</surname> <given-names>J.</given-names></name> <name><surname>L&#x00FC;llmann-Rauch</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Enzyme replacement therapy with recombinant pro-CTSD (cathepsin D) corrects defective proteolysis and autophagy in neuronal ceroid lipofuscinosis.</article-title> <source><italic>Autophagy</italic></source> <volume>16</volume> <fpage>811</fpage>&#x2013;<lpage>825</lpage>. <pub-id pub-id-type="doi">10.1080/15548627.2019.1637200</pub-id> <pub-id pub-id-type="pmid">31282275</pub-id></mixed-citation></ref>
<ref id="B24"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>McFerrin</surname> <given-names>L. G.</given-names></name> <name><surname>Atchley</surname> <given-names>W. R.</given-names></name></person-group> (<year>2011</year>). <article-title>Evolution of the Max and Mlx networks in animals.</article-title> <source><italic>Genome Biol. Evol.</italic></source> <volume>3</volume> <fpage>915</fpage>&#x2013;<lpage>937</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evr082</pub-id> <pub-id pub-id-type="pmid">21859806</pub-id></mixed-citation></ref>
<ref id="B25"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Meng</surname> <given-names>J.</given-names></name> <name><surname>Han</surname> <given-names>L.</given-names></name> <name><surname>Zheng</surname> <given-names>N.</given-names></name> <name><surname>Xu</surname> <given-names>H.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>TMEM59 haploinsufficiency ameliorates the pathology and cognitive impairment in the 5xFAD mouse model of Alzheimer&#x2019;s disease.</article-title> <source><italic>Front. Cell Dev. Biol.</italic></source> <volume>8</volume>:<fpage>596030</fpage>. <pub-id pub-id-type="doi">10.3389/fcell.2020.596030</pub-id> <pub-id pub-id-type="pmid">33195275</pub-id></mixed-citation></ref>
<ref id="B26"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Migliore</surname> <given-names>L.</given-names></name> <name><surname>Copped&#x00E8;</surname> <given-names>F.</given-names></name></person-group> (<year>2022</year>). <article-title>Gene-environment interactions in Alzheimer disease: The emerging role of epigenetics.</article-title> <source><italic>Nat. Rev. Neurol.</italic></source> <volume>18</volume> <fpage>643</fpage>&#x2013;<lpage>660</lpage>. <pub-id pub-id-type="doi">10.1038/s41582-022-00714-w</pub-id> <pub-id pub-id-type="pmid">36180553</pub-id></mixed-citation></ref>
<ref id="B27"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rampa</surname> <given-names>A.</given-names></name> <name><surname>Gobbi</surname> <given-names>S.</given-names></name> <name><surname>Belluti</surname> <given-names>F.</given-names></name> <name><surname>Bisi</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Emerging targets in neurodegeneration: New opportunities for Alzheimer&#x2019;s disease treatment?</article-title> <source><italic>Curr. Top. Med. Chem.</italic></source> <volume>13</volume> <fpage>1879</fpage>&#x2013;<lpage>1904</lpage>. <pub-id pub-id-type="doi">10.2174/15680266113139990143</pub-id> <pub-id pub-id-type="pmid">23931436</pub-id></mixed-citation></ref>
<ref id="B28"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sebastian</surname> <given-names>A.</given-names></name> <name><surname>Cistulli</surname> <given-names>P. A.</given-names></name> <name><surname>Cohen</surname> <given-names>G.</given-names></name> <name><surname>Chazal</surname> <given-names>P.</given-names></name></person-group> (<year>2020</year>). <article-title>Characterisation of upper airway collapse in OSA patients using snore signals: A cluster analysis approach.</article-title> <source><italic>Annu. Int. Conf. IEEE Eng. Med. Biol. Soc.</italic></source> <volume>2020</volume> <fpage>5124</fpage>&#x2013;<lpage>5127</lpage>. <pub-id pub-id-type="doi">10.1109/EMBC44109.2020.9175591</pub-id> <pub-id pub-id-type="pmid">33019139</pub-id></mixed-citation></ref>
<ref id="B29"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tang</surname> <given-names>X.</given-names></name> <name><surname>Tena</surname> <given-names>J.</given-names></name> <name><surname>Di Lucente</surname> <given-names>J.</given-names></name> <name><surname>Maezawa</surname> <given-names>I.</given-names></name> <name><surname>Harvey</surname> <given-names>D. J.</given-names></name> <name><surname>Jin</surname> <given-names>L. W.</given-names></name><etal/></person-group> (<year>2023</year>). <article-title>Transcriptomic and glycomic analyses highlight pathway-specific glycosylation alterations unique to Alzheimer&#x2019;s disease.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>13</volume>:<fpage>7816</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-023-34787-4</pub-id> <pub-id pub-id-type="pmid">37188790</pub-id></mixed-citation></ref>
<ref id="B30"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>W.</given-names></name> <name><surname>Lee</surname> <given-names>I.</given-names></name> <name><surname>Spratt</surname> <given-names>H.</given-names></name> <name><surname>Fang</surname> <given-names>X.</given-names></name> <name><surname>Bao</surname> <given-names>X.</given-names></name></person-group> (<year>2021</year>). <article-title>tRNA-Derived fragments in Alzheimer&#x2019;s Disease: Implications for new disease biomarkers and neuropathological mechanisms</article-title>. <source><italic>J. Alzheimers Dis</italic></source>. <volume>79</volume>, <fpage>793</fpage>&#x2013;<lpage>806</lpage>. <pub-id pub-id-type="doi">10.3233/JAD-200917</pub-id> <pub-id pub-id-type="pmid">33337366</pub-id></mixed-citation></ref>
<ref id="B31"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>X.</given-names></name> <name><surname>Tan</surname> <given-names>H.</given-names></name> <name><surname>Yin</surname> <given-names>K.</given-names></name> <name><surname>Xu</surname> <given-names>S.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Serrano</surname> <given-names>G. E.</given-names></name><etal/></person-group> (<year>2025a</year>). <article-title>Comprehensive and site-specific characterization of protein N-Glycosylation in AD samples reveals its potential roles in protein aggregation and synaptic dysfunction.</article-title> <source><italic>Anal. Chem.</italic></source> <volume>97</volume> <fpage>21873</fpage>&#x2013;<lpage>21882</lpage>. <pub-id pub-id-type="doi">10.1021/acs.analchem.5c02455</pub-id> <pub-id pub-id-type="pmid">41042700</pub-id></mixed-citation></ref>
<ref id="B32"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Zhou</surname> <given-names>Z.</given-names></name> <name><surname>Qin</surname> <given-names>H.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2025b</year>). <article-title>Integrated multi-omics analysis revealed the molecular networks and potential targets of cellular senescence in Alzheimer&#x2019;s disease.</article-title> <source><italic>Hum. Mol. Genet.</italic></source> <volume>34</volume> <fpage>381</fpage>&#x2013;<lpage>391</lpage>. <pub-id pub-id-type="doi">10.1093/hmg/ddae189</pub-id> <pub-id pub-id-type="pmid">39690817</pub-id></mixed-citation></ref>
<ref id="B33"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>You</surname> <given-names>W.</given-names></name></person-group> (<year>2025</year>). <article-title>Global patterns linking total meat supply to dementia incidence: A population-based ecological study.</article-title> <source><italic>AIMS Neurosci.</italic></source> <volume>12</volume> <fpage>203</fpage>&#x2013;<lpage>221</lpage>. <pub-id pub-id-type="doi">10.3934/Neuroscience.2025012</pub-id> <pub-id pub-id-type="pmid">40717728</pub-id></mixed-citation></ref>
<ref id="B34"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Q.</given-names></name> <name><surname>Chen</surname> <given-names>B.</given-names></name> <name><surname>Yang</surname> <given-names>P.</given-names></name> <name><surname>Wu</surname> <given-names>J.</given-names></name> <name><surname>Pang</surname> <given-names>X.</given-names></name> <name><surname>Pang</surname> <given-names>C.</given-names></name></person-group> (<year>2022</year>). <article-title>Bioinformatics-based study reveals that AP2M1 is regulated by the circRNA-miRNA-mRNA interaction network and affects Alzheimer&#x2019;s disease.</article-title> <source><italic>Front. Genet.</italic></source> <volume>13</volume>:<fpage>1049786</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2022.1049786</pub-id> <pub-id pub-id-type="pmid">36468008</pub-id></mixed-citation></ref>
<ref id="B35"><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>S. C.</given-names></name> <name><surname>Dong</surname> <given-names>Y. X.</given-names></name> <name><surname>Tian</surname> <given-names>J.</given-names></name> <name><surname>Che</surname> <given-names>G. W.</given-names></name> <name><surname>Lai</surname> <given-names>Y.</given-names></name></person-group> (<year>2025</year>). <article-title>Advancing the understanding of alveolar regeneration: Global research trends, thematic evolution, and emerging frontiers.</article-title> <source><italic>Regen. Ther.</italic></source> <volume>30</volume> <fpage>778</fpage>&#x2013;<lpage>794</lpage>. <pub-id pub-id-type="doi">10.1016/j.reth.2025.09.005</pub-id> <pub-id pub-id-type="pmid">41079451</pub-id></mixed-citation></ref>
</ref-list>
<fn-group>
<fn id="n1" fn-type="custom" custom-type="edited-by"><p>Edited by: <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/180146/overview">Francisco Navarrete</ext-link>, Miguel Hern&#x00E1;ndez University of Elche, Spain</p></fn>
<fn id="n2" fn-type="custom" custom-type="reviewed-by"><p>Reviewed by: <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1506878/overview">Cristian D. Gutierrez Reyes</ext-link>, Texas Tech University, United States</p>
<p><ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1863872/overview">Tian Zhao Ma</ext-link>, Affiliated Hospital of Jining Medical University, China</p></fn>
</fn-group>
</back>
</article>