<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="brief-report" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1231415</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2023.1231415</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Brief Research Report</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>DNA cytosine deamination is associated with recurrent Somatic Copy Number Alterations in stomach adenocarcinoma</article-title>
<alt-title alt-title-type="left-running-head">Shi and Shen</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fgene.2023.1231415">10.3389/fgene.2023.1231415</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Shi</surname>
<given-names>Yilin</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2069834/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Shen</surname>
<given-names>Huangxuan</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1296424/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>The College of Letters &#x26; Science, University of Wisconsin&#x2013;Madison</institution>, <addr-line>Madison</addr-line>, <addr-line>WI</addr-line>, <country>United States</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>State Key Laboratory of Ophthalmology</institution>, <institution>Zhongshan Ophthalmic Center</institution>, <institution>Sun Yat-sen University</institution>, <addr-line>Guangzhou</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1064044/overview">Qiyuan Li</ext-link>, Xiamen University, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2340410/overview">Hongbo Ling</ext-link>, George Washington University, United States</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/785402/overview">Rong Na</ext-link>, The University of Hong Kong, Hong Kong SAR, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Huangxuan Shen, <email>shenhx@mail.sysu.edu.cn</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>06</day>
<month>10</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1231415</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>05</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>05</day>
<month>09</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Shi and Shen.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Shi and Shen</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Stomach Adenocarcinoma (STAD) is a leading cause of death worldwide. Somatic Copy Number Alterations (SCNAs), which result in Homologous recombination (HR) deficiency in double-strand break repair, are associated with the progression of STAD. However, the landscape of frequent breakpoints of SCNAs (hotspots) and their functional impacts remain poorly understood. In this study, we aimed to explore the frequency and impact of these hotspots in 332 STAD patients and 1,043 cancer cells using data from the Cancer Genome Atlas (TCGA) and Cancer Cell Line Encyclopedia (CCLE). We studied the rates of DSB (Double-Strand Breaks) loci in STAD patients by employing the Non-Homogeneous Poisson Distribution (&#x3bb;), based on which we identified 145 DSB-hotspots with genes affected. We further verified DNA cytosine deamination as a critical process underlying the burden of DSB in STAD. Finally, we illustrated the clinical impact of the significant biological processes. Our findings highlighted the relationship between DNA cytosine deamination and SCNA in cancer was associated with recurrent Somatic Copy Number Alterations in STAD.</p>
</abstract>
<kwd-group>
<kwd>stomach adenocarcinoma</kwd>
<kwd>DNA cytosine deamination</kwd>
<kwd>hotspots</kwd>
<kwd>Somatic Copy Number Alterations</kwd>
<kwd>CCLE database</kwd>
<kwd>TCGA database</kwd>
</kwd-group>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Computational Genomics</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>STAD is ranked as the fifth most commonly diagnosed cancer in 2020 and is the third leading cause of cancer-related deaths worldwide (<xref ref-type="bibr" rid="B32">Zhou et al., 2020</xref>). China has the highest incidence rate of STAD, accounting for 49.9% of global cases, with approximately 498,000 deaths in 2019 (<xref ref-type="bibr" rid="B33">Zhu et al., 2019</xref>). In general, external mutagens, such as smoking, and alterations of certain biological functions, such as deficiencies in DSB repair mechanism and <italic>APOBEC</italic> enzymatic activities, would result in specific mutational signatures (<xref ref-type="bibr" rid="B13">Guo et al., 2021</xref>). Recent studies also highlighted the involvement of Microsatellite Instability, Tumor Mutation Burden, and SCNAs in the progression of Gastric Cancer (<xref ref-type="bibr" rid="B7">Chen et al., 2022</xref>). Filaggrin (<italic>FLG</italic>) mutation led to increased gastric cancer sensitivity to 24 chemotherapeutic drugs, suggesting a potential protective factor (<xref ref-type="bibr" rid="B31">Yicheng et al., 2022</xref>). GLP2R knockdown was shown to significantly inhibit the proliferation and migration of gastric cancer cells <italic>in virto</italic> (<xref ref-type="bibr" rid="B10">Fu et al., 2022</xref>). Despite extensive research in STAD, the underlying pathogenesis and etiology of the disease remained elusive. Mutagenic processes only partially accounted for observed mutational signatures in cancer patients (<xref ref-type="bibr" rid="B30">Watkins et al., 2020</xref>). Therefore, further investigation into frequent SCNAs in STADs was crucial.</p>
<p>SCNAs resulted in multiple copy gains or losses of specific DNA fragments on homologous chromosomes (<xref ref-type="bibr" rid="B17">Hovhannisyan et al., 2019</xref>). They arose from inter-related processes of replication stress, spindle multipolarity, mitotic errors, and breakage&#x2013;fusion&#x2013;bridge cycles (<xref ref-type="bibr" rid="B28">Steele et al., 2022</xref>). Genome doubling and ongoing dynamic chromosomal instability resulted in the evolution of driver SCNAs (<xref ref-type="bibr" rid="B19">Jamal-Hanjani et al., 2017</xref>). HR, an evolutionary-conserved mechanism, playing a role in a subtle balance between genome stability and diversity, was a DNA repair pathway (<xref ref-type="bibr" rid="B4">Carr and Lambert, 2013</xref>). HR deficiency underlying the DSB-repair mechanism has been identified as a major cause of SCNAs in cancer cells (<xref ref-type="bibr" rid="B15">Hastings et al., 2009</xref>). Recent Studies demonstrated a strong correlation between large Somatic Copy Number Alterations (SCNAs) and the development of developmental disorders and cancer (<xref ref-type="bibr" rid="B22">Mac&#xe9; et al., 2018</xref>). In particular, SCNAs in STAD have been found to exhibit various signatures involving cancer-related genes such as <italic>TP53, PIK3CA</italic>, and <italic>ARID1A</italic>. These signatures include Diploid with zero whole-genome doublings, chromothripsis amplification, and loss of heterozygosity. The drivers of some signatures of SCNAs in STAD have been identified as <italic>MDM2, EGFR, CCNE1, MYC, and ERBB2</italic> (<xref ref-type="bibr" rid="B17">Hovhannisyan et al., 2019</xref>). A computational method applied to characterize aneuploidy in samples of tumors according to coordinated aberrations in the expression of genes in each chromosomal region has been developed (<xref ref-type="bibr" rid="B6">Carter et al., 2006</xref>). However, the specific genes and their functions within SCNA hotspots remain unknown. Therefore, it is crucial to identify relevant genes within hotspots of SCNAs to investigate their functions.</p>
<p>Here, we analyzed the distribution of the DSB points of 179166 SNCA events across 332 STADs from TCGA and 872216 SCNA events across 1,043 cancer cells from CCLE. We identified 145 hotspots of recurrent DSB in STAD and found that DNA Cytosine Deamination was associated with the load of SCNAs in STADs and the corresponding clinical outcome. The schematic view of this study and methods of calculating &#x3bb; were provided (<xref ref-type="fig" rid="F1">Figures 1A, B</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Flowcharts of the procedure. <bold>(A)</bold> The schematic view of this study. In this study, we classified SCNAs in terms of width. Excluding chromosome X and Y, we divided genome into parts and simulated the Poisson Distribution to calculate &#x3bb; in each window to find out hotspots. Then relevant genes were found, GO analysis was done after modification of gene list and top 10 terms were listed. At last relationship between some relevant genes and SCNA burden was illustrated by T-test and survival analysis was made. <bold>(B)</bold> Flowchart of calculating &#x3bb; in different positions of genome. We set a window on genome and calculate the &#x3bb; in this window. The calculation lasts until the end of the genome.</p>
</caption>
<graphic xlink:href="fgene-14-1231415-g001.tif"/>
</fig>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>Material and methods</title>
<sec id="s2-1">
<title>Raw data</title>
<p>The data on SCNAs in STAD patients, overall survival of STAD patients, gene expression in STAD patients, and locations of genes were obtained from TCGA (<ext-link ext-link-type="uri" xlink:href="https://tcga-data-secure.nci.nih.gov/tcgafiles/tcga4yeo/tumor/prad/cgcc/broad.mit.edu/genome_wide_snp_6/snp/">https://tcga-data-secure.nci.nih.gov/tcgafiles/tcga4yeo/tumor/prad/cgcc/broad.mit.edu/genome_wide_snp_6/snp/</ext-link>). The data was available in appendices. The version of the genome was hg19 (<ext-link ext-link-type="uri" xlink:href="https://genome.ucsc.edu/cgi-bin/hgTracks?db=hg19&amp;lastVirtModeType=default&amp;lastVirtModeExtraState=&amp;virtModeType=default&amp;virtMode=0&amp;nonVirtPosition=&amp;position=chr2%3A25383722%2D25391559&amp;hgsid=1653852260_7skZ6eFcyA94KTIXSNz9IEKYP60J">https://genome.ucsc.edu/cgi-bin/hgTracks?db&#x3d;hg19&#x26;lastVirtModeType&#x3d;default&#x26;lastVirtModeExtraState&#x3d;&#x26;virtModeType&#x3d;default&#x26;virtMode&#x3d;0&#x26;nonVirtPosition&#x3d;&#x26;position&#x3d;chr2%3A25383722%2D25391559&#x26;hgsid&#x3d;1653852260_7skZ6eFcyA94KTIXSNz9IEKYP60J</ext-link>). The data of SCNAs in STAD cell lines and expressions of relevant genes were obtained from CCLE (<ext-link ext-link-type="uri" xlink:href="https://depmap.org/portal/download/all/">https://depmap.org/portal/download/all/</ext-link>). The two sets of data were analyzed separately.</p>
</sec>
<sec id="s2-2">
<title>Classification of SCNAs</title>
<p>The data was imported into RStudio 4.2.2 and classified into two sets: large-scale SCNAs with a width &#x3e;100&#xa0;kb and SCNAs with a width&#x2264;100&#xa0;kb (<xref ref-type="bibr" rid="B23">McCarroll et al., 2008</xref>). They were analyzed separately to better understand their characteristics and implications. For ease of reference, SCNAs with a width &#x3e;100&#xa0;kb and SCNAs with a width&#x2264;100&#xa0;kb were called Set1 and Set2, respectively (<xref ref-type="table" rid="T1">Table 1</xref>). See supporting material.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Convenient code for two sets of experiments. Convenient code for two sets of experiments.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Set of experiment</th>
<th align="left">Procedure of experiment</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Set1</td>
<td align="left">Analyzing SCNAs with width&#x3e; 100&#xa0;kb</td>
</tr>
<tr>
<td align="left">Set2</td>
<td align="left">Analyzing SCNAs with width &#x3d;&#x3c; 100&#xa0;kb</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s2-3">
<title>Calculation of &#x3bb;</title>
<p>We hypothesized that the rate of DSB follows a Poisson Distribution with the parameter &#x3bb;. To obtain &#x3bb; based on the number of DSB events in each locus, a log-likelihood function was utilized to estimate &#x3bb; values, taking into account the deviation from a perfect Poisson Distribution observed in the data (<xref ref-type="bibr" rid="B29">van Opheusden et al., 2020</xref>).</p>
<p>The study analyzed two datasets to determine the burden of SCNAs in specific genomic regions. The SCNA burden, defined as the number of SCNAs, was calculated for each patient within 1&#xa0;Mb windows along the chromosomes.</p>
<p>To determine whether an SCNA fell within a window, the following criteria were applied: if the start point of the SCNA was smaller than the endpoint of the window, and the endpoint was larger than the start point of the window, then the SCNA was counted within the window.</p>
<p>Using RStudio, &#x3bb; was estimated for each window. The parameter &#x3bb;, representing the average occurrence of SCNAs within a window, was calculated using log-likelihood functions. It was observed that the distribution of &#x3bb; within the windows did not perfectly match the Standard Poisson Distribution models.</p>
<p>To visually represent the distribution of &#x3bb;, histograms of log2(&#x3bb;) were plotted. Additionally, Manhattan plots were generated, using a threshold value approximately equal to the minimum value of &#x3bb; observed in the hotspots, to illustrate the hotspots.</p>
</sec>
<sec id="s2-4">
<title>Identification of hotspots</title>
<p>Hotspots were determined based on the top 5% of the &#x3bb; values, as this threshold was considered statistically significant. The information regarding the hotspots in both sets of experiments can be found in the appendices, which include details such as the location of the hotspot and the corresponding &#x3bb; value. We also identified the relevant genes located within these hotspots.</p>
</sec>
<sec id="s2-5">
<title>Gene lists modification</title>
<p>Before doing the GO analysis, hyper-polymorphic genes such as genes in <italic>OR, TAS, IGH, IGK, IGL, HLA</italic> family, and genes in the list of <italic>HLA</italic> (See appendix) were excluded from the analysis. Odor genes and taste receptor (TAS) (<xref ref-type="bibr" rid="B20">Keller et al., 2007</xref>; <xref ref-type="bibr" rid="B5">Carrai et al., 2017</xref>), HLA molecules (<xref ref-type="bibr" rid="B8">Dendrou et al., 2018</xref>), IGK rearrangements (<xref ref-type="bibr" rid="B18">Jackson et al., 2012</xref>), IgH class switching (<xref ref-type="bibr" rid="B27">Stavnezer and Schrader, 2014</xref>), Immunoglobulin light (IgL) chains (<xref ref-type="bibr" rid="B9">Edholm et al., 2011</xref>) were proven to be highly variable in all individuals. Their variability could potentially interfere with the GO analysis. By excluding these genes, we aimed to focus on identifying less variable but significant genes associated with the progression of stomach adenocarcinoma (STAD) and determining corresponding significant functional terms.</p>
</sec>
<sec id="s2-6">
<title>Enrichment analysis of differentially expressed genes</title>
<p>After modification, GO analysis was performed on GProfiler (<xref ref-type="bibr" rid="B24">Raudvere et al., 2019</xref>). The website was available at: <ext-link ext-link-type="uri" xlink:href="https://biit.cs.ut.ee/gprofiler/gost">https://biit.cs.ut.ee/gprofiler/gost</ext-link>. We tested the statistical enrichment of expression genes related to hotspots in the KEGG path. The ratio of intersection size to term size was calculated. Terms with the top 10 ratio value were selected to be graphed in a bubble graph. The adjusted <italic>p</italic>-value of all terms was smaller than 0.05, which illustrated that the terms were significantly enriched by expressed genes.</p>
</sec>
<sec id="s2-7">
<title>Statistical analysis</title>
<p>Violin plots visualized the relationships between the expression of significant genes identified in GO analysis and SCNA burden. The patients were divided into two groups, namely, &#x201c;High&#x201d; and &#x201c;Low&#x201d;, based on their median SCNA burdens. Both consisted of an equal number of patients. In addition, T-tests demonstrated the relationship between the APOBEC family copy number statuses and SCNA burden. If the start point of the SCNA was smaller than the endpoint of the analyzed <italic>APOBEC</italic> gene, and the endpoint of the SCNA exceeded the start of the <italic>APOBEC</italic> gene, patients with this SCNA were classified as &#x201c;Mutant&#x201d;, while the other patients were classified as &#x2018;Wild&#x2019;. T-tests with <italic>p</italic>-values less than 0.05 were considered statistically significant, and corresponding genes were deemed significant.</p>
</sec>
<sec id="s2-8">
<title>Survival analysis</title>
<p>To explore the relationship between SCNA burden and overall survival, we utilized the R packages survival (version 3.4-0) and survminer (version 0.4.9) to generate Kaplan-Meier curves. Furthermore, patients were categorized based on the expression of <italic>APOBEC</italic> genes for survival analysis. Each survival analysis comprised three groups: Low, Median, and High. Results with <italic>p</italic>-values smaller than 0.05 were deemed statistically significant.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Result</title>
<sec id="s3-1">
<title>Calculation of &#x3bb;</title>
<p>According to data from TCGA and CCLE, the majority of the log2(&#x3bb;) values in Set1 (Width &#x3e;100&#xa0;kb) ranged from 0 to 0.5, while most of the log2(&#x3bb;) values in Set2 (Width&#x2264;100&#xa0;kb) were close to 0 (<xref ref-type="fig" rid="F2">Figures 2A&#x2013;D</xref>). Estimation revealed that the minimum value of &#x3bb; in hotspots of TCGA Set1 was around 1.4, while the CCLE Set1 was around 1.6. In TCGA Set2 and CCLE Set2, the minimum value may be 1. Since the minimum value of Set2 was close to 1, which represents the overall minimum value of &#x3bb;, the Manhattan plot of Set2 won&#x2019;t be displayed. The Manhattan plot of Set1 with a threshold value of approximately 1.4 and 1.6, respectively, was plotted (<xref ref-type="fig" rid="F2">Figures 2E, F</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Histogram of log2(&#x3bb;), Manhattan plot of &#x3bb; in Set1 and Top 10 terms of GO analysis in two sets of experiments. <bold>(A)</bold> Histogram of log2(&#x3bb;) in Set1 for TCGA data set. <bold>(B)</bold> Histogram of log2(&#x3bb;) in Set2 for TCGA data set. <bold>(C)</bold> Histogram of log2(&#x3bb;) in Set1 for CCLE data set. <bold>(D)</bold> Histogram of log2(&#x3bb;) in Set2 for CCLE data set. <bold>(E)</bold> Manhattan plot of &#x3bb; in Set1 for TCGA data set. The red points which are larger than other colorful points below are &#x3bb; values that are considered as hotspots in each chromosome. The corresponding hotspots containing <italic>APOBEC</italic> genes have a relative low value of &#x3bb; than other &#x3bb; of hotspots. The red arrows represent the values of &#x3bb; of hotspots which contain genes relative to DNA cytosine deamination found in GO analysis. <bold>(F)</bold> Manhattan plot of &#x3bb; in Set1 for TCGA data set. The red points which are larger than other colorful points below are &#x3bb; values that are considered as hotspots in each chromosome. The corresponding hotspots containing <italic>APOBEC</italic> genes have a relative low value of &#x3bb; than other &#x3bb; of hotspots. The red arrows represent the values of &#x3bb; of hotspots which contain genes relative to DNA cytosine deamination found in GO analysis. <bold>(G)</bold> Top 10 terms of GO analysis in Set1 and Set2 for TCGA data set. <bold>(H)</bold> Top 10 terms of GO analysis in Set1 and Set2 for CCLE data set.</p>
</caption>
<graphic xlink:href="fgene-14-1231415-g002.tif"/>
</fig>
</sec>
<sec id="s3-2">
<title>145 hotspots and relevant genes</title>
<p>A total of 145 hotspots were identified from the analysis of 2,897 windows in Set1 and Set2. In TCGA Set1, 1,437 genes were found within these hotspots, and after modification, 1,230 genes remained. Similarly, in TCGA Set2, 452 genes were located within the hotspots, and 395 genes remained after modification. In CCLE-Set1, 1,462 genes were found within these hotspots, and after filtering for genes whose variability could potentially interfere with the GO analysis, 1,265 genes remained. In CCLE Set2, 1,458 genes were found within these hotspots, and 1,336 genes were left after the filtering (<xref ref-type="table" rid="T2">Table 2</xref>). The complete list of genes before and after modification for both sets of experiments can be found in the appendices, along with additional information on the hotspots.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Minimum &#x3bb; of hotspots, number of relevant genes before and after modification in two sets of experiments. Minimum &#x3bb; of hotspots, number of relevant genes in two sets of experiments.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Experiment</th>
<th align="left">Minimum &#x3bb; of hotspots</th>
<th align="left">Number of associated genes (before modification)</th>
<th align="left">Number of associated genes (after modification)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Set1</td>
<td align="left">1.342</td>
<td align="left">1,437</td>
<td align="left">1,230</td>
</tr>
<tr>
<td align="left">Set2</td>
<td align="left">1</td>
<td align="left">1,337</td>
<td align="left">1,277</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3-3">
<title>
<italic>APOBEC</italic>-related DNA cytosine deamination was overrepresented in genes affected by DSBs</title>
<p>The GO analysis revealed that DNA Cytosine Deamination appeared among the top 10 terms in Set1. However, the top 10 terms in Set2 did not show any significant functions, leading to the exclusion of Set2 from further analysis. Interestingly, the GO analysis also highlighted the presence of genes belonging to the <italic>APOBEC</italic> family in both TCGA-Set1 and CCLE-Set1. These <italic>APOBEC</italic> genes were found to be associated with hotspots that exhibited a relatively low value of &#x3bb; compared to other hotspots. The analysis revealed that these genes were predominantly located in hotspots with &#x3bb; values around 1.4 and 1.6 in two sets of data, respectively (<xref ref-type="fig" rid="F2">Figures 2G, H</xref>).</p>
</sec>
<sec id="s3-4">
<title>Association between the SCNA burden and APOBEC family copy number statuses, expression of <italic>APOBEC</italic> gene</title>
<p>In TCGA data, the results indicated a significant association between the expression levels of <italic>APOBEC3C, APOBEC3D, APOBEC3F, APOBEC3G, APOBEC3H</italic>, and SCNA burden (<xref ref-type="fig" rid="F3">Figures 3A&#x2013;E</xref>). The findings also revealed a significant association between DSB points affecting <italic>APOBEC3D</italic> and SCNA burdens (<xref ref-type="fig" rid="F3">Figure 3F</xref>). In CCLE data, the results showed a significant association between the expression levels of <italic>APOBEC3D, APOBEC3F, APOBEC3G, APOBEC3H,</italic> and SCNA burdens (<xref ref-type="fig" rid="F3">Figures 3G&#x2013;J</xref>).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Significant violin plots that illustrate relationship between expression of <italic>APOBEC</italic> family genes and SCNA burden, DSB points affecting <italic>APOBEC</italic> genes and SCNA burden. The expression of <italic>APOBEC3C, APOBEC3D, APOBEC3F, APOBEC3G, APOBEC3H</italic> have relationship with SCNA burden. The DSB points affecting <italic>APOBEC3D</italic> have relationship with SCNA burden. <bold>(A)</bold> The violin plot of <italic>APOBEC3C</italic> for TCGA data set. <bold>(B)</bold> The violin plot of <italic>APOBEC3D</italic> for TCGA data set. <bold>(C)</bold> The violin plot of <italic>APOBEC3F</italic> for TCGA data set. <bold>(D)</bold> The violin plot of <italic>APOBEC3G</italic> for TCGA data set. <bold>(E)</bold> The violin plot of <italic>APOBEC3H</italic> for TCGA data set. <bold>(F)</bold> The violin plot of SCNA burden <italic>versus</italic> whether DSB points happened in <italic>APOBEC3D</italic> for TCGA data set. <bold>(G)</bold> The violin plot of <italic>APOBEC3D</italic> for CCLE data set. <bold>(H)</bold> The violin plot of <italic>APOBEC3F</italic> for CCLE data set. <bold>(I)</bold> The violin plot of <italic>APOBEC3G</italic> for CCLE data set. <bold>(J)</bold> The violin plot of <italic>APOBEC3H</italic> for CCLE data set.</p>
</caption>
<graphic xlink:href="fgene-14-1231415-g003.tif"/>
</fig>
</sec>
<sec id="s3-5">
<title>Significance of SCNA burden and <italic>APOBEC3C</italic> expression on the prognosis of STAD patients</title>
<p>The cohort of 332 patients from TCGA was divided into three groups based on their SCNA burden. The survival analysis demonstrated that patients with higher SCNA burden tended to better overall survival compared to those with lower SCNA burden (<xref ref-type="fig" rid="F4">Figures 4A, B</xref>). Additionally, we investigated the correlation between the expression of specific <italic>APOBEC</italic> genes and the overall survival of STAD patients. Similarly, the cohort of 332 patients was divided into three groups based on SCNA burden. The results indicated that patients with median expression of <italic>APOBEC3C</italic> tended to better overall survival, as opposed to those with high or low expression (<xref ref-type="fig" rid="F4">Figures 4C, D</xref>).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>The Kaplan&#x2013;Meier curve of patients of three groups and histogram of SCNA burden, histogram of log2 (expression of <italic>APOBEC3C</italic>). The patients were grouped in &#x201c;SCNA burden low&#x201d;, &#x201c;SCNA burden median&#x201d;, &#x201c;SCNA burden high&#x201d; and &#x201c;Expression low&#x201d;, &#x201c;Expression median&#x201d;, &#x201c;Expression high&#x201d; according to the expression of <italic>APOBEC</italic> genes, respectively. The result showed that patients with more SCNA tended to have better overall survival. Patients with median expression of <italic>APOBEC3C</italic> tended to have better overall survival. <bold>(A)</bold> The Kaplan&#x2013;Meier curve of patients grouped in &#x201c;SCNA burden low&#x201d;, &#x201c;SCNA burden median&#x201d;, &#x201c;SCNA burden high&#x201d;. <bold>(B)</bold> Histogram of SCNA burden. <bold>(C)</bold> The Kaplan&#x2013;Meier curve of patients grouped in &#x201c;Expression low&#x201d;, &#x201c;Expression median&#x201d;, &#x201c;Expression high&#x201d;. <bold>(D)</bold> Histogram of log2 (expression of <italic>APOBEC3C</italic>).</p>
</caption>
<graphic xlink:href="fgene-14-1231415-g004.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>The distribution of SCNAs has been extensively studied and found to be associated with various abnormal conditions, including autism spectrum disorder and Adrenocortical Carcinomas (<xref ref-type="bibr" rid="B11">Girirajan et al., 2013</xref>; <xref ref-type="bibr" rid="B14">Gupta et al., 2022</xref>). However, the relationship between the prognosis of STAD patients and hotspots of SCNAs remained unclear. Identifying genomic regions undergoing frequent alteration in human cancers was a powerful way to discover genes playing significant roles in oncogenesis (<xref ref-type="bibr" rid="B2">Beroukhim et al., 2010</xref>). By identifying genes located in the hotspots of SCNAs in STAD patients and understanding their functions, we may uncover new mechanisms of STAD development, novel cell signaling pathways, and targeted therapeutic approaches that can improve the precision of treatment for patients. It is worth noting that many SCNAs identified in one cancer type are present in multiple other cancer types (<xref ref-type="bibr" rid="B2">Beroukhim et al., 2010</xref>). In the case of <italic>APOBEC</italic> genes, it was proven that a common deletion in the <italic>APOBEC3</italic> gene was strongly associated with Breast Cancer risk (<xref ref-type="bibr" rid="B21">Long et al., 2013</xref>). Notably a signature attributed to the <italic>APOBEC</italic> family of cytidine deaminases was present in many cancer types (<xref ref-type="bibr" rid="B1">Alexandrov et al., 2013</xref>; <xref ref-type="bibr" rid="B12">Guo et al., 2018</xref>). Therefore, further research on SCNAs in STAD and <italic>APOBEC</italic> genes may yield valuable insights into the variations observed in other cancer types, ultimately advancing the overall treatment of various cancers. While the methods employed in this study may appear straightforward, there are still certain aspects that warrant discussion.</p>
<p>First, the result of GO analysis in Set1 indicated that genes related to DNA Cytosine Deamination may be highly variable in STAD patients. Previous research has demonstrated the beneficial role of DNA Cytosine Deamination in immunity and its detrimental effects on cancer (<xref ref-type="bibr" rid="B26">Shi et al., 2017</xref>). <italic>APOBEC3</italic>-mediated mutagenesis has been observed in various cancers (<xref ref-type="bibr" rid="B3">Butler and Banday, 2023</xref>). However, the specific mechanism underlying this association remains poorly understood. The DSB points that affect <italic>APOBEC3D</italic> appear to be linked to the SCNA burden, although the precise pattern of this relationship is still unknown. Future research should focus on unraveling the mechanisms through which DNA Cytosine Deamination impacts STAD patients.</p>
<p>Second, the findings of the study suggested a phenomenon that a higher burden of SCNAs in STAD patients was associated with improved clinical performance. This indicated that SCNAs may confer a survival advantage in STAD patients. Previous research showed that in the high tumor mutational burden group, nivolumab plus ipilimumab seemed to provide better clinical performance than nivolumab monotherapy (<xref ref-type="bibr" rid="B16">Hellmann et al., 2018</xref>). It was also shown that higher nonsynonymous mutation burden in tumors had a relationship with durable clinical benefit (<xref ref-type="bibr" rid="B25">Rizvi et al., 2015</xref>). However, the specific mechanism underlying the association requires further investigation. Interestingly, the study also observed that a median expression of <italic>APOBEC3C</italic> was associated with better clinical outcomes in STAD patients. This suggests that the expression of <italic>APOBEC3C</italic> may play a role in influencing the prognosis of STAD patients. Future research should focus on exploring the expression patterns of <italic>APOBEC3C</italic> and investigating its mechanism of action in affecting the prognosis of STAD patients.</p>
<p>Finally, the results of the analysis with TCGA data and CCLE data seemed to be the same. In the analysis for TCGA data, there was a significant association between the expression levels of <italic>APOBEC3C, APOBEC3D, APOBEC3F, APOBEC3G, APOBEC3H,</italic> and SCNA burden while the results showed a significant association between the expression levels of <italic>APOBEC3D, APOBEC3F, APOBEC3G, APOBEC3H,</italic> and SCNA burden with CCLE data. Both the result of GO Analysis for TCGA data and CCLE data revealed the significance of DNA Cytosine Deamination, which may suggest the endogeneity of DNA Cytosine Deamination. Future research should focus on the mechanism of DNA Cytosine Deamination.</p>
</sec>
<sec sec-type="conclusion" id="s5">
<title>Conclusion</title>
<p>This research study identified a total of 145 hotspots associated with <italic>APOBEC</italic>-related DNA Cytosine Deamination in patients with STAD. These hotspots were found to be linked to recurrent SCNAs, suggesting a potential role in the progression of STAD. As a result, the <italic>APOBEC</italic> genes have emerged as potential targets for the treatment of STAD.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s6">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7">
<title>Ethics statement</title>
<p>Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants&#x2019; legal guardians/next of kin in accordance with the national legislation and the institutional requirements. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.</p>
</sec>
<sec id="s8">
<title>Author contributions</title>
<p>YS contributed to conception and design of the study. He organized the database, performed the statistical analysis, wrote the first draft of the manuscript. HS proposed the idea, reviewed the manuscript and supervised the Study. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="COI-statement" id="s9">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alexandrov</surname>
<given-names>L. B.</given-names>
</name>
<name>
<surname>Nik-Zainal</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Wedge</surname>
<given-names>D. C.</given-names>
</name>
<name>
<surname>Aparicio</surname>
<given-names>S. A. J. R.</given-names>
</name>
<name>
<surname>Behjati</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Biankin</surname>
<given-names>A. V.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Signatures of mutational processes in human cancer</article-title>. <source>Nature</source> <volume>500</volume> (<issue>7463</issue>), <fpage>415</fpage>&#x2013;<lpage>421</lpage>. <pub-id pub-id-type="doi">10.1038/nature12477</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Beroukhim</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Mermel</surname>
<given-names>C. H.</given-names>
</name>
<name>
<surname>Porter</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Raychaudhuri</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Donovan</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>The landscape of somatic copy-number alteration across human cancers</article-title>. <source>Nature</source> <volume>463</volume> (<issue>7283</issue>), <fpage>899</fpage>&#x2013;<lpage>905</lpage>. <pub-id pub-id-type="doi">10.1038/nature08822</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Butler</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Banday</surname>
<given-names>A. R.</given-names>
</name>
</person-group> (<year>2023</year>). <article-title>APOBEC3-mediated mutagenesis in cancer: causes, clinical significance and therapeutic potential</article-title>. <source>J. Hematol. Oncol.</source> <volume>16</volume> (<issue>1</issue>), <fpage>31</fpage>. <pub-id pub-id-type="doi">10.1186/s13045-023-01425-5</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carr</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Lambert</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Replication stress-induced genome instability: the dark side of replication maintenance by homologous recombination</article-title>. <source>J. Mol. Biol.</source> <volume>425</volume> (<issue>23</issue>), <fpage>4733</fpage>&#x2013;<lpage>4744</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2013.04.023</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carrai</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Campa</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Vodicka</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Flamini</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Martelli</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Slyskova</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Association between taste receptor (TAS) genes and the perception of wine characteristics</article-title>. <source>Sci. Rep.</source> <volume>7</volume> (<issue>1</issue>), <fpage>9239</fpage>&#x2013;<lpage>9247</lpage>. <pub-id pub-id-type="doi">10.1038/s41598-017-08946-3</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carter</surname>
<given-names>S. L.</given-names>
</name>
<name>
<surname>Eklund</surname>
<given-names>A. C.</given-names>
</name>
<name>
<surname>Kohane</surname>
<given-names>I. S.</given-names>
</name>
<name>
<surname>Harris</surname>
<given-names>L. N.</given-names>
</name>
<name>
<surname>Szallasi</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>A signature of chromosomal instability inferred from gene expression profiles predicts clinical outcome in multiple human cancers</article-title>. <source>Nat. Genet.</source> <volume>38</volume> (<issue>9</issue>), <fpage>1043</fpage>&#x2013;<lpage>1048</lpage>. <pub-id pub-id-type="doi">10.1038/ng1861</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Identification of tumor mutation burden, microsatellite instability, and somatic copy number alteration derived nine gene signatures to predict clinical outcomes in STAD</article-title>. <source>Front. Mol. Biosci.</source> <volume>9</volume>, <fpage>793403</fpage>. <pub-id pub-id-type="doi">10.3389/fmolb.2022.793403</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dendrou</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Petersen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Rossjohn</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Fugger</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>HLA variation and disease</article-title>. <source>Nat. Rev. Immunol.</source> <volume>18</volume> (<issue>5</issue>), <fpage>325</fpage>&#x2013;<lpage>339</lpage>. <pub-id pub-id-type="doi">10.1038/nri.2017.143</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Edholm</surname>
<given-names>E. S.</given-names>
</name>
<name>
<surname>Wilson</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bengten</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Immunoglobulin light (IgL) chains in ectothermic vertebrates</article-title>. <source>Dev. Comp. Immunol.</source> <volume>35</volume> (<issue>9</issue>), <fpage>906</fpage>&#x2013;<lpage>915</lpage>. <pub-id pub-id-type="doi">10.1016/j.dci.2011.01.012</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fu</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>W.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Development of tumor mutation burden-related prognostic model and novel biomarker identification in stomach adenocarcinoma</article-title>. <source>Front. Cell Dev. Biol.</source> <volume>10</volume>, <fpage>790920</fpage>. <pub-id pub-id-type="doi">10.3389/fcell.2022.790920</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Girirajan</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Dennis</surname>
<given-names>M. Y.</given-names>
</name>
<name>
<surname>Baker</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Malig</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Coe</surname>
<given-names>B. P.</given-names>
</name>
<name>
<surname>Campbell</surname>
<given-names>C. D.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Refinement and discovery of new hotspots of copy-number variation associated with autism spectrum disorder</article-title>. <source>Am. J. Hum. Genet.</source> <volume>92</volume> (<issue>2</issue>), <fpage>221</fpage>&#x2013;<lpage>237</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2012.12.016</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Germline and somatic variations influence the somatic mutational signatures of esophageal squamous cell carcinomas in a Chinese population</article-title>. <source>BMC Genomics</source> <volume>19</volume> (<issue>1</issue>), <fpage>538</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-018-4906-4</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Sztupinszki</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Borcsok</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Genetic determinants of somatic selection of mutational processes in 3,566 human cancers</article-title>. <source>Cancer Res.</source> <volume>81</volume> (<issue>16</issue>), <fpage>4205</fpage>&#x2013;<lpage>4217</lpage>. <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-21-0086</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gupta</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Won</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Chadalavada</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Nanjangud</surname>
<given-names>G. J.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y. B.</given-names>
</name>
<name>
<surname>Al-Ahmadie</surname>
<given-names>H. A.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>TERT copy number alterations, promoter mutations and rearrangements in adrenocortical carcinomas</article-title>. <source>Endocr. Pathol.</source> <volume>33</volume> (<issue>2</issue>), <fpage>304</fpage>&#x2013;<lpage>314</lpage>. <pub-id pub-id-type="doi">10.1007/s12022-021-09691-0</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hastings</surname>
<given-names>P. J.</given-names>
</name>
<name>
<surname>Lupski</surname>
<given-names>J. R.</given-names>
</name>
<name>
<surname>Rosenberg</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Ira</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Mechanisms of change in gene copy number</article-title>. <source>Nat. Rev. Genet.</source> <volume>10</volume> (<issue>8</issue>), <fpage>551</fpage>&#x2013;<lpage>564</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2593</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hellmann</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Callahan</surname>
<given-names>M. K.</given-names>
</name>
<name>
<surname>Awad</surname>
<given-names>M. M.</given-names>
</name>
<name>
<surname>Calvo</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Ascierto</surname>
<given-names>P. A.</given-names>
</name>
<name>
<surname>Atmaca</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Tumor mutational burden and efficacy of nivolumab monotherapy and in combination with ipilimumab in small-cell lung cancer</article-title>. <source>Cancer Cell</source> <volume>33</volume> (<issue>5</issue>), <fpage>853</fpage>&#x2013;<lpage>861.e4</lpage>. <pub-id pub-id-type="doi">10.1016/j.ccell.2018.04.001</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hovhannisyan</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Harutyunyan</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Aroutiounian</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Liehr</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>DNA copy number variations as markers of mutagenic impact</article-title>. <source>Int. J. Mol. Sci.</source> <volume>20</volume> (<issue>19</issue>), <fpage>4723</fpage>. <pub-id pub-id-type="doi">10.3390/ijms20194723</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jackson</surname>
<given-names>K. J. L.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Gaeta</surname>
<given-names>B. A.</given-names>
</name>
<name>
<surname>Pomat</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Siba</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Rimmer</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Divergent human populations show extensive shared IGK rearrangements in peripheral blood B cells</article-title>. <source>Immunogenetics</source> <volume>64</volume> (<issue>1</issue>), <fpage>3</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1007/s00251-011-0559-z</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jamal-Hanjani</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wilson</surname>
<given-names>G. A.</given-names>
</name>
<name>
<surname>McGranahan</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Birkbak</surname>
<given-names>N. J.</given-names>
</name>
<name>
<surname>Watkins</surname>
<given-names>T. B. K.</given-names>
</name>
<name>
<surname>Veeriah</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Tracking the evolution of non-small-cell lung cancer</article-title>. <source>N. Engl. J. Med.</source> <volume>376</volume> (<issue>22</issue>), <fpage>2109</fpage>&#x2013;<lpage>2121</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa1616288</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Keller</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Zhuang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Chi</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Vosshall</surname>
<given-names>L. B.</given-names>
</name>
<name>
<surname>Matsunami</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Genetic variation in a human odorant receptor alters odour perception</article-title>. <source>Nature</source> <volume>449</volume> (<issue>7161</issue>), <fpage>468</fpage>&#x2013;<lpage>472</lpage>. <pub-id pub-id-type="doi">10.1038/nature06162</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Long</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Delahanty</surname>
<given-names>R. J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>Y. T.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>Q.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>A common deletion in the APOBEC3 genes and breast cancer risk</article-title>. <source>J. Natl. Cancer Inst.</source> <volume>105</volume> (<issue>8</issue>), <fpage>573</fpage>&#x2013;<lpage>579</lpage>. <pub-id pub-id-type="doi">10.1093/jnci/djt018</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mac&#xe9;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kutalik</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Valsesia</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Copy number variation</article-title>. <source>Methods Mol. Biol.</source> <volume>1793</volume>, <fpage>231</fpage>&#x2013;<lpage>258</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-4939-7868-7_14</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McCarroll</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Kuruvilla</surname>
<given-names>F. G.</given-names>
</name>
<name>
<surname>Korn</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Cawley</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Nemesh</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wysoker</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>Integrated detection and population-genetic analysis of SNPs and copy number variation</article-title>. <source>Nat. Genet.</source> <volume>40</volume> (<issue>10</issue>), <fpage>1166</fpage>&#x2013;<lpage>1174</lpage>. <pub-id pub-id-type="doi">10.1038/ng.238</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Raudvere</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Kolberg</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Kuzmin</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Arak</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Adler</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Peterson</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update)</article-title>. <source>Nucleic acids Res.</source> <volume>47</volume> (<issue>W1</issue>), <fpage>W191</fpage>&#x2013;<lpage>W198</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz369</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rizvi</surname>
<given-names>N. A.</given-names>
</name>
<name>
<surname>Hellmann</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Snyder</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kvistborg</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Makarov</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Havel</surname>
<given-names>J. J.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer</article-title>. <source>Science</source> <volume>348</volume> (<issue>6230</issue>), <fpage>124</fpage>&#x2013;<lpage>128</lpage>. <pub-id pub-id-type="doi">10.1126/science.aaa1348</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shi</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Carpenter</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Banerjee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Shaban</surname>
<given-names>N. M.</given-names>
</name>
<name>
<surname>Kurahashi</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Salamango</surname>
<given-names>D. J.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Structural basis for targeted DNA cytosine deamination and mutagenesis by APOBEC3A and APOBEC3B</article-title>. <source>Nat. Struct. Mol. Biol.</source> <volume>24</volume> (<issue>2</issue>), <fpage>131</fpage>&#x2013;<lpage>139</lpage>. <pub-id pub-id-type="doi">10.1038/nsmb.3344</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stavnezer</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Schrader</surname>
<given-names>C. E.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>IgH chain class switch recombination: mechanism and regulation</article-title>. <source>J. Immunol.</source> <volume>193</volume> (<issue>11</issue>), <fpage>5370</fpage>&#x2013;<lpage>5378</lpage>. <pub-id pub-id-type="doi">10.4049/jimmunol.1401849</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Steele</surname>
<given-names>C. D.</given-names>
</name>
<name>
<surname>Abbasi</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Islam</surname>
<given-names>S. M. A.</given-names>
</name>
<name>
<surname>Bowes</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>Khandekar</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Haase</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Signatures of copy number alterations in human cancer</article-title>. <source>Nature</source> <volume>606</volume> (<issue>7916</issue>), <fpage>984</fpage>&#x2013;<lpage>991</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-022-04738-6</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>van Opheusden</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Acerbi</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>W. J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Unbiased and efficient log-likelihood estimation with inverse binomial sampling</article-title>. <source>PLoS Comput. Biol.</source> <volume>16</volume> (<issue>12</issue>), <fpage>e1008483</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1008483</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Watkins</surname>
<given-names>T. B. K.</given-names>
</name>
<name>
<surname>Lim</surname>
<given-names>E. L.</given-names>
</name>
<name>
<surname>Petkovic</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Elizalde</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Birkbak</surname>
<given-names>N. J.</given-names>
</name>
<name>
<surname>Wilson</surname>
<given-names>G. A.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Pervasive chromosomal instability and karyotype order in tumour evolution</article-title>. <source>Nature</source> <volume>587</volume> (<issue>7832</issue>), <fpage>126</fpage>&#x2013;<lpage>132</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-020-2698-6</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yicheng</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Xin</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Huilin</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Association of FLG mutation with tumor mutation load and clinical outcomes in patients with gastric cancer</article-title>. <source>Front. Genet.</source> <volume>13</volume>, <fpage>808542</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2022.808542</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>H-F.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>Y-J.</given-names>
</name>
<name>
<surname>Teng</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Exploring TCGA database for identification of potential prognostic genes in stomach adenocarcinoma</article-title>. <source>Cancer Cell Int.</source> <volume>20</volume> (<issue>1</issue>), <fpage>264</fpage>. <pub-id pub-id-type="doi">10.1186/s12935-020-01351-3</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>Y. H.</given-names>
</name>
<name>
<surname>Jeong</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>Z. Y.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>J. Y.</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>R. Q.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Dietary intake of fatty acids, total cholesterol, and stomach cancer in a Chinese population</article-title>. <source>Nutrients</source> <volume>11</volume> (<issue>8</issue>), <fpage>1730</fpage>. <pub-id pub-id-type="doi">10.3390/nu11081730</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>