<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" dtd-version="2.3" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2021.642759</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The Analysis of Gene Expression Data Incorporating Tumor Purity Information</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Ahn</surname> <given-names>Seungjun</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1134434/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Grimes</surname> <given-names>Tyler</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1200747/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Datta</surname> <given-names>Somnath</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1147229/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Biostatistics, University of Florida</institution>, <addr-line>Gainesville, FL</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: D. P. Kreil, Boku University, Vienna, Austria</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Binbin Wang, National Cancer Institute, National Institutes of Health (NIH), United States; Helen Piontkivska, Kent State University, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Somnath Datta, <email>somnath.datta@ufl.edu</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>23</day>
<month>08</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>642759</elocation-id>
<history>
<date date-type="received">
<day>16</day>
<month>12</month>
<year>2020</year>
</date>
<date date-type="accepted">
<day>30</day>
<month>07</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2021 Ahn, Grimes and Datta.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Ahn, Grimes and Datta</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The tumor microenvironment is composed of tumor cells, stroma cells, immune cells, blood vessels, and other associated non-cancerous cells. Gene expression measurements on tumor samples are an average over cells in the microenvironment. However, research questions often seek answers about tumor cells rather than the surrounding non-tumor tissue. Previous studies have suggested that the tumor purity (TP)&#x2014;the proportion of tumor cells in a solid tumor sample&#x2014;has a confounding effect on differential expression (DE) analysis of high vs. low survival groups. We investigate three ways incorporating the TP information in the two statistical methods used for analyzing gene expression data, namely, differential network (DN) analysis and DE analysis. Analysis 1 ignores the TP information completely, Analysis 2 uses a truncated sample by removing the low TP samples, and Analysis 3 uses TP as a covariate in the underlying statistical models. We use three gene expression data sets related to three different cancers from the Cancer Genome Atlas (TCGA) for our investigation. The networks from Analysis 2 have greater amount of differential connectivity in the two networks than that from Analysis 1 in all three cancer datasets. Similarly, Analysis 1 identified more differentially expressed genes than Analysis 2. Results of DN and DE analyses using Analysis 3 were mostly consistent with those of Analysis 1 across three cancers. However, Analysis 3 identified additional cancer-related genes in both DN and DE analyses. Our findings suggest that using TP as a covariate in a linear model is appropriate for DE analysis, but a more robust model is needed for DN analysis. However, because true DN or DE patterns are not known for the empirical datasets, simulated datasets can be used to study the statistical properties of these methods in future studies.</p>
</abstract>
<kwd-group>
<kwd>tumor purity</kwd>
<kwd>RNA-seq data</kwd>
<kwd>differential network analysis</kwd>
<kwd>differential gene expression analysis</kwd>
<kwd>gene expression data</kwd>
<kwd>confounding effects</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="6"/>
<equation-count count="0"/>
<ref-count count="52"/>
<page-count count="10"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1">
<title>Introduction</title>
<p>The tumor microenvironment (TME) is composed of tumor cells, stroma cells, immune cells, blood vessels, and other associated non-cancerous cells. It is recognized that TME is a key contributor to tumor growth, progression, and metastasis (<xref ref-type="bibr" rid="B31">Quail and Joyce, 2013</xref>; <xref ref-type="bibr" rid="B43">Turley et al., 2015</xref>). Advances in high-throughput sequencing technologies have enabled a comprehensive view of this heterogeneous collection of cells. The tumor purity (TP) is defined as the proportion of tumor cells in a solid tumor sample. TP is important to know because it contributes to a better prediction of prognosis and clinical management (<xref ref-type="bibr" rid="B25">Mao et al., 2018</xref>; <xref ref-type="bibr" rid="B10">Gong et al., 2020</xref>). It also plays a crucial role in classifying cancer subtypes (<xref ref-type="bibr" rid="B51">Zhang et al., 2017</xref>).</p>
<p>Conventionally, the TP is estimated through a visual inspection of tumor specimens between trained pathologists (<xref ref-type="bibr" rid="B32">Rajan et al., 2004</xref>), which can cause a poor interrater agreement and be time-consuming for large studies (<xref ref-type="bibr" rid="B49">Yuan et al., 2012</xref>; <xref ref-type="bibr" rid="B13">Haider et al., 2020</xref>). Researchers have been investigating the estimation of TP directly from data. Several studies have proposed methods of estimating the TP in DNA methylation data (updated version of InfiniumPurify; <xref ref-type="bibr" rid="B52">Zheng et al., 2017</xref>), DNA somatic copy number data (ABSOLUTE algorithm; <xref ref-type="bibr" rid="B6">Carter et al., 2012</xref>), high-throughput DNA-sequencing data (Tumor Heterogeneity Analysis algorithm; <xref ref-type="bibr" rid="B28">Oesper et al., 2013</xref>), and whole-exome sequencing data (AbsSN-Seq algorithm; <xref ref-type="bibr" rid="B3">Bao et al., 2014</xref>). Lastly, <xref ref-type="bibr" rid="B48">Yoshihara et al. (2013)</xref> developed the ESTIMATE (Estimation of STromal and Immune cells in MAlignant Tumor tissues using Expression data) algorithm for TP estimation in microarray data, which is based on a scoring system using the proportion of stromal and immune cells in tumor samples.</p>
<p>In this study, our interest lies in RNA-seq data. Research involving the estimation of TP from RNA-sequencing (RNA-seq) data was presented with the eXtreme Gradient Boosting (XGBoost) ensemble learning algorithm (<xref ref-type="bibr" rid="B23">Li et al., 2019</xref>) and with the gene co-expression network-based TSNet model (<xref ref-type="bibr" rid="B29">Petralia et al., 2018</xref>).</p>
<p>Beyond the estimation of TP, <xref ref-type="bibr" rid="B2">Aran et al. (2015)</xref> analyzed RNA-seq data across 21 cancer types from The Cancer Genome Atlas (TCGA; <xref ref-type="bibr" rid="B4">Cancer Genome Atlas Research Network et al., 2013</xref>) using the TP in their analyses. They examined the association between TP and clinical variables and differences in TP across different subtypes of cancer. Evidence from their studies indicates that the TP confounds the association between gene expression and overall survival (OS) in the differential expression (DE) analysis. They conducted the DE analysis across 13 types of cancer, then compared it to a similar analysis with the inclusion of purity estimates as an additional covariate. Genes that were initially DE between tumor and normal samples before adding TP as a covariate turn out not to be DE, and a set of new genes were introduced as DE after adding TP into the analysis (<xref ref-type="bibr" rid="B2">Aran et al., 2015</xref>). In another recent study, <xref ref-type="bibr" rid="B35">Rhee et al. (2018)</xref> performed the gene cluster analysis using a partial correlation to identify the relationship between the gene expression and mutation abundance while adjusting for TP.</p>
<p>However, there are a limited number of studies that assess the effect of TP on other statistical methods (<xref ref-type="bibr" rid="B51">Zhang et al., 2017</xref>; <xref ref-type="bibr" rid="B29">Petralia et al., 2018</xref>) that are widely used for analyzing gene expression data, such as differential network (DN) analysis. In this article, we have two main objectives. These will contrast results from three different analyses: analyzing the complete dataset without TP information (Analysis 1); analyzing the dataset after dichotomizing TP and removing the low-purity samples (Analysis 2); and analyzing the complete dataset with TP included as a continuous covariate (Analysis 3).</p>
<p>In the first objective, we compare results of Analysis 1 to Analysis 2. In the second objective, we compare results between Analysis 1 and Analysis 3. In both objectives, we analyzed breast invasive carcinoma (BRCA), head and neck squamous cell carcinoma (HNSC), and lung squamous cell carcinoma (LUSC) datasets from the TCGA (<xref ref-type="bibr" rid="B4">Cancer Genome Atlas Research Network et al., 2013</xref>). <xref ref-type="fig" rid="F1">Figure 1</xref> summarizes the analysis plans and objectives of the study. The approach described in this paper provides a general strategy for assessing the effect of TP on gene expression data analyses.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Flowchart of analysis plans and objectives of the study.</p></caption>
<graphic xlink:href="fgene-12-642759-g001.tif"/>
</fig>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S2.SS1">
<title>Clinical Data</title>
<p>An initial sample of 1,093 patients were obtained from the BRCA dataset. After exclusion of patients with incomplete data on age at diagnosis, OS, and TP, 1,029 patients remained eligible for the study. Similarly, 509 and 474 patients were used for analysis after excluding 11 and 27 patients from the HNSC and LUSC datasets, respectively. The primary endpoint was OS, calculated as the time from diagnosis to the time of death. Patients who were alive at the last follow-up were considered censored. The rate of censoring was 85.6, 58, and 57.8% for BRCA, HNSC, and LUSC.</p>
</sec>
<sec id="S2.SS2">
<title>RNA-Seq Data</title>
<p>The normalized RNA-seq data consisting of 20,155 genes from TCGA for the breast cancer samples were obtained from LinkedOmics (<xref ref-type="bibr" rid="B44">Vasaikar et al., 2017</xref>), a publicly available portal that contains multi-omics data and clinical data across 32 cancer types from TCGA. For all analyses, genes without an Entrez gene ID were removed. A total of 16,485 genes were mapped to its Entrez gene ID. It was further reduced to 6,963 genes which were also found in 7,618 unique genes from Reactome pathways (<xref ref-type="bibr" rid="B18">Jassal et al., 2020</xref>). The Reactome database is an open-source and peer-reviewed database of biological pathways. To filter out lowly expressed genes, genes with zero Reads Per Kilobase of transcript per Million reads mapped (RPKM) expression in more than 80% of 1,029 samples were removed, leaving 6,747 genes. Upon applying the same data processing scheme, 6,698 out of original 20,165 genes from 509 samples and 6,712 out of original 20,104 genes from 474 patients were available for the analysis of HNSC and LUSC, respectively. We considered genes that are within 649 pathways (<xref ref-type="supplementary-material" rid="FS1">Supplementary Files</xref> for complete list) that have more than 20 or less than 100 genes for the analysis of DN.</p>
</sec>
<sec id="S2.SS3">
<title>Statistical Methods</title>
<p>Our objective is to assess the effect of TP on DN analysis, which has not been studied previously, and on DE analysis by comparing Analysis 1 vs. Analysis 2 and Analysis 1 vs. Analysis 3. The methods to the analyses of DN and DE are described below. Study samples are dichotomized into high-survival (HS) and low-survival (LS) groups based on the median value of OS. All statistical analyses were performed in R version 4.0.2 (R Foundation for Statistical Computing, Vienna, Austria).</p>
<sec id="S2.SS3.SSS1">
<title>Differential Network Analysis</title>
<p>The DN analysis is a method for identifying changes among gene&#x2013;gene associations. These changes are indicative of dysfunctional regulation that is affecting the ability of genes to interact with one another (either through their mRNA or protein products; <xref ref-type="bibr" rid="B7">de la Fuente, 2010</xref>). Genes do not work alone; in other words, they interact with each other in complicated ways. However, the DE analysis assumes that the gene expression is independent of each other, which lacks in identifying the dynamics of physical and genetic networks directly (<xref ref-type="bibr" rid="B16">Ideker and Krogan, 2012</xref>; <xref ref-type="bibr" rid="B20">Kim et al., 2018</xref>). DN analysis is different from DE analysis in that it compares a weighted network from study samples with different clinical characteristics to identify a set of genes involved in a specific cancer-related pathway or to find a hub gene that regulates its neighbor genes. The HS and LS groups are compared to identify gene pathways that have differentially connected (DC) co-expression networks. The <italic>dnapath</italic> package (<xref ref-type="bibr" rid="B12">Grimes et al., 2019</xref>) was used to perform the DN analysis using 649 different Reactome pathways, using partial correlations to infer the individual gene networks. The <italic>p</italic>-value of the differential connectivity score is computed from a permutation test (20 random permutations).</p>
</sec>
<sec id="S2.SS3.SSS2">
<title>Differential Expression Analysis</title>
<p>The DE analysis was performed to identify the number of differentially expressed genes (DEGs) between HS and LS groups. The <italic>edgeR</italic> package (<xref ref-type="bibr" rid="B37">Robinson et al., 2010</xref>) was utilized to obtain the count matrix of gene counts. Subsequently, the gene-wise linear model is fitted to the data, followed by estimating contrasts of each gene using the <italic>limma</italic> package (<xref ref-type="bibr" rid="B36">Ritchie et al., 2015</xref>). Empirical Bayes smoothing was also applied to obtain the unadjusted gene-wise <italic>p</italic>-value. The Benjamini&#x2013;Hochberg correction was then applied to control the false discovery rate for multiple-hypothesis testing.</p>
</sec>
<sec id="S2.SS3.SSS3">
<title>Tumor Purity-Adjusted Analysis: Plans for Analysis 3</title>
<p>Tumor purity-adjusted analysis (Analysis 3) is compared to Analysis 1 to assess the confounding effect of TP on the association between gene expression and OS. We fit the simple linear regression model for each gene as a function of TP. The residual of each separate linear model is then utilized as TP-adjusted gene expression level for the TP-adjusted DN analysis. For the TP-adjusted DE analysis, TP is introduced as an additional covariate into the design matrix, as performed by <xref ref-type="bibr" rid="B2">Aran et al. (2015)</xref>.</p>
</sec>
</sec>
</sec>
<sec id="S3">
<title>Results</title>
<sec id="S3.SS1">
<title>Define High Tumor Purity and Survival Groups</title>
<p>In order to compare results of Analysis 1 to Analysis 2 in later sections, we firstly need to define a cutoff value for &#x201C;high&#x201D; purity. The median TP from each three datasets is about 0.7 when rounding to the nearest 10th. Specifically, median (Q1, Q3) TPs for BRCA, HNSC, and LUSC are 0.747 (0.656, 0.825), 0.688 (0.613, 0.767), and 0.684 (0.590, 0.789), respectively. Therefore, it makes sense to treat TP greater than or equal to 0.7 as high purity. For DN and DE analyses, survival groups are dichotomized based on the median OS. <xref ref-type="fig" rid="F2">Figure 2</xref> displays boxplots of TP for two survival groups for the three cancer datasets.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Boxplots of tumor purity (TP) between high and low overall survival (OS) groups, dichotomized based on the median OS using breast invasive carcinoma (BRCA; <bold>left</bold>), head and neck squamous cell carcinoma (HNSC; <bold>center</bold>), and lung squamous cell carcinoma (LUSC; <bold>right</bold>); HS, high overall survival; LS, low overall survival.</p></caption>
<graphic xlink:href="fgene-12-642759-g002.tif"/>
</fig>
</sec>
<sec id="S3.SS2">
<title>Analysis Without Tumor Purity Adjustment: Analysis 1 vs. 2</title>
<sec id="S3.SS2.SSS1">
<title>Differential Network Analysis on Three Cancer Types</title>
<p>The DN analysis was performed on the following study samples: full BRCA containing 1,029 samples (509 and 474 samples for full HNSC and full LUSC), and on a high-purity subset, which contained 659 samples (240 and 225 samples for HNSC and LUSC) after removing those with TP less than 0.70. The top five significant pathways from the DN analysis on BRCA are shown in <xref ref-type="table" rid="T1">Tables 1</xref>, <xref ref-type="table" rid="T2">2</xref> for Analysis 1 and Analysis 2, respectively. The top 20 significant pathways for Analyses 1 and 2 on BRCA are presented as <xref ref-type="supplementary-material" rid="FS1">Supplementary Tables 1</xref>, <xref ref-type="supplementary-material" rid="FS1">2</xref>, respectively.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Five most significant pathways from DN analysis using BRCA without subsetting.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>Pathway</bold></td>
<td valign="top" align="center"><bold>DC score</bold></td>
<td valign="top" align="center"><bold>No. of genes</bold></td>
<td valign="top" align="center"><bold>No. of DC genes</bold></td>
<td valign="top" align="center"><bold>Avg. expr. in low-risk</bold></td>
<td valign="top" align="center"><bold>Avg. expr. in high-risk</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Inflammasomes</td>
<td valign="top" align="center">0.077</td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">7.83</td>
<td valign="top" align="center">7.82</td>
</tr>
<tr>
<td valign="top" align="left">MET activates PTK2 signaling</td>
<td valign="top" align="center">0.075</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">10.2</td>
<td valign="top" align="center">10.2</td>
</tr>
<tr>
<td valign="top" align="left">Intrinsic pathway of fibrin clot formation</td>
<td valign="top" align="center">0.072</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">5.03</td>
<td valign="top" align="center">5.03</td>
</tr>
<tr>
<td valign="top" align="left">PD-1 signaling</td>
<td valign="top" align="center">0.072</td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">7.12</td>
<td valign="top" align="center">7.09</td>
</tr>
<tr>
<td valign="top" align="left">Antigen activates B cell receptor (BCR) leading to generation of second messengers</td>
<td valign="top" align="center">0.072</td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">8.98</td>
<td valign="top" align="center">8.93</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>Columns include Reactome pathway names, differentially connectivity (DC) score, number of genes in the pathway, number of significant DC genes, and average expression level of genes in the pathway.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Five most significant pathways from DN analysis using BRCA subsetting on samples with tumor purity (TP) above 70%.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>Pathway</bold></td>
<td valign="top" align="center"><bold>DC score</bold></td>
<td valign="top" align="center"><bold>No. of genes</bold></td>
<td valign="top" align="center"><bold>No. of DC genes</bold></td>
<td valign="top" align="center"><bold>Avg. expr. in low-risk</bold></td>
<td valign="top" align="center"><bold>Avg. expr. in high-risk</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">G0 and early G1</td>
<td valign="top" align="center">0.086</td>
<td valign="top" align="center">27</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">8.97</td>
<td valign="top" align="center">8.89</td>
</tr>
<tr>
<td valign="top" align="left">Transcription of E2F targets under negative control by DREAM complex</td>
<td valign="top" align="center">0.085</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">9.31</td>
<td valign="top" align="center">9.24</td>
</tr>
<tr>
<td valign="top" align="left">Degradation of AXIN</td>
<td valign="top" align="center">0.081</td>
<td valign="top" align="center">55</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">10.3</td>
<td valign="top" align="center">10.3</td>
</tr>
<tr>
<td valign="top" align="left">SCF (Skp2)-mediated degradation of p27/p21</td>
<td valign="top" align="center">0.081</td>
<td valign="top" align="center">60</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">10.5</td>
<td valign="top" align="center">10.5</td>
</tr>
<tr>
<td valign="top" align="left">Cross-presentation of soluble exogenous antigens (endosomes)</td>
<td valign="top" align="center">0.08</td>
<td valign="top" align="center">50</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">9.79</td>
<td valign="top" align="center">9.8</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>Columns include Reactome pathway names, DC score, number of genes in the pathway, number of significant DC genes, and average expression level of genes in the pathway.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>Among the top five results from Analysis 1 on BRCA (<xref ref-type="table" rid="T1">Table 1</xref>), four are non-tumor-related pathways: &#x201C;Inflammasomes,&#x201D; &#x201C;PD-1 signaling,&#x201D; and &#x201C;antigen&#x201D; are related to immune cells and &#x201C;Fibrin clot formation&#x201D; pathways are related to blood, except the &#x201C;MET activates PTK2 signaling&#x201D; pathway, which is related to the cell cycle. On the other hand, the top pathways from Analysis 2 (<xref ref-type="table" rid="T2">Table 2</xref>) are cancer-progression-related pathways, including cell cycle and transcription factor.</p>
<p>&#x201C;Degradation of AXIN&#x201D; is identified as one of the DC pathways in both analyses; in particular, it was the top 11th and 3rd in the full dataset and in the high-purity subset, respectively. AXIN is a protein that is related to a cytoskeletal regulation and a molecular controller of cerebral cortical development (<xref ref-type="bibr" rid="B47">Ye et al., 2015</xref>).</p>
<p>Incidentally, the mean expression of the &#x201C;Degradation of AXIN&#x201D; pathway is the same in both Analyses 1 and 2 (10.3 vs. 10.3), which we would not expect since the full dataset will contain more immune cells. However, the signal in the DN is stronger in Analysis 2 (<xref ref-type="fig" rid="F3">Figure 3</xref>). Some of the edges (differential connections) are more prominent in Analysis 2 results. This suggests that the associations among genes in this pathway may be a result of dysregulation in the tumor cells rather than in the immune cells of the TME.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Differential network (DN) analysis results for the degradation of AXIN pathway using BRCA. On the left is the DN estimated from the full dataset, and on the right shows the estimated DN from the high TP subsample. The edge width and opacity are scaled based on (1) the <italic>p</italic>-value of the differential connectivity score and (2) the relative magnitude of the change in association. Blue edges indicate stronger association in the LS group, and red edges are stronger in the HS group. No connected edge between genes means that there is no statistical evidence of a gene&#x2013;gene association. The edge color represents the relative mean gene expression for a specific grouping factor (HS and LS). The network will contain more disconnected components if the hub genes are no longer hubs, which potentially alter the network structure.</p></caption>
<graphic xlink:href="fgene-12-642759-g003.tif"/>
</fig>
<p>The &#x201C;G0 and Early G1&#x201D; pathway is significantly DC in Analysis 2, but not in Analysis 1. Upon inspection (<xref ref-type="fig" rid="F4">Figure 4</xref>), we find that the two estimated DN show a greater difference compared to the previous comparison in <xref ref-type="fig" rid="F3">Figure 3</xref>. This pathway is related to cell proliferation and may not be an active process within non-tumor cells in the TME. This would explain why the signal is weak in the full dataset. By subsetting on high-purity samples, the noise from the non-tumor cell in the TME is reduced.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Differential network analysis results for the G0 and early G1 pathway using BRCA. On the left is the DN estimated from the full dataset, and on the right shows the estimated DN from the high TP subsample.</p></caption>
<graphic xlink:href="fgene-12-642759-g004.tif"/>
</fig>
<p><xref ref-type="supplementary-material" rid="FS1">Supplementary Table 3</xref> summarizes the top 20 results of Analysis 1 on HNSC; of the top five pathways, three pathways are relevant to non-tumor cells in TME. Similar with BRCA, more cancer-progression-related pathways are found as top pathways in Analysis 2 on HNSC (<xref ref-type="supplementary-material" rid="FS1">Supplementary Table 4</xref>). However, based on the top five results of Analyses 1 and 2 on HNSC (<xref ref-type="supplementary-material" rid="FS1">Supplementary Tables 5</xref>, <xref ref-type="supplementary-material" rid="FS1">6</xref>, respectively), there are four cancer-related pathways in Analysis 1 and three in Analysis 2. In all cancer datasets, the network plots (<xref ref-type="supplementary-material" rid="FS1">Supplementary Figures 1</xref>&#x2013;<xref ref-type="supplementary-material" rid="FS1">3</xref>) for Analysis 2 shows greater amount of differential connectivity than Analysis 1.</p>
</sec>
<sec id="S3.SS2.SSS2">
<title>Differential Expression Analysis on Three Cancer Types</title>
<p>A total of 6,747 genes are analyzed to identify DEGs between two survival groups in BRCA. One hundred seventy-seven genes are selected as DEGs between HS and LS groups in Analysis 1 (<italic>n</italic> = 1,029). Of these, 84 genes are upregulated and 93 genes are downregulated. Among the top five DEGs in <xref ref-type="table" rid="T3">Table 3</xref>, the Fc fragment of IgG receptor IIIa (FCGR3A) is linked to rheumatoid arthritis (<xref ref-type="bibr" rid="B40">Shimizu et al., 2019</xref>) and is associated with HIV infection (<xref ref-type="bibr" rid="B30">Poonia et al., 2010</xref>). Ribosomal protein (RPL22) plays a critical role in regulating lymphoma development (<xref ref-type="bibr" rid="B33">Rao et al., 2012</xref>). On the other hand, there is no DEG found (at the 0.05 significance level) between two survival groups in Analysis 2 (<italic>n</italic> = 659; <xref ref-type="table" rid="T4">Table 4</xref>).</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Five most significant differentially expressed genes (DEGs) from differential expression (DE) analysis using BRCA without subsetting.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>Gene</bold></td>
<td valign="top" align="center"><bold>logFC</bold></td>
<td valign="top" align="center"><bold>Avg. expr.</bold></td>
<td valign="top" align="center"><bold>BH adj. <italic>p</italic>-value</bold></td>
</tr>
<tr>
<td valign="top" align="left">FCGR3A</td>
<td valign="top" align="center">0.33</td>
<td valign="top" align="center">10.5</td>
<td valign="top" align="center">0.006</td>
</tr>
<tr>
<td valign="top" align="left">RPL22</td>
<td valign="top" align="center">&#x2212;0.16</td>
<td valign="top" align="center">12.7</td>
<td valign="top" align="center">0.006</td>
</tr>
<tr>
<td valign="top" align="left">SLCO2B1</td>
<td valign="top" align="center">0.30</td>
<td valign="top" align="center">9.6</td>
<td valign="top" align="center">0.006</td>
</tr>
<tr>
<td valign="top" align="left">RPS25</td>
<td valign="top" align="center">&#x2212;0.19</td>
<td valign="top" align="center">12.7</td>
<td valign="top" align="center">0.006</td>
</tr>
<tr>
<td valign="top" align="left">SMPD1</td>
<td valign="top" align="center">0.18</td>
<td valign="top" align="center">9.5</td>
<td valign="top" align="center">0.006</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"></td>
<td/>
<td/>
<td/>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p>Results from DE analysis using BRCA subset on samples with TP above 70%.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>Gene</bold></td>
<td valign="top" align="center"><bold>logFC</bold></td>
<td valign="top" align="center"><bold>Avg. expr.</bold></td>
<td valign="top" align="center"><bold>BH adj. <italic>p</italic>-value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">STAB1</td>
<td valign="top" align="center">0.29</td>
<td valign="top" align="center">9.4</td>
<td valign="top" align="center">0.062</td>
</tr>
<tr>
<td valign="top" align="left">SLCO2B1</td>
<td valign="top" align="center">0.31</td>
<td valign="top" align="center">9.1</td>
<td valign="top" align="center">0.102</td>
</tr>
<tr>
<td valign="top" align="left">RPS24</td>
<td valign="top" align="center">&#x2212;0.24</td>
<td valign="top" align="center">14.1</td>
<td valign="top" align="center">0.102</td>
</tr>
<tr>
<td valign="top" align="left">RPL15</td>
<td valign="top" align="center">&#x2212;0.16</td>
<td valign="top" align="center">14.0</td>
<td valign="top" align="center">0.111</td>
</tr>
<tr>
<td valign="top" align="left">HMGB1</td>
<td valign="top" align="center">&#x2212;0.16</td>
<td valign="top" align="center">12.4</td>
<td valign="top" align="center">0.111</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>For HNSC, 755 out of 6,698 genes are DE between HS and LS groups in Analysis 1 (<italic>n</italic> = 509) whereas nine genes are DE in Analysis 2 (<italic>n</italic> = 240). The top five DEGs are summarized in <xref ref-type="supplementary-material" rid="FS1">Supplementary Tables 7</xref>, <xref ref-type="supplementary-material" rid="FS1">8</xref> for Analyses 1 and 2, respectively. For LUSC, there are 3 out of 6,712 genes identified as DEGs in Analysis 1 (<italic>n</italic> = 474), but no DEG is found in Analysis 2 (<italic>n</italic> = 225). <xref ref-type="supplementary-material" rid="FS1">Supplementary Tables 9</xref>, <xref ref-type="supplementary-material" rid="FS1">10</xref> list the top five DEGs from Analyses 1 and 2, respectively. Similar with BRCA, cancer-related genes are found among DEGs in HNSC and LUSC, which are shown in <xref ref-type="supplementary-material" rid="FS1">Supplementary Summary</xref>.</p>
</sec>
</sec>
<sec id="S3.SS3">
<title>Analysis With Tumor Purity Adjustment: Analysis 1 vs. 3</title>
<sec id="S3.SS3.SSS1">
<title>Tumor Purity-Adjusted Differential Network Analysis on Three Cancer Types</title>
<p>We further investigated the effect of TP by modeling it as a covariate. Previous studies have suggested that TP has a confounding effect on gene expression and conducted their analyses with TP adjustment (<xref ref-type="bibr" rid="B2">Aran et al., 2015</xref>; <xref ref-type="bibr" rid="B35">Rhee et al., 2018</xref>). Here, we perform TP-adjusted analyses of DN and DE (Analysis 3), and compare results with Analysis 1 in earlier sections.</p>
<p>The top five pathways from Analysis 3 on BRCA (<xref ref-type="table" rid="T5">Table 5</xref>) resulted in a similar list of significant pathways compared to the ones from Analysis 1 (<xref ref-type="table" rid="T1">Table 1</xref>). The top 20 results are summarized in <xref ref-type="supplementary-material" rid="FS1">Supplementary Table 11</xref>; of these, &#x201C;Listeria monocytogenes&#x201D; is a pathogenic bacterium that has been studied for its use as cancer vaccines (<xref ref-type="bibr" rid="B8">Flickinger et al., 2018</xref>). ODC is an enzyme, whose overexpression is associated with the poorer OS in endometrial cancer (<xref ref-type="bibr" rid="B19">Kim et al., 2017</xref>). These two pathways are found as top 7th and 8th in Analysis 1 as well. Upon inspection of the first two significant pathways (<xref ref-type="fig" rid="F5">Figures 5</xref>, <xref ref-type="fig" rid="F6">6</xref>), both analyses have similar network structures. However, some changes in differential connectivity are observed when adjusting for TP. For example, two edges that were not detected in Analysis 1 but appear in Analysis 3 include COL27A1-PTK2 and NFKB1-TXNIP in <xref ref-type="fig" rid="F5">Figures 5</xref>, <xref ref-type="fig" rid="F6">6</xref>, respectively. PTK2 is linked to worse OS in ovarian and invasive breast cancer (<xref ref-type="bibr" rid="B42">Sulzmaier et al., 2014</xref>). Low expression in TXNIP is observed in different types of cancers including breast and stomach cancers (<xref ref-type="bibr" rid="B27">Nagaraj et al., 2018</xref>). These cancer-related DC genes may be useful for therapeutic development for cancer treatment, but should be carefully interpreted as these findings are estimates, not representing the true gene&#x2013;gene association.</p>
<table-wrap position="float" id="T5">
<label>TABLE 5</label>
<caption><p>Five most significant pathways from TP-adjusted DN analysis on BRCA.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>Pathway</bold></td>
<td valign="top" align="center"><bold>DC score</bold></td>
<td valign="top" align="center"><bold>No. of genes</bold></td>
<td valign="top" align="center"><bold>No. of DC genes</bold></td>
<td valign="top" align="center"><bold>Avg. expr. in low-risk</bold></td>
<td valign="top" align="center"><bold>Avg. expr. in high-risk</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">MET activates PTK2 signaling</td>
<td valign="top" align="center">0.076</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">&#x2212;0.0311</td>
<td valign="top" align="center">0.0312</td>
</tr>
<tr>
<td valign="top" align="left">Inflammasomes</td>
<td valign="top" align="center">0.073</td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">&#x2212;0.00432</td>
<td valign="top" align="center">0.00434</td>
</tr>
<tr>
<td valign="top" align="left">PD-1 signaling</td>
<td valign="top" align="center">0.072</td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">&#x2212;0.00249</td>
<td valign="top" align="center">0.0025</td>
</tr>
<tr>
<td valign="top" align="left">Listeria monocytogenes entry into host cells</td>
<td valign="top" align="center">0.072</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.0235</td>
<td valign="top" align="center">&#x2212;0.0237</td>
</tr>
<tr>
<td valign="top" align="left">Regulation of ornithine decarboxylase (ODC)</td>
<td valign="top" align="center">0.071</td>
<td valign="top" align="center">51</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">&#x2212;0.00403</td>
<td valign="top" align="center">0.00405</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>Columns include Reactome pathway names, DC score, number of genes in the pathway, number of significant DC genes, and average expression level of genes in the pathway.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>MET activates the PTK2 signaling pathway from DN analysis results using BRCA. On the left is the DN estimated from the full dataset not adjusted by TP, and on the right shows the estimated DN from the full dataset adjusted by TP.</p></caption>
<graphic xlink:href="fgene-12-642759-g005.tif"/>
</fig>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>Inflammasome pathway from DN analysis results using BRCA. On the left is the DN estimated from the full dataset not adjusted by TP, and on the right shows the estimated DN from the full dataset adjusted by TP.</p></caption>
<graphic xlink:href="fgene-12-642759-g006.tif"/>
</fig>
<p><xref ref-type="supplementary-material" rid="FS1">Supplementary Tables 12</xref>, <xref ref-type="supplementary-material" rid="FS1">13</xref> display the top 20 results of Analysis 3 on HNSC and LUSC, respectively. As shown in BRCA, Analysis 3 resulted in a similar list of pathways with Analysis 1. Upon the inspection of <xref ref-type="supplementary-material" rid="FS1">Supplementary Figures 5</xref>&#x2013;<xref ref-type="supplementary-material" rid="FS1">7</xref>, the networks in Analyses 1 and 3 maintain a homogeneous structure with some minor differences, which we also observed in BRCA. <xref ref-type="supplementary-material" rid="FS1">Supplementary Summary</xref> further discusses findings about new edges detected on HNSC and LUSC.</p>
</sec>
<sec id="S3.SS3.SSS2">
<title>Tumor Purity-Adjusted Differential Gene Expression Analysis on Three Cancer Types</title>
<p><xref ref-type="table" rid="T6">Table 6</xref> summarizes the top five DEGs found from Analysis 3 on BRCA. Two hundred forty-three out of 6,747 genes are DE between HS and LS groups. Among 243 DEGs, 125 genes are upregulated and 118 genes are downregulated. One hundred seventy-seven DEGs from Analysis 1 on BRCA are overlapped with these 243 DEGs found in Analysis 3. In addition, 66 DEGs are introduced in Analysis 3. Of 66 new DEGs, cytohesin 4 (CYTH4) is linked to bipolar disorder (<xref ref-type="bibr" rid="B34">Rezazadeh et al., 2015</xref>). Neutrophil cytosolic factor 4 (NCF4) is associated with the risk of colorectal cancer (<xref ref-type="bibr" rid="B38">Ryan et al., 2014</xref>). Triggering receptor expressed on myeloid cells 2 (TREM2) is related to Alzheimer&#x2019;s disease development (<xref ref-type="bibr" rid="B11">Gratuze et al., 2018</xref>). Cyclin T2 (CCNT2) and acyl-CoA synthetase long-chain family member 5 (ACSL5) are involved with breast cancer (<xref ref-type="bibr" rid="B41">Stelzer et al., 2016</xref>). These findings about additional genes from Analysis 3 will facilitate research in understanding underlying mechanism of breast cancer.</p>
<table-wrap position="float" id="T6">
<label>TABLE 6</label>
<caption><p>Five most significant DEGs from TP-adjusted DE analysis using BRCA.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>Gene</bold></td>
<td valign="top" align="center"><bold>logFC</bold></td>
<td valign="top" align="center"><bold>Avg. expr.</bold></td>
<td valign="top" align="center"><bold>BH adj. <italic>p</italic>-value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">SLCO2B1</td>
<td valign="top" align="center">0.33</td>
<td valign="top" align="center">9.6</td>
<td valign="top" align="center">1.04e-06</td>
</tr>
<tr>
<td valign="top" align="left">FCGR3A</td>
<td valign="top" align="center">0.35</td>
<td valign="top" align="center">10.5</td>
<td valign="top" align="center">3.57e-05</td>
</tr>
<tr>
<td valign="top" align="left">C3AR1</td>
<td valign="top" align="center">0.27</td>
<td valign="top" align="center">8.3</td>
<td valign="top" align="center">3.57e-05</td>
</tr>
<tr>
<td valign="top" align="left">STAB1</td>
<td valign="top" align="center">0.28</td>
<td valign="top" align="center">9.7</td>
<td valign="top" align="center">3.57e-05</td>
</tr>
<tr>
<td valign="top" align="left">C1QC</td>
<td valign="top" align="center">0.29</td>
<td valign="top" align="center">10.6</td>
<td valign="top" align="center">3.58e-05</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>With HNSC, 615 out of 6,698 genes are found DE between HS and LS groups in Analysis 3. Six hundred two out of 615 DEGs overlap with DEGs from Analysis 1, and the remainder of 13 DEGs are detected in Analysis 3 only. The top five DEGs are summarized in <xref ref-type="supplementary-material" rid="FS1">Supplementary Table 14</xref>. For LUSC, 8 out of 6,712 genes are identified DE in Analysis 3; of these eight DEGs, five are found additionally and three overlap with DEGs from Analysis 1. <xref ref-type="supplementary-material" rid="FS1">Supplementary Table 15</xref> displays the top five DEGs from Analysis 3. A cancer-related gene such as PLK3 is found DE. More details about HNSC and LUSC are discussed in <xref ref-type="supplementary-material" rid="FS1">Supplementary Summary</xref>. We have also included a complete list of genes and pathways that are identified from DE and DN analyses as <xref ref-type="supplementary-material" rid="FS1">Supplementary Files</xref> for each cancer types.</p>
</sec>
</sec>
</sec>
<sec id="S4">
<title>Discussion</title>
<p>In this study, we assessed the effect of TP on DN and DE analyses by analyzing three RNA-seq datasets from TCGA. In both cases, qualitatively different results were obtained when filtering samples based on the TP or by including TP as a covariate.</p>
<p>For DN analysis, pathways related to immune and blood cells in TME were found in Analysis 1, while more cancer-related pathways were obtained in Analysis 2 except for LUSC. The same was not true for Analysis 3, which identified the same list of pathways as Analysis 1 in all three cancer datasets. This suggests that using TP as a covariate may not be sufficient for controlling its confounding effects on the association between gene expression and OS. Analysis 2 does not rely on any model assumptions, so it is more robust and may be able to identify the effect of TP. However, one limitation of Analysis 2 is that the decrease in sample size after removing low TP samples may influence the differences in results found when compared to Analysis 1.</p>
<p>For DE analysis, Analysis 1 revealed DEGs between HS and LS groups, while no or a few DEGs were identified in Analysis 2 in BRCA and HNSC. In LUSC, no or a few DEGs were found in either Analysis 1 or 2. When comparing Analysis 1 with Analysis 3, we observed similar results as in previous studies: adding TP as a covariate causes some DEGs to be removed while others are added. The linear model for the effect of TP on gene expression is reasonable for DE analysis, because we expect the aggregate gene expression of tumor-related genes to increase linearly as the ratio of tumor cells increases. Hence, Analysis 3 would have more power to detect the effect of TP on gene expression compared to the more robust approach of Analysis 2. By removing low TP samples, Analysis 2 is unable to utilize the full information provided by TP. However, results for DN analysis suggest that the linear model for TP is not the best choice in general. When comparing DEGs identified in our study to Aran et al., two genes are found in both studies using BRCA: TCF7 and MSR1. Sixteen DEGs are identified in both studies using HNSC: KCNA3, ABCD2, AQP1, FOXP1, C2orf49, PIK3CG, KDR, INPP5D, NFATC2, TNFAIP8L1, AVPR1A, MYO9B, F5, ARHGEF6, FBLN5, and ABCA6. However, there was no DEG overlapped with their studies using HNSC. This may be due to a different data processing scheme applied in each study.</p>
<p>We anticipate that our findings will lead to the improvement in understanding how to incorporate the TP when using two statistical methods: DN and DE analyses.</p>
<p>Future research could extend the current findings to examine how the TP-adjusted analysis affects the sensitivity and specificity compared to the unadjusted analysis. For example, we obtained more DEGs in BRCA and LUSC, but fewer DEGs in HNSC from the TP-adjusted DE analysis. In this paper, we did not include a simulation experiment on DN and DE analyses. It requires complex sampling methodology, which is beyond the scope of this paper. A possible simulation scenario is to set different model assumptions for gene expressions. For example, we consider a linear combination of gene expression level that is weighted by TP, and we also consider the null case when the gene expression level is independent from TP in which the linear combination assumption is not applied. DN and DE analyses can be performed using these simulated samples. Future studies are warranted focusing more on the effect of TP in a simulation-based study to validate our findings.</p>
</sec>
<sec id="S5">
<title>Data Availability Statement</title>
<p>The datasets analyzed for this study can be available at <ext-link ext-link-type="uri" xlink:href="http://www.linkedomics.org">www.linkedomics.org</ext-link>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="S6">
<title>Author Contributions</title>
<p>SA, TG, and SD designed the study. SA and TG involved with the data processing and statistical analyses of the study. SA drafted the manuscript. TG and SD provided suggestions when writing the manuscript. All the authors have reviewed and edited the manuscript, contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="pudiscl1">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack>
<p>We would like to thank the editor for inviting us to submit a revision of our work. We thank the two reviewers for their valuable comments and suggestions on the previous submission which led to a better manuscript. We also thank other members of our research team (S. Guha, T. Kang, A. Sachdeva, and S. Anyaso) for useful discussions during this research.</p>
</ack>
<sec id="S8" sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2021.642759/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2021.642759/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.CSV" id="TS3" mimetype="text/csv" xmlns:xlink="http://www.w3.org/1999/xlink"></supplementary-material>
<supplementary-material xlink:href="Data_Sheet_2.xlsx" id="TS1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"></supplementary-material>
<supplementary-material xlink:href="Data_Sheet_3.xlsx" id="TS2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"></supplementary-material>
<supplementary-material xlink:href="Data_Sheet_4.docx" id="FS1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"></supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Afratis</surname> <given-names>N. A.</given-names></name> <name><surname>Nikitovic</surname> <given-names>D.</given-names></name> <name><surname>Multhaupt</surname> <given-names>H. A.</given-names></name> <name><surname>Theocharis</surname> <given-names>A. D.</given-names></name> <name><surname>Couchman</surname> <given-names>J. R.</given-names></name> <name><surname>Karamanos</surname> <given-names>N. K.</given-names></name></person-group> (<year>2017</year>). <article-title>Syndecans - key regulators of cell signaling and biological functions.</article-title> <source><italic>FEBS J.</italic></source> <volume>284</volume> <fpage>27</fpage>&#x2013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1111/febs.13940</pub-id> <pub-id pub-id-type="pmid">27790852</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aran</surname> <given-names>D.</given-names></name> <name><surname>Sirota</surname> <given-names>M.</given-names></name> <name><surname>Butte</surname> <given-names>A. J.</given-names></name></person-group> (<year>2015</year>). <article-title>Systematic pan-cancer analysis of tumour purity.</article-title> <source><italic>Nat. Commun</italic>.</source> <volume>6</volume>:<fpage>8971</fpage>.</citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bao</surname> <given-names>L.</given-names></name> <name><surname>Pu</surname> <given-names>M.</given-names></name> <name><surname>Messer</surname> <given-names>K.</given-names></name></person-group> (<year>2014</year>). <article-title>AbsCN-seq: a statistical method to estimate tumor purity, ploidy and absolute copy numbers from next-generation sequencing data.</article-title> <source><italic>Bioinformatics</italic></source> <volume>30</volume> <fpage>1056</fpage>&#x2013;<lpage>1063</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btt759</pub-id> <pub-id pub-id-type="pmid">24389661</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><collab>Cancer Genome Atlas Research Network,</collab> <person-group person-group-type="author"><name><surname>Weinstein</surname> <given-names>J. N.</given-names></name> <name><surname>Collisson</surname> <given-names>E. A.</given-names></name> <name><surname>Mills</surname> <given-names>G. B.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>The cancer genome atlas pan-cancer analysis project.</article-title> <source><italic>Nat. Genet.</italic></source> <volume>45</volume> <fpage>1113</fpage>&#x2013;<lpage>1120</lpage>.</citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cao</surname> <given-names>Q.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>T.</given-names></name></person-group> (<year>2018</year>). <article-title>AIMP2-DX2 promotes the proliferation, migration, and invasion of nasopharyngeal carcinoma cells.</article-title> <source><italic>Biomed. Res. Int.</italic></source> <volume>2018</volume>:<fpage>9253036</fpage>. <pub-id pub-id-type="doi">10.1155/2018/9253036</pub-id> <pub-id pub-id-type="pmid">29854811</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carter</surname> <given-names>S. L.</given-names></name> <name><surname>Cibulskis</surname> <given-names>K.</given-names></name> <name><surname>Helman</surname> <given-names>E.</given-names></name> <name><surname>McKenna</surname> <given-names>A.</given-names></name> <name><surname>Shen</surname> <given-names>H.</given-names></name> <name><surname>Zack</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Absolute quantification of somatic DNA alterations in human cancer.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>30</volume> <fpage>413</fpage>&#x2013;<lpage>421</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.2203</pub-id> <pub-id pub-id-type="pmid">22544022</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Fuente</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>From &#x2018;differential expression&#x2019; to &#x2018;differential networking&#x2019; &#x2013; Identification of dysfunctional regulatory networks in diseases.</article-title> <source><italic>Trends Genet.</italic></source> <volume>26</volume> <fpage>326</fpage>&#x2013;<lpage>333</lpage>.</citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flickinger</surname> <given-names>J. C.</given-names> <suffix>Jr.</suffix></name> <name><surname>Rodeck</surname> <given-names>U.</given-names></name> <name><surname>Snook</surname> <given-names>A. E.</given-names></name></person-group> (<year>2018</year>). <article-title><italic>Listeria monocytogenes</italic> as a vector for cancer immunotherapy: current understanding and progress.</article-title> <source><italic>Vaccines</italic></source> <volume>6</volume>:<fpage>48</fpage>. <pub-id pub-id-type="doi">10.3390/vaccines6030048</pub-id> <pub-id pub-id-type="pmid">30044426</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gabriel</surname> <given-names>L. A.</given-names></name> <name><surname>Wang</surname> <given-names>L. W.</given-names></name> <name><surname>Bader</surname> <given-names>H.</given-names></name> <name><surname>Ho</surname> <given-names>J. C.</given-names></name> <name><surname>Majors</surname> <given-names>A. K.</given-names></name> <name><surname>Hollyfield</surname> <given-names>J. G.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>ADAMTSL4, a secreted glycoprotein widely distributed in the eye, binds fibrillin-1 microfibrils and accelerates microfibril biogenesis.</article-title> <source><italic>Invest. Ophthalmol. Vis. Sci.</italic></source> <volume>53</volume> <fpage>461</fpage>&#x2013;<lpage>469</lpage>. <pub-id pub-id-type="doi">10.1167/iovs.10-5955</pub-id> <pub-id pub-id-type="pmid">21989719</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gong</surname> <given-names>Z.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>W.</given-names></name></person-group> (<year>2020</year>). <article-title>Tumor purity as a prognosis and immunotherapy relevant feature in gastric cancer.</article-title> <source><italic>Cancer Med</italic>.</source> <volume>9</volume> <fpage>9052</fpage>&#x2013;<lpage>9063</lpage>. <pub-id pub-id-type="doi">10.1002/cam4.3505</pub-id> <pub-id pub-id-type="pmid">33030278</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gratuze</surname> <given-names>M.</given-names></name> <name><surname>Leyns</surname> <given-names>C. E. G.</given-names></name> <name><surname>Holtzman</surname> <given-names>D. M.</given-names></name></person-group> (<year>2018</year>). <article-title>New insights into the role of TREM2 in Alzheimer&#x2019;s disease.</article-title> <source><italic>Mol. Neurodegener.</italic></source> <volume>13</volume>:<fpage>66</fpage>. <pub-id pub-id-type="doi">10.1186/s13024-018-0298-9</pub-id> <pub-id pub-id-type="pmid">30572908</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grimes</surname> <given-names>T.</given-names></name> <name><surname>Potter</surname> <given-names>S. S.</given-names></name> <name><surname>Datta</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <article-title>Integrating gene regulatory pathways into differential network analysis of gene expression data.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>9</volume>:<fpage>5479</fpage>.</citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haider</surname> <given-names>S.</given-names></name> <name><surname>Tyekucheva</surname> <given-names>S.</given-names></name> <name><surname>Prandi</surname> <given-names>D.</given-names></name> <name><surname>Fox</surname> <given-names>N. S.</given-names></name> <name><surname>Ahn</surname> <given-names>J.</given-names></name> <name><surname>Xu</surname> <given-names>A. W.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Systematic assessment of tumor purity and its clinical implications.</article-title> <source><italic>JCO Precis. Oncol.</italic></source> <volume>4</volume> <fpage>995</fpage>&#x2013;<lpage>1005</lpage>. <pub-id pub-id-type="doi">10.1200/PO.20.00016</pub-id> <pub-id pub-id-type="pmid">33015524</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Helmke</surname> <given-names>C.</given-names></name> <name><surname>Becker</surname> <given-names>S.</given-names></name> <name><surname>Strebhardt</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>The role of Plk3 in oncogenesis.</article-title> <source><italic>Oncogene</italic></source> <volume>35</volume> <fpage>135</fpage>&#x2013;<lpage>147</lpage>. <pub-id pub-id-type="doi">10.1038/onc.2015.105</pub-id> <pub-id pub-id-type="pmid">25915845</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holroyd</surname> <given-names>A. K.</given-names></name> <name><surname>Michie</surname> <given-names>A. M.</given-names></name></person-group> (<year>2018</year>). <article-title>The role of mTOR-mediated signaling in the regulation of cellular migration.</article-title> <source><italic>Immunol. Lett.</italic></source> <volume>196</volume> <fpage>74</fpage>&#x2013;<lpage>79</lpage>. <pub-id pub-id-type="doi">10.1016/j.imlet.2018.01.015</pub-id> <pub-id pub-id-type="pmid">29408410</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ideker</surname> <given-names>T.</given-names></name> <name><surname>Krogan</surname> <given-names>N. J.</given-names></name></person-group> (<year>2012</year>). <article-title>Differential network biology.</article-title> <source><italic>Mol. Syst. Biol</italic>.</source> <volume>8</volume>:<fpage>565</fpage>. <pub-id pub-id-type="doi">10.1038/msb.2011.99</pub-id> <pub-id pub-id-type="pmid">22252388</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Iwaya</surname> <given-names>T.</given-names></name> <name><surname>Fukagawa</surname> <given-names>T.</given-names></name> <name><surname>Suzuki</surname> <given-names>Y.</given-names></name> <name><surname>Takahashi</surname> <given-names>Y.</given-names></name> <name><surname>Sawada</surname> <given-names>G.</given-names></name> <name><surname>Ishibashi</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Contrasting expression patterns of histone mRNA and microRNA 760 in patients with gastric cancer.</article-title> <source><italic>Clin. Cancer Res.</italic></source> <volume>19</volume> <fpage>6438</fpage>&#x2013;<lpage>6449</lpage>. <pub-id pub-id-type="doi">10.1158/1078-0432.CCR-12-3186</pub-id> <pub-id pub-id-type="pmid">24097871</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jassal</surname> <given-names>B.</given-names></name> <name><surname>Matthews</surname> <given-names>L.</given-names></name> <name><surname>Viteri</surname> <given-names>G.</given-names></name> <name><surname>Gong</surname> <given-names>C.</given-names></name> <name><surname>Lorente</surname> <given-names>P.</given-names></name> <name><surname>Fabregat</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>The reactome pathway knowledgebase.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>48</volume> <fpage>D498</fpage>&#x2013;<lpage>D503</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz1031</pub-id> <pub-id pub-id-type="pmid">31691815</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>H. I.</given-names></name> <name><surname>Schultz</surname> <given-names>C. R.</given-names></name> <name><surname>Buras</surname> <given-names>A. L.</given-names></name> <name><surname>Friedman</surname> <given-names>E.</given-names></name> <name><surname>Fedorko</surname> <given-names>A.</given-names></name> <name><surname>Seamon</surname> <given-names>L.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Ornithine decarboxylase as a therapeutic target for endometrial cancer.</article-title> <source><italic>PLoS One</italic></source> <volume>12</volume>:<fpage>e0189044</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0189044</pub-id> <pub-id pub-id-type="pmid">29240775</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>Y.</given-names></name> <name><surname>Hao</surname> <given-names>J.</given-names></name> <name><surname>Gautam</surname> <given-names>Y.</given-names></name> <name><surname>Mersha</surname> <given-names>T. B.</given-names></name> <name><surname>Kang</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>DiffGRN: differential gene regulatory network analysis.</article-title> <source><italic>Int. J. Data Min. Bioinform</italic>.</source> <volume>20</volume> <fpage>362</fpage>&#x2013;<lpage>379</lpage>. <pub-id pub-id-type="doi">10.1504/IJDMB.2018.094891</pub-id> <pub-id pub-id-type="pmid">31114627</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laugesen</surname> <given-names>A.</given-names></name> <name><surname>H&#x00F8;jfeldt</surname> <given-names>J. W.</given-names></name> <name><surname>Helin</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>Role of the polycomb repressive complex 2 (PRC2) in transcriptional regulation and cancer.</article-title> <source><italic>Cold Spring Harb. Perspect. Med.</italic></source> <volume>6</volume>:<fpage>a026575</fpage>. <pub-id pub-id-type="doi">10.1101/cshperspect.a026575</pub-id> <pub-id pub-id-type="pmid">27449971</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Guo</surname> <given-names>X. B.</given-names></name> <name><surname>Wang</surname> <given-names>J. S.</given-names></name> <name><surname>Wang</surname> <given-names>H. C.</given-names></name> <name><surname>Li</surname> <given-names>L. P.</given-names></name></person-group> (<year>2020</year>). <article-title>Function of fibroblast growth factor 2 in gastric cancer occurrence and prognosis.</article-title> <source><italic>Mol. Med. Rep.</italic></source> <volume>21</volume> <fpage>575</fpage>&#x2013;<lpage>582</lpage>. <pub-id pub-id-type="doi">10.3892/mmr.2019.10850</pub-id> <pub-id pub-id-type="pmid">31789423</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Umbach</surname> <given-names>D. M.</given-names></name> <name><surname>Bingham</surname> <given-names>A.</given-names></name> <name><surname>Li</surname> <given-names>Q.-J.</given-names></name> <name><surname>Zhuang</surname> <given-names>Y.</given-names></name> <name><surname>Li</surname> <given-names>L.</given-names></name></person-group> (<year>2019</year>). <article-title>Putative biomarkers for predicting tumor sample purity based on gene expression data.</article-title> <source><italic>BMC Genomics</italic></source> <volume>20</volume>:<fpage>1021</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-019-6412-8</pub-id> <pub-id pub-id-type="pmid">31881847</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li&#x00E8;vre</surname> <given-names>A.</given-names></name> <name><surname>Blons</surname> <given-names>H.</given-names></name> <name><surname>Houllier</surname> <given-names>A. M.</given-names></name> <name><surname>Laccourreye</surname> <given-names>O.</given-names></name> <name><surname>Brasnu</surname> <given-names>D.</given-names></name> <name><surname>Beaune</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>Clinicopathological significance of mitochondrial D-Loop mutations in head and neck carcinoma.</article-title> <source><italic>Br. J. Cancer</italic></source> <volume>94</volume> <fpage>692</fpage>&#x2013;<lpage>697</lpage>. <pub-id pub-id-type="doi">10.1038/sj.bjc.6602993</pub-id> <pub-id pub-id-type="pmid">16495928</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mao</surname> <given-names>Y.</given-names></name> <name><surname>Feng</surname> <given-names>Q.</given-names></name> <name><surname>Zheng</surname> <given-names>P.</given-names></name> <name><surname>Yang</surname> <given-names>L.</given-names></name> <name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Xu</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Low tumor purity is associated with poor prognosis, heavy mutation burden, and intense immune phenotype in colon cancer.</article-title> <source><italic>Cancer Manag. Res</italic>.</source> <volume>10</volume> <fpage>3569</fpage>&#x2013;<lpage>3577</lpage>. <pub-id pub-id-type="doi">10.2147/CMAR.S171855</pub-id> <pub-id pub-id-type="pmid">30271205</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morrow</surname> <given-names>J. K.</given-names></name> <name><surname>Lin</surname> <given-names>H. K.</given-names></name> <name><surname>Sun</surname> <given-names>S. C.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name></person-group> (<year>2015</year>). <article-title>Targeting ubiquitination for cancer therapies.</article-title> <source><italic>Future Med. Chem.</italic></source> <volume>7</volume> <fpage>2333</fpage>&#x2013;<lpage>2350</lpage>. <pub-id pub-id-type="doi">10.4155/fmc.15.148</pub-id> <pub-id pub-id-type="pmid">26630263</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nagaraj</surname> <given-names>K.</given-names></name> <name><surname>Lapkina-Gendler</surname> <given-names>L.</given-names></name> <name><surname>Sarfstein</surname> <given-names>R.</given-names></name> <name><surname>Gurwitz</surname> <given-names>D.</given-names></name> <name><surname>Pasmanik-Chor</surname> <given-names>M.</given-names></name> <name><surname>Laron</surname> <given-names>Z.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Identification of thioredoxin-interacting protein (TXNIP) as a downstream target for IGF1 action.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>115</volume> <fpage>1045</fpage>&#x2013;<lpage>1050</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1715930115</pub-id> <pub-id pub-id-type="pmid">29339473</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oesper</surname> <given-names>L.</given-names></name> <name><surname>Mahmoody</surname> <given-names>A.</given-names></name> <name><surname>Raphael</surname> <given-names>B. J.</given-names></name></person-group> (<year>2013</year>). <article-title>THetA: inferring intra-tumor heterogeneity from high-throughput DNA sequencing data.</article-title> <source><italic>Genome Biol.</italic></source> <volume>14</volume>:<fpage>R80</fpage>. <pub-id pub-id-type="doi">10.1186/gb-2013-14-7-r80</pub-id> <pub-id pub-id-type="pmid">23895164</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Petralia</surname> <given-names>F.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Peng</surname> <given-names>J.</given-names></name> <name><surname>Yan</surname> <given-names>A.</given-names></name> <name><surname>Zhu</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>P.</given-names></name></person-group> (<year>2018</year>). <article-title>A new method for constructing tumor specific gene co-expression networks based on samples with tumor purity heterogeneity.</article-title> <source><italic>Bioinformatics</italic></source> <volume>34</volume> <fpage>i528</fpage>&#x2013;<lpage>i536</lpage>.</citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Poonia</surname> <given-names>B.</given-names></name> <name><surname>Kijak</surname> <given-names>G. H.</given-names></name> <name><surname>Pauza</surname> <given-names>C. D.</given-names></name></person-group> (<year>2010</year>). <article-title>High affinity allele for the gene of FCGR3A is risk factor for HIV infection and progression.</article-title> <source><italic>PLoS One</italic></source> <volume>5</volume>:<fpage>e15562</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0015562</pub-id> <pub-id pub-id-type="pmid">21187939</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quail</surname> <given-names>D. F.</given-names></name> <name><surname>Joyce</surname> <given-names>J. A.</given-names></name></person-group> (<year>2013</year>). <article-title>Microenvironmental regulation of tumor progression and metastasis.</article-title> <source><italic>Nat. Med.</italic></source> <volume>19</volume> <fpage>1423</fpage>&#x2013;<lpage>1437</lpage>. <pub-id pub-id-type="doi">10.1038/nm.3394</pub-id> <pub-id pub-id-type="pmid">24202395</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rajan</surname> <given-names>R.</given-names></name> <name><surname>Poniecka</surname> <given-names>A.</given-names></name> <name><surname>Smith</surname> <given-names>T. L.</given-names></name> <name><surname>Yang</surname> <given-names>Y.</given-names></name> <name><surname>Frye</surname> <given-names>D.</given-names></name> <name><surname>Pusztai</surname> <given-names>L.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>Change in tumor cellularity of breast carcinoma after neoadjuvant chemotherapy as a variable in the pathologic assessment of response.</article-title> <source><italic>Cancer</italic></source> <volume>100</volume> <fpage>1365</fpage>&#x2013;<lpage>1373</lpage>. <pub-id pub-id-type="doi">10.1002/cncr.20134</pub-id> <pub-id pub-id-type="pmid">15042669</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rao</surname> <given-names>S.</given-names></name> <name><surname>Lee</surname> <given-names>S. Y.</given-names></name> <name><surname>Gutierrez</surname> <given-names>A.</given-names></name> <name><surname>Perrigoue</surname> <given-names>J.</given-names></name> <name><surname>Thapa</surname> <given-names>R. J.</given-names></name> <name><surname>Tu</surname> <given-names>Z.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Inactivation of ribosomal protein L22 promotes transformation by induction of the stemness factor, Lin28B.</article-title> <source><italic>Blood</italic></source> <volume>120</volume> <fpage>3764</fpage>&#x2013;<lpage>3773</lpage>. <pub-id pub-id-type="doi">10.1182/blood-2012-03-415349</pub-id> <pub-id pub-id-type="pmid">22976955</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rezazadeh</surname> <given-names>M.</given-names></name> <name><surname>Gharesouran</surname> <given-names>J.</given-names></name> <name><surname>Mirabzadeh</surname> <given-names>A.</given-names></name> <name><surname>Khorram Khorshid</surname> <given-names>H. R.</given-names></name> <name><surname>Biglarian</surname> <given-names>A.</given-names></name> <name><surname>Ohadi</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>A primate-specific functional GTTT-repeat in the core promoter of CYTH4 is linked to bipolar disorder in human.</article-title> <source><italic>Prog. Neuropsychopharmacol. Biol. Psychiatry</italic></source> <volume>56</volume> <fpage>161</fpage>&#x2013;<lpage>167</lpage>. <pub-id pub-id-type="doi">10.1016/j.pnpbp.2014.09.001</pub-id> <pub-id pub-id-type="pmid">25240857</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rhee</surname> <given-names>J. K.</given-names></name> <name><surname>Jung</surname> <given-names>Y. C.</given-names></name> <name><surname>Kim</surname> <given-names>K. R.</given-names></name> <name><surname>Yoo</surname> <given-names>J.</given-names></name> <name><surname>Kim</surname> <given-names>J.</given-names></name> <name><surname>Lee</surname> <given-names>Y. J.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Impact of tumor purity on immune gene expression and clustering analyses across multiple cancer types.</article-title> <source><italic>Cancer Immunol. Res.</italic></source> <volume>6</volume> <fpage>87</fpage>&#x2013;<lpage>97</lpage>. <pub-id pub-id-type="doi">10.1158/2326-6066.CIR-17-0201</pub-id> <pub-id pub-id-type="pmid">29141981</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ritchie</surname> <given-names>M. E.</given-names></name> <name><surname>Phipson</surname> <given-names>B.</given-names></name> <name><surname>Wu</surname> <given-names>D.</given-names></name> <name><surname>Hu</surname> <given-names>Y.</given-names></name> <name><surname>Law</surname> <given-names>C. W.</given-names></name> <name><surname>Shi</surname> <given-names>W.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>limma powers differential expression analyses for RNA-sequencing and microarray studies.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>43</volume>:<fpage>e47</fpage>.</citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robinson</surname> <given-names>M. D.</given-names></name> <name><surname>McCarthy</surname> <given-names>D. J.</given-names></name> <name><surname>Smyth</surname> <given-names>G. K.</given-names></name></person-group> (<year>2010</year>). <article-title>edgeR: a bioconductor package for differential expression analysis of digital gene expression data.</article-title> <source><italic>Bioinformatics</italic></source> <volume>26</volume> <fpage>139</fpage>&#x2013;<lpage>140</lpage>.</citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ryan</surname> <given-names>B. M.</given-names></name> <name><surname>Zanetti</surname> <given-names>K. A.</given-names></name> <name><surname>Robles</surname> <given-names>A. I.</given-names></name> <name><surname>Schetter</surname> <given-names>A. J.</given-names></name> <name><surname>Goodman</surname> <given-names>J.</given-names></name> <name><surname>Hayes</surname> <given-names>R. B.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Germline variation in NCF4, an innate immunity gene, is associated with an increased risk of colorectal cancer.</article-title> <source><italic>Int. J. Cancer</italic></source> <volume>134</volume> <fpage>1399</fpage>&#x2013;<lpage>1407</lpage>. <pub-id pub-id-type="doi">10.1002/ijc.28457</pub-id> <pub-id pub-id-type="pmid">23982929</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ryu</surname> <given-names>J.</given-names></name> <name><surname>Koh</surname> <given-names>Y.</given-names></name> <name><surname>Park</surname> <given-names>H.</given-names></name> <name><surname>Kim</surname> <given-names>D. Y.</given-names></name> <name><surname>Kim</surname> <given-names>D. C.</given-names></name> <name><surname>Byun</surname> <given-names>J. M.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Highly expressed integrin-&#x03B1;8 induces epithelial to mesenchymal transition-like features in multiple myeloma with early relapse.</article-title> <source><italic>Mol. Cells</italic></source> <volume>39</volume> <fpage>898</fpage>&#x2013;<lpage>908</lpage>. <pub-id pub-id-type="doi">10.14348/molcells.2016.0210</pub-id> <pub-id pub-id-type="pmid">28008160</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shimizu</surname> <given-names>Y.</given-names></name> <name><surname>Kohyama</surname> <given-names>M.</given-names></name> <name><surname>Yorifuji</surname> <given-names>H.</given-names></name> <name><surname>Jin</surname> <given-names>H.</given-names></name> <name><surname>Arase</surname> <given-names>N.</given-names></name> <name><surname>Suenaga</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Fc&#x03B3;RIIIA-mediated activation of NK cells by IgG heavy chain complexed with MHC class II molecules.</article-title> <source><italic>Int. Immunol.</italic></source> <volume>31</volume> <fpage>303</fpage>&#x2013;<lpage>314</lpage>. <pub-id pub-id-type="doi">10.1093/intimm/dxz010</pub-id> <pub-id pub-id-type="pmid">30721990</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stelzer</surname> <given-names>G.</given-names></name> <name><surname>Rosen</surname> <given-names>N.</given-names></name> <name><surname>Plaschkes</surname> <given-names>I.</given-names></name> <name><surname>Zimmerman</surname> <given-names>S.</given-names></name> <name><surname>Twik</surname> <given-names>M.</given-names></name> <name><surname>Fishilevich</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>The genecards suite: from gene data mining to disease genome sequence analyses.</article-title> <source><italic>Curr. Protoc. Bioinformatics</italic></source> <volume>54</volume> <fpage>1.30.1</fpage>&#x2013;<lpage>1.30.33</lpage>. <pub-id pub-id-type="doi">10.1002/cpbi.5</pub-id> <pub-id pub-id-type="pmid">27322403</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sulzmaier</surname> <given-names>F. J.</given-names></name> <name><surname>Jean</surname> <given-names>C.</given-names></name> <name><surname>Schlaepfer</surname> <given-names>D. D.</given-names></name></person-group> (<year>2014</year>). <article-title>FAK in cancer: mechanistic findings and clinical applications.</article-title> <source><italic>Nat. Rev. Cancer</italic></source> <volume>14</volume> <fpage>598</fpage>&#x2013;<lpage>610</lpage>. <pub-id pub-id-type="doi">10.1038/nrc3792</pub-id> <pub-id pub-id-type="pmid">25098269</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Turley</surname> <given-names>S. J.</given-names></name> <name><surname>Cremasco</surname> <given-names>V.</given-names></name> <name><surname>Astarita</surname> <given-names>J. L.</given-names></name></person-group> (<year>2015</year>). <article-title>Immunological hallmarks of stromal cells in the tumour microenvironment.</article-title> <source><italic>Nat. Rev. Immunol.</italic></source> <volume>15</volume> <fpage>669</fpage>&#x2013;<lpage>682</lpage>. <pub-id pub-id-type="doi">10.1038/nri3902</pub-id> <pub-id pub-id-type="pmid">26471778</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vasaikar</surname> <given-names>S. V.</given-names></name> <name><surname>Straub</surname> <given-names>P.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>B.</given-names></name></person-group> (<year>2017</year>). <article-title>LinkedOmics: analyzing multi-omics data within and across 32 cancer types.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>46</volume>, <fpage>D956</fpage>&#x2013;<lpage>D963</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx1090</pub-id> <pub-id pub-id-type="pmid">29136207</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Whitfield</surname> <given-names>M. L.</given-names></name> <name><surname>Zheng</surname> <given-names>L. X.</given-names></name> <name><surname>Baldwin</surname> <given-names>A.</given-names></name> <name><surname>Ohta</surname> <given-names>T.</given-names></name> <name><surname>Hurt</surname> <given-names>M. M.</given-names></name> <name><surname>Marzluff</surname> <given-names>W. F.</given-names></name></person-group> (<year>2000</year>). <article-title>Stem-loop binding protein, the protein that binds the 3&#x2019; end of histone mRNA, is cell cycle regulated by both translational and posttranslational mechanisms.</article-title> <source><italic>Mol. Cell Biol.</italic></source> <volume>20</volume> <fpage>4188</fpage>&#x2013;<lpage>4198</lpage>. <pub-id pub-id-type="doi">10.1128/mcb.20.12.4188-4198.2000</pub-id> <pub-id pub-id-type="pmid">10825184</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xiao</surname> <given-names>H.</given-names></name> <name><surname>Gulen</surname> <given-names>M. F.</given-names></name> <name><surname>Qin</surname> <given-names>J.</given-names></name> <name><surname>Yao</surname> <given-names>J.</given-names></name> <name><surname>Bulek</surname> <given-names>K.</given-names></name> <name><surname>Kish</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>The Toll-interleukin-1 receptor member SIGIRR regulates colonic epithelial homeostasis, inflammation, and tumorigenesis.</article-title> <source><italic>Immunity</italic></source> <volume>26</volume> <fpage>461</fpage>&#x2013;<lpage>475</lpage>. <pub-id pub-id-type="doi">10.1016/j.immuni.2007.02.012</pub-id> <pub-id pub-id-type="pmid">17398123</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ye</surname> <given-names>T.</given-names></name> <name><surname>Fu</surname> <given-names>A. K.</given-names></name> <name><surname>Ip</surname> <given-names>N. Y.</given-names></name></person-group> (<year>2015</year>). <article-title>Emerging roles of Axin in cerebral cortical development.</article-title> <source><italic>Front. Cell Neurosci</italic>.</source> <volume>9</volume>:<fpage>217</fpage>. <pub-id pub-id-type="doi">10.3389/fncel.2015.00217</pub-id> <pub-id pub-id-type="pmid">26106297</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yoshihara</surname> <given-names>K.</given-names></name> <name><surname>Shahmoradgoli</surname> <given-names>M.</given-names></name> <name><surname>Mart&#x00ED;nez</surname> <given-names>E.</given-names></name> <name><surname>Vegesna</surname> <given-names>R.</given-names></name> <name><surname>Kim</surname> <given-names>H.</given-names></name> <name><surname>Torres-Garcia</surname> <given-names>W.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Inferring tumour purity and stromal and immune cell admixture from expression data.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>4</volume>:<fpage>2612</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms3612</pub-id> <pub-id pub-id-type="pmid">24113773</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yuan</surname> <given-names>Y.</given-names></name> <name><surname>Failmezger</surname> <given-names>H.</given-names></name> <name><surname>Rueda</surname> <given-names>O. M.</given-names></name> <name><surname>Ali</surname> <given-names>H. R.</given-names></name> <name><surname>Gr&#x00E4;f</surname> <given-names>S.</given-names></name> <name><surname>Chin</surname> <given-names>S. F.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Quantitative image analysis of cellular heterogeneity in breast tumors complements genomic profiling.</article-title> <source><italic>Sci. Transl. Med.</italic></source> <volume>4</volume>:<fpage>157ra143</fpage>. <pub-id pub-id-type="doi">10.1126/scitranslmed.3004330</pub-id> <pub-id pub-id-type="pmid">23100629</pub-id> <comment>Erratum in: Sci. Transl. Med. 4:161er6</comment></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Li</surname> <given-names>A.</given-names></name> <name><surname>Zhang</surname> <given-names>A.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Chen</surname> <given-names>L.</given-names></name></person-group> (<year>2018</year>). <article-title>DPY30 is required for the enhanced proliferation, motility and epithelial-mesenchymal transition of epithelial ovarian cancer cells.</article-title> <source><italic>Int. J. Mol. Med.</italic></source> <volume>42</volume> <fpage>3065</fpage>&#x2013;<lpage>3072</lpage>. <pub-id pub-id-type="doi">10.3892/ijmm.2018.3869</pub-id> <pub-id pub-id-type="pmid">30221689</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Feng</surname> <given-names>H.</given-names></name> <name><surname>Wu</surname> <given-names>H.</given-names></name> <name><surname>Zheng</surname> <given-names>X.</given-names></name></person-group> (<year>2017</year>). <article-title>Accounting for tumor purity improves cancer subtype classification from DNA methylation data.</article-title> <source><italic>Bioinformatics</italic></source> <volume>33</volume> <fpage>2651</fpage>&#x2013;<lpage>2657</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btx303</pub-id> <pub-id pub-id-type="pmid">28472248</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>N.</given-names></name> <name><surname>Wu</surname> <given-names>H. J.</given-names></name> <name><surname>Wu</surname> <given-names>H.</given-names></name></person-group> (<year>2017</year>). <article-title>Estimating and accounting for tumor purity in the analysis of DNA methylation data from cancer studies.</article-title> <source><italic>Genome Biol</italic>.</source> <volume>18</volume>:<fpage>17</fpage>. <pub-id pub-id-type="doi">10.1186/s13059-016-1143-5</pub-id> <pub-id pub-id-type="pmid">28122605</pub-id></citation></ref>
</ref-list>
</back>
</article>
