<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Oncol.</journal-id>
<journal-title>Frontiers in Oncology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Oncol.</abbrev-journal-title>
<issn pub-type="epub">2234-943X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fonc.2021.626654</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Oncology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Identification of a Novel Four-Gene Signature Correlated With the Prognosis of Patients With Hepatocellular Carcinoma: A Comprehensive Analysis</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Zhu</surname>
<given-names>Weihua</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ru</surname>
<given-names>Lixin</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Ma</surname>
<given-names>Zhenchao</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1117252"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Department of Gastroenterology, Affiliated Huzhou Hospital, Zhejiang University School of Medicine, Huzhou Central Hospital, Affiliated Central Hospital Huzhou University</institution>, <addr-line>Huzhou</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Department of Radiation Oncology, Affiliated Huzhou Hospital, Zhejiang University School of Medicine, Huzhou Central Hospital, Affiliated Central Hospital Huzhou University</institution>, <addr-line>Huzhou</addr-line>, <country>China</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>Department of Radiation Oncology, The Second Affiliated Hospital of Soochow University</institution>, <addr-line>Suzhou</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Bubu Ama Banini, Virginia Commonwealth University, United States</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Wafaa M. Rashed, Children&#x2019;s Cancer Hospital, Egypt; Huaichao Luo, Sichuan Cancer Hospital, China</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Zhenchao Ma, <email xlink:href="mailto:mazhenchao1986@163.com">mazhenchao1986@163.com</email>
</p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Gastrointestinal Cancers, a section of the journal Frontiers in Oncology</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>03</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>11</volume>
<elocation-id>626654</elocation-id>
<history>
<date date-type="received">
<day>06</day>
<month>11</month>
<year>2020</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>01</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Zhu, Ru and Ma</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Zhu, Ru and Ma</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<sec>
<title>Purpose</title>
<p>Hepatocellular carcinoma (HCC) is a common solid-tumor malignancy with high heterogeneity, and accurate prognostic prediction in HCC remains difficult. This analysis was performed to find a novel prognostic multigene signature.</p>
</sec>
<sec>
<title>Methods</title>
<p>The TCGA-LIHC dataset was analyzed for differentially coexpressed genes through weighted gene coexpression network analysis (WGCNA) and differential gene expression analysis. A protein-protein interaction (PPI) network and univariate Cox regression analysis of overall survival (OS) were utilized to identify their prognostic value. Next, we used least absolute shrinkage and selection operator (LASSO) Cox regression to establish a prognostic module. Subsequently, the ICGC-LIRI-JP dataset was applied for further validation. Based on this module, HCC cases were stratified into high-risk and low-risk groups, and differentially expressed genes (DEGs) were identified. Functional enrichment analyses of these DEGs were conducted. Finally, single-sample gene set enrichment analysis (ssGSEA) was performed to explore the correlation between the prognostic signature and immune status.</p>
</sec>
<sec>
<title>Results</title>
<p>A total of 393 differentially coexpressed genes were obtained. Forty differentially coexpressed hub genes were identified using the CytoHubba plugin, and 38 of them were closely correlated with OS. Afterward, we established the four-gene prognostic signature with an acceptable accuracy (area under the curve [AUC] of 1-year survival: 0.739). The ICGC-LIRI-JP dataset also supported the acceptable accuracy (AUC of 1-year survival:0.752). Compared with low-risk cohort, HCC cases in the high-risk cohort had shorter OS, higher tumor grades, and higher T stages. The risk scores of this signature still act as independent predictors of OS (P&lt;0.001). Functional enrichment analyses suggest that it was mainly organelle fission and nuclear division that were enriched. Finally, ssGSEA revealed that this signature is strongly associated with the immune status of HCC patients.</p>
</sec>
<sec>
<title>Conclusions</title>
<p>The proposed prognostic signature of four differentially coexpressed hub genes has satisfactory prognostic ability, providing important insight into the prediction of HCC prognosis.</p>
</sec>
</abstract>
<kwd-group>
<kwd>hepatocellular carcinoma</kwd>
<kwd>least absolute shrinkage and selection operator Cox regression</kwd>
<kwd>weighted gene coexpression network analysis</kwd>
<kwd>prognostic signature</kwd>
<kwd>single-sample gene set enrichment analysis</kwd>
<kwd>immune status</kwd>
</kwd-group>
<counts>
<fig-count count="11"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="40"/>
<page-count count="17"/>
<word-count count="5314"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<title>Introduction</title>
<p>It is estimated that nearly 42,810 new cases and 30,160 estimated deaths of hepatocellular carcinoma (HCC) will occur in 2020, leading to enormous socioeconomic pressure for HCC patients and their families (<xref ref-type="bibr" rid="B1">1</xref>). HCC accounts for 85%&#x2013;90% of all primary liver cancer patients, and its occurrence is strongly associated with chronic hepatitis B virus (HBV) or hepatitis C virus (HCV) infection, alcohol consumption, and nonalcoholic steatohepatitis (<xref ref-type="bibr" rid="B2">2</xref>). HCC has high interpatient, intertumoral and intratumoral heterogeneity (<xref ref-type="bibr" rid="B3">3</xref>). Patients with localized HCC usually have poor survival (with a 5-year overall survival [OS] rate of 30%), and this rate is less than 5% for HCC patients with distant metastasis (<xref ref-type="bibr" rid="B4">4</xref>). Currently, due to the complicated etiologic factors and the high heterogeneity of HCC, it remains difficult to accurately predict the prognosis of HCC patients. Although there were some similar studies published previously, they usually required many genes in their gene signatures, which may cause some difficulties in real-world practice (<xref ref-type="bibr" rid="B5">5</xref>, <xref ref-type="bibr" rid="B6">6</xref>). Therefore, it is urgent to find the gene signature involved with less genes for the convenience of real-world practice.</p>
<p>With the rapid development of genome technology, bioinformatics analysis has been adopted for microarray datasets to further explore the underlying molecular mechanisms of diseases and detect disease-specific biomarkers (<xref ref-type="bibr" rid="B7">7</xref>). Weighted gene coexpression network analysis (WGCNA) is utilized to further understand gene coexpression networks and gene functions (<xref ref-type="bibr" rid="B8">8</xref>). WGCNA detects modules of closely correlated genes among samples to relate modules to external traits, providing significant insights into predicting possible functions of coexpressed genes (<xref ref-type="bibr" rid="B9">9</xref>). Additionally, differential gene expression analysis is often utilized in transcriptomic datasets to investigate potential biological and molecular mechanisms and quantify differences between the gene expression levels of experimental and control cohorts (<xref ref-type="bibr" rid="B10">10</xref>).</p>
<p>To increase the reliability of screening highly related genes, both methods mentioned above were used in our analysis. First, the RNA-Seq dataset and HCC clinical information were downloaded from The Cancer Genome Atlas (TCGA) database. Second, WGCNA and differential gene expression analysis were performed to obtain differentially coexpressed genes. Then, a protein-protein interaction (PPI) network was constructed, and 38 differential coexpression hub genes with prognostic value were detected. Afterward, we built a prognostic four-gene signature and verified it in the International Cancer Genome Consortium (ICGC) database. Ultimately, functional enrichment analysis was conducted to investigate the underlying biological mechanisms.</p>
</sec>
<sec id="s2" sec-type="materials|methods">
<title>Materials and Methods</title>
<p>The detailed process of data downloading, prognostic signature construction and external validation is presented in <xref ref-type="fig" rid="f1">
<bold>Figure 1</bold>
</xref>. The details of each step are illustrated in the following subsections.</p>
<fig id="f1" position="float">
<label>Figure 1</label>
<caption>
<p>Study design and workflow of this study.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g001.tif"/>
</fig>
<sec id="s2_1">
<title>Datasets Downloaded From the TCGA and ICGC Databases</title>
<p>First, RNA-Seq and corresponding clinical data for liver hepatocellular carcinoma (LIHC) were obtained from the TCGA database (<uri xlink:href="https://portal.gdc.cancer.gov/">https://portal.gdc.cancer.gov/</uri>). A list of 424 samples was obtained, including 374 LIHC and 50 normal liver tissues, and RNA-seq count data on 19645 genes were obtained. The Illumina HiSeq 2000 platform was used to generate and annotate all data to a reference transcript set of the human hg38 gene standard track. The edgeR package tutorial suggested that genes with low read counts do not merit further analysis (<xref ref-type="bibr" rid="B11">11</xref>). Hence, genes with a count per million (CPM) &lt;1 were omitted from this analysis. Next, the function rpkm in the edgeR package was adapted for further filtering. Consequently, 13,924 genes were acquired for subsequent analysis. Second, the RNA-Seq data and clinical data of HCC patients were acquired from the ICGC database (<uri xlink:href="https://dcc.icgc.org/">https://dcc.icgc.org/</uri>). A total of 260 HCC samples, which mainly originated from the Japanese population with HBV or HCV infection, were acquired (<xref ref-type="bibr" rid="B12">12</xref>). We chose the normalized read count values of the ICGC-LIRI-JP cohort. As a result, 22,913 genes were obtained for the next analysis.</p>
</sec>
<sec id="s2_2">
<title>Identification of Key Coexpression Modules Using WGCNA</title>
<p>The gene coexpression network of the TCGA-LIHC dataset was built through the WGCNA package (<xref ref-type="bibr" rid="B8">8</xref>). To build a scale-free network, a soft-power &#x3b2; = 7 (<xref ref-type="fig" rid="f2">
<bold>Figures 2A, B</bold>
</xref>) was used in the TCGA-LIHC dataset. Next, the adjacency matrix was created according to the formula aij = |Sij|<sup>&#x3b2;</sup> (aij: adjacency matrix between gene i and gene j, Sij: similarity matrix made by Pearson&#x2019;s correlation coefficient of all gene pairs, as well as &#x3b2;: soft-power value). Subsequently, we converted this matrix into a topological overlap matrix (TOM) and the corresponding dissimilarity (1-TOM). The hierarchical clustering dendrogram of the 1-TOM matrix was established to aggregate the genes with similar expression patterns into the same coexpression module. Afterward, the module-trait relations between modules and external traits were analyzed to identify functional modules from the coexpression network. Hence, the modules with the largest correlation coefficients were regarded as modules that highly correlated with clinical traits. We chose the module that was positively associated with LIHC for our subsequent analysis.</p>
<fig id="f2" position="float">
<label>Figure 2</label>
<caption>
<p>Identification of modules related to the clinical traits in the TCGA-LIHC dataset. <bold>(A)</bold> Sample dendrogram and trait heatmap. <bold>(B)</bold> Scale independence and Mean connectivity. <bold>(C)</bold> The cluster dendrogram of co-expression network modules is ordered by a hierarchical clustering of genes based on the 1-TOM matrix. Different colors represent different modules. <bold>(D)</bold> Module-trait relationships. Each row represents a color module and every column represents a clinical trait (normal and tumor). Each cell contains the corresponding correlation and P-value.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g002.tif"/>
</fig>
</sec>
<sec id="s2_3">
<title>Identification of Differentially Coexpressed Genes</title>
<p>The limma package is often used to perform differential gene expression analysis of gene expression profiles and RNA-Seq datasets (<xref ref-type="bibr" rid="B13">13</xref>). Here, we applied the limma package in the differential expression analysis of the TCGA-LIHC dataset to identify differentially expressed genes (DEGs) between LIHC and nontumorous tissues. To minimize the false discovery rate (FDR) to the greatest extent possible, we adjusted the P-value with the Benjamini&#x2013;Hochberg (BH) method. The filtering criteria for DEGs were |logFC|&gt;1 and adj. P &lt;0.05. Afterward, we took the intersection of genes between DEGs and coexpressed genes to improve the reliability of screening closely related genes, and these differentially coexpressed genes were used for the next analysis.</p>
</sec>
<sec id="s2_4">
<title>PPI Network Construction and Hub Gene Identification</title>
<p>The PPI network of differentially coexpressed genes was built through the Search Tool for the Retrieval of Interacting Genes (STRING) database (<xref ref-type="bibr" rid="B14">14</xref>). Then, we established a visual network of molecular interactions with combined scores &#x2265;0.7 using Cytoscape (<xref ref-type="bibr" rid="B15">15</xref>). In addition, the degree values of all nodes in the PPI network were calculated using the CytoHubba plugin (<xref ref-type="bibr" rid="B16">16</xref>). The top 40 nodes with the highest degree scores were selected and regarded as hub genes associated with LIHC. The forty hub genes related to LIHC were displayed using the CytoHubba plug-in. In addition, we conducted gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses of the 40 hub genes to explore their biological functions. Adj. P values &lt;0.05 were considered significant.</p>
</sec>
<sec id="s2_5">
<title>Survival Analysis of Hub Genes and the Correlation Network</title>
<p>To analyze the prognostic roles of the differentially coexpressed hub genes in LIHC, we performed univariate Cox regression analysis of OS using the survival package based on the TCGA-LIHC dataset. LIHC patients without follow-up information or a survival time=0 days were excluded from our analysis, and the other patients in the TCGA-LIHC dataset were classified into two groups considering the median expression levels of the differentially coexpressed hub genes. Log-rank P&lt;0.01 was considered significant. Additionally, the correlation network of these differentially coexpressed hub genes was established through the igraph package. The filtering criterion was a cutoff &gt;0.75.</p>
</sec>
<sec id="s2_6">
<title>Construction of the Gene Signature in the TCGA Database</title>
<p>To decrease the risk of overfitting to the greatest extent possible, we used least absolute shrinkage and selection operator (LASSO) Cox regression analysis to build the prognostic module of LIHC (<xref ref-type="bibr" rid="B17">17</xref>, <xref ref-type="bibr" rid="B18">18</xref>). The LASSO algorithm is widely utilized to select and shrink variables using the glmnet package. We used the expression matrix of the differentially coexpressed hub genes with prognostic value as the independent variable, while the OS and status of patients in the TCGA-LIHC dataset were used as the response variables. Then, we determined the penalty parameter (&#x3bb;) of this module using tenfold cross-validation following the minimum criteria, namely, the &#x3bb; value corresponding to the minimum partial likelihood deviance.</p>
</sec>
<sec id="s2_7">
<title>Nomogram and Validation of the Expression Patterns of the Gene Signature</title>
<p>We calculated the risk scores of all LIHC patients using the expression level of every gene and the corresponding regression coefficient. The following formula was used: score= e<sup>sum (every gene&#x2019;s expression &#xd7; corresponding coefficient)</sup>. Then, LIHC patients were divided into high- and low-risk cohorts based on the median value of the risk score. Subsequently, we constructed a nomogram of the prognostic signature to predict the survival of LIHC patients. Furthermore, we built calibration curves and time-dependent receiver operating characteristic (ROC) curves to evaluate the discrimination and accuracy of the prognostic multigene signature. The GSE112790 dataset was used to validate the expression patterns of the genes in the signature between LIHC and nontumorous tissues.</p>
</sec>
<sec id="s2_8">
<title>Distribution and Prognostic Value of the Gene Signature</title>
<p>To analyze the prognostic value of the gene signature, we performed Kaplan-Meier survival analysis between the low- and high-risk groups using the survminer package based on the TCGA-LIHC and ICGC-LIRI-JP datasets. Additionally, to explore distribution in the low- and high-risk cohorts, we performed principal component analysis (PCA) and t-distributed stochastic neighbor embedding (t-SNE) on the TCGA-LIHC and ICGC-LIRI-JP datasets using the stats and Rtsne packages, respectively. To determine whether the risk score acts as an independent indicator of the prognosis of LIHC patients, we performed univariate and multivariate Cox regression analyses among all available variables using the TCGA-LIHC and ICGC-LIRI-JP datasets.</p>
</sec>
<sec id="s2_9">
<title>Differential Gene Expression Analysis and Functional Enrichment Analysis</title>
<p>To acquire the DEGs between the low- and high-risk groups, we performed differential gene expression analysis using the limma package in the TCGA-LIHC and ICGC-LIRI-JP datasets. The P-value was adjusted using the BH method. The filtering criteria for DEGs were |logFC|&gt;2 and adj. P &lt;0.05. Afterward, we conducted GO and KEGG pathway analyses of the DEGs between the low- and high-risk groups in the TCGA-LIHC and ICGC-LIRI-JP datasets. To further analyze the relationship between the risk score and immune status, we calculated the infiltrating scores of 16 immune cells and 13 immune-related functions or pathways using single-sample gene set enrichment analysis (ssGSEA) (<xref ref-type="bibr" rid="B19">19</xref>).</p>
</sec>
</sec>
<sec id="s3" sec-type="results">
<title>Results</title>
<sec id="s3_1">
<title>Identification of Key Coexpression Modules Using WGCNA</title>
<p>To find the pivotal module in LIHC, the gene coexpression network was established in the TCGA-LIHC dataset. A list of 11 modules was generated (<xref ref-type="fig" rid="f2">
<bold>Figure 2C</bold>
</xref>). Next, the heatmap revealed the correlations between the modules and clinical traits (normal and LIHC) in the TCGA-LIHC dataset (<xref ref-type="fig" rid="f2">
<bold>Figure 2D</bold>
</xref>). Furthermore, the yellow module of the TCGA-LIHC dataset positively correlated with LIHC tissues (r=0.57, P=1e-37) and was used for our next analysis.</p>
</sec>
<sec id="s3_2">
<title>Selection of Differentially Coexpressed Genes</title>
<p>The heatmap displayed the expression patterns of fifty upregulated and fifty downregulated genes in the TCGA-LIHC dataset (<xref ref-type="fig" rid="f3">
<bold>Figure 3A</bold>
</xref>). The volcano plot indicated that 2708 DEGs had a conspicuous dysregulation between LIHC and nontumorous tissues in the TCGA-LIHC dataset (<xref ref-type="fig" rid="f3">
<bold>Figure 3B</bold>
</xref>). The Venn diagram showed the intersection of coexpressed genes (<xref ref-type="supplementary-material" rid="ST1">
<bold>Table S1</bold>
</xref>) and DEGs (<xref ref-type="supplementary-material" rid="ST2">
<bold>Table S2</bold>
</xref>); namely, 393 differentially coexpressed genes were identified (<xref ref-type="fig" rid="f3">
<bold>Figure 3C</bold>
</xref>).</p>
<fig id="f3" position="float">
<label>Figure 3</label>
<caption>
<p>Identification of differentially expressed genes (DEGs) in TCGA-LIHC dataset with the cut-off criteria of |logFC|&gt;1 and adj.P &lt;0.05. <bold>(A)</bold> Heatmap of top 50 upregulated and 50 downregulated DEGs of TCGA-LIHC dataset. <bold>(B)</bold> Volcano plot of DEGs in the TCGA-LIHC dataset. <bold>(C)</bold> The Venn diagram of genes between DEGs and co-expression genes. A total of 393 overlapping differential co-expression genes are detected.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g003.tif"/>
</fig>
</sec>
<sec id="s3_3">
<title>PPI Network Construction and Hub Gene Analysis</title>
<p>
<xref ref-type="fig" rid="f4">
<bold>Figure 4A</bold>
</xref> displays the PPI network of the differentially coexpressed genes with 241 nodes and 4792 edges. Subsequently, we quantified the degree scores of all nodes in this PPI network through the CytoHubba plugin (<xref ref-type="supplementary-material" rid="ST3">
<bold>Table S3</bold>
</xref>) and chose the top 40 nodes as hub genes that are closely correlated with LIHC (<xref ref-type="fig" rid="f4">
<bold>Figure 4B</bold>
</xref>). In addition, GO analysis showed significant enrichment in the mitotic nuclear division, organelle fission and spindles terms (<xref ref-type="supplementary-material" rid="SF1">
<bold>Figure S1A</bold>
</xref>). KEGG pathway analysis showed enrichment in the cell cycle and oocyte meiosis pathways (<xref ref-type="supplementary-material" rid="SF1">
<bold>Figure S1B</bold>
</xref>).</p>
<fig id="f4" position="float">
<label>Figure 4</label>
<caption>
<p>Visualization of the protein-protein interaction (PPI) network and hub genes. <bold>(A)</bold> PPI network of differential co-expression genes. <bold>(B)</bold> The identification of 40 differential co-expressed hub genes using the degree algorithm.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g004.tif"/>
</fig>
</sec>
<sec id="s3_4">
<title>Survival Analysis and Correlation Network of the Differentially Coexpressed Hub Genes</title>
<p>Univariate Cox regression analysis of the differentially coexpressed hub genes demonstrated that 38 hub genes were closely associated with the survival of LIHC patients (<xref ref-type="fig" rid="f5">
<bold>Figure 5A</bold>
</xref>). The heatmap revealed that the 38 hub genes with prognostic value were significantly overexpressed in LIHC tissues (<xref ref-type="fig" rid="f5">
<bold>Figure 5B</bold>
</xref>). Additionally, the correlation network suggested that the differentially coexpressed hub genes closely interact with each other (<xref ref-type="fig" rid="f5">
<bold>Figure 5C</bold>
</xref>).</p>
<fig id="f5" position="float">
<label>Figure 5</label>
<caption>
<p>Identification differential co-expressed hub genes with prognostic values. <bold>(A)</bold> Univariate Cox analysis for overall survival (OS) of 38 differential co-expressed hub genes with prognostic values. <bold>(B)</bold> 38 differential co-expressed hub genes with prognostic values are significantly upregulated in HCC tissues. <bold>(C)</bold> The correlation network of candidate genes. The correlation coefficients are represented by different colors.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g005.tif"/>
</fig>
</sec>
<sec id="s3_5">
<title>Construction of the Gene Signature and Nomogram in the TCGA Database</title>
<p>We used the LASSO Cox regression module to build a prognostic signature based on the expression matrix of the 38 differentially coexpressed hub genes. Consequently, we identified a four-gene signature module according to the optimal &#x3bb; value (<xref ref-type="fig" rid="f6">
<bold>Figures 6A, B</bold>
</xref>). In addition, we calculated the risk scores of LIHC patients using the following formula: score= e <sup>(0.225*expression value of CDCA8+0.124*expression value of KIF20A+0.012*expression value of KIF2C+0.144*expression value of CEP55)</sup>. Then, we established a nomogram to predict the 1-, 2-, and 3-year OS probability of LIHC patients (<xref ref-type="fig" rid="f6">
<bold>Figure 6C</bold>
</xref>). The calibration curves of 1-, 2-, and 3-year OS probability showed satisfactory calibration of this nomogram (<xref ref-type="fig" rid="f6">
<bold>Figures 6D&#x2013;F</bold>
</xref>). Moreover, based on GSE112790, we confirmed that CDCA8, KIF20A, KIF2C and CEP55 were significantly overexpressed in LIHC tissues compared with nontumorous tissues (<xref ref-type="supplementary-material" rid="SF2">
<bold>Figure S2</bold>
</xref>). Furthermore, the ROC curves suggested acceptable accuracy of this nomogram (area under the curve [AUC] of 1-year survival: 0.739; AUC of 2-year survival: 0.714; and AUC of 3-year survival: 0.673) (<xref ref-type="fig" rid="f7">
<bold>Figure 7A</bold>
</xref>). Afterward, all LIHC patients were divided into a low-risk cohort (n=183) and a high-risk cohort (n=182) based on the median risk score (<xref ref-type="fig" rid="f7">
<bold>Figure 7B</bold>
</xref>). The high-risk cohort in the TCGA-LIHC dataset had more deaths (<xref ref-type="fig" rid="f7">
<bold>Figure 7C</bold>
</xref>), a poorer tumor grade, a higher clinical stage and a higher T stage (<xref ref-type="table" rid="T1">
<bold>Table 1</bold>
</xref>). Consistently, Kaplan-Meier survival analysis showed that LIHC patients in the high-risk cohort experienced shorter survival than those in the low-risk cohort (<xref ref-type="fig" rid="f7">
<bold>Figure 7D</bold>
</xref>, P=1.14e-4). In the PCA of the TCGA-LIHC dataset, the first principal component (PC1) could explain 88.6% of total variance, and the PC1 scores were negatively correlated with the risk scores of patients (<xref ref-type="fig" rid="f7">
<bold>Figure 7E</bold>
</xref>), while the second principal component (PC2) could explain 5.4% total variance (<xref ref-type="supplementary-material" rid="SF3">
<bold>Figure S3A</bold>
</xref>). Moreover, PCA and t-SNE analysis revealed that most LIHC patients in the high- and low-risk cohorts were distributed in two different directions (<xref ref-type="fig" rid="f7">
<bold>Figure 7F</bold>
</xref>).</p>
<fig id="f6" position="float">
<label>Figure 6</label>
<caption>
<p>|Construction of the gene signature and nomogram in TCGA-LIHC dataset. <bold>(A, B)</bold> The construction of the four-gene signature module. <bold>(C)</bold> The construction of the nomogram of this module. <bold>(D&#x2013;F)</bold> The calibration curves of 1-, 2-, and 3-year overall survival probability.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g006.tif"/>
</fig>
<fig id="f7" position="float">
<label>Figure 7</label>
<caption>
<p>Prognostic analysis of the four-gene signature model in TCGA-LIHC dataset. <bold>(A)</bold> AUC of time-dependent ROC curves verifies the prognostic performance of the risk score in TCGA-LIHC dataset. <bold>(B)</bold> The distribution and the median value of the risk scores in TCGA-LIHC dataset. <bold>(C)</bold> The distributions of OS status, OS and risk score in the TCGA-LIHC dataset. <bold>(D)</bold> Kaplan-Meier curves for the OS of patients in the high-risk group and low-risk group in TCGA-LIHC dataset. <bold>(E)</bold> PCA plot of TCGA-LIHC dataset. <bold>(F)</bold> t-SNE analysis of TCGA-LIHC dataset.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g007.tif"/>
</fig>
<table-wrap id="T1" position="float">
<label>Table 1</label>
<caption>
<p>Baseline characteristics of LIHC patients in high-risk and low-risk cohorts.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" rowspan="2" colspan="2" align="left">Baseline characteristics</th>
<th valign="top" colspan="3" align="center">TCGA-LIHC dataset</th>
<th valign="top" colspan="3" align="center">ICGC-LIRP-JI dataset</th>
</tr>
<tr>
<th valign="top" align="center">High-risk</th>
<th valign="top" align="center">Low-risk</th>
<th valign="top" align="center">P-value</th>
<th valign="top" align="center">High-risk</th>
<th valign="top" align="center">Low-risk</th>
<th valign="top" align="center">P-value</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" rowspan="2" align="left">Age (%)</td>
<td valign="top" align="left">&#x2264;60 year</td>
<td valign="top" align="left">97 (53.3)</td>
<td valign="top" align="left">76 (41.5)</td>
<td valign="top" rowspan="2" align="center">
<bold>0.024</bold>
</td>
<td valign="top" align="left">35 (21.5)</td>
<td valign="top" align="left">15 (21.7)</td>
<td valign="top" rowspan="2" align="center">0.964</td>
</tr>
<tr>
<td valign="top" align="left">&gt;60 year</td>
<td valign="top" align="left">85 (46.7)</td>
<td valign="top" align="left">105 (58.5)</td>
<td valign="top" align="left">128 (78.5)</td>
<td valign="top" align="left">54 (78.3)</td>
</tr>
<tr>
<td valign="top" rowspan="2" align="left">Gender (%)</td>
<td valign="top" align="left">Female</td>
<td valign="top" align="left">66 (36.3)</td>
<td valign="top" align="left">53 (29.0)</td>
<td valign="top" rowspan="2" align="center">0.137</td>
<td valign="top" align="left">44 (27.2)</td>
<td valign="top" align="left">17 (24.6)</td>
<td valign="top" rowspan="2" align="center">0.691</td>
</tr>
<tr>
<td valign="top" align="left">Male</td>
<td valign="top" align="left">116 (63.7)</td>
<td valign="top" align="left">130 (71.0)</td>
<td valign="top" align="left">118 (72.8)</td>
<td valign="top" align="left">52 (75.4)</td>
</tr>
<tr>
<td valign="top" rowspan="3" align="left">Tumor grade (%)</td>
<td valign="top" align="left">G1 + G2</td>
<td valign="top" align="left">94 (51.6)</td>
<td valign="top" align="left">136 (74.3)</td>
<td valign="top" rowspan="3" align="center">
<bold>&lt;0.001</bold>
</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" rowspan="3" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">G3 + G4</td>
<td valign="top" align="left">85 (46.7)</td>
<td valign="top" align="left">45 (24.6)</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">unknown</td>
<td valign="top" align="left">3 (1.6)</td>
<td valign="top" align="left">2 (1.1)</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" rowspan="3" align="left">Clinical stage (%)</td>
<td valign="top" align="left">I + II</td>
<td valign="top" align="left">116 (63.7)</td>
<td valign="top" align="left">138 (75.4)</td>
<td valign="top" rowspan="3" align="center">
<bold>0.008</bold>
</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" rowspan="3" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">III + IV</td>
<td valign="top" align="left">54 (29.7)</td>
<td valign="top" align="left">33 (18.0)</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">unknown</td>
<td valign="top" align="left">12 (6.6)</td>
<td valign="top" align="left">12 (6.6)</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" rowspan="3" align="left">T stage (%)</td>
<td valign="top" align="left">T1 + T2</td>
<td valign="top" align="left">124 (68.1)</td>
<td valign="top" align="left">147 (80.3)</td>
<td valign="top" rowspan="3" align="center">
<bold>0.003</bold>
</td>
<td valign="top" align="left">96 (59.3)</td>
<td valign="top" align="left">45 (65.2)</td>
<td valign="top" rowspan="2" align="center">0.395</td>
</tr>
<tr>
<td valign="top" align="left">T3 + T4</td>
<td valign="top" align="left">58 (31.9)</td>
<td valign="top" align="left">33 (18.0)</td>
<td valign="top" align="left">66 (40.7)</td>
<td valign="top" align="left">24 (34.8)</td>
</tr>
<tr>
<td valign="top" align="left">unknown</td>
<td valign="top" align="left">0 (0.0)</td>
<td valign="top" align="left">3 (1.7)</td>
<td valign="top" align="left">0 (0.0)</td>
<td valign="top" align="left">0 (0.0)</td>
<td valign="top" align="left"/>
</tr>
<tr>
<td valign="top" rowspan="3" align="left">N stage (%)</td>
<td valign="top" align="left">N0</td>
<td valign="top" align="left">127 (69.8)</td>
<td valign="top" align="left">121 (66.1)</td>
<td valign="top" rowspan="3" align="center">0.962</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" rowspan="3" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">N1 + N2 + N3</td>
<td valign="top" align="left">2 (1.1)</td>
<td valign="top" align="left">3 (1.6)</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">unknown</td>
<td valign="top" align="left">53 (29.1)</td>
<td valign="top" align="left">53 (29.0)</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" rowspan="3" align="left">M stage (%)</td>
<td valign="top" align="left">M0</td>
<td valign="top" align="left">136 (74.7)</td>
<td valign="top" align="left">127 (69.4)</td>
<td valign="top" rowspan="3" align="center">0.230</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" rowspan="3" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">M1</td>
<td valign="top" align="left">0 (0.0)</td>
<td valign="top" align="left">3 (1.6)</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">unknown</td>
<td valign="top" align="left">46 (25.3)</td>
<td valign="top" align="left">53 (29.0)</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>LIHC, liver hepatocellular carcinoma; TCGA, The Cancer Genome Atlas; ICGC, International Cancer Genome Consortium.</p>
<p>The bold P values means P &lt; 0.05.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s3_6">
<title>Verification of the Four-Gene Signature Module in the ICGC Database</title>
<p>To validate the robustness of the four-gene signature module from the TCGA-LIHC dataset, we chose the ICGC-LIRI-JP dataset for further verification. First, we stratified LIHC patients from the ICGC-LIRI-JP dataset into high-risk and low-risk cohorts according to the median value of the risk score, which was calculated using the formula mentioned above. Consistent with the outcomes from the TCGA-LIHC dataset, the four-gene signature had an excellent AUC (<xref ref-type="fig" rid="f8">
<bold>Figure 8A</bold>
</xref>, 1-year survival: 0.752; 2-year survival: 0.751; and 3-year survival: 0.782). Moreover, the high-risk group correlated with a higher rate of mortality (<xref ref-type="fig" rid="f8">
<bold>Figures 8B, C</bold>
</xref>). Additionally, patients from the high-risk cohort experienced significantly shorter survival than those in the low-risk cohort (<xref ref-type="fig" rid="f8">
<bold>Figure 8D</bold>
</xref>, P=1.24e-3). In the PCA of the ICGC-LIRI-JP dataset, the PC1 could explain 79% of total variance, and the PC1 scores were positively correlated with the risk scores of patients (<xref ref-type="fig" rid="f8">
<bold>Figure 8E</bold>
</xref>), whereas the PC2 could explain 13% total variance (<xref ref-type="supplementary-material" rid="SF3">
<bold>Figure S3B</bold>
</xref>). In addition, t-SNE analysis validated that most patients in the high- and low-risk cohorts were distributed in two different directions (<xref ref-type="fig" rid="f8">
<bold>Figure 8F</bold>
</xref>). In general, these outcomes in the ICGC-LIRI-JP dataset were similar to those in the TCGA-LIHC dataset.</p>
<fig id="f8" position="float">
<label>Figure 8</label>
<caption>
<p>Validation of the 10-gene signature in ICGC-LIRI-JP dataset. <bold>(A)</bold> AUC of time-dependent ROC curves verifies the prognostic performance of the risk score in ICGC-LIRI-JP dataset. <bold>(B)</bold> The distribution and the median value of the risk scores in ICGC-LIRI-JP dataset. <bold>(C)</bold> The distributions of OS status, OS and risk scores in ICGC-LIRI-JP dataset. <bold>(D)</bold> Kaplan-Meier curves for the OS of patients in the high-risk group and low-risk group in ICGC-LIRI-JP dataset. <bold>(E)</bold> PCA plot of ICGC-LIRI-JP dataset. <bold>(F)</bold> t-SNE analysis of ICGC-LIRI-JP dataset.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g008.tif"/>
</fig>
</sec>
<sec id="s3_7">
<title>Independent Prognostic Role of the Four-Gene Signature</title>
<p>To determine whether the risk score plays an independent prognostic role, we performed univariate and multivariate Cox regression analyses of the survival of LIHC patients. The univariate Cox regression analysis indicated that a higher risk score was closely correlated with worse survival in LIHC patients using the TCGA-LIHC (<xref ref-type="fig" rid="f9">
<bold>Figure 9A</bold>
</xref>, hazard ratio [HR]=3.324, 95% confidence interval [CI]: 2.181&#x2013;5.066, P&lt;0.001) and ICGC-LIRI-JP (<xref ref-type="fig" rid="f9">
<bold>Figure 9B</bold>
</xref>, HR=1.413, 95% CI: 1.243&#x2013;1.607, P&lt;0.001) datasets. Similar to the results of the univariate Cox regression analysis, the multivariate Cox regression analysis still suggested the risk score as an independent indicator for the survival of LIHC patients using the TCGA-LIHC (<xref ref-type="fig" rid="f9">
<bold>Figure 9C</bold>
</xref>, HR=3.041, 95% CI: 1.930&#x2013;4.790, P&lt;0.001) and ICGC-LIRI-JP (<xref ref-type="fig" rid="f9">
<bold>Figure 9D</bold>
</xref>, HR=1.378, 95% CI: 1.210&#x2013;1.569, P&lt;0.001) datasets.</p>
<fig id="f9" position="float">
<label>Figure 9</label>
<caption>
<p>Independent prognostic role of the four-gene signature. <bold>(A)</bold>The univariate Cox regression analysis in TCGA-LIHC dataset. <bold>(B)</bold>The univariate Cox regression analysis in ICGC-RI-JP dataset. <bold>(C)</bold> The multivariate Cox regression analysis in TCGA-LIHC dataset. <bold>(D)</bold> The multivariate Cox regression analysis in ICGC-LIRI-JP dataset.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g009.tif"/>
</fig>
</sec>
<sec id="s3_8">
<title>Differential Gene Expression Analysis and Functional Enrichment Analysis</title>
<p>Differential gene expression analyses were conducted in the TCGA-LIHC and ICGC-LIRI-JP datasets, and 499 and 185 DEGs between the high- and low-risk groups were obtained (<xref ref-type="supplementary-material" rid="ST4">
<bold>Tables S4</bold>
</xref>, <xref ref-type="supplementary-material" rid="ST5">
<bold>S5</bold>
</xref>), respectively. To explore the biological functions of the DEGs in the high- and low-risk groups, we again performed GO enrichment and KEGG pathway analyses. In the TCGA-LIHC dataset, GO enrichment analysis indicated significant enrichment in the organelle fission, nuclear division, chromosomal region and ATPase activity terms (<xref ref-type="fig" rid="f10">
<bold>Figure 10A</bold>
</xref>). GO enrichment analysis of the ICGC-LIRI-JP dataset showed similar outcomes to the TCGA-LIHC dataset (<xref ref-type="fig" rid="f10">
<bold>Figure 10B</bold>
</xref>). Additionally, KEGG pathway analysis of the TCGA-LIHC dataset showed significant enrichment in the cell cycle, oocyte meiosis and progesterone-medicated oocyte maturation pathways (<xref ref-type="fig" rid="f10">
<bold>Figure 10C</bold>
</xref>). In the ICGC-LIRI-JP dataset, KEGG pathway analysis also demonstrated the analogical outcomes of the TCGA-LIHC dataset (<xref ref-type="fig" rid="f10">
<bold>Figure 10D</bold>
</xref>).</p>
<fig id="f10" position="float">
<label>Figure 10</label>
<caption>
<p>Functional enrichment analysis of differentially expressed genes (DEGs) between high-risk and low-risk groups. <bold>(A)</bold> Gene ontology (GO) enrichment analysis of DEGs of TCGA-LIHC dataset. <bold>(B)</bold> Gene ontology (GO) enrichment analysis of DEGs of ICGC-RI-JP dataset. <bold>(C)</bold> Kyoto encyclopedia of genes and genomes pathway analysis of DEGs of TCGA-LIHC dataset. <bold>(D)</bold> KEGG pathway analysis of DEGs of ICGC-RI-JP dataset.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g010.tif"/>
</fig>
<p>To explore the correlation between the risk score and immune status, we calculated the infiltrating scores of 16 immune cells and 13 immune-related functions or pathways using ssGSEA. The scores of activated dendritic cells (aDCs), mast cells and follicular helper cells (Tfhs) were notably different between the high- and low-risk groups in the TCGA-LIHC dataset (all adj. P&lt;0.001, <xref ref-type="fig" rid="f11">
<bold>Figure 11A</bold>
</xref>). In the TCGA-LIHC dataset, the scores of cytolytic activity, type I interferon (IFN) response and type II IFN response were obviously higher in the low-risk group, while the score of MHC class I was lower in the low-risk group (all adj. P&lt;0.01, <xref ref-type="fig" rid="f11">
<bold>Figure 11B</bold>
</xref>). Moreover, the ICGC-LIRI-JP dataset showed that aDCs, mast cells, MHC class I and type II IFN responses were significantly different between the two risk cohorts (<xref ref-type="fig" rid="f11">
<bold>Figures 11C, D</bold>
</xref>), which is consistent with the results of the TCGA-LIHC dataset.</p>
<fig id="f11" position="float">
<label>Figure 11</label>
<caption>
<p>Comparison of single-sample gene set enrichment (ssGSEA) scores between high-risk and low-risk groups in TCGA-LIHC and ICGC-LIRI-JP datasets. <bold>(A, B)</bold> The scores of 16 immune cells and 13 immune-related functions are displayed in boxplots in TCGA-LIHC dataset. <bold>(C, D)</bold> The scores of 16 immune cells and 13 immune-related functions are displayed in boxplots in ICGC-RI-JP dataset. Adjusted P values are showed as: ns, not significant; *P &lt; 0.05; **P &lt; 0.01; ***P&#xa0;&lt;&#xa0;0.001.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fonc-11-626654-g011.tif"/>
</fig>
</sec>
</sec>
<sec id="s4" sec-type="discussion">
<title>Discussion</title>
<p>As a common solid-tumor malignancy with high mortality, HCC has brought great socioeconomic pressure to HCC patients and their families. Owing to the complex etiological factors and high heterogeneity of HCC, it remains difficult to accurately predict the survival of HCC patients. Thus, it is urgent to detect effective prognostic biomarkers to monitor the progression and predict the prognosis of HCC patients. In this study, 393 differentially coexpressed genes were obtained through WGCNA and differential gene expression analysis. Then, these genes were used to construct a PPI network, and 38 hub genes were observed to be closely correlated with OS. Subsequently, we established a novel four-gene prognostic signature in the TCGA-LIHC dataset and built a nomogram based on this novel module, which showed acceptable accuracy and calibration. Afterward, the four-gene signature module was verified in the TCGA-LIHC dataset using the LASSO algorithm. To improve the robustness of the signature, we used the ICGC-LIRI-JP dataset for further validation. The four-gene signature was still found to have independent prognostic value. Finally, ssGSEA revealed significant differences in aDCs, mast cells, MHC class I and type II IFN responses between the two risk cohorts.</p>
<p>Several prior analyses have also shown that certain gene signatures may predict patient survival (<xref ref-type="bibr" rid="B20">20</xref>&#x2013;<xref ref-type="bibr" rid="B26">26</xref>); however, our study has some differences and/or advantages compared with similar analyses. First, the gene signatures built in previous studies require many genes (<xref ref-type="bibr" rid="B20">20</xref>&#x2013;<xref ref-type="bibr" rid="B23">23</xref>), which possibly leads to some difficulties in real-world practice. Our novel signature requires only 4 genes, and the predictive ability of our signature is acceptable, which increases the feasibility of the use of our signature in real-world practice. Second, in our study, we simultaneously used WGCNA, differential gene expression analysis, PPI network construction, univariate Cox regression analysis and LASSO Cox regression analysis, and these methods were rarely used together in one study for the construction of a prognostic module of HCC, which is a novel point of our study. Third, some previous studies did not verify their gene signature (<xref ref-type="bibr" rid="B24">24</xref>&#x2013;<xref ref-type="bibr" rid="B26">26</xref>) using other datasets; however, we used two datasets (the ICGC-LIRI-JP dataset and GSE112790) for external validation, which is helpful to enhance the reliability of our findings. Interestingly, we observed that most differentially coexpressed hub genes (38/40) were significantly associated with survival time according to the results of the univariate Cox regression analysis. This finding suggests the possibility of establishing a prognostic signature using these differentially coexpressed hub genes.</p>
<p>The prognostic module proposed in our analysis was composed of CDCA8, KIF20A, KIF2C and CEP55, all of which are often reported as being dysregulated in HCC tissues (<xref ref-type="bibr" rid="B27">27</xref>&#x2013;<xref ref-type="bibr" rid="B30">30</xref>). First, cell division cycle associated 8 (CDCA8) is regarded as a significant oncogene that is involved in the pathological development of various cancers, including HCC (<xref ref-type="bibr" rid="B27">27</xref>) and pancreatic ductal adenocarcinoma (<xref ref-type="bibr" rid="B31">31</xref>). Wu et&#xa0;al. reported that CDCA8 is obviously overexpressed at the mRNA and protein levels in HCC tissues, and the authors validated this finding at the mRNA level using real-time&#xa0;quantitative PCR (RT-qPCR) (<xref ref-type="bibr" rid="B32">32</xref>).Similarly, CDCA8 is closely correlated with cell division and growth in HCC, and CDCA8 is strongly associated with the pathological grades and T stages of HCC (<xref ref-type="bibr" rid="B33">33</xref>). Second, kinesin family member 20A (KIF20A) and KIF2C are the members of the kinesin superfamily proteins, both of which are closely regulated by E2F1. The depletion of KIF20A or KIF2C results in deforming microtubule structures, influencing cell motility and inhibiting cancer metastasis (<xref ref-type="bibr" rid="B34">34</xref>). A recent study suggested that KIF20A and KIF2C are obviously upregulated in HCC tissues, and higher expression of KIF20A and KIF2C correlates with worse survival (including OS and disease-free survival [DFS]), higher tumor stages and poorer pathological grades (<xref ref-type="bibr" rid="B35">35</xref>). Moreover, by conducting basic experiments, this study also showed that the downregulation of KIF20A and KIF2C can effectively inhibit the proliferation of HCC cells and increase G1 arrest in HCC cells (<xref ref-type="bibr" rid="B35">35</xref>). In addition, Lu et&#xa0;al. observed that high KIF20A expression was associated with more high-grade HCC (52.3% vs. 32.5%, P=0.003), more advanced HCC (45.9% vs. 21.1%, P&lt;0.0001), and more deaths (65.7% vs. 28.9%, P&lt;0.0001) than low KIF20A expression, and the authors also reported that KIF20A could act as an independent prognostic indicator for poor OS (HR=1.30, 95% CI: 1.16&#x2013;1.47, P&lt;0.001) and recurrence-free survival (RFS) (HR=1.14, 95% CI: 1.03&#x2013;1.27, P &lt; 0.001) (<xref ref-type="bibr" rid="B36">36</xref>). KIF2C contributed to cell proliferation, adverse invasion, and metastasis <italic>in vitro</italic> and <italic>in vivo</italic> by performing both gain- and loss-of-function assays, and the authors further suggested that KIF2C plays an important role in mediating the crosstalk between Wnt/&#x3b2;-catenin and mammalian target of rapamycin complex 1 (mTORC1) signaling in the pathogenesis of HCC (<xref ref-type="bibr" rid="B37">37</xref>). Third, centrosomal protein 55 (CEP55) contributes to the carcinogenesis of many cancers and regulates PI3K/AKT signaling (<xref ref-type="bibr" rid="B38">38</xref>). Yang et&#xa0;al. showed that CEP55 is upregulated in HCC tissues, and CEP55 overexpression correlates with poor tumor grades and high T stages; the authors also showed that CEP55 acts as an independent predictor of the OS of HCC patients using multivariate analysis (<xref ref-type="bibr" rid="B39">39</xref>). In addition, CEP55 was found to promote cell migration and adverse invasion <italic>via</italic> the regulation of the JAK2-STAT3-MMP signaling pathway in HCC, and the knockdown of CEP55 strongly suppressed HCC cell migration and invasion (<xref ref-type="bibr" rid="B40">40</xref>).</p>
<p>Several limitations to our analysis exist. 1) The TCGA-LIHC dataset provides multiple HCC tissue samples, and the ICGC-LIRI-JP dataset and GSE112790 were applied for external validation. However, these datasets were obtained from public databases, and additional real-world datasets are required to validate the clinical utility of the four-gene prognostic signature. 2) Although we utilized comprehensive bioinformatics approaches to construct and validate this prognostic signature in HCC, it may not be very accurate for HCC patients with different grades and stages. 3) We did not verify the correlation between the risk score and immune status by conducting basic experiments, which is a significant issue that deserves further investigation in the future.</p>
</sec>
<sec id="s5">
<title>Conclusion</title>
<p>This comprehensive analysis proposes a novel prognostic signature of four differentially coexpressed hub genes that has satisfactory prognostic value. This model was an independent predictor of OS in the TCGA-LIHC and ICGC-LIRI-JP datasets, providing insight into the prediction of HCC prognosis. Nevertheless, additional studies are required to further explore the underlying mechanisms of these differentially coexpressed hub genes and tumor immunity.</p>
</sec>
<sec id="s6">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="supplementary-material" rid="ST1">
<bold>Supplementary Material</bold>
</xref>. Further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>ZM had full access to all of the data in the manuscript and takes responsibility for the integrity of the data and the accuracy of the data analysis. Concept and design: All authors. Acquisition, analysis, and interpretation of data: ZM. Drafting of the manuscript: ZM. Critical revision of the manuscript for important intellectual content: All authors. Statistical analysis: All authors. Supervision: All authors. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="s8" sec-type="funding-information">
<title>Funding</title>
<p>This study is supported by the Public Welfare Technology Application Research Program of Huzhou (No. 2019GY35, 2019GY01) and Young Talents Project of Huzhou Central Hospital (NO. 2020YC09), without the involvement of commercial entities. The funder had no role in the design or performance of the study; the collection, management, analysis, and interpretation of the data; the preparation, review, and approval of the manuscript; or the decision to submit the manuscript for publication.</p>
</sec>
<sec id="s9" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgments</title>
<p>The authors gratefully acknowledge contributions from TCGA, ICGC, and GEO databases.</p>
</ack>
<sec id="s10" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fonc.2021.626654/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fonc.2021.626654/full#supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="Image_1.tif" id="SF1" mimetype="image/tiff">
<label>Supplementary Figure 1</label>
<caption>
<p>Functional enrichment analysis of 40 differentially co-expressed genes. <bold>(A)</bold> Gene ontology (GO) enrichment analysis of 40 differentially co-expressed genes. <bold>(B)</bold> Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis of 40 differentially co-expressed genes.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Image_2.tif" id="SF2" mimetype="image/tiff">
<label>Supplementary Figure 2</label>
<caption>
<p>Validation of the expression patterns of the four genes in the prognostic signature between LIHC and non-tumorous tissues based on GSE112790. The expression pattern of <bold>(A)</bold> CDCA8, <bold>(B)</bold> KIF20A, <bold>(C)</bold> KIF2C, and <bold>(D)</bold> CEP55 between LIHC and non-tumorous tissues.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Image_3.tif" id="SF3" mimetype="image/tiff">
<label>Supplementary Figure 3</label>
<caption>
<p>Scree plot of the principal component analysis (PCA) in TCGA-LIHC and ICGC-LIRI-JP datasets. <bold>(A)</bold> In the PCA of the TCGA-LIHC dataset, the first principal component (PC1) could explain 88.6% of total variance, while the second principal component (PC2) could explain 5.4% total variance. <bold>(B)</bold> In the PCA of the ICGC-LIRI-JP dataset, the PC1 could explain 79% of total variance, whereas the PC2 could explain 13% total variance.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Table_1.xlsx" id="ST1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet">
<label>Supplementary Table 1</label>
<caption>
<p>The co-expression genes in the yellow module of TCGA-LIHC dataset.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Table_2.xlsx" id="ST2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet">
<label>Supplementary Table 2</label>
<caption>
<p>The differentially expressed genes (DEGs) in TCGA-LIHC dataset.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Table_3.xlsx" id="ST3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet">
<label>Supplementary Table 3</label>
<caption>
<p>Degree scores of all differential co-expressed genes.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Table_4.xlsx" id="ST4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet">
<label>Supplementary Table 4</label>
<caption>
<p>DEGs between high-risk and low-risk cohorts in TCGA-LIHC dataset.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Table_5.xlsx" id="ST5" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet">
<label>Supplementary Table 5</label>
<caption>
<p>DEGs between high-risk and low-risk cohorts in ICGC-LIRI-JP dataset.</p>
</caption>
</supplementary-material>
</sec>
<sec id="s11">
<title>Abbreviations</title>
<p>HCC, hepatocellular carcinoma; HBV, hepatitis B virus; HCV, hepatitis C virus; WGCNA, Weighed Gene Co-expression Network Analysis; DEGs, differentially expressed genes; TCGA, The Cancer Genome Atlas; ICGC, International Cancer Genome Consortium; ssGSEA, single-sample Gene Set Enrichment Analysis; LIHC, liver hepatocellular carcinoma; PPI, protein-protein interaction network; FDR, false discovery Rate; BH, Benjamini&#x2013;Hochberg; TOM, topological overlap matrix; CPM, count per million; GO, gene ontology; KEGG, kyoto encyclopedia of genes and genomes pathway; STRING, Search Tool for the Retrieval of Interacting Genes; OS, overall survival; HR, hazard ratio; CI, confidence interval; LASSO, the least absolute shrinkage and selection operator analysis; ROC, receiver operating characteristic curves; PCA, principal component analysis; PC1, the first principal component; PC2, the second principal component; t-SNE, t-distributed stochastic neighbor embedding; tSNE1, the first component of t-SNE; tSNE2, the second component of t-SNE; AUC, Area Under Curve; aDCs, activated dendritic cells; IFN, interferon; DCs, dendritic cells; iDCs, immature Dendritic Cells; pDCs,&#xa0;plasmacytoid dendritic cells; NK cells, natural killer cells; Tfhs, follicular helper cells; Th1 cells, helper one T-cells; Th2 cells, helper two T-cells; TIL, tumor infiltrating lymphocytes; Treg, regulatory T cells; APC, antigen presenting cell; CCR, chemokine receptor type; HLA, human leukocyte antigen; MHC, major histocompatibility complex; RT-qPCR, real-time&#xa0;quantitative PCR; DFS, disease-free survival; RFS, recurrence-free survival; mTORC1, mammalian target of rapamycin complex1 signaling.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Siegel</surname> <given-names>RL</given-names>
</name>
<name>
<surname>Miller</surname> <given-names>KD</given-names>
</name>
<name>
<surname>Jemal</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>Cancer statistics, 2020</article-title>. <source>CA Cancer J Clin</source> (<year>2020</year>) <volume>70</volume>(<issue>1</issue>):<fpage>7</fpage>&#x2013;<lpage>30</lpage>. doi: <pub-id pub-id-type="doi">10.3322/caac.21590</pub-id>
</citation>
</ref>
<ref id="B2">
<label>2</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vogel</surname> <given-names>A</given-names>
</name>
<name>
<surname>Saborowski</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>Current strategies for the treatment of intermediate and advanced hepatocellular carcinoma</article-title>. <source>Cancer Treat</source> (<year>2020</year>) <volume>82</volume>:<fpage>101946</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.ctrv.2019.101946</pub-id>
</citation>
</ref>
<ref id="B3">
<label>3</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kole</surname> <given-names>C</given-names>
</name>
<name>
<surname>Charalampakis</surname> <given-names>N</given-names>
</name>
<name>
<surname>Tsakatikas</surname> <given-names>S</given-names>
</name>
<name>
<surname>Vailas</surname> <given-names>M</given-names>
</name>
<name>
<surname>Moris</surname> <given-names>D</given-names>
</name>
<name>
<surname>Gkotsis</surname> <given-names>E</given-names>
</name>
<etal/>
</person-group>. <article-title>Immunotherapy for Hepatocellular Carcinoma: A 2021 Update</article-title>. <source>Cancers (Basel)</source> (<year>2020</year>) <volume>12</volume>(<issue>10</issue>):<fpage>E2859</fpage>. doi: <pub-id pub-id-type="doi">10.3390/cancers12102859</pub-id>
</citation>
</ref>
<ref id="B4">
<label>4</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Caines</surname> <given-names>A</given-names>
</name>
<name>
<surname>Selim</surname> <given-names>R</given-names>
</name>
<name>
<surname>Salgia</surname> <given-names>R</given-names>
</name>
</person-group>. <article-title>The Changing Global Epidemiology of Hepatocellular Carcinoma</article-title>. <source>Clin Liver Dis</source> (<year>2020</year>) <volume>24</volume>(<issue>4</issue>):<page-range>535&#x2013;47</page-range>. doi: <pub-id pub-id-type="doi">10.1016/j.cld.2020.06.001</pub-id>
</citation>
</ref>
<ref id="B5">
<label>5</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huo</surname> <given-names>J</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>L</given-names>
</name>
<name>
<surname>Zang</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>A robust nine-gene prognostic signature associated with tumour doubling time for hepatocellular carcinoma</article-title>. <source>Life Sci</source> (<year>2020</year>) <volume>260</volume>:<fpage>118396</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.lfs.2020.118396</pub-id>
</citation>
</ref>
<ref id="B6">
<label>6</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Zhu</surname> <given-names>J</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>C</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>W</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>F</given-names>
</name>
<etal/>
</person-group>. <article-title>Development and validation of a novel immune-related prognostic model in hepatocellular carcinoma</article-title>. <source>J Transl Med</source> (<year>2020</year>) <volume>18</volume>(<issue>1</issue>):<fpage>67</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12967-020-02255-6</pub-id>
</citation>
</ref>
<ref id="B7">
<label>7</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morganti</surname> <given-names>S</given-names>
</name>
<name>
<surname>Tarantino</surname> <given-names>P</given-names>
</name>
<name>
<surname>Ferraro</surname> <given-names>E</given-names>
</name>
<name>
<surname>D'Amico</surname> <given-names>P</given-names>
</name>
<name>
<surname>Duso</surname> <given-names>BA</given-names>
</name>
<name>
<surname>Curigliano</surname> <given-names>G</given-names>
</name>
</person-group>. <article-title>Next Generation Sequencing (NGS): A Revolutionary Technology in Pharmacogenomics and Personalized Medicine in Cancer</article-title>. <source>Adv Exp Med Biol</source> (<year>2019</year>) <volume>1168</volume>:<fpage>9</fpage>&#x2013;<lpage>30</lpage> . doi: <pub-id pub-id-type="doi">10.1007/978-3-030-24100-1_2</pub-id>
</citation>
</ref>
<ref id="B8">
<label>8</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nangraj</surname> <given-names>AS</given-names>
</name>
<name>
<surname>Selvaraj</surname> <given-names>G</given-names>
</name>
<name>
<surname>Kaliamurthi</surname> <given-names>S</given-names>
</name>
<name>
<surname>Kaushik</surname> <given-names>AC</given-names>
</name>
<name>
<surname>Cho</surname> <given-names>WC</given-names>
</name>
<name>
<surname>Wei</surname> <given-names>DQ</given-names>
</name>
<etal/>
</person-group>. <article-title>Integrated PPI- and WGCNA-Retrieval of Hub Gene Signatures Shared Between Barrett&#x2019;s Esophagus and Esophageal Adenocarcinoma</article-title>. <source>Front Pharmacol</source> (<year>2020</year>) <volume>11</volume>:<elocation-id>881</elocation-id>. doi: <pub-id pub-id-type="doi">10.3389/fphar.2020.00881</pub-id>
</citation>
</ref>
<ref id="B9">
<label>9</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname> <given-names>Q</given-names>
</name>
<name>
<surname>Zhou</surname> <given-names>LQ</given-names>
</name>
<name>
<surname>Li</surname> <given-names>SH</given-names>
</name>
<name>
<surname>Yuan</surname> <given-names>YW</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>L</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>JL</given-names>
</name>
<etal/>
</person-group>. <article-title>Identification of subtype-specific genes signature by WGCNA for prognostic prediction in diffuse type gastric cancer</article-title>. <source>Aging (Albany NY)</source> (<year>2020</year>) <volume>12</volume>(<issue>17</issue>):<page-range>17418&#x2013;35</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.18632/aging.103743</pub-id>
</citation>
</ref>
<ref id="B10">
<label>10</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reddy</surname> <given-names>RRS</given-names>
</name>
<name>
<surname>Ramanujam</surname> <given-names>MV</given-names>
</name>
</person-group>. <article-title>High Throughput Sequencing-Based Approaches for Gene Expression Analysis</article-title>. <source>Methods Mol Biol</source> (<year>2018</year>) <volume>1783</volume>:<fpage>299</fpage>&#x2013;<lpage>323</lpage>. doi: <pub-id pub-id-type="doi">10.1007/978-1-4939-7834-2_15</pub-id>
</citation>
</ref>
<ref id="B11">
<label>11</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Robinson</surname> <given-names>MD</given-names>
</name>
<name>
<surname>McCarthy</surname> <given-names>DJ</given-names>
</name>
<name>
<surname>Smyth</surname> <given-names>GK</given-names>
</name>
</person-group>. <article-title>edgeR: a Bioconductor package for differential expression analysis of digital gene expression data</article-title>. <source>Bioinformatics</source> (<year>2010</year>) <volume>26</volume>(<issue>1</issue>):<page-range>139&#x2013;40</page-range>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btp616</pub-id>
</citation>
</ref>
<ref id="B12">
<label>12</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fujimoto</surname> <given-names>A</given-names>
</name>
<name>
<surname>Furuta</surname> <given-names>M</given-names>
</name>
<name>
<surname>Totoki</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Tsunoda</surname> <given-names>T</given-names>
</name>
<name>
<surname>Kato</surname> <given-names>M</given-names>
</name>
<name>
<surname>Shiraishi</surname> <given-names>Y</given-names>
</name>
<etal/>
</person-group>. <article-title>Whole-genome mutational landscape and characterization of noncoding and structural mutations in liver cancer</article-title>. <source>Nat Genet</source> (<year>2016</year>) <volume>48</volume>(<issue>5</issue>):<page-range>500&#x2013;9</page-range>. doi: <pub-id pub-id-type="doi">10.1038/ng.3547</pub-id>
</citation>
</ref>
<ref id="B13">
<label>13</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ritchie</surname> <given-names>ME</given-names>
</name>
<name>
<surname>Phipson</surname> <given-names>B</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>D</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Law</surname> <given-names>CW</given-names>
</name>
<name>
<surname>Shi</surname> <given-names>W</given-names>
</name>
<etal/>
</person-group>. <article-title>limma powers differential expression analyses for RNA-sequencing and microarray studies</article-title>. <source>Nucleic Acids Res</source> (<year>2015</year>) <volume>43</volume>(<issue>7</issue>):<fpage>e47</fpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkv007</pub-id>
</citation>
</ref>
<ref id="B14">
<label>14</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Franceschini</surname> <given-names>A</given-names>
</name>
<name>
<surname>Szklarczyk</surname> <given-names>D</given-names>
</name>
<name>
<surname>Frankild</surname> <given-names>S</given-names>
</name>
<name>
<surname>Kuhn</surname> <given-names>M</given-names>
</name>
<name>
<surname>Simonovic</surname> <given-names>M</given-names>
</name>
<name>
<surname>Roth</surname> <given-names>A</given-names>
</name>
<etal/>
</person-group>. <article-title>STRING v9.1: Protein-protein interaction networks, with increased coverage and integration</article-title>. <source>Nucleic Acids Res</source> (<year>2013</year>) <volume>41</volume>(<issue>Database issue</issue>):<page-range>D808&#x2013;15</page-range>. doi: <pub-id pub-id-type="doi">10.1093/nar/gks1094</pub-id>
</citation>
</ref>
<ref id="B15">
<label>15</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smoot</surname> <given-names>ME</given-names>
</name>
<name>
<surname>Ono</surname> <given-names>K</given-names>
</name>
<name>
<surname>Ruscheinski</surname> <given-names>J</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>PL</given-names>
</name>
<name>
<surname>Ideker</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>Cytoscape 2.8: New features for data integration and network visualization</article-title>. <source>Bioinformatics</source> (<year>2011</year>) <volume>27</volume>(<issue>3</issue>):<page-range>431&#x2013;2</page-range>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btq675</pub-id>
</citation>
</ref>
<ref id="B16">
<label>16</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chin</surname> <given-names>CH</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>SH</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>HH</given-names>
</name>
<name>
<surname>Ho</surname> <given-names>CW</given-names>
</name>
<name>
<surname>Ko</surname> <given-names>MT</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>CY</given-names>
</name>
<etal/>
</person-group>. <article-title>cytoHubba: identifying hub objects and sub-networks from complex interactome</article-title>. <source>BMC Syst Biol</source> (<year>2014</year>) <volume>8 Suppl 4</volume>(<supplement>Suppl 4</supplement>):<fpage>S11</fpage>. doi: <pub-id pub-id-type="doi">10.1186/1752-0509-8-S4-S11</pub-id>
</citation>
</ref>
<ref id="B17">
<label>17</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Simon</surname> <given-names>N</given-names>
</name>
<name>
<surname>Friedman</surname> <given-names>J</given-names>
</name>
<name>
<surname>Hastie</surname> <given-names>T</given-names>
</name>
<name>
<surname>Tibshirani</surname> <given-names>R</given-names>
</name>
</person-group>. <article-title>Regularization Paths for Cox&#x2019;s Proportional Hazards Model via Coordinate Descent</article-title>. <source>J Stat Software</source> (<year>2011</year>) <volume>39</volume>(<issue>5</issue>):<fpage>1</fpage>&#x2013;<lpage>13</lpage>. doi: <pub-id pub-id-type="doi">10.18637/jss.v039.i05</pub-id>
</citation>
</ref>
<ref id="B18">
<label>18</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Belhechmi</surname> <given-names>S</given-names>
</name>
<name>
<surname>Bin</surname> <given-names>R</given-names>
</name>
<name>
<surname>Rotolo</surname> <given-names>F</given-names>
</name>
<name>
<surname>Michiels</surname> <given-names>S</given-names>
</name>
</person-group>. <article-title>Accounting for grouped predictor variables or pathways in high-dimensional penalized Cox regression models</article-title>. <source>BMC Bioinf</source> (<year>2020</year>) <volume>21</volume>(<issue>1</issue>):<fpage>277</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12859-020-03618-y</pub-id>
</citation>
</ref>
<ref id="B19">
<label>19</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yi</surname> <given-names>M</given-names>
</name>
<name>
<surname>Nissley</surname> <given-names>DV</given-names>
</name>
<name>
<surname>McCormick</surname> <given-names>F</given-names>
</name>
<name>
<surname>Stephens</surname> <given-names>RM</given-names>
</name>
</person-group>. <article-title>ssGSEA score-based Ras dependency indexes derived from gene expression data reveal potential Ras addiction mechanisms with possible clinical implications</article-title>. <source>Sci Rep</source> (<year>2020</year>) <volume>10</volume>(<issue>1</issue>):<fpage>10258</fpage>. doi: <pub-id pub-id-type="doi">10.1038/s41598-020-66986-8</pub-id>
</citation>
</ref>
<ref id="B20">
<label>20</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huo</surname> <given-names>J</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>L</given-names>
</name>
<name>
<surname>Zang</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>A Prognostic Model of 15 Immune-Related Gene Pairs Associated With Tumor Mutation Burden for Hepatocellular Carcinoma</article-title>. <source>Front Mol Biosci</source> (<year>2020</year>) <volume>7</volume>:<elocation-id>581354</elocation-id>. doi: <pub-id pub-id-type="doi">10.3389/fmolb.2020.581354</pub-id>
</citation>
</ref>
<ref id="B21">
<label>21</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Li</surname> <given-names>L</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>J</given-names>
</name>
<name>
<surname>Ye</surname> <given-names>W</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>B</given-names>
</name>
<name>
<surname>Zeng</surname> <given-names>J</given-names>
</name>
<etal/>
</person-group>. <article-title>Comprehensive analysis reveals a metabolic ten-gene signature in hepatocellular carcinoma</article-title>. <source> PeerJ</source> (<year>2020</year>) <volume>8</volume>:<fpage>e9201</fpage>. doi: <pub-id pub-id-type="doi">10.7717/peerj.9201</pub-id>
</citation>
</ref>
<ref id="B22">
<label>22</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ouyang</surname> <given-names>G</given-names>
</name>
<name>
<surname>Yi</surname> <given-names>B</given-names>
</name>
<name>
<surname>Pan</surname> <given-names>G</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>X</given-names>
</name>
</person-group>. <article-title>A robust twelve-gene signature for prognosis prediction of hepatocellular carcinoma</article-title>. <source>Cancer Cell Int</source> (<year>2020</year>) <volume>20</volume>:<fpage>207</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12935-020-01294-9</pub-id>
</citation>
</ref>
<ref id="B23">
<label>23</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xie</surname> <given-names>H</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>S</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>P</given-names>
</name>
<name>
<surname>Tao</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>A novel seven-gene signature as Prognostic Biomarker in Hepatocellular Carcinoma</article-title>. <source>J Cancer</source> (<year>2020</year>) <volume>11</volume>(<issue>19</issue>):<page-range>5768&#x2013;81</page-range>. doi: <pub-id pub-id-type="doi">10.7150/jca.44573</pub-id>
</citation>
</ref>
<ref id="B24">
<label>24</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dong</surname> <given-names>S</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>LJ</given-names>
</name>
</person-group>. <article-title>An alternative splicing signature model for predicting hepatocellular carcinoma-specific survival</article-title>. <source>J Gastrointest Oncol</source> (<year>2020</year>) <volume>11</volume>(<issue>5</issue>):<page-range>1054&#x2013;64</page-range>. doi: <pub-id pub-id-type="doi">10.21037/jgo-20-377</pub-id>
</citation>
</ref>
<ref id="B25">
<label>25</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname> <given-names>J</given-names>
</name>
<name>
<surname>Miao</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Ran</surname> <given-names>J</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Guan</surname> <given-names>Q</given-names>
</name>
<name>
<surname>Mi</surname> <given-names>D</given-names>
</name>
</person-group>. <article-title>Construction prognosis model based on autophagy-related gene signatures in hepatocellular carcinoma</article-title>. <source>Biomark Med</source> (<year>2020</year>) <volume>14</volume>(<issue>13</issue>):<page-range>1229&#x2013;42</page-range>. doi: <pub-id pub-id-type="doi">10.2217/bmm-2020-0170</pub-id>
</citation>
</ref>
<ref id="B26">
<label>26</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname> <given-names>ZH</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>DL</given-names>
</name>
</person-group>. <article-title>Identification of a protein signature for predicting overall survival of hepatocellular carcinoma: a study based on data mining</article-title>. <source>BMC Cancer</source> (<year>2020</year>) <volume>20</volume>(<issue>1</issue>):<fpage>720</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12885-020-07229-x</pub-id>
</citation>
</ref>
<ref id="B27">
<label>27</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>B</given-names>
</name>
<name>
<surname>Tang</surname> <given-names>B</given-names>
</name>
<name>
<surname>Gao</surname> <given-names>J</given-names>
</name>
<name>
<surname>Li</surname> <given-names>J</given-names>
</name>
<name>
<surname>Kong</surname> <given-names>L</given-names>
</name>
<name>
<surname>Qin</surname> <given-names>L</given-names>
</name>
</person-group>. <article-title>A hypoxia-related signature for clinically predicting diagnosis, prognosis and immune microenvironment of hepatocellular carcinoma patients</article-title>. <source>J Transl Med</source> (<year>2020</year>) <volume>18</volume>(<issue>1</issue>):<fpage>342</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12967-020-02492-9</pub-id>
</citation>
</ref>
<ref id="B28">
<label>28</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>J</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>J</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Li</surname> <given-names>W</given-names>
</name>
</person-group>. <article-title>A Nomogram Based on a Three-Gene Signature Derived from AATF Coexpressed Genes Predicts Overall Survival of Hepatocellular Carcinoma Patients</article-title>. <source>BioMed Res Int</source> (<year>2020</year>) <volume>2020</volume>:<fpage>7310768</fpage>. doi: <pub-id pub-id-type="doi">10.1155/2020/7310768</pub-id>
</citation>
</ref>
<ref id="B29">
<label>29</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ji</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Yin</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>W</given-names>
</name>
</person-group>. <article-title>Integrated Bioinformatic Analysis Identifies Networks and Promising Biomarkers for Hepatitis B Virus-Related Hepatocellular Carcinoma</article-title>. <source>Int J Genomics</source> (<year>2020</year>) <volume>2020</volume>:<fpage>2061024</fpage>. doi: <pub-id pub-id-type="doi">10.1155/2020/2061024</pub-id>
</citation>
</ref>
<ref id="B30">
<label>30</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xiang</surname> <given-names>XH</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>L</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>X</given-names>
</name>
<etal/>
</person-group>. <article-title>Seven-senescence-associated gene signature predicts overall survival for Asian patients with hepatocellular carcinoma</article-title>. <source>World J Gastroenterol</source> (<year>2019</year>) <volume>25</volume>(<issue>14</issue>):<page-range>1715&#x2013;28</page-range>. doi: <pub-id pub-id-type="doi">10.3748/wjg.v25.i14.1715</pub-id>
</citation>
</ref>
<ref id="B31">
<label>31</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>B</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>B</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>X</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>H</given-names>
</name>
<name>
<surname>He</surname> <given-names>L</given-names>
</name>
</person-group>. <article-title>KIF18B promotes the proliferation of pancreatic ductal adenocarcinoma via activating the expression of CDCA8</article-title>. <source>J&#xa0;Cell Physiol</source> (<year>2020</year>) <volume>235</volume>(<issue>5</issue>):<page-range>4227&#x2013;38</page-range>. doi: <pub-id pub-id-type="doi">10.1002/jcp.29201</pub-id>
</citation>
</ref>
<ref id="B32">
<label>32</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname> <given-names>B</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Luo</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>A</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Gan</surname> <given-names>Y</given-names>
</name>
<etal/>
</person-group>. <article-title>The diagnostic and prognostic value of cell division cycle associated gene family in Hepatocellular Carcinoma</article-title>. <source>J Cancer</source> (<year>2020</year>) <volume>11</volume>(<issue>19</issue>):<page-range>5727&#x2013;37</page-range>. doi: <pub-id pub-id-type="doi">10.7150/jca.46554</pub-id>
</citation>
</ref>
<ref id="B33">
<label>33</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname> <given-names>D</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>X</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Zhou</surname> <given-names>K</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>J</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>JC</given-names>
</name>
<etal/>
</person-group>. <article-title>Identification of immune subtypes and prognosis of hepatocellular carcinoma based on immune checkpoint gene expression profile</article-title>. <source>Biomed Pharmacother</source> (<year>2020</year>) <volume>126</volume>:<fpage>109903</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.biopha.2020.109903</pub-id>
</citation>
</ref>
<ref id="B34">
<label>34</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jung</surname> <given-names>YD</given-names>
</name>
<name>
<surname>Cho</surname> <given-names>JH</given-names>
</name>
<name>
<surname>Park</surname> <given-names>S</given-names>
</name>
<name>
<surname>Kang</surname> <given-names>M</given-names>
</name>
<name>
<surname>Park</surname> <given-names>SJ</given-names>
</name>
<name>
<surname>Choi</surname> <given-names>DH</given-names>
</name>
<etal/>
</person-group>. <article-title>Lactate Activates the E2F Pathway to Promote Cell Motility by Up-Regulating Microtubule Modulating Genes</article-title>. <source>Cancers (Basel)</source> (<year>2019</year>) <volume>11</volume>(<issue>3</issue>):<fpage>274</fpage>. doi: <pub-id pub-id-type="doi">10.3390/cancers11030274</pub-id>
</citation>
</ref>
<ref id="B35">
<label>35</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>J</given-names>
</name>
<name>
<surname>Li</surname> <given-names>S</given-names>
</name>
<name>
<surname>Zhou</surname> <given-names>S</given-names>
</name>
<name>
<surname>Cao</surname> <given-names>S</given-names>
</name>
<name>
<surname>Lou</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Shen</surname> <given-names>H</given-names>
</name>
<etal/>
</person-group>. <article-title>Kinesin superfamily protein expression and its association with progression and prognosis in hepatocellular carcinoma</article-title>. <source>J Cancer Res Ther</source> (<year>2017</year>) <volume>13</volume>(<issue>4</issue>):<page-range>651&#x2013;9</page-range>. doi: <pub-id pub-id-type="doi">10.4103/jcrt.JCRT_491_17</pub-id>
</citation>
</ref>
<ref id="B36">
<label>36</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lu</surname> <given-names>M</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>X</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Fu</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>C</given-names>
</name>
<name>
<surname>Xiang</surname> <given-names>W</given-names>
</name>
<etal/>
</person-group>. <article-title>Aberrant KIF20A expression might independently predict poor overall survival and recurrence-free survival of hepatocellular carcinoma</article-title>. <source>IUBMB Life</source> (<year>2018</year>) <volume>70</volume>(<issue>4</issue>):<page-range>328&#x2013;35</page-range>. doi: <pub-id pub-id-type="doi">10.1002/iub.1726</pub-id>
</citation>
</ref>
<ref id="B37">
<label>37</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>GP</given-names>
</name>
<name>
<surname>Shen</surname> <given-names>SL</given-names>
</name>
<name>
<surname>Yu</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Yue</surname> <given-names>X</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>WJ</given-names>
</name>
<name>
<surname>Li</surname> <given-names>SQ</given-names>
</name>
</person-group>. <article-title>Kinesin family member 2C aggravates the progression of hepatocellular carcinoma and interacts with competing endogenous RNA</article-title>. <source>J Cell Biochem</source> (<year>2020</year>) <volume>121</volume>(<issue>11</issue>):<page-range>4419&#x2013;30</page-range>. doi: <pub-id pub-id-type="doi">10.1002/jcb.29665</pub-id>
</citation>
</ref>
<ref id="B38">
<label>38</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jeffery</surname> <given-names>J</given-names>
</name>
<name>
<surname>Sinha</surname> <given-names>D</given-names>
</name>
<name>
<surname>Srihari</surname> <given-names>S</given-names>
</name>
<name>
<surname>Kalimutho</surname> <given-names>M</given-names>
</name>
<name>
<surname>Khanna</surname> <given-names>KK</given-names>
</name>
</person-group>. <article-title>Beyond cytokinesis: the emerging roles of CEP55 in tumorigenesis</article-title>. <source>Oncogene</source> (<year>2016</year>) <volume>35</volume>(<issue>6</issue>):<page-range>683&#x2013;90</page-range>. doi: <pub-id pub-id-type="doi">10.1038/onc.2015.128</pub-id>
</citation>
</ref>
<ref id="B39">
<label>39</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname> <given-names>L</given-names>
</name>
<name>
<surname>He</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>W</given-names>
</name>
</person-group>. <article-title>Upregulation of CEP55 Predicts Dismal Prognosis in Patients with Liver Cancer</article-title>. <source>BioMed Res Int</source> (<year>2020</year>) <volume>2020</volume>:<fpage>4139320</fpage>. doi: <pub-id pub-id-type="doi">10.1155/2020/4139320</pub-id>
</citation>
</ref>
<ref id="B40">
<label>40</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>M</given-names>
</name>
<name>
<surname>Gao</surname> <given-names>J</given-names>
</name>
<name>
<surname>Li</surname> <given-names>D</given-names>
</name>
<name>
<surname>Yin</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>CEP55 Promotes Cell Motility via JAK2-STAT3-MMPs Cascade in Hepatocellular Carcinoma</article-title>. <source>Cells</source> (<year>2018</year>) <volume>7</volume>(<issue>8</issue>):<fpage>99</fpage>. doi: <pub-id pub-id-type="doi">10.3390/cells7080099</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>