<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">785330</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2021.785330</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A Five-Gene Risk Score Model for Predicting the Prognosis of Multiple Myeloma Patients Based on Gene Expression Profiles</article-title>
<alt-title alt-title-type="left-running-head">Chen et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Prognostic Model for Multiple Myeloma</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Chen</surname>
<given-names>Xiaotong</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1465745/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Liu</surname>
<given-names>Lintao</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Chen</surname>
<given-names>Mengping</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/797793/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Xiang</surname>
<given-names>Jing</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wan</surname>
<given-names>Yike</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1497584/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Li</surname>
<given-names>Xin</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Jiang</surname>
<given-names>Jinxing</given-names>
</name>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Hou</surname>
<given-names>Jian</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1244044/overview"/>
</contrib>
</contrib-group>
<aff>Department of Hematology, Renji Hospital, Shanghai Jiao Tong University School of Medicine, <addr-line>Shanghai</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/672784/overview">Min Tang</ext-link>, Jiangsu University, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/594339/overview">Lenka Besse</ext-link>, Kantonsspital St. Gallen, Switzerland</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1519074/overview">Yingying Chen</ext-link>, University of Nevada, Reno, United&#x20;States</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Jian Hou, <email>houjian@medmail.com.cn</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>30</day>
<month>11</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>785330</elocation-id>
<history>
<date date-type="received">
<day>29</day>
<month>09</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>04</day>
<month>11</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Chen, Liu, Chen, Xiang, Wan, Li, Jiang and Hou.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Chen, Liu, Chen, Xiang, Wan, Li, Jiang and Hou</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Multiple myeloma is a heterogeneous plasma cell malignancy that remains incurable because of the tendency of relapse for most patients. Survival outcomes may vary widely due to patient and disease variables; therefore, it is necessary to establish a more accurate prognostic model to improve prognostic precision and guide clinical therapy. Here, we developed a risk score model based on myeloma gene expression profiles from three independent datasets: GSE6477, GSE13591, and GSE24080. In this model, highly survival-associated five genes, including <italic>EPAS1</italic>, <italic>ERC2</italic>, <italic>PRC1</italic>, <italic>CSGALNACT1</italic>, and <italic>CCND1</italic>, are selected by using the least absolute shrinkage and selection operator (Lasso) regression and univariate and multivariate Cox regression analyses. At last, we analyzed three validation datasets (including GSE2658, GSE136337, and MMRF datasets) to examine the prognostic efficacy of this model by dividing patients into high-risk and low-risk groups based on the median risk score. The results indicated that the survival of patients in low-risk group was greatly prolonged compared with their counterparts in the high-risk group. Therefore, the five-gene risk score model could increase the accuracy of risk stratification and provide effective prediction for the prognosis of patients and instruction for individualized clinical treatment.</p>
</abstract>
<kwd-group>
<kwd>multiple myeloma</kwd>
<kwd>prognosis</kwd>
<kwd>risk score model</kwd>
<kwd>overall survival</kwd>
<kwd>prediction</kwd>
</kwd-group>
<contract-sponsor id="cn001">National Natural Science Foundation of China<named-content content-type="fundref-id">10.13039/501100001809</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Multiple myeloma (MM) is a heterogeneous plasma cell malignancy, which is the second most common hematological malignancy in the world (<xref ref-type="bibr" rid="B11">Kazandjian, 2016</xref>). There were 16,500 new cases and 10,300 deaths of MM in China in 2016; the morbidity and mortality rates increased with age, and older people were at higher risk of MM (<xref ref-type="bibr" rid="B7">Gerecke et&#x20;al., 2016</xref>; <xref ref-type="bibr" rid="B16">Liu et&#x20;al., 2019</xref>). The median survival in MM is approximately 6&#xa0;years, with survival duration ranging from a few months to more than 10&#xa0;years (<xref ref-type="bibr" rid="B20">Rajkumar, 2020</xref>). The International Staging System (ISS) is a widely used system for the stratification of MM patients based on easy-to-apply variables (serum beta2-microglobulin and serum albumin) (<xref ref-type="bibr" rid="B8">Greipp et&#x20;al., 2005</xref>). In 2015, the International Myeloma Working Group (IMWG) developed the revised international staging system (R-ISS); it classifies patients into three risk groups by combining the ISS with high-risk cytogenetic abnormalities (CA) [del (17p), t (4; 14) (p16; q32), or t (14; 16) (q32; q23)] and serum lactate dehydrogenase (LDH) (<xref ref-type="bibr" rid="B18">Palumbo et&#x20;al., 2015</xref>). For cytogenetic changes, t (4; 14), t (14; 16), t (14; 20), del (17p), and hypodiploidy have been found to be associated with high-risk diseases (<xref ref-type="bibr" rid="B19">Rajkumar et&#x20;al., 2013</xref>; <xref ref-type="bibr" rid="B15">Lakshman et&#x20;al., 2018</xref>).</p>
<p>Over the past 15&#xa0;years, the survival rate of MM has improved significantly (<xref ref-type="bibr" rid="B13">Kumar et&#x20;al., 2014</xref>). Bortezomib, lenalidomide, and dexamethasone (VRd) is the current standard of treatment for newly diagnosed MM (<xref ref-type="bibr" rid="B20">Rajkumar, 2020</xref>). For the treatment of recurrent MM, proteasome inhibitors, immunomodulatory substances, and classical chemotherapy agents are the main therapeutic measures (<xref ref-type="bibr" rid="B7">Gerecke et&#x20;al., 2016</xref>). But even under these treatments, almost all patients with MM eventually relapse; the survival outcomes of MM are highly heterogeneous. Therefore, it is important to perform risk stratification for patients with MM and to find reliable prognostic biomarkers, which can better improve prognostic accuracy and guide clinical treatment (<xref ref-type="bibr" rid="B21">Shaughnessy et&#x20;al., 2007</xref>). Comprehensive clinical information and gene expression data in public biological databases can provide opportunities to identify the prognostic signature for MM, and the biomarkers which are associated with prognostic and survival outcomes can be identified based on the gene expression of myeloma patient tissues. Studies have demonstrated that prognostic models based on the gene expression signature can predict survival outcomes in multiple independent datasets, and the discriminatory ability was also better than other combinations of traditional risk scores <underline>(</underline>
<xref ref-type="bibr" rid="B10">Heuck et&#x20;al., 2014</xref>
<underline>).</underline> In addition, further validation and analysis can also be performed in combination with other clinical information from more cohorts, to provide new insights for clinical application (<xref ref-type="bibr" rid="B5">Cai et&#x20;al., 2020</xref>).</p>
<p>The gene expression profiles can be used to calculate differentially expressed genes (DEGs) between myeloma patients and healthy individuals, and the DEGs may associate with the prognosis of MM patients (<xref ref-type="bibr" rid="B1">Alizadeh et&#x20;al., 2000</xref>). Some studies have already investigated the use of gene expression profiles alone or in combination with clinical factors as an improvement to estimate patient survival risk (<xref ref-type="bibr" rid="B6">Fernandez-Teijeiro et&#x20;al., 2004</xref>; <xref ref-type="bibr" rid="B9">Habermann et&#x20;al., 2008</xref>). The least absolute shrinkage and selection operator (Lasso) regression is a method for variable selection; it reduces the number of variables and only retains the most influential variables by using dimensionality reduction techniques. Univariate and multivariate Cox regression analyses were performed to obtain the genes correlated with prognosis in order to produce an accurate and refined&#x20;model.</p>
<p>In this study, we integrated multiple datasets to develop and validate an effective prognostic risk model for MM patients, which was successfully validated in additional three independent datasets to demonstrate the stability and reliability of the risk model. This model contributes to risk stratification, providing important implications for the prognosis of MM and may offer the prospect of personalized therapeutic.</p>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>Materials and Methods</title>
<sec id="s2-1">
<title>MM Dataset</title>
<p>We systematically searched for MM datasets that were publicly available and provided prognosis information, and the datasets must have complete survival data, including survival status and OS/PFS/EFS time. For this study, GSE6477, GSE13591, GSE24080, GSE2658, and GSE136337 were downloaded from the Gene Expression Omnibus (GEO) database (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo/">https://www.ncbi.nlm.nih.gov/geo/</ext-link>) and normalized between different arrays. Detailed clinic-cytogenetic information including t (4; 14), t (14; 16), del (17p), ISS, and R-ISS stage and survival data were included in the datasets. The MMRF CoMMpass study is a clinical trial of newly diagnosed MM patients sponsored by the Multiple Myeloma Research Foundation; the project provides clinical information (including survival data) and the expression profile data of MM patients. Because both GSE6477 and GSE13591 contained healthy individuals and MM patients, the microarray data from these two databases were used to obtain DEGs between the two groups. GSE24080, as a training dataset, contained 559&#xa0;MM patients, while the testing datasets GSE2658, GSE136337, and MMRF contained 559, 256, and 559&#xa0;MM patients, respectively. The study workflow is shown in <xref ref-type="fig" rid="F1">Figure&#x20;1</xref>.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Study flowchart of prognostic model building and validation.</p>
</caption>
<graphic xlink:href="fgene-12-785330-g001.tif"/>
</fig>
</sec>
<sec id="s2-2">
<title>Acquisition of Prognostic Associated Genes</title>
<p>From GSE6477 and GSE13591 dataset, healthy individuals and MM patients were screened out, and the data were combined to create an expression matrix including 20 healthy individuals and 206&#xa0;MM patients. Then, we acquired the DEGs between the two groups, genes with &#x7c;log fold change&#x7c; &#x3e; 1 and Benjamini&#x2013;Hochberg&#x2013;adjusted <italic>p</italic>&#x20;&#x3c; 0.05 were considered as significant DEGs. At last, the DEGs were overlapped with 20,174 genes in the GSE24080 dataset to generate the matrix of prognostic model&#x2013;associated&#x20;genes.</p>
</sec>
<sec id="s2-3">
<title>Lasso Regression Analysis</title>
<p>The GSE24080 dataset was used as the training dataset to construct the risk score model. In order to improve the prediction accuracy and interpretability of the prognostic model, Lasso regression was used to select potential prognostic genes. In this result, all genes have their own relative coefficients, and with the continuous selection and simulation of significant features, we can acquire an optimal model with the parameter lambda.min which contains the top features for constructing the risk score model. In the receiver operating characteristic (ROC) curve, the survival outcome was predicted by the patients&#x2019; risk score, and the area under curve (AUC) of the model was used to demonstrate the prediction ability of the Lasso&#x20;model.</p>
</sec>
<sec id="s2-4">
<title>Construction and Validation of Prognostic Risk Score Model</title>
<p>We used the univariate Cox regression to select genes correlated with prognosis (<italic>p</italic>&#x20;&#x3c; 0.05). Next, the Kaplan&#x2013;Meier analysis was performed to screen for the genes significantly associated with survival outcome. Then, multivariate Cox regression was used to analyze these key genes (<italic>p</italic>&#x20;&#x3c; 0.05) prior to establishing the prognostic risk score model. The risk scores of all samples were calculated according to the equation: risk score &#x3d; &#x2211;coefficient value &#x2217; expression&#x20;level.</p>
<p>The GSE2658, GSE136337 and MMRF datasets, were used for validation. With the median risk scores as the cut-off value to classify the high-risk and low-risk groups, we used the log-rank test to compare the difference between the two groups in both training and testing datasets.</p>
</sec>
<sec id="s2-5">
<title>Statistical Analysis</title>
<p>The R software &#x201c;sva&#x201d; package was used to perform correction and acquire the integrated expression matrix by removing batch effects between different datasets. The R software &#x201c;Limma&#x201d; package was used to obtain DEGs, and the &#x201c;Glmnet&#x201d; package was used to further construct the model. Survival was compared using the Kaplan&#x2013;Meier analysis with log-rank tests. Univariate and multivariate Cox regression analyses were performed for subsequent analyses. The R software &#x201c;ggpubr&#x201d; package was used to visualize the risk scores of patients in different survival states with the Wilcoxon tests. The R software &#x201c;survival&#x201d; and &#x201c;survminer&#x201d; packages were used to divide the patients into high-risk and low-risk groups by the median risk score. The R software vision 3.6.3 was used for statistical analyses. A two-sided <italic>p</italic>&#x20;&#x3c; 0.05 was considered statistically significant.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec id="s3-1">
<title>Identification of Prognostic Model&#x2013;Associated Genes</title>
<p>To develop the prognostic model for MM, GSE6477 dataset and GSE13591 dataset were used to acquire the DEGs between MM patients and healthy individuals. A total of 304 DEGs were identified with the cutoffs of &#x7c;log fold change&#x7c; &#x3e; 1 and Benjamini&#x2013;Hochberg&#x2013;adjusted <italic>p</italic>&#x20;&#x3c; 0.05, among which, 90 genes were upregulated and 214 genes were downregulated, as shown in the volcano plot (<xref ref-type="sec" rid="s10">Supplementary Figure S1</xref>). Next, the GSE24080 dataset was used as the training dataset and to construct the prognostic model, 20,174 genes from 559&#xa0;MM patients were obtained from it. By overlapping these genes with the 304 DEGs obtained above, we acquired a matrix containing 304 genes as the prognostic model&#x2013;associated genes (<xref ref-type="sec" rid="s10">Supplementary Table&#x20;S1</xref>).</p>
</sec>
<sec id="s3-2">
<title>Construction of Lasso Regression Model</title>
<p>We used Lasso regression to select the prognostic-related genes. In this regression, the contributions of all the genes were weighted by their relative coefficients. Cross validation was used to get the best performance of the model; the left dashed line represents lambda.min, which was utilized to generate the most accurate model by minimizing the prediction error (<xref ref-type="fig" rid="F2">Figure&#x20;2A</xref>). Finally, 304 genes were narrowed down to 38 potential predictor variables (<xref ref-type="sec" rid="s10">Supplementary Table S2</xref>) with nonzero coefficients in the Lasso regression model. The final risk score can be acquired by multiplying the expression of each gene with its corresponding coefficient and adding them together; then, we used the median of the risk score as the cut-off value to divide the high-risk and low-risk groups. By comparing the two groups, we found the 38-gene predictive model could distinguish the survival and death events effectively (Wilcoxon test <italic>p</italic>&#x20;&#x3c; 2.2e-16, <xref ref-type="fig" rid="F2">Figure&#x20;2B</xref>). In the ROC curve, the area under curve (AUC) of the predictive model is 0.785, indicating that the predictive ability of the model is favorable (<xref ref-type="fig" rid="F2">Figure&#x20;2C</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Screen prognostic genes by Lasso regression analysis. <bold>(A)</bold> Acquisition of the best Lambda value. The left dashed line represents lambda.min, the right dashed line represents lambda.1se. <bold>(B)</bold> To distinguish the survival and death events by the model based on lambda.min. 0: alive, 1: death. <bold>(C)</bold> ROC curve is used to evaluate the predictive performance of the&#x20;model.</p>
</caption>
<graphic xlink:href="fgene-12-785330-g002.tif"/>
</fig>
</sec>
<sec id="s3-3">
<title>Construction of the Prognostic Model</title>
<p>Next, the univariate Cox regression analysis and Kaplan&#x2013;Meier survival analysis were used to filter the target genes, and the results showed 20 genes (<xref ref-type="sec" rid="s10">Supplementary Table S3</xref>) in the GSE24080 dataset were significantly associated with prognosis. To further screen the key genes, we performed multivariate Cox regression analysis of the 20 genes, and finally we obtained a 11-gene prognostic model which was significantly associated with the prognosis in MM patients (<xref ref-type="sec" rid="s10">Supplementary Table S4</xref>). We further screened the 11 genes and ordered them according to the <italic>p</italic> value. The data indicated that the log-rank <italic>p</italic> values of all genes in the model were minimal (log-rank <italic>p</italic>&#x20;&#x3c; 0.01), when containing the top four or top five genes. To obtain the optimal model, we compared the prediction results of the four-gene model and the five-gene model for the prognosis of MM. Results showed that when applying the four-gene model in the testing dataset (GSE136337), Kaplan&#x2013;Meier curves of PFS showed no difference between the high-risk and low-risk groups divided by the median risk score (log-rank p &#x3d; 0.092, <xref ref-type="sec" rid="s10">Supplementary Figure S1B</xref>), which indicated that the four-gene model was not as valid as the five-gene model; the five-gene model can predict the prognostic outcome more effectively. Therefore, the five genes were used to build a risk score model, including <italic>EPAS1, ERC2, PRC1, CSGALNACT1,</italic> and <italic>CCND1,</italic> the multivariate Cox regression analysis showed them significantly associated with prognosis in MM patients (<xref ref-type="fig" rid="F3">Figure&#x20;3A</xref>). Among them, <italic>EPAS1, ERC2, CSGALNACT1</italic>, and <italic>CCND</italic> (hazard ratio &#x3c;1) were protective genes, while <italic>PRC1</italic> was a harmful gene (hazard ratio &#x3e;1). Then MM patients were divided into the high-risk and low-risk groups by the median risk score, and the KM analysis was used to compare the overall survival (OS) difference between the two groups. As shown in <xref ref-type="fig" rid="F3">Figures 3B&#x2013;F</xref>, we found that the differences between each gene&#x2019;s two groups were highly significant. The high expressions of <italic>PRC1</italic> were infaust for survival outcome in MM patients, but other genes are beneficial.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Construction of the five-gene risk score model. <bold>(A)</bold> Multivariate Cox regression analysis of the five genes (&#x2a;&#x2a;<italic>p</italic>&#x20;&#x3c; 0.01 and &#x2a;&#x2a;&#x2a;<italic>p</italic>&#x20;&#x3c; 0.001). The figure also showed Hazard ratio, Global log-rank p, C-index, and AIC. <bold>(B&#x2013;F)</bold> Kaplan&#x2013;Meier survival of five prognostic genes: <italic>EPAS1</italic>, <italic>ERC2</italic>, <italic>PRC1</italic>, <italic>CSGALNACT1</italic>, and <italic>CCND1</italic>.</p>
</caption>
<graphic xlink:href="fgene-12-785330-g003.tif"/>
</fig>
<p>Next, the dot plots were used to compare the survival of patients in the high-risk and low-risk groups and found that the survival of the low-risk group was higher than the survival of the high-risk group (<xref ref-type="fig" rid="F4">Figures 4A,B</xref>). The gene expression levels in the heat map showed that four genes were decreased in the high-risk group (<xref ref-type="fig" rid="F4">Figure&#x20;4C</xref>) and consistent with their hazard ratio (HR) values (HR &#x3c; 1). The time-dependent ROC analysis was used to assess the predictive ability of the model, the AUC values for 1-year, 3-year, and 5-year survival were 0.685, 0.735, and 0.676, respectively (<xref ref-type="fig" rid="F4">Figure&#x20;4D</xref>). In the GSE24080 dataset, to verify the predictive ability of the five-gene risk score model in the training dataset, the patients were divided into high-risk and low-risk groups by the median risk score. We found the difference between the two groups was highly significant in event-free survival (EFS) and OS (log-rank <italic>p</italic>&#x20;&#x3c; 0.0001; <xref ref-type="fig" rid="F4">Figures 4E,F</xref>). The results demonstrated that the five-gene prognostic model was significantly associated with prognosis in MM patients.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Risk score model based on five-gene signature in the training dataset. <bold>(A)</bold> MM patients were divided into high-risk and low-risk groups based on the median risk score. <bold>(B)</bold> Scatter plot of survival time and status in the high-risk and low-risk groups. <bold>(C)</bold> Gene expression heat map for five prognostic genes in the high-risk and low-risk groups. <bold>(D)</bold> Assessment of the predictive ability of the model by time-dependent ROC analysis. <bold>(E,F)</bold> Kaplan&#x2013;Meier curves of overall survival and event-free survival for the high-risk group and low-risk group in MM patients (log-rank test, <italic>p</italic>&#x20;&#x3c; 0.0001).</p>
</caption>
<graphic xlink:href="fgene-12-785330-g004.tif"/>
</fig>
</sec>
<sec id="s3-4">
<title>Validation of the Five-Gene Risk Score Model</title>
<p>In order to validate the predictive ability of the five-gene risk score model, we analyzed three testing datasets: the MMRF dataset, GSE2658, and GSE136337 datasets. In these testing datasets, the median risk score was taken as the cut-off value to divide high-risk and low-risk groups; then Kaplan&#x2013;Meier survival curves were used to distinguish the differences between the two groups in MM patients. In the MMRF dataset, the survival information contains OS and progress-free survival (PFS), both survival information could be used to verify the five-gene risk score model. The results showed that the differences between the two groups were highly significant in OS (log-rank <italic>p</italic>&#x20;&#x3c; 0.001) as well as PFS (log-rank <italic>p</italic>&#x20;&#x3c; 0.001), and the high-risk group predicted poor survival outcome, in line with the training dataset (<xref ref-type="fig" rid="F5">Figures 5A,B</xref>). Similarly, both in the GSE2658 (<xref ref-type="fig" rid="F5">Figure&#x20;5C</xref>) GSE136337 datasets (<xref ref-type="fig" rid="F5">Figures 5D,E</xref>), the high-risk groups showed significantly shorter OS than the low-risk groups and the log-rank <italic>p</italic> values were &#x3c;0.0001, 0.00017, and 0.012, respectively. Taken together, our five-gene risk score model was confirmed to be an independent prognostic factor in three testing datasets and can effectively predict the prognostic risk of MM patients.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Validation of the five-gene risk score model in the testing datasets by Kaplan&#x2013;Meier curves. <bold>(A)</bold> OS in the MMRF dataset (<italic>p</italic>&#x20;&#x3d; 0.00082). <bold>(B)</bold> PFS in the MMRF dataset (<italic>p</italic>&#x20;&#x3d; 0.00021). <bold>(C)</bold> OS in GSE2658 (<italic>p</italic>&#x20;&#x3c; 0.0001). <bold>(D)</bold> OS in GSE136337 (<italic>p</italic>&#x20;&#x3d; 0.00017). <bold>(E)</bold> PFS in GSE136337 (<italic>p</italic>&#x20;&#x3d; 0.012). MM patients were divided into high-risk and low-risk groups by the median risk score. The difference between the two groups was tested by the log-rank&#x20;test.</p>
</caption>
<graphic xlink:href="fgene-12-785330-g005.tif"/>
</fig>
</sec>
<sec id="s3-5">
<title>Identification of Genetic Risk Indicators</title>
<p>Del (17p), t (4; 14), and t (14; 16) are defined as high-risk genetic factors by the IMWG. To verify the predictive ability of the five-gene risk score model in patients with or without these genetic factors, we analyzed the GSE136337 dataset which contains these indicators. First, we divided MM patients into two subgroups based on absence/presence del (17p), and the number of these patients were 411 and 15, respectively. Then, the patients were further subdivided into high-risk and low-risk groups by the median risk score. Kaplan&#x2013;Meier curves of PFS showed no difference between the del (17p) FALSE and del (17p) TRUE group (log-rank <italic>p</italic>&#x20;&#x3d; 0.86), which indicated that del (17p) was not an effective indicator for prognostic outcome (<xref ref-type="sec" rid="s10">Supplementary Figure S2A</xref>). However, in patients without del (17p), the difference between two groups was statistically significant (log-rank <italic>p</italic>&#x20;&#x3c; 0.0001, <xref ref-type="fig" rid="F6">Figure&#x20;6A</xref>), with the high-risk group showing shorter overall survival than the low-risk group, whereas in patients with del (17p), the difference was not significant (log-rank <italic>p</italic>&#x20;&#x3d; 0.49; <xref ref-type="sec" rid="s10">Supplementary Figure&#x20;S2B</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Validation of the five-gene risk score model in patients without genetic risk indicators by Kaplan&#x2013;Meier curves. <bold>(A)</bold> MM patients without del (17p) (<italic>p</italic>&#x20;&#x3d;&#x20;0.0002). <bold>(B)</bold> MM patients without t (4,14) (<italic>p</italic>&#x20;&#x3d; 0.00039). <bold>(C)</bold> MM patients without t (14,16) (<italic>p</italic>&#x20;&#x3d; 0.00015). MM patients were divided into high-risk and low-risk groups by the median risk&#x20;score.</p>
</caption>
<graphic xlink:href="fgene-12-785330-g006.tif"/>
</fig>
<p>Similarly, the results showed that there was no difference between the t (4,14) FALSE and t (4,14) TRUE group, indicating t (4,14) was not an effective indicator for prognostic outcomes (log-rank <italic>p</italic>&#x20;&#x3d; 0.98, <xref ref-type="sec" rid="s10">Supplementary Figure S2C</xref>). In patients without t (4,14), the low-risk group showed higher overall survival than the high-risk group (log-rank <italic>p</italic>&#x20;&#x3c; 0.0001; <xref ref-type="fig" rid="F6">Figure&#x20;6B</xref>). But for patients with t (4,14), the difference between the two groups was not significant (log-rank <italic>p</italic>&#x20;&#x3d; 0.1; <xref ref-type="sec" rid="s10">Supplementary Figure S2D</xref>). Since there was only 1&#xa0;MM patient with t (14,16), t (14,16) also cannot be used as a predictor of prognosis (log-rank <italic>p</italic>&#x20;&#x3d; 0.48, <xref ref-type="sec" rid="s10">Supplementary Figure S2E</xref>). For 425&#xa0;MM patients without t (14,16), the result showed that the survival was longer in the low-risk group than in the high-risk group (log-rank <italic>p</italic>&#x20;&#x3c; 0.0001; <xref ref-type="fig" rid="F6">Figure&#x20;6C</xref>). In conclusion, our five-gene risk score model could effectively predict the prognosis of MM patients without high-risk genetic factors.</p>
</sec>
<sec id="s3-6">
<title>Identification of Clinical Risk Indicators</title>
<p>R_ISS and ISS were the widely used systems for the stratification of MM patients. In the MMRF dataset, the difference between different stages were highly significant (log-rank <italic>p</italic>&#x20;&#x3c; 0.0001; <xref ref-type="sec" rid="s10">Supplementary Figures S3A,B</xref>). The ISS and R_ISS systems divided MM patients into three stages, and stage II and stage III were considered as progressive stages for MM patients. Thus, we verified the model&#x2019;s predictive ability in the patients with these two stages, and the results showed that by combining the data from the ISS stage II and III, the difference in OS between the high-risk and low-risk groups was highly significant (log-rank <italic>p</italic>&#x20;&#x3d; 0.0049; <xref ref-type="fig" rid="F7">Figure&#x20;7A</xref>), the survival of the low-risk group was longer than the survival of high-risk group. However, for the ISS stage I, the difference between the two groups was not discernible (log-rank <italic>p</italic>&#x20;&#x3d; 0.16; <xref ref-type="sec" rid="s10">Supplementary Figure S3C</xref>). For R-ISS, we performed the same analysis, broadly consistent with the above. For stage II and III, the difference between the high-risk and low-risk groups was significant (log-rank <italic>p</italic>&#x20;&#x3d; 0.01; <xref ref-type="fig" rid="F7">Figure&#x20;7B</xref>); stage I was not significant (log-rank <italic>p</italic>&#x20;&#x3d; 0.7; <xref ref-type="sec" rid="s10">Supplementary Figure S3D</xref>). As the disease progressed, the risk scores became higher. The risk scores were significantly higher in stage II and III patients than in stage I patients in the ISS (Wilcoxon test <italic>p</italic>&#x20;&#x3d; 0.00045, <xref ref-type="fig" rid="F7">Figure&#x20;7C</xref>) and R-ISS (Wilcoxon test <italic>p</italic>&#x20;&#x3d; 0.00076, <xref ref-type="fig" rid="F7">Figure&#x20;7D</xref>). In addition, for ISS, when comparing stage II and stage III patients with stage I patients separately, the difference of risk scores between two stages were significant (Wilcoxon test p &#x3c; 0.05). But the difference was not significant between stage II and stage III patients (Wilcoxon test p &#x3d; 0.1). For R-ISS, when comparing stage I patients, stage II patients, and stage III patients separately, the risk scores between these stages were highly significant (Wilcoxon test p &#x3c; 0.05, <xref ref-type="sec" rid="s10">Supplementary Figure S3E,F</xref>). In conclusion, for high-risk ISS and R_ISS stage patients, the five-gene risk score model was confirmed to be an independent prognostic factor and can be further used to predict the prognostic outcome more accurately.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Validation of the five-gene risk score model in ISS and R_ISS. <bold>(A)</bold> Kaplan&#x2013;Meier curves of MM patients in stage II and III of ISS (<italic>p</italic>&#x20;&#x3d; 0.0049). <bold>(B)</bold> Kaplan&#x2013;Meier curves of MM patients in stage II and III of R-ISS (<italic>p</italic>&#x20;&#x3d; 0.01). <bold>(C)</bold> Survival differences between stage I and stage II and III of ISS (<italic>p</italic>&#x20;&#x3d; 0.00045). <bold>(D)</bold> Survival differences between stage I and stage II and III of R_ISS (<italic>p</italic>&#x20;&#x3d; 0.00076). MM patients were divided into high-risk and low-risk groups by the median risk&#x20;score.</p>
</caption>
<graphic xlink:href="fgene-12-785330-g007.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>MM is a malignancy of terminally differentiated plasma cells, which in most cases remain incurable, the MM cells are mainly resident in the bone marrow (<xref ref-type="bibr" rid="B14">Kumar et&#x20;al., 2017</xref>). Patients suffering from MM often display heterogeneous clinical outcomes, and MM remains a challenge due to the tendency to relapses for most patients (<xref ref-type="bibr" rid="B23">Sonneveld and Broijl, 2016</xref>). Among patients receiving the same treatments, survival outcomes can vary widely. In this regard, treatment options for individualized treatment are lacking.</p>
<p>Therefore, a prognostic signature beyond the current staging system is needed to establish to improve prognostic precision and guide clinical therapy. Many studies have found that gene transcription levels are closely related to tumor prognosis and biomarkers have been studied to improve prediction accuracy. In this study, we integrated public MM datasets and constructed a five-gene prognostic risk score model. The results demonstrated its validity in three independent datasets. The risk score model was confirmed to be an independent prognostic factor in multiple analyses that included genetic factors and clinical factors. Compared with other prognostic models, our model predicted survival outcomes effectively and were applied to predict the prognosis of patients with high-risk ISS/R-ISS stage or patients without high-risk genetic factors innovatively. As shown in <xref ref-type="fig" rid="F6">Figure&#x20;6</xref>, for patients without del (17p), t (4,14), and t (14,16), the difference between two groups was statistically significant, the high-risk group showed shorter overall survival than the low-risk&#x20;group.</p>
<p>The five genes in the risk score model: Endothelial PAS domain-containing protein 1 (<italic>EPAS1</italic>), often known as HIF2&#x3b1;, is a type of hypoxia-inducible factor (<xref ref-type="bibr" rid="B24">Tian et&#x20;al., 1997</xref>). In colorectal carcinoma, the <italic>EPAS1</italic> protein expression inversely correlated with higher tumor grade and is associated with poor prognosis (<xref ref-type="bibr" rid="B3">Baba et&#x20;al., 2010</xref>). <italic>ERC2</italic> (ELKS/RAB6-Interacting/CAST Family Member 2) is a protein-coding gene located in presynaptic active zones (<xref ref-type="bibr" rid="B12">Ko et&#x20;al., 2006</xref>). In renal-cell carcinomas (<xref ref-type="bibr" rid="B2">Arai et&#x20;al., 2014</xref>), frequent genetic and transcriptional inactivation of <italic>ERC2</italic> occurs, suggesting that <italic>ERC2</italic> may be involved in cancer progression. <italic>PRC1</italic> (protein regulator of cytokinesis-1) belongs to the microtubule-associated protein family and is involved in cytokinesis. <italic>PRC1</italic> was significantly overexpressed in breast cancer and lung adenocarcinoma (<xref ref-type="bibr" rid="B22">Shimo et&#x20;al., 2007</xref>; <xref ref-type="bibr" rid="B25">Zhan et&#x20;al., 2017</xref>), despite the possible molecular mechanisms have not been fully elucidated. <italic>CSGALNACT1</italic> (chondroitin sulfate N-acetylgalactosaminyltransferase 1) encodes a protein involved in glycos&#x2013;aminoglycan chain synthesis and modifications. Studies showed that in myeloma, <italic>CSGALNACT1</italic> was under-expressed in MM cells compared to normal bone marrow plasma cells, which suggest that the overexpression of <italic>CSGALNACT1</italic> is associated with a good prognosis (<xref ref-type="bibr" rid="B4">Bret et&#x20;al., 2009</xref>). <italic>CCND1</italic> (cyclin D1) is involved in regulating cell cycle and transcriptional processes. Previous studies have found that in myeloma, the dysregulation of <italic>CCND1</italic> is associated with oncogenic event in patients (<xref ref-type="bibr" rid="B17">Padhi et&#x20;al., 2013</xref>), but further functional studies are needed to validate.</p>
<p>The advantage of our research is large sample sizes, five GEO datasets, and the MMRF dataset are used for system analysis; and a variety of algorithms are used to explore the optimal prognostic model. For survival information (OS, PFS, and EFS) in three testing datasets, the five-gene risk score model also acquired effective prognostic predictions; the survival of low-risk group was higher than the survival in the high-risk group. For genetic factors [del (17p), t (4; 14), and t (14; 16)] and clinical factors (ISS and R-ISS), adding the model can increase the prediction accuracy significantly. All these results demonstrate the stability and reliability of the model, which is an independent predictor of survival, can identify high-/low-risk patients and provide effective treatment recommendations. More importantly, in previous studies, these five genes have been confirmed to be associated with tumor, so further functional studies are needed to evaluate the roles of five genes in myeloma.</p>
<p>Despite the model is effective in predicting prognosis, there were still some shortcomings. First, we used multiple datasets, but there are biases between different platforms which may cause differences in results. Second, this is a retrospective study; further prospective studies are needed to confirm the results. Third, the contribution of each gene in the five-gene risk score model is unknown, further functional studies are needed to be validated.</p>
</sec>
</body>
<back>
<sec id="s5">
<title>Data Availability Statement</title>
<p>Publicly available datasets were analyzed in this study. These data can be found here: <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo/">https://www.ncbi.nlm.nih.gov/geo/</ext-link> <ext-link ext-link-type="uri" xlink:href="https://research.themmrf.org/">https://research.themmrf.org/</ext-link>.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>XC and JH conceived, designed, and managed the study. XC, LL, and MC drafted the manuscript. JX, YW, XL, and JJ provided technical assistance. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This work was supported by the National Natural Science Foundation of China (81974006).</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s10">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2021.785330/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2021.785330/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM1" mimetype="application/ZIP" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alizadeh</surname>
<given-names>A. A.</given-names>
</name>
<name>
<surname>Eisen</surname>
<given-names>M. B.</given-names>
</name>
<name>
<surname>Davis</surname>
<given-names>R. E.</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Lossos</surname>
<given-names>I. S.</given-names>
</name>
<name>
<surname>Rosenwald</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2000</year>). <article-title>Distinct Types of Diffuse Large B-Cell Lymphoma Identified by Gene Expression Profiling</article-title>. <source>Nature</source> <volume>403</volume> (<issue>6769</issue>), <fpage>503</fpage>&#x2013;<lpage>511</lpage>. <pub-id pub-id-type="doi">10.1038/35000501</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Arai</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Sakamoto</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Ichikawa</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Totsuka</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Chiku</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Gotoh</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Multilayer&#x2010;omics Analysis of Renal Cell Carcinoma, Including the Whole Exome, Methylome and Transcriptome</article-title>. <source>Int. J.&#x20;Cancer</source> <volume>135</volume> (<issue>6</issue>), <fpage>1330</fpage>&#x2013;<lpage>1342</lpage>. <pub-id pub-id-type="doi">10.1002/ijc.28768</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baba</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Nosho</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Shima</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Irahara</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Chan</surname>
<given-names>A. T.</given-names>
</name>
<name>
<surname>Meyerhardt</surname>
<given-names>J.&#x20;A.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>HIF1A Overexpression Is Associated with Poor Prognosis in a Cohort of 731 Colorectal Cancers</article-title>. <source>Am. J.&#x20;Pathol.</source> <volume>176</volume> (<issue>5</issue>), <fpage>2292</fpage>&#x2013;<lpage>2301</lpage>. <pub-id pub-id-type="doi">10.2353/ajpath.2010.090972</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bret</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Hose</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Reme</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Sprynski</surname>
<given-names>A.-C.</given-names>
</name>
<name>
<surname>Mahtouk</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Schved</surname>
<given-names>J.-F.</given-names>
</name>
<etal/>
</person-group> (<year>2009</year>). <article-title>Expression of Genes Encoding for Proteins Involved in Heparan Sulphate and Chondroitin Sulphate Chain Synthesis and Modification in normal and Malignant Plasma Cells</article-title>. <source>Br. J.&#x20;Haematol.</source> <volume>145</volume> (<issue>3</issue>), <fpage>350</fpage>&#x2013;<lpage>368</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2141.2009.07633.x</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cai</surname>
<given-names>W.-Y.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>Z.-N.</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>X.-T.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>L.-Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>G.-D.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Identification of a Tumor Microenvironment-Relevant Gene Set-Based Prognostic Signature and Related Therapy Targets in Gastric Cancer</article-title>. <source>Theranostics</source> <volume>10</volume> (<issue>19</issue>), <fpage>8633</fpage>&#x2013;<lpage>8647</lpage>. <pub-id pub-id-type="doi">10.7150/thno.47938</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fernandez-Teijeiro</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Betensky</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Sturla</surname>
<given-names>L. M.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>J.&#x20;Y. H.</given-names>
</name>
<name>
<surname>Tamayo</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Pomeroy</surname>
<given-names>S. L.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Combining Gene Expression Profiles and Clinical Parameters for Risk Stratification in Medulloblastomas</article-title>. <source>Jco</source> <volume>22</volume> (<issue>6</issue>), <fpage>994</fpage>&#x2013;<lpage>998</lpage>. <pub-id pub-id-type="doi">10.1200/JCO.2004.03.036</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gerecke</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Fuhrmann</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Strifler</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Schmidt-Hieber</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Einsele</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Knop</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>The Diagnosis and Treatment of Multiple Myeloma</article-title>. <source>Dtsch Arztebl Int.</source> <volume>113</volume> (<issue>27-28</issue>), <fpage>470</fpage>&#x2013;<lpage>476</lpage>. <pub-id pub-id-type="doi">10.3238/arztebl.2016.0470</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Greipp</surname>
<given-names>P. R.</given-names>
</name>
<name>
<surname>Miguel</surname>
<given-names>J.&#x20;S.</given-names>
</name>
<name>
<surname>Durie</surname>
<given-names>B. G. M.</given-names>
</name>
<name>
<surname>Crowley</surname>
<given-names>J.&#x20;J.</given-names>
</name>
<name>
<surname>Barlogie</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Blade</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>International Staging System for Multiple Myeloma</article-title>. <source>J.&#x20;Clin. Oncol.</source> <volume>23</volume> (<issue>15</issue>), <fpage>3412</fpage>&#x2013;<lpage>3420</lpage>. <pub-id pub-id-type="doi">10.1200/JCO.2005.04.242</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Habermann</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S. S.</given-names>
</name>
<name>
<surname>Maurer</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Morton</surname>
<given-names>L. M.</given-names>
</name>
<name>
<surname>Lynch</surname>
<given-names>C. F.</given-names>
</name>
<name>
<surname>Ansell</surname>
<given-names>S. M.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>Host Immune Gene Polymorphisms in Combination with Clinical and Demographic Factors Predict Late Survival in Diffuse Large B-Cell Lymphoma Patients in the Pre-rituximab Era</article-title>. <source>Blood</source> <volume>112</volume> (<issue>7</issue>), <fpage>2694</fpage>&#x2013;<lpage>2702</lpage>. <pub-id pub-id-type="doi">10.1182/blood-2007-09-111658</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Heuck</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Qu</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>van Rhee</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Waheed</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Usmani</surname>
<given-names>S. Z.</given-names>
</name>
<name>
<surname>Epstein</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Five Gene Probes Carry Most of the Discriminatory Power of the 70-gene Risk Model in Multiple Myeloma</article-title>. <source>Leukemia</source> <volume>28</volume> (<issue>12</issue>), <fpage>2410</fpage>&#x2013;<lpage>2413</lpage>. <pub-id pub-id-type="doi">10.1038/leu.2014.232</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kazandjian</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Multiple Myeloma Epidemiology and Survival: A Unique Malignancy</article-title>. <source>Semin. Oncol.</source> <volume>43</volume> (<issue>6</issue>), <fpage>676</fpage>&#x2013;<lpage>681</lpage>. <pub-id pub-id-type="doi">10.1053/j.seminoncol.2016.11.004</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ko</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yoon</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Piccoli</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Chung</surname>
<given-names>H. S.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>J.&#x20;R.</given-names>
</name>
<etal/>
</person-group> (<year>2006</year>). <article-title>Organization of the Presynaptic Active Zone by ERC2/CAST1-dependent Clustering of the Tandem PDZ Protein Syntenin-1</article-title>. <source>J.&#x20;Neurosci.</source> <volume>26</volume> (<issue>3</issue>), <fpage>963</fpage>&#x2013;<lpage>970</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.4475-05.2006</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>S. K.</given-names>
</name>
<name>
<surname>Dispenzieri</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Lacy</surname>
<given-names>M. Q.</given-names>
</name>
<name>
<surname>Gertz</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Buadi</surname>
<given-names>F. K.</given-names>
</name>
<name>
<surname>Pandey</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Continued Improvement in Survival in Multiple Myeloma: Changes in Early Mortality and Outcomes in Older Patients</article-title>. <source>Leukemia</source> <volume>28</volume> (<issue>5</issue>), <fpage>1122</fpage>&#x2013;<lpage>1128</lpage>. <pub-id pub-id-type="doi">10.1038/leu.2013.313</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>S. K.</given-names>
</name>
<name>
<surname>Rajkumar</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Kyle</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>van Duin</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sonneveld</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Mateos</surname>
<given-names>M. V.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Multiple Myeloma</article-title>. <source>Nat. Rev. Dis. Primers</source> <volume>3</volume>, <fpage>17046</fpage>. <pub-id pub-id-type="doi">10.1038/nrdp.2017.46</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lakshman</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Paul</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Rajkumar</surname>
<given-names>S. V.</given-names>
</name>
<name>
<surname>Ketterling</surname>
<given-names>R. P.</given-names>
</name>
<name>
<surname>Greipp</surname>
<given-names>P. T.</given-names>
</name>
<name>
<surname>Dispenzieri</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Prognostic Significance of Interphase FISH in Monoclonal Gammopathy of Undetermined Significance</article-title>. <source>Leukemia</source> <volume>32</volume> (<issue>8</issue>), <fpage>1811</fpage>&#x2013;<lpage>1815</lpage>. <pub-id pub-id-type="doi">10.1038/s41375-018-0030-3</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Mi</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zeng</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Incidence and Mortality of Multiple Myeloma in China, 2006-2016: an Analysis of the Global Burden of Disease Study 2016</article-title>. <source>J.&#x20;Hematol. Oncol.</source> <volume>12</volume> (<issue>1</issue>), <fpage>136</fpage>. <pub-id pub-id-type="doi">10.1186/s13045-019-0807-5</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Padhi</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Varghese</surname>
<given-names>R. G.</given-names>
</name>
<name>
<surname>Ramdas</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Cyclin D1 Expression in Multiple Myeloma by Immunohistochemistry: Case Series of 14 Patients and Literature Review</article-title>. <source>Indian J.&#x20;Med. Paediatr. Oncol.</source> <volume>34</volume> (<issue>4</issue>), <fpage>283</fpage>&#x2013;<lpage>291</lpage>. <pub-id pub-id-type="doi">10.4103/0971-5851.125246</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Palumbo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Avet-Loiseau</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Oliva</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lokhorst</surname>
<given-names>H. M.</given-names>
</name>
<name>
<surname>Goldschmidt</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Rosinol</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Revised International Staging System for Multiple Myeloma: A Report from International Myeloma Working Group</article-title>. <source>J.&#x20;Clin. Oncol.</source> <volume>33</volume> (<issue>26</issue>), <fpage>2863</fpage>&#x2013;<lpage>2869</lpage>. <pub-id pub-id-type="doi">10.1200/JCO.2015.61.2267</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rajkumar</surname>
<given-names>S. V.</given-names>
</name>
<name>
<surname>Gupta</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Fonseca</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Dispenzieri</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Gonsalves</surname>
<given-names>W. I.</given-names>
</name>
<name>
<surname>Larson</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Impact of Primary Molecular Cytogenetic Abnormalities and Risk of Progression in Smoldering Multiple Myeloma</article-title>. <source>Leukemia</source> <volume>27</volume> (<issue>8</issue>), <fpage>1738</fpage>&#x2013;<lpage>1744</lpage>. <pub-id pub-id-type="doi">10.1038/leu.2013.86</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rajkumar</surname>
<given-names>S. V.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Multiple Myeloma: 2020 Update on Diagnosis, Risk-Stratification and Management</article-title>. <source>Am. J.&#x20;Hematol.</source> <volume>95</volume> (<issue>5</issue>), <fpage>548</fpage>&#x2013;<lpage>567</lpage>. <pub-id pub-id-type="doi">10.1002/ajh.25791</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shaughnessy</surname>
<given-names>J.&#x20;D.</given-names>
<suffix>Jr.</suffix>
</name>
<name>
<surname>Zhan</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Burington</surname>
<given-names>B. E.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Colla</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Hanamura</surname>
<given-names>I.</given-names>
</name>
<etal/>
</person-group> (<year>2007</year>). <article-title>A Validated Gene Expression Model of High-Risk Multiple Myeloma Is Defined by Deregulated Expression of Genes Mapping to Chromosome 1</article-title>. <source>Blood</source> <volume>109</volume> (<issue>6</issue>), <fpage>2276</fpage>&#x2013;<lpage>2284</lpage>. <pub-id pub-id-type="doi">10.1182/blood-2006-07-038430</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shimo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Nishidate</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ohta</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Fukuda</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Nakamura</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Katagiri</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Elevated Expression of Protein Regulator of Cytokinesis 1, Involved in the Growth of Breast Cancer Cells</article-title>. <source>Cancer Sci.</source> <volume>98</volume> (<issue>2</issue>), <fpage>174</fpage>&#x2013;<lpage>181</lpage>. <pub-id pub-id-type="doi">10.1111/j.1349-7006.2006.00381.x</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sonneveld</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Broijl</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Treatment of Relapsed and Refractory Multiple Myeloma</article-title>. <source>Haematologica</source> <volume>101</volume> (<issue>4</issue>), <fpage>396</fpage>&#x2013;<lpage>406</lpage>. <pub-id pub-id-type="doi">10.3324/haematol.2015.129189</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tian</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>McKnight</surname>
<given-names>S. L.</given-names>
</name>
<name>
<surname>Russell</surname>
<given-names>D. W.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Endothelial PAS Domain Protein 1 (EPAS1), a Transcription Factor Selectively Expressed in Endothelial Cells</article-title>. <source>Genes Dev.</source> <volume>11</volume> (<issue>1</issue>), <fpage>72</fpage>&#x2013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.1101/gad.11.1.72</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhan</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Xi</surname>
<given-names>G. M.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>H. B.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y. F.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>PRC1 Contributes to Tumorigenesis of Lung Adenocarcinoma in Association with the Wnt/beta-Catenin Signaling Pathway</article-title>. <source>Mol. Cancer</source> <volume>16</volume> (<issue>1</issue>), <fpage>108</fpage>. <pub-id pub-id-type="doi">10.1186/s12943-017-0682-z</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>