<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Nutr.</journal-id>
<journal-title>Frontiers in Nutrition</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Nutr.</abbrev-journal-title>
<issn pub-type="epub">2296-861X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnut.2023.1066749</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Nutrition</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Contribution of macro- and micronutrients intake to gastrointestinal cancer mortality in the ONCONUT cohort: Classical vs. modern approaches</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Donghia</surname> <given-names>Rossella</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2045986/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Guerra</surname> <given-names>Vito</given-names></name>
</contrib>
<contrib contrib-type="author">
<name><surname>Pesole</surname> <given-names>Pasqua Letizia</given-names></name>
</contrib>
<contrib contrib-type="author">
<name><surname>Liso</surname> <given-names>Marina</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/2139853/overview"/>
</contrib>
</contrib-group>
<aff><institution>National Institute of Gastroenterology, IRCCS &#x201C;S. de Bellis,&#x201D; Research Hospital</institution>, <addr-line>Bari</addr-line>, <country>Italy</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Angela M. Zivkovic, University of California, Davis, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Salvatore Vaccaro, IRCCS Local Health Authority of Reggio Emilia, Italy; Omid Sadeghi, Isfahan University of Medical Sciences, Iran</p></fn>
<corresp id="c001">&#x002A;Correspondence: Rossella Donghia, <email>rossella.donghia@irccsdebellis.it</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Nutrition Methodology, a section of the journal Frontiers in Nutrition</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>23</day>
<month>01</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>10</volume>
<elocation-id>1066749</elocation-id>
<history>
<date date-type="received">
<day>13</day>
<month>10</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>01</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Donghia, Guerra, Pesole and Liso.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Donghia, Guerra, Pesole and Liso</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The aim of this study was to evaluate the contribution of macro- and micronutrients intake to mortality in patients with gastrointestinal cancer, comparing the classical statistical approaches with a new generation algorithm. In 1992, the ONCONUT project was started with the aim of evaluating the relationship between diet and cancer development in a Southern Italian elderly population. Patients who died of specific death causes (ICD-10 from 150.0 to 159.9) were included in the study (<italic>n</italic> = 3,505) and survival analysis was applied. This cohort was used to test the performance of different techniques, namely Cox proportional-hazards model, random survival forest (RSF), Survival Support Vector Machine (SSVM), and C-index, applied to quantify the performance. Lastly, the new prediction mode, denominated Shapley Additive Explanation (SHAP), was adopted. RSF had the best performance (0.7653711 and 0.7725246, for macro- and micronutrients, respectively), while SSVM had the worst C-index (0.5667753 and 0.545222). SHAP was helpful to understand the role of single patient features on mortality. Using SHAP together with RSF and classical CPH was most helpful, and shows promise for future clinical applications.</p>
</abstract>
<kwd-group>
<kwd>gastrointestinal cancer</kwd>
<kwd>machine learning</kwd>
<kwd>biostatistics</kwd>
<kwd>cancers</kwd>
<kwd>nutrition</kwd>
</kwd-group>
<counts>
<fig-count count="9"/>
<table-count count="3"/>
<equation-count count="3"/>
<ref-count count="52"/>
<page-count count="11"/>
<word-count count="6209"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>Cancer is one of the deadliest diseases in the world. Biologically, it includes a collection of diseases in which the cells of the body start dividing uncontrollably without stopping. Normally, human cells grow and duplicate according to physiological needs and their number is controlled by the apoptotic process. In pathological conditions such as cancer, however, cells grow abnormally, old and damaged cells are not eliminated, and new cells duplicate uncontrollably. Malignant tumors invade surrounding tissues and compete with normal cells for nutrients. Additionally, these types of tumors can metastasize to new sites, often distant from the primary tumor site, through the blood or lymphatic system (<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B2">2</xref>). According to the American Cancer Society annual report, 1,918,030 new cases of cancer were recorded in 2022 resulting in 609,360 deaths (<xref ref-type="bibr" rid="B3">3</xref>). The most common site of cancer development is the digestive system, and 343,040 new cases were discovered in that same year. The 5-year survival rate for all cancers has grown dramatically since the early 1960s, reaching nearly double, from 39 to 70% among white individuals and triple, from 27 to 63% among black patients. In particular, for gastrointestinal tumors, interest is now focused not only on the tumor growth rate but also on the cellular microenvironment, which is responsible for tumor initiation, progression and metastasis (<xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B5">5</xref>). One of the many causes of this dysregulation is certainly an incorrect diet (<xref ref-type="fig" rid="F1">Figure 1</xref>), lifestyle, and genetic background (<xref ref-type="bibr" rid="B6">6</xref>) which leads to an ever increasing rate of obesity and continues to remain the &#x201C;elephant in the room&#x201D; (<xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B8">8</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Component of diet.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnut-10-1066749-g001.tif"/>
</fig>
<p>Dietary intakes of macro- and micronutrients have been implicated in the etiology of chronic disease as gastrointestinal cancer (<xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B10">10</xref>). Several studies have shown that high dietary intakes of macronutrients (as fat, protein and carbohydrate) may have a positive association with the risk of developing of cancer (<xref ref-type="bibr" rid="B11">11</xref>). Additionally, fruits and vegetables as sources of dietary fiber, folate, vitamin C, and a lot of phenols, and flavonoids could protect, because are involved on trapping free radicals and reactive oxygen molecules at the cellular level [induced lipid peroxidation has been implicated in malignant transformation (<xref ref-type="bibr" rid="B12">12</xref>)], thus acting as a protective mechanism against the oxidative damage during digestion process (<xref ref-type="bibr" rid="B13">13</xref>, <xref ref-type="bibr" rid="B14">14</xref>). However, micronutrients with a significantly reduced intake also are involved in other kind of cancer, as the breast cancer risk in obese women (<xref ref-type="bibr" rid="B15">15</xref>).</p>
<p>The obesity, which is a slow and progressive condition, is mainly based on an ever increasing fatty acid intake vs. low amounts of fiber, vitamins and minerals, widely present in fruit and vegetables (<xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B17">17</xref>). For example nuts contain magnesium, unsaturated fatty acids, potassium, fiber, and vitamin E (<xref ref-type="bibr" rid="B18">18</xref>), while dairy products and eggs (<xref ref-type="bibr" rid="B19">19</xref>) contain a well-balanced composition of micronutrients that could help for the prevention of hypertension (<xref ref-type="bibr" rid="B20">20</xref>). However, also consumption of particular type of vegetable food as N. sativa could reduce oxidative stress and inflammation with various mechanisms including a reduction of lipid peroxidation via its antioxidant properties; agonist of PPAR-&#x03B3; in adipose tissue; activation of AMPK, increased antioxidants inhibition of NF-&#x03BA;B pathway (<xref ref-type="bibr" rid="B21">21</xref>, <xref ref-type="bibr" rid="B22">22</xref>).</p>
<p>The obesity rate associated with the development of gastrointestinal cancer (GC) is estimated to rise by about 18% overall by 2025, affecting 18% of men and 21% of women. Studies have shown that weight disorders such as overweight and obesity are usually associated also with sleep problems, and some studies have shown that adherence to DASH diet has helped to correct this trend (<xref ref-type="bibr" rid="B23">23</xref>). Furthermore, these disorders could be associated with the Non-alcoholic Fatty Liver Disease (NAFLD) dyslipidemia, and insulin resistance (<xref ref-type="bibr" rid="B24">24</xref>).</p>
<p>One of the many opportunities available to study this increase, as well as the classical clinical approaches, is to apply machine learning (ML) as a basis on which to predict cancer survival. A huge amount of research has been conducted across a wide range of statistical approaches. Classical survival analysis is a well-established methodology for estimating a patient&#x2019;s probability of survival. Although this is now validated, research is also focused on identifying new methods for prediction and calculating their accuracy. In this work we aim to evaluate survival of GC patients using the classical statistical method, Cox Proportional Hazards (CPH) and compare it with new generation algorithms such as Random Survival Forest (RSF), and Survival Support Vector Machine (SSVM), together with a new approach to quantify the power of parameters&#x2019; prediction on GC, denominated Shapley Additive Explanation (SHAP) (<xref ref-type="table" rid="T1">Table 1</xref>).</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>General view of machine learning techniques.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Methods</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Assumptions</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Advantages</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Disadvantages</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">CPH</td>
<td valign="top" align="left">Proportion hazard ratios are constant over time</td>
<td valign="top" align="left">Distribution knowledge of survival times not required</td>
<td valign="top" align="left">Not easy to interpret</td>
</tr>
<tr>
<td valign="top" align="left">RSF</td>
<td valign="top" align="left">Adapts tree structured for survival analysis</td>
<td valign="top" align="left">On single survival tree</td>
<td valign="top" align="left">Not easy to interpret for categorical variables</td>
</tr>
<tr>
<td valign="top" align="left">SSVM</td>
<td valign="top" align="left">Adapts support vector regression for survival analysis</td>
<td valign="top" align="left">Works well for unstructured data and it is good for scaling</td>
<td valign="top" align="left">Not easy to interpret</td>
</tr>
<tr>
<td valign="top" align="left">SHAP</td>
<td valign="top" align="left">Features independence</td>
<td valign="top" align="left">The prediction is fairly distributed among the feature values</td>
<td valign="top" align="left">It is possible to create intentionally misleading interpretations</td>
</tr>
</tbody>
</table></table-wrap>
<p>The use of ML approaches proved to enhance the classical statistical methods, improving cancer diagnosis, prognosis, detection and prediction (<xref ref-type="bibr" rid="B25">25</xref>), thus providing researchers with powerful tools. The use of classical approaches, ML methods and SHAP is helpful to solve the problem of the ML algorithm, because it is based on the &#x201C;black-box&#x201D; concept (<xref ref-type="bibr" rid="B26">26</xref>).</p>
<sec id="S1.SS1">
<title>Models</title>
<sec id="S1.SS1.SSS1">
<title>Cox proportional hazard</title>
<p>Survival analysis is an integral part of inferential statistics, in which a time variable describes when an event will occur (<xref ref-type="bibr" rid="B27">27</xref>). In fact, if we consider studies recruiting patients, it must be possible to describe each of them based on two values, the duration of the condition (t) and the status (alive/event). The time between the initial moment and the terminal event (not necessarily death) is represented by a random variable <italic>T</italic> (<italic>T</italic> &#x2265; 0), defined as the &#x201C;survival time.&#x201D; Mathematically, survival analysis is linked to conditional probability, that represents whether a patient will survive after a certain time t from his entry into the study, and is therefore conditioned by the fact that the patient survived the previous days. Thus, we also define survival analysis as cumulative probability or cumulative survival. Supposing that this random variable <italic>T</italic> confers a certain probability of surviving, we define F(<italic>t</italic>) as the probability function with a certain density. This distribution function of <italic>T</italic> is of the type:</p>
<disp-formula id="S1.Ex1">
<mml:math id="M1">
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>&lt;</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mo largeop="true" symmetric="true">&#x222B;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>t</mml:mi>
</mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>u</mml:mi>
</mml:mrow>
</mml:math>
</disp-formula>
<p>This represents the probability that the survival time will be less than the value of <italic>t</italic>. The survival function S(<italic>t</italic>) is defined as the probability that the survival time will be greater than or equal to <italic>t</italic>, according to this following complementary equation with distribution function:</p>
<disp-formula id="S1.Ex2">
<mml:math id="M2">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo rspace="5.8pt">&#x2265;</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mi>F</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>The survival function can be used to represent the probability that an individual will survive beyond the initial time for a certain time <italic>t</italic>. Survival analysis methods are classified as non-parametric, semi-parametric and parametric, based on the assumptions made on the distribution of <italic>T</italic>.</p>
<p>The CPH model is a form of multivariate survival analysis that can control other factors. The dependent variable is time to event (or survival time), which can be death or a clinical event (e.g., myocardial infarction) and is generally used to study the association between survival time and potential predictor variables (<xref ref-type="bibr" rid="B28">28</xref>). Specifically, the goal of this model is to evaluate the effect of multiple factors on survival, as well as to examine the occurrence rate of a given event at a particular moment of time. This rate is also called the &#x201C;Hazard.&#x201D; The predictor variables are called covariates. The Cox model is defined with the Hazard function which can be defined as follows:</p>
<disp-formula id="S1.Ex3">
<mml:math id="M3">
<mml:mrow>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo rspace="5.8pt">&#x00D7;</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:mtext>exp</mml:mtext>
</mml:mpadded>
</mml:mrow>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="normal">&#x22EF;</mml:mi>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>p</mml:mi>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>p</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>t</italic> represents the survival time, while h<sub>0</sub> is the basis of the hazard, exp (b<sub>1</sub>) is the Hazard Ratio. In this way, values equal to 1 can be taken to mean the absence of risk, values &#x003E; 1 an increased risk, while when &#x003C; 1 the protection from the risk itself. The applicability of this model must be preceded by an analysis to demonstrate the assumption of proportionality, or to demonstrate what effect each covariate has on the constant hazard function for each time. When this assumption is not proven, then a stratified Cox model should be employed. Generally this condition is rarely unproven, so the Cox model is widely used. It can also be shown graphically by means of a graph of the Schoenfeld residuals, which must be non-random to demonstrate the proportionality of the previously fitted model (<xref ref-type="bibr" rid="B29">29</xref>). As in the case of the regression model, the Cox model can also be performed &#x201C;net&#x201D; of certain covariates, in order to purify the effect of the association under study.</p>
</sec>
<sec id="S1.SS1.SSS2">
<title>Survival random forest</title>
<p>Random Survival Forest is an adaptation of classical Random Forest (RF) because it includes survival parameters such as status and time. In general, these techniques are classified as an ensemble tree method, i.e., based on classification trees.</p>
<p>An RSF is computed by an ensemble of binary decision trees which can be used to select the most important variables linked with time to event in terms of mortality. Bootstrapping and random node splitting are applied to obtain an ensemble of independent decision trees.</p>
<p>Variable predictiveness can be assessed using variable importance measures for both single and grouped variables.</p>
<p>In a selection process the error rate is approximated by the Out-Of-Bag error (OOB) during the training process. In each tree of the random forest, the OOB error is calculated based on predictions of observations (<xref ref-type="bibr" rid="B30">30</xref>).</p>
<p>The variable selection is implemented using the minimal depth of a respective variable, determined in each decision tree of a RSF as the distance from the root node to the closest node (<xref ref-type="bibr" rid="B31">31</xref>).</p>
</sec>
<sec id="S1.SS1.SSS3">
<title>Survival support vector machines</title>
<p>The SSVM methodology was developed by Cortes and Vapnik (<xref ref-type="bibr" rid="B31">31</xref>) and Kiang et al. (<xref ref-type="bibr" rid="B32">32</xref>). These models are based on discriminating two classes of observations by a linear decision surface (defined as the hyperplane) maximizing the distance between the hyperplane and single observations. If the classes are not separable by a linear surface, a non-linear transformation can be obtained through mapping the data on a different dimension space (feature space). By using a kernel function, it is possible to construct the separating hyperplane without explicitly carrying the map into feature space (<xref ref-type="bibr" rid="B33">33</xref>).</p>
<p>These models are a powerful tool to analyze this type of data because of their performance in analyzing sparse data, i.e., data with many or more predictors than observations. SSVMs have been widely applied to analyze binary outcomes or for datasets with a survival outcome.</p>
</sec>
<sec id="S1.SS1.SSS4">
<title>Shapley Additive Explanations (SHAP)</title>
<p>This was proposed for the first time by Molan (<xref ref-type="bibr" rid="B34">34</xref>). It is a particular type of model based on tree features importance, but with greater accuracy and consistency properties. A Sharpley value is defined as the average marginal contribution of feature values across all possible feature coalitions. Each value can be interpreted as the difference between the actual prediction and the average prediction on the whole dataset (<xref ref-type="bibr" rid="B35">35</xref>).</p>
<p>In general, the SHAP values test each combination of predictors to assess the effect of each single predictor, based on the game theory and conditional assumptions (<xref ref-type="bibr" rid="B36">36</xref>).</p>
</sec>
<sec id="S1.SS1.SSS5">
<title>Concordance index</title>
<p>This index measures how well models predict time to death of patients. It is easy to interpret because a value of c = 0.5 represents the average performance, i.e., no predictive discrimination, while a model with c &#x003E; 0.5, predicts well and shows a good capability to distinguish patients with events, whereas c &#x003C; 0.5 is defined as the worst model (<xref ref-type="bibr" rid="B37">37</xref>).</p>
</sec>
</sec>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and methods</title>
<sec id="S2.SS1">
<title>Ethical considerations</title>
<p>The studies were conducted in accordance with the principles of the Declaration of Helsinki and approved by the local Ethics Committee. All enrolled subjects provided written informed consent.</p>
</sec>
<sec id="S2.SS2">
<title>Participants</title>
<p>In 1992, the ONCONUT prospective cohort was started with the goal of evaluating the relationship between diet and cancer development in a Southern Italian elderly population (<italic>n</italic> = 35,000). The study was sponsored by the Italian National Institute of Health and carried out by the Epidemiology and Biostatistics Laboratory of the National Institute of Gastroenterology &#x201C;Saverio De Bellis,&#x201D; Research Hospital, Castellana Grotte (Bari), Italy. From April 1992 to July 1993, patients referred to the Clinical Pathology Laboratories of the three Unit&#x00E0; Sanitaria Locale (USL) Bari 16 areas (Municipalities of Monopoli and Polignano a Mare (Bari), Italy), BA 17 (Municipalities of Gioia del Colle and Santeramo in Colle (Bari), Italy) and Bari 18 (Municipalities of Castellana Grotte, Turi, Putignano, Noci, Alberobello and Locorotondo (Bari), Italy) were estimated to amount to 11,622, but only 5,632 subjects (48.46%) (ONCONUT 1) completed about 90% of the semi-quantitative food frequency questionnaire (FFQ). After 5 years, 4,563 patients returned to the Clinical Pathology Laboratories, and compose the ONCONUT 2. After excluding cases other than those with gastrointestinal disease (other types of cancers), 3,505 (76.81%) presented complete data for survival analysis.</p>
<p>The survival rate with GC (ICD-10, codes from 150.0 to 159.9) during these years 1992&#x2013;1993 (ONCONUT 1) was considered as the main outcome. Food conversion into nutrients (macro- and micronutrients) and calories was performed using the Italian National Institute of Nutrition Food Composition. Tables were integrated with data from Fidanza (<xref ref-type="bibr" rid="B38">38</xref>), using a validated semi-quantitative FFQ administered to participants. The glycemic index (GI) derived from each food (<xref ref-type="bibr" rid="B39">39</xref>) was calculated using tables and the glycemic load (GL), as suggested by Foster-Powell et al. (<xref ref-type="bibr" rid="B40">40</xref>). The present investigation was conducted following the &#x201C;Standards for Reporting Diagnostic Accuracy Studies&#x201D; (STARD) guidelines, and the manuscript was organized following the &#x201C;Strengthening the Reporting of Observational Studies in Epidemiology&#x2014;Nutritional Epidemiology&#x201D; (STROBE-nut) guidelines (<xref ref-type="bibr" rid="B41">41</xref>).</p>
</sec>
<sec id="S2.SS3">
<title>Statistical analysis</title>
<p>Patients characteristics are reported as Mean &#x00B1; Standard Deviation (M &#x00B1; SD) for continuous variables, and as frequencies and percentages (%) for categorical variables.</p>
<p>To test the associations between groups (Dead <italic>vs</italic> Alive), Chi-square test or Fisher&#x2019;s exact test for categorical variables were applied, as necessary, while the Wilcoxon rank sum (Mann-Whitney) test was applied for continuous variables.</p>
<p>For studying the risk on mortality, the Cox model was used. The CPH model was fitted to the data, and the proportional hazard assumption was evaluated by means of Schoenfeld residuals (SRT).</p>
<p>We randomly split the data into the training and testing subgroups for CPH, RSF and SSVM.</p>
<p>The training data included 75% of the sample (<italic>n</italic> = 2,629) while the remaining data, the test data, accounting for 25% (<italic>n</italic> = 876), were used to test the model.</p>
<p>To ensure repeatability in terms of estimation, the same seed was used.</p>
<p>After running these models the C-Index was used to evaluate the performance of the model on the test subset. Finally, to compare differences between C-Index values from the models values, paired Student&#x2019;s <italic>t</italic>-test was used.</p>
<p>When testing the null hypothesis of no association, the probability level of error, at two-tailed, was 0.05. All statistical computations were made using StataCorp. (<xref ref-type="bibr" rid="B42">42</xref>) and RStudio software (&#x201C;Prairie Trillium&#x201D; Release).</p>
</sec>
</sec>
<sec id="S3" sec-type="results">
<title>Results</title>
<p>We randomly subdivided patients and allocated 75% to the training set, and the remaining 25% to the test set.</p>
<p>The demographic and food habits are reported in <xref ref-type="table" rid="T2">Table 2</xref> for the total cohort. Mean age was 65.01 &#x00B1; 8.76 years and 38.26% of patients were male.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Baseline characteristics of patients and macro- and micronutrients intake.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Parameters<xref ref-type="table-fn" rid="t2fns1">&#x002A;</xref></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">M &#x00B1; SD or %</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Age (years)</td>
<td valign="top" align="center">65.01 &#x00B1; 8.76</td>
</tr>
<tr>
<td valign="top" align="left">Gender (M) (%)</td>
<td valign="top" align="center">1,341 (38.26)</td>
</tr>
<tr>
<td valign="top" align="left" colspan="2" style="background-color: #dcdcdc;"><bold>Educational level (%)</bold></td>
</tr>
<tr>
<td valign="top" align="left">None</td>
<td valign="top" align="center">956 (20.45)</td>
</tr>
<tr>
<td valign="top" align="left">Primary school diploma</td>
<td valign="top" align="center">2,045 (58.78)</td>
</tr>
<tr>
<td valign="top" align="left">Middle school diploma</td>
<td valign="top" align="center">324 (9.31)</td>
</tr>
<tr>
<td valign="top" align="left">Diploma</td>
<td valign="top" align="center">127 (3.65)</td>
</tr>
<tr>
<td valign="top" align="left">University degree</td>
<td valign="top" align="center">27 (0.78)</td>
</tr>
<tr>
<td valign="top" align="left">Smoking habits (yes) (%)</td>
<td valign="top" align="center">377 (10.90)</td>
</tr>
<tr>
<td valign="top" align="left" colspan="2" style="background-color: #dcdcdc;"><bold>Marital status (%)</bold></td>
</tr>
<tr>
<td valign="top" align="left">Single</td>
<td valign="top" align="center">168 (4.90)</td>
</tr>
<tr>
<td valign="top" align="left">Married or cohabiting</td>
<td valign="top" align="center">2,622 (76.51)</td>
</tr>
<tr>
<td valign="top" align="left">Separated or divorced</td>
<td valign="top" align="center">32 (0.93)</td>
</tr>
<tr>
<td valign="top" align="left">Widower</td>
<td valign="top" align="center">605 (17.65)</td>
</tr>
<tr>
<td valign="top" align="left">BMI (kg/cm<sup>2</sup>)</td>
<td valign="top" align="center">26.53 &#x00B1; 4.27</td>
</tr>
<tr>
<td valign="top" align="left">Glycemic index</td>
<td valign="top" align="center">56.15 &#x00B1; 4.67</td>
</tr>
<tr>
<td valign="top" align="left">Glycemic load</td>
<td valign="top" align="center">135.83 &#x00B1; 71.57</td>
</tr>
<tr>
<td valign="top" align="left">Diabetes (yes) (%)</td>
<td valign="top" align="center">766 (23.00)</td>
</tr>
<tr>
<td valign="top" align="left">Myocardial infarction (yes) (%)</td>
<td valign="top" align="center">195 (5.99)</td>
</tr>
<tr>
<td valign="top" align="left" colspan="2" style="background-color: #dcdcdc;"><bold>Macronutrients<xref ref-type="table-fn" rid="t2fns2"><sup>&#x03C8;</sup></xref></bold></td>
</tr>
<tr>
<td valign="top" align="left">Water (g)</td>
<td valign="top" align="center">1790.75 &#x00B1; 731.44</td>
</tr>
<tr>
<td valign="top" align="left">Proteins (g)</td>
<td valign="top" align="center">69.05 &#x00B1; 29.49</td>
</tr>
<tr>
<td valign="top" align="left">Lipids (g)</td>
<td valign="top" align="center">76.22 &#x00B1; 27.94</td>
</tr>
<tr>
<td valign="top" align="left">Available carbohydrates (g)</td>
<td valign="top" align="center">250.89 &#x00B1; 121.05</td>
</tr>
<tr>
<td valign="top" align="left">Fatty acids (g)</td>
<td valign="top" align="center">131.72 &#x00B1; 74.22</td>
</tr>
<tr>
<td valign="top" align="left">Soluble carbohydrates (g)</td>
<td valign="top" align="center">101.86 &#x00B1; 63.87</td>
</tr>
<tr>
<td valign="top" align="left">Total fiber (g)</td>
<td valign="top" align="center">26.28 &#x00B1; 13.89</td>
</tr>
<tr>
<td valign="top" align="left">Saturated fatty acids (g)</td>
<td valign="top" align="center">20.36 &#x00B1; 9.25</td>
</tr>
<tr>
<td valign="top" align="left">Monounsaturated fatty acids (g)</td>
<td valign="top" align="center">40.52 &#x00B1; 15.07</td>
</tr>
<tr>
<td valign="top" align="left">Polyunsaturated fatty acids (g)</td>
<td valign="top" align="center">8.31 &#x00B1; 3.21</td>
</tr>
<tr>
<td valign="top" align="left">Cholesterol (mg)</td>
<td valign="top" align="center">183.55 &#x00B1; 105.68</td>
</tr>
<tr>
<td valign="top" align="left">Alcohol (mg)</td>
<td valign="top" align="center">15.41 &#x00B1; 19.76</td>
</tr>
<tr>
<td valign="top" align="left" colspan="2" style="background-color: #dcdcdc;"><bold>Micronutrients<xref ref-type="table-fn" rid="t2fns2"><sup>&#x03C8;</sup></xref></bold></td>
</tr>
<tr>
<td valign="top" align="left">Sodium (mg)</td>
<td valign="top" align="center">1450.37 &#x00B1; 837.52</td>
</tr>
<tr>
<td valign="top" align="left">Potassium (mg)</td>
<td valign="top" align="center">3330.30 &#x00B1; 1658.44</td>
</tr>
<tr>
<td valign="top" align="left">Iron (mg)</td>
<td valign="top" align="center">11.19 &#x00B1; 4.92</td>
</tr>
<tr>
<td valign="top" align="left">Calcium (mg)</td>
<td valign="top" align="center">851.19 &#x00B1; 469.13</td>
</tr>
<tr>
<td valign="top" align="left">Phosphorus (mg)</td>
<td valign="top" align="center">1144.59 &#x00B1; 483.75</td>
</tr>
<tr>
<td valign="top" align="left">Thiamin (mg)</td>
<td valign="top" align="center">0.78 &#x00B1; 0.35</td>
</tr>
<tr>
<td valign="top" align="left">Riboflavin (mg)</td>
<td valign="top" align="center">1.41 &#x00B1; 0.62</td>
</tr>
<tr>
<td valign="top" align="left">Vitamin A (&#x03BC;g)</td>
<td valign="top" align="center">1146.33 &#x00B1; 942.73</td>
</tr>
<tr>
<td valign="top" align="left">Vitamin C (mg)</td>
<td valign="top" align="center">170.74 &#x00B1; 122.68</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t2fns1"><p>ONCONUT study (n = 3,505), total cohort. &#x002A;As mean and standard deviation (M &#x00B1; SD) for continuous variables and percentage (%) for categorical variables.</p></fn>
<fn><p>BMI, body mass index.</p></fn>
<fn id="t2fns2"><p><sup>&#x03C8;</sup>Calculated on quantity of daily consumption.</p></fn>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="table" rid="T3">Table 3</xref> shows the difference between alive and dead GC patients. Patients who were older, male, separated, divorced or widower, and with lower education levels were more represented than the others, likewise patients with diabetes and myocardial infarction (<italic>p</italic> &#x003C; 0.001 respectively). Lower intakes of both macro- and micronutrients were confirmed in dead patients, where all comparisons resulted statistically significant (<italic>p</italic> &#x003C; 0.05).</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Comparison between patients with and without mortality event in the total cohort.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Parameters<xref ref-type="table-fn" rid="t3fns1">&#x002A;</xref></td>
<td valign="top" align="center" colspan="3" style="color:#ffffff;background-color: #7f8080;">Status</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Alive (<italic>n</italic> = 2,847)</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>Died (<italic>n</italic> = 658)</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><italic><bold>p</bold></italic><bold><xref ref-type="table-fn" rid="t3fns2"><sup>&#x00A7;</sup></xref></bold></td>
</tr>
<tr>
<td valign="top" align="left">Age (years)</td>
<td valign="top" align="center">63.49 &#x00B1; 8.05</td>
<td valign="top" align="center">71.57 &#x00B1; 8.67</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Gender (M) (%)</td>
<td valign="top" align="center">1,001 (35.16)</td>
<td valign="top" align="center">340 (51.67)</td>
<td valign="top" align="center">&#x003C;0.001<sup><xref ref-type="table-fn" rid="t3fns2">&#x2227;</xref></sup></td>
</tr>
<tr>
<td valign="top" align="left">Educational level (%)</td>
<td/>
<td/>
<td valign="top" align="center">0.01<sup><xref ref-type="table-fn" rid="t3fns2">&#x2227;</xref></sup></td>
</tr>
<tr>
<td valign="top" align="left">None</td>
<td valign="top" align="center">744 (26.32)</td>
<td valign="top" align="center">212 (32.52)</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Primary school diploma</td>
<td valign="top" align="center">1,676 (59.29)</td>
<td valign="top" align="center">369 (56.60)</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Middle school diploma</td>
<td valign="top" align="center">274 (9.69)</td>
<td valign="top" align="center">50 (7.67)</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Diploma</td>
<td valign="top" align="center">111 (3.93)</td>
<td valign="top" align="center">16 (2.45)</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">University degree</td>
<td valign="top" align="center">22 (0.78)</td>
<td valign="top" align="center">5 (0.77)</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Smoking habit (yes) (%)</td>
<td valign="top" align="center">310 (11.03)</td>
<td valign="top" align="center">67 (10.34)</td>
<td valign="top" align="center">0.61<sup><xref ref-type="table-fn" rid="t3fns2">&#x2227;</xref></sup></td>
</tr>
<tr>
<td valign="top" align="left">Marital status (%)</td>
<td/>
<td/>
<td valign="top" align="center">&#x003C;0.001<sup><xref ref-type="table-fn" rid="t3fns2">&#x2227;</xref></sup></td>
</tr>
<tr>
<td valign="top" align="left">Single</td>
<td valign="top" align="center">137 (4.91)</td>
<td valign="top" align="center">31 (4.87)</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Married or cohabiting</td>
<td valign="top" align="center">2,192 (78.57)</td>
<td valign="top" align="center">430 (67.50)</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Separated or divorced</td>
<td valign="top" align="center">26 (0.93)</td>
<td valign="top" align="center">6 (0.94)</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Widower</td>
<td valign="top" align="center">435 (15.59)</td>
<td valign="top" align="center">170 (26.69)</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">BMI (kg/cm<sup>2</sup>)</td>
<td valign="top" align="center">26.65 &#x00B1; 4.29</td>
<td valign="top" align="center">26.02 &#x00B1; 4.17</td>
<td valign="top" align="center">0.002</td>
</tr>
<tr>
<td valign="top" align="left">Glycemic index</td>
<td valign="top" align="center">56.21 &#x00B1; 4.63</td>
<td valign="top" align="center">55.90 &#x00B1; 4.83</td>
<td valign="top" align="center">0.18</td>
</tr>
<tr>
<td valign="top" align="left">Glycemic load</td>
<td valign="top" align="center">138.06 &#x00B1; 71.80</td>
<td valign="top" align="center">126.17 &#x00B1; 69.80</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Diabetes (yes) (%)</td>
<td valign="top" align="center">563 (20.83)</td>
<td valign="top" align="center">203 (32.32)</td>
<td valign="top" align="center">&#x003C;0.001<sup><xref ref-type="table-fn" rid="t3fns2">&#x2227;</xref></sup></td>
</tr>
<tr>
<td valign="top" align="left">Myocardial infarction (yes) (%)</td>
<td valign="top" align="center">120 (4.54)</td>
<td valign="top" align="center">75 (12.20)</td>
<td valign="top" align="center">&#x003C;0.001<sup><xref ref-type="table-fn" rid="t3fns2">&#x2227;</xref></sup></td>
</tr>
<tr>
<td valign="top" align="left" colspan="4" style="background-color: #dcdcdc;"><bold>Macronutrients<xref ref-type="table-fn" rid="t3fns3"><sup>&#x03C8;</sup></xref></bold></td>
</tr>
<tr>
<td valign="top" align="left">Water (g)</td>
<td valign="top" align="center">1806.86 &#x00B1; 725.56</td>
<td valign="top" align="center">1721.02 &#x00B1; 752.93</td>
<td valign="top" align="center">0.0002</td>
</tr>
<tr>
<td valign="top" align="left">Proteins (g)</td>
<td valign="top" align="center">69.98 &#x00B1; 30.08</td>
<td valign="top" align="center">65.00 &#x00B1; 26.47</td>
<td valign="top" align="center">0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Lipids (g)</td>
<td valign="top" align="center">77.07 &#x00B1; 28.10</td>
<td valign="top" align="center">72.56 &#x00B1; 26.97</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Available carbohydrates (g)</td>
<td valign="top" align="center">254.54 &#x00B1; 121.36</td>
<td valign="top" align="center">235.10 &#x00B1; 118.49</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Fatty acids (g)</td>
<td valign="top" align="center">133.68 &#x00B1; 74.60</td>
<td valign="top" align="center">123.24 &#x00B1; 71.96</td>
<td valign="top" align="center">0.0004</td>
</tr>
<tr>
<td valign="top" align="left">Soluble carbohydrates (g)</td>
<td valign="top" align="center">103.25 &#x00B1; 63.80</td>
<td valign="top" align="center">95.86 &#x00B1; 63.87</td>
<td valign="top" align="center">0.0002</td>
</tr>
<tr>
<td valign="top" align="left">Total fiber (g)</td>
<td valign="top" align="center">26.63 &#x00B1; 13.80</td>
<td valign="top" align="center">24.76 &#x00B1; 14.15</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Saturated fatty acids (g)</td>
<td valign="top" align="center">20.62 &#x00B1; 9.46</td>
<td valign="top" align="center">19.26 &#x00B1; 8.24</td>
<td valign="top" align="center">0.0005</td>
</tr>
<tr>
<td valign="top" align="left">Monounsaturated fatty acids (g)</td>
<td valign="top" align="center">40.84 &#x00B1; 14.92</td>
<td valign="top" align="center">39.14 &#x00B1; 15.63</td>
<td valign="top" align="center">0.0006</td>
</tr>
<tr>
<td valign="top" align="left">Polyunsaturated fatty acids (g)</td>
<td valign="top" align="center">8.41 &#x00B1; 3.22</td>
<td valign="top" align="center">7.88 &#x00B1; 3.15</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Cholesterol (mg)</td>
<td valign="top" align="center">186.21 &#x00B1; 107.93</td>
<td valign="top" align="center">172.04 &#x00B1; 94.55</td>
<td valign="top" align="center">0.007</td>
</tr>
<tr>
<td valign="top" align="left">Alcohol (mg)</td>
<td valign="top" align="center">15.55 &#x00B1; 19.80</td>
<td valign="top" align="center">14.79 &#x00B1; 19.55</td>
<td valign="top" align="center">0.26</td>
</tr>
<tr>
<td valign="top" align="left" colspan="4" style="background-color: #dcdcdc;"><bold>Micronutrients<xref ref-type="table-fn" rid="t3fns3"><sup>&#x03C8;</sup></xref></bold></td>
</tr>
<tr>
<td valign="top" align="left">Sodium (mg)</td>
<td valign="top" align="center">1480.34 &#x00B1; 842.00</td>
<td valign="top" align="center">1320.69 &#x00B1; 805.72</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Potassium (mg)</td>
<td valign="top" align="center">3378.20 &#x00B1; 1644.73</td>
<td valign="top" align="center">3123.04 &#x00B1; 1702.33</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Iron (mg)</td>
<td valign="top" align="center">11.35 &#x00B1; 4.88</td>
<td valign="top" align="center">10.51 &#x00B1; 5.05</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Calcium (mg)</td>
<td valign="top" align="center">860.72 &#x00B1; 472.95</td>
<td valign="top" align="center">810.57 &#x00B1; 450.34</td>
<td valign="top" align="center">0.003</td>
</tr>
<tr>
<td valign="top" align="left">Phosphorus (mg)</td>
<td valign="top" align="center">1157.39 &#x00B1; 487.70</td>
<td valign="top" align="center">1089.20 &#x00B1; 462.54</td>
<td valign="top" align="center">0.0004</td>
</tr>
<tr>
<td valign="top" align="left">Thiamin (mg)</td>
<td valign="top" align="center">0.80 &#x00B1; 0.35</td>
<td valign="top" align="center">0.73 &#x00B1; 0.34</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
<tr>
<td valign="top" align="left">Riboflavin (mg)</td>
<td valign="top" align="center">1.42 &#x00B1; 0.62</td>
<td valign="top" align="center">1.35 &#x00B1; 0.59</td>
<td valign="top" align="center">0.002</td>
</tr>
<tr>
<td valign="top" align="left">Vitamin A (&#x03BC;g)</td>
<td valign="top" align="center">1150.60 &#x00B1; 914.78</td>
<td valign="top" align="center">1127.83 &#x00B1; 1055.71</td>
<td valign="top" align="center">0.003</td>
</tr>
<tr>
<td valign="top" align="left">Vitamin C (mg)</td>
<td valign="top" align="center">173.76 &#x00B1; 121.13</td>
<td valign="top" align="center">157.66 &#x00B1; 128.47</td>
<td valign="top" align="center">&#x003C;0.0001</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t3fns1"><p>&#x002A;As mean and standard deviation (M &#x00B1; SD) for continuous variables and percentage (%) for categorical variables.</p></fn>
<fn><p>BMI, body mass index.</p></fn>
<fn id="t3fns2"><p><sup>&#x00A7;</sup> Wilcoxon rank-sum test (Mann-Whitney), <sup>&#x2227;</sup>Chi-square or Fisher&#x2019;s test, where necessary.</p></fn>
<fn id="t3fns3"><p><sup>&#x03C8;</sup>Calculated on quantity of daily consumption.</p></fn>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="fig" rid="F2">Figures 2</xref>, <xref ref-type="fig" rid="F3">3</xref> describe the traditional Cox model adjusted for age and gender in the training set. For macronutrients (<xref ref-type="fig" rid="F2">Figure 2</xref>) only alcohol intake was associated with risk of mortality, with a low protective role (HR = 0.99, <italic>p</italic> = 0.008, 0.98&#x2013;1.0, 95% C.I.), while for micronutrients intake (<xref ref-type="fig" rid="F3">Figure 3</xref>) iron intake had a protective role against mortality (HR = 0.88, <italic>p</italic> = 0.002, 0.81&#x2013;0.95, 95% C.I.).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Multivariate Cox regression for macronutrients intake in training set.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnut-10-1066749-g002.tif"/>
</fig>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Multivariate Cox regression for micronutrients intake in training set.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnut-10-1066749-g003.tif"/>
</fig>
<p><xref ref-type="fig" rid="F4">Figures 4</xref>, <xref ref-type="fig" rid="F5">5</xref> describe the importance plot for the RSF for the training set; in both models (<xref ref-type="fig" rid="F4">Figures 4</xref>, <xref ref-type="fig" rid="F5">5</xref>), alcohol and iron intake, as also the Cox model, had the most important values, contributing to the good prediction of mortality. To identify the features that influenced the prediction, the SHAP summary plot was used (<xref ref-type="fig" rid="F6">Figures 6</xref>, <xref ref-type="fig" rid="F7">7</xref>). Each point corresponds to a single patient in the testing set. The position on the x-axis, i.e., SHAP value, was the impact of the variable on the model output. Statistically, this represents the logarithm of the mortality risk. Patients with higher SHAP values had a higher risk of death, while lower SHAP values had lower risk. For macro- and micronutrients (<xref ref-type="fig" rid="F6">Figures 6</xref>, <xref ref-type="fig" rid="F7">7</xref>, respectively) except the time and age variables, all patients had SHAP value near to 0, except for fatty acids (for macronutrients) and Sodium intake (for micronutrients), when more patients had higher SHAP values.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Importance values for macronutrients in training set.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnut-10-1066749-g004.tif"/>
</fig>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Importance values for micronutrients in training set.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnut-10-1066749-g005.tif"/>
</fig>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>Shapley Additive Explanation summary plot for macronutrients in training set.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnut-10-1066749-g006.tif"/>
</fig>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p>Shapley Additive Explanation summary plot for micronutrients in training set.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnut-10-1066749-g007.tif"/>
</fig>
<p><xref ref-type="fig" rid="F8">Figures 8</xref>, <xref ref-type="fig" rid="F9">9</xref> depict the quantification of the C-Index in the graph bar. Among macro- and micronutrients CPH and RSF had similar values (0.7649284 and 0.7653711, 0.764206 and 0.7725246, respectively), but the values for SSVM were statistically different for each statistical technique (0.5667753 and 0.545222, respectively, <italic>p</italic> &#x003C; 0.0001).</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption><p>Index in the different models for macronutrients in test set. &#x002A;&#x002A;&#x002A;<italic>p</italic>-value significant. Other comparisons were not significant with <italic>p</italic>-value &#x003E; 0.05.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnut-10-1066749-g008.tif"/>
</fig>
<fig id="F9" position="float">
<label>FIGURE 9</label>
<caption><p>Index in the different models for micronutrients in test set. &#x002A;&#x002A;&#x002A;<italic>p</italic>-value significant. Other comparisons were not significant with <italic>p</italic>-value &#x003E; 0.05.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnut-10-1066749-g009.tif"/>
</fig>
</sec>
<sec id="S4" sec-type="discussion">
<title>Discussion</title>
<p>In this study we enhanced predictive performances using different survival statistical techniques in an elderly cohort.</p>
<p>Alcohol consumption had a protective role on mortality in our cohort, unlike in the literature (<xref ref-type="bibr" rid="B43">43</xref>), probably due to wine consumption rather than spirits. Red wine is a source of polyphenols, such as resveratrol and anthocyanin, with demonstrated antioxidant properties and protective role against cardiovascular diseases (<xref ref-type="bibr" rid="B44">44</xref>&#x2013;<xref ref-type="bibr" rid="B46">46</xref>).</p>
<p>In the same way, iron, like alcohol, had a protective role and a high impact on mortality both in the CPH model and RSF (<xref ref-type="bibr" rid="B47">47</xref>). Among these methods, RSF demonstrated the best performance followed by CPH, as reported in the literature (<xref ref-type="bibr" rid="B48">48</xref>) while the SVM showed the worst performance because generally used for a wide array of applications for its high potential in data transformation and geometric structure (<xref ref-type="bibr" rid="B49">49</xref>, <xref ref-type="bibr" rid="B50">50</xref>). The results reveal a promising ML potential for predicting mortality risk in clinical practice. SHAP values on the summary plot were an interesting way to illustrate key features and a good method for describing the power of prediction of outcome. RSF followed by CPH are known to be the best for predicting mortality, while the use of SHAP is still rarely used. The power of SHAP was to allow a description of the single patients variability and also of how individual patients were distributed in terms of SHAP values. The advantage of using SHAP was to overcome the problem of interpreting the classical ML techniques, widely described in the literature regarding the prediction of macro- and micronutrients, and to couple it with other techniques to improve the prediction (<xref ref-type="bibr" rid="B51">51</xref>).</p>
<p>The main limitation of this study is the use of an elderly cohort. Mathematically, the use of larger cohorts will surely lead to different results and therefore external validation could be necessary.</p>
<p>Our report can be seen as a first step toward evaluating personalized nutrition in an elderly cohort, and studying the relationship between macro- and micronutrients intake and mortality with different approaches.</p>
<p>Understanding eating habits and therefore the correct intake of macro and micronutrients will be useful for improving the lifestyle of people with obesity who also suffer from other disabling pathologies (<xref ref-type="bibr" rid="B18">18</xref>, <xref ref-type="bibr" rid="B24">24</xref>).</p>
<p>This suggests that combining different techniques could be useful in order to personalize the ideal dietary intake. Furthermore, a future follow-up project will be to use these tools to predict changes in food habits in the Southern Italian population. The promise of using ML tools to achieve nutritional phenotyping needs to be explored further in order to set up standard paradigms based on different epidemiological parameters.</p>
<p>Precision nutrition directly addresses metabolic heterogeneity and may serve as a treatment for obesity and other metabolic diseases. Future interventions should examine ways to increase dietary self-monitoring adherence and intervention exposure and consider the development and testing of a specific predictive algorithm (<xref ref-type="bibr" rid="B52">52</xref>).</p>
</sec>
<sec id="S5" sec-type="conclusion">
<title>Conclusion</title>
<p>The results of this study demonstrate that both the CPH and RSV methods performed well, but the combination with SHAP was best. It is difficult to draw conclusions about preferring one model over the other based on this study and the others in the literature, because each model had advantages and disadvantages. In clinical practice none of the mathematical models described could replace another, but all should be used together to make future decisions. In the last years, Artificial intelligence offers major opportunities to improve public health management.</p>
</sec>
<sec id="S6" sec-type="data-availability">
<title>Data availability statement</title>
<p>The datasets presented in this article are not readily available because IRCCS &#x201C;S. de Bellis&#x201D; Hospital property. Requests to access the datasets should be directed to corresponding author.</p>
</sec>
<sec id="S7" sec-type="ethics-statement">
<title>Ethics statement</title>
<p>The studies involving human participants were reviewed and approved by the National Institute of Gastroenterology &#x201C;S. de Bellis,&#x201D; Research Hospital, Castellana Grotte (BA), Italy. The patients/participants provided their written informed consent to participate in this study.</p>
</sec>
<sec id="S8" sec-type="author-contributions">
<title>Author contributions</title>
<p>RD and ML: conceptualization and writing&#x2014;original draft preparation. RD: methodology, software, formal analysis, investigation, and data curation. RD, ML, and PP: writing&#x2014;review and editing. LP: supervision. VG: methodology. All authors have read and agreed to the published version of the manuscript.</p>
</sec>
</body>
<back>
<sec id="S9" sec-type="funding-information">
<title>Funding</title>
<p>This study was supported by the Italian Ministry of Health with the &#x201C;Ricerca Corrente 2022&#x201D; Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) &#x201C;S. de Bellis,&#x201D; funded by research projects.</p>
</sec>
<sec id="S10" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="S11" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<fn-group>
<title>Abbreviations</title>
<fn fn-type="abbr">
<p>Abbreviations: GC, gastrointestinal cancer; CPH, Cox proportional hazards; RSF, random survival forest; SSVM, survival support vector machine; SHAP, Shapley Additive Explanation; ML, machine learning; OOB, Out-Of-Bag error; RF, random forest; HR, hazards ratio; SD, standard deviations.</p></fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aunan</surname> <given-names>J</given-names></name> <name><surname>Cho</surname> <given-names>W</given-names></name> <name><surname>S&#x00F8;reide</surname> <given-names>K</given-names></name></person-group>. <article-title>The biology of aging and cancer: a brief overview of shared and divergent molecular hallmarks.</article-title> <source><italic>Aging Dis.</italic></source> (<year>2017</year>) <volume>8</volume>:<fpage>628</fpage>&#x2013;<lpage>42</lpage>. <pub-id pub-id-type="doi">10.14336/AD.2017.0103</pub-id> <pub-id pub-id-type="pmid">28966806</pub-id></citation></ref>
<ref id="B2"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marongiu</surname> <given-names>F</given-names></name> <name><surname>Serra</surname> <given-names>M</given-names></name> <name><surname>Laconi</surname> <given-names>E</given-names></name></person-group>. <article-title>Development versus evolution in cancer biology.</article-title> <source><italic>Trends Cancer.</italic></source> (<year>2018</year>) <volume>4</volume>:<fpage>342</fpage>&#x2013;<lpage>8</lpage>.</citation></ref>
<ref id="B3"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Siegel</surname> <given-names>R</given-names></name> <name><surname>Miller</surname> <given-names>K</given-names></name> <name><surname>Fuchs</surname> <given-names>H</given-names></name> <name><surname>Jemal</surname> <given-names>A</given-names></name></person-group>. <article-title>Cancer statistics, 2022.</article-title> <source><italic>CA Cancer J Clin.</italic></source> (<year>2022</year>) <volume>72</volume>:<fpage>7</fpage>&#x2013;<lpage>33</lpage>.</citation></ref>
<ref id="B4"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quante</surname> <given-names>M</given-names></name> <name><surname>Varga</surname> <given-names>J</given-names></name> <name><surname>Wang</surname> <given-names>T</given-names></name> <name><surname>Greten</surname> <given-names>F</given-names></name></person-group>. <article-title>The gastrointestinal tumor microenvironment.</article-title> <source><italic>Gastroenterology.</italic></source> (<year>2013</year>) <volume>145</volume>:<fpage>63</fpage>&#x2013;<lpage>78</lpage>.</citation></ref>
<ref id="B5"><label>5.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sukka</surname> <given-names>S</given-names></name></person-group> <collab>The Onconut <sup>&#x00AE;</sup> Project Group.</collab> <article-title>The impact of clinical nutrition on cancer therapy: a frequently underestimated perspective. A complementary approach to cancer patients.</article-title> <source><italic>Med J Nutr Metab.</italic></source> (<year>2012</year>) <volume>5</volume>:<fpage>75</fpage>&#x2013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1007/s12349-012-0105-z</pub-id> <pub-id pub-id-type="pmid">22962628</pub-id></citation></ref>
<ref id="B6"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Movahed</surname> <given-names>S</given-names></name> <name><surname>Tabrizi</surname> <given-names>F</given-names></name> <name><surname>Pahlavani</surname> <given-names>N</given-names></name> <name><surname>Toussi</surname> <given-names>M</given-names></name> <name><surname>Motlah</surname> <given-names>A</given-names></name> <name><surname>Eslami</surname> <given-names>S</given-names></name><etal/></person-group> <article-title>Comprehensive assessment of nutritional status and nutritional-related complications in newly diagnosed esophageal cancer patients: a cross-sectional study.</article-title> <source><italic>Clin Nutr.</italic></source> (<year>2021</year>) <volume>40</volume>:<fpage>4449</fpage>&#x2013;<lpage>55</lpage>.</citation></ref>
<ref id="B7"><label>7.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walsh</surname> <given-names>D</given-names></name> <name><surname>Szafranski</surname> <given-names>M</given-names></name> <name><surname>Aktas</surname> <given-names>A</given-names></name> <name><surname>Kadakia</surname> <given-names>K</given-names></name></person-group>. <article-title>Malnutrition in cancer care: time to address the elephant in the room.</article-title> <source><italic>J. Oncol. Pract.</italic></source> (<year>2019</year>) <volume>15</volume>:<fpage>357</fpage>&#x2013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1200/JOP.19.00165</pub-id> <pub-id pub-id-type="pmid">31188710</pub-id></citation></ref>
<ref id="B8"><label>8.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arensberg</surname> <given-names>M</given-names></name> <name><surname>Richards</surname> <given-names>J</given-names></name> <name><surname>Benjamin</surname> <given-names>J</given-names></name> <name><surname>Kerr</surname> <given-names>K</given-names></name> <name><surname>Hegazi</surname> <given-names>R</given-names></name></person-group>. <article-title>Opportunities for quality improvement programs (QIPs) in the nutrition support of patients with cancer.</article-title> <source><italic>Healthcare.</italic></source> (<year>2020</year>) <volume>8</volume>:<issue>227</issue>. <pub-id pub-id-type="doi">10.3390/healthcare8030227</pub-id> <pub-id pub-id-type="pmid">32722026</pub-id></citation></ref>
<ref id="B9"><label>9.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tayyem</surname> <given-names>R</given-names></name> <name><surname>Bawadi</surname> <given-names>H</given-names></name> <name><surname>Shehadah</surname> <given-names>I</given-names></name> <name><surname>Abu-Mweis</surname> <given-names>S</given-names></name> <name><surname>Agraib</surname> <given-names>L</given-names></name> <name><surname>Bani-Hani</surname> <given-names>K</given-names></name><etal/></person-group> <article-title>Macro- and micronutrients consumption and the risk for colorectal cancer among Jordanians.</article-title> <source><italic>Nutrients.</italic></source> (<year>2015</year>) <volume>7</volume>:<fpage>1769</fpage>&#x2013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.3390/nu7031769</pub-id> <pub-id pub-id-type="pmid">25763533</pub-id></citation></ref>
<ref id="B10"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harshman</surname> <given-names>M</given-names></name> <name><surname>Aldoori</surname> <given-names>W</given-names></name></person-group>. <article-title>Diet and colorectal cancer: review of the evidence.</article-title> <source><italic>Can Fam Physician.</italic></source> (<year>2007</year>) <volume>53</volume>:<fpage>1913</fpage>&#x2013;<lpage>20</lpage>.</citation></ref>
<ref id="B11"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>Z</given-names></name> <name><surname>Liu</surname> <given-names>L</given-names></name> <name><surname>Wang</surname> <given-names>P</given-names></name> <name><surname>Roebothan</surname> <given-names>B</given-names></name> <name><surname>Zhano</surname> <given-names>J</given-names></name> <name><surname>Cotterchio</surname> <given-names>M</given-names></name><etal/></person-group> <article-title>Association of total energy intake and macronutient consumption with colorectal cancer risk: results from a large population-based case-control study in newfoundland and labrador and Ontario.</article-title> <source><italic>Canda. Nutr J.</italic></source> (<year>2012</year>) <volume>11</volume>:<issue>18</issue>. <pub-id pub-id-type="doi">10.1186/1475-2891-11-18</pub-id> <pub-id pub-id-type="pmid">22449145</pub-id></citation></ref>
<ref id="B12"><label>12.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez</surname> <given-names>R</given-names></name></person-group>. <article-title>Free radiclas, oxidative stress and DNA metabolism in human cancer.</article-title> <source><italic>Cancer Investig.</italic></source> (<year>1999</year>) <volume>17</volume>:<fpage>376</fpage>&#x2013;<lpage>7</lpage>.</citation></ref>
<ref id="B13"><label>13.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Millen</surname> <given-names>A</given-names></name> <name><surname>Subar</surname> <given-names>A</given-names></name> <name><surname>Graubard</surname> <given-names>B</given-names></name> <name><surname>Peters</surname> <given-names>U</given-names></name> <name><surname>Hayes</surname> <given-names>R</given-names></name> <name><surname>Weissfeld</surname> <given-names>J</given-names></name><etal/></person-group> <article-title>PLCO, cancer screening trial project team. Fruit and vegetale intake and prevalene of colorectal adenom in a cancer screening trial.</article-title> <source><italic>Am J Clin Nutr.</italic></source> (<year>2007</year>) <volume>86</volume>:<fpage>1754</fpage>&#x2013;<lpage>64</lpage>. <pub-id pub-id-type="doi">10.1093/ajcn/86.5.1754</pub-id> <pub-id pub-id-type="pmid">18065596</pub-id></citation></ref>
<ref id="B14"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sato</surname> <given-names>Y</given-names></name> <name><surname>Tsubono</surname> <given-names>Y</given-names></name> <name><surname>Kakaya</surname> <given-names>N</given-names></name> <name><surname>Ogawa</surname> <given-names>K</given-names></name> <name><surname>Kurashima</surname> <given-names>K</given-names></name> <name><surname>Kuriyama</surname> <given-names>S</given-names></name><etal/></person-group> <article-title>Fruits and vegetable consumption and risk of colorectal cancer in Japan: the miyagi cohort study.</article-title> <source><italic>Public Health Nutr.</italic></source> (<year>2005</year>) <volume>8</volume>:<fpage>309</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1079/phn2004681</pub-id> <pub-id pub-id-type="pmid">15918928</pub-id></citation></ref>
<ref id="B15"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Song</surname> <given-names>H</given-names></name> <name><surname>Jeong</surname> <given-names>A</given-names></name> <name><surname>Mai Tran</surname> <given-names>T</given-names></name> <name><surname>lee</surname> <given-names>J</given-names></name> <name><surname>Kim</surname> <given-names>M</given-names></name> <name><surname>Park</surname> <given-names>B</given-names></name></person-group>. <article-title>Association between micronutrient intake and breast cancer risk according to body mass index in South Korean Adult women: a cohort study.</article-title> <source><italic>Nutrients.</italic></source> (<year>2022</year>) <volume>14</volume>:<issue>2644</issue>. <pub-id pub-id-type="doi">10.3390/nu14132644</pub-id> <pub-id pub-id-type="pmid">35807825</pub-id></citation></ref>
<ref id="B16"><label>16.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tong</surname> <given-names>Y</given-names></name> <name><surname>Gao</surname> <given-names>H</given-names></name> <name><surname>Qi</surname> <given-names>Q</given-names></name> <name><surname>Liu</surname> <given-names>X</given-names></name> <name><surname>Li</surname> <given-names>J</given-names></name> <name><surname>Gao</surname> <given-names>J</given-names></name><etal/></person-group> <article-title>High fat diet, gut microbiome and gastrointestinal cancer.</article-title> <source><italic>Theranostics.</italic></source> (<year>2021</year>) <volume>11</volume>:<fpage>5889</fpage>&#x2013;<lpage>910</lpage>.</citation></ref>
<ref id="B17"><label>17.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murphy</surname> <given-names>N</given-names></name> <name><surname>Jenab</surname> <given-names>M</given-names></name> <name><surname>Gunter</surname> <given-names>M</given-names></name></person-group>. <article-title>Adiposity and gastrointestinal cancers: epidemiology, mechanisms and future directions.</article-title> <source><italic>Nat Rev Gastroenterol Hepatol.</italic></source> (<year>2018</year>) <volume>15</volume>:<fpage>659</fpage>&#x2013;<lpage>70</lpage>. <pub-id pub-id-type="doi">10.1038/s41575-018-0038-1</pub-id> <pub-id pub-id-type="pmid">29970888</pub-id></citation></ref>
<ref id="B18"><label>18.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pahlavani</surname> <given-names>N</given-names></name> <name><surname>Rostami</surname> <given-names>D</given-names></name> <name><surname>Ebrahimi</surname> <given-names>F</given-names></name> <name><surname>Azizi-Soleiman</surname> <given-names>F</given-names></name></person-group>. <article-title>Nuts effects in chronic disease and relationship between walnuts and satiety: review on the available evidence.</article-title> <source><italic>Obes Med.</italic></source> (<year>2020</year>) <volume>17</volume>:<issue>1001736</issue>.</citation></ref>
<ref id="B19"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kolahdouz-Mohammadi</surname> <given-names>R</given-names></name> <name><surname>Malekahmadi</surname> <given-names>M</given-names></name> <name><surname>Calyton</surname> <given-names>Z</given-names></name> <name><surname>Sadata</surname> <given-names>S</given-names></name> <name><surname>Pahlavani</surname> <given-names>N</given-names></name> <name><surname>Sikaroudi</surname> <given-names>M</given-names></name><etal/></person-group> <article-title>Effect of egg consumption on blood pressure: a systematic review and meta-analysis of randomized clinical trials.</article-title> <source><italic>Curr Hypertens Rep.</italic></source> (<year>2020</year>) <volume>22</volume>:<issue>24</issue>.</citation></ref>
<ref id="B20"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mansouri</surname> <given-names>M</given-names></name> <name><surname>Pahlavani</surname> <given-names>N</given-names></name> <name><surname>Sharifi</surname> <given-names>F</given-names></name> <name><surname>Varmaghani</surname> <given-names>M</given-names></name> <name><surname>Shokri</surname> <given-names>A</given-names></name> <name><surname>Yaghubi</surname> <given-names>H</given-names></name><etal/></person-group> <article-title>Dairy consumption in relation to hypertension among a large population of university students: the MEPHASOUS Study.</article-title> <source><italic>Diabetes Metab Syndr Obes.</italic></source> (<year>2020</year>) <volume>13</volume>:<fpage>1633</fpage>&#x2013;<lpage>42</lpage>. <pub-id pub-id-type="doi">10.2147/DMSO.S248592</pub-id> <pub-id pub-id-type="pmid">32523363</pub-id></citation></ref>
<ref id="B21"><label>21.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hadi</surname> <given-names>V</given-names></name> <name><surname>Pahlavani</surname> <given-names>N</given-names></name> <name><surname>Malekahmadi</surname> <given-names>M</given-names></name> <name><surname>Nattagh-Eshtivani</surname> <given-names>E</given-names></name> <name><surname>Navashenaq</surname> <given-names>J</given-names></name> <name><surname>Hadi</surname> <given-names>S</given-names></name><etal/></person-group> <article-title>Nigella sativa in controlling type 2 diabetes, cardiovascular, and rheumatoid arthritis diseases: molecular aspects.</article-title> <source><italic>J Res Med Sci.</italic></source> (<year>2021</year>) <volume>26</volume>:<issue>20</issue>. <pub-id pub-id-type="doi">10.4103/jrms.JRMS_236_20</pub-id> <pub-id pub-id-type="pmid">34221050</pub-id></citation></ref>
<ref id="B22"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nattagh-Eshtivani</surname> <given-names>E</given-names></name> <name><surname>Barghchi</surname> <given-names>H</given-names></name> <name><surname>Pahlavani</surname> <given-names>N</given-names></name> <name><surname>Barati</surname> <given-names>M</given-names></name> <name><surname>Amiri</surname> <given-names>Y</given-names></name> <name><surname>Fadel</surname> <given-names>A</given-names></name><etal/></person-group> <article-title>Biological and pharmacological effects and nutritional impact of phytosterols: a comprehensive review.</article-title> <source><italic>Phytother Res.</italic></source> (<year>2022</year>) <volume>36</volume>:<fpage>299</fpage>&#x2013;<lpage>322</lpage>. <pub-id pub-id-type="doi">10.1002/ptr.7312</pub-id> <pub-id pub-id-type="pmid">34729825</pub-id></citation></ref>
<ref id="B23"><label>23.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pahlavani</surname> <given-names>N</given-names></name> <name><surname>Khayyatzadeh</surname> <given-names>S</given-names></name> <name><surname>Banazadeh</surname> <given-names>V</given-names></name> <name><surname>Bagherniya</surname> <given-names>M</given-names></name> <name><surname>Tayefi</surname> <given-names>M</given-names></name> <name><surname>Eslami</surname> <given-names>S</given-names></name><etal/></person-group> <article-title>Adherence to a dietary approach to stop hypertension (DASH)-style in relation to daytime sleepiness.</article-title> <source><italic>Nat Sci Sleep.</italic></source> (<year>2020</year>) <volume>12</volume>:<fpage>325</fpage>&#x2013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.2147/NSS.S246991</pub-id> <pub-id pub-id-type="pmid">32607032</pub-id></citation></ref>
<ref id="B24"><label>24.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shabgah</surname> <given-names>A</given-names></name> <name><surname>Norouzi</surname> <given-names>F</given-names></name> <name><surname>Hedayati-Moghadam</surname> <given-names>M</given-names></name> <name><surname>Soleimani</surname> <given-names>D</given-names></name> <name><surname>Pahlavani</surname> <given-names>N</given-names></name> <name><surname>Navashenaq</surname> <given-names>J</given-names></name></person-group>. <article-title>A comprehensive review of long non-coding RNAs in the pathogenesis and development of non-alcoholic fatty liver disease.</article-title> <source><italic>Nutr Metab.</italic></source> (<year>2021</year>) <volume>18</volume>:<issue>22</issue>. <pub-id pub-id-type="doi">10.1186/s12986-021-00552-5</pub-id> <pub-id pub-id-type="pmid">33622377</pub-id></citation></ref>
<ref id="B25"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kourou</surname> <given-names>K</given-names></name> <name><surname>Exarchos</surname> <given-names>T</given-names></name> <name><surname>Exarchos</surname> <given-names>K</given-names></name> <name><surname>Karamouzis</surname> <given-names>M</given-names></name> <name><surname>Fotiadias</surname> <given-names>D</given-names></name></person-group>. <article-title>Machine learning applications in cancer progrosis and prediction.</article-title> <source><italic>Comput. Struct. Biotechno. J.</italic></source> (<year>2015</year>) <volume>13</volume>:<fpage>8</fpage>&#x2013;<lpage>17</lpage>.</citation></ref>
<ref id="B26"><label>26.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Athanasiou</surname> <given-names>M</given-names></name> <name><surname>Sfrintzeri</surname> <given-names>K</given-names></name> <name><surname>Zarkogianni</surname> <given-names>K</given-names></name> <name><surname>Thanopoulou</surname> <given-names>AC</given-names></name> <name><surname>Nikit</surname> <given-names>KS</given-names></name></person-group>. <article-title>An explainable XGBoost-Based approach towards assessing the risk of cardiovascular diasease in patients in patients with type 2 diabetes mellitus.</article-title> <source><italic>Proceedings of the 2020 IEE 20th International Conference on Bioinformatics and Bioengineering (BIBE).</italic></source> <publisher-loc>Cincinnati, OH</publisher-loc>: <publisher-name>IEEE</publisher-name> (<year>2020</year>).</citation></ref>
<ref id="B27"><label>27.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tibshirani</surname> <given-names>R</given-names></name></person-group>. <article-title>Regression shrinkage and selection via the Lasso.</article-title> <source><italic>J R Stat Soc.</italic></source> (<year>1996</year>) <volume>58</volume>:<fpage>267</fpage>&#x2013;<lpage>88</lpage>.</citation></ref>
<ref id="B28"><label>28.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dietrich</surname> <given-names>S</given-names></name> <name><surname>Floegel</surname> <given-names>A</given-names></name> <name><surname>Troll</surname> <given-names>M</given-names></name> <name><surname>Kuhn</surname> <given-names>T</given-names></name> <name><surname>Rathmann</surname> <given-names>W</given-names></name> <name><surname>Peters</surname> <given-names>A</given-names></name><etal/></person-group> <article-title>Random survival forest in practice: a method for modelling complex metabolomics data in time to event analysis.</article-title> <source><italic>Int J Epidemiol.</italic></source> (<year>2016</year>) <volume>45</volume>:<fpage>1406</fpage>&#x2013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1093/ije/dyw145</pub-id> <pub-id pub-id-type="pmid">27591264</pub-id></citation></ref>
<ref id="B29"><label>29.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cox</surname> <given-names>D</given-names></name></person-group>. <article-title>Regression models and life-tables.</article-title> <source><italic>J R Stat Soc.</italic></source> (<year>1972</year>) <volume>34</volume>:<fpage>187</fpage>&#x2013;<lpage>202</lpage>.</citation></ref>
<ref id="B30"><label>30.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ishwaran</surname> <given-names>H</given-names></name> <name><surname>Kogalur</surname> <given-names>UB</given-names></name> <name><surname>Blackstone</surname> <given-names>E</given-names></name> <name><surname>Lauer</surname> <given-names>MS</given-names></name></person-group>. <article-title>Random survival forests.</article-title> <source><italic>Ann Appl Stat.</italic></source> (<year>2008</year>) <volume>2</volume>:<fpage>841</fpage>&#x2013;<lpage>60</lpage>.</citation></ref>
<ref id="B31"><label>31.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cortes</surname> <given-names>C</given-names></name> <name><surname>Vapnik</surname> <given-names>V</given-names></name></person-group>. <article-title>Support-vector networks.</article-title> <source><italic>Mach Learn.</italic></source> (<year>1995</year>) <volume>20</volume>:<fpage>273</fpage>&#x2013;<lpage>97</lpage>.</citation></ref>
<ref id="B32"><label>32.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kiang</surname> <given-names>Y</given-names></name> <name><surname>Xie</surname> <given-names>J</given-names></name> <name><surname>Liu</surname> <given-names>W</given-names></name> <name><surname>Xi</surname> <given-names>S</given-names></name> <name><surname>Huang</surname> <given-names>L</given-names></name> <name><surname>Huang</surname> <given-names>W</given-names></name><etal/></person-group> <article-title>Immunomarker support vector machine classifier for prediction of gastric cancer survival and adjuvant chemotherpautic benefit.</article-title> <source><italic>Clin Cancer Res.</italic></source> (<year>2018</year>) <volume>24</volume>:<fpage>5574</fpage>&#x2013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.1158/1078-0432.CCR-18-0848</pub-id> <pub-id pub-id-type="pmid">30042208</pub-id></citation></ref>
<ref id="B33"><label>33.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lungberg</surname> <given-names>S</given-names></name> <name><surname>Lee</surname> <given-names>S</given-names></name></person-group>. <article-title>A unified approach to interpreting model predictions.</article-title> <source><italic>Adv Neural Inf Process Syst.</italic></source> (<year>2017</year>) <volume>20</volume>:<fpage>4765</fpage>&#x2013;<lpage>74</lpage>.</citation></ref>
<ref id="B34"><label>34.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Molan</surname> <given-names>C.</given-names></name></person-group> <source><italic>Interpretable machine learning</italic>. Self Published</source>. (<year>2020</year>).</citation></ref>
<ref id="B35"><label>35.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lim</surname> <given-names>S</given-names></name> <name><surname>Chi</surname> <given-names>S</given-names></name></person-group>. <article-title>Xgboost application on bridge management systems for proactive damage estimation.</article-title> <source><italic>Adv Eng Inf.</italic></source> (<year>2019</year>) <volume>41</volume>:<issue>100922</issue>.</citation></ref>
<ref id="B36"><label>36.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harrell</surname> <given-names>J</given-names></name> <name><surname>Lee</surname> <given-names>K</given-names></name> <name><surname>Mark</surname> <given-names>D</given-names></name></person-group>. <article-title>Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.</article-title> <source><italic>Stat Med.</italic></source> (<year>1996</year>) <volume>15</volume>:<fpage>361</fpage>&#x2013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1002/(SICI)1097-0258(19960229)15:4&#x003C;361::AID-SIM168&#x003E;3.0.CO;2-4</pub-id> <pub-id pub-id-type="pmid">8668867</pub-id></citation></ref>
<ref id="B37"><label>37.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pierucci</surname> <given-names>P</given-names></name> <name><surname>Misciagna</surname> <given-names>G</given-names></name> <name><surname>Ventura</surname> <given-names>M</given-names></name> <name><surname>Inguaggiato</surname> <given-names>R</given-names></name> <name><surname>Cisternino</surname> <given-names>A</given-names></name> <name><surname>Guerra</surname> <given-names>V</given-names></name><etal/></person-group> <article-title>Diet and myocardial infarction: a nested case-control study in a cohort of elderly subjects in a Mediterranean area of souther Italy.</article-title> <source><italic>Nutr Metab Cardiovasc Dis.</italic></source> (<year>2012</year>) <volume>22</volume>:<fpage>727</fpage>&#x2013;<lpage>33</lpage>. <pub-id pub-id-type="doi">10.1016/j.numecd.2010.12.002</pub-id> <pub-id pub-id-type="pmid">21482083</pub-id></citation></ref>
<ref id="B38"><label>38.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fidanza</surname> <given-names>F.</given-names></name></person-group> <source><italic>Nutrizione Umana.</italic></source> <publisher-loc>Idelson</publisher-loc>: <publisher-name>Napoli</publisher-name> (<year>1984</year>).</citation></ref>
<ref id="B39"><label>39.</label><citation citation-type="journal"><collab>FAO/WHO.</collab> <source><italic>Carbohydrate and Human Nutrition.</italic></source> <publisher-loc>Rome</publisher-loc>: <publisher-name>FAO</publisher-name> (<year>1998</year>).</citation></ref>
<ref id="B40"><label>40.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foster-Powell</surname> <given-names>K</given-names></name> <name><surname>Holt</surname> <given-names>S</given-names></name> <name><surname>Brand-Miller</surname> <given-names>J</given-names></name></person-group>. <article-title>International tables of glycemic index and glycemic load values.</article-title> <source><italic>Am J Clin Nutr.</italic></source> (<year>2002</year>) <volume>76</volume>:<fpage>5</fpage>&#x2013;<lpage>56</lpage>.</citation></ref>
<ref id="B41"><label>41.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lachat</surname> <given-names>C</given-names></name> <name><surname>Hawwash</surname> <given-names>D</given-names></name> <name><surname>Ock&#x00E9;</surname> <given-names>M</given-names></name> <name><surname>Berg</surname> <given-names>C</given-names></name> <name><surname>Forsum</surname> <given-names>E</given-names></name> <name><surname>H&#x00F6;rnell</surname> <given-names>A</given-names></name><etal/></person-group> <article-title>Strengthening the reporting of observational studies in epidemiology &#x2013; nutritional epidemiology (STROBE-nut): an extension of the STROBE statement.</article-title> <source><italic>PLoS Med.</italic></source> (<year>2016</year>) <volume>13</volume>:<issue>e1002036</issue>. <pub-id pub-id-type="doi">10.1371/journal.pmed.1002036</pub-id></citation></ref>
<ref id="B42"><label>42.</label><citation citation-type="journal"><collab>StataCorp.</collab> <source><italic>Stata Statistical Software: Release 17.</italic></source> <publisher-loc>College Station, TX</publisher-loc>: <publisher-name>StataCorp LLC</publisher-name> (<year>2021</year>).</citation></ref>
<ref id="B43"><label>43.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Belle</surname> <given-names>V</given-names></name> <name><surname>Papantonis</surname> <given-names>I</given-names></name></person-group>. <article-title>Principles and practive of explainable machine learning.</article-title> <source><italic>Front Big Data.</italic></source> (<year>2021</year>) <volume>4</volume>:<issue>688969</issue>. <pub-id pub-id-type="doi">10.3389/fdata.2021.688969</pub-id> <pub-id pub-id-type="pmid">34278297</pub-id></citation></ref>
<ref id="B44"><label>44.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fan</surname> <given-names>S</given-names></name> <name><surname>Hu</surname> <given-names>Y</given-names></name> <name><surname>You</surname> <given-names>Y</given-names></name> <name><surname>Xue</surname> <given-names>W</given-names></name> <name><surname>Chai</surname> <given-names>R</given-names></name> <name><surname>Zhang</surname> <given-names>X</given-names></name><etal/></person-group> <article-title>Role of resveratrol in inhibiting pathological cardiac remodeling.</article-title> <source><italic>Front Pharmacol.</italic></source> (<year>2022</year>) <volume>13</volume>:<issue>924473</issue>. <pub-id pub-id-type="doi">10.3389/fphar.2022.924473</pub-id> <pub-id pub-id-type="pmid">36120366</pub-id></citation></ref>
<ref id="B45"><label>45.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bohara</surname> <given-names>R</given-names></name> <name><surname>Tabassum</surname> <given-names>N</given-names></name> <name><surname>Singh</surname> <given-names>M</given-names></name> <name><surname>Gigli</surname> <given-names>G</given-names></name> <name><surname>Ragusa</surname> <given-names>A</given-names></name> <name><surname>Leporatti</surname> <given-names>S</given-names></name></person-group>. <article-title>Recent overview of resveratrol&#x2019;s beneficial effects and its nano-delivery systems.</article-title> <source><italic>Molecules.</italic></source> (<year>2022</year>) <volume>27</volume>:<issue>5154</issue>.</citation></ref>
<ref id="B46"><label>46.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Herrera-Bravo</surname> <given-names>J</given-names></name> <name><surname>Beltr&#x00E1;n</surname> <given-names>J</given-names></name> <name><surname>Huard</surname> <given-names>N</given-names></name> <name><surname>Saavedra</surname> <given-names>K</given-names></name> <name><surname>Saavedra</surname> <given-names>N</given-names></name> <name><surname>Alvear</surname> <given-names>M</given-names></name><etal/></person-group> <article-title>Anthocyanins found in pinot noir waste induce target genes related to the Nrf2 signalling in endothelial cells.</article-title> <source><italic>Antioxidants.</italic></source> (<year>2022</year>) <volume>11</volume>:<issue>1239</issue>. <pub-id pub-id-type="doi">10.3390/antiox11071239</pub-id> <pub-id pub-id-type="pmid">35883728</pub-id></citation></ref>
<ref id="B47"><label>47.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>M</given-names></name> <name><surname>Cai</surname> <given-names>S</given-names></name> <name><surname>Guo</surname> <given-names>J</given-names></name> <name><surname>Zhu</surname> <given-names>Y</given-names></name> <name><surname>Li</surname> <given-names>M</given-names></name> <name><surname>Yu</surname> <given-names>Y</given-names></name><etal/></person-group> <article-title>Alcohol drinking and all cancer mortality: a meta-analysis.</article-title> <source><italic>Ann Oncol.</italic></source> (<year>2013</year>) <volume>24</volume>:<fpage>807</fpage>&#x2013;<lpage>16</lpage>.</citation></ref>
<ref id="B48"><label>48.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fonseca-Nunes</surname> <given-names>A</given-names></name> <name><surname>Jakszyn</surname> <given-names>P</given-names></name> <name><surname>Agudo</surname> <given-names>A</given-names></name></person-group>. <article-title>Iron and cancer risk &#x2013; A systematic review and meta-analysis of the epidemiological evidence.</article-title> <source><italic>Cancer Epidemiol Biomarkers Prev.</italic></source> (<year>2014</year>) <volume>23</volume>:<fpage>12</fpage>&#x2013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1158/1055-9965.EPI-13-0733</pub-id> <pub-id pub-id-type="pmid">24243555</pub-id></citation></ref>
<ref id="B49"><label>49.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Statnikov</surname> <given-names>A</given-names></name> <name><surname>Aliferis</surname> <given-names>C</given-names></name></person-group>. <article-title>Are random forests better than support vector machines for microarray-based cancer classification.</article-title> <source><italic>AMIA Annu Symp Proc.</italic></source> (<year>2007</year>) <volume>2007</volume>:<fpage>686</fpage>&#x2013;<lpage>90</lpage>.</citation></ref>
<ref id="B50"><label>50.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ben-Hur</surname> <given-names>A</given-names></name> <name><surname>Horn</surname> <given-names>D</given-names></name> <name><surname>Siegelmann</surname> <given-names>H</given-names></name> <name><surname>Vapnik</surname> <given-names>V.</given-names></name></person-group> <source><italic>Support Vector Clustering.</italic></source> <publisher-loc>Brookline, MA</publisher-loc>: <publisher-name>Microtome Publishing</publisher-name> (<year>2001</year>).</citation></ref>
<ref id="B51"><label>51.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>I</given-names></name> <name><surname>Scheidegger</surname> <given-names>C</given-names></name> <name><surname>Venkatasubramanian</surname> <given-names>S</given-names></name> <name><surname>Friedler</surname> <given-names>S</given-names></name></person-group>. <article-title>Shapley residuals: quantifying the limits of the shapley value for explanations.</article-title> <source><italic>Proceedings of the ICML Workshop on Workshop on Human Interpretability in Machine Learning, July 2020.</italic></source> <publisher-loc>New York, NY</publisher-loc>: <publisher-name>WHI</publisher-name> (<year>2020</year>).</citation></ref>
<ref id="B52"><label>52.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Popp</surname> <given-names>C</given-names></name> <name><surname>Hu</surname> <given-names>L</given-names></name> <name><surname>Kharmats</surname> <given-names>A</given-names></name> <name><surname>Curran</surname> <given-names>M</given-names></name> <name><surname>Berube</surname> <given-names>L</given-names></name> <name><surname>Wang</surname> <given-names>C</given-names></name><etal/></person-group> <article-title>Effect of a personalized diet to reduce postprandial glycemic response vs a low-fat diet on weight loss in adults with abnormal glucose metabolism and obesity.</article-title> <source><italic>JAMA Netw.</italic></source> (<year>2022</year>) <volume>5</volume>:<issue>e2233760</issue>. <pub-id pub-id-type="doi">10.1001/jamanetworkopen.2022.33760</pub-id> <pub-id pub-id-type="pmid">36169954</pub-id></citation></ref>
</ref-list>
</back>
</article>
