<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2015.00075</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Integrated genomic and BMI analysis for type 2 diabetes risk assessment</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Lebr&#x000F3;n-Aldea</surname> <given-names>Dayanara</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/154905"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Dhurandhar</surname> <given-names>Emily J.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/56287"/>
</contrib>
<contrib contrib-type="author">
<name><surname>P&#x000E9;rez-Rodr&#x000ED;guez</surname> <given-names>Paulino</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Klimentidis</surname> <given-names>Yann C.</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/30341"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Tiwari</surname> <given-names>Hemant K.</given-names></name>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/22042"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Vazquez</surname> <given-names>Ana I.</given-names></name>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/37593"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Institute of Mathematics, School of Science and Technology, Universidad Metropolitana</institution> <country>San Juan, Puerto Rico</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Health Behavior, School of Public Health, University of Alabama at Birmingham</institution> <country>Birmingham, AL, USA</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Statistics, Colegio de Postgraduados</institution> <country>Texcoco, M&#x000E9;xico</country></aff>
<aff id="aff4"><sup>4</sup><institution>Division of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona</institution> <country>Tucson, AZ, USA</country></aff>
<aff id="aff5"><sup>5</sup><institution>Department of Biostatistics, School of Public Health, University of Alabama at Birmingham</institution> <country>Birmingham, AL, USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Eduardo Manfredi, Institut National de la Recherche Agronomique, France</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Li Zhang, University of California, San Francisco, USA; Alexandre Bureau, Universit&#x000E9; Laval, Canada</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Ana I. Vazquez, School of Public Health, University of Alabama at Birmingham, Ryals Public Health Building, 1665 University Boulevard, Birmingham, AL 35294-0022, USA e-mail: <email>avazquez&#x00040;uab.edu</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Statistical Genetics and Methodology, a section of the journal Frontiers in Genetics.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>17</day>
<month>03</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="collection">
<year>2015</year>
</pub-date>
<volume>6</volume>
<elocation-id>75</elocation-id>
<history>
<date date-type="received">
<day>31</day>
<month>10</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>12</day>
<month>02</month>
<year>2015</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2015 Lebr&#x000F3;n-Aldea, Dhurandhar, P&#x000E9;rez-Rodr&#x000ED;guez, Klimentidis, Tiwari and Vazquez.</copyright-statement>
<copyright-year>2015</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract><p>Type 2 Diabetes (T2D) is a chronic disease arising from the development of insulin absence or resistance within the body, and a complex interplay of environmental and genetic factors. The incidence of T2D has increased throughout the last few decades, together with the occurrence of the obesity epidemic. The consideration of variants identified by Genome Wide Association Studies (GWAS) into risk assessment models for T2D could aid in the identification of at-risk patients who could benefit from preventive medicine. In this study, we build several risk assessment models, evaluated with two different classification approaches (Logistic Regression and Neural Networks), to measure the effect of including genetic information in the prediction of T2D. We used data from to the Original and the Offspring cohorts of the Framingham Heart Study, which provides phenotypic and genetic information for 5245 subjects (4306 controls and 939 cases). Models were built by using several covariates: gender, exposure time, cohort, body mass index (BMI), and 65 SNPs associated to T2D. We fitted Logistic Regressions and Bayesian Regularized Neural Networks and then assessed their predictive ability by using a ten-fold cross validation. We found that the inclusion of genetic information into the risk assessment models increased the predictive ability by 2%, when compared to the baseline model. Furthermore, the models that included BMI at the onset of diabetes as a possible effector, gave an improvement of 6% in the area under the curve derived from the ROC analysis. The highest AUC achieved (0.75) belonged to the model that included BMI, and a genetic score based on the 65 established T2D-associated SNPs. Finally, the inclusion of SNPs and BMI raised predictive ability in all models as expected; however, results from the AUC in Neural Networks and Logistic Regression did not differ significantly in their prediction accuracy.</p></abstract>
<kwd-group>
<kwd>type 2 diabetes</kwd>
<kwd>Logistic Regression</kwd>
<kwd>Neural Network</kwd>
<kwd>risk assessment</kwd>
<kwd>genetic score</kwd>
</kwd-group>
<counts>
<fig-count count="0"/>
<table-count count="7"/>
<equation-count count="3"/>
<ref-count count="49"/>
<page-count count="8"/>
<word-count count="7266"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="introduction" id="s1">
<title>Introduction</title>
<p>Type 2 Diabetes (T2D) is one of the fastest growing diseases in the United States and other developed nations (Nugent, <xref ref-type="bibr" rid="B29">2008</xref>; Hu, <xref ref-type="bibr" rid="B14a">2011</xref>). In the last three decades, the number of Americans diagnosed with diabetes has tripled (from 5.6 to 20.9 million), making this a public health concern (CDC (Center for Disease Control), <xref ref-type="bibr" rid="B5">2013</xref>). T2D is a chronic metabolic disease, characterized by high levels of glucose in the blood, and frequently caused by a deficiency of insulin secretion and/or the development of insulin resistance (the inability of cells to respond to the insulin). If not treated properly, it can produce kidney failure, blindness, and circulatory problems. (Manzella, <xref ref-type="bibr" rid="B21">2007</xref>; Buijsse et al., <xref ref-type="bibr" rid="B3">2011</xref>; Hu, <xref ref-type="bibr" rid="B14a">2011</xref>; Sanghera and Blackett, <xref ref-type="bibr" rid="B33">2012</xref>). The interplay of environmental (i.e., sedentary life, obesity, lack of exercise, poor diet) and genetic factors (i.e., familial contribution), contribute to the etiology and epidemy of T2D, in addition to an estimated heritability of 26% (Poulsen et al., <xref ref-type="bibr" rid="B31">1999</xref>). Since 2007, Genome Wide Association Studies known as GWAS, have identified and confirmed more than 50 loci associated with the development of T2D (Steinthorsdottir et al., <xref ref-type="bibr" rid="B39">2007</xref>; Lindgren et al., <xref ref-type="bibr" rid="B17">2009</xref>; Shu et al., <xref ref-type="bibr" rid="B36">2010</xref>; Voight et al., <xref ref-type="bibr" rid="B43">2010</xref>; Morris et al., <xref ref-type="bibr" rid="B24">2012</xref>). Several genes identified so far are involved in encoding proteins necessary for insulin secretion, glucose metabolism, and beta-cell function, which are components that enable insulin production and insulin receptor activation in the body (Sladek et al., <xref ref-type="bibr" rid="B37">2007</xref>; Steinthorsdottir et al., <xref ref-type="bibr" rid="B39">2007</xref>; Yasuda et al., <xref ref-type="bibr" rid="B44">2008</xref>).</p>
<p>Previous studies that have included genetic profiling and scores in T2D preventive models, have shown only a slight increase in predictive ability. Generally, the use of genetic variants provides a small contribution in terms of prediction accuracy due to their small effects, especially if compared to the use of age and clinically measured variables, such as BMI, and triglyceride levels and known risk factors for this disease (Saxena et al., <xref ref-type="bibr" rid="B34">2007</xref>; Lyssenko et al., <xref ref-type="bibr" rid="B19">2008</xref>; Voight et al., <xref ref-type="bibr" rid="B43">2010</xref>; Vazquez et al., <xref ref-type="bibr" rid="B42">2012</xref>). As of today, while there is excitement with the possibility of a more personalized medicine, medical professionals do not consider genotypic information as a variable in assessing patients&#x00027; risk of developing T2D (Katsios, <xref ref-type="bibr" rid="B15">2010</xref>; Lyssenko and Laakso, <xref ref-type="bibr" rid="B20">2013</xref>). In several studies where risk assessment models have been built and tested, a few deficiencies have been noticed that could possibly have influenced their models&#x00027; predictive ability. Such deficiency may arise due to the use of a model that so far does not capture the complexity of polygenic signals and their interaction with covariates. In addition, an ideal risk assessment model would incorporate the interplay of a substantial number of small-effect genes and several phenotypic variables (e.g., BMI) related to the development of T2D in order to get a more realistic and precise prediction (Lindstrom and Tuomilehto, <xref ref-type="bibr" rid="B18">2003</xref>). However, by incorporating other phenotypes (also heritable) into the risk assessment models, pleiotropic genetic effects shared by both traits could be explained. BMI is an easy to measure phenotype, highly associated to diabetes and obesity and shown to be a strong predictor of diabetes (Lyssenko et al., <xref ref-type="bibr" rid="B19">2008</xref>; Meigs et al., <xref ref-type="bibr" rid="B23">2008</xref>). Nevertheless, it is possible that after accounting for BMI, the inclusion of SNP variants associated to T2D, may not improve prediction accuracy any further. However, this is an unanswered question.</p>
<p>To address these problems, we applied two statistical models (logistic regression, and a neural network) to data from the Framingham Heart Study, and incorporated 65 SNPs that are confirmed to be associated with T2D (Morris et al., <xref ref-type="bibr" rid="B24">2012</xref>) to estimate genetic and non-genetic effects in the prediction of T2D. Since non-genetic factors play a predominant role in whether genetically predisposed individuals progress on to T2D (Poulsen et al., <xref ref-type="bibr" rid="B31">1999</xref>), we considered including BMI information at the onset of T2D, and importantly including genetic by BMI interactions in the predictions of T2D.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Data</title>
<p>Our data set (<italic>n</italic> &#x0003D; 5239) came from the Framingham Heart Study which followed participants over seven decades and collected information from bi-yearly physical and blood examinations. Our sample was composed of 2378 females and 2861 males from the Original and Offspring cohorts; where 4300 are controls and 939 subjects are cases. Diagnosis of T2D for subjects varied by cohort. In the Original cohort, the presence of T2D was diagnosed with a blood glucose level greater than or equal to 200 mg/dL; however, for the offspring cohort, diabetes was diagnosed if fasting glucose levels were equal or greater to 125 mg/dL (NCBI, <xref ref-type="bibr" rid="B25">2006</xref>, <xref ref-type="bibr" rid="B26">2008</xref>).</p>
<p>We also examined 65 SNPs that were found to be associated with T2D as listed in Morris et al. (<xref ref-type="bibr" rid="B24">2012</xref>). Since only 20 of the 65 SNPs were genotyped by the Affymetrix 500K chip in our sample, genotype imputation was performed for the missing genotypes of the SNPs by using the IMPUTE2 software (Howie et al., <xref ref-type="bibr" rid="B13">2011</xref>). Missing information per SNP was imputed with a mean accuracy of 0.94. The imputation accuracy for all the imputed SNPs can be seen in Table <xref ref-type="supplementary-material" rid="SM1">A</xref> in Supplementary Materials.</p>
</sec>
<sec>
<title>Models</title>
<p>In this section we will present the response variable, the set of predictors, and the genetic covariates used to build the T2D models. Subsequently, the parametric and non-parametric methods, Logistic Regression (LR) and Neural Network (NN), respectively, will be introduced and finally, we will detail a series of nested models that incorporate BMI and genetic components consisting of the 65 SNPs (Morris et al., <xref ref-type="bibr" rid="B24">2012</xref>).</p>
<sec>
<title>Set of response and predictor variables</title>
<p>Disease status of the participants was coded with a binary response variable <italic>y</italic>(<italic>y</italic><sub><italic>i</italic></sub> &#x0003D; 0 for absence and <italic>y</italic><sub><italic>i</italic></sub> &#x0003D; 1 for presence of T2D in the <italic>i<sup>th</sup></italic> subject). A group of covariates was selected based on the association with T2D (<italic>P</italic> &#x0003C; 0.01) and these were: cohort (<italic>c</italic><sub><italic>i</italic></sub>), a dummy variable indicating whether the subject <italic>i</italic> belongs to the Original or Offspring cohort; age at last contact (<italic>l</italic><sub><italic>i</italic></sub>) 73.91 &#x000B1; 11.74 (mean &#x000B1; s.d.), was included to control for different exposure time or observational period; the first two principal components (<italic>PC</italic><sub>1</sub>, <italic>PC</italic><sub>2</sub>) derived from a set of 1000 European ethnicity-informative SNPs (Drineas et al., <xref ref-type="bibr" rid="B8">2010</xref>), and gender (<italic>s</italic><sub><italic>i</italic></sub>), also coded with an indicator variable, with this set of co-variables we generated a baseline model that is not influenced by genetic effects. Each one of the risk assessment models was extended by incorporating the body mass index (BMI, <italic>b</italic><sub><italic>i</italic></sub>) at diabetes onset in the case of diabetics and the last observed BMI for non-diabetics, which served as a measure of obesity [<italic>b</italic><sub><italic>i</italic></sub> (<italic>mean</italic> &#x000B1; <italic>s.d.</italic>) &#x0003D; 27.75 &#x000B1; 5.38]. In some models, the SNPs were incorporated either by directly including the 65 SNPs or indirectly by a genetic score (GS) calculated as the count of risk alleles presents on each subject per SNP <inline-formula><mml:math id="M4"><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>G</mml:mi><mml:msub><mml:mi>S</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>65</mml:mn></mml:mrow></mml:msubsup><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mstyle><mml:mo>;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>&#x0007B;</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mn>2</mml:mn><mml:mo>&#x0007D;</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>. Where <italic>x</italic><sub><italic>ij</italic></sub> are the count of risk alleles in the <italic>j<sup>th</sup></italic> SNP for the <italic>i<sup>th</sup></italic> subject. Risk alleles for the inputted SNPs were given by the expected allele count <italic>x</italic><sub><italic>ij</italic></sub> being this a continuous number ranging from [0, 2].</p>
</sec>
<sec>
<title>Logistic regression</title>
<p>The probability of diabetes peculiar to subject <italic>i<sup>th</sup></italic> was given by a linear predictor with a logit link (Dobson, <xref ref-type="bibr" rid="B7">2002</xref>) in the following form:
<disp-formula id="E1"><label>(1)</label><mml:math id="M1"><mml:mrow><mml:msub><mml:mi>p</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>E</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x0007C;</mml:mo><mml:mo>&#x000B7;</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>exp</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>&#x003B7;</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>+</mml:mo><mml:mtext>exp</mml:mtext><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>&#x003B7;</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>
where <italic>E</italic>(<italic>y</italic><sub><italic>i</italic></sub>|&#x000B7;) is the expected value for the diabetes status (<italic>y</italic><sub><italic>i</italic></sub>); <italic>p<sub><italic>i</italic></sub> is</italic> the subject-specific probability of developing T2D given a set of covariates for subject <italic>i</italic> and <italic>exp</italic>(&#x000B7;) is the exponential function. The linear predictor (&#x003B7;<sub><italic>i</italic></sub>) for a model built with only the non-genetic predictor variables is described in equation (2) and obtained as follows:
<disp-formula id="E2"><label>(2)</label><mml:math id="M2"><mml:mrow><mml:msub><mml:mi>&#x003B7;</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>&#x003B1;</mml:mi><mml:mn>0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003B1;</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:msub><mml:mi>c</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003B1;</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:msub><mml:mi>s</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003B1;</mml:mi><mml:mn>3</mml:mn></mml:msub><mml:msub><mml:mi>b</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003B1;</mml:mi><mml:mn>4</mml:mn></mml:msub><mml:msub><mml:mi>l</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula>
where &#x003B1;<sub>0</sub> is an intercept common to all observations, plus a regression on the &#x0201C;fixed effects&#x0201D;; and &#x003B1;<sub>1</sub> to &#x003B1;<sub>4</sub> are the corresponding regression coefficients or effects, for each one of the included variables.</p>
</sec>
<sec>
<title>Neural network</title>
<p>Bayesian Regularized Neural Network is a machine learning algorithm that is suited for classification problems (Shekhar and Amin, <xref ref-type="bibr" rid="B35">1992</xref>; Neal, <xref ref-type="bibr" rid="B27">1996</xref>; Gianola et al., <xref ref-type="bibr" rid="B11">2011</xref>; P&#x000E9;rez-Rodr&#x000ED;guez et al., <xref ref-type="bibr" rid="B30">2012</xref>). The Neural network aims to reduce the errors in the training set, adjust the parameters and to respond properly to novel inputs. One of the simplest neural networks is composed of three layers: the input layer which consists of the input of all the covariates for each one of the subject&#x00027;s <italic>x</italic><sub><italic>ij</italic></sub> (<italic>i</italic> &#x0003D; 1&#x02026; 5245; <italic>j is the quantity of covariates included per model)</italic> the hidden layer that contains <italic>s</italic> neurons; and the output layer. Each input connects to each one of the neurons creating an unknown weight <italic>w<sub><italic>i</italic></sub></italic> for each input. This inner product between the weights and the input vector in each neuron of the hidden layer is given by equation:
<disp-formula id="E3"><label>(3)</label><mml:math id="M3"><mml:mrow><mml:msub><mml:mi>u</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn>0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mtext>&#x0200A;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x0200A;</mml:mtext><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>65</mml:mn></mml:mrow></mml:munderover><mml:mrow><mml:msub><mml:mi>&#x003B2;</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mstyle><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mi>s</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtext>neurons</mml:mtext></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
where <italic>u</italic><sub><italic>ki</italic></sub> in the hidden layer is transformed by applying an activation function. We used the tangent hyperbolic function: <inline-formula><mml:math id="M5"><mml:mrow><mml:mi>g</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>a</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>exp</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>2</mml:mn><mml:mi>a</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>exp</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>2</mml:mn><mml:mi>a</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula>, which maps the inputs into the closed interval [&#x02212;1, 1]. The output from each of the neurons is combined linearly <inline-formula><mml:math id="M6"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>&#x003BC;</mml:mi><mml:mo>+</mml:mo><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>s</mml:mi></mml:msubsup><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mi>g</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>u</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:mstyle></mml:mrow></mml:math></inline-formula> and finally transformed by applying the function <inline-formula><mml:math id="M7"><mml:mrow><mml:mi>h</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>a</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mn>1</mml:mn><mml:mo>+</mml:mo><mml:mi>e</mml:mi><mml:mi>x</mml:mi><mml:mi>p</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mi>a</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula>, which maps the inputs into an open interval (0, 1), so that the output can be interpreted as a probability, that is <italic>y</italic><sub><italic>i</italic></sub> &#x0003D; <italic>h</italic>(<italic>z</italic><sub><italic>i</italic></sub>). Since the activation function can be a nonlinear function, it allows the classifier to capture non-linear effects.</p>
<p>Neural network models were fitted using the Bayesian approach (MacKay, <xref ref-type="bibr" rid="B20a">1992</xref>) implemented in the Software for Flexible Bayesian Modeling (FBM) written by Neal (<xref ref-type="bibr" rid="B27">1996</xref>) which is available freely at <ext-link ext-link-type="uri" xlink:href="http://www.cs.toronto.edu/~radford/fbm.software.html">www.cs.toronto.edu/&#x0007E;radford/fbm.software.html</ext-link>. For our analyses, a total of 6 neurons were included in the hidden layer to reduce the computational burden, since the results with 9 neurons yielded almost identical results.</p>
</sec>
<sec>
<title>Sequence of models</title>
<p>Six models were built, with the aim of evaluating the genetic effects of the 65 variants associated to T2D as risk factors. Our starting point was a Baseline model (BASE), which is composed of only the non-genetic covariates or fixed effects: cohort, age at last contact, gender and principal components. BASE<sub>BMI</sub> extends model BASE by incorporating BMI in the set of predictors. Since BMI co-varies with T2D, is reasonable to think that pleiotropic effects may exist. Subsequently, we generated clinical models that included genetic information. GEN65 extends BASE by incorporating the 65 SNPs associated to T2D; each SNP contains the count of risk alleles {0, 1, 2}. The GENS extends BASE model by adding the Genetic Risk Score (GS) consisting of the sum of all variants that increase diabetes risk. To test whether there are genetic effects on T2D after accounting for BMI, models GENS<sub>BMI</sub> and GEN<sub>BMI</sub> are extensions of the model of GENS and GEN65, respectively, including BMI. Finally, GEN<sub>BMI</sub> was also extended accommodating SNPs by BMI interactions, into a model called GENB<sub>SNPs &#x000D7; BMI</sub>. Table <xref ref-type="table" rid="T1">1</xref>, shows the components inside of each one of the models tested.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Description of the model&#x00027;s components</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" colspan="5"><bold>Model components</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left"><bold>Model name</bold></td>
<td align="center"><bold>Covariates (age, gender, PCs, cohort and exposure time)</bold></td>
<td align="center"><bold>BMI</bold></td>
<td align="center"><bold>65 SNPs</bold></td>
<td align="center"><bold>Genetic score</bold></td>
</tr>
<tr>
<td align="left">BASE</td>
<td align="center">&#x02713;</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left">BASE<sub>BMI</sub></td>
<td align="center">&#x02713;</td>
<td align="center">&#x02713;</td>
<td/>
<td/>
</tr>
<tr>
<td align="left">GEN65</td>
<td align="center">&#x02713;</td>
<td/>
<td align="center">&#x02713;</td>
<td/>
</tr>
<tr>
<td align="left">GEN65<sub>BMI</sub></td>
<td align="center">&#x02713;</td>
<td align="center">&#x02713;</td>
<td align="center">&#x02713;</td>
<td/>
</tr>
<tr>
<td align="left">GENS</td>
<td align="center">&#x02713;</td>
<td/>
<td/>
<td align="center">&#x02713;</td>
</tr>
<tr>
<td align="left">GENS<sub>BMI</sub></td>
<td align="center">&#x02713;</td>
<td align="center">&#x02713;</td>
<td/>
<td align="center">&#x02713;</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Estimated effects and confidence intervals</title>
<p>The estimated effects of gene markers and other covariates for the risk of T2D were calculated and displayed in terms of Odds Ratio (OR). The BASE model was used to estimate the effects for all the non-genetic covariates. In addition <italic>P</italic>-values were used to discriminate SNPs association to T2D and a 95% Confidence Interval of the OR was built to determine the statistical significance of the association between the response and the predictors.</p>
</sec>
</sec>
<sec>
<title>Predictive ability</title>
<p>To evaluate the risk assessment models, a 10-fold cross-validation was used to compare the accuracy of their respective predictions. Each of the subjects within the data was assigned randomly to the 10 folds. The testing sample consisted of a subset of 1/10th of the data, and training would take the rest of the sample in order to achieve an optimal predictive model. Predictive ability of the models was assessed with the Receiver Operating Characteristic Curve (Fawcett, <xref ref-type="bibr" rid="B10">2006</xref>), using the R package &#x0201C;pROC&#x0201D; (Robin et al., <xref ref-type="bibr" rid="B31a">2013</xref>), in order to obtain their Area Under a Curve (AUC), also referred as C-Statistic.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Descriptive statistics</title>
<p>The characteristics of the 5245 subjects are described and summarized in Table <xref ref-type="table" rid="T2">2</xref>. More than half of the sample were females (<italic>n</italic> &#x0003D; 2864), and only 18% of the overall subjects were diabetic. Within the data set, BMI (mean &#x000B1; standard deviation) for diabetics was 29.9 &#x000B1; 6.0, and healthy subjects 27.3 &#x000B1; 5.1. According to the subjects BMI indexes, 28.2% of the observed subjects demonstrated to be obese (<italic>n</italic> &#x0003D; 1482) and 67.4% of the sample were overweight, while the rest were classified as normal. The mean observed age at which sample subjects acquired T2D was 63 years old. A reduction in the proportion of incidences of T2D can be seen in the Offspring cohort since the subjects of the Original cohort were observed during a longer time when compared to the Offspring cohort.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Descriptive statistics of the sample (<italic>n</italic> &#x0003D; 5245)<xref ref-type="table-fn" rid="TN2s"><sup>&#x0002A;</sup></xref></bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Covariates</bold></th>
<th align="center"><bold>Diabetics</bold></th>
<th align="center"><bold>Non-diabetics</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Original Cohort (<italic>n</italic> &#x0003D; 1497)</td>
<td align="center">30.2% (452)</td>
<td align="center">69.8% (1045)</td>
</tr>
<tr>
<td align="left">Offspring Cohort (<italic>n</italic> &#x0003D; 3742)</td>
<td align="center">13.0% (487)</td>
<td align="center">87% (3255)</td>
</tr>
<tr>
<td align="left">Males</td>
<td align="center">20.6% (489)</td>
<td align="center">79.5% (1892)</td>
</tr>
<tr>
<td align="left">Females</td>
<td align="center">15.7% (450)</td>
<td align="center">84.3% (2414)</td>
</tr>
<tr>
<td align="left">BMI (mean &#x000B1; s.d.)</td>
<td align="center">29.9 &#x000B1; 5.9</td>
<td align="center">27.3 &#x000B1; 5.1</td>
</tr>
<tr>
<td align="left">Exposure Time (mean &#x000B1; s.d.)</td>
<td align="center">78.8 &#x000B1; 10.6</td>
<td align="center">72.9 &#x000B1; 11.8</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN2s"><label>&#x0002A;</label><p><italic>Frequency of subjects per division are enclosed between parenthesis (n).</italic></p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Genetic score</title>
<p>GS is a subject specific count of all the risk alleles in each one of the SNPs reported to be associated with risk of T2D. Table <xref ref-type="table" rid="T3">3</xref> shows a summary of the GS for both control and cases. GS ranged from 52 to 86, which indicates that each individual had at least one risk allele for T2D in almost every SNP. Individuals with a high genetic score presented a greater cumulative incidence of T2D, in comparison to subjects with a low risk score.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>Genetic score frequencies per quartile</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Genetic Score</bold></th>
<th align="center" colspan="2"><bold>Frequencies by diabetes status</bold></th>
</tr>
<tr>
<th align="left"><bold>Quartiles</bold></th>
<th align="center"><bold>Non-diabetic, percentage (<italic>n</italic>)</bold></th>
<th align="center"><bold>Diabetics, percentage (<italic>n</italic>)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">&#x0003C; 66.32</td>
<td align="center">86% (1132)</td>
<td align="center">14% (182)</td>
</tr>
<tr>
<td align="left">66.32 &#x02264; GS &#x0003C; 69.55</td>
<td align="center">85% (1108)</td>
<td align="center">15% (199)</td>
</tr>
<tr>
<td align="left">69.55&#x02264; GS &#x0003C; 72.75</td>
<td align="center">82% (1072)</td>
<td align="center">18% (236)</td>
</tr>
<tr>
<td align="left">&#x02265;72.75</td>
<td align="center">75% (992)</td>
<td align="center">25% (322)</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Estimated effects</title>
<p>NN is a classifier that yields multiple estimated effects (depending on the number of neurons), which complicates the interpretation of the results. For that reason, estimates shown in this section are results from the Logistic Regression model.</p>
<p>Table <xref ref-type="table" rid="T4">4</xref> shows the estimated Odds Ratio for the significant covariates in all models. If these covariates are not augmenting T2D risk, we would expect an OR estimate and both limits of the 95% confidence interval to include 1.0. All covariates except the Principal Components were significantly associated to diabetes (<italic>P</italic> &#x0003C; 0.01). Fixed effects estimates across the models were consistent for each of the covariates (i.e., the inclusion or exclusion of effects in the model produced very little variation of the estimated effects in the remaining effects in the model). Therefore, describing one model (GENS<sub>BMI</sub>) suffices to understand the effect of the covariates in the prediction of diabetes. For GENS<sub>BMI</sub>, gender had an OR &#x0003D; 0.60 which implies a much lower risk of developing T2D in women when compared to men. The Cohort&#x00027;s odds ratio (OR &#x0003D; 0.45), implies a lower risk of T2D in Offspring members in comparison to the Original Cohort. Exposure time had an OR of 1.03, resulting in a 3% increase in risk of development for every year of exposure. The OR for the Genetic Score is approximated to 1.1, which implies an increase in risk of developing T2D, with the increase in value of the genetic score. The OR for BMI was 1.13 in the models that included BMI. This value demonstrates there is a 13% increment in risk of T2D when increasing 1 kg/m<sup>2</sup> in BMI.</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p><bold>Estimated odd ratios (95% C.I) for covariates in risk assessment models<xref ref-type="table-fn" rid="TN4ss"><sup>&#x0002A;&#x0002A;</sup></xref></bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Covariates</bold></th>
<th align="center"><bold>BASE</bold></th>
<th align="center"><bold>BASE<sub><bold>BMI</bold></sub></bold></th>
<th align="center"><bold>GEN65</bold></th>
<th align="center"><bold>GEN65<sub><bold>BMI</bold></sub></bold></th>
<th align="center"><bold>GENS</bold></th>
<th align="center"><bold>GENS<sub><bold>BMI</bold></sub></bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Gender</td>
<td align="center">0.63 (0.54&#x02013;0.73)</td>
<td align="center">0.61 (0.52&#x02013;0.71)</td>
<td align="center">0.61 (0.53&#x02013;0.72)</td>
<td align="center">0.59 (0.51&#x02013;0.70)</td>
<td align="center">0.62 (0.53&#x02013;0.72)</td>
<td align="center">0.60 (0.51&#x02013;0.70)</td>
</tr>
<tr>
<td align="left">Cohort</td>
<td align="center">0.52 (0.42&#x02013;0.64)</td>
<td align="center">0.45 (0.36&#x02013;0.56)</td>
<td align="center">0.51 (0.40&#x02013;0.64)</td>
<td align="center">0.45 (0.35&#x02013;0.57)</td>
<td align="center">0.52 (0.42&#x02013;0.65)</td>
<td align="center">0.45 (0.36&#x02013;0.57)</td>
</tr>
<tr>
<td align="left">Exposure Time</td>
<td align="center">1.03 (1.02&#x02013;1.04)</td>
<td align="center">1.04 (1.03&#x02013;1.05)</td>
<td align="center">1.03 (1.02&#x02013;1.04)</td>
<td align="center">1.04 (1.03&#x02013;1.05)</td>
<td align="center">1.03 (1.02&#x02013;1.04)</td>
<td align="center">1.04 (1.03&#x02013;1.05)</td>
</tr>
<tr>
<td align="left">GS</td>
<td align="center">&#x02013;</td>
<td align="center">&#x02013;</td>
<td align="center">&#x02013;</td>
<td align="center">&#x02013;</td>
<td align="center">1.07 (1.05&#x02013;1.08)</td>
<td align="center">1.07 (1.05&#x02013;1.09)</td>
</tr>
<tr>
<td align="left">BMI</td>
<td align="center">&#x02013;</td>
<td align="center">1.12 (1.11&#x02013;1.14)</td>
<td align="center">&#x02013;</td>
<td align="center">1.13 (1.11&#x02013;1.15)</td>
<td align="center">&#x02013;</td>
<td align="center">1.13 (1.11&#x02013;1.14)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN4ss"><label>&#x0002A;&#x0002A;</label><p><italic>Odds Ratio for the genetic score are only reported for the only two models where it was included.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>SNP estimated effects</title>
<p>Table <xref ref-type="table" rid="T5">5</xref> provides the <italic>P</italic>-value of the 21 SNPs that gave a statistical association with T2D in our study; we also present the <italic>P</italic>-value of those SNPs, in association to BMI and WHR as reported in the Giant Consortium (Heid et al., <xref ref-type="bibr" rid="B12">2010</xref>; Speliotes et al., <xref ref-type="bibr" rid="B38">2010</xref>). Only four SNPs found in the genes GLIS3, PTPRD, TCF7L2, and TSPAN8; had an association with a <italic>P</italic>-value less than 0.001. The SNPs: rs11717195, rs17301514, rs4299828, rs11063069, and rs10842994 have a <italic>P</italic>-value less than 0.1, therefore suggested as possible risk genetic variants. A total of three SNPs, each pertaining to a different gene, were found to be associated to WHR. These genes were: <italic>GCKR</italic> (Glucokinase Regulatory Protein), <italic>IGF2BP2</italic> (Insulin-Like Growth Factor 2 MRNA Binding Protein 2), and <italic>PTPRD</italic> (protein tyrosine phosphatase receptor D). In addition, two SNPs strongly associated to BMI, were located in the genes <italic>IRS1</italic> (Insulin Receptor Substrate 1) and <italic>TCF7L2</italic> (Transcription Factor 7-Like 2).</p>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p><bold><italic>P</italic>-value for the evaluated SNPs and their reported <italic>P</italic>-values for association to WHR and BMI in the giant consortium</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>SNP</bold></th>
<th align="left"><bold>Gene</bold></th>
<th align="center"><bold><italic>P</italic>-value</bold></th>
<th align="center"><bold>BMI <italic>P</italic>-value<xref ref-type="table-fn" rid="TN5sss"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></bold></th>
<th align="center"><bold>WHR <italic>P</italic>-value<xref ref-type="table-fn" rid="TN5sss"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">rs780094</td>
<td align="left"><italic>GCKR</italic></td>
<td align="center">0.0029</td>
<td align="center">0.093</td>
<td align="center">0.00026</td>
</tr>
<tr>
<td align="left">rs2943640</td>
<td align="left"><italic>IRS1</italic></td>
<td align="center">0.0418</td>
<td align="center">0.006</td>
<td align="center">0.60</td>
</tr>
<tr>
<td align="left">rs11717195</td>
<td align="left"><italic>ADCY5</italic></td>
<td align="center">0.0508</td>
<td align="center">0.049</td>
<td align="center">0.10</td>
</tr>
<tr>
<td align="left">rs4402960</td>
<td align="left"><italic>IGF2BP2</italic></td>
<td align="center">0.0131</td>
<td align="center">0.020</td>
<td align="center">0.003</td>
</tr>
<tr>
<td align="left">rs17301514</td>
<td align="left"><italic>ADIPOQ</italic></td>
<td align="center">0.0609</td>
<td align="center">0.155</td>
<td align="center">0.450</td>
</tr>
<tr>
<td align="left">rs7756992</td>
<td align="left"><italic>CDKAL1</italic></td>
<td align="center">0.0337</td>
<td align="center">0.070</td>
<td align="center">0.230</td>
</tr>
<tr>
<td align="left">rs4299828</td>
<td align="left"><italic>IRS4</italic></td>
<td align="center">0.0991</td>
<td align="center">0.474</td>
<td align="center">0.530</td>
</tr>
<tr>
<td align="left">rs3734621</td>
<td align="left"><italic>KIF6</italic></td>
<td align="center">0.0378</td>
<td align="center">0.082</td>
<td align="center">0.190</td>
</tr>
<tr>
<td align="left">rs849135</td>
<td align="left"><italic>JAZF1</italic></td>
<td align="center">0.0418</td>
<td align="center">0.057</td>
<td align="center">0.120</td>
</tr>
<tr>
<td align="left">rs10758593</td>
<td align="left"><italic>GLIS3</italic></td>
<td align="center">0.000532</td>
<td align="center">0.790</td>
<td align="center">0.190</td>
</tr>
<tr>
<td align="left">rs16927668</td>
<td align="left"><italic>PTPRD</italic></td>
<td align="center">0.0012</td>
<td align="center">0.999</td>
<td align="center">0.006</td>
</tr>
<tr>
<td align="left">rs10811661</td>
<td align="left"><italic>CDKN2B</italic></td>
<td align="center">0.0050</td>
<td align="center">0.891</td>
<td align="center">0.110</td>
</tr>
<tr>
<td align="left">rs7903146</td>
<td align="left"><italic>TCF7L2</italic></td>
<td align="center">1.23E-06</td>
<td align="center">0.00024</td>
<td align="center">0.310</td>
</tr>
<tr>
<td align="left">rs163184</td>
<td align="left"><italic>KCNQ1</italic></td>
<td align="center">0.0264</td>
<td align="center">0.887</td>
<td align="center">0.590</td>
</tr>
<tr>
<td align="left">rs10830963</td>
<td align="left"><italic>MTNR1B</italic></td>
<td align="center">0.02918</td>
<td align="center">0.211</td>
<td align="center">0.42</td>
</tr>
<tr>
<td align="left">rs11063069</td>
<td align="left"><italic>CCND2</italic></td>
<td align="center">0.066935</td>
<td align="center">0.127</td>
<td align="center">0.49</td>
</tr>
<tr>
<td align="left">rs10842994</td>
<td align="left"><italic>KLHDC5</italic></td>
<td align="center">0.065763</td>
<td align="center">0.367</td>
<td align="center">0.53</td>
</tr>
<tr>
<td align="left">rs7955901</td>
<td align="left"><italic>TSPAN8/ LGR5</italic></td>
<td align="center">0.000192</td>
<td align="center">0.836</td>
<td align="center">0.18</td>
</tr>
<tr>
<td align="left">rs12427353</td>
<td align="left"><italic>HNF1A</italic></td>
<td align="center">0.02744</td>
<td align="center">0.746</td>
<td align="center">0.61</td>
</tr>
<tr>
<td align="left">rs7177055</td>
<td align="left"><italic>HMG20A</italic></td>
<td align="center">0.014363</td>
<td align="center">0.051</td>
<td align="center">0.23</td>
</tr>
<tr>
<td align="left">rs11651052</td>
<td align="left"><italic>TCFL4</italic></td>
<td align="center">0.008092</td>
<td align="center">&#x02013;</td>
<td align="center">&#x02013;</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN5sss"><label>&#x0002A;&#x0002A;&#x0002A;</label><p><italic>P-values of BMI and waist-to-hip ratio (WHR) as reported by GIANT consortium. (Lindgren et al., <xref ref-type="bibr" rid="B17">2009</xref>).</italic></p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Interaction with BMI</title>
<p>Our results suggest SNP by BMI interaction with five SNPs at a <italic>P</italic> &#x0003C; 0.05, and 8 genes SNPs with <italic>P</italic> &#x0003C; 0.1. These results along with the estimated OR are provided in Table <xref ref-type="table" rid="T6">6</xref>, for all SNPs. The location of the interacting SNPs are in/near the following genes: the Transcription Factor 7 like 2 (<italic>TCFL2</italic>), Gastric Inhibitory Polypeptide Receptor (<italic>GIPR</italic>), Growth Factor Receptor-Bound Protein (<italic>GRB14</italic>), G1/S-Specific Cyclin D2 (<italic>CCND2</italic>), Transducin-Like Enhancer of Split 1 (<italic>TLE1</italic>), Cartilage Intermediate Layer Protein 2 (<italic>CILP2</italic>) and HNF1 homeobox B (<italic>HNF1B</italic>). Genes <italic>CILP2</italic>, <italic>HNF1B</italic>, and <italic>HMGA2</italic>, were confirmed to have an association with BMI (<italic>P</italic> &#x0003C; 0.001). We did not detect any significant interaction in the model where genetic effects were incorporated as a Genetic Score (i.e., GENS<sub>BMI</sub>).</p>
<table-wrap position="float" id="T6">
<label>Table 6</label>
<caption><p><bold>Odds Ratio of SNP by BMI interactions of highest significance</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>SNP</bold></th>
<th align="left"><bold>Gene</bold></th>
<th align="center"><bold>Odds Ratio (95%C.I)</bold></th>
<th align="center"><bold><italic>P</italic>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">rs8108269</td>
<td align="left"><italic>GIPR</italic></td>
<td align="center">1.02 (1.0&#x02013;1.05)</td>
<td align="center">0.0896</td>
</tr>
<tr>
<td align="left">rs13389219</td>
<td align="left"><italic>GRB14</italic></td>
<td align="center">1.02 (1.00&#x02013;1.04)</td>
<td align="center">0.0421</td>
</tr>
<tr>
<td align="left">rs11063069</td>
<td align="left"><italic>CCND2</italic></td>
<td align="center">1.02 (0.99&#x02013;1.05)</td>
<td align="center">0.0870</td>
</tr>
<tr>
<td align="left">rs7903146</td>
<td align="left"><italic>TCF7L2</italic></td>
<td align="center">1.02 (1.00&#x02013;1.04)</td>
<td align="center">0.0404</td>
</tr>
<tr>
<td align="left">rs2796441</td>
<td align="left"><italic>TLE1</italic></td>
<td align="center">0.97 (0.95&#x02013;1.00)</td>
<td align="center">0.0231</td>
</tr>
<tr>
<td align="left">rs10401969</td>
<td align="left"><italic>CILP2</italic></td>
<td align="center">1.08 (1.03&#x02013;1.13)</td>
<td align="center">0.001906</td>
</tr>
<tr>
<td align="left">rs11651052</td>
<td align="left"><italic>HNF1B</italic></td>
<td align="center">0.95 (1.03&#x02013;1.13)</td>
<td align="center">0.000124</td>
</tr>
<tr>
<td align="left">rs2261181</td>
<td align="left"><italic>HMGA2</italic></td>
<td align="center">0.96 (0.93&#x02013;0.99)</td>
<td align="center">0.005184</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Predictive ability</title>
<p>Predictive ability of the models was evaluated with a ten-fold cross validation and measured in terms of AUC. Values of the AUC in cross validation, for all risk assessment models in the Logistic Regression and Neural Networks, are reported in Table <xref ref-type="table" rid="T7">7</xref>. In addition, ROC Curves for each risk assessment model tested with the Neural Networks, can be found in Table <xref ref-type="supplementary-material" rid="SM1">B</xref> the Supplementary Material.</p>
<table-wrap position="float" id="T7">
<label>Table 7</label>
<caption><p><bold>Predictive ability of the models evaluated with the area under the receiver operating curve (AUC)</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Risk assessment models</bold></th>
<th align="center"><bold>LR</bold></th>
<th align="center"><bold>NN</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">BASE</td>
<td align="center">0.6658</td>
<td align="center">0.6666</td>
</tr>
<tr>
<td align="left">BASE<sub>BMI</sub></td>
<td align="center">0.7393</td>
<td align="center">0.7354</td>
</tr>
<tr>
<td align="left">GEN65</td>
<td align="center">0.6785</td>
<td align="center">0.6786</td>
</tr>
<tr>
<td align="left">GEN65<sub>BMI</sub></td>
<td align="center">0.7452</td>
<td align="center">0.7411</td>
</tr>
<tr>
<td align="left">GENS</td>
<td align="center">0.6858</td>
<td align="center">0.6857</td>
</tr>
<tr>
<td align="left">GENS<sub>BMI</sub></td>
<td align="center">0.7495</td>
<td align="center">0.7496</td>
</tr>
<tr>
<td align="left">GENB<sub>SNPxBMI</sub></td>
<td align="center">0.7362</td>
<td align="center">0.7432</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The AUC of the logistic regression in the BASE model was 0.6658 and 0.666, in the LR and NN models respectively. The incorporation of BMI (BASE<sub>BMI</sub>), increased the AUC to 0.739 and 0.735 for LR and NN, respectively. Also, accounting for genetic markers in GEN65, increased the predictive ability of the models by approximately 2%, when compared to the baseline factors alone. We further analyzed the extent to which the predictive accuracy could be improved by adding BMI to the GEN65 model and achieved a discriminative value of 0.745 and 0.741 (LR and NN, respectively), resulting in an increase of approximately 7%. Previous studies have shown a correlation between the increases in weight and body mass with an increase in probabilities of developing T2D. The incorporation of the genetic score after accounting for BMI further increased AUC to 0.750 (i.e., the GENS<sub>BMI</sub> model, for both LR and NN). A difference of approximately 8% in predictive ability was observed in the GEN65<sub>BMI</sub> model, when compared with the baseline model (see Table <xref ref-type="table" rid="T7">7</xref>). The inclusion of the interaction of the SNPs with BMI in T2D, gave an AUC of 0.7362 in the GENB<sub>SNPxBMI</sub> model; with a 0.7% increase when modeled in the Neural Network. Both statistical methods yielded approximately the same AUC. Predictive values show that when strong genetic variants related to T2D are chosen, they substantially improve prediction of risk for T2D.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>In this paper we investigated the effects of including genetic information in preventive risk assessments for T2D, while using different modeling approaches (LR and NN). The effect of including genetic information was examined by adding 65 candidate SNPs for T2D and computing a genetic score based on these SNPs.</p>
<p>Of the 65 SNPs analyzed, 7 SNPs that are located in 4 genes (<italic>GLIS3</italic>, <italic>TCF7L2, LGR5</italic>, and <italic>PTPRD)</italic>, showed a strong association with Type 2 Diabetes. In addition, <italic>IGF2BP2</italic> and <italic>GCKR</italic> have been identified by several meta-analyses (Dupuis et al., <xref ref-type="bibr" rid="B9">2010</xref>; Heid et al., <xref ref-type="bibr" rid="B12">2010</xref>; Speliotes et al., <xref ref-type="bibr" rid="B38">2010</xref>; Morris et al., <xref ref-type="bibr" rid="B24">2012</xref>) as risk genetic variants for Type 2 Diabetes with effects in WHR. The SNPs: rs780094, rs7756992, rs7955901 are in the <italic>GCKR</italic>, <italic>CDKAL1</italic>, and <italic>LGR5</italic> gene regions; with annotated functions of insulin production, pancreatic cell growth, and glucose homeostasis, respectively. <italic>GLIS3</italic> has been listed as a diabetes susceptibility gene due to its role in the generation of pancreatic beta cells; an alteration in the expression of this gene could repress the generation of beta cells, and may be involved in pancreatic dysfunction (Dupuis et al., <xref ref-type="bibr" rid="B9">2010</xref>; Nogueira et al., <xref ref-type="bibr" rid="B28">2013</xref>). <italic>TCF7L2</italic> was observed to have a relationship with BMI in both the DIAGRAM and GIANT consortiums (Lindgren et al., <xref ref-type="bibr" rid="B17">2009</xref>; Morris et al., <xref ref-type="bibr" rid="B24">2012</xref>). It has demonstrated to lower insulin secretion by affecting &#x003B2;-cell responsiveness to insulin; it is also found in chromatin regions in islets (Kiessling and Ehrhart-Bornstein, <xref ref-type="bibr" rid="B16">2006</xref>; Sladek et al., <xref ref-type="bibr" rid="B37">2007</xref>; Lyssenko et al., <xref ref-type="bibr" rid="B19">2008</xref>; Mccarthy and Zeggini, <xref ref-type="bibr" rid="B22">2009</xref>). The gene <italic>PTPRD</italic> (protein tyrosine phosphatase receptor type D) provides a component needed to trigger the reactions for the linkage of the insulin receptor to tissue. However, it was excluded as a risk gene for Type 2 Diabetes by Bektas et al. (<xref ref-type="bibr" rid="B2a">2001</xref>) since none of the mutations did segregate with diabetes. <italic>IRS1</italic> showed an association with BMI through SNP-by-BMI interaction. This genetic variant, with an increased interaction with multiple proteins, has been associated with T2D and obesity, and could lead to the development of insulin resistance (Rung et al., <xref ref-type="bibr" rid="B32">2009</xref>; Caruso et al., <xref ref-type="bibr" rid="B4">2014</xref>).</p>
<p>When analyzing the effects of the inclusion of genetic variants in the prediction of this disease, our results suggest that a vast number of SNPs provide a modest enhancement in the predictive ability of the models. Improvement of these discriminative values, show that the added SNPs capture genetic risk. However, when the interaction of the SNPs by environment (BMI) was included in the model, no further increase was seen. The consistency of AUC throughout the models, with the use of both Neural Network and Logistic Regression, suggests that the use of different statistical approaches neither aided nor reduced the predictive ability of the models. The limitation in predictive accuracy seems to be associated to factors other than the statistical model, such as: the size of the training sample, the number of SNPs included in the model, missing heritability issues and low heritability of the trait. A few concerns about SNPs information, were observed. The first pertains to the imputation uncertainty of the SNPs, since it was not fully taken into account in our analyses. Nevertheless, an alternative methods that consider imputation uncertainty are proposed by Marchini and Howie (<xref ref-type="bibr" rid="B20b">2010</xref>). Secondly, biases could have been produced in the SNPs estimates due to family structure; nevertheless, since the number of families within our sample is large, it is considered to be of minor importance. In our sample of 5245 subjects, 2073 subjects were aggregated from 495 families, (these families contained subjects with at least one relative in the sample), moreover, the size of these families was 4.19 &#x000B1; 6.40 (mean &#x000B1; s.d) members per family.</p>
<p>The most commonly identified covariates used in assessment analyses that provide a high AUC (0.60&#x02013;0.80) as a clinical baseline model have been: age, high blood pressure, and glucose levels between other covariates (Hu et al., <xref ref-type="bibr" rid="B14">2001</xref>; Lyssenko et al., <xref ref-type="bibr" rid="B19">2008</xref>; Meigs et al., <xref ref-type="bibr" rid="B23">2008</xref>; Cooke et al., <xref ref-type="bibr" rid="B6">2012</xref>). Due to the small effects and marginal change that genotyped data provides in risk prediction, they have been used in only a few models to quantify individual disease risk and thus to facilitate personalized management of T2D risk. The ability and the effects of including genetic information into risk prediction, have been widely studied but are still limited. Previous risk assessments were SNPs associated to T2D were included, slightly improved their predictive ability when compared to baseline clinical covariates (Lyssenko et al., <xref ref-type="bibr" rid="B19">2008</xref>; Meigs et al., <xref ref-type="bibr" rid="B23">2008</xref>; Van Hoek et al., <xref ref-type="bibr" rid="B41">2008</xref>; Katsios, <xref ref-type="bibr" rid="B15">2010</xref>; Bao et al., <xref ref-type="bibr" rid="B1">2013</xref>; Lyssenko and Laakso, <xref ref-type="bibr" rid="B20">2013</xref>; Talmud et al., <xref ref-type="bibr" rid="B40">2014</xref>). In her study, Van Hoek et al. (<xref ref-type="bibr" rid="B41">2008</xref>), incorporated 18 SNPs, together with age, sex, and BMI and achieved an AUC of 0.68, yielding only a approximately 2% increase when compared to the baseline model. Furthermore, Lyssenko et al. (<xref ref-type="bibr" rid="B19">2008</xref>), evaluated the inclusion of a genetic score built with 16 SNPs; in addition to, multiple clinical covariates and achieved a discriminative value of 0.74. The addition of a modest amount of SNPs into risk prediction was lately studied by Talmud et al. (<xref ref-type="bibr" rid="B40">2014</xref>), with the use of 65 SNPs found by the DIAGRAM consortium, which were the same used in this study. A genetic score and clinical covariates such as: BMI, triglyceride levels and fasting glucose, altogether with a large data set, resulted in an AUC of 0.75. This last result is consistent with our results in the model GENS<sub>BMI</sub>. A limitation of our study is that we did not take into account other clinical variables that have shown some degree of association with diabetes, such as triglyceride levels, high blood pressure, LDL or HDL, which could have enhanced our results. The Framingham Heart Study provides these variables, but there are missing values in many exams and subjects. To avoid reducing sample size, we only included BMI longitudinally (i.e., account for BMI at the first diabetes record), and we found that genetic signal from the SNPs is captured beyond what could be explained by the BMI. BMI estimated effect on diabetes may result biased since we incorporated BMI as the BMI at first diabetes diagnosis for diabetic subjects and last BMI on record for healthy subjects. However, preliminary analysis (not included in the paper) show us that the effect and their significance, for BMI and other covariables in the models, are insensitive to alternative ways to account for BMI, such as, BMI at the first exam, or maximum BMI of the subjects observed period. Despite our limitations, our study can provide important remarks. The effect of genetic information in the improvement of the prediction accuracy, was evaluated in our models by incorporating 65 SNPs both directly and into a genetic score. In addition, we looked at the inclusion of gene-environment (BMI) and gene-gene interaction into risk prediction. Also, a classical logistic regression and a Neural Network (a non-parametric classification algorithm) were explored.</p>
<p>Prevalence of T2D is highest among individuals with a BMI &#x02265; 40 kg/m<sup>2</sup> (Bays et al., <xref ref-type="bibr" rid="B2">2007</xref>). The increase in central adiposity and percent body fat is associated with an increased risk of T2D; however, not all obese or overweight patients develop T2D, and of those who do, just a proportion is genetically predisposed. Our results show, in agreement with the literature, that BMI serves as a prediction enhancer for T2D. Predictive accuracy yielded better estimates in the baseline model that included BMI; and this was further improved when the genetic effect was also incorporated, giving an AUC difference of a approximately 8% when compared to baseline. Interaction between BMI and the genes: CILP2, HNF1B, and HMGA2 in relation to T2D, was found and reported in Table <xref ref-type="table" rid="T6">6</xref>. HNF1B is a homodimer in charge of the nephron and pancreas development. Mutations in this gene region could result in the development of diabetes. In addition, HMGA2 has transcriptional regulating factors which play a role in adipogenesis and fat storage, inducing obesity.</p>
<p>In summary, this study confirmed the association of 21 genetic variants with T2D. It was observed that individuals who have a high genetic score may have increased probabilities of developing Type 2 Diabetes. Also, accounting for genetic information, either by including SNPs or a Genetic Score in the regression, led to an improvement in prediction accuracy (<italic>AUC</italic>) of approximately 2%. However, modeling strategies such as Neural Network or Logistic Regression did not yield differences in terms of prediction. We also showed that the inclusion of BMI into the risk assessment models, improved the predictive accuracy by approximately 8%. Furthermore, the risk assessment model yielded a modest increment in prediction accuracy when including genetic risk score, even after accounting for BMI. This small improvement suggests that there is still genetic signal involved in the development of T2D, yet to be captured, that could produce effects beyond the increase in BMI. In summary, marker information in addition to commonly used baseline covariates such as BMI, could lead to an overall modest improvement of predictive performance.</p>
</sec>
<sec>
<title>Author contributions</title>
<p>All individuals that helped in the writing process of this manuscript are listed as authors and co-authors, and were part of: the formation of the research, recompilation and management of the data, data analysis and interpretation as well as the redaction and edition of this manuscript.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
</sec>
</body>
<back>
<ack>
<p>The Framingham Heart Study is conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with Boston University (Contract No. N01-HC-25195). This manuscript was not prepared in collaboration with investigators of the Framingham Heart Study and does not necessarily reflect the opinions or views of the Framingham Heart Study, Boston University, or NHLBI. Funding for SHARe Affymetrix genotyping was provided by NHLBI Contract N02-HL-64278. SHARe Illumina genotyping was provided under an agreement between Illumina and Boston University. AIV acknowledge support from NIH, National Institute of Diabetes and Digestive and Kidney, grant 7-R01-DK-062148-10-S1. YCK acknowledges support from NIH grant K01DK095032. PPR received financial support from NIH grants: R01GM099992 and R01GM101219. DLA acknowledges support from the National Science Foundation, grant EPS-1158862 and she is grateful to Dr. Juan Arratia, Principal Investigator and Director of the Student Research Development Center at the Universidad Metropolitana.</p>
</ack>
<sec sec-type="supplementary-material" id="s5">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://www.frontiersin.org/journal/10.3389/fgene.2015.00075/abstract">http://www.frontiersin.org/journal/10.3389/fgene.2015.00075/abstract</ext-link></p>
<supplementary-material xlink:href="Table1.DOCX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bao</surname> <given-names>W.</given-names></name> <name><surname>Hu</surname> <given-names>F. B.</given-names></name> <name><surname>Rong</surname> <given-names>S.</given-names></name> <name><surname>Rong</surname> <given-names>Y.</given-names></name> <name><surname>Bowers</surname> <given-names>K.</given-names></name> <name><surname>Schisterman</surname> <given-names>E. F.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Predicting risk of type 2 diabetes mellitus with genetic risk models on the basis of established genome-wide association markers: a systematic review</article-title>. <source>Am. J. Epidemiol</source>. <volume>178</volume>, <fpage>1197</fpage>&#x02013;<lpage>1207</lpage>. <pub-id pub-id-type="doi">10.1093/aje/kwt123</pub-id><pub-id pub-id-type="pmid">24008910</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bays</surname> <given-names>H. E.</given-names></name> <name><surname>Chapman</surname> <given-names>R. H.</given-names></name> <name><surname>Grandy</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>The relationship of body mass index to diabetes mellitus, hypertension and dyslipidaemia: comparison of data from two national surveys</article-title>. <source>Int. J. Clin. Pract</source>. <volume>61</volume>, <fpage>737</fpage>&#x02013;<lpage>747</lpage>. <pub-id pub-id-type="doi">10.1111/j.1742-1241.2007.01336.x</pub-id><pub-id pub-id-type="pmid">17493087</pub-id></citation>
</ref>
<ref id="B2a">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bektas</surname> <given-names>A.</given-names></name> <name><surname>Hughes</surname> <given-names>J. N.</given-names></name> <name><surname>Warram</surname> <given-names>J. H.</given-names></name> <name><surname>Krolewski</surname> <given-names>A. S.</given-names></name> <name><surname>Doria</surname> <given-names>A.</given-names></name></person-group> (<year>2001</year>). <article-title>Type 2 diabetes locus on 12q15 further mapping and mutation screening of two candidate genes</article-title>. <source>Diabetes</source> <volume>50</volume>, <fpage>204</fpage>&#x02013;<lpage>208</lpage>. <pub-id pub-id-type="doi">10.2337/diabetes.50.1.204</pub-id><pub-id pub-id-type="pmid">11147789</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Buijsse</surname> <given-names>B.</given-names></name> <name><surname>Simmons</surname> <given-names>R. K.</given-names></name> <name><surname>Griffin</surname> <given-names>S. J.</given-names></name> <name><surname>Schulze</surname> <given-names>M. B.</given-names></name></person-group> (<year>2011</year>). <article-title>Risk assessment tools for identifying individuals at risk of developing type 2 diabetes</article-title>. <source>Epidemiol. Rev</source>. <volume>33</volume>, <fpage>46</fpage>&#x02013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1093/epirev/mxq019</pub-id><pub-id pub-id-type="pmid">21622851</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caruso</surname> <given-names>M.</given-names></name> <name><surname>Ma</surname> <given-names>D.</given-names></name> <name><surname>Msallaty</surname> <given-names>Z.</given-names></name> <name><surname>Lewis</surname> <given-names>M.</given-names></name> <name><surname>Seyoum</surname> <given-names>B.</given-names></name> <name><surname>Al-janabi</surname> <given-names>W.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Increased interaction with insulin receptor substrate 1, a novel abnormality in insulin resistance and type 2 diabetes</article-title>. <source>Diabetes</source> <volume>63</volume>, <fpage>1933</fpage>&#x02013;<lpage>1947</lpage>. <pub-id pub-id-type="doi">10.2337/db13-1872</pub-id><pub-id pub-id-type="pmid">24584551</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="web"><person-group person-group-type="author"><collab>CDC (Center for Disease Control).</collab></person-group> (<year>2013</year>). <source>Diabetes Data and Trend</source>. Avaliable online at: <ext-link ext-link-type="uri" xlink:href="http://www.genetichealth.com/DBTS_Genetics_of_Type_2_Diabetes.shtml">www.genetichealth.com/DBTS_Genetics_of_Type_2_Diabetes.shtml</ext-link>.</citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cooke</surname> <given-names>J. N.</given-names></name> <name><surname>Ng</surname> <given-names>M. C. Y.</given-names></name> <name><surname>Palmer</surname> <given-names>N. D.</given-names></name> <name><surname>An</surname> <given-names>S. S.</given-names></name> <name><surname>Hester</surname> <given-names>J. M.</given-names></name> <name><surname>Freedman</surname> <given-names>B. I.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Genetic risk assessment of type 2 diabetes-associated polymorphisms in African Americans</article-title>. <source>Diabetes Care</source> <volume>35</volume>, <fpage>287</fpage>&#x02013;<lpage>292</lpage>. <pub-id pub-id-type="doi">10.2337/dc11-0957</pub-id><pub-id pub-id-type="pmid">22275441</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Dobson</surname> <given-names>A.</given-names></name></person-group> (<year>2002</year>). <article-title>Binary variables and logistic regression</article-title>, in <source>An Introduction to Generalized Linear Models, 2nd Edn</source>. eds <person-group person-group-type="editor"><name><surname>Charfield</surname> <given-names>C.</given-names></name> <name><surname>Zidek</surname> <given-names>J.</given-names></name></person-group> (<publisher-loc>Boca Raton, FL</publisher-loc>: <publisher-name>Chapman and Hall/CRC</publisher-name>), <fpage>120</fpage>&#x02013;<lpage>126</lpage>.</citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Drineas</surname> <given-names>P.</given-names></name> <name><surname>Lewis</surname> <given-names>J.</given-names></name> <name><surname>Paschou</surname> <given-names>P.</given-names></name></person-group> (<year>2010</year>). <article-title>Inferring geographic coordinates of origin for Europeans using small panels of ancestry informative markers</article-title>. <source>PLoS ONE</source> <volume>5</volume>: <fpage>e11892</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0011892</pub-id><pub-id pub-id-type="pmid">20805874</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dupuis</surname> <given-names>J.</given-names></name> <name><surname>Langenberg</surname> <given-names>C.</given-names></name> <name><surname>Prokopenko</surname> <given-names>I.</given-names></name> <name><surname>Saxena</surname> <given-names>R.</given-names></name> <name><surname>Soranzo</surname> <given-names>N.</given-names></name> <name><surname>Jackson</surname> <given-names>A. U.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk</article-title>. <source>Nat. Genet</source>. <volume>42</volume>, <fpage>105</fpage>&#x02013;<lpage>116</lpage>. <pub-id pub-id-type="doi">10.1038/ng.520</pub-id><pub-id pub-id-type="pmid">20081858</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fawcett</surname> <given-names>T.</given-names></name></person-group> (<year>2006</year>). <article-title>An introduction to ROC analysis</article-title>. <source>Pattern Recognit. Lett</source>. <volume>27</volume>, <fpage>861</fpage>&#x02013;<lpage>874</lpage>. <pub-id pub-id-type="doi">10.1016/j.patrec.2005.10.010</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gianola</surname> <given-names>D.</given-names></name> <name><surname>Okut</surname> <given-names>H.</given-names></name> <name><surname>Weigel</surname> <given-names>K.</given-names></name> <name><surname>Rosa</surname> <given-names>G.</given-names></name></person-group> (<year>2011</year>). <article-title>Predicting complex quantitative traits with bayesian neural networks: a case study with jersey cows and wheat</article-title>. <source>BMC Genet</source>. <volume>12</volume>:<fpage>87</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2156-12-87</pub-id><pub-id pub-id-type="pmid">21981731</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heid</surname> <given-names>I. M.</given-names></name> <name><surname>Jackson</surname> <given-names>A. U.</given-names></name> <name><surname>Randall</surname> <given-names>J. C.</given-names></name> <name><surname>Winkler</surname> <given-names>T. W.</given-names></name> <name><surname>Qi</surname> <given-names>L.</given-names></name> <name><surname>Steinthorsdottir</surname> <given-names>V.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution</article-title>. <source>Nat. Genet</source>. <volume>42</volume>, <fpage>949</fpage>&#x02013;<lpage>960</lpage>. <pub-id pub-id-type="doi">10.1038/ng.685</pub-id><pub-id pub-id-type="pmid">20935629</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Howie</surname> <given-names>B.</given-names></name> <name><surname>Marchini</surname> <given-names>J.</given-names></name> <name><surname>Stephens</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Genotype imputation with thousands of genomes</article-title>. <source>G3 (Bethesda)</source> <volume>1</volume>, <fpage>457</fpage>&#x02013;<lpage>470</lpage>. <pub-id pub-id-type="doi">10.1534/g3.111.001198</pub-id><pub-id pub-id-type="pmid">22384356</pub-id></citation>
</ref>
<ref id="B14a">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>F. B.</given-names></name></person-group> (<year>2011</year>). <article-title>Globalization of diabetes: the role of diet, lifestyle, and genes</article-title>. <source>Diabetes Care</source> <volume>34</volume>, <fpage>1249</fpage>&#x02013;<lpage>1257</lpage>. <pub-id pub-id-type="doi">10.2337/dc11-0442</pub-id><pub-id pub-id-type="pmid">21617109</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>F. B.</given-names></name> <name><surname>Manson</surname> <given-names>J. E.</given-names></name> <name><surname>Stampfer</surname> <given-names>M. J.</given-names></name> <name><surname>Colditz</surname> <given-names>G.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Solomon</surname> <given-names>C. G.</given-names></name> <etal/></person-group>. (<year>2001</year>). <article-title>Diet, lifestyle, and the risk of type 2 diabetes mellitus in women</article-title>. <source>N. Engl. J. Med</source>. <volume>345</volume>, <fpage>790</fpage>&#x02013;<lpage>797</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa010492</pub-id><pub-id pub-id-type="pmid">11556298</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katsios</surname> <given-names>C.</given-names></name></person-group> (<year>2010</year>). <article-title>Individual genomes and personalized medicine: life diversity and complexity editorial</article-title>. <source>Pers. Med</source>. <volume>7</volume>, <fpage>347</fpage>&#x02013;<lpage>350</lpage>. <pub-id pub-id-type="doi">10.2217/pme.10.30</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kiessling</surname> <given-names>A</given-names></name> <name><surname>Ehrhart-Bornstein</surname> <given-names>M.</given-names></name></person-group> (<year>2006</year>). <article-title>Transcription factor 7-like 2 (TCFL2) - a novel factor involved in pathogenesis of type 2 diabetes. Comment on: Grant et al., Nature Genetics 2006, Published online 15 January 2006</article-title>. <source>Horm. Metab. Res</source>. <volume>38</volume>, <fpage>137</fpage>&#x02013;<lpage>138</lpage>. <pub-id pub-id-type="doi">10.1055/s-2006-925137</pub-id><pub-id pub-id-type="pmid">16523417</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindgren</surname> <given-names>C. M.</given-names></name> <name><surname>Heid</surname> <given-names>I. M.</given-names></name> <name><surname>Randall</surname> <given-names>J. C.</given-names></name> <name><surname>Lamina</surname> <given-names>C.</given-names></name> <name><surname>Steinthorsdottir</surname> <given-names>V.</given-names></name> <name><surname>Qi</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Genome-wide association scan meta-analysis identifies three loci influencing adiposity and fat distribution</article-title>. <source>PLoS Genet</source>. <volume>5</volume>:<fpage>e1000508</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1000508</pub-id><pub-id pub-id-type="pmid">19557161</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindstrom</surname> <given-names>J.</given-names></name> <name><surname>Tuomilehto</surname> <given-names>J.</given-names></name></person-group> (<year>2003</year>). <article-title>The diabetes risk score</article-title>. <source>Diabetes Care</source> <volume>26</volume>, <fpage>725</fpage>&#x02013;<lpage>731</lpage>. <pub-id pub-id-type="doi">10.2337/diacare.26.3.725</pub-id><pub-id pub-id-type="pmid">12610029</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lyssenko</surname> <given-names>V.</given-names></name> <name><surname>Jonsson</surname> <given-names>A.</given-names></name> <name><surname>Almgren</surname> <given-names>P.</given-names></name> <name><surname>Pulizzi</surname> <given-names>N.</given-names></name> <name><surname>Isomaa</surname> <given-names>B.</given-names></name> <name><surname>Tuomi</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Clinical risk factors, DNA variants, and the development of type 2 diabetes</article-title>. <source>N. Engl. J. Med</source>. <volume>359</volume>, <fpage>2220</fpage>&#x02013;<lpage>2232</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa0801869</pub-id><pub-id pub-id-type="pmid">19020324</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lyssenko</surname> <given-names>V.</given-names></name> <name><surname>Laakso</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>Genetic screening for the risk of type 2 diabetes: worthless or valuable?</article-title> <source>Diabetes Care</source> <volume>36</volume>(<supplement>Suppl. 2</supplement>), <fpage>S120</fpage>&#x02013;<lpage>S126</lpage>. <pub-id pub-id-type="doi">10.2337/dcS13-2009</pub-id><pub-id pub-id-type="pmid">23882036</pub-id></citation>
</ref>
<ref id="B20a">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>MacKay</surname> <given-names>D. J. C.</given-names></name></person-group> (<year>1992</year>). <article-title>A practical Bayesian framework for backpropagation networks</article-title>. <source>Neural Comput</source>. <volume>4</volume>, <fpage>448</fpage>&#x02013;<lpage>472</lpage>. <pub-id pub-id-type="pmid">18255558</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Manzella</surname> <given-names>D.</given-names></name></person-group> (<year>2007</year>). <source>Insulin and Diabetes</source>. Available online at: About.com. <ext-link ext-link-type="uri" xlink:href="http://diabetes.about.com/od/whatisdiabetes/p/insulin.htm">http://diabetes.about.com/od/whatisdiabetes/p/insulin.htm</ext-link>.</citation>
</ref>
<ref id="B20b">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marchini</surname> <given-names>J.</given-names></name> <name><surname>Howie</surname> <given-names>B.</given-names></name></person-group> (<year>2010</year>). <article-title>Genotype imputation for genome-wide association studies</article-title>. <source>Nat. Rev. Genet</source>. <volume>11</volume>, <fpage>499</fpage>&#x02013;<lpage>511</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2796</pub-id><pub-id pub-id-type="pmid">20517342</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mccarthy</surname> <given-names>M. I.</given-names></name> <name><surname>Zeggini</surname> <given-names>E.</given-names></name></person-group> (<year>2009</year>). <article-title>Genome-wide association studies in type 2 diabetes</article-title>. <source>Curr. Diab. Rep</source>. <volume>9</volume>, <fpage>164</fpage>&#x02013;<lpage>171</lpage>. <pub-id pub-id-type="doi">10.1007/s11892-009-0027-4</pub-id><pub-id pub-id-type="pmid">19323962</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meigs</surname> <given-names>J. B.</given-names></name> <name><surname>Shrader</surname> <given-names>P.</given-names></name> <name><surname>Sullivan</surname> <given-names>L. M.</given-names></name> <name><surname>McAteer</surname> <given-names>J. B.</given-names></name> <name><surname>Fox</surname> <given-names>C. S.</given-names></name> <name><surname>Dupuis</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Genotype score in addition to common risk factors for prediction of type 2 diabetes</article-title>. <source>N. Engl. J. Med</source>. <volume>359</volume>, <fpage>2208</fpage>&#x02013;<lpage>2219</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa0804742</pub-id><pub-id pub-id-type="pmid">19020323</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morris</surname> <given-names>A. P.</given-names></name> <name><surname>Voight</surname> <given-names>B. F.</given-names></name> <name><surname>Teslovich</surname> <given-names>T. M.</given-names></name> <name><surname>Ferreira</surname> <given-names>T.</given-names></name> <name><surname>Segr&#x000E8;</surname> <given-names>A. V.</given-names></name> <name><surname>Steinthorsdottir</surname> <given-names>V.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes</article-title>. <source>Nat. Genet</source>. <volume>44</volume>, <fpage>981</fpage>&#x02013;<lpage>990</lpage>. <pub-id pub-id-type="doi">10.1038/ng.2383</pub-id><pub-id pub-id-type="pmid">22885922</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="web"><person-group person-group-type="author"><collab>NCBI.</collab></person-group> (<year>2006</year>). <source>Diabetic Status, Original Cohort Exams 1 - 25: Coding Manual</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/variable.cgi?study_id=phs000011.v7.p4&#x00026;phv=10779&#x00026;phd=430&#x00026;pha=&#x00026;pht=40&#x00026;phvf=&#x00026;phdf=&#x00026;phaf=&#x00026;phtf=&#x00026;dssp=1&#x00026;consent=&#x00026;temp=1">http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/variable.cgi?study_id&#x0003D;phs000011.v7.p4&#x00026;phv&#x0003D;10779&#x00026;phd&#x0003D;430&#x00026;pha&#x0003D;&#x00026;pht&#x0003D;40&#x00026;phvf&#x0003D;&#x00026;phdf&#x0003D;&#x00026;phaf&#x0003D;&#x00026;phtf&#x0003D;&#x00026;dssp&#x0003D;1&#x00026;consent&#x0003D;&#x00026;temp&#x0003D;1</ext-link></citation>
</ref>
<ref id="B26">
<citation citation-type="web"><person-group person-group-type="author"><collab>NCBI.</collab></person-group> (<year>2008</year>). <source>Diabetic Status, Offspring Cohort Exams 1 - 7: Coding Manual</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/document.cgi?study_id=phs000011.v7.p4&#x00026;phv=10797&#x00026;phd=431&#x00026;pha=&#x00026;pht=41&#x00026;phvf=&#x00026;phdf=&#x00026;phaf=&#x00026;phtf=&#x00026;dssp=1&#x00026;consent=&#x00026;temp=1#v13">http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/document.cgi?study_id&#x0003D;phs000011.v7.p4&#x00026;phv&#x0003D;10797&#x00026;phd&#x0003D;431&#x00026;pha&#x0003D;&#x00026;pht&#x0003D;41&#x00026;phvf&#x0003D;&#x00026;phdf&#x0003D;&#x00026;phaf&#x0003D;&#x00026;phtf&#x0003D;&#x00026;dssp&#x0003D;1&#x00026;consent&#x0003D;&#x00026;temp&#x0003D;1&#x00023;v13</ext-link></citation>
</ref>
<ref id="B27">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Neal</surname> <given-names>R. M.</given-names></name></person-group> (<year>1996</year>). <source>Bayesian Learning for Neural Networks Volumen 118 Lecture Notes in Statistics</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name>. Ilustrated.</citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nogueira</surname> <given-names>T. C.</given-names></name> <name><surname>Paula</surname> <given-names>F. M.</given-names></name> <name><surname>Villate</surname> <given-names>O.</given-names></name> <name><surname>Colli</surname> <given-names>M. L.</given-names></name> <name><surname>Moura</surname> <given-names>R. F.</given-names></name> <name><surname>Cunha</surname> <given-names>D. A.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>GLIS3, a susceptibility gene for type 1 and type 2 diabetes, modulates pancreatic beta cell apoptosis via regulation of a splice variant of the BH3-only protein bim</article-title>. <source>PLoS Genet</source>. <volume>9</volume>:<fpage>e1003532</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1003532</pub-id><pub-id pub-id-type="pmid">23737756</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nugent</surname> <given-names>R.</given-names></name></person-group> (<year>2008</year>). <article-title>Chronic diseases in developing countries: health and economic burdens</article-title>. <source>Ann. N.Y. Acad. Sci</source>. <volume>1136</volume>, <fpage>70</fpage>&#x02013;<lpage>79</lpage>. <pub-id pub-id-type="doi">10.1196/annals.1425.027</pub-id><pub-id pub-id-type="pmid">18579877</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>P&#x000E9;rez-Rodr&#x000ED;guez</surname> <given-names>P.</given-names></name> <name><surname>Gianola</surname> <given-names>D.</given-names></name> <name><surname>Gonz&#x000E1;lez-Camacho</surname> <given-names>J. M.</given-names></name> <name><surname>Crossa</surname> <given-names>J.</given-names></name> <name><surname>Man&#x000E8;s</surname> <given-names>Y.</given-names></name> <name><surname>Dreisigacker</surname> <given-names>S.</given-names></name></person-group> (<year>2012</year>). <article-title>Comparison between linear and non-parametric regression models for genome-enabled prediction in wheat</article-title>. <source>G3 (Bethesda)</source> <volume>2</volume>, <fpage>1595</fpage>&#x02013;<lpage>1605</lpage>. <pub-id pub-id-type="doi">10.1534/g3.112.003665</pub-id><pub-id pub-id-type="pmid">23275882</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Poulsen</surname> <given-names>P.</given-names></name> <name><surname>Kyvik</surname> <given-names>K. O.</given-names></name> <name><surname>Vaag</surname> <given-names>A.</given-names></name> <name><surname>Beck-Nielsen</surname> <given-names>H.</given-names></name></person-group> (<year>1999</year>). <article-title>Heritability of type II (non-insulin-dependent) diabetes mellitus and abnormal glucose tolerance&#x02013;a population-based twin study</article-title>. <source>Diabetologia</source> <volume>42</volume>, <fpage>139</fpage>&#x02013;<lpage>145</lpage>. <pub-id pub-id-type="pmid">10064092</pub-id></citation>
</ref>
<ref id="B31a">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Robin</surname> <given-names>A. X.</given-names></name> <name><surname>Turck</surname> <given-names>N.</given-names></name> <name><surname>Hainard</surname> <given-names>A.</given-names></name> <name><surname>Lisacek</surname> <given-names>F.</given-names></name> <name><surname>Sanchez</surname> <given-names>J.</given-names></name> <name><surname>M&#x000FC;ller</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2013</year>). <source>Package &#x0201C;pROC&#x0201D;</source>. <fpage>1</fpage>&#x02013;<lpage>71</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://expasy.org/tools/pROC/">http://expasy.org/tools/pROC/</ext-link></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rung</surname> <given-names>J.</given-names></name> <name><surname>Cauchi</surname> <given-names>S.</given-names></name> <name><surname>Albrechtsen</surname> <given-names>A.</given-names></name> <name><surname>Shen</surname> <given-names>L.</given-names></name> <name><surname>Rocheleau</surname> <given-names>G.</given-names></name> <name><surname>Cavalcanti-Proen&#x000E7;a</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Genetic variant near IRS1 is associated with type 2 diabetes, insulin resistance and hyperinsulinemia</article-title>. <source>Nat. Genet</source>. <volume>41</volume>, <fpage>1110</fpage>&#x02013;<lpage>1115</lpage>. <pub-id pub-id-type="doi">10.1038/ng.443</pub-id><pub-id pub-id-type="pmid">19734900</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sanghera</surname> <given-names>D. K.</given-names></name> <name><surname>Blackett</surname> <given-names>P. R.</given-names></name></person-group> (<year>2012</year>). <article-title>Type 2 diabetes genetics: beyond GWAS</article-title>. <source>J Diabetes Metab</source>. <volume>3</volume>:<fpage>6948</fpage>. <pub-id pub-id-type="doi">10.4172/2155-6156.1000198</pub-id><pub-id pub-id-type="pmid">23243555</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saxena</surname> <given-names>R.</given-names></name> <name><surname>Voight</surname> <given-names>B. F.</given-names></name> <name><surname>Lyssenko</surname> <given-names>V.</given-names></name> <name><surname>Burtt</surname> <given-names>N. P.</given-names></name> <name><surname>de Bakker</surname> <given-names>P. I.</given-names></name> <name><surname>Chen</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels</article-title>. <source>Science</source> <volume>316</volume>, <fpage>1331</fpage>&#x02013;<lpage>1336</lpage>. <pub-id pub-id-type="doi">10.1126/science.1142358</pub-id><pub-id pub-id-type="pmid">17463246</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shekhar</surname> <given-names>S.</given-names></name> <name><surname>Amin</surname> <given-names>M. B.</given-names></name></person-group> (<year>1992</year>). <article-title>Generalization by neural networks</article-title>. <source>IEEE Trans. Knowl. Data Eng</source>. <volume>4</volume>, <fpage>177</fpage>&#x02013;<lpage>185</lpage>. <pub-id pub-id-type="doi">10.1109/69.134256</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shu</surname> <given-names>X. O.</given-names></name> <name><surname>Long</surname> <given-names>J.</given-names></name> <name><surname>Cai</surname> <given-names>Q.</given-names></name> <name><surname>Qi</surname> <given-names>L.</given-names></name> <name><surname>Xiang</surname> <given-names>Y.-B.</given-names></name> <name><surname>Cho</surname> <given-names>Y. S.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Identification of new genetic risk variants for type 2 diabetes</article-title>. <source>PLoS Genet</source>. <volume>6</volume>:<fpage>e1001127</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1001127</pub-id><pub-id pub-id-type="pmid">20862305</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sladek</surname> <given-names>R.</given-names></name> <name><surname>Rocheleau</surname> <given-names>G.</given-names></name> <name><surname>Rung</surname> <given-names>J.</given-names></name> <name><surname>Dina</surname> <given-names>C.</given-names></name> <name><surname>Shen</surname> <given-names>L.</given-names></name> <name><surname>Serre</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>A genome-wide association study identifies novel risk loci for type 2 diabetes</article-title>. <source>Nature</source> <volume>445</volume>, <fpage>881</fpage>&#x02013;<lpage>885</lpage>. <pub-id pub-id-type="doi">10.1038/nature05616</pub-id><pub-id pub-id-type="pmid">17293876</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Speliotes</surname> <given-names>E. K.</given-names></name> <name><surname>Willer</surname> <given-names>C. J.</given-names></name> <name><surname>Berndt</surname> <given-names>S. I.</given-names></name> <name><surname>Monda</surname> <given-names>K. L.</given-names></name> <name><surname>Thorleifsson</surname> <given-names>G.</given-names></name> <name><surname>Jackson</surname> <given-names>A. U.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index</article-title>. <source>Nat. Genet</source>. <volume>42</volume>:<fpage>937</fpage>&#x02013;<lpage>948</lpage>. <pub-id pub-id-type="doi">10.1038/ng.686</pub-id><pub-id pub-id-type="pmid">20935630</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steinthorsdottir</surname> <given-names>V.</given-names></name> <name><surname>Thorleifsson</surname> <given-names>G.</given-names></name> <name><surname>Reynisdottir</surname> <given-names>I.</given-names></name> <name><surname>Benediktsson</surname> <given-names>R.</given-names></name> <name><surname>Jonsdottir</surname> <given-names>T.</given-names></name> <name><surname>Walters</surname> <given-names>G. B.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>A variant in CDKAL1 influences insulin response and risk of type 2 diabetes</article-title>. <source>Nat. Genet</source>. <volume>39</volume>, <fpage>770</fpage>&#x02013;<lpage>775</lpage>. <pub-id pub-id-type="doi">10.1038/ng2043</pub-id><pub-id pub-id-type="pmid">17460697</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Talmud</surname> <given-names>P. J.</given-names></name> <name><surname>Cooper</surname> <given-names>J. A.</given-names></name> <name><surname>Morris</surname> <given-names>R. W.</given-names></name> <name><surname>Dudbridge</surname> <given-names>F.</given-names></name> <name><surname>Shah</surname> <given-names>T.</given-names></name> <name><surname>Engmann</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Sixty-five common genetic variants and prediction of type 2 diabetes</article-title>. <source>Diabetes</source>. [Epub ahead of print]. <pub-id pub-id-type="doi">10.2337/db14-1504</pub-id><pub-id pub-id-type="pmid">25475436</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Hoek</surname> <given-names>M.</given-names></name> <name><surname>Dehghan</surname> <given-names>A.</given-names></name> <name><surname>Witteman</surname> <given-names>J. C. M.</given-names></name> <name><surname>van Duijn</surname> <given-names>C. M.</given-names></name> <name><surname>Uitterlinden</surname> <given-names>A. G.</given-names></name> <name><surname>Oostra</surname> <given-names>B. A.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Predicting type 2 diabetes based on polymorphisms from genome-wide association studies: a population-based study</article-title>. <source>Diabetes</source> <volume>57</volume>, <fpage>3122</fpage>&#x02013;<lpage>3128</lpage>. <pub-id pub-id-type="doi">10.2337/db08-0425</pub-id><pub-id pub-id-type="pmid">18694974</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vazquez</surname> <given-names>A. I.</given-names></name> <name><surname>de los Campos</surname> <given-names>G.</given-names></name> <name><surname>Klimentidis</surname> <given-names>Y. C.</given-names></name> <name><surname>Rosa</surname> <given-names>G. J. M.</given-names></name> <name><surname>Gianola</surname> <given-names>D.</given-names></name> <name><surname>Yi</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>A comprehensive genetic approach for improving prediction of skin cancer risk in humans</article-title>. <source>Genetics</source> <volume>192</volume>, <fpage>1493</fpage>&#x02013;<lpage>1502</lpage>. <pub-id pub-id-type="doi">10.1534/genetics.112.141705</pub-id><pub-id pub-id-type="pmid">23051645</pub-id></citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Voight</surname> <given-names>B. F.</given-names></name> <name><surname>Scott</surname> <given-names>L. J.</given-names></name> <name><surname>Steinthorsdottir</surname> <given-names>V.</given-names></name> <name><surname>Morris</surname> <given-names>A. P.</given-names></name> <name><surname>Dina</surname> <given-names>C.</given-names></name> <name><surname>Welch</surname> <given-names>R. P.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Twelve type 2 diabetes susceptibility loci identified through large-scale association analysis</article-title>. <source>Nat. Genet</source>. <volume>42</volume>, <fpage>579</fpage>&#x02013;<lpage>589</lpage>. <pub-id pub-id-type="doi">10.1038/ng.609</pub-id><pub-id pub-id-type="pmid">20581827</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yasuda</surname> <given-names>K.</given-names></name> <name><surname>Miyake</surname> <given-names>K.</given-names></name> <name><surname>Horikawa</surname> <given-names>Y.</given-names></name> <name><surname>Hara</surname> <given-names>K.</given-names></name> <name><surname>Osawa</surname> <given-names>H.</given-names></name> <name><surname>Furuta</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Variants in KCNQ1 are associated with susceptibility to type 2 diabetes mellitus</article-title>. <source>Nat. Genet</source>. <volume>40</volume>, <fpage>1092</fpage>&#x02013;<lpage>1097</lpage>. <pub-id pub-id-type="doi">10.1038/ng.207</pub-id><pub-id pub-id-type="pmid">18711367</pub-id></citation>
</ref>
</ref-list>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term>T2D</term>
<def><p>Type 2 Diabetes</p></def></def-item>
<def-item><term>AUC</term>
<def><p>Area Under the receiver operating Curve</p></def></def-item>
<def-item><term>GS</term>
<def><p>Genetic Score</p></def></def-item>
<def-item><term>BMI</term>
<def><p>Body Mass Index</p></def></def-item>
<def-item><term>LR</term>
<def><p>Logistic Regression</p></def></def-item>
<def-item><term>NN</term>
<def><p>Neural Network</p></def></def-item>
<def-item><term>OR</term>
<def><p>Odds Ratio.</p></def></def-item>
</def-list>
</glossary>
</back>
</article>