<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Physio.</journal-id>
<journal-title>Frontiers in Physiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Physio.</abbrev-journal-title>
<issn pub-type="epub">1664-042X</issn>
<publisher>
<publisher-name>Frontiers Research Foundation</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fphys.2012.00323</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Physiology</subject>
<subj-group>
<subject>Review Article</subject>
</subj-group>
</subj-group>
<series-title content-type="Research Topic">Research Topic: From structural to molecular systems biology: experimental and computational approaches to unravel mechanisms of kinase activity regulation in cancer and neurodegeneration</series-title>
</article-categories>
<title-group>
<article-title>Interpretation of the Consequences of Mutations in Protein Kinases: Combined Use of Bioinformatics and Text Mining</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Izarzugaza</surname> <given-names>Jose M. G.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001">&#x0002A;</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Krallinger</surname> <given-names>Martin</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Valencia</surname> <given-names>Alfonso</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001">&#x0002A;</xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Structural Computational Biology Group, Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre</institution> <country>Madrid, Spain</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Matteo Barberis, Humboldt University Berlin, Germany; Max Planck Institute for Molecular Genetics, Berlin, Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Jonathan Christopher Fuller, Heidelberg Institute for Theoretical Studies, Germany; Xavier De La Cruz Montserrat, Vall d&#x02019;Hebron Research Institute, Spain</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Jose M. G. Izarzugaza and Alfonso Valencia, Structural Computational Biology Group, Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre, C/Melchor Fernandez Almagro, 3, 28029 Madrid, Spain. e-mail: <email>jmgonzalez&#x00040;cnio.es</email>; <email>avalencia&#x00040;cnio.es</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Frontiers in Systems Biology, a specialty of Frontiers in Physiology.</p></fn>
</author-notes>
<pub-date pub-type="epreprint">
<day>05</day>
<month>06</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="epub">
<day>22</day>
<month>08</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="collection">
<year>2012</year>
</pub-date>
<volume>3</volume>
<elocation-id>323</elocation-id>
<history>
<date date-type="received">
<day>23</day>
<month>05</month>
<year>2012</year>
</date>
<date date-type="accepted">
<day>23</day>
<month>07</month>
<year>2012</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2012 Izarzugaza, Krallinger and Valencia.</copyright-statement>
<copyright-year>2012</copyright-year>
<license license-type="open-access" xlink:href="http://www.frontiersin.org/licenseagreement"><p>This is an open-access article distributed under the terms of the <uri xlink:href="http://creativecommons.org/licenses/by/3.0/">Creative Commons Attribution License</uri>, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.</p></license>
</permissions>
<abstract>
<p>Protein kinases play a crucial role in a plethora of significant physiological functions and a number of mutations in this superfamily have been reported in the literature to disrupt protein structure and/or function. Computational and experimental research aims to discover the mechanistic connection between mutations in protein kinases and disease with the final aim of predicting the consequences of mutations on protein function and the subsequent phenotypic alterations. In this article, we will review the possibilities and limitations of current computational methods for the prediction of the pathogenicity of mutations in the protein kinase superfamily. In particular we will focus on the problem of benchmarking the predictions with independent gold standard datasets. We will propose a pipeline for the curation of mutations automatically extracted from the literature. Since many of these mutations are not included in the databases that are commonly used to train the computational methods to predict the pathogenicity of protein kinase mutations we propose them to build a valuable gold standard dataset in the benchmarking of a number of these predictors. Finally, we will discuss how text mining approaches constitute a powerful tool for the interpretation of the consequences of mutations in the context of disease genome analysis with particular focus on cancer.</p>
</abstract>
<kwd-group>
<kwd>disease</kwd>
<kwd>kinase</kwd>
<kwd>literature mining</kwd>
<kwd>mutation</kwd>
<kwd>pathogenicity prediction</kwd>
<kwd>protein kinase</kwd>
<kwd>text mining</kwd>
<kwd>variation</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="84"/>
<page-count count="10"/>
<word-count count="9209"/>
</counts>
</article-meta>
</front>
<body>
<sec>
<title>The Human Kinome</title>
<p>Protein kinases are a family of enzymes that catalyze the transfer of a phosphate from ATP to a serine, threonine, or tyrosine hydroxyl group in the target protein. Phosphorylation often implies enzyme activation or inhibition, alteration of interaction surfaces, and conformational changes, among the most common consequences. It is due to the importance of the processes regulated, that protein kinases generally do not act alone but rather, they form part of a finely tuned signaling cascade that is strictly controlled spatiotemporally. Therefore, protein kinases are metaphorically referred to as the metabolic switches of the cell.</p>
<p>Protein kinases are one of the most ubiquitous families of signaling molecules in the human cell. The total number of genes encoding kinases has been a matter of discussion in the last decade and, for instance, in Wang (<xref ref-type="bibr" rid="B77">1998</xref>) estimated between 1000 and 2000 different human kinase genes. With the completion of the human genome, the current estimate is that 518 genes encode protein kinases, corresponding to more than 2% of the total number of genes in the human genome (Manning et al., <xref ref-type="bibr" rid="B52">2002b</xref>).</p>
<p>All members of the superfamily share a characteristic domain &#x02013; the protein kinase domain &#x02013; that confers them the ability to phosphorylate other proteins. Empirical studies suggest that the residues conforming the ATP binding site tend to be conserved and that phosphotransfer is carried out by a shared set of amino acids (Schee and Bourne, <xref ref-type="bibr" rid="B65">2005</xref>; Knight et al., <xref ref-type="bibr" rid="B38">2007</xref>; L&#x000F3;pez et al., <xref ref-type="bibr" rid="B49">2007</xref>; Kinnings and Jackson, <xref ref-type="bibr" rid="B37">2009</xref>; Tanramluk et al., <xref ref-type="bibr" rid="B71">2009</xref>).</p>
<p>In spite of these similarities, experiments in yeast models (Manning et al., <xref ref-type="bibr" rid="B51">2002a</xref>; Ubersax et al., <xref ref-type="bibr" rid="B74">2003</xref>) suggest that although protein kinases individually present a remarkable substrate specificity, the superfamily as a whole is very promiscuous, phosphorylating a wide range of protein substrates. This observation, may be attributed to the different domain architectures present in the protein kinase superfamily. In addition to the aforementioned protein kinase domain committed to the general function of phosphorylation, a number of modular domains are combined to, for example, confer substrate specificity, to tightly control the activity of the enzyme or anchor the kinase to the membrane (Finn et al., <xref ref-type="bibr" rid="B23">2010</xref>).</p>
<p>These differences in terms of functionality and domain architecture can be used to classify members of the protein kinase superfamily into different categories. Indeed, there are several different classifications of kinases from the main model organisms: yeast (Hunter and Plowman, <xref ref-type="bibr" rid="B30">1997</xref>), worm (Manning, <xref ref-type="bibr" rid="B50">2005</xref>), fruit fly (Manning et al., <xref ref-type="bibr" rid="B51">2002a</xref>), and mouse (Caenepeel et al., <xref ref-type="bibr" rid="B10">2004</xref>). The reference classification in humans is KinBase (Manning et al., <xref ref-type="bibr" rid="B52">2002b</xref>; Miranda-Saavedra and Barton, <xref ref-type="bibr" rid="B54">2007</xref>), which has also been incorporated into UniProt (Bairoch et al., <xref ref-type="bibr" rid="B3">2005</xref>), albeit with minor modifications.</p>
</sec>
<sec>
<title>Mutations in the Protein Kinase Superfamily</title>
<p>Due to their important regulatory function, a number of mutations in protein kinases have been associated with different human diseases (Shchemelinin et al., <xref ref-type="bibr" rid="B66">2006</xref>), including cancer. For example, Greenman et al. (<xref ref-type="bibr" rid="B27">2007</xref>) carried out the first large scale study of the variation of 518 human kinases in 210 samples of cancer tissues and cell-lines. Moreover, other high-throughput studies (Sj&#x000F6;blom et al., <xref ref-type="bibr" rid="B68">2006</xref>; Wood et al., <xref ref-type="bibr" rid="B80">2007</xref>) also yielding interesting information about the role that variation of human protein kinases plays in cancer. For a detailed review, refer to Baudot et al. (<xref ref-type="bibr" rid="B6">2009</xref>).</p>
<p>The results from these high-throughput resequencing projects is often available through research publications. However, in order to make the information more easily accessible, several efforts are devoted to compile, store, annotate, and characterize mutations, including mutations in the protein kinase superfamily. Some examples are UniProt (Yip et al., <xref ref-type="bibr" rid="B81">2008</xref>), COSMIC (Bamford et al., <xref ref-type="bibr" rid="B5">2004</xref>), SAAPdb (Hurst et al., <xref ref-type="bibr" rid="B31">2009</xref>), MoKCa (Richardson et al., <xref ref-type="bibr" rid="B64">2009</xref>), and KinMutBase (Ortutay et al., <xref ref-type="bibr" rid="B58">2005</xref>). Together they constitute a powerful resource to understand disease association and the functional/structural properties of the mutations that affect human protein kinases.</p>
<p>Unfortunately, database curators are not able to store and annotate the vast amount of information provided by large-scale variation studies at the same pace it is generated. Mainly, because the process generally involves the manual inspection and curation of specific variation studies, which requires considerable resources. As a consequence, although growing in number, the mutations totally characterized, and well-understood only represent a small fraction of all the human variome.</p>
</sec>
<sec>
<title>Methods to Predict Pathogenic Mutations</title>
<p>In the section Mutations in the protein kinase superfamily we mentioned that high-throughput resequencing screenings represent a powerful set of techniques to discover large numbers of mutations. Of these, only a small fraction are causally implicated in disease onset and therefore, separating the wheat from the chaff is still a major challenge (Baudot et al., <xref ref-type="bibr" rid="B6">2009</xref>). For a small subset of the new mutations discovered, experimental information is available regarding the relationship between the mutation and disease, and for a smaller number of cases the underlying biochemical mechanism is known. Little information is available for the remaining mutations. The requirement of a lot of investment, both in terms of time and money, means that it is not feasible to experimentally test the association of all these mutations to disease, and to characterize their functional effects. Nevertheless, this problem is very amenable to <italic>in silico</italic> predictors.</p>
<p>Cline and Karchin (<xref ref-type="bibr" rid="B15">2011</xref>) wisely summarized the two different approaches as follows: &#x0201C;<italic>A bench biologist interested in whether a mutation of interest impacts the transcription of a gene might perform site-directed mutagenesis on genomic DNA, transfect mutated DNA into cell culture, and use readouts of the gene&#x02019;s transcriptional activity to measure changes with respect to wild type. In contrast, a bioinformatics approach typically involves computational analysis of the DNA sequence surrounding the mutation, possibly supplemented with information from published bench experiments</italic>.&#x0201D;</p>
<p>This is just one example of the very different methods available to predict <italic>in silico</italic> the probability of a newly discovered mutation being implicated in disease. Different approaches have been developed in the last decade (Table <xref ref-type="table" rid="T1">1</xref>) and several detailed reviews on this subject have been published (Baudot et al., <xref ref-type="bibr" rid="B6">2009</xref>; Karchin, <xref ref-type="bibr" rid="B35">2009</xref>; Cline and Karchin, <xref ref-type="bibr" rid="B15">2011</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Summary of methods to predict the pathogenicity of mutations</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Method</th>
<th align="left">Main features</th>
<th align="left">Further information</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">SIFT (Ng and Henikoff, <xref ref-type="bibr" rid="B57">2001</xref>)</td>
<td align="left">Threshold-based, conservation</td>
<td align="left"><monospace><uri xlink:href="http://sift.jcvi.org">http://sift.jcvi.org</uri></monospace></td>
</tr>
<tr>
<td align="left">PMUT (Ferrer-Costa et al., <xref ref-type="bibr" rid="B21">2005</xref>)</td>
<td align="left">Neural Network, sequence-, and structure-based features</td>
<td align="left"><monospace><uri xlink:href="http://mmb.pcb.ub.es/PMut">http://mmb.pcb.ub.es/PMut</uri></monospace></td>
</tr>
<tr>
<td align="left">SNPs3D (Yue et al., <xref ref-type="bibr" rid="B84">2006</xref>)</td>
<td align="left">Support Vector Machine, structure-based features</td>
<td align="left"><monospace><uri xlink:href="http://www.snps3d.org">http://www.snps3d.org</uri></monospace></td>
</tr>
<tr>
<td align="left">PANTHER (Thomas et al., <xref ref-type="bibr" rid="B72">2003</xref>)</td>
<td align="left">Threshold-based, conservation (PSEC)</td>
<td align="left"><monospace><uri xlink:href="http://www.pantherdb.org/tools/csnpScoreForm.jsp">http://www.pantherdb.org/tools/csnpScoreForm.jsp</uri></monospace></td>
</tr>
<tr>
<td align="left">Pfam LogRE (Clifford et al., <xref ref-type="bibr" rid="B14">2004</xref>)</td>
<td align="left">Threshold-based, probability of a PFAM domain to be pathogenic using a log-odds ratio</td>
<td align="left"/>
</tr>
<tr>
<td align="left">LS-SNP (Karchin, <xref ref-type="bibr" rid="B35">2009</xref>)</td>
<td align="left">Support Vector Machine, sequence-, and structure-based features</td>
<td align="left"><monospace><uri xlink:href="http://ls-snp.icm.jhu.edu/ls-snp-pdb">http://ls-snp.icm.jhu.edu/ls-snp-pdb</uri></monospace></td>
</tr>
<tr>
<td align="left">CanPredict (Kaminker et al., <xref ref-type="bibr" rid="B33">2007a</xref>)</td>
<td align="left">Combines SIFT, Pfam LogRE, and Gene Ontology terms in a single prediction</td>
<td align="left"><monospace><uri xlink:href="http://research-public.gene.com/Research/genentech/canpredict">http://research-public.gene.com/Research/genentech/canpredict</uri></monospace></td>
</tr>
<tr>
<td align="left">SNAP (Bromberg and Rost, <xref ref-type="bibr" rid="B9">2007</xref>)</td>
<td align="left">Neural Network, sequence-, and structure-based features</td>
<td align="left"><monospace><uri xlink:href="http://cubic.bioc.columbia.edu/services/snap">http://cubic.bioc.columbia.edu/services/snap</uri></monospace></td>
</tr>
<tr>
<td align="left">Torkamani (Torkamani and Schork, <xref ref-type="bibr" rid="B73">2007</xref>)</td>
<td align="left">Support Vector Machine, sequence-, and structure-based features, kinase-specific</td>
<td align="left"/>
</tr>
<tr>
<td align="left">MutaGeneSys (Stoyanovich and Pe&#x02019;er, <xref ref-type="bibr" rid="B70">2008</xref>)</td>
<td align="left">Whole-genome marker correlation dataset to identify association to causal SNPs in OMIM</td>
<td align="left"><monospace><uri xlink:href="http://www.cs.columbia.edu/&#x0007E;jds1/MutaGeneSys">http://www.cs.columbia.edu/&#x0007E;jds1/MutaGeneSys</uri></monospace></td>
</tr>
<tr>
<td align="left">stSNP (Uzun et al., <xref ref-type="bibr" rid="B75">2007</xref>)</td>
<td align="left">Integrates non-synonymous SNPs from dbSNP, structural models from Modeler and KEGG pathways. Comparative native/mutant analysis</td>
<td align="left"><monospace><uri xlink:href="http://ilyinlab.org/StSNP">http://ilyinlab.org/StSNP</uri></monospace></td>
</tr>
<tr>
<td align="left">F-SNP (Lee and Shatkay, <xref ref-type="bibr" rid="B47">2008</xref>)</td>
<td align="left">Metaserver, combines PolyPhen, SNPeffect2.0, SNPs3D, LS-SNP</td>
<td align="left"><monospace><uri xlink:href="http://compbio.cs.queensu.ca/F-SNP">http://compbio.cs.queensu.ca/F-SNP</uri></monospace></td>
</tr>
<tr>
<td align="left">SNP &#x00026; GO (Calabrese et al., <xref ref-type="bibr" rid="B11">2009</xref>)</td>
<td align="left">Support Vector Machine, several sequence-derived features, and information from Gene Ontology terms</td>
<td align="left"><monospace><uri xlink:href="http://snps-and-go.biocomp.unibo.it/snps-and-go/">http://snps-and-go.biocomp.unibo.it/snps-and-go/</uri></monospace></td>
</tr>
<tr>
<td align="left">PolyPhen-2 (Adzhubei et al., <xref ref-type="bibr" rid="B1">2010</xref>)</td>
<td align="left">Bayesian classifier, sequence-, and structure-based features</td>
<td align="left"><monospace><uri xlink:href="http://genetics.bwh.harvard.edu/pph2">http://genetics.bwh.harvard.edu/pph2</uri></monospace></td>
</tr>
<tr>
<td align="left">MuD (Wainreb et al., <xref ref-type="bibr" rid="B76">2010</xref>)</td>
<td align="left">Random forest, sequence-, and structure-based features</td>
<td align="left"><monospace><uri xlink:href="http://mud.tau.ac.il">http://mud.tau.ac.il</uri></monospace></td>
</tr>
<tr>
<td align="left">CHASM (Wong et al., <xref ref-type="bibr" rid="B79">2011</xref>)</td>
<td align="left">Random forest, sequence-based features</td>
<td align="left"><monospace><uri xlink:href="http://wiki.chasmsoftware.org/index.php">http://wiki.chasmsoftware.org/index.php</uri></monospace></td>
</tr>
<tr>
<td align="left">Mutation Assessor (Reva et al., <xref ref-type="bibr" rid="B63">2011</xref>)</td>
<td align="left">Threshold-based, differential evolutionary conservation in subfamilies</td>
<td align="left"><monospace><uri xlink:href="http://mutationassessor.org">http://mutationassessor.org</uri></monospace></td>
</tr>
<tr>
<td align="left">Condel (Gonz&#x000E1;lez-P&#x000E9;rez and L&#x000F3;pez-Bigas, <xref ref-type="bibr" rid="B26">2011</xref>)</td>
<td align="left">Metaserver, combines the output of other predictors</td>
<td align="left"><monospace><uri xlink:href="http://bg.upf.edu/condel/">http://bg.upf.edu/condel/</uri></monospace></td>
</tr>
<tr>
<td align="left">wKinMut (Izarzugaza et al., <xref ref-type="bibr" rid="B32">2012</xref>, submitted)</td>
<td align="left">Framework for the analysis of kinase mutations. Integrates annotations, predictions, and information from the literature</td>
<td align="left"><monospace><uri xlink:href="http://wkinmut.bioinfo.cnio.es">http://wkinmut.bioinfo.cnio.es</uri></monospace></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>These methodologies can be classified according to their underlying principles: Some methods make use of several features to identify relevant positions in a given protein, and hence, rules are derived to predict the pathogenicity of mutations. Another group of implementations assumes that evolutionarily conserved protein residues are important for protein structure, folding, and function, whereby mutations in these residues are considered deleterious (Ng and Henikoff, <xref ref-type="bibr" rid="B57">2001</xref>). Variations on this principle lead to methods that predict deleterious mutations by assessing the changes in evolutionarily conserved PFAM motifs (Clifford et al., <xref ref-type="bibr" rid="B14">2004</xref>). Furthermore, a group of methodologies use protein structures to characterize substitutions that significantly destabilize the folded state. A growing number of systems integrate prior knowledge in the form of both sequence-based and structure-based features from a set of mutations (for which their characterization as pathogenic or neutral exists) to train an automatic machine learning system. Once trained, the system can infer the pathogenicity of new mutations automatically. Different machine learning methods can be implemented depending on their individual needs. Among them, probably the most popular ones are: rule-based systems (Wang and Moult, <xref ref-type="bibr" rid="B78">2001</xref>; Ramensky et al., <xref ref-type="bibr" rid="B60">2002</xref>; Reva et al., <xref ref-type="bibr" rid="B63">2011</xref>), decision trees (Krishnan and Westhead, <xref ref-type="bibr" rid="B44">2003</xref>), random forests (Kaminker et al., <xref ref-type="bibr" rid="B34">2007b</xref>; Wainreb et al., <xref ref-type="bibr" rid="B76">2010</xref>), neural networks (Ferrer-Costa et al., <xref ref-type="bibr" rid="B22">2002</xref>; Bromberg and Rost, <xref ref-type="bibr" rid="B9">2007</xref>), Bayesian methods (Adzhubei et al., <xref ref-type="bibr" rid="B1">2010</xref>), and SVMs (Karchin et al., <xref ref-type="bibr" rid="B36">2005</xref>; Yue et al., <xref ref-type="bibr" rid="B83">2005</xref>; Torkamani and Schork, <xref ref-type="bibr" rid="B73">2007</xref>; Calabrese et al., <xref ref-type="bibr" rid="B11">2009</xref>; Wainreb et al., <xref ref-type="bibr" rid="B76">2010</xref>). In addition, some meta approaches have been implemented recently (Lee and Shatkay, <xref ref-type="bibr" rid="B47">2008</xref>), for instance, Condel (Gonz&#x000E1;lez-P&#x000E9;rez and L&#x000F3;pez-Bigas, <xref ref-type="bibr" rid="B26">2011</xref>) integrates five of the most widely employed computational tools for sorting missense single nucleotide variations.</p>
<p>Methods also differ in the nature of the protein properties used to determine the pathogenicity of new mutations. Some of the predictors require sequence-oriented features that are easily applicable to any polymorphism. Recurrent examples of this category are: amino acid type, sequence conservation, domain type, functional annotations, post-translational modifications, and so on. A second set of predictors calculate features that require a protein structure. Common examples to illustrate these are: secondary structure, solvent accessibility, flexibility, etc. The major drawback of these methodologies is that although they may increase the accuracy, the need for either an experimentally solved or a precisely modeled protein structure implies a loss of coverage. The number of features and their combinations is infinite. Moreover, features can also either be general or apply only to a defined subset of proteins, as is the membership to a kinase group (Torkamani and Schork, <xref ref-type="bibr" rid="B73">2007</xref>; Izarzugaza et al., <xref ref-type="bibr" rid="B32">2012</xref>).</p>
</sec>
<sec>
<title>Benchmarking Prediction Methods</title>
<p>In the previous section we discussed the differences between the various methods, both in terms of implementation and prediction features. Equally important are the differences found in the composition of the datasets used to train the methods. This is particularly relevant in the case of machine learning approaches. Machine learning approaches are developed in two independent consecutive steps: during the initial development phase, the developers aim to optimize the combination of features, internal parameters, and prediction algorithms to obtain a trained classifier. In a later phase, blind tests are conducted to evaluate the performance simulating a more realistic scenario. Consequently, three separate datasets are needed: (i) a training dataset to allow the classifier to learn, (ii) a validation dataset to optimize the selection of parameters, and (iii) an evaluation dataset to conduct blind tests to assess the expected performance of the classifier.</p>
<p>Consequently, the datasets used highly influence the overall performance of the prediction and, if not pondered cautiously might become a source of evaluation errors. Probably, the most common of them being overtraining as a result from the evaluation of the methodologies with mutations that have also been considered in the training dataset. In other words, if a predictor were evaluated using a test set whose correct answers the method had previously been provided with, this may yield unfair over-estimation of the prediction capability. An extension of this problem, especially if the features considered predict at the protein level, is that mutations occurring in the same protein or closely related homologs should not span two different datasets.</p>
<p>The selection of a benchmark dataset that is fair and does not lead to artifacts is not a trivial task (Care et al., <xref ref-type="bibr" rid="B13">2007</xref>) and clean datasets that were not used in the development of any of the methods are required. Following a similar approach to those in the detection of bio-entities from the literature (BioCreative), protein structure (CASP), and protein interaction prediction (CAPRI), a successful recent example is CAGI<xref ref-type="fn" rid="fn1"><sup>1</sup></xref>. In summary, CAGI is intended to assess a battery of computational methods for predicting the phenotypic impacts of genome variation. Participants are provided a number of different sets of genetic variants and are expected to make predictions of resulting, molecular, cellular, or organismal phenotype. These predictions are later on evaluated by independent assessors against experimental characterizations.</p>
<p>Although CAGI constitutes an undoubtedly powerful tool to provide insights on the performance of state-of-the-art methodologies, the major drawback is that provided datasets are gathered from very specialized projects, and consequently are seldom universally applicably to all methodologies, which consequently, limits the benchmark. An example of the previous would be the intrinsic limitation to predict mutations outside the protein kinase superfamily for kinase-specific methodologies.</p>
<p>Complementary to the CAGI experiment, current text mining methodologies enable the generation of clean sets of experimentally validated mutation mentions from the literature. Those mutations that were not recorded in the databases used to provide the training and evaluation datasets are of special interest. Here we propose a pipeline for the curation of mutations automatically extracted from the literature and their use as a gold standard in the benchmarking of pathogenicity predictors. We will describe this approach thoroughly in the following sections.</p>
</sec>
<sec>
<title>Mining Kinase Mutations from the Literature</title>
<p>Previously, we discussed how the efforts of database curators to store and annotate mutations (Table <xref ref-type="table" rid="T2">2</xref>) can hardly keep the pace of the vast amount of information generated by current large-scale variation studies. To bridge this growing gap, automatic extraction of entities and their relationships from the existing literature can be applied. This includes text mining techniques such as regular expressions, pattern recognition, and natural language processing, among others. Indeed, these approaches have been successfully applied to other fields of research, for instance for the automatic extraction of protein&#x02013;protein interactions (Blaschke and Valencia, <xref ref-type="bibr" rid="B8">2001</xref>; Krallinger et al., <xref ref-type="bibr" rid="B42">2008c</xref>) and in the annotation of genes and proteins (Krallinger et al., <xref ref-type="bibr" rid="B40">2008a</xref>, <xref ref-type="bibr" rid="B43">2010</xref>). Despite the success of these methods, it must be born in mind that this technology does not aim to replace manual curation and validation. Rather, text mining approaches are better understood as systematic tools to assist the efforts of human curators by helping them to find information, prioritize documents, and highlight potentially relevant items (Krallinger et al., <xref ref-type="bibr" rid="B40">2008a</xref>,<xref ref-type="bibr" rid="B41">b</xref>; Leitner et al., <xref ref-type="bibr" rid="B48">2010</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Summary of resources providing information about kinases and mutations</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Method</th>
<th align="left">Description</th>
<th align="left">Further information</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">UniProt (Consortium, <xref ref-type="bibr" rid="B19">2007</xref>)</td>
<td align="left">General information about proteins, including human protein kinases</td>
<td align="left"><monospace><uri xlink:href="http://www.uniprot.org/">http://www.uniprot.org/</uri></monospace></td>
</tr>
<tr>
<td align="left">PDB (Berman et al., <xref ref-type="bibr" rid="B7">2000</xref>)</td>
<td align="left">Catalog of protein structures, protein kinases widely represented</td>
<td align="left"><monospace><uri xlink:href="http://www.rcsb.org/">http://www.rcsb.org/</uri></monospace></td>
</tr>
<tr>
<td align="left">PDBsum (Laskowski et al., <xref ref-type="bibr" rid="B45">2005</xref>)</td>
<td align="left">Annotation on protein structures</td>
<td align="left"><monospace><uri xlink:href="http://www.ebi.ac.uk/pdbsum">http://www.ebi.ac.uk/pdbsum</uri></monospace></td>
</tr>
<tr>
<td align="left">KinBase (Manning et al., <xref ref-type="bibr" rid="B52">2002b</xref>; Miranda-Saavedra and Barton, <xref ref-type="bibr" rid="B54">2007</xref>)</td>
<td align="left">Hierarchical classification of protein kinases</td>
<td align="left"><monospace><uri xlink:href="http://kinase.com/kinbase/">http://kinase.com/kinbase/</uri></monospace></td>
</tr>
<tr>
<td align="left">SwissVar (Yip et al., <xref ref-type="bibr" rid="B82">2007</xref>)</td>
<td align="left">Detailed information about mutations present in UniProt</td>
<td align="left"><monospace><uri xlink:href="http://swissvar.expasy.org/">http://swissvar.expasy.org/</uri></monospace></td>
</tr>
<tr>
<td align="left">COSMIC (Bamford et al., <xref ref-type="bibr" rid="B5">2004</xref>)</td>
<td align="left">Catalog of somatic mutations in cancer</td>
<td align="left"><monospace><uri xlink:href="http://www.sanger.ac.uk/perl/genetics/CGP/cosmic">http://www.sanger.ac.uk/perl/genetics/CGP/cosmic</uri></monospace></td>
</tr>
<tr>
<td align="left">Ensembl (Flicek et al., <xref ref-type="bibr" rid="B24">2011</xref>)</td>
<td align="left">Infrastructure for the integrated annotation on chordate and selected eukaryotic genomes</td>
<td align="left"><monospace><uri xlink:href="http://www.ensembl.org">http://www.ensembl.org</uri></monospace></td>
</tr>
<tr>
<td align="left">dbSNP (Sherry et al., <xref ref-type="bibr" rid="B67">2001</xref>)</td>
<td align="left">Annotated catalog of SNPs</td>
<td align="left"><monospace><uri xlink:href="http://www.ncbi.nlm.nih.gov/projects/SNP">http://www.ncbi.nlm.nih.gov/projects/SNP</uri></monospace></td>
</tr>
<tr>
<td align="left">HapMap (Consortium et al., <xref ref-type="bibr" rid="B17">2010b</xref>)</td>
<td align="left">Catalog of common genetic variants in the human genome</td>
<td align="left"><monospace><uri xlink:href="http://www.hapmap.org">www.hapmap.org</uri></monospace></td>
</tr>
<tr>
<td align="left">1000 Genomes (Consortium et al., <xref ref-type="bibr" rid="B18">2010c</xref>)</td>
<td align="left">Deep catalog of human variations derived from the next-generation sequencing of 1000 people</td>
<td align="left"><monospace><uri xlink:href="http://www.1000genomes.org/">http://www.1000genomes.org/</uri></monospace></td>
</tr>
<tr>
<td align="left">TCGA (Network, <xref ref-type="bibr" rid="B56">2011</xref>)</td>
<td align="left">The Cancer Genome Atlas is a collection of genetic variations found in 20 different cancers</td>
<td align="left"><monospace><uri xlink:href="http://cancergenome.nih.gov/">http://cancergenome.nih.gov/</uri></monospace></td>
</tr>
<tr>
<td align="left">ICGC (Consortium et al., <xref ref-type="bibr" rid="B16">2010a</xref>)</td>
<td align="left">The International Cancer Genome Consortium project aims to a comprehensive description of genomic, transcriptomic, and epigenomic changes in 50 tumor types and sub-types</td>
<td align="left"><monospace><uri xlink:href="http://www.icgc.org">http://www.icgc.org</uri></monospace></td>
</tr>
<tr>
<td align="left">OMIM (Amberger et al., <xref ref-type="bibr" rid="B2">2011</xref>)</td>
<td align="left">Catalog of Mendelian mutations known to cause disease</td>
<td align="left"><monospace><uri xlink:href="http://www.ncbi.nlm.nih.gov/omim">http://www.ncbi.nlm.nih.gov/omim</uri></monospace></td>
</tr>
<tr>
<td align="left">SAAPdb (Hurst et al., <xref ref-type="bibr" rid="B31">2009</xref>)</td>
<td align="left">Calculation of the structural consequences of mutations</td>
<td align="left"><monospace><uri xlink:href="http://www.bioinf.org.uk/saap/db/">http://www.bioinf.org.uk/saap/db/</uri></monospace></td>
</tr>
<tr>
<td align="left">SNPeffect 2.0 (Reumers et al., <xref ref-type="bibr" rid="B62">2006</xref>)</td>
<td align="left">A database mapping molecular phenotypic effects of human non-synonymous coding SNPs</td>
<td align="left"><monospace><uri xlink:href="http://snpeffect.switchlab.org">http://snpeffect.switchlab.org</uri></monospace></td>
</tr>
<tr>
<td align="left">ModBase (Pieper et al., <xref ref-type="bibr" rid="B59">2006</xref>)</td>
<td align="left">Structural models of mutant proteins</td>
<td align="left"><monospace><uri xlink:href="http://salilab.org/modbase">http://salilab.org/modbase</uri></monospace></td>
</tr>
<tr>
<td align="left">TopoSNP (Stitziel et al., <xref ref-type="bibr" rid="B69">2004</xref>)</td>
<td align="left">TopoSNP: a topographic database of non-synonymous single nucleotide polymorphisms with and without known disease association</td>
<td align="left"><monospace><uri xlink:href="http://gila.bioengr.uic.edu/snp/toposnp/">http://gila.bioengr.uic.edu/snp/toposnp/</uri></monospace></td>
</tr>
<tr>
<td align="left">MoKCa (Richardson et al., <xref ref-type="bibr" rid="B64">2009</xref>)</td>
<td align="left">Annotated catalog of cancer-associated mutations in protein kinases</td>
<td align="left"><monospace><uri xlink:href="http://strubiol.icr.ac.uk/extra/mokca/">http://strubiol.icr.ac.uk/extra/mokca/</uri></monospace></td>
</tr>
<tr>
<td align="left">KinMutBase (Ortutay et al., <xref ref-type="bibr" rid="B58">2005</xref>)</td>
<td align="left">Registry of disease-causing mutations in protein kinase domains</td>
<td align="left"><monospace><uri xlink:href="http://bioinf.uta.fi/KinMutBase">http://bioinf.uta.fi/KinMutBase</uri></monospace></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Here we will use our recently published pipeline for extracting mutation mentions in protein kinases from the literature, SNP2L (Krallinger et al., <xref ref-type="bibr" rid="B39">2009</xref>), as an example of a typical text mining workflow. The pipeline (Figure <xref ref-type="fig" rid="F1">1</xref>) integrates article retrieval, detection of mutations, and proteins in the corresponding article, correct mutation-protein association and, finally, validation of the results. To the best of our knowledge there is currently no pipeline similar to the one presented here. Two main aspects make our pipeline unique. First, our system is specifically designed to extract mutations occurring in the protein kinase superfamily. Second, we perform an additional filtering step to ensure the quality of the extracted mutations as we will disclose in the following sections.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>SNP2L Pipeline as an example of a typical automatic method to extract mutation mentions from the literature</bold>. The pipeline integrates article retrieval, detection of mutations and proteins in the corresponding article, correct mutation-protein association and, finally, validation of the results.</p></caption>
<graphic xlink:href="fphys-03-00323-g001.tif"/>
</fig>
<sec>
<title>Article selection (triage): Constructing a text mining corpus</title>
<p>Following a common approach in text mining, we tested SNP2L with two different datasets: One constituted by the whole collection of PubMed abstracts and the other by a collection of either manually or automatically selected full-text articles. In order to construct the <italic>corpus</italic>, full-text articles were automatically downloaded using an in-house retrieval system (Krallinger et al., <xref ref-type="bibr" rid="B40">2008a</xref>) prioritized under three different criteria:</p>
<list list-type="order">
<list-item><p>Relevance of the abstract: information contained in the corresponding abstracts such as the mention of mutations, mention of human kinases, and a combination of keywords (including &#x0201C;human kinase mutation&#x0201D;).</p></list-item>
<list-item><p><italic>A priori</italic> relevance of the full-text articles: extracting all references in PubMed for human kinases contained in multiple databases (e.g., SwissProt, MINT, and IntAct).</p></list-item>
<list-item><p>Relevance of the journal: based on analyzing a fraction of mutation-mentioning abstracts of each journal and prioritizing a set of journals (and thus their articles) to retrieve their full-text articles. This set consisted of the following journals: <italic>American Journal of Human Genetics, European Journal of Human Genetics, Human Genetics, Human Mutation, and Human Molecular Genetics</italic>.</p></list-item>
</list>
<p>Before proceeding to the next step, all articles should be split in sentences using a sentence boundary detection system (Krallinger et al., <xref ref-type="bibr" rid="B40">2008a</xref>).</p>
</sec>
<sec>
<title>Entity recognition: Mutations and protein kinases</title>
<p>The consistent nomenclature used to describe mutations in the literature makes these entities especially amenable to this type of approach and accordingly, a growing number of such methods have been described in the literature over the years. A summary of several of these literature mining tools to extract information on mutations is presented in Table <xref ref-type="table" rid="T3">3</xref>. In the example discussed here, we used MutationFinder (Caporaso et al., <xref ref-type="bibr" rid="B12">2007</xref>) for the initial extraction of single aminoacid substitutions. MutationFinder constitutes a valuable tool to detect the mention of mutations in a given set of manuscripts and it relies on language expressions used to describe mutation events. MutationFinder is very competitive for recall and precision when compared to other strategies (Yip et al., <xref ref-type="bibr" rid="B82">2007</xref>), and it has been evaluated using a manually generated gold standard collection of abstracts.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>Summary of text mining implementations for mutation extraction</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Method</th>
<th align="left">Main features</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">MEMA (Rebholz-Schuhmann et al., <xref ref-type="bibr" rid="B61">2004</xref>)</td>
<td align="left">Regular expressions, gene and protein mentions, co-mention proximity, OMIM validation</td>
</tr>
<tr>
<td align="left">MuteXt (Horn et al., <xref ref-type="bibr" rid="B29">2004</xref>)</td>
<td align="left">Regular expressions, GPCR and NR mentions detection, co-mention proximity, sequence check</td>
</tr>
<tr>
<td align="left">Yip (Yip et al., <xref ref-type="bibr" rid="B82">2007</xref>)</td>
<td align="left">Regular expressions, protein mentions detection, SwissProt validation, sequence check</td>
</tr>
<tr>
<td align="left">Mutation GraB (Lee et al., <xref ref-type="bibr" rid="B46">2007</xref>)</td>
<td align="left">Regular expressions, protein mentions detection, graph shorted distance, sequence check</td>
</tr>
<tr>
<td align="left">Mutation Miner (Baker and Rene, <xref ref-type="bibr" rid="B4">2006</xref>)</td>
<td align="left">Regular expressions, protein mentions detection, sentence co-mention</td>
</tr>
<tr>
<td align="left">MuGeX (Erdogmus and Sezerman, <xref ref-type="bibr" rid="B20">2007</xref>)</td>
<td align="left">Regular expressions, protein mentions, protein, and DNA mutation disambiguation</td>
</tr>
<tr>
<td align="left">VTag (McDonald et al., <xref ref-type="bibr" rid="B53">2004</xref>)</td>
<td align="left">Machine learning detection of acquired sequence variation mentions detection (mutations, translocations, and deletions)</td>
</tr>
<tr>
<td align="left">OSIRIS (Furlong et al., <xref ref-type="bibr" rid="B25">2008</xref>)</td>
<td align="left">Detection of human gene variations corresponding to SNPs</td>
</tr>
<tr>
<td align="left">MutationFinder (Caporaso et al., <xref ref-type="bibr" rid="B12">2007</xref>)</td>
<td align="left">Regular expressions and patterns, protein mutations mentions detection, complex language expressions</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>After recognizing all the mutations mentioned in the text, we attempted to identify all human protein kinases co-mentioned with them in the same document. Existing systems that try to link mentions of genes and proteins to database identifiers generally rely on approaches that compare the names appearing in the text to gene names or aliases contained in database records. The actual task of determining the exact database record for a gene/protein mention is commonly referred to as gene mention grounding or normalization, and has been evaluated in the second BioCreative community challenge, illustrating that dictionary look-up approaches can obtain competitive results for this purpose (Morgan et al., <xref ref-type="bibr" rid="B55">2008</xref>).</p>
<p>Following this line, we constructed a lexicon specifically for human protein kinases, derived from gene and protein symbols, names, and aliases contained in the UniProt database (see Figure <xref ref-type="fig" rid="F1">1</xref>, Get names, symbols, and aliases). Because this gene/protein lexicon did not capture all representative typographical variants of a given name, we used a rule-based approach and heuristics for generating typographical variants for the kinase lexicon entries. With this respect, the alternative use of hyphens, capitalization (upper-case and capitalized names), and different word order variants were captured. The gene/protein lexicon was filtered to eliminate highly ambiguous names through comparison with a stop word list and by, after an initial look-up step, checking manually potential outlier names that show a very high mention frequency. The extended and pruned human kinase lexicon was then used for the detection of corresponding mentions in our document collections containing mutation mentions. As a given name can correspond to different records (ambiguity), both at the level of human genes as well as in case of genes from different species sharing the same name, we calculated for each article, two different scores reflecting (a) the contextual similarity of the article to the reference (UniProt) protein record and (b) the overall association of the article to human species terms from the total set of tagged species terms. A conceivable alternative would be to simply apply very strict protein-organism co-mention criteria based on relative textual distances, which is rather problematic in case of human proteins were often the organism source is not explicitly stated.</p>
</sec>
<sec>
<title>Mutation-sequence linking</title>
<p>The next step is to link mutation mentions with their corresponding human kinases. This step would be trivial if a single protein was mentioned per article, however, for most of the articles this is not the case and more than one protein is mentioned per article. A reasonable solution would be to check the existence of the amino acid at the specified position for each mutation mention-protein combination. In addition to this basic sequence look-up validation method additional mutation mapping strategies could be implemented. They should consider errors resulting from the wrong detection of the directionality of the extracted mutation mention (using the wild type as mutant residue and <italic>vice versa</italic>) and inconsistencies and alternative sequence counting between the article and the kinase sequence. For example:</p>
<list list-type="simple">
<list-item><p>&#x02013; Sliding window algorithms that look for relative positions of mutations (pattern) rather than exact position co-occurrences. With this approach, mutation mentions would be scanned looking for positions relative to the starting one attending to the distance between all the mutations in the same abstract. The strength of this approach is that it is able to deal with alternative sequence coordinates. There are many examples in the literature: Mutations F175P, R178L, and Y530L in the proto-oncogene tyrosine-protein kinase Src, are mentioned in the considered article (PMID 2108315) as F172P, R175L, and Y527F respectively. Since the probability of finding simple patterns by chance can be high in some trivial cases, it is reasonable to consider only those cases where a minimum number of mutated positions (3 in our example) could be detected.</p></list-item>
<list-item><p>&#x02013; Bidirectional mutation to sequence position mapping. Either the wild type or the mutant residue of an extracted mutation mention might be accepted in the corresponding sequence position.</p></list-item>
<list-item><p>&#x02013; Pro-peptides and mature protein mutation mapping. In order to allow alternative residue counting due to the presence of a signal peptide, a displacement equal to the length of the corresponding signal peptide might be allowed.</p></list-item>
<list-item><p>&#x02013; Methionine cleavage: the mutation mapping might be carried out taking into consideration the possibility of neglecting the N-terminal methionine.</p></list-item>
</list>
</sec>
<sec>
<title>Using the literature to generate a benchmark dataset</title>
<p>The main focus of this article has been the construction of a gold standard dataset to benchmark prediction methods. Following this thread of reasoning, mutations already present in common databases are discarded, while new ones form the benchmark dataset. This procedure will ensure a dataset that enables fair comparison and is less prone to over-estimation of the classifiers&#x02019; performance as we discussed previously in the <italic>Benchmarking prediction methods</italic> section.</p>
<p>In spite of constituting a powerful tool for the extraction of knowledge from the literature, text mining approaches to recover kinase mutations still have some limitations in terms of recall and a number mutations escape detection by even the most accurate state-of-the-art algorithms. Among the challenging aspects in this respect are the detection of mutations that are described in additional materials or contained in tables and figures. This is because they can not easily be converted efficiently to plain text. Another key issue is the appropriate detection of the kinase mentions, which can be referred to through a range of different typographical variations and aliases, of which text mining approaches can only cover some. To this issue one also needs to add the underlying limitations in terms of recall of the mutation extraction process (Caporaso et al., <xref ref-type="bibr" rid="B12">2007</xref>) and inconsistencies of sequence descriptions in reference databases as compared to those examined in scientific articles.</p>
</sec>
<sec>
<title>Using the literature to understand the consequences of mutation</title>
<p>From a parallel perspective, text mining approaches can be used to enhance our understanding of both new and existing mutations. Text mining approaches output mutations extracted from the literature along with all their contextual information. Pointers to the relevant literature are provided, these include: experimental conditions, organism, or population sub-types, information regarding observed phenotypes including association to disease, or in a best case scenario, the underlying biochemical mechanisms.</p>
<p>This information can help to interpret the consequences of mutations and is often complementary to the valuable clues provided by the methods to predict the pathogenicity of mutations. Indeed, the emerging trend in the field is to integrate information from diverse sources (Lee and Shatkay, <xref ref-type="bibr" rid="B47">2008</xref>; Gonz&#x000E1;lez-P&#x000E9;rez and L&#x000F3;pez-Bigas, <xref ref-type="bibr" rid="B26">2011</xref>), as we have done recently with the development of wKinMut<xref ref-type="fn" rid="fn2"><sup>2</sup></xref> to help in the interpretation of mutations in the protein kinase superfamily.</p>
<p>In addition to the predictions of pathogenicity directly from our in-house classifier (Izarzugaza et al., <xref ref-type="bibr" rid="B32">2012</xref>) and the values of the features used in the classification, wKinMut combines information from different external sources to help in the interpretation of the prediction. These include the results from other classifiers focusing on different aspects of mutation pathogenicity (SIFT; Ng and Henikoff, <xref ref-type="bibr" rid="B57">2001</xref>; MutationAssessor; Reva et al., <xref ref-type="bibr" rid="B63">2011</xref>), the representation of the mutation in the context of its three-dimensional structure and records of the mutation in other databases such as SAAPdb (Hurst et al., <xref ref-type="bibr" rid="B31">2009</xref>), UniProt (Yip et al., <xref ref-type="bibr" rid="B82">2007</xref>), COSMIC (Bamford et al., <xref ref-type="bibr" rid="B5">2004</xref>), and KinMutBase (Ortutay et al., <xref ref-type="bibr" rid="B58">2005</xref>). Two text mining resources complement the framework: iHop (Hoffmann and Valencia, <xref ref-type="bibr" rid="B28">2005</xref>) a literature mining system to extract gene&#x02013;gene and protein&#x02013;protein interactions and SNP2L (Krallinger et al., <xref ref-type="bibr" rid="B39">2009</xref>) whose capabilities to detect mutation mentions from the literature have been described thoroughly here.</p>
<p>In summary, wKinMut can be useful to predict the pathogenicity of novel mutations and to interpret the biochemical mechanisms leading to pathogenicity and it can be applied to the interpretation of genomes from cancer patients.</p>
</sec>
</sec>
<sec>
<title>Overview and Summary</title>
<p>Current research aims to discover the mechanistic connection between mutations and disease. We focused on the protein kinase superfamily due to the enormous wealth of mentions in the literature associating different diseases, including cancer, with mutations in members of this superfamily.</p>
<p>In this article we have reviewed the different possibilities and limitations of state-of-the-art computational methods for the prediction of the pathogenicity of mutations and we have discussed the difficulties that arise to benchmark and evaluate the performance of the classifiers. We have proposed our recently published pipeline, SNP2L, for the automatic extraction and curation of mentions in the literature to collect a gold standard dataset that might be used in the benchmarking of the different predictors. Finally, we have introduced wKinMut as an example the integration of text mining with prediction methodologies to help in the interpretation of the consequences of mutations in the context of disease genome analysis with particular focus on cancer. We think that such applications might be of interest in the interpretation of patient genomes in the emerging field of personalized/stratified medicine in, hopefully, a near future.</p>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Adzhubei</surname> <given-names>I. A.</given-names></name> <name><surname>Schmidt</surname> <given-names>S.</given-names></name> <name><surname>Peshkin</surname> <given-names>L.</given-names></name> <name><surname>Ramensky</surname> <given-names>V. E.</given-names></name> <name><surname>Gerasimova</surname> <given-names>A.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name> <name><surname>Kondrashov</surname> <given-names>A. S.</given-names></name> <name><surname>Sunyaev</surname> <given-names>S. R.</given-names></name></person-group> (<year>2010</year>). <article-title>A method and server for predicting damaging missense mutations</article-title>. <source>Nat. Methods</source> <volume>7</volume>, <fpage>248</fpage>&#x02013;<lpage>249</lpage>.<pub-id pub-id-type="doi">10.1038/nmeth0410-248</pub-id><pub-id pub-id-type="pmid">20354512</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Amberger</surname> <given-names>J.</given-names></name> <name><surname>Bocchini</surname> <given-names>C.</given-names></name> <name><surname>Hamosh</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <article-title>A new face and new challenges for online mendelian inheritance in man (omim(<sup>&#x000AE;</sup>))</article-title>. <source>Hum. Mutat.</source> <volume>32</volume>, <fpage>564</fpage>&#x02013;<lpage>567</lpage>.<pub-id pub-id-type="doi">10.1002/humu.21466</pub-id><pub-id pub-id-type="pmid">21472891</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bairoch</surname> <given-names>A.</given-names></name> <name><surname>Apweiler</surname> <given-names>R.</given-names></name> <name><surname>Wu</surname> <given-names>C. H.</given-names></name> <name><surname>Barker</surname> <given-names>W. C.</given-names></name> <name><surname>Boeckmann</surname> <given-names>B.</given-names></name> <name><surname>Ferro</surname> <given-names>S.</given-names></name> <name><surname>Gasteiger</surname> <given-names>E.</given-names></name> <name><surname>Huang</surname> <given-names>H.</given-names></name> <name><surname>Lopez</surname> <given-names>R.</given-names></name> <name><surname>Magrane</surname> <given-names>M.</given-names></name> <name><surname>Martin</surname> <given-names>M. J.</given-names></name> <name><surname>Natale</surname> <given-names>D. A.</given-names></name> <name><surname>O&#x02019;Donovan</surname> <given-names>C.</given-names></name> <name><surname>Redaschi</surname> <given-names>N.</given-names></name> <name><surname>Yeh</surname> <given-names>L.-S. L.</given-names></name></person-group> (<year>2005</year>). <article-title>The universal protein resource (UniProt)</article-title>. <source>Nucleic Acids Res.</source> <volume>33</volume>, <fpage>D154</fpage>&#x02013;<lpage>D159</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gki070</pub-id><pub-id pub-id-type="pmid">15608167</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baker</surname> <given-names>C. J. O.</given-names></name> <name><surname>Rene</surname> <given-names>W.</given-names></name></person-group> (<year>2006</year>). <article-title>Mutation mining &#x02013; a prospector&#x02019;s tale</article-title>. <source>J. Inform. Syst. Front.</source> <volume>8</volume>, <fpage>47</fpage>&#x02013;<lpage>57</lpage>.<pub-id pub-id-type="doi">10.1007/s10796-006-6103-2</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bamford</surname> <given-names>S.</given-names></name> <name><surname>Dawson</surname> <given-names>E.</given-names></name> <name><surname>Forbes</surname> <given-names>S.</given-names></name> <name><surname>Clements</surname> <given-names>J.</given-names></name> <name><surname>Pettett</surname> <given-names>R.</given-names></name> <name><surname>Dogan</surname> <given-names>A.</given-names></name> <name><surname>Flanagan</surname> <given-names>A.</given-names></name> <name><surname>Teague</surname> <given-names>J.</given-names></name> <name><surname>Futreal</surname> <given-names>P. A.</given-names></name> <name><surname>Stratton</surname> <given-names>M. R.</given-names></name> <name><surname>Wooster</surname> <given-names>R.</given-names></name></person-group> (<year>2004</year>). <article-title>The cosmic (catalogue of somatic mutations in cancer) database and website</article-title>. <source>Br. J. Cancer</source> <volume>91</volume>, <fpage>355</fpage>&#x02013;<lpage>358</lpage>.<pub-id pub-id-type="pmid">15188009</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baudot</surname> <given-names>A.</given-names></name> <name><surname>Real</surname> <given-names>F.</given-names></name> <name><surname>Izarzugaza</surname> <given-names>J.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>From cancer genomes to cancer models: bridging the gaps</article-title>. <source>EMBO Rep.</source> <volume>10</volume>, <fpage>359</fpage>&#x02013;<lpage>366</lpage>.<pub-id pub-id-type="doi">10.1038/embor.2009.46</pub-id><pub-id pub-id-type="pmid">19305388</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berman</surname> <given-names>H. M.</given-names></name> <name><surname>Westbrook</surname> <given-names>J.</given-names></name> <name><surname>Feng</surname> <given-names>Z.</given-names></name> <name><surname>Gilliland</surname> <given-names>G.</given-names></name> <name><surname>Bhat</surname> <given-names>T. N.</given-names></name> <name><surname>Weissig</surname> <given-names>H.</given-names></name> <name><surname>Shindyalov</surname> <given-names>I. N.</given-names></name> <name><surname>Bourne</surname> <given-names>P. E.</given-names></name></person-group> (<year>2000</year>). <article-title>The protein data bank</article-title>. <source>Nucleic Acids Res.</source> <volume>28</volume>, <fpage>235</fpage>&#x02013;<lpage>242</lpage>.<pub-id pub-id-type="doi">10.1093/nar/28.1.235</pub-id><pub-id pub-id-type="pmid">10592235</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blaschke</surname> <given-names>C.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2001</year>). <article-title>The potential use of suiseki as a protein interaction discovery tool</article-title>. <source>Genome Inform.</source> <volume>12</volume>, <fpage>123</fpage>&#x02013;<lpage>134</lpage>.<pub-id pub-id-type="pmid">11791231</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bromberg</surname> <given-names>Y.</given-names></name> <name><surname>Rost</surname> <given-names>B.</given-names></name></person-group> (<year>2007</year>). <article-title>Snap: predict effect of non-synonymous polymorphisms on function</article-title>. <source>Nucleic Acids Res.</source> <volume>35</volume>, <fpage>3823</fpage>&#x02013;<lpage>3835</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkm238</pub-id><pub-id pub-id-type="pmid">17526529</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caenepeel</surname> <given-names>S.</given-names></name> <name><surname>Charydczak</surname> <given-names>G.</given-names></name> <name><surname>Sudarsanam</surname> <given-names>S.</given-names></name> <name><surname>Hunter</surname> <given-names>T.</given-names></name> <name><surname>Manning</surname> <given-names>G.</given-names></name></person-group> (<year>2004</year>). <article-title>The mouse kinome: discovery and comparative genomics of all mouse protein kinases</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>101</volume>, <fpage>11707</fpage>&#x02013;<lpage>11712</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.0306880101</pub-id><pub-id pub-id-type="pmid">15289607</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Calabrese</surname> <given-names>R.</given-names></name> <name><surname>Capriotti</surname> <given-names>E.</given-names></name> <name><surname>Fariselli</surname> <given-names>P.</given-names></name> <name><surname>Martelli</surname> <given-names>P. L.</given-names></name> <name><surname>Casadio</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Functional annotations improve the predictive score of human disease-related mutations in proteins</article-title>. <source>Hum. Mutat.</source> <volume>30</volume>, <fpage>1237</fpage>&#x02013;<lpage>1244</lpage>.<pub-id pub-id-type="doi">10.1002/humu.21047</pub-id><pub-id pub-id-type="pmid">19514061</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caporaso</surname> <given-names>J. G.</given-names></name> <name><surname>Baumgartner</surname> <given-names>W. A.</given-names></name> <name><surname>Randolph</surname> <given-names>D. A.</given-names></name> <name><surname>Cohen</surname> <given-names>K. B.</given-names></name> <name><surname>Hunter</surname> <given-names>L.</given-names></name></person-group> (<year>2007</year>). <article-title>MutationFinder: a high-performance system for extracting point mutation mentions from text</article-title>. <source>Bioinformatics</source> <volume>23</volume>, <fpage>1862</fpage>&#x02013;<lpage>1865</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btm235</pub-id><pub-id pub-id-type="pmid">17495998</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Care</surname> <given-names>M. A.</given-names></name> <name><surname>Needham</surname> <given-names>C. J.</given-names></name> <name><surname>Bulpitt</surname> <given-names>A. J.</given-names></name> <name><surname>Westhead</surname> <given-names>D. R.</given-names></name></person-group> (<year>2007</year>). <article-title>Deleterious SNP prediction: be mindful of your training data!</article-title> <source>Bioinformatics</source> <volume>23</volume>, <fpage>664</fpage>&#x02013;<lpage>672</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btl649</pub-id><pub-id pub-id-type="pmid">17234639</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clifford</surname> <given-names>R. J.</given-names></name> <name><surname>Edmonson</surname> <given-names>M. N.</given-names></name> <name><surname>Nguyen</surname> <given-names>C.</given-names></name> <name><surname>Buetow</surname> <given-names>K. H.</given-names></name></person-group> (<year>2004</year>). <article-title>Large-scale analysis of non-synonymous coding region single nucleotide polymorphisms</article-title>. <source>Bioinformatics</source> <volume>20</volume>, <fpage>1006</fpage>&#x02013;<lpage>1014</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bth029</pub-id><pub-id pub-id-type="pmid">14751981</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cline</surname> <given-names>M.</given-names></name> <name><surname>Karchin</surname> <given-names>R.</given-names></name></person-group> (<year>2011</year>). <article-title>Using bioinformatics to predict the functional impact of SNVs</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>441</fpage>&#x02013;<lpage>448</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btq695</pub-id><pub-id pub-id-type="pmid">21159622</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Consortium</surname> <given-names>I. C. G.</given-names></name> <name><surname>Hudson</surname> <given-names>T. J.</given-names></name> <name><surname>Anderson</surname> <given-names>W.</given-names></name> <name><surname>Artez</surname> <given-names>A.</given-names></name> <name><surname>Barker</surname> <given-names>A. D.</given-names></name> <name><surname>Bell</surname> <given-names>C.</given-names></name> <name><surname>Bernab&#x000E9;</surname> <given-names>R. R.</given-names></name> <name><surname>Bhan</surname> <given-names>M. K.</given-names></name> <name><surname>Calvo</surname> <given-names>F.</given-names></name> <name><surname>Eerola</surname> <given-names>I.</given-names></name> <name><surname>Gerhard</surname> <given-names>D. S.</given-names></name> <name><surname>Guttmacher</surname> <given-names>A.</given-names></name> <name><surname>Guyer</surname> <given-names>M.</given-names></name> <name><surname>Hemsley</surname> <given-names>F. M.</given-names></name> <name><surname>Jennings</surname> <given-names>J. L.</given-names></name> <name><surname>Kerr</surname> <given-names>D.</given-names></name> <name><surname>Klatt</surname> <given-names>P.</given-names></name> <name><surname>Kolar</surname> <given-names>P.</given-names></name> <name><surname>Kusada</surname> <given-names>J.</given-names></name> <name><surname>Lane</surname> <given-names>D. P.</given-names></name> <name><surname>Laplace</surname> <given-names>F.</given-names></name> <name><surname>Youyong</surname> <given-names>L.</given-names></name> <name><surname>Nettekoven</surname> <given-names>G.</given-names></name> <name><surname>Ozenberger</surname> <given-names>B.</given-names></name> <name><surname>Peterson</surname> <given-names>J.</given-names></name> <name><surname>Rao</surname> <given-names>T. S.</given-names></name> <name><surname>Remacle</surname> <given-names>J.</given-names></name> <name><surname>Schafer</surname> <given-names>A. J.</given-names></name> <name><surname>Shibata</surname> <given-names>T.</given-names></name> <name><surname>Stratton</surname> <given-names>M. R.</given-names></name> <name><surname>Vockley</surname> <given-names>J. G.</given-names></name> <name><surname>Watanabe</surname> <given-names>K.</given-names></name> <name><surname>Yang</surname> <given-names>H.</given-names></name> <name><surname>Yuen</surname> <given-names>M. M. F.</given-names></name> <name><surname>Knoppers</surname> <given-names>B. M.</given-names></name> <name><surname>Bobrow</surname> <given-names>M.</given-names></name> <name><surname>Cambon-Thomsen</surname> <given-names>A.</given-names></name> <name><surname>Dressler</surname> <given-names>L. G.</given-names></name> <name><surname>Dyke</surname> <given-names>S. O. M.</given-names></name> <name><surname>Joly</surname> <given-names>Y.</given-names></name> <name><surname>Kato</surname> <given-names>K.</given-names></name> <name><surname>Kennedy</surname> <given-names>K. L.</given-names></name> <name><surname>Nicol&#x000E1;s</surname> <given-names>P.</given-names></name> <name><surname>Parker</surname> <given-names>M. J.</given-names></name> <name><surname>Rial-Sebbag</surname> <given-names>E.</given-names></name> <name><surname>Romeo-Casabona</surname> <given-names>C. M.</given-names></name> <name><surname>Shaw</surname> <given-names>K. M.</given-names></name> <name><surname>Wallace</surname> <given-names>S.</given-names></name> <name><surname>Wiesner</surname> <given-names>G. L.</given-names></name> <name><surname>Zeps</surname> <given-names>N.</given-names></name> <name><surname>Lichter</surname> <given-names>P.</given-names></name> <name><surname>Biankin</surname> <given-names>A. V.</given-names></name> <name><surname>Chabannon</surname> <given-names>C.</given-names></name> <name><surname>Chin</surname> <given-names>L.</given-names></name> <name><surname>Cl&#x000E9;ment</surname> <given-names>B.</given-names></name> <name><surname>de Alava</surname> <given-names>E.</given-names></name> <name><surname>Degos</surname> <given-names>F.</given-names></name> <name><surname>Ferguson</surname> <given-names>M. L.</given-names></name> <name><surname>Geary</surname> <given-names>P.</given-names></name> <name><surname>Hayes</surname> <given-names>D. N.</given-names></name> <name><surname>Hudson</surname> <given-names>T. J.</given-names></name> <name><surname>Johns</surname> <given-names>A. L.</given-names></name> <name><surname>Kasprzyk</surname> <given-names>A.</given-names></name> <name><surname>Nakagawa</surname> <given-names>H.</given-names></name> <name><surname>Penny</surname> <given-names>R.</given-names></name> <name><surname>Piris</surname> <given-names>M. A.</given-names></name> <name><surname>Sarin</surname> <given-names>R.</given-names></name> <name><surname>Scarpa</surname> <given-names>A.</given-names></name> <name><surname>Shibata</surname> <given-names>T.</given-names></name> <name><surname>van de Vijver</surname> <given-names>M.</given-names></name> <name><surname>Futreal</surname> <given-names>P. A.</given-names></name> <name><surname>Aburatani</surname> <given-names>H.</given-names></name> <name><surname>Bay&#x000E9;s</surname> <given-names>M.</given-names></name> <name><surname>Botwell</surname> <given-names>D. D.</given-names></name> <name><surname>Campbell</surname> <given-names>P. J.</given-names></name> <name><surname>Estivill</surname> <given-names>X.</given-names></name> <name><surname>Gerhard</surname> <given-names>D. S.</given-names></name> <name><surname>Grimmond</surname> <given-names>S. M.</given-names></name> <name><surname>Gut</surname> <given-names>I.</given-names></name> <name><surname>Hirst</surname> <given-names>M.</given-names></name> <name><surname>L&#x000F3;pez-Ot&#x000ED;n</surname> <given-names>C.</given-names></name> <name><surname>Majumder</surname> <given-names>P.</given-names></name> <name><surname>Marra</surname> <given-names>M.</given-names></name> <name><surname>McPherson</surname> <given-names>J. D.</given-names></name> <name><surname>Nakagawa</surname> <given-names>H.</given-names></name> <name><surname>Ning</surname> <given-names>Z.</given-names></name> <name><surname>Puente</surname> <given-names>X. S.</given-names></name> <name><surname>Ruan</surname> <given-names>Y.</given-names></name> <name><surname>Shibata</surname> <given-names>T.</given-names></name> <name><surname>Stratton</surname> <given-names>M. R.</given-names></name> <name><surname>Stunnenberg</surname> <given-names>H. G.</given-names></name> <name><surname>Swerdlow</surname> <given-names>H.</given-names></name> <name><surname>Velculescu</surname> <given-names>V. E.</given-names></name> <name><surname>Wilson</surname> <given-names>R. K.</given-names></name> <name><surname>Xue</surname> <given-names>H. H.</given-names></name> <name><surname>Yang</surname> <given-names>L.</given-names></name> <name><surname>Spellman</surname> <given-names>P. T.</given-names></name> <name><surname>Bader</surname> <given-names>G. D.</given-names></name> <name><surname>Boutros</surname> <given-names>P. C.</given-names></name> <name><surname>Campbell</surname> <given-names>P. J.</given-names></name> <name><surname>Flicek</surname> <given-names>P.</given-names></name> <name><surname>Getz</surname> <given-names>G.</given-names></name> <name><surname>Guig&#x000F3;</surname> <given-names>R.</given-names></name> <name><surname>Guo</surname> <given-names>G.</given-names></name> <name><surname>Haussler</surname> <given-names>D.</given-names></name> <name><surname>Heath</surname> <given-names>S.</given-names></name> <name><surname>Hubbard</surname> <given-names>T. J.</given-names></name> <name><surname>Jiang</surname> <given-names>T.</given-names></name> <name><surname>Jones</surname> <given-names>S. M.</given-names></name> <name><surname>Li</surname> <given-names>Q.</given-names></name> <name><surname>L&#x000F3;pez-Bigas</surname> <given-names>N.</given-names></name> <name><surname>Luo</surname> <given-names>R.</given-names></name> <name><surname>Muthuswamy</surname> <given-names>L.</given-names></name> <name><surname>Ouellette</surname> <given-names>B. F.</given-names></name> <name><surname>Pearson</surname> <given-names>J. V.</given-names></name> <name><surname>Puente</surname> <given-names>X. S.</given-names></name> <name><surname>Quesada</surname> <given-names>V.</given-names></name> <name><surname>Raphael</surname> <given-names>B. J.</given-names></name> <name><surname>Sander</surname> <given-names>C.</given-names></name> <name><surname>Shibata</surname> <given-names>T.</given-names></name> <name><surname>Speed</surname> <given-names>T. P.</given-names></name> <name><surname>Stein</surname> <given-names>L. D.</given-names></name> <name><surname>Stuart</surname> <given-names>J. M.</given-names></name> <name><surname>Teague</surname> <given-names>J. W.</given-names></name> <name><surname>Totoki</surname> <given-names>Y.</given-names></name> <name><surname>Tsunoda</surname> <given-names>T.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name> <name><surname>Wheeler</surname> <given-names>D. A.</given-names></name> <name><surname>Wu</surname> <given-names>H.</given-names></name> <name><surname>Zhao</surname> <given-names>S.</given-names></name> <name><surname>Zhou</surname> <given-names>G.</given-names></name> <name><surname>Stein</surname> <given-names>L. D.</given-names></name> <name><surname>Guig&#x000F3;</surname> <given-names>R.</given-names></name> <name><surname>Hubbard</surname> <given-names>T. J.</given-names></name> <name><surname>Joly</surname> <given-names>Y.</given-names></name> <name><surname>Jones</surname> <given-names>S. M.</given-names></name> <name><surname>Kasprzyk</surname> <given-names>A.</given-names></name> <name><surname>Lathrop</surname> <given-names>M.</given-names></name> <name><surname>L&#x000F3;pez-Bigas</surname> <given-names>N.</given-names></name> <name><surname>Ouellette</surname> <given-names>B. F.</given-names></name> <name><surname>Spellman</surname> <given-names>P. T.</given-names></name> <name><surname>Teague</surname> <given-names>J. W.</given-names></name> <name><surname>Thomas</surname> <given-names>G.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name> <name><surname>Yoshida</surname> <given-names>T.</given-names></name> <name><surname>Kennedy</surname> <given-names>K. L.</given-names></name> <name><surname>Axton</surname> <given-names>M.</given-names></name> <name><surname>Dyke</surname> <given-names>S. O.</given-names></name> <name><surname>Futreal</surname> <given-names>P. A.</given-names></name> <name><surname>Gerhard</surname> <given-names>D. S.</given-names></name> <name><surname>Gunter</surname> <given-names>C.</given-names></name> <name><surname>Guyer</surname> <given-names>M.</given-names></name> <name><surname>Hudson</surname> <given-names>T. J.</given-names></name> <name><surname>McPherson</surname> <given-names>J. D.</given-names></name> <name><surname>Miller</surname> <given-names>L. J.</given-names></name> <name><surname>Ozenberger</surname> <given-names>B.</given-names></name> <name><surname>Shaw</surname> <given-names>K. M.</given-names></name> <name><surname>Kasprzyk</surname> <given-names>A.</given-names></name> <name><surname>Stein</surname> <given-names>L. D.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Haider</surname> <given-names>S. A.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Yung</surname> <given-names>C. K.</given-names></name> <name><surname>Cros</surname> <given-names>A.</given-names></name> <name><surname>Liang</surname> <given-names>Y.</given-names></name> <name><surname>Gnaneshan</surname> <given-names>S.</given-names></name> <name><surname>Guberman</surname> <given-names>J.</given-names></name> <name><surname>Hsu</surname> <given-names>J.</given-names></name> <name><surname>Bobrow</surname> <given-names>M.</given-names></name> <name><surname>Chalmers</surname> <given-names>D. R.</given-names></name> <name><surname>Hasel</surname> <given-names>K. W.</given-names></name> <name><surname>Joly</surname> <given-names>Y.</given-names></name> <name><surname>Kaan</surname> <given-names>T. S.</given-names></name> <name><surname>Kennedy</surname> <given-names>K. L.</given-names></name> <name><surname>Knoppers</surname> <given-names>B. M.</given-names></name> <name><surname>Lowrance</surname> <given-names>W. W.</given-names></name> <name><surname>Masui</surname> <given-names>T.</given-names></name> <name><surname>Nicol&#x000E1;s</surname> <given-names>P.</given-names></name> <name><surname>Rial-Sebbag</surname> <given-names>E.</given-names></name> <name><surname>Rodriguez</surname> <given-names>L. L.</given-names></name> <name><surname>Vergely</surname> <given-names>C.</given-names></name> <name><surname>Yoshida</surname> <given-names>T.</given-names></name> <name><surname>Grimmond</surname> <given-names>S. M.</given-names></name> <name><surname>Biankin</surname> <given-names>A. V.</given-names></name> <name><surname>Bowtell</surname> <given-names>D. D.</given-names></name> <name><surname>Cloonan</surname> <given-names>N.</given-names></name> <name><surname>deFazio</surname> <given-names>A.</given-names></name> <name><surname>Eshleman</surname> <given-names>J. R.</given-names></name> <name><surname>Etemadmoghadam</surname> <given-names>D.</given-names></name> <name><surname>Gardiner</surname> <given-names>B. B.</given-names></name> <name><surname>Kench</surname> <given-names>J. G.</given-names></name> <name><surname>Scarpa</surname> <given-names>A.</given-names></name> <name><surname>Sutherland</surname> <given-names>R. L.</given-names></name> <name><surname>Tempero</surname> <given-names>M. A.</given-names></name> <name><surname>Waddell</surname> <given-names>N. J.</given-names></name> <name><surname>Wilson</surname> <given-names>P. J.</given-names></name> <name><surname>McPherson</surname> <given-names>J. D.</given-names></name> <name><surname>Gallinger</surname> <given-names>S.</given-names></name> <name><surname>Tsao</surname> <given-names>M. S.</given-names></name> <name><surname>Shaw</surname> <given-names>P. A.</given-names></name> <name><surname>Petersen</surname> <given-names>G. M.</given-names></name> <name><surname>Mukhopadhyay</surname> <given-names>D.</given-names></name> <name><surname>Chin</surname> <given-names>L.</given-names></name> <name><surname>DePinho</surname> <given-names>R. A.</given-names></name> <name><surname>Thayer</surname> <given-names>S.</given-names></name> <name><surname>Muthuswamy</surname> <given-names>L.</given-names></name> <name><surname>Shazand</surname> <given-names>K.</given-names></name> <name><surname>Beck</surname> <given-names>T.</given-names></name> <name><surname>Sam</surname> <given-names>M.</given-names></name> <name><surname>Timms</surname> <given-names>L.</given-names></name> <name><surname>Ballin</surname> <given-names>V.</given-names></name> <name><surname>Lu</surname> <given-names>Y.</given-names></name> <name><surname>Ji</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Chen</surname> <given-names>F.</given-names></name> <name><surname>Hu</surname> <given-names>X.</given-names></name> <name><surname>Zhou</surname> <given-names>G.</given-names></name> <name><surname>Yang</surname> <given-names>Q.</given-names></name> <name><surname>Tian</surname> <given-names>G.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Xing</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>Z.</given-names></name> <name><surname>Yu</surname> <given-names>Y.</given-names></name> <name><surname>Yu</surname> <given-names>J.</given-names></name> <name><surname>Yang</surname> <given-names>H.</given-names></name> <name><surname>Lathrop</surname> <given-names>M.</given-names></name> <name><surname>Tost</surname> <given-names>J.</given-names></name> <name><surname>Brennan</surname> <given-names>P.</given-names></name> <name><surname>Holcatova</surname> <given-names>I.</given-names></name> <name><surname>Zaridze</surname> <given-names>D.</given-names></name> <name><surname>Brazma</surname> <given-names>A.</given-names></name> <name><surname>Egevard</surname> <given-names>L.</given-names></name> <name><surname>Prokhortchouk</surname> <given-names>E.</given-names></name> <name><surname>Banks</surname> <given-names>R. E.</given-names></name> <name><surname>Uhl&#x000E9;n</surname> <given-names>M.</given-names></name> <name><surname>Cambon-Thomsen</surname> <given-names>A.</given-names></name> <name><surname>Viksna</surname> <given-names>J.</given-names></name> <name><surname>Ponten</surname> <given-names>F.</given-names></name> <name><surname>Skryabin</surname> <given-names>K.</given-names></name> <name><surname>Stratton</surname> <given-names>M. R.</given-names></name> <name><surname>Futreal</surname> <given-names>P. A.</given-names></name> <name><surname>Birney</surname> <given-names>E.</given-names></name> <name><surname>Borg</surname> <given-names>A.</given-names></name> <name><surname>B&#x000F8;rresen-Dale</surname> <given-names>A. L.</given-names></name> <name><surname>Caldas</surname> <given-names>C.</given-names></name> <name><surname>Foekens</surname> <given-names>J. A.</given-names></name> <name><surname>Martin</surname> <given-names>S.</given-names></name> <name><surname>Reis-Filho</surname> <given-names>J. S.</given-names></name> <name><surname>Richardson</surname> <given-names>A. L.</given-names></name> <name><surname>Sotiriou</surname> <given-names>C.</given-names></name> <name><surname>Stunnenberg</surname> <given-names>H. G.</given-names></name> <name><surname>Thoms</surname> <given-names>G.</given-names></name> <name><surname>van de Vijver</surname> <given-names>M.</given-names></name> <name><surname>van&#x02019;t Veer</surname> <given-names>L.</given-names></name> <name><surname>Calvo</surname> <given-names>F.</given-names></name> <name><surname>Birnbaum</surname> <given-names>D.</given-names></name> <name><surname>Blanche</surname> <given-names>H.</given-names></name> <name><surname>Boucher</surname> <given-names>P.</given-names></name> <name><surname>Boyault</surname> <given-names>S.</given-names></name> <name><surname>Chabannon</surname> <given-names>C.</given-names></name> <name><surname>Gut</surname> <given-names>I.</given-names></name> <name><surname>Masson-Jacquemier</surname> <given-names>J. D.</given-names></name> <name><surname>Lathrop</surname> <given-names>M.</given-names></name> <name><surname>Pauport&#x000E9;</surname> <given-names>I.</given-names></name> <name><surname>Pivot</surname> <given-names>X.</given-names></name> <name><surname>Vincent-Salomon</surname> <given-names>A.</given-names></name> <name><surname>Tabone</surname> <given-names>E.</given-names></name> <name><surname>Theillet</surname> <given-names>C.</given-names></name> <name><surname>Thomas</surname> <given-names>G.</given-names></name> <name><surname>Tost</surname> <given-names>J.</given-names></name> <name><surname>Treilleux</surname> <given-names>I.</given-names></name> <name><surname>Calvo</surname> <given-names>F.</given-names></name> <name><surname>Bioulac-Sage</surname> <given-names>P.</given-names></name> <name><surname>Cl&#x000E9;ment</surname> <given-names>B.</given-names></name> <name><surname>Decaens</surname> <given-names>T.</given-names></name> <name><surname>Degos</surname> <given-names>F.</given-names></name> <name><surname>Franco</surname> <given-names>D.</given-names></name> <name><surname>Gut</surname> <given-names>I.</given-names></name> <name><surname>Gut</surname> <given-names>M.</given-names></name> <name><surname>Heath</surname> <given-names>S.</given-names></name> <name><surname>Lathrop</surname> <given-names>M.</given-names></name> <name><surname>Samuel</surname> <given-names>D.</given-names></name> <name><surname>Thomas</surname> <given-names>G.</given-names></name> <name><surname>Zucman-Rossi</surname> <given-names>J.</given-names></name> <name><surname>Lichter</surname> <given-names>P.</given-names></name> <name><surname>Eils</surname> <given-names>R.</given-names></name> <name><surname>Brors</surname> <given-names>B.</given-names></name> <name><surname>Korbel</surname> <given-names>J. O.</given-names></name> <name><surname>Korshunov</surname> <given-names>A.</given-names></name> <name><surname>Landgraf</surname> <given-names>P.</given-names></name> <name><surname>Lehrach</surname> <given-names>H.</given-names></name> <name><surname>Pfister</surname> <given-names>S.</given-names></name> <name><surname>Radlwimmer</surname> <given-names>B.</given-names></name> <name><surname>Reifenberger</surname> <given-names>G.</given-names></name> <name><surname>Taylor</surname> <given-names>M. D.</given-names></name> <name><surname>von Kalle</surname> <given-names>C.</given-names></name> <name><surname>Majumder</surname> <given-names>P. P.</given-names></name> <name><surname>Sarin</surname> <given-names>R.</given-names></name> <name><surname>Rao</surname> <given-names>T. S.</given-names></name> <name><surname>Bhan</surname> <given-names>M. K.</given-names></name> <name><surname>Scarpa</surname> <given-names>A.</given-names></name> <name><surname>Pederzoli</surname> <given-names>P.</given-names></name> <name><surname>Lawlor</surname> <given-names>R. A.</given-names></name> <name><surname>Delledonne</surname> <given-names>M.</given-names></name> <name><surname>Bardelli</surname> <given-names>A.</given-names></name> <name><surname>Biankin</surname> <given-names>A. V.</given-names></name> <name><surname>Grimmond</surname> <given-names>S. M.</given-names></name> <name><surname>Gress</surname> <given-names>T.</given-names></name> <name><surname>Klimstra</surname> <given-names>D.</given-names></name> <name><surname>Zamboni</surname> <given-names>G.</given-names></name> <name><surname>Shibata</surname> <given-names>T.</given-names></name> <name><surname>Nakamura</surname> <given-names>Y.</given-names></name> <name><surname>Nakagawa</surname> <given-names>H.</given-names></name> <name><surname>Kusada</surname> <given-names>J.</given-names></name> <name><surname>Tsunoda</surname> <given-names>T.</given-names></name> <name><surname>Miyano</surname> <given-names>S.</given-names></name> <name><surname>Aburatani</surname> <given-names>H.</given-names></name> <name><surname>Kato</surname> <given-names>K.</given-names></name> <name><surname>Fujimoto</surname> <given-names>A.</given-names></name> <name><surname>Yoshida</surname> <given-names>T.</given-names></name> <name><surname>Campo</surname> <given-names>E.</given-names></name> <name><surname>L&#x000F3;pez-Ot&#x000ED;n</surname> <given-names>C.</given-names></name> <name><surname>Estivill</surname> <given-names>X.</given-names></name> <name><surname>Guig&#x000F3;</surname> <given-names>R.</given-names></name> <name><surname>de Sanjos&#x000E9;</surname> <given-names>S.</given-names></name> <name><surname>Piris</surname> <given-names>M. A.</given-names></name> <name><surname>Montserrat</surname> <given-names>E.</given-names></name> <name><surname>Gonz&#x000E1;lez-D&#x000ED;az</surname> <given-names>M.</given-names></name> <name><surname>Puente</surname> <given-names>X. S.</given-names></name> <name><surname>Jares</surname> <given-names>P.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name> <name><surname>Himmelbauer</surname> <given-names>H.</given-names></name> <name><surname>Quesada</surname> <given-names>V.</given-names></name> <name><surname>Bea</surname> <given-names>S.</given-names></name> <name><surname>Stratton</surname> <given-names>M. R.</given-names></name> <name><surname>Futreal</surname> <given-names>P. A.</given-names></name> <name><surname>Campbell</surname> <given-names>P. J.</given-names></name> <name><surname>Vincent-Salomon</surname> <given-names>A.</given-names></name> <name><surname>Richardson</surname> <given-names>A. L.</given-names></name> <name><surname>Reis-Filho</surname> <given-names>J. S.</given-names></name> <name><surname>van de Vijver</surname> <given-names>M.</given-names></name> <name><surname>Thomas</surname> <given-names>G.</given-names></name> <name><surname>Masson-Jacquemier</surname> <given-names>J. D.</given-names></name> <name><surname>Aparicio</surname> <given-names>S.</given-names></name> <name><surname>Borg</surname> <given-names>A.</given-names></name> <name><surname>B&#x000F8;rresen-Dale</surname> <given-names>A. L.</given-names></name> <name><surname>Caldas</surname> <given-names>C.</given-names></name> <name><surname>Foekens</surname> <given-names>J. A.</given-names></name> <name><surname>Stunnenberg</surname> <given-names>H. G.</given-names></name> <name><surname>van&#x02019;t Veer</surname> <given-names>L.</given-names></name> <name><surname>Easton</surname> <given-names>D. F.</given-names></name> <name><surname>Spellman</surname> <given-names>P. T.</given-names></name> <name><surname>Martin</surname> <given-names>S.</given-names></name> <name><surname>Barker</surname> <given-names>A. D.</given-names></name> <name><surname>Chin</surname> <given-names>L.</given-names></name> <name><surname>Collins</surname> <given-names>F. S.</given-names></name> <name><surname>Compton</surname> <given-names>C. C.</given-names></name> <name><surname>Ferguson</surname> <given-names>M. L.</given-names></name> <name><surname>Gerhard</surname> <given-names>D. S.</given-names></name> <name><surname>Getz</surname> <given-names>G.</given-names></name> <name><surname>Gunter</surname> <given-names>C.</given-names></name> <name><surname>Guttmacher</surname> <given-names>A.</given-names></name> <name><surname>Guyer</surname> <given-names>M.</given-names></name> <name><surname>Hayes</surname> <given-names>D. N.</given-names></name> <name><surname>Lander</surname> <given-names>E. S.</given-names></name> <name><surname>Ozenberger</surname> <given-names>B.</given-names></name> <name><surname>Penny</surname> <given-names>R.</given-names></name> <name><surname>Peterson</surname> <given-names>J.</given-names></name> <name><surname>Sander</surname> <given-names>C.</given-names></name> <name><surname>Shaw</surname> <given-names>K. M.</given-names></name> <name><surname>Speed</surname> <given-names>T. P.</given-names></name> <name><surname>Spellman</surname> <given-names>P. T.</given-names></name> <name><surname>Vockley</surname> <given-names>J. G.</given-names></name> <name><surname>Wheeler</surname> <given-names>D. A.</given-names></name> <name><surname>Wilson</surname> <given-names>R. K.</given-names></name> <name><surname>Hudson</surname> <given-names>T. J.</given-names></name> <name><surname>Chin</surname> <given-names>L.</given-names></name> <name><surname>Knoppers</surname> <given-names>B. M.</given-names></name> <name><surname>Lander</surname> <given-names>E. S.</given-names></name> <name><surname>Lichter</surname> <given-names>P.</given-names></name> <name><surname>Stein</surname> <given-names>L. D.</given-names></name> <name><surname>Stratton</surname> <given-names>M. R.</given-names></name> <name><surname>Anderson</surname> <given-names>W.</given-names></name> <name><surname>Barker</surname> <given-names>A. D.</given-names></name> <name><surname>Bell</surname> <given-names>C.</given-names></name> <name><surname>Bobrow</surname> <given-names>M.</given-names></name> <name><surname>Burke</surname> <given-names>W.</given-names></name> <name><surname>Collins</surname> <given-names>F. S.</given-names></name> <name><surname>Compton</surname> <given-names>C. C.</given-names></name> <name><surname>DePinho</surname> <given-names>R. A.</given-names></name> <name><surname>Easton</surname> <given-names>D. F.</given-names></name> <name><surname>Futreal</surname> <given-names>P. A.</given-names></name> <name><surname>Gerhard</surname> <given-names>D. S.</given-names></name> <name><surname>Green</surname> <given-names>A. R.</given-names></name> <name><surname>Guyer</surname> <given-names>M.</given-names></name> <name><surname>Hamilton</surname> <given-names>S. R.</given-names></name> <name><surname>Hubbard</surname> <given-names>T. J.</given-names></name> <name><surname>Kallioniemi</surname> <given-names>O. P.</given-names></name> <name><surname>Kennedy</surname> <given-names>K. L.</given-names></name> <name><surname>Ley</surname> <given-names>T. J.</given-names></name> <name><surname>Liu</surname> <given-names>E. T.</given-names></name> <name><surname>Lu</surname> <given-names>Y.</given-names></name> <name><surname>Majumder</surname> <given-names>P.</given-names></name> <name><surname>Marra</surname> <given-names>M.</given-names></name> <name><surname>Ozenberger</surname> <given-names>B.</given-names></name> <name><surname>Peterson</surname> <given-names>J.</given-names></name> <name><surname>Schafer</surname> <given-names>A. J.</given-names></name> <name><surname>Spellman</surname> <given-names>P. T.</given-names></name> <name><surname>Stunnenberg</surname> <given-names>H. G.</given-names></name> <name><surname>Wainwright</surname> <given-names>B. J.</given-names></name> <name><surname>Wilson</surname> <given-names>R. K.</given-names></name> <name><surname>Yang</surname> <given-names>H.</given-names></name></person-group> (<year>2010a</year>). <article-title>International network of cancer genome projects</article-title>. <source>Nature</source> <volume>464</volume>, <fpage>993</fpage>&#x02013;<lpage>998</lpage>.<pub-id pub-id-type="doi">10.1038/nature08987</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Consortium</surname> <given-names>I. H.</given-names></name> <name><surname>Altshuler</surname> <given-names>D. M.</given-names></name> <name><surname>Gibbs</surname> <given-names>R. A.</given-names></name> <name><surname>Peltonen</surname> <given-names>L.</given-names></name> <name><surname>Altshuler</surname> <given-names>D. M.</given-names></name> <name><surname>Gibbs</surname> <given-names>R. A.</given-names></name> <name><surname>Peltonen</surname> <given-names>L.</given-names></name> <name><surname>Dermitzakis</surname> <given-names>E.</given-names></name> <name><surname>Schaffner</surname> <given-names>S. F.</given-names></name> <name><surname>Yu</surname> <given-names>F.</given-names></name> <name><surname>Peltonen</surname> <given-names>L.</given-names></name> <name><surname>Dermitzakis</surname> <given-names>E.</given-names></name> <name><surname>Bonnen</surname> <given-names>P. E.</given-names></name> <name><surname>Altshuler</surname> <given-names>D. M.</given-names></name> <name><surname>Gibbs</surname> <given-names>R. A.</given-names></name> <name><surname>de Bakker</surname> <given-names>P. I. W.</given-names></name> <name><surname>Deloukas</surname> <given-names>P.</given-names></name> <name><surname>Gabriel</surname> <given-names>S. B.</given-names></name> <name><surname>Gwilliam</surname> <given-names>R.</given-names></name> <name><surname>Hunt</surname> <given-names>S.</given-names></name> <name><surname>Inouye</surname> <given-names>M.</given-names></name> <name><surname>Jia</surname> <given-names>X.</given-names></name> <name><surname>Palotie</surname> <given-names>A.</given-names></name> <name><surname>Parkin</surname> <given-names>M.</given-names></name> <name><surname>Whittaker</surname> <given-names>P.</given-names></name> <name><surname>Yu</surname> <given-names>F.</given-names></name> <name><surname>Chang</surname> <given-names>K.</given-names></name> <name><surname>Hawes</surname> <given-names>A.</given-names></name> <name><surname>Lewis</surname> <given-names>L. R.</given-names></name> <name><surname>Ren</surname> <given-names>Y.</given-names></name> <name><surname>Wheeler</surname> <given-names>D.</given-names></name> <name><surname>Gibbs</surname> <given-names>R. A.</given-names></name> <name><surname>Muzny</surname> <given-names>D. M.</given-names></name> <name><surname>Barnes</surname> <given-names>C.</given-names></name> <name><surname>Darvishi</surname> <given-names>K.</given-names></name> <name><surname>Hurles</surname> <given-names>M.</given-names></name> <name><surname>Korn</surname> <given-names>J. M.</given-names></name> <name><surname>Kristiansson</surname> <given-names>K.</given-names></name> <name><surname>Lee</surname> <given-names>C.</given-names></name> <name><surname>McCarrol</surname> <given-names>S. A.</given-names></name> <name><surname>Nemesh</surname> <given-names>J.</given-names></name> <name><surname>Dermitzakis</surname> <given-names>E.</given-names></name> <name><surname>Keinan</surname> <given-names>A.</given-names></name> <name><surname>Montgomery</surname> <given-names>S. B.</given-names></name> <name><surname>Pollack</surname> <given-names>S.</given-names></name> <name><surname>Price</surname> <given-names>A. L.</given-names></name> <name><surname>Soranzo</surname> <given-names>N.</given-names></name> <name><surname>Bonnen</surname> <given-names>P. E.</given-names></name> <name><surname>Gibbs</surname> <given-names>R. A.</given-names></name> <name><surname>Gonzaga-Jauregui</surname> <given-names>C.</given-names></name> <name><surname>Keinan</surname> <given-names>A.</given-names></name> <name><surname>Price</surname> <given-names>A. L.</given-names></name> <name><surname>Yu</surname> <given-names>F.</given-names></name> <name><surname>Anttila</surname> <given-names>V.</given-names></name> <name><surname>Brodeur</surname> <given-names>W.</given-names></name> <name><surname>Daly</surname> <given-names>M. J.</given-names></name> <name><surname>Leslie</surname> <given-names>S.</given-names></name> <name><surname>McVean</surname> <given-names>G.</given-names></name> <name><surname>Moutsianas</surname> <given-names>L.</given-names></name> <name><surname>Nguyen</surname> <given-names>H.</given-names></name> <name><surname>Schaffner</surname> <given-names>S. F.</given-names></name> <name><surname>Zhang</surname> <given-names>Q.</given-names></name> <name><surname>Ghori</surname> <given-names>M. J. R.</given-names></name> <name><surname>McGinnis</surname> <given-names>R.</given-names></name> <name><surname>McLaren</surname> <given-names>W.</given-names></name> <name><surname>Pollack</surname> <given-names>S.</given-names></name> <name><surname>Price</surname> <given-names>A. L.</given-names></name> <name><surname>Schaffner</surname> <given-names>S. F.</given-names></name> <name><surname>Takeuchi</surname> <given-names>F.</given-names></name> <name><surname>Grossman</surname> <given-names>S. R.</given-names></name> <name><surname>Shlyakhter</surname> <given-names>I.</given-names></name> <name><surname>Hostetter</surname> <given-names>E. B.</given-names></name> <name><surname>Sabeti</surname> <given-names>P. C.</given-names></name> <name><surname>Adebamowo</surname> <given-names>C. A.</given-names></name> <name><surname>Foster</surname> <given-names>M. W.</given-names></name> <name><surname>Gordon</surname> <given-names>D. R.</given-names></name> <name><surname>Licinio</surname> <given-names>J.</given-names></name> <name><surname>Manca</surname> <given-names>M. C.</given-names></name> <name><surname>Marshall</surname> <given-names>P. A.</given-names></name> <name><surname>Matsuda</surname> <given-names>I.</given-names></name> <name><surname>Ngare</surname> <given-names>D.</given-names></name> <name><surname>Wang</surname> <given-names>V. O.</given-names></name> <name><surname>Reddy</surname> <given-names>D.</given-names></name> <name><surname>Rotimi</surname> <given-names>C. N.</given-names></name> <name><surname>Royal</surname> <given-names>C. D.</given-names></name> <name><surname>Sharp</surname> <given-names>R. R.</given-names></name> <name><surname>Zeng</surname> <given-names>C.</given-names></name> <name><surname>Brooks</surname> <given-names>L. D.</given-names></name> <name><surname>McEwen</surname> <given-names>J. E.</given-names></name></person-group> (<year>2010b</year>). <article-title>Integrating common and rare genetic variation in diverse human populations</article-title>. <source>Nature</source> <volume>467</volume>, <fpage>52</fpage>&#x02013;<lpage>58</lpage>.<pub-id pub-id-type="doi">10.1038/nature09298</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Consortium</surname> <given-names>G. P.</given-names></name> <name><surname>Durbin</surname> <given-names>R. M.</given-names></name> <name><surname>Abecasis</surname> <given-names>G. R.</given-names></name> <name><surname>Altshuler</surname> <given-names>D. L.</given-names></name> <name><surname>Auton</surname> <given-names>A.</given-names></name> <name><surname>Brooks</surname> <given-names>L. D.</given-names></name> <name><surname>Durbin</surname> <given-names>R. M.</given-names></name> <name><surname>Gibbs</surname> <given-names>R. A.</given-names></name> <name><surname>Hurles</surname> <given-names>M. E.</given-names></name> <name><surname>McVean</surname> <given-names>G. A.</given-names></name></person-group> (<year>2010c</year>). <article-title>A map of human genome variation from population-scale sequencing</article-title>. <source>Nature</source> <volume>467</volume>, <fpage>1061</fpage>&#x02013;<lpage>1073</lpage>.<pub-id pub-id-type="doi">10.1038/nature09534</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Consortium</surname> <given-names>U.</given-names></name></person-group> (<year>2007</year>). <article-title>The universal protein resource (UniProt)</article-title>. <source>Nucleic Acids Res.</source> <volume>35</volume>, <fpage>D193</fpage>&#x02013;<lpage>D197</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkl929</pub-id><pub-id pub-id-type="pmid">17142230</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Erdogmus</surname> <given-names>M.</given-names></name> <name><surname>Sezerman</surname> <given-names>O. U.</given-names></name></person-group> (<year>2007</year>). <article-title>Application of automatic mutation-gene pair extraction to diseases</article-title>. <source>J. Bioinform. Comput. Biol.</source> <volume>5</volume>, <fpage>1261</fpage>&#x02013;<lpage>1275</lpage>.<pub-id pub-id-type="doi">10.1142/S021972000700317X</pub-id><pub-id pub-id-type="pmid">18172928</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferrer-Costa</surname> <given-names>C.</given-names></name> <name><surname>Gelp&#x000ED;</surname> <given-names>J. L.</given-names></name> <name><surname>Zamakola</surname> <given-names>L.</given-names></name> <name><surname>Parraga</surname> <given-names>I.</given-names></name> <name><surname>de la Cruz</surname> <given-names>X.</given-names></name> <name><surname>Orozco</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>PMUT: a web-based tool for the annotation of pathological mutations on proteins</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>3176</fpage>&#x02013;<lpage>3178</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti486</pub-id><pub-id pub-id-type="pmid">15879453</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferrer-Costa</surname> <given-names>C.</given-names></name> <name><surname>Orozco</surname> <given-names>M.</given-names></name> <name><surname>de la Cruz</surname> <given-names>X.</given-names></name></person-group> (<year>2002</year>). <article-title>Characterization of disease-associated single amino acid polymorphisms in terms of sequence and structure properties</article-title>. <source>J. Mol. Biol.</source> <volume>315</volume>, <fpage>771</fpage>&#x02013;<lpage>786</lpage>.<pub-id pub-id-type="doi">10.1006/jmbi.2001.5255</pub-id><pub-id pub-id-type="pmid">11812146</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Finn</surname> <given-names>R. D.</given-names></name> <name><surname>Mistry</surname> <given-names>J.</given-names></name> <name><surname>Tate</surname> <given-names>J.</given-names></name> <name><surname>Coggill</surname> <given-names>P.</given-names></name> <name><surname>Heger</surname> <given-names>A.</given-names></name> <name><surname>Pollington</surname> <given-names>J. E.</given-names></name> <name><surname>Gavin</surname> <given-names>O. L.</given-names></name> <name><surname>Gunasekaran</surname> <given-names>P.</given-names></name> <name><surname>Ceric</surname> <given-names>G.</given-names></name> <name><surname>Forslund</surname> <given-names>K.</given-names></name> <name><surname>Holm</surname> <given-names>L.</given-names></name> <name><surname>Sonnhammer</surname> <given-names>E. L. L.</given-names></name> <name><surname>Eddy</surname> <given-names>S. R.</given-names></name> <name><surname>Bateman</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>The pfam protein families database</article-title>. <source>Nucleic Acids Res.</source> <volume>38</volume>, <fpage>D211</fpage>&#x02013;<lpage>D222</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkp985</pub-id><pub-id pub-id-type="pmid">19920124</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flicek</surname> <given-names>P.</given-names></name> <name><surname>Amode</surname> <given-names>M. R.</given-names></name> <name><surname>Barrell</surname> <given-names>D.</given-names></name> <name><surname>Beal</surname> <given-names>K.</given-names></name> <name><surname>Brent</surname> <given-names>S.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Clapham</surname> <given-names>P.</given-names></name> <name><surname>Coates</surname> <given-names>G.</given-names></name> <name><surname>Fairley</surname> <given-names>S.</given-names></name> <name><surname>Fitzgerald</surname> <given-names>S.</given-names></name> <name><surname>Gordon</surname> <given-names>L.</given-names></name> <name><surname>Hendrix</surname> <given-names>M.</given-names></name> <name><surname>Hourlier</surname> <given-names>T.</given-names></name> <name><surname>Johnson</surname> <given-names>N.</given-names></name> <name><surname>K&#x000E4;h&#x000E4;ri</surname> <given-names>A.</given-names></name> <name><surname>Keefe</surname> <given-names>D.</given-names></name> <name><surname>Keenan</surname> <given-names>S.</given-names></name> <name><surname>Kinsella</surname> <given-names>R.</given-names></name> <name><surname>Kokocinski</surname> <given-names>F.</given-names></name> <name><surname>Kulesha</surname> <given-names>E.</given-names></name> <name><surname>Larsson</surname> <given-names>P.</given-names></name> <name><surname>Longden</surname> <given-names>I.</given-names></name> <name><surname>McLaren</surname> <given-names>W.</given-names></name> <name><surname>Overduin</surname> <given-names>B.</given-names></name> <name><surname>Pritchard</surname> <given-names>B.</given-names></name> <name><surname>Riat</surname> <given-names>H. S.</given-names></name> <name><surname>Rios</surname> <given-names>D.</given-names></name> <name><surname>Ritchie</surname> <given-names>G. R. S.</given-names></name> <name><surname>Ruffier</surname> <given-names>M.</given-names></name> <name><surname>Schuster</surname> <given-names>M.</given-names></name> <name><surname>Sobral</surname> <given-names>D.</given-names></name> <name><surname>Spudich</surname> <given-names>G.</given-names></name> <name><surname>Tang</surname> <given-names>Y. A.</given-names></name> <name><surname>Trevanion</surname> <given-names>S.</given-names></name> <name><surname>Vandrovcova</surname> <given-names>J.</given-names></name> <name><surname>Vilella</surname> <given-names>A. J.</given-names></name> <name><surname>White</surname> <given-names>S.</given-names></name> <name><surname>Wilder</surname> <given-names>S. P.</given-names></name> <name><surname>Zadissa</surname> <given-names>A.</given-names></name> <name><surname>Zamora</surname> <given-names>J.</given-names></name> <name><surname>Aken</surname> <given-names>B. L.</given-names></name> <name><surname>Birney</surname> <given-names>E.</given-names></name> <name><surname>Cunningham</surname> <given-names>F.</given-names></name> <name><surname>Dunham</surname> <given-names>I.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name> <name><surname>Fern&#x000E1;ndez-Suarez</surname> <given-names>X. M.</given-names></name> <name><surname>Herrero</surname> <given-names>J.</given-names></name> <name><surname>Hubbard</surname> <given-names>T. J. P.</given-names></name> <name><surname>Parker</surname> <given-names>A.</given-names></name> <name><surname>Proctor</surname> <given-names>G.</given-names></name> <name><surname>Vogel</surname> <given-names>J.</given-names></name> <name><surname>Searle</surname> <given-names>S. M. J.</given-names></name></person-group> (<year>2011</year>). <article-title>Ensembl 2011</article-title>. <source>Nucleic Acids Res.</source> <volume>39</volume>, <fpage>D800</fpage>&#x02013;<lpage>D806</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkq1064</pub-id><pub-id pub-id-type="pmid">21045057</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Furlong</surname> <given-names>L. I.</given-names></name> <name><surname>Dach</surname> <given-names>H.</given-names></name> <name><surname>Hofmann-Apitius</surname> <given-names>M.</given-names></name> <name><surname>Sanz</surname> <given-names>F.</given-names></name></person-group> (<year>2008</year>). <article-title>Osirisv1.2: a named entity recognition system for sequence variants of genes in biomedical literature</article-title>. <source>BMC Bioinformatics</source> <volume>9</volume>, <fpage>84</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-9-84</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonz&#x000E1;lez-P&#x000E9;rez</surname> <given-names>A.</given-names></name> <name><surname>L&#x000F3;pez-Bigas</surname> <given-names>N.</given-names></name></person-group> (<year>2011</year>). <article-title>Improving the assessment of the outcome of nonsynonymous snvs with a consensus deleteriousness score, condel</article-title>. <source>Am. J. Hum. Genet.</source> <volume>88</volume>, <fpage>440</fpage>&#x02013;<lpage>449</lpage>.<pub-id pub-id-type="doi">10.1016/j.ajhg.2011.03.004</pub-id><pub-id pub-id-type="pmid">21457909</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Greenman</surname> <given-names>C.</given-names></name> <name><surname>Stephens</surname> <given-names>P.</given-names></name> <name><surname>Smith</surname> <given-names>R.</given-names></name> <name><surname>Dalgliesh</surname> <given-names>G. L.</given-names></name> <name><surname>Hunter</surname> <given-names>C.</given-names></name> <name><surname>Bignell</surname> <given-names>G.</given-names></name> <name><surname>Davies</surname> <given-names>H.</given-names></name> <name><surname>Teague</surname> <given-names>J.</given-names></name> <name><surname>Butler</surname> <given-names>A.</given-names></name> <name><surname>Stevens</surname> <given-names>C.</given-names></name> <name><surname>Edkins</surname> <given-names>S.</given-names></name> <name><surname>O&#x02019;Meara</surname> <given-names>S.</given-names></name> <name><surname>Vastrik</surname> <given-names>I.</given-names></name> <name><surname>Schmidt</surname> <given-names>E. E.</given-names></name> <name><surname>Avis</surname> <given-names>T.</given-names></name> <name><surname>Barthorpe</surname> <given-names>S.</given-names></name> <name><surname>Bhamra</surname> <given-names>G.</given-names></name> <name><surname>Buck</surname> <given-names>G.</given-names></name> <name><surname>Choudhury</surname> <given-names>B.</given-names></name> <name><surname>Clements</surname> <given-names>J.</given-names></name> <name><surname>Cole</surname> <given-names>J.</given-names></name> <name><surname>Dicks</surname> <given-names>E.</given-names></name> <name><surname>Forbes</surname> <given-names>S.</given-names></name> <name><surname>Gray</surname> <given-names>K.</given-names></name> <name><surname>Halliday</surname> <given-names>K.</given-names></name> <name><surname>Harrison</surname> <given-names>R.</given-names></name> <name><surname>Hills</surname> <given-names>K.</given-names></name> <name><surname>Hinton</surname> <given-names>J.</given-names></name> <name><surname>Jenkinson</surname> <given-names>A.</given-names></name> <name><surname>Jones</surname> <given-names>D.</given-names></name> <name><surname>Menzies</surname> <given-names>A.</given-names></name> <name><surname>Mironenko</surname> <given-names>T.</given-names></name> <name><surname>Perry</surname> <given-names>J.</given-names></name> <name><surname>Raine</surname> <given-names>K.</given-names></name> <name><surname>Richardson</surname> <given-names>D.</given-names></name> <name><surname>Shepherd</surname> <given-names>R.</given-names></name> <name><surname>Small</surname> <given-names>A.</given-names></name> <name><surname>Tofts</surname> <given-names>C.</given-names></name> <name><surname>Varian</surname> <given-names>J.</given-names></name> <name><surname>Webb</surname> <given-names>T.</given-names></name> <name><surname>West</surname> <given-names>S.</given-names></name> <name><surname>Widaa</surname> <given-names>S.</given-names></name> <name><surname>Yates</surname> <given-names>A.</given-names></name> <name><surname>Cahill</surname> <given-names>D. P.</given-names></name> <name><surname>Louis</surname> <given-names>D. N.</given-names></name> <name><surname>Goldstraw</surname> <given-names>P.</given-names></name> <name><surname>Nicholson</surname> <given-names>A. G.</given-names></name> <name><surname>Brasseur</surname> <given-names>F.</given-names></name> <name><surname>Looijenga</surname> <given-names>L.</given-names></name> <name><surname>Weber</surname> <given-names>B. L.</given-names></name> <name><surname>Chiew</surname> <given-names>Y.-E.</given-names></name> <name><surname>DeFazio</surname> <given-names>A.</given-names></name> <name><surname>Greaves</surname> <given-names>M. F.</given-names></name> <name><surname>Green</surname> <given-names>A. R.</given-names></name> <name><surname>Campbell</surname> <given-names>P.</given-names></name> <name><surname>Birney</surname> <given-names>E.</given-names></name> <name><surname>Easton</surname> <given-names>D. F.</given-names></name> <name><surname>Chenevix-Trench</surname> <given-names>G.</given-names></name> <name><surname>Tan</surname> <given-names>M.-H.</given-names></name> <name><surname>Khoo</surname> <given-names>S. K.</given-names></name> <name><surname>Teh</surname> <given-names>B. T.</given-names></name> <name><surname>Yuen</surname> <given-names>S. T.</given-names></name> <name><surname>Leung</surname> <given-names>S. Y.</given-names></name> <name><surname>Wooster</surname> <given-names>R.</given-names></name> <name><surname>Futreal</surname> <given-names>P. A.</given-names></name> <name><surname>Stratton</surname> <given-names>M. R.</given-names></name></person-group> (<year>2007</year>). <article-title>Patterns of somatic mutation in human cancer genomes</article-title>. <source>Nature</source> <volume>446</volume>, <fpage>153</fpage>&#x02013;<lpage>158</lpage>.<pub-id pub-id-type="doi">10.1038/nature05610</pub-id><pub-id pub-id-type="pmid">17344846</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hoffmann</surname> <given-names>R.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2005</year>). <article-title>Implementing the ihop concept for navigation of biomedical literature</article-title>. <source>Bioinformatics</source> <volume>21</volume>(<issue>Suppl. 2</issue>), <fpage>ii252</fpage>&#x02013;<lpage>ii258</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti571</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Horn</surname> <given-names>F.</given-names></name> <name><surname>Lau</surname> <given-names>A. L.</given-names></name> <name><surname>Cohen</surname> <given-names>F. E.</given-names></name></person-group> (<year>2004</year>). <article-title>Automated extraction of mutation data from the literature: application of MuteXt to g protein-coupled receptors and nuclear hormone receptors</article-title>. <source>Bioinformatics</source> <volume>20</volume>, <fpage>557</fpage>&#x02013;<lpage>568</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btg449</pub-id><pub-id pub-id-type="pmid">14990452</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hunter</surname> <given-names>T.</given-names></name> <name><surname>Plowman</surname> <given-names>G. D.</given-names></name></person-group> (<year>1997</year>). <article-title>The protein kinases of budding yeast: six score and more</article-title>. <source>Trends Biochem. Sci.</source> <volume>22</volume>, <fpage>18</fpage>&#x02013;<lpage>22</lpage>.<pub-id pub-id-type="doi">10.1016/S0968-0004(97)80009-6</pub-id><pub-id pub-id-type="pmid">9020587</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hurst</surname> <given-names>J.</given-names></name> <name><surname>McMillan</surname> <given-names>L.</given-names></name> <name><surname>Porter</surname> <given-names>C.</given-names></name> <name><surname>Allen</surname> <given-names>J.</given-names></name> <name><surname>Fakorede</surname> <given-names>A.</given-names></name> <name><surname>Martin</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>The SAAPdb web resource: a large-scale structural analysis of mutant proteins</article-title>. <source>Hum. Mutat.</source> <volume>30</volume>, <fpage>616</fpage>&#x02013;<lpage>624</lpage>.<pub-id pub-id-type="doi">10.1002/humu.20898</pub-id><pub-id pub-id-type="pmid">19191322</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Izarzugaza</surname> <given-names>J. M.</given-names></name> <name><surname>Pozo</surname> <given-names>A.</given-names></name> <name><surname>Vazquez</surname> <given-names>M.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Prioritization of pathogenic mutations in the protein kinase superfamily</article-title>. <source>BMC Genomics</source> <volume>13</volume>(<issue>Suppl. 4</issue>), <fpage>S3</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2164-13-S2-S3</pub-id><pub-id pub-id-type="pmid">22759651</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaminker</surname> <given-names>J. S.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Watanabe</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name></person-group> (<year>2007a</year>). <article-title>CanPredict: a computational tool for predicting cancer-associated missense mutations</article-title>. <source>Nucleic Acids Res.</source> <volume>35</volume>, <fpage>W595</fpage>&#x02013;<lpage>W598</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkl1012</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaminker</surname> <given-names>J. S.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Waugh</surname> <given-names>A.</given-names></name> <name><surname>Haverty</surname> <given-names>P. M.</given-names></name> <name><surname>Peters</surname> <given-names>B.</given-names></name> <name><surname>Sebisanovic</surname> <given-names>D.</given-names></name> <name><surname>Stinson</surname> <given-names>J.</given-names></name> <name><surname>Forrest</surname> <given-names>W. F.</given-names></name> <name><surname>Bazan</surname> <given-names>J. F.</given-names></name> <name><surname>Seshagiri</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name></person-group> (<year>2007b</year>). <article-title>Distinguishing cancer-associated missense mutations from common polymorphisms</article-title>. <source>Cancer Res.</source> <volume>67</volume>, <fpage>465</fpage>&#x02013;<lpage>473</lpage>.<pub-id pub-id-type="doi">10.1158/0008-5472.CAN-06-1736</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karchin</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Next generation tools for the annotation of human SNPs</article-title>. <source>Brief Bioinform.</source> <volume>10</volume>, <fpage>35</fpage>&#x02013;<lpage>52</lpage>.<pub-id pub-id-type="doi">10.1093/bib/bbn047</pub-id><pub-id pub-id-type="pmid">19181721</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karchin</surname> <given-names>R.</given-names></name> <name><surname>Diekhans</surname> <given-names>M.</given-names></name> <name><surname>Kelly</surname> <given-names>L.</given-names></name> <name><surname>Thomas</surname> <given-names>D. J.</given-names></name> <name><surname>Pieper</surname> <given-names>U.</given-names></name> <name><surname>Eswar</surname> <given-names>N.</given-names></name> <name><surname>Haussler</surname> <given-names>D.</given-names></name> <name><surname>Sali</surname> <given-names>A.</given-names></name></person-group> (<year>2005</year>). <article-title>LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>2814</fpage>&#x02013;<lpage>2820</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti442</pub-id><pub-id pub-id-type="pmid">15827081</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kinnings</surname> <given-names>S. L.</given-names></name> <name><surname>Jackson</surname> <given-names>R. M.</given-names></name></person-group> (<year>2009</year>). <article-title>Binding site similarity analysis for the functional classification of the protein kinase family</article-title>. <source>J. Chem. Inf. Model</source> <volume>49</volume>, <fpage>318</fpage>&#x02013;<lpage>329</lpage>.<pub-id pub-id-type="doi">10.1021/ci900204y</pub-id><pub-id pub-id-type="pmid">19434833</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Knight</surname> <given-names>J. D. R.</given-names></name> <name><surname>Qian</surname> <given-names>B.</given-names></name> <name><surname>Baker</surname> <given-names>D.</given-names></name> <name><surname>Kothary</surname> <given-names>R.</given-names></name></person-group> (<year>2007</year>). <article-title>Conservation, variability and the modeling of active protein kinases</article-title>. <source>PLoS ONE</source> <volume>2</volume>, <fpage>e982</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0000982</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krallinger</surname> <given-names>M.</given-names></name> <name><surname>Izarzugaza</surname> <given-names>J. M. G.</given-names></name> <name><surname>Rodriguez-Penagos</surname> <given-names>C.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Extraction of human kinase mutations from literature, databases and genotyping studies</article-title>. <source>BMC Bioinformatics</source> <volume>10</volume>(<issue>Suppl. 8</issue>), <fpage>S1</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-10-S12-S1</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krallinger</surname> <given-names>M.</given-names></name> <name><surname>Leitner</surname> <given-names>F.</given-names></name> <name><surname>Rodriguez-Penagos</surname> <given-names>C.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2008a</year>). <article-title>Overview of the protein-protein interaction annotation extraction task of BioCreative ii</article-title>. <source>Genome Biol.</source> <volume>9</volume>(<issue>Suppl. 2</issue>), <fpage>S4</fpage>.<pub-id pub-id-type="doi">10.1186/gb-2008-9-s2-s4</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krallinger</surname> <given-names>M.</given-names></name> <name><surname>Morgan</surname> <given-names>A.</given-names></name> <name><surname>Smith</surname> <given-names>L.</given-names></name> <name><surname>Leitner</surname> <given-names>F.</given-names></name> <name><surname>Tanabe</surname> <given-names>L.</given-names></name> <name><surname>Wilbur</surname> <given-names>J.</given-names></name> <name><surname>Hirschman</surname> <given-names>L.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2008b</year>). <article-title>Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge</article-title>. <source>Genome Biol.</source> <volume>9</volume>(<issue>Suppl. 2</issue>), <fpage>S1</fpage>.<pub-id pub-id-type="doi">10.1186/gb-2008-9-s1-s1</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krallinger</surname> <given-names>M.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name> <name><surname>Hirschman</surname> <given-names>L.</given-names></name></person-group> (<year>2008c</year>). <article-title>Linking genes to literature: text mining, information extraction, and retrieval applications for biology</article-title>. <source>Genome Biol.</source> <volume>9</volume>(<issue>Suppl. 2</issue>), <fpage>S8</fpage>.<pub-id pub-id-type="doi">10.1186/gb-2008-9-1-r8</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krallinger</surname> <given-names>M.</given-names></name> <name><surname>Leitner</surname> <given-names>F.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>Analysis of biological processes and diseases using text mining approaches</article-title>. <source>Methods Mol. Biol.</source> <volume>593</volume>, <fpage>341</fpage>&#x02013;<lpage>382</lpage>.<pub-id pub-id-type="doi">10.1007/978-1-60327-194-3_16</pub-id><pub-id pub-id-type="pmid">19957157</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krishnan</surname> <given-names>V. G.</given-names></name> <name><surname>Westhead</surname> <given-names>D. R.</given-names></name></person-group> (<year>2003</year>). <article-title>A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein function</article-title>. <source>Bioinformatics</source> <volume>19</volume>, <fpage>2199</fpage>&#x02013;<lpage>2209</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btg297</pub-id><pub-id pub-id-type="pmid">14630648</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laskowski</surname> <given-names>R. A.</given-names></name> <name><surname>Chistyakov</surname> <given-names>V. V.</given-names></name> <name><surname>Thornton</surname> <given-names>J. M.</given-names></name></person-group> (<year>2005</year>). <article-title>PDBsum more: new summaries and analyses of the known 3D structures of proteins and nucleic acids</article-title>. <source>Nucleic Acids Res.</source> <volume>33</volume>, <fpage>D266</fpage>&#x02013;<lpage>D268</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gki414</pub-id><pub-id pub-id-type="pmid">15608193</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>L. C.</given-names></name> <name><surname>Horn</surname> <given-names>F.</given-names></name> <name><surname>Cohen</surname> <given-names>F. E.</given-names></name></person-group> (<year>2007</year>). <article-title>Automatic extraction of protein point mutations using a graph bigram association</article-title>. <source>PLoS Comput. Biol.</source> <volume>3</volume>, <fpage>e16</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pcbi.0030067</pub-id><pub-id pub-id-type="pmid">17274683</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>P. H.</given-names></name> <name><surname>Shatkay</surname> <given-names>H.</given-names></name></person-group> (<year>2008</year>). <article-title>F-SNP: computationally predicted functional SNPs for disease association studies</article-title>. <source>Nucleic Acids Res.</source> <volume>36</volume>, <fpage>D820</fpage>&#x02013;<lpage>D824</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkn540</pub-id><pub-id pub-id-type="pmid">17986460</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leitner</surname> <given-names>F.</given-names></name> <name><surname>Chatr-aryamontri</surname> <given-names>A.</given-names></name> <name><surname>Mardis</surname> <given-names>S. A.</given-names></name> <name><surname>Ceol</surname> <given-names>A.</given-names></name> <name><surname>Krallinger</surname> <given-names>M.</given-names></name> <name><surname>Licata</surname> <given-names>L.</given-names></name> <name><surname>Hirschman</surname> <given-names>L.</given-names></name> <name><surname>Cesareni</surname> <given-names>G.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>The FEBS Letters/BioCreative ii.5 experiment: making biological information accessible</article-title>. <source>Nat. Biotechnol.</source> <volume>28</volume>, <fpage>897</fpage>&#x02013;<lpage>899</lpage>.<pub-id pub-id-type="doi">10.1038/nbt0910-897</pub-id><pub-id pub-id-type="pmid">20829821</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>L&#x000F3;pez</surname> <given-names>G.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name> <name><surname>Tress</surname> <given-names>M. L.</given-names></name></person-group> (<year>2007</year>). <article-title>FireDB &#x02013; a database of functionally important residues from proteins of known structure</article-title>. <source>Nucleic Acids Res.</source> <volume>35</volume>, <fpage>D219</fpage>&#x02013;<lpage>D223</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkl897</pub-id><pub-id pub-id-type="pmid">17132832</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manning</surname> <given-names>G.</given-names></name></person-group> (<year>2005</year>). <article-title>Genomic overview of protein kinases</article-title>. <source>WormBook</source> <volume>13</volume>, <fpage>1</fpage>&#x02013;<lpage>19</lpage>.</citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manning</surname> <given-names>G.</given-names></name> <name><surname>Plowman</surname> <given-names>G. D.</given-names></name> <name><surname>Hunter</surname> <given-names>T.</given-names></name> <name><surname>Sudarsanam</surname> <given-names>S.</given-names></name></person-group> (<year>2002a</year>). <article-title>Evolution of protein kinase signaling from yeast to man</article-title>. <source>Trends Biochem. Sci.</source> <volume>27</volume>, <fpage>514</fpage>&#x02013;<lpage>520</lpage>.<pub-id pub-id-type="doi">10.1016/S0968-0004(02)02179-5</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manning</surname> <given-names>G.</given-names></name> <name><surname>Whyte</surname> <given-names>D. B.</given-names></name> <name><surname>Martinez</surname> <given-names>R.</given-names></name> <name><surname>Hunter</surname> <given-names>T.</given-names></name> <name><surname>Sudarsanam</surname> <given-names>S.</given-names></name></person-group> (<year>2002b</year>). <article-title>The protein kinase complement of the human genome</article-title>. <source>Science</source> <volume>298</volume>, <fpage>1912</fpage>&#x02013;<lpage>1934</lpage>.<pub-id pub-id-type="doi">10.1126/science.1075762</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McDonald</surname> <given-names>R. T.</given-names></name> <name><surname>Winters</surname> <given-names>R. S.</given-names></name> <name><surname>Mandel</surname> <given-names>M.</given-names></name> <name><surname>Jin</surname> <given-names>Y.</given-names></name> <name><surname>White</surname> <given-names>P. S.</given-names></name> <name><surname>Pereira</surname> <given-names>F.</given-names></name></person-group> (<year>2004</year>). <article-title>An entity tagger for recognizing acquired genomic variations in cancer literature</article-title>. <source>Bioinformatics</source> <volume>20</volume>, <fpage>3249</fpage>&#x02013;<lpage>3251</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bth350</pub-id><pub-id pub-id-type="pmid">15180929</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miranda-Saavedra</surname> <given-names>D.</given-names></name> <name><surname>Barton</surname> <given-names>G. J.</given-names></name></person-group> (<year>2007</year>). <article-title>Classification and functional annotation of eukaryotic protein kinases</article-title>. <source>Proteins</source> <volume>68</volume>, <fpage>893</fpage>&#x02013;<lpage>914</lpage>.<pub-id pub-id-type="doi">10.1002/prot.21444</pub-id><pub-id pub-id-type="pmid">17557329</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morgan</surname> <given-names>A. A.</given-names></name> <name><surname>Lu</surname> <given-names>Z.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Cohen</surname> <given-names>A. M.</given-names></name> <name><surname>Fluck</surname> <given-names>J.</given-names></name> <name><surname>Ruch</surname> <given-names>P.</given-names></name> <name><surname>Divoli</surname> <given-names>A.</given-names></name> <name><surname>Fundel</surname> <given-names>K.</given-names></name> <name><surname>Leaman</surname> <given-names>R.</given-names></name> <name><surname>Hakenberg</surname> <given-names>J.</given-names></name> <name><surname>Sun</surname> <given-names>C.</given-names></name> <name><surname>hui Liu</surname> <given-names>H.</given-names></name> <name><surname>Torres</surname> <given-names>R.</given-names></name> <name><surname>Krauthammer</surname> <given-names>M.</given-names></name> <name><surname>Lau</surname> <given-names>W. W.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Hsu</surname> <given-names>C.-N.</given-names></name> <name><surname>Schuemie</surname> <given-names>M.</given-names></name> <name><surname>Cohen</surname> <given-names>K. B.</given-names></name> <name><surname>Hirschman</surname> <given-names>L.</given-names></name></person-group> (<year>2008</year>). <article-title>Overview of BioCreative ii gene normalization</article-title>. <source>Genome Biol.</source> <volume>9</volume>(<issue>Suppl. 2</issue>), <fpage>S3</fpage>.<pub-id pub-id-type="doi">10.1186/gb-2008-9-1-r3</pub-id><pub-id pub-id-type="pmid">18834494</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Network</surname> <given-names>C. G. A. R.</given-names></name></person-group> (<year>2011</year>). <article-title>Integrated genomic analyses of ovarian carcinoma</article-title>. <source>Nature</source> <volume>474</volume>, <fpage>609</fpage>&#x02013;<lpage>615</lpage>.<pub-id pub-id-type="doi">10.1038/nature10166</pub-id><pub-id pub-id-type="pmid">21720365</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ng</surname> <given-names>P. C.</given-names></name> <name><surname>Henikoff</surname> <given-names>S.</given-names></name></person-group> (<year>2001</year>). <article-title>Predicting deleterious amino acid substitutions</article-title>. <source>Genome Res.</source> <volume>11</volume>, <fpage>863</fpage>&#x02013;<lpage>874</lpage>.<pub-id pub-id-type="doi">10.1101/gr.176601</pub-id><pub-id pub-id-type="pmid">11337480</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ortutay</surname> <given-names>C.</given-names></name> <name><surname>V&#x000E4;liaho</surname> <given-names>J.</given-names></name> <name><surname>Stenberg</surname> <given-names>K.</given-names></name> <name><surname>Vihinen</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>KinMutBase: a registry of disease-causing mutations in protein kinase domains</article-title>. <source>Hum. Mutat.</source> <volume>25</volume>, <fpage>435</fpage>&#x02013;<lpage>442</lpage>.<pub-id pub-id-type="doi">10.1002/humu.20166</pub-id><pub-id pub-id-type="pmid">15832311</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pieper</surname> <given-names>U.</given-names></name> <name><surname>Eswar</surname> <given-names>N.</given-names></name> <name><surname>Davis</surname> <given-names>F. P.</given-names></name> <name><surname>Braberg</surname> <given-names>H.</given-names></name> <name><surname>Madhusudhan</surname> <given-names>M. S.</given-names></name> <name><surname>Rossi</surname> <given-names>A.</given-names></name> <name><surname>Marti-Renom</surname> <given-names>M.</given-names></name> <name><surname>Karchin</surname> <given-names>R.</given-names></name> <name><surname>Webb</surname> <given-names>B. M.</given-names></name> <name><surname>Eramian</surname> <given-names>D.</given-names></name> <name><surname>Shen</surname> <given-names>M.-Y.</given-names></name> <name><surname>Kelly</surname> <given-names>L.</given-names></name> <name><surname>Melo</surname> <given-names>F.</given-names></name> <name><surname>Sali</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>MODBASE: a database of annotated comparative protein structure models and associated resources</article-title>. <source>Nucleic Acids Res.</source> <volume>34</volume>, <fpage>D291</fpage>&#x02013;<lpage>D295</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkj059</pub-id><pub-id pub-id-type="pmid">16381869</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramensky</surname> <given-names>V.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name> <name><surname>Sunyaev</surname> <given-names>S.</given-names></name></person-group> (<year>2002</year>). <article-title>Human non-synonymous SNPs: server and survey</article-title>. <source>Nucleic Acids Res.</source> <volume>30</volume>, <fpage>3894</fpage>&#x02013;<lpage>3900</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkf493</pub-id><pub-id pub-id-type="pmid">12202775</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rebholz-Schuhmann</surname> <given-names>D.</given-names></name> <name><surname>Marcel</surname> <given-names>S.</given-names></name> <name><surname>Albert</surname> <given-names>S.</given-names></name> <name><surname>Tolle</surname> <given-names>R.</given-names></name> <name><surname>Casari</surname> <given-names>G.</given-names></name> <name><surname>Kirsch</surname> <given-names>H.</given-names></name></person-group> (<year>2004</year>). <article-title>Automatic extraction of mutations from Medline and cross-validation with OMIM</article-title>. <source>Nucleic Acids Res.</source> <volume>32</volume>, <fpage>135</fpage>&#x02013;<lpage>142</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkh162</pub-id><pub-id pub-id-type="pmid">14704350</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reumers</surname> <given-names>J.</given-names></name> <name><surname>Maurer-Stroh</surname> <given-names>S.</given-names></name> <name><surname>Schymkowitz</surname> <given-names>J.</given-names></name> <name><surname>Rousseau</surname> <given-names>F.</given-names></name></person-group> (<year>2006</year>). <article-title>SNPeffect v2.0: a new step in investigating the molecular phenotypic effects of human non-synonymous SNPs</article-title>. <source>Bioinformatics</source> <volume>22</volume>, <fpage>2183</fpage>&#x02013;<lpage>2185</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btl348</pub-id><pub-id pub-id-type="pmid">16809394</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reva</surname> <given-names>B.</given-names></name> <name><surname>Antipin</surname> <given-names>Y.</given-names></name> <name><surname>Sander</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>Predicting the functional impact of protein mutations: application to cancer genomics</article-title>. <source>Nucleic Acids Res</source>. <volume>39</volume>, <fpage>e118</fpage>.<pub-id pub-id-type="doi">10.1093/nar/gkr407</pub-id><pub-id pub-id-type="pmid">21727090</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richardson</surname> <given-names>C. J.</given-names></name> <name><surname>Gao</surname> <given-names>Q.</given-names></name> <name><surname>Mitsopoulous</surname> <given-names>C.</given-names></name> <name><surname>Zvelebil</surname> <given-names>M.</given-names></name> <name><surname>Pearl</surname> <given-names>L. H.</given-names></name> <name><surname>Pearl</surname> <given-names>F. M. G.</given-names></name></person-group> (<year>2009</year>). <article-title>MoKCa database&#x02013;mutations of kinases in cancer</article-title>. <source>Nucleic Acids Res.</source> <volume>37</volume>, <fpage>D824</fpage>&#x02013;<lpage>D831</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkn832</pub-id><pub-id pub-id-type="pmid">18986996</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schee</surname> <given-names>E. D.</given-names></name> <name><surname>Bourne</surname> <given-names>P. E.</given-names></name></person-group> (<year>2005</year>). <article-title>Structural evolution of the protein kinase-like superfamily</article-title>. <source>PLoS Comput. Biol.</source> <volume>1</volume>, <fpage>e49</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pcbi.0010049</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shchemelinin</surname> <given-names>I.</given-names></name> <name><surname>Sefc</surname> <given-names>L.</given-names></name> <name><surname>Necas</surname> <given-names>E.</given-names></name></person-group> (<year>2006</year>). <article-title>Protein kinases, their function and implication in cancer and other diseases</article-title>. <source>Folia Biol. (Praha)</source> <volume>52</volume>, <fpage>81</fpage>&#x02013;<lpage>100</lpage>.<pub-id pub-id-type="pmid">17089919</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sherry</surname> <given-names>S. T.</given-names></name> <name><surname>Ward</surname> <given-names>M. H.</given-names></name> <name><surname>Kholodov</surname> <given-names>M.</given-names></name> <name><surname>Baker</surname> <given-names>J.</given-names></name> <name><surname>Phan</surname> <given-names>L.</given-names></name> <name><surname>Smigielski</surname> <given-names>E. M.</given-names></name> <name><surname>Sirotkin</surname> <given-names>K.</given-names></name></person-group> (<year>2001</year>). <article-title>dbSNP: the NCBI database of genetic variation</article-title>. <source>Nucleic Acids Res.</source> <volume>29</volume>, <fpage>308</fpage>&#x02013;<lpage>311</lpage>.<pub-id pub-id-type="doi">10.1093/nar/29.1.308</pub-id><pub-id pub-id-type="pmid">11125122</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sj&#x000F6;blom</surname> <given-names>T.</given-names></name> <name><surname>Jones</surname> <given-names>S.</given-names></name> <name><surname>Wood</surname> <given-names>L. D.</given-names></name> <name><surname>Parsons</surname> <given-names>D. W.</given-names></name> <name><surname>Lin</surname> <given-names>J.</given-names></name> <name><surname>Barber</surname> <given-names>T. D.</given-names></name> <name><surname>Mandelker</surname> <given-names>D.</given-names></name> <name><surname>Leary</surname> <given-names>R. J.</given-names></name> <name><surname>Ptak</surname> <given-names>J.</given-names></name> <name><surname>Silliman</surname> <given-names>N.</given-names></name> <name><surname>Szabo</surname> <given-names>S.</given-names></name> <name><surname>Buckhaults</surname> <given-names>P.</given-names></name> <name><surname>Farrell</surname> <given-names>C.</given-names></name> <name><surname>Meeh</surname> <given-names>P.</given-names></name> <name><surname>Markowitz</surname> <given-names>S. D.</given-names></name> <name><surname>Willis</surname> <given-names>J.</given-names></name> <name><surname>Dawson</surname> <given-names>D.</given-names></name> <name><surname>Willson</surname> <given-names>J. K. V.</given-names></name> <name><surname>Gazdar</surname> <given-names>A. F.</given-names></name> <name><surname>Hartigan</surname> <given-names>J.</given-names></name> <name><surname>Wu</surname> <given-names>L.</given-names></name> <name><surname>Liu</surname> <given-names>C.</given-names></name> <name><surname>Parmigiani</surname> <given-names>G.</given-names></name> <name><surname>Park</surname> <given-names>B. H.</given-names></name> <name><surname>Bachman</surname> <given-names>K. E.</given-names></name> <name><surname>Papadopoulos</surname> <given-names>N.</given-names></name> <name><surname>Vogelstein</surname> <given-names>B.</given-names></name> <name><surname>Kinzler</surname> <given-names>K. W.</given-names></name> <name><surname>Velculescu</surname> <given-names>V. E.</given-names></name></person-group> (<year>2006</year>). <article-title>The consensus coding sequences of human breast and colorectal cancers</article-title>. <source>Science</source> <volume>314</volume>, <fpage>268</fpage>&#x02013;<lpage>274</lpage>.<pub-id pub-id-type="doi">10.1126/science.1133427</pub-id><pub-id pub-id-type="pmid">16959974</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stitziel</surname> <given-names>N. O.</given-names></name> <name><surname>Binkowski</surname> <given-names>T. A.</given-names></name> <name><surname>Tseng</surname> <given-names>Y. Y.</given-names></name> <name><surname>Kasif</surname> <given-names>S.</given-names></name> <name><surname>Liang</surname> <given-names>J.</given-names></name></person-group> (<year>2004</year>). <article-title>topoSNP: a topographic database of non-synonymous single nucleotide polymorphisms with and without known disease association</article-title>. <source>Nucleic Acids Res.</source> <volume>32</volume>, <fpage>D520</fpage>&#x02013;<lpage>D522</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkh104</pub-id><pub-id pub-id-type="pmid">14681472</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stoyanovich</surname> <given-names>J.</given-names></name> <name><surname>Pe&#x02019;er</surname> <given-names>I.</given-names></name></person-group> (<year>2008</year>). <article-title>MutaGeneSys: estimating individual disease susceptibility based on genome-wide SNP array data</article-title>. <source>Bioinformatics</source> <volume>24</volume>, <fpage>440</fpage>&#x02013;<lpage>442</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btm587</pub-id><pub-id pub-id-type="pmid">18048395</pub-id></citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tanramluk</surname> <given-names>D.</given-names></name> <name><surname>Schreyer</surname> <given-names>A.</given-names></name> <name><surname>Pitt</surname> <given-names>W. R.</given-names></name> <name><surname>Blundell</surname> <given-names>T. L.</given-names></name></person-group> (<year>2009</year>). <article-title>On the origins of enzyme inhibitor selectivity and promiscuity: a case study of protein kinase binding to staurosporine</article-title>. <source>Chem. Biol. Drug Des.</source> <volume>74</volume>, <fpage>16</fpage>&#x02013;<lpage>24</lpage>.<pub-id pub-id-type="doi">10.1111/j.1747-0285.2009.00832.x</pub-id><pub-id pub-id-type="pmid">19519740</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname> <given-names>P. D.</given-names></name> <name><surname>Campbell</surname> <given-names>M. J.</given-names></name> <name><surname>Kejariwal</surname> <given-names>A.</given-names></name> <name><surname>Mi</surname> <given-names>H.</given-names></name> <name><surname>Karlak</surname> <given-names>B.</given-names></name> <name><surname>Daverman</surname> <given-names>R.</given-names></name> <name><surname>Diemer</surname> <given-names>K.</given-names></name> <name><surname>Muruganujan</surname> <given-names>A.</given-names></name> <name><surname>Narechania</surname> <given-names>A.</given-names></name></person-group> (<year>2003</year>). <article-title>Panther: a library of protein families and subfamilies indexed by function</article-title>. <source>Genome Res.</source> <volume>13</volume>, <fpage>2129</fpage>&#x02013;<lpage>2141</lpage>.<pub-id pub-id-type="doi">10.1101/gr.772403</pub-id><pub-id pub-id-type="pmid">12952881</pub-id></citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Torkamani</surname> <given-names>A.</given-names></name> <name><surname>Schork</surname> <given-names>N. J.</given-names></name></person-group> (<year>2007</year>). <article-title>Accurate prediction of deleterious protein kinase polymorphisms</article-title>. <source>Bioinformatics</source> <volume>23</volume>, <fpage>2918</fpage>&#x02013;<lpage>2925</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btm437</pub-id><pub-id pub-id-type="pmid">17855419</pub-id></citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ubersax</surname> <given-names>J. A.</given-names></name> <name><surname>Woodbury</surname> <given-names>E. L.</given-names></name> <name><surname>Quang</surname> <given-names>P. N.</given-names></name> <name><surname>Paraz</surname> <given-names>M.</given-names></name> <name><surname>Blethrow</surname> <given-names>J. D.</given-names></name> <name><surname>Shah</surname> <given-names>K.</given-names></name> <name><surname>Shokat</surname> <given-names>K. M.</given-names></name> <name><surname>Morgan</surname> <given-names>D. O.</given-names></name></person-group> (<year>2003</year>). <article-title>Targets of the cyclin-dependent kinase Cdk1</article-title>. <source>Nature</source> <volume>425</volume>, <fpage>859</fpage>&#x02013;<lpage>864</lpage>.<pub-id pub-id-type="doi">10.1038/nature02062</pub-id><pub-id pub-id-type="pmid">14574415</pub-id></citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Uzun</surname> <given-names>A.</given-names></name> <name><surname>Leslin</surname> <given-names>C. M.</given-names></name> <name><surname>Abyzov</surname> <given-names>A.</given-names></name> <name><surname>Ilyin</surname> <given-names>V.</given-names></name></person-group> (<year>2007</year>). <article-title>Structure SNP (StSNP): a web server for mapping and modeling nsSNPs on protein structures with linkage to metabolic pathways</article-title>. <source>Nucleic Acids Res.</source> <volume>35</volume>, <fpage>W384</fpage>&#x02013;<lpage>W392</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkm232</pub-id><pub-id pub-id-type="pmid">17537826</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wainreb</surname> <given-names>G.</given-names></name> <name><surname>Ashkenazy</surname> <given-names>H.</given-names></name> <name><surname>Bromberg</surname> <given-names>Y.</given-names></name> <name><surname>Starovolsky-Shitrit</surname> <given-names>A.</given-names></name> <name><surname>Haliloglu</surname> <given-names>T.</given-names></name> <name><surname>Ruppin</surname> <given-names>E.</given-names></name> <name><surname>Avraham</surname> <given-names>K. B.</given-names></name> <name><surname>Rost</surname> <given-names>B.</given-names></name> <name><surname>Ben-Tal</surname> <given-names>N.</given-names></name></person-group> (<year>2010</year>). <article-title>Mud: an interactive web server for the prediction of non-neutral substitutions using protein structural data</article-title>. <source>Nucleic Acids Res.</source> <volume>38</volume>(Suppl.), <fpage>W523</fpage>&#x02013;<lpage>W528</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkq1208</pub-id><pub-id pub-id-type="pmid">20542913</pub-id></citation></ref>
<ref id="B77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>J. Y.</given-names></name></person-group> (<year>1998</year>). <article-title>Protein kinases entering the information age</article-title>. <source>J. Biomed. Sci.</source> <volume>5</volume>, <fpage>73</fpage>.<pub-id pub-id-type="doi">10.1159/000025314</pub-id><pub-id pub-id-type="pmid">9662064</pub-id></citation></ref>
<ref id="B78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Moult</surname> <given-names>J.</given-names></name></person-group> (<year>2001</year>). <article-title>SNPs, protein structure, and disease</article-title>. <source>Hum. Mutat.</source> <volume>17</volume>, <fpage>263</fpage>&#x02013;<lpage>270</lpage>.<pub-id pub-id-type="doi">10.1002/humu.22</pub-id><pub-id pub-id-type="pmid">11295823</pub-id></citation></ref>
<ref id="B79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wong</surname> <given-names>W. C.</given-names></name> <name><surname>Kim</surname> <given-names>D.</given-names></name> <name><surname>Carter</surname> <given-names>H.</given-names></name> <name><surname>Diekhans</surname> <given-names>M.</given-names></name> <name><surname>Ryan</surname> <given-names>M. C.</given-names></name> <name><surname>Karchin</surname> <given-names>R.</given-names></name></person-group> (<year>2011</year>). <article-title>CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>2147</fpage>&#x02013;<lpage>2148</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btr338</pub-id><pub-id pub-id-type="pmid">21685053</pub-id></citation></ref>
<ref id="B80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wood</surname> <given-names>L. D.</given-names></name> <name><surname>Parsons</surname> <given-names>D. W.</given-names></name> <name><surname>Jones</surname> <given-names>S.</given-names></name> <name><surname>Lin</surname> <given-names>J.</given-names></name> <name><surname>Sj&#x000F6;blom</surname> <given-names>T.</given-names></name> <name><surname>Leary</surname> <given-names>R. J.</given-names></name> <name><surname>Shen</surname> <given-names>D.</given-names></name> <name><surname>Boca</surname> <given-names>S. M.</given-names></name> <name><surname>Barber</surname> <given-names>T. D.</given-names></name> <name><surname>Ptak</surname> <given-names>J.</given-names></name> <name><surname>Silliman</surname> <given-names>N.</given-names></name> <name><surname>Szabo</surname> <given-names>S.</given-names></name> <name><surname>Dezso</surname> <given-names>Z.</given-names></name> <name><surname>Ustyanksky</surname> <given-names>V.</given-names></name> <name><surname>Nikolskaya</surname> <given-names>T.</given-names></name> <name><surname>Nikolsky</surname> <given-names>Y.</given-names></name> <name><surname>Karchin</surname> <given-names>R.</given-names></name> <name><surname>Wilson</surname> <given-names>P. A.</given-names></name> <name><surname>Kaminker</surname> <given-names>J. S.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Croshaw</surname> <given-names>R.</given-names></name> <name><surname>Willis</surname> <given-names>J.</given-names></name> <name><surname>Dawson</surname> <given-names>D.</given-names></name> <name><surname>Shipitsin</surname> <given-names>M.</given-names></name> <name><surname>Willson</surname> <given-names>J. K. V.</given-names></name> <name><surname>Sukumar</surname> <given-names>S.</given-names></name> <name><surname>Polyak</surname> <given-names>K.</given-names></name> <name><surname>Park</surname> <given-names>B. H.</given-names></name> <name><surname>Pethiyagoda</surname> <given-names>C. L.</given-names></name> <name><surname>Pant</surname> <given-names>P. V. K.</given-names></name> <name><surname>Ballinger</surname> <given-names>D. G.</given-names></name> <name><surname>Sparks</surname> <given-names>A. B.</given-names></name> <name><surname>Hartigan</surname> <given-names>J.</given-names></name> <name><surname>Smith</surname> <given-names>D. R.</given-names></name> <name><surname>Suh</surname> <given-names>E.</given-names></name> <name><surname>Papadopoulos</surname> <given-names>N.</given-names></name> <name><surname>Buckhaults</surname> <given-names>P.</given-names></name> <name><surname>Markowitz</surname> <given-names>S. D.</given-names></name> <name><surname>Parmigiani</surname> <given-names>G.</given-names></name> <name><surname>Kinzler</surname> <given-names>K. W.</given-names></name> <name><surname>Velculescu</surname> <given-names>V. E.</given-names></name> <name><surname>Vogelstein</surname> <given-names>B.</given-names></name></person-group> (<year>2007</year>). <article-title>The genomic landscapes of human breast and colorectal cancers</article-title>. <source>Science</source> <volume>318</volume>, <fpage>1108</fpage>&#x02013;<lpage>1113</lpage>.<pub-id pub-id-type="doi">10.1126/science.1145720</pub-id><pub-id pub-id-type="pmid">17932254</pub-id></citation></ref>
<ref id="B81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yip</surname> <given-names>Y. L.</given-names></name> <name><surname>Famiglietti</surname> <given-names>M.</given-names></name> <name><surname>Gos</surname> <given-names>A.</given-names></name> <name><surname>Duek</surname> <given-names>P. D.</given-names></name> <name><surname>David</surname> <given-names>F. P. A.</given-names></name> <name><surname>Gateau</surname> <given-names>A.</given-names></name> <name><surname>Bairoch</surname> <given-names>A.</given-names></name></person-group> (<year>2008</year>). <article-title>Annotating single amino acid polymorphisms in the UniProt/Swiss-Prot knowledgebase</article-title>. <source>Hum. Mutat.</source> <volume>29</volume>, <fpage>361</fpage>&#x02013;<lpage>366</lpage>.<pub-id pub-id-type="doi">10.1002/humu.20671</pub-id><pub-id pub-id-type="pmid">18175334</pub-id></citation></ref>
<ref id="B82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yip</surname> <given-names>Y. L.</given-names></name> <name><surname>Lachenal</surname> <given-names>N.</given-names></name> <name><surname>Pillet</surname> <given-names>V.</given-names></name> <name><surname>Veuthey</surname> <given-names>A.-L.</given-names></name></person-group> (<year>2007</year>). <article-title>Retrieving mutation-specific information for human proteins in UniProt/Swiss-Prot knowledgebase</article-title>. <source>J. Bioinform. Comput. Biol.</source> <volume>5</volume>, <fpage>1215</fpage>&#x02013;<lpage>1231</lpage>.<pub-id pub-id-type="doi">10.1142/S021972000700320X</pub-id><pub-id pub-id-type="pmid">18172926</pub-id></citation></ref>
<ref id="B83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yue</surname> <given-names>P.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Moult</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>Loss of protein structure stability as a major causative factor in monogenic disease</article-title>. <source>J. Mol. Biol.</source> <volume>353</volume>, <fpage>459</fpage>&#x02013;<lpage>473</lpage>.<pub-id pub-id-type="doi">10.1016/j.jmb.2005.08.020</pub-id><pub-id pub-id-type="pmid">16169011</pub-id></citation></ref>
<ref id="B84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yue</surname> <given-names>P.</given-names></name> <name><surname>Melamud</surname> <given-names>E.</given-names></name> <name><surname>Moult</surname> <given-names>J.</given-names></name></person-group> (<year>2006</year>). <article-title>SNPs3D: candidate gene and SNP selection for association studies</article-title>. <source>BMC Bioinformatics</source> <volume>7</volume>, <fpage>166</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-7-166</pub-id></citation></ref>
</ref-list>
<notes>
<sec>
<title>Research Topic:</title>
<p><ext-link ext-link-type="uri" xlink:href="http://www.frontiersin.org/Systems_Biology/researchtopics/From_structural_to_molecular_s/513">From structural to molecular systems biology: experimental and computational approaches to unravel mechanisms of kinase activity regulation in cancer and neurodegeneration</ext-link></p>
</sec>
<sec>
<title>Topic Editor:</title>
<p><ext-link ext-link-type="uri" xlink:href="http://community.frontiersin.org/people/MatteoBarberis/30688">Matteo Barberis</ext-link>, Humboldt University Berlin, Germany; Max Planck Institute for Molecular Genetics, Berlin, Germany</p>
</sec>
</notes>
<fn-group>
<fn id="fn1"><p><sup>1</sup><uri xlink:href="http://genomeinterpretation.org">http://genomeinterpretation.org</uri></p></fn>
<fn id="fn2"><p><sup>2</sup><uri xlink:href="http://wkinmut.bioinfo.cnio.es">http://wkinmut.bioinfo.cnio.es</uri></p></fn>
</fn-group>
</back>
</article>
