<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Gene.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Gene.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Research Foundation</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2012.00014</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Mini Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The Impact of Classification of Interest on Predictive Toxicogenomics</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Kerns</surname> <given-names>Robnet T.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<!-- http://www.frontiersin.org/Community/WhosWhoDetails.aspx?UID=42902&d=1&sname=RobnetKerns&name=Science -->
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Bushel</surname> <given-names>Pierre R.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="author-notes" rid="fn001">&#x0002A;</xref>
<!-- http://www.frontiersin.org/Community/WhosWhoDetails.aspx?UID=37563&d=1&sname=PierreBushel&name=Science -->
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Microarray and Genome Informatics, National Institute of Environmental Health Sciences</institution> <country>Research Triangle Park, NC, USA</country></aff>
<aff id="aff2"><sup>2</sup><institution>Biostatistics Branch, National Institute of Environmental Health Sciences</institution> <country>Research Triangle Park, NC, USA</country></aff>
<aff id="aff3"><sup>3</sup><institution>SRA International, Inc.</institution> <country>Research Triangle Park, NC, USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Juergen Borlak, Fraunhofer Institute of Toxicology and Experimental Medicine Toxicology, Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Sabina Halappanavar, Health Canada, Canada; Carolyn E. Banister, Brown University, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Pierre R. Bushel, Biostatistics Branch, National Institute of Environmental Health Sciences, P.O. Box 12233, Research Triangle Park, NC 27709, USA. e-mail: <email>bushel&#x00040;niehs.nih.gov</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Frontiers in Toxicogenomics, a specialty of Frontiers in Genetics.</p></fn>
</author-notes>
<pub-date pub-type="epreprint">
<day>30</day>
<month>11</month>
<year>2011</year>
</pub-date>
<pub-date pub-type="epub">
<day>07</day>
<month>02</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="collection">
<year>2012</year>
</pub-date>
<volume>3</volume>
<elocation-id>14</elocation-id>
<history>
<date date-type="received">
<day>02</day>
<month>11</month>
<year>2011</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>01</month>
<year>2012</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2012 Kerns and Bushel.</copyright-statement>
<copyright-year>2012</copyright-year>
<license license-type="open-access" xlink:href="http://www.frontiersin.org/licenseagreement"><p>This is an open-access article distributed under the terms of the <uri xlink:href="http://creativecommons.org/licenses/by-nc/3.0/">Creative Commons Attribution Non Commercial License</uri>, which permits non-commercial use, distribution, and reproduction in other forums, provided the original authors and source are credited.</p></license>
</permissions>
<abstract>
<p>The era of toxicogenomics has introduced a new way of monitoring the effect of environmental stressors and toxicants on biological systems via quantification of changes in gene expression. Because the liver is one of the major organs for synthesis and secretion of substances which metabolize endogenous and exogenous materials, there has been a great deal of interest in elucidating predictive and mechanistic genomic markers of hepatotoxicity. This mini-review will bring context to a limited number of toxicogenomics studies which used genomics to evaluate the transcriptional changes in blood and liver in response to acetaminophen (APAP) or other liver toxicants, but differed according to the classification of interest (COI), i.e., the partitioning of the samples <italic>a priori</italic> according to a common toxicological characteristic. The toxicogenomics studies highlighted are characterized by a classification of either no/low vs. high APAP dose exposure, none vs. observed necrosis, and severity of necrosis. The overlap or lack thereof between the gene classifiers and the modulated biological processes that are elucidated will be discussed to enhance the understanding of the effect of the particular COI model and experimental design used for prediction.</p>
</abstract>
<kwd-group>
<kwd>toxicogenomics</kwd>
<kwd>prediction</kwd>
<kwd>classification</kwd>
<kwd>gene expression</kwd>
<kwd>genomics</kwd>
<kwd>biomarker</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="11"/>
<page-count count="6"/>
<word-count count="3286"/>
</counts>
</article-meta>
</front>
<body>
<sec>
<title>Toxicogenomics</title>
<p>In toxicogenomics, gene expression data has been used to understand the transcriptional changes in response to exposure to environmental stressors and toxicants. The transcriptome may be a more effective marker or predictor for drug-induced liver injury (DILI) compared to other biomarkers (for a review see Watkins, <xref ref-type="bibr" rid="B10">2009</xref>). Several toxicogenomics studies have monitored the gene expression changes in the liver and/or the blood following exposure to hepatotoxicants (Bushel et al., <xref ref-type="bibr" rid="B2">2007</xref>; Huang et al., <xref ref-type="bibr" rid="B5">2008</xref>, <xref ref-type="bibr" rid="B4">2010</xref>). Figure <xref ref-type="fig" rid="F1">1</xref>A illustrates a few examples of descriptors used in toxicogenomics studies. For a given classical dose response curve, regions can be ill-defined as having no effect, an observable effect on the transcription of genes, or bracketed into a maximum limit where the highest dose produces an effect without appreciable toxicity (Figure <xref ref-type="fig" rid="F1">1</xref>A left inset). In other cases, measurements of serum enzymes alanine transaminase (ALT) and aspartate transaminase (AST) are typically used to monitor liver damage and separate samples into responders and non-responders of an exposure to a toxicant. However, increases in these transaminases are not good prognosticators of liver injury and as such, have limitations in their use as biomarkers (Blei, <xref ref-type="bibr" rid="B1">2005</xref>). For instance, ALT measurements do not always correlate well with histopathological data. There can be cases where the variation of the ALT measure among samples sharing the same necrosis severity score is large (Huang et al., <xref ref-type="bibr" rid="B4">2010</xref>). In a more practical approach, liver injury severity measures can be grouped into a composite score or according to the similarity of the biological processes of the samples to reflective a more intuitive phenotypic representation of the toxicant effect (Figure <xref ref-type="fig" rid="F1">1</xref>A right inset). Furthermore, and to complicate matters even further, when time is a function of the exposure along with dose, biological and transcriptional responses take the form of those exhibited in an acute/short-term effect that may or may not be reversible vs. a chronic or repeated effect resulting in a long-term manifestation of an injury. Cleary these limited, albeit general, representations of the interplay between toxicology and genomics provide good examples of how the desired end result of a toxicogenomics study ultimately depend on how one structures the study design and categorizes the phenotypic response(s).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>(A)</bold> A few examples of descriptors used in toxicogenomics studies. Main inset: ALT over time as a function of sample type. Left inset: classical dose response curve. MTD: minimum tolerated dose. Right inset: liver injury severity measures grouped according to similarity to gene ontology biological processes (GO BPs) <bold>(B,C)</bold> Intersection of COI-specific predictive gene signature probes <bold>(B)</bold> and gene ontologies enriched in those predictive gene signatures <bold>(C)</bold>. <bold>(D)</bold> Network analysis of predictive gene signatures shows a common central gene relational network. Corresponding gene symbols of predictive gene signatures were submitted to STRING to generate a Gene Relational Network (solid black lines; line width correlates to STRING combined score, which is the computed final confidence score for association between two proteins). Nodes are connected to COIs Dose, Severity, and DILI by dashed lines. Node color further represents COI. Green: dose. Brown: dose and DILI. Red: DILI. Violet: DILI and severity. Blue: severity.</p></caption>
<graphic xlink:href="fgene-03-00014-g001.tif"/>
</fig>
</sec>
<sec>
<title>Classification of Interest</title>
<p>A region of interest (ROI) is commonly used in scientific research and analysis to focus on a particular area or subset of data identified for a particular purpose. In digital imaging for example, a ROI might be the focal point of a digital representation of an object. The ability to focus on a specific subset of data is useful to investigators in order to have a broader understanding of the larger experimental system or problem. Similarly, in toxicogenomics the classification of interest (COI) is defined as the <italic>a priori</italic> partitioning of biological samples according to a common toxicological characteristic.</p>
<p>Here we examine a set of three similar toxicogenomics studies (Bushel et al., <xref ref-type="bibr" rid="B2">2007</xref>; Huang et al., <xref ref-type="bibr" rid="B5">2008</xref>, <xref ref-type="bibr" rid="B4">2010</xref>), each of which used the same toxicant for exposure, but varied according to COI. The first study, Bushel et al. (<xref ref-type="bibr" rid="B2">2007</xref>), investigated the utility of rat blood gene expression data to predict exposure to acetaminophen (APAP). The samples were segregated based on a binary COI defined as either no or low dose exposure vs. high dose exposure. In another study, Huang et al. (<xref ref-type="bibr" rid="B5">2008</xref>) targeted rat liver gene expression data to predict the extent of necrosis in the liver. Here the COI was defined as the severity of liver necrosis determined by visual histopathology. In the final study, Huang et al. (<xref ref-type="bibr" rid="B4">2010</xref>) explored the use of rat blood gene expression data to predict liver necrosis. The COI for this latter study was no visual liver necrosis vs. some observable sign of necrosis in the treated samples. Although each study was similarly designed to identify predictive markers based on rat gene expression in response to APAP exposure, the predictive gene signatures generated share very few microarray probes in common. Further, the predictive gene signatures from the three studies differ widely in enriched biological processes and thus biological interpretation. It is unclear how these subtle differences in the study COI contributed to the identification of predictor genes, biological interpretation of these gene signatures and therefore the ultimate conclusions.</p>
</sec>
<sec>
<title>Comparison of Studies</title>
<p>The three toxicogenomic studies examined each define similar predictive transcriptomic signatures based on related yet distinctive COI, i.e., blood transcriptomic signatures that predict dose of APAP exposure, liver transcriptomic signatures that predict the severity of liver necrosis, and blood and liver transcriptomic signatures that predict DILI (Table <xref ref-type="table" rid="T1">1</xref>). To give an overview of the studies, these experiments are summarized and characterized as follows.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Summary of predictive transcriptomics for APAP/DILI</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Classification of interest</th>
<th align="left" valign="top">Tissue</th>
<th align="left" valign="top">Prediction</th>
<th align="left" valign="top">Study</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Dose</td>
<td align="left" valign="top">Blood</td>
<td align="left" valign="top">No/low vs. high dose</td>
<td align="left" valign="top">Bushel et al. (<xref ref-type="bibr" rid="B2">2007</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Severity</td>
<td align="left" valign="top">Liver</td>
<td align="left" valign="top">No vs. minimal or mild vs. moderate or marked</td>
<td align="left" valign="top">Huang et al. (<xref ref-type="bibr" rid="B5">2008</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">DILI</td>
<td align="left" valign="top">Blood</td>
<td align="left" valign="top">None vs. observed</td>
<td align="left" valign="top">Huang et al. (<xref ref-type="bibr" rid="B4">2010</xref>)</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Using a blood gene expression dataset in rat, Bushel et al. (<xref ref-type="bibr" rid="B2">2007</xref>) reported a gene signature that predicted an endpoint of APAP exposure and classified the endpoint by level (no dose/low dose vs. high dose). Here into called the <italic>Dose-Classification</italic>. Using the Agilent 011868 Rat Oligo Microarray G4130A, pooling of the highly accurate classifiers common to any two out of the four prediction algorithms (<italic>k</italic>-NN/DME-ANOVA, PCA/EPIG, fuzzy-ARTMAP/DME-ANOVA, fuzzy-ARTMAP/DCE-ANOVA)<xref ref-type="fn" rid="fn1"><sup>1</sup></xref> resulted in 27 gene-probes. The classifiers were built to take advantage of dose as a main effect in the study with treatments levels divided into non\sub-toxic (0, 150&#x02009;mg/kg) and toxic (1500, 2000&#x02009;mg/kg) without considering the duration of exposure.</p>
<p>Huang et al. (<xref ref-type="bibr" rid="B5">2008</xref>) used rat liver gene expression assayed on the same array platform as Bushel et al. (<xref ref-type="bibr" rid="B2">2007</xref>) but from samples exposed to eight hepatotoxicants as a training set to predict the severity of liver injury. Here into called the <italic>Severity-Classification</italic>. An independent test set comprised of liver gene expression from rats treated with APAP, carbon tetrachloride and allyl alcohol. The union of the classifiers from the Random Forest and support vector machine prediction algorithms resulted in 22 gene-probes. The classifiers were trained using the partition of groups created by merging severity levels of a histopathological endpoint depending on the similarity of the biological processes of samples (group1: no necrosis; group2: minimal and mild necrosis; group3: moderate and marked necrosis) irrespective of the dose and/or time of exposure.</p>
<p>Huang et al. (<xref ref-type="bibr" rid="B4">2010</xref>) used liver and blood gene expression data from the Huang et al. (<xref ref-type="bibr" rid="B5">2008</xref>) study but predicted DILI within and across tissues, i.e., used the same predictive gene signature trained on liver expression to predict DILI based on both liver and blood gene expression data (and vice verse). Here into called the <italic>DILI-Classification</italic>. DILI was defined as a binary response (no necrosis vs. some observable sign of necrosis) without considering the dose and/or time of exposure. From the various, highly accurate predictors, 10 gene-probes occurred most often.</p>
<p>A meta-analysis of the data from these studies illustrates the effect of the COI in the identification of predictor genes and the interpretation of results. Although each study was similarly designed to identify predictive markers related to samples treated with APAP, there are very few probes common to all three gene signatures. While no gene-probes are common to each predictive gene signature, half of the DILI-Classification predictive gene signature probes (<italic>n</italic>&#x02009;&#x0003D;&#x02009;10) overlap with either the Dose-Classification or Severity-Classification predictive gene signatures (<italic>n</italic>&#x02009;&#x0003D;&#x02009;27 and <italic>n</italic>&#x02009;&#x0003D;&#x02009;22, respectively), which have no gene-probes common amongst themselves (Figure <xref ref-type="fig" rid="F1">1</xref>B). The pattern of overlap of the gene signatures from the three studies is only partially reinforced by similar patterns of intersection of functionally enriched ontologies in the predictive gene signatures (Figure <xref ref-type="fig" rid="F1">1</xref>C) as the Dose-Classification signature is not functionally enriched for gene ontologies after multiple test correction. Yet, despite the presence of only a single gene-probe common to both predictive signatures (Figure <xref ref-type="fig" rid="F1">1</xref>C), the Severity-Classification and DILI-Classification studies revealed a large amount of functional overlap (Figure <xref ref-type="fig" rid="F1">1</xref>C; Table <xref ref-type="table" rid="T2">2</xref>). Although there are no gene-probes or enriched gene ontologies common to all three gene signatures, surprisingly, genes from the three signatures together comprise a STRING-generated Gene Relational Network (Figure <xref ref-type="fig" rid="F1">1</xref>D) that is enriched for inflammatory and immune response ontologies (Table <xref ref-type="table" rid="T2">2</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Top eight BiNGO gene ontology terms enriched in Severity and DILI predictive gene signatures, and STRING gene relational network generated from the three predictive signatures</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">GO-ID</th>
<th align="left" valign="top"><italic>p</italic></th>
<th align="left" valign="top"><italic>x</italic></th>
<th align="left" valign="top"><italic>n</italic></th>
<th align="left" valign="top">Description</th>
<th align="left" valign="top">Genes</th>
</tr>
</thead>
<tbody>
<tr>
<td colspan="6" align="left" valign="top"><bold>&#x02009;&#x02009;&#x02009;Severity COI</bold></td>
</tr>
<tr>
<td align="left" valign="top">1782</td>
<td align="left" valign="top">1.15E&#x02212;02</td>
<td align="left" valign="top">2</td>
<td align="left" valign="top">9</td>
<td align="left" valign="top">B cell homeostasis</td>
<td align="left" valign="top">BAK1|BCL2A1D</td>
</tr>
<tr>
<td align="left" valign="top">7275</td>
<td align="left" valign="top">1.15E&#x02212;02</td>
<td align="left" valign="top">10</td>
<td align="left" valign="top">2238</td>
<td align="left" valign="top">Multicellular organismal development</td>
<td align="left" valign="top">BAK1|RND1|CCL2|SEMA4G|CD44|LGALS3|GPNMB|LCP1|VASP|ANXA2</td>
</tr>
<tr>
<td align="left" valign="top">31100</td>
<td align="left" valign="top">1.15E&#x02212;02</td>
<td align="left" valign="top">3</td>
<td align="left" valign="top">80</td>
<td align="left" valign="top">Organ regeneration</td>
<td align="left" valign="top">BAK1|CCL2|LCP1</td>
</tr>
<tr>
<td align="left" valign="top">32879</td>
<td align="left" valign="top">1.15E&#x02212;02</td>
<td align="left" valign="top">6</td>
<td align="left" valign="top">708</td>
<td align="left" valign="top">Regulation of localization</td>
<td align="left" valign="top">BAK1|CCL2|CXCL16|LCP1|EHD4|ANXA2</td>
</tr>
<tr>
<td align="left" valign="top">6935</td>
<td align="left" valign="top">1.15E&#x02212;02</td>
<td align="left" valign="top">3</td>
<td align="left" valign="top">83</td>
<td align="left" valign="top">Chemotaxis</td>
<td align="left" valign="top">LSP1|CCL2|CXCL16</td>
</tr>
<tr>
<td align="left" valign="top">48731</td>
<td align="left" valign="top">1.15E&#x02212;02</td>
<td align="left" valign="top">9</td>
<td align="left" valign="top">1908</td>
<td align="left" valign="top">System development</td>
<td align="left" valign="top">BAK1|RND1|CCL2|CD44|LGALS3|GPNMB|LCP1|VASP|ANXA2</td>
</tr>
<tr>
<td align="left" valign="top">16043</td>
<td align="left" valign="top">1.24E&#x02212;02</td>
<td align="left" valign="top">9</td>
<td align="left" valign="top">1982</td>
<td align="left" valign="top">Cellular component organization</td>
<td align="left" valign="top">BAK1|RND1|CD44|LGALS3|TUBB6|LCP1|VASP|EHD4|ANXA2</td>
</tr>
<tr>
<td align="left" valign="top">5539</td>
<td align="left" valign="top">1.42E&#x02212;02</td>
<td align="left" valign="top">3</td>
<td align="left" valign="top">106</td>
<td align="left" valign="top">Glycosaminoglycan binding</td>
<td align="left" valign="top">CCL2|CD44|GPNMB</td>
</tr>
<tr>
<td colspan="6" align="left" valign="top"><bold>&#x02009;&#x02009;&#x02009;DILI COI</bold></td>
</tr>
<tr>
<td align="left" valign="top">5615</td>
<td align="left" valign="top">3.79E&#x02212;07</td>
<td align="left" valign="top">7</td>
<td align="left" valign="top">585</td>
<td align="left" valign="top">Extracellular space</td>
<td align="left" valign="top">CCL2|S100A8|MMP8|S100A9|SERPINB1A|HP|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">32496</td>
<td align="left" valign="top">8.22E&#x02212;07</td>
<td align="left" valign="top">5</td>
<td align="left" valign="top">166</td>
<td align="left" valign="top">Response to lipopolysaccharide</td>
<td align="left" valign="top">CCL2|S100A8|S100A9|HP|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">42330</td>
<td align="left" valign="top">2.44E&#x02212;06</td>
<td align="left" valign="top">4</td>
<td align="left" valign="top">83</td>
<td align="left" valign="top">Taxis</td>
<td align="left" valign="top">CCL2|S100A8|S100A9|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">9408</td>
<td align="left" valign="top">1.09E&#x02212;04</td>
<td align="left" valign="top">3</td>
<td align="left" valign="top">74</td>
<td align="left" valign="top">Response to heat</td>
<td align="left" valign="top">CCL2|HP|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">2544</td>
<td align="left" valign="top">3.34E&#x02212;04</td>
<td align="left" valign="top">2</td>
<td align="left" valign="top">14</td>
<td align="left" valign="top">Chronic inflammatory response</td>
<td align="left" valign="top">S100A8|S100A9</td>
</tr>
<tr>
<td align="left" valign="top">45471</td>
<td align="left" valign="top">4.01E&#x02212;04</td>
<td align="left" valign="top">3</td>
<td align="left" valign="top">126</td>
<td align="left" valign="top">Response to ethanol</td>
<td align="left" valign="top">CCL2|S100A8|S100A9</td>
</tr>
<tr>
<td align="left" valign="top">2690</td>
<td align="left" valign="top">4.36E&#x02212;04</td>
<td align="left" valign="top">2</td>
<td align="left" valign="top">17</td>
<td align="left" valign="top">Positive regulation of leukocyte chemotaxis</td>
<td align="left" valign="top">CCL2|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">30595</td>
<td align="left" valign="top">1.30E&#x02212;03</td>
<td align="left" valign="top">2</td>
<td align="left" valign="top">34</td>
<td align="left" valign="top">Leukocyte chemotaxis</td>
<td align="left" valign="top">CCL2|S100A9</td>
</tr>
<tr>
<td colspan="6" align="left" valign="top"><bold>&#x02009;&#x02009;&#x02009;STRING gene relational network</bold></td>
</tr>
<tr>
<td align="left" valign="top">6952</td>
<td align="left" valign="top">6.29E&#x02212;09</td>
<td align="left" valign="top">10</td>
<td align="left" valign="top">421</td>
<td align="left" valign="top">Defense response</td>
<td align="left" valign="top">LSP1|CCL2|CD44|S100A8|CXCL16|S100A9|MS4A1|HP|CFD|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">6954</td>
<td align="left" valign="top">1.81E&#x02212;08</td>
<td align="left" valign="top">8</td>
<td align="left" valign="top">216</td>
<td align="left" valign="top">Inflammatory response</td>
<td align="left" valign="top">CCL2|CD44|S100A8|S100A9|MS4A1|HP|CFD|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">42330</td>
<td align="left" valign="top">5.83E&#x02212;08</td>
<td align="left" valign="top">6</td>
<td align="left" valign="top">83</td>
<td align="left" valign="top">Taxis</td>
<td align="left" valign="top">LSP1|CCL2|S100A8|CXCL16|S100A9|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">44421</td>
<td align="left" valign="top">4.50E&#x02212;07</td>
<td align="left" valign="top">10</td>
<td align="left" valign="top">769</td>
<td align="left" valign="top">Extracellular region part</td>
<td align="left" valign="top">CCL2|LGALS3|S100A8|CXCL16|MMP8|S100A9|HP|CFD|CXCL10|ANXA2</td>
</tr>
<tr>
<td align="left" valign="top">40011</td>
<td align="left" valign="top">3.28E&#x02212;06</td>
<td align="left" valign="top">7</td>
<td align="left" valign="top">318</td>
<td align="left" valign="top">Locomotion</td>
<td align="left" valign="top">LSP1|CCL2|CD44|S100A8|CXCL16|S100A9|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">9605</td>
<td align="left" valign="top">7.24E&#x02212;06</td>
<td align="left" valign="top">8</td>
<td align="left" valign="top">561</td>
<td align="left" valign="top">Response to external stimulus</td>
<td align="left" valign="top">LSP1|CCL2|CD44|S100A8|CXCL16|S100A9|HP|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">5615</td>
<td align="left" valign="top">9.00E&#x02212;06</td>
<td align="left" valign="top">8</td>
<td align="left" valign="top">585</td>
<td align="left" valign="top">Extracellular space</td>
<td align="left" valign="top">CCL2|S100A8|CXCL16|MMP8|S100A9|HP|CFD|CXCL10</td>
</tr>
<tr>
<td align="left" valign="top">2376</td>
<td align="left" valign="top">1.63E&#x02212;05</td>
<td align="left" valign="top">8</td>
<td align="left" valign="top">640</td>
<td align="left" valign="top">Immune system process</td>
<td align="left" valign="top">CCL2|CXCL16|S100A9|SPIB|OAS1A|CFD|LCP1|CXCL10</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic><italic>p</italic>, Benjamini and Hochberg False Discovery Rate correction; <italic>x</italic>, number of genes from list in GO term; <italic>n</italic>, number of genes from background in GO term</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>Common to gene signatures that predict liver injury based on both liver and blood gene expression (Severity-Classification and DILI-Classification) was chemokine (C&#x02013;C motif) ligand 2 (Ccl2), a key chemokine that regulates migration and infiltration of monocytes and macrophages, and has a putative role in APAP exposure and DILI. The Ccl2 protein measured by ELISA was shown to be elevated in blood from patients following acute APAP overdose (James et al., <xref ref-type="bibr" rid="B6">2005</xref>). The role of Ccl2 in DILI was further implicated in a study of Ccl2-deficient mice which showed protection from necrosis when administered carbon tetrachloride (Zamara et al., <xref ref-type="bibr" rid="B11">2007</xref>). In addition, when a mouse model genetically deficient in the C-C chemokine receptor and negatively mediates Ccl2 expression is challenged with APAP, an ELISA revealed elevated liver necrosis and increased Ccl2 protein (Hogaboam et al., <xref ref-type="bibr" rid="B3">2000</xref>). The genes from the three COIs together form a relational network dominated by inflammatory responsive genes. These inflammatory responsive genes appear to be central to the major biological pathways that are presumably part of the mechanism(s) leading to APAP liver tissue damage.</p>
<p>One would certainly expect more overlap between the predictors for the different COIs. However, there is no reason why very few genes, even a single one would not have the predictive power to accurately classify the exposed samples. If more mechanistic information needs to be gleaned from the predictors, the number of genes in the classifiers can be enlarged. This will undoubtedly bring more biological context to the classifiers but definitely at the expense of losing prediction accuracy due to the addition of potentially noisy genes or genes as false positives. Where to draw the line is not abundantly clear. What is certain is that depending on a typical toxicogenomics study, if the gene expression data is analyzed using different COIs, then several different predictors can be derived as potential biomarkers which may have a small but biologically relevant common theme, yet still captures the underpinnings of the data surveyor&#x02019;s interest. Possible solutions to this problem may be (1) include more descriptors for a study design, i.e., increase the size of the data used, (2) leverage a composite phenotype of many endpoint measurements, and/or (3) focus on a more informative classifier and improved training set.</p>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<app-group>
<app id="A1">
<title>Appendix</title>
<sec>
<title>Addendum: bioinformatics methods used for analysis</title>
<p>Gene ontologies enriched in the assembled gene signatures for the three studies were created using BiNGO as described below.</p>
<p>Biomolecular interaction networks of predictive gene signatures were generated using STRING (Szklarczyk et al., <xref ref-type="bibr" rid="B9">2011</xref>). Gene symbols from the three assembled gene signatures were submitted to STRING using the default settings, the output was downloaded as a text file and then imported into Cytoscape (Smoot et al., <xref ref-type="bibr" rid="B8">2011</xref>) for graphical visualization. Interactions between genes and nodes representing a COI were manually added prior to import into Cytoscape.</p>
<p>Over-representation analysis of gene signatures and networks rendered in Cytoscape was by the BiNGO v. 2.44 plug-in (Maere et al., <xref ref-type="bibr" rid="B7">2005</xref>) using the default settings, GO_Full as the selected ontology file and <italic>Rattus norvegicus</italic> as the selected annotation. For intersection comparison of gene ontologies enriched in gene signatures, all terms with Benjamini and Hochberg FDR correction <italic>p</italic>&#x02009;&#x0003C;&#x02009;0.05 were used. For reporting significant functionally enriched ontologies in tabular format, terms were filtered to remove less significant ontologies with redundant gene lists; and, only the top eight filtered ontologies ranked by increasing <italic>p</italic> were reported.</p>
</sec>
</app>
</app-group>
<ack>
<p>This research was supported, in part by, the Intramural Research Program of the National Institutes of Health (NIH) and National Institute of Environmental Health Sciences (NIEHS; Z01 ES102345-04).</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blei</surname> <given-names>A. T.</given-names></name></person-group> (<year>2005</year>). <article-title>Selection for acute liver failure: have we got it right?</article-title> <source>Liver Transpl.</source> <volume>11</volume>(<issue>Suppl. 2</issue>), <fpage>S30</fpage>&#x02013;<lpage>S34</lpage>.<pub-id pub-id-type="doi">10.1002/lt.20595</pub-id><pub-id pub-id-type="pmid">16237684</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bushel</surname> <given-names>P. R.</given-names></name> <name><surname>Heinloth</surname> <given-names>A. N.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name> <name><surname>Chou</surname> <given-names>J. W.</given-names></name> <name><surname>Boorman</surname> <given-names>G. A.</given-names></name> <name><surname>Malarkey</surname> <given-names>D. E.</given-names></name> <name><surname>Houle</surname> <given-names>C. D.</given-names></name> <name><surname>Ward</surname> <given-names>S. M.</given-names></name> <name><surname>Wilson</surname> <given-names>R. E.</given-names></name> <name><surname>Fannin</surname> <given-names>R. D.</given-names></name> <name><surname>Russo</surname> <given-names>M. W.</given-names></name> <name><surname>Watkins</surname> <given-names>P. B.</given-names></name> <name><surname>Tennant</surname> <given-names>R. W.</given-names></name> <name><surname>Paules</surname> <given-names>R. S.</given-names></name></person-group> (<year>2007</year>). <article-title>Blood gene expression signatures predict exposure levels</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>104</volume>, <fpage>18211</fpage>&#x02013;<lpage>18216</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.0706987104</pub-id><pub-id pub-id-type="pmid">17984051</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hogaboam</surname> <given-names>C. M.</given-names></name> <name><surname>Bone-Larson</surname> <given-names>C. L.</given-names></name> <name><surname>Steinhauser</surname> <given-names>M. L.</given-names></name> <name><surname>Matsukawa</surname> <given-names>A.</given-names></name> <name><surname>Gosling</surname> <given-names>J.</given-names></name> <name><surname>Boring</surname> <given-names>L.</given-names></name> <name><surname>Charo</surname> <given-names>I. F.</given-names></name> <name><surname>Simpson</surname> <given-names>K. J.</given-names></name> <name><surname>Lukacs</surname> <given-names>N. W.</given-names></name> <name><surname>Kunkel</surname> <given-names>S. L.</given-names></name></person-group> (<year>2000</year>). <article-title>Exaggerated hepatic injury due to acetaminophen challenge in mice lacking C-C chemokine receptor 2</article-title>. <source>Am. J. Pathol.</source> <volume>156</volume>, <fpage>1245</fpage>&#x02013;<lpage>1252</lpage>.<pub-id pub-id-type="doi">10.1016/S0002-9440(10)64775-X</pub-id><pub-id pub-id-type="pmid">10751350</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Shi</surname> <given-names>W.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Chou</surname> <given-names>J. W.</given-names></name> <name><surname>Paules</surname> <given-names>R. S.</given-names></name> <name><surname>Gerrish</surname> <given-names>K.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Luo</surname> <given-names>J.</given-names></name> <name><surname>Wolfinger</surname> <given-names>R. D.</given-names></name> <name><surname>Bao</surname> <given-names>W.</given-names></name> <name><surname>Chu</surname> <given-names>T. M.</given-names></name> <name><surname>Nikolsky</surname> <given-names>Y.</given-names></name> <name><surname>Nikolskaya</surname> <given-names>T.</given-names></name> <name><surname>Dosymbekov</surname> <given-names>D.</given-names></name> <name><surname>Tsyganova</surname> <given-names>M. O.</given-names></name> <name><surname>Shi</surname> <given-names>L.</given-names></name> <name><surname>Fan</surname> <given-names>X.</given-names></name> <name><surname>Corton</surname> <given-names>J. C.</given-names></name> <name><surname>Chen</surname> <given-names>M.</given-names></name> <name><surname>Cheng</surname> <given-names>Y.</given-names></name> <name><surname>Tong</surname> <given-names>W.</given-names></name> <name><surname>Fang</surname> <given-names>H.</given-names></name> <name><surname>Bushel</surname> <given-names>P. R.</given-names></name></person-group> (<year>2010</year>). <article-title>Genomic indicators in the blood predict drug-induced liver injury</article-title>. <source>Pharmacogenomics J.</source> <volume>10</volume>, <fpage>267</fpage>&#x02013;<lpage>277</lpage>.<pub-id pub-id-type="doi">10.1038/tpj.2010.33</pub-id><pub-id pub-id-type="pmid">20676066</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>L.</given-names></name> <name><surname>Heinloth</surname> <given-names>A. N.</given-names></name> <name><surname>Zeng</surname> <given-names>Z. B.</given-names></name> <name><surname>Paules</surname> <given-names>R. S.</given-names></name> <name><surname>Bushel</surname> <given-names>P. R.</given-names></name></person-group> (<year>2008</year>). <article-title>Genes related to apoptosis predict necrosis of the liver as a phenotype observed in rats exposed to a compendium of hepatotoxicants</article-title>. <source>BMC Genomics</source> <volume>9</volume>, <fpage>288</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2164-9-288</pub-id><pub-id pub-id-type="pmid">18558008</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>James</surname> <given-names>L. P.</given-names></name> <name><surname>Simpson</surname> <given-names>P. M.</given-names></name> <name><surname>Farrar</surname> <given-names>H. C.</given-names></name> <name><surname>Kearns</surname> <given-names>G. L.</given-names></name> <name><surname>Wasserman</surname> <given-names>G. S.</given-names></name> <name><surname>Blumer</surname> <given-names>J. L.</given-names></name> <name><surname>Reed</surname> <given-names>M. D.</given-names></name> <name><surname>Sullivan</surname> <given-names>J. E.</given-names></name> <name><surname>Hinson</surname> <given-names>J. A.</given-names></name></person-group> (<year>2005</year>). <article-title>Cytokines and toxicity in acetaminophen overdose</article-title>. <source>J. Clin. Pharmacol.</source> <volume>45</volume>, <fpage>1165</fpage>&#x02013;<lpage>1171</lpage>.<pub-id pub-id-type="doi">10.1177/0091270005280296</pub-id><pub-id pub-id-type="pmid">16172181</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maere</surname> <given-names>S.</given-names></name> <name><surname>Heymans</surname> <given-names>K.</given-names></name> <name><surname>Kuiper</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>3448</fpage>&#x02013;<lpage>3449</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti551</pub-id><pub-id pub-id-type="pmid">15972284</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smoot</surname> <given-names>M. E.</given-names></name> <name><surname>Ono</surname> <given-names>K.</given-names></name> <name><surname>Ruscheinski</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>P. L.</given-names></name> <name><surname>Ideker</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>Cytoscape 2.8: new features for data integration and network visualization</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>431</fpage>&#x02013;<lpage>432</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btr045</pub-id><pub-id pub-id-type="pmid">21149340</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Szklarczyk</surname> <given-names>D.</given-names></name> <name><surname>Franceschini</surname> <given-names>A.</given-names></name> <name><surname>Kuhn</surname> <given-names>M.</given-names></name> <name><surname>Simonovic</surname> <given-names>M.</given-names></name> <name><surname>Roth</surname> <given-names>A.</given-names></name> <name><surname>Minguez</surname> <given-names>P.</given-names></name> <name><surname>Doerks</surname> <given-names>T.</given-names></name> <name><surname>Stark</surname> <given-names>M.</given-names></name> <name><surname>Muller</surname> <given-names>J.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name> <name><surname>Jensen</surname> <given-names>L. J.</given-names></name> <name><surname>Mering</surname> <given-names>C. V.</given-names></name></person-group> (<year>2011</year>). <article-title>The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored</article-title>. <source>Nucleic Acids Res.</source> <volume>39</volume>, <fpage>D561</fpage>&#x02013;<lpage>D568</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkq973</pub-id><pub-id pub-id-type="pmid">21045058</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Watkins</surname> <given-names>P. B.</given-names></name></person-group> (<year>2009</year>). <article-title>Biomarkers for the diagnosis and management of drug-induced liver injury</article-title>. <source>Semin. Liver Dis.</source> <volume>29</volume>, <fpage>393</fpage>&#x02013;<lpage>399</lpage>.<pub-id pub-id-type="doi">10.1055/s-0029-1240008</pub-id><pub-id pub-id-type="pmid">19826973</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zamara</surname> <given-names>E.</given-names></name> <name><surname>Galastri</surname> <given-names>S.</given-names></name> <name><surname>Aleffi</surname> <given-names>S.</given-names></name> <name><surname>Petrai</surname> <given-names>I.</given-names></name> <name><surname>Aragno</surname> <given-names>M.</given-names></name> <name><surname>Mastrocola</surname> <given-names>R.</given-names></name> <name><surname>Novo</surname> <given-names>E.</given-names></name> <name><surname>Bertolani</surname> <given-names>C.</given-names></name> <name><surname>Milani</surname> <given-names>S.</given-names></name> <name><surname>Vizzutti</surname> <given-names>F.</given-names></name> <name><surname>Vercelli</surname> <given-names>A.</given-names></name> <name><surname>Pinzani</surname> <given-names>M.</given-names></name> <name><surname>Laffi</surname> <given-names>G.</given-names></name> <name><surname>LaVilla</surname> <given-names>G.</given-names></name> <name><surname>Parola</surname> <given-names>M.</given-names></name> <name><surname>Marra</surname> <given-names>F.</given-names></name></person-group> (<year>2007</year>). <article-title>Prevention of severe toxic liver injury and oxidative stress in MCP-1-deficient mice</article-title>. <source>J. Hepatol.</source> <volume>46</volume>, <fpage>230</fpage>&#x02013;<lpage>238</lpage>.<pub-id pub-id-type="doi">10.1016/j.jhep.2006.09.007</pub-id><pub-id pub-id-type="pmid">17125873</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn1"><p><sup>1</sup>The prediction algorithms are designated as the method used for making the prediction followed by the method used to select the genes to build the classifier (i.e., k-NN/DME). k-NN, k-nearest neighbors; DME-ANOVA, an analysis of variance model with dose as the main effect; PCA, principal component analysis; EPIG, extracting patterns and identifying genes; fuzzy-ARTMAP, fuzzy adaptive resonance theory map (neural network); DCE-ANOVA, an analysis of variance model with dose as a confounded effect.</p></fn>
</fn-group>
</back>
</article>
