<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Gene.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Gene.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Research Foundation</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2012.00008</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Review Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Statistical Inference and Reverse Engineering of Gene Regulatory Networks from Observational Expression Data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Emmert-Streib</surname> <given-names>Frank</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn001">&#x0002A;</xref>
<!-- http://www.frontiersin.org/people/frankemmert_streib/23749 -->
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Glazko</surname> <given-names>Galina V.</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="author-notes" rid="fn001">&#x0002A;</xref>
<!-- http://www.frontiersin.org/Community/WhosWhoDetails.aspx?UID=29842&d=1&sname=GalinaGlazko&name=Science -->
</contrib>
<contrib contrib-type="author">
<name><surname>Altay</surname> <given-names>G&#x000F6;kmen</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<!-- http://www.frontiersin.org/Community/WhosWhoDetails.aspx?UID=44911&d=1&sname=GokmenAltay&name=Science -->
</contrib>
<contrib contrib-type="author">
<name><surname>de Matos Simoes</surname> <given-names>Ricardo</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<!-- http://www.frontiersin.org/Community/WhosWhoDetails.aspx?UID=39291&d=1&sname=RicardoDe_Matos_Simoes&name=Science -->
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Computational Biology and Machine Learning Lab, School of Medicine, Dentistry and Biomedical Sciences, Center for Cancer Research and Cell Biology, Queen&#x02019;s University Belfast</institution> <country>Belfast, UK</country></aff>
<aff id="aff2"><sup>2</sup><institution>Statistics and Computational Biology Laboratory, Li Ka Shing Centre, Cancer Research UK Cambridge Research Institute, University of Cambridge</institution> <country>Cambridge, UK</country></aff>
<aff id="aff3"><sup>3</sup><institution>Division of Biomedical Informatics, University of Arkansas for Medical Sciences</institution> <country>Little Rock, AR, USA</country></aff>
<aff id="aff4"><sup>4</sup><institution>Department of Oncology, Cancer Research UK Cambridge Research Institute, University of Cambridge</institution> <country>Cambridge, UK</country></aff>
<aff id="aff5"><sup>5</sup><institution>Department of Electrical and Electronics Engineering, Bahcesehir University</institution> <country>Istanbul, Turkey</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Raya Khanin, Memorial Sloan-Kettering Cancer Center, USA</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Raya Khanin, Memorial Sloan-Kettering Cancer Center, USA; Erik Larsson, Memorial Sloan-Kettering Cancer Center, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Frank Emmert-Streib, Cancer Research UK Cambridge Research Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, UK. e-mail: <email>v&#x00040;bio-complexity.com</email>; Galina V. Glazko, Division of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA. e-mail: <email>gvglazko&#x00040;uams.edu</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Frontiers in Bioinformatics and Computational Biology, a specialty of Frontiers in Genetics.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>03</day>
<month>02</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="collection">
<year>2012</year>
</pub-date>
<volume>3</volume>
<elocation-id>8</elocation-id>
<history>
<date date-type="received">
<day>07</day>
<month>12</month>
<year>2011</year>
</date>
<date date-type="accepted">
<day>10</day>
<month>01</month>
<year>2012</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2012 Emmert-Streib, Glazko, Altay and de Matos Simoes.</copyright-statement>
<copyright-year>2012</copyright-year>
<license license-type="open-access" xlink:href="http://www.frontiersin.org/licenseagreement"><p>This is an open-access article distributed under the terms of the <uri xlink:href="http://creativecommons.org/licenses/by-nc/3.0/">Creative Commons Attribution Non Commercial License</uri>, which permits non-commercial use, distribution, and reproduction in other forums, provided the original authors and source are credited.</p></license>
</permissions>
<abstract>
<p>In this paper, we present a systematic and conceptual overview of methods for inferring gene regulatory networks from observational gene expression data. Further, we discuss two classic approaches to infer causal structures and compare them with contemporary methods by providing a conceptual categorization thereof. We complement the above by surveying global and local evaluation measures for assessing the performance of inference algorithms.</p>
</abstract>
<kwd-group>
<kwd>gene regulatory networks</kwd>
<kwd>statistical inference</kwd>
<kwd>reverse engineering</kwd>
<kwd>causal relations</kwd>
<kwd>directed acyclic graphs</kwd>
<kwd>Bayesian network</kwd>
<kwd>information-theory methods</kwd>
</kwd-group>
<counts>
<fig-count count="7"/>
<table-count count="0"/>
<equation-count count="23"/>
<ref-count count="110"/>
<page-count count="15"/>
<word-count count="13174"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="introduction" id="s1">
<label>1</label> <title>Introduction</title>
<p>The purpose of this paper is to provide a systematic overview of methods used to estimate gene regulatory networks (GRN) from large-scale expression data. The inference of gene regulatory networks, which is sometimes also referred to as reverse engineering (Stolovitzky and Califano, <xref ref-type="bibr" rid="B90">2007</xref>; Stolovitzky et al., <xref ref-type="bibr" rid="B91">2009</xref>), is the process of estimating the direct physical (biochemical) interactions of a cellular system from data. That means one aims for identifying all molecular regulatory interactions among genes that are present in an organism to establish and maintain all required biological functions characterizing a certain physiological state of a cell. Depending on the data used for inferring the network, which, principally, may either come from DNA microarray, RNA-seq, proteomics or ChIP-chip experiments, or combinations thereof, the biological interpretation of an edge in these networks is dependent thereon. For expression data, inferred interactions may preferably indicate transcription regulation, but can also correspond to protein-protein interactions. Due to the causal character of these networks, which ensures a meaningful biological interpretation, the genome-wide inference of gene regulatory networks holds great promise in enhancing the understanding of normal cell physiology, and also complex pathological phenotypes (Barab&#x000E1;si and Oltvai, <xref ref-type="bibr" rid="B8">2004</xref>; Emmert-Streib, <xref ref-type="bibr" rid="B38">2007</xref>; Schadt, <xref ref-type="bibr" rid="B84">2009</xref>).</p>
<p>Due to the fact that this field is currently vastly expanding, this overview is inevitably incomplete. Instead of aiming to cover as many approaches as possible, we focus on conceptual clarity and methods for observational expression data. That means, we review statistical approaches from the literature we consider most important and show that they can be categorized nicely according to assumptions they make about the dynamic behavior of the data but also with respect to conceptual strategies they employ. In order to facilitate the understanding of the latter point we present also two seminal, and in the meanwhile classic, methods for the causal inference of networks and their theoretical foundations (Chow and Liu, <xref ref-type="bibr" rid="B24">1968</xref>; Pearl, <xref ref-type="bibr" rid="B77">1988</xref>; Spirtes et al., <xref ref-type="bibr" rid="B88">1993</xref>). Model-based approaches based on, e.g., Boolean networks or differential equations (de Jong, <xref ref-type="bibr" rid="B29">2002</xref>; Liu et al., <xref ref-type="bibr" rid="B64">2008</xref>) are not covered in this paper. Also, supervised and semi-supervised network inference methods (Ernst et al., <xref ref-type="bibr" rid="B41">2008</xref>; Mordelet and Vert, <xref ref-type="bibr" rid="B74">2008</xref>; Cerulo et al., <xref ref-type="bibr" rid="B19">2010</xref>) that require a training set will not be discussed. Further, we content ourselves with approaches for observational data (Rubin, <xref ref-type="bibr" rid="B83">1974</xref>; Rosenbaum, <xref ref-type="bibr" rid="B82">2002</xref>) neglecting methods that utilize interventional or perturbational data. In addition to the presentation of inference methods, we provide also an overview of global and local performance metrics frequently used to assess the inference abilities of such methods. Finally, we will emphasize and discuss the conceptual closeness of all current methods with respect to two classic methods (Chow and Liu, <xref ref-type="bibr" rid="B24">1968</xref>; Pearl, <xref ref-type="bibr" rid="B77">1988</xref>; Spirtes et al., <xref ref-type="bibr" rid="B88">1993</xref>).</p>
</sec>
<sec sec-type="materials|methods">
<label>2</label> <title>Methods</title>
<p>The inference of gene networks from high-throughput data is a very complex and vastly expanding area triggered by the invention of measurement technologies. In order to provide a systematic discussion of the underlying principles we limit this review to observational, steady-state gene expression data, and consider correlation- and mutual information-based inference methods only, as visualized in Figure <xref ref-type="fig" rid="F1">1</xref>. These methods are representative of linear and non-linear methods. Principally, there are three fundamental levels of a molecular system as given by the central dogma of molecular biology (Crick, <xref ref-type="bibr" rid="B26">1970</xref>), namely, the DNA, mRNA, and the protein level. Consequently, these levels imply sensible variables that can and should be measured to obtain information about the biological function of a cell. Specifically, one can distinguish between measurements that provide information about the protein-DNA binding (ChIP-chip, ChIP-seq), gene expression (DNA microarray, RNA-seq), and the protein-protein interaction (protein microarray) which can be used to infer various types of gene networks (Emmert-Streib and Glazko, <xref ref-type="bibr" rid="B40">2011</xref>). We focus in this review on observational gene expression data because to date, this data type is predominating.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Conceptual categorization of network inference methods and the biology, technology, and data types they depend on</bold>. The red arrows and boxes represent the focus of this review article.</p></caption>
<graphic xlink:href="fgene-03-00008-g001.tif"/>
</fig>
<p>The complexity of the network inference problem can be visualized with the help of Figure <xref ref-type="fig" rid="F1">1</xref>. There are two major factors that contribute to it. First, almost all components shown in Figure <xref ref-type="fig" rid="F1">1</xref> as represented by the boxes can be connected with each other. That means, they are not mutually exclusive but can be combined in a great variety. This concerns the integration of different high-throughput data, but also the combination of different data types or even methods. Second, any network inference method is subject to statistical and computational variations in form of technical modifications. This may relate to newly developed statistical estimators or optimization methods, or to the design of efficient algorithms or their parallelization on a computer cluster.</p>
<sec>
<label>2.1</label> <title>Classic approaches to causal inference</title>
<p>We begin our presentation by some necessary preliminaries. Directed acyclic graphs (DAGs) are frequently employed to represent causal relations among variables (Wright, <xref ref-type="bibr" rid="B102">1934</xref>; Verma and Pearl, <xref ref-type="bibr" rid="B93">1990</xref>; Shipley, <xref ref-type="bibr" rid="B86">2000</xref>). In such a graph, <italic>G</italic>, a directed edge from node <italic>X</italic> to <italic>Y</italic> means that <italic>X</italic> is the cause for <italic>Y</italic>. For this reason these networks are also called causal graphs. For the causal inference of network structures, the graph theoretical measure d-separation is key. It can be defined in the following way (Pearl, <xref ref-type="bibr" rid="B77">1988</xref>; Verma and Pearl, <xref ref-type="bibr" rid="B93">1990</xref>).</p>
<p>Definition 1. Two nodes <italic>X</italic> and <italic>Y</italic> are called d-separated by set <italic>S</italic> if and only if <italic>X</italic>, <italic>Y</italic>, and <italic>S</italic> are disjoint and every undirected path from <italic>X</italic> to <italic>Y</italic> is blocked by the nodes in set <italic>S</italic>.</p>
<p>If <italic>X</italic> is d-separated from <italic>Y</italic> by <italic>S</italic> we write (X&#x02009;&#x02568;&#x02009;<italic>Y</italic>|<italic>S</italic>).</p>
<p>Definition 2. A path <italic>w</italic> is called d-separated or blocked by a set of nodes <italic>S</italic> if and only if either of the following two criteria hold:</p>
<list list-type="order">
<list-item><p>on <italic>w</italic> is a node <italic>s</italic> which is no collider and <italic>s</italic>&#x02009;&#x02208;&#x02009;<italic>S</italic></p></list-item>
<list-item><p>on <italic>w</italic> is a node <italic>s</italic> which is a collider and it holds <italic>s</italic>&#x02009;&#x02209;&#x02009;<italic>S</italic> and <italic>de</italic>(<italic>s</italic>)&#x02009;&#x02209;&#x02009;<italic>S</italic>.</p></list-item>
</list>
<p>Here, <italic>de</italic>(<italic>s</italic>) denotes the set of descendants of node <italic>s</italic>. A node <italic>Z</italic> is called a collider if the edge of the incoming as well as the edge of the outgoing link point toward <italic>Z</italic>. In Figure <xref ref-type="fig" rid="F2">2</xref> we clarify this notation. The top Figure shows a collider <italic>Z</italic>. For example, considering the path <italic>X</italic>&#x02009;&#x02192;&#x02009;<italic>Z</italic>&#x02009;&#x02190;&#x02009;<italic>Y</italic>, then from the definition of d-separation follows that <italic>X</italic> is d-separated from <italic>Y</italic> if conditioned on the empty set <italic>S</italic>&#x02009;&#x0003D;&#x02009;&#x02205;, (<italic>X</italic>&#x02009;&#x02568;&#x02009;<italic>Y</italic>|&#x02205;), or simply (<italic>X</italic>&#x02009;&#x02568;&#x02009;<italic>Y</italic>). However, if we condition on <italic>Z</italic> then (<italic>X</italic>&#x02009;<inline-graphic xlink:href="fgene-03-00008-i001.tif"/> <italic>Y</italic>|<italic>Z</italic>), according to Definition 2. In this case <italic>Z</italic> unblocks the path between <italic>X</italic> and <italic>Y</italic>. In the middle and bottom panels in Figure <xref ref-type="fig" rid="F2">2</xref> two non-colliders (a chain and a fork) are shown. We want to notice that in general one node is sufficient to block a path.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Visualization of different path types</bold>. The orientation of edges with respect to the node <italic>Z</italic> allow the following distinction. Top: collider; middle: chain; bottom: fork.</p></caption>
<graphic xlink:href="fgene-03-00008-g002.tif"/>
</fig>
<p>A systematic connection between independence relations among variables <italic>V</italic> forming a DAG, and a process in form of a probability distribution <italic>P</italic>, can be given with the help of d-separation and Markov independencies. Formally, this connection is provided by an I-map (independency map). A DAG <italic>G</italic> is called an I-map of <italic>P</italic> if</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M7"><mml:msub><mml:mrow><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mi>X</mml:mi><mml:mo>&#x02568;</mml:mo><mml:mi>Y</mml:mi><mml:mfenced separators="" open="|" close=""><mml:mrow><mml:mi>S</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>G</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">&#x021D2;</mml:mo><mml:msub><mml:mrow><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mi>X</mml:mi><mml:mo>&#x02568;</mml:mo><mml:mi>Y</mml:mi><mml:mfenced separators="" open="|" close=""><mml:mrow><mml:mi>S</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>P</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-punc">.</mml:mo></mml:math></disp-formula>
<p>Here the subscript <italic>G</italic> refers to the structure represented by a DAG and the independence relations provided by d-separation, and a &#x0201C;<italic>P&#x0201D;</italic> refers to the probability distribution whereas the independence relations corresponds to conditional Markov independencies. From equation (<xref ref-type="disp-formula" rid="E1">1</xref>) follows the definition of a Bayesian network (Pearl, <xref ref-type="bibr" rid="B77">1988</xref>).</p>
<p>Definition 3. <italic>Bayesian network</italic> Given a probability distribution <italic>P</italic> on a set of random variables <italic>V</italic>. A DAG, <italic>G</italic>&#x02009;&#x0003D;&#x02009;(<italic>V</italic>, <italic>E</italic>), is called a Bayesian network of <italic>P</italic> if and only if <italic>G</italic> is a minimal I-map of <italic>P</italic>.</p>
<p>Here minimal I-map means that if any edge in the DAG is deleted, <italic>G</italic> is no longer an I-map of <italic>P</italic>. There are alternative definitions of a Bayesian network (Pearl, <xref ref-type="bibr" rid="B78">2000</xref>), however, the definition given above emphasizes its connection to d-separation best. This is important to emphasize because Bayesian networks are first of all about independence relations, and not about specific probability distributions.</p>
<p>In our opinion, there are basically two principle approaches in the literature that are relevant for our contextual problem, which <italic>proof</italic> mathematically that, under certain conditions, they are capable of systematically inferring causal relations. The first principle approach, which is based on <italic>d-separation</italic>, has been independently developed by two group; Verma and Pearl (<xref ref-type="bibr" rid="B94">1991</xref>) and Spirtes et al. (<xref ref-type="bibr" rid="B88">1993</xref>). The second principle approach is from Chow and Liu (<xref ref-type="bibr" rid="B24">1968</xref>). Certainly, there are various variations of these two approaches, however, in the following, we focus only on the essential principles of both methods.</p>
<p>The first algorithm we discuss proven to reconstruct a causal structure is the <italic>inductive causation</italic> (IC) algorithm. For simplicity, we assume <italic>causal sufficiency</italic>, which means that latent variables are absent. The IC algorithm starts with the construction of an undirected dependency graph (UDG) by testing exhaustively for the independence of variable <italic>X</italic> from <italic>Y</italic> given a set <italic>S<sub>xy</sub></italic>. Exhaustively means, for any set <italic>S<sub>xy</sub></italic> that can be formed among the available variables. We denote this by (<italic>X</italic>&#x02009;&#x02568;&#x02009;<italic>Y</italic>&#x02009;|&#x02009;<italic>S<sub>xy</sub></italic>)<sub><italic>P</italic></sub> emphasizing explicitly that this independence is with respect to an underlying distribution <italic>P</italic>.</p>
<p>The basic principle on which the IC algorithm is based on, is a connection between the d-separation relations of a DAG <italic>G</italic> and independence relations of a probability distribution <italic>P</italic> consistent with the structure of the DAG <italic>G</italic>. The crucial point is that the distribution <italic>P</italic> and its independence relations are not given (known) but they need to be estimated from data, generated from the distribution <italic>P</italic>. Schematically, this is outlined in Figure <xref ref-type="fig" rid="F3">3</xref>. Here, the red line corresponds to the experimentally generation of the data, e.g., by means of DNA microarrays, and the blue line corresponds to the theoretical, however, unobservable (that&#x02019;s why this line is dashed), independence relations present in the DAG <italic>G</italic> that can be expressed by the concept of d-separation. Based on the data, an inference method aims for estimating these independence relations statistically. The gray box marks the range of the inference method.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>Schematic visualization of the importance of the d-separation concept in the inferential task to recover the structure of a DAG <italic>G</italic></bold>. The red, green, and blue lines refer to experimental, inferential, or theoretical (unobservable) steps. For notational simplicity, we use also in the inferential step the symbol <italic>P</italic> instead of <inline-formula><mml:math id="M2"><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mo class="MathClass-op">&#x00304;</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula> to indicate an estimate of <italic>P</italic>.</p></caption>
<graphic xlink:href="fgene-03-00008-g003.tif"/>
</fig>
<p>The second algorithm proven to reconstruct a causal structure is from Chow and Liu (<xref ref-type="bibr" rid="B24">1968</xref>). It is based on the MWST (maximum weight spanning tree) algorithm, which connects node pairs in a way that the resulting network structure forms a tree with a minimal sum of edge weights. It is of interest to note that the proof of the MWST algorithm (see for example Pearl, <xref ref-type="bibr" rid="B77">1988</xref>) is based on the <italic>data processing inequality</italic> (DPI; Cover and Thomas, <xref ref-type="bibr" rid="B25">1991</xref>), which will be discussed later in the paper for ARACNE.</p>
</sec>
<sec>
<label>2.2</label> <title>Categorization of methods</title>
<p>In this section we present an overview and a categorization of methods that have been <italic>specifically</italic> introduced to infer gene regulatory networks from observational gene expression data. Previously, there have been other reviews that attempted to do this (Werhli et al., <xref ref-type="bibr" rid="B96">2006</xref>; Bansal et al., <xref ref-type="bibr" rid="B7">2007</xref>; Margolin and Califano, <xref ref-type="bibr" rid="B67">2007</xref>; Markowetz and Spang, <xref ref-type="bibr" rid="B70">2007</xref>; Hache et al., <xref ref-type="bibr" rid="B49">2009</xref>; Olsen et al., <xref ref-type="bibr" rid="B76">2009</xref>; de Smet and Marchal, <xref ref-type="bibr" rid="B32">2010</xref>; Penfold and Wild, <xref ref-type="bibr" rid="B79">2011</xref>), however, their emphasis and conceptualization deviates from ours.</p>
<sec>
<label>2.2.1</label> <title>Correlation-based estimation methods</title>
<sec>
<label>2.2.1.1</label> <title>Co-expression networks</title>
<p>There are two principally different ways to construct co-expression networks from microarray data. One approach follows a hard- and the other a soft-thresholding of correlation coefficients (Zhou et al., <xref ref-type="bibr" rid="B109">2002</xref>; Horvath and Dong, <xref ref-type="bibr" rid="B51">2008</xref>). In the following <italic>r<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;|<italic>cor</italic>(<italic>i</italic>,<italic>j</italic>)| is the absolute value of the Pearson correlation coefficient. In the case of hard-thresholding, the correlation coefficient <italic>r<sub>ij</sub></italic> between gene <italic>i</italic> and <italic>j</italic> is assessed with respect to a threshold &#x003C4; and both genes are connected by an edge, <italic>A<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;<italic>A<sub>ji</sub></italic>&#x02009;&#x0003D;&#x02009;1, if <italic>r<sub>ij</sub></italic>&#x02009;&#x02265;&#x02009;&#x003C4;, otherwise the genes remain unconnected, <italic>A<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;<italic>A<sub>ji</sub></italic>&#x02009;&#x0003D;&#x02009;0. The threshold or cut-off parameter &#x003C4; can be obtained from randomization of the data allowing the assessment of statistical significance (Carter et al., <xref ref-type="bibr" rid="B18">2004</xref>). This results in an undirected, unweighted network.</p>
<p>For the soft-thresholding two types of adjacency functions are frequently used (Zhang and Horvath, <xref ref-type="bibr" rid="B106">2005</xref>). The sigmoid function,</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M8"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mn>1</mml:mn><mml:mo>+</mml:mo><mml:mi>exp</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mi>&#x003B1;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02212;</mml:mo><mml:msub><mml:mi>&#x003C4;</mml:mi><mml:mn>0</mml:mn></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
<p>and the power adjacency function,</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M9"><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:mo class="MathClass-rel">|</mml:mo><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mo class="MathClass-rel">|</mml:mo></mml:mrow><mml:mrow><mml:mi>&#x003B2;</mml:mi></mml:mrow></mml:msup><mml:mo class="MathClass-punc">.</mml:mo></mml:math></disp-formula>
<p>Both types of adjacency functions lead to undirected but weighted networks. In order to choose the above parameters appropriately Zhang and Horvath (<xref ref-type="bibr" rid="B106">2005</xref>) suggested to adjust them in a way that the resulting network has approximately a scale-free degree distribution.</p>
<p>We would like to remark that the purpose for the construction of co-expression networks is different to all other methods discussed in this paper. Co-expression networks serve as means to explore the functionality of genes on a systems level (Zhang and Horvath, <xref ref-type="bibr" rid="B106">2005</xref>) and do not aim to be causal representations of regulatory networks. Nevertheless, we included them in this review to point out that also networks that are not causal can be very useful in gaining biological understanding and insights.</p>
</sec>
<sec>
<label>2.2.1.2</label> <title>Asymmetric-N</title>
<p>Asymmetric-N is an algorithm that takes the fact into account that biological networks contain hubs. It is a modified version of Symmetric-N (Agrawal, <xref ref-type="bibr" rid="B1">2002</xref>) which was designed for the construction of co-expression networks (Chen et al., <xref ref-type="bibr" rid="B20">2008a</xref>). The two-step algorithm of Agrawal (<xref ref-type="bibr" rid="B1">2002</xref>), Symmetric-N, utilizes the N-nearest-neighbor concept. In a first step, for each node in the network its correlation to other nodes is calculated and sorted in descending order. In a second step, each pair of nodes is evaluated one by one and if for a pair of nodes both are in the corresponding N-nearest-neighbors list, a connection between them is included. Otherwise, they are not connected. Here <italic>N</italic> is a pre-defined cut-off value for the number of neighbor nodes to be considered (Agrawal, <xref ref-type="bibr" rid="B1">2002</xref>). It is demonstrated in Agrawal (<xref ref-type="bibr" rid="B1">2002</xref>) that the inferred network has a scale-free topology.</p>
<p>Instead of using <italic>N</italic> neighbors for all nodes as potential connections, a higher number <italic>N<sub>C</sub></italic> is assigned to some core nodes (e.g., TFs) and a smaller number <italic>N<sub>P</sub></italic> is assigned to periphery nodes (e.g., non-TFs). Since the numbers of potential neighbors are different for core and peripheral nodes, the algorithm was called Asymmetric-N (Chen et al., <xref ref-type="bibr" rid="B20">2008a</xref>). Asymmetric-N employs the intuition that transcription factors (TFs) are more frequently connected to other genes than regulated genes. An interesting result found in Chen et al. (<xref ref-type="bibr" rid="B20">2008a</xref>) is that Asymmetric-N performed better than Bayesian networks for small sample sizes.</p>
</sec>
<sec>
<label>2.2.1.3</label> <title>SEM (structural equation model)</title>
<p>Xiong et al. (<xref ref-type="bibr" rid="B104">2004</xref>) use a structural equation model (SEM; J&#x000F6;reskog, <xref ref-type="bibr" rid="B56">1973</xref>; Bollen, <xref ref-type="bibr" rid="B13">1989</xref>) to reconstruct gene networks. Structural equation models combine path analysis (Wright, <xref ref-type="bibr" rid="B101">1921</xref>, <xref ref-type="bibr" rid="B102">1934</xref>) and confirmatory factor analysis (J&#x000F6;reskog, <xref ref-type="bibr" rid="B55">1969</xref>; Brown, <xref ref-type="bibr" rid="B16">2006</xref>).</p>
<p>Xiong et al.&#x02019;s (<xref ref-type="bibr" rid="B104">2004</xref>) approach starts by assuming that the expression level of genes, <italic>X</italic>, can be described linearly by</p>
<disp-formula id="E4"><label>(4)</label><mml:math id="M10"><mml:mi>X</mml:mi><mml:mo class="MathClass-rel">=</mml:mo><mml:mi>B</mml:mi><mml:mi>X</mml:mi><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mi>&#x00393;</mml:mi><mml:mi>&#x003BE;</mml:mi><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mi>&#x003B5;</mml:mi><mml:mo class="MathClass-punc">.</mml:mo></mml:math></disp-formula>
<p>Here <italic>X</italic> is a vector of <italic>p</italic> endogenous and &#x003BE; a vector of <italic>q</italic> exogenous variables, and <italic>B</italic> and &#x00393; are <italic>p</italic>&#x02009;&#x000D7;&#x02009;<italic>p</italic> respectively <italic>p</italic>&#x02009;&#x000D7;&#x02009;<italic>q</italic> coupling matrices. The last term in equation (<xref ref-type="disp-formula" rid="E4">4</xref>), &#x003B5;, represents noise. The structure of the regulatory gene network is coded by the entries of the coupling matrices <italic>B</italic> and &#x00393;. The structural equation in equation (<xref ref-type="disp-formula" rid="E4">4</xref>) is learned in two steps. First, assuming the structure of the network is known the coupling parameters are estimated via maximum likelihood optimizing the distance between the (<italic>p</italic>&#x02009;&#x0002B;&#x02009;<italic>q</italic>)&#x02009;&#x000D7;&#x02009;(<italic>p</italic>&#x02009;&#x0002B;&#x02009;<italic>q</italic>) covariance matrix &#x003A3;(&#x00398;) and the sample covariance matrix <italic>S</italic> estimated from the data. Here, &#x00398;&#x02009;&#x0003D;&#x02009;<italic>f</italic>&#x02009;(<italic>B</italic>, &#x00393;, &#x003A6;, &#x003A8;), is a function of the parameters of the SEM and the covariances of &#x003BE; (&#x003A6;) and &#x003B5; (&#x003A8;) that can be expressed analytically. The second step consists in finding the network structure (of <italic>B</italic> and &#x00393;) by an optimization method whereas the different models are assessed by Akaike&#x02019;s information criterion (AIC).</p>
</sec>
<sec>
<label>2.2.1.4</label> <title>Low-order partial correlation</title>
<p>Partial correlations of low-order have been employed in de la Fuente et al. (<xref ref-type="bibr" rid="B30">2004</xref>), Magwene and Kim (<xref ref-type="bibr" rid="B66">2004</xref>), Wille and B&#x000FC;hlmann (<xref ref-type="bibr" rid="B98">2006</xref>). This approach is capable to infer undirected networks only, because a correlation, of any order, is symmetric in its arguments. The principle working mechanism of this approach is as follows. Starting with a fully connected network, interactions between genes are iteratively excluded by moving toward an increasing order of the correlation coefficient. More specifically, starting from a fully connected adjacency matrix <italic>A</italic>, one calculates in the first step all pair-wise correlation coefficients, &#x003C1;<italic><sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;&#x003C1;(<italic>X<sub>i</sub></italic>, <italic>X<sub>j</sub></italic>). If &#x003C1;<italic><sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;0 the connection between gene <italic>X<sub>i</sub></italic> and <italic>X<sub>j</sub></italic> is deleted, <italic>A<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;<italic>A<sub>ji</sub></italic>&#x02009;&#x0003D;&#x02009;0. Inthe second step all partial correlation coefficients of first-order, <inline-formula><mml:math id="M3"><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003C1;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mo>|</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003C1;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo class="MathClass-punc">,</mml:mo></mml:mrow></mml:math></inline-formula> are calculated for all triplets of genes with &#x003C1;<italic><sub>ij</sub></italic>&#x02009;&#x02260;&#x02009;0. Now, if there is at least one gene <italic>X<sub>k</sub></italic> for which &#x003C1;<italic><sub>ij</sub>|<italic>k</italic></italic>&#x02009;&#x0003D;&#x02009;0 holds the connection between gene <italic>X<sub>i</sub></italic> and <italic>X<sub>j</sub></italic> is deleted, <italic>A<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;<italic>A<sub>ji</sub></italic>&#x02009;&#x0003D;&#x02009;0. Continuation to higher orders is analogously. The exclusion of interactions is based on testing the null hypothesis &#x003C1;<italic><sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;0 against the alternative hypothesis &#x003C1;<italic><sub>ij</sub></italic>&#x02009;&#x02260;&#x02009;0 for all orders of the (partial) correlation coefficients. In (de la Fuente et al., <xref ref-type="bibr" rid="B30">2004</xref>) this approach was applied up to order three. The resulting network from this procedure is frequently called an <italic>undirected dependency graph</italic> (UDG; Shipley, <xref ref-type="bibr" rid="B86">2000</xref>). An implementation of this approach is called ParCorA (de la Fuente et al., <xref ref-type="bibr" rid="B30">2004</xref>).</p>
</sec>
<sec>
<label>2.2.1.5</label> <title>GGM (graphical Gaussian model)</title>
<p>GGM, also known as covariance selection model, concentration graph, or Markov random field (Dempster, <xref ref-type="bibr" rid="B34">1972</xref>; Whittaker, <xref ref-type="bibr" rid="B97">1990</xref>; Koller and Friedman, <xref ref-type="bibr" rid="B59">2009</xref>), is a graphical model which assumes that all variables are distributed according to a multivariate normal distribution with a specific structure of the inverse of the covariance matrix, &#x003A9;&#x02009;&#x0003D;&#x02009;&#x003A3;<sup>&#x02212;1</sup>, also called precision or concentration matrix. Network inference methods based on GGM make use of the relation,</p>
<disp-formula id="E5"><label>(5)</label><mml:math id="M11"><mml:msub><mml:mrow><mml:mi>&#x003C1;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mo class="MathClass-rel">|</mml:mo><mml:mi>V</mml:mi><mml:mo class="MathClass-bin">\</mml:mo><mml:mrow><mml:mo class="MathClass-open">{</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mo class="MathClass-close">}</mml:mo></mml:mrow></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:mo class="MathClass-bin">-</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003C9;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msqrt><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003C9;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>&#x003C9;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac><mml:mo class="MathClass-punc">,</mml:mo></mml:math></disp-formula>
<p>connecting the partial correlation of full-order with the elements of &#x003A9;, <italic>w<sub>ij</sub></italic>&#x02009;&#x02208;&#x02009;&#x003A9;. The partial correlation is of full-order (with respect to the number of genes) because <italic>V</italic>\{<italic>ij</italic>} is the set of all genes excluding <italic>i</italic> and <italic>j</italic>, i.e., the largest possible set of genes not considering <italic>i</italic> and <italic>j</italic>. In case of a vanishing partial correlation in equation (<xref ref-type="disp-formula" rid="E5">5</xref>) one can write</p>
<disp-formula id="E6"><label>(6)</label><mml:math id="M12"><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02568;</mml:mo><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">|</mml:mo><mml:mi>V</mml:mi><mml:mo class="MathClass-bin">\</mml:mo><mml:mrow><mml:mo class="MathClass-open">{</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mo class="MathClass-close">}</mml:mo></mml:mrow></mml:mrow></mml:mfenced><mml:mo class="MathClass-rel">=</mml:mo><mml:mn>0</mml:mn><mml:mo class="MathClass-punc">.</mml:mo></mml:math></disp-formula>
<p>From this conditional independence relation, the principle way to infer a network structure from GGM becomes apparent estimating a network in the following way. If &#x003C1;<italic><sub>ij</sub></italic>|V\{<italic>ij</italic>}&#x02009;&#x02260;&#x02009;0, according to a hypothesis test, we include an edge, <italic>A<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;<italic>A<sub>ji</sub></italic>&#x02009;&#x0003D;&#x02009;1, otherwise there is no edge between <italic>i</italic> and <italic>j</italic>.</p>
<p>Several approaches have been made to infer gene regulatory networks based on GGM (Wille et al., <xref ref-type="bibr" rid="B99">2004</xref>; Sch&#x000E4;fer and Strimmer, <xref ref-type="bibr" rid="B85">2005</xref>; Li and Gui, <xref ref-type="bibr" rid="B62">2006</xref>). These methods differ in the way the inverse of the covariance matrix, &#x003A3;<sup>&#x02212;1</sup>, is estimated and in the statistical tests employed to define significance. The reason for these technical variants comes from a variety of problems. First, if the number of samples is smaller than the number of genes, which is typically the case for genomics data, the sample covariance matrix is not positive definite and, hence, not invertible. Another problem is caused by the small samples size problem (Sch&#x000E4;fer and Strimmer, <xref ref-type="bibr" rid="B85">2005</xref>).</p>
</sec>
<sec>
<label>2.2.1.6</label> <title>BN (Bayesian networks)</title>
<p>Bayesian networks allow identifying a DAG structure of a network. BN were among the first methods that have been applied to expression data to infer GRN (Friedman et al., <xref ref-type="bibr" rid="B44">2000</xref>; Hartemink et al., <xref ref-type="bibr" rid="B50">2001</xref>). In order to overcome the problem that only acyclic networks can be inferred by BN, which would be a severe limitation considering the fact that real biological networks contain many feedback loops and are, hence, cyclic, a modified method called <italic>dynamic Bayesian network</italic> (DBN) is used instead. Briefly, DBN unfold cyclic processes by mapping them onto a sequence of acyclic events, which can then be analyzed with a BN (Dean and Kanazawa, <xref ref-type="bibr" rid="B33">1990</xref>; Koller and Friedman, <xref ref-type="bibr" rid="B59">2009</xref>). For the inference of GRN, this approach has been applied in Husmeier (<xref ref-type="bibr" rid="B52">2003</xref>), Perrin et al. (<xref ref-type="bibr" rid="B80">2003</xref>), Zou and Conzen (<xref ref-type="bibr" rid="B110">2005</xref>). The variations in these approaches provide different methods for the <italic>structure learning</italic> problem, which is the integral part of a BN learned from data. Principally, a major practical problem of a BN is that the computational complexity of algorithms to learn their structure has been shown to be NP-hard for score-based approaches (Chickering et al., <xref ref-type="bibr" rid="B23">1995</xref>; Chickering, <xref ref-type="bibr" rid="B22">1996</xref>). Hence, BN or DBN can be either only applied to quite small networks or heuristic approximation methods have to be used that separate the score into tractable units that can be optimized by locally constraint search techniques making the computational complexity manageable (Friedman et al., <xref ref-type="bibr" rid="B45">1999</xref>, <xref ref-type="bibr" rid="B44">2000</xref>).</p>
<p>In Figure <xref ref-type="fig" rid="F6">6</xref>, we categorize BN as correlation-based methods because the simplest statistical realization of a BN is obtained by estimating (<italic>X</italic>&#x02009;&#x02568;&#x02009;<italic>Y</italic>|<italic>S</italic>)<italic><sub>P</sub></italic>, see section <xref ref-type="sec" rid="s1">1</xref>, by means of partial correlation coefficients (Spirtes et al., <xref ref-type="bibr" rid="B88">1993</xref>; Pearl, <xref ref-type="bibr" rid="B78">2000</xref>). However, also non-linear approaches are possible.</p>
</sec>
</sec>
<sec>
<label>2.2.2</label> <title>Information-theory based estimation methods</title>
<p>In the following, we restrict our discussion to the working mechanisms of the discussed procedures. However, we want to emphasize that also for these methods the employed statistical estimators are of importance (Beirlant et al., <xref ref-type="bibr" rid="B10">1997</xref>; Steuer et al., <xref ref-type="bibr" rid="B89">2002</xref>; Daub et al., <xref ref-type="bibr" rid="B28">2004</xref>; Kraskov et al., <xref ref-type="bibr" rid="B60">2004</xref>; Khan et al., <xref ref-type="bibr" rid="B58">2007</xref>).</p>
<sec>
<label>2.2.2.1</label> <title>A. Mutual information-based</title>
<p><italic>RN (relevance networks)</italic>. The principle idea of RN (Butte and Kohane, <xref ref-type="bibr" rid="B17">2000</xref>) is to compute all mutual information (MI) values for all pairs of genes, for a given data set, and declare mutual information values as relevant if their corresponding value is larger than a given threshold <italic>I</italic><sub>0</sub>. The resulting network is constructed based on this threshold by including an edge between two genes in the respective adjacency matrix of the network, <italic>A<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;<italic>A<sub>ji</sub></italic>&#x02009;&#x0003D;&#x02009;1, if <italic>I<sub>ij</sub></italic>&#x02009;&#x0003E;&#x02009;<italic>I</italic><sub>0</sub>, otherwise no edge is included between <italic>i</italic> and <italic>j</italic>. In Butte and Kohane (<xref ref-type="bibr" rid="B17">2000</xref>) the threshold <italic>I</italic><sub>0</sub> was found by randomization of the expression data set. From this randomization, mutual information values were re-calculated from which a reference distribution of mutual information values, resembling a null-distribution, was obtained. Based on this reference distribution the threshold <italic>I</italic><sub>0</sub> was obtained by heuristic arguments. For this reason, mutual information values that are larger than <italic>I</italic><sub>0</sub> are called relevant but cannot necessarily be called statistically significant.</p>
<p><italic>ARACNE (algorithm for the reconstruction of accurate cellular networks)</italic>. ARACNE (Basso et al., <xref ref-type="bibr" rid="B9">2005</xref>; Margolin et al., <xref ref-type="bibr" rid="B68">2006</xref>) is similar to RN because it also uses mutual information and a randomization of the data to obtain a threshold <italic>I</italic><sub>0</sub> allowing to declare mutual information values significant if <italic>I<sub>ij</sub></italic>&#x02009;&#x0003E;&#x02009;<italic>I</italic><sub>0</sub><xref ref-type="fn" rid="fn1"><sup>1</sup></xref>. If <italic>I<sub>ij</sub></italic> is found to be significant, than an edge is included in the corresponding adjacency matrix between gene <italic>i</italic> and <italic>j</italic>, <italic>A<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;<italic>A<sub>ji</sub></italic>&#x02009;&#x0003D;&#x02009;1, otherwise no edge is included. However, in contrast to RN, ARACNE performs a second step testing all gene-triplets (three genes with mutual information values larger than <italic>I</italic><sub>0</sub>) such that, for each triplet (<italic>ijk</italic>), the edge corresponding to the lowest mutual information value <inline-formula><mml:math id="M4"><mml:mrow><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo class="MathClass-punc">,</mml:mo></mml:mrow></mml:math></inline-formula> with (<italic>i</italic>&#x02032;<italic>j</italic>&#x02032;)&#x02009;&#x0003D;&#x02009;argmin{<italic>I<sub>ij</sub></italic>, <italic>I<sub>jk</sub></italic>, <italic>I<sub>ik</sub></italic>}, is eliminated from the adjacency matrix, if it is smaller than the second smallest MI value <italic>I</italic><sub>2</sub> multiplied by a factor, i.e.,</p>
<disp-formula id="E7"><label>(7)</label><mml:math id="M13"><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:mfenced separators="" open="{" close=""><mml:mrow><mml:mtable style="text-align:axis;" equalrows="false" columnlines="none" equalcolumns="false" class="array"><mml:mtr><mml:mtd class="array" columnalign="left"><mml:mn>0</mml:mn><mml:mspace width="1em" class="quad"/></mml:mtd><mml:mtd class="array" columnalign="left"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo class="MathClass-rel">&#x02264;</mml:mo><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo class="MathClass-bin">-</mml:mo><mml:mi>&#x003F5;</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="array" columnalign="left"><mml:mn>1</mml:mn><mml:mspace width="1em" class="quad"/></mml:mtd><mml:mtd class="array" columnalign="left"><mml:mstyle class="text"><mml:mtext>otherwise</mml:mtext></mml:mstyle><mml:mo class="MathClass-punc">.</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="array" columnalign="left"><mml:mspace width="1em" class="quad"/></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mfenced></mml:math></disp-formula>
<p>Here 0&#x02009;&#x02264;&#x02009;&#x02208;&#x02009;&#x02264;&#x02009;1. The introduction of this step has been motivated by the so called <italic>data processing inequality</italic> (DPI; Cover and Thomas, <xref ref-type="bibr" rid="B25">1991</xref>). The DPI is a relation between mutual information values which means loosely that a post-processing of data cannot increase its information content. Specifically, one can show (Cover and Thomas, <xref ref-type="bibr" rid="B25">1991</xref>) that the DPI for the following relation between the three random variables,</p>
<disp-formula id="E8"><label>(8)</label><mml:math id="M14"><mml:mi>X</mml:mi><mml:mo class="MathClass-rel">&#x02192;</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-rel">&#x02192;</mml:mo><mml:mi>Z</mml:mi><mml:mo class="MathClass-punc">,</mml:mo></mml:math></disp-formula>
<p>implies that <italic>I</italic>(<italic>X</italic>,<italic>Z</italic>)&#x02009;&#x02264;&#x02009;<italic>I</italic>(<italic>X</italic>,<italic>Y</italic>). Due to the fact that the criteria in equation (<xref ref-type="disp-formula" rid="E8">8</xref>) is for &#x02208;&#x02009;&#x0003E;&#x02009;0 less stringent than the DPI [equation (<xref ref-type="disp-formula" rid="E9">9</xref>)], &#x02208; is called <italic>tolerance parameter</italic>.</p>
<p>To ensure that the application of equation (<xref ref-type="disp-formula" rid="E8">8</xref>) results in an unique solution, independent of the order the triples have been selected, the procedure starts by listing all possible triplets in the network <italic>G</italic>. Then all of these triplets are tested. Hence, the results of these tests have no influence on subsequent tests of triplets. At the end of the second step, the resulting network represents the final result.</p>
<p>ARACNE employs two parameters, <italic>I</italic><sub>0</sub> and &#x02208;. The cut-off parameter <italic>I</italic><sub>0</sub> is determined by a resampling method estimating the distribution of the null hypothesis corresponding to a vanishing mutual information. This allows to assign <italic>p</italic>-values to mutual information values. In contrast, the DPI is not directly connected to statistical inference, but serves as a filtering step. Optimal values of &#x02208; are found from simulation studies that allow a comparison with the underlying true network. Hence, <italic>I</italic><sub>0</sub> is found in an unsupervised and &#x02208; in a supervised way of learning.</p>
<p><italic>CLR (context likelihood of relatedness)</italic>. CLR (Faith et al., <xref ref-type="bibr" rid="B42">2007</xref>) is also an extension of RN. It starts by estimating the pair-wise mutual information values for all genes. Then, CLR estimates the statistical likelihood of each MI value <italic>I<sub>ij</sub></italic>, for a particular pair of genes (<italic>ij</italic>), by comparing this MI value to a background. Specifically, for each gene pair (<italic>ij</italic>) two <italic>z</italic>-scores are obtained, one for gene <italic>i</italic> and one for gene <italic>j</italic>, by comparing the mutual information value <italic>I<sub>ij</sub></italic> with gene specific distributions, <italic>p<sub>i</sub></italic> and <italic>p<sub>j</sub></italic>. Here the two distributions <italic>p<sub>i</sub></italic> and <italic>p<sub>j</sub></italic> correspond to the distributions of mutual information values related to gene <italic>i</italic> ({<italic>I<sub>ik</sub></italic>|<italic>k</italic>&#x02009;&#x02208;&#x02009;<italic>V</italic>}) and gene <italic>j</italic> ({<italic>I<sub>jk</sub></italic>|<italic>k</italic>&#x02009;&#x02208;&#x02009;<italic>V</italic>}). By making a normality assumption about these distributions, corresponding <italic>z</italic>-scores, <italic>z<sub>i</sub></italic> and <italic>z<sub>j</sub></italic>, can be obtained from which the joint likelihood measure <inline-formula><mml:math id="M5"><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:msub><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover><mml:mo class="MathClass-rel">=</mml:mo><mml:msqrt><mml:mrow><mml:msubsup><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:msubsup><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:msqrt></mml:mrow></mml:math></inline-formula> is calculated. In contrast to RN and ARACNE, which employ a global threshold <italic>I</italic><sub>0</sub> for each mutual information value correspondingly pair of genes, CLR estimates <italic>individual</italic> thresholds by considering an <italic>individual</italic> background for each pair of genes. This procedure can be seen as a first attempt to take the network context of gene pairs into account.</p>
<p><italic>C3NET (conservative causal core)</italic>. C3NET consists of two main steps (Altay and Emmert-Streib, <xref ref-type="bibr" rid="B2">2010a</xref>). The first step is for the elimination of non-significant edges, whereas the second step selects for each gene the edge among the remaining ones with maximum mutual in formation value. The first step is similar to RN (Butte and Kohane, <xref ref-type="bibr" rid="B17">2000</xref>), ARACNE (Margolin et al., <xref ref-type="bibr" rid="B68">2006</xref>), or CLR (Faith et al., <xref ref-type="bibr" rid="B42">2007</xref>) essential for eliminating non-significant links, according to a chosen significance level &#x003B1;, between gene pairs. In the second step, the most significant link for each gene is selected. This link corresponds also to the highest MI value among the neighbor edges for each gene. This implies that the highest possible number of edges that can be inferred by C3NET is equal to the number of genes under consideration. This number can decrease for several reasons. For example, when two genes have the same edge with maximum MI value. In this case, the same edge would be chosen by both genes to be included in the network. However, if an edge is already present another inclusion does not lead to an additional edge. Another case corresponds to the situation when a gene does not have significant edges at all. In this case, apparently, no edge can be included in the network. Since C3NET employs MI values as test statistics among genes, there is no directional information that can be inferred thereof. Hence, the resulting network is undirected and unweighted. For a detailed explanation of C3NET and technical details, the reader is referred to Altay and Emmert-Streib (<xref ref-type="bibr" rid="B2">2010a</xref>, <xref ref-type="bibr" rid="B4">2011</xref>), de Matos Simoes and Emmert-Streib (<xref ref-type="bibr" rid="B31">2011</xref>).</p>
</sec>
<sec>
<label>2.2.2.2</label> <title>B. Extensions of mutual information and conditional mutual information</title>
<p><italic>SA-CLR (synergy augmented CLR)</italic>. As indicated by its name, SA-CLR is based on CLR but includes synergistic effects between genes (Anastassiou, <xref ref-type="bibr" rid="B5">2007</xref>; Watkinson et al., <xref ref-type="bibr" rid="B95">2009</xref>). Synergy,</p>
<disp-formula id="E9"><label>(9)</label><mml:math id="M15"><mml:mstyle class="text"><mml:mtext>Syn</mml:mtext></mml:mstyle><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow></mml:mfenced><mml:mo class="MathClass-rel">=</mml:mo><mml:mi>I</mml:mi><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow></mml:mfenced><mml:mo class="MathClass-bin">-</mml:mo><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfenced><mml:mo class="MathClass-punc">,</mml:mo></mml:math></disp-formula>
<p>is defined as the difference between the generalized two-way mutual information,</p>
<disp-formula id="E10"><label>(10)</label><mml:math id="M16"><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-rel">=</mml:mo><mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mo>&#x1D53C;</mml:mo></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:msub><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo class="MathClass-punc">,</mml:mo></mml:math></disp-formula>
<p>and the mutual information values for two random variables. A visualization of synergy is shown in Figure <xref ref-type="fig" rid="F4">4</xref>. In this figure, the different intersections of the three variables <italic>X</italic>, <italic>Y</italic>, and <italic>Z</italic> are labeled by lower case letters, whereas &#x0201C;e&#x0201D; corresponds to the synergy, Syn(<italic>X</italic>,<italic>Y</italic>;<italic>Z</italic>). Using the definitions of the occurring mutual information values one obtains a simplified form of the synergy, given by</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>Generalized Venn diagram showing the synergy Syn(<italic>X</italic>,<italic>Y</italic>;<italic>Z</italic>)&#x02009;&#x0003D;&#x02009;&#x02212;&#x02009;<italic>I</italic>(<italic>X</italic>;<italic>Y</italic>;<italic>Z</italic>)</bold>. The synergy is shown in red and blue because it can assume positive as well as negative values.</p></caption>
<graphic xlink:href="fgene-03-00008-g004.tif"/>
</fig>
<disp-formula id="E11"><mml:math id="M17"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd class="eqnarray-1"><mml:mstyle class="text"><mml:mtext>Syn</mml:mtext></mml:mstyle><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:msub><mml:mrow><mml:mo>&#x1D53C;</mml:mo></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:msub><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo class="MathClass-bin">-</mml:mo></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(11)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"></mml:mtd><mml:mtd class="eqnarray-2"><mml:mspace width="1em" class="quad"/><mml:mspace width="0.3em" class="thinspace"/><mml:mspace width="0.3em" class="thinspace"/><mml:msub><mml:mrow><mml:mo>&#x1D53C;</mml:mo></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:msub><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo class="MathClass-bin">-</mml:mo><mml:msub><mml:mrow><mml:mo>&#x1D53C;</mml:mo></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:msub><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(12)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:msub><mml:mrow><mml:mo>&#x1D53C;</mml:mo></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:msub><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(13)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:msub><mml:mrow><mml:mo>&#x1D53C;</mml:mo></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:msub><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>y</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray"></mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"></mml:mtd><mml:mtd class="eqnarray-2"><mml:mspace width="1em" class="quad"/><mml:mo class="MathClass-rel">=</mml:mo><mml:mo class="MathClass-bin">-</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(14)</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>which is the negative of the three-dimensional interaction information (Watkinson et al., <xref ref-type="bibr" rid="B95">2009</xref>). We want to remark that <italic>I</italic>(<italic>X</italic>; <italic>Y</italic>; <italic>Z</italic>) can assume positive as well as negative values (McGill, <xref ref-type="bibr" rid="B71">1954</xref>). For this reason the intersection &#x0201C;e&#x0201D; in Figure <xref ref-type="fig" rid="F4">4</xref> is shown in blue and red to emphasize this. Due to the fact that a Venn diagram visualizes logical relations between variables Figure <xref ref-type="fig" rid="F4">4</xref> is no Venn diagram in the strict sense but a generalization thereof, allowing also to express negative values by including colors in the representation. The idea of SA-CLR consists in utilizing the synergy given in equation (<xref ref-type="disp-formula" rid="E10">10</xref>) in the following way,</p>
<disp-formula id="E12"><label>(15)</label><mml:math id="M18"><mml:mi>S</mml:mi><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>Y</mml:mi></mml:mrow></mml:mfenced><mml:mo class="MathClass-rel">=</mml:mo><mml:munder class="msub"><mml:mrow><mml:mo class="MathClass-op">max</mml:mo></mml:mrow><mml:mrow><mml:mspace width="1em" class="nbsp" /><mml:mtable style="text-align:axis;" equalrows="false" columnlines="none none none none none none none none none" equalcolumns="false" class="array"><mml:mtr><mml:mtd class="array" columnalign="center"><mml:mi>Z</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="array" columnalign="center"><mml:mstyle class="text"><mml:mtext>for</mml:mtext></mml:mstyle><mml:mspace width="2.77695pt" class="tmspace"/><mml:mi>X</mml:mi><mml:mo class="MathClass-rel">&#x02260;</mml:mo><mml:mi>Z</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-rel">&#x02260;</mml:mo><mml:mi>Z</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="array" columnalign="center"><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mspace width="2.77695pt" class="tmspace"/><mml:mo class="MathClass-rel">&#x0003C;</mml:mo><mml:mspace width="2.77695pt" class="tmspace"/><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="array" columnalign="center"><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mspace width="2.77695pt" class="tmspace"/><mml:mo class="MathClass-rel">&#x0003C;</mml:mo><mml:mspace width="2.77695pt" class="tmspace"/><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="array" columnalign="center"></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:munder><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mo class="MathClass-bin">-</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfenced><mml:mo class="MathClass-punc">.</mml:mo></mml:math></disp-formula>
<p>This term is called <italic>synergistic regulation index</italic> (SRI; Watkinson et al., <xref ref-type="bibr" rid="B95">2009</xref>). The interpretation of SRI is that for two given genes, <italic>X</italic> and <italic>Y</italic>, we are searching a third one, <italic>Z</italic>, that maximizes the synergy Syn(<italic>X</italic>, <italic>Y</italic>; <italic>Z</italic>) obeying the above constraints. This gene, <italic>Z</italic>, can be be arbitrarily chosen among all available genes, besides <italic>X</italic> and <italic>Y</italic>. The two constraints that involve mutual information, are chosen assuming that gene <italic>X</italic> and <italic>Z</italic> regulate <italic>Y</italic>. In this case, the mutual information between the two regulators, <italic>X</italic> and <italic>Z</italic>, should be smallest, motivating the constraints. Interestingly, these constraints introduce an asymmetry in <italic>X</italic> and <italic>Y</italic><xref ref-type="fn" rid="fn2"><sup>2</sup></xref> making it possible to speak about regulator and modulator genes. In the case of a positive synergy, a significant high value of SRI may indicate that gene <italic>X</italic> regulates genes <italic>Y</italic> (Watkinson et al., <xref ref-type="bibr" rid="B95">2009</xref>).</p>
<p>In order to detect also non-cooperative effects, the measure actually used by SA-CLR is <italic>I</italic>(<italic>X</italic>, <italic>Y</italic>)&#x02009;&#x0002B;&#x02009;<italic>S</italic>(<italic>X</italic>, <italic>Y</italic>), the mutual information between two genes plus their <italic>synergistic regulation index</italic>. The significance of <italic>I</italic>(<italic>X</italic>, <italic>Y</italic>)&#x02009;&#x0002B;&#x02009;<italic>S</italic>(<italic>X</italic>, <italic>Y</italic>) is detected in a similar way as described above for CLR.</p>
<p>It is interesting to note that</p>
<disp-formula id="E13"><label>(16)</label><mml:math id="M19"><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mi>S</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-rel">=</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-rel">|</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mo class="MathClass-rel">|</mml:mo></mml:mrow><mml:mrow><mml:mi>S</mml:mi><mml:mi>y</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>Y</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mspace width="2.77695pt" class="tmspace"/><mml:mo class="MathClass-rel">&#x0003E;</mml:mo><mml:mspace width="2.77695pt" class="tmspace"/><mml:mn>0</mml:mn></mml:mrow></mml:msub></mml:math></disp-formula>
<p>holds formally but is not identical to the (general) conditional mutual information <italic>I</italic>(<italic>X</italic>; <italic>Y</italic>|<italic>Z</italic>) because the synergy for <italic>X</italic>, <italic>Y</italic>, and <italic>Z</italic> has to be positive. To make this clear, we included this constraint visibly in equation (<xref ref-type="disp-formula" rid="E16">20</xref>). This allows SA-CLR, in contrast to CLR, to obtain a directed network by assigning an edge from gene <italic>X</italic> to <italic>Y</italic> if <italic>I</italic>(<italic>X</italic>; <italic>Y</italic>)&#x02009;&#x0002B;&#x02009;<italic>S</italic>(<italic>X</italic>, <italic>Y</italic>) is significant.</p>
<p><italic>MRNET (maximum relevance, minimum redundance)</italic>. MRNET (Meyer et al., <xref ref-type="bibr" rid="B73">2007</xref>) is an iterative algorithm that identifies potential interaction partners of a target gene <italic>Y</italic> that maximize a scoring function. Its working mechanism is as follows,</p>
<disp-formula id="E14"><mml:math id="M20"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd class="eqnarray-1"><mml:msubsup><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msubsup></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:munder class="msub"><mml:mrow><mml:mtext>argmax</mml:mtext></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">&#x02208;</mml:mo><mml:mi>V</mml:mi><mml:mo class="MathClass-bin">\</mml:mo><mml:mi>S</mml:mi></mml:mrow></mml:munder><mml:mspace width="0.3em" class="thinspace"/><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(17)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">-</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mo class="MathClass-rel">|</mml:mo><mml:mi>S</mml:mi><mml:mo class="MathClass-rel">|</mml:mo></mml:mrow></mml:mfrac><mml:munder class="msub"><mml:mrow><mml:mo mathsize="big">&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">&#x02208;</mml:mo><mml:mi>S</mml:mi></mml:mrow></mml:munder><mml:mspace width="0.3em" class="thinspace"/><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-punc">.</mml:mo></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(18)</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Whenever a gene, <italic>X<sub>j</sub></italic>, is found with a score that maximizes equation (<xref ref-type="disp-formula" rid="E17">21</xref>) and <italic>s<sub>j</sub></italic> is above a threshold, <italic>s</italic><sub>0</sub>, then this gene is added to the set <italic>S</italic>. The algorithm is iterated until no further gene can be found that would pass the threshold test. The basic idea of MRNET is to find interaction partners for <italic>Y</italic> that are of maximal relevance [first term in equation (<xref ref-type="disp-formula" rid="E17">22</xref>)] for <italic>Y</italic>, but introduce a minimum redundancy [second term in equation (<xref ref-type="disp-formula" rid="E17">22</xref>)] with respect to the already found interaction partners in the set <italic>S</italic>. Starting with a fully connected, undirected network among all genes, MRNET reduces successively edges between <italic>Y</italic> and <italic>V</italic><italic>S</italic>, which have not maximized the score in equation <xref ref-type="disp-formula" rid="E17">22</xref>.</p>
<p>We want to remark that the score <italic>s<sub>j</sub></italic> can be approximated by the difference between two mutual information values,</p>
<disp-formula id="E15"><label>(19)</label><mml:math id="M21"><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">&#x02248;</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">-</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-punc">;</mml:mo><mml:mi>A</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-punc">,</mml:mo></mml:math></disp-formula>
<p>where the (auxiliary) random variable, <italic>A</italic>, represents the influence of the variables in the set <italic>S</italic>. Equation (<xref ref-type="disp-formula" rid="E17">23</xref>) is visualized in the left Figure <xref ref-type="fig" rid="F5">5</xref>. Here, <italic>I</italic>(<italic>X<sub>j</sub></italic>; <italic>Y</italic>) corresponds to &#x0201C;d&#x0201D; and <italic>I</italic>(<italic>X<sub>j</sub></italic>; <italic>A</italic>) to &#x0201C;b.&#x0201D; Due to the fact that the mutual information is always positive, <italic>I</italic>(<italic>X<sub>j</sub></italic>; <italic>Y</italic>) adds a positive and <italic>I</italic>(<italic>X<sub>j</sub></italic>; <italic>A</italic>) a negative value to <italic>s<sub>j</sub></italic>, indicated by blue and red colors. It has been argued in (Meyer et al., <xref ref-type="bibr" rid="B73">2007</xref>) that due to the maximization of equation (<xref ref-type="disp-formula" rid="E17">22</xref>), MRNET approximates the conditional mutual information <italic>I</italic>(<italic>X<sub>j</sub></italic>;<italic>Y</italic>|S). This is also motivated by Figure <xref ref-type="fig" rid="F5">5</xref>.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p><bold>Generalized Venn diagrams of the information captured by different measures</bold>. Left: score <italic>s<sub>j</sub></italic> [equation (<xref ref-type="disp-formula" rid="E17">23</xref>)] for MRNET. Right: the <italic>MI</italic><sub>3</sub> measure of the MI3 algorithm. Blue corresponds to positive and red to a negative values.</p></caption>
<graphic xlink:href="fgene-03-00008-g005.tif"/>
</fig>
<p><italic>MI3 (mutual information 3)</italic>. The MI3 algorithm (Luo et al., <xref ref-type="bibr" rid="B65">2008</xref>) uses three-way mutual information for the inference, hypothesizing that gene regulation commonly involves more than one regulator genes. The value of the measure <italic>MI<sub>3</sub></italic> is defined as,</p>
<disp-formula id="E16"><label>(20)</label><mml:math id="M22"><mml:mi>M</mml:mi><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo class="MathClass-rel">=</mml:mo><mml:mn>2</mml:mn><mml:mi>I</mml:mi><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo class="MathClass-bin">-</mml:mo><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfenced><mml:mo class="MathClass-punc">,</mml:mo></mml:math></disp-formula>
<p>which equals the difference of mutual informations between the target gene and the two regulators and the target gene with one of the regulators. Alternatively, <italic>MI</italic><sub>3</sub> can also be written as</p>
<disp-formula id="E17"><mml:math id="M23"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd class="eqnarray-1"><mml:mi>M</mml:mi><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-rel">|</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-rel">|</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(21)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mi>S</mml:mi><mml:mi>y</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(22)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-punc">,</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">-</mml:mo><mml:mi>I</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>T</mml:mi><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo class="MathClass-punc">;</mml:mo><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-punc">.</mml:mo></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(23)</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Equation (<xref ref-type="disp-formula" rid="E18">25</xref>) is the sum of two conditional mutual informations between the target gene and a regulator given the other regulator, which emphasizes the three-way nature of this measure. Interestingly, <italic>MI</italic><sub>3</sub> has also a simple relation to synergy, as shown in equation (<xref ref-type="disp-formula" rid="E18">26</xref>). This follows directly from equations (<xref ref-type="disp-formula" rid="E10">10</xref> and <xref ref-type="disp-formula" rid="E18">24</xref>).</p>
<p>In the right Figure <xref ref-type="fig" rid="F5">5</xref>, we show a visualization of <italic>MI</italic><sub>3</sub>. Here, the shown intersections result from the correspondence of <italic>I</italic>(<italic>T</italic>; <italic>R</italic><sub>1</sub>, <italic>R</italic><sub>2</sub>) to &#x0201C;d&#x02009;&#x0002B;&#x02009;e&#x02009;&#x0002B;&#x02009;f,&#x0201D; <italic>I</italic>(<italic>T</italic>; <italic>R</italic><sub>1</sub>) to &#x0201C;d&#x02009;&#x0002B;&#x02009;e,&#x0201D; and <italic>I</italic>(<italic>T</italic>; <italic>R</italic><sub>2</sub>) to &#x0201C;e&#x02009;&#x0002B;&#x02009;f.&#x0201D; Subtraction of these terms according to equation (<xref ref-type="disp-formula" rid="E18">24</xref>) results in &#x0201C;d&#x02009;&#x0002B;&#x02009;f.&#x0201D; Note that <italic>I</italic>(<italic>T</italic>; <italic>R</italic><sub>1</sub>, <italic>R</italic><sub>2</sub>) is counted twice.</p>
<p>The MI3 algorithm learns gene regulatory networks in two steps: first, local regulatory networks consisting of only three genes (<italic>T</italic>, <italic>R</italic><sub>1</sub>, and <italic>R</italic><sub>2</sub>) are learned. Starting from a given gene, <italic>T</italic>, two regulator genes are searched which maximize MI3 (<italic>T</italic>, <italic>R</italic><sub>1</sub>, <italic>R</italic><sub>2</sub>). As a result, directed edges between <italic>R</italic><sub>1</sub> and <italic>T</italic> and <italic>R</italic><sub>2</sub> and <italic>T</italic> are added constituting a small regulatory network. Second, the local regulatory networks learned in step one are assembled. To ensure that the resulting network is acyclic, there may be a need to remove edges forming cycles in the assembled network. This is solved heuristically, by identifying all local three-gene networks that contribute to a cycle and elimination of all edges of them from the overall network, except from the one with the highest <italic>MI</italic><sub>3</sub> value. Overall, the MI3 algorithm aims to learn the optimal two-parent causal model for each target variable in the form R1&#x02009;&#x02192;&#x02009;T and R2&#x02009;&#x02192;&#x02009;T.</p>
<p><italic>CMI (conditional mutual information)</italic>. Soranzo et al. (<xref ref-type="bibr" rid="B87">2007</xref>) use only the conditional mutual information (CMI) to infer regulatory networks. They estimate all conditional mutual information values I(<italic>X<sub>i</sub></italic>; <italic>X<sub>j</sub></italic>&#x02009;<italic>|&#x02009;X<sub>k</sub></italic>) between triplets of genes. Starting from a fully connected network, they successively remove edges <italic>A<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;<italic>A<sub>ji</sub></italic>&#x02009;&#x0003D;&#x02009;0 if I(<italic>X<sub>i</sub></italic>; <italic>X<sub>j</sub></italic>&#x02009;|&#x02009;<italic>X<sub>k</sub></italic>)&#x02009;&#x0003D;&#x02009;0 for at least one gene <italic>X<sub>k</sub></italic>, according to a threshold condition.</p>
<p><italic>MI-CMI</italic>. The MI-CMI algorithm (Liang and Wang, <xref ref-type="bibr" rid="B63">2008</xref>) uses both mutual information and conditional mutual information to estimate the network. Starting from a fully unconnected network <italic>G</italic>, the algorithm consists of three steps. First, all pairs of mutual information values <italic>I</italic>(<italic>X<sub>i</sub></italic>; <italic>X<sub>j</sub></italic>) are estimated. If <italic>I</italic>(<italic>X<sub>i</sub></italic>; <italic>X<sub>j</sub></italic>)&#x02009;&#x02265;&#x02009;<italic>I</italic><sub>0</sub>, then an edge is included, <italic>A<sub>ij</sub></italic>&#x02009;&#x0003D;&#x02009;<italic>A<sub>ji</sub></italic>&#x02009;&#x0003D;&#x02009;1. Second, for all triplets (<italic>X<sub>i</sub></italic>, <italic>X<sub>j</sub></italic>, <italic>X<sub>k</sub></italic>) with <italic>I</italic>(<italic>X<sub>i</sub></italic>; <italic>X<sub>j</sub></italic>)&#x02009;&#x02265;&#x02009;<italic>I</italic><sub>0</sub>, <italic>I</italic>(<italic>X<sub>i</sub></italic>; <italic>X<sub>k</sub></italic>)&#x02009;&#x02265;&#x02009;<italic>I</italic><sub>0</sub>, and <italic>I</italic>(<italic>X<sub>j</sub></italic>; <italic>X<sub>k</sub></italic>)&#x02009;&#x02265;&#x02009;<italic>I</italic><sub>0</sub> the conditional mutual information for all combinations of <italic>I</italic>(<italic>X<sub>i</sub></italic>, <italic>X<sub>j</sub></italic>, <italic>X<sub>k</sub></italic>) is estimated. Then based on heuristic rules, is it decided which edges in <italic>G</italic> are likely to result from indirect regulations (interactions). These edges will be removed from <italic>G</italic>. Third, for all triplets (<italic>X<sub>i</sub></italic>, <italic>X<sub>j</sub></italic>, <italic>X<sub>k</sub></italic>) with <italic>I</italic>(<italic>X<sub>i</sub></italic>; <italic>X<sub>j</sub></italic>)&#x02009;&#x0003C;&#x02009;<italic>I</italic><sub>0</sub>, <italic>I</italic>(<italic>X<sub>i</sub></italic>; <italic>X<sub>k</sub></italic>)&#x02009;&#x0003C;&#x02009;<italic>I</italic><sub>0</sub>, and <italic>I</italic>(<italic>X<sub>j</sub></italic>; <italic>X<sub>k</sub></italic>)&#x02009;&#x0003C;&#x02009;<italic>I</italic><sub>0</sub> the conditional mutual information for all combinations of (<italic>X<sub>i</sub></italic>; <italic>X<sub>j</sub></italic>; <italic>X<sub>k</sub></italic>) is estimated. Again, following some heuristic rules, edges that are likely the result from interactive regulations, which could not be detected by mutual information, are included in <italic>G</italic>. Special emphasize was given to the used statistical estimators, for the mutual information and conditional mutual information values.</p>
<p>Mutual information in combination with conditional mutual information was also used in Zhao et al. (<xref ref-type="bibr" rid="B108">2008</xref>), Zhang et al. (<xref ref-type="bibr" rid="B107">2011</xref>). However, in contrast to (Liang and Wang, <xref ref-type="bibr" rid="B63">2008</xref>), these approaches are conceptually closer to the <sc>pc</sc> algorithm (Spirtes et al., <xref ref-type="bibr" rid="B88">1993</xref>; Shipley, <xref ref-type="bibr" rid="B86">2000</xref>).</p>
</sec>
</sec>
</sec>
<sec>
<label>2.3</label> <title>Categorization from a dynamical perspective</title>
<p>In Figure <xref ref-type="fig" rid="F6">6</xref> we present an overview of inference methods to infer regulatory networks, as discussed in the previous section. We separated these methods in two main groups, depending on if the method is correlation- (pink) or information-based (orange). Within these two groups, there are various ways for further subdivision, however, we just distinguish between basic forms and higher order extensions &#x02013; as indicated by darker colors (see Figure <xref ref-type="fig" rid="F6">6</xref>).</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p><bold>Overview of inference methods to reconstruct regulatory networks from expression data</bold>. First column, name of the method. Second column, methodological base of the method. Third column, name of the software package, if available. Horizontal classification, first panel, linear methods (correlation-based). Second panel, non-linear methods (information-based). Darker colors represent extended methods.</p></caption>
<graphic xlink:href="fgene-03-00008-g006.tif"/>
</fig>
</sec>
<sec>
<label>2.4</label> <title>Multiple testing corrections</title>
<p>Due to the fact that all methods presented above involve many hypotheses that are tested, one needs to apply a multiple hypothesis correction method (Lehmann and Romano, <xref ref-type="bibr" rid="B61">2005</xref>; Dudoit and van der Laan, <xref ref-type="bibr" rid="B37">2007</xref>). From the investigation of multiple testing corrections it is known that the presence of a correlation structure in the data can lead to severe problems that counteracts an efficient control of an error measure, e.g., of the false discovery rate (FDR) or the family-wise error (Benjamini and Yekutieli, <xref ref-type="bibr" rid="B12">2001</xref>; Storey and Tibshirani, <xref ref-type="bibr" rid="B92">2003</xref>; Dudoit et al., <xref ref-type="bibr" rid="B36">2008</xref>). Unfortunately, in the specific context of the inference of regulatory networks, multiple testing corrections have not been studied well. For this reason, without conducting in depth investigations of this problem one should apply a conservative rather than a more liberal correction method. Hence, one can apply a Bonferroni correction (Bonferroni, <xref ref-type="bibr" rid="B14">1936</xref>; Dudoit and van der Laan, <xref ref-type="bibr" rid="B37">2007</xref>).</p>
</sec>
</sec>
<sec>
<label>3</label> <title>Evaluation measures</title>
<p>In order to assess the performance of inference methods several measures have been suggested. In the following, we present three different types of such measures: (1) General statistical measures, (2) Ontology-based measures, (3) Network-based measures. On overview of these different measures is given in Figure <xref ref-type="fig" rid="F7">7</xref>.</p>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p><bold>Overview of different evaluation measures</bold>. Here, a circle/oval represents the entity that is assessed by a measure, which can be seen as its <italic>resolution</italic>. The figure visualizes general statistical measures <bold>(A)</bold>, ontology-based measures <bold>(B)</bold>, global network-based measures <bold>(C)</bold> and local network-based measures <bold>(D)</bold>. The lack of any circle/oval in <bold>(D)</bold> shall indicate that there is no restriction to any particular resolution level. The measures in green incorporate biological information, whereas the measures in purple do not.</p></caption>
<graphic xlink:href="fgene-03-00008-g007.tif"/>
</fig>
<sec>
<label>3.1</label> <title>General statistical measures</title>
<p>The most widely used statistical measures are based on,</p>
<disp-formula id="E18"><mml:math id="M24"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd class="eqnarray-1"><mml:mstyle class="text"><mml:mtext>sensitivity</mml:mtext></mml:mstyle></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle class="text"><mml:mtext>TP</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle class="text"><mml:mtext>TP</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>FN</mml:mtext></mml:mstyle></mml:mrow></mml:mfrac></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(24)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"><mml:mstyle class="text"><mml:mtext>specificity</mml:mtext></mml:mstyle></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle class="text"><mml:mtext>TN</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle class="text"><mml:mtext>TN</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>FP</mml:mtext></mml:mstyle></mml:mrow></mml:mfrac></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(25)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"><mml:mstyle class="text"><mml:mtext>complementary</mml:mtext></mml:mstyle><mml:mspace width="2.77695pt" class="tmspace"/><mml:mstyle class="text"><mml:mtext>sensitivity</mml:mtext></mml:mstyle></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mn>1</mml:mn><mml:mo class="MathClass-bin">-</mml:mo><mml:mstyle class="text"><mml:mtext>sensitivity</mml:mtext></mml:mstyle><mml:mo class="MathClass-rel">=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle class="text"><mml:mtext>FN</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle class="text"><mml:mtext>TP</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>FN</mml:mtext></mml:mstyle></mml:mrow></mml:mfrac></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(26)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"><mml:mstyle class="text"><mml:mtext>precision</mml:mtext></mml:mstyle></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mstyle class="text"><mml:mtext>P</mml:mtext></mml:mstyle><mml:mo class="MathClass-rel">=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle class="text"><mml:mtext>TP</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle class="text"><mml:mtext>TP</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>FP</mml:mtext></mml:mstyle></mml:mrow></mml:mfrac></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(27)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"><mml:mstyle class="text"><mml:mtext>recall</mml:mtext></mml:mstyle></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mstyle class="text"><mml:mtext>R</mml:mtext></mml:mstyle><mml:mo class="MathClass-rel">=</mml:mo><mml:mstyle class="text"><mml:mtext>sensitivity</mml:mtext></mml:mstyle></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(28)</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="eqnarray-1"><mml:mstyle class="text"><mml:mtext>accuracy</mml:mtext></mml:mstyle></mml:mtd><mml:mtd class="eqnarray-2"><mml:mo class="MathClass-rel">=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle class="text"><mml:mtext>TP</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>TN</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle class="text"><mml:mtext>TP</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>TN</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>FP</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>FN</mml:mtext></mml:mstyle></mml:mrow></mml:mfrac></mml:mtd><mml:mtd class="eqnarray-3"></mml:mtd><mml:mtd class="eqnarray-4"><mml:mtext class="eqnarray">(29)</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>obtained by comparison of a inferred (predicted) network with the true network underlying the data. From the measures listed above, three pair-wise combinations thereof are frequently used to assess the performance of an algorithm. The first measure is the area under the curve for the <italic>receiver operator characteristics</italic> (AUC-ROC; Fawcett, <xref ref-type="bibr" rid="B43">2006</xref>). The ROC curve represents the sensitivity as function of the complementary specificity obtained by using various threshold values &#x000FE;eta&#x02009;&#x02208;&#x02009;&#x00398;, instead of one specific, the algorithm depends on<xref ref-type="fn" rid="fn3"><sup>3</sup></xref>. This leads to a &#x003B8;-dependence of all quantities listed above and, hence, allows to obtain a functional behavior among these measures. The second measure is the area under the precision-recall curve (AUC-PR), obtained similarly as AUC-ROC, and the third is the <italic>F</italic>-score,</p>
<disp-formula id="E19"><label>(30)</label><mml:math id="M25"><mml:mstyle class="text"><mml:mtext>F</mml:mtext></mml:mstyle><mml:mo class="MathClass-rel">=</mml:mo><mml:mn>2</mml:mn><mml:mfrac><mml:mrow><mml:mstyle class="text"><mml:mtext>PR</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle class="text"><mml:mtext>P</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>R</mml:mtext></mml:mstyle></mml:mrow></mml:mfrac><mml:mo class="MathClass-punc">,</mml:mo></mml:math></disp-formula>
<p>also called <italic>F</italic><sub>1</sub> because it is a special form of,</p>
<disp-formula id="E20"><label>(31)</label><mml:math id="M26"><mml:msub><mml:mrow><mml:mstyle class="text"><mml:mtext>F</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>&#x003B2;</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo class="MathClass-bin">-</mml:mo><mml:msup><mml:mrow><mml:mi>&#x003B2;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mfrac><mml:mrow><mml:mstyle class="text"><mml:mtext>PR</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003B2;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mstyle class="text"><mml:mtext>P</mml:mtext></mml:mstyle><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>R</mml:mtext></mml:mstyle></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo class="MathClass-punc">.</mml:mo></mml:math></disp-formula>
<p>We want to emphasize that all measures presented above are general statistical measures used in statistics and data analysis. None of them is specific to our problem under consideration, namely, the inference of regulatory networks from expression data. In other words, none of these measures utilizes either biological or network specific information in any form. Further, each of these general statistical measures are global error measures because they evaluate the network inference performance as a whole, represented by a scalar value. As a consequence thereof, it is implicitly assumed that the inference process is homogeneous, i.e., each interaction should have about the same true positive rate, because otherwise it would be implausible to summarize the inference performance by just one value, e.g., a <italic>F</italic>-score. However, this is not justified, as we will discuss below.</p>
<p>In the following, we present extensions of general statistical measures for both types of information.</p>
</sec>
<sec>
<label>3.2</label> <title>Ontology-based measures</title>
<p>An evaluation strategy utilizing biological information to assess the performance of an inference method tries to quantify the <italic>biological relevance</italic> of the inferred network. In general, it is assumed that genes in a gene regulatory network are preferentially linked to genes involved in similar biological processes (Wolfe et al., <xref ref-type="bibr" rid="B100">2005</xref>). There are several publicly available resources of biological knowledge which can be used to test whether this holds true, for example, the Gene Ontology database (GO; Ashburner et al., <xref ref-type="bibr" rid="B6">2000</xref>), or KEGG (Kanehisa and Goto, <xref ref-type="bibr" rid="B57">2000</xref>). There are also several curated organism-specific knowledge databases, such as RegulonDB (<italic>E. coli</italic>; Gama-Castro et al., <xref ref-type="bibr" rid="B47">2008</xref>) and MIPS (yeast; Mewes et al., <xref ref-type="bibr" rid="B72">2002</xref>). Functional congruence of clusters of coexpressed genes is a popular validation measure for clustering algorithms (Datta and Datta, <xref ref-type="bibr" rid="B27">2006</xref>) and in principle can also be used for the validation of inferred networks.</p>
</sec>
<sec>
<label>3.3</label> <title>Network-based measures</title>
<p>The third type of measures for assessing the performance of an inference algorithm considers the network structure explicitly. That means, in contrast to the general statistical measures and also the ontology-based measures, network-based measures can only be used if there is a network that underlies the problem.</p>
<sec>
<label>3.3.1</label> <title>Global network-based measures</title>
<p>A measure that makes explicit use of the network structures of the true (<italic>G</italic>) and inferred (<italic>&#x0011C;</italic>) network was proposed by Zhao et al. (<xref ref-type="bibr" rid="B108">2008</xref>). They suggested to use,</p>
<disp-formula id="E21"><label>(32)</label><mml:math id="M27"><mml:mi>D</mml:mi><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>G</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>&#x0011C;</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-rel">=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mo class="MathClass-rel">|</mml:mo><mml:mi>E</mml:mi><mml:mo class="MathClass-rel">|</mml:mo></mml:mrow></mml:mfrac><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:munder class="msub"><mml:mrow><mml:mo mathsize="big">&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:mn>0</mml:mn><mml:mo class="MathClass-punc">,</mml:mo><mml:mspace width="1em" class="nbsp" /><mml:msub><mml:mrow><mml:mi>&#x000C2;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:munder><mml:mspace width="0.3em" class="thinspace"/><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>G</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo class="MathClass-punc">,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:munder class="msub"><mml:mrow><mml:mo mathsize="big">&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:mn>1</mml:mn><mml:mo class="MathClass-punc">,</mml:mo><mml:mspace width="1em" class="nbsp" /><mml:msub><mml:mrow><mml:mi>&#x000C2;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:munder><mml:mspace width="0.3em" class="thinspace"/><mml:mn>1</mml:mn></mml:mrow></mml:mfenced><mml:mo class="MathClass-punc">,</mml:mo></mml:math></disp-formula>
<p>the weighted sum of false-positive edges (first term) plus the false-negative edges (second term). This measure is not only asymmetric in its arguments but also gives a different weight to type-1 (false positives) respectively type-2 (false negatives) errors. For type-1 errors, in the true network <italic>G</italic> the Dijkstra distance (Dijkstra, <xref ref-type="bibr" rid="B35">1959</xref>) from node <italic>i</italic> to <italic>j</italic>, <italic>d<sub>G</sub></italic>(<italic>i</italic>,<italic>j</italic>), is calculated whereas for type-2 errors each false-negative edge is assigned a constant weight of &#x0201C;1.&#x0201D; Overall, <italic>D</italic>(<italic>G,&#x0011C;</italic>) is a global measure, as were all measures discussed in section <xref ref-type="sec" rid="s1">1</xref>, which assesses the performance of an inference algorithm holistically.</p>
</sec>
<sec>
<label>3.3.2</label> <title>Local network-based measures</title>
<p>In contrast to all above measures, which were <italic>global</italic> measures, we present finally <italic>local</italic> network-based measures introduced in Altay and Emmert-Streib (<xref ref-type="bibr" rid="B3">2010b</xref>), Emmert-Streib and Altay (<xref ref-type="bibr" rid="B39">2010</xref>). These local network-based measures are based on <italic>ensemble data</italic>, &#x1D49F;&#x02009;&#x0003D;&#x02009;{<italic>D</italic><sub>1</sub>(<italic>G</italic>),&#x02026;, <italic>D<sub>E</sub></italic>(<italic>G</italic>)}, and the availability of a reference network that represents the &#x0201C;true&#x0201D; regulatory network <italic>G</italic>. Ensemble data means that there is more than one data set available from the biological phenomenon under investigation. This ensemble could be obtained either by bootstrapping from one data set or from simulation studies. The apparent advantage of ensemble data is that they allow to quantify the variability of the population that underlies the data. After having obtained an ensemble of estimated networks <inline-formula><mml:math id="M6"><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="script">G</mml:mi></mml:mrow><mml:mrow><mml:mi>e</mml:mi></mml:mrow></mml:msup><mml:mo class="MathClass-rel">=</mml:mo><mml:msubsup><mml:mrow><mml:mrow><mml:mo class="MathClass-open">{</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>e</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo class="MathClass-close">}</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo class="MathClass-rel">=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>E</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> from &#x1D49F;&#x02009;&#x0003D;&#x02009;{<italic>D</italic><sub>1</sub>(<italic>G</italic>),&#x02026;, <italic>D<sub>E</sub></italic>(<italic>G</italic>)} by application of an inference algorithm, we can obtain the true positive rates ({<italic>TPR<sub>ij</sub></italic>}) of each edge and the true negative rates ({<italic>TNR<sub>ij</sub></italic>}) for each non-edge with respect to the underlying true network structure. For example, we can estimate the TPR of an edge between gene <italic>i</italic> and <italic>j</italic>, which is present in <italic>G</italic>, by</p>
<disp-formula id="E22"><label>(33)</label><mml:math id="M28"><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo class="MathClass-rel">=</mml:mo><mml:mfrac><mml:mrow><mml:mi>#</mml:mi><mml:mspace width="2.77695pt" class="tmspace"/><mml:mstyle class="text"><mml:mtext>of</mml:mtext></mml:mstyle><mml:mspace width="2.77695pt" class="tmspace"/><mml:mstyle class="text"><mml:mtext>edges</mml:mtext></mml:mstyle><mml:mspace width="2.77695pt" class="tmspace"/><mml:mstyle class="text"><mml:mtext>present</mml:mtext></mml:mstyle><mml:mspace width="2.77695pt" class="tmspace"/><mml:mstyle class="text"><mml:mtext>in</mml:mtext></mml:mstyle><mml:mspace width="2.77695pt" class="tmspace"/><mml:msup><mml:mrow><mml:mi mathvariant="script">G</mml:mi></mml:mrow><mml:mrow><mml:mi>e</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>E</mml:mi></mml:mrow></mml:mfrac></mml:math></disp-formula>
<p>which corresponds to <italic>Pr</italic>(gene <italic>i</italic> interacts with gene <italic>j</italic>&#x02009;|&#x02009;&#x1D4A2;<sup>e</sup>). Analogously, we can estimate the TNR.</p>
<p>Principally, any combination of true positive and true negative rates would result in a valid network-based measure consisting, e.g., of network motifs, subnetworks, or even only of individual edges. For such a measure representing a structural region within a network it is then possible to estimate its reconstruction rate. To provide a concrete example, we give the reconstruction rate of a three-gene motif that is given by the chain in Figure <xref ref-type="fig" rid="F2">2</xref></p>
<disp-formula id="E23"><label>(34)</label><mml:math id="M29"><mml:mi>p</mml:mi><mml:mo class="MathClass-rel">=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:mfrac><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mstyle class="text"><mml:mtext>TPR</mml:mtext></mml:mstyle><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mstyle class="text"><mml:mtext>X</mml:mtext></mml:mstyle><mml:mo class="MathClass-rel">&#x02192;</mml:mo><mml:mstyle class="text"><mml:mtext>Y</mml:mtext></mml:mstyle></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>TPR</mml:mtext></mml:mstyle><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mstyle class="text"><mml:mtext>Y</mml:mtext></mml:mstyle><mml:mo class="MathClass-rel">&#x02192;</mml:mo><mml:mstyle class="text"><mml:mtext>Z</mml:mtext></mml:mstyle></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow><mml:mo class="MathClass-bin">&#x0002B;</mml:mo><mml:mstyle class="text"><mml:mtext>TNR</mml:mtext></mml:mstyle><mml:mrow><mml:mo class="MathClass-open">(</mml:mo><mml:mrow><mml:mstyle class="text"><mml:mtext>X</mml:mtext></mml:mstyle><mml:mo class="MathClass-rel">&#x02194;&#x00338;</mml:mo><mml:mstyle class="text"><mml:mtext>Z</mml:mtext></mml:mstyle></mml:mrow><mml:mo class="MathClass-close">)</mml:mo></mml:mrow></mml:mrow></mml:mfenced><mml:mo class="MathClass-punc">.</mml:mo></mml:math></disp-formula>
<p>Also, if biological information about the genes in the network is available, this can also be used to define appropriate measures. For example, one could obtain a reconstruction rate for all interactions that are connected with transcription factors or of particular biological pathways involving only certain genes as defined, e.g., via the gene ontology database. Further examples of such measures can be found in Altay and Emmert-Streib (<xref ref-type="bibr" rid="B3">2010b</xref>), Emmert-Streib and Altay (<xref ref-type="bibr" rid="B39">2010</xref>).</p>
</sec>
</sec>
</sec>
<sec>
<label>4</label> <title>Comparison of Inference Methods</title>
<p>A question that is of practical relevance is which of the discussed inference methods, listed in Figure <xref ref-type="fig" rid="F6">6</xref>, is preferred. Unfortunately, there is no study that compared all these methods with each other. However, there are several studies that compared subsets thereof. For example, in Altay and Emmert-Streib (<xref ref-type="bibr" rid="B2">2010a</xref>, <xref ref-type="bibr" rid="B4">2011</xref>) RN, ARACNE, CLR, MRNET, and C3NET have been compared for a large variety of different conditions, including different network structures, sample sizes, and noise levels. For all these conditions, ensemble data have been generated that allow the application of local network-based measures providing the most detailed information about the inference characteristics of a method. Overall, it has been found that C3NET and MRNET perform better than ARACNE, CLR, and RN (in this order). There are two main differences between C3NET and MRNET. First, the computational complexity of C3NET is <italic>O</italic>(<italic>N</italic><sup>2</sup>) and for MRNET it is between <italic>O</italic>(<italic>N</italic><sup>2</sup>) and <italic>O</italic>(<italic>N</italic><sup>3</sup>) which is difficult to quantify exactly because of the iterative nature of the second step employed by MRNET. For the conducted simulations which involved networks in the order of <italic>O</italic>(10<sup>2</sup>) genes this was tractable, however, for real biological data the number of genes can reach over 10,000 causing series problems. In contrast, C3NET has been applied to a bootstrap ensemble from B cell lymphoma for 9.684 genes (de Matos Simoes et al., submitted). Second, due to the conservative character of C3NET, which is currently the most conservative method of all inference methods, the number of obtained interactions is easier to deal with than for other methods. For example, in Basso et al. (<xref ref-type="bibr" rid="B9">2005</xref>) ARACNE was applied to the same B cell data, inferring about 130,000 interactions whereas C3NET inferred about 10,000 interactions. Given the current impossibility to experimentally validate tens of thousands of interactions by wet lab experiments a conservative subnetwork having a low false-positive rate (Altay and Emmert-Streib, <xref ref-type="bibr" rid="B2">2010a</xref>, <xref ref-type="bibr" rid="B4">2011</xref>) is advantages.</p>
<p>We would like to remark that the methods studied in Altay and Emmert-Streib (<xref ref-type="bibr" rid="B2">2010a</xref>, <xref ref-type="bibr" rid="B4">2011</xref>) were selected on the ground of their conceptual similarity to provide a fair comparison, but they have also a (relatively) low computational complexity which allows the investigation for ensemble data and not just single data sets. Many other methods listed in Figure <xref ref-type="fig" rid="F6">6</xref> have a higher computational complexity forcing a comparative analysis to be performed with small networks, typically involving <italic>O</italic>(10<sup>1</sup>) genes, and individual data sets rather than ensembles. For example, in Werhli et al. (<xref ref-type="bibr" rid="B96">2006</xref>) RN, GGM, and BN have been compared for a network consisting of only 11 genes because of the high computational complexity of a BN. As a result, it is reported that GGM and the BN outperform RN and perform similarly compared to each other.</p>
</sec>
<sec>
<label>5</label> <title>Discussion</title>
<p>From the discussions of the preceding methods for the structural inference of regulatory networks one could get the impression that their number is sheerly unlimited as well as the principle ideas behind them. In our opinion the first point is probably true, the latter not. In order to see this more clearly we would like to outline the general procedure underlying the development of each method. First, a principle idea or hypothesis is raised about a mechanism for the inference of regulatory networks and, second, a method is conceived that could accomplish this. Formally, the first part relates to the conceptual or qualitative framework of a method whereas the latter refers to its quantitative realization, e.g., in form of statistical estimators. In order to learn about two classic inference algorithms, embodying two different conceptual ideas, we presented the IC (inductive causation) algorithm (Pearl, <xref ref-type="bibr" rid="B78">2000</xref>) and the MWST (maximum weight spanning tree) algorithm (Chow and Liu, <xref ref-type="bibr" rid="B24">1968</xref>). For the IC algorithm the conceptual core is d-separation, and for the MWST this is the DPI. When considered from this perspective, the inference algorithms shown in Figure <xref ref-type="fig" rid="F6">6</xref> can be grouped as follows. ARACNE is based on the idea of the MWST, whereas all other methods except SEM, SA-CLR, and MI3 are either close or approximate adaptations of d-separation. Here approximate adaptation of d-separation means that an algorithm employs d-separation in a certain way, for instance up to a fixed order of the correlation coefficient or the mutual information (like ParCorA or MI-CMI), but not in the exhaustive way as described for the IC algorithm. From this perspective follow at least three implications. First, the number of conceptually different ideas is very limited if technical approximations are not counted as different. Second, when one of the above inference algorithms represents only an &#x0201C;approximation&#x0201D; of one of the two classic conceptual frameworks its theoretical inference abilities would strictly speaking require a new mathematical proof because new assumptions made may not translate to these methods. Third, due to the fact that most methods presented in Figure <xref ref-type="fig" rid="F6">6</xref> lack such a strict mathematical proof our knowledge about their actual abilities is based on numerical studies. This does not mean that numerical studies are not capable of allowing a detailed analysis but that these investigations need to be conducted thoroughly acknowledging the statistical and biological nature of the problem. The former means that numerical studies need to be based on <italic>ensemble data</italic> in order to capture characteristics of the population, and the latter means that network-based metrics should be applied due to the heterogeneous inferability of local components of regulatory networks, as found in Altay and Emmert-Streib (<xref ref-type="bibr" rid="B3">2010b</xref>) and Emmert-Streib and Altay (<xref ref-type="bibr" rid="B39">2010</xref>).</p>
<p>We would like to finish this review with a brief outlook on future directions. In the introduction, we mentioned that due to the nature of gene expression data, which do not allow to derive unique predictions about the underlying molecular interactions between gene products, the resulting gene regulatory networks represent a mixture of a transcriptional regulatory network and a protein interaction network. For example, in Altay and Emmert-Streib (<xref ref-type="bibr" rid="B2">2010a</xref>) gene expression data from <italic>E. coli</italic> have been analyzed by inferring a regulatory network. Among the verified interactions found from comparing estimated interactions with experimentally reported results from the literature, transcriptional regulatory interactions, and also protein-protein interactions have been found. Similar results for <italic>S. cerevisiae</italic> can be found in de Matos Simoes and Emmert-Streib (submitted). In this study, over 800 protein-protein interactions from BioGRID (Breitkreutz et al., <xref ref-type="bibr" rid="B15">2008</xref>) have been identified in the inferred network. For a general discussion about the connection between gene expression and protein interactions, see Grigoriev (<xref ref-type="bibr" rid="B48">2001</xref>), Jansen et al. (<xref ref-type="bibr" rid="B53">2002</xref>).</p>
<p>In order to obtain more refined predictions about the type of interactions and also to improve the inference performance of the methods, complementary information from other types of high-throughput data is needed. For example, data from ChIP-chip or ChIP-Seq experiments could be used to obtain information about the potential gene targets of transcription factors, similarly, proteomics data could be employed to reveal protein-protein interactions. Ideally, information from all three data types (ChIP-Seq, gene expression, and proteomics) should be integrated to infer a more detailed network with a clearer interpretation of the inferred interactions between the gene products. Sporadically, methods have been already pioneered for such an integration (Nariai et al., <xref ref-type="bibr" rid="B75">2005</xref>; Xing and van der Laan, <xref ref-type="bibr" rid="B103">2005</xref>). However, due to the increased experimental effort and its associated costs the combined availability of several different types of large-scale high-throughput data sets, as necessary for such an analysis, is still rare. This lack of data hampered so far the systematic development of integrative methods. However, with the increasing use of next-generation sequencing technologies this might change is the near future. Other data integration approaches that need to be addressed are the combination of, e.g., gene expression data, from different experiments or platforms (Belcastro et al., <xref ref-type="bibr" rid="B11">2011</xref>). Due to the need for a normalization of such expression data, a pooling of these data is usually prohibitive making an analysis difficult.</p>
<p>Another very important topic, aside data integrative methods, relates to the generation of the data themselves. Specifically, in this review we focused on observational data only, however, experimental data consisting of gene interventions or perturbations form a very fruitful source of information that could be systematically exploited (Fr&#x000F6;hlich et al., <xref ref-type="bibr" rid="B46">2008</xref>; Markowetz, <xref ref-type="bibr" rid="B69">2010</xref>; Pinna et al., <xref ref-type="bibr" rid="B81">2010</xref>; Yip et al., <xref ref-type="bibr" rid="B105">2010</xref>).</p>
<p>This discussion emphasizes the need for a clear conceptual distinction between different methods and the information they are based on.</p>
</sec>
<sec>
<label>6</label> <title>Conclusion</title>
<p>In this paper we presented a systematic overview of methods for inferring gene regulatory networks. Although this field is currently vastly expanding making it very difficult to obtain such an overview, we assumed two different perspectives that allowed to categorize inference algorithms sensibly. The first perspective was based on the dynamical assumptions (linear vs. non-linear) methods make about the underlying data. The second considered the methods through the lense of classic approaches which use either d-separation or the DPI (Chow and Liu, <xref ref-type="bibr" rid="B24">1968</xref>; Verma and Pearl, <xref ref-type="bibr" rid="B94">1991</xref>; Spirtes et al., <xref ref-type="bibr" rid="B88">1993</xref>). We want to conclude this article by mentioning that the inference of regulatory networks may not only help in gaining a better understanding of the <italic>normal</italic> physiology of a cell, but also in elucidating the molecular basis of diseases (Emmert-Streib, <xref ref-type="bibr" rid="B38">2007</xref>; Chen et al., <xref ref-type="bibr" rid="B21">2008b</xref>; Jiang et al., <xref ref-type="bibr" rid="B54">2008</xref>).</p>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>Frank Emmert-Streib would like to thank Simon Tavar&#x000E9; and Florian Markowetz for fruitful discussions and the Department for Employment and Learning through its &#x0201C;Strengthening the all-Island Research Base&#x0201D; initiative and the School of Medicine, Dentistry and Biomedical Sciences for financial support. G&#x000F6;kmen Altay is funded by Cancer Research UK.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Agrawal</surname> <given-names>H.</given-names></name></person-group> (<year>2002</year>). <article-title>Extreme self-organization in networks constructed from gene expression data</article-title>. <source>Phys. Rev. Lett.</source> <volume>89</volume>, <fpage>268702</fpage>.<pub-id pub-id-type="doi">10.1103/PhysRevLett.89.268702</pub-id><pub-id pub-id-type="pmid">12484863</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altay</surname> <given-names>G.</given-names></name> <name><surname>Emmert-Streib</surname> <given-names>F.</given-names></name></person-group> (<year>2010a</year>). <article-title>Inferring the conservative causal core of gene regulatory networks</article-title>. <source>BMC Syst. Biol.</source> <volume>4</volume>, <fpage>132</fpage>.<pub-id pub-id-type="doi">10.1186/1752-0509-4-132</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altay</surname> <given-names>G.</given-names></name> <name><surname>Emmert-Streib</surname> <given-names>F.</given-names></name></person-group> (<year>2010b</year>). <article-title>Revealing differences in gene network inference algorithms on the network-level by ensemble methods</article-title>. <source>Bioinformatics</source> <volume>26</volume>, <fpage>1738</fpage>&#x02013;<lpage>1744</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btq259</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altay</surname> <given-names>G.</given-names></name> <name><surname>Emmert-Streib</surname> <given-names>F.</given-names></name></person-group> (<year>2011</year>). <article-title>Structural Influence of gene networks on their inference: analysis of C3NET</article-title>. <source>Biol. Direct</source> <volume>6</volume>, <fpage>31</fpage>.<pub-id pub-id-type="doi">10.1186/1745-6150-6-31</pub-id><pub-id pub-id-type="pmid">21696592</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anastassiou</surname> <given-names>D.</given-names></name></person-group> (<year>2007</year>). <article-title>Computational analysis of the synergy among multiple interacting genes</article-title>. <source>Mol. Syst. Biol.</source> <fpage>3</fpage>. Available at: <uri xlink:href="http://www.ncbi.nlm.nih.gov/pubmed/17299419">http://www.ncbi.nlm.nih.gov/pubmed/17299419</uri><pub-id pub-id-type="doi">10.1186/1745-6150-6-31</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashburner</surname> <given-names>M.</given-names></name> <name><surname>Ball</surname> <given-names>C.</given-names></name> <name><surname>Blake</surname> <given-names>J.</given-names></name> <name><surname>Botstein</surname> <given-names>D.</given-names></name> <name><surname>Butler</surname> <given-names>H.</given-names></name> <name><surname>Cherry</surname> <given-names>J. M.</given-names></name> <name><surname>Davis</surname> <given-names>A. P.</given-names></name> <name><surname>Dolinski</surname> <given-names>K.</given-names></name> <name><surname>Dwight</surname> <given-names>S. S.</given-names></name> <name><surname>Eppig</surname> <given-names>J. T.</given-names></name> <name><surname>Harris</surname> <given-names>M. A.</given-names></name> <name><surname>Hill</surname> <given-names>D. P.</given-names></name> <name><surname>Issel-Tarver</surname> <given-names>L.</given-names></name> <name><surname>Kasarskis</surname> <given-names>A.</given-names></name> <name><surname>Lewis</surname> <given-names>S.</given-names></name> <name><surname>Matese</surname> <given-names>J. C.</given-names></name> <name><surname>Richardson</surname> <given-names>J. E.</given-names></name> <name><surname>Ringwald</surname> <given-names>M.</given-names></name> <name><surname>Rubin</surname> <given-names>G. M.</given-names></name> <name><surname>Sherlock</surname> <given-names>G.</given-names></name></person-group> (<year>2000</year>). <article-title>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</article-title>. <source>Nat. Genet.</source> <volume>25</volume>, <fpage>25</fpage>&#x02013;<lpage>29</lpage>.<pub-id pub-id-type="doi">10.1038/75556</pub-id><pub-id pub-id-type="pmid">10802651</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bansal</surname> <given-names>M.</given-names></name> <name><surname>Belcastro</surname> <given-names>V.</given-names></name> <name><surname>Ambesi-Impiombato</surname> <given-names>A.</given-names></name> <name><surname>di Bernardo</surname> <given-names>D.</given-names></name></person-group> (<year>2007</year>). <article-title>How to infer gene networks from expression profiles</article-title>. <source>Mol. Syst. Biol.</source> <volume>3</volume>, <fpage>78</fpage>.<pub-id pub-id-type="doi">10.1038/msb4100158</pub-id><pub-id pub-id-type="pmid">17299415</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barab&#x000E1;si</surname> <given-names>A. L.</given-names></name> <name><surname>Oltvai</surname> <given-names>Z. N.</given-names></name></person-group> (<year>2004</year>). <article-title>Network biology: understanding the cell&#x02019;s functional organization</article-title>. <source>Nat. Rev. Genet.</source> <volume>5</volume>, <fpage>101</fpage>&#x02013;<lpage>113</lpage>.<pub-id pub-id-type="doi">10.1038/nrg1272</pub-id><pub-id pub-id-type="pmid">14735121</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Basso</surname> <given-names>K.</given-names></name> <name><surname>Margolin</surname> <given-names>A. A.</given-names></name> <name><surname>Stolovitzky</surname> <given-names>G.</given-names></name> <name><surname>Klein</surname> <given-names>U.</given-names></name> <name><surname>Dalla-Favera</surname> <given-names>R.</given-names></name> <name><surname>Califano</surname> <given-names>A.</given-names></name></person-group> (<year>2005</year>). <article-title>Reverse engineering of regulatory networks in human b cells</article-title>. <source>Nat. Genet.</source> <volume>37</volume>, <fpage>382</fpage>&#x02013;<lpage>390</lpage>.<pub-id pub-id-type="doi">10.1038/ng1532</pub-id><pub-id pub-id-type="pmid">15778709</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beirlant</surname> <given-names>J.</given-names></name> <name><surname>Dudewicz</surname> <given-names>E.</given-names></name> <name><surname>Gyorfi</surname> <given-names>L.</given-names></name> <name><surname>van der Meulen</surname> <given-names>E.</given-names></name></person-group> (<year>1997</year>). <article-title>Nonparametric entropy estimation: an overview</article-title>. <source>Int. J. Math. Stat. Sci.</source> <volume>6</volume>, <fpage>17</fpage>&#x02013;<lpage>39</lpage>.</citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Belcastro</surname> <given-names>V.</given-names></name> <name><surname>Siciliano</surname> <given-names>V.</given-names></name> <name><surname>Gregoretti</surname> <given-names>F.</given-names></name> <name><surname>Mithbaokar</surname> <given-names>P.</given-names></name> <name><surname>Dharmalingam</surname> <given-names>G.</given-names></name> <name><surname>Berlingieri</surname> <given-names>S.</given-names></name> <name><surname>Iorio</surname> <given-names>F.</given-names></name> <name><surname>Oliva</surname> <given-names>G.</given-names></name> <name><surname>Polishchuck</surname> <given-names>R.</given-names></name> <name><surname>Brunetti-Pierri</surname> <given-names>N.</given-names></name> <name><surname>di Bernardo</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <article-title>Transcriptional gene network inference from a massive dataset elucidates transcriptome organization and gene function</article-title>. <source>Nucleic Acids Res.</source> <volume>39</volume>, <fpage>8677</fpage>&#x02013;<lpage>8688</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkr593</pub-id><pub-id pub-id-type="pmid">21785136</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benjamini</surname> <given-names>Y.</given-names></name> <name><surname>Yekutieli</surname> <given-names>D.</given-names></name></person-group> (<year>2001</year>). <article-title>The control of the false discovery rate in multiple testing under dependency</article-title>. <source>Ann. Stat.</source> <volume>29</volume>, <fpage>1165</fpage>&#x02013;<lpage>1188</lpage>.<pub-id pub-id-type="doi">10.1214/aos/1013699998</pub-id></citation></ref>
<ref id="B13"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bollen</surname> <given-names>K.</given-names></name></person-group> (<year>1989</year>). <source>Structural Equations with Latent Variables</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Wiley</publisher-name>.</citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonferroni</surname> <given-names>E.</given-names></name></person-group> (<year>1936</year>). <article-title>Teoria statistica delle classi e calcolo delle probabilita</article-title>. <source>Pubbl. d. R. Ist. Super. di Sci. Econom. e Commerciali di Firenze</source> <volume>8</volume>, <fpage>3</fpage>&#x02013;<lpage>62</lpage>.</citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breitkreutz</surname> <given-names>B.-J.</given-names></name> <name><surname>Stark</surname> <given-names>C.</given-names></name> <name><surname>Reguly</surname> <given-names>T.</given-names></name> <name><surname>Boucher</surname> <given-names>L.</given-names></name> <name><surname>Breitkreutz</surname> <given-names>A.</given-names></name> <name><surname>Livstone</surname> <given-names>M.</given-names></name> <name><surname>Oughtred</surname> <given-names>R.</given-names></name> <name><surname>Lackner</surname> <given-names>D. H.</given-names></name> <name><surname>Bahler</surname> <given-names>J.</given-names></name> <name><surname>Wood</surname> <given-names>V.</given-names></name> <name><surname>Dolinski</surname> <given-names>K.</given-names></name> <name><surname>Tyers</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>The BioGRID interaction database: 2008 update</article-title>. <source>Nucleic Acids Res.</source> <volume>36</volume>(<issue>Suppl. 1</issue>), <fpage>D637</fpage>&#x02013;<lpage>D640</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkm1001</pub-id><pub-id pub-id-type="pmid">18000002</pub-id></citation></ref>
<ref id="B16"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>T.</given-names></name></person-group> (<year>2006</year>). <source>Confirmatory Factor Analysis for Applied Research</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>The Guilford Press</publisher-name>.</citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butte</surname> <given-names>A.</given-names></name> <name><surname>Kohane</surname> <given-names>I.</given-names></name></person-group> (<year>2000</year>). <article-title>Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements</article-title>. <source>Pac. Symp. Biocomp.</source> <volume>5</volume>, <fpage>415</fpage>&#x02013;<lpage>426</lpage>.</citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carter</surname> <given-names>S. L.</given-names></name> <name><surname>Brechbuhler</surname> <given-names>C. M.</given-names></name> <name><surname>Griffin</surname> <given-names>M.</given-names></name> <name><surname>Bond</surname> <given-names>A. T.</given-names></name></person-group> (<year>2004</year>). <article-title>Gene co-expression network topology provides a framework for molecular characterization of cellular state</article-title>. <source>Bioinformatics</source> <volume>20</volume>, <fpage>2242</fpage>&#x02013;<lpage>2250</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bth234</pub-id><pub-id pub-id-type="pmid">15130938</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cerulo</surname> <given-names>L.</given-names></name> <name><surname>Elkan</surname> <given-names>C.</given-names></name> <name><surname>Ceccarelli</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Learning gene regulatory networks from only positive and unlabeled data</article-title>. <source>BMC Bioinformatics</source> <volume>11</volume>, <fpage>228</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-11-228</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>G.</given-names></name> <name><surname>Larsen</surname> <given-names>P.</given-names></name> <name><surname>Almasri</surname> <given-names>E.</given-names></name> <name><surname>Dai</surname> <given-names>Y.</given-names></name></person-group> (<year>2008a</year>). <article-title>Rank-based edge reconstruction for scale-free genetic regulatory networks</article-title>. <source>BMC Bioinformatics</source> <volume>9</volume>, <fpage>75</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-9-75</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Zhu</surname> <given-names>J.</given-names></name> <name><surname>Lum</surname> <given-names>P.</given-names></name> <name><surname>Yang</surname> <given-names>X.</given-names></name> <name><surname>Pinto</surname> <given-names>S.</given-names></name> <name><surname>MacNeil</surname> <given-names>D. J.</given-names></name> <name><surname>Zhang</surname> <given-names>C.</given-names></name> <name><surname>Lamb</surname> <given-names>J.</given-names></name> <name><surname>Edwards</surname> <given-names>S.</given-names></name> <name><surname>Sieberts</surname> <given-names>S. K.</given-names></name> <name><surname>Leonardson</surname> <given-names>A.</given-names></name> <name><surname>Castellini</surname> <given-names>L. W.</given-names></name> <name><surname>Wang</surname> <given-names>S.</given-names></name> <name><surname>Champy</surname> <given-names>M. F.</given-names></name> <name><surname>Zhang</surname> <given-names>B.</given-names></name> <name><surname>Emilsson</surname> <given-names>V.</given-names></name> <name><surname>Doss</surname> <given-names>S.</given-names></name> <name><surname>Ghazalpour</surname> <given-names>A.</given-names></name> <name><surname>Horvath</surname> <given-names>S.</given-names></name> <name><surname>Drake</surname> <given-names>T. A.</given-names></name> <name><surname>Lusis</surname> <given-names>A. J.</given-names></name> <name><surname>Schadt</surname> <given-names>E. E.</given-names></name></person-group> (<year>2008b</year>). <article-title>Variations in DNA elucidate molecular networks that cause disease</article-title>. <source>Nature</source> <volume>452</volume>, <fpage>429</fpage>&#x02013;<lpage>435</lpage>.<pub-id pub-id-type="doi">10.1038/nature06757</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chickering</surname> <given-names>D. M.</given-names></name></person-group> (<year>1996</year>). <article-title>Learning Bayesian networks is NP-complete</article-title>. <source>Networks</source> <volume>112</volume>, <fpage>121</fpage>&#x02013;<lpage>130</lpage>.</citation></ref>
<ref id="B23"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Chickering</surname> <given-names>D.</given-names></name> <name><surname>Geiger</surname> <given-names>D.</given-names></name> <name><surname>Heckerman</surname> <given-names>D.</given-names></name></person-group> (<year>1995</year>). <article-title>&#x0201C;Learning bayesian networks: search methods and experimental results,&#x0201D;</article-title> in <conf-name>Proceedings of Fifth Conference on Artificial lntelligence and Statistics</conf-name> (<conf-sponsor>Society for Artificial Intelligence in Statistics</conf-sponsor>), <fpage>569</fpage>&#x02013;<lpage>595</lpage>.</citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chow</surname> <given-names>C.</given-names></name> <name><surname>Liu</surname> <given-names>C.</given-names></name></person-group> (<year>1968</year>). <article-title>Approximating discrete probability distributions with dependence trees</article-title>. <source>IEEE Trans. Inf. Theory</source> <volume>14</volume>, <fpage>462</fpage>&#x02013;<lpage>467</lpage>.<pub-id pub-id-type="doi">10.1109/TIT.1968.1054142</pub-id></citation></ref>
<ref id="B25"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Cover</surname> <given-names>T.</given-names></name> <name><surname>Thomas</surname> <given-names>J.</given-names></name></person-group> (<year>1991</year>). <source>Information Theory</source>. <publisher-loc>Hoboken, NJ</publisher-loc>: <publisher-name>John Wiley &#x00026; Sons, Inc</publisher-name>.</citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crick</surname> <given-names>F.</given-names></name></person-group> (<year>1970</year>). <article-title>Central dogma of molecular biology</article-title>. <source>Nature</source> <volume>227</volume>, <fpage>561</fpage>&#x02013;<lpage>563</lpage>.<pub-id pub-id-type="doi">10.1038/227561a0</pub-id><pub-id pub-id-type="pmid">4913914</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Datta</surname> <given-names>S.</given-names></name> <name><surname>Datta</surname> <given-names>S.</given-names></name></person-group> (<year>2006</year>). <article-title>Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes</article-title>. <source>BMC Bioinformatics</source> <volume>7</volume>, <fpage>397</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-7-397</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Daub</surname> <given-names>C.</given-names></name> <name><surname>Steuer</surname> <given-names>R.</given-names></name> <name><surname>Selbig</surname> <given-names>J.</given-names></name> <name><surname>Kloska</surname> <given-names>S.</given-names></name></person-group> (<year>2004</year>). <article-title>Estimating mutual information using b-spline functions &#x02013; an improved similarity measure for analysing gene expression data</article-title>. <source>BMC Bioinformatics</source> <volume>5</volume>, <fpage>118</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-5-118</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Jong</surname> <given-names>H.</given-names></name></person-group> (<year>2002</year>). <article-title>Modeling and simulation of genetic regulatory systems: a literature review</article-title>. <source>J. Comput. Biol.</source> <volume>9</volume>, <fpage>67</fpage>&#x02013;<lpage>103</lpage>.<pub-id pub-id-type="doi">10.1089/10665270252833208</pub-id><pub-id pub-id-type="pmid">11911796</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Fuente</surname> <given-names>A.</given-names></name> <name><surname>Bing</surname> <given-names>N.</given-names></name> <name><surname>Hoeschele</surname> <given-names>I.</given-names></name> <name><surname>Mendes</surname> <given-names>P.</given-names></name></person-group> (<year>2004</year>). <article-title>Discovery of meaningful associations in genomic data using partial correlation coefficients</article-title>. <source>Bioinformatics</source> <volume>20</volume>, <fpage>3565</fpage>&#x02013;<lpage>3574</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bth445</pub-id><pub-id pub-id-type="pmid">15284096</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Matos Simoes</surname> <given-names>R.</given-names></name> <name><surname>Emmert-Streib</surname> <given-names>F.</given-names></name></person-group> (<year>2011</year>). <article-title>Influence of statistical estimators of mutual information and data heterogeneity on the inference of gene regulatory networks</article-title>. <source>PLoS ONE</source> <volume>6</volume>, <fpage>e29279</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0029279</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Smet</surname> <given-names>R.</given-names></name> <name><surname>Marchal</surname> <given-names>K.</given-names></name></person-group> (<year>2010</year>). <article-title>Advantages and limitations of current network inference methods</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>8</volume>, <fpage>717</fpage>&#x02013;<lpage>729</lpage>.<pub-id pub-id-type="pmid">20805835</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dean</surname> <given-names>T.</given-names></name> <name><surname>Kanazawa</surname> <given-names>K.</given-names></name></person-group> (<year>1990</year>). <article-title>A model for reasoning about persistence and causation</article-title>. <source>Comput. Intell.</source> <volume>5</volume>, <fpage>142</fpage>&#x02013;<lpage>150</lpage>.<pub-id pub-id-type="doi">10.1111/j.1467-8640.1989.tb00324.x</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dempster</surname> <given-names>A.</given-names></name></person-group> (<year>1972</year>). <article-title>Covariance selection</article-title>. <source>Biometrics</source> <volume>28</volume>, <fpage>157</fpage>&#x02013;<lpage>175</lpage>.<pub-id pub-id-type="doi">10.2307/2528966</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dijkstra</surname> <given-names>E.</given-names></name></person-group> (<year>1959</year>). <article-title>A note on two problems in connection with graphs</article-title>. <source>Numer. Math.</source> <volume>1</volume>, <fpage>269</fpage>&#x02013;<lpage>271</lpage>.<pub-id pub-id-type="doi">10.1007/BF01386390</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dudoit</surname> <given-names>S.</given-names></name> <name><surname>Gilbert</surname> <given-names>H.</given-names></name> <name><surname>van der Laan</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>Resampling-based empirical bayes multiple testing procedures for controlling generalized tail probability and expected value error rates: focus on the false discovery rate and simulation study</article-title>. <source>Biom. J.</source> <volume>50</volume>, <fpage>716</fpage>&#x02013;<lpage>744</lpage>.<pub-id pub-id-type="doi">10.1002/bimj.200710473</pub-id><pub-id pub-id-type="pmid">18932138</pub-id></citation></ref>
<ref id="B37"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Dudoit</surname> <given-names>S.</given-names></name> <name><surname>van der Laan</surname> <given-names>M.</given-names></name></person-group> (<year>2007</year>). <source>Multiple Testing Procedures with Applications to Genomics</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Springer</publisher-name>.</citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Emmert-Streib</surname> <given-names>F.</given-names></name></person-group> (<year>2007</year>). <article-title>The chronic fatigue syndrome: a comparative pathway analysis</article-title>. <source>J. Comput. Biol.</source> <volume>14</volume>, <fpage>961</fpage>&#x02013;<lpage>972</lpage>.<pub-id pub-id-type="doi">10.1089/cmb.2007.0041</pub-id><pub-id pub-id-type="pmid">17803373</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Emmert-Streib</surname> <given-names>F.</given-names></name> <name><surname>Altay</surname> <given-names>G.</given-names></name></person-group> (<year>2010</year>). <article-title>Local network-based measures to assess the inferability of different regulatory networks</article-title>. <source>IET Syst. Biol.</source> <volume>4</volume>, <fpage>277</fpage>&#x02013;<lpage>288</lpage>.<pub-id pub-id-type="doi">10.1049/iet-syb.2010.0028</pub-id><pub-id pub-id-type="pmid">20632777</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Emmert-Streib</surname> <given-names>F.</given-names></name> <name><surname>Glazko</surname> <given-names>G.</given-names></name></person-group> (<year>2011</year>). <article-title>Network biology: a direct approach to study biological function</article-title>. <source>Wiley Interdiscip. Rev. Syst. Biol. Med.</source> <volume>3</volume>, <fpage>379</fpage>&#x02013;<lpage>391</lpage>.<pub-id pub-id-type="doi">10.1002/wsbm.134</pub-id><pub-id pub-id-type="pmid">21197659</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ernst</surname> <given-names>J.</given-names></name> <name><surname>Beg</surname> <given-names>Q. K.</given-names></name> <name><surname>Kay</surname> <given-names>K. A.</given-names></name> <name><surname>Balazsi</surname> <given-names>G.</given-names></name> <name><surname>Oltvai</surname> <given-names>Z. N.</given-names></name> <name><surname>Bar-Joseph</surname> <given-names>Z.</given-names></name></person-group> (<year>2008</year>). <article-title>A semi-supervised method for predicting transcription factor-gene interactions in <italic>Escherichia coli</italic></article-title>. <source>PLoS Comput. Biol.</source> <volume>4</volume>, <fpage>e1000044</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pcbi.1000044</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Faith</surname> <given-names>J. J.</given-names></name> <name><surname>Hayete</surname> <given-names>B.</given-names></name> <name><surname>Thaden</surname> <given-names>J. T.</given-names></name> <name><surname>Mogno</surname> <given-names>I.</given-names></name> <name><surname>Wierzbowski</surname> <given-names>J.</given-names></name> <name><surname>Cottarel</surname> <given-names>G.</given-names></name> <name><surname>Kasif</surname> <given-names>S.</given-names></name> <name><surname>Collins</surname> <given-names>J. J.</given-names></name> <name><surname>Gardner</surname> <given-names>T. S.</given-names></name></person-group> (<year>2007</year>). <article-title>Large-scale mapping and validation of <italic>Escherichia coli</italic> transcriptional regulation from a compendium of expression profiles</article-title>. <source>PLoS Biol.</source> <volume>5</volume>, <fpage>8</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pbio.0050008</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fawcett</surname> <given-names>T.</given-names></name></person-group> (<year>2006</year>). <article-title>An introduction to ROC analysis</article-title>. <source>Pattern Recognit. Lett.</source> <volume>27</volume>, <fpage>861</fpage>&#x02013;<lpage>874</lpage>.<pub-id pub-id-type="doi">10.1016/j.patrec.2005.10.012</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friedman</surname> <given-names>N.</given-names></name> <name><surname>Linial</surname> <given-names>M.</given-names></name> <name><surname>Nachman</surname> <given-names>I.</given-names></name> <name><surname>Pe&#x02019;er</surname> <given-names>D.</given-names></name></person-group> (<year>2000</year>). <article-title>Using Bayesian network to analyze expression data</article-title>. <source>J. Comput. Biol.</source> <volume>7</volume>, <fpage>601</fpage>&#x02013;<lpage>620</lpage>.<pub-id pub-id-type="doi">10.1089/106652700750050961</pub-id><pub-id pub-id-type="pmid">11108481</pub-id></citation></ref>
<ref id="B45"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Friedman</surname> <given-names>N.</given-names></name> <name><surname>Nachman</surname> <given-names>I.</given-names></name> <name><surname>Pe&#x02019;er</surname> <given-names>D.</given-names></name></person-group> (<year>1999</year>). <article-title>&#x0201C;Learning Bayesian network structure from massive datasets: the &#x02018;Sparse Candidate&#x02019; algorithm,&#x0201D;</article-title> in <conf-name>Proceedings of Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI)</conf-name>, (<conf-sponsor>Society for Artificial Intelligence in Statistics</conf-sponsor>), <fpage>206</fpage>&#x02013;<lpage>215</lpage>.</citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fr&#x000F6;hlich</surname> <given-names>H.</given-names></name> <name><surname>Bei&#x000DF;barth</surname> <given-names>T.</given-names></name> <name><surname>Tresch</surname> <given-names>A.</given-names></name> <name><surname>Kostka</surname> <given-names>D.</given-names></name> <name><surname>Jacob</surname> <given-names>J.</given-names></name> <name><surname>Spang</surname> <given-names>R.</given-names></name> <name><surname>Markowetz</surname> <given-names>F.</given-names></name></person-group> (<year>2008</year>). <article-title>Analyzing gene perturbation screens with nested effects models in r and bioconductor</article-title>. <source>Bioinformatics</source> <volume>24</volume>, <fpage>2549</fpage>&#x02013;<lpage>2550</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btn403</pub-id><pub-id pub-id-type="pmid">18718939</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gama-Castro</surname> <given-names>S.</given-names></name> <name><surname>Jimenez-Jacinto</surname> <given-names>V.</given-names></name> <name><surname>Peralta-Gil</surname> <given-names>M.</given-names></name> <name><surname>Santos-Zavaleta</surname> <given-names>A.</given-names></name> <name><surname>Penaloza-Spinola</surname> <given-names>M. I.</given-names></name> <name><surname>Contreras-Moreira</surname> <given-names>B.</given-names></name> <name><surname>Segura-Salazar</surname> <given-names>J.</given-names></name> <name><surname>Mu&#x000F1;iz-Rascado</surname> <given-names>L.</given-names></name> <name><surname>Mart&#x000ED;nez-Flores</surname> <given-names>I.</given-names></name> <name><surname>Salgado</surname> <given-names>H.</given-names></name> <name><surname>Bonavides-Mart&#x000ED;nez</surname> <given-names>C.</given-names></name> <name><surname>Abreu-Goodger</surname> <given-names>C.</given-names></name> <name><surname>Rodr&#x000ED;guez-Penagos</surname> <given-names>C.</given-names></name> <name><surname>Miranda-R&#x000ED;os</surname> <given-names>J.</given-names></name> <name><surname>Morett</surname> <given-names>E.</given-names></name> <name><surname>Merino</surname> <given-names>E.</given-names></name> <name><surname>Huerta</surname> <given-names>A. M.</given-names></name> <name><surname>Trevi&#x000F1;o-Quintanilla</surname> <given-names>L.</given-names></name> <name><surname>Collado-Vides</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <article-title>RegulonDB (version 6.0): gene regulation model of <italic>Escherichia coli</italic> K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation</article-title>. <source>Nucleic Acids Res.</source> <volume>36</volume>(<issue>Suppl. 1</issue>), <fpage>D120</fpage>&#x02013;<lpage>D124</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkn491</pub-id><pub-id pub-id-type="pmid">18158297</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grigoriev</surname> <given-names>A.</given-names></name></person-group> (<year>2001</year>). <article-title>A relationship between gene expression and protein interactions on the proteome scale: analysis of the bacteriophage T7 and the yeast <italic>Saccharomyces cerevisiae</italic></article-title>. <source>Nucleic Acids Res.</source> <volume>29</volume>, <fpage>3513</fpage>&#x02013;<lpage>3519</lpage>.<pub-id pub-id-type="doi">10.1093/nar/29.17.3513</pub-id><pub-id pub-id-type="pmid">11522820</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hache</surname> <given-names>H.</given-names></name> <name><surname>Lehrach</surname> <given-names>H.</given-names></name> <name><surname>Herwig</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Reverse engineering of gene regulatory networks: a comparative study</article-title>. <source>EUROSIP J. Bioinform. Syst. Biol.</source> Available at: <uri xlink:href="http://www.hindawi.com/journals/bsb/2009/617281/">http://www.hindawi.com/journals/bsb/2009/617281/</uri></citation></ref>
<ref id="B50"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hartemink</surname> <given-names>A.</given-names></name> <name><surname>Gifford</surname> <given-names>D.</given-names></name> <name><surname>Jaakkola</surname> <given-names>T.</given-names></name> <name><surname>Young</surname> <given-names>R.</given-names></name></person-group> (<year>2001</year>). <article-title>&#x0201C;Using graphical models and genomic expression data to statistically validate models of genetic regulatory networks,&#x0201D;</article-title> in <source>Pacific Symposium on Biocomputing</source> (<publisher-loc>NJ</publisher-loc>: <publisher-name>World Scientific</publisher-name>), <fpage>422</fpage>&#x02013;<lpage>433</lpage>.</citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Horvath</surname> <given-names>S.</given-names></name> <name><surname>Dong</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <article-title>Geometric interpretation of gene coexpression network analysis</article-title>. <source>PLoS Comput. Biol.</source> <volume>4</volume>, <fpage>e1000117</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pcbi.1000117</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Husmeier</surname> <given-names>D.</given-names></name></person-group> (<year>2003</year>). <article-title>Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic bayesian networks</article-title>. <source>Bioinformatics</source> <volume>19</volume>, <fpage>2271</fpage>&#x02013;<lpage>2282</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btg313</pub-id><pub-id pub-id-type="pmid">14630656</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jansen</surname> <given-names>R.</given-names></name> <name><surname>Greenbaum</surname> <given-names>D.</given-names></name> <name><surname>Gerstein</surname> <given-names>M.</given-names></name></person-group> (<year>2002</year>). <article-title>Relating whole-genome expression data with protein-protein interactions</article-title>. <source>Genome Res.</source> <volume>12</volume>, <fpage>37</fpage>&#x02013;<lpage>46</lpage>.<pub-id pub-id-type="doi">10.1101/gr.205602</pub-id><pub-id pub-id-type="pmid">11779829</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>W.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Rao</surname> <given-names>S.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Du</surname> <given-names>L.</given-names></name> <name><surname>Li</surname> <given-names>C.</given-names></name> <name><surname>Wu</surname> <given-names>C.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Yang</surname> <given-names>B.</given-names></name></person-group> (<year>2008</year>). <article-title>Constructing disease-specific gene networks using pair-wise relevance metric: application to colon cancer identifies interleukin 8, desmin and enolase 1 as the central elements</article-title>. <source>BMC Syst. Biol.</source> <volume>2</volume>, <fpage>72</fpage>.<pub-id pub-id-type="doi">10.1186/1752-0509-2-72</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>J&#x000F6;reskog</surname> <given-names>K.</given-names></name></person-group> (<year>1969</year>). <article-title>A general approach to confirmatory maximum likelihood factor analysis</article-title>. <source>Psychometrica</source> <volume>34</volume>, <fpage>183</fpage>&#x02013;<lpage>202</lpage>.<pub-id pub-id-type="doi">10.1007/BF02290173</pub-id></citation></ref>
<ref id="B56"><citation citation-type="book"><person-group person-group-type="author"><name><surname>J&#x000F6;reskog</surname> <given-names>K.</given-names></name></person-group> (<year>1973</year>). <article-title>&#x0201C;A general method for estimating a linear structural equation system,&#x0201D;</article-title> in <source>Structural Equation Models in the Social Sciences</source>, eds <person-group person-group-type="editor"><name><surname>Goldberger</surname> <given-names>A.</given-names></name> <name><surname>Duncan</surname> <given-names>O.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Seminar Press</publisher-name>), <fpage>85</fpage>&#x02013;<lpage>112</lpage>.</citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Goto</surname> <given-names>S.</given-names></name></person-group> (<year>2000</year>). <article-title>KEGG: kyoto encyclopedia of genes and genomes</article-title>. <source>Nucleic Acids Res.</source> <volume>28</volume>, <fpage>27</fpage>&#x02013;<lpage>30</lpage>.<pub-id pub-id-type="doi">10.1093/nar/28.7.e27</pub-id><pub-id pub-id-type="pmid">10592173</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khan</surname> <given-names>S.</given-names></name> <name><surname>Bandyopadhyay</surname> <given-names>S.</given-names></name> <name><surname>Ganguly</surname> <given-names>A.</given-names></name> <name><surname>Saigal</surname> <given-names>S.</given-names></name> <name><surname>Erickson</surname> <given-names>D.</given-names></name> <name><surname>Protopopescu</surname> <given-names>V.</given-names></name> <name><surname>Ostrouchov</surname> <given-names>G.</given-names></name></person-group> (<year>2007</year>). <article-title>Relative performance of mutual information estimation methods for quantifying the dependence among short and noisy data</article-title>. <source>Phys. Rev. E Stat. Nonlin. Soft Matter Phys.</source> <volume>76</volume>, <fpage>026209</fpage>.<pub-id pub-id-type="doi">10.1103/PhysRevE.76.026209</pub-id><pub-id pub-id-type="pmid">17930123</pub-id></citation></ref>
<ref id="B59"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Koller</surname> <given-names>D.</given-names></name> <name><surname>Friedman</surname> <given-names>N.</given-names></name></person-group> (<year>2009</year>). <source>Probabilistic Graphical Models: Principles and Techniques</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kraskov</surname> <given-names>A.</given-names></name> <name><surname>St&#x000F6;gbauer</surname> <given-names>H.</given-names></name> <name><surname>Grassberger</surname> <given-names>P.</given-names></name></person-group> (<year>2004</year>). <article-title>Estimating mutual information</article-title>. <source>Phys. Rev. E</source> <volume>69</volume>, <fpage>066138</fpage>.<pub-id pub-id-type="doi">10.1103/PhysRevE.69.066138</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lehmann</surname> <given-names>E.</given-names></name> <name><surname>Romano</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>Generalizations of the familywise error rate</article-title>. <source>Ann. Stat.</source> <volume>33</volume>, <fpage>1138</fpage>&#x02013;<lpage>1154</lpage>.<pub-id pub-id-type="doi">10.1214/009053605000000084</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Gui</surname> <given-names>J.</given-names></name></person-group> (<year>2006</year>). <article-title>Gradient directed regularization for sparse Gaussian concentration graphs, with applications to inference of genetic networks</article-title>. <source>Biostatistics</source> <volume>7</volume>, <fpage>302</fpage>&#x02013;<lpage>317</lpage>.<pub-id pub-id-type="doi">10.1093/biostatistics/kxj008</pub-id><pub-id pub-id-type="pmid">16326758</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liang</surname> <given-names>K.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name></person-group> (<year>2008</year>). <article-title>Gene regulatory network reconstruction using conditional mutual information</article-title>. <source>EURASIP J. Bioinform. Syst. Biol.</source> <volume>2008</volume>, <fpage>253894</fpage>.<pub-id pub-id-type="doi">10.1155/2008/253894</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>W.</given-names></name> <name><surname>L&#x000E4;hdesm&#x000E4;ki</surname> <given-names>H.</given-names></name> <name><surname>Dougherty</surname> <given-names>E.</given-names></name> <name><surname>Shmulevich</surname> <given-names>I.</given-names></name></person-group> (<year>2008</year>). <article-title>Inference of Boolean networks using sensitivity regularization</article-title>. <source>EURASIP J. Bioinform. Syst. Biol.</source> <volume>2008</volume>, <fpage>780541</fpage>.<pub-id pub-id-type="doi">10.1155/2008/780541</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>W.</given-names></name> <name><surname>Hankenson</surname> <given-names>K. D.</given-names></name> <name><surname>Woolf</surname> <given-names>P. J.</given-names></name></person-group> (<year>2008</year>). <article-title>Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information</article-title>. <source>BMC Bioinformatics</source> <volume>9</volume>, <fpage>467</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-9-467</pub-id><pub-id pub-id-type="pmid">18980677</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Magwene</surname> <given-names>P. M.</given-names></name> <name><surname>Kim</surname> <given-names>J.</given-names></name></person-group> (<year>2004</year>). <article-title>Estimating genomic coexpression networks using first-order conditional independence</article-title>. <source>Genome Biol.</source> <volume>5</volume>, <fpage>R100</fpage>.<pub-id pub-id-type="doi">10.1186/gb-2004-5-12-r100</pub-id><pub-id pub-id-type="pmid">15575966</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Margolin</surname> <given-names>A.</given-names></name> <name><surname>Califano</surname> <given-names>A.</given-names></name></person-group> (<year>2007</year>). <article-title>Theory and limitations of genetic network inference from microarray data</article-title>. <source>Ann. N. Y. Acad. Sci.</source> <volume>1115</volume>, <fpage>51</fpage>&#x02013;<lpage>72</lpage>.<pub-id pub-id-type="doi">10.1196/annals.1407.019</pub-id><pub-id pub-id-type="pmid">17925348</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Margolin</surname> <given-names>A.</given-names></name> <name><surname>Nemenman</surname> <given-names>I.</given-names></name> <name><surname>Basso</surname> <given-names>K.</given-names></name> <name><surname>Wiggins</surname> <given-names>C.</given-names></name> <name><surname>Stolovitzky</surname> <given-names>G.</given-names></name> <name><surname>Dalla Favera</surname> <given-names>R.</given-names></name> <name><surname>Califano</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context</article-title>. <source>BMC Bioinformatics</source> <volume>7</volume>, <fpage>S7</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-7-S1-S7</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Markowetz</surname> <given-names>F.</given-names></name></person-group> (<year>2010</year>). <article-title>How to understand the cell by breaking it: network analysis of gene perturbation screens</article-title>. <source>PLoS Comput. Biol.</source> <volume>6</volume>, <fpage>e1000655</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pcbi.1000655</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Markowetz</surname> <given-names>F.</given-names></name> <name><surname>Spang</surname> <given-names>R.</given-names></name></person-group> (<year>2007</year>). <article-title>Inferring cellular networks&#x02013;a review</article-title>. <source>BMC Bioinformatics</source> <volume>8</volume>, <fpage>S5</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-8-S6-S5</pub-id></citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McGill</surname> <given-names>W. J.</given-names></name></person-group> (<year>1954</year>). <article-title>Multivariate information transmission</article-title>. <source>Psychometrika</source> <volume>19</volume>, <fpage>97</fpage>&#x02013;<lpage>116</lpage>.<pub-id pub-id-type="doi">10.1007/BF02289159</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mewes</surname> <given-names>H. W.</given-names></name> <name><surname>Frishman</surname> <given-names>D.</given-names></name> <name><surname>Guldener</surname> <given-names>U.</given-names></name> <name><surname>Mannhaupt</surname> <given-names>G.</given-names></name> <name><surname>Mayer</surname> <given-names>K.</given-names></name> <name><surname>Mokrejs</surname> <given-names>M.</given-names></name> <name><surname>Morgenstern</surname> <given-names>B.</given-names></name> <name><surname>M&#x000FC;nsterk&#x000F6;tter</surname> <given-names>M.</given-names></name> <name><surname>Rudd</surname> <given-names>S.</given-names></name> <name><surname>Weil</surname> <given-names>B.</given-names></name></person-group> (<year>2002</year>). <article-title>MIPS: a database for genomes and protein sequences</article-title>. <source>Nucleic Acids Res.</source> <volume>30</volume>, <fpage>31</fpage>&#x02013;<lpage>34</lpage>.<pub-id pub-id-type="doi">10.1093/nar/30.1.31</pub-id><pub-id pub-id-type="pmid">11752246</pub-id></citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meyer</surname> <given-names>P. E.</given-names></name> <name><surname>Kontos</surname> <given-names>K.</given-names></name> <name><surname>Latiffe</surname> <given-names>F.</given-names></name> <name><surname>Bontempi</surname> <given-names>G.</given-names></name></person-group> (<year>2007</year>). <article-title>Information-theoretic inference of large transcriptional regulatory networks</article-title>. <source>EUROSIP J. Bioinform. Syst. Biol.</source> 2007, Article ID 79879.</citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mordelet</surname> <given-names>F.</given-names></name> <name><surname>Vert</surname> <given-names>J.-P.</given-names></name></person-group> (<year>2008</year>). <article-title>SIRENE: supervised inference of regulatory networks</article-title>. <source>Bioinformatics</source> <volume>24</volume>, <fpage>i76</fpage>&#x02013;<lpage>i82</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btn273</pub-id><pub-id pub-id-type="pmid">18689844</pub-id></citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nariai</surname> <given-names>N.</given-names></name> <name><surname>Tamada</surname> <given-names>Y.</given-names></name> <name><surname>Imoto</surname> <given-names>S.</given-names></name> <name><surname>Miyano</surname> <given-names>S.</given-names></name></person-group> (<year>2005</year>). <article-title>Estimating gene regulatory networks and protein-protein interactions of <italic>Saccharomyces cerevisiae</italic> from multiple genome-wide data</article-title>. <source>Bioinformatics</source> <volume>21</volume>(<issue>Suppl. 2</issue>), <fpage>206</fpage>&#x02013;<lpage>212</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti1133</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olsen</surname> <given-names>C.</given-names></name> <name><surname>Meyer</surname> <given-names>P.</given-names></name> <name><surname>Bontempi</surname> <given-names>G.</given-names></name></person-group> (<year>2009</year>). <article-title>On the impact of entropy estimator in transcriptional regulatory network inference</article-title>. <source>EURASIP J. Bioinform. Syst. Biol.</source> <volume>2009</volume>, <fpage>308959</fpage>.<pub-id pub-id-type="doi">10.1155/2009/308959</pub-id></citation></ref>
<ref id="B77"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Pearl</surname> <given-names>J.</given-names></name></person-group> (<year>1988</year>). <source>Probabilistic Reasoning in Intelligent Systems</source>. <publisher-loc>San Francisco</publisher-loc>: <publisher-name>Morgan-Kaufmann</publisher-name>.</citation></ref>
<ref id="B78"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Pearl</surname> <given-names>J.</given-names></name></person-group> (<year>2000</year>). <source>Causality: Models, Reasoning, and Inference</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="B79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Penfold</surname> <given-names>C. A.</given-names></name> <name><surname>Wild</surname> <given-names>D. L.</given-names></name></person-group> (<year>2011</year>). <article-title>How to infer gene networks from expression profiles, revisited</article-title>. <source>Interface Focus</source> <volume>1</volume>, <fpage>857</fpage>&#x02013;<lpage>870</lpage>.<pub-id pub-id-type="doi">10.1098/rsfs.2011.0053</pub-id></citation></ref>
<ref id="B80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Perrin</surname> <given-names>B.-E.</given-names></name> <name><surname>Ralaivola</surname> <given-names>L.</given-names></name> <name><surname>Mazurie</surname> <given-names>A.</given-names></name> <name><surname>Bottani</surname> <given-names>S.</given-names></name> <name><surname>Mallet</surname> <given-names>J.</given-names></name> <name><surname>d&#x02019;Alche Buc</surname> <given-names>F.</given-names></name></person-group> (<year>2003</year>). <article-title>Gene networks inference using dynamic Bayesian networks</article-title>. <source>Bioinformatics</source> <volume>19</volume>, <fpage>ii138</fpage>&#x02013;<lpage>ii148</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btg1071</pub-id><pub-id pub-id-type="pmid">14534183</pub-id></citation></ref>
<ref id="B81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pinna</surname> <given-names>A.</given-names></name> <name><surname>Soranzo</surname> <given-names>N.</given-names></name> <name><surname>de la Fuente</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>From knockouts to networks: establishing direct cause-effect relationships through graph analysis</article-title>. <source>PLoS ONE</source> <volume>5</volume>, <fpage>e12912</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0012912</pub-id></citation></ref>
<ref id="B82"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Rosenbaum</surname> <given-names>P.</given-names></name></person-group> (<year>2002</year>). <source>Observational Studies</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Springer</publisher-name>.</citation></ref>
<ref id="B83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rubin</surname> <given-names>R.</given-names></name></person-group> (<year>1974</year>). <article-title>Estimating causal effects of treatments in randomized and nonrandomized studies</article-title>. <source>J. Educ. Psychol.</source> <volume>66</volume>, <fpage>688</fpage>&#x02013;<lpage>701</lpage>.<pub-id pub-id-type="doi">10.1037/h0037350</pub-id></citation></ref>
<ref id="B84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schadt</surname> <given-names>E.</given-names></name></person-group> (<year>2009</year>). <article-title>Molecular networks as sensors and drivers of common human diseases</article-title>. <source>Nature</source> <volume>461</volume>, <fpage>218</fpage>&#x02013;<lpage>223</lpage>.<pub-id pub-id-type="doi">10.1038/nature08454</pub-id><pub-id pub-id-type="pmid">19741703</pub-id></citation></ref>
<ref id="B85"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sch&#x000E4;fer</surname> <given-names>J.</given-names></name> <name><surname>Strimmer</surname> <given-names>K.</given-names></name></person-group> (<year>2005</year>). <article-title>An empirical bayes approach to inferring large-scale gene association networks</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>754</fpage>&#x02013;<lpage>764</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti062</pub-id><pub-id pub-id-type="pmid">15479708</pub-id></citation></ref>
<ref id="B86"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Shipley</surname> <given-names>B.</given-names></name></person-group> (<year>2000</year>). <source>Cause and Correlation in Biology</source>. <publisher-loc>Cambridge, NY</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="B87"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Soranzo</surname> <given-names>N.</given-names></name> <name><surname>Bianconi</surname> <given-names>G.</given-names></name> <name><surname>Altafini</surname> <given-names>C.</given-names></name></person-group> (<year>2007</year>). <article-title>Comparing association network algorithms for reverse engineering of large-scale gene regulatory networks: synthetic versus real data</article-title>. <source>Bioinformatics</source> <volume>23</volume>, <fpage>1640</fpage>&#x02013;<lpage>1647</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btm163</pub-id><pub-id pub-id-type="pmid">17485431</pub-id></citation></ref>
<ref id="B88"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Spirtes</surname> <given-names>P.</given-names></name> <name><surname>Glymour</surname> <given-names>C.</given-names></name> <name><surname>Scheines</surname> <given-names>R.</given-names></name></person-group> (<year>1993</year>). <source>Causation, Prediction, and Search</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Springer</publisher-name>.</citation></ref>
<ref id="B89"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steuer</surname> <given-names>R.</given-names></name> <name><surname>Kurths</surname> <given-names>J.</given-names></name> <name><surname>Daub</surname> <given-names>C. O.</given-names></name> <name><surname>Weise</surname> <given-names>J.</given-names></name> <name><surname>Selbig</surname> <given-names>J.</given-names></name></person-group> (<year>2002</year>). <article-title>The mutual information: detecting and evaluating dependencies between variables</article-title>. <source>Bioinformatics</source> <volume>18</volume>, <fpage>S231</fpage>&#x02013;<lpage>S240</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/18.suppl_2.S231</pub-id><pub-id pub-id-type="pmid">12386007</pub-id></citation></ref>
<ref id="B90"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Stolovitzky</surname> <given-names>G.</given-names></name> <name><surname>Califano</surname> <given-names>A.</given-names></name></person-group> (eds). (<year>2007</year>). <source>Reverse Engineering Biological Networks: Opportunities and Challenges in Computational Methods for Pathway Inference</source>. <publisher-loc>Boston</publisher-loc>: <publisher-name>Wiley-Blackwell</publisher-name>.</citation></ref>
<ref id="B91"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stolovitzky</surname> <given-names>G.</given-names></name> <name><surname>Prill</surname> <given-names>R.</given-names></name> <name><surname>Califano</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Lessons from the DREAM 2 challenges</article-title>. <source>Ann. N. Y. Acad. Sci.</source> <volume>1158</volume>, <fpage>159</fpage>&#x02013;<lpage>195</lpage>.<pub-id pub-id-type="doi">10.1111/j.1749-6632.2009.04497.x</pub-id><pub-id pub-id-type="pmid">19348640</pub-id></citation></ref>
<ref id="B92"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Storey</surname> <given-names>J.</given-names></name> <name><surname>Tibshirani</surname> <given-names>R.</given-names></name></person-group> (<year>2003</year>). <article-title>Statistical significance for genomewide studies</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>100</volume>, <fpage>9440</fpage>&#x02013;<lpage>9445</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.1530509100</pub-id><pub-id pub-id-type="pmid">12883005</pub-id></citation></ref>
<ref id="B93"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Verma</surname> <given-names>T.</given-names></name> <name><surname>Pearl</surname> <given-names>J.</given-names></name></person-group> (<year>1990</year>). <article-title>&#x0201C;Causal networks: semantics and expressiveness,&#x0201D;</article-title> in <conf-name>Proceedings of the 4th Workshop on Uncertainly in Artificial Intelligence</conf-name>, <conf-loc>Mountain View, CA</conf-loc>, <fpage>352</fpage>&#x02013;<lpage>359</lpage>.</citation></ref>
<ref id="B94"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Verma</surname> <given-names>T.</given-names></name> <name><surname>Pearl</surname> <given-names>J.</given-names></name></person-group> (<year>1991</year>). <article-title>&#x0201C;Equivalence and synthesis of causal models,&#x0201D;</article-title> in <conf-name>Proceedings of the 6th Workshop on Uncertainty in Artificial Intelligence</conf-name>, <conf-loc>Cambridge, MA</conf-loc>, <fpage>220</fpage>&#x02013;<lpage>227</lpage>.</citation></ref>
<ref id="B95"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Watkinson</surname> <given-names>J.</given-names></name> <name><surname>Liang</surname> <given-names>K.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Zheng</surname> <given-names>T.</given-names></name> <name><surname>Anastassiou</surname> <given-names>D.</given-names></name></person-group> (<year>2009</year>). <article-title>Inference of regulatory gene interactions from expression data using three-way mutual information</article-title>. <source>Ann. N. Y. Acad. Sci.</source> <volume>1158</volume>, <fpage>302</fpage>&#x02013;<lpage>313</lpage>.<pub-id pub-id-type="doi">10.1111/j.1749-6632.2008.03757.x</pub-id><pub-id pub-id-type="pmid">19348651</pub-id></citation></ref>
<ref id="B96"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werhli</surname> <given-names>A.</given-names></name> <name><surname>Grzegorczyk</surname> <given-names>M.</given-names></name> <name><surname>Husmeier</surname> <given-names>D.</given-names></name></person-group> (<year>2006</year>). <article-title>Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical Gaussian models and Bayesian networks</article-title>. <source>Bioinformatics</source> <volume>22</volume>, <fpage>2523</fpage>&#x02013;<lpage>2531</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btl391</pub-id><pub-id pub-id-type="pmid">16844710</pub-id></citation></ref>
<ref id="B97"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Whittaker</surname> <given-names>J.</given-names></name></person-group> (<year>1990</year>). <source>Graphical Models in Applied Multivariate Statistics</source>. <publisher-loc>Chichester</publisher-loc>: <publisher-name>John Wiley &#x00026; Sons</publisher-name>.</citation></ref>
<ref id="B98"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wille</surname> <given-names>A.</given-names></name> <name><surname>B&#x000FC;hlmann</surname> <given-names>P.</given-names></name></person-group> (<year>2006</year>). <article-title>Low-order conditional independence graphs for inferring genetic networks</article-title>. <source>Stat. Appl. Genet. Mol. Biol.</source> <volume>4</volume>, <fpage>32</fpage>.</citation></ref>
<ref id="B99"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wille</surname> <given-names>A.</given-names></name> <name><surname>Zimmermann</surname> <given-names>P.</given-names></name> <name><surname>Vranova</surname> <given-names>E.</given-names></name> <name><surname>Furholz</surname> <given-names>A.</given-names></name> <name><surname>Laule</surname> <given-names>O.</given-names></name> <name><surname>Bleuler</surname> <given-names>S.</given-names></name> <name><surname>Hennig</surname> <given-names>L.</given-names></name> <name><surname>Prelic</surname> <given-names>A.</given-names></name> <name><surname>von Rohr</surname> <given-names>P.</given-names></name> <name><surname>Thiele</surname> <given-names>L.</given-names></name> <name><surname>Zitzler</surname> <given-names>E.</given-names></name> <name><surname>Gruissem</surname> <given-names>W.</given-names></name> <name><surname>Buhlmann</surname> <given-names>P.</given-names></name></person-group> (<year>2004</year>). <article-title>Sparse graphical gaussian modeling of the isoprenoid gene network in <italic>Arabidopsis thaliana</italic></article-title>. <source>Genome Biol.</source> <volume>5</volume>, <fpage>R92</fpage>.<pub-id pub-id-type="doi">10.1186/gb-2004-5-11-r92</pub-id><pub-id pub-id-type="pmid">15535868</pub-id></citation></ref>
<ref id="B100"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wolfe</surname> <given-names>C.</given-names></name> <name><surname>Kohane</surname> <given-names>I.</given-names></name> <name><surname>Butte</surname> <given-names>A.</given-names></name></person-group> (<year>2005</year>). <article-title>Systematic survey reveals general applicability of &#x0201C;guilt-by-association&#x0201D; within gene coexpression networks</article-title>. <source>BMC Bioinformatics</source> <volume>6</volume>, <fpage>227</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-6-227</pub-id></citation></ref>
<ref id="B101"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>S.</given-names></name></person-group> (<year>1921</year>). <article-title>Correlation of causation</article-title>. <source>J. Agric. Res.</source> <volume>20</volume>, <fpage>557</fpage>&#x02013;<lpage>585</lpage>.</citation></ref>
<ref id="B102"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>S.</given-names></name></person-group> (<year>1934</year>). <article-title>The method of path coefficients</article-title>. <source>Ann. Math. Stat.</source> <volume>5</volume>, <fpage>161</fpage>&#x02013;<lpage>215</lpage>.<pub-id pub-id-type="doi">10.1214/aoms/1177732676</pub-id></citation></ref>
<ref id="B103"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xing</surname> <given-names>B.</given-names></name> <name><surname>van der Laan</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>A causal inference approach for constructing transcriptional regulatory networks</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>4007</fpage>&#x02013;<lpage>4013</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti648</pub-id><pub-id pub-id-type="pmid">16131521</pub-id></citation></ref>
<ref id="B104"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xiong</surname> <given-names>M.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Fang</surname> <given-names>X.</given-names></name></person-group> (<year>2004</year>). <article-title>Identification of genetic networks</article-title>. <source>Genetics</source> <volume>166</volume>, <fpage>1037</fpage>&#x02013;<lpage>1052</lpage>.<pub-id pub-id-type="doi">10.1534/genetics.166.2.1037</pub-id><pub-id pub-id-type="pmid">15020486</pub-id></citation></ref>
<ref id="B105"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yip</surname> <given-names>K. Y.</given-names></name> <name><surname>Alexander</surname> <given-names>R. P.</given-names></name> <name><surname>Yan</surname> <given-names>K.-K.</given-names></name> <name><surname>Gerstein</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Improved reconstruction of in silico gene regulatory networks by integrating knockout and perturbation data</article-title>. <source>PLoS ONE</source> <volume>5</volume>, <fpage>e8121</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0008121</pub-id></citation></ref>
<ref id="B106"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>B.</given-names></name> <name><surname>Horvath</surname> <given-names>S.</given-names></name></person-group> (<year>2005</year>). <article-title>A general framework for weighted gene co-expression network analysis</article-title>. <source>Stat. Appl. Genet. Mol. Biol.</source> <volume>4</volume>, <fpage>17</fpage>.</citation></ref>
<ref id="B107"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Zhao</surname> <given-names>X.-M.</given-names></name> <name><surname>He</surname> <given-names>K.</given-names></name> <name><surname>Lu</surname> <given-names>L.</given-names></name> <name><surname>Cao</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Hao</surname> <given-names>J.-K.</given-names></name> <name><surname>Liu</surname> <given-names>Z.-P.</given-names></name> <name><surname>Chen</surname> <given-names>L.</given-names></name></person-group> (<year>2011</year>). <article-title>Inferring gene regulatory networks from gene expression data by PC-algorithm based on conditional mutual information</article-title>. <source>Bioinformatics</source>. <uri xlink:href="http://www.ncbi.nlm.nih.gov/pubmed/22088843">http://www.ncbi.nlm.nih.gov/pubmed/22088843</uri></citation></ref>
<ref id="B108"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>W.</given-names></name> <name><surname>Serpendin</surname> <given-names>E.</given-names></name> <name><surname>Dougherty</surname> <given-names>E. R.</given-names></name></person-group> (<year>2008</year>). <article-title>Inferring connectivity of genetic regulatory networks using information-theoretic criteria</article-title>. <source>IEEE/ACM Trans. Comput. Biol. Bioinform.</source> <volume>5</volume>, <fpage>262</fpage>&#x02013;<lpage>274</lpage>.<pub-id pub-id-type="doi">10.1109/TCBB.2007.1067</pub-id><pub-id pub-id-type="pmid">18451435</pub-id></citation></ref>
<ref id="B109"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>X.</given-names></name> <name><surname>Kao</surname> <given-names>M.</given-names></name> <name><surname>Wong</surname> <given-names>W.</given-names></name></person-group> (<year>2002</year>). <article-title>Transitive functional annotation by shortest-path analysis of gene expression data</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>99</volume>, <fpage>12783</fpage>&#x02013;<lpage>12788</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.212392399</pub-id><pub-id pub-id-type="pmid">12196633</pub-id></citation></ref>
<ref id="B110"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zou</surname> <given-names>M.</given-names></name> <name><surname>Conzen</surname> <given-names>S. D.</given-names></name></person-group> (<year>2005</year>). <article-title>A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>71</fpage>&#x02013;<lpage>79</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bth463</pub-id><pub-id pub-id-type="pmid">15308537</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn1"><p><sup>1</sup>The details of the randomization are different to that used by RN allowing now to make statistical statements.</p></fn>
<fn id="fn2"><p><sup>2</sup>We want to remark that <italic>I</italic>(<italic>X</italic>; <italic>Y</italic>; <italic>Z</italic>) is symmetric in the three variables.</p></fn>
<fn id="fn3"><p><sup>3</sup>For example, the mutual information threshold <italic>I</italic><sub>0</sub>, used in the CLR algorithm.</p></fn>
</fn-group>
</back>
</article>
