<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2017.00096</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Quantifying Gene Regulatory Relationships with Association Measures: A Comparative Study</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Liu</surname> <given-names>Zhi-Ping</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/126999/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Biomedical Engineering, School of Control Science and Engineering, Shandong University</institution> <country>Jinan, China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Shihua Zhang, Academy of Mathematics and Systems Science (CAS), China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Xingming Zhao, Tongji University, China; Lin Gao, Xidian University, China</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Zhi-Ping Liu <email>zpliu&#x00040;sdu.edu.cn</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Genetics</p></fn></author-notes>
<pub-date pub-type="epub">
<day>13</day>
<month>07</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>96</elocation-id>
<history>
<date date-type="received">
<day>18</day>
<month>04</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>28</day>
<month>06</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2017 Liu.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Liu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>In this work, we provide a comparative study of the main available association measures for characterizing gene regulatory strengths. Detecting the association between genes (as well as RNAs, proteins, and other molecules) is very important to decipher their functional relationship from genomic data in bioinformatics. With the availability of more and more high-throughput datasets, the quantification of meaningful relationships by employing association measures will make great sense of the data. There are various quantitative measures have been proposed for identifying molecular associations. They are depended on different statistical assumptions, for different intentions, as well as with different computational costs in calculating the associations in thousands of genes. Here, we comprehensively summarize these association measures employed and developed for describing gene regulatory relationships. We compare these measures in their consistency and specificity of detecting gene regulations from both simulation and real gene expression profiling data. Obviously, these measures used in genes can be easily extended in other biological molecules or across them.</p></abstract>
<kwd-group>
<kwd>gene regulatory network</kwd>
<kwd>gene coexpression</kwd>
<kwd>association measure</kwd>
<kwd>high-throughput data</kwd>
<kwd>bioinformatics</kwd>
</kwd-group>
<contract-num rid="cn001">61572287</contract-num>
<contract-num rid="cn001">61533011</contract-num>
<contract-num rid="cn002">ZR2015FQ001</contract-num>
<contract-sponsor id="cn001">National Natural Science Foundation of China<named-content content-type="fundref-id">10.13039/501100001809</named-content></contract-sponsor>
<contract-sponsor id="cn002">Natural Science Foundation of Shandong Province<named-content content-type="fundref-id">10.13039/501100007129</named-content></contract-sponsor>
<counts>
<fig-count count="4"/>
<table-count count="2"/>
<equation-count count="25"/>
<ref-count count="60"/>
<page-count count="12"/>
<word-count count="8276"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>The high-throughput technologies, such as microarray (Schena et al., <xref ref-type="bibr" rid="B40">1995</xref>) and RNA-Seq (Wang et al., <xref ref-type="bibr" rid="B49">2009</xref>) in transcriptomic level, generate bunch of data of describing various perspectives of cell state. These data provide unprecedented opportunity to quantify molecular expressions and their relationships. From a systematic perspective, the molecules in a cell orchestrate together to form various integrated and condense network systems of performing comprehensive functions (Liu, <xref ref-type="bibr" rid="B25">2015</xref>). For instance, transcriptional interactions between transcription factor (TF) and target genes are often formulated into gene regulatory network of modeling biological processes (Liu et al., <xref ref-type="bibr" rid="B28">2014</xref>, <xref ref-type="bibr" rid="B27">2015</xref>). Deciphering gene relationships from high-throughput data are crucial to reversely engineer their inner interaction scenarios, as well as profoundly reveal the dysfunctions in certain disorders, such as complex diseases (Liu et al., <xref ref-type="bibr" rid="B26">2012</xref>).</p>
<p>Quantifying the relationship between molecular components becomes fundamental in the new research paradigm from data to knowledge. The data analysis techniques of association support the kind of investigation. Traditionally, when we explore the relationship between two variables, Pearson&#x00027;s correlation coefficient (PCC) is employed to qualify their linear relationship (Zou et al., <xref ref-type="bibr" rid="B60">2003</xref>). From entropy aspects, mutual information (MI) is often used for defining the non-linear relationship between gene variables (Butte and Kohane, <xref ref-type="bibr" rid="B10">2000</xref>). Mathematically, the assumptions underlying these measures are considerable in real applications. Association measures have been developed to meet the requirements of appropriateness and precision in defining relationships from various perspectives.</p>
<p>Detecting gene associations is a fundamental method to reconstruct gene regulatory network from gene expression profiling data (Liu, <xref ref-type="bibr" rid="B25">2015</xref>). Although more integrated methods such as ordinary differential equations are available to model the differential dynamics among genes, the association-based methods are direct, simple, and easy for interpretation as well. With introducing the independence, these measures have been extended to quantify the associations between many genes simultaneously (Stuart et al., <xref ref-type="bibr" rid="B45">2003</xref>). In typical microarray experiments, the gene expression data can often be represented by matrix <bold>G</bold>,</p>
<disp-formula id="E1"><mml:math id="M1"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>G</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:mtable style="text-align:axis;" equalrows="false" columnlines="" equalcolumns="false" class="array"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:mtable style="text-align:axis;" equalrows="false" columnlines="none none none none" equalcolumns="false" class="array"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub></mml:mtd><mml:mtd><mml:mo>&#x02026;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mtd><mml:mtd><mml:mo>&#x02026;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022F1;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022F1;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mtd><mml:mtd><mml:mo>&#x02026;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mtd><mml:mtd><mml:mo>&#x02026;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022F1;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022F1;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mtd><mml:mtd><mml:mo>&#x02026;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mtd><mml:mtd><mml:mo>&#x02026;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mrow><mml:mi>G</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where <italic>G</italic><sub><italic>ij</italic></sub> represents the gene expression value of the <italic>i</italic>-th gene (1 &#x02264; <italic>i</italic> &#x02264; <italic>m</italic>) in the <italic>j</italic>-th experiment (1 &#x02264; <italic>j</italic> &#x02264; <italic>n</italic>). It is noted that <italic>j</italic> refers to a sample or a time point with specific phenotype meaning. The association between gene <italic>X</italic> and gene <italic>Y</italic> (<italic>X, Y</italic> &#x02208; {<italic>G</italic><sub>1</sub>, <italic>G</italic><sub>2</sub>, &#x022EF;&#x000A0;, <italic>G</italic><sub><italic>m</italic></sub>}) is often to indicate their regulatory relationship (Zhang and Horvath, <xref ref-type="bibr" rid="B54">2005</xref>). Let gene expressions be <italic>X</italic> &#x0003D; (<italic>X</italic><sub>1</sub>, <italic>X</italic><sub>2</sub>, &#x02026;, <italic>X</italic><sub><italic>n</italic></sub>) and <italic>Y</italic> &#x0003D; (<italic>Y</italic><sub>1</sub>, <italic>Y</italic><sub>2</sub>, &#x02026;, <italic>Y</italic><sub><italic>n</italic></sub>). Based on the two vectors, we employ or define an association measure to assess their regulatory strength. Recently, some novel measures besides PCC and MI have been proposed to define the association between two variables (Reshef et al., <xref ref-type="bibr" rid="B37">2011</xref>). It is of great interest to investigate their performances in the reconstruction of gene regulatory network from gene expression data. Figure <xref ref-type="fig" rid="F1">1</xref> demonstrates the strategy of inferring gene regulatory network by gene coexpression analysis. Gene regulation, in a particular form of transcriptional regulation, often specifies the regulation from TF to target gene. The quantified gene coexpression evaluates the simultaneous patterns of two gene&#x00027;s redundancy across samples. The expression level of upstream TF&#x00027;s gene is often to approximate its downstream protein product. As shown in Figure <xref ref-type="fig" rid="F1">1C</xref>, if we set up which ones are TFs by prior knowledge in the gene association network, we can infer a directed gene regulatory network via an undirected association measure.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>The strategy of building gene coexpression-based regulatory network from gene expression data. <bold>(A)</bold> The gene expression patterns of <italic>m</italic> genes in <italic>n</italic> samples. <bold>(B)</bold> The gene coexpression patterns quantified by association measure. <bold>(C)</bold> With some prior knowledge of TFs, the gene coexpression relationships can be improved to be a gene regulatory network.</p></caption>
<graphic xlink:href="fgene-08-00096-g0001.tif"/>
</fig>
<p>The coexpression pattern between two genes implies their regulatory aspects. As shown in Figure <xref ref-type="fig" rid="F1">1C</xref>, it firstly indicates a direct regulatory interaction. In some biological state, gene coexpression exactly responds to the activation or inhibition regulation from a TF to its target gene. The regulation between them is reflected by their highly-related gene expression redundancy. Secondly, gene coexpression is about gene co-regulation. That is to imply the two genes are regulated by the same TF(s) and then they contain highly-related gene expression patterns. Third means that the two genes are functionally-related by participating in the same regulatory circuit or particular signaling pathway. Generally, the dynamic regulations in a cell are inherently embedded with temporal features. Gene regulation is often reflected by time-delayed gene expression patterns from the activation of TF&#x00027;s gene to the downstream target responds (Bar-Joseph et al., <xref ref-type="bibr" rid="B5">2012</xref>). For the simplicity of association measure, the coexpression-based methods are popular in inferring gene regulatory network from gene expression data (Zhang and Horvath, <xref ref-type="bibr" rid="B54">2005</xref>).</p>
<p>In this paper, we provide a comparative study on these available association measures of quantifying gene relationships in regulatory network. Fourteen most-popular association measures or indices will be summarized and compared. Based on some benchmark datasets of gene regulatory network inference challenges, we evaluate their individual performances in the reconstruction of gene regulatory networks. This provides a concise comparison of accuracy and quality in network inference by the association measures. In a case study, we compare the differences of these inferred regulations during the infection of hepatitis C virus on host cells. In data-driven network inference, the characteristics of the association measures in statistics and computations are also analyzed and discussed.</p>
</sec>
<sec id="s2">
<title>Association measures</title>
<p>Numerous association measures have been proposed to define the relationship between two random variables. For gene regulations, we collect 14 of them for our assessments of network inference power from data. Table <xref ref-type="table" rid="T1">1</xref> lists the 14 association measures with brief introduction of their statistical assumptions and fundamental properties individually. Some measures are well-known such as PCC, while some become available recently such as maximal information correlation (MIC). For the completeness of introduction and reference, we describe them in details respectively in this section.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Summary of some association measures used to quantify gene regulations.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Abbre</bold>.</th>
<th valign="top" align="left"><bold>Method</bold></th>
<th valign="top" align="left"><bold>Symbol</bold></th>
<th valign="top" align="left"><bold>Description</bold></th>
<th valign="top" align="left"><bold>References</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Pearson</td>
<td valign="top" align="left">Pearson&#x00027;s</td>
<td valign="top" align="left"><italic>r</italic></td>
<td valign="top" align="left">Linear, widely-used, no parameter, coeff. &#x02208; [&#x02212;1, 1]</td>
<td valign="top" align="left">Pearson, <xref ref-type="bibr" rid="B34">1895</xref></td>
</tr>
<tr>
<td valign="top" align="left">Spearman</td>
<td valign="top" align="left">Spearman&#x00027;s</td>
<td valign="top" align="left">&#x003C1;</td>
<td valign="top" align="left">Monotonic, rank-based, no parameter, coeff. &#x02208; [&#x02212;1, 1]</td>
<td valign="top" align="left">Spearman, <xref ref-type="bibr" rid="B44">1904</xref></td>
</tr>
<tr>
<td valign="top" align="left">Kendall</td>
<td valign="top" align="left">Kendall&#x00027;s</td>
<td valign="top" align="left">&#x003C4;</td>
<td valign="top" align="left">Monotonic, rank-based, no parameter, coeff. &#x02208; [&#x02212;1, 1]</td>
<td valign="top" align="left">Kendall, <xref ref-type="bibr" rid="B20">1938</xref></td>
</tr>
<tr>
<td valign="top" align="left">Hoeffding</td>
<td valign="top" align="left">Hoeffding&#x00027;s</td>
<td valign="top" align="left"><italic>D</italic></td>
<td valign="top" align="left">Non-linear, rank-based, no parameter, coeff. &#x02208; [0, 1]</td>
<td valign="top" align="left">Hoeffding, <xref ref-type="bibr" rid="B18">1948</xref></td>
</tr>
<tr>
<td valign="top" align="left">Blomqvist</td>
<td valign="top" align="left">Blomqvist&#x00027;s</td>
<td valign="top" align="left">&#x003B2;</td>
<td valign="top" align="left">Monotonic, rank-based, no parameter, coeff. &#x02208; [&#x02212;1, 1]</td>
<td valign="top" align="left">Blomqvist, <xref ref-type="bibr" rid="B7">1950</xref></td>
</tr>
<tr>
<td valign="top" align="left">Goodman</td>
<td valign="top" align="left">Goodman and Kruskal&#x00027;s</td>
<td valign="top" align="left">&#x003B3;</td>
<td valign="top" align="left">Monotonic, cross classifications, rank-based, no parameter, coeff. &#x02208; [&#x02212;1, 1]</td>
<td valign="top" align="left">Goodman and Kruskal, <xref ref-type="bibr" rid="B17">1954</xref></td>
</tr>
<tr>
<td valign="top" align="left">WWH</td>
<td valign="top" align="left">Wang, Waterman, Huang&#x00027;s</td>
<td valign="top" align="left"><italic>wwh</italic></td>
<td valign="top" align="left">Monotonic, rank-based, no parameter, coeff. &#x02208; [0, &#x0002B;&#x0221E;]</td>
<td valign="top" align="left">Wang et al., <xref ref-type="bibr" rid="B48">2014</xref></td>
</tr>
<tr>
<td valign="top" align="left">MI</td>
<td valign="top" align="left">Mutual information</td>
<td valign="top" align="left"><italic>I</italic></td>
<td valign="top" align="left">Non-linear, entropy-based, no parameter, coeff. &#x02208; [0, &#x0002B;&#x0221E;]</td>
<td valign="top" align="left">Shannon, <xref ref-type="bibr" rid="B41">1948</xref></td>
</tr>
<tr>
<td valign="top" align="left">MIC</td>
<td valign="top" align="left">Maximum information correlation</td>
<td valign="top" align="left"><italic>mic</italic></td>
<td valign="top" align="left">Non-linear, entropy-based, 1 parameter, coeff. &#x02208; [0, 1]</td>
<td valign="top" align="left">Reshef et al., <xref ref-type="bibr" rid="B37">2011</xref></td>
</tr>
<tr>
<td valign="top" align="left">Wilks</td>
<td valign="top" align="left">Wilks&#x00027;</td>
<td valign="top" align="left"><italic>W</italic></td>
<td valign="top" align="left">Linear, covariance-based, no parameter, coeff. &#x02208; [0, 1]</td>
<td valign="top" align="left">Wilks, <xref ref-type="bibr" rid="B50">1935</xref></td>
</tr>
<tr>
<td valign="top" align="left">KCCA</td>
<td valign="top" align="left">Kernel canonical correlation analysis</td>
<td valign="top" align="left"><italic>kcca</italic></td>
<td valign="top" align="left">Non-linear, covariance-based, 1 parameter, coeff. &#x02208; [0, 1]</td>
<td valign="top" align="left">Bach and Jordan, <xref ref-type="bibr" rid="B3">2002</xref></td>
</tr>
<tr>
<td valign="top" align="left">dCor</td>
<td valign="top" align="left">Distance correlation</td>
<td valign="top" align="left"><italic>dCor</italic></td>
<td valign="top" align="left">Non-linear, covariance-based, 1 parameter, coeff. &#x02208; [0, 1]</td>
<td valign="top" align="left">Szekely and Rizzo, <xref ref-type="bibr" rid="B46">2009</xref></td>
</tr>
<tr>
<td valign="top" align="left">CMMD</td>
<td valign="top" align="left">copula-based maximum mean discrepancy</td>
<td valign="top" align="left"><italic>cmmd</italic></td>
<td valign="top" align="left">Non-linear, copulas-based, 1 parameter, coeff. &#x02208; [0, 1]</td>
<td valign="top" align="left">Poczos et al., <xref ref-type="bibr" rid="B36">2012</xref></td>
</tr>
<tr>
<td valign="top" align="left">RDC</td>
<td valign="top" align="left">Randomized dependence coefficient</td>
<td valign="top" align="left"><italic>rdc</italic></td>
<td valign="top" align="left">Non-linear, copulas-based, 2 parameters, coeff. &#x02208; [0, 1]</td>
<td valign="top" align="left">Lopez-Paz et al., <xref ref-type="bibr" rid="B30">2013</xref></td>
</tr>
</tbody>
</table>
</table-wrap>
<sec>
<title>Pearson&#x00027;s correlation coefficient</title>
<p>PCC describes the linear relationship between two variables <italic>X</italic> and <italic>Y</italic> (Pearson, <xref ref-type="bibr" rid="B34">1895</xref>). In the microarray data of gene expression, it defines the correlation coefficient between gene <italic>X</italic> and <italic>Y</italic> as</p>
<disp-formula id="E2"><mml:math id="M2"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mo>&#x0002D;</mml:mo></mml:mover></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mi>&#x00232;</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <inline-formula><mml:math id="M3"><mml:mover accent="true"><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mo>&#x0002D;</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mstyle displaystyle='true'><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula><mml:math id="M4"><mml:mi>&#x00232;</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle='true'><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> refer to the mean of two variables of gene expression in samples, and <inline-formula><mml:math id="M5"><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mfrac><mml:mrow><mml:mstyle displaystyle='true'><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mo>&#x0002D;</mml:mo></mml:mover></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfrac></mml:mrow></mml:msqrt></mml:math></inline-formula>, <inline-formula><mml:math id="M6"><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mfrac><mml:mrow><mml:mstyle displaystyle='true'><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mi>&#x00232;</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfrac></mml:mrow></mml:msqrt></mml:math></inline-formula> are their standard deviations. Generally, it assesses their linear relationship into a value between &#x02212;1 and 1, where 1 refers to total positive correlation and &#x02212;1 refers to total negative correlation, and 0 refers to no correlation.</p>
<p>When we implement the statistical test of its significance, PCC assumes the two variables are from two normal distributions and the two vectors are the corresponding pairs with independence in the observations (Zou et al., <xref ref-type="bibr" rid="B60">2003</xref>). It has been widely used to quantify the gene coexpression relationships in many studies, such as WGCNA (Zhang and Horvath, <xref ref-type="bibr" rid="B54">2005</xref>; Langfelder and Horvath, <xref ref-type="bibr" rid="B22">2008</xref>).</p>
</sec>
<sec>
<title>Spearman&#x00027;s rank correlation</title>
<p>Spearman&#x00027;s rank correlation &#x003C1; is a non-parametric measure of the relationship between two variables (Spearman, <xref ref-type="bibr" rid="B44">1904</xref>). The association between two variables <italic>X</italic> and <italic>Y</italic> is formulated as a monotonic function</p>
<disp-formula id="E3"><mml:math id="M7"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>&#x003C1;</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:mn>6</mml:mn><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msubsup><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where <italic>d</italic><sub><italic>i</italic></sub> &#x0003D; <italic>X</italic><sub><italic>i</italic></sub> &#x02212; <italic>Y</italic><sub><italic>i</italic></sub>, 1 &#x02264; <italic>i</italic> &#x02264; <italic>n</italic>. Instead of using the element values directly, it transforms the two vectors to the two rank vectors of these elements respectively. The differential rank vector is generated by the difference between two rank vectors.</p>
<p>When there are no repeated values in <italic>X</italic> and <italic>Y</italic> (no duplicated ranks), &#x003C1; reaches 1 and &#x02212;1 when a variable is a perfect monotone function of the other variable. The statistical independence between them refers to &#x003C1; = 0. In the statistical test, it still requires the dependence between the two ranking of two variables (Zar, <xref ref-type="bibr" rid="B53">1972</xref>). Compared to PCC, it contains a larger application scope because it does not require the normal distribution assumptions. It is equivalent to PCC between two ranked variables (Conover and Iman, <xref ref-type="bibr" rid="B11">1981</xref>). The following non-linear rank-based correlations contain the similar properties.</p>
</sec>
<sec>
<title>Kendall&#x00027;s tau coefficient</title>
<p>Similar to the former coefficients, Kendall&#x00027;s tau coefficient (Kendall, <xref ref-type="bibr" rid="B20">1938</xref>) is another measure of rank correlation between <italic>X</italic> and <italic>Y</italic>. It is defined as</p>
<disp-formula id="E4"><mml:math id="M8"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>&#x003C4;</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>/</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>n</italic><sub><italic>c</italic></sub> &#x0003D; &#x00023;(<italic>concordantpairs</italic>) and <italic>n</italic><sub><italic>d</italic></sub> &#x0003D; &#x00023;(<italic>discordantpairs</italic>). Any pair of observations (<italic>X</italic><sub><italic>i</italic></sub>, <italic>Y</italic><sub><italic>i</italic></sub>) and (<italic>X</italic><sub><italic>j</italic></sub>, <italic>Y</italic><sub><italic>j</italic></sub>) in <italic>X</italic> and <italic>Y</italic>, where <italic>i</italic> &#x02260; <italic>j</italic>, are defined as concordant if the ranks for both elements agree, i.e., if both <italic>X</italic><sub><italic>i</italic></sub> &#x0003E; <italic>X</italic><sub><italic>j</italic></sub> and <italic>Y</italic><sub><italic>i</italic></sub> &#x0003E; <italic>Y</italic><sub><italic>j</italic></sub> or if both <italic>X</italic><sub><italic>i</italic></sub> &#x0003C; <italic>X</italic><sub><italic>j</italic></sub> and <italic>Y</italic><sub><italic>i</italic></sub> &#x0003C; <italic>Y</italic><sub><italic>j</italic></sub>. They are classified to be discordant if <italic>X</italic><sub><italic>i</italic></sub> &#x0003E; <italic>X</italic><sub><italic>j</italic></sub> and <italic>Y</italic><sub><italic>i</italic></sub> &#x0003C; <italic>Y</italic><sub><italic>j</italic></sub> or if <italic>X</italic><sub><italic>i</italic></sub> &#x0003C; <italic>X</italic><sub><italic>j</italic></sub> and <italic>Y</italic><sub><italic>i</italic></sub> &#x0003E; <italic>Y</italic><sub><italic>j</italic></sub>. If <italic>X</italic><sub><italic>i</italic></sub> &#x0003D; <italic>X</italic><sub><italic>j</italic></sub> or <italic>Y</italic><sub><italic>i</italic></sub> &#x0003D; <italic>Y</italic><sub><italic>j</italic></sub>, the pair is neither concordant nor discordant. Based on &#x003C4;, Somers&#x00027; <italic>D</italic> of <italic>Y</italic> with respect to <italic>X</italic> is defined as <italic>D</italic><sub><italic>YX</italic></sub> &#x0003D; &#x003C4;(<italic>X, Y</italic>)/&#x003C4;(<italic>X, X</italic>), where &#x003C4;(<italic>X, X</italic>) is the number of pairs with unequal values (Somers, <xref ref-type="bibr" rid="B43">1962</xref>). It is easy to find that the order of ranks in the two variables plays critical roles in the calculation of these non-parametric estimators.</p>
</sec>
<sec>
<title>Hoeffding&#x00027;s dependence coefficient</title>
<p>The original idea of Hoeffding&#x00027;s dependence measure <italic>D</italic> is to assess the independence of two datasets by their distance between distributions for continuous variables (Hoeffding, <xref ref-type="bibr" rid="B18">1948</xref>). It has been extended for the samples of <italic>X</italic> and <italic>Y</italic> as</p>
<disp-formula id="E5"><mml:math id="M9"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>D</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>3</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>3</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>4</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <inline-formula><mml:math id="M10"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle='true'><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>Q</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>Q</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M11"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle='true'><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M12"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle='true'><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>Q</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>, <italic>R</italic><sub><italic>i</italic></sub> is the rank of <italic>X</italic><sub><italic>i</italic></sub>, <italic>S</italic><sub><italic>i</italic></sub> is the rank of <italic>Y</italic><sub><italic>i</italic></sub>, and <italic>Q</italic><sub><italic>i</italic></sub> is the bivariate rank, which refers to the number of points with both <italic>X</italic> and <italic>Y</italic> values less than the <italic>i</italic>th point, i.e., <italic>Q</italic><sub><italic>i</italic></sub> &#x0003D; &#x00023;(<italic>X</italic><sub><italic>j</italic></sub>, <italic>Y</italic><sub><italic>j</italic></sub>) <italic>s.t</italic>. <italic>X</italic><sub><italic>j</italic></sub> &#x0003C; <italic>X</italic><sub><italic>i</italic></sub> <italic>and Y</italic><sub><italic>j</italic></sub> &#x0003C; <italic>Y</italic><sub><italic>i</italic></sub>.</p>
</sec>
<sec>
<title>Blomqvist&#x00027;s &#x003B2;</title>
<p>A measure referred as Blomqvist&#x00027;s &#x003B2; has been developed for the medial correlation coefficient (Blomqvist, <xref ref-type="bibr" rid="B7">1950</xref>). For two random variables <italic>X</italic> and <italic>Y</italic>, let &#x0201C;<italic>x</italic> &#x02212; <italic>y</italic>&#x0201D;-plane be divided into four regions by the median lines of <inline-formula><mml:math id="M13"><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:math></inline-formula> and &#x01EF9;. The relationship of <italic>X</italic> and <italic>Y</italic> can be obtained from the number of sample points in the four quadrants. In gene regulations, suppose the sample size takes even number (with minor modifications in odd number), it is defined as</p>
<disp-formula id="E6"><mml:math id="M14"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>&#x003B2;</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>-</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>n</italic><sub>1</sub> refers to the number of data in the first or third quadrant, and <italic>n</italic><sub>2</sub> refers to that in the second or fourth quadrant. It has some advantages such as its explicit form and low computational complexity in estimation (Blomqvist, <xref ref-type="bibr" rid="B7">1950</xref>).</p>
</sec>
<sec>
<title>Goodman and Kruskal&#x00027;s gamma coefficient</title>
<p>The Goodman and Kruskal&#x00027;s &#x003B3; coefficient (Goodman and Kruskal, <xref ref-type="bibr" rid="B17">1954</xref>) is another widely-used rand-based coefficient to measure the dependence between variables. It is defined as</p>
<disp-formula id="E7"><mml:math id="M15"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>&#x003B3;</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>P</italic><sub><italic>s</italic></sub>, <italic>P</italic><sub><italic>d</italic></sub> are the probabilities that a randomly selected pair of observations will relocate in the same or opposite order respectively, when ranked by both variables. It represents the symmetric distances between the two paired sets representing the binary relation of ranks. It is very close to Kendall&#x00027;s tau. In gene samples, its maximum likelihood estimation can be regarded as</p>
<disp-formula id="E8"><mml:math id="M16"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>G</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>n</italic><sub><italic>s</italic></sub> is the number of concordant pairs, which refer to those pairs ranked in the same order one both variables. <italic>n</italic><sub><italic>d</italic></sub> is the number of discordant pairs, which are the number of pairs of cases ranked in reversed order. It computes the normalized difference between the numbers of concordant and discordant pairs such that it will take values between &#x02212;1 and &#x0002B;1. When it is specified into 2 &#x000D7; 2 matrices, it is exactly Yule&#x00027;s <italic>Q</italic> coefficient (Yule, <xref ref-type="bibr" rid="B52">1900</xref>).</p>
</sec>
<sec>
<title>WWH order correlation</title>
<p>The order statistics seems to provide a robust gene coexpression measure by taking local patterns in gene expression profiles into account. Wang, Huang, and Waterman (WWH; Wang et al., <xref ref-type="bibr" rid="B48">2014</xref>) proposed a count statistics method to define a new gene coexpression regulatory measure, i.e.,</p>
<disp-formula id="E9"><mml:math id="M17"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>w</mml:mi><mml:mi>w</mml:mi><mml:mi>h</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x02264;</mml:mo><mml:msub><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0003C;</mml:mo><mml:mo>&#x022EF;</mml:mo><mml:mo>&#x0003C;</mml:mo><mml:msub><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02264;</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mi>F</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where <italic>X</italic> &#x0003D; (<italic>X</italic><sub>1</sub>, &#x02026;, <italic>X</italic><sub><italic>n</italic></sub>) and <italic>Y</italic> &#x0003D; (<italic>Y</italic><sub>1</sub>, &#x02026;, <italic>Y</italic><sub><italic>n</italic></sub>) are genes <italic>X</italic> and <italic>Y</italic> with expression levels from <italic>n</italic> samples. The function <italic>F</italic> is an indicator function comparing the rank patterns of the two subsequences with a length parameter <italic>k</italic>. This method aims to identify the consistency of rank orders of the two variables and expect to highlight the local corresponding features in expression profiles. The authors considered a special case in the time-series samples by constraining the consecutive subsequences and another general cases of samples (Wang et al., <xref ref-type="bibr" rid="B48">2014</xref>).</p>
</sec>
<sec>
<title>Mutual information</title>
<p>Mutual information is based on information theory (Shannon, <xref ref-type="bibr" rid="B41">1948</xref>). Suppose <italic>P</italic>(<italic>X, Y</italic>) is the joint probability distribution function of gene variables of <italic>X</italic> and <italic>Y</italic>, and <italic>P</italic>(<italic>X</italic>) and <italic>P</italic>(<italic>Y</italic>) are their marginal probability distribution functions respectively. The mutual information between <italic>X</italic> and <italic>Y</italic> is defined as</p>
<disp-formula id="E10"><mml:math id="M18"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>I</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mi>Y</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mi>P</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo class="qopname">log</mml:mo><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>The mutual information can also be represented as a Kullback&#x02013;Leibler divergence (Kullback and Leibler, <xref ref-type="bibr" rid="B21">1951</xref>), which is to measure of the difference between two probability distributions.</p>
</sec>
<sec>
<title>Maximal information correlation</title>
<p>Based on mutual information, MIC is defined to evaluate the margin probability by calculating the data point frequencies (Reshef et al., <xref ref-type="bibr" rid="B37">2011</xref>), i.e.,</p>
<disp-formula id="E11"><mml:math id="M19"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>M</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mo>max</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mo>&#x0003C;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>B</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mfrac><mml:mrow><mml:mi>I</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mo class="qopname">log</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mo class="qopname">min</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mo>,</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where (<italic>X</italic><sub><italic>i</italic></sub>) and (<italic>Y</italic><sub><italic>j</italic></sub>) are the two gene expressions across the samples individually. <italic>I</italic> refers to their mutual information. The <italic>B</italic> is a heuristically setting parameter such as <italic>B</italic> &#x0003D; <italic>N</italic><sup>0.6</sup>, and <italic>N</italic> is the cells of a grid <italic>G</italic> induced by <italic>X</italic> and <italic>Y</italic>.</p>
</sec>
<sec>
<title>Wilks&#x00027; <italic>W</italic></title>
<p>Wilks&#x00027; <italic>W</italic> statistic is the covariance-based measure of two vectors (Wilks, <xref ref-type="bibr" rid="B50">1935</xref>). It is defined as</p>
<disp-formula id="E12"><mml:math id="M20"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>W</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:mo class="qopname">det</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x02211;</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo class="qopname">det</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x02211;</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo class="qopname">det</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x02211;</mml:mi></mml:mrow><mml:mrow><mml:mn>22</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <inline-formula><mml:math id="M41"><mml:mrow><mml:mi>&#x003A3;</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>&#x003A3;</mml:mi><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mi>&#x003A3;</mml:mi><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>&#x003A3;</mml:mi><mml:mrow><mml:mn>21</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mi>&#x003A3;</mml:mi><mml:mrow><mml:mn>22</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>, and &#x003A3;<sub><italic>ij</italic></sub> &#x0003D; cov(<italic>X</italic><sub><italic>i</italic></sub>, <italic>Y</italic><sub><italic>j</italic></sub>). It has close relationship with likelihood-ratio and multivariate analysis of variance (MANOVA) by integrating the covariances of two individual variables and their combinations. Similarly, Pillai&#x00027;s trace criterion performs similar ideas while with low popularity (Pillai, <xref ref-type="bibr" rid="B35">1955</xref>). Here, it is a special case only for two gene expression vectors.</p>
</sec>
<sec>
<title>Kernel canonical correlation analysis</title>
<p>Instead of directly calculating the relationship between <italic>X</italic> and <italic>Y</italic>, the canonical correlation analysis (CCA) is a statistical technique of maximizing the correlation between sets of projections of the two original vectors.</p>
<p>Let <italic>U</italic> &#x0003D; <italic>a</italic><sup><italic>T</italic></sup><italic>X</italic>, <italic>V</italic> &#x0003D; <italic>b</italic><sup><italic>T</italic></sup><italic>Y</italic>, <inline-formula><mml:math id="M21"><mml:mi>V</mml:mi><mml:mi>a</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>U</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub><mml:mi>a</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M22"><mml:mi>V</mml:mi><mml:mi>a</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mn>22</mml:mn></mml:mrow></mml:msub><mml:mi>b</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M23"><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>v</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>U</mml:mi><mml:mo>,</mml:mo><mml:mi>V</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub><mml:mi>b</mml:mi></mml:math></inline-formula>,</p>
<p>where <inline-formula><mml:math id="M42"><mml:mrow><mml:mi>&#x003A3;</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>=</mml:mo><mml:mi>V</mml:mi><mml:mi>a</mml:mi><mml:mi>r</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>&#x003A3;</mml:mi><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mi>&#x003A3;</mml:mi><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>&#x003A3;</mml:mi><mml:mrow><mml:mn>21</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mi>&#x003A3;</mml:mi><mml:mrow><mml:mn>22</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>, &#x003A3;<sub>11</sub> &#x0003D; <italic>Var</italic>(<italic>X</italic>), &#x003A3;<sub>22</sub> &#x0003D; <italic>Var</italic>(<italic>Y</italic>), &#x003A3;<sub>12</sub> &#x0003D; <italic>Var</italic>(<italic>X, Y</italic>), &#x003A3;<sub>21</sub> &#x0003D; <italic>Var</italic>(<italic>Y, X</italic>).</p>
<p>So</p>
<disp-formula id="E13"><mml:math id="M24"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>U</mml:mi><mml:mo>,</mml:mo><mml:mi>V</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msup><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>&#x02211;</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub><mml:mi>b</mml:mi></mml:mrow><mml:mrow><mml:msqrt><mml:mrow><mml:msup><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>&#x02211;</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub><mml:mi>a</mml:mi></mml:mrow></mml:msqrt><mml:msqrt><mml:mrow><mml:msup><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>&#x02211;</mml:mi></mml:mrow><mml:mrow><mml:mn>22</mml:mn></mml:mrow></mml:msub><mml:mi>b</mml:mi></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>We define the largest canonical correlation as <inline-formula><mml:math id="M25"><mml:msub><mml:mrow><mml:mi>&#x003C1;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munder><mml:mrow><mml:mo>sup</mml:mo></mml:mrow><mml:mrow><mml:mi>a</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:munder><mml:mtext>&#x000A0;</mml:mtext><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>U</mml:mi><mml:mo>,</mml:mo><mml:mi>V</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>, where we set the second floor as a fix number. When we maximize the first floor by solving an optimization problem is to achieve the largest canonical correlation coefficient between the original <italic>X</italic> and <italic>Y</italic>.</p>
<p>In CCA, the vector of <italic>U</italic> and <italic>V</italic> are linear combinations of <italic>X</italic> and <italic>Y</italic>. When</p>
<disp-formula id="E14"><mml:math id="M26"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mi>&#x003A6;</mml:mi><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mi>&#x003A6;</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mi>&#x003A6;</mml:mi><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mi>&#x003A6;</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x003A6; : &#x0211D;<sup><italic>n</italic></sup> &#x02192; &#x0211D;<sup><italic>N</italic></sup>(<italic>n</italic> &#x02264; <italic>N</italic>) is the kernel function of <italic>X</italic> and <italic>Y</italic> (can be different for them).</p>
<disp-formula id="E15"><mml:math id="M27"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>U</mml:mi><mml:mo>,</mml:mo><mml:mi>V</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mi>&#x003B2;</mml:mi></mml:mrow><mml:mrow><mml:msqrt><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mi>&#x003B1;</mml:mi></mml:mrow></mml:msqrt><mml:msqrt><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mi>&#x003B2;</mml:mi></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>and the kernel CCA is defined as <inline-formula><mml:math id="M28"><mml:mi>k</mml:mi><mml:mi>c</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:munder><mml:mrow><mml:mo>sup</mml:mo></mml:mrow><mml:mrow><mml:mi>&#x003B1;</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003B2;</mml:mi></mml:mrow></mml:munder><mml:mtext>&#x000A0;</mml:mtext><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>U</mml:mi><mml:mo>,</mml:mo><mml:mi>V</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
</sec>
<sec>
<title>Distance correlation</title>
<p>Let (<italic>X</italic><sub><italic>i</italic></sub>, <italic>Y</italic><sub><italic>i</italic></sub>), 1 &#x02264; <italic>i</italic> &#x02264; <italic>n</italic> be statistical samples for two random variables (<italic>X, Y</italic>). The pairwise distances are</p>
<disp-formula id="E16"><mml:math id="M29"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mtable style="text-align:axis;" equalrows="false" columnlines="" equalcolumns="false" class="array"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where ||&#x025AA;|| denotes Euclidean norm, Then, two <italic>n</italic> &#x000D7; <italic>n</italic> distance matrices (<italic>a</italic><sub><italic>j,k</italic></sub>) and (<italic>b</italic><sub><italic>j,k</italic></sub>) are generated. For each element (<italic>j, k</italic>), two transformed values are defined as</p>
<disp-formula id="E17"><mml:math id="M30"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x00101;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x025AA;</mml:mo></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x00101;</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x025AA;</mml:mo><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x00101;</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x025AA;</mml:mo><mml:mo>,</mml:mo><mml:mo>&#x025AA;</mml:mo></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>B</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mo>&#x0002D;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x025AA;</mml:mo></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mo>&#x0002D;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mo>&#x025AA;</mml:mo><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mo>&#x0002D;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mo>&#x025AA;</mml:mo><mml:mo>,</mml:mo><mml:mo>&#x025AA;</mml:mo></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x00101;<sub><italic>j</italic>,&#x025AA;</sub> is the <italic>j</italic>-th row mean, &#x00101;<sub>&#x025AA;,<italic>k</italic></sub> is the <italic>k</italic>-th column mean, and &#x00101;<sub>&#x025AA;,&#x025AA;</sub> is the grand mean of the distance matrix of the <italic>X</italic> samples. The notations for <italic>b</italic> values have the similar meanings. The distance covariance is defined as the square root of</p>
<disp-formula id="E18"><mml:math id="M31"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>B</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Then, distance correlation (dCor; Szekely and Rizzo, <xref ref-type="bibr" rid="B46">2009</xref>) between <italic>X</italic> and <italic>Y</italic> is defined as the square root of</p>
<disp-formula id="E19"><mml:math id="M32"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>d</mml:mi><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>dCor satisfies 0 &#x02264; <italic>R</italic> &#x02264; 1, and <italic>R</italic> &#x0003D; 0 when <italic>X</italic> and <italic>Y</italic> are independent.</p>
</sec>
<sec>
<title>Copula-based maximum mean discrepancy</title>
<p>A copula is a multivariate probability distribution function defined on the unit hypercube with known uniform marginals (Nelsen, <xref ref-type="bibr" rid="B32">2006</xref>). It is popular in high-dimensional statistics for describing the relationships between variables. Specifically, the copula of two random gene variables <italic>X</italic> and <italic>Y</italic> is defined as a function</p>
<disp-formula id="E20"><mml:math id="M33"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>C</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>U</mml:mi><mml:mo>,</mml:mo><mml:mi>V</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>C</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>F</italic><sub><italic>X</italic></sub>(<italic>x</italic>) &#x0003D; <italic>P</italic>(<italic>X</italic> &#x02264; <italic>x</italic>), <italic>F</italic><sub><italic>Y</italic></sub>(<italic>x</italic>) &#x0003D; <italic>P</italic>(<italic>Y</italic> &#x02264; <italic>y</italic>), and <italic>F</italic><sub><italic>XY</italic></sub>(<italic>x, y</italic>) &#x0003D; <italic>P</italic>(<italic>X</italic> &#x02264; <italic>x, Y</italic> &#x02264; <italic>y</italic>) are the two marginal distributions and the joint distributions (Sklar, <xref ref-type="bibr" rid="B42">1959</xref>).</p>
<p>cMMD is a copula-based kernel association measure between random variables (Poczos et al., <xref ref-type="bibr" rid="B36">2012</xref>). It extends the maximum mean discrepancy (MMD) method (Borgwardt et al., <xref ref-type="bibr" rid="B8">2006</xref>) of measuring dependence to the copula of the joint distribution. Suppose two copulas transformations have been implemented on the original variables, i.e., <italic>U</italic> &#x0003D; <italic>F</italic><sub>1</sub>(<italic>X</italic>) and <italic>V</italic> &#x0003D; <italic>F</italic><sub>2</sub>(<italic>Y</italic>), <italic>F</italic><sub>1</sub> and <italic>F</italic><sub>2</sub> are the empirical cumulative distribution functions for <italic>X</italic> and <italic>Y</italic> respectively (Lopez-Paz et al., <xref ref-type="bibr" rid="B30">2013</xref>). cMMD defines the relationship between <italic>X</italic> and <italic>Y</italic> as</p>
<disp-formula id="E21"><mml:math id="M34"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>c</mml:mi><mml:mi>m</mml:mi><mml:mi>m</mml:mi><mml:mi>d</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>m</mml:mi><mml:mi>m</mml:mi><mml:mi>d</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x02260;</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mi>K</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>U</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>K</italic>(<italic>U</italic><sub><italic>i</italic></sub>, <italic>V</italic><sub><italic>j</italic></sub>) &#x0003D; &#x003A6;(<italic>U</italic><sub><italic>i</italic></sub>, <italic>U</italic><sub><italic>j</italic></sub>) &#x0002B; &#x003A6;(<italic>V</italic><sub><italic>i</italic></sub>, <italic>V</italic><sub><italic>j</italic></sub>) &#x02212; &#x003A6;(<italic>U</italic><sub><italic>i</italic></sub>, <italic>V</italic><sub><italic>j</italic></sub>) &#x02212; &#x003A6;(<italic>U</italic><sub><italic>j</italic></sub>, <italic>V</italic><sub><italic>i</italic></sub>), and &#x003A6; is a specified kernel function, e.g., Gaussian kernel.</p>
</sec>
<sec>
<title>Randomized dependence coefficient</title>
<p>Based on the former kernel CCA and copulas, the randomized dependence coefficient (RDC) provides a computationally efficient association measures between multivariate random variables. In details, it is defined as</p>
<disp-formula id="E22"><mml:math id="M35"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>r</mml:mi><mml:mi>d</mml:mi><mml:mi>c</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi><mml:mo>;</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>s</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mo>sup</mml:mo></mml:mrow><mml:mrow><mml:mi>&#x003B1;</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003B2;</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mi>&#x003A6;</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>;</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>s</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:msup><mml:mrow><mml:mi>&#x003B2;</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mi>&#x003A6;</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>;</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>s</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where the functions are the same as the former ones, <italic>k</italic> &#x02208; &#x02115;<sup>&#x0002B;</sup> and <italic>s</italic> &#x02208; &#x0211D;<sup>&#x0002B;</sup> are the parameters which are often set as 20 and 0.6 respectively. RDC is proved to be capable of discovering a wide range of functional association patterns in multiple datasets.</p>
</sec>
</sec>
<sec id="s3">
<title>Results of comparison study</title>
<p>For a comparative study of these association measures in inferring gene regulatory relationships, we test these association measures in DREAM3 <italic>in silico</italic> network challenge datasets (Marbach et al., <xref ref-type="bibr" rid="B31">2010</xref>). In the challenges, gene expression datasets have been generated by some specified network structures. Then, the datasets are open without any information about the network structures. The task is to reconstruct the network structures from the open datasets by developing new inference methods. There are three sizes of networks with 10, 50, and 100 nodes respectively, and multiple datasets for each size (4 for 10-node network, 23 for 50-node network, and 46 for 100-node network). The assessment is to evaluate the consistency between the inferred network and the true network structure (gold standards). Figure <xref ref-type="fig" rid="F2">2</xref> illustrate the receiver operating characteristic (ROC) curves of inference performance by these association measures in the 10-node benchmark network. Due to the undirected regulations identified by all these association measures, we omit the regulatory directions when calculating the evaluation metrics of sensitivity (SN), specificity (SP), accuracy (ACC), Matthews correlation coefficient (MCC), F-measure, and area under ROC curve (AUC). Table <xref ref-type="table" rid="T2">2</xref> demonstrates these detailed values of evaluation metrics of these association measures. We find KCCA performs the best in the 14 association measures for inferring 10-node networks and it reaches the AUC of 0.623 &#x000B1; 0.083 (mean &#x000B1; standard deviation). Overall, the performances of these methods are comparable with each other in the 10-node network.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>The performances of different association measures in the inference of the 10-node regulatory network of DREAM challenges. <bold>(A)</bold> ROC curve of 14 association measures with maximum AUC in the four datasets. <bold>(B)</bold> Blox plots of AUC of 14 association measures.</p></caption>
<graphic xlink:href="fgene-08-00096-g0002.tif"/>
</fig>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>The performance details of inferring benchmark gene regulatory networks by 14 association measures.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Methods</bold></th>
<th valign="top" align="center"><bold>Node size</bold></th>
<th valign="top" align="center"><bold>SN</bold></th>
<th valign="top" align="center"><bold>SP</bold></th>
<th valign="top" align="center"><bold>ACC</bold></th>
<th valign="top" align="center"><bold>F-measure</bold></th>
<th valign="top" align="center"><bold>MCC</bold></th>
<th valign="top" align="center"><bold>AUC</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Pearson</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.500 &#x000B1; 0.093</td>
<td valign="top" align="center">0.545 &#x000B1; 0.166</td>
<td valign="top" align="center">0.506 &#x000B1; 0.098</td>
<td valign="top" align="center">0.518 &#x000B1; 0.121</td>
<td valign="top" align="center">0.030 &#x000B1; 0.162</td>
<td valign="top" align="center">0.592 &#x000B1; 0.048</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.536 &#x000B1; 0.102</td>
<td valign="top" align="center">0.510 &#x000B1; 0.121</td>
<td valign="top" align="center">0.535 &#x000B1; 0.099</td>
<td valign="top" align="center">0.507 &#x000B1; 0.074</td>
<td valign="top" align="center">0.014 &#x000B1; 0.044</td>
<td valign="top" align="center">0.554 &#x000B1; 0.027</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.531 &#x000B1; 0.047</td>
<td valign="top" align="center">0.487 &#x000B1; 0.078</td>
<td valign="top" align="center">0.530 &#x000B1; 0.046</td>
<td valign="top" align="center">0.504 &#x000B1; 0.048</td>
<td valign="top" align="center">0.004 &#x000B1; 0.021</td>
<td valign="top" align="center">0.536 &#x000B1; 0.021</td>
</tr> <tr>
<td valign="top" align="left">Spearman</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.617 &#x000B1; 0.162</td>
<td valign="top" align="center">0.477 &#x000B1; 0.155</td>
<td valign="top" align="center">0.600 &#x000B1; 0.150</td>
<td valign="top" align="center">0.526 &#x000B1; 0.141</td>
<td valign="top" align="center">0.074 &#x000B1; 0.191</td>
<td valign="top" align="center">0.574 &#x000B1; 0.055</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.511 &#x000B1; 0.083</td>
<td valign="top" align="center">0.504 &#x000B1; 0.071</td>
<td valign="top" align="center">0.510 &#x000B1; 0.081</td>
<td valign="top" align="center">0.502 &#x000B1; 0.059</td>
<td valign="top" align="center">0.005 &#x000B1; 0.036</td>
<td valign="top" align="center">0.538 &#x000B1; 0.031</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.501 &#x000B1; 0.055</td>
<td valign="top" align="center">0.506 &#x000B1; 0.086</td>
<td valign="top" align="center">0.501 &#x000B1; 0.053</td>
<td valign="top" align="center">0.497 &#x000B1; 0.043</td>
<td valign="top" align="center">0.002 &#x000B1; 0.019</td>
<td valign="top" align="center">0.533 &#x000B1; 0.025</td>
</tr> <tr>
<td valign="top" align="left">Kendall</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.601 &#x000B1; 0.192</td>
<td valign="top" align="center">0.500 &#x000B1; 0.117</td>
<td valign="top" align="center">0.589 &#x000B1; 0.175</td>
<td valign="top" align="center">0.536 &#x000B1; 0.125</td>
<td valign="top" align="center">0.082 &#x000B1; 0.198</td>
<td valign="top" align="center">0.574 &#x000B1; 0.057</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.499 &#x000B1; 0.098</td>
<td valign="top" align="center">0.518 &#x000B1; 0.083</td>
<td valign="top" align="center">0.500 &#x000B1; 0.095</td>
<td valign="top" align="center">0.498 &#x000B1; 0.053</td>
<td valign="top" align="center">0.005 &#x000B1; 0.034</td>
<td valign="top" align="center">0.536 &#x000B1; 0.031</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.509 &#x000B1; 0.054</td>
<td valign="top" align="center">0.503 &#x000B1; 0.085</td>
<td valign="top" align="center">0.509 &#x000B1; 0.053</td>
<td valign="top" align="center">0.499 &#x000B1; 0.040</td>
<td valign="top" align="center">0.003 &#x000B1; 0.017</td>
<td valign="top" align="center">0.532 &#x000B1; 0.025</td>
</tr> <tr>
<td valign="top" align="left">Hoeffdings</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.519 &#x000B1; 0.591</td>
<td valign="top" align="center">0.591 &#x000B1; 0.091</td>
<td valign="top" align="center">0.528 &#x000B1; 0.080</td>
<td valign="top" align="center">0.544 &#x000B1; 0.042</td>
<td valign="top" align="center">0.073 &#x000B1; 0.062</td>
<td valign="top" align="center">0.539 &#x000B1; 0.039</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.507 &#x000B1; 0.072</td>
<td valign="top" align="center">0.494 &#x000B1; 0.102</td>
<td valign="top" align="center">0.507 &#x000B1; 0.070</td>
<td valign="top" align="center">0.492 &#x000B1; 0.064</td>
<td valign="top" align="center">0.00006 &#x000B1; 0.038</td>
<td valign="top" align="center">0.544 &#x000B1; 0.032</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.504 &#x000B1; 0.071</td>
<td valign="top" align="center">0.523 &#x000B1; 0.061</td>
<td valign="top" align="center">0.504 &#x000B1; 0.069</td>
<td valign="top" align="center">0.508 &#x000B1; 0.042</td>
<td valign="top" align="center">0.006 &#x000B1; 0.018</td>
<td valign="top" align="center">0.535 &#x000B1; 0.025</td>
</tr> <tr>
<td valign="top" align="left">Blomqvist</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.563 &#x000B1; 0.069</td>
<td valign="top" align="center">0.409 &#x000B1; 0.189</td>
<td valign="top" align="center">0.544 &#x000B1; 0.060</td>
<td valign="top" align="center">0.451 &#x000B1; 0.136</td>
<td valign="top" align="center">&#x02212;0.019 &#x000B1; 0.125</td>
<td valign="top" align="center">0.570 &#x000B1; 0.030</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.457 &#x000B1; 0.126</td>
<td valign="top" align="center">0.496 &#x000B1; 0.134</td>
<td valign="top" align="center">0.458 &#x000B1; 0.120</td>
<td valign="top" align="center">0.444 &#x000B1; 0.069</td>
<td valign="top" align="center">&#x02212;0.016 &#x000B1; 0.028</td>
<td valign="top" align="center">0.535 &#x000B1; 0.030</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.550 &#x000B1; 0.066</td>
<td valign="top" align="center">0.583 &#x000B1; 0.056</td>
<td valign="top" align="center">0.551 &#x000B1; 0.065</td>
<td valign="top" align="center">0.560 &#x000B1; 0.020</td>
<td valign="top" align="center">0.030 &#x000B1; 0.008</td>
<td valign="top" align="center">0.574 &#x000B1; 0.022</td>
</tr> <tr>
<td valign="top" align="left">Goodman</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.411 &#x000B1; 0.130</td>
<td valign="top" align="center">0.500 &#x000B1; 0.053</td>
<td valign="top" align="center">0.422 &#x000B1; 0.073</td>
<td valign="top" align="center">0.437 &#x000B1; 0.073</td>
<td valign="top" align="center">&#x02212;0.063 &#x000B1; 0.073</td>
<td valign="top" align="center">0.539 &#x000B1; 0.067</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.470 &#x000B1; 0.086</td>
<td valign="top" align="center">0.454 &#x000B1; 0.083</td>
<td valign="top" align="center">0.469 &#x000B1; 0.082</td>
<td valign="top" align="center">0.448 &#x000B1; 0.037</td>
<td valign="top" align="center">&#x02212;0.0246 &#x000B1; 0.0194</td>
<td valign="top" align="center">0.531 &#x000B1; 0.026</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.531 &#x000B1; 0.068</td>
<td valign="top" align="center">0.529 &#x000B1; 0.059</td>
<td valign="top" align="center">0.531 &#x000B1; 0.067</td>
<td valign="top" align="center">0.524 &#x000B1; 0.027</td>
<td valign="top" align="center">0.014 &#x000B1; 0.011</td>
<td valign="top" align="center">0.527 &#x000B1; 0.018</td>
</tr> <tr>
<td valign="top" align="left">WWH</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.411 &#x000B1; 0.248</td>
<td valign="top" align="center">0.591 &#x000B1; 0.174</td>
<td valign="top" align="center">0.433 &#x000B1; 0.200</td>
<td valign="top" align="center">0.416 &#x000B1; 0.148</td>
<td valign="top" align="center">&#x02212;0.006 &#x000B1; 0.103</td>
<td valign="top" align="center">0.569 &#x000B1; 0.069</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.352 &#x000B1; 0.116</td>
<td valign="top" align="center">0.660 &#x000B1; 0.099</td>
<td valign="top" align="center">0.360 &#x000B1; 0.111</td>
<td valign="top" align="center">0.437 &#x000B1; 0.083</td>
<td valign="top" align="center">0.003 &#x000B1; 0.019</td>
<td valign="top" align="center">0.532 &#x000B1; 0.016</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.392 &#x000B1; 0.137</td>
<td valign="top" align="center">0.619 &#x000B1; 0.145</td>
<td valign="top" align="center">0.395 &#x000B1; 0.134</td>
<td valign="top" align="center">0.442 &#x000B1; 0.070</td>
<td valign="top" align="center">0.003 &#x000B1; 0.009</td>
<td valign="top" align="center">0.522 &#x000B1; 0.018</td>
</tr> <tr>
<td valign="top" align="left">MI</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.557 &#x000B1; 0.149</td>
<td valign="top" align="center">0.409 &#x000B1; 0.241</td>
<td valign="top" align="center">0.539 &#x000B1; 0.111</td>
<td valign="top" align="center">0.416 &#x000B1; 0.115</td>
<td valign="top" align="center">&#x02212;0.022 &#x000B1; 0.111</td>
<td valign="top" align="center">0.534 &#x000B1; 0.041</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.470 &#x000B1; 0.100</td>
<td valign="top" align="center">0.443 &#x000B1; 0.081</td>
<td valign="top" align="center">0.470 &#x000B1; 0.098</td>
<td valign="top" align="center">0.448 &#x000B1; 0.069</td>
<td valign="top" align="center">&#x02212;0.028 &#x000B1; 0.043</td>
<td valign="top" align="center">0.569 &#x000B1; 0.046</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.468 &#x000B1; 0.081</td>
<td valign="top" align="center">0.471 &#x000B1; 0.069</td>
<td valign="top" align="center">0.468 &#x000B1; 0.079</td>
<td valign="top" align="center">0.462 &#x000B1; 0.046</td>
<td valign="top" align="center">&#x02212;0.014 &#x000B1; 0.020</td>
<td valign="top" align="center">0.544 &#x000B1; 0.034</td>
</tr> <tr>
<td valign="top" align="left">MIC</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.500 &#x000B1; 0.051</td>
<td valign="top" align="center">0.636 &#x000B1; 0.196</td>
<td valign="top" align="center">0.517 &#x000B1; 0.042</td>
<td valign="top" align="center">0.547 &#x000B1; 0.066</td>
<td valign="top" align="center">0.090 &#x000B1; 0.121</td>
<td valign="top" align="center">0.573 &#x000B1; 0.062</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.515 &#x000B1; 0.120</td>
<td valign="top" align="center">0.494 &#x000B1; 0.084</td>
<td valign="top" align="center">0.515 &#x000B1; 0.116</td>
<td valign="top" align="center">0.492 &#x000B1; 0.070</td>
<td valign="top" align="center">0.003 &#x000B1; 0.044</td>
<td valign="top" align="center">0.551 &#x000B1; 0.031</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.510 &#x000B1; 0.058</td>
<td valign="top" align="center">0.502 &#x000B1; 0.071</td>
<td valign="top" align="center">0.510 &#x000B1; 0.057</td>
<td valign="top" align="center">0.501 &#x000B1; 0.038</td>
<td valign="top" align="center">0.003 &#x000B1; 0.017</td>
<td valign="top" align="center">0.531 &#x000B1; 0.024</td>
</tr> <tr>
<td valign="top" align="left">Wilks</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.522 &#x000B1; 0.113</td>
<td valign="top" align="center">0.477 &#x000B1; 0.087</td>
<td valign="top" align="center">0.517 &#x000B1; 0.109</td>
<td valign="top" align="center">0.498 &#x000B1; 0.098</td>
<td valign="top" align="center">0.0004 &#x000B1; 0.13</td>
<td valign="top" align="center">0.592 &#x000B1; 0.048</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.536 &#x000B1; 0.102</td>
<td valign="top" align="center">0.509 &#x000B1; 0.120</td>
<td valign="top" align="center">0.536 &#x000B1; 0.099</td>
<td valign="top" align="center">0.507 &#x000B1; 0.073</td>
<td valign="top" align="center">0.014 &#x000B1; 0.044</td>
<td valign="top" align="center">0.554 &#x000B1; 0.027</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.523 &#x000B1; 0.050</td>
<td valign="top" align="center">0.502 &#x000B1; 0.080</td>
<td valign="top" align="center">0.523 &#x000B1; 0.049</td>
<td valign="top" align="center">0.508 &#x000B1; 0.048</td>
<td valign="top" align="center">0.006 &#x000B1; 0.021</td>
<td valign="top" align="center">0.538 &#x000B1; 0.025</td>
</tr> <tr>
<td valign="top" align="left">KCCA</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.472 &#x000B1; 0.267</td>
<td valign="top" align="center">0.432 &#x000B1; 0.202</td>
<td valign="top" align="center">0.467 &#x000B1; 0.231</td>
<td valign="top" align="center">0.393 &#x000B1; 0.168</td>
<td valign="top" align="center">&#x02212;0.067 &#x000B1; 0.219</td>
<td valign="top" align="center">0.623 &#x000B1; 0.083</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.442 &#x000B1; 0.121</td>
<td valign="top" align="center">0.464 &#x000B1; 0.119</td>
<td valign="top" align="center">0.442 &#x000B1; 0.117</td>
<td valign="top" align="center">0.428 &#x000B1; 0.070</td>
<td valign="top" align="center">&#x02212;0.031 &#x000B1; 0.037</td>
<td valign="top" align="center">0.541 &#x000B1; 0.058</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.453 &#x000B1; 0.100</td>
<td valign="top" align="center">0.502 &#x000B1; 0.090</td>
<td valign="top" align="center">0.454 &#x000B1; 0.098</td>
<td valign="top" align="center">0.462 &#x000B1; 0.058</td>
<td valign="top" align="center">&#x02212;0.011 &#x000B1; 0.024</td>
<td valign="top" align="center">0.541 &#x000B1; 0.036</td>
</tr> <tr>
<td valign="top" align="left">dCor</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.506 &#x000B1; 0.061</td>
<td valign="top" align="center">0.545 &#x000B1; 0.166</td>
<td valign="top" align="center">0.511 &#x000B1; 0.069</td>
<td valign="top" align="center">0.520 &#x000B1; 0.102</td>
<td valign="top" align="center">0.034 &#x000B1; 0.140</td>
<td valign="top" align="center">0.573 &#x000B1; 0.060</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.529 &#x000B1; 0.084</td>
<td valign="top" align="center">0.513 &#x000B1; 0.103</td>
<td valign="top" align="center">0.529 &#x000B1; 0.082</td>
<td valign="top" align="center">0.512 &#x000B1; 0.067</td>
<td valign="top" align="center">0.014 &#x000B1; 0.042</td>
<td valign="top" align="center">0.556 &#x000B1; 0.031</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.514 &#x000B1; 0.061</td>
<td valign="top" align="center">0.510 &#x000B1; 0.091</td>
<td valign="top" align="center">0.514 &#x000B1; 0.060</td>
<td valign="top" align="center">0.505 &#x000B1; 0.049</td>
<td valign="top" align="center">0.006 &#x000B1; 0.021</td>
<td valign="top" align="center">0.538 &#x000B1; 0.025</td>
</tr> <tr>
<td valign="top" align="left">CMMD</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.573 &#x000B1; 0.201</td>
<td valign="top" align="center">0.545 &#x000B1; 0.129</td>
<td valign="top" align="center">0.569 &#x000B1; 0.176</td>
<td valign="top" align="center">0.540 &#x000B1; 0.112</td>
<td valign="top" align="center">0.085 &#x000B1; 0.164</td>
<td valign="top" align="center">0.611 &#x000B1; 0.066</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.508 &#x000B1; 0.081</td>
<td valign="top" align="center">0.491 &#x000B1; 0.088</td>
<td valign="top" align="center">0.508 &#x000B1; 0.079</td>
<td valign="top" align="center">0.494 &#x000B1; 0.065</td>
<td valign="top" align="center">&#x02212;0.00006 &#x000B1; 0.041</td>
<td valign="top" align="center">0.547 &#x000B1; 0.031</td>
</tr>
<tr style="border-bottom: thin solid #000000;">
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.512 &#x000B1; 0.071</td>
<td valign="top" align="center">0.505 &#x000B1; 0.068</td>
<td valign="top" align="center">0.512 &#x000B1; 0.070</td>
<td valign="top" align="center">0.503 &#x000B1; 0.044</td>
<td valign="top" align="center">0.004 &#x000B1; 0.019</td>
<td valign="top" align="center">0.532 &#x000B1; 0.028</td>
</tr>
<tr>
<td valign="top" align="left">RDC</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.522 &#x000B1; 0.147</td>
<td valign="top" align="center">0.568 &#x000B1; 0.227</td>
<td valign="top" align="center">0.528 &#x000B1; 0.139</td>
<td valign="top" align="center">0.527 &#x000B1; 0.143</td>
<td valign="top" align="center">0.062 &#x000B1; 0.203</td>
<td valign="top" align="center">0.599 &#x000B1; 0.038</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">50</td>
<td valign="top" align="center">0.518 &#x000B1; 0.085</td>
<td valign="top" align="center">0.522 &#x000B1; 0.076</td>
<td valign="top" align="center">0.518 &#x000B1; 0.083</td>
<td valign="top" align="center">0.515 &#x000B1; 0.076</td>
<td valign="top" align="center">0.013 &#x000B1; 0.039</td>
<td valign="top" align="center">0.551 &#x000B1; 0.032</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.517 &#x000B1; 0.070</td>
<td valign="top" align="center">0.515 &#x000B1; 0.042</td>
<td valign="top" align="center">0.517 &#x000B1; 0.069</td>
<td valign="top" align="center">0.051 &#x000B1; 0.04</td>
<td valign="top" align="center">0.007 &#x000B1; 0.018</td>
<td valign="top" align="center">0.534 &#x000B1; 0.026</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>For the association measures, it becomes more difficult to achieve high inference performances when the network size becomes bigger from 10, 50 to 100. Although each association measure cannot achieve good inferences for big networks, the performances of them decrease with the same tendency. For 50-node networks, mutual information (MI) achieves the best AUC of 0.569 &#x000B1; 0.046. Blomqvist&#x00027;s &#x003B2; performs the best for 100-node networks in the inference, while it is not stable for the small-size networks. Figure <xref ref-type="fig" rid="F3">3</xref> shows the ranks of their performances according to the mean AUCs in different size of networks individually. From the comparative study, mutual information (MI) performs relatively better with stable ranks for big networks with 50 and 100 nodes. PCC is also stable in the 14 association measures for various sizes of network, as well as KCCA and dCor. This indicates their relative reliability in detecting gene regulatory relationships from expression data. For the other association measures, they accomplish unreliable and unstable regulatory network inferences in the benchmarks.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>The ranks of 14 association measures in the inferences of regulatory networks with different node sizes. The numbers in the color blocks refer to the ranks of corresponding association measures by the means of AUC in these benchmark networks.</p></caption>
<graphic xlink:href="fgene-08-00096-g0003.tif"/>
</fig>
<p>From the inference performances, we find that most of association-based methods can only achieve limited accuracies in the reconstruction of gene regulatory network from the benchmark datasets, especially for large-size networks. The application scopes of these association measures are mainly determined by the assumptions and characteristics of their definitions listed in Table <xref ref-type="table" rid="T1">1</xref>. For instances, PCC is for linear regulatory relationship, MI is for non-linear relationship, KCCA and dCor measure the genuine relationship based on covariance, and the rank-based associations are robust to the noisy and outliers in gene expressions. In practical applications, the selection of suitable association measures could be subjectively determined by research purpose, experimental design, phenotypic condition and data quality. An ensemble and self-adaptive association measures selection strategy is desirable to be proposed for the co-existence of different gene regulatory relationships.</p>
<p>In real microarray data, we perform our comparative study of quantifying gene regulations during hepatitis C virus (HCV) infection on host Huh7 cells. The gene expression data are downloaded from NCBI GEO (accession ID <ext-link ext-link-type="NCBI:geo" xlink:href="GSE20948">GSE20948</ext-link>) (Edgar et al., <xref ref-type="bibr" rid="B15">2002</xref>). There are 28 samples of 14 HCV infected Huh7 hepatoma cell samples and 14 corresponding mock-infected samples, originally designed three replicates at 6, 12, 18, 24, and 48 h post-infections, respectively. Two samples at 6 h have not been enrolled after quality control. The details can be accessed from Ref. (Blackham et al., <xref ref-type="bibr" rid="B6">2010</xref>). We also download the hepatocellular carcinoma (HCC) gene set from KEGG (Kanehisa and Goto, <xref ref-type="bibr" rid="B19">2000</xref>). The gene set contains 123 genes with 94 genes containing their expression profiles in GSE20948 (Edgar et al., <xref ref-type="bibr" rid="B15">2002</xref>).</p>
<p>For evaluating the inference consistency of these association measures, we calculate the pairwise gene regulatory strengths in the HCC genes by the 14 association measures respectively. In the results of each association measure, the pairs with the top 5% association values are regarded as the identified gene regulations in the context of specific gene expression profiles after HCV infection.</p>
<p>Figure <xref ref-type="fig" rid="F4">4</xref> demonstrates the inferred gene coexpression regulatory network in the HCC genes by PCC. There is no information about direction, so we annotate the known human TFs and display them by different color nodes (cyan) with the other genes (green). From Figure <xref ref-type="fig" rid="F4">4</xref>, we can figure out the regulatory information about positive and negative relationships during HCV infection. As in the former comparisons, we compare the overlapping status of these inferred coexpression relationships by the four association measures with top performances, i.e., Pearson, MI, KCCA and dCor. There exists only one pair of genes (&#x0201C;IFNA1&#x0201D; and &#x0201C;IFNA13&#x0201D;) is identified by the four measures, and the relationship between the two genes can be detected by any of them. Interestingly, Pearson and dCor contain many overlaps (177 regulations). It provides direct evidence that dCor is mainly to extract the linear correlations between genes as that Pearson done in this case study. There are few overlaps (3 regulations) between Pearson and MI, which indicates the linear and non-linear information are inconsistent with each other, and different association measures might identify different gene associations. The selection of suitable association measures is again proved to be very important for inferring gene coexpression regulatory network. The few overlapping regulations also imply the complex and diversity of regulatory relationships underlying gene expressions. More advanced methods beyond association measures are urged for elucidating gene regulatory mechanism from high-throughput data. See Section Discussion for some already available methods.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>The reconstructed gene coexpression regulatory network during HCV infection. <bold>(A)</bold> The gene association network constructed by the PCC-based method. Isolated genes are not shown. <bold>(B)</bold> The overlapping status of the inferred gene regulations by four association measures, i.e., Pearson, MI, KCCA, and dCor.</p></caption>
<graphic xlink:href="fgene-08-00096-g0004.tif"/>
</fig>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>It is known association is different from causality and correlation does not imply causation (Altman and Krzywinski, <xref ref-type="bibr" rid="B1">2015</xref>). Detecting the causality between genes has been essential in gene regulatory network inference since the availability of high-throughput data (Opgen-Rhein and Strimmer, <xref ref-type="bibr" rid="B33">2007</xref>). Gene association network indicates more general gene-gene relationship than regulation, and gene regulatory network indicates more general gene-gene relationship than causality. The gene causality network, that is to say, the causal regulations between genes are directed in the gene-gene interaction graph with the detailed information of which ones are upstream regulators, and which ones are downstream targets. In the direct regulations, TFs or signal transductors causally affect their target gene expressions. The information flow transits between genes will be revealed if a causal relationship exists. So far, there is no association measure has been defined for describing the causal relationship between genes (Zhang et al., <xref ref-type="bibr" rid="B56">2014</xref>; Zhao et al., <xref ref-type="bibr" rid="B58">2016</xref>), while more advanced methods based on conditional probability, model-based regression and differential equation have been proposed to address the evaluations of causality.</p>
<p>Based on conditional independence, some improved association measures, such as partial correlation coefficient and conditional mutual information, have been proposed to eliminate false positive regulations from gene associations. The original association measures generate the footholds for detecting genuine relationships. Conditioning on another gene or gene set <italic>Z</italic>, partial correlation measure <italic>r</italic><sub><italic>XY</italic>&#x000B7;<italic>Z</italic></sub> between gene <italic>X</italic> and <italic>Y</italic> is to access the exact correlation between <italic>X</italic> and <italic>Y</italic> and that has no relationship with <italic>Z</italic> (de la Fuente et al., <xref ref-type="bibr" rid="B12">2004</xref>). It is defined as</p>
<disp-formula id="E23"><mml:math id="M36"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi><mml:mi>Y</mml:mi><mml:mo>&#x000B7;</mml:mo><mml:mi>Z</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi><mml:mi>Z</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi><mml:mi>Z</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msqrt><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:msubsup><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>X</mml:mi><mml:mi>Z</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:msubsup><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>Y</mml:mi><mml:mi>Z</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where <italic>r</italic> refers to PCC. In the similar philosophy of introducing other gene or gene set, the conditional mutual information (CMI; Liang and Wang, <xref ref-type="bibr" rid="B24">2008</xref>) is defined as</p>
<disp-formula id="E24"><mml:math id="M37"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>I</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>Z</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mi>Y</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:msub><mml:mrow><mml:mi>Z</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mi>Z</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mi>p</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Z</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo class="qopname">log</mml:mo><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>Z</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>Z</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mi>p</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi>Z</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Based on CMI and the order of conditioned gene numbers, we proposed a gene regulatory inference method named PCA-CMI (Zhang et al., <xref ref-type="bibr" rid="B57">2012</xref>, <xref ref-type="bibr" rid="B55">2013</xref>), which detect out dedicate associations by removing undirect false positive regulations. For a pair of genes <italic>X</italic> and <italic>Y</italic>, Li proposed a conditional coexpression measure named liquid association (LA) between two genes by introducing a third gene <italic>Z</italic> (Li, <xref ref-type="bibr" rid="B23">2002</xref>). Based on <italic>Z</italic>, the gene relationship of <italic>X</italic> and <italic>Y</italic> is defined as</p>
<disp-formula id="E25"><mml:math id="M38"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>L</mml:mi><mml:mi>A</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">|</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>E</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">|</mml:mo><mml:mi>Z</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>Z</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>n</italic> is the sample size. The LA activity determines the functional associations of gene <italic>X</italic> and <italic>Y</italic> in the condition of <italic>Z</italic>.</p>
<p>Currently, the causality between genes is often quantified via Bayesian models (Friedman et al., <xref ref-type="bibr" rid="B16">2000</xref>). According to data, the conditional probability of <inline-formula><mml:math id="M39"><mml:mi>P</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi><mml:mo stretchy="false">|</mml:mo><mml:mi>Y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mo stretchy="false">|</mml:mo><mml:mi>X</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:math></inline-formula> The probability of gene <italic>X</italic> conditioned on gene <italic>Y</italic>, means <italic>Y</italic> have a causal effect on <italic>X</italic> because there exists a negative or positive values of the conditional probability. The structured model has been extended and formulated as diagrams using a graphical criterion known as <italic>d</italic>-separation (Bareinboim and Pearl, <xref ref-type="bibr" rid="B4">2016</xref>). Bayesian network provides a model-based detection of causal regulatory relationships. Gene regulations are then identified from the graphical models (Liu et al., <xref ref-type="bibr" rid="B29">2013</xref>).</p>
<p>Regression and other structured models often extract the effects of regulatory coefficients. The identification of model coefficients determines the global relationship of these individual genes (D&#x00027;Haeseleer et al., <xref ref-type="bibr" rid="B14">1999</xref>). Specifically, the regression models the response gene as the linear combinations of the other dependent genes, i.e., <italic>Y</italic> &#x0003D; <italic>c</italic><sub>0</sub> &#x0002B; <italic>c</italic><sub>1</sub><italic>X</italic><sub>1</sub> &#x0002B; <italic>c</italic><sub>2</sub><italic>X</italic><sub>2</sub> &#x0002B; &#x022EF; &#x0002B; <italic>c</italic><sub><italic>m</italic></sub><italic>X</italic><sub><italic>m</italic></sub> &#x0002B; &#x003B5;, <italic>m</italic> is the number of dependent genes in the regression and &#x003B5; is the error variable. In generalized linear models, the response gene is changed to &#x003B8;(<italic>Y</italic>), and <italic>X</italic><sub>1</sub>, &#x022EF;&#x000A0;, <italic>X</italic><sub><italic>m</italic></sub> are replaced by &#x003D5;<sub>1</sub>(<italic>X</italic><sub>1</sub>), &#x022EF;&#x000A0;, &#x003D5;<sub><italic>m</italic></sub>(<italic>X</italic><sub><italic>m</italic></sub>), respectively (Breiman and Friedman, <xref ref-type="bibr" rid="B9">1985</xref>). In the special case of simple linear regression with <italic>m</italic> &#x0003D; 1, the model is to detect the linear relationship between the response gene and the only one dependent gene. The coefficient of determination denoted by <italic>r</italic><sup>2</sup> is equal to the square of PCC (Altman and Krzywinski, <xref ref-type="bibr" rid="B2">2016</xref>). The coefficient of determination, which represents the proportion of variation due to their linear relationship, generalizes the correlation coefficient for relationships beyond simple linear regression. Often, the regression equations often model the associations between response genes and dependent genes in an inter-coupled system. From a system biology perspective, regression models consider the genes in an integrated manner. Compared to the former pairwise associations, they identify more complicated relationships among genes. After determining the coefficients, the relationships in these genes are quantified correspondingly. How to determine crucial regulators and targets via statistical variable selections techniques, such as lasso (Tibshirani, <xref ref-type="bibr" rid="B47">1996</xref>) and elastic net (Zou and Trevor, <xref ref-type="bibr" rid="B59">2005</xref>), are substantially important.</p>
<p>Similarly, ODE models the derivatives, i.e., <inline-formula><mml:math id="M40"><mml:mfrac><mml:mrow><mml:mi>d</mml:mi><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:mo>&#x022EF;</mml:mo><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, and so ODE quantifies the dynamics of the response as a function of the dependents in the system (Wu et al., <xref ref-type="bibr" rid="B51">2014</xref>). The expression change rate of a response gene is modeled by the expressions of dependence genes. The <italic>Y</italic> might be another dependence gene and thus the system is closed. The system identification is to evaluate the coefficients in the right-hand side of the equation and the coefficient values refer to gene regulatory strengths. When the coefficient is 0, there is no relationship between the responding gene and the depending gene, otherwise the regulatory strength can be represented by positive or negative numeric values.</p>
<p>Compared to association measure, regression model and differential equation model regard gene regulatory network as an integrative system. The gene regulatory network inference is then transformed to a system identification problem of solving the coupled equations. The gene regulation strengths refer to the identified coefficients. From a sequential modeling perspective, the causality between regulators and targets can also be reflected by these system biology techniques.</p>
<p>In machine learning techniques such as clustering (Rui and Wunsch, <xref ref-type="bibr" rid="B38">2005</xref>), there are some metrics have been developed for measuring the association between data points. The distances of Euclidean, cosine, Hamming, Manhattan are often used to measure gene relationships in gene expression clustering (D&#x00027;Haeseleer, <xref ref-type="bibr" rid="B13">2005</xref>). These distances evaluate the differences including dependences between genes, while these compared association measures focus on quantifying gene relationship such as regulation between genes. In gene expression data analyses of clustering and feature selection, distance metrics provide alternatives to define gene similarities. The distance metrics are not included in the comparative study for their diversity and case-intensity (Santini and Jain, <xref ref-type="bibr" rid="B39">1999</xref>).</p>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusions</title>
<p>In this paper, we summarized and compared the main proximities and metrics for quantifying gene regulatory associations. Written in full, the definitions and descriptions of 14 association measures are summarized and their characteristics with applications in regulatory network inference have been presented. From the benchmark challenge data and real gene expression data, we compared their performances and consistencies in the network inferences. Furthermore, their advantages and limitations are also analyzed and discussed. Currently, developing causality measure is an urgent research topic from driving gene association to regulation causality (Bareinboim and Pearl, <xref ref-type="bibr" rid="B4">2016</xref>). A powerful measure of causality will greatly benefit the discovery of important gene regulations. Moreover, the linear/non-linear regression and differential equation models regard many genes in dynamic systems and the parameters of these models represent the system in details. The model-based gene regulatory network inference methods seem to provide more powerful tools when compared to the association-based methods. However, the association measures contain their flexibility in sense, easy interpretation and large scope of applications.</p>
<p>In conclusion, gene association measures provide fundamental quantifications of detecting gene regulatory relationships from transcriptomic profiling data. The high-throughput technologies advance the measurements of thousands of genes in parallel manners. The association measures effectively accelerate the transformation processes from data to knowledge. Most of the proposed association measures are statistical techniques which focus only on the inter-relationships between genes, and they are very hard to get the causal gene relationships alone. With the improved conditional or joint association measures, such as partial correlation coefficient, conditional mutual information and liquid association, the causality between genes can be partially extracted out from data. The introduction of other genes in evaluating gene regulation provides promising alternatives to grasp the genuine regulations. For an entire system, many genes perform their functions coordinately and cooperatively. So more advanced models are extremely needed to describe the complex system of gene regulations. In such model as ODE, the time-varying regulations are exactly to quantify the gene regulatory interactions with temporal implications. For the model complexity and the data availability, the dynamics underlying the coefficients in regression and ODE will reveal much more complicated regulatory relationships.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>ZL conceived and designed the study. ZL wrote the code and analyzed the data. ZL drafted the manuscript.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
</sec>
</body>
<back>
<ack><p>This work was partially supported by the National Natural Science Foundation of China (NSFC) under Grant Nos. 61572287 and 61533011; Natural Science Foundation of Shandong Province, China (ZR2015FQ001); the Fundamental Research Funds of Shandong University under Grant Nos. 2015QY001 and 2016JC007; the Scientific Research Foundation for the Returned Overseas Chinese Scholars, Ministry of Education of China. The paper was also funded by a Pilot Research Grant from School of Control Science and Engineering at Shandong University.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altman</surname> <given-names>N.</given-names></name> <name><surname>Krzywinski</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>Points of significance: association, correlation and causation</article-title>. <source>Nat. Methods</source> <volume>12</volume>, <fpage>899</fpage>&#x02013;<lpage>900</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.3587</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altman</surname> <given-names>N.</given-names></name> <name><surname>Krzywinski</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Points of significance: simple linear regression</article-title>. <source>Nat Methods</source> <volume>12</volume>, <fpage>999</fpage>&#x02013;<lpage>1000</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.3627</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bach</surname> <given-names>F. R.</given-names></name> <name><surname>Jordan</surname> <given-names>M. I.</given-names></name></person-group> (<year>2002</year>). <article-title>Kernel independent component analysis</article-title>. <source>J. Mach. Learn. Res.</source> <volume>3</volume>, <fpage>1</fpage>&#x02013;<lpage>48</lpage>. <pub-id pub-id-type="doi">10.1109/ICASSP.2003.1202783</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bareinboim</surname> <given-names>E.</given-names></name> <name><surname>Pearl</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>Causal inference and the data-fusion problem</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>113</volume>, <fpage>7345</fpage>&#x02013;<lpage>7352</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1510507113</pub-id><pub-id pub-id-type="pmid">27382148</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bar-Joseph</surname> <given-names>Z.</given-names></name> <name><surname>Gitter</surname> <given-names>A.</given-names></name> <name><surname>Simon</surname> <given-names>I.</given-names></name></person-group> (<year>2012</year>). <article-title>Studying and modelling dynamic biological processes using time-series gene expression data</article-title>. <source>Nat. Rev. Genet.</source> <volume>13</volume>, <fpage>552</fpage>&#x02013;<lpage>564</lpage>. <pub-id pub-id-type="doi">10.1038/nrg3244</pub-id><pub-id pub-id-type="pmid">22805708</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blackham</surname> <given-names>S.</given-names></name> <name><surname>Baillie</surname> <given-names>A.</given-names></name> <name><surname>Al-Hababi</surname> <given-names>F.</given-names></name> <name><surname>Remlinger</surname> <given-names>K.</given-names></name> <name><surname>You</surname> <given-names>S.</given-names></name> <name><surname>Hamatake</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Gene expression profiling indicates the roles of host oxidative stress, apoptosis, lipid metabolism, and intracellular transport genes in the replication of hepatitis C virus</article-title>. <source>J. Virol.</source> <volume>84</volume>, <fpage>5404</fpage>&#x02013;<lpage>5414</lpage>. <pub-id pub-id-type="doi">10.1128/JVI.02529-09</pub-id><pub-id pub-id-type="pmid">20200238</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blomqvist</surname> <given-names>N.</given-names></name></person-group> (<year>1950</year>). <article-title>On a measure of dependence between two random variables</article-title>. <source>Ann. Math. Stat.</source> <volume>21</volume>, <fpage>593</fpage>&#x02013;<lpage>600</lpage>. <pub-id pub-id-type="doi">10.1214/aoms/1177729754</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borgwardt</surname> <given-names>K. M.</given-names></name> <name><surname>Gretton</surname> <given-names>A.</given-names></name> <name><surname>Rasch</surname> <given-names>M. J.</given-names></name> <name><surname>Kriegel</surname> <given-names>H. P.</given-names></name> <name><surname>Scholkopf</surname> <given-names>B.</given-names></name> <name><surname>Smola</surname> <given-names>A. J.</given-names></name></person-group> (<year>2006</year>). <article-title>Integrating structured biological data by kernel maximum mean discrepancy</article-title>. <source>Bioinformatics</source> <volume>22</volume>, <fpage>e49</fpage>&#x02013;<lpage>e57</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btl242</pub-id><pub-id pub-id-type="pmid">16873512</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breiman</surname> <given-names>L.</given-names></name> <name><surname>Friedman</surname> <given-names>J. H.</given-names></name></person-group> (<year>1985</year>). <article-title>Estimating optimal transformations for multiple regression and correlation</article-title>. <source>J. Am. Stat. Assoc.</source> <volume>80</volume>, <fpage>580</fpage>&#x02013;<lpage>598</lpage>. <pub-id pub-id-type="doi">10.1080/01621459.1985.10478157</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butte</surname> <given-names>A. J.</given-names></name> <name><surname>Kohane</surname> <given-names>I. S.</given-names></name></person-group> (<year>2000</year>). <article-title>Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements</article-title>. <source>Pac. Symp. Biocomput.</source> <volume>5</volume>, <fpage>418</fpage>&#x02013;<lpage>429</lpage>. <pub-id pub-id-type="doi">10.1142/9789814447331_0040</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Conover</surname> <given-names>W. J.</given-names></name> <name><surname>Iman</surname> <given-names>R. L.</given-names></name></person-group> (<year>1981</year>). <article-title>Rank transformations as a bridge between parametric and nonparametric statistics</article-title>. <source>Am. Stat.</source> <volume>35</volume>, <fpage>124</fpage>&#x02013;<lpage>129</lpage>. <pub-id pub-id-type="doi">10.1080/00031305.1981.10479327</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Fuente</surname> <given-names>A.</given-names></name> <name><surname>Bing</surname> <given-names>N.</given-names></name> <name><surname>Hoeschele</surname> <given-names>I.</given-names></name> <name><surname>Mendes</surname> <given-names>P.</given-names></name></person-group> (<year>2004</year>). <article-title>Discovery of meaningful associations in genomic data using partial correlation coefficients</article-title>. <source>Bioinformatics</source> <volume>20</volume>, <fpage>3565</fpage>&#x02013;<lpage>3574</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bth445</pub-id><pub-id pub-id-type="pmid">15284096</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>D&#x00027;Haeseleer</surname> <given-names>P.</given-names></name></person-group> (<year>2005</year>). <article-title>How does gene expression clustering work?</article-title> <source>Nat. Biotechnol.</source> <volume>23</volume>, <fpage>1499</fpage>&#x02013;<lpage>1501</lpage>. <pub-id pub-id-type="doi">10.1038/nbt1205-1499</pub-id><pub-id pub-id-type="pmid">16333293</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>D&#x00027;Haeseleer</surname> <given-names>P.</given-names></name> <name><surname>Wen</surname> <given-names>X.</given-names></name> <name><surname>Fuhrman</surname> <given-names>S.</given-names></name> <name><surname>Somogyi</surname> <given-names>R.</given-names></name></person-group> (<year>1999</year>). <article-title>Linear modeling of mRNA expression levels during CNS development and injury</article-title>. <source>Pac. Symp. Biocomput.</source> <volume>4</volume>, <fpage>41</fpage>&#x02013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.1142/9789814447300_0005</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>R.</given-names></name> <name><surname>Domrachev</surname> <given-names>M.</given-names></name> <name><surname>Lash</surname> <given-names>A. E.</given-names></name></person-group> (<year>2002</year>). <article-title>Gene Expression Omnibus: NCBI gene expression and hybridization array data repository</article-title>. <source>Nucleic Acids Res.</source> <volume>30</volume>, <fpage>207</fpage>&#x02013;<lpage>210</lpage>. <pub-id pub-id-type="doi">10.1093/nar/30.1.207</pub-id><pub-id pub-id-type="pmid">11752295</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friedman</surname> <given-names>N.</given-names></name> <name><surname>Linial</surname> <given-names>M.</given-names></name> <name><surname>Nachman</surname> <given-names>I.</given-names></name> <name><surname>Pe&#x00027;er</surname> <given-names>D.</given-names></name></person-group> (<year>2000</year>). <article-title>Using Bayesian networks to analyze expression data</article-title>. <source>J. Comput. Biol.</source> <volume>7</volume>, <fpage>601</fpage>&#x02013;<lpage>620</lpage>. <pub-id pub-id-type="doi">10.1089/106652700750050961</pub-id><pub-id pub-id-type="pmid">11108481</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goodman</surname> <given-names>L. A.</given-names></name> <name><surname>Kruskal</surname> <given-names>W. H.</given-names></name></person-group> (<year>1954</year>). <article-title>Measures of association for cross classifications</article-title>. <source>J. Am. Stat. Assoc.</source> <volume>49</volume>, <fpage>732</fpage>&#x02013;<lpage>764</lpage>. <pub-id pub-id-type="doi">10.1080/01621459.1954.10501231</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hoeffding</surname> <given-names>W.</given-names></name></person-group> (<year>1948</year>). <article-title>A non-parametric test of independence</article-title>. <source>Ann. Math. Stat.</source> <volume>19</volume>, <fpage>546</fpage>&#x02013;<lpage>557</lpage>. <pub-id pub-id-type="doi">10.1214/aoms/1177730150</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Goto</surname> <given-names>S.</given-names></name></person-group> (<year>2000</year>). <article-title>KEGG: kyoto encyclopedia of genes and genomes</article-title>. <source>Nucleic Acids Res.</source> <volume>28</volume>, <fpage>27</fpage>&#x02013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1093/nar/28.1.27</pub-id><pub-id pub-id-type="pmid">10592173</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kendall</surname> <given-names>M. G.</given-names></name></person-group> (<year>1938</year>). <article-title>A new measure of rank correlation</article-title>. <source>Biometrika</source> <volume>30</volume>, <fpage>81</fpage>&#x02013;<lpage>93</lpage>. <pub-id pub-id-type="doi">10.1093/biomet/30.1-2.81</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kullback</surname> <given-names>S.</given-names></name> <name><surname>Leibler</surname> <given-names>R. A.</given-names></name></person-group> (<year>1951</year>). <article-title>On information and sufficiency</article-title>. <source>Ann. Math. Stat.</source> <volume>22</volume>, <fpage>79</fpage>&#x02013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1214/aoms/1177729694</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langfelder</surname> <given-names>P.</given-names></name> <name><surname>Horvath</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>WGCNA: an R package for weighted correlation network analysis</article-title>. <source>BMC Bioinformatics</source> <volume>9</volume>:<fpage>559</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-9-559</pub-id><pub-id pub-id-type="pmid">19114008</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>K. C.</given-names></name></person-group> (<year>2002</year>). <article-title>Genome-wide coexpression dynamics: theory and application</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>99</volume>, <fpage>16875</fpage>&#x02013;<lpage>16880</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.252466999</pub-id><pub-id pub-id-type="pmid">12486219</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liang</surname> <given-names>K. C.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name></person-group> (<year>2008</year>). <article-title>Gene regulatory network reconstruction using conditional mutual information</article-title>. <source>EURASIP J. Bioinform. Syst. Biol.</source> <volume>2008</volume>:<fpage>253894</fpage>. <pub-id pub-id-type="doi">10.1155/2008/253894</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Z. P.</given-names></name></person-group> (<year>2015</year>). <article-title>Reverse engineering of genome-wide gene regulatory networks from gene expression data</article-title>. <source>Curr. Genomics</source> <volume>16</volume>, <fpage>3</fpage>&#x02013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.2174/1389202915666141110210634</pub-id><pub-id pub-id-type="pmid">25937810</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Z. P.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>X. S.</given-names></name> <name><surname>Chen</surname> <given-names>L.</given-names></name></person-group> (<year>2012</year>). <article-title>Network-based analysis of complex diseases</article-title>. <source>IET Syst. Biol.</source> <volume>6</volume>, <fpage>22</fpage>&#x02013;<lpage>33</lpage>. <pub-id pub-id-type="doi">10.1049/iet-syb.2010.0052</pub-id><pub-id pub-id-type="pmid">22360268</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Z. P.</given-names></name> <name><surname>Wu</surname> <given-names>C.</given-names></name> <name><surname>Miao</surname> <given-names>H.</given-names></name> <name><surname>Wu</surname> <given-names>H.</given-names></name></person-group> (<year>2015</year>). <article-title>RegNetwork: an integrated database of transcriptional and post-transcriptional regulatory networks in human and mouse</article-title>. <source>Database</source> <volume>2015</volume>:<fpage>bav095</fpage>. <pub-id pub-id-type="doi">10.1093/database/bav095</pub-id><pub-id pub-id-type="pmid">26424082</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Z. P.</given-names></name> <name><surname>Wu</surname> <given-names>H.</given-names></name> <name><surname>Zhu</surname> <given-names>J.</given-names></name> <name><surname>Miao</surname> <given-names>H.</given-names></name></person-group> (<year>2014</year>). <article-title>Systematic identification of transcriptional and post-transcriptional regulations in human respiratory epithelial cells during influenza A virus infection</article-title>. <source>BMC Bioinformatics</source> <volume>15</volume>:<fpage>336</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-15-336</pub-id><pub-id pub-id-type="pmid">25281301</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Z. P.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Horimoto</surname> <given-names>K.</given-names></name> <name><surname>Chen</surname> <given-names>L.</given-names></name></person-group> (<year>2013</year>). <article-title>Gaussian graphical model for identifying significantly responsive regulatory networks from time course high-throughput data</article-title>. <source>IET Syst. Biol.</source> <volume>7</volume>, <fpage>143</fpage>&#x02013;<lpage>152</lpage>. <pub-id pub-id-type="doi">10.1049/iet-syb.2012.0062</pub-id><pub-id pub-id-type="pmid">24067414</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lopez-Paz</surname> <given-names>D.</given-names></name> <name><surname>Hennig</surname> <given-names>P.</given-names></name> <name><surname>Scholkopf</surname> <given-names>B.</given-names></name></person-group> (<year>2013</year>). <article-title>The randomized dependence coefficient</article-title>. <source>Adv. Neural Inf. Process. Syst.</source> <volume>26</volume>, <fpage>1</fpage>&#x02013;<lpage>9</lpage>.</citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marbach</surname> <given-names>D.</given-names></name> <name><surname>Prill</surname> <given-names>R. J.</given-names></name> <name><surname>Schaffter</surname> <given-names>T.</given-names></name> <name><surname>Mattiussi</surname> <given-names>C.</given-names></name> <name><surname>Floreano</surname> <given-names>D.</given-names></name> <name><surname>Stolovitzky</surname> <given-names>G.</given-names></name></person-group> (<year>2010</year>). <article-title>Revealing strengths and weaknesses of methods for gene network inference</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>107</volume>, <fpage>6286</fpage>&#x02013;<lpage>6291</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0913357107</pub-id><pub-id pub-id-type="pmid">20308593</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Nelsen</surname> <given-names>R. B.</given-names></name></person-group> (<year>2006</year>). <source>An Introduction to Copulas.</source> <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer-Verlag</publisher-name>.</citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Opgen-Rhein</surname> <given-names>R.</given-names></name> <name><surname>Strimmer</surname> <given-names>K.</given-names></name></person-group> (<year>2007</year>). <article-title>From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data</article-title>. <source>BMC Syst. Biol.</source> <volume>1</volume>:<fpage>37</fpage>. <pub-id pub-id-type="doi">10.1186/1752-0509-1-37</pub-id><pub-id pub-id-type="pmid">17683609</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pearson</surname> <given-names>K.</given-names></name></person-group> (<year>1895</year>). <article-title>Note on regression and inheritance in the case of two parents</article-title>. <source>Proc. R. Soc. Lond.</source> <volume>58</volume>, <fpage>240</fpage>&#x02013;<lpage>242</lpage>. <pub-id pub-id-type="doi">10.1098/rspl.1895.0041</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pillai</surname> <given-names>K. C. S.</given-names></name></person-group> (<year>1955</year>). <article-title>Some new test criteria in multivariate analysis</article-title>. <source>Ann. Math. Stat.</source> <volume>26</volume>, <fpage>117</fpage>&#x02013;<lpage>121</lpage>. <pub-id pub-id-type="doi">10.1214/aoms/1177728599</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Poczos</surname> <given-names>B.</given-names></name> <name><surname>Ghahramani</surname> <given-names>Z.</given-names></name> <name><surname>Schneider</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>Copula-based kernel dependency measures</article-title>, in <source>Proceedings of International Conference on Machine Learning</source> (<publisher-loc>Edinburgh</publisher-loc>), <fpage>775</fpage>&#x02013;<lpage>782</lpage>.</citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reshef</surname> <given-names>D. N.</given-names></name> <name><surname>Reshef</surname> <given-names>Y. A.</given-names></name> <name><surname>Finucane</surname> <given-names>H. K.</given-names></name> <name><surname>Grossman</surname> <given-names>S. R.</given-names></name> <name><surname>McVean</surname> <given-names>G.</given-names></name> <name><surname>Turnbaugh</surname> <given-names>P. J.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Detecting novel associations in large data sets</article-title>. <source>Science</source> <volume>334</volume>, <fpage>1518</fpage>&#x02013;<lpage>1524</lpage>. <pub-id pub-id-type="doi">10.1126/science.1205438</pub-id><pub-id pub-id-type="pmid">22174245</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rui</surname> <given-names>X.</given-names></name> <name><surname>Wunsch</surname> <given-names>D.</given-names></name></person-group> (<year>2005</year>). <article-title>Survey of clustering algorithms</article-title>. <source>IEEE Trans. Neural Netw.</source> <volume>16</volume>, <fpage>645</fpage>&#x02013;<lpage>678</lpage>. <pub-id pub-id-type="doi">10.1109/TNN.2005.845141</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Santini</surname> <given-names>S.</given-names></name> <name><surname>Jain</surname> <given-names>R.</given-names></name></person-group> (<year>1999</year>). <article-title>Similarity measures</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intell.</source> <volume>21</volume>, <fpage>871</fpage>&#x02013;<lpage>883</lpage>. <pub-id pub-id-type="doi">10.1109/34.790428</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schena</surname> <given-names>M.</given-names></name> <name><surname>Shalon</surname> <given-names>D.</given-names></name> <name><surname>Davis</surname> <given-names>R. W.</given-names></name> <name><surname>Brown</surname> <given-names>P. O.</given-names></name></person-group> (<year>1995</year>). <article-title>Quantitative monitoring of gene expression patterns with a complementary DNA microarray</article-title>. <source>Science</source> <volume>270</volume>, <fpage>467</fpage>&#x02013;<lpage>470</lpage>. <pub-id pub-id-type="doi">10.1126/science.270.5235.467</pub-id><pub-id pub-id-type="pmid">7569999</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shannon</surname> <given-names>C. E.</given-names></name></person-group> (<year>1948</year>). <article-title>A mathematical theory of communication</article-title>. <source>Bell Syst. Tech. J.</source> <volume>27</volume>, <fpage>379</fpage>&#x02013;<lpage>423</lpage>. <pub-id pub-id-type="doi">10.1002/j.1538-7305.1948.tb01338.x</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sklar</surname> <given-names>A.</given-names></name></person-group> (<year>1959</year>). <article-title>Fonctions de repartition a n dimensions et leurs marges</article-title>. <source>Publications de l&#x00027;Institut de Statistique de L&#x00027;Universite de Paris</source> <volume>8</volume>, <fpage>229</fpage>&#x02013;<lpage>231</lpage>.</citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Somers</surname> <given-names>R. H.</given-names></name></person-group> (<year>1962</year>). <article-title>A new asymmetric measure of association for ordinal variables</article-title>. <source>Am. Sociol. Rev.</source> <volume>27</volume>, <fpage>799</fpage>&#x02013;<lpage>811</lpage>. <pub-id pub-id-type="doi">10.2307/2090408</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spearman</surname> <given-names>C. C.</given-names></name></person-group> (<year>1904</year>). <article-title>The proof and measurement of association between two things</article-title>. <source>Am. J. Psychol.</source> <volume>15</volume>, <fpage>72</fpage>&#x02013;<lpage>101</lpage>. <pub-id pub-id-type="doi">10.2307/1412159</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stuart</surname> <given-names>J. M.</given-names></name> <name><surname>Segal</surname> <given-names>E.</given-names></name> <name><surname>Koller</surname> <given-names>D.</given-names></name> <name><surname>Kim</surname> <given-names>S. K.</given-names></name></person-group> (<year>2003</year>). <article-title>A gene-coexpression network for global discovery of conserved genetic modules</article-title>. <source>Science</source> <volume>302</volume>, <fpage>249</fpage>&#x02013;<lpage>255</lpage>. <pub-id pub-id-type="doi">10.1126/science.1087447</pub-id><pub-id pub-id-type="pmid">12934013</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Szekely</surname> <given-names>G. J.</given-names></name> <name><surname>Rizzo</surname> <given-names>M. L.</given-names></name></person-group> (<year>2009</year>). <article-title>Brownian distance covariance</article-title>. <source>Ann. Appl. Stat.</source> <fpage>1236</fpage>&#x02013;<lpage>1265</lpage>. <pub-id pub-id-type="doi">10.1214/09-AOAS312</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tibshirani</surname> <given-names>R.</given-names></name></person-group> (<year>1996</year>). <article-title>Regression shrinkage and selection via the lasso</article-title>. <source>J. R. Stat. Soc. Ser. B</source> <volume>58</volume>, <fpage>267</fpage>&#x02013;<lpage>288</lpage>.</citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Y. X.</given-names></name> <name><surname>Waterman</surname> <given-names>M. S.</given-names></name> <name><surname>Huang</surname> <given-names>H.</given-names></name></person-group> (<year>2014</year>). <article-title>Gene coexpression measures in large heterogeneous samples using count statistics</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>111</volume>, <fpage>16371</fpage>&#x02013;<lpage>16376</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1417128111</pub-id><pub-id pub-id-type="pmid">25288767</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Gerstein</surname> <given-names>M.</given-names></name> <name><surname>Snyder</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>RNA-Seq: a revolutionary tool for transcriptomics</article-title>. <source>Nat. Rev. Genet.</source> <volume>10</volume>, <fpage>57</fpage>&#x02013;<lpage>63</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2484</pub-id><pub-id pub-id-type="pmid">19015660</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wilks</surname> <given-names>S. S.</given-names></name></person-group> (<year>1935</year>). <article-title>On the independence of k sets of normally distributed statistical variables</article-title>. <source>Econometrica</source> <volume>3</volume>, <fpage>309</fpage>&#x02013;<lpage>326</lpage>. <pub-id pub-id-type="doi">10.2307/1905324</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>S.</given-names></name> <name><surname>Liu</surname> <given-names>Z. P.</given-names></name> <name><surname>Qiu</surname> <given-names>X.</given-names></name> <name><surname>Wu</surname> <given-names>H.</given-names></name></person-group> (<year>2014</year>). <article-title>Modeling genome-wide dynamic regulatory network in mouse lungs with influenza infection using high-dimensional ordinary differential equations</article-title>. <source>PLoS ONE</source> <volume>9</volume>:<fpage>e95276</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0095276</pub-id><pub-id pub-id-type="pmid">24802016</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yule</surname> <given-names>G. U.</given-names></name></person-group> (<year>1900</year>). <article-title>On the association of attributes in statistics: with illustrations from the material of the childhood society, &#x00026; c</article-title>. <source>Philos. Trans. R. Soc. Lond. Ser. A</source> <volume>194</volume>, <fpage>257</fpage>&#x02013;<lpage>319</lpage>. <pub-id pub-id-type="doi">10.1098/rsta.1900.0019</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zar</surname> <given-names>J. H.</given-names></name></person-group> (<year>1972</year>). <article-title>Significance testing of the spearman rank correlation coefficient</article-title>. <source>J. Am. Stat. Assoc.</source> <volume>67</volume>, <fpage>578</fpage>&#x02013;<lpage>580</lpage>. <pub-id pub-id-type="doi">10.1080/01621459.1972.10481251</pub-id></citation></ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>B.</given-names></name> <name><surname>Horvath</surname> <given-names>S.</given-names></name></person-group> (<year>2005</year>). <article-title>A general framework for weighted gene co-expression network analysis</article-title>. <source>Stat. Appl. Genet. Mol. Biol.</source> <volume>4</volume>:<fpage>17</fpage>. <pub-id pub-id-type="doi">10.2202/1544-6115.1128</pub-id><pub-id pub-id-type="pmid">16646834</pub-id></citation></ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>K.</given-names></name> <name><surname>Liu</surname> <given-names>Z. P.</given-names></name> <name><surname>Duval</surname> <given-names>B.</given-names></name> <name><surname>Richer</surname> <given-names>J. M.</given-names></name> <name><surname>Zhao</surname> <given-names>X. M.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>NARROMI: a noise and redundancy reduction technique improves accuracy of gene regulatory network inference</article-title>. <source>Bioinformatics</source> <volume>29</volume>, <fpage>106</fpage>&#x02013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bts619</pub-id><pub-id pub-id-type="pmid">23080116</pub-id></citation></ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Zhao</surname> <given-names>J.</given-names></name> <name><surname>Hao</surname> <given-names>J. K.</given-names></name> <name><surname>Zhao</surname> <given-names>X. M.</given-names></name> <name><surname>Chen</surname> <given-names>L.</given-names></name></person-group> (<year>2014</year>). <article-title>Conditional mutual inclusive information enables accurate quantification of associations in gene regulatory networks</article-title>. <source>Nucleic Acids Res.</source> <volume>43</volume>, <fpage>e31</fpage>. <pub-id pub-id-type="doi">10.1093/nar/gku1315</pub-id><pub-id pub-id-type="pmid">25539927</pub-id></citation></ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Zhao</surname> <given-names>X. M.</given-names></name> <name><surname>He</surname> <given-names>K.</given-names></name> <name><surname>Lu</surname> <given-names>L.</given-names></name> <name><surname>Cao</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Inferring gene regulatory networks from gene expression data by path consistency algorithm based on conditional mutual information</article-title>. <source>Bioinformatics</source> <volume>28</volume>, <fpage>98</fpage>&#x02013;<lpage>104</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr626</pub-id><pub-id pub-id-type="pmid">22088843</pub-id></citation></ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>J.</given-names></name> <name><surname>Zhou</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Chen</surname> <given-names>L.</given-names></name></person-group> (<year>2016</year>). <article-title>Part mutual information for quantifying direct associations in networks</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>113</volume>, <fpage>5130</fpage>&#x02013;<lpage>5135</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1522586113</pub-id><pub-id pub-id-type="pmid">27092000</pub-id></citation></ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zou</surname> <given-names>H. H.</given-names></name> <name><surname>Trevor</surname> <given-names>H.</given-names></name></person-group> (<year>2005</year>). <article-title>Regularization and variable selection via the elastic net</article-title>. <source>J. R. Stat. Soc. B</source> <volume>67</volume>, <fpage>301</fpage>&#x02013;<lpage>320</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-9868.2005.00503.x</pub-id></citation></ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zou</surname> <given-names>K. H.</given-names></name> <name><surname>Tuncali</surname> <given-names>K.</given-names></name> <name><surname>Silverman</surname> <given-names>S. G.</given-names></name></person-group> (<year>2003</year>). <article-title>Correlation and simple linear regression</article-title>. <source>Radiology</source> <volume>227</volume>, <fpage>617</fpage>&#x02013;<lpage>628</lpage>. <pub-id pub-id-type="doi">10.1148/radiol.2273011499</pub-id><pub-id pub-id-type="pmid">12773666</pub-id></citation></ref>
</ref-list>
</back>
</article>
