<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">792090</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2022.792090</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Network Crosstalk as a Basis for Drug Repurposing</article-title>
<alt-title alt-title-type="left-running-head">Guala and Sonnhammer</alt-title>
<alt-title alt-title-type="right-running-head">Network Crosstalk for Drug Repurposing</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Guala</surname>
<given-names>Dimitri</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/642987/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Sonnhammer</surname>
<given-names>Erik L. L.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1634693/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Science for Life Laboratory</institution>, <institution>Department of Biochemistry and Biophysics</institution>, <institution>Stockholm University</institution>, <addr-line>Solna</addr-line>, <country>Sweden</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Merck AB</institution>, <addr-line>Solna</addr-line>, <country>Sweden</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/35409/overview">Ovidiu Radulescu</ext-link>, Universit&#xe9; de Montpellier, France</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/106132/overview">Andrei Zinovyev</ext-link>, Institut Curie, France</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/91346/overview">Ana&#xef;s Baudot</ext-link>, INSERM U1251 Centre de G&#xe9;n&#xe9;tique M&#xe9;dicale de Marseille (MMG), France</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Erik L. L. Sonnhammer, <email>Erik.Sonnhammer@dbb.su.se</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>08</day>
<month>03</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>792090</elocation-id>
<history>
<date date-type="received">
<day>09</day>
<month>10</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>27</day>
<month>01</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Guala and Sonnhammer.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Guala and Sonnhammer</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The need for systematic drug repurposing has seen a steady increase over the past decade and may be particularly valuable to quickly remedy unexpected pandemics. The abundance of functional interaction data has allowed mapping of substantial parts of the human interactome modeled using functional association networks, favoring network-based drug repurposing. Network crosstalk-based approaches have never been tested for drug repurposing despite their success in the related and more mature field of pathway enrichment analysis. We have, therefore, evaluated the top performing crosstalk-based approaches for drug repurposing. Additionally, the volume of new interaction data as well as more sophisticated network integration approaches compelled us to construct a new benchmark for performance assessment of network-based drug repurposing tools, which we used to compare network crosstalk-based methods with a state-of-the-art technique. We find that network crosstalk-based drug repurposing is able to rival the state-of-the-art method and in some cases outperform it.</p>
</abstract>
<kwd-group>
<kwd>drug repurposing</kwd>
<kwd>drug repositioning</kwd>
<kwd>network-based</kwd>
<kwd>benchmark</kwd>
<kwd>functional association network</kwd>
<kwd>network crosstalk</kwd>
<kwd>shortest path</kwd>
</kwd-group>
<contract-sponsor id="cn001">Vetenskapsr&#xe5;det<named-content content-type="fundref-id">10.13039/501100004359</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Drug repurposing or repositioning has seen a steady increase over the past decade, culminating in a doubling of the number of published articles in PubMed during 2020 (959) compared with 2019 (458). Increased time and cost associated with traditional forms of drug discovery as well as the increase in data for drug repurposing methods are driving the observed rise (<xref ref-type="bibr" rid="B22">Jourdan et al., 2020</xref>). The exponential increase seen during the last year can primarily be attributed to the start of the COVID-19 pandemic that launched the scientific community into a desperate search for a quick and affordable solution to the crisis (<xref ref-type="bibr" rid="B40">Singh et al., 2020</xref>).</p>
<p>The initial repurposing approaches, during 2020, turned to clinical experience and anecdata-driven predictions, resulting in futile clinical trials investigating the same treatments, such as hydroxychloroquine (<xref ref-type="bibr" rid="B30">Gysi et al., 2021</xref>). However, as the panic subsided, a more systematic approach to drug repurposing involving computational drug repurposing methods started to gain more traction. Taking advantage of the increased availability of functional interaction data such as protein&#x2013;protein interactions (PPIs) and co-regulation in pathways of biological processes, network-based approaches are shown to excel at the drug repurposing task (<xref ref-type="bibr" rid="B18">Guney et al., 2016</xref>).</p>
<p>At its core, network-based drug repurposing relies on some form of topological similarity between the set of disease-related proteins and the proteins targeted by a drug in the context of a PPI network. The network in question is the amalgamation of knowledge about the complex biology of the cellular mechanisms involved in the pathology of the studied disease and the mechanism of action of the drugs intended for repurposing. It is, therefore, imperative that the network used for the network-based approaches is comprehensive and biologically sound (<xref ref-type="bibr" rid="B17">Guala et al., 2019</xref>).</p>
<p>A previous study of drug repurposing methods (<xref ref-type="bibr" rid="B18">Guney et al., 2016</xref>) used a network that consisted of a collection of PPIs, retrieved from a few databases, to benchmark different ways to measure topological similarity and compared the best approach to other state-of-the-art computational drug repurposing methods. The network-based approach relying on the shortest path between the drug-targeted proteins and the closest disease-related protein outperformed other approaches, such as those based on the network distance between targets, network diffusion, chemical similarity, Gene Ontology (<xref ref-type="bibr" rid="B2">Ashburner et al., 2000</xref>) term overlap for drug&#x2013;drug similarity, co-regulation of genes in LINCS (<xref ref-type="bibr" rid="B41">Stathias et al., 2020</xref>), and shared side effects.</p>
<p>An adjacent area of bioinformatics that relies on robust and comprehensive functional association networks is functional enrichment analysis, also known as pathway analysis (<xref ref-type="bibr" rid="B26">Maleki et al., 2020</xref>). Pathway analysis is often used to uncover the underlying biological mechanisms and processes mediating the effects of drugs, and pathology in diseases. The newest methods, in pathway analysis, study network crosstalk, i.e., the interaction between sets of genes in the network, for example, the crosstalk between a set of disease-related genes and pathways with known biological functions. Given the substantial performance increase of crosstalk-based pathway analysis compared with other approaches (<xref ref-type="bibr" rid="B31">Ogris et al., 2017</xref>; <xref ref-type="bibr" rid="B11">Castresana-Aguirre and Sonnhammer, 2020</xref>), we propose to use crosstalk-based tools also for drug repurposing. One way to achieve this is to evaluate the similarity between drug-target proteins and disease-related proteins by measuring the crosstalk between the two sets compared with an expected crosstalk in the underlying network between similar sets of nodes in terms of size and connectivity. Another approach would be to use network diffusion (<xref ref-type="bibr" rid="B25">K&#xf6;hler et al., 2008</xref>), also commonly used for network-based pathway analysis (<xref ref-type="bibr" rid="B16">Glaab et al., 2012</xref>), to assess drug&#x2013;disease similarity. However, a network diffusion approach in which longer paths are down-weighted exponentially was already evaluated for drug repurposing and was outperformed by the shortest path approach previously (<xref ref-type="bibr" rid="B18">Guney et al., 2016</xref>). Additionally, the same shortest path&#x2013;based approach performed better than a random walk&#x2013;based diffusion method in an evaluation of drug repurposing for COVID-19 (<xref ref-type="bibr" rid="B30">Gysi et al., 2021</xref>).</p>
<p>The need for reliable computational drug repurposing methods was clear during the outbreak of the COVID-19 pandemic. Since the previous evaluation of drug repurposing methods (<xref ref-type="bibr" rid="B18">Guney et al., 2016</xref>), new methods have been developed, more data on drugs and their actions have become available, and the networks of functional associations have become more comprehensive (<xref ref-type="bibr" rid="B17">Guala et al., 2019</xref>). Given that new data only partially is used in the form of an updated network in the evaluation of drug repurposing for COVID-19 (<xref ref-type="bibr" rid="B30">Gysi et al., 2021</xref>), a complete update of drug repurposing evaluation is highly relevant. A further issue with the <xref ref-type="bibr" rid="B18">Guney et al. (2016)</xref> benchmark is that it assumes all unknown drug&#x2013;disease associations to be negatives, which is unnecessarily strict as some of them are yet to be discovered positives that can be identified by overlap. Taken together, this warrants an update of the benchmark in terms of both data and methodology. In this study, we, therefore, compare the performance of network crosstalk-based drug repurposing with the shortest path&#x2013;based method considered to be the state of the art on a much larger drug&#x2013;disease interaction data set in the context of a comprehensive functional association network.</p>
</sec>
<sec id="s2">
<title>2 Materials and Methods</title>
<sec id="s2-1">
<title>2.1 The Guney2016 Benchmark</title>
<p>To evaluate the performance of the studied drug repurposing tools, the data set initially published by Guney et al. was used (<xref ref-type="bibr" rid="B18">Guney et al., 2016</xref>). In that data set, disease-associated genes were retrieved from the Online Mendelian Inheritance of Man, OMIM (<xref ref-type="bibr" rid="B19">Hamosh et al., 2005</xref>) database and the GWAS catalog at PheGenI (<xref ref-type="bibr" rid="B34">Ramos et al., 2013</xref>), keeping 79 diseases with at least 20 genes in the underlying human PPI network from a previous study (<xref ref-type="bibr" rid="B29">Menche et al., 2015</xref>). The network comprised a collection of PPIs from a few experimental databases consisting of 13,329 proteins and 141,150 interactions. Drugs and drug targets were downloaded from DrugBank (<xref ref-type="bibr" rid="B44">Wishart et al., 2018</xref>) and mapped to diseases using medical subject headings. Drugs without targets in the underlying network as well as drugs with the same therapeutic indication and targets as already included drugs were removed, yielding 238 drugs. This produced a benchmark of 402 positive and 18,162 negative drug&#x2013;disease pairs.</p>
</sec>
<sec id="s2-2">
<title>2.2 The FunCoup Benchmark</title>
<sec id="s2-2-1">
<title>2.2.1 The Functional Association Network</title>
<p>One of the most comprehensive functional association networks, FunCoup (<xref ref-type="bibr" rid="B1">Alexeyenko et al., 2011</xref>; <xref ref-type="bibr" rid="B32">Persson et al., 2021</xref>), was used as the substrate for the network-based drug repurposing algorithms in this benchmark. FunCoup is a framework for integrating large-scale experimental data on gene&#x2013;gene and protein&#x2013;protein interactions. It uses na&#xef;ve Bayesian integration to combine both direct and indirect interactions stemming from 11 different types of data, pertaining to co-expression, co-localization, co-evolution, co-regulation, domain&#x2013;domain interaction, genetic interaction, and PPI (<xref ref-type="bibr" rid="B17">Guala et al., 2019</xref>). Each type of evidence is uniquely scored, shared via orthology across the included species, and trained using gold standards, such as metabolic and signaling pathways from Kyoto Encyclopedia of Genes and Genomes (KEGG) (<xref ref-type="bibr" rid="B23">Kanehisa, 2000</xref>), confirmed pairwise and complex PPIs from iRefIndex (<xref ref-type="bibr" rid="B35">Razick et al., 2008</xref>), CORUM (<xref ref-type="bibr" rid="B15">Giurgiu et al., 2019</xref>), and ComplexPortal (<xref ref-type="bibr" rid="B28">Meldal et al., 2019</xref>) as well as operon information from OperonDB (<xref ref-type="bibr" rid="B33">Pertea et al., 2009</xref>). The latest update of FunCoup, release 5, which was used in the study, encompasses genome-wide networks for 21 different species from all domains of life and the SARS-CoV2 virus&#x2013;human interactome (<xref ref-type="bibr" rid="B32">Persson et al., 2021</xref>); however, only its <italic>Homo sapiens</italic> network was used in this study. The default interaction confidence threshold of 0.8, applied to enrich the resulting network for high quality interactions, generated a <italic>H. sapiens</italic> network containing 12,848 proteins and 612,284 interactions.</p>
</sec>
<sec id="s2-2-2">
<title>2.2.2 Disease and Drug Target Data</title>
<p>The Comparative Toxicogenomic Database (CTD) (<xref ref-type="bibr" rid="B12">Davis et al., 2021</xref>) was used as the source of manually curated information about drugs and diseases including drug&#x2013;target interactions as well as disease-related genes, or disease genes in short. Drug target information in CTD is manually curated from published literature as well as public databases, such as DrugBank. The information on diseases and related genes is also manually curated from published literature or extracted from OMIM. For the purpose of this study, the initial set of CTD drugs was filtered for drugs with at least one drug target in the FunCoup network, resulting in 1370 drugs. A similar filtering was performed for the CTD diseases. However only diseases with at least 20 disease-related genes in the FunCoup network were kept as in similar evaluations (<xref ref-type="bibr" rid="B18">Guney et al., 2016</xref>), yielding 212 well-characterized diseases. The combination of diseases and drugs comprised 8518 drug&#x2013;disease associations, which were designated as positive in the continued analysis. The remaining 281,922 drug&#x2013;disease combinations were designated as negative.</p>
<p>For a negative drug&#x2013;disease combination with an overlap between drug targets and disease genes, the studied drug&#x2013;disease pair may be positive but yet unverified. To minimize the risk of misclassifying a positive drug&#x2013;disease combination as a false positive, all combinations with gene overlap were removed from the negatives set. The final benchmark contained 8518 positive and 159,144 negative drug&#x2013;disease pairs.</p>
</sec>
</sec>
<sec id="s2-3">
<title>2.3 The Time-Stamped Benchmark</title>
<p>To validate newly predicted drug&#x2013;disease combinations, we used new indications that have come about after the Guney2016 benchmark and used these together with the original network to construct a time-stamped benchmark. There were 40 diseases that could be matched by name in both the Guney2016 benchmark and the FCbench. Drugs that have been repurposed since the Guney2016 benchmark, i.e., drugs that received a new indication for at least one of the 40 diseases, caused a change in the label of the corresponding drug&#x2013;disease combinations, switching from negative status in the Guney2016 benchmark to positive status in the time-stamped benchmark. This produced 362 new drug&#x2013;disease combinations. After removing 473 negatives due to overlap between drug targets and disease genes, the time-stamped benchmark comprised 764 positive and 17,327 negative drug&#x2013;disease pairs.</p>
</sec>
<sec id="s2-4">
<title>2.4 Network-Based Drug Repurposing Methods</title>
<p>Network-based drug repurposing methods in this study used either shortest path or network crosstalk to assess the similarity between the set of drug targets and the set of disease genes (<xref ref-type="fig" rid="F1">Figure 1</xref>). There are different ways of measuring the shortest path between two sets of nodes in a network. In a previous study (<xref ref-type="bibr" rid="B18">Guney et al., 2016</xref>), the shortest path between drug targets and the closest disease gene is shown to outperform the other distance measures. We, therefore, used that method in our study and refer to it as &#x201c;proximity.&#x201d; Because crosstalk-based distance measures have never before been used for drug repurposing, we used the three most prominent crosstalk-based approaches here.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Network-based drug repurposing, proximity vs. crosstalk, for a given disease and drug. In crosstalk, the number of network links between all drug targets and disease genes is used while, in proximity, the length of the single shortest path between any drug target and any closest disease gene is used.</p>
</caption>
<graphic xlink:href="fgene-13-792090-g001.tif"/>
</fig>
<sec id="s2-4-1">
<title>2.4.1 Proximity</title>
<p>To assess the distance, <inline-formula id="inf1">
<mml:math id="m1">
<mml:mi>d</mml:mi>
</mml:math>
</inline-formula>, between a drug, <inline-formula id="inf2">
<mml:math id="m2">
<mml:mi>T</mml:mi>
</mml:math>
</inline-formula>, and a disease, <inline-formula id="inf3">
<mml:math id="m3">
<mml:mi>S</mml:mi>
</mml:math>
</inline-formula>, the shortest path between all drug targets and all disease-related proteins was calculated. The distance, <inline-formula id="inf4">
<mml:math id="m4">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>S</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, was the average shortest path between every drug target and its closest disease gene, where <inline-formula id="inf5">
<mml:math id="m5">
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mo>&#x7c;</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>&#x7c;</mml:mo>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> is the number of drug targets.<disp-formula id="equ1">
<mml:math id="m6">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mi mathvariant="normal">&#x2016;</mml:mi>
<mml:mi>T</mml:mi>
<mml:mi mathvariant="normal">&#x2016;</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p>To normalize the distance with respect to the topological properties of the network, a degree-aware node permutation approach was adopted. The nodes in the network were divided into bins according to node degree with 100 nodes in each bin. A random sample of drug targets and disease genes was performed, matching the sampled nodes in both the number and degree distribution by sampling nodes from the same bin as the observed targets/genes. The <inline-formula id="inf6">
<mml:math id="m7">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>S</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> was calculated for the sampled sets and the procedure was repeated 1000 times. The mean, <inline-formula id="inf7">
<mml:math id="m8">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>R</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and standard deviation, <inline-formula id="inf8">
<mml:math id="m9">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>R</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, of the distribution of randomly sampled distances was used to normalize the observed distance and to assess its significance. (<xref ref-type="bibr" rid="B18">Guney et al., 2016</xref>).<disp-formula id="equ2">
<mml:math id="m10">
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>R</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>R</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p>Proximity was sped up and re-implemented in R for the purpose of this study from the original Python version (<ext-link ext-link-type="uri" xlink:href="https://github.com/emreg00/proximity">https://github.com/emreg00/proximity</ext-link>) published by <xref ref-type="bibr" rid="B18">Guney et al. (2016)</xref>.</p>
</sec>
<sec id="s2-4-2">
<title>2.4.2 NEAT</title>
<p>For this drug repurposing method, similarity between two sets of nodes in a network is estimated by measuring the crosstalk, i.e., the number of links, between the two sets. A larger crosstalk suggests a greater similarity between the two sets. To normalize the observed crosstalk, <inline-formula id="inf9">
<mml:math id="m11">
<mml:mi>x</mml:mi>
</mml:math>
</inline-formula>, the mean, <inline-formula id="inf10">
<mml:math id="m12">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>H</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and the standard deviation, <inline-formula id="inf11">
<mml:math id="m13">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>H</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, of the expected crosstalk between sets of nodes with the same size as the drug targets and disease genes can be calculated assuming a hypergeometric distribution of links in the underlying functional association network (<xref ref-type="bibr" rid="B39">Signorelli et al., 2016</xref>). Below the number of drug targets, <inline-formula id="inf12">
<mml:math id="m14">
<mml:mi>K</mml:mi>
</mml:math>
</inline-formula>, the number of disease genes, <inline-formula id="inf13">
<mml:math id="m15">
<mml:mi>n</mml:mi>
</mml:math>
</inline-formula>, and the total number of genes in the network, <inline-formula id="inf14">
<mml:math id="m16">
<mml:mi>N</mml:mi>
</mml:math>
</inline-formula>, can be used to determine, the mean, <inline-formula id="inf15">
<mml:math id="m17">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>H</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and the standard deviation, <inline-formula id="inf16">
<mml:math id="m18">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>H</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf17">
<mml:math id="m19">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>H</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>n</mml:mi>
<mml:mfrac>
<mml:mi>K</mml:mi>
<mml:mi>N</mml:mi>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf18">
<mml:math id="m20">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>H</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>H</mml:mi>
</mml:msub>
<mml:mfrac>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>K</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:mfrac>
<mml:mfrac>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>Normalization of crosstalk is carried out as follows:<disp-formula id="equ3">
<mml:math id="m21">
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mi>E</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>H</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>H</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
</sec>
<sec id="s2-4-3">
<title>2.4.3 BinoX</title>
<p>The BinoX approach to drug repurposing also uses crosstalk as a measure of similarity, but assumes a binomial distribution, i.e., <inline-formula id="inf19">
<mml:math id="m22">
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, of crosstalk between two sets of nodes in the underlying network (<xref ref-type="bibr" rid="B31">Ogris et al., 2017</xref>), where <inline-formula id="inf20">
<mml:math id="m23">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the maximum possible crosstalk. To estimate the parameter <inline-formula id="inf21">
<mml:math id="m24">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> of the underlying distribution, to serve as a comparator for the observed crosstalk, <inline-formula id="inf22">
<mml:math id="m25">
<mml:mi>x</mml:mi>
</mml:math>
</inline-formula>, the network is randomized <inline-formula id="inf23">
<mml:math id="m26">
<mml:mi>M</mml:mi>
</mml:math>
</inline-formula> times using topology-preserving link shuffling (<xref ref-type="bibr" rid="B27">McCormack et al., 2013</xref>).<disp-formula id="equ4">
<mml:math id="m27">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>M</mml:mi>
</mml:mfrac>
<mml:msubsup>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mi>i</mml:mi>
<mml:mi>M</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p>The mean, <inline-formula id="inf24">
<mml:math id="m28">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>b</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and the standard deviation, <inline-formula id="inf25">
<mml:math id="m29">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>b</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> can be calculated for the expected crosstalk, using <inline-formula id="inf26">
<mml:math id="m30">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf27">
<mml:math id="m31">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> and the observed crosstalk can be normalized producing a <italic>z</italic>-score as for NEAT.</p>
<p>For calculating <inline-formula id="inf28">
<mml:math id="m32">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>b</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf29">
<mml:math id="m33">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>b</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, the stand-alone version of the pathway enrichment tool BinoX (<ext-link ext-link-type="uri" xlink:href="https://bitbucket.org/sonnhammergroup/binox/wiki/Home">https://bitbucket.org/sonnhammergroup/binox/</ext-link>) was used (<xref ref-type="bibr" rid="B31">Ogris et al., 2017</xref>).</p>
</sec>
<sec id="s2-4-4">
<title>2.4.4 ANUBIX</title>
<p>The ANUBIX approach assumes that network crosstalk follows a beta-binomial distribution, where the probability of crosstalk is binomially distributed with the parameter <inline-formula id="inf30">
<mml:math id="m34">
<mml:mrow>
<mml:mo>&#xa0;</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>x</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> following a Beta distribution, i.e., <inline-formula id="inf31">
<mml:math id="m35">
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>a</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. To compare the observed crosstalk to the expected crosstalk, a degree-aware randomization procedure is applied. In this procedure, for each drug&#x2013;disease pair, a set of nodes with the same size and degree distribution as the set of drug targets is sampled from the network, and its crosstalk with the set of disease genes is calculated. The procedure is repeated 100 times, and maximum likelihood estimation is used to estimate the parameters of the Beta distribution. The mean and standard deviation of the sampled crosstalk are calculated as for <italic>BinoX</italic> and a similar normalization procedure is applied to obtain the normalized crosstalk score, <inline-formula id="inf32">
<mml:math id="m36">
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>u</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. The estimation of parameters for the Beta distribution as well as the calculation of mean, standard deviation, and <italic>p</italic>-values was conducted using a re-implementation of the pathway enrichment tool ANUBIX (<xref ref-type="bibr" rid="B11">Castresana-Aguirre and Sonnhammer, 2020</xref>).</p>
</sec>
</sec>
<sec id="s2-5">
<title>2.5 Drug&#x2013;Drug Similarity&#x2013;Based Repurposing</title>
<p>For diseases that already have approved treatment options, a common repurposing technique is to study the similarity of new drugs to the already approved ones. We, therefore, applied the above-mentioned repurposing methodologies to assess drug&#x2013;drug similarity of new drugs to the drugs already approved for a disease.</p>
<p>For proximity, the drug&#x2013;drug similarity, <inline-formula id="inf33">
<mml:math id="m37">
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>S</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, between drug <inline-formula id="inf34">
<mml:math id="m38">
<mml:mi>T</mml:mi>
</mml:math>
</inline-formula> and disease <inline-formula id="inf35">
<mml:math id="m39">
<mml:mi>S</mml:mi>
</mml:math>
</inline-formula> is<disp-formula id="equ5">
<mml:math id="m40">
<mml:mrow>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p>For the crosstalk-based methods, the drug&#x2013;drug similarity is<disp-formula id="equ6">
<mml:math id="m41">
<mml:mrow>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>k</mml:mi>
<mml:mo>&#xa0;</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p>The <inline-formula id="inf36">
<mml:math id="m42">
<mml:mi>z</mml:mi>
</mml:math>
</inline-formula> vector in both equations above refers to the method-specific normalized <italic>z</italic>-scores calculated between drugs indicated for a disease and each of the other candidate drugs, where <inline-formula id="inf37">
<mml:math id="m43">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> denotes drugs approved for disease <inline-formula id="inf38">
<mml:math id="m44">
<mml:mi>S</mml:mi>
</mml:math>
</inline-formula> with <inline-formula id="inf39">
<mml:math id="m45">
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> being all the drug targets for the drug <inline-formula id="inf40">
<mml:math id="m46">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>.<disp-formula id="equ7">
<mml:math id="m47">
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mo>&#x2200;</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi>a</mml:mi>
<mml:mi>p</mml:mi>
<mml:mi>p</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>v</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>d</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
<mml:mo>&#x2208;</mml:mo>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
</sec>
<sec id="s2-6">
<title>2.6 Performance Evaluation</title>
<p>Prior to running the drug repurposing methods on the drug&#x2013;disease combinations in the three data sets, we examined the set of negative drug&#x2013;disease combinations for potential positives misclassified as negatives. The potential positives were deemed to be those drug&#x2013;disease combinations for which there was gene overlap between drug targets and disease genes. Potential positives were filtered from the analysis.</p>
<p>Running each drug repurposing approach on every drug&#x2013;disease combination produced <italic>z</italic>-scores and corrected <italic>p</italic>-values, which were used in the evaluation. To interpret the <italic>z</italic>-score output from a drug repurposing method as a classification of drug&#x2013;disease combinations into positives or negatives, a <italic>z</italic>-score threshold was required. In the Guney2016 benchmark, the threshold was set as the <italic>z</italic>-score at the intersection of the sensitivity and specificity curves when visualizing the change in sensitivity and specificity as functions of the <italic>z</italic>-score in the same graph. Sensitivity was defined as the proportion of correctly assigned instances of the positive class, also known as recall or true positives among all positives. Specificity was defined as the proportion of correctly assigned instances of the negative class, also known as true negatives among all negatives.</p>
<p>Performance metrics used in the Guney2016 benchmark and in this study are considered to be among the most commonly used performance metrics for evaluation of drug repurposing approaches (<xref ref-type="bibr" rid="B38">Schuler et al., 2020</xref>) and include receiver operating characteristic (ROC) curve and area under the ROC curve, <inline-formula id="inf41">
<mml:math id="m48">
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mi>U</mml:mi>
<mml:mi>R</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>C</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. The ROC curve is a convenient visualization of the change in sensitivity also known as recall or true positive rate, <inline-formula id="inf42">
<mml:math id="m49">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
<mml:mi>R</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, and the false positive rate, <inline-formula id="inf43">
<mml:math id="m50">
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mi>P</mml:mi>
<mml:mi>R</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, at various thresholds, <inline-formula id="inf44">
<mml:math id="m51">
<mml:mi>T</mml:mi>
</mml:math>
</inline-formula>. The AUROC provides a summary metric for the performance of a classifier visualized in the ROC curve and can be interpreted as the probability of ranking a randomly chosen positive sample higher than a randomly chosen negative one (<xref ref-type="bibr" rid="B6">Bradley, 1997</xref>).<disp-formula id="equ8">
<mml:math id="m52">
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mi>U</mml:mi>
<mml:mi>R</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>C</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x222b;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x221e;</mml:mi>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:munderover>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
<mml:mi>R</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>F</mml:mi>
<mml:mi>P</mml:mi>
<mml:mi>R</mml:mi>
<mml:mo>&#x2032;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p>AUROC was calculated both for the full data set of all drug&#x2013;disease combinations as well as on 100 randomly sampled sets containing all positives and equally many negatives. The sampling procedure was applied to address the inherent class imbalance of the data set. The Wilcoxon signed-rank test was used to compare the performance of different drug repurposing methods.</p>
<p>Additionally, we also assessed the precision recall (PR) curve as well as the area under it (AUPR) (<xref ref-type="bibr" rid="B24">Keilwagen et al., 2014</xref>) because both are also commonly used to assess performance of classifiers. The PR curve visualizes the change in precision and recall at different thresholds. Because the number of true positive drug&#x2013;disease combinations is much lower than the number of false positive ones, the AUPR punishes the top-ranked FPs more than AUROC (<xref ref-type="bibr" rid="B13">Davis and Goadrich, 2006</xref>). However, the whole premise of drug repurposing is that our knowledge of valid drug&#x2013;disease combinations is incomplete, meaning that many of the drug&#x2013;disease combinations that we currently consider as false positive are merely untested true positives. This notion is supported by both the time-stamped benchmark and other work (<xref ref-type="bibr" rid="B36">Rivero-Garc&#xed;a et al., 2021</xref>) and is causing the AUPR to be an unreasonably conservative and, thus, less suitable performance measure as compared with AUROC.</p>
<p>The abovementioned metrics rely on the knowledge of both true positives and true negatives. However, most of the drug&#x2013;disease combinations designated as negatives are, in fact, unknown. Although we did try to address this by filtering combinations with overlap in drug targets and disease genes, there may still be many potential positives left. We, therefore, also assessed performance of drug repurposing using the less rigorous metric of the recall at the <italic>z</italic>-score cutoff (<xref ref-type="bibr" rid="B7">Brown and Patel, 2018</xref>).</p>
</sec>
<sec id="s2-7">
<title>2.7 Multiple Hypothesis Testing</title>
<p>All <italic>p</italic>-values produced in the study were corrected for multiple hypothesis testing using the Benjamini&#x2013;Hochberg procedure (<xref ref-type="bibr" rid="B4">Benjamini and Hochberg, 1995</xref>).</p>
</sec>
</sec>
<sec id="s3">
<title>3 Results</title>
<sec id="s3-1">
<title>3.1 Benchmark Characteristics</title>
<p>The benchmark presented here comprises a total of 290,437 drug&#x2013;disease combinations, where 8,397 came from approved indications, 180 from clinical trials, 242 were contraindicated, and 121 were registered as off-label. The remaining 281,500 came from linking drugs to diseases with which they are not associated, denoted as &#x201c;unknown.&#x201d; Drug&#x2013;disease pairs with approved indication or used off-label were considered as positives. Contraindicated combinations and drug&#x2013;disease pairs in clinical trials were removed. The remaining drug&#x2013;disease combinations, i.e., the unknown, were denoted as negatives. Sensitivity analysis when adding back contraindicated drug&#x2013;disease pairs as well as pairs studied in clinical trials to the positive class was conducted and yielded the same results (<xref ref-type="sec" rid="s10">Supplementary Figure S1</xref>).</p>
<p>For a negative drug&#x2013;disease combination with an overlap between drug targets and disease genes the studied drug&#x2013;disease pair may be positive but yet unverified. This is supported by the fact that the proportion of drug&#x2013;disease combinations with overlapping genes was markedly enriched for approved or off-label annotations compared with those annotated as &#x201c;unknown&#x201d; (<xref ref-type="fig" rid="F2">Figure 2</xref>). Removing drug&#x2013;disease combinations with gene overlap from the negative set resulted in the FCbench containing 8,518 positives and 159,144 negatives. This represents a 21-fold increase of positives and a nine-fold increase of negatives compared with the Guney2016 benchmark.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>The proportion of gene overlap (marked &#x201c;TRUE&#x201d;) between drug targets and disease genes in the different sets of positive and negative drug&#x2013;disease combinations. The proportion of not overlapping genes is marked &#x201c;FALSE&#x201d;.</p>
</caption>
<graphic xlink:href="fgene-13-792090-g002.tif"/>
</fig>
<p>A subset of 40 diseases that appeared in both the Guney and the current benchmarks had received at least one new drug as a treatment. We, therefore, updated the classification of these new drug&#x2013;disease combinations in the Guney2016 benchmark from negative to positive to produce a time-stamped benchmark. The time-stamped benchmark comprises 764 positives and 17,327 negatives.</p>
</sec>
<sec id="s3-2">
<title>3.2 Evaluation of Methods to Score Drug&#x2013;Disease Association&#x2013;Based Drug Repurposing</title>
<p>The relationship between drug targets and disease genes in the context of a functional association network was assessed using four different drug repurposing approaches: one based on the shortest paths, proximity, and three based on crosstalk, NEAT, BinoX, and ANUBIX. Since the studied relationship was between drug targets and disease genes, we refer to the outcome as a drug&#x2013;disease association. Each of the evaluated drug repurposing approaches produced an association score between a drug and a disease with higher score, i.e., shorter distance for proximity and higher crosstalk for the other methods, suggesting a higher probability of the studied drug being used for the specific disease. The scoring was evaluated on three data sets with drug&#x2013;disease combinations, using ROC curves, AUROC, and recall (see <italic>Materials and Methods</italic>). PR curves, AUPR, and some other metrics, e.g., Matthews correlation coefficient (mcc) and F1-score, were also evaluated for completeness, but due to their unsuitability as performance metrics for this task were only included in the <xref ref-type="sec" rid="s10">Supplementary Material</xref>.</p>
<sec id="s3-2-1">
<title>3.2.1 Performance on the Guney2016 Benchmark</title>
<p>Visual inspection of the ROC curves for the full Guney2016 data set suggests superior performance of the BinoX approach compared with all the other approaches (<xref ref-type="fig" rid="F3">Figure 3A</xref>). The AUROC achieved by BinoX (median &#x3d; 0.714) stands out from the rest that tightly follow each other: proximity (median &#x3d; 0.677), ANUBIX (median &#x3d; 0.667), and NEAT (median &#x3d; 0.657) (<xref ref-type="table" rid="T1">Table 1</xref>). The AUROC under the equal label sampling from the benchmark confirms the order and indicates that the observed differences in performance were statistically significant (<italic>p</italic> &#x3c; .01).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Drug&#x2013;disease association&#x2013;based performance on the Guney2016 benchmark. Performance of the different drug repurposing tools: ANUBIX (green), BinoX (orange), NEAT (purple), and proximity (prox, pink) on <bold>(A)</bold> ROC curves, where the dotted line represents random prediction, and <bold>(B)</bold> AUROC using sampled sets from the benchmark containing equal numbers of positive and negative drug&#x2013;disease combinations. The pairwise Wilcoxon rank sum test was used to assess the significance of difference on AUROC. FDR-corrected <italic>p</italic>-values were obtained using the Benjamini&#x2013;Hochberg procedure.</p>
</caption>
<graphic xlink:href="fgene-13-792090-g003.tif"/>
</fig>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Performance metrics for all three benchmarks. Performance on the three benchmarks, Guney 2016, time-stamped (time), and FCbench, was assessed for different drug repurposing tools: ANUBIX, BinoX, NEAT, and proximity (prox). The evaluated metrics include AUROC, recall, and recall on the pairs with changed label in the time-stamped benchmark (new positives).</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left"/>
<th colspan="3" align="center">AUROC</th>
<th colspan="4" align="center">recall</th>
</tr>
<tr>
<th align="left"/>
<th align="center">Guney2016</th>
<th align="center">Time</th>
<th align="center">FCbench</th>
<th align="center">Guney2016</th>
<th align="center">Time</th>
<th align="center">Time (new positives)</th>
<th align="center">FCbench</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">ANUBIX</td>
<td align="char" char=".">0.667</td>
<td align="char" char=".">0.645</td>
<td align="char" char=".">0.849</td>
<td align="char" char=".">0.433</td>
<td align="char" char=".">0.386</td>
<td align="char" char=".">0.334</td>
<td align="char" char=".">0.782</td>
</tr>
<tr>
<td align="left">BinoX</td>
<td align="char" char=".">0.714</td>
<td align="char" char=".">0.541</td>
<td align="char" char=".">0.728</td>
<td align="char" char=".">0.635</td>
<td align="char" char=".">0.126</td>
<td align="char" char=".">0.094</td>
<td align="char" char=".">0.672</td>
</tr>
<tr>
<td align="left">NEAT</td>
<td align="char" char=".">0.657</td>
<td align="char" char=".">0.535</td>
<td align="char" char=".">0.759</td>
<td align="char" char=".">0.602</td>
<td align="char" char=".">0.120</td>
<td align="char" char=".">0.099</td>
<td align="char" char=".">0.694</td>
</tr>
<tr>
<td align="left">prox</td>
<td align="char" char=".">0.677</td>
<td align="char" char=".">0.633</td>
<td align="char" char=".">0.868</td>
<td align="char" char=".">0.493</td>
<td align="char" char=".">0.425</td>
<td align="char" char=".">0.351</td>
<td align="char" char=".">0.793</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3-2-2">
<title>3.2.2 Performances on the Time-Stamped Validation</title>
<p>In the time-stamped validation of the Guney2016 benchmark, 362 drug&#x2013;disease associations were changed from negatives to positives, resulting in almost a doubling of the number of positive combinations from 402 to 764. This resulted in a variable degree of loss of performance for all the methods (<xref ref-type="table" rid="T1">Table 1</xref>). The largest drop in AUROC performance was noted for BinoX by 17% (AUROC &#x3d; 0.541) followed by NEAT 12% (AUROC &#x3d; 0.535). As a result, BinoX no longer outperformed the other methods, and instead, the top methods became ANUBIX (AUROC &#x3d; 0.645) and proximity (AUROC &#x3d; 0.633), which were considerably more accurate. The difference in performance between the methods was once again statistically significant (<xref ref-type="fig" rid="F4">Figure 4</xref>).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Drug&#x2013;disease association&#x2013;based performance on the time-stamped benchmark. Performance of the different drug repurposing tools: ANUBIX (green), BinoX (orange), NEAT (purple), and proximity (prox, pink) on <bold>(A)</bold> ROC curves, where the dotted line represents random prediction, and <bold>(B)</bold> AUROC using sampled sets from the benchmark containing equal numbers of positive and negative drug&#x2013;disease combinations. The pairwise Wilcoxon rank sum test was used to assess the significance of difference on AUROC. FDR-corrected <italic>p</italic>-values were obtained using the Benjamini&#x2013;Hochberg procedure.</p>
</caption>
<graphic xlink:href="fgene-13-792090-g004.tif"/>
</fig>
<p>The explanation for the large drop in performance for BinoX and NEAT is the fact that both methods correctly classified only roughly 10% of the new positives as positives while ANUBIX and proximity correctly classified 33% and 35% of the new positives, respectively (<xref ref-type="table" rid="T1">Table 1</xref>).</p>
</sec>
<sec id="s3-2-3">
<title>3.2.3 Performances on the FCbench</title>
<p>The FCbench contains an order of magnitude more positives than the Guney2016 one (8,397 versus 402) and the time stamped one, 764 vs. 402. It is also based on a denser and more complete underlying network, which boosted the performance of all methods (<xref ref-type="fig" rid="F5">Figure 5</xref>). The difference in performance proved once again to be statistically significant with proximity outperforming ANUBIX with an AUROC of 0.868 vs. 0.849, followed by NEAT at 0.759, and finally BinoX at 0.728 (<xref ref-type="table" rid="T1">Table 1</xref>). A much higher recall was also observed, primarily for proximity and ANUBIX at 0.793 and 0.782, respectively, almost doubling the recall from the Guney2016 and time-stamped benchmarks. NEAT and BinoX had a recall of 0.694 and 0.672, respectively.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Drug&#x2013;disease association&#x2013;based performance on the FCbench. Performance of the different drug repurposing tools: ANUBIX (green), BinoX (orange), NEAT (purple), and proximity (prox, pink) on <bold>(A)</bold> ROC curves, where the dotted line represents random prediction, and <bold>(B)</bold> AUROC using sampled sets from the benchmark containing equal numbers of positive and negative drug&#x2013;disease combinations. The pairwise Wilcoxon rank sum test was used to assess the significance of difference on AUROC. FDR-corrected <italic>p</italic>-values were obtained using the Benjamini&#x2013;Hochberg procedure.</p>
</caption>
<graphic xlink:href="fgene-13-792090-g005.tif"/>
</fig>
<p>Performance of the tested drug repurposing methods differed for the used benchmarks with each benchmark having a different winner. However, since proximity was always ranked as the top or second best method, it could be considered an overall winner with ANUBIX performing equally well if the old benchmark is ignored. Both proximity and ANUBIX performed considerably better than NEAT and BinoX on the benchmarks based on new data.</p>
</sec>
</sec>
<sec id="s3-3">
<title>3.3 Application of Predicted Drug&#x2013;Disease Associations for Drug Repurposing</title>
<p>In the FCbench, the two top-performing methods, ANUBIX and proximity, produced similar AUROC and recall values with a difference of about 2% in AUROC in favor of ANUBIX and 1% in recall in favor of proximity. Although this difference is statistically significant (<italic>p</italic> &#x3c; .01) for AUROC, it is hardly meaningful in terms of overall performance.</p>
<p>Do these two approaches recover the same or similar sets of drug&#x2013;disease combinations? The intersection of correctly classified drug&#x2013;disease combinations for ANUBIX and proximity contained 6,261 elements while each method uniquely correctly classified 404 and 497 drug&#x2013;disease combinations, respectively; i.e., they overlapped by &#x223c;94%. Despite the relatively high coverage demonstrated by both methods and using the same prior knowledge in the form of the FunCoup network, there were still unique predictions provided by both methods.</p>
<p>To suggest potential drug repurposing opportunities, we investigated the drug&#x2013;disease combinations assigned as negatives, including combinations previously removed due to overlap between drug targets and disease genes. We chose to focus on predictions from the two top performing methods, ANUBIX and proximity. To increase the chances of identifying potential positives, we set a method-specific <italic>z</italic>-score cutoff, <inline-formula id="inf45">
<mml:math id="m53">
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>u</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>f</mml:mi>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>z</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>2</mml:mn>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>z</mml:mi>
</mml:msub>
<mml:mo>&#xa0;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, only considering combinations with <italic>z</italic>-scores above the cutoff. This produced 5,654 combinations agreed upon by both methods (<xref ref-type="sec" rid="s10">Supplementary Table S2</xref>) as well as 5,967 and 6,302 combinations assigned uniquely by ANUBIX (<xref ref-type="sec" rid="s10">Supplementary Table S3</xref>) and proximity (<xref ref-type="sec" rid="s10">Supplementary Table S4</xref>), respectively. It should be noted that the <italic>z</italic>-score cutoff was higher for ANUBIX (<italic>z</italic> &#x3d; 8.25) vs. proximity (<italic>z</italic> &#x3d; 5.89), which can explain the slightly lower number of uniquely retrieved combinations. Among the identified combinations, 97% contained an overlap between drug targets and disease genes.</p>
<p>After sorting the retrieved drug&#x2013;disease combinations by the highest <italic>z</italic>-scores from proximity and ANUBIX, we looked at the top 10 suggestions. The top suggestion for repurposing was palbociclib (DB09073) for hepatocellular carcinoma (HCC). Palbociclib is a selective inhibitor of cyclin dependent kinases CD4/CD6, used to treat HER2-negative and HR-positive advanced or metastatic breast cancer (<xref ref-type="bibr" rid="B14">Finn et al., 2009</xref>). However, it also showed promising results as a novel therapeutic strategy in preclinical models of HCC (<xref ref-type="bibr" rid="B5">Bollard et al., 2017</xref>). Only looking at the results from proximity, this combination would have been in 10th place, but adding the prediction from ANUBIX, we were able to identify this combination as the top candidate for repurposing.</p>
<p>The second drug&#x2013;disease combination on the candidates list was azathioprine (DB00933) and HCC. Azathioprine is an immunosuppressant used to prevent renal transplant rejection as well as in treatment of autoimmune disorders, such as Crohn&#x2019;s disease and rheumatoid arthritis. Although a conclusive link between azathioprine and HCC has never been established, multiple case reports suggest a possible association (<xref ref-type="bibr" rid="B20">Heron et al., 2016</xref>). This finding exemplifies a case when the uncovered drug&#x2013;disease combination is of a potential side effect nature rather than a treatment effect, which is usually desired. At the same time, this supports previously observed results that network-based drug repurposing can identify potential side effects and negative drug interactions (<xref ref-type="bibr" rid="B3">Badkas et al., 2021</xref>).</p>
<p>Third on the list is testosterone (DB00624) and HCC. Testosterone is indicated for treatment of hypogonadism and hypogonadotropic hypogonadism. However, animal models suggest the implication of testosterone in the etiology of HCC, which was later supported by the results from a study in humans where elevated levels of testosterone were associated with increased risk of HCC (<xref ref-type="bibr" rid="B45">Yu and Chen, 1993</xref>). This type of association could be considered as disease-biomarker rather than disease treatment or drug&#x2013;disease side effect. It is not the primary goal of drug repurposing but could potentially open other avenues for the use of network-based approaches.</p>
<p>In the eighth place is the combination of one of the few gold-based drugs, Sodium aurothiomalate (DB09276) and the human influenza. The human influenza is 50%&#x2013;65% suppressed by the current vaccination program, so there may not be a real need for repurposing any drugs for it. However, there is evidence showing the potential of gold-based treatments against other viruses. One such example is auranofin that inhibits replication of the SARS-COV-2 virus (<xref ref-type="bibr" rid="B37">Rothan et al., 2020</xref>), suggesting that gold-based compounds may also be useful against influenza virus.</p>
<p>Looking at the combinations uniquely predicted for repurposing by ANUBIX, we found enalaprilat (DB09477) for left ventricular dysfunction (LVD) as one of the top suggestions with a <italic>z</italic>-score of 66.3 (<xref ref-type="sec" rid="s10">Supplementary Table S3</xref>). Enalaprilat is indicated for treatment of hypertension. However, there is clinical evidence of enalaprilat causing marked reduction of development of atrial fibrillation in patients with LVD (<xref ref-type="bibr" rid="B43">Vermes et al., 2003</xref>). In sixth place, the combination of tetrabenazine (DB04844) and Parkinsonian disorders received a <italic>z</italic>-score of 49.9. Tetrabenazine is approved for treatment of Huntington&#x2019;s disease but is widely used for various movement disorders for which, among other effects, it improves levodopa-induced peak-dose dyskinesias in patients with Parkinson&#x2019;s disease (<xref ref-type="bibr" rid="B8">Brusa et al., 2013</xref>).</p>
</sec>
<sec id="s3-4">
<title>3.4 Evaluation of Methods to Score Drug&#x2013;Drug Similarity&#x2013;Based Repurposing</title>
<p>A common approach in drug repurposing for diseases with already approved drugs is to identify other drugs that are similar to the already approved ones. To this end, we here calculate a score for a drug&#x2013;disease combination based on the similarity between a drug candidate and a drug already approved for that disease. This similarity is based on either the shortest path or crosstalk between the drug targets of a candidate drug and the drug targets of a drug approved for the disease, resulting in a <italic>z</italic>-score for the pair of drugs. The <italic>z</italic>-scores are then combined across all approved drugs to produce a drug&#x2013;disease similarity score for the drug candidate (see Methods).</p>
<sec id="s3-4-1">
<title>3.4.1 Performances on the Guney2016 Benchmark</title>
<p>The ROC and AUROC performance was markedly increased compared with drug&#x2013;disease association with over 30% improvements for proximity and ANUBIX<italic>,</italic> followed by 22% improvement for NEAT, and 17% for BinoX (<xref ref-type="table" rid="T1">Table 1</xref> and <xref ref-type="table" rid="T2">Table 2</xref>). The results show significantly (<italic>p</italic> &#x3c; .01) different performance for all pairwise comparisons except between NEAT and BinoX (<xref ref-type="fig" rid="F6">Figure 6</xref>). At an AUROC of 0.989, proximity showed 1.4% higher performance than ANUBIX (AUROC &#x3d; 0.975), followed by BinoX and NEAT at an AUROC of 0.885 and 0.883, respectively. The recall increased by about twofold compared with the drug&#x2013;disease evaluation for proximity and ANUBIX<italic>,</italic> resulting in 0.925 and 0.960, respectively, meaning that almost a complete recovery of all positives was achieved. An increased recall was observed for NEAT and BinoX as well.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Drug&#x2013;drug similarity&#x2013;based performance metrics for all three benchmarks. Performance on the three benchmarks, Guney 2016, time-stamped (time), and FCbench, was assessed for different drug repurposing tools: ANUBIX, BinoX, NEAT, and proximity (prox). The evaluated metrics included AUROC, recall, and recall on the pairs in the time-stamped benchmark (new positives).</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left"/>
<th colspan="3" align="center">AUROC</th>
<th colspan="4" align="center">recall</th>
</tr>
<tr>
<th align="left"/>
<th align="center">Guney2016</th>
<th align="center">Time</th>
<th align="center">FCbench</th>
<th align="center">Guney2016</th>
<th align="center">Time</th>
<th align="center">Time (new positives)</th>
<th align="center">FCbench</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">ANUBIX</td>
<td align="char" char=".">0.975</td>
<td align="char" char=".">0.829</td>
<td align="char" char=".">0.883</td>
<td align="char" char=".">0.925</td>
<td align="char" char=".">0.719</td>
<td align="char" char=".">0.391</td>
<td align="char" char=".">0.823</td>
</tr>
<tr>
<td align="left">BinoX</td>
<td align="char" char=".">0.885</td>
<td align="char" char=".">0.813</td>
<td align="char" char=".">0.823</td>
<td align="char" char=".">0.791</td>
<td align="char" char=".">0.766</td>
<td align="char" char=".">0.046</td>
<td align="char" char=".">0.740</td>
</tr>
<tr>
<td align="left">NEAT</td>
<td align="char" char=".">0.883</td>
<td align="char" char=".">0.813</td>
<td align="char" char=".">0.803</td>
<td align="char" char=".">0.831</td>
<td align="char" char=".">0.748</td>
<td align="char" char=".">0.046</td>
<td align="char" char=".">0.732</td>
</tr>
<tr>
<td align="left">prox</td>
<td align="char" char=".">0.989</td>
<td align="char" char=".">0.820</td>
<td align="char" char=".">0.973</td>
<td align="char" char=".">0.960</td>
<td align="char" char=".">0.730</td>
<td align="char" char=".">0.414</td>
<td align="char" char=".">0.910</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Drug&#x2013;drug similarity&#x2013;based performance on the Guney2016 benchmark. Performance of the different drug repurposing tools: ANUBIX (green), BinoX (orange), NEAT (purple), and proximity (prox, pink) on <bold>(A)</bold> ROC curves, where the dotted line represents random prediction, and <bold>(B)</bold> AUROC using sampled sets from the benchmark containing equal numbers of positive and negative drug&#x2013;disease combinations. The pairwise Wilcoxon rank sum test was used to assess the significance of difference on AUROC. FDR-corrected <italic>p</italic>-values were obtained using the Benjamini&#x2013;Hochberg procedure.</p>
</caption>
<graphic xlink:href="fgene-13-792090-g006.tif"/>
</fig>
</sec>
<sec id="s3-4-2">
<title>3.4.2 Performances on the Time-Stamped Validation</title>
<p>Also for the time-stamped analysis, performance of drug&#x2013;drug similarity was higher for all studied approaches than when drug&#x2013;disease association was used. However, in comparison with the performance of drug&#x2013;drug similarity on the Guney2016 data set, the time-stamped performance was lower, in particular for the previously best-performing tools, proximity and ANUBIX (<xref ref-type="table" rid="T2">Table 2</xref>). With a 1% margin, ANUBIX significantly outperformed proximity in this analysis with an AUROC of 0.829 vs. 0.820. The difference in AUROC was not statistically significant between proximity, NEAT<italic>,</italic> and BinoX (<xref ref-type="fig" rid="F7">Figure 7</xref>). The recall for the time-stamped data set was markedly lower than in the Guney2016 benchmark for all methods except BinoX, which was only slightly lower. BinoX achieved the highest recall of 0.766 followed by NEAT at 0.748, proximity at 0.730, and ANUBIX at 0.719. Compared with drug&#x2013;disease association, the recall on the new positives was only marginally higher for proximity and ANUBIX, landing on 0.414 and 0.391, respectively. The already low recall of new positives seen for BinoX and NEAT in the drug&#x2013;disease association analysis shrank further to 0.046 for drug&#x2013;drug similarity (<xref ref-type="table" rid="T2">Table 2</xref>).</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Drug&#x2013;drug similarity&#x2013;based performance on the time-stamped benchmark. Performance of the different drug repurposing tools: ANUBIX (green), BinoX (orange), NEAT (purple), and proximity (prox, pink) on <bold>(A)</bold> ROC curves, where the dotted line represents random prediction, and <bold>(B)</bold> AUROC using sampled sets from the benchmark containing equal numbers of positive and negative drug&#x2013;disease combinations. The pairwise Wilcoxon rank sum test was used to assess the significance of difference on AUROC. FDR-corrected <italic>p</italic>-values were obtained using the Benjamini&#x2013;Hochberg procedure.</p>
</caption>
<graphic xlink:href="fgene-13-792090-g007.tif"/>
</fig>
</sec>
<sec id="s3-4-3">
<title>3.4.3 Performances on the FCbench</title>
<p>The drug&#x2013;drug similarity&#x2013;based performance on the FCbench also increased compared with the corresponding drug&#x2013;disease association&#x2013;based one for all studied approaches. Proximity reached the highest AUROC of 0.973, followed by ANUBIX at 0.883, BinoX at 0.823, and <italic>NEAT</italic> at 0.803 (<xref ref-type="fig" rid="F8">Figure 8</xref>). The same ranking was reflected in the recall values, where proximity, ANUBIX, BinoX, and NEAT reached 0.910, 0.823, 0.740, and 0.732, respectively (<xref ref-type="table" rid="T2">Table 2</xref>). All differences in AUROC were statistically significant (<italic>p</italic> &#x3c; .01) (<xref ref-type="fig" rid="F8">Figure 8</xref>).</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption>
<p>Drug&#x2013;drug similarity&#x2013;based performance on the FCbench. Performance of the different drug repurposing tools: ANUBIX (green), BinoX (orange), NEAT (purple), and proximity (prox, pink) on <bold>(A)</bold> ROC curves, where the dotted line represents random prediction, and <bold>(B)</bold> AUROC using sampled sets from the benchmark containing equal numbers of positive and negative drug&#x2013;disease combinations. The pairwise Wilcoxon rank sum test was used to assess the significance of difference on AUROC. FDR-corrected <italic>p</italic>-values were obtained using the Benjamini&#x2013;Hochberg procedure.</p>
</caption>
<graphic xlink:href="fgene-13-792090-g008.tif"/>
</fig>
</sec>
</sec>
</sec>
<sec id="s4">
<title>4 Discussion</title>
<p>The last couple of years shows that the possibilities for drug repurposing were far from satisfactory and called for better methods, in particular computational network-based approaches that can combine all existing knowledge of drugs, targets, disease genes, and networks. This study introduces a new approach to network-based drug repurposing based on network crosstalk, rivalling available state-of-the-art methods, together with an updated and comprehensive <italic>in silico</italic> performance assessment platform.</p>
<p>The Guney2016 benchmark for drug repurposing tools introduced by <xref ref-type="bibr" rid="B18">Guney et al. (2016)</xref> was applied na&#xef;vely but also utilized as the basis for a time-stamped update. Although the BinoX tool outperformed the other approaches on the Guney2016 benchmark, the same achievement was not reproduced in the time-stamped approach nor in the new, more comprehensive FCbench. Except for the ranking on the Guney2016 benchmark, it was clear that the top crosstalk-based approach was <italic>ANUBIX</italic>. This can be attributed to the fact that the both BinoX and NEAT approaches are more prone to false positives compared with ANUBIX due to the beta-binomial distribution being a more accurate model of random crosstalk in the underlying network and that ANUBIX accounts for nonrandom intrapathway interactions (<xref ref-type="bibr" rid="B11">Castresana-Aguirre and Sonnhammer, 2020</xref>). An additional insight from the three benchmarks was that all methods performed much better on the FCbench, which can be attributed to the more comprehensive underlying functional association network, FunCoup.</p>
<p>When it comes to the choice of the underlying network, other comprehensive functional association networks exist, such as STRING (<xref ref-type="bibr" rid="B42">Szklarczyk et al., 2020</xref>) and HumanNet (<xref ref-type="bibr" rid="B21">Yang et al., 2018</xref>). The reason for using FunCoup is both because it relies solely on experimental evidence, avoiding the noise introduced by, e.g., text mining that forms the largest evidence type in STRING, and FunCoup&#x2019;s superior performance when it comes to link quality compared with the other networks (<xref ref-type="bibr" rid="B32">Persson et al., 2021</xref>). However, networks keep evolving constantly, and their comparison is nontrivial. Therefore, the current benchmarking approach is intentionally flexible and does not require FunCoup to be used, allowing for any other network to be used in FunCoup&#x2019;s place.</p>
<p>Performance of the studied tools was primarily assessed using ROC, AUROC, and recall. These metrics are commonly used in the field of drug repurposing. Despite being an incomplete metric by only focusing on the positive cases, the importance of recall is suggested to be favored for drug repurposing due to incomplete knowledge of all positive drug&#x2013;disease combinations (<xref ref-type="bibr" rid="B7">Brown and Patel, 2018</xref>). The latter is also the reason why other popular metrics, such as precision, PR curves, AUPR, Matthew&#x2019;s correlation coefficient, and F-measure, may be unreliably conservative for drug repurposing because they are based on the false positive rate, which may be highly overestimated. They were calculated but not included in the final tool evaluation.</p>
<p>Although statistically significant differences were noted between the top two methods, proximity and ANUBIX, the practical importance of one or a few percent differences is questionable. Besides, in some evaluations one tool outperformed the other while the situation was reversed in other assessments. In the end, both methods present complementary approaches contributing almost equally to a substantial proportion of unique predictions, i.e., 68.5% of all drug&#x2013;disease predictions being unique to one or the other tool. This is a clear indication of the potential benefit of using the top methods in concert both for validation of predictions from one tool by the other and to get a higher coverage of the repurposable space.</p>
<p>Another outcome of our analysis supports the findings of the Guney2016 benchmark. The performance of drug&#x2013;drug similarity&#x2013;based approaches was higher than was seen for the drug&#x2013;disease based ones, irrespective of the used drug repurposing method or benchmark. This could be driven by the higher recall when similarity between drug targets of approved drugs and candidate drugs is evaluated. The sets of disease genes are usually heterogeneous, containing both disease-causing genes, i.e., drivers, and genes merely affected by the disease, i.e., passengers. Drug target association with the drivers may yield a potential therapeutic effect while association with the passengers does not guarantee an effect on the disease. When association is assessed for the whole set of disease genes, the passenger genes may, therefore, introduce noise, decreasing signal-to-noise ratio and thereby also performance. A potential way to address this issue could be to purify the set of disease genes by preclustering, an approach that is shown to increase sensitivity in pathway analysis (<xref ref-type="bibr" rid="B9">Castresana-Aguirre, 2021</xref>). While many disease genes are known for each disease, most drugs have only one or a few drug targets, e.g., for the FDA-approved drugs in DrugBank, the median number of targets was three (<xref ref-type="bibr" rid="B36">Rivero-Garc&#xed;a et al., 2020</xref>). This suggests that the set of drug targets for a drug is, for the most part, a more homogeneous set. Many of the novel treatments consist of monoclonal antibodies with only one target, suggesting that even more specificity is on the horizon. This increases the signal-to-noise ratio when drug&#x2013;drug similarity is used for repurposing. However, the drawback is that drug&#x2013;drug similarity requires at least one drug to be approved for a disease to repurpose more drugs for that disease. On the other hand, the approach could be attempted even for diseases without approved treatments if related diseases do have drugs approved for them.</p>
<p>In the assessment of the potential drug repurposing candidates, it was apparent that not all the predictions were related to potential new treatment opportunities. Some drug&#x2013;disease combinations were suggestive of potential drug-related side effects. This supports the fact that network-based drug repurposing methodologies can also be used to identify potential side effects, something we hope to explore in the future.</p>
<p>In conclusion, in this study, we employed a comprehensive framework of functional interaction data to create a platform for <italic>in silico</italic> assessment of network-based drug repurposing, which can be utilized for robust evaluation of new drug repurposing strategies. Additionally, we show that network crosstalk-based approaches can rival the state-of-the-art network-based repurposing method, thereby adding to the arsenal of tools that can be used by the scientific community in the search for novel applications of already approved treatments for the benefit of patients.</p>
</sec>
</body>
<back>
<sec id="s5">
<title>Data Availability Statement</title>
<p>We used R (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://r-project.org">r-project.org</ext-link>) version 3.6.3 for statistical tests and data visualization as well as Python (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://python.org">python.org</ext-link>). The datasets generated for this study together with corresponding code can be found in the Sonnhammer group repository <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://bitbucket.org/sonnhammergroup/drugrepurposingbench/">https://bitbucket.org/sonnhammergroup/drugrepurposingbench/</ext-link>.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>DG: Conceptualization, software, data curation, writing&#x2014;original draft. ES: Conceptualization, supervision, writing&#x2014;review and editing, funding acquisition.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This work was supported by the Swedish Research Council Project 2015-05342. Open access funding was provided by Stockholm University.</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>Author DG was employed by company Merck AB, Sweden.</p>
<p>The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s10">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2022.792090/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2022.792090/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="Table1.XLSX" id="SM1" mimetype="application/XLSX" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet1.docx" id="SM2" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alexeyenko</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Schmitt</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tjarnberg</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Guala</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Frings</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Sonnhammer</surname>
<given-names>E. L. L.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Comparative Interactomics with Funcoup 2.0</article-title>. <source>Nucleic Acids Res.</source> <volume>40</volume>, <fpage>D821</fpage>&#x2013;<lpage>D828</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr1062</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ashburner</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ball</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Blake</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Botstein</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Butler</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Cherry</surname>
<given-names>J. M.</given-names>
</name>
<etal/>
</person-group> (<year>2000</year>). <article-title>Gene Ontology: Tool for the Unification of Biology</article-title>. <source>Nat. Genet.</source> <volume>25</volume>, <fpage>25</fpage>&#x2013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1038/75556</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Badkas</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>De Landtsheer</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sauter</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Topological Network Measures for Drug Repositioning</article-title>. <source>Brief. Bioinform.</source> <volume>22</volume>, <fpage>1</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbaa357</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Benjamini</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hochberg</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing</article-title>. <source>J. R. Stat. Soc. Ser. B (Methodological)</source> <volume>57</volume>, <fpage>289</fpage>&#x2013;<lpage>300</lpage>. <pub-id pub-id-type="doi">10.1111/j.2517-6161.1995.tb02031.x</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bollard</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Miguela</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Ruiz De Galarreta</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Venkatesh</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Bian</surname>
<given-names>C. B.</given-names>
</name>
<name>
<surname>Roberto</surname>
<given-names>M. P.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Palbociclib (PD-0332991), a Selective CDK4/6 Inhibitor, Restricts Tumour Growth in Preclinical Models of Hepatocellular Carcinoma</article-title>. <source>Gut</source> <volume>66</volume>, <fpage>1286</fpage>&#x2013;<lpage>1296</lpage>. <pub-id pub-id-type="doi">10.1136/gutjnl-2016-312268</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bradley</surname>
<given-names>A. P.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>The Use of the Area under the Roc Curve in the Evaluation of Machine Learning Algorithms</article-title>. <source>Pattern Recognition</source> <volume>30</volume>, <fpage>1145</fpage>&#x2013;<lpage>1159</lpage>. <pub-id pub-id-type="doi">10.1016/S0031-3203(96)00142-2</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brown</surname>
<given-names>A. S.</given-names>
</name>
<name>
<surname>Patel</surname>
<given-names>C. J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>A Review of Validation Strategies for Computational Drug Repositioning</article-title>. <source>Brief. Bioinform.</source> <volume>19</volume>, <fpage>174</fpage>&#x2013;<lpage>177</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbw110</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brusa</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Orlacchio</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Stefani</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Galati</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Pierantozzi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Iani</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Tetrabenazine Improves Levodopa-Induced Peak-Dose Dyskinesias in Patients with Parkinsons Disease</article-title>. <source>Fneur</source> <volume>28</volume>. <pub-id pub-id-type="doi">10.11138/fneur/2013.28.2.101</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Castresana-Aguirre</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2021</year>). <source>From Networks to Pathway Analysis (PhD Dissertation</source>. <publisher-loc>Stockholm</publisher-loc>: <publisher-name>Department of Biochemistry and Biophysics, Stockholm University</publisher-name>. <comment>Retrieved from <ext-link ext-link-type="uri" xlink:href="http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-195048">http://urn.kb.se/resolve?urn&#x3d;urn:nbn:se:su:diva-195048</ext-link>.</comment> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Castresana-Aguirre</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sonnhammer</surname>
<given-names>E. L. L.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Pathway-specific Model Estimation for Improved Pathway Annotation by Network Crosstalk</article-title>. <source>Sci. Rep.</source> <volume>10</volume>. <pub-id pub-id-type="doi">10.1038/s41598-020-70239-z</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Davis</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Grondin</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Johnson</surname>
<given-names>R. J.</given-names>
</name>
<name>
<surname>Sciaky</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Wiegers</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wiegers</surname>
<given-names>T. C.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Comparative Toxicogenomics Database (CTD): Update 2021</article-title>. <source>Nucleic Acids Res.</source> <volume>49</volume>, <fpage>D1138</fpage>&#x2013;<lpage>D1143</lpage>. <pub-id pub-id-type="doi">10.1093/NAR/GKAA891</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Davis</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Goadrich</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>The Relationship between Precision-Recall and ROC Curves</article-title>. <source>ACM Int. Conf. Proceeding Ser.</source>, <fpage>233</fpage>&#x2013;<lpage>240</lpage>. <pub-id pub-id-type="doi">10.1145/1143844.1143874</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Finn</surname>
<given-names>R. S.</given-names>
</name>
<name>
<surname>Dering</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Conklin</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kalous</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Cohen</surname>
<given-names>D. J.</given-names>
</name>
<name>
<surname>Desai</surname>
<given-names>A. J.</given-names>
</name>
<etal/>
</person-group> (<year>2009</year>). <article-title>PD 0332991, a Selective Cyclin D Kinase 4/6 Inhibitor, Preferentially Inhibits Proliferation of Luminal Estrogen Receptor-Positive Human Breast Cancer Cell Lines <italic>In Vitro</italic>
</article-title>. <source>Breast Cancer Res.</source> <volume>11</volume>. <pub-id pub-id-type="doi">10.1186/bcr2419</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Giurgiu</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Reinhard</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Brauner</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Dunger-Kaltenbach</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Fobo</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Frishman</surname>
<given-names>G.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>CORUM: the Comprehensive Resource of Mammalian Protein Complexes-2019</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume>, <fpage>D559</fpage>&#x2013;<lpage>D563</lpage>. <pub-id pub-id-type="doi">10.1093/NAR/GKY973</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Glaab</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Baudot</surname>
<given-names>a.</given-names>
</name>
<name>
<surname>Krasnogor</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Schneider</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Valencia</surname>
<given-names>a.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>EnrichNet: Network-Based Gene Set Enrichment Analysis</article-title>. <source>Bioinformatics</source> <volume>28</volume>, <fpage>i451</fpage>&#x2013;<lpage>i457</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bts389</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guala</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Ogris</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>M&#xfc;ller</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Sonnhammer</surname>
<given-names>E. L. L.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Genome-wide Functional Association Networks: Background, Data &#x26; State-Of-The-Art Resources</article-title>. <source>Brief. Bioinform</source>. <pub-id pub-id-type="doi">10.1093/bib/bbz08910.1093/bib/bbz064</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guney</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Menche</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Vidal</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bar&#xe1;basi</surname>
<given-names>A.-L.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Network-based In Silico Drug Efficacy Screening</article-title>. <source>Nat. Commun.</source> <volume>7</volume>, <fpage>10331</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms10331</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hamosh</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Scott</surname>
<given-names>A. F.</given-names>
</name>
<name>
<surname>Amberger</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Bocchini</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>McKusick</surname>
<given-names>V. A.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Online Mendelian Inheritance in Man (OMIM), a Knowledgebase of Human Genes and Genetic Disorders</article-title>. <source>Nucleic Acids Res.</source> <volume>33</volume>, <fpage>D514</fpage>&#x2013;<lpage>D517</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gki033</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Heron</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Fortinsky</surname>
<given-names>K. J.</given-names>
</name>
<name>
<surname>Spiegle</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Hilzenrat</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Szilagyi</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Resected Hepatocellular Carcinoma in a Patient with Crohn&#x27;s Disease on Azathioprine</article-title>. <source>Case Rep. Gastroenterol.</source> <volume>10</volume>, <fpage>54</fpage>&#x2013;<lpage>60</lpage>. <pub-id pub-id-type="doi">10.1159/000444012</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hwang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>C. Y.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Hart</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Marcotte</surname>
<given-names>E. M.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>HumanNet V2: Human Gene Networks for Disease Research</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume>, <fpage>D573</fpage>&#x2013;<lpage>D580</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky1126</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jourdan</surname>
<given-names>J.-P.</given-names>
</name>
<name>
<surname>Bureau</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Rochais</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Dallemagne</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Drug Repositioning: a Brief Overview</article-title>. <source>J. Pharm. Pharmacol.</source> <volume>72</volume>, <fpage>1145</fpage>&#x2013;<lpage>1151</lpage>. <pub-id pub-id-type="doi">10.1111/JPHP.13273</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kanehisa</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>KEGG: Kyoto Encyclopedia of Genes and Genomes</article-title>. <source>Nucleic Acids Res.</source> <volume>28</volume>, <fpage>27</fpage>&#x2013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1093/nar/28.1.27</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Keilwagen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Grosse</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Grau</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Area under Precision-Recall Curves for Weighted and Unweighted Data</article-title>. <source>PLoS One</source> <volume>9</volume>, <fpage>e92209</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0092209</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>K&#xf6;hler</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bauer</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Horn</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Robinson</surname>
<given-names>P. N.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Walking the Interactome for Prioritization of Candidate Disease Genes</article-title>. <source>Am. J. Hum. Genet.</source> <volume>82</volume>, <fpage>949</fpage>&#x2013;<lpage>958</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2008.02.013</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maleki</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Ovens</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Hogan</surname>
<given-names>D. J.</given-names>
</name>
<name>
<surname>Kusalik</surname>
<given-names>A. J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Gene Set Analysis: Challenges, Opportunities, and Future Research</article-title>. <source>Front. Genet.</source> <volume>11</volume>, <fpage>654</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2020.00654</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McCormack</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Frings</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Alexeyenko</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Sonnhammer</surname>
<given-names>E. L. L.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Statistical Assessment of Crosstalk Enrichment between Gene Groups in Biological Networks</article-title>. <source>PLoS One</source> <volume>8</volume>, <fpage>e54945</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0054945</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meldal</surname>
<given-names>B. H. M.</given-names>
</name>
<name>
<surname>Bye-A-Jee</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Gajdo&#x161;</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Hammerov&#xe1;</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Hor&#xe1;&#x10d;kov&#xe1;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Melicher</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Complex Portal 2018: Extended Content and Enhanced Visualization Tools for Macromolecular Complexes</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume>, <fpage>D550</fpage>&#x2013;<lpage>D558</lpage>. <pub-id pub-id-type="doi">10.1093/NAR/GKY1001</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Menche</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Sharma</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kitsak</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ghiassian</surname>
<given-names>S. D.</given-names>
</name>
<name>
<surname>Vidal</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Loscalzo</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Uncovering Disease-Disease Relationships through the Incomplete Interactome</article-title>. <source>Science</source> <volume>347</volume>, <fpage>1257601</fpage>. <pub-id pub-id-type="doi">10.1126/science.1257601</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morselli Gysi</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Do Valle</surname>
<given-names>&#xcd;.</given-names>
</name>
<name>
<surname>Zitnik</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ameli</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Gan</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Varol</surname>
<given-names>O.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Network Medicine Framework for Identifying Drug-Repurposing Opportunities for COVID-19</article-title>. <source>Proc. Natl. Acad. Sci. USA</source> <volume>118</volume>, <fpage>e2025581118</fpage>. <pub-id pub-id-type="doi">10.1073/pnas.2025581118</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ogris</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Guala</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Helleday</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Sonnhammer</surname>
<given-names>E. L. L.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>A Novel Method for Crosstalk Analysis of Biological Networks: Improving Accuracy of Pathway Annotation</article-title>. <source>Nucleic Acids Res.</source> <volume>45</volume>, <fpage>e8</fpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw849</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Persson</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Castresana-Aguirre</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Buzzao</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Guala</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>SonnhammerErik</surname>
<given-names>L. L.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Funcoup 5: Functional Association Networks in All Domains of Life, Supporting Directed Links and Tissue-Specificity</article-title>. <source>J. Mol. Biol.</source> <pub-id pub-id-type="doi">10.1016/j.jmb.2021.166835</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pertea</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ayanbule</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Smedinghoff</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Salzberg</surname>
<given-names>S. L.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>OperonDB: A Comprehensive Database of Predicted Operons in Microbial Genomes</article-title>. <source>Nucleic Acids Res.</source> <volume>37</volume>, <fpage>D479</fpage>&#x2013;<lpage>D482</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkn784</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ramos</surname>
<given-names>E. M.</given-names>
</name>
<name>
<surname>Hoffman</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Junkins</surname>
<given-names>H. A.</given-names>
</name>
<name>
<surname>Maglott</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Phan</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Sherry</surname>
<given-names>S. T.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Phenotype-Genotype Integrator (PheGenI): Synthesizing Genome-wide Association Study (GWAS) Data with Existing Genomic Resources</article-title>. <source>Eur. J. Hum. Genet.</source> <volume>22</volume>, <fpage>144</fpage>&#x2013;<lpage>147</lpage>. <pub-id pub-id-type="doi">10.1038/ejhg.2013.96</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Razick</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Magklaras</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Donaldson</surname>
<given-names>I. M.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>iRefIndex: a Consolidated Protein Interaction Database with Provenance</article-title>. <source>BMC Bioinformatics</source> <volume>9</volume>, <fpage>405</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-9-405</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rivero-Garc&#xed;a</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Castresana-Aguirre</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Guglielmo</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Guala</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Sonnhammer</surname>
<given-names>E. L. L.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Drug Repurposing Improves Disease Targeting 11-fold and Can Be Augmented by Network Module Targeting, Applied to COVID-19</article-title>. <source>Sci. Rep.</source> <volume>11</volume>, <fpage>1</fpage>&#x2013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1038/s41598-021-99721-y</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rothan</surname>
<given-names>H. A.</given-names>
</name>
<name>
<surname>Stone</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Natekar</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kumari</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Arora</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>The FDA-Approved Gold Drug Auranofin Inhibits Novel Coronavirus (SARS-COV-2) Replication and Attenuates Inflammation in Human Cells</article-title>. <source>Virology</source> <volume>547</volume>, <fpage>7</fpage>&#x2013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1016/j.virol.2020.05.002</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schuler</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Falls</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Mangione</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Hudson</surname>
<given-names>M. L.</given-names>
</name>
<name>
<surname>Bruggemann</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Samudrala</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2020</year>). <source>Evaluating Performance of Drug Repurposing Technologies</source>. <pub-id pub-id-type="doi">10.1101/2020.12.03.410274</pub-id>
<source>bioRxiv</source> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Signorelli</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Vinciotti</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Wit</surname>
<given-names>E. C.</given-names>
</name>
<name>
<surname>Ashburner</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ball</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Blake</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>NEAT: an Efficient Network Enrichment Analysis Test</article-title>. <source>BMC Bioinformatics</source> <volume>17</volume>, <fpage>352</fpage>. <pub-id pub-id-type="doi">10.1186/s12859-016-1203-6</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname>
<given-names>T. U.</given-names>
</name>
<name>
<surname>Parida</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lingaraju</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Kesavan</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>R. K.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Drug Repurposing Approach to Fight COVID-19</article-title>. <source>Pharmacol. Rep.</source> <volume>72</volume>, <fpage>1479</fpage>&#x2013;<lpage>1508</lpage>. <pub-id pub-id-type="doi">10.1007/S43440-020-00155-6</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stathias</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Turner</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Koleti</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Vidovic</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Cooper</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Fazel-Najafabadi</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>LINCS Data Portal 2.0: Next Generation Access point for Perturbation-Response Signatures</article-title>. <source>Nucleic Acids Res.</source> <volume>48</volume>, <fpage>D431</fpage>&#x2013;<lpage>D439</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz1023</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Szklarczyk</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Gable</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>Nastou</surname>
<given-names>K. C.</given-names>
</name>
<name>
<surname>Lyon</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kirsch</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Pyysalo</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>The STRING Database in 2021: Customizable Protein-Protein Networks, and Functional Characterization of User-Uploaded Gene/measurement Sets</article-title>. <source>Nucleic Acids Res.</source> <volume>49</volume>, <fpage>D605</fpage>&#x2013;<lpage>D612</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkaa1074</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vermes</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Tardif</surname>
<given-names>J.-C.</given-names>
</name>
<name>
<surname>Bourassa</surname>
<given-names>M. G.</given-names>
</name>
<name>
<surname>Racine</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Levesque</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>White</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2003</year>). <article-title>Enalapril Decreases the Incidence of Atrial Fibrillation in Patients with Left Ventricular Dysfunction</article-title>. <source>Circulation</source> <volume>107</volume>, <fpage>2926</fpage>&#x2013;<lpage>2931</lpage>. <pub-id pub-id-type="doi">10.1161/01.CIR.0000072793.81076.D4</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wishart</surname>
<given-names>D. S.</given-names>
</name>
<name>
<surname>Feunang</surname>
<given-names>Y. D.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>A. C.</given-names>
</name>
<name>
<surname>Lo</surname>
<given-names>E. J.</given-names>
</name>
<name>
<surname>Marcu</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Grant</surname>
<given-names>J. R.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>DrugBank 5.0: A Major Update to the DrugBank Database for 2018</article-title>. <source>Nucleic Acids Res.</source> <volume>46</volume>, <fpage>D1074</fpage>&#x2013;<lpage>D1082</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx1037</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yu</surname>
<given-names>M. W.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C. J.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Elevated Serum Testosterone Levels and Risk of Hepatocellular Carcinoma</article-title>. <source>Cancer Res.</source> <volume>53</volume>, <fpage>790</fpage>&#x2013;<lpage>794</lpage>. </citation>
</ref>
</ref-list>
</back>
</article>