<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mol. Biosci.</journal-id>
<journal-title>Frontiers in Molecular Biosciences</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mol. Biosci.</abbrev-journal-title>
<issn pub-type="epub">2296-889X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">882738</article-id>
<article-id pub-id-type="doi">10.3389/fmolb.2022.882738</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Molecular Biosciences</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Decoding the Role of Epigenetics in Breast Cancer Using Formal Modeling and Machine-Learning Methods</article-title>
<alt-title alt-title-type="left-running-head">Asim et al.</alt-title>
<alt-title alt-title-type="right-running-head">Decoding the Role of Epigenetics in Breast Cancer</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Asim</surname>
<given-names>Ayesha</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1625918/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Kiani</surname>
<given-names>Yusra Sajid</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/476067/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Saeed</surname>
<given-names>Muhammad Tariq</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/439155/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Jabeen</surname>
<given-names>Ishrat</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/491965/overview"/>
</contrib>
</contrib-group>
<aff>
<institution>School of Interdisciplinary Engineering and Sciences (SINES)</institution>, <institution>National University of Sciences and Technology, (NUST)</institution>, <addr-line>Islamabad</addr-line>, <country>Pakistan</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/204932/overview">Adil Mardinoglu</ext-link>, King&#x2019;s College London, United Kingdom</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/883783/overview">Fu Hui</ext-link>, Tianjin University of Traditional Chinese Medicine, China</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1702773/overview">Han Jin</ext-link>, Science for Life Laboratory (SciLifeLab), Sweden</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Ishrat Jabeen, <email>ishrat.jabeen@sines.nust.edu.pk</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Molecular Diagnostics and Therapeutics, a section of the journal Frontiers in Molecular Biosciences.</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>11</day>
<month>07</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>9</volume>
<elocation-id>882738</elocation-id>
<history>
<date date-type="received">
<day>24</day>
<month>02</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>25</day>
<month>05</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Asim, Kiani, Saeed and Jabeen.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Asim, Kiani, Saeed and Jabeen</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Breast carcinogenesis is known to be instigated by genetic and epigenetic modifications impacting multiple cellular signaling cascades, thus making its prevention and treatments a challenging endeavor. However, epigenetic modification, particularly DNA methylation-mediated silencing of key TSGs, is a hallmark of cancer progression. One such tumor suppressor gene (TSG) <italic>RUNX3</italic> (Runt-related transcription factor 3) has been a new insight in breast cancer known to be suppressed due to local promoter hypermethylation mediated by DNA methyltransferase 1 (DNMT1). However, the precise mechanism of epigenetic-influenced silencing of the <italic>RUNX3</italic> signaling resulting in cancer invasion and metastasis remains inadequately characterized. In this study, a biological regulatory network (BRN) has been designed to model the dynamics of the DNMT1&#x2013;<italic>RUNX3</italic> network augmented by other regulators such as p21, c-myc, and p53. For this purpose, the Ren&#xe9; Thomas qualitative modeling was applied to compute the unknown parameters and the subsequent trajectories signified important behaviors of the DNMT1&#x2013;<italic>RUNX3</italic> network (i.e., recovery cycle, homeostasis, and bifurcation state). As a result, the biological system was observed to invade cancer metastasis due to persistent activation of oncogene c-myc accompanied by consistent downregulation of TSG <italic>RUNX3.</italic> Conversely, homeostasis was achieved in the absence of c-myc and activated TSG <italic>RUNX3</italic>. Furthermore, DNMT1 was endorsed as a potential epigenetic drug target to be subjected to the implementation of machine-learning techniques for the classification of the active and inactive DNMT1 modulators. The best-performing ML model successfully classified the active and least-active DNMT1 inhibitors exhibiting 97% classification accuracy. Collectively, this study reveals the underlined epigenetic events responsible for <italic>RUNX3</italic>-implicated breast cancer metastasis along with the classification of DNMT1 modulators that can potentially drive the perception of epigenetic-based tumor therapy.</p>
</abstract>
<kwd-group>
<kwd>RUNX3 signaling pathway</kwd>
<kwd>Dnmt1</kwd>
<kwd>machine learning</kwd>
<kwd>qualitative modeling</kwd>
<kwd>c-myc</kwd>
<kwd>SMBioNet</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Breast cancer is one of the frequently diagnosed lethal malignancies affecting millions of women worldwide. The risk factors include both genetic and epigenetic abnormalities whereby the latter provides early prognostic biomarkers for breast cancer therapeutics (<xref ref-type="bibr" rid="B5">Chen, 2012</xref>; <xref ref-type="bibr" rid="B30">Sun et al., 2017</xref>). Furthermore, DNA methylation is the most prominent epigenetic marker that frequently influences the expression or silencing of genes involved in vital cellular activities such as cell proliferation, programmed cell death, and cell differentiation (<xref ref-type="bibr" rid="B5">Chen, 2012</xref>). DNA methylation is mediated by DNA methyltransferases (DNMTs), in which the DNMT1 isomer is explicitly responsible for the maintenance of methylation status during replication (S-phase) (<xref ref-type="bibr" rid="B23">Pouliot et al., 2015</xref>). Briefly, it transfers the methyl group (&#x2212;CH3) on cytosine in CpG dinucleotide regions by keeping the CpG islands non-methylated that are preferably located in the proximal promoter region of a gene (<xref ref-type="bibr" rid="B31">Tang et al., 2009</xref>). On the contrary, the hypermethylation of promoter CpG islands of TSGs due to the overexpression of DNMT1 is a popular mechanism of gene silencing and a hallmark of cancer (<xref ref-type="bibr" rid="B28">Subramaniam et al., 2014</xref>; <xref ref-type="bibr" rid="B23">Pouliot et al., 2015</xref>).</p>
<p>DNMT1-regulated hypermethylation of the TSG <italic>RUNX3</italic> promoter region is a new insight and an early event in breast cancer predominantly in TNBC (triple-negative breast cancer) (<xref ref-type="bibr" rid="B34">Widschwendter and Jones, 2002</xref>; <xref ref-type="bibr" rid="B15">Jung et al., 2007</xref>; <xref ref-type="bibr" rid="B13">Jiang et al., 2008</xref>; <xref ref-type="bibr" rid="B29">Subramaniam et al., 2009</xref>; <xref ref-type="bibr" rid="B27">Shin et al., 2016</xref>). Generally, <italic>RUNX3</italic> is known to elicit its tumor-suppressive ability through major cancer signaling pathways including TGF-&#x3b2;, Wnt/&#x3b2;-catenin, and KRAS (<xref ref-type="bibr" rid="B5">Chen, 2012</xref>). <italic>RUNX3</italic> either influences the downstream target of tumor suppressor signaling pathways or acts as an antagonist for oncogenic pathways to exert its antitumor activity. However, despite the evident role of <italic>RUNX3</italic> in breast cancer and the relation of DNMT1 with <italic>RUNX3</italic> gene, no study has explored the underlying epigenetic molecular events by which DNMT1 silences <italic>RUNX3</italic> to promote breast cancer metastasis.</p>
<p>Furthermore, due to the pivotal implication of DNMT1 in various tumors (<xref ref-type="bibr" rid="B20">Li et al., 2019</xref>) and the reversible nature of its methylation activity, targeting DNMT1 through small modulators is a promising pharmacological intervention to revive the suppressed TSGs (such as <italic>RUNX3</italic>). To date, two DNMT inhibitors (i.e., azacytidine and decitabine) have been approved by FDA for the treatment of myeloid malignancies (<xref ref-type="bibr" rid="B9">Gnyszka, Jastrz&#x119;bski, and Flis 2013</xref>). However, the potential side effects associated with these drugs limit their use in high-grade malignancies. Therefore, the development of more potent DNMT1 inhibitors acquiring potential binding pocket features is highly desirable to restore the suppressed TSGs (<italic>RUNX3</italic>) in cancer therapeutics. Previously, several <italic>in vivo</italic> and computational studies have targeted DNMT1 for therapeutic purposes (<xref ref-type="bibr" rid="B38">Yoo 2011</xref>; <xref ref-type="bibr" rid="B37">Yoo, Kim, and Robertson 2012</xref>; <xref ref-type="bibr" rid="B21">Mirza et al., 2013</xref>; <xref ref-type="bibr" rid="B18">Krishna et al., 2017</xref>); however, its impact on downstream target genes, particularly <italic>RUNX3,</italic> requires further elucidation.</p>
<p>Therefore, in the present study, a BRN of the DNMT1&#x2013;<italic>RUNX3</italic> signaling was developed to provide an insight into the epigenetic-mediated silencing of <italic>RUNX3</italic> leading to cancer development and metastasis. For this purpose, qualitative modeling by the Ren&#xe9; Thomas formalism was applied in a SMBioNet tool utilizing the existing wet-laboratory data in the form of computation tree logic (CTL) for parameter estimation and verification through the model checking technique (<xref ref-type="fig" rid="F1">Figure 1</xref>). The model trajectories were explored to identify the paths involved in the overexpression of DNMT1, activation of oncogene (c-myc), and suppression of tumor suppressor genes (<italic>RUNX3</italic>, p21, and p53) leading to cancer invasion or recovery (homeostasis). In addition, DNMT1 was advocated as a potential drug target and favored to be subjected to ML approaches to classify diverse DNMT1 modulators. Decision tree (DT) and neural network classifiers were built utilizing the data set of DNMT1 from the ChEMBL database to identify potential two-dimensional binding features essential for the modulation of DNMT1 activity (<xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Flowchart showing the overall methodology utilized in the current study. Qualitative modeling of the BRN to explore and model the DNMT1&#x2013;<italic>RUNX3</italic> network was followed by the application of ML techniques to construct the predictive ML (DT and ANN) models using the DNMT1 data set extracted from the ChEMBL database.</p>
</caption>
<graphic xlink:href="fmolb-09-882738-g001.tif"/>
</fig>
</sec>
<sec id="s2">
<title>2 Materials and Methods</title>
<p>The overall methodology used in the current study is provided in <xref ref-type="fig" rid="F1">Figure 1</xref>.</p>
<sec id="s2-1">
<title>2.1 Qualitative Modeling Framework</title>
<p>
<statement content-type="definition" id="Definition_1">
<label>Definition 1</label>
<p>Biological Regulatory Network (BRN)</p>
<p>A BRN is a labeled, directed graph G &#x3d; (V, E) in which biological regulators are denoted as a set of nodes or vertices V and their interactions as a set of edges E&#x2286; V &#xd7; V. Each edge is labeled by a pair of elements (<italic>&#x3c4;, &#x3c3;</italic>), where <italic>&#x3c4;</italic> is the threshold at which a gene U starts regulating a gene V, and <italic>&#x3c3;</italic> represents the type of interactions <italic>&#x3c3;</italic> &#x3d; &#x2b; (activation) and &#x2212; (inhibition) between the nodes (<xref ref-type="bibr" rid="B2">Bernot et al., 2004</xref>).</p>
</statement>
</p>
<p>
<statement content-type="definition" id="Definition_2">
<label>Definition 2</label>
<p>Qualitative State</p>
<p>The state of the BRN is a configuration of the expression level of all the nodes at a particular time instant. A state is n-tuple S &#x3d; {s<sub>v1</sub> , . . . , s<sub>vn</sub>}, &#x2200;s<sub>vi</sub> &#x2208; &#x3b4;<sub>vi</sub>, where s<sub>vi</sub> is the expression level of <sub>vi</sub> (<xref ref-type="bibr" rid="B2">Bernot et al., 2004</xref>).</p>
</statement>
</p>
<p>
<statement content-type="definition" id="Definition_3">
<label>Definition 3</label>
<p>Resources</p>
<p>Each <italic>vi</italic> in a state of the BRN is controlled by its precursors G<sup>
<bold>-</bold>
</sup>
<sub>vi</sub>, called the set of resources &#x3c9;<sub>vi</sub>.<list list-type="simple">
<list-item>
<p>Let G &#x3d; (V, E), at level s<sub>vj</sub>, &#x3c9;<sub>vi</sub>, defined as:</p>
</list-item>
<list-item>
<p>&#x3c9;<sub>vj</sub> &#x3d; vi &#x2208; G<sup>&#x2212;</sup>
<sub>vj</sub> &#x2223; (s<sub>vi</sub> &#x2265; &#x3c4;<sub>vi</sub>,<sub>vj</sub> and &#x3c3;<sub>vi,vj</sub> &#x3d; &#x2b;) or (s<sub>vi</sub> &#x3c; &#x3c4;<sub>vi,vj</sub> and &#x3c3;<sub>vi,vj</sub> &#x3d; &#x2212;) (<xref ref-type="bibr" rid="B2">Bernot et al., 2004</xref>).</p>
</list-item>
</list>
</p>
</statement>
</p>
<p>
<statement content-type="definition" id="Definition_4">
<label>Definition 4</label>
<p>Parameters</p>
<p>Parameters of a biological regulatory network are indexed by its set of resources. It is a cartesian product of each variable&#x2019;s element and resource (<xref ref-type="bibr" rid="B26">Saeed et al., 2016</xref>).</p>
</statement>
</p>
<p>
<statement content-type="definition" id="Definition_5">
<label>Definition 5</label>
<p>Betweenness Centrality</p>
<p>Network analysis techniques such as graph theory&#x2013;based approaches can be applied to further analyze the state graph by sorting it based on betweenness centrality (<xref ref-type="bibr" rid="B8">Freeman 2019</xref>). The betweenness centrality measures the extent to which a single vertex or node is more central/connected than all other nodes. For a particular node, the centrality metric is measured with the number of shortest paths that pass through it, whereas the betweenness would be high for a particular node if it appears in many shortest paths (<xref ref-type="bibr" rid="B10">Golbeck 2015</xref>). Likewise, the qualitative states with higher betweenness centrality value are more likely to occur in the system, implying that these entities are frequently expressed in biological phenomena. Moreover, the central or more connected nodes in the system might represent a potential therapeutic target (<xref ref-type="bibr" rid="B10">Golbeck 2015</xref>; <xref ref-type="bibr" rid="B26">Saeed et al., 2016</xref>).</p>
</statement>
</p>
</sec>
<sec id="s2-2">
<title>2.2 Qualitative Modeling and Parameter Estimation</title>
<p>Ren&#xe9; Thomas introduced a graph theory&#x2013;based approach for qualitative modeling of dynamic biological regulatory networks (<xref ref-type="bibr" rid="B32">Thomas 1978</xref>). In this method, each BRN (<xref ref-type="statement" rid="Definition_1">Definition 1</xref>) is modeled as a weighted, directed graph, which consists of a set of nodes and edges. Nodes represent a biological entity (i.e., genes or proteins), whereas edge represents the relationship of activation or inhibition between the nodes.</p>
<p>Herein, the role of the epigenetic (DNMT1)-mediated <italic>RUNX3</italic> silencing in cancer development and progression was examined by regulating key oncogenes and TSGs. The literature-driven information was used to generate the BRN, which signifies important entities involved in the DNMT1&#x2013;<italic>RUNX3</italic> signaling and the relationship (activation and inhibition) among the chosen entities. The unknown parameters were inferred by encoding wet-laboratory biological observations as propositional calculus or more precisely computation tree logic (CTL) and verified through the model checking technique as previously reported by <xref ref-type="bibr" rid="B25">Saeed et al. (2018</xref>). Briefly, model checking is an automatic technique based on the exhaustive exploration of the entire state space of a biological system, which therefore allows the analysis and cross-verification of a large number of possible outcomes of a network (<xref ref-type="bibr" rid="B2">Bernot et al., 2004</xref>). All compatible combinations of parameters (<xref ref-type="statement" rid="Definition_4">Definition 4</xref>) were generated and evaluated by a model checker using a SMBioNet tool, and the model parameters that violate laboratory data were eliminated.</p>
<p>Briefly, SMBioNet (<xref ref-type="bibr" rid="B16">Khalis et al., 2009</xref>) is a qualitative framework&#x2013;based tool that calls New Symbolic Model Verifier (NuSMV) (<xref ref-type="bibr" rid="B6">Cimatti et al., 2002</xref>) as a model checker. NuSMV works by considering a Model M of the BRN and its property <italic>&#x3c6;</italic>, which exhaustively explores M to verify <italic>&#x3c6;.</italic> The SMBioNet engages in this principle to identify logical parameters of the models that comply more with the known observations. Once the computational verification with laboratory data was completed, the resultant model and all its trajectories were further analyzed to understand how the systematic evolution of the DNMT1&#x2013;<italic>RUNX3</italic> system takes place with time. The network was explored using the concept of betweenness centrality to underline important trajectories of the dynamic biological system (i.e., homeostasis, bifurcation state, and recovery cycle). Furthermore, the paths involved in the activation of oncogenes, suppression of tumor suppressor genes, cancer invasion, and recovery were also identified. This study also highlighted a qualitative bifurcating at which the system can invade tumorigenesis or normal homeostasis depending upon the successive changes in the expression level of TSG and oncogenes.</p>
</sec>
<sec id="s2-3">
<title>2.3 Data Set Compilation or Collection</title>
<p>A total of 738 DNMT1 inhibitors were collected from the ChEMBL database (target ID ChEMBL1993). Only those compounds were extracted for which the biological activity was estimated experimentally as the inhibitory potency (IC<sub>50</sub> value). Initially, the removal of duplicates, small fragments (MW &#x3c; 200), and inconsistent activity values (IC<sub>50</sub>) was performed, which was followed by the exploration and manual correction of stereoisomers. The data preprocessing resulted in a final data set of 242 DNMT1 inhibitors with IC<sub>50</sub> values in the range of 0.01&#x2013;1,600&#xa0;&#x3bc;M (<xref ref-type="sec" rid="s2">Section 2</xref>, <xref ref-type="sec" rid="s11">Supplementary Tables S1, S2</xref>).</p>
<p>Briefly<bold>,</bold> our data set contains compounds of diverse origin that include natural, synthetic (nucleoside/non-nucleoside), and FDA-approved drugs against DNMT1, thus incorporating all major scaffolds of DNMT1 inhibitors available to date. The shortlisted DNMT1 inhibitors were used to build machine-learning (DT and ANN) models. Therefore, a diverse subset selection approach was utilized to divide the data set into a training set (80%) and a test set (20%) for model building and validation, respectively. Briefly, a diverse subset splits the data into two sections based on chemical diversity calculated as a function of distance between molecular descriptors (<xref ref-type="bibr" rid="B17">Koutsoukas et al., 2013</xref>). The absolute biological activity values were converted into binary numbers on the basis of an activity threshold value (IC<sub>50</sub> &#x2264; 10&#xa0;&#xb5;M) such that 1 represents active and 0 indicates least-active class of DNMT1 inhibitors. This binarization of DNMT1 data was supported by a histogram plot provided in <xref ref-type="sec" rid="s11">Supplementary Figure S1</xref>.</p>
</sec>
<sec id="s2-4">
<title>2.4 Calculation and Selection of 2D Chemical Descriptors</title>
<p>All 2D MOE descriptors (2019.01) (<xref ref-type="bibr" rid="B33">ULC 2018</xref>) by excluding energy-related descriptors were calculated for the training set. A total of 158 2D descriptors were computed through MOE after which the redundant, null, and constant value descriptors were excluded from the final set. The descriptors with negligible relevancy and weightage were also removed from the final set of descriptors to improve the overall predictive ability of ML models. Consequently, the selected descriptors were provided as an input to WEKA (3.9.3) for the construction of a DT classifier that was further used for shortlisting of the most relevant and decision-making attributes.</p>
</sec>
<sec id="s2-5">
<title>2.5 Machine-Learning Approaches</title>
<p>Herein, the DT and ANN classification models were built using the training set and the models were validated using the 20% test set compounds.</p>
<sec id="s2-5-1">
<title>2.5.1 Decision Tree</title>
<p>The C4.5 variant of the J48 algorithm implemented in WEKA was used to build the univariate tree of training set attributes. The J48 algorithm works by splitting the data into smaller subsets based on features that will produce the most uniform child node at each step (<xref ref-type="bibr" rid="B39">Yosipof, Guedes, and Garc&#xed;a-Sosa 2018</xref>). The process is repeated iteratively until no more splits can be made or data are uniformly classified into terminal nodes. Tree parameters were tuned to improve the overall performance of the model and limit the overfitting of data. Therefore, the lowest number of confidence factor was used to incur more pruning and a minimum of one instance per leaf was set for the splitting rule, with a 10-fold cross-validation approach.</p>
</sec>
<sec id="s2-5-2">
<title>2.5.2 Artificial Neural Network</title>
<p>Artificial neural networks (ANNs) are nonparametric human nervous system&#x2013;inspired computational models, which process complex input information to produce the output. A multilayer perceptron (MLP) function of WEKA (3.9.3) was utilized to build the ANN model of DNMT1 inhibitors. The MLP network typically consists of at least three layers: one input, one output, and one hidden layer (can be more than one). The perceptron calculates a linear combination of inputs and their weights to compute a sum, and the output is calculated through an activation function (most often the sigmoid) (<xref ref-type="bibr" rid="B14">Jiao et al., 2020</xref>). Moreover, MLP uses back-propagation to find the optimized input weights and builds hidden layer(s) for the classification of nonlinear data (<xref ref-type="bibr" rid="B7">Frank, Hall, and Witten 2017</xref>). Generally, MLP normalizes each attribute by default, to improve the performance of the network. Overall, model training was performed using a 10-fold cross-validation and the parameters were optimized on the basis of number and nodes of hidden layer by constructing several perceptron models. The default values of momentum, learning rate and training time were used in order to build the best-fit neural network.</p>
</sec>
<sec id="s2-5-3">
<title>2.5.3 Model Performance Validation</title>
<p>The DT and ANN models were trained using the training set (80%), and the classification was validated using the test set (20%). The performance of the machine-learning model was further evaluated using statistical parameters including accuracy (<xref ref-type="disp-formula" rid="e1">Eq. 1</xref>), sensitivity (<xref ref-type="disp-formula" rid="e2">Eq. 2</xref>), specificity (<xref ref-type="disp-formula" rid="e3">Eq. 3</xref>), F-measure (<xref ref-type="disp-formula" rid="e6">Eq. 6</xref>), and MCC measure (<xref ref-type="disp-formula" rid="e7">Eq. 7</xref>). Accuracy is the percentage of correctly classified active and least-active compounds, whereas specificity is the percentage of true least-active predicted compounds, and sensitivity is the proportion of true active compounds predicted from our model. In addition, F-score, also known as balanced accuracy, measures the precision (how many compounds are correctly classified) and robustness (resistance to errors) of ML models. Likewise, Matthew&#x2019;s correlation coefficient (MCC) measures the difference between actual and predicted values, and it is only high (near to 1) if all the classes are predicted in good proportion.</p>
<p>Overall accuracy:<disp-formula id="e1">
<mml:math id="m1">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">TN</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FP</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">TN</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FN</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(1)</label>
</disp-formula>
</p>
<p>Sensitivity (true-positive rate):<disp-formula id="e2">
<mml:math id="m2">
<mml:mrow>
<mml:mi mathvariant="bold-italic">Tp</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FN</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(2)</label>
</disp-formula>
</p>
<p>Specificity (true-negative rate):<disp-formula id="e3">
<mml:math id="m3">
<mml:mrow>
<mml:mi mathvariant="bold-italic">Tn</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TN</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TN</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FP</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(3)</label>
</disp-formula>
</p>
<p>Precision:<disp-formula id="e4">
<mml:math id="m4">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FP</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(4)</label>
</disp-formula>
</p>
<p>Recall:<disp-formula id="e5">
<mml:math id="m5">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FN</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(5)</label>
</disp-formula>
</p>
<p>F-measure:<disp-formula id="e6">
<mml:math id="m6">
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold-italic">precision</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="bold-italic">recall</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold-italic">precision</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">recall</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(6)</label>
</disp-formula>
</p>
<p>Mathew&#x2019;s correlation coefficient:<disp-formula id="e7">
<mml:math id="m7">
<mml:mrow>
<mml:mi mathvariant="bold-italic">C</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="bold-italic">TN</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi mathvariant="bold-italic">FP</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="bold-italic">FN</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msqrt>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FP</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TP</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FN</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TN</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FP</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold-italic">TN</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">FN</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(7)</label>
</disp-formula>
</p>
</sec>
</sec>
</sec>
<sec id="s3">
<title>3 Results</title>
<p>A schematic model depicting the stepwise process of replication and methylation of TSG <italic>RUNX3</italic> by DNMT1 and its subsequent influence on various cancer signaling pathways was established using the literature-driven information. Qualitative modeling was then performed to unfold the cellular events involved in the epigenetic, that is, DNMT1-mediated silencing of <italic>RUNX3</italic> that leads to cancer invasion. The comprehensive literature-driven pathway of DNMT1&#x2013;<italic>RUNX3</italic> is elaborated in the <xref ref-type="sec" rid="s11">Supplementary Material Section S3</xref>, <xref ref-type="fig" rid="F2">Figure 2</xref>. Briefly, a series of stepwise events at the replication fork led by macroprotein complexes (DNMT1, UHRF1, HAUSP, Tip60, HDAC1, and PCNA) ensures the methylation of the newly synthesized daughter strand during the S-phase (<xref ref-type="fig" rid="F2">Figure 2</xref>). As a result, the nascent <italic>RUNX3</italic> strand acquires the normal status of methylation, whereas the promoter region remains hypomethylated to facilitate active transcription of <italic>RUNX3</italic> gene. Subsequently, <italic>RUNX3</italic> (transcription factor) functions as a tumor suppressor protein to combat cancer initiation and metastasis through various signaling pathways. The TSG <italic>RUNX3</italic> elicits an antitumor activity by regulating the transcription of target genes (p21, c-myc, etc.) of the major cancer signaling cascades, such as transforming growth factor-beta (TGF-&#x3b2;), Wnt/&#x3b2;-catenin, and mitogenic KRAS pathway as explained in <xref ref-type="fig" rid="F2">Figure 2</xref>.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Schematic knowledge-based network is presented to illustrate the stepwise process of replication and methylation (by DNMT1) of TSG <italic>RUNX3</italic> and its implication in different cancer pathways. Step 1: At the replication fork, UHRF1 recognizes hemimethylated DNA and recruits other proteins including DNMT1, Tip60, HAUSP1, HDAC1, and PCNA to make a macroprotein complex. Step 2: DNMT1 in complex with these regulators transfers a methyl group through the base flip mechanism onto the nascent daughter strand. The green color highlighted in the daughter strand depicts the hypomethylated promoter region of the nascent <italic>RUNX3</italic> gene. The red-headed lollipop structure here mimics the normal methylation status of <italic>RUNX3</italic> gene. Step 3: Transcription machinery then successfully identifies the promoter region to translate the functional <italic>RUNX3</italic> protein, which acts as a tumor suppressor and combats the cancerous environment through the regulation of major signaling pathways including TGF-beta, Wnt/&#x3b2;, and KRAS pathways. TSG <italic>RUNX3</italic> exerts its antitumor activity by regulating the transcription of significant target genes including p21, c-myc, and p53 (oval blue structures at the bottom). Steps 4 and 5: The entire macroprotein complex after performing its function undergoes stepwise proteasomal degradation in the late S-phase of cell cycle. PCNA &#x3d; proliferating cell nuclear antigen; DNMT1 &#x3d; DNA methyltransferase 1; HAUSP1 &#x3d; herpesvirus-associated ubiquitin-specific protease; HDAC1 &#x3d; histone deacetylase1; Tip60 &#x3d; histone acetyltransferase; UHRF1 &#x3d; ubiquitin-like, containing PHD and RING finger domains 1; Ac&#x3d; acetylation; <italic>RUNX3</italic> &#x3d; Runt-related transcription factor 3; TGF-&#x3b2; &#x3d; transforming growth factor-beta; VEGF &#x3d; vascular endothelial growth factor; EMT &#x3d; epithelial&#x2013;mesenchymal transition; and TSG &#x3d; tumor suppressor gene.</p>
</caption>
<graphic xlink:href="fmolb-09-882738-g002.tif"/>
</fig>
<p>From literature-driven pathways described in <xref ref-type="fig" rid="F2">Figure 2</xref>, a qualitative biological regulatory network was generated based on preferred entities. DNMT1, <italic>RUNX3</italic>, p21, c-myc, p53, and MDM2 nodes were selected due to their well-established functionality in the biological system (<xref ref-type="sec" rid="s11">Supplementary Material Section S3</xref>). As a result, the BRN (<xref ref-type="fig" rid="F3">Figure 3A</xref>) composed of total six nodes and nine interactions exhibiting all the significant activation and inhibition relationships was obtained. Initially, negative feedback loops were observed from an inhibitory set of genes required by the system to generate the stable states. According to the interaction graph (<xref ref-type="fig" rid="F3">Figure 3A</xref>), <italic>RUNX3</italic> transactivates p21 to maintain the concentration of DNMT1 through a negative feedback loop. In addition, <italic>RUNX3</italic> inhibits the onset of oncogene c-myc that might upregulate the expression level of DNMT1 in a positive manner. Interestingly, p53 also prevents upregulation of DNMT1 through the activation of p21 and inhibition of oncogene c-myc. Furthermore, the proposed network characterized oscillatory behavior of two regulatory loops involving DNMT1 and <italic>RUNX3</italic>: 1) a negative feedback loop between p21 and DNMT1 through <italic>RUNX3</italic> and 2) a positive feedback loop between c-myc and DNMT1 through <italic>RUNX3</italic> (<xref ref-type="fig" rid="F3">Figure 3A</xref>)<italic>.</italic> It is notable that the representation of the BRN as a weighted, directed graph (<xref ref-type="fig" rid="F3">Figure 3A</xref>) was obtained using GENOTECH through the implementation of discrete formalism. Therefore, the interaction graph utilized the qualitative data only, that is, type of interaction (activation &#x2b;, or inhibition &#x2212;) among the nodes (genes), and the threshold value for each interaction. However, modeling the dynamic behavior of such complex systems that include both positive and negative loops requires the computation of accurate logical parameters. These parameters were generated in the SMBioNet software using the known experimental observations encoded as CTL formulas reported in <xref ref-type="fig" rid="F3">Figure 3B</xref>. The first CTL observation searches for a state with high expression of DNMT1 and oncogenes leading to tumor invasion. The later CTL formula seeks for the stable state or homeostasis that has normal expression level of tumor suppressor genes (<xref ref-type="fig" rid="F3">Figure 3B</xref>). These observations were coded as input in SMBioNet to generate the preferred set of logical parameters.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>
<bold>(A)</bold> DNMT1&#x2013;<italic>RUNX3</italic> interaction graph (BRN) was generated by utilizing the preferred entities to discover all the important activation (&#x2b;) and inhibition (&#x2212;) relationships among them. The network demonstrates predominantly two oscillatory behaviors in DNMT1&#x2013;<italic>RUNX3</italic> graph. The first one illustrates the <italic>RUNX3</italic>-stimulated onset of p21, which in turn positively regulates DNMT1 (shown with red arrows). The other loop exhibits the inhibition of c-myc by <italic>RUNX3</italic>, which also instigates the onset of DNMT1 (shown with green pointed arrows). In addition, p53 can also be seen regulating DNMT1 through the activation of p21 and inhibition of c-myc signals. <bold>(B)</bold> The second part of the figure demonstrates two CTL observations utilized by SMBioNet for the estimation of parameters that were later used to generate the state graph of the dynamic model. According to CTL formulas, the overexpression of DNMT1 is responsible for hypermethylation at the promoter region of <italic>RUNX3</italic> and ultimately its suppression, which is associated with many cancer types including breast cancer. Each circle/node represents a gene, and the arrows among them show the type of interaction they hold. Activation is denoted with green pointed arrows, and blunt red arrows represent inhibition whereby the weight of the arrows depicts threshold values of interactions.</p>
</caption>
<graphic xlink:href="fmolb-09-882738-g003.tif"/>
</fig>
<sec id="s3-1">
<title>3.1 Parameters</title>
<p>SMBioNet computed a total of 14 sets of logical parameters that are presented as a heat map in <xref ref-type="fig" rid="F4">Figure 4</xref>. The preferred set of logical parameters verified through model checking might reflect the probable biological trajectories in cancer invasion and recovery. According to the known biological observations, the most significant change occurs in the expression of DNMT1 and <italic>RUNX3</italic> in different types of malignancies including breast tumor. The heat map suggests eight critical resources of DNMT1 based on the presence and absence of its activators and inhibitors. For instance, (CMYC) indicates the presence of c-myc, and (CMYC, P21) represents the presence of activator c-myc and the absence of inhibitor p21. Likewise, (CMYC, P21, and P53) shows the presence of activator c-myc whereby no inhibitor is present in the system. The ability of each entity to evolve is described as a function of the presence or absence of its resource as shown in <xref ref-type="fig" rid="F4">Figure 4</xref>.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Heat map of logical parameters computed on SMBioNet shows 14 distinct sets of parameters. A preferred set of parameters were estimated through model checking rendered as heat map along with their resources (M1&#x2013;M14). Each column represents a distinct set of logical parameters where a moderate expression of an entity is expressed using green color, an overexpression is expressed using red color, and an underexpression is illustrated using yellow color in the heat map.</p>
</caption>
<graphic xlink:href="fmolb-09-882738-g004.tif"/>
</fig>
<p>The trend of DNMT1 being induced in the presence of c-myc was observed in all 14 parameter sets. Therefore, a parameter set that allows DNMT1 to achieve its maximum threshold value of &#x201c;2&#x201d; was selected to generate the state graph, assuming that this concentration is lethal for breast cells. The selected parameters (M6) allow all nodes to interact with others corresponding to the natural dynamic phenomenon, while maintaining their interdependencies for the activation or suppression. The source code of input models is provided in the <xref ref-type="sec" rid="s11">Supplementary Material Section S1</xref>. The calculated parameters indicate that DNMT1 maintains a higher expression level in the presence of oncogene c-myc signaling. Conversely, an activation of an inhibitor p21 or p53 prevents the expression level of DNMT1 to exceed the normal threshold value; it is also observed that TSG <italic>RUNX3</italic> is activated in the presence of DNMT1 signal. The expression of p53 is increased when the MDM2 inhibition signals are absent in the system. However, the collective behavior of the genes in a dynamic biological system can only be concluded by interpreting the trajectories in the state transition graph.</p>
</sec>
<sec id="s3-2">
<title>3.2 State Graph</title>
<p>On the basis of selected logical parameters (M6) inferred from SMBioNet, a state graph was generated using GENOTECH and analyzed on the Cytoscape software to highlight significant genetic evolution including the recovery cycle (<xref ref-type="fig" rid="F5">Figure 5</xref>). The state transition graph illustrates all the possible qualitative states exhibited by the DNMT1&#x2013;<italic>RUNX3</italic> system as shown in <xref ref-type="fig" rid="F6">Figure 6</xref> (b). It contains a total of 32 nodes and 79 edges sorted on the basis of the betweenness centrality concept of graph theory. Our generated state graph highlights the temporal evolution of the system in the form of trajectories from one qualitative state to the other. It includes homeostasis, recovery trajectories, and bifurcation state from which the biological system can evolve either to homeostasis or to tumorigenesis. Here, we highlighted two important trajectories of the qualitative model involved in tumor progression and recovery/homeostasis. According to our model, the activation of oncogene c-myc signal (c-myc &#x3d; 1) introduces pathogenesis in the biological system, which is characterized by the qualitative states (0,0,1,<bold>1</bold>,0,0) and (0,1,0,<bold>1</bold>,1,0) highlighted in <xref ref-type="fig" rid="F5">Figures 5</xref> and <xref ref-type="fig" rid="F6">6C</xref>, respectively.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>State graph of the recovery cycle from Model 6 (M6) is highlighted with black pointed arrows. Each circle indicates a unique qualitative state with gene entities in the following order: DNMT1, <italic>RUNX3</italic>, p21, c-myc, p53, and MDM2, sorted based on betweenness centrality. The recovery trajectory illustrates how a pathogenic system undergoes successive genetic evolution to reach normal homeostasis. The onset of oncogene c-myc introduces pathogenesis and tends to retain it by downregulating p21 and upregulating DNMT1. However, the activation of TSG <italic>RUNX3</italic> limits the overexpression of DNMT1 by inhibiting c-myc and restoring the p21 expression. The normal state characterized as (0,0,1,0,0,0) exhibits a high betweenness centrality as shown with a larger diameter and lighter color in the state graph. The color bar on the right side signifies the trend of betweenness centrality; that is, the lighter is the color and the larger is the diameter, the higher is the betweenness centrality of the qualitative state and vice versa.</p>
</caption>
<graphic xlink:href="fmolb-09-882738-g005.tif"/>
</fig>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>
<bold>(A)</bold> Heat map of the first 8 sets of parameters out of total 14 (refer <xref ref-type="fig" rid="F4">Figure 4</xref>) computed by SMBioNet is displayed, whereas Model 6 (M6) parameters were used to generate the state transition graph for network analysis. <bold>(B)</bold> State graph of M6 is shown, which composed of 32 nodes and 79 edges sorted on the basis of betweenness centrality. Each circle represents a unique state with gene expression in the order as follows: DNMT1, <italic>RUNX3</italic>, p21, c-myc, p53, and MDM2. The generated state transition graph illustrates all the possible qualitative states of the system. Trajectories of the graph were then further analyzed to identify important genetic evolution. <bold>(C)</bold> A bifurcation state is highlighted characterized by the qualitative state (010110). Trajectories display distinct paths from one common transition state characterized by the onset of oncogene c-myc leading to homeostasis or pathological loop based on successive genetic changes. From the bifurcating (0,1,0,1,1,0) state, the repressed <italic>RUNX3</italic> (0,0,0,1,1,0) converges the system toward pathological state with a successive onset of DNMT1 (1,0,1,1,1,0) and offset of p53 (1,0,0,1,0,0). Here, the qualitative state (1,0,0,1,0,0) is characterized as pathological state (highlighted in red box in the right trajectory) experienced by the system due to the consistent onset of oncogene c-myc along with persistent suppression of <italic>RUNX3</italic>. On the contrary, the system might evolve toward normal state (highlighted in green box in the left trajectory) if TSG <italic>RUNX3</italic> gets activated causing constant inhibition of oncogene c-myc (1,1,0,0,1,0), to control the moderate expression level of DNMT1 (0,1,1,0,1,0). The normal state is characterized by the activation of <italic>RUNX3</italic> along with the controlled expression level of DNMT1 (0,1,1,0,1,0), which is achieved by the system through continuous activation of TSG <italic>RUNX3</italic> along with the consistent inhibition of oncogene c-myc, ultimately leading to a typical reset state (0,0,0,0,0,0).</p>
</caption>
<graphic xlink:href="fmolb-09-882738-g006.tif"/>
</fig>
</sec>
<sec id="s3-3">
<title>3.3 Recovery</title>
<p>The qualitative model explains the step-by-step evolution of the system to recover from stress environment and maintain homeostasis. The onset of oncogenic c-myc (c-myc &#x3d; 1) initiates the pathogenesis by inhibiting cell cycle arrest protein/TSG p21 (p21 &#x3d; 0) leading to the qualitative state (0,0,0,1,0,0), which consequently activates DNMT1 (DNMT1 &#x3d; 1) (1,0,0,1,0,0) in the system. It is a typical pathogenic state characterized by high expression level of DNMT1 and c-myc along with a suppressed concentration of TSGs, that is, <italic>RUNX3</italic>, p21, and p53 as shown in <xref ref-type="fig" rid="F5">Figure 5</xref>.</p>
<p>DNMT1 activates <italic>RUNX3</italic> by maintaining the methylation status (<italic>RUNX3</italic> &#x3d; 1) (1,1,0,1,0,0), which employs its tumor suppressor activity and inhibits the onset of oncogene c-myc subsequently, limiting the hyperactivation of DNMT1 leading to the states (1,1,0,0,0,0) and (0,1,0,0,0,0), respectively. In addition, <italic>RUNX3</italic> transactivates p21 (0,1,1,0,0,0) to further keep a concentration check of DNMT1 expression level. The restoration of p21 signal supervises the abnormal and uncontrolled proliferation, thus causing the system to recover from a pathogenic environment. These successive genetic changes evolve into homeostasis that is generally characterized by the presence of TSGs with a moderate concentration of DNMT1, and repressed oncogenes (0,0,1,0,0,0) as shown in <xref ref-type="fig" rid="F5">Figure 5</xref>. Notably, the qualitative state (0,0,1,0,0,0) is the most central state of the system shown with lighter color and larger diameter. This suggests a high probability of the infected system to recover back to homeostasis through a series of steady states that are relatively less central (low betweenness centrality). The graph in <xref ref-type="fig" rid="F5">Figure 5</xref> with black pointed arrows describes all the states that the system exhibits to recover from a pathogenic state. Under normal circumstances, biological systems often exhibit oscillatory behavior or homeostasis during which the overall status of the system remains in a cycle of normal states. Thus, the qualitative model should expect pathogenic trajectories along with normal homeostasis behavior either as a cycle or as a closed path.</p>
</sec>
<sec id="s3-4">
<title>3.4 Bifurcation state</title>
<p>A transition bifurcation state highlighted in <xref ref-type="fig" rid="F6">Figure 6</xref> is characterized by the onset of oncogene c-myc from which the system could divert either toward pathogenic state or toward homeostasis depending on the successive genetic alterations. The suppressed level of TSG <italic>RUNX3</italic> (right trajectory of <xref ref-type="fig" rid="F6">Figure 6C</xref>) converges the system toward pathological path in which the persistent activation of oncogene c-myc (c-myc &#x3d; 1) with the subsequent consistent suppression of TSG <italic>RUNX3</italic> (<italic>RUNX3</italic> &#x3d; 0) augments the hyperactivation of DNMT1. Consequently, the activated c-myc also downregulates p21 and p53 to further assist the metastasis and tumor invasion (1,0,0,1,0,0) characterized by red square in <xref ref-type="fig" rid="F6">Figure 6C</xref>. Alternatively (left trajectory of <xref ref-type="fig" rid="F6">Figure 6C</xref>), activation of TSG <italic>RUNX3</italic> inhibits the c-myc signaling and restores p21 expression, which controls the concentration of DNMT1 to maintain normal homeostasis of the system. This leads to normal qualitative state (0,1,1,0,1,0) characterized by the moderate level of DNMT1, the low expression of oncogenes, and the presence of TSGs along with the oscillations of the p53&#x2013;MDM2 circuit as highlighted with green square on the left side of <xref ref-type="fig" rid="F6">Figure 6C</xref>. Moreover, the qualitative normal state (0,0,1,0,0,0) in the homeostasis trajectory is well connected as shown with lighter color and larger diameter, which explains the probability of this state to occur more in the system than any other or pathogenic state (1,0,0,1,0,0).</p>
</sec>
<sec id="s3-5">
<title>3.5 Machine-Learning Models</title>
<p>ML models were built in WEKA using the DNMT1 data set of diverse structures and FDA-approved drugs (decitabine, azacytidine) that were extracted from the ChEMBL database (target ID ChEMBL1993). 80% of the collected DNMT1 data set was utilized for training the model, while the remaining 20% was included in the test set to validate the classification models. The number of training and test set compounds along with other details (bioactivities, activity threshold) is presented in <xref ref-type="sec" rid="s4">Section 4</xref>, <xref ref-type="sec" rid="s11">Supplementary Table S3</xref>. In order to build classification models, an activity threshold was established for both the training and test sets such that compounds with IC<sub>50</sub> values &#x2264;10&#xa0;&#x3bc;M and &#x3e;18&#xa0;&#x3bc;M were categorized as active and least-active compounds, respectively. As a result, the training set contained 146 active (class label 1) and 68 least-active (class label 0) compounds. Initially, all the provided 2D MOE (2019.01) (<xref ref-type="bibr" rid="B33">ULC 2018</xref>) descriptors were calculated to train the DT model. The descriptors selected by the DT model were further utilized to build the ANN model of DNMT1 inhibitors using the MLP algorithm in WEKA. The descriptors including vsa_acc, h_logP, b_count, radius, PEOE_VSA-5, b_double, SlogP_VSA5, SMR_VSA2, Q_VSA_PNEG, and Kier3 (<xref ref-type="bibr" rid="B35">Wildman and Crippen 1999</xref>) were identified to be important for model learning and differentiating between active and least-active compounds efficiently. The detail of preferred descriptors is provided in <xref ref-type="sec" rid="s11">Supplementary Table S4</xref> of <xref ref-type="sec" rid="s4">Section 4</xref>.</p>
<sec id="s3-5-1">
<title>3.5.1 Decision Tree</title>
<p>The J48 algorithm implemented in the ML software WEKA was used to build the DT classification model. Parameters of DT were optimized, and a pruned tree was built in order to get the best accuracy and performance on the available data set. Therefore, a minimum of 1 instance per leaf was set for the splitting rule and the rest default parameters of the J48 algorithm were utilized to build an optimal DT model. A final DT model with an overall size of 23 and 12 terminal nodes was obtained using the DNMT1 data set (shown in <xref ref-type="sec" rid="s4">Section 4</xref>, <xref ref-type="sec" rid="s11">Supplementary Figure S2</xref>). The overall topology of the DT classifier is elaborated in <xref ref-type="sec" rid="s4">Section 4</xref> of the <xref ref-type="sec" rid="s11">Supplementary Material</xref>.</p>
<p>The model performance and overall predictive ability were evaluated through different statistical parameters including sensitivity (<xref ref-type="disp-formula" rid="e2">Eq. 2</xref>), specificity (<xref ref-type="disp-formula" rid="e3">Eq. 3</xref>), F-measure (<xref ref-type="disp-formula" rid="e6">Eq. 6</xref>), and MCC value (<xref ref-type="disp-formula" rid="e7">Eq. 7</xref>). The DT classifier attained accuracy, sensitivity, and specificity values of 0.974, 0.99, and 0.92, respectively, as shown in <xref ref-type="table" rid="T1">table 1</xref>. Overall, 97% of DNMT1 inhibitors were correctly classified into active and least-active class of compounds by the DT model. However, the ratios of correctly predicted active compounds (TPR) and the truly predicted least-active compounds (TNR) were observed to be 99% and 92%, respectively. The DT displayed an MCC value of 0.9 (<xref ref-type="table" rid="T1">table 1</xref>), indicating an optimal model efficiency with a strong positive correlation between the actual and predicted class labels. In addition, the F1-score was calculated to assess the balanced classification accuracy performance of the trained DT model, which turned out to be 0.98. As the DT showed an optimal performance on the training data in terms of classification and prediction, the descriptors identified by the DT model were further utilized to build the predictive ANN model.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Statistical parameters of classification models, J-48 decision tree and MLP neural network, for training data calculated from WEKA.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">ML algorithm</th>
<th align="left">Accuracy</th>
<th align="left">Sensitivity</th>
<th align="left">Specificity</th>
<th align="left">Precision</th>
<th align="left">F-measure</th>
<th align="left">MCC</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="7" align="left">Training set (80%)</td>
</tr>
<tr>
<td align="left">&#x2003;DT</td>
<td align="center">0.974</td>
<td align="center">0.993</td>
<td align="center">0.920</td>
<td align="center">0.973</td>
<td align="center">0.983</td>
<td align="left">0.932</td>
</tr>
<tr>
<td align="left">&#x2003;ANN</td>
<td align="center">0.969</td>
<td align="center">0.979</td>
<td align="center">0.940</td>
<td align="center">0.979</td>
<td align="center">0.979</td>
<td align="left">0.919</td>
</tr>
<tr>
<td colspan="7" align="left">Test set (20%)</td>
</tr>
<tr>
<td align="left">&#x2003;DT</td>
<td align="center">0.826</td>
<td align="center">0.916</td>
<td align="center">0.500</td>
<td align="center">0.868</td>
<td align="center">0.891</td>
<td align="left">0.45</td>
</tr>
<tr>
<td align="left">&#x2003;ANN</td>
<td align="center">0.717</td>
<td align="center">0.805</td>
<td align="center">0.400</td>
<td align="center">0.828</td>
<td align="center">0.816</td>
<td align="left">0.20</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3-5-2">
<title>3.5.2 Artificial Neural Network</title>
<p>A set of 10 selected 2D descriptors [vsa_acc, h_logP, b_count, radius, PEOE_VSA-5, b_double, SlogP_VSA5, SMR_VSA2, Q_VSA_PNEG, and Kier3 (<xref ref-type="bibr" rid="B35">Wildman and Crippen 1999</xref>)] were used to build the ANN model in WEKA utilizing the MLP algorithm. Several ANN models were developed exploiting different combinations of parameters in order to obtain an optimal classification model. The final ANN comprised of 10 input nodes, 1 layer of 4 hidden nodes, and two output nodes (i.e., &#x201c;1&#x201d; for active and &#x201c;0&#x201d; for least-active compounds) (<xref ref-type="sec" rid="s4">Section 4</xref>, <xref ref-type="sec" rid="s11">Supplementary Figure S3</xref>). The ANN model was optimized using a training time of 500, learning rate of 0.3, and momentum of 0.2 to improve the accuracy and speed of learning.</p>
<p>The classification performance of ANN was also evaluated using statistical measures including classification accuracy, F1-score, and MCC values as reported in <xref ref-type="table" rid="T1">table 1</xref>. The trained ANN model attained a MCC value of 0.91, an overall classification accuracy of 0.97, and a F1-score of 0.98, thus indicating an optimal binary classification of our data set. Overall, 97% of DNMT1 data was correctly classified into active and least-active class by our learned ANN, whereby the true-positive and true-negative prediction rates were observed to be 0.98 and 0.94 implying a higher specificity and sensitivity, respectively. Both the trained DT and ANN classifiers displayed optimal statistical performance, that is, greater than 96% and a good predictive ability for the training data.</p>
</sec>
<sec id="s3-5-3">
<title>3.5.3 Model Validation</title>
<p>The obtained DT and ANN models were validated using the 20% test set compounds that contain the most diverse chemical structures of the DNMT1 inhibitor data set (activity range of 0.01&#x2013;132&#xa0;&#x3bc;M). Overall, DT and ANN acquired a classification accuracy of greater than 70% for the test set whereby DT could correctly classify 83% of the data and the ANN model attained an accuracy of 73% (<xref ref-type="table" rid="T1">Table 1</xref>). Both the classifiers displayed an optimal value of sensitivity, that is, 0.92 and 0.80 for DT and ANN, respectively, implying a high true-positive prediction rate on the test set. Moreover, F1-score was calculated as 0.89 for the DT classifier and 0.81 for ANN indicative of the robustness and unbiased classification performance of our predictive ML classifiers. Despite the structural diversity of DNMT1 inhibitors, our models optimally retained the classification accuracy of 0.97 on the training set and 0.82 (DT) and 0.72 (ANN) on the test data. Notably, in comparison with the ANN model, the DT classifier could classify active and least-active compounds more efficiently as reported in <xref ref-type="table" rid="T1">table 1</xref>. The training and test sets utilized for model generation and validation in this study are shown in <xref ref-type="sec" rid="s2">Section 2</xref>, <xref ref-type="sec" rid="s11">Supplementary Tables S1, S2</xref>.</p>
</sec>
</sec>
</sec>
<sec id="s4">
<title>4 Discussion</title>
<p>TNBC is the most aggressive subtype of breast cancer that lacks obvious treatment options due to the availability of limited information about definite biomarkers. Previously, aberrant epigenetic modifications have been implicated in breast cancer, highlighting <italic>RUNX3</italic> as a promising prognostic biomarker (<xref ref-type="bibr" rid="B12">Wang et al., 2014</xref>). Moreover, several former studies have reported the depleted level of <italic>RUNX3</italic> in breast cancer cell lines predominantly due to local hypermethylation at the proximal promoter region of TSG <italic>RUNX3,</italic> which is an early event in carcinogenesis (<xref ref-type="bibr" rid="B5">Chen, 2012</xref>; <xref ref-type="bibr" rid="B19">Lau et al., 2006</xref>; <xref ref-type="bibr" rid="B4">Chen et al., 2016</xref>)<italic>.</italic> Therefore, the aberrant DNA methylation pattern is known as the hallmark of cancer epigenetics characterized by DNMT1, also known as the maintenance methyltransferase. So far, the individual role of <italic>RUNX3</italic> as a TSG in breast cancer and DNMT1 as the methylation maintenance protein has been reported in various scientific studies (<xref ref-type="bibr" rid="B13">Jiang et al., 2008</xref>; <xref ref-type="bibr" rid="B4">Chen et al., 2016</xref>; <xref ref-type="bibr" rid="B19">Lau et al., 2006</xref>; <xref ref-type="bibr" rid="B29">Subramaniam et al., 2009</xref>). However, the underlined epigenetic-mediated <italic>RUNX3</italic>-shared signaling that instigates and disseminates the tumorigenesis remains largely elusive. This study provides an insight into the integrated DNMT1&#x2013;<italic>RUNX3</italic> signaling by taking into account various cancer-related significant upstream and downstream regulators (such as p21, c-myc, p53, and MDM2) and presents the DNMT1&#x2013;<italic>RUNX3</italic> signaling cascades as one consolidated network. To the best of authors&#x2019; knowledge, this study is one of its kind to illustrate and model the epigenetic-mediated silencing of TSG <italic>RUNX3</italic> through the Ren&#xe9; Thomas framework modeling that has largely been a common practice of systems biology to investigate the dynamics of biological networks (<xref ref-type="bibr" rid="B25">Saeed et al., 2018</xref>).</p>
<p>Moreover, the model checking technique was adopted to build the interaction graph rendered as a state graph (<xref ref-type="fig" rid="F6">Figure 6B</xref>) by utilizing the literature-driven information of the DNMT1&#x2013;<italic>RUNX3</italic> signaling (<xref ref-type="fig" rid="F2">Figure 2</xref>). The similar approach of the model checking has been previously applied in different former studies such as parameter estimation through formal modeling (<xref ref-type="bibr" rid="B1">Ahmad et al., 2012</xref>; <xref ref-type="bibr" rid="B3">Bibi et al., 2017</xref>), tail resorption in tadpole metamorphosis (<xref ref-type="bibr" rid="B16">Khalis et al., 2009</xref>), and immunity control in bacteriophage lambda (<xref ref-type="bibr" rid="B24">Richard, Comet, and Bernot 2006</xref>). We utilized this technique in the SMBioNet tool to exhaustively explore the model space of the DNMT1&#x2013;<italic>RUNX3</italic> BRN for the estimation of precise parameters by encoding the existing wet-laboratory data in the form of computation tree logic (CTL) (<xref ref-type="fig" rid="F3">Figure 3B</xref>). Once the computational verification with laboratory data is completed, the resultant model parameters and all its trajectories were further analyzed to understand how systematic evolution of the DNMT1&#x2013;<italic>RUNX3</italic> system takes place with time, which might lead the system to invade cancer metastasis (<xref ref-type="fig" rid="F6">Figure 6B</xref>).</p>
<p>As a result, two important behaviors of the DNMT1&#x2013;<italic>RUNX3</italic> system were highlighted that shows the successive genetic events of the recovery cycle (<xref ref-type="fig" rid="F5">Figure 5</xref>) and a bifurcation state leading to cancer invasion (<xref ref-type="fig" rid="F6">Figure 6C</xref>). The trajectory (right side of <xref ref-type="fig" rid="F6">Figure 6C</xref>) from our model plotted as a state graph articulates that the onset of oncogene c-myc infects the system and anticipates the upregulation of DNMT1 by downregulating the <italic>RUNX3</italic> expression level. It generates a feedback loop where the consistent onset of oncogene c-myc accompanied by the persistent suppression of TSG <italic>RUNX3</italic> contributes toward the overproduction of DNMT1. These findings are in agreement with the previous experimental studies, which reported the transcriptional upregulation of DNMT1 in TNBC due to the amplified c-myc expression level, emphasizing the importance of DNMT1 and c-myc relation in tumorigenesis (<xref ref-type="bibr" rid="B36">Wu et al., 2021</xref>).</p>
<p>On the contrary, the activation of <italic>RUNX3</italic> (left side of <xref ref-type="fig" rid="F6">Figure 6C</xref>) and subsequent TSGs (such as p21, p53) regulate the adequate level of DNMT1 to acquire homeostasis. In addition, network analysis emphasized the tumor-suppressive role of <italic>RUNX3</italic> by demonstrating the higher probability of the infected system to recover to normal homeostasis in the presence of <italic>RUNX3</italic> (left side of <xref ref-type="fig" rid="F6">Figure 6C</xref>) than its likelihood to invade the tumorigenesis. This is in line with the former findings that have conversely related the reactivation of <italic>RUNX3</italic> with a reduced potential of metastasis and invasiveness in breast cancer cells (<xref ref-type="bibr" rid="B34">Widschwendter and Jones, 2002</xref>). Although computational models cannot replace experiments, they are a step to demonstrate whether or not a proposed mechanism is sufficient to produce an observed phenomenon or an underlying assumption on the basis of their mathematical framework.</p>
<p>Moreover, the findings of our qualitative modeling suggested DNMT1 as a critical hub regulator of various oncogenic and tumor suppressor proteins that determine the ultimate status of the system. These outcomes advocate DNMT1 as a potential drug target in epigenetic cancer signaling through logic-based temporal evolution. Notably, the hypermethylation of TSGs by DNMT1 is a reversible process; therefore, <italic>RUNX3</italic> expression could be restored using demethylating compounds. Consequently, designing specific small modulators that inhibit DNMT1 activity in order to restore the TSG <italic>RUNX3</italic> could represent new clinical avenues for breast cancer therapeutics. Several pioneer studies have discussed the probable restoration of <italic>RUNX3</italic> and a reduced carcinogenic potential in cancer cell lines when treated with demethylating drugs (<xref ref-type="bibr" rid="B19">Lau et al., 2006</xref>; <xref ref-type="bibr" rid="B15">Jung, Park, Young Kim, et al., 2007</xref>).</p>
<p>To date, only two FDA-approved drugs (azacytidine and decitabine) are available that target DNMT1 in addition to other DNMTs for the treatment of myelodysplastic syndrome (<xref ref-type="bibr" rid="B9">Gnyszka, Jastrz&#x119;bski, and Flis 2013</xref>). However, potential side effects of these drugs necessitate the design of less toxic and more specific inhibitors of DNMT1. Ever since the breakthrough of epigenetics in cancer treatment, several in silico studies such as pharmacophore and QSAR modeling (<xref ref-type="bibr" rid="B37">Yoo, Kim, and Robertson 2012</xref>; <xref ref-type="bibr" rid="B22">Phanus-Umporn et al., 2020</xref>) have been reported for drug discovery against DNMT1 through small modulators. However, to the best of authors&#x2019; knowledge, no model is developed that could learn the diverse features of DNMT1 inhibitors. Therefore, in this study, we adopted machine-learning approaches to identify potential 2D descriptors of DNMT1 modulators by utilizing all the available chemical scaffolds of DNMT1 inhibitors (i.e., natural compounds, synthetic, nucleoside inhibitors, and FDA drugs).</p>
<p>ML classifiers DT and ANN were developed to classify the active and least-active compounds of the DNMT1 data set. All 10 captured features of the best-performing model (DT and ANN) along with feature descriptions are provided in <xref ref-type="sec" rid="s4">Section 4</xref>, <xref ref-type="sec" rid="s11">Supplementary Table S4</xref>. The trained DT identified vsa_acc, h_logP, b_count, radius, PEOE_VSA-5, b_double, SlogP_VSA5, SMR_VSA2, Q_VSA_PNEG, and Kier3 as key descriptors of demethylating compounds targeting DNMT1. However, the overall vDW surface area of hydrogen bond acceptors (vsa_acc) was reflected as the most prominent and distinguishing 2D descriptor by the DT model. This feature has also been previously identified by <xref ref-type="bibr" rid="B11">Hassanzadeh et al. (2017</xref>), which further emphasizes the significance of HBA for the activity of DNMT1 modulators. Furthermore, the developed models were able to classify the training set and predict the test set with optimal accuracies (<xref ref-type="table" rid="T1">table 1</xref>). The final selected trained DT and ANN classifiers showed an optimal classification accuracy of 0.97 and high sensitivity and specificity along with MCC values of 0.93 and 0.92, respectively.</p>
<p>The subsequent screening of the diverse test set from the DT and the ANN model resulted in the classification accuracies of 0.83 and 0.72, respectively. However, the DT classifier outperformed the ANN model showing higher classification accuracy of 0.83 and sensitivity of 0.92. Notably, despite the structural diversity of the DNMT1 data set, the ML classifiers developed in the current study retained an optimal classification accuracy of 97% for the training data and greater than 70% for the test data. The relatively low predictive accuracy of ML models on the test set can be attributed to the diversity of DNMT1 inhibitors as they differ highly in terms of chemical structure, molecular weight, and other pharmacological variables such as clogP and lipophilicity. Therefore, the learned features of the training set might not be sufficient to fully predict and explain the behavior of the test set. Also, the FDA-approved drug (azacytidine) was fairly classified among most active modulators of DNMT1 by our developed model, which strengthens the classification and predictive ability of the ML models.</p>
</sec>
<sec id="s5">
<title>5 Conclusion</title>
<p>
<italic>RUNX3</italic> has been proposed as a potential biomarker in TNBC for an early prognosis, which is known to be downregulated by DNMT1. However, the precise mechanism of epigenetic-mediated silencing of TSG <italic>RUNX3,</italic> which results in cancer invasion and metastasis, has not yet been explored. This study deals with the formal modeling of the DNMT1&#x2013;<italic>RUNX3</italic> signaling and the development of ML models on a diverse data set of DNMT1 modulators. First, we employed a qualitative modeling approach to provide an insight into epigenetic-inspired <italic>RUNX3</italic> signaling. The results revealed that the onset of oncogene c-myc introduces pathogenesis in the system and its consistent activation along with persistent suppression of TSG <italic>RUNX3</italic> hyperactivates DNMT1 leading to cancer metastasis. Conversely, the activation of <italic>RUNX3</italic> leads the system to acquire normal homeostasis by transactivating other TSGs such as p21 and p53. Moreover, our findings advocate DNMT1 as a potential epigenetic drug target to revive the suppressed TSG <italic>RUNX3</italic> in breast cancer therapeutics. Second, predictive ML models (DT and ANN) have been developed that identify some potential 2D descriptors essential to modulate the DNMT1 activity, and the best-performing models have effectively classified the active and least-active inhibitors. The trained (DT and ANN) models acquired 97% classification accuracy on the training data set, and the subsequent screening of the test set through DT and ANN models achieved 83% and 72% predictive accuracy, respectively, emphasizing the optimal efficiency of the developed model. In general, the application of formal methods to unveil the network and model the underline genetic events responsible for DNMT1-inspired TSG <italic>RUNX3</italic> silencing along with ML approaches to predict the 2D attributes of hypomethylating compounds (targeting DNMT1) could present new computational avenues for the treatment of breast cancer requiring epigenetic therapy.</p>
</sec>
</body>
<back>
<sec id="s6" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="s11">Supplementary Material</xref>; further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>AA, IJ, and MS designed the study and performed the experiments. AA, YK, and IJ analyzed the data, collected materials/data/tools, wrote the manuscript, prepared figures/tables, analyzed the results, and reviewed the draft of the manuscript. AA and MS contributed analysis tools, investigated the results, reviewed and proofread the draft of the manuscript, and provided technical support. All authors approved the results of the final draft.</p>
</sec>
<sec id="s8">
<title>Funding</title>
<p>The publication fee for this research manuscript has been funded by the National University of Sciences and Technology, NUST.</p>
</sec>
<sec sec-type="COI-statement" id="s9">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s11">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmolb.2022.882738/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmolb.2022.882738/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.docx" id="SM1" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ahmad</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Niazi</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Mansoor</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Siddique</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Bibby</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Formal Modeling and Analysis of the MAL-Associated Biological Regulatory Network: Insight into Cerebral Malaria</article-title>. <source>PLOS ONE</source> <volume>7</volume> (<issue>3</issue>), <fpage>e33532</fpage>. <pub-id pub-id-type="doi">10.1371/JOURNAL.PONE.0033532</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bernot</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Comet</surname>
<given-names>J.-P.</given-names>
</name>
<name>
<surname>Richard</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Guespin</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Application of Formal Methods to Biological Regulatory Networks: Extending Thomas&#x27; Asynchronous Logical Approach with Temporal Logic</article-title>. <source>J. Theor. Biol.</source> <volume>229</volume>, <fpage>339</fpage>&#x2013;<lpage>347</lpage>. <pub-id pub-id-type="doi">10.1016/j.jtbi.2004.04.003</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bibi</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Ahmad</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Siddiqa</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Paracha</surname>
<given-names>R. Z.</given-names>
</name>
<name>
<surname>Saeed</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ali</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Formal Modeling of MTOR Associated Biological Regulatory Network Reveals Novel Therapeutic Strategy for the Treatment of Cancer</article-title>. <source>Front. Physiol.</source> <volume>8</volume> (<issue>JUN</issue>), <fpage>416</fpage>. <pub-id pub-id-type="doi">10.3389/FPHYS.2017.00416/BIBTEX</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Pei</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>The Emerging Role of RUNX3 in Cancer Metastasis (Review)</article-title>. <source>Oncol. Rep.</source> <volume>35</volume>, <fpage>1227</fpage>&#x2013;<lpage>1236</lpage>. <pub-id pub-id-type="doi">10.3892/or.2015.4515</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>L.-F.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Tumor Suppressor Function of RUNX3 in Breast Cancer</article-title>. <source>J. Cell. Biochem.</source>, <fpage>a</fpage>&#x2013;<lpage>n</lpage>. <pub-id pub-id-type="doi">10.1002/jcb.24074</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Cimatti</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Clarke</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Giunchiglia</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Giunchiglia</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Pistore</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Roveri</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2002</year>). &#x201c;<article-title>NuSMV 2: An Opensource Tool for Symbolic Model Checking</article-title>,&#x201d; in <source>Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics</source>, <fpage>359</fpage>&#x2013;<lpage>364</lpage>. <pub-id pub-id-type="doi">10.1007/3-540-45657-0_29</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Frank</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Hall</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Witten</surname>
<given-names>I. H.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>The WEKA Workbench</article-title>. <source>Data Min.</source>, <fpage>553</fpage>&#x2013;<lpage>571</lpage>. <pub-id pub-id-type="doi">10.1016/b978-0-12-804291-5.00024-6</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Freeman</surname>
<given-names>L. C.</given-names>
</name>
</person-group> (<year>20192019</year>). &#x201c;<article-title>A Set of Measures of Centrality Based on Betweenness</article-title>,&#x201d; in <source>American Sociological Association Stable</source> (<publisher-name>American Sociological Association Is Collaborating with JSTOR to Digit.&#x201d;</publisher-name>), <volume>40</volume>, <fpage>35</fpage>&#x2013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.2307/3033543</pub-id>
<issue>1</issue> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gnyszka</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Jastrz&#x119;bski</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Flis</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>DNA Methyltransferase Inhibitors and Their Emerging Role in Epigenetic Therapy of Cancer</article-title>. <source>Anticancer Res2996</source> <volume>33</volume>, <fpage>2989</fpage>&#x2013;<lpage>2996</lpage>. </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Golbeck</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Analyzing Networks</article-title>. <source>Introd. Soc. Media Investigation</source>, <fpage>221</fpage>&#x2013;<lpage>235</lpage>. <pub-id pub-id-type="doi">10.1016/B978-0-12-801656-5.00021-4</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hassanzadeh</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kasymov</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Mahernia</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Adib</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Emperle</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Dukatz</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Discovery of Novel and Selective DNA Methyltransferase 1 Inhibitors by Pharmacophore and Docking-Based Virtual Screening</article-title>. <source>ChemistrySelect</source> <volume>2</volume> (<issue>27</issue>), <fpage>8383</fpage>&#x2013;<lpage>8392</lpage>. <pub-id pub-id-type="doi">10.1002/slct.201701734</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hong</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Clinicopathological Significance and Potential Drug&#xa0;target of T-Cadherin in NSCLC</article-title>. <source>Dddt</source> <volume>9</volume>, <fpage>207</fpage>&#x2013;<lpage>216</lpage>. <pub-id pub-id-type="doi">10.2147/DDDT.S74259</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jiang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tong</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Lou</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Geng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Expression of RUNX3 Gene, Methylation Status and Clinicopathological Significance in Breast Cancer and Breast Cancer Cell Lines</article-title>. <source>Pathobiology</source> <volume>75</volume> (<issue>4</issue>), <fpage>244</fpage>&#x2013;<lpage>251</lpage>. <pub-id pub-id-type="doi">10.1159/000132385</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jiao</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Machine Learning and Deep Learning in Chemical Health and Safety: A Systematic Review of Techniques and Applications</article-title>. <source>ACS Chem. Health Saf.</source> <volume>27</volume> (<issue>6</issue>), <fpage>316</fpage>&#x2013;<lpage>334</lpage>. <pub-id pub-id-type="doi">10.1021/acs.chas.0c00075</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jung</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>T. Y.</given-names>
</name>
<name>
<surname>Jong</surname>
<given-names>H.-S.</given-names>
</name>
<name>
<surname>Im</surname>
<given-names>S.-A.</given-names>
</name>
<name>
<surname>Robertson</surname>
<given-names>K. D.</given-names>
</name>
<etal/>
</person-group> (<year>2007</year>). <article-title>Potential Advantages of DNA Methyltransferase 1 (DNMT1)-Targeted Inhibition for Cancer Therapy</article-title>. <source>J. Mol. Med.</source> <volume>85</volume> (<issue>10</issue>), <fpage>1137</fpage>&#x2013;<lpage>1148</lpage>. <pub-id pub-id-type="doi">10.1007/S00109-007-0216-Z</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Khalis</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Comet</surname>
<given-names>J-P.</given-names>
</name>
<name>
<surname>Richard</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Bernot</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>The SMBioNet Method for Discovering Models of Gene Regulatory Networks</article-title>. <source>Genes, Genomes Genomics</source> <volume>3</volume> (<issue>1</issue>), <fpage>15</fpage>&#x2013;<lpage>22</lpage>. </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Koutsoukas</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Paricharak</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Galloway</surname>
<given-names>W. R. J. D.</given-names>
</name>
<name>
<surname>Spring</surname>
<given-names>D. R.</given-names>
</name>
<name>
<surname>IJzerman</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Glen</surname>
<given-names>R. C.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>How Diverse Are Diversity Assessment Methods? A Comparative Analysis and Benchmarking of Molecular Descriptor Space</article-title>. <source>J. Chem. Inf. Model.</source> <volume>54</volume> (<issue>1</issue>), <fpage>230</fpage>&#x2013;<lpage>242</lpage>. <pub-id pub-id-type="doi">10.1021/ci400469u</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krishna</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Shukla</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lakra</surname>
<given-names>A. D.</given-names>
</name>
<name>
<surname>Meeran</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Siddiqi</surname>
<given-names>M. I.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Identification of Potent Inhibitors of DNA Methyltransferase 1 (DNMT1) through a Pharmacophore-Based Virtual Screening Approach</article-title>. <source>J. Mol. Graph. Model.</source> <volume>75</volume>, <fpage>174</fpage>&#x2013;<lpage>188</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmgm.2017.05.014</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lau</surname>
<given-names>Q. C.</given-names>
</name>
<name>
<surname>Raja</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Salto-Tellez</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Ito</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Inoue</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2006</year>). <article-title>RUNX3 Is Frequently Inactivated by Dual Mechanisms of Protein Mislocalization and Promoter Hypermethylation in Breast Cancer</article-title>. <source>Cancer Res.</source> <volume>66</volume>, <fpage>6512</fpage>&#x2013;<lpage>6520</lpage>. <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-06-0369</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Wan</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Xing</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Deep Neural Network Classifier for Virtual Screening Inhibitors of (S)-Adenosyl-L-Methionine (SAM)-Dependent Methyltransferase Family</article-title>. <source>Front. Chem.</source> <volume>7</volume> (<issue>MAY</issue>), <fpage>1</fpage>&#x2013;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.3389/fchem.2019.00324</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mirza</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sharma</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Parshad</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Gupta</surname>
<given-names>S. D.</given-names>
</name>
<name>
<surname>Pandya</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Ralhan</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Expression of DNA Methyltransferases in Breast Cancer Patients and to Analyze the Effect of Natural Compounds on DNA Methyltransferases and Associated Proteins</article-title>. <source>J. Breast Cancer</source> <volume>16</volume> (<issue>1</issue>), <fpage>23</fpage>&#x2013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.4048/jbc.2013.16.1.23</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Phanus-Umporn</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Prachayasittikul</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Nantasenamat</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Prachayasittikul</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Prachayasittikul</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>QSAR-driven Rational Design of Novel Dna Methyltransferase 1 Inhibitors</article-title>. <source>EXCLI J.</source> <volume>19</volume>, <fpage>458</fpage>&#x2013;<lpage>475</lpage>. <pub-id pub-id-type="doi">10.17179/EXCLI2020-1096</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pouliot</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Labrie</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Diorio</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Durocher</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>The Role of Methylation in Breast Cancer Susceptibility and Treatment</article-title>. <source>Anticancer Res.</source> <volume>35</volume> (<issue>9</issue>), <fpage>4569</fpage>&#x2013;<lpage>4574</lpage>. <pub-id pub-id-type="doi">10.1007/s13566-015-0216-5</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Richard</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Comet</surname>
<given-names>J.-P.</given-names>
</name>
<name>
<surname>Bernot</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2006</year>). &#x201c;<article-title>Formal Methods for Modeling Biological Regulatory Networks</article-title>,&#x201d; in <source>Modern Formal Methods and Applications</source>, <fpage>83</fpage>&#x2013;<lpage>122</lpage>. <pub-id pub-id-type="doi">10.1007/1-4020-4223-X_5</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saeed</surname>
<given-names>M. T.</given-names>
</name>
<name>
<surname>Ahmad</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Baumbach</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Pauling</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Shafi</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Paracha</surname>
<given-names>R. Z.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Parameter Estimation of Qualitative Biological Regulatory Networks on High Performance Computing Hardware</article-title>. <source>BMC Syst. Biol.</source> <volume>12</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.1186/s12918-018-0670-y</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saeed</surname>
<given-names>M. T.</given-names>
</name>
<name>
<surname>Ahmad</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kanwal</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Holowatyj</surname>
<given-names>A. N.</given-names>
</name>
<name>
<surname>Sheikh</surname>
<given-names>I. A.</given-names>
</name>
<name>
<surname>Zafar Paracha</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Formal Modeling and Analysis of the Hexosamine Biosynthetic Pathway: Role of O-Linked N-Acetylglucosamine Transferase in Oncogenesis and Cancer Progression</article-title>. <source>PeerJ</source> <volume>4</volume> (<issue>9</issue>), <fpage>e2348</fpage>&#x2013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.7717/peerj.2348</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shin</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>LeeLee</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Koo</surname>
<given-names>J. S.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Differential Expression of the Epigenetic Methylation-Related Protein DNMT1 by Breast Cancer Molecular Subtype and Stromal Histology</article-title>. <source>J. Transl. Med.</source> <volume>14</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1186/s12967-016-0840-x</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Subramaniam</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Thombre</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Dhar</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Anant</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>DNA Methyltransferases: A Novel Target for Prevention and Therapy</article-title>. <source>Front. Oncol.</source> <volume>4</volume>. <pub-id pub-id-type="doi">10.3389/fonc.2014.00080</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Subramaniam</surname>
<given-names>M. M.</given-names>
</name>
<name>
<surname>Chan</surname>
<given-names>J. Y.</given-names>
</name>
<name>
<surname>Yeoh</surname>
<given-names>K. G.</given-names>
</name>
<name>
<surname>Quek</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ito</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Salto-Tellez</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Molecular Pathology of RUNX3 in Human Carcinogenesis</article-title>. <source>Biochimica Biophysica Acta (BBA) - Rev. Cancer</source> <volume>1796</volume>, <fpage>315</fpage>&#x2013;<lpage>331</lpage>. <pub-id pub-id-type="doi">10.1016/j.bbcan.2009.07.004</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>Y.-S.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>YangYang</surname>
<given-names>Z.-N.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>LuZhu</surname>
<given-names>H.-J. Zhi. Yong.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Z.-Y.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Risk Factors and Preventions of Breast Cancer</article-title>. <source>Int. J. Biol. Sci.</source> <volume>13</volume>, <fpage>1387</fpage>&#x2013;<lpage>1397</lpage>. <pub-id pub-id-type="doi">10.7150/ijbs.21635</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tang</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Xiao</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Potential of DNMT and its Epigenetic Regulation for Lung Cancer Therapy</article-title>. <source>Cg</source> <volume>10</volume>, <fpage>336</fpage>&#x2013;<lpage>352</lpage>. <pub-id pub-id-type="doi">10.2174/138920209788920994</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thomas</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>1978</year>). <article-title>Logical Analysis of Systems Comprising Feedback Loops</article-title>. <source>J. Theor. Biol</source>. <pub-id pub-id-type="doi">10.1016/0022-5193(78)90127-3</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="book">
<collab>ULC, Chemical Computing Group</collab> (<year>2018</year>). <source>Molecular Operating Environment (MOE), 2013.08.&#x201d; 1010 Sherbooke St. West, Suite &#x23;910</source>. <publisher-loc>Montreal, QC, Canada</publisher-loc>. <comment>
<italic>H3A 2R7</italic>
</comment>. </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Widschwendter</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>P. A.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>DNA Methylation and Breast Carcinogenesis</article-title>. <source>Oncogene</source> <volume>21</volume> (<issue>35</issue>), <fpage>5462</fpage>&#x2013;<lpage>5482</lpage>. <pub-id pub-id-type="doi">10.1038/SJ.ONC.1205606</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wildman</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Crippen</surname>
<given-names>G. M.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Prediction of Physicochemical Parameters by Atomic Contributions</article-title>. <source>J. Chem. Inf. Comput. Sci.</source> <volume>39</volume>, <fpage>868</fpage>&#x2013;<lpage>873</lpage>. <pub-id pub-id-type="doi">10.1021/ci990307l</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>S.-Y.</given-names>
</name>
<name>
<surname>Xiao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>WeiWei</surname>
<given-names>J.-L.</given-names>
</name>
<name>
<surname>XuXu</surname>
<given-names>X.-E.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>X.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>MYC Suppresses STING-dependent Innate Immunity by Transcriptionally Upregulating DNMT1 in Triple-Negative Breast Cancer</article-title>. <source>J. Immunother. Cancer</source> <volume>9</volume> (<issue>7</issue>), <fpage>e002528</fpage>. <pub-id pub-id-type="doi">10.1136/jitc-2021-002528</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yoo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>J. H.</given-names>
</name>
<name>
<surname>Robertson</surname>
<given-names>K. D.</given-names>
</name>
<name>
<surname>Medina-Franco</surname>
<given-names>J. L.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Molecular Modeling of Inhibitors of Human DNA Methyltransferase with a Crystal Structure</article-title>. <source>Eighth Edi</source> <volume>87</volume>, <fpage>219</fpage>&#x2013;<lpage>247</lpage>. <pub-id pub-id-type="doi">10.1016/B978-0-12-398312-1.00008-1</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yoo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Medina-Franco</surname>
<given-names>J. L.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Homology Modeling, Docking and Structure-Based Pharmacophore of Inhibitors of DNA Methyltransferase</article-title>. <source>J. Comput. Aided Mol. Des.</source> <volume>25</volume>, <fpage>555</fpage>&#x2013;<lpage>567</lpage>. <pub-id pub-id-type="doi">10.1007/s10822-011-9441-1</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yosipof</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Guedes</surname>
<given-names>R. C.</given-names>
</name>
<name>
<surname>Garc&#xed;a-Sosa</surname>
<given-names>A. T.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Data Mining and Machine Learning Models for Predicting Drug Likeness and Their Disease or Organ Category</article-title>. <source>Front. Chem.</source> <volume>6</volume> (<issue>MAY</issue>), <fpage>1</fpage>&#x2013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.3389/fchem.2018.00162</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>