<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Epigenet. Epigenom.</journal-id>
<journal-title>Frontiers in Epigenetics and Epigenomics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Epigenet. Epigenom.</abbrev-journal-title>
<issn pub-type="epub">2813-706X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1473789</article-id>
<article-id pub-id-type="doi">10.3389/freae.2024.1473789</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Epigenetics and Epigenomics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>MicroRNAs provide negative feedback and stability in gene regulatory network models of cell-state transitions</article-title>
<alt-title alt-title-type="left-running-head">Razavi-Mohseni and Beer</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/freae.2024.1473789">10.3389/freae.2024.1473789</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Razavi-Mohseni</surname>
<given-names>Milad</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/895943/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/Writing - review &#x26; editing/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Beer</surname>
<given-names>Michael A.</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2765024/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/Writing - review &#x26; editing/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
</contrib>
</contrib-group>
<aff>
<institution>Department of Biomedical Engineering and McKusick-Nathans Department of Genetic Medicine</institution>, <institution>Johns Hopkins University</institution>, <addr-line>Baltimore</addr-line>, <addr-line>MD</addr-line>, <country>United States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2080929/overview">Sharon YR Dent</ext-link>, University of Texas MD Anderson Cancer Center, United States</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2649002/overview">Hina Sultana</ext-link>, University of North Carolina System, United States</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2819241/overview">Blerta Xhemalce</ext-link>, The University of Texas at Austin, United States</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Michael A. Beer, <email>mbeer@jhu.edu</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>02</day>
<month>10</month>
<year>2024</year>
</pub-date>
<pub-date pub-type="collection">
<year>2024</year>
</pub-date>
<volume>2</volume>
<elocation-id>1473789</elocation-id>
<history>
<date date-type="received">
<day>31</day>
<month>07</month>
<year>2024</year>
</date>
<date date-type="accepted">
<day>16</day>
<month>09</month>
<year>2024</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2024 Razavi-Mohseni and Beer.</copyright-statement>
<copyright-year>2024</copyright-year>
<copyright-holder>Razavi-Mohseni and Beer</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The development of multicellular organisms occurs through a series of cell state transitions controlled by gene regulatory networks. Central to these networks are transcription factors (TFs) which bind enhancers and activate the expression of other genes, some of which are also TFs. Gene regulatory networks (GRN) connect TFs and enhancers in a nonlinear circuit capable of producing complex behavior such as bifurcations between stable cell states. Our dynamic network modelling of the Embryonic Stem Cell (ESC) to Definitive Endoderm (DE) transition requires an as yet unknown negative feedback mechanism for stability. Here, we show that cell state specific microRNAs (miRNAs) can provide this negative feedback by inactivating other cell lineage determining TFs (ESC or DE) during the transition. Our model provides a mechanism to maintain stable cell states without requiring a large set of cell-type-specific repressive TFs, of which there are fewer known examples than activators. In support of this model, we use computational models and analyze gene and miRNA expression and chromatin accessibility data from human cell lines to detect enhancers activating the miRNAs consistent with our network model. Our analysis highlights the interplay between TFs and miRNAs during ESC to DE transition and proposes a novel model for gene regulation.</p>
</abstract>
<kwd-group>
<kwd>gene regulatory network (GRN)</kwd>
<kwd>microRNA</kwd>
<kwd>machine learning</kwd>
<kwd>cell-state transitions</kwd>
<kwd>dynamic network modeling</kwd>
<kwd>enhancers</kwd>
</kwd-group>
<contract-num rid="cn001">R01 HG012367</contract-num>
<contract-sponsor id="cn001">National Human Genome Research Institute<named-content content-type="fundref-id">10.13039/100000051</named-content>
</contract-sponsor>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Chromatin Epigenomics</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Cell state transitions and their proper regulation are central to the development of multicellular organisms. These transitions are controlled by gene regulatory networks which exhibit complex behavior in response to developmental and environmental perturbations via the nonlinear interactions between transcription factor (TF) binding at enhancers that activate other TF genes. We refer to lineage-specific TFs as core TFs, and their transcription factor binding sites (TFBS) can quantitatively predict the active enhancers (accessible peaks) in that cell type. Differentiated cell types are stable states of these gene regulatory networks (<xref ref-type="bibr" rid="B9">Davidson, 2010</xref>; <xref ref-type="bibr" rid="B6">Beer et al., 2020</xref>). In addition to being important for understanding congenital developmental disorders, gene regulatory networks are disrupted in cancer (<xref ref-type="bibr" rid="B53">Xing et al., 2020</xref>; <xref ref-type="bibr" rid="B54">Xu et al., 2023</xref>; <xref ref-type="bibr" rid="B44">Sheng et al., 2021</xref>; <xref ref-type="bibr" rid="B18">Ho et al., 2023</xref>; <xref ref-type="bibr" rid="B41">Razavi-Mohseni et al., 2024</xref>), and quantitative models may aid the identification of effective therapeutic targets.</p>
<p>We have recently developed quantitative dynamic gene regulatory network (GRN) models (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>) of this cell state transition process, which reproduce many intriguing experimental observations in the embryonic stem cell (ESC) to definitive endoderm (DE) transition (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>). In these experiments, the differentiation from ESC to DE is induced while perturbing enhancers with CRISPRi. Perturbation of single enhancers flanking core DE TF genes caused a significant decrease in expression of the core DE TF genes, but the transcriptional response was transient, and eventually fully recovered to unperturbed levels, in spite of the fact that ATAC-seq verified that the CRISPRi targeted enhancers remained inaccessible (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>). This dynamic effect is explained in a nonlinear model by autoregulation of the cell-specific TFs (of which there are many, perhaps 5-10 per cell type) and redundant enhancers (of which there are also typically 5-10 enhancers per core TF gene). If only one enhancer is perturbed, the other enhancers are usually sufficient to activate the core TFs. In contrast, if the promoter is perturbed, the gene remains inactive. An additional component of this modeling is that all enhancers within a CTCF loop, or topologically associating domain (TAD), can activate the target gene, while those outside a CTCF loop have little or no effect (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>). This observation greatly simplifies the complex problem of enhancer-promoter interactions (<xref ref-type="bibr" rid="B51">Xi and Beer, 2018</xref>; <xref ref-type="bibr" rid="B15">Gschwind et al., 2023</xref>; <xref ref-type="bibr" rid="B57">Yao et al., 2024</xref>) needed for modeling of gene regulatory networks, as CTCF binding and its action as anchors to cohesin extruded loops are, to a large degree, predictable with simple models (<xref ref-type="bibr" rid="B52">Xi and Beer, 2021</xref>) and are largely independent of cell type.</p>
<p>However, in this model (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>) we only focused on the activation of core DE TF genes, and not the concomitant inactivation of the core ESC TF genes. In our earlier modeling (<xref ref-type="bibr" rid="B30">Li et al., 2019</xref>; <xref ref-type="bibr" rid="B6">Beer et al., 2020</xref>), we were able to model stable bifurcations between cell states by including negative feedback between core DE TFs (GATA6, SOX17, EOMES, MIXL1) and core ESC TFs (POU5F1(OCT4), SOX2, NANOG), as shown in <xref ref-type="fig" rid="F1">Figure 1A</xref> (TF expression shown in <xref ref-type="sec" rid="s10">Supplementary Figure S1</xref>). However, this mechanism requires a set of repressive TFs, or a repressive configuration of otherwise activating TFs, specific to each individual cell type, which may be difficult to satisfy given the relatively few known repressive TFs compared to activators. Our large-scale machine learning analysis of chromatin accessibility in a broad range of cell types and tissues in the ENCODE project identifies TFBS for some known repressive TFs, but we find that the majority of cell-specific TFBS signals we learn are activating. To be clear, by this we mean that the predictive cell-specific sequence features (TFBS) are dominated by those that have a positive model weight, increase the accessibility of peaks, and are recognized as TFBS active in the cell type. Yet it is experimentally clear that inducing the DE state somehow inactivates the ESC TFs. We do not know the direct consequences of having both ESC and DE TFs active at the same time, but it likely disrupts the proper expression of downstream genes required for cellular viability and function. Similarly, prior to the induction of DE, without a repressive effect of ESC TFs on DE TFs, autoregulation of DE TFs could inappropriately activate in response to signaling fluctuations or noise. Thus, our modeling implies that stable and precise development of all cell types (not only ESC and DE) requires some negative feedback acting on core TFs of cell types immediately upstream and downstream in the lineage.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>GRN models of cell-state transitions occur via activation of cell-specific TFs, but also require repressive mechanisms to inactivate TFs of the previous cell-state. <bold>(A)</bold> Autoregulatory GRN cell state transitions require an unknown negative feedback mechanism of cell type 1&#x2019;s TFs on cell type 2&#x2019;s TFs, and <italic>vice versa</italic>. <bold>(B)</bold> miRNAs activated in one cell type can provide this negative feedback mechanism by targeting the 3&#x2032; UTRs and translationally repressing the TFs of the other cell type; Gene regulatory network (GRN), microRNA (miRNA), embryonic stem cell (ESC), definitive endoderm (DE).</p>
</caption>
<graphic xlink:href="freae-02-1473789-g001.tif"/>
</fig>
<p>While many TF genes regulate gene expression through activation, other classes of genes, such as microRNAs (miRNAs), affect gene regulation largely through direct translational repression of their targets. miRNAs are &#x223c;22 nucleotide-long non-coding RNAs which, through their seed regions (&#x223c;7 nucleotides), bind to the 3&#x2032; untranslated region (3&#x2032; UTR) of their target mRNA and reduce the rate of translation (<xref ref-type="bibr" rid="B25">Lee et al., 1993</xref>; <xref ref-type="bibr" rid="B4">Bartel, 2004</xref>; <xref ref-type="bibr" rid="B26">Lewis et al., 2005</xref>). miRNAs play important roles in various biological processes such as apoptosis, cancer progression and embryonic differentiation (<xref ref-type="bibr" rid="B7">Chang et al., 2007</xref>; <xref ref-type="bibr" rid="B16">Hayes et al., 2014</xref>; <xref ref-type="bibr" rid="B47">Tay et al., 2008</xref>).</p>
<p>In this paper, we will present analysis in support of the hypothesis that miRNAs can act as a required negative feedback mechanism to prevent the misexpression of TFs whose activation is associated with immediate precursor and/or downstream cell types in the lineage. The pervasive autoregulation of core TFs makes activation of neighboring cell types susceptible to signaling noise and fluctuations which could disrupt normal development. We will show that negative feedback is required to repress precursor and downstream cell type TFs and provide stability to cell fates. As shown schematically in <xref ref-type="fig" rid="F1">Figure 1B</xref>, this could be achieved if DE TFs transcriptionally activate miRNAs which translationally repress ESC TFs, and if ESC TFs transcriptionally activate miRNAs which translationally repress DE TFs. Earlier work has shown that miRNAs can produce bifurcated cell-state transitions, but did not directly model core TF or miRNA enhancer activation (<xref ref-type="bibr" rid="B59">Zhao et al., 2019</xref>; <xref ref-type="bibr" rid="B22">Lai et al., 2016</xref>). DICER1 is required for human ESC renewal (<xref ref-type="bibr" rid="B48">Teijeiro et al., 2018</xref>), consistent with the possibility that lack of miRNA repression induces inappropriate expression of DE or other germ layer TFs and leads to slow cell death over time. We will present machine learning analysis of functional epigenomic datasets in the early ESC to DE transition in support of miRNAs potentially acting as the required negative feedback mechanism through enhancer-driven differential miRNA activation.</p>
</sec>
<sec sec-type="results" id="s2">
<title>Results</title>
<sec id="s2-1">
<title>Machine learning detects many activating TFs and few repressive TFs</title>
<p>We used a machine learning approach, gapped-kmer SVM (gkm-SVM) (<xref ref-type="bibr" rid="B12">Ghandi et al., 2014a</xref>; <xref ref-type="bibr" rid="B6">Beer et al., 2020</xref>; <xref ref-type="bibr" rid="B24">Lee et al., 2015</xref>) to identify TFBS driving variations in chromatin accessibility during cell state transitions. We chose gkm-SVM for its combination of predictive accuracy and interpretability (<xref ref-type="bibr" rid="B12">Ghandi et al., 2014a</xref>; <xref ref-type="bibr" rid="B6">Beer et al., 2020</xref>; <xref ref-type="bibr" rid="B24">Lee et al., 2015</xref>; <xref ref-type="bibr" rid="B46">Shigaki et al., 2019</xref>; <xref ref-type="bibr" rid="B5">Beer, 2017</xref>; <xref ref-type="bibr" rid="B56">Yan et al., 2021</xref>; <xref ref-type="bibr" rid="B21">Kreimer et al., 2017</xref>; <xref ref-type="bibr" rid="B11">Gate et al., 2018</xref>; <xref ref-type="bibr" rid="B36">Mo et al., 2016</xref>; <xref ref-type="bibr" rid="B19">Jain et al., 2024</xref>), and because of its advantages relative to deep neural networks (DNNs) when training on smaller numbers of differentially active peaks during a transition between related cell types (&#x223c;2000 peaks). However, we have shown that gkm-SVM and DNNs detect similar features when DNNs produce a robust model (<xref ref-type="bibr" rid="B45">Shigaki, 2024</xref>). We trained a gkm-SVM model on the top 10,000 distal ATAC-seq peaks against random negative genomic sequences, following our standard pipeline (<xref ref-type="bibr" rid="B6">Beer et al., 2020</xref>), for ESC and DE (after 2&#xa0;days of differentiation, DE-D2). Each gkm-SVM model summarizes the TFBS required to classify the ATAC-seq peaks in a vector of gapped kmer weights which can predict the accessibility of the sequences. These weight vectors have a tail of large positive weight kmers which map to activating TFs expressed or active in the cell type, and a much weaker tail of negative weight or repressive TFs, as shown in <xref ref-type="fig" rid="F2">Figure 2A</xref> for ESC and DE. In <xref ref-type="fig" rid="F2">Figure 2A</xref>, we also show the average of all weight vectors trained from every ENCODE DNase-seq dataset (1270 cell/tissues), and again find an imbalance between activating and repressive TFBS. Within the repressive tail of negative weight kmers learned in the full ENCODE set of tissues, we do learn kmers mapping to known repressive binding sites, most commonly SNAI1/2, TWIST, and ZEB1/2 (5&#x2032;-CACCT-3&#x2032; motif) which are repressors known to play a key role in EMT and cancer (<xref ref-type="bibr" rid="B8">Craene and Berx, 2013</xref>; <xref ref-type="bibr" rid="B50">Vandewalle et al., 2008</xref>); GLI3, GLIS2, and ZBTB7A (5&#x2032;-GACCCC-3&#x2032; motif) known to have a repressive role in Hedgehog signaling (<xref ref-type="bibr" rid="B28">Lex et al., 2022</xref>; <xref ref-type="bibr" rid="B27">2020</xref>) and fetal hemoglobin switching (<xref ref-type="bibr" rid="B31">Liu et al., 2021</xref>); and nuclear hormone receptor monomer and dimer elements (5&#x2032;-AGGTCA-3&#x2032;) bound by large families of co-repressors (<xref ref-type="bibr" rid="B39">Perissi et al., 2010</xref>). We also learn non-ZEB-like E-box motifs (5&#x2032;-CAGNTG-3&#x2032;) which are usually activating, but repressive in heart and fibroblasts, where they could be bound by E-box binding helix-loop-helix TFs (e.g., E47, E2A, also known as TCF3) and targeted by repressive ID factors (<xref ref-type="bibr" rid="B40">Perk et al., 2005</xref>). Although we detect several well documented repressive TFBS, in all of our ENCODE models the negative weight tail of repressive TFBS is only slightly enriched above the expected null normal distribution, while the positive weight tail is dramatically enriched in all cell/tissues and contains hundreds of activating cell-specific TFs (5-10 per cell/tissue). In addition, we find the repressive TFBS in broader sets of cell types and tissues than the activators, so while they are clearly playing an important repressive role, there may not be cell-specific repressors unique to each cell type. For the specific case of the ESC to DE cell-state transition, we trained models on ESC and DE ATAC-seq peaks, and extracted motifs from each weight distribution (Methods). We find motifs for the expected regulators OCT4-SOX2-NANOG in ESC and GATA in DE (<xref ref-type="fig" rid="F2">Figure 2B</xref>), but we found no or very weak negative weight motifs. Consistent with the results from our machine learning models, a systematic study of TF binding in HepG2 cells found that 34 out of 35&#xa0;TFs with strongly reproducible effects on expression were activating, while only REST was repressive (fraction significant &#x3e;0.8, &#x7c;median effect estimate&#x7c;&#x3e;0.5) (<xref ref-type="bibr" rid="B37">Moyers et al., 2023</xref>). The miRNA repression analyzed in this paper is an alternative repressive mechanism.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Sequence features identified by machine learning models can explain chromatin accessibility profiles during cell-state transitions with many activating TFBS, and very few repressive TFBS. <bold>(A)</bold> Machine learning models detect a strong tail of activating positive weight kmers mapping to TFBS in ESC, DE, and in all ENCODE cell/tissues, with only a few repressive TFBS detected (little deviation from expected null normal distribution on the negative tail). <bold>(B)</bold> gkm-SVM models detect expected regulators OCT4-SOX2-NANOG in ESC, and GATA in DE and only weak negative weight signals. <bold>(C)</bold> When trained on differentially active peaks in ESC vs. DE-D2, again only the activating signals are learned, with <bold>(D)</bold> OCT4-SOX2-NANOG with positive weight and GATA and EOMES with negative weight; DE day 2 (DE-D2), 10k randomly generated GC-matched genomic sequences (neg); LASSO regression weight on the gkm-SVM weight space is denoted by &#x201c;W.&#x201d; &#x201c;Z&#x201d; is the Z-score of the motif in gkm-SVM weight distribution. &#x201c;I&#x201d; is the relative error increase after deletion of the particular motif from the motif list explaining the weight vector.</p>
</caption>
<graphic xlink:href="freae-02-1473789-g002.tif"/>
</fig>
<p>We next trained a model on the most differentially active peaks in ESC vs. DE (n &#x3d; 3000 in each set). Now OCT4-SOX2-NANOG is learned as a positive signal, and GATA, EOMES, SMAD, and MIXL1 (Crx homeodomain motif) are learned as negative signals. This training design (ESC peaks vs. DE peaks) more sensitively detects the motifs required to explain all the accessibility changes induced during the cell-state transition from ESC to DE. Again, the known activating TFBS signals OCT4-SOX2-NANOG and GATA, EOMES, SMAD, and MIXL1 are learned (the same motifs learned when trained against negative genomic regions are learned as contributing positively to that cell type&#x2019;s differentially active peaks). If repressive TFs were contributing broadly to changes in ESC and DE chromatin accessibility, we would expect to see them. In some cases, we do learn an EBOX motif (5&#x2032;-CANNTG-3&#x2032;, which as discussed above can be repressive in some cell types) as contributing to ESC specific peaks, but this is most likely bound by MYC, which is strongly expressed in ESC but is downregulated in DE. This analysis does not preclude the possibility that DE TFs are activating a repressive TF that is only binding at a few core ESC TF enhancers (those flanking POU5F1 (OCT4), SOX2, and NANOG), but if this were a dominant mechanism, we would also expect to see the repressive TF having an effect at other peaks which have weak binding sites for the TF, and thus should appear in the weight vector. In conclusion, we see little evidence for a strong repressive TF expressed in DE which shuts down the ESC core TF enhancers, and little evidence for a strong repressive TF expressed in ESC which keeps the DE TFs repressed. The PRC2 complex may play this repressive role by being recruited to specific promoters, but how the complex achieves cell-specificity is unclear (<xref ref-type="bibr" rid="B58">Youmans et al., 2021</xref>).</p>
</sec>
<sec id="s2-2">
<title>Dynamic gene regulatory network modeling</title>
<p>The gene regulatory network model (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>) we developed to understand the dynamic effect of CRISPRi enhancer perturbation is a two-state transcriptional model with a large enhancer activated transcription rate, <inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and a low basal transcription rate, <inline-formula id="inf2">
<mml:math id="m2">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> (see <xref ref-type="fig" rid="F3">Figure 3A</xref> for GRN model variables). When we learn the core TFs in any cell type with gkm-SVM, we also find binding sites for multiple core TFs in each core TF enhancer. For simplicity, we model this cooperative autoregulatory activity by assuming that all core TFs are activated together, and we call this activation <inline-formula id="inf3">
<mml:math id="m3">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> for cell type 1 and <inline-formula id="inf4">
<mml:math id="m4">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> for cell type 2. <inline-formula id="inf5">
<mml:math id="m5">
<mml:mrow>
<mml:mi>&#x3c8;</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> describes how the concentration of core TFs for each cell type vary as a function of time (Methods). Each cell type&#x2019;s enhancers are active proportional to <inline-formula id="inf6">
<mml:math id="m6">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula> and inactive proportional to <inline-formula id="inf7">
<mml:math id="m7">
<mml:mrow>
<mml:mfrac>
<mml:mi>b</mml:mi>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula>. We will assume that miRNAs are similarly transcriptionally activated by enhancers, with the same rates, and repress the other cell type&#x2019;s core TF activity with an additional rate <inline-formula id="inf8">
<mml:math id="m8">
<mml:mrow>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. Relaxing the assumption that core TFs and miRNAs are transcribed at the same rates <inline-formula id="inf9">
<mml:math id="m9">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf10">
<mml:math id="m10">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> does not qualitatively change our findings. The translational repression rate, <inline-formula id="inf11">
<mml:math id="m11">
<mml:mrow>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, includes miRNA processing and nuclear transport. Together this yields two coupled nonlinear differential equations for <inline-formula id="inf12">
<mml:math id="m12">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf13">
<mml:math id="m13">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> as shown in <xref ref-type="fig" rid="F3">Figure 3A</xref>. We have also added a stimulus, <inline-formula id="inf14">
<mml:math id="m14">
<mml:mrow>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, which acts at a subset of enhancers to induce the transition from cell type 1 to 2 by activating enhancers of cell type 2 through a mechanism independent of <inline-formula id="inf15">
<mml:math id="m15">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. Further details of the model are provided in Methods.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>A GRN model with miRNAs activated by enhancers in one cell type, which target TFs of the other cell type, can produce stable cell-state bifurcations. <bold>(A)</bold> Two-state autoregulatory GRN model with miRNA negative feedback. <bold>(B)</bold> With weak negative feedback (<italic>d</italic> &#x3d; 0.1), cell type 1&#x2019;s TFs remain on after a stimulus, <inline-formula id="inf16">
<mml:math id="m16">
<mml:mrow>
<mml:mi>&#x3b4;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, activates cell type 2&#x2019;s TFs. With modest negative feedback (<italic>d</italic> &#x3d; 1), cell type 1&#x2019;s TFs are repressed as cell type 2&#x2019;s TFs are activated. <bold>(C)</bold> Phase plane analysis shows a stable fixed point with both cell types&#x2019; TFs activated at <italic>d</italic> &#x3d; 0.1 <inline-formula id="inf17">
<mml:math id="m17">
<mml:mrow>
<mml:mfenced open="" close="(" separators="&#x7c;">
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
</inline-formula>2.3,2.3), which becomes unstable with miRNA repression (<italic>d</italic> &#x3d; 1).</p>
</caption>
<graphic xlink:href="freae-02-1473789-g003.tif"/>
</fig>
<p>In the limit <inline-formula id="inf18">
<mml:math id="m18">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, these two equations are completely uncoupled, and each has a stable OFF state at <inline-formula id="inf19">
<mml:math id="m19">
<mml:mrow>
<mml:mi>&#x3c8;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>/</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and a stable ON state at <inline-formula id="inf20">
<mml:math id="m20">
<mml:mrow>
<mml:mi>&#x3c8;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>/</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. When initial conditions are near the OFF state, a stimulus is required to transition into the ON state, but for <inline-formula id="inf21">
<mml:math id="m21">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and even for small <inline-formula id="inf22">
<mml:math id="m22">
<mml:mrow>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, it is possible to activate <inline-formula id="inf23">
<mml:math id="m23">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf24">
<mml:math id="m24">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> at the same time. As discussed in the introduction, this likely causes inappropriate expression of many downstream genes and interferes with normal cell function and viability. For small <inline-formula id="inf25">
<mml:math id="m25">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, with a stimulus <inline-formula id="inf26">
<mml:math id="m26">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b4;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.3</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> (see Methods) sufficient to induce the activation of cell type 2, <inline-formula id="inf27">
<mml:math id="m27">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> becomes activated (<xref ref-type="fig" rid="F3">Figure 3B</xref>, left), but cell type 1 remains active. In contrast, for larger miRNA repression of cell type 1&#x2019;s TFs <inline-formula id="inf28">
<mml:math id="m28">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1.0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, at the same stimulus (<inline-formula id="inf29">
<mml:math id="m29">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b4;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.3</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>), when <inline-formula id="inf30">
<mml:math id="m30">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> becomes activated, the activation of miRNAs repressing <inline-formula id="inf31">
<mml:math id="m31">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> TFs are sufficient to drive <inline-formula id="inf32">
<mml:math id="m32">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> fully to the OFF state (<xref ref-type="fig" rid="F3">Figure 3B</xref>, right).</p>
<p>Further understanding of this behavior is provided by phase plane analysis in <xref ref-type="fig" rid="F3">Figure 3C</xref>. For <inline-formula id="inf33">
<mml:math id="m33">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, there is a stable fixed point at <inline-formula id="inf34">
<mml:math id="m34">
<mml:mrow>
<mml:mfenced open="" close="(" separators="&#x7c;">
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
</inline-formula>2.3,2.3) (<xref ref-type="fig" rid="F3">Figure 3C</xref>, left), showing that a stable solution exists with both ESC and DE TFs activated. With sufficiently strong negative feedback, this stable fixed point disappears (<inline-formula id="inf35">
<mml:math id="m35">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x3e;</mml:mo>
<mml:mn>0.4</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>), and the only three stable fixed points are with <inline-formula id="inf36">
<mml:math id="m36">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> in the ON state and <inline-formula id="inf37">
<mml:math id="m37">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> in the OFF state <inline-formula id="inf38">
<mml:math id="m38">
<mml:mrow>
<mml:mfenced open="" close="(" separators="&#x7c;">
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
</inline-formula>3,0), <inline-formula id="inf39">
<mml:math id="m39">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> in the OFF state and <inline-formula id="inf40">
<mml:math id="m40">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> in the ON state <inline-formula id="inf41">
<mml:math id="m41">
<mml:mrow>
<mml:mfenced open="" close="(" separators="&#x7c;">
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
</inline-formula>0,3), and both <inline-formula id="inf42">
<mml:math id="m42">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf43">
<mml:math id="m43">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> in the OFF state <inline-formula id="inf44">
<mml:math id="m44">
<mml:mrow>
<mml:mfenced open="" close="(" separators="&#x7c;">
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
</inline-formula>0,0) as shown in <xref ref-type="fig" rid="F3">Figure 3C</xref> (right), for <inline-formula id="inf45">
<mml:math id="m45">
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. This situation is more similar to what is seen experimentally in the ESC to DE transition, where activation of core DE TFs genes (GATA6, MIXL1, EOMES, SOX17) induces the inactivation of the core ESC TF genes (POU5F1, SOX2, and NANOG).</p>
</sec>
<sec id="s2-3">
<title>Cell-specific expression of miRNAs</title>
<p>Next, we analyze miRNA expression data to detect miRNAs that are differentially expressed in ESC and DE and might be repressing TFs in the other cell type. PCA of miRNA expression across 177 ENCODE samples (<xref ref-type="bibr" rid="B42">Reese et al., 2023</xref>) shows some degree of clustering and separation between samples originating from the three germ layers (endoderm, mesoderm and ectoderm) (<xref ref-type="fig" rid="F4">Figure 4A</xref>; <xref ref-type="sec" rid="s10">Supplementary Figure S2</xref>, Methods). The three human embryonic stem cell samples (hESC) are close to each other in the PCA space, and have an average expression correlation of 0.95 compared to average expression correlation of 0.77 for all 177 pairwise comparisons.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>In the transition from ESC to DE, there are multiple miRNAs expressed in a cell-specific manner, flanked by peaks of cell-specific chromatin accessibility consistent with enhancer activity driving the target miRNA expression. <bold>(A)</bold> PCA of miRNA expression in tissues and cell lines identifies clusters of samples. <bold>(B)</bold> Correlation heatmap of 835 miRNAs with detectable expression in multiple samples (Methods). Differentially expressed miRNAs in DE compared to ESC have some similar patterns of expression in ENCODE samples. <bold>(C)</bold> Number of upregulated miRNAs in DE compared to ESC. <bold>(D)</bold> There are differentially active ATAC peaks flanking differentially expressed miRNAs with higher activity in the tissue where the flanking miRNA expression is upregulated. &#x201c;DE-MIR-flanking&#x201d; peaks are ATAC peaks having a miRNA upregulated in DE within 100&#xa0;kb of them. &#x201c;ESC-MIR-flanking&#x201d; peaks are defined similarly. <bold>(E)</bold> DE and ESC TFs have higher binding to differentially accessible DE-MIR-flanking and ESC-MIR-flanking peaks, respectively (<italic>t</italic>-test, &#x2a;<italic>p</italic> &#x3c; 0.01). MYC, POU5F, and NANOG ChIP-seq experiments were performed in ESC (H1), while others were done in DE cells. hESC: human embryonic stem cell.</p>
</caption>
<graphic xlink:href="freae-02-1473789-g004.tif"/>
</fig>
<p>We performed a differential expression analysis for miRNAs in DE compared to the ESC cell line and found 59 miRNAs overexpressed in DE state and 100 miRNAs with higher expression in ESC (<xref ref-type="fig" rid="F4">Figures 4B, C</xref>). Some miRNAs upregulated in either ESC or DE are co-expressed across other tissues and cluster together (<xref ref-type="fig" rid="F4">Figure 4B</xref>), providing evidence that miRNA expression differences are informative and vary across differentiation states and tissues.</p>
</sec>
<sec id="s2-4">
<title>Differential miRNA expression is driven by differential chromatin accessibility and TF binding at putative enhancers</title>
<p>ESC and DE ATAC-seq along with H3K27ac (<xref ref-type="bibr" rid="B30">Li et al., 2019</xref>; <xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>) shows that peaks flanking differentially expressed miRNAs are consistently more active in the tissue where the flanking miRNAs are overexpressed (<xref ref-type="fig" rid="F4">Figure 4D</xref>, Methods). Core DE and core ESC TFs bind preferentially to the most differentially accessible peaks between DE and ESC that are near a differentially expressed miRNA (<xref ref-type="fig" rid="F4">Figure 4E</xref>). GATA6, EOMES and SOX17 have a significantly higher binding to ATAC peaks flanking miRNAs upregulated in DE (named &#x201c;DE-MIR-flanking&#x201d; peaks), whereas MYC, POU5F1 and NANOG have a higher ChIP-seq signal at differentially accessible peaks in ESC flanking ESC miRNAs (&#x201c;ESC-MIR-flanking&#x201d; peaks) (<italic>t</italic>-test <italic>p</italic> &#x3c; 0.01, <xref ref-type="fig" rid="F4">Figure 4E</xref>). In addition, GATA and POU5F1 DNA-binding motif sequences have a significantly higher enrichment in differentially accessible DE-MIR-flanking peaks and ESC-MIR-flanking peaks, respectively (<italic>t</italic>-test <italic>p</italic> &#x3c; 0.05, <xref ref-type="sec" rid="s10">Supplementary Figure S3</xref>, Methods). Taken together, there are distinct patterns of accessibility and TF binding at peaks near differentially expressed miRNAs in ESC and DE which suggest that they are functioning as enhancers driving the differential expression of these miRNAs in ESC and DE, consistent with our modeling in <xref ref-type="fig" rid="F3">Figure 3</xref>.</p>
</sec>
<sec id="s2-5">
<title>Differentially expressed miRNAs are predicted to target 3&#x2032;UTRs of core DE and ESC TFs</title>
<p>Our dynamic network modeling (<xref ref-type="fig" rid="F1">Figures 1</xref>, <xref ref-type="fig" rid="F3">3</xref>) highlights the possibility of miRNAs repressing core ESC and DE TFs during embryonic differentiation. We used TargetScan and mirDIP (<xref ref-type="bibr" rid="B35">McGeary et al., 2019</xref>; <xref ref-type="bibr" rid="B49">Tokar et al., 2018</xref>) to find the set of candidate miRNAs targeting 3&#x2032;UTRs of core TFs (<xref ref-type="fig" rid="F5">Figure 5A</xref>, Methods) and discovered a subset of miRNAs predicted to target MYC, SOX2, POU5F1, EOMES and GATA6 (<xref ref-type="fig" rid="F5">Figure 5B</xref>). MIR92B is expressed at a higher level in ESC and is confidently predicted to target EOMES and GATA6 (<xref ref-type="fig" rid="F5">Figure 5C</xref>). MIR145 is upregulated in DE and consistent with the target prediction, MIR145 was shown to repress pluripotency by targeting ESC TFs POU5F1, SOX2 and KLF4 (<xref ref-type="bibr" rid="B55">Xu et al., 2009</xref>). MIR892A is upregulated in DE while it has low expression in the ESC state and is predicted to target POU5F1. Another core ESC TF is MYC predicted to be targeted by MIRLET7D (<xref ref-type="sec" rid="s10">Supplementary Figure S4</xref>). In addition, our analysis of differentially expressed miRNAs identified a potential combinatorial regulation of SOX2 by multiple DE miRNAs such as MIR145, MIR369 and MIRLET7F1 (<xref ref-type="fig" rid="F5">Figure 5D</xref>). This potential combinatorial repression is consistent with SOX2 being the core ESC TF having the highest and the fastest downregulation following ESC-DE transition through DE day 3 (<xref ref-type="sec" rid="s10">Supplementary Figure S1</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Some differentially expressed miRNAs target 3&#x2032; UTRs of core TFs of other cell types, consistent with a negative feedback role in cell-type specification. <bold>(A)</bold> Distribution of mirDIP prediction scores for pairs of [miRNA, TF] where we chose 0.2 &#x3c; mirDIP_score as the cutoff for miRNA target prediction. <bold>(B)</bold> Differentially expressed miRNAs in DE compared to ESC (&#x7c;log<sub>2</sub>FC&#x7c; &#x3e; 2). Labeled miRNAs are those with a flanking (&#x3c;100&#xa0;kb) differentially active ATAC peak which are predicted to target a core DE or ESC TF by mirDIP. Numbers are mirDIP target prediction scores. <bold>(C)</bold> miRNA seed sequences aligned to 3&#x2032;UTR of DE and ESC TFs. <bold>(D)</bold> The SOX2 3&#x2032;UTR contains multiple seed target sequences for DE expressed miRNAs, consistent with combinatorial translational repression.</p>
</caption>
<graphic xlink:href="freae-02-1473789-g005.tif"/>
</fig>
</sec>
<sec id="s2-6">
<title>Chromatin accessibility variations around miRNAs targeting ESC and DE TFs</title>
<p>We identified differentially expressed miRNAs putatively targeting DE or ESC TFs and found differentially active enhancer peaks consistent with their cell-specific expression. MIR92B is predicted to target DE TFs and has a differentially active candidate enhancer (<italic>E1</italic>) flanking it (<xref ref-type="fig" rid="F6">Figure 6A</xref>), where ESC TFs such as POU5F1 and NANOG bind. Conversely, DE TFs such as EOMES, GATA6 and SOX17 bind to the differentially accessible candidate enhancers (<italic>E2</italic> and <italic>E3</italic>) flanking MIR145 and MIR892A (<xref ref-type="fig" rid="F6">Figures 6B, C</xref>). These candidate enhancers are all within loop extrusion model predicted CTCF loops (<xref ref-type="bibr" rid="B52">Xi and Beer, 2021</xref>) enclosing the miRNA, providing more support that they are responsible for the miRNA differential expression. Together, these results suggest DE and ESC TFs, through activating <italic>cis</italic>-regulatory elements, upregulate miRNAs, which in turn repress core TFs for the other transition state.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Chromatin accessibility and TF binding at candidate enhancers activating ESC or DE specific miRNAs. Chromatin landscape around <bold>(A)</bold> MIR92B&#xa0;<bold>(B)</bold> MIR145 <bold>(C)</bold> MIR892A. <italic>E1</italic>, <italic>E2</italic> and <italic>E3</italic> are candidate enhancers within the same CTCF loop as the miRNAs. Their accessibility may be regulated through binding of core DE or ESC TFs and these candidate enhancers may control the expression of the nearby miRNAs which will repress TFs of the other transition state.</p>
</caption>
<graphic xlink:href="freae-02-1473789-g006.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s3">
<title>Discussion</title>
<p>Cell state transitions are characterized by the activation of lineage-specific core TF genes and the inactivation of at least some of the core TF genes of the precursor cell type. Our modeling shows that a repressive mechanism is required to achieve this inactivation. In our analysis of a very large set of chromatin accessibility datasets, we have found a deficit of repressive sequence signals in enhancers that could provide this shut-off mechanism, relative to activating sequence signals. We see evidence for TFBS for ZEB1/2, SNAI1/2, GLI, ZBTB7A, Nuclear Hormone Receptors, GFI1, REST and others acting as repressors, but in our systematic sequence analysis, there are far fewer repressors detected than activators, suggesting that there are too few repressors to act individually in multiple different cell types to negatively regulate the large set of known activators. On the other hand, it is quite easy to imagine each cell type expressing a small number of miRNAs to repress the TFs in precursor and downstream cell types by targeting the TF&#x2019;s 3&#x2032;UTRs. The PRC2 complex or other repressive complexes may play this repressive role by binding promoters, but how these complexes are recruited to different genomic locations in a cell specific manner is unclear. There is also strong autoactivation of core TFs in most tissues, where we have found that core TFs bind their own enhancers. This autoactivation is a key component of our dynamic gene network model, and helps explain our experimental observations of the effect of enhancer perturbation in the ESC-DE transition (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>). However, the autoactivation also induces hysteresis in the gene regulatory network which makes it difficult to turn off cell-specific core TFs once they are activated. One could argue that the tendency for chromatin to be bound by nucleosomes and revert to a heterochromatic state could provide a generic shut-off mechanism that provides the required negative feedback, but mathematically this would be equivalent to using a lower basal transcription rate, controlled by parameters <inline-formula id="inf46">
<mml:math id="m46">
<mml:mrow>
<mml:mi>b</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf47">
<mml:math id="m47">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> in our model, which are already quite small. The repressive effect could also be modeled by requiring stronger cooperativity to overcome the heterochromatic state, by using <inline-formula id="inf48">
<mml:math id="m48">
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>4</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> instead of <inline-formula id="inf49">
<mml:math id="m49">
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, but changing <inline-formula id="inf50">
<mml:math id="m50">
<mml:mrow>
<mml:mi>b</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf51">
<mml:math id="m51">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> or the cooperativity exponent does not qualitatively change the stability analysis shown in <xref ref-type="fig" rid="F3">Figures 3B, C</xref>. The point is that once autoactivation is significant enough to overcome any default heterochromatic repression, it remains strong enough to overcome it until another distinct repressive mechanism is induced. In this paper we have provided modeling and analysis of epigenomic datasets in support of miRNAs acting to provide the negative feedback necessary to shut off the core TFs of neighboring cell types and provide stability to cell types and cell state transitions.</p>
<p>Our miRNA model is generic and can apply to many different cell types, but our epigenomic analysis is specific to the ESC to DE transition for which we have ample high-quality datasets. We identified differentially expressed miRNAs in ESC and DE, and differentially active enhancers flanking these miRNAs which are likely responsible for their differential expression and which are bound by the relevant cell-specific core TFs. We found both ESC-expressed miRNAs which are driven by core ESC TF-bound enhancers that provide the translational repression of core DE TF genes through binding sites in their 3&#x2032;UTRs, and DE-expressed miRNAs which are driven by core DE TF-bound enhancers that carry out the translational repression of core ESC TF genes, again with miRNA binding sites in their 3&#x2032;UTRs.</p>
<p>More detailed experimental validation of these predictions is required, but if verified, a better understanding of the role of miRNAs in cell-state transitions may allow further improvement in the efficiency of stem cell differentiation for both scientific and therapeutic purposes, and may help better understand the role of miRNAs in cancer and as potential therapeutic targets.</p>
</sec>
<sec sec-type="methods" id="s4">
<title>Methods</title>
<sec id="s4-1">
<title>Gkm-SVM modeling</title>
<p>We trained gkm-SVM on ATAC-seq data from ESC and DE (<xref ref-type="bibr" rid="B30">Li et al., 2019</xref>), after removing promoter peaks within 2&#xa0;kb of an annotated TSS, non-cell specific peaks accessible in more than 30% of ENCODE samples, and CTCF binding sites. We removed promoters as they have similar TFBS and are usually active across all cell types (<xref ref-type="bibr" rid="B38">Oh and Beer, 2024</xref>). Gkm-SVM uses counts of gapped kmer features under the 300bp sequences centered on the ATAC-seq peaks, as gapped kmers are effective at describing regulatory sequence features (<xref ref-type="bibr" rid="B13">Ghandi et al., 2014b</xref>; <xref ref-type="bibr" rid="B2">Amanchy et al., 2011</xref>) and do not require previous knowledge of TFBS. Following the default training method in the gkmSVM-R package (<xref ref-type="bibr" rid="B14">Ghandi et al., 2016</xref>), the AUROC was evaluated on 5-fold cross validation (CV) and AUROC is the average of 5 CV test sets (0.902 for ESC, 0.949 for DE-D2, and 0.975 for DE-D2 vs. ESC). All cross-fold validation sets produce similar sets of features and have very similar weight vectors. We normalized weight vectors to have zero mean and unit standard deviation for comparison. We extracted TFBS from this weight vector using gkm-PWM (<xref ref-type="bibr" rid="B45">Shigaki, 2024</xref>), which produces a lasso weight reflecting the importance of each motif in explaining the full weight vector (W in <xref ref-type="fig" rid="F2">Figure 2</xref>). Other motif metrics Z and I describe the Z-score for kmers mapping to the motif and the error induced by removing the motif from the list.</p>
</sec>
<sec id="s4-2">
<title>Dynamic gene regulatory network modeling</title>
<p>We modified the gene regulatory network model (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>) to include miRNA repression. The model describes the rate of change of cell-specific TFs, where one cell state&#x2019;s TF concentrations are described by <inline-formula id="inf52">
<mml:math id="m52">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and the other cell state&#x2019;s TF concentrations are described by <inline-formula id="inf53">
<mml:math id="m53">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. For simplicity, we assume that through autoregulation, each cell state&#x2019;s TFs turn on or off together, and can thus be represented with one concentration (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>). The differential equation describes the change in concentration of the TFs, which is driven by the balance between a transcription rate, <inline-formula id="inf54">
<mml:math id="m54">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>&#x3c8;</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and degradation/export, <inline-formula id="inf55">
<mml:math id="m55">
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mi>&#x3c8;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, see, for example, (<xref ref-type="bibr" rid="B1">Alon, 2019</xref>). For both the TF and miRNA transcription rate we choose the form <inline-formula id="inf56">
<mml:math id="m56">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>&#x3c8;</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula>. This rate depends on the TF activity, <inline-formula id="inf57">
<mml:math id="m57">
<mml:mrow>
<mml:mi>&#x3c8;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, through enhancer activation, and the parameters control the rate at which transcription transitions from low transcription to high transcription as a function of <inline-formula id="inf58">
<mml:math id="m58">
<mml:mrow>
<mml:mi>&#x3c8;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. It is a simple Hill form (<xref ref-type="bibr" rid="B1">Alon, 2019</xref>), but can also be understood as a two-state model with a large enhancer activated transcription rate, <inline-formula id="inf59">
<mml:math id="m59">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and a low basal transcription rate, <inline-formula id="inf60">
<mml:math id="m60">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. Each cell type&#x2019;s enhancers are active proportional to <inline-formula id="inf61">
<mml:math id="m61">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula> and inactive proportional to <inline-formula id="inf62">
<mml:math id="m62">
<mml:mrow>
<mml:mfrac>
<mml:mi>b</mml:mi>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula>. We will assume that miRNAs are similarly activated with enhancers, and repress the other cell type&#x2019;s core TF activity with a rate <inline-formula id="inf63">
<mml:math id="m63">
<mml:mrow>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. This rate includes miRNA processing and nuclear transport. The core TFs are degraded and/or exported from the nucleus at a rate <inline-formula id="inf64">
<mml:math id="m64">
<mml:mrow>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. Together this yields two coupled nonlinear differential equations for <inline-formula id="inf65">
<mml:math id="m65">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf66">
<mml:math id="m66">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> as shown in <xref ref-type="fig" rid="F3">Figure 3A</xref>. Transitions from cell type 1 to cell type 2 are modeled (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>) by adding a time-dependent stimulus, <inline-formula id="inf67">
<mml:math id="m67">
<mml:mrow>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, which acts at a subset of cell type 2 enhancers to induce activation of <inline-formula id="inf68">
<mml:math id="m68">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, by increasing the probability that <inline-formula id="inf69">
<mml:math id="m69">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>&#x2019;s enhancers are active. The probability that <inline-formula id="inf70">
<mml:math id="m70">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is in the basal state is <inline-formula id="inf71">
<mml:math id="m71">
<mml:mrow>
<mml:mi>b</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, and the probability that <inline-formula id="inf72">
<mml:math id="m72">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>&#x2019;s enhancers are active is <inline-formula id="inf73">
<mml:math id="m73">
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:msubsup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>3</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> without stimulus and <inline-formula id="inf74">
<mml:math id="m74">
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:msubsup>
<mml:mi>&#x3c8;</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>3</mml:mn>
</mml:msubsup>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> with the stimulus, with <inline-formula id="inf75">
<mml:math id="m75">
<mml:mrow>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3b4;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mrow>
<mml:mfenced open="[" close="]" separators="&#x7c;">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>erf</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:msqrt>
<mml:mn>2</mml:mn>
</mml:msqrt>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> This stimulus turns on gradually at <inline-formula id="inf76">
<mml:math id="m76">
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. The other parameters are <inline-formula id="inf77">
<mml:math id="m77">
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>d</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3b4;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mn>0.5</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> unless specified otherwise.</p>
</sec>
<sec id="s4-3">
<title>ENCODE miRNA expression data</title>
<p>We downloaded 177 ENCODE miRNA-seq samples from the ENCODE portal (<xref ref-type="bibr" rid="B42">Reese et al., 2023</xref>) (<xref ref-type="sec" rid="s10">Supplementary Table S1</xref>). Raw miRNA expression counts were mean-normalized across the 177 samples and log<sub>2</sub>-transformed. 835 miRNAs of the annotated 1877 miRNAs had a log<sub>2</sub>-normalized expression above 4 (&#x223c;16 normalized counts) in at least two samples. We used this subset of 835 miRNAs to plot a PCA and heatmap of miRNAs across different tissues. We generated a table assigning a likely germ layer of origin to miRNA-seq tissue and cell line experiments (<xref ref-type="sec" rid="s10">Supplementary Table S1</xref>). PCA and heatmaps were generated in R using &#x2018;prcomp&#x2019;, ggplot2 and pheatmap. We used ENCSR588ZRK (H1 embryonic stem cells) labeled &#x201c;ESC&#x201d; and ENCSR820BMF (endodermal cell originated from H1) labeled &#x201c;DE&#x201d; for miRNA expression analysis.</p>
<p>We used a threshold of &#x7c;log<sub>2</sub>FC&#x7c; &#x3e; 2 to call differentially expressed miRNAs between ESC and DE. Ensembl bioMart V.110 and V.111 were used for miRNA gene name conversion and transcription start site annotation (<xref ref-type="bibr" rid="B34">Martin et al., 2023</xref>).</p>
<sec id="s4-3-1">
<title>DE and ESC core TFs and RNA-seq analysis</title>
<p>We used RNA-seq data from GEO GSE213394 (<xref ref-type="bibr" rid="B32">Luo et al., 2023</xref>) to define core DE and core ESC TFs. RNA-seq expression values for HUES8 (ESC) and SOX17&#x2b; HUES8 (DE) were upper-quartile normalized. Expression values for DE-D3 (DE day 3) and ESC samples were averaged across the three replicated and differentially expressed genes were called using a threshold of &#x7c;log<sub>2</sub>FC &#x3e; 1.5&#x7c;. The list of human TFs was obtained from Lambert et al. (<xref ref-type="bibr" rid="B23">Lambert et al., 2018</xref>).</p>
</sec>
</sec>
<sec id="s4-4">
<title>ATAC and ChIP-seq analysis</title>
<p>ATAC-seq, ChIP-seq (TFs and H3K27ac) fastq and bigwig files were obtained from GEO GSE213394 (<xref ref-type="bibr" rid="B30">Li et al., 2019</xref>). They were processed according to the pipeline described by Shigaki et al. (<xref ref-type="bibr" rid="B46">Shigaki et al., 2019</xref>). We compared ATAC-seq and ChIP-seq samples in ESC to DE-D2 (day 2), as some TF ChIP-seq data was only available in DE-D2 (and not in DE-D3). We used ATAC-seq and H3K27ac to identify differentially active candidate enhancers regulating differentially expressed miRNAs in DE and ESC. We extended the ATAC peak summit by&#xa0;&#xb1;&#xa0;500bp or 150bp on each side (peak length &#x3d; 1000bp for H3K27ac and 300bp for ATAC) and calculated the average H3K27ac signal across the extended region, as H3K27ac peaks typically flank the ATAC peak summit. Differentially active regions were called from the set of top 25,000 strongest ATAC peaks in DE and ESC which satisfy the following: i) ATAC &#x7c;log<sub>2</sub>FC&#x7c; &#x3e; 1; ii) H3K27ac &#x7c;log<sub>2</sub>FC&#x7c; &#x3e; 0.25; iii) fall within 100kbp of a differentially expressed miRNA. Peak color indicates whether they are close to a miRNA upregulated in DE (&#x201c;DE-MIR-flanking&#x201d; peak) or to a miRNA upregulated in ESC (&#x201c;ESC-MIR-flanking&#x201d; peak). Of the 101 DE-MIR-flanking and 183 ESC-MIR-flanking peaks, only seven peaks are shared among the two sets and this subset of peaks has a relatively similar ATAC signal in DE and ESC (not differentially active and not affecting the rest of the analysis).</p>
<p>GATA6, EOMES and SOX17 ChIP-seq samples in DE were obtained from GSE213394, while the bigwig files for MYC (ENCFF145JGY, Snyder Lab), POU5F1 (ENCFF106YHB, Myers Lab) and NANOG (ENCFF512EZC, Myers Lab) ChIP-seq in H1 (ESC) were downloaded from the ENCODE portal (<xref ref-type="bibr" rid="B33">Luo et al., 2020</xref>). UCSC bigWigAverageOverBed was used to calculate average ChIP-seq signal over DE-MIR-flanking and ESC-MIR-flanking peaks and the values were log<sub>2</sub>-transformed and <italic>t</italic>-test was used.</p>
</sec>
<sec id="s4-5">
<title>TF motif analysis</title>
<p>For motif enrichment analysis, we used STORM (<xref ref-type="bibr" rid="B43">Schones et al., 2007</xref>) and the following PWMs: GATA (MEME &#x201c;MOTIF GATA1-ext&#x201d;), GSC (JASPAR &#x201c;MA0648.1&#x201d;), SOX (JASPAR &#x201c;MA0143.4&#x201d;), EOMES (MEME &#x201c;MOTIF Eomes&#x201d;), NANOG (HOMER &#x201c;motif224&#x201d;), POU5F1 (JASPAR &#x201c;MA1115.1&#x201d;) (<xref ref-type="bibr" rid="B3">Bailey et al., 2015</xref>; <xref ref-type="bibr" rid="B10">Fornes et al., 2020</xref>; <xref ref-type="bibr" rid="B17">Heinz et al., 2010</xref>). <italic>t</italic>-test was used. To remove CTCF binding sites we scored with ScanACE (<xref ref-type="bibr" rid="B20">Karnik and Beer, 2015</xref>) with a score threshold of 5.0.</p>
</sec>
<sec id="s4-6">
<title>miRNA target prediction</title>
<p>We used mirDIP to find a set of candidate miRNAs targeting core DE/ESC TFs and selected pairs of miRNA and core TFs whose mirDIP score is greater than 0.20. TargetScan was used to validate mirDIP predictions and align 3&#x2032;UTRs to miRNA seed regions.</p>
</sec>
<sec id="s4-7">
<title>Genome browser</title>
<p>We used WashU Epigenome Browser (<xref ref-type="bibr" rid="B29">Li et al., 2022</xref>) for <xref ref-type="fig" rid="F6">Figure 6</xref>. For better visual clarity in the figure, we used RefGene annotations (genomic coordinates) for MIR145 and MIR92B, while we used GENCODE V39 for MIR892A coordinates.</p>
</sec>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>Publicly available datasets were analyzed in this study. The datasets can be found here: <ext-link ext-link-type="uri" xlink:href="https://encodeproject.org">https://encodeproject.org</ext-link> and <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo/">https://www.ncbi.nlm.nih.gov/geo/</ext-link> and the ENCODE (ENCFF) and GEO (GSE) accession numbers can be found in the Methods section.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>MR-M: Writing&#x2013;review and editing, Writing&#x2013;original draft, Formal Analysis, Conceptualization. MB: Writing&#x2013;review and editing, Writing&#x2013;original draft, Supervision, Formal Analysis, Conceptualization.</p>
</sec>
<sec sec-type="funding-information" id="s7">
<title>Funding</title>
<p>The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. MAB and MR-M were supported by National Human Genome Research Institute (NHGRI)&#x2014;National Institutes of Health (NIH) grant R01 HG012367 to MAB. Funder played no role in the research.</p>
</sec>
<ack>
<p>We gratefully acknowledge useful discussions with Danwei Huangfu in our ongoing collaborations on stem cell differentiation.</p>
</ack>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s10">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/freae.2024.1473789/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/freae.2024.1473789/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material>
<label>SUPPLEMENTARY FIGURE S1</label>
<caption>
<p>DE and ESC TFs.</p>
</caption>
</supplementary-material>
<supplementary-material>
<label>SUPPLEMENTARY FIGURE S2</label>
<caption>
<p>Correlation heatmap of 177 ENCODE miRNA expression samples.</p>
</caption>
</supplementary-material>
<supplementary-material>
<label>SUPPLEMENTARY FIGURE S3</label>
<caption>
<p>DNA binding site (motif) enrichment in loci flanking DE and ESC miRNAs.</p>
</caption>
</supplementary-material>
<supplementary-material>
<label>SUPPLEMENTARY FIGURE S4</label>
<caption>
<p>Alignment of seed regions to 3&#x2032;UTRs of TFs.</p>
</caption>
</supplementary-material>
<supplementary-material>
<label>SUPPLEMENTARY TABLE S1</label>
<caption>
<p>ENCODE miRNA expression datasets.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Table1.xlsx" id="SM1" mimetype="application/xlsx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image1.pdf" id="SM2" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Alon</surname>
<given-names>U.</given-names>
</name>
</person-group> (<year>2019</year>). <source>An introduction to systems biology: design principles of biological circuits</source>. <publisher-name>Chapman and Hall/CRC</publisher-name>.</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Amanchy</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Kandasamy</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Mathivanan</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Periaswamy</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Reddy</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Yoon</surname>
<given-names>W.-H.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>Identification of novel phosphorylation motifs through an integrative computational and experimental analysis of the human phosphoproteome</article-title>. <source>J. Proteomics Bioinform</source> <volume>4</volume>, <fpage>22</fpage>&#x2013;<lpage>35</lpage>. <pub-id pub-id-type="doi">10.4172/jpb.1000163</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bailey</surname>
<given-names>T. L.</given-names>
</name>
<name>
<surname>Johnson</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Grant</surname>
<given-names>C. E.</given-names>
</name>
<name>
<surname>Noble</surname>
<given-names>W. S.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>The MEME suite</article-title>. <source>Nucleic Acids Res.</source> <volume>43</volume>, <fpage>W39</fpage>&#x2013;<lpage>W49</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkv416</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bartel</surname>
<given-names>D. P.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>MicroRNAs: genomics, biogenesis, mechanism, and function</article-title>. <source>Cell</source> <volume>116</volume>, <fpage>281</fpage>&#x2013;<lpage>297</lpage>. <pub-id pub-id-type="doi">10.1016/s0092-8674(04)00045-5</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Beer</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Predicting enhancer activity and variant impact using gkm-SVM</article-title>. <source>Hum. Mutat.</source> <volume>38</volume>, <fpage>1251</fpage>&#x2013;<lpage>1258</lpage>. <pub-id pub-id-type="doi">10.1002/humu.23185</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Beer</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Shigaki</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Huangfu</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Enhancer predictions and genome-wide regulatory circuits</article-title>. <source>Annu. Rev. Genom Hum. Genet.</source> <volume>21</volume>, <fpage>37</fpage>&#x2013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-genom-121719-010946</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chang</surname>
<given-names>T.-C.</given-names>
</name>
<name>
<surname>Wentzel</surname>
<given-names>E. A.</given-names>
</name>
<name>
<surname>Kent</surname>
<given-names>O. A.</given-names>
</name>
<name>
<surname>Ramachandran</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Mullendore</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>K. H.</given-names>
</name>
<etal/>
</person-group> (<year>2007</year>). <article-title>Transactivation of miR-34a by p53 broadly influences gene expression and promotes apoptosis</article-title>. <source>Mol. Cell</source> <volume>26</volume>, <fpage>745</fpage>&#x2013;<lpage>752</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2007.05.010</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Craene</surname>
<given-names>B. D.</given-names>
</name>
<name>
<surname>Berx</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Regulatory networks defining EMT during cancer initiation and progression</article-title>. <source>Nat. Rev. Cancer</source> <volume>13</volume>, <fpage>97</fpage>&#x2013;<lpage>110</lpage>. <pub-id pub-id-type="doi">10.1038/nrc3447</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Davidson</surname>
<given-names>E. H.</given-names>
</name>
</person-group> (<year>2010</year>). <source>The regulatory genome: gene regulatory networks in development and evolution</source>. <publisher-name>Elsevier</publisher-name>.</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fornes</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Castro-Mondragon</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Khan</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>van der Lee</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Richmond</surname>
<given-names>P. A.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>JASPAR 2020: update of the open-access database of transcription factor binding profiles</article-title>. <source>Nucleic Acids Res.</source> <volume>48</volume>, <fpage>D87-D92</fpage>&#x2013;<lpage>D92</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz1001</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gate</surname>
<given-names>R. E.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>C. S.</given-names>
</name>
<name>
<surname>Aiden</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Siba</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Tabaka</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lituiev</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Genetic determinants of co-accessible chromatin regions in activated T cells across humans</article-title>. <source>Nat. Genet.</source> <volume>50</volume>, <fpage>1140</fpage>&#x2013;<lpage>1150</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-018-0156-2</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ghandi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Mohammad-Noori</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Beer</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2014a</year>). <article-title>Enhanced regulatory sequence prediction using gapped k-mer features</article-title>. <source>PLoS Comput. Biol.</source> <volume>10</volume>, <fpage>e1003711</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1003711</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ghandi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Mohammad-Noori</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Beer</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2014b</year>). <article-title>Robust k-mer frequency estimation using gapped k-mers</article-title>. <source>J. Math. Biol.</source> <volume>69</volume>, <fpage>469</fpage>&#x2013;<lpage>500</lpage>. <pub-id pub-id-type="doi">10.1007/s00285-013-0705-3</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ghandi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Mohammad-Noori</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ghareghani</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Garraway</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Beer</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>gkmSVM: an R package for gapped-kmer SVM</article-title>. <source>Bioinformatics</source> <volume>32</volume>, <fpage>2205</fpage>&#x2013;<lpage>2207</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btw203</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Gschwind</surname>
<given-names>A. R.</given-names>
</name>
<name>
<surname>Mualim</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Karbalayghareh</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Sheth</surname>
<given-names>M. U.</given-names>
</name>
<name>
<surname>Dey</surname>
<given-names>K. K.</given-names>
</name>
<name>
<surname>Jagoda</surname>
<given-names>E.</given-names>
</name>
<etal/>
</person-group> (<year>2023</year>). <article-title>An encyclopedia of enhancer-gene regulatory interactions in the human genome</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.biorxiv.org/content/10.1101/2023.11.09.563812v1">https://www.biorxiv.org/content/10.1101/2023.11.09.563812v1</ext-link> (Accessed June 2, 2024)</comment>.</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hayes</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Peruzzi</surname>
<given-names>P. P.</given-names>
</name>
<name>
<surname>Lawler</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>MicroRNAs in cancer: biomarkers, functions and therapy</article-title>. <source>Trends Mol. Med.</source> <volume>20</volume>, <fpage>460</fpage>&#x2013;<lpage>469</lpage>. <pub-id pub-id-type="doi">10.1016/j.molmed.2014.06.005</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Heinz</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Benner</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Spann</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Bertolino</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>Y. C.</given-names>
</name>
<name>
<surname>Laslo</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>Simple combinations of lineage-determining transcription factors prime <italic>cis</italic>-regulatory elements required for macrophage and B cell identities</article-title>. <source>Mol. Cell</source> <volume>38</volume>, <fpage>576</fpage>&#x2013;<lpage>589</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2010.05.004</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ho</surname>
<given-names>S. W. T.</given-names>
</name>
<name>
<surname>Sheng</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Xing</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ooi</surname>
<given-names>W. F.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Sundar</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2023</year>). <article-title>Regulatory enhancer profiling of mesenchymal-type gastric cancer reveals subtype-specific epigenomic landscapes and targetable vulnerabilities</article-title>. <source>Gut</source> <volume>72</volume>, <fpage>226</fpage>&#x2013;<lpage>241</lpage>. <pub-id pub-id-type="doi">10.1136/gutjnl-2021-326483</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jain</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bakolitsa</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Brenner</surname>
<given-names>S. E.</given-names>
</name>
<name>
<surname>Radivojac</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Moult</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Repo</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2024</year>). <article-title>CAGI, the Critical Assessment of Genome Interpretation, establishes progress and prospects for computational genetic variant interpretation methods</article-title>. <source>Genome Biol.</source> <volume>25</volume>, <fpage>53</fpage>. <pub-id pub-id-type="doi">10.1186/s13059-023-03113-6</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Karnik</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Beer</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Identification of predictive cis-regulatory elements using a discriminative objective function and a dynamic search space</article-title>. <source>PLOS ONE</source> <volume>10</volume>, <fpage>e0140557</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0140557</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kreimer</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Zeng</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Edwards</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Shin</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Predicting gene expression in massively parallel reporter assays: a comparative study</article-title>. <source>Hum. Mutat.</source> <volume>38</volume>, <fpage>1240</fpage>&#x2013;<lpage>1250</lpage>. <pub-id pub-id-type="doi">10.1002/humu.23197</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lai</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Wolkenhauer</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Vera</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Understanding microRNA-mediated gene regulatory networks through mathematical modelling</article-title>. <source>Nucleic Acids Res.</source> <volume>44</volume>, <fpage>6019</fpage>&#x2013;<lpage>6035</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw550</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lambert</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Jolma</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Campitelli</surname>
<given-names>L. F.</given-names>
</name>
<name>
<surname>Das</surname>
<given-names>P. K.</given-names>
</name>
<name>
<surname>Yin</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Albu</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>The human transcription factors</article-title>. <source>Cell</source> <volume>172</volume>, <fpage>650</fpage>&#x2013;<lpage>665</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2018.01.029</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Gorkin</surname>
<given-names>D. U.</given-names>
</name>
<name>
<surname>Baker</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Strober</surname>
<given-names>B. J.</given-names>
</name>
<name>
<surname>Asoni</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>McCallion</surname>
<given-names>A. S.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>A method to predict the impact of regulatory variants from DNA sequence</article-title>. <source>Nat. Genet.</source> <volume>47</volume>, <fpage>955</fpage>&#x2013;<lpage>961</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3331</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>R. C.</given-names>
</name>
<name>
<surname>Feinbaum</surname>
<given-names>R. L.</given-names>
</name>
<name>
<surname>Ambros</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>The <italic>C. elegans</italic> heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14</article-title>. <source>Cell</source> <volume>75</volume>, <fpage>843</fpage>&#x2013;<lpage>854</lpage>. <pub-id pub-id-type="doi">10.1016/0092-8674(93)90529-y</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lewis</surname>
<given-names>B. P.</given-names>
</name>
<name>
<surname>Burge</surname>
<given-names>C. B.</given-names>
</name>
<name>
<surname>Bartel</surname>
<given-names>D. P.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are MicroRNA targets</article-title>. <source>Cell</source> <volume>120</volume>, <fpage>15</fpage>&#x2013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2004.12.035</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lex</surname>
<given-names>R. K.</given-names>
</name>
<name>
<surname>Ji</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Falkenstein</surname>
<given-names>K. N.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Henry</surname>
<given-names>J. L.</given-names>
</name>
<name>
<surname>Ji</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>GLI transcriptional repression regulates tissue-specific enhancer activity in response to Hedgehog signaling</article-title>. <source>Elife</source> <volume>9</volume>, <fpage>e50670</fpage>. <pub-id pub-id-type="doi">10.7554/elife.50670</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lex</surname>
<given-names>R. K.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Ji</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Falkenstein</surname>
<given-names>K. N.</given-names>
</name>
<name>
<surname>Schuler</surname>
<given-names>K. E.</given-names>
</name>
<name>
<surname>Windsor</surname>
<given-names>K. E.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>GLI transcriptional repression is inert prior to Hedgehog pathway activation</article-title>. <source>Nat. Commun.</source> <volume>13</volume>, <fpage>808</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-022-28485-4</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Purushotham</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Harrison</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Hsu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zhuo</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Fan</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>WashU Epigenome browser update 2022</article-title>. <source>Nucleic Acids Res.</source> <volume>50</volume>, <fpage>W774</fpage>&#x2013;<lpage>W781</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkac238</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Q. V.</given-names>
</name>
<name>
<surname>Dixon</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Verma</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Rosen</surname>
<given-names>B. P.</given-names>
</name>
<name>
<surname>Gordillo</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Genome-scale screens identify JNK&#x2013;JUN signaling as a barrier for pluripotency exit and endoderm differentiation</article-title>. <source>Nat. Genet.</source> <volume>51</volume>, <fpage>999</fpage>&#x2013;<lpage>1010</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-019-0408-9</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Kai</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hsu</surname>
<given-names>J. Y.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Transcription factor competition at the &#x3b3;-globin promoters controls hemoglobin switching</article-title>. <source>Nat. Genet.</source> <volume>53</volume>, <fpage>511</fpage>&#x2013;<lpage>520</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-021-00798-y</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Luo</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Oh</surname>
<given-names>J. W.</given-names>
</name>
<name>
<surname>Xi</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Shigaki</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Wong</surname>
<given-names>W.</given-names>
</name>
<etal/>
</person-group> (<year>2023</year>). <article-title>Dynamic network-guided CRISPRi screen identifies CTCF-loop-constrained nonlinear enhancer gene regulatory activity during cell state transitions</article-title>. <source>Nat. Genet.</source> <volume>55</volume>, <fpage>1336</fpage>&#x2013;<lpage>1346</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-023-01450-7</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Luo</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hitz</surname>
<given-names>B. C.</given-names>
</name>
<name>
<surname>Gabdank</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Hilton</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Kagda</surname>
<given-names>M. S.</given-names>
</name>
<name>
<surname>Lam</surname>
<given-names>B.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>New developments on the Encyclopedia of DNA Elements (ENCODE) data portal</article-title>. <source>Nucleic Acids Res.</source> <volume>48</volume>, <fpage>D882</fpage>&#x2013;<lpage>D889</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz1062</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Martin</surname>
<given-names>F. J.</given-names>
</name>
<name>
<surname>Amode</surname>
<given-names>M. R.</given-names>
</name>
<name>
<surname>Aneja</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Austine-Orimoloye</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Azov</surname>
<given-names>A. G.</given-names>
</name>
<name>
<surname>Barnes</surname>
<given-names>I.</given-names>
</name>
<etal/>
</person-group> (<year>2023</year>). <article-title>Ensembl 2023</article-title>. <source>Nucleic Acids Res.</source> <volume>51</volume>, <fpage>D933</fpage>&#x2013;<lpage>D941</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkac958</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McGeary</surname>
<given-names>S. E.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>C. Y.</given-names>
</name>
<name>
<surname>Pham</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Bisaria</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Kelley</surname>
<given-names>G. M.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>The biochemical basis of microRNA targeting efficacy</article-title>. <source>Science</source> <volume>366</volume>, <fpage>eaav1741</fpage>. <pub-id pub-id-type="doi">10.1126/science.aav1741</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Davis</surname>
<given-names>F. P.</given-names>
</name>
<name>
<surname>Mukamel</surname>
<given-names>E. A.</given-names>
</name>
<name>
<surname>Henry</surname>
<given-names>G. L.</given-names>
</name>
<name>
<surname>Nery</surname>
<given-names>J. R.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Epigenomic landscapes of retinal rods and cones</article-title>. <source>eLife</source> <volume>5</volume>, <fpage>e11613</fpage>. <pub-id pub-id-type="doi">10.7554/elife.11613</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Moyers</surname>
<given-names>B. A.</given-names>
</name>
<name>
<surname>Partridge</surname>
<given-names>E. C.</given-names>
</name>
<name>
<surname>Mackiewicz</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Betti</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Darji</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Meadows</surname>
<given-names>S. K.</given-names>
</name>
<etal/>
</person-group> (<year>2023</year>). <article-title>Characterization of human transcription factor function and patterns of gene regulation in HepG2 cells</article-title>. <source>Genome Res.</source> <volume>33</volume>, <fpage>1879</fpage>&#x2013;<lpage>1892</lpage>. <pub-id pub-id-type="doi">10.1101/gr.278205.123</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Oh</surname>
<given-names>J. W.</given-names>
</name>
<name>
<surname>Beer</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2024</year>). <article-title>Gapped-kmer sequence modeling robustly identifies regulatory vocabularies and distal enhancers conserved between evolutionarily distant mammals</article-title>. <source>Nat. Commun.</source> <volume>15</volume>, <fpage>6464</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-024-50708-z</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perissi</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Jepsen</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Glass</surname>
<given-names>C. K.</given-names>
</name>
<name>
<surname>Rosenfeld</surname>
<given-names>M. G.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Deconstructing repression: evolving models of co-repressor action</article-title>. <source>Nat. Rev. Genet.</source> <volume>11</volume>, <fpage>109</fpage>&#x2013;<lpage>123</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2736</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perk</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Iavarone</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Benezra</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Id family of helix-loop-helix proteins in cancer</article-title>. <source>Nat. Rev. Cancer</source> <volume>5</volume>, <fpage>603</fpage>&#x2013;<lpage>614</lpage>. <pub-id pub-id-type="doi">10.1038/nrc1673</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Razavi-Mohseni</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>Y. A.</given-names>
</name>
<name>
<surname>Shigaki</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Ho</surname>
<given-names>S. W. T.</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2024</year>). <article-title>Machine learning identifies activation of RUNX/AP-1 as drivers of mesenchymal and fibrotic regulatory programs in gastric cancer</article-title>. <source>Genome Res.</source> <volume>34</volume>, <fpage>680</fpage>&#x2013;<lpage>695</lpage>. <pub-id pub-id-type="doi">10.1101/gr.278565.123</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Reese</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Williams</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Balderrama-Gutierrez</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Wyman</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>&#xc7;elik</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Rebboah</surname>
<given-names>E.</given-names>
</name>
<etal/>
</person-group> (<year>2023</year>). <article-title>The ENCODE4 long-read RNA-seq collection reveals distinct classes of transcript structure diversity</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.biorxiv.org/content/10.1101/2023.05.15.540865v1">https://www.biorxiv.org/content/10.1101/2023.05.15.540865v1</ext-link> (Accessed July 16, 2024)</comment>.</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schones</surname>
<given-names>D. E.</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>A. D.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>M. Q.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Statistical significance of cis-regulatory modules</article-title>. <source>BMC Bioinforma.</source> <volume>8</volume>, <fpage>19</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-8-19</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sheng</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ho</surname>
<given-names>S. W. T.</given-names>
</name>
<name>
<surname>Ooi</surname>
<given-names>W. F.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Xing</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Padmanabhan</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Integrative epigenomic and high-throughput functional enhancer profiling reveals determinants of enhancer heterogeneity in gastric cancer</article-title>. <source>Genome Med.</source> <volume>13</volume>, <fpage>158</fpage>. <pub-id pub-id-type="doi">10.1186/s13073-021-00970-3</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Shigaki</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2024</year>). <source>Learning the sequence determinants of mammalian transcriptional gene regulation across cell-types</source>. <publisher-loc>Baltimore, Maryland</publisher-loc>: <publisher-name>Johns Hopkins University</publisher-name>.</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shigaki</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Adato</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Adhikari</surname>
<given-names>A. N.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Hawkins-Hooker</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Inoue</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Integration of multiple epigenomic marks improves prediction of variant impact in saturation mutagenesis reporter assay</article-title>. <source>Hum. Mutat.</source> <volume>40</volume>, <fpage>1280</fpage>&#x2013;<lpage>1291</lpage>. <pub-id pub-id-type="doi">10.1002/humu.23797</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tay</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Thomson</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Lim</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Rigoutsos</surname>
<given-names>I.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>MicroRNAs to Nanog, Oct4 and Sox2 coding regions modulate embryonic stem cell differentiation</article-title>. <source>Nature</source> <volume>455</volume>, <fpage>1124</fpage>&#x2013;<lpage>1128</lpage>. <pub-id pub-id-type="doi">10.1038/nature07299</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Teijeiro</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Majumdar</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Gonz&#xe1;lez</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Rickert</surname>
<given-names>R. W.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>DICER1 is essential for self-renewal of human embryonic stem cells</article-title>. <source>Stem Cell Rep.</source> <volume>11</volume>, <fpage>616</fpage>&#x2013;<lpage>625</lpage>. <pub-id pub-id-type="doi">10.1016/j.stemcr.2018.07.013</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tokar</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Pastrello</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Rossos</surname>
<given-names>A. E. M.</given-names>
</name>
<name>
<surname>Abovsky</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Hauschild</surname>
<given-names>A.-C.</given-names>
</name>
<name>
<surname>Tsay</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>mirDIP 4.1&#x2014;integrative database of human microRNA target predictions</article-title>. <source>Nucleic Acids Res.</source> <volume>46</volume>, <fpage>D360</fpage>&#x2013;<lpage>D370</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx1144</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vandewalle</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Van Roy</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Berx</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>The role of the ZEB family of transcription factors in development and disease</article-title>. <source>Cell Mol. Life Sci.</source> <volume>66</volume>, <fpage>773</fpage>&#x2013;<lpage>787</lpage>. <pub-id pub-id-type="doi">10.1007/s00018-008-8465-8</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xi</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Beer</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Local epigenomic state cannot discriminate interacting and non-interacting enhancer&#x2013;promoter pairs with high accuracy</article-title>. <source>PLOS Comput. Biol.</source> <volume>14</volume>, <fpage>e1006625</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1006625</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xi</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Beer</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Loop competition and extrusion model predicts CTCF interaction specificity</article-title>. <source>Nat. Commun.</source> <volume>12</volume>, <fpage>1046</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-021-21368-0</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xing</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ooi</surname>
<given-names>W. F.</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Qamra</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>P.-H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Genomic and epigenomic EBF1 alterations modulate TERT expression in gastric cancer</article-title>. <source>J. Clin. Investigation</source> <volume>130</volume>, <fpage>3005</fpage>&#x2013;<lpage>3020</lpage>. <pub-id pub-id-type="doi">10.1172/JCI126726</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>K. K.</given-names>
</name>
<name>
<surname>Law</surname>
<given-names>J. H.</given-names>
</name>
<name>
<surname>Chua</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Sheng</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Flores</surname>
<given-names>N. M.</given-names>
</name>
<etal/>
</person-group> (<year>2023</year>). <article-title>Comprehensive molecular phenotyping of ARID1A-deficient gastric cancer reveals pervasive epigenomic reprogramming and therapeutic opportunities</article-title>. <source>Gut</source> <volume>72</volume>, <fpage>1651</fpage>&#x2013;<lpage>1663</lpage>. <pub-id pub-id-type="doi">10.1136/gutjnl-2022-328332</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Papagiannakopoulos</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Pan</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Thomson</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Kosik</surname>
<given-names>K. S.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>MicroRNA-145 regulates OCT4, SOX2, and KLF4 and represses pluripotency in human embryonic stem cells</article-title>. <source>Cell</source> <volume>137</volume>, <fpage>647</fpage>&#x2013;<lpage>658</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2009.02.038</pub-id>
</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Qiu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Ribeiro dos Santos</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Yin</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y. E.</given-names>
</name>
<name>
<surname>Vinckier</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Systematic analysis of binding of transcription factors to noncoding variants</article-title>. <source>Nature</source> <volume>591</volume>, <fpage>147</fpage>&#x2013;<lpage>151</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-021-03211-0</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yao</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Tycko</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Oh</surname>
<given-names>J. W.</given-names>
</name>
<name>
<surname>Bounds</surname>
<given-names>L. R.</given-names>
</name>
<name>
<surname>Gosai</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Lataniotis</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2024</year>). <article-title>Multicenter integrated analysis of noncoding CRISPRi screens</article-title>. <source>Nat. Methods</source> <volume>21</volume>, <fpage>723</fpage>&#x2013;<lpage>734</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-024-02216-7</pub-id>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Youmans</surname>
<given-names>D. T.</given-names>
</name>
<name>
<surname>Gooding</surname>
<given-names>A. R.</given-names>
</name>
<name>
<surname>Dowell</surname>
<given-names>R. D.</given-names>
</name>
<name>
<surname>Cech</surname>
<given-names>T. R.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Competition between PRC2.1 and 2.2 subcomplexes regulates PRC2 chromatin occupancy in human stem cells</article-title>. <source>Mol. Cell</source> <volume>81</volume>, <fpage>488</fpage>&#x2013;<lpage>501.e9</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2020.11.044</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Popel</surname>
<given-names>A. S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Mechanistic computational models of MicroRNA-mediated signaling networks in human diseases</article-title>. <source>Int. J. Mol. Sci.</source> <volume>20</volume>, <fpage>421</fpage>. <pub-id pub-id-type="doi">10.3390/ijms20020421</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>