<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2017.00607</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Characterization of the Polycomb-Group Mark H3K27me3 in Unicellular Algae</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Mikulski</surname> <given-names>Pawel</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Komarynets</surname> <given-names>Olga</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/417267/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Fachinelli</surname> <given-names>Fabio</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/63061/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Weber</surname> <given-names>Andreas P.M.</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/12158/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Schubert</surname> <given-names>Daniel</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/32001/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Institute of Biology, Free University of Berlin</institution> <country>Berlin, Germany</country></aff>
<aff id="aff2"><sup>2</sup><institution>Institute of Genetics, Heinrich-Heine-Universit&#x00E4;t D&#x00FC;sseldorf</institution> <country>D&#x00FC;sseldorf, Germany</country></aff>
<aff id="aff3"><sup>3</sup><institution>Faculty of Medicine, University of Geneva</institution> <country>Geneva, Switzerland</country></aff>
<aff id="aff4"><sup>4</sup><institution>Institute of Plant Biochemistry, Heinrich-Heine-Universit&#x00E4;t D&#x00FC;sseldorf</institution> <country>D&#x00FC;sseldorf, Germany</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: <italic>Dao-Xiu Zhou, Universit&#x00E9; Paris-Sud, France</italic></p></fn>
<fn fn-type="edited-by"><p>Reviewed by: <italic>Yuhai Cui, Agriculture and Agri-Food Canada, Canada; Crisanto Gutierrez, Consejo Superior de Investigaciones Cient&#x00ED;ficas (CSIC), Spain</italic></p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x002A;Correspondence: <italic>Daniel Schubert, <email>dan.schubert@fu-berlin.de</email></italic></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Plant Genetics and Genomics, a section of the journal Frontiers in Plant Science</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>26</day>
<month>04</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>607</elocation-id>
<history>
<date date-type="received">
<day>18</day>
<month>02</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>04</day>
<month>04</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2017 Mikulski, Komarynets, Fachinelli, Weber and Schubert.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Mikulski, Komarynets, Fachinelli, Weber and Schubert</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Polycomb Group (PcG) proteins mediate chromatin repression in plants and animals by catalyzing H3K27 methylation and H2AK118/119 mono-ubiquitination through the activity of the Polycomb repressive complex 2 (PRC2) and PRC1, respectively. PcG proteins were extensively studied in higher plants, but their function and target genes in unicellular branches of the green lineage remain largely unknown. To shed light on PcG function and <italic>modus operandi</italic> in a broad evolutionary context, we demonstrate phylogenetic relationship of core PRC1 and PRC2 proteins and H3K27me3 biochemical presence in several unicellular algae of different phylogenetic subclades. We focus then on one of the species, the model red alga <italic>Cyanidioschizon merolae</italic>, and show that H3K27me3 occupies both, genes and repetitive elements, and mediates the strength of repression depending on the differential occupancy over gene bodies. Furthermore, we report that H3K27me3 in <italic>C. merolae</italic> is enriched in telomeric and subtelomeric regions of the chromosomes and has unique preferential binding toward intein-containing genes involved in protein splicing. Thus, our study gives important insight for Polycomb-mediated repression in lower eukaryotes, uncovering a previously unknown link between H3K27me3 targets and protein splicing.</p>
</abstract>
<kwd-group>
<kwd>Polycomb</kwd>
<kwd>PcG</kwd>
<kwd>H3K27me3</kwd>
<kwd><italic>Cyanidioschyzon merolae</italic></kwd>
<kwd>algae</kwd>
<kwd>intein</kwd>
<kwd>protein splicing</kwd>
<kwd>telomere</kwd>
</kwd-group>
<contract-num rid="cn001">SFB973</contract-num>
<contract-num rid="cn002">EpiTRAITS</contract-num>
<contract-sponsor id="cn001">Deutsche Forschungsgemeinschaft<named-content content-type="fundref-id">10.13039/501100001659</named-content></contract-sponsor>
<contract-sponsor id="cn002">Seventh Framework Programme<named-content content-type="fundref-id">10.13039/100011102</named-content></contract-sponsor>
<counts>
<fig-count count="10"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="84"/>
<page-count count="17"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec><title>Introduction</title>
<p>In the eukaryotic cells, transcription state is dependent on the underlying chromatin state. The fundamental structure of chromatin is based on a nucleosome, a complex of 147bp-long fragments of DNA wrapped around the core histone proteins (H2A, H2B, H3, H4). Chromatin state can be influenced by post-translational modifications deposited on the histones, either by direct structural changes in the nucleosomes or recruitment/displacement of secondary proteins involved in chromatin remodeling or transcription. The presence of particular histone modifications often determines the type of the chromatin and correlates with transcriptional activity of the target DNA. Among those, tri-methylation of lysine 27 on histone H3 (H3K27me3) and mono-ubiquitination of histone H2AK118/H2AK119 (H2AK118ub/H2AK119ub) are commonly associated with transcriptionally silent facultative heterochromatin.</p>
<p>Deposition of H3K27me3 and H2AK118ub/H2AK119ub is mediated by Polycomb group (PcG) proteins, whose function was initially shown to control developmentally regulated processes and maintain cell identity (<xref ref-type="bibr" rid="B52">Papp and M&#x00FC;ller, 2006</xref>; <xref ref-type="bibr" rid="B61">Schuettengruber et al., 2007</xref>) in both, animals [reviewed in <xref ref-type="bibr" rid="B62">Schwartz and Pirrotta (2008)</xref>] and plants [reviewed in <xref ref-type="bibr" rid="B34">K&#x00F6;hler and Villar (2008)</xref>]. PcG proteins form distinct complexes, like Polycomb Repressive Complex 2 (PRC2), involved in H3K27 trimethylation [in metazoans Pcl-PRC2 complex (<xref ref-type="bibr" rid="B47">Nekrasov et al., 2007</xref>; <xref ref-type="bibr" rid="B46">M&#x00FC;ller and Verrijzer, 2009</xref>)], and Polycomb Repressive Complex 1 (PRC1), responsible for H2A mono-ubiquitination. PRC2 in <italic>Drosophila melanogaster</italic>, where it was initially discovered, consists of four core subunits: Enhancer of zeste [E(z)], a catalytic component needed for methylation of H3K27; Extra Sex Combs (ESC), a WD40 motif-containing protein that scaffolds interactions within the complex; Suppressor of zeste (Su(z)12), a Zinc Finger subunit essential for binding of PRC2 to nucleosomes and p55, a nucleosome remodeling factor (<xref ref-type="bibr" rid="B63">Schwartz and Pirrotta, 2013</xref>). In turn, PRC1 is formed by: dRING/Sex Combs Extra (Sce); Posterior Sex Combs (Psc), both responsible for mono-ubiquitination activity; Polyhomeotic (Ph), essential for maintaining protein-protein interactions; and Polycomb (Pc), involved in recruitment of the complex to chromatin; and Sex comb on midleg (Scm), important for spreading of PcG silencing (<xref ref-type="bibr" rid="B63">Schwartz and Pirrotta, 2013</xref>). Both complexes are functionally connected with each other. In the canonical hierarchical model, initially introduced H3K27me3 mark is recognized by PRC1 and followed by H2A mono-ubiquitination to maintain the repression. However, it was shown that the mechanism of PcG-mediated repression can happen as well in the opposite order, with PRC1 introducing its modification prior to PRC2 activity (<xref ref-type="bibr" rid="B79">Yang et al., 2013</xref>; <xref ref-type="bibr" rid="B15">Del Prete et al., 2015</xref>).</p>
<p>Polycomb-group complexes were identified also in the plant kingdom. Core PRC1 complex in model flowering plant <italic>Arabidopsis thaliana</italic> consist of AtRING1a/b (equivalent to dRING/Sce), AtBMI1a/b/c and EMF1 (equivalent to Psc), and LHP1 (equivalent to Pc). Although a wide range of PRC1-associated proteins exists (<xref ref-type="bibr" rid="B13">Chen et al., 2016</xref>), it is unclear how the combinations of different subunits depend on specific temporal and spatial conditions. In contrast to PRC1, composition of the <italic>Arabidopsis</italic> PRC2 complex is better characterized. <italic>Arabidopsis</italic> PRC2 subunits underwent gene duplication, resulting in the presence of different, partially redundant or independently acting PRC2 complexes (<xref ref-type="bibr" rid="B18">Derkacheva and Hennig, 2014</xref>). In the <italic>Arabidopsis</italic> genome the following PRC2 components can be identified: three E(z)-homologs &#x2013; CURLY LEAF (CLF), SWINGER (SWN), MEDEA (MEA); three Su(z)12-homologs &#x2013; EMBRYONIC FLOWER 2 (EMF2), VERNALIZATION 2 (VRN2), FERTILIZATION INDEPENDENT SEED 2 (FIS2); as well as single gene subunits: ESC-homolog &#x2013; FERTILIZATION INDEPENDENT ENDOSPERM (FIE) and p55-homologs &#x2013; MULTICOPY SUPPRESSOR OF IRA 1-5 (MSI1-5).</p>
<p>One of the aspects of PRC2 biology concerns its origins and the abundance in lower eukaryotes. Due to the absence of PRC2 in the model unicellular species <italic>Saccharomyces cerevisiae</italic> and <italic>S. pombe</italic> (<xref ref-type="bibr" rid="B35">Lachner et al., 2004</xref>; <xref ref-type="bibr" rid="B73">Veerappan et al., 2008</xref>), the PRC2 appearance was previously thought to co-occur with the emergence of multicellularity. However, high conservation of PRC2 subunits in both, animal and plant lineages implies that PcG genes appeared already in the last common unicellular ancestor (<xref ref-type="bibr" rid="B5">Bowman et al., 2007</xref>), before those two kingdoms diverged. In an elegant study, Shaver <italic>et al</italic> (<xref ref-type="bibr" rid="B65">Shaver et al., 2010</xref>) identified the novel computational PRC2-homologs in several unicellular species and showed that E(z)-homolog in unicellular green algae <italic>Chlamydomonas reinhardtii</italic> is responsible for mono- and di-methylation of lysine 27 on histone H3. As the tri-methylation activity of <italic>Chlamydomonas</italic> E(z) homolog was not found, this study highlights important differences in the functional conservation of PRC2 components. For instance, although H3K27 tri-methylation is considered to be a prominent histone mark in PRC2-mediated repression, there is a range of H3K27 methylation levels catalyzed by PRC2 that varies between plant and <italic>Drosophila</italic> or mammals. In <italic>Arabidopsis</italic>, PRC2 controls H3K27 tri-methylation and H3K27 di-methylation in the euchromatin (<xref ref-type="bibr" rid="B3">Bastow et al., 2004</xref>; <xref ref-type="bibr" rid="B39">Lindroth et al., 2004</xref>; <xref ref-type="bibr" rid="B67">Sung et al., 2006</xref>), with mono-methylation being catalyzed by ATXR5 and ATXR6 (<xref ref-type="bibr" rid="B28">Jacob and Michaels, 2009</xref>; <xref ref-type="bibr" rid="B27">Jacob et al., 2009</xref>). However, in metazoans homologs of ATXR5/6 were not found and their PRC2 complexes mediate H3K27 methylation in all contexts (<xref ref-type="bibr" rid="B45">M&#x00FC;ller et al., 2002</xref>; <xref ref-type="bibr" rid="B21">Ebert et al., 2004</xref>; <xref ref-type="bibr" rid="B23">Ferrari et al., 2014</xref>). Another difference between species is the type of DNA elements targeted by PRC2. <italic>Drosophila</italic> PRC2 targets both, genes and transposable elements/repetitive sequences (<xref ref-type="bibr" rid="B81">Yin et al., 2011</xref>), whereas in <italic>Arabidopsis</italic> the H3K27me3 mark is excluded from the majority of transposable elements/repetitive sequences (<xref ref-type="bibr" rid="B36">Lafos et al., 2011</xref>; <xref ref-type="bibr" rid="B16">Deleris et al., 2012</xref>; <xref ref-type="bibr" rid="B53">Park et al., 2012</xref>).</p>
<p>Even though PRC2-mediated repression has been extensively studied in higher plants and animals, its characterization in lower eukaryotes gathered much less attention. The existence of H3K27me3 in lower eukaryotes is largely unknown (<xref ref-type="bibr" rid="B65">Shaver et al., 2010</xref>) and distribution of the mark in the genome and its function were reported only for a handful of species, such as <italic>Neurospora crassa</italic>, <italic>Phaeodactylum tricornutum</italic>, or <italic>Tetrahymena thermophila</italic>. In <italic>N. crassa</italic> H3K27me3 is arranged into broad domains covering 774 genes connected with a full spectrum of functions (<xref ref-type="bibr" rid="B29">Jamieson et al., 2013</xref>). In <italic>T. thermophila</italic>, the mark associates with the developmentally regulated genome rearrangements as it occupies sequences eliminated during differentiation of the macronucleus (<xref ref-type="bibr" rid="B40">Liu et al., 2007</xref>). In <italic>P. tricornutum</italic>, H3K27me3 covers transposable elements and genes, and its targets are involved in, i.e., signal transduction, development and cell cycle control (<xref ref-type="bibr" rid="B74">Veluchamy et al., 2015</xref>).</p>
<p>Given the distinct degrees of phylogenetic conservation of PRC2 and PRC1, and poor characterization of H3K27me3 in lower eukaryotes, we sought to investigate the presence of Polycomb-group homologs and H3K27me3 abundance in several representative unicellular, photosynthetically active eukaryotes. Our scope included the representatives of red algae, green algae and <italic>Glaucophyta</italic> that vary in genome size, genome architecture, ecological niche and metabolism (<xref ref-type="bibr" rid="B42">Matsuzaki et al., 2004</xref>; <xref ref-type="bibr" rid="B51">Palenik et al., 2007</xref>; <xref ref-type="bibr" rid="B78">Worden et al., 2009</xref>; <xref ref-type="bibr" rid="B44">Merchant et al., 2010</xref>; <xref ref-type="bibr" rid="B56">Price et al., 2012</xref>). After an initial small-scale screen on H3K27me3 presence, we focused on one of the species, <italic>Cyanidioschyzon merolae</italic>, to study genome-wide occupation of H3K27me3 and characterize its targets. <italic>C. merolae</italic> is a unicellular red alga, living in highly acidic environment with high temperatures. It contains a small (16 Mb) genome which was fully sequenced as the first algal genome (<xref ref-type="bibr" rid="B42">Matsuzaki et al., 2004</xref>) and assembled as first 100% complete eukaryotic genome (<xref ref-type="bibr" rid="B49">Nozaki et al., 2007</xref>). Interestingly, the genome shows extremely simplified structure that contains almost exclusively intron-lacking genes (only 26 genes contain an intron), very low percentage (0,7%) of transposable elements and a novel class of a repetitive element, corresponding to the truncated ORF from White spot syndrome virus (WSV repeat) (<xref ref-type="bibr" rid="B49">Nozaki et al., 2007</xref>). Given its unicellularity, evolutionary ancestry, primitive architecture of the genome and availability of sequencing data, we argue that <italic>C. merolae</italic> is a suitable model for studying chromatin repression in an evolutionary context.</p>
<p>Our results confirmed the high conservation of PRC2 core members in lower eukaryotes and the widespread presence of H3K27me3 modification. We report that in <italic>C. merolae</italic> H3K27me3 mark targets both, genes and repetitive elements and is anti-correlated with transcriptional activity. We show that H3K27me3 distribution over its targets is not uniform, but can be grouped into several different clusters that correlate with different levels of repression. Furthermore, we demonstrate that H3K27me3 has a preferential chromosomal localization toward telomeric and subtelomeric regions. For the first time, we also reveal that H3K27me3 target genes are enriched in the functional class of intein-mediated protein splicing. Moreover, by deposition of RNA- and ChIP-sequencing data, we provide a resource for studies on the chromatin and transcriptome in the model red alga <italic>C. merolae</italic>.</p>
</sec>
<sec id="s1" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec><title>Phylogenetic Analysis and Homology Search</title>
<p>Sequences of PRC1 and PRC2 components from <italic>A. thaliana</italic> and <italic>D. melanogaster</italic> were selected from TAIR10 and NCBI databases, respectively, and used as the queries. Homology searches were done using BLAST against organism-specific databases at: <ext-link ext-link-type="uri" xlink:href="https://phytozome.jgi.doe.gov/pz/portal.html">https://phytozome.jgi.doe.gov/pz/portal.html</ext-link> (<italic>C. reinhardtii</italic> v5.5 and <italic>Micromonas pusilla</italic> CCMP1545 v3.0), <ext-link ext-link-type="uri" xlink:href="http://merolae.biol.s.u-tokyo.ac.jp/">http://merolae.biol.s.u-tokyo.ac.jp/</ext-link> (<italic>C. merolae</italic>), <ext-link ext-link-type="uri" xlink:href="http://genome.jgi.doe.gov/Ostta4/Ostta4.home.html">http://genome.jgi.doe.gov/Ostta4/Ostta4.home.html</ext-link> (<italic>Ostreococcus tauri</italic> v2.0), <ext-link ext-link-type="uri" xlink:href="http://cyanophora.rutgers.edu/cyanophora/home.php">http://cyanophora.rutgers.edu/cyanophora/home.php</ext-link> (<italic>Cyanophora paradoxa</italic>).</p>
<p>The highest scoring candidate proteins were used for BLAST searches against the TAIR10 genome annotation to confirm a reciprocal match to the protein used as an initial query. The sequences were aligned using ClustalXv2.0 (<xref ref-type="bibr" rid="B37">Larkin et al., 2007</xref>) and conserved blocks were selected by gBlocks v0.91b (<xref ref-type="bibr" rid="B68">Talavera and Castresana, 2007</xref>). For substitution model estimation, ProtTestv3.4 was employed (<xref ref-type="bibr" rid="B1">Abascal et al., 2005</xref>). Phylogenetic trees were created using a Bayesian approach in Mr Bayesv3.2 (<xref ref-type="bibr" rid="B60">Ronquist et al., 2012</xref>), self-compiled developer version r1067 with implemented LG model. The runs were done with 20 mln generations. Sequences from <italic>D. melanogaster</italic> were selected as an out group. Consensus trees and alignments were visualized with Figtree v1.4.2 (<xref ref-type="bibr" rid="B58">Rambaut, 2009</xref>) and Jalviewv2 (<xref ref-type="bibr" rid="B75">Waterhouse et al., 2009</xref>), respectively.</p>
</sec>
<sec><title>Algae Growth</title>
<p><italic>Cyanidioschyzon merolae</italic> cells were grown at 42&#x00B0;C with shaking at 150 rpm. The cultures were kept under continuous light, with light intensity of 70 &#x03BC;mol.</p>
</sec>
<sec><title>Western Blot</title>
<p>Around 50.000 algae cells were diluted in Laemmli&#x2019;s sample buffer, and 3&#x2013;7 &#x03BC;l of total protein/lane was subjected to sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) using 12% SDS-polyacrylamide gel. The resolved proteins were transferred onto polyvinylidenedifluoride (PVDF) membranes (Immobilon-P membrane, Milipore) using a Vertical Electrophoresis Cell (Bio-Rad). Following transfer, membranes were washed with phosphate buffered saline containing 0.05% Tween-20 (PBST) and then blocked in 5% milk diluted in PBST for 1 h at room temperature. The membranes were then incubated overnight with anti_H3K27me3 (N 07-449, Millipore or C15410195, Diagenode) and anti H3 (ab1791-100, Abcam) rabbit polyclonal antibodies 1:2000 in PBST containing 5% milk at 4&#x00B0;C. After washing three times for 10 min in PBST, membranes were incubated with peroxidase-labeled secondary antibody for 1 h at room temperature. The membranes were washed three times for 10 min in PBST, incubated with SuperSignal West Femto Chemiluminescence Substrate (Thermo Scientific) for 1 min, and exposed for 1&#x2013;2 min. Commercial histone extract from calf thymus (Sigma, #9064-47-5) was used as a positive control for H3K27me3 detection.</p>
</sec>
<sec><title>Competition Assay</title>
<p>Pre-incubation of anti-H3K27me3 antibody (C15410195, Diagenode) and H3K27me3 peptide (Intavis) was done for 2 h in RT in PBST, with occasional mixing. The peptide-to-antibody molar ratio was 50:1. A control solution &#x2013; PBST with just antibody at 2X final concentration &#x2013; was used at the same time. At the end of the pre-incubation, 4% BSA blocking solution was added to the peptide/antibody mixture to a final concentration of 3% BSA, mixed briefly and added to the membranes. After washing three times for 10 min in PBST, membranes were incubated with peroxidase-labeled secondary antibody for 1 h at room temperature. The membranes were washed three times for 10 min in PBST, incubated with SuperSignal West Femto Chemiluminescence Substrate (Thermo Scientific) for 1 min, and exposed for 1&#x2013;2 min.</p>
</sec>
<sec><title>Chromatin Immunoprecipitation</title>
<p><italic>Cyanidioschyzon merolae</italic> cells were grown until late log-phase. 40 ml samples were fixed [Formaldehyde, 1% (v/v)] for 10 min, until addition of glycine (to final concentration of 125 mM, 5 min incubation). Superfluous formaldehyde was removed by three washes with ice cold PBS buffer and the remaining cell pellet was resuspended in 4 ml of Extraction Buffer (50 mM Tris-HCl pH 8, 10 mM EDTA, 1% SDS) with Protease Inhibitor Cocktail (Roche). The samples were sonicated for 5 min with 30 s ON/30 s OFF cycle using Bioruptor Plus device (Diagenode) and cleared by two rounds of centrifugation (13000 rpm, 4&#x00B0;C, 10 min). Subsequent steps were performed as in the Plant ChIP-seq kit protocol (Diagenode) with higher volume of sample taken aside as an input (1:5 of chromatin for IP). Immunoprecipitation was done using anti-H3K27me3 Polyclonal Premium antibody (C15410195, Diagenode) and, as a negative control, IgG fraction from rabbit (C15410206, Diagenode). Quality and fragment size of immunoprecipitated DNA and input samples were measured using agarose gel electrophoresis and Bioanalyzer 2100 (Agilent Technologies).</p>
</sec>
<sec><title>Real Time PCR</title>
<p>DNA samples obtained from ChIP were used for H3K27me3 enrichment analysis for several target genes by real-time quantitative PCR. Reactions were prepared using KAPA SYBR FAST qPCR Mastermix (KapaBiosystems), according to the manufacturer&#x2019;s protocol, and run on an iQ5 detection system (Biorad) using a 2-step program. Differences in H3K27me3 enrichment on target genes were scored by comparison of % recovery of input and standard error values. 5&#x2032;- > 3&#x2032; sequences of oligonucleotides used for amplification were as follows: CmMADS- forward: GGATGAGAAAGCGAGAAATACGA and reverse: TCACAATGCCGATCTCACAG; CmEIF-4A &#x2013; forward: TGTACGATATGATCCAGAGAAGAG and reverse: TGTAGATTTGCTCCTTGAAACC; Cm60S &#x2013; forward: AAGTTTCGCTGTACGCTTGG and reverse: TAACCAGGACCATATCGCCG.</p>
</sec>
<sec><title>ChIP-seq</title>
<p>DNA library was prepared with MicroPlex Library Preparation Kit (Diagenode) and sequenced on HiSeq 2000 sequencer (Illumina Inc.). Quality of paired-end raw reads was assessed using FastQC v0.11.4. The reads were trimmed according to the quality and mapped to the reference in Bowtie v2.2.6, with standard options. Peaks were called using MACS v2.1 with exclusion of clonal reads. The new annotation from the <italic>C. merolae</italic> Genome Project v3<sup><xref ref-type="fn" rid="fn01">1</xref></sup> was employed as a reference. The correlation between ChIP-seq replicates was scored using Pearson correlation and shown in Supplementary Figure <xref ref-type="supplementary-material" rid="SM3">S3A</xref>. The data was deposited in the Gene Expression Omnibus under the series GSE93913.</p>
</sec>
<sec><title>RNA Extraction and RNA-seq</title>
<p><italic>Cyanidioschyzon merolae</italic> cells were grown until late log-phase. RNA was isolated using RNeasy Plant Mini Kit (Qiagen), following a standard protocol. cDNA synthesis and DNase treatment were performed using RevertAid First Strand cDNA Synthesis Kit (Thermo Scientific) with oligo(dT) primers, according to a standard manual. The quality and concentration of samples were measured spectrometrically using Nanodrop 1000 (Thermo Scientific) and electrophoretically using Bioanalyzer 2100 (Agilent Technologies). Libraries were constructed using TruSeq RNA Library Prep kit (Illumina) with gel-free library purification based on Agencourt AMPure XP beads (Beckman Coulter, Inc.). The samples were sequenced on HiSeq 2000 device (Illumina Inc.). Paired-end raw sequencing reads were analyzed using Galaxy implementations (usegalaxy.org) of FastQC program and Tuxedo protocol (<xref ref-type="bibr" rid="B71">Trapnell et al., 2012</xref>). The new annotation from <italic>C. merolae</italic> Genome Project v3 was employed as a reference. The correlation between replicates in RNA-seq experiment was scored using Pearson correlation and shown in Supplementary Figure <xref ref-type="supplementary-material" rid="SM3">S4</xref>. The data was deposited in the Gene Expression Omnibus, under the series GSE93912.</p>
</sec>
<sec><title>Bioinformatic Secondary Analysis of NGS Data</title>
<p>Identified H3K27me3 peaks were annotated by intersection with the reference using Bedtools v2.17 (<xref ref-type="bibr" rid="B57">Quinlan and Hall, 2010</xref>). Sequences from unannotated peaks were translated <italic>in silico</italic> to ORF in HMMER2GO v0.17 software and the longest ORF was used for homology searches by BLASTP/Pfam against respective protein databases. Alternatively, unannotated peak sequences were used for homology searches by BLASTN against NCBI nucleotide databases: nr/nt and ref_seq. Homology searches were performed in BLAST+ standalone package v2.2.28+ (<xref ref-type="bibr" rid="B10">Camacho et al., 2009</xref>). Alignment threshold was set as follows: >50% alignment length and &#x003C;0.5 <italic>E</italic>-value. Complete genome records in nucleotide databases were used to form negative GI list and excluded from the BLASTN searches. Only the top alignment hit was included in the further analyses. Distance between peak and annotated genomic feature was obtained using &#x2018;closest&#x2019; command from Bedtoolsv2.17. Mapping of the reads was visualized in IGV v2.3 (<xref ref-type="bibr" rid="B69">Thorvaldsd&#x00F3;ttir et al., 2013</xref>). Clustering of H3K27me3 enrichment was done on deepTools2 (<xref ref-type="bibr" rid="B59">Ramirez et al., 2016</xref>). Gene ontology was inferred by using Singular Enrichment Analysis on the AgriGO server (<xref ref-type="bibr" rid="B19">Du et al., 2010</xref>) against complete GO list. Protein annotations were extracted from UniProtKB database and used as reference. To select significantly enriched GO terms, Fisher test with Yekutieli adjustment was used a statistical method. <italic>P</italic>-value was set to 0.05 and minimal number of entries kept at 5.</p>
</sec>
</sec>
<sec><title>Results</title>
<sec><title>Conservation of PRC1 and PRC2 Homologs</title>
<p>In order to assess evolutionary conservation of Polycomb complexes, we performed phylogenetic analysis in the species from various groups of lower plants. We focused on the representatives from subclades: Chlorophyta (green algae), Rhodophyta (red algae), Glaucophyta and Embryophyta. For comparison, a representative from Metazoa, <italic>D. melanogaster</italic>, was included as well. Using reciprocal BLAST searches with full length proteins from <italic>A. thaliana</italic> or <italic>D. melanogaster</italic>, we identified homologs of PRC1 and PRC2 members in several species (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Number of identified homologs of PRC2 and PRC1 components in selected species.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Clade</th>
<td valign="top" align="left"></td>
<th valign="top" align="center">E(z)</th>
<th valign="top" align="center">ESC</th>
<th valign="top" align="center">Suz12</th>
<th valign="top" align="center">P55</th>
<th valign="top" align="center">BMI1/Psc</th>
<th valign="top" align="center">RING1A/Sce</th>
<th valign="top" align="center">LHP1/Pc</th>
<th valign="top" align="center">EMF1</th>
<th valign="top" align="center">Scm</th>
<th valign="top" align="center">H3</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Metazoa</td>
<td valign="top" align="left"><italic>D. melanogaster</italic></td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">&#x223C;100</td>
</tr>
<tr>
<td valign="top" align="left">Embryophyta</td>
<td valign="top" align="left"><italic>A. thaliana</italic></td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">15</td>
</tr>
<tr>
<td valign="top" align="left">Chlorophyta</td>
<td valign="top" align="left"><italic>C. reinhardtii</italic></td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">35</td>
</tr>
<tr>
<td valign="top" align="left">Chlorophyta</td>
<td valign="top" align="left"><italic>M. pusilla</italic></td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">5</td>
</tr>
<tr>
<td valign="top" align="left">Chlorophyta</td>
<td valign="top" align="left"><italic>O. tauri</italic></td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
</tr>
<tr>
<td valign="top" align="left">Rhodophyta</td>
<td valign="top" align="left"><italic>C. merolae</italic></td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
</tr>
<tr>
<td valign="top" align="left">Glaucophyta</td>
<td valign="top" align="left"><italic>C. paradoxa</italic></td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">12</td>
</tr>
<tr>
<td valign="top" align="left"></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>We found a general absence of PRC1 components: Psc/BMI1, Pc/LHP1, EMF1 and Scm in the algal genomes analyzed, with an exception of Psc/BMI1 homolog in <italic>C. reinhardtii</italic>. In contrast, the RING1 subunit is widely conserved, suggesting a mono-ubiquitylation activity unrelated to PRC1 in algae and protein moonlighting (<xref ref-type="bibr" rid="B30">Jeffery, 2003</xref>). In general, an abundance of RING1 homologs in chlorophytes, but the absence of other PRC1 components in lower branches of Archaeplastida used here is consistent with the previous studies (<xref ref-type="bibr" rid="B24">Hennig and Derkacheva, 2009</xref>; <xref ref-type="bibr" rid="B4">Berke and Snel, 2015</xref>; <xref ref-type="bibr" rid="B13">Chen et al., 2016</xref>) and agrees with the notion of lower PRC1 conservation and potential loss of the complex in several phylogenetic branches.</p>
<p>Homology searches on PRC2 revealed broad distribution of complex components: E(z), ESC and p55, consistently with results published elsewhere (<xref ref-type="bibr" rid="B65">Shaver et al., 2010</xref>; <xref ref-type="bibr" rid="B9">Butenko and Ohad, 2011</xref>; <xref ref-type="bibr" rid="B32">Kim et al., 2013</xref>). Importantly, p55 in higher eukaryotes has various functions and participates also in other complexes than PRC2 (i.e., chromatin assembly factor 1 [CAF1)]. Therefore it still remains to be proven whether a role of p55 homologs is Polycomb-related in lower plants. Moreover, we observed an absence of Suz12 from <italic>C. reinhardtii</italic> and <italic>C. paradoxa</italic>. Similarly to the other studies (<xref ref-type="bibr" rid="B65">Shaver et al., 2010</xref>), the lower abundance of this subunit shows that Suz12 was lost in several species, despite the presence of the other PRC2 components. Interestingly, we did not detect any core PRC2 members in the <italic>Cyanophora</italic> representative. Given that E(z) and ESC homologs were present in the representatives of Rhodophyta and Chlorophyta, it suggests the existence of PRC2 in the common algal ancestor and subsequent loss of the complex from <italic>Cyanophora</italic>. However, we could not exclude technical issues coming from incomplete annotation and incorrectly predicated protein models in the <italic>Cyanophora</italic> genome. In addition, we identified mostly one homolog per complex component in all screened species, apart from <italic>Arabidopsis</italic>, which adds another piece of evidence to the frequent gene duplication occurring in flowering plants.</p>
<p>Overall, our results agree with the ancient presence of core PRC2 components and the frequent losses of the Suz12 subunit and PRC1 complex members.</p>
</sec>
<sec><title>Phylogenetic Relationship of PRC2 Homologs</title>
<p>Phylogenetic analyses on the conserved domains or full sequences of core PRC2 homologs in lower and higher plants with the creation of Neighbor-Joining (NJ) trees revealed grouping into several distinct classes (<xref ref-type="bibr" rid="B65">Shaver et al., 2010</xref>; <xref ref-type="bibr" rid="B32">Kim et al., 2013</xref>; <xref ref-type="bibr" rid="B26">Huang et al., 2016</xref>). In a different approach, we extracted amino acid sequences from the most abundant PRC2 members, aligned them and automatically selected only the conserved blocks in the multiple alignment (see Material and Methods). With such prepared sequences we created Bayesian trees for E(z)z, ESC and Su(z)12 homologs. Consistently with the published data, the homologs formed defined clades. Bayesian trees for E(z) and ESC sequences generally resolved the evolutionary distance between groups (<bold>Figures <xref ref-type="fig" rid="F1">1A,B</xref></bold>). In those cases, we found separation of representatives of Rhodophyta and Viridiplantae, with species from Chlorophyta and Embryophyta forming separate subclades within the latter. The exceptions included one of the ESC homologs from <italic>C. reinhardtii</italic> (CrESC.1), which clustered together with the <italic>C. merolae</italic> homolog, rather than the other green algae. In turn, our analysis on Su(z)12 detected distinct clades for the organisms from Embryophyta, Chlorophyta and Rhodophyta, without grouping of <italic>Arabidopsis</italic> and green algae sequences (<bold>Figure <xref ref-type="fig" rid="F1">1C</xref></bold>). Moreover, we noted a phylogenetic distance between <italic>C. merolae</italic> and any other species for all the three proteins.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p><bold>Bayesian-inferred phylogenetic trees on conserved blocks from homologs of core PRC2 components.</bold> Consensus trees were generated by MrBayes v2 and visualized in FigTree v1.4. Clade credibility values are depicted at the nodes. Branch length corresponds to the rate of expected mutations per site in the protein sequence, according to relevant scale bar below each tree. Sequences from <italic>D. melanogaster</italic> were selected as outgroups. Two-letter abbreviations represent species&#x2019; names: Dm &#x2013; <italic>D. melanogaster</italic>; Cm &#x2013; <italic>C. merolae</italic>; At &#x2013; <italic>A. thaliana</italic>; Mp &#x2013; <italic>M. pusilla</italic>; Ot &#x2013; <italic>O. tauri</italic>; Cr &#x2013; <italic>C. reinhardtii</italic>. <bold>(A)</bold> Phylogenetic tree on E(z) homologs. Conserved blocks selected from multiple alignment on following homologs (UniProtKB ID or protein/transcript name from resource databases indicated): DmE(z) &#x2013; P42124; CmE(z) &#x2013; CMQ156C; AtCLF &#x2013; AT2G23380; AtSWN &#x2013; AT4G02020; AtMEA &#x2013; AT1G02580; MpE(z) &#x2013; 59369; OtE(z) &#x2013; 6642; CrE(z) &#x2013; Cre17.g746247.t1.1. <bold>(B)</bold> Phylogenetic tree on ESC homologs. Homologs included in the analysis: DmESC &#x2013; Q24338; CmESC &#x2013; CMK173C; AtFIE &#x2013; AT3G20740; MpESC &#x2013; 49065; OtESC &#x2013; 22117; CrESC.1 &#x2013; Cre16.g693750.t1.1; CrESC.2 &#x2013; Cre03.g180050.t1.1. <bold>(C)</bold> Phylogenetic tree on Su(z)12 homologs. Homologs included in the analysis &#x2013; DmSu(z)12 &#x2013; Q9NJG9; CmSu(z)12 &#x2013; CML082C; AtFIS2 &#x2013; AT2G35670; AtEMF2 &#x2013; AT5G51230; AtVRN2 &#x2013; AT4G16845; MpSu(z)12 &#x2013; 9357; OtSu(z) &#x2013; 13623.</p></caption>
<graphic xlink:href="fpls-08-00607-g001.tif"/>
</fig>
<p>In summary, our results on Bayesian phylogenetic trees on conserved sequence blocks are in agreement with the published NJ trees using the full length (<xref ref-type="bibr" rid="B32">Kim et al., 2013</xref>) or domain-only (<xref ref-type="bibr" rid="B65">Shaver et al., 2010</xref>) sequences. Given high conservation of domain architecture in core PRC2 components between the subclades (<xref ref-type="bibr" rid="B65">Shaver et al., 2010</xref>; <xref ref-type="bibr" rid="B32">Kim et al., 2013</xref>), we conclude that PRC2 is widely distributed and already evolved in a common unicellular ancestor.</p>
</sec>
<sec><title>Conservation of Histone H3 Sequences in Various Eukaryotic Organisms</title>
<p>As PRC2 has the canonical function to methylate lysine 27 in histone H3, we decided to analyze protein sequence conservation of histone H3. Amino acid sequences of algal histone H3 were retrieved from the databases of respective genome sequencing projects (see materials and methods) or NCBI, using the canonical histone H3.1 from <italic>Arabidopsis</italic> as a query (<xref ref-type="bibr" rid="B50">Okada et al., 2005</xref>). The results revealed a presence of multiple H3 genes in all of the species studied (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>). Noteworthy, we detected an equal amount of histone H3 gene copies as in the published reports for <italic>M. pusilla</italic> and <italic>C. reinhardtii</italic> (<xref ref-type="bibr" rid="B14">Cui et al., 2015</xref>). The number of H3 gene copies in <italic>D.</italic> was taken from the published results (<xref ref-type="bibr" rid="B43">McKay et al., 2015</xref>).</p>
<p>Next, the closest H3 homologs between the species were aligned using ClustalX and visualized with Jalview. The alignment revealed overall high amino acid conservation and a presence of lysine-27 in all studied species (<bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold>). Moreover, we noted that the sequences around lysine-27, including the underlying motif ARKS, does not show any sequence divergence with an exception of <italic>C. reinhardtii</italic>, which contains threonine-28 instead of serine-28. The results suggested that there is the potential for introduction of the H3K27me3 modification in most of species analyzed. However, in <italic>C. reinhardtii</italic> the presence of threonine-28 next to lysine-27 might not permit detection using commercial antibodies against the methylated H3K27. Moreover, a mass spectrometry analysis on <italic>Chlamydomonas</italic> histone H3 did not reveal the presence of the H3K27me3 modification (<xref ref-type="bibr" rid="B65">Shaver et al., 2010</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p><bold>H3 sequence conservation.</bold> AtH3.1 was used a query for BLAST searches against databases of respective species. The closest homologs were selected and used for multiple alignment in ClustalX, visualized afterward in Jalview v2.9. Color-shading marks non-conserved residues. Red frame correspond to lysine-27, a residue typically modified by PRC2.</p></caption>
<graphic xlink:href="fpls-08-00607-g002.tif"/>
</fig>
</sec>
<sec><title>Detection of H3K27me3</title>
<p>Given the phylogenetic conservation of PRC2 core components and high similarity of amino acid composition of histone H3 in the vast majority of the species, we sought to examine H3K27me3 presence in total protein extracts from selected organisms. Isolated proteins from crude extract were separated on a SDS-PAGE gel and detected using two independently raised anti-H3K27me3 antibodies. Our results revealed the presence of the H3K27me3 modification in <italic>M. pusilla</italic>, <italic>O. tauri</italic>, <italic>C. paradoxa</italic>, and <italic>C. merolae</italic> (<bold>Figures <xref ref-type="fig" rid="F3">3A,B</xref></bold>). In order to decipher H3K27me3 relative abundance, we performed western blotting for histone H3 (<bold>Figure <xref ref-type="fig" rid="F3">3D</xref></bold>) and calculated H3K27me3/H3 ratios based on band intensity quantification. We detected different relative amounts of the modification in the studied species (<bold>Figure <xref ref-type="fig" rid="F3">3E</xref></bold>), with the lowest H3K27me3/H3 ratios for <italic>M. pusilla</italic> and <italic>O. tauri</italic>, intermediate for <italic>C. merolae</italic> and the highest for <italic>C. paradoxa</italic>.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p><bold>Biochemical detection of H3K27me3 histone mark.</bold> <bold>(A,B,D)</bold> Western blot analysis of H3K27me3 in different algae (C &#x2013; histone powder control, Mp &#x2013; <italic>M. pusilla</italic>, Ot &#x2013; <italic>O. tauri</italic>, Cp &#x2013; <italic>C. paradoxa</italic>, Cm &#x2013; <italic>C. merolae</italic>). Membranes were incubated with antibodies against H3K27me3 [Millipore, N07-449 <bold>(A)</bold> and Diagenode, C15410206 <bold>(B)</bold>] and H3 [Abcam, ab1791-100 <bold>(D)</bold>]. The arrows indicate the band of interest. <bold>(C)</bold> Competition assay done after Western blotting with primary antibodies against H3K27me3 from <bold>(B)</bold>. Note the order change of the samples (red arrow). <bold>(E)</bold> Quantification of Western blotting. Intensity of the bands was obtained using ImageJ. H3K27me3/H3 protein level ratio of three independent experiments is shown. <italic>P</italic>-values were calculated using Student <italic>t</italic>-test. A single asterisk indicates a significance level of <italic>p</italic> &#x003C; 0.05. Error bars represent standard deviation.</p></caption>
<graphic xlink:href="fpls-08-00607-g003.tif"/>
</fig>
<p>Due to the fact that one of the antibodies detected proteins in the sizes not corresponding to histones, we performed protein competition assay for the confirmation of specific band reactivity. After preincubation with H3K27me3 peptide, we did not detect any band corresponding to this modification for a commercial histone extract from calf thymus (positive control), as well as for <italic>M. pusilla</italic>, <italic>O. tauri</italic>, and <italic>C. merolae</italic> (<bold>Figure <xref ref-type="fig" rid="F3">3C</xref></bold>), suggesting a specificity of the antibody. Intensity decrease of the band corresponding to H3K27me3 for <italic>C. paradoxa</italic> was accompanied by the overall signal loss, including the bands of the other molecular weight proteins. Therefore, the presence of H3K27me3 in <italic>C. paradoxa</italic> remains inconclusive.</p>
<p>In summary, we were able to biochemically identify and confirm a presence of H3K27me3 in <italic>M. pusilla</italic>, <italic>O. tauri</italic> and <italic>C. merolae</italic>. This result is consistent with our phylogenetic analyses and suggests that the PRC2 complex is active in these species. For <italic>C. paradoxa</italic>, the biochemical results are unclear: potential absence of H3K27me3 mark would correspond to the lack of PRC2 homologs identified in its genome. Differences in relative H3K27me3 amounts between species imply, e.g., a distinct abundance of genomic targets of H3K27me3 or different H3K27me3 enrichment levels per genomic region. Further work should decipher an impact of such differences in the evolutionary context.</p>
</sec>
<sec><title>Characterization of H3K27me3 Target Genes</title>
<p>Based on the broad distribution of PRC2 genes, high sequence conservation of histone H3 and specific signal detected with the anti-H3K27me3 antibody on a protein blot, we selected one of the species, <italic>C. merolae</italic>, to investigate the H3K27me3 abundance further. Firstly, we asked whether the modification has similar targets as in the other organisms. Reciprocal BLAST searches between protein databases of <italic>C. merolae</italic> and <italic>A. thaliana</italic> were used to detect homologous genes of known <italic>A. thaliana</italic> PRC2 targets (Supplementary Figure <xref ref-type="supplementary-material" rid="SM3">S1</xref>). We selected a MADS box-containing gene CMA095C (CmMADS) as nearly all <italic>Arabidopsis</italic> MADS box genes carry H3K27me3. For negative controls, estimated not to be targeted by H3K27me3, homologous genes for EUKARYOTIC ELONGATION FACTOR EIF4A (CMK028C, CmEIF4A) and 60S RIBOSOMAL PROTEIN L23 (CMS262C, Cm60S) were taken.</p>
<p>Next, we performed Chromatin-immunoprecipitation (ChIP) using anti-H3K27me3 antibody and analyzed the expression of candidate genes by RT-PCR. As a result, we were able to show significant enrichment of H3K27me3 on CmMADS and low abundance of the mark on negative targets: CmEIF4A and Cm60S (<bold>Figure <xref ref-type="fig" rid="F4">4</xref></bold>). Overall, we were able to identify positive target of H3K27me3 and show high enrichment of the mark comparing to negative loci.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p><bold>H3K27me3 abundance on selected genes from <italic>C. merolae</italic>.</bold> Chromatin immunoprecipitation was performed using H3K27me3 antibody (#C15410195, Diagenode) and IgG (#C15410206, Diagenode) for negative control, followed by RT-PCR analysis on a putative H3K27me3 target (CmMADS) and housekeeping genes as the putative non-H3K27me3 targets (Cm60S, CmEIF4A). Selected genes relate to following locus IDs from <italic>C. merolae</italic> genome project: CmMADS &#x2013; CMA095C, Cm60S &#x2013; CMS262C, CmEIF4A &#x2013; CMK028C. Error bars correspond to the standard deviation from three biological replicates. Significance was calculated using Student&#x2019;s <italic>t</italic>-test for the comparison between CmMADS and either, Cm60S, or CmEIF4A. A triple asterisk indicates the significance level of <italic>p</italic> &#x003C; 0.001.</p></caption>
<graphic xlink:href="fpls-08-00607-g004.tif"/>
</fig>
</sec>
<sec><title>H3K27me3 Genome-wide Distribution &#x2013; Peak Identification</title>
<p>In order to characterize the targets of H3K27me3 and the distribution of H3K27me3 peaks in a genome-wide scale, we created a sequencing library and performed chromatin-immunoprecipitation coupled with sequencing (ChIP-seq) for the samples: H3K27me3-bound DNA, H3-bound DNA (both in triplicate) and input.</p>
<p>Cleaning raw sequencing reads, mapping to the reference genome and peak calling with input or H3 sample for normalization, let us identify more than 1300 H3K27me3 peaks per replicate (<bold>Figure <xref ref-type="fig" rid="F5">5A</xref></bold>). The peaks cover 14% of total nuclear genome length, with an average peak size of 1792 bp. Good quality of the data was confirmed by a high number of H3K27me3 peaks in the overlaps between the biological replicates, as well as the high correlation values of the reads&#x2019; occupancy (Supplementary Figure <xref ref-type="supplementary-material" rid="SM3">S3</xref>)</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p><bold>H3K27me3 genome-wide distribution and peak annotation.</bold> <bold>(A,B)</bold> &#x2013; Number of peaks identified in three different biological replicates from chromatin immunoprecipitation experiment (IP1-3) in &#x2018;high overlap annotated peaks,&#x2019; &#x2018;low overlap annotated peaks&#x2019; category, &#x2018;<italic>de novo</italic>-annotated&#x2019; and &#x2018;unannotated&#x2019; categories (see main text). <bold>(C,D)</bold> &#x2013; Characterization of H3K27me3 peaks according to genomic element type. High overlap H3K27me3 peaks cover 242 genes, which amounts to 4% of total gene number <bold>(C)</bold> or 172 repetitive elements, 50% of total repetitive element number <bold>(D)</bold>. Both, loci of hypothetical proteins and transcripts, were included in total gene number.</p></caption>
<graphic xlink:href="fpls-08-00607-g005.tif"/>
</fig>
<p>Next, we compared H3K27me3-peak coordinates with the loci in <italic>C. merolae</italic> reference genome. We applied a threshold of 50% overlap of the locus length and successfully annotated ca. Six hundred peaks (&#x2018;high overlap annotated peaks&#x2019;) (<bold>Figure <xref ref-type="fig" rid="F5">5A</xref></bold>). The remaining peaks corresponded to the reference loci covered by H3K27me3 in less than 50% of their length (&#x2018;low overlap annotated peaks&#x2019;) and to the unannotated peaks without any overlap with reference loci. We concluded that the unannotated peaks are the intergenic regions and/or uncharacterized genomic features missing in the current annotation. Considering the second possibility, we used several strategies to find <italic>de novo</italic> annotation of unknown peaks.</p>
<p>Firstly, we <italic>in silico</italic> translated sequences from the unannotated peaks and searched for open reading frames (ORFs) within them. We found ORFs for all of the unannotated peaks and used them for BLASTP/Pfam searches against the NCBI non-redundant protein sequence database (nr database) or Pfam protein database. Using a specific alignment threshold (alignment length > 50%; <italic>E</italic>-value &#x003C; 0.5), we could annotate further 178, 164, and 195 peaks for biological replicate 1, 2, and 3, respectively (<bold>Figure <xref ref-type="fig" rid="F5">5B</xref></bold>). In the second approach, we extracted DNA sequences of the unknown peaks and used BLASTN searches against the NCBI nucleotide collection (nr/nt database) and NCBI transcript reference sequences (refseq_rna database). The hits surpassing the alignment threshold (alignment length > 50%, <italic>E</italic>-value &#x003C; 0.5) were compared to BLASTP/Pfam output, which allowed to annotate next 4 to 8 peaks, depending on the biological replicate (<bold>Figure <xref ref-type="fig" rid="F5">5B</xref></bold>). In general, our sequence homology searches helped to detect <italic>de novo</italic> annotation of 13&#x2013;15% peaks from their total number, depending on the biological replicate (<bold>Figure <xref ref-type="fig" rid="F5">5A</xref></bold>).</p>
<p>We reasoned that the remaining unknown peaks after <italic>de novo</italic> annotation corresponded to the regions containing promoters and the other regulatory elements of neighboring annotated elements. Given the high density of genes along the chromosomes in <italic>C. merolae</italic> (<xref ref-type="bibr" rid="B42">Matsuzaki et al., 2004</xref>), such regions would reside close to 5&#x2032; or 3&#x2032; gene ends, but would not be present in the annotation. In order to validate this idea, we calculated average distances between gene locations and unannotated peaks. We compared distributions between peaks successfully aligned to BLASTP database (annotated peaks) and the ones showing no significant alignment (unannotated peaks), assuming that BLASTP-aligned peaks truly correspond to novel genes missing in our reference, rather than uncharacterized <italic>cis</italic> elements. However, we did not observe major differences between distances calculated for annotated and unannotated peaks (Supplementary Table <xref ref-type="supplementary-material" rid="SM3">S1</xref>), suggesting that unannotated peaks do not exclusively correspond to <italic>cis</italic> regulatory elements of known genomic elements. Further development of <italic>C. merolae</italic> genome annotation should deepen our understanding about these remaining uncharacterized H3K27me3 peaks.</p>
</sec>
<sec><title>H3K27me3 Genome-wide Distribution &#x2013; Genomic Feature</title>
<p>Next, we sought to investigate the genomic feature profile underlying H3K27me3 peaks. We picked a stringent group of high overlap annotated targets, removed duplicates (two peaks spanning one genomic element) for each replicate and selected a consensus set of features that appear in any two out of three replicates. As a result, we found that H3K27me3 covers 242 genes, which amounts to 4% of <italic>C. merolae</italic> total gene number (<bold>Figure <xref ref-type="fig" rid="F5">5C</xref></bold>). On the other hand, we also detected 172 H3K27me3-bound repetitive elements. Given very low repetitive element occupancy in the <italic>C. merolae</italic> genome (0,7% transposable elements + 5% WSV repeats) (<xref ref-type="bibr" rid="B49">Nozaki et al., 2007</xref>), H3K27me3 seems to be predominantly present on repetitive elements, covering as much as 50% of their total number (<bold>Figure <xref ref-type="fig" rid="F5">5D</xref></bold>). We noted that the enrichment on both, genes and repetitive elements, is in concordance with the studies in <italic>Drosophila</italic> (<xref ref-type="bibr" rid="B81">Yin et al., 2011</xref>). Interestingly, the predominant enrichment of H3K27me3 on repetitive elements was found also in diatom <italic>P. tricornutum</italic> (<xref ref-type="bibr" rid="B74">Veluchamy et al., 2015</xref>), suggesting an ancestral role of Polycomb-mediated gene regulation in guarding the genome.</p>
</sec>
<sec><title>H3K27me3 Is a Silencing Mark in <italic>C. merolae</italic></title>
<p>Next, we wanted to characterize the correlation of H3K27me3 enrichment with the expression of its targets. In order to decipher expression level on a genome-wide scale, we performed a RNA-seq experiment on reverse-transcribed RNA extracted from <italic>C. merolae</italic> cultures. The sequencing reads were mapped to the reference genome and expression level was assessed based on the FPKM values. We intersected the FPKM data with H3K27me3-binding profile and distinguished genes and repetitive elements for the analyses.</p>
<p>We found that both, H3K27me3-covered genes and H3K27me3-covered repetitive elements are on average significantly less expressed than non-H3K27me3 targets (<bold>Figure <xref ref-type="fig" rid="F6">6</xref></bold>), confirming that the H3K27me3 modification in <italic>C. merolae</italic> is highly correlated with gene repression. Moreover, we noted that the general level of repetitive element expression is lower than gene expression, irrespectively of the H3K27me3 status. These results suggest the existence of additional mechanisms involved in repression of the repetitive elements.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p><bold>Relationship between expression and H3K27me3 presence.</bold> H3K27me3 occupancy is anti-correlated with expression level in both, genes and TEs. Expression was estimated based on FPKM values obtained from RNA-seq experiment. Student <italic>t</italic>-test was used to calculate significance. <sup>&#x2217;&#x2217;</sup><italic>p</italic>-value &#x003C; 0.01, <sup>&#x2217;</sup><italic>p</italic>-value &#x003C; 0.05. Error bars represent standard error (SE).</p></caption>
<graphic xlink:href="fpls-08-00607-g006.tif"/>
</fig>
</sec>
<sec><title>H3K27me3 Gene-body Distribution</title>
<p>H3K27me3 enrichment has a broad distribution over gene bodies in plants (<xref ref-type="bibr" rid="B83">Zhang et al., 2007</xref>; <xref ref-type="bibr" rid="B80">Yang et al., 2014</xref>) or gene bodies and flanking regions in animals (<xref ref-type="bibr" rid="B54">Pauler et al., 2009</xref>; <xref ref-type="bibr" rid="B12">Cerase et al., 2014</xref>) and diatoms (<xref ref-type="bibr" rid="B74">Veluchamy et al., 2015</xref>). To inspect the distribution of the mark in <italic>C. merolae</italic>, we calculated average H3K27me3 enrichment over genic coordinates and observed that H3K27me3 covers gene bodies, 0.5 kb upstream and 0.5 kb downstream regions (Supplementary Figure <xref ref-type="supplementary-material" rid="SM3">S2</xref>). This general approach obscured whether all targets show similar, broad distribution of H3K27me3 or whether the differential enrichment subgroups are present among the targets. We therefore performed k-means clustering on the results. Our analysis identified four clusters with the preferential presence of H3K27me3 at gene bodies (cluster 1), the 0.5 kb downstream region (cluster 2), and the 0.5 kb upstream region (cluster 3) (<bold>Figure <xref ref-type="fig" rid="F7">7A</xref></bold>). Cluster 4 contains non-H3K27me3 targets.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p><bold>Clustering of H3K27me3 gene body enrichment.</bold> Coverage of ChIP-seq H3K27me3 reads normalized to input was calculated and used to generate matrix with gene <bold>(A)</bold> or repetitive element <bold>(B)</bold> locations as reference. The reads were scaled to 500 bp windows and flanking regions set to 500 bp. Matrix files served as an input for heatmap generation and K-means clustering in deepTools2.</p></caption>
<graphic xlink:href="fpls-08-00607-g007.tif"/>
</fig>
<p>As distinct H3K27me3 enrichment clusters impact target gene activity differently in some other species (<xref ref-type="bibr" rid="B82">Young et al., 2011</xref>; <xref ref-type="bibr" rid="B74">Veluchamy et al., 2015</xref>), we asked whether gene expression level varies between the H3K27me3 clusters in <italic>C. merolae</italic>. H3K27me3 differential enrichment compared with RNA-seq data revealed that clusters 1&#x2013;3 correlate with the gene repression, albeit to an uneven extent. Cluster 1 correlated with the strongest repression level, whereas clusters 2 and 3 correlated with a similar, milder repression (<bold>Figure <xref ref-type="fig" rid="F8">8A</xref></bold>). Thus, the strongest silencing effect was seen for H3K27me3 distribution at the gene bodies, similarly to the diatom <italic>P. tricornutum</italic> (<xref ref-type="bibr" rid="B74">Veluchamy et al., 2015</xref>), and the H3K27me3 enrichment at the flanking regions is correlated with the milder repression.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption><p><bold>Relationship between H3K27me3 occupancy and expression after clustering analysis.</bold> Identified clusters (<bold>Figure <xref ref-type="fig" rid="F7">7</xref></bold>) were correlated with expression level measured as FPKM in RNA-seq experiment. Expression level was determined for H3K27me3 occupancy clusters from genes <bold>(A)</bold> and repetitive elements <bold>(B)</bold>. Error bars represent standard error.</p></caption>
<graphic xlink:href="fpls-08-00607-g008.tif"/>
</fig>
<p>We performed similar clustering analysis to characterize H3K27me3 enrichment also on the repetitive elements. We could distinguish two clusters with histone mark presence on 0.5 kb upstream region and gene bodies (cluster 1) or gene bodies only (cluster 2) (<bold>Figure <xref ref-type="fig" rid="F7">7B</xref></bold>). Cluster 3 corresponds to non-H3K27me3 targets. In contrast to the genes, we found that cluster 1 and 2 correlate with reduced gene expression to the same extent (<bold>Figure <xref ref-type="fig" rid="F8">8B</xref></bold>), suggesting that gene body H3K27me3 distribution has a predominant effect on gene repression. On the other hand, clusters present at repetitive elements are not fully analogous to those at genes as for the former we did not identify a cluster with enrichment exclusively at the flanking regions.</p>
</sec>
<sec><title>H3K27me3 Location on Chromosomes</title>
<p>Looking at the whole-chromosome level of H3K27me3 distribution, we found a preferential enrichment of the mark on the chromosome ends (<bold>Figure <xref ref-type="fig" rid="F9">9</xref></bold>). Such observation concerns 36 out 40 ends from 20 chromosomes. The exceptions, in which the closest H3K27me3 domain was detected only 4&#x2013;5 kb away from the chromosome end, include: 5&#x2032; end of chromosome 3 and 3&#x2032; end of chromosomes 7, 14, 16 (Supplementary Table <xref ref-type="supplementary-material" rid="SM3">S2</xref> distance). Telomeric repeats in plants span regions from 0.5 kb in <italic>Chlorella vulgaris</italic> to 150 kb in <italic>Nicotiana tabacum</italic> (<xref ref-type="bibr" rid="B22">Fajkus et al., 1995</xref>; <xref ref-type="bibr" rid="B25">Higashiyama et al., 1995</xref>). <italic>C. merolae</italic> telomere length is relatively short and varies from 400bp to 700bp (<xref ref-type="bibr" rid="B49">Nozaki et al., 2007</xref>). Hence, the H3K27me3-domains at the chromosome ends detected in <italic>C. merolae</italic> cover the regions of telomeres and subtelomeres, similarly to what has been shown for the fungi: <italic>N. crassa</italic> (<xref ref-type="bibr" rid="B29">Jamieson et al., 2013</xref>) and <italic>Cryptococcus neoformans</italic> (<xref ref-type="bibr" rid="B20">Dumesic et al., 2014</xref>).</p>
<fig id="F9" position="float">
<label>FIGURE 9</label>
<caption><p><bold>H3K27me3 domain enrichment on chromosome ends.</bold> 5&#x2032; end of chromosome 13 is used as an example. Results from three biological replicates (IP1-3) taken for sequencing are shown. Bars below H3K27me3 tracks correspond to peaks scored by MACS2 with input (top bar) or H3 (bottom bar) used for normalization. The lowest track correspond to <italic>C. merolae</italic> genome annotation (merolae.biol.s.u-tokyo.ac.jp). The data was visualized in IGV v2.3.</p></caption>
<graphic xlink:href="fpls-08-00607-g009.tif"/>
</fig>
</sec>
<sec><title>H3K27me3-target Gene Ontology</title>
<p>In higher eukaryotes, H3K27me3 is known to target, i.e., developmental, tissue-specific or stress-responsive genes (<xref ref-type="bibr" rid="B6">Boyer et al., 2006</xref>; <xref ref-type="bibr" rid="B7">Bracken et al., 2006</xref>; <xref ref-type="bibr" rid="B83">Zhang et al., 2007</xref>; <xref ref-type="bibr" rid="B36">Lafos et al., 2011</xref>; <xref ref-type="bibr" rid="B33">Kleinmanns and Schubert, 2014</xref>). We sought to investigate whether similar functional association can be found in <italic>C. merolae</italic>, a unicellular organism with primitive developmental program.</p>
<p>In order to decipher functional classification of H3K27me3 targets, we performed gene ontology analysis with Singular Enrichment Analysis from the AgriGO toolkit (<xref ref-type="bibr" rid="B19">Du et al., 2010</xref>). GO classes of H3K27me3-targets and full lists of <italic>C. merolae</italic> genes used as a reference were extracted from the UniProtKB-database. We found a GO annotation for 3085 genes (50,5% from total number of 6108, including hypothetical proteins and transcripts) in the reference with 144 GO-annotated genes among H3K27me3-targets (62,6% from total number of 230). Statistical significance was determined based on the adjusted <italic>p</italic>-value with Yekutieli test for FDR. For statistical tests, we selected only those GO classes that were represented by at least five entries (see material and methods). All three different GO sub-ontologies were taken into account: molecular function (GO:0003674), biological process (GO::0008150), and cellular component (GO::0008372).</p>
<p>We did not reveal any significantly (adjusted <italic>p</italic>-value &#x003C; 0.05) enriched GO terms in cellular component sub-ontology. In contrast, we detected three GO term branches in biological process and one in molecular function sub-ontologies (<bold>Figure <xref ref-type="fig" rid="F10">10</xref></bold>). Consistently with the known H3K27me3 function in the repression of developmental programs, we detected significantly enriched GO terms related to organismal process and development. Surprisingly, among enriched terms we found also those corresponding to protein maturation and intein-mediated protein splicing.</p>
<fig id="F10" position="float">
<label>FIGURE 10</label>
<caption><p><bold>Gene ontology analysis on H3K27me3 target genes.</bold> Significantly enriched terms from molecular function and biological process subontologies (no significant hits found for cellular component subontology). Color coding represents significance level (1&#x2013;9) below adjusted <italic>p</italic>-value threshold (<italic>p</italic> &#x003C; 0.05). Adjusted <italic>p</italic>-values are shown next to GO IDs for each GO term. Relationships between GO terms are reflected in the arrow types. Red frames highlight GO terms connected with intein-mediated protein splicing. Numbers below subcategory name correspond to: GO subcategory gene number in query/total gene number in query &#x007C; GO subcategory gene number in reference (TAIR10)/total gene number in reference.</p></caption>
<graphic xlink:href="fpls-08-00607-g010.tif"/>
</fig>
<p>Protein splicing is a protein maturation mechanism based on the excision of a protein fragment (intein) from a precursor and ligation of the flanking polypeptides (exteins) (<xref ref-type="bibr" rid="B70">Topilina and Mills, 2014</xref>). Comparative analyses of amino acid sequences revealed a homology of protein splice-junction motif from inteins to the C-terminal Hog domains in Hedgehog proteins, the secretory proteins controlling developmental processes in Metazoa (<xref ref-type="bibr" rid="B31">Jiang and Hui, 2008</xref>).</p>
<p>In concordance to the protein splicing mechanism and the function of intein-containing genes, we observed that the majority of protein entries present in significantly enriched GO terms (organismal process and development, cell communication and peptidase function) overlap with the protein splicing group (Supplementary Table <xref ref-type="supplementary-material" rid="SM1">S3</xref>). Moreover, we found that all of the H3K27me3 targets present in the protein splicing term indeed possess a Hog domain as the Hedgehog proteins.</p>
<p>Interestingly, the genes encoding Hedgehog proteins in <italic>C. merolae</italic> reside in the telomeric and subtelomeric regions of the chromosomes (<xref ref-type="bibr" rid="B49">Nozaki et al., 2007</xref>), which is in agreement to our results on the preferential H3K27me3 enrichment at the chromosome ends (<bold>Figure <xref ref-type="fig" rid="F9">9</xref></bold>). Moreover, comparing with clustering data, we observed that Hedgehog loci were enriched in cluster 3 (Supplementary Table <xref ref-type="supplementary-material" rid="SM2">S4</xref>), bound by H3K27me3 preferentially to the upstream regions and moderately repressed.</p>
</sec>
</sec>
<sec><title>Discussion</title>
<p>Polycomb-group mediated regulation of gene expression in multicellular organisms is intensively studied in plants and has important roles in stress responses and developmental phase transitions (<xref ref-type="bibr" rid="B34">K&#x00F6;hler and Villar, 2008</xref>; <xref ref-type="bibr" rid="B33">Kleinmanns and Schubert, 2014</xref>). However, a model for Pc-G function in unicellular, photosynthetic organisms is currently lacking. We therefore performed homology searches and Bayesian phylogenetic tree analyses on the conserved alignment blocks. Our results confirmed a frequent absence of the PRC1 complex, ancient origin of PRC2 complex and the widespread distribution of core components of PRC2 in the green lineage [see also: (<xref ref-type="bibr" rid="B65">Shaver et al., 2010</xref>; <xref ref-type="bibr" rid="B32">Kim et al., 2013</xref>)]. We detected the PcG mark H3K27me3 in prasinophytes and red algae, arguing for the functionality of PRC2 in these organisms. Subsequent analyses of H3K27me3 occupancy in the model red alga, <italic>C. merolae</italic>, and correlative analyses with transcriptomic data revealed several important observations in this unicellular organism.</p>
<p>We observed that H3K27me3 in <italic>C. merolae</italic> is present on both, genes and repetitive elements, covering 4% of total gene number and 50% of total repetitive element number. The occupancy on both genomic types is in the agreement with the results from <italic>Drosophila</italic> and <italic>Phaeodactylum</italic>, suggesting an ancestral function of PcG in repetitive element silencing. As <italic>Arabidopsis</italic> H3K27me3 occupies preferentially genes, our results suggest also a partial divergence from PcG-mediated repetitive element repression in higher plants.</p>
<p>In <italic>Arabidopsis</italic> endosperm, the vicinity of repetitive elements was shown to impact H3K27me3-mediated gene imprinting (<xref ref-type="bibr" rid="B76">Weinhofer et al., 2010</xref>; <xref ref-type="bibr" rid="B77">Wolff et al., 2011</xref>). In contrast, the H3K27me3-covered genes in <italic>C. merolae</italic> are not in the vicinity of repetitive elements. However, we cannot exclude the possibility that the genome annotation is incomplete, with unknown types of repetitive elements being not reported.</p>
<p>We showed that H3K27me3 occupancy anti-correlates with the expression of its targets, consistently with its function in the multicellular organisms. As the other lysine-27 methylation marks, H3K27me1 and H3K27me2, have a repressive function in higher plants (<xref ref-type="bibr" rid="B3">Bastow et al., 2004</xref>; <xref ref-type="bibr" rid="B27">Jacob et al., 2009</xref>), similarly to the H3K27me3, the future steps should include profiling of those modifications in <italic>C. merolae</italic> genome. We also observed that the degree of repression is tightly correlated with the H3K27me3 profile at the genic level. The lowest expression was found for the genes covered by H3K27me3 over the gene body, whereas the targets with H3K27me3 occupancy on 5&#x2032; and 3&#x2032; flanking regions associate with intermediate repression.</p>
<p>Furthermore, H3K27me3 shows an enrichment on chromosome ends and covers a broad region including telomeric repeats and subtelomeric genes in <italic>C. merolae</italic>. Interestingly, several other species were reported to show an enrichment of H3K27me3 at the subtelomeric and telomeric regions (<xref ref-type="bibr" rid="B66">Smith et al., 2008</xref>; <xref ref-type="bibr" rid="B72">Vaquero-sedas et al., 2012</xref>). Moreover, Polycomb proteins were proved to be associated with telomere-binding factors (<xref ref-type="bibr" rid="B84">Zhou et al., 2013</xref>) and telobox motifs demonstrated to be enriched at PcG-peaks (<xref ref-type="bibr" rid="B17">Deng et al., 2013</xref>), arguing for a functional connection between PcG-mediated repression and chromosomal ends.</p>
<p>Our gene ontology analysis showed that H3K27me3 covers genes associated with intein-mediated protein splicing. Inteins are considered as ancient mobile elements present in all three domains of life (eukaryotes, archaea, bacteria) and viruses (<xref ref-type="bibr" rid="B55">Pietrokovski, 2001</xref>; <xref ref-type="bibr" rid="B48">Novikova et al., 2014</xref>). Hence, our results suggest that the H3K27me3-mediated repression might function in guarding the <italic>C. merolae</italic> genome. Such notion would be consistent with the predominant enrichment of H3K27me3 on the repetitive elements (<bold>Figure <xref ref-type="fig" rid="F5">5D</xref></bold>). On the other hand, inteins may also develop to become post-translational regulatory elements in the course of evolution, as seen for the conditional protein splicing of Hog domains in animals (<xref ref-type="bibr" rid="B64">Shah and Muir, 2014</xref>). Therefore, it is of special importance to decipher the function of intein-containing Hedgehog genes and the conditions for their de-repression in <italic>C. merolae.</italic></p>
<p>Interestingly, we also observed that the intein-containing genes reside in the subtelomeric regions and are moderately repressed by H3K27me3 that occupies upstream regions from their TSS. It is currently unclear how protein splicing, moderate H3K27me3-mediated repression and subtelomeric localization are inter-connected. It is feasible that the integration of intein-containing genes appeared relatively recently and that these &#x2018;young&#x2019; H3K27me3 targets display H3K27me3 enrichment at regions upstream of their TSS, in agreement with their moderate repression. Comparative genomics between <italic>C. merolae</italic> and related species should be used to validate this idea. In turn, subtelomeres are dynamic regions and the objects of ectopic recombination events, which facilitate gene diversification, phenotypic diversity and adaptation to different environments, exemplified by an olfactory receptor gene family in humans (<xref ref-type="bibr" rid="B38">Linardopoulou et al., 2001</xref>), contingency systems in various pathogens (<xref ref-type="bibr" rid="B2">Barry et al., 2003</xref>) and carbon-source metabolism genes in <italic>S. cerevisiae</italic> (<xref ref-type="bibr" rid="B11">Carlson and Botstein, 1983</xref>; <xref ref-type="bibr" rid="B41">Louis et al., 1994</xref>; <xref ref-type="bibr" rid="B8">Brown et al., 2010</xref>). Hence, it is possible that the intein-containing subtelomeric H3K27me3-covered genes are the targets of rapid adaptive evolution. Characterization of their duplication frequency and diversification within their gene families should validate such notion.</p>
<p>Last but not the least, our results on H3K27me3 presence in various algae and genome-wide characterization of expression status and H3K27me3 occupancy in the model red alga, <italic>C. merolae</italic>, provide a resource for chromatin and transcriptomic studies. An existence of genetic tools, optimized growth conditions and the sequenced genome in <italic>C. merolae</italic> offer a possibility to decipher genome regulation in a broad evolutionary context, including the organisms with super-reduced genomes.</p>
</sec>
<sec><title>Author Contributions</title>
<p>PM, OK, and DS designed the research. FF and AW designed <italic>C. merolae</italic> growth scheme. FF handled <italic>C. merolae</italic> cell cultures. PM performed bioinformatic analyses, chromatin immunoprecipitations and sequencing library preparations. OK performed competition assay and western blot experiments. PM and DS performed homology searches for H3K27me3 targets used in ChIP-RT-qPCR. The manuscript was written by PM and revised by the other authors.</p>
</sec>
<sec><title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> PM, OK, and DS were supported by the DFG-funded SFB973, the Marie Curie International Training Network &#x201C;EpiTRAITS&#x201D; and by Freie Universit&#x00E4;t Berlin within the Excellence Initiative of the German Research Foundation.</p>
</fn>
</fn-group>
<ack>
<p>We thank Prof. Motomichi Matsuzaki for providing <italic>C. merolae</italic> genome annotation and Dr. Michael Gr&#x00FC;nst&#x00E4;udl for crucial insights in phylogenetic tree creation. We thank Julia Kleinmanns and Dorota Komar for critical revision of the manuscript.</p>
</ack>
<sec sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fpls.2017.00607/full#supplementary-material">http://journal.frontiersin.org/article/10.3389/fpls.2017.00607/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_3.xlsx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
</supplementary-material>
<supplementary-material xlink:href="Table_4.xlsx" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
</supplementary-material>
<supplementary-material xlink:href="Data_Sheet_1.pdf" id="SM3" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink">
</supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abascal</surname> <given-names>F.</given-names></name> <name><surname>Zardoya</surname> <given-names>R.</given-names></name> <name><surname>Posada</surname> <given-names>D.</given-names></name> <name><surname>Miller</surname> <given-names>C. J.</given-names></name> <name><surname>Lou</surname> <given-names>H. J.</given-names></name> <name><surname>Johnson</surname> <given-names>A. D.</given-names></name><etal/></person-group> (<year>2005</year>). <article-title>ProtTest: selection of best-fit models of protein evolution.</article-title> <source><italic>Bioinformatics</italic></source> <volume>21</volume> <fpage>2104</fpage>&#x2013;<lpage>2105</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bti263</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barry</surname> <given-names>J. D.</given-names></name> <name><surname>Ginger</surname> <given-names>M. L.</given-names></name> <name><surname>Burton</surname> <given-names>P.</given-names></name> <name><surname>McCulloch</surname> <given-names>R.</given-names></name></person-group> (<year>2003</year>). <article-title>Why are parasite contingency genes often associated with telomeres?</article-title> <source><italic>Int. J. Parasitol.</italic></source> <volume>33</volume> <fpage>29</fpage>&#x2013;<lpage>45</lpage>. <pub-id pub-id-type="doi">10.1016/S0020-7519(02)00247-3</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bastow</surname> <given-names>R.</given-names></name> <name><surname>Mylne</surname> <given-names>J. S.</given-names></name> <name><surname>Lister</surname> <given-names>C.</given-names></name> <name><surname>Lippman</surname> <given-names>Z.</given-names></name> <name><surname>Martienssen</surname> <given-names>R. A.</given-names></name> <name><surname>Dean</surname> <given-names>C.</given-names></name></person-group> (<year>2004</year>). <article-title>Vernalization requires epigenetic silencing of FLC by histone methylation.</article-title> <source><italic>Nature</italic></source> <volume>427</volume> <fpage>164</fpage>&#x2013;<lpage>167</lpage>. <pub-id pub-id-type="doi">10.1038/nature02269</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berke</surname> <given-names>L.</given-names></name> <name><surname>Snel</surname> <given-names>B.</given-names></name></person-group> (<year>2015</year>). <article-title>The plant Polycomb repressive complex 1 (PRC1) existed in the ancestor of seed plants and has a complex duplication history.</article-title> <source><italic>BMC Evol. Biol.</italic></source> <volume>15</volume>:<issue>44</issue>. <pub-id pub-id-type="doi">10.1186/s12862-015-0319-z</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowman</surname> <given-names>J. L.</given-names></name> <name><surname>Floyd</surname> <given-names>S. K.</given-names></name> <name><surname>Sakakibara</surname> <given-names>K.</given-names></name></person-group> (<year>2007</year>). <article-title>Green genes-comparative genomics of the green branch of life.</article-title> <source><italic>Cell</italic></source> <volume>129</volume> <fpage>229</fpage>&#x2013;<lpage>234</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2007.04.004</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boyer</surname> <given-names>L. A.</given-names></name> <name><surname>Plath</surname> <given-names>K.</given-names></name> <name><surname>Zeitlinger</surname> <given-names>J.</given-names></name> <name><surname>Brambrink</surname> <given-names>T.</given-names></name> <name><surname>Medeiros</surname> <given-names>L. A.</given-names></name> <name><surname>Lee</surname> <given-names>T. I.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>Polycomb complexes repress developmental regulators in murine embryonic stem cells.</article-title> <source><italic>Nature</italic></source> <volume>441</volume> <fpage>349</fpage>&#x2013;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1038/nature04733</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bracken</surname> <given-names>A. P.</given-names></name> <name><surname>Dietrich</surname> <given-names>N.</given-names></name> <name><surname>Pasini</surname> <given-names>D.</given-names></name> <name><surname>Hansen</surname> <given-names>K. H.</given-names></name> <name><surname>Helin</surname> <given-names>K.</given-names></name></person-group> (<year>2006</year>). <article-title>Genome-wide mapping of Polycomb target genes unravels their roles in cell fate transitions.</article-title> <source><italic>Genes Dev.</italic></source> <volume>20</volume> <fpage>1123</fpage>&#x2013;<lpage>1136</lpage>. <pub-id pub-id-type="doi">10.1101/gad.381706.present</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>C. A.</given-names></name> <name><surname>Murray</surname> <given-names>A. W.</given-names></name> <name><surname>Verstrepen</surname> <given-names>K. J.</given-names></name></person-group> (<year>2010</year>). <article-title>Rapid expansion and functional divergence of subtelomeric gene families in yeasts.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>20</volume> <fpage>895</fpage>&#x2013;<lpage>903</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2010.04.027</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butenko</surname> <given-names>Y.</given-names></name> <name><surname>Ohad</surname> <given-names>N.</given-names></name></person-group> (<year>2011</year>). <article-title>Polycomb-group mediated epigenetic mechanisms through plant evolution.</article-title> <source><italic>Biochim. Biophys. Acta</italic></source> <volume>1809</volume> <fpage>395</fpage>&#x2013;<lpage>406</lpage>. <pub-id pub-id-type="doi">10.1016/j.bbagrm.2011.05.013</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Camacho</surname> <given-names>C.</given-names></name> <name><surname>Coulouris</surname> <given-names>G.</given-names></name> <name><surname>Avagyan</surname> <given-names>V.</given-names></name> <name><surname>Ma</surname> <given-names>N.</given-names></name> <name><surname>Papadopoulos</surname> <given-names>J.</given-names></name> <name><surname>Bealer</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>BLAST+: architecture and applications.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>10</volume>:<issue>421</issue>. <pub-id pub-id-type="doi">10.1186/1471-2105-10-421</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carlson</surname> <given-names>M.</given-names></name> <name><surname>Botstein</surname> <given-names>D.</given-names></name></person-group> (<year>1983</year>). <article-title>Organization of the <italic>SUC</italic> gene family in <italic>Saccharomyces</italic>.</article-title> <source><italic>Mol. Cell. Biol.</italic></source> <volume>3</volume> <fpage>351</fpage>&#x2013;<lpage>359</lpage>. <pub-id pub-id-type="doi">10.1128/MCB.3.3.351</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cerase</surname> <given-names>A.</given-names></name> <name><surname>Smeets</surname> <given-names>D.</given-names></name> <name><surname>Tang</surname> <given-names>Y. A.</given-names></name> <name><surname>Gdula</surname> <given-names>M.</given-names></name> <name><surname>Kraus</surname> <given-names>F.</given-names></name> <name><surname>Spivakov</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Spatial separation of Xist RNA and polycomb proteins revealed by superresolution microscopy.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>111</volume> <fpage>2235</fpage>&#x2013;<lpage>2240</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1312951111</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>D.</given-names></name> <name><surname>Huang</surname> <given-names>Y.</given-names></name> <name><surname>Ruan</surname> <given-names>Y.</given-names></name> <name><surname>Shen</surname> <given-names>W. H.</given-names></name></person-group> (<year>2016</year>). <article-title>The evolutionary landscape of PRC1 core components in green lineage.</article-title> <source><italic>Planta</italic></source> <volume>243</volume> <fpage>825</fpage>&#x2013;<lpage>846</lpage>. <pub-id pub-id-type="doi">10.1007/s00425-015-2451-9</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cui</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Shao</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>K.</given-names></name> <name><surname>Leng</surname> <given-names>P.</given-names></name> <name><surname>Liang</surname> <given-names>Z.</given-names></name></person-group> (<year>2015</year>). <article-title>Genome-wide identification, evolutionary, and expression analyses of histone H3 variants in plants.</article-title> <source><italic>Biomed. Res. Int.</italic></source> <volume>2015</volume>:<issue>341598</issue>. <pub-id pub-id-type="doi">10.1155/2015/341598</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Del Prete</surname> <given-names>S.</given-names></name> <name><surname>Mikulski</surname> <given-names>P.</given-names></name> <name><surname>Schubert</surname> <given-names>D.</given-names></name> <name><surname>Gaudin</surname> <given-names>V.</given-names></name></person-group> (<year>2015</year>). <article-title>One, two, three: Polycomb proteins hit all dimensions of gene regulation.</article-title> <source><italic>Genes</italic></source> <volume>6</volume> <fpage>520</fpage>&#x2013;<lpage>542</lpage>. <pub-id pub-id-type="doi">10.3390/genes6030520</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deleris</surname> <given-names>A.</given-names></name> <name><surname>Stroud</surname> <given-names>H.</given-names></name> <name><surname>Bernatavichute</surname> <given-names>Y.</given-names></name> <name><surname>Johnson</surname> <given-names>E.</given-names></name> <name><surname>Klein</surname> <given-names>G.</given-names></name> <name><surname>Schubert</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Loss of the DNA methyltransferase MET1 Induces H3K9 hypermethylation at PcG target genes and redistribution of H3K27 trimethylation to transposons in <italic>Arabidopsis thaliana</italic>.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>8</volume>:<issue>e1003062</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1003062</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deng</surname> <given-names>W.</given-names></name> <name><surname>Buzas</surname> <given-names>D. M.</given-names></name> <name><surname>Ying</surname> <given-names>H.</given-names></name> <name><surname>Robertson</surname> <given-names>M.</given-names></name> <name><surname>Taylor</surname> <given-names>J.</given-names></name> <name><surname>Peacock</surname> <given-names>W. J.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Arabidopsis Polycomb Repressive Complex 2 binding sites contain putative GAGA factor binding motifs within coding regions of genes.</article-title> <source><italic>BMC Genomics</italic></source> <volume>14</volume>:<issue>593</issue>. <pub-id pub-id-type="doi">10.1186/1471-2164-14-593</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Derkacheva</surname> <given-names>M.</given-names></name> <name><surname>Hennig</surname> <given-names>L.</given-names></name></person-group> (<year>2014</year>). <article-title>Variations on a theme: Polycomb group proteins in plants.</article-title> <source><italic>J. Exp. Bot.</italic></source> <volume>65</volume> <fpage>2769</fpage>&#x2013;<lpage>2784</lpage>. <pub-id pub-id-type="doi">10.1093/jxb/ert410</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Du</surname> <given-names>Z.</given-names></name> <name><surname>Zhou</surname> <given-names>X.</given-names></name> <name><surname>Ling</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Su</surname> <given-names>Z.</given-names></name></person-group> (<year>2010</year>). <article-title>agriGO: a GO analysis toolkit for the agricultural community.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>38</volume> <fpage>W64</fpage>&#x2013;<lpage>W70</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkq310</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dumesic</surname> <given-names>P. A.</given-names></name> <name><surname>Homer</surname> <given-names>C. M.</given-names></name> <name><surname>Moresco</surname> <given-names>J. J.</given-names></name> <name><surname>Pack</surname> <given-names>L. R.</given-names></name> <name><surname>Shanle</surname> <given-names>E. K.</given-names></name> <name><surname>Coyle</surname> <given-names>S. M.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Product binding enforces the genomic specificity of a yeast polycomb repressive complex.</article-title> <source><italic>Cell</italic></source> <volume>160</volume> <fpage>204</fpage>&#x2013;<lpage>218</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2014.11.039</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ebert</surname> <given-names>A.</given-names></name> <name><surname>Schotta</surname> <given-names>G.</given-names></name> <name><surname>Lein</surname> <given-names>S.</given-names></name> <name><surname>Kubicek</surname> <given-names>S.</given-names></name> <name><surname>Krauss</surname> <given-names>V.</given-names></name> <name><surname>Jenuwein</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>Su(var) genes regulate the balance between euchromatin and heterochromatin in <italic>Drosophila</italic>.</article-title> <source><italic>Genes Dev.</italic></source> <volume>18</volume> <fpage>2973</fpage>&#x2013;<lpage>2983</lpage>. <pub-id pub-id-type="doi">10.1101/gad.323004</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fajkus</surname> <given-names>J.</given-names></name> <name><surname>Kovar&#x00ED;k</surname> <given-names>A.</given-names></name> <name><surname>Kr&#x00E1;lovics</surname> <given-names>R.</given-names></name> <name><surname>Bezd&#x011B;k</surname> <given-names>M.</given-names></name></person-group> (<year>1995</year>). <article-title>Organization of telomeric and subtelomeric chromatin in the higher plant <italic>Nicotiana tabacum</italic>.</article-title> <source><italic>Mol. Gen. Genet.</italic></source> <volume>247</volume> <fpage>633</fpage>&#x2013;<lpage>638</lpage>. <pub-id pub-id-type="doi">10.1007/BF00290355</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferrari</surname> <given-names>K. J.</given-names></name> <name><surname>Scelfo</surname> <given-names>A.</given-names></name> <name><surname>Jammula</surname> <given-names>S.</given-names></name> <name><surname>Cuomo</surname> <given-names>A.</given-names></name> <name><surname>Barozzi</surname> <given-names>I.</given-names></name> <name><surname>St&#x00FC;tzer</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Polycomb-dependent H3K27me1 and H3K27me2 regulate active transcription and enhancer fidelity.</article-title> <source><italic>Mol. Cell</italic></source> <volume>53</volume> <fpage>49</fpage>&#x2013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2013.10.030</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hennig</surname> <given-names>L.</given-names></name> <name><surname>Derkacheva</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>Diversity of Polycomb group complexes in plants: same rules, different players?</article-title> <source><italic>Trends Genet.</italic></source> <volume>25</volume> <fpage>414</fpage>&#x2013;<lpage>423</lpage>. <pub-id pub-id-type="doi">10.1016/j.tig.2009.07.002</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Higashiyama</surname> <given-names>T.</given-names></name> <name><surname>Maki</surname> <given-names>S.</given-names></name> <name><surname>Yamada</surname> <given-names>T.</given-names></name></person-group> (<year>1995</year>). <article-title>Molecular organization of <italic>Chlorella vulgaris</italic> chromosome I: presence of telomeric repeats that are conserved in higher plants.</article-title> <source><italic>Mol. Gen. Genet.</italic></source> <volume>246</volume> <fpage>29</fpage>&#x2013;<lpage>36</lpage>. <pub-id pub-id-type="doi">10.1007/BF00290130</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>Y.</given-names></name> <name><surname>Chen</surname> <given-names>D.-H.</given-names></name> <name><surname>Liu</surname> <given-names>B.-Y.</given-names></name> <name><surname>Shen</surname> <given-names>W.-H.</given-names></name> <name><surname>Ruan</surname> <given-names>Y.</given-names></name></person-group> (<year>2016</year>). <article-title>Conservation and diversification of polycomb repressive complex 2 (PRC2) proteins in the green lineage.</article-title> <source><italic>Brief. Funct. Genomics</italic></source> <volume>16</volume> <fpage>106</fpage>&#x2013;<lpage>119</lpage>. <pub-id pub-id-type="doi">10.1093/bfgp/elw007</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jacob</surname> <given-names>Y.</given-names></name> <name><surname>Feng</surname> <given-names>S.</given-names></name> <name><surname>LeBlanc</surname> <given-names>C. A.</given-names></name> <name><surname>Bernatavichute</surname> <given-names>Y. V.</given-names></name> <name><surname>Stroud</surname> <given-names>H.</given-names></name> <name><surname>Cokus</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>ATXR5 and ATXR6 are H3K27 monomethyltransferases required for chromatin structure and gene silencing.</article-title> <source><italic>Nat. Struct. Mol. Biol.</italic></source> <volume>16</volume> <fpage>763</fpage>&#x2013;<lpage>768</lpage>. <pub-id pub-id-type="doi">10.1038/nsmb.1611</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jacob</surname> <given-names>Y.</given-names></name> <name><surname>Michaels</surname> <given-names>S. D.</given-names></name></person-group> (<year>2009</year>). <article-title>H3K27me1 is E(z) in animals, but not in plants.</article-title> <source><italic>Epigenetics</italic></source> <volume>4</volume> <fpage>366</fpage>&#x2013;<lpage>369</lpage>. <pub-id pub-id-type="doi">10.4161/epi.4.6.9713</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jamieson</surname> <given-names>K.</given-names></name> <name><surname>Rountree</surname> <given-names>M. R.</given-names></name> <name><surname>Lewis</surname> <given-names>Z. A.</given-names></name> <name><surname>Stajich</surname> <given-names>J. E.</given-names></name> <name><surname>Selker</surname> <given-names>E. U.</given-names></name></person-group> (<year>2013</year>). <article-title>Regional control of histone H3 lysine 27 methylation in <italic>Neurospora</italic>.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>110</volume> <fpage>6027</fpage>&#x2013;<lpage>6032</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1303750110</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jeffery</surname> <given-names>C. J.</given-names></name></person-group> (<year>2003</year>). <article-title>Moonlighting proteins: old proteins learning new tricks.</article-title> <source><italic>Trends Genet.</italic></source> <volume>19</volume> <fpage>415</fpage>&#x2013;<lpage>417</lpage>. <pub-id pub-id-type="doi">10.1016/S0168-9525(03)00167-7</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>J.</given-names></name> <name><surname>Hui</surname> <given-names>C.-C.</given-names></name></person-group> (<year>2008</year>). <article-title>Hedgehog signaling in development and cancer.</article-title> <source><italic>Dev. Cell</italic></source> <volume>15</volume> <fpage>801</fpage>&#x2013;<lpage>812</lpage>. <pub-id pub-id-type="doi">10.1016/j.devcel.2008.11.010</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>E.</given-names></name> <name><surname>Ma</surname> <given-names>X.</given-names></name> <name><surname>Cerutti</surname> <given-names>H.</given-names></name></person-group> (<year>2013</year>). <article-title>Gene silencing in microalgae?: mechanisms and biological roles.</article-title> <source><italic>Bioresour. Technol.</italic></source> <volume>184</volume> <fpage>23</fpage>&#x2013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1016/j.biortech.2014.10.119</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kleinmanns</surname> <given-names>J. A.</given-names></name> <name><surname>Schubert</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Polycomb and Trithorax group protein-mediated control of stress responses in plants.</article-title> <source><italic>Biol. Chem.</italic></source> <volume>395</volume> <fpage>1291</fpage>&#x2013;<lpage>1300</lpage>. <pub-id pub-id-type="doi">10.1515/hsz-2014-0197</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>K&#x00F6;hler</surname> <given-names>C.</given-names></name> <name><surname>Villar</surname> <given-names>C. B. R.</given-names></name></person-group> (<year>2008</year>). <article-title>Programming of gene expression by Polycomb group proteins.</article-title> <source><italic>Trends Cell Biol.</italic></source> <volume>18</volume> <fpage>236</fpage>&#x2013;<lpage>243</lpage>. <pub-id pub-id-type="doi">10.1016/j.tcb.2008.02.005</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lachner</surname> <given-names>M.</given-names></name> <name><surname>Sengupta</surname> <given-names>R.</given-names></name> <name><surname>Schotta</surname> <given-names>G.</given-names></name> <name><surname>Jenuwein</surname> <given-names>T.</given-names></name></person-group> (<year>2004</year>). <article-title>Trilogies of histone lysine methylation as epigenetic landmarks of the eukaryotic genome.</article-title> <source><italic>Cold Spring Harb. Symp. Quant. Biol.</italic></source> <volume>69</volume> <fpage>209</fpage>&#x2013;<lpage>218</lpage>. <pub-id pub-id-type="doi">10.1101/sqb.2004.69.209</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lafos</surname> <given-names>M.</given-names></name> <name><surname>Kroll</surname> <given-names>P.</given-names></name> <name><surname>Hohenstatt</surname> <given-names>M. L.</given-names></name> <name><surname>Thorpe</surname> <given-names>F. L.</given-names></name> <name><surname>Clarenz</surname> <given-names>O.</given-names></name> <name><surname>Schubert</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <article-title>Dynamic regulation of H3K27 trimethylation during <italic>Arabidopsis</italic> differentiation.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>7</volume>:<issue>e1002040</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1002040</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Larkin</surname> <given-names>M. A.</given-names></name> <name><surname>Blackshields</surname> <given-names>G.</given-names></name> <name><surname>Brown</surname> <given-names>N. P.</given-names></name> <name><surname>Chenna</surname> <given-names>R.</given-names></name> <name><surname>Mcgettigan</surname> <given-names>P. A.</given-names></name> <name><surname>McWilliam</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Clustal W and Clustal X version 2.0.</article-title> <source><italic>Bioinformatics</italic></source> <volume>23</volume> <fpage>2947</fpage>&#x2013;<lpage>2948</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btm404</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Linardopoulou</surname> <given-names>E.</given-names></name> <name><surname>Mefford</surname> <given-names>H. C.</given-names></name> <name><surname>Nguyen</surname> <given-names>O.</given-names></name> <name><surname>Friedman</surname> <given-names>C.</given-names></name> <name><surname>van den Engh</surname> <given-names>G.</given-names></name> <name><surname>Farwell</surname> <given-names>D. G.</given-names></name><etal/></person-group> (<year>2001</year>). <article-title>Transcriptional activity of multiple copies of a subtelomerically located olfactory receptor gene that is polymorphic in number and location.</article-title> <source><italic>Hum. Mol. Genet.</italic></source> <volume>10</volume> <fpage>2373</fpage>&#x2013;<lpage>2383</lpage>. <pub-id pub-id-type="doi">10.1093/hmg/10.21.2373</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindroth</surname> <given-names>A. M.</given-names></name> <name><surname>Shultis</surname> <given-names>D.</given-names></name> <name><surname>Jasencakova</surname> <given-names>Z.</given-names></name> <name><surname>Fuchs</surname> <given-names>J.</given-names></name> <name><surname>Johnson</surname> <given-names>L.</given-names></name> <name><surname>Schubert</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>Dual histone H3 methylation marks at lysines 9 and 27 required for interaction with CHROMOMETHYLASE3.</article-title> <source><italic>EMBO J.</italic></source> <volume>23</volume> <fpage>4286</fpage>&#x2013;<lpage>4296</lpage>. <pub-id pub-id-type="doi">10.1038/sj.emboj.7600430</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Taverna</surname> <given-names>S. D.</given-names></name> <name><surname>Muratore</surname> <given-names>T. L.</given-names></name> <name><surname>Shabanowitz</surname> <given-names>J.</given-names></name> <name><surname>Hunt</surname> <given-names>D. F.</given-names></name> <name><surname>Allis</surname> <given-names>C. D.</given-names></name></person-group> (<year>2007</year>). <article-title>RNAi-dependent H3K27 methylation is required for heterochromatin formation and DNA elimination in <italic>Tetrahymena</italic>.</article-title> <source><italic>Genes Dev.</italic></source> <volume>21</volume> <fpage>1530</fpage>&#x2013;<lpage>1545</lpage>. <pub-id pub-id-type="doi">10.1101/gad.1544207</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Louis</surname> <given-names>E. J.</given-names></name> <name><surname>Naumova</surname> <given-names>E. S.</given-names></name> <name><surname>Lee</surname> <given-names>A.</given-names></name> <name><surname>Naumov</surname> <given-names>G.</given-names></name> <name><surname>Haber</surname> <given-names>J. E.</given-names></name></person-group> (<year>1994</year>). <article-title>The chromosome end in yeast: its mosaic nature and influence on recombinational dynamics.</article-title> <source><italic>Genetics</italic></source> <volume>136</volume> <fpage>789</fpage>&#x2013;<lpage>802</lpage>.</citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsuzaki</surname> <given-names>M.</given-names></name> <name><surname>Misumi</surname> <given-names>O.</given-names></name> <name><surname>Shin-I</surname> <given-names>T.</given-names></name> <name><surname>Maruyama</surname> <given-names>S.</given-names></name> <name><surname>Takahara</surname> <given-names>M.</given-names></name> <name><surname>Miyagishima</surname> <given-names>S.-Y.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>Genome sequence of the ultrasmall unicellular red alga <italic>Cyanidioschyzon merolae</italic> 10D.</article-title> <source><italic>Nature</italic></source> <volume>428</volume> <fpage>653</fpage>&#x2013;<lpage>657</lpage>. <pub-id pub-id-type="doi">10.1038/nature02398</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McKay</surname> <given-names>D. J.</given-names></name> <name><surname>Klusza</surname> <given-names>S.</given-names></name> <name><surname>Penke</surname> <given-names>T. J. R.</given-names></name> <name><surname>Meers</surname> <given-names>M. P.</given-names></name> <name><surname>Curry</surname> <given-names>K. P.</given-names></name> <name><surname>McDaniel</surname> <given-names>S. L.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Interrogating the function of metazoan histones using engineered gene clusters.</article-title> <source><italic>Dev. Cell</italic></source> <volume>32</volume> <fpage>373</fpage>&#x2013;<lpage>386</lpage>. <pub-id pub-id-type="doi">10.1016/j.devcel.2014.12.025</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Merchant</surname> <given-names>S. S.</given-names></name> <name><surname>Prochnik</surname> <given-names>S. E.</given-names></name> <name><surname>Vallon</surname> <given-names>O.</given-names></name> <name><surname>Harris</surname> <given-names>E. H.</given-names></name> <name><surname>Karpowicz</surname> <given-names>J.</given-names></name> <name><surname>Witman</surname> <given-names>G. B.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>The <italic>Chlamydomonas</italic> genome reveals the evolution of key animal and plant functions.</article-title> <source><italic>Science</italic></source> <volume>318</volume> <fpage>245</fpage>&#x2013;<lpage>250</lpage>. <pub-id pub-id-type="doi">10.1126/science.1143609.The</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>M&#x00FC;ller</surname> <given-names>J.</given-names></name> <name><surname>Hart</surname> <given-names>C. M.</given-names></name> <name><surname>Francis</surname> <given-names>N. J.</given-names></name> <name><surname>Vargas</surname> <given-names>M. L.</given-names></name> <name><surname>Sengupta</surname> <given-names>A.</given-names></name> <name><surname>Wild</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2002</year>). <article-title>Histone methyltransferase activity of a <italic>Drosophila</italic> Polycomb group repressor complex.</article-title> <source><italic>Cell</italic></source> <volume>111</volume> <fpage>197</fpage>&#x2013;<lpage>208</lpage>. <pub-id pub-id-type="doi">10.1016/S0092-8674(02)00976-5</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>M&#x00FC;ller</surname> <given-names>J.</given-names></name> <name><surname>Verrijzer</surname> <given-names>P.</given-names></name></person-group> (<year>2009</year>). <article-title>Biochemical mechanisms of gene regulation by polycomb group protein complexes.</article-title> <source><italic>Curr. Opin. Genet. Dev.</italic></source> <volume>19</volume> <fpage>150</fpage>&#x2013;<lpage>158</lpage>. <pub-id pub-id-type="doi">10.1016/j.gde.2009.03.001</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nekrasov</surname> <given-names>M.</given-names></name> <name><surname>Klymenko</surname> <given-names>T.</given-names></name> <name><surname>Fraterman</surname> <given-names>S.</given-names></name> <name><surname>Papp</surname> <given-names>B.</given-names></name> <name><surname>Oktaba</surname> <given-names>K.</given-names></name> <name><surname>K&#x00F6;cher</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Pcl-PRC2 is needed to generate high levels of H3-K27 trimethylation at Polycomb target genes.</article-title> <source><italic>EMBO J.</italic></source> <volume>26</volume> <fpage>4078</fpage>&#x2013;<lpage>4088</lpage>. <pub-id pub-id-type="doi">10.1038/sj.emboj.7601837</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Novikova</surname> <given-names>O.</given-names></name> <name><surname>Topilina</surname> <given-names>N.</given-names></name> <name><surname>Belfort</surname> <given-names>M.</given-names></name></person-group> (<year>2014</year>). <article-title>Enigmatic distribution, evolution, and function of inteins.</article-title> <source><italic>J. Biol. Chem.</italic></source> <volume>289</volume> <fpage>14490</fpage>&#x2013;<lpage>14497</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.R114.548255</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nozaki</surname> <given-names>H.</given-names></name> <name><surname>Takano</surname> <given-names>H.</given-names></name> <name><surname>Misumi</surname> <given-names>O.</given-names></name> <name><surname>Terasawa</surname> <given-names>K.</given-names></name> <name><surname>Matsuzaki</surname> <given-names>M.</given-names></name> <name><surname>Maruyama</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>A 100%-complete sequence reveals unusually simple genomic features in the hot-spring red alga <italic>Cyanidioschyzon merolae</italic>.</article-title> <source><italic>BMC Biol.</italic></source> <volume>5</volume>:<issue>28</issue>. <pub-id pub-id-type="doi">10.1186/1741-7007-5-28</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Okada</surname> <given-names>T.</given-names></name> <name><surname>Endo</surname> <given-names>M.</given-names></name> <name><surname>Singh</surname> <given-names>M. B.</given-names></name> <name><surname>Bhalla</surname> <given-names>P. L.</given-names></name></person-group> (<year>2005</year>). <article-title>Analysis of the histone H3 gene family in Arabidopsis and identification of the male-gamete-specific variant <italic>AtMGH3</italic>.</article-title> <source><italic>Plant J.</italic></source> <volume>44</volume> <fpage>557</fpage>&#x2013;<lpage>568</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2005.02554.x</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Palenik</surname> <given-names>B.</given-names></name> <name><surname>Grimwood</surname> <given-names>J.</given-names></name> <name><surname>Aerts</surname> <given-names>A.</given-names></name> <name><surname>Rouz&#x00E9;</surname> <given-names>P.</given-names></name> <name><surname>Salamov</surname> <given-names>A.</given-names></name> <name><surname>Putnam</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>The tiny eukaryote <italic>Ostreococcus</italic> provides genomic insights into the paradox of plankton speciation.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>104</volume> <fpage>7705</fpage>&#x2013;<lpage>7710</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0611046104</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Papp</surname> <given-names>B.</given-names></name> <name><surname>M&#x00FC;ller</surname> <given-names>J.</given-names></name></person-group> (<year>2006</year>). <article-title>Histone trimethylation and the maintenance of transcriptional ON and OFF states by trxG and PcG proteins.</article-title> <source><italic>Genes Dev.</italic></source> <volume>20</volume> <fpage>2041</fpage>&#x2013;<lpage>2054</lpage>. <pub-id pub-id-type="doi">10.1101/gad.388706</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Park</surname> <given-names>S.</given-names></name> <name><surname>Oh</surname> <given-names>S.</given-names></name> <name><surname>van Nocker</surname> <given-names>S.</given-names></name></person-group> (<year>2012</year>). <article-title>Genomic and gene-level distribution of histone H3 dimethyl lysine-27 (H3K27me2) in Arabidopsis.</article-title> <source><italic>PLoS ONE</italic></source> <volume>7</volume>:<issue>e52855</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0052855</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pauler</surname> <given-names>F. M.</given-names></name> <name><surname>Sloane</surname> <given-names>M. A.</given-names></name> <name><surname>Huang</surname> <given-names>R.</given-names></name> <name><surname>Regha</surname> <given-names>K.</given-names></name> <name><surname>Koerner</surname> <given-names>M. V.</given-names></name> <name><surname>Tamir</surname> <given-names>I.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>H3K27me3 forms BLOCs over silent genes and intergenic regions and specifies a histone banding pattern on a mouse autosomal chromosome.</article-title> <source><italic>Genome Res.</italic></source> <volume>19</volume> <fpage>221</fpage>&#x2013;<lpage>233</lpage>. <pub-id pub-id-type="doi">10.1101/gr.080861.108.4</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pietrokovski</surname> <given-names>S.</given-names></name></person-group> (<year>2001</year>). <article-title>Intein spread and extinction in evolution.</article-title> <source><italic>Trends Genet.</italic></source> <volume>17</volume> <fpage>465</fpage>&#x2013;<lpage>472</lpage>. <pub-id pub-id-type="doi">10.1016/S0168-9525(01)02365-4</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Price</surname> <given-names>D. C.</given-names></name> <name><surname>Chan</surname> <given-names>C. X.</given-names></name> <name><surname>Yoon</surname> <given-names>H. S.</given-names></name> <name><surname>Yang</surname> <given-names>E. C.</given-names></name> <name><surname>Qiu</surname> <given-names>H.</given-names></name> <name><surname>Weber</surname> <given-names>A. P.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Cyanophora paradoxa genome elucidates origin of photosynthesis in algae and plants.</article-title> <source><italic>Science</italic></source> <volume>335</volume> <fpage>843</fpage>&#x2013;<lpage>847</lpage>. <pub-id pub-id-type="doi">10.1126/science.1213561</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quinlan</surname> <given-names>A. R.</given-names></name> <name><surname>Hall</surname> <given-names>I. M.</given-names></name></person-group> (<year>2010</year>). <article-title>BEDTools: a flexible suite of utilities for comparing genomic features.</article-title> <source><italic>Bioinformatics</italic></source> <volume>26</volume> <fpage>841</fpage>&#x2013;<lpage>842</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btq033</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rambaut</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <source><italic>FigTree, a Graphical Viewer of Phylogenetic Trees</italic>.</source> Available at: <ext-link ext-link-type="uri" xlink:href="http://tree.bio.ed.ac.uk/software/figtree/">http://tree.bio.ed.ac.uk/software/figtree/</ext-link></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramirez</surname> <given-names>F.</given-names></name> <name><surname>Ryan</surname> <given-names>D. P.</given-names></name> <name><surname>Gruning</surname> <given-names>B.</given-names></name> <name><surname>Bhardwaj</surname> <given-names>V.</given-names></name> <name><surname>Kilpert</surname> <given-names>F.</given-names></name> <name><surname>Richter</surname> <given-names>A. S.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>deepTools2: a next generation web server for deep-sequencing data analysis.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>160</fpage>&#x2013;<lpage>165</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw257</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ronquist</surname> <given-names>F.</given-names></name> <name><surname>Teslenko</surname> <given-names>M.</given-names></name> <name><surname>Van Der Mark</surname> <given-names>P.</given-names></name> <name><surname>Ayres</surname> <given-names>D. L.</given-names></name> <name><surname>Darling</surname> <given-names>A.</given-names></name> <name><surname>H&#x00F6;hna</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Mrbayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space.</article-title> <source><italic>Syst. Biol.</italic></source> <volume>61</volume> <fpage>539</fpage>&#x2013;<lpage>542</lpage>. <pub-id pub-id-type="doi">10.1093/sysbio/sys029</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schuettengruber</surname> <given-names>B.</given-names></name> <name><surname>Chourrout</surname> <given-names>D.</given-names></name> <name><surname>Vervoort</surname> <given-names>M.</given-names></name> <name><surname>Leblanc</surname> <given-names>B.</given-names></name> <name><surname>Cavalli</surname> <given-names>G.</given-names></name></person-group> (<year>2007</year>). <article-title>Genome regulation by Polycomb and Trithorax proteins.</article-title> <source><italic>Cell</italic></source> <volume>128</volume> <fpage>735</fpage>&#x2013;<lpage>745</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2007.02.009</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schwartz</surname> <given-names>Y. B.</given-names></name> <name><surname>Pirrotta</surname> <given-names>V.</given-names></name></person-group> (<year>2008</year>). <article-title>Polycomb complexes and epigenetic states.</article-title> <source><italic>Curr. Opin. Cell Biol.</italic></source> <volume>20</volume> <fpage>266</fpage>&#x2013;<lpage>273</lpage>. <pub-id pub-id-type="doi">10.1016/j.ceb.2008.03.002</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schwartz</surname> <given-names>Y. B.</given-names></name> <name><surname>Pirrotta</surname> <given-names>V.</given-names></name></person-group> (<year>2013</year>). <article-title>A new world of Polycombs: unexpected partnerships and emerging functions.</article-title> <source><italic>Nat. Rev. Genet.</italic></source> <volume>14</volume> <fpage>853</fpage>&#x2013;<lpage>864</lpage>. <pub-id pub-id-type="doi">10.1038/nrg3603</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shah</surname> <given-names>N. H.</given-names></name> <name><surname>Muir</surname> <given-names>T. W.</given-names></name></person-group> (<year>2014</year>). <article-title>Inteins: nature&#x2019;s gift to protein chemists.</article-title> <source><italic>Chem. Sci.</italic></source> <volume>5</volume> <fpage>446</fpage>&#x2013;<lpage>461</lpage>. <pub-id pub-id-type="doi">10.1039/C3sc52951g</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shaver</surname> <given-names>S.</given-names></name> <name><surname>Casas-Mollano</surname> <given-names>J. A.</given-names></name> <name><surname>Cerny</surname> <given-names>R. L.</given-names></name> <name><surname>Cerutti</surname> <given-names>H.</given-names></name></person-group> (<year>2010</year>). <article-title>Origin of the polycomb repressive complex 2 and gene silencing by an E(z) homolog in the unicellular alga <italic>Chlamydomonas</italic>.</article-title> <source><italic>Epigenetics</italic></source> <volume>5</volume> <fpage>301</fpage>&#x2013;<lpage>312</lpage>. <pub-id pub-id-type="doi">10.4161/epi.5.4.11608</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>K. M.</given-names></name> <name><surname>Kothe</surname> <given-names>G. O.</given-names></name> <name><surname>Matsen</surname> <given-names>C. B.</given-names></name> <name><surname>Khlafallah</surname> <given-names>T. K.</given-names></name> <name><surname>Adhvaryu</surname> <given-names>K. K.</given-names></name> <name><surname>Hemphill</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>The fungus <italic>Neurospora crassa</italic> displays telomeric silencing mediated by multiple sirtuins and by methylation of histone H3 lysine 9.</article-title> <source><italic>Epigenetics Chromatin</italic></source> <volume>1</volume>:<issue>5</issue>. <pub-id pub-id-type="doi">10.1186/1756-8935-1-5</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sung</surname> <given-names>S.</given-names></name> <name><surname>Schmitz</surname> <given-names>R. J.</given-names></name> <name><surname>Amasino</surname> <given-names>R. M.</given-names></name></person-group> (<year>2006</year>). <article-title>A PHD finger protein involved in both the vernalization and photoperiod pathways in <italic>Arabidopsis</italic>.</article-title> <source><italic>Genes Dev.</italic></source> <volume>20</volume> <fpage>3244</fpage>&#x2013;<lpage>3248</lpage>. <pub-id pub-id-type="doi">10.1101/gad.1493306</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Talavera</surname> <given-names>G.</given-names></name> <name><surname>Castresana</surname> <given-names>J.</given-names></name></person-group> (<year>2007</year>). <article-title>Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments.</article-title> <source><italic>Syst. Biol.</italic></source> <volume>56</volume> <fpage>564</fpage>&#x2013;<lpage>577</lpage>. <pub-id pub-id-type="doi">10.1080/10635150701472164</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thorvaldsd&#x00F3;ttir</surname> <given-names>H.</given-names></name> <name><surname>Robinson</surname> <given-names>J. T.</given-names></name> <name><surname>Mesirov</surname> <given-names>J. P.</given-names></name></person-group> (<year>2013</year>). <article-title>Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>14</volume> <fpage>178</fpage>&#x2013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbs017</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Topilina</surname> <given-names>N. I.</given-names></name> <name><surname>Mills</surname> <given-names>K. V.</given-names></name></person-group> (<year>2014</year>). <article-title>Recent advances in <italic>in vivo</italic> applications of intein-mediated protein splicing.</article-title> <source><italic>Mob. DNA</italic></source> <volume>5</volume>:<issue>5</issue>. <pub-id pub-id-type="doi">10.1186/1759-8753-5-5</pub-id></citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Trapnell</surname> <given-names>C.</given-names></name> <name><surname>Roberts</surname> <given-names>A.</given-names></name> <name><surname>Goff</surname> <given-names>L.</given-names></name> <name><surname>Pertea</surname> <given-names>G.</given-names></name> <name><surname>Kim</surname> <given-names>D.</given-names></name> <name><surname>Kelley</surname> <given-names>D. R.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.</article-title> <source><italic>Nat. Protoc.</italic></source> <volume>7</volume> <fpage>562</fpage>&#x2013;<lpage>578</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2012.016</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vaquero-sedas</surname> <given-names>I.</given-names></name> <name><surname>Luo</surname> <given-names>C.</given-names></name> <name><surname>Vega-palas</surname> <given-names>M. A.</given-names></name></person-group> (<year>2012</year>). <article-title>Analysis of the epigenetic status of telomeres by using ChIP-seq data.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>40</volume>:<issue>e163</issue>. <pub-id pub-id-type="doi">10.1093/nar/gks730</pub-id></citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Veerappan</surname> <given-names>C. S.</given-names></name> <name><surname>Avramova</surname> <given-names>Z.</given-names></name> <name><surname>Moriyama</surname> <given-names>E. N.</given-names></name></person-group> (<year>2008</year>). <article-title>Evolution of SET-domain protein families in the unicellular and multicellular Ascomycota fungi.</article-title> <source><italic>BMC Evol. Biol.</italic></source> <volume>8</volume>:<issue>190</issue>. <pub-id pub-id-type="doi">10.1186/1471-2148-8-190</pub-id></citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Veluchamy</surname> <given-names>A.</given-names></name> <name><surname>Rastogi</surname> <given-names>A.</given-names></name> <name><surname>Lin</surname> <given-names>X.</given-names></name> <name><surname>Lombard</surname> <given-names>B.</given-names></name> <name><surname>Murik</surname> <given-names>O.</given-names></name> <name><surname>Thomas</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>An integrative analysis of post-translational histone modifications in the marine diatom <italic>Phaeodactylum tricornutum</italic>.</article-title> <source><italic>Genome Biol.</italic></source> <volume>16</volume>:<issue>102</issue>. <pub-id pub-id-type="doi">10.1186/s13059-015-0671-8</pub-id></citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Waterhouse</surname> <given-names>A. M.</given-names></name> <name><surname>Procter</surname> <given-names>J. B.</given-names></name> <name><surname>Martin</surname> <given-names>D. M. A.</given-names></name> <name><surname>Clamp</surname> <given-names>M.</given-names></name> <name><surname>Barton</surname> <given-names>G. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Jalview Version 2&#x2013;A multiple sequence alignment editor and analysis workbench.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>1189</fpage>&#x2013;<lpage>1191</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp033</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weinhofer</surname> <given-names>I.</given-names></name> <name><surname>Hehenberger</surname> <given-names>E.</given-names></name> <name><surname>Roszak</surname> <given-names>P.</given-names></name> <name><surname>Hennig</surname> <given-names>L.</given-names></name> <name><surname>K&#x00F6;hler</surname> <given-names>C.</given-names></name></person-group> (<year>2010</year>). <article-title>H3K27me3 profiling of the endosperm implies exclusion of polycomb group protein targeting by DNA methylation.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>6</volume>:<issue>e1001152</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1001152</pub-id></citation></ref>
<ref id="B77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wolff</surname> <given-names>P.</given-names></name> <name><surname>Weinhofer</surname> <given-names>I.</given-names></name> <name><surname>Seguin</surname> <given-names>J.</given-names></name> <name><surname>Roszak</surname> <given-names>P.</given-names></name> <name><surname>Beisel</surname> <given-names>C.</given-names></name> <name><surname>Donoghue</surname> <given-names>M. T. A.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>High-resolution analysis of parent-of-origin allelic expression in the Arabidopsis endosperm.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>7</volume>:<issue>e1002126</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1002126</pub-id></citation></ref>
<ref id="B78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Worden</surname> <given-names>A. Z.</given-names></name> <name><surname>Lee</surname> <given-names>J.-H.</given-names></name> <name><surname>Mock</surname> <given-names>T.</given-names></name> <name><surname>Rouz&#x00E9;</surname> <given-names>P.</given-names></name> <name><surname>Simmons</surname> <given-names>M. P.</given-names></name> <name><surname>Aerts</surname> <given-names>A. L.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Green evolution and dynamic adaptations revealed by genomes of the marine picoeukaryotes <italic>Micromonas</italic>.</article-title> <source><italic>Science</italic></source> <volume>324</volume> <fpage>268</fpage>&#x2013;<lpage>272</lpage>. <pub-id pub-id-type="doi">10.1126/science.1167222</pub-id></citation></ref>
<ref id="B79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>C.</given-names></name> <name><surname>Bratzel</surname> <given-names>F.</given-names></name> <name><surname>Hohmann</surname> <given-names>N.</given-names></name> <name><surname>Koch</surname> <given-names>M.</given-names></name> <name><surname>Turck</surname> <given-names>F.</given-names></name> <name><surname>Calonje</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>VAL-and AtBMI1-mediated H2Aub initiate the switch from embryonic to postgerminative growth in <italic>Arabidopsis</italic>.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>23</volume> <fpage>1324</fpage>&#x2013;<lpage>1329</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2013.05.050</pub-id></citation></ref>
<ref id="B80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>H.</given-names></name> <name><surname>Howard</surname> <given-names>M.</given-names></name> <name><surname>Dean</surname> <given-names>C.</given-names></name></person-group> (<year>2014</year>). <article-title>Antagonistic roles for H3K36me3 and H3K27me3 in the cold-induced epigenetic switch at <italic>Arabidopsis</italic> FLC.</article-title> <source><italic>Curr. Biol.</italic></source> <volume>24</volume> <fpage>1793</fpage>&#x2013;<lpage>1797</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2014.06.047</pub-id></citation></ref>
<ref id="B81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yin</surname> <given-names>H.</given-names></name> <name><surname>Sweeney</surname> <given-names>S.</given-names></name> <name><surname>Raha</surname> <given-names>D.</given-names></name> <name><surname>Snyder</surname> <given-names>M.</given-names></name> <name><surname>Lin</surname> <given-names>H.</given-names></name></person-group> (<year>2011</year>). <article-title>A high-resolution whole-genome map of key chromatin modifications in the adult <italic>Drosophila melanogaster</italic>.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>7</volume>:<issue>e1002380</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1002380</pub-id></citation></ref>
<ref id="B82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Young</surname> <given-names>M. D.</given-names></name> <name><surname>Willson</surname> <given-names>T. A.</given-names></name> <name><surname>Wakefield</surname> <given-names>M. J.</given-names></name> <name><surname>Trounson</surname> <given-names>E.</given-names></name> <name><surname>Hilton</surname> <given-names>D. J.</given-names></name> <name><surname>Blewitt</surname> <given-names>M. E.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>ChIP-seq analysis reveals distinct H3K27me3 profiles that correlate with transcriptional activity.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>39</volume> <fpage>7415</fpage>&#x2013;<lpage>7427</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr416</pub-id></citation></ref>
<ref id="B83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Clarenz</surname> <given-names>O.</given-names></name> <name><surname>Cokus</surname> <given-names>S.</given-names></name> <name><surname>Bernatavichute</surname> <given-names>Y. V.</given-names></name> <name><surname>Pellegrini</surname> <given-names>M.</given-names></name> <name><surname>Goodrich</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Whole-genome analysis of histone H3 lysine 27 trimethylation in <italic>Arabidopsis</italic>.</article-title> <source><italic>PLoS Biol.</italic></source> <volume>5</volume>:<issue>e129</issue>. <pub-id pub-id-type="doi">10.1371/journal.pbio.0050129</pub-id></citation></ref>
<ref id="B84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>Y.</given-names></name> <name><surname>Hartwig</surname> <given-names>B.</given-names></name> <name><surname>James</surname> <given-names>G. V.</given-names></name> <name><surname>Schneeberger</surname> <given-names>K.</given-names></name> <name><surname>Turck</surname> <given-names>F.</given-names></name></person-group> (<year>2013</year>). <article-title>Complementary activities of TELOMERE REPEAT BINDING proteins and polycomb group complexes in transcriptional regulation of target genes.</article-title> <source><italic>Plant Cell.</italic></source> <volume>28</volume> <fpage>87</fpage>&#x2013;<lpage>101</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.15.00787</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn01"><label>1</label><p><ext-link ext-link-type="uri" xlink:href="http://czon.jp/download/annot_list.txt">czon.jp/download/annot_list.txt</ext-link></p></fn>
</fn-group>
</back>
</article>