<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2023.1112734</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Complete genome assembly and methylome dissection of <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Fomenkov</surname> <given-names>Alexey</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1029715/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Weigele</surname> <given-names>Peter</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1755836/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>McClung</surname> <given-names>Colleen</given-names></name>
</contrib>
<contrib contrib-type="author">
<name><surname>Madinger</surname> <given-names>Casey</given-names></name>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Roberts</surname> <given-names>Richard J.</given-names></name>
<xref ref-type="corresp" rid="c002"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/168347/overview"/>
</contrib>
</contrib-group>
<aff><institution>New England Biolabs Inc.</institution>, <addr-line>Ipswich, MA</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Solenne Ithurbide, University of Freiburg, Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Barny Whitman, University of Georgia, United States; Mohea Couturier, Vrije Universiteit Brussel, Belgium</p></fn>
<corresp id="c001">&#x002A;Correspondence: Alexey Fomenkov, <email>fomenkov@neb.com</email></corresp>
<corresp id="c002">Richard J. Roberts, <email>roberts@neb.com</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Biology of Archaea, a section of the journal Frontiers in Microbiology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>04</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1112734</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>11</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>03</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Fomenkov, Weigele, McClung, Madinger and Roberts.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Fomenkov, Weigele, McClung, Madinger and Roberts</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Although restriction-modification systems are found in both Eubacterial and Archaeal kingdoms, comparatively less is known about patterns of DNA methylation and genome defense systems in archaea. Here we report the complete closed genome sequence and methylome analysis of <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic>, a strain of the CO<sub>2</sub>-reducing methanogenic archaeon and a commercial source for <italic>Mae</italic>I, <italic>Mae</italic>II, and <italic>Mae</italic>III restriction endonucleases. The <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> genome consists of a 1.68 megabase circular chromosome predicted to contain 1,615 protein coding genes and 38 tRNAs. A combination of methylome sequencing, homology-based genome annotation, and recombinant gene expression identified five restriction-modification systems encoded by this organism, including the methyltransferase and site-specific endonuclease of <italic>Mae</italic>III. The <italic>Mae</italic>III restriction endonuclease was recombinantly expressed, purified and shown to have site-specific DNA cleavage activity <italic>in vitro</italic>.</p>
</abstract>
<kwd-group>
<kwd>complete genome</kwd>
<kwd>methylome</kwd>
<kwd><italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic></kwd>
<kwd>restriction</kwd>
<kwd>modification</kwd>
<kwd>systems</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="27"/>
<page-count count="10"/>
<word-count count="4875"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>The role of DNA methylation in the restriction-modification (RM) mechanisms of bacteria and archaea was discovered more than 50 years ago (<xref ref-type="bibr" rid="B1">Arber, 1974</xref>; <xref ref-type="bibr" rid="B15">Loenen et al., 2014</xref>). Bacterial and archaeal DNA methyltransferases (MTases) create an epigenetic landscape in DNA by modifying specific motifs and, in concert with their cognate restriction endonucleases, determine the boundaries between bacterial species and protect against the entry of bacteriophages, conjugative plasmids and other mobile DNA elements. In addition to their roles in specifying self/non-self during genome defense, DNA methyltransferases appear as standalone genes; i.e., not associated with restriction endonucleases. Such &#x201C;orphan&#x201D; methyltransferases take on epigenetic functions such as regulation of genes for pathogenesis (<xref ref-type="bibr" rid="B10">Hernday et al., 2003</xref>) and ensuring fidelity of DNA repair (<xref ref-type="bibr" rid="B19">Schlagman et al., 1986</xref>).</p>
<p>Compared to their bacterial counterparts, the RM systems of archaea are relatively understudied. For example, of 54,450 genomes currently known to harbor RM systems, only 613 are from archaea (<xref ref-type="bibr" rid="B18">Roberts et al., 2022</xref>). Three restriction endonucleases, <italic>Mae</italic>I (C&#x2193;TAG), <italic>Mae</italic>II (A&#x2193;CGT), and <italic>Mae</italic>III (&#x2193;GTNAC) have previously been characterized from <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic> (<xref ref-type="bibr" rid="B20">Schmid et al., 1984</xref>). <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic>, originally isolated from marine sediments of Lipari Island near Sicily (<xref ref-type="bibr" rid="B13">Kendall et al., 2006</xref>), is a strain of CO<sub>2</sub>-reducing methanogenic Euryarchaeota. Because the genome sequence of this strain has not been reported, we decided to perform SMRT sequencing of this archaeon to identify the Restriction-Modification (RM) system genes. Next generation sequencing platforms such as Single Molecule Real Time (SMRT) (<xref ref-type="bibr" rid="B4">Clark et al., 2012</xref>), Rapid Identification Methylase Specificity (RIMS) (<xref ref-type="bibr" rid="B2">Baum et al., 2021</xref>), and Oxford Nanopore Technology (ONT) (<xref ref-type="bibr" rid="B23">Tourancheau et al., 2021</xref>) allow not only the sequence and complete assembly of bacterial and archaeal genomes, but also enable the determination of methylation patterns and the confirmation of the MTase genes responsible for the modified motifs.</p>
<p>Herein, we report the genome sequence and methylome of <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic>, results which enabled the identification of genes encoding five RM-systems. Of these, methyltransferase activities of M.MaeIV and M.MaeV, as well as the restriction endonuclease activity of <italic>Mae</italic>III, were further verified and characterized experimentally. These findings reveal the genetic basis of commercially relevant restriction endonucleases, connect methyltransferase genes to the epigenetic state of <italic>M. aeolicus</italic>, and extend what is known about RM systems in the Methanococcales branch of the Euryarchaeal lineage.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and methods</title>
<sec id="S2.SS1">
<title>Strains, plasmids, and reagents</title>
<p>Cells of <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic> were a gift from Prof. Dina Grohmann (Universit&#x00E4;t Regensburg), grown as previously described (<xref ref-type="bibr" rid="B13">Kendall et al., 2006</xref>). All laboratory <italic>Escherichia coli</italic> strains used in this study are listed in <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>. Cultures were grown in Luria-Bertani (LB) broth or agar on appropriate antibiotics. All plasmids used in this study are listed in <xref ref-type="supplementary-material" rid="TS2">Supplementary Table 2</xref>. Restriction enzymes, except <italic>Mae</italic>III (Roche Diagnostic, GmbH, Mannheim, Germany), DNA substrates, DNA and protein markers were from New England Biolabs (NEB, MA). Q5 &#x201C;Hot Start&#x201D; DNA polymerase (M0543, NEB, MA) was used for PCR amplification of Methyltransferase and Restriction endonuclease genes for cloning. All PCR primers were synthesized by IDT, IA and are listed in <xref ref-type="supplementary-material" rid="TS3">Supplementary Table 3</xref>. The cloning of PCR amplified genes in appropriate vectors were performed using NEBuilder HiFi DNA assembly Master mix (E2621, NEB, MA). The plasmid DNA was purified using a Monarch Plasmid Miniprep kit (T1010, NEB, MA).</p>
</sec>
<sec id="S2.SS2">
<title>SMRT sequencing and genome assembly</title>
<p>Genomic DNA from <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic> was purified using a Monarch Genome Purification kit (T3010, NEB, Ipswich, MA, USA) and the DNA sample was sheared to an average size of &#x223C; 10 kb using the G-tube protocol (Covaris, Woburn, MA, USA). DNA libraries were prepared using a SMRTbell express template prep kit 2.0 (100&#x2013;938&#x2013;900, Pacific Biosciences, Menlo Park, CA, USA) and ligated with hairpin barcoded adapter lbc1-lbc1. Incompletely formed SMRTbell templates were removed by digestion with a combination of exonuclease III and exonuclease VII (NEB, Ipswich, MA, USA). The qualification and quantification of the SMRTbell libraries were made on a Qubit fluorimeter (Invitrogen, OR) and a 2,100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). SMRT sequencing was performed using an SQ1 (Pacific Biosciences, Menlo Park, CA, USA) based on the multiplex protocol for 10 kb SMRTbell library inserts. Sequencing reads were collected and <italic>de novo</italic> assembled using the Microbial Assembly version 10.1.0.1119588 program with default quality and read length parameters. In addition to genome assembly (<xref ref-type="bibr" rid="B3">Chin et al., 2013</xref>), the SMRT Analysis pipeline from Pacific Biosciences<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> enables the determination of the epigenetic status of sequenced DNA by identifying the m6A and m4C modified motifs (<xref ref-type="bibr" rid="B5">Flusberg et al., 2010</xref>; <xref ref-type="bibr" rid="B4">Clark et al., 2012</xref>; <xref ref-type="bibr" rid="B14">Korlach and Turner, 2012</xref>).</p>
</sec>
<sec id="S2.SS3">
<title>Computational and experimental identification of <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> methyltransferases</title>
<p>Following an approach nicknamed the &#x201C;Hungarian trick&#x201D; (<xref ref-type="bibr" rid="B21">Szomol&#x00E1;nyi et al., 1980</xref>), 5 &#x03BC;g of <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> gDNA was digested with <italic>Bam</italic>HI, <italic>Bgl</italic>II and partially digested with <italic>Sau</italic>3AI restriction enzymes. DNA fragments were cloned into the compatible dephosphorylated <italic>Bam</italic>HI sites of pUC19, pBR322 and pACYC184 vectors yielding 9 shot gun libraries. The foreign restriction and modification genes are usually very toxic for <italic>E. coli</italic>, therefore in order to clone both we need high level of methylation. These three vectors have different copy numbers pUC19 (high), pBR322 (medium) and pACYC184 (low). Moreover, they utilized different promoters P<sub><italic>lac</italic></sub> and P<sub><italic>tet</italic></sub>, since native <italic>Methanococcus</italic> promoters may or may not work in <italic>E. coli</italic>. Each purified plasmid library (1 &#x03BC;g) was challenged with 10 units of <italic>Mae</italic>III restriction endonuclease and the digestion mixtures were transformed into <italic>E. coli</italic> strain ER2683. Plasmids were recovered from surviving transformants and analyzed by Sanger sequencing and restriction digest.</p>
<p>Homology-based searches of the <italic>M. aeolicus</italic> genome for putative methyltransferase genes were performed using the Seqware program (<xref ref-type="bibr" rid="B18">Roberts et al., 2022</xref>). Additional searches for MTase genes were performed using HMMer (HMMER 3.3.2)<sup><xref ref-type="fn" rid="footnote2">2</xref></sup> to annotate each predicted protein coding sequence feature (CDS) of the <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic> genome. Genes matching MTase sequence profiles were further examined by structure prediction using the ColabFold implementation of AlphaFold2 (<xref ref-type="bibr" rid="B12">Jumper et al., 2021</xref>; <xref ref-type="bibr" rid="B17">Mirdita et al., 2022</xref>), followed by structure similarity search using predicted MTase models as query inputs to DALI (<xref ref-type="bibr" rid="B11">Holm, 2022</xref>).</p>
<p>Gene loci for M.MaeV and M1-M2.MaeIV were PCR amplified with P1-P2 primers and P3-P4 primers, respectively, and assembled into low copy pACYC&#x0394;tet vectors under control of a constitutive P<sub><italic>tet</italic></sub> promoter and expressed in <italic>E. coli</italic> strain ER2683 (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>). The correct clones were verified by Sanger sequencing of the inserts using an ABI DNA sequencer with pTetF and pTetR sequencing primers and transformed into the methylation deficient <italic>E. coli</italic> strain ER2796 (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>).</p>
</sec>
<sec id="S2.SS4">
<title>Liquid chromatography-tandem mass spectrometry analysis of functionally purified <italic>Mae</italic>III endonuclease</title>
<p>Purified <italic>Mae</italic>III endonuclease was subjected to 10&#x2013;20% Tris-Glycine SDS-PAGE and stained with SimplyBlue SafeStain (Invitrogen, OR). The major band at approximately 35 kDa was excised for in-gel digestion (<xref ref-type="fig" rid="F4">Figure 4A</xref>). The gel band was sliced into 1 mm<sup>3</sup> pieces and the pieces were divided into two aliquots for multiple enzymatic digestions (<xref ref-type="bibr" rid="B25">Wi&#x015B;niewski et al., 2019</xref>). The gel pieces were destained, and the protein was reduced with DTT, alkylated with iodoacetamide, and then digested with either Trypsin-ultra, Mass Spectrometry Grade (P8101; NEB, Ipswich, MA, USA) or subtilisin (Sigma-Aldrich, St. Louis, MO, USA) at a protease concentration of 2 ng/&#x03BC;L. Digestions were conducted for 1 h (50&#x00B0;C for trypsin, 37&#x00B0;C for subtilisin) and were quenched with trifluoroacetic acid at a final concentration of 0.5% for downstream analysis.</p>
<p>The resulting peptides were loaded <italic>via</italic> Proxeon Easy-nLC 1,200 (ThermoFisher, Waltham, MA, USA) onto a reversed phase analytical column 25 cm &#x00D7; 100 &#x03BC;m ID, Reprosil 3 &#x03BC;m C18 (New Objective, Littleton, MA, USA) and eluted at a flow rate of 0.3 &#x03BC;L/min over a 35 min gradient of organic solvent (mobile phase A: 0.1% formic acid in water; mobile phase B: 90% acetonitrile/0.1% formic acid) as follows: 1-min equilibration at 10% B, 10&#x2013;40% B in 25 min, 40&#x2013;50% B in 4 min, and 50&#x2013;78% B in 5 min. Peptides were electro-sprayed into a LTQ Orbitrap XL with a Nanospray Flex Ion Source (Thermo Fisher, Waltham, MA, USA) and fragmentation spectra were generated by collision-induced dissociation (CID). Full scans between 400&#x2013;1,600 m/z were acquired with 30 k resolution. MS/MS spectra were generated in a data-dependent manner, with a normalized collision energy of 35.0 and dynamic exclusion of 30 s. The five most intense ions (including + 1 charge states for subtilisin-generated peptides) were selected for fragmentation.</p>
<p>Acquired MS/MS spectra were analyzed using PEAKS Studio Xpro (Bioinformatics Solutions, Inc., Waterloo, ON, USA). The data were searched against <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic> translated ORFs from the genome sequence, allowing for a fixed carbamidomethyl modification on cysteines, variable oxidation of methionine, and two missed cleavages (tryptic peptides). The precursor and fragment mass tolerances were set to 20 ppm and 0.6 Da, respectively. Identified proteins were filtered by a 1% false discovery rate and required at least two unique peptides per protein.</p>
</sec>
<sec id="S2.SS5">
<title>Protein purification and endonuclease assay</title>
<p>The recombinant <italic>Mae</italic>III restriction enzyme was partially purified on Ni-NTA magnetic beads (S1423L, NEB, Ipswich, MA, USA) according to the manufacturer&#x2019;s protocol. DNA digestion patterns obtained by titration of restriction endonuclease activity from 250 mM imidazole Ni-NTA column elution fraction from <italic>E. coli</italic> clone 3081[pM.<italic>Mae</italic>III + pET21bMaeIII] expressing <italic>Mae</italic>III enzyme compared to <italic>Mae</italic>III restriction enzyme from Roche on the same DNA substrate on &#x03BB; DNA substrate. 1 &#x03BC;g of &#x03BB; DNA incubated with Ni-NTA imidazole elution fraction of 6&#x00D7;His:<italic>Mae</italic>III serial 3&#x00D7; dilution (from 9 to 0.1 &#x03BC;l) and serial 3&#x00D7; dilution of commercial <italic>Mae</italic>III enzyme (from 9 to 0.1 units) for 1 h at 55&#x00B0;C in 1&#x00D7; <italic>Mae</italic>III commercial buffer (Roche Diagnostic, GmbH, Mannheim, Germany). 2&#x00D7; Buffer contains 40 mM Tris&#x2013;HCl, 550 mM NaCl, 12 mM MgCl<sub>2</sub>, 14 mM 2-mercaptoethanol, pH 8.2 (+55&#x00B0;C). The reaction mixtures were separated on 0.8% agarose gel in TBE buffer and visualized by staining with ethidium bromide.</p>
</sec>
</sec>
<sec id="S3" sec-type="results">
<title>Results</title>
<sec id="S3.SS1">
<title>DNA-sequencing and genome assembly</title>
<p>Monarch purified gDNA (2 &#x03BC;g) from a <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic> culture was used to make one barcoded (lbc1-lbc1) 10 kb SMRT bell library sequenced on an SQ1 instrument for 20 h. 418,400 long continuous sequencing subreads with 2,583 bp mean subread lengths were converted to 12,577 HiFi subreads with 5,328 bp mean subread length, yielding 43.7 Mb of high-quality sequencing data. The polished assembly generated one closed circular genomic element of 1,677,738 bp with 26-fold genome coverage and 29.93% G + C content for the chromosome (<xref ref-type="fig" rid="F1">Figure 1A</xref>). The assembled sequence was annotated using the National Center for Biotechnology Information (NCBI) Prokaryotic Genome Annotation Pipeline (PGAP) (<xref ref-type="bibr" rid="B22">Tatusova et al., 2016</xref>; <xref ref-type="bibr" rid="B9">Haft et al., 2018</xref>) which identified 1,615 protein coding features and 38 tRNA genes. The annotated genome sequence of <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> is available from NCBI <italic>via</italic> the Bioproject <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="PRJNA622823">PRJNA622823</ext-link>; BioSample: SAMN30825808, SRA: SRS15238210, and genome sequence: NZ_CP104873.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p><bold>(A)</bold> Schematic presentation of the complete closed circular genome of <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic> showing the location of the RM systems. The M.<italic>Mae</italic>III gene was not directly identified by the Seqware program (&#x002A;). <bold>(B)</bold> Detection of modified DNA motifs identified by SMRT sequencing in <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> genome. Note that 5mC methylation by the M.<italic>Mae</italic>II gene was not directly identified by SMRT sequencing.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1112734-g001.tif"/>
</fig>
</sec>
<sec id="S3.SS2">
<title>SMRT motif and modification analysis of <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic> genome</title>
<p>One advantage of the SMRT sequencing platform is its ability to directly detect the epigenetic state of sequenced DNA. As the DNA sequencing polymerase traverses methylated nucleobases in the sequencing template, changes in the amount of time between the nucleotide incorporation events (also known as the Interpulse Duration, or IPD) are recorded and analyzed by the SMRT-Link software yielding m6A and m4C modification motifs (<xref ref-type="bibr" rid="B5">Flusberg et al., 2010</xref>; <xref ref-type="bibr" rid="B4">Clark et al., 2012</xref>). The complete motif and modification analysis of assembled <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic> using these kinetic signatures detected three m6A and one m4C modified motifs (<xref ref-type="fig" rid="F1">Figure 1B</xref>). Two of the motifs with m4C <bold>C</bold>TA<underline>G</underline> and m6A C<underline>T</underline>N<bold>A</bold>G were recognized as signatures of the previously discovered <italic>Mae</italic>I and <italic>Mae</italic>III RM systems (<xref ref-type="bibr" rid="B20">Schmid et al., 1984</xref>). The m5C motif of <italic>Mae</italic>II system A<bold>C</bold><underline>G</underline>T cannot be directly detected by SMRT sequencing due to the weak IPD ratio signal generated by DNA polymerase on m5C modified base. Two additional m6A motifs with single DNA strand modification GC<bold>A</bold>CY and CG<bold>A</bold>G were also detected by SMRT motif and modification analysis.</p>
</sec>
<sec id="S3.SS3">
<title>Matching putative DNA MTase genes to appropriate motifs</title>
<p>The primary scanning of the <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> assembled genome using Seqware identified 4 putative methyltransferase genes. Two of them for M.<italic>Mae</italic>I and M.<italic>Mae</italic>II where assigned bioinformatically based on homology search against methyltransferase genes in REBASE with known specificities (<xref ref-type="fig" rid="F2">Figures 2A, B</xref>). The two other putative methyltransferase genes were assigned experimentally. Gene loci for M.MaeV and M1-M2.MaeIV were cloned into low copy pACYC&#x0394;tet vectors under the control of a constitutive P<sub><italic>tet</italic></sub> promoter and their expression induced in <italic>E. coli</italic> strain ER2683 (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>). SMRT sequencing and methylation analysis of <italic>E. coli</italic> gDNA recovered from pM1-2.MaeIV and pM.MaeV expressing clones revealed that M.MaeV recognized the m6A modified motif CG<bold>A</bold>G. M1.MaeIV and M2.MaeIV belong to a family of so called split methylase/RM genes similar to DrdVI (<xref ref-type="bibr" rid="B6">Fomenkov et al., 2019</xref>) and recognize the m6A GC<bold>A</bold>CY modified motif (<xref ref-type="fig" rid="F3">Figures 3A, B</xref>). However, at that point, we were not able to assign the m6A G<underline>T</underline>N<bold>A</bold>C motif, corresponding to M.<italic>Mae</italic>III, as there was no candidate gene identified by the PGAP annotation.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Schematic presentation of RM systems assigned bioinformatically based on homology search against DNA methylase genes with known specificity in REBASE. M.<italic>Mae</italic>I with the top 3 blast hits <bold>(A)</bold>, M.<italic>Mae</italic>II with the 2 top blast hits <bold>(B)</bold>. The flags represent the conserved motifs that are used to identify the methyltransferase genes.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1112734-g002.tif"/>
</fig>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Schematic presentation of RM systems assigned experimentally by cloning and SMRT motif and modification analysis. The correct motifs were verified by analyzing SMRT kinetic data using MotStat software (<xref ref-type="bibr" rid="B24">Vincze, unpublished</xref>) for M.MaeIV <bold>(A)</bold> and M.MaeV <bold>(B)</bold>. The flags represent the conserved motifs that are used to identify the methyltransferase genes.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1112734-g003.tif"/>
</fig>
</sec>
<sec id="S3.SS4">
<title>Searching for the <italic>Mae</italic>III restriction-modification system</title>
<p>To find the missing gene for M.<italic>Mae</italic>III we decided to apply an old methylation selection method known as the &#x201C;Hungarian trick&#x201D; (<xref ref-type="bibr" rid="B21">Szomol&#x00E1;nyi et al., 1980</xref>). Basically, this method is based on selection of clone(s) expressing the methyltransferase by treatment of the library with the cognate restriction endonuclease to destroy unmodified plasmid DNAs, leaving only plasmids that contain an active methyltransferase gene. In addition, we made a fosmid library in the pCC1Fos vector (Epicenter, WI). The plasmid and fosmid DNAs from survival clones were tested using a methylation protection assay for <italic>Mae</italic>III resistance to digestion. Unfortunately, after selection and screening we were not able to find any <italic>Mae</italic>III resistant clones (data not shown). Therefore, we decided to use a different approach to find the missing genes.</p>
<p>We used an MS/MS method to identify the protein sequence in functionally purified <italic>Mae</italic>III from Roche (<xref ref-type="fig" rid="F4">Figure 4A</xref>). This yielded a top target hit corresponding to locus_tag N6C89_05690 from <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> (<xref ref-type="fig" rid="F4">Figure 4B</xref>). In addition, the <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> assembled genome was scanned for additional DNA methyltransferase genes using an automated implementation of HMMer, which produced another potential hit corresponding to locus_tag N6C89_5685. The best scoring match to an AlphaFold2 predicted model of N6C89_5685 was 7lt5, the PDB structure of CamA, an adenine methyltransferase of <italic>Clostridioides difficile</italic> (<xref ref-type="fig" rid="F4">Figure 4C</xref>; <xref ref-type="bibr" rid="B27">Zhou et al., 2021</xref>). Thus, the close proximity of the two identified target genes strongly suggested that they corresponded to the <italic>Mae</italic>III RM system.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p><italic>Mae</italic>III restriction endonuclease separated on 10&#x2013;20% Tris-Glycine SDS-PAGE and stained with SimplyBlue SafeStain. The protein band at &#x223C; 35 kD was excised and used for MS/MS analysis <bold>(A)</bold>. The sequence matched locus_tag N6C89_05690 from the <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> genome was detected by MS/MS analysis as putative <italic>Mae</italic>III restriction enzyme <bold>(B)</bold>. <italic>M. aeolicus</italic> PL15/H<italic><sup>p</sup></italic> assembled genome was scanned for additional DNA methyltransferase genes using an automated implementation of HMMer, which produced another potential hit corresponding to locus_tag N6C89_5685. The predicted structure of the M.<italic>Mae</italic>III protein through AlphaFold2 monomeric modeling <bold>(C)</bold>. The best scoring match to an AlphaFold2 predicted model of N6C89_5685 locus was 7lt5, the PDB structure of CamA, an adenine methyltransferase of <italic>Clostridioides difficile</italic> (<xref ref-type="bibr" rid="B27">Zhou et al., 2021</xref>).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1112734-g004.tif"/>
</fig>
</sec>
<sec id="S3.SS5">
<title>Cloning and expression of the <italic>Mae</italic>III restriction-modification system</title>
<p>The genes corresponding to locus_tags 5,685 and 5,690 (<xref ref-type="fig" rid="F5">Figure 5A</xref>) were cloned into the low copy pACYC&#x0394;tet vector under control of a constitutive P<sub><italic>tet</italic></sub> promoter for expression in <italic>E. coli</italic> T7 expression strain, ER3081. The plasmid and gDNA from this strain were purified and challenged with 10 units of the <italic>Mae</italic>III restriction enzyme to test for methylation protection (<xref ref-type="fig" rid="F5">Figure 5C</xref>). In addition, the DNA obtained from <italic>E. coli</italic> expressing this putative M.<italic>Mae</italic>III methyltransferase was sequenced using the SMRT analysis platform, identifying the sequence GTNAC as modified (<xref ref-type="fig" rid="F5">Figure 5B</xref>), consistent with the M.<italic>Mae</italic>III modification motif previously reported (<xref ref-type="bibr" rid="B20">Schmid et al., 1984</xref>). These results clearly indicated that locus_tag N6C89_5685 did indeed code for M.<italic>Mae</italic>III. Therefore the strain was designated as ER3081[pM.<italic>Mae</italic>III].</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Schematic presentation of the <italic>Mae</italic>III restriction-modification system <bold>(A)</bold>. Motif and modification analysis by SMRT sequencing of gDNA from <italic>E. coli</italic> strain ER3081 [pM.<italic>Mae</italic>III] <bold>(B)</bold>. Note that this strain contains a Dam MTase in addition to M.<italic>Mae</italic>III. <italic>Mae</italic>III methylation protection assay on both plasmid and gDNA from <italic>E. coli</italic> 3081[pM.<italic>Mae</italic>III] strain <bold>(C)</bold>. Plasmid and gDNA from <italic>E. coli</italic> 3081[pM.<italic>Mae</italic>III] were purified and 1 &#x03BC;g of each DNA was incubated without (w/o) as a negative control or with 1 unit of <italic>Mae</italic>III restriction enzyme at 55&#x00B0;C for 1 h in 1&#x00D7; <italic>Mae</italic>III commercial buffer (Roche Diagnostic, GmbH, Mannheim, Germany). The results demonstrated that both plasmid, and gDNA are resistant to <italic>Mae</italic>III restriction endonuclease digestion due to modification of the DNA <italic>in vivo</italic> by M.<italic>Mae</italic>III methyltransferase at GTNAC sites. &#x03BB; DNA digestion patterns obtained by titration of restriction endonuclease activity from 250 mM imidazole Ni-NTA column elution fraction from <italic>E. coli</italic> clone 3081[pM.<italic>Mae</italic>III + pET21b 6&#x00D7;His:<italic>Mae</italic>III] expressing <italic>Mae</italic>III enzyme compared to <italic>Mae</italic>III restriction enzyme from Roche on the same DNA substrate <bold>(D)</bold>. 1 &#x03BC;g of &#x03BB; DNA incubated with Ni-NTA imidazole elution fraction of 6&#x00D7;His:<italic>Mae</italic>III serial 3&#x00D7; dilution (from 9 to 0.1 &#x03BC;l) and serial 3&#x00D7; dilution of commercial <italic>Mae</italic>III enzyme (from 9 to 0.1 units) for 1 h at 55&#x00B0;C in 1&#x00D7; <italic>Mae</italic>III commercial buffer (Roche Diagnostic, GmbH, Mannheim, Germany). 2&#x00D7; Buffer contains 40 mM Tris&#x2013;HCl, 550 mM NaCl, 12 mM MgCl<sub>2</sub>, 14 mM 2-mercaptoethanol, pH 8.2 (+55&#x00B0;C). The reaction mixtures were separated on 0.8% agarose gel in TBE buffer and visualized by staining with ethidium bromide.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1112734-g005.tif"/>
</fig>
<p>A PCR amplicon spanning the presumptive endonuclease gene, locus_tag 5690, was cloned in-frame with the N-terminal 6&#x00D7; His tag and linker sequence of the pET21b expression vector under the control of the pT7 promoter and transformed into an <italic>E. coli</italic> strain previously transformed with a compatible M.<italic>Mae</italic>III expressing plasmid. The resulting two-plasmid strain encoding M.<italic>Mae</italic>III and the putative <italic>Mae</italic>III endonuclease gene was induced and the his-tagged putative endonuclease purified using Ni-NTA beads as described. <italic>Mae</italic>III restriction endonuclease activity was subsequently detected in a 250 mM imidazole elution fraction (<xref ref-type="fig" rid="F5">Figure 5D</xref>).</p>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<title>Discussion</title>
<p>In this paper we report the complete closed genome of <italic>Methanococcus aeolicus</italic> PL15/H<italic><sup>p</sup></italic>. The sequencing data are publicly available from NCBI. In addition, the complete methylome of this strain and the identified RM systems may help other investigators to develop novel genetic systems for <italic>M. aeolicus</italic> and create efficient transformation protocols for this archaeal species.</p>
<p>In addition we performed genomic neighborhood analysis of M.<italic>Mae</italic>III homologs across representative archaea. <xref ref-type="fig" rid="F6">Figure 6</xref> shows the methyltransferases (indicated in yellow) either in an apparent &#x201C;orphan&#x201D; configuration (i.e., no neighboring restriction endonuclease) or associated with an endonuclease sequence (indicated in red). <italic>M. aeolicus</italic> PL-15:H and <italic>M. aeolicus</italic> Nankai-3 are highly syntenic at this locus, however, <italic>M. aeolicus</italic> Nankai-3 completely lacks the <italic>Mae</italic>III RM system. Like the Nankai-3 strain, <italic>M. jannaschii</italic> lacks a M.<italic>Mae</italic>III homolog. An <italic>Mae</italic>III-like RM system is found in the thermophilic bacterium <italic>Hippea alviniae</italic> EP5-r. M.<italic>Mae</italic>III homologs are found in a variety of unrelated contexts across representatives from Euryarchaeota and Lokiarchaeota. One conserved configuration of an M.<italic>Mae</italic>III-like gene occurs between <italic>Methanothermococcus</italic> sp., SCGC AD-144-N22 and <italic>Methanofervidicoccus</italic> sp., A16 where apparent orphan methylases are found in syntenic loci sharing a high degree of amino-acid identity. The function of these orphan methyltransferases is currently unknown.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>Genomic neighborhoods of M.<italic>Mae</italic>III homologs across representative archaea. The m6A DNA methylase (M.<italic>Mae</italic>III) of <italic>M. aeolicus</italic> PL-15:H was used as a query sequence for a BLAST search taxonomically limited to archaea (taxid:2,157) at NCBI. Eight genomes from cultured archaea and one bacterium containing matching sequences to M.<italic>Mae</italic>III with an <italic>e</italic>-value of less than 10<sup>&#x2013;30</sup> were selected for genome neighborhood analysis. Sub genomic sequences surrounding the M.<italic>Mae</italic>III homologs were manually obtained by computationally extracting 7.5 kb of sequence, plus annotations, upstream and downstream of the methyltransferase encoding sequence. The resulting gene clusters were aligned and rendered using &#x201C;clinker&#x201D; gene diagramming software (<xref ref-type="bibr" rid="B8">Gilchrist and Chooi, 2020</xref>) with the methyltransferase colored yellow and endonuclease colored in red. The degree of amino acid identity among coding sequences are indicated by grayscale links connecting homologous genes.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1112734-g006.tif"/>
</fig>
<p>Taken together, the data presented herein extend what is known about the occurrence and function of RM systems in the Methanococcales branch of the Euryarchaeota lineage. The synteny/neighborhood analysis presented in <xref ref-type="fig" rid="F5">Figure 5</xref> are consistent with previously reported findings in the Haloarchaea (<xref ref-type="bibr" rid="B7">Fullmer et al., 2019</xref>); i.e., the <italic>Mae</italic>III RM system does not seem to be conserved between closely related <italic>Methanococcus</italic> species and yet is found intact in a eubacterial extremophile. These observations are suggestive of a mobile and dynamic distribution not only of the intact RM system, but also homologous methyltransferases found in &#x201C;orphan&#x201D; configuration. It remains to be seen what, if any, would be the impacts of the RM systems and orphan methyltransferases of <italic>Methanococcus</italic> on mating (<xref ref-type="bibr" rid="B16">Makkay et al., 2020</xref>), transformability (<xref ref-type="bibr" rid="B26">Zatopek et al., 2021</xref>), and DNA repair.</p>
</sec>
<sec id="S5" sec-type="data-availability">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/genbank/">https://www.ncbi.nlm.nih.gov/genbank/</ext-link>, <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="CP104873">CP104873</ext-link>.</p>
</sec>
<sec id="S6" sec-type="author-contributions">
<title>Author contributions</title>
<p>AF performed SMRT sequencing, genome assembly, and writing&#x2014;original manuscript. PW performed bioinformatic analysis. CMc and CMa performed MS/MS analysis. RR performed methylome analysis and editing final manuscript. All authors contributed to the article and approved the submitted version.</p>
</sec>
</body>
<back>
<sec id="S7" sec-type="funding-information">
<title>Funding</title>
<p>This work was supported by New England Biolabs Inc.</p>
</sec>
<ack><p>We thank Yi Sun and Sean Johnson with their help on bioinformatic analysis and L&#x00E9;a Chuzel for help with the construction of the fosmid library.</p>
</ack>
<sec id="S8" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors work for New England Biolabs Inc., a company that sells research reagents, including restriction enzymes and DNA methyltransferases, to the scientific community.</p>
</sec>
<sec id="S9" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="S10" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2023.1112734/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmicb.2023.1112734/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.XLSX" id="TS1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.XLSX" id="TS2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_3.XLSX" id="TS3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="http://www.pacbiodevnet.com/SMRT-Analysis/Software/SMRT-Pipe">http://www.pacbiodevnet.com/SMRT-Analysis/Software/SMRT-Pipe</ext-link></p></fn>
<fn id="footnote2">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="http://hmmer.org/">http://hmmer.org/</ext-link></p></fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arber</surname> <given-names>W.</given-names></name></person-group> (<year>1974</year>). <article-title>DNA modification and restriction.</article-title> <source><italic>Prog. Nucleic Acid Res. Mol. Biol.</italic></source> <volume>14</volume> <fpage>1</fpage>&#x2013;<lpage>37</lpage>.</citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baum</surname> <given-names>C.</given-names></name> <name><surname>Lin</surname> <given-names>Y. C.</given-names></name> <name><surname>Fomenkov</surname> <given-names>A.</given-names></name> <name><surname>Anton</surname> <given-names>B. P.</given-names></name> <name><surname>Chen</surname> <given-names>L.</given-names></name> <name><surname>Yan</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Rapid identification of methylase specificity (RIMS-seq) jointly identifies methylated motifs and generates shotgun sequencing of bacterial genomes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>49</volume>:<issue>e113</issue>. <pub-id pub-id-type="doi">10.1093/nar/gkab705</pub-id> <pub-id pub-id-type="pmid">34417598</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chin</surname> <given-names>C.-H.</given-names></name> <name><surname>Alexander</surname> <given-names>D. H.</given-names></name> <name><surname>Marks</surname> <given-names>P.</given-names></name> <name><surname>Klammer</surname> <given-names>A. A.</given-names></name> <name><surname>Drake</surname> <given-names>J.</given-names></name> <name><surname>Heiner</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Nonhybrid, finished microbial genome assemblies from long-read SMRTsequencing data.</article-title> <source><italic>Nat. Methods</italic></source> <volume>10</volume> <fpage>563</fpage>&#x2013;<lpage>569</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.2474</pub-id> <pub-id pub-id-type="pmid">23644548</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>T. A.</given-names></name> <name><surname>Murray</surname> <given-names>I. A.</given-names></name> <name><surname>Morgan</surname> <given-names>R. D.</given-names></name> <name><surname>Kislyuk</surname> <given-names>A. O.</given-names></name> <name><surname>Spittle</surname> <given-names>K. E.</given-names></name> <name><surname>Boitano</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Characterization of DNA methyltransferase specificities using single-molecule, real-time DNA sequencing.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>40</volume> <fpage>11450</fpage>&#x2013;<lpage>11462</lpage>.</citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flusberg</surname> <given-names>B. A.</given-names></name> <name><surname>Webster</surname> <given-names>D. R.</given-names></name> <name><surname>Lee</surname> <given-names>J. H.</given-names></name> <name><surname>Travers</surname> <given-names>K. J.</given-names></name> <name><surname>Olivares</surname> <given-names>E. C.</given-names></name> <name><surname>Clark</surname> <given-names>T. A.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Direct detection of DNA methylation during single-molecule, real-time sequencing.</article-title> <source><italic>Nat. Methods</italic></source> <volume>7</volume> <fpage>461</fpage>&#x2013;<lpage>465</lpage>.</citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fomenkov</surname> <given-names>A.</given-names></name> <name><surname>Luyten</surname> <given-names>Y.</given-names></name> <name><surname>Vincze</surname> <given-names>T.</given-names></name> <name><surname>Anton</surname> <given-names>B. P.</given-names></name> <name><surname>Roberts</surname> <given-names>R. J.</given-names></name> <name><surname>Morgan</surname> <given-names>R. D.</given-names></name></person-group> (<year>2019</year>). <article-title>Complete Genome Sequence and Methylome Analysis of Deinococcus wulumuqiensis 479.</article-title> <source><italic>Microbiol Resour Announc.</italic></source> <volume>12</volume>:<issue>e00066-19</issue>. <pub-id pub-id-type="doi">10.1128/MRA.00066-19</pub-id> <pub-id pub-id-type="pmid">30938698</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fullmer</surname> <given-names>M. S.</given-names></name> <name><surname>Ouellette</surname> <given-names>M.</given-names></name> <name><surname>Louyakis</surname> <given-names>A. S.</given-names></name> <name><surname>Papke</surname> <given-names>R. T.</given-names></name> <name><surname>Gogarten</surname> <given-names>J. P.</given-names></name></person-group> (<year>2019</year>). <article-title>The Patchy Distribution of Restriction&#x2013;Modification System Genes and the Conservation of Orphan Methyltransferases in Halobacteria.</article-title> <source><italic>Genes</italic></source> <volume>10</volume>:<issue>233</issue>. <pub-id pub-id-type="doi">10.3390/genes10030233</pub-id> <pub-id pub-id-type="pmid">30893937</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gilchrist</surname> <given-names>C. L. M.</given-names></name> <name><surname>Chooi</surname> <given-names>Y.-H.</given-names></name></person-group> (<year>2020</year>). <article-title>Clinker &#x0026; clustermap.js: Automatic generation of gene cluster comparison figures.</article-title> <source><italic>Bioinformatics</italic></source> <volume>37</volume> <fpage>2473</fpage>&#x2013;<lpage>2475</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btab007</pub-id> <pub-id pub-id-type="pmid">33459763</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haft</surname> <given-names>D. H.</given-names></name> <name><surname>DiCuccio</surname> <given-names>M.</given-names></name> <name><surname>Badretdin</surname> <given-names>A.</given-names></name> <name><surname>Brover</surname> <given-names>V.</given-names></name> <name><surname>Chetvernin</surname> <given-names>V.</given-names></name> <name><surname>O&#x2019;Neill</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>RefSeq: an update on prokaryotic genome annotation and curation.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>46</volume> <fpage>D851</fpage>&#x2013;<lpage>D860</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx1068.6</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hernday</surname> <given-names>A.</given-names></name> <name><surname>Bruce</surname> <given-names>A. B.</given-names></name> <name><surname>Low</surname> <given-names>D.</given-names></name></person-group> (<year>2003</year>). <article-title>The mechanism by which DNA adenine methylase and PapI activate the pap epigenetic switch.</article-title> <source><italic>Mol. Cell</italic></source> <volume>12</volume> <fpage>947</fpage>&#x2013;<lpage>957</lpage>.</citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holm</surname> <given-names>L.</given-names></name></person-group> (<year>2022</year>). <article-title>Dali server: Structural unification of protein families.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>50</volume> <fpage>W210</fpage>&#x2013;<lpage>W215</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkac387</pub-id> <pub-id pub-id-type="pmid">35610055</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jumper</surname> <given-names>J.</given-names></name> <name><surname>Evans</surname> <given-names>R.</given-names></name> <name><surname>Pritzel</surname> <given-names>A.</given-names></name> <name><surname>Green</surname> <given-names>T.</given-names></name> <name><surname>Figurnov</surname> <given-names>M.</given-names></name> <name><surname>Ronneberger</surname> <given-names>O.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Highly accurate protein structure prediction with AlphaFold.</article-title> <source><italic>Nature</italic></source> <volume>2021</volume> <fpage>583</fpage>&#x2013;<lpage>589</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-021-03819-2</pub-id> <pub-id pub-id-type="pmid">34265844</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kendall</surname> <given-names>M. M.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Sieprawska-Lupa</surname> <given-names>M.</given-names></name> <name><surname>Stetter</surname> <given-names>K. O.</given-names></name> <name><surname>Whitman</surname> <given-names>W. B.</given-names></name> <name><surname>Boone</surname> <given-names>D. R.</given-names></name></person-group> (<year>2006</year>). <article-title>Methanococcus aeolicus sp. nov., a mesophilic, methanogenic archaeon from shallow and deep marine sediments.</article-title> <source><italic>Int. J. Syst. Evol. Microbiol.</italic></source> <volume>56</volume> <fpage>1525</fpage>&#x2013;<lpage>1529</lpage>. <pub-id pub-id-type="doi">10.1099/ijs.0.642160</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Korlach</surname> <given-names>J.</given-names></name> <name><surname>Turner</surname> <given-names>S. W.</given-names></name></person-group> (<year>2012</year>). <article-title>Going beyond five bases in DNA sequencing.</article-title> <source><italic>Curr. Opin. Struct. Biol.</italic></source> <volume>22</volume> <fpage>251</fpage>&#x2013;<lpage>261</lpage>. <pub-id pub-id-type="doi">10.1016/j.sbi.2012.04.002</pub-id> <pub-id pub-id-type="pmid">22575758</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loenen</surname> <given-names>W. A.</given-names></name> <name><surname>Dryden</surname> <given-names>D. T.</given-names></name> <name><surname>Raleigh</surname> <given-names>E. A.</given-names></name> <name><surname>Wilson</surname> <given-names>G. G.</given-names></name> <name><surname>Murray</surname> <given-names>N. E.</given-names></name></person-group> (<year>2014</year>). <article-title>Highlights of the DNA cutters: A short history of the restriction enzymes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>42</volume> <fpage>3</fpage>&#x2013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkt990</pub-id> <pub-id pub-id-type="pmid">24141096</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Makkay</surname> <given-names>A. M.</given-names></name> <name><surname>Louyakis</surname> <given-names>A. S.</given-names></name> <name><surname>Ram-Mohan</surname> <given-names>N.</given-names></name> <name><surname>Gophna</surname> <given-names>U.</given-names></name> <name><surname>Gogarten</surname> <given-names>J. P.</given-names></name> <name><surname>Papke</surname> <given-names>R. T.</given-names></name></person-group> (<year>2020</year>). <article-title>Insights into gene expression changes under conditions that facilitate horizontal gene transfer (mating) of a model archaeon.</article-title> <source><italic>Sci. Rep</italic></source> <volume>10</volume>:<issue>22297</issue>. <pub-id pub-id-type="doi">10.1038/s41598-020-79296-w</pub-id> <pub-id pub-id-type="pmid">33339886</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mirdita</surname> <given-names>M.</given-names></name> <name><surname>Sch&#x00FC;tze</surname> <given-names>K.</given-names></name> <name><surname>Moriwaki</surname> <given-names>Y.</given-names></name> <name><surname>Heo</surname> <given-names>L.</given-names></name> <name><surname>Ovchinnikov</surname> <given-names>S.</given-names></name> <name><surname>Steinegger</surname> <given-names>M.</given-names></name></person-group> (<year>2022</year>). <article-title>ColabFold: Making protein folding accessible to all.</article-title> <source><italic>Nat. Methods</italic></source> <volume>19</volume> <fpage>679</fpage>&#x2013;<lpage>682</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-022-01488-1</pub-id> <pub-id pub-id-type="pmid">35637307</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roberts</surname> <given-names>R. J.</given-names></name> <name><surname>Vincze</surname> <given-names>T.</given-names></name> <name><surname>Posfai</surname> <given-names>J.</given-names></name> <name><surname>Macelis</surname> <given-names>D.</given-names></name></person-group> (<year>2022</year>). <article-title>REBASE&#x2013;a database for DNA restriction and modification: Enzymes, genes and genomes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <comment>[Epub ahead of print]</comment>. <pub-id pub-id-type="doi">10.1093/nar/gkac975</pub-id> <pub-id pub-id-type="pmid">36318248</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schlagman</surname> <given-names>S.</given-names></name> <name><surname>Hattman</surname> <given-names>S.</given-names></name> <name><surname>Marinus</surname> <given-names>M.</given-names></name></person-group> (<year>1986</year>). <article-title>Direct role of the <italic>Escherichia coli</italic> Dam DNA methyltransferase in methylation-directed mismatch repair.</article-title> <source><italic>J. Bacteriol.</italic></source> <volume>165</volume> <fpage>896</fpage>&#x2013;<lpage>900</lpage>. <pub-id pub-id-type="doi">10.1128/jb.165.3.896-900.1986</pub-id> <pub-id pub-id-type="pmid">3512529</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmid</surname> <given-names>K.</given-names></name> <name><surname>Thomm</surname> <given-names>M.</given-names></name> <name><surname>Laminet</surname> <given-names>A.</given-names></name> <name><surname>Laue</surname> <given-names>F. G.</given-names></name> <name><surname>Kessler</surname> <given-names>C.</given-names></name> <name><surname>Stetter</surname> <given-names>K. O.</given-names></name><etal/></person-group> (<year>1984</year>). <article-title>Three new restriction endonucleases MaeI, MaeII and MaeIII from Methanococcus aeolicus.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>6</volume> <fpage>2619</fpage>&#x2013;<lpage>2628</lpage>. <pub-id pub-id-type="doi">10.1093/nar/12.6.2619</pub-id> <pub-id pub-id-type="pmid">6324124</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Szomol&#x00E1;nyi</surname> <given-names>E.</given-names></name> <name><surname>Kiss</surname> <given-names>A.</given-names></name> <name><surname>Venetianer</surname> <given-names>P.</given-names></name></person-group> (<year>1980</year>). <article-title>Cloning the modification methylase gene of Bacillus sphaericus R in <italic>Escherichia coli</italic>.</article-title> <source><italic>Gene</italic></source> <volume>3</volume> <fpage>219</fpage>&#x2013;<lpage>225</lpage>. <pub-id pub-id-type="doi">10.1016/0378-1119(80)90051-7</pub-id> <pub-id pub-id-type="pmid">6254840</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tatusova</surname> <given-names>T.</given-names></name> <name><surname>DiCuccio</surname> <given-names>M.</given-names></name> <name><surname>Badretdin</surname> <given-names>A.</given-names></name> <name><surname>Chetvernin</surname> <given-names>V.</given-names></name> <name><surname>Nawrocki</surname> <given-names>E. P.</given-names></name> <name><surname>Zaslavsky</surname> <given-names>L.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>NCBI prokaryotic genome annotation pipeline.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>6614</fpage>&#x2013;<lpage>6624</lpage>.</citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tourancheau</surname> <given-names>A.</given-names></name> <name><surname>Mead</surname> <given-names>E. A.</given-names></name> <name><surname>Zhang</surname> <given-names>X. S.</given-names></name> <name><surname>Fang</surname> <given-names>G.</given-names></name></person-group> (<year>2021</year>). <article-title>Discovering multiple types of DNA methylation from bacteria and microbiome using nanopore sequencing.</article-title> <source><italic>Nat. Methods</italic></source> <volume>5</volume> <fpage>491</fpage>&#x2013;<lpage>498</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-021-01109-3</pub-id> <pub-id pub-id-type="pmid">33820988</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vincze</surname> <given-names>T.</given-names></name></person-group> (<year>unpublished</year>). <source><italic>MotStat software for SMRT motif and modification analysis</italic></source>.</citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wi&#x015B;niewski</surname> <given-names>J. R.</given-names></name> <name><surname>Wegler</surname> <given-names>C.</given-names></name> <name><surname>Artursson</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>Multiple-Enzyme-Digestion Strategy Improves Accuracy and Sensitivity of Label- and Standard-Free Absolute Quantification to a Level That Is Achievable by Analysis with Stable Isotope-Labeled Standard Spiking.</article-title> <source><italic>J. Proteome Res.</italic></source> <volume>18</volume> <fpage>217</fpage>&#x2013;<lpage>224</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jproteome.8b00549</pub-id> <pub-id pub-id-type="pmid">30336047</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zatopek</surname> <given-names>K. M.</given-names></name> <name><surname>Burkhart</surname> <given-names>B. W.</given-names></name> <name><surname>Morgan</surname> <given-names>R. D.</given-names></name> <name><surname>Gehring</surname> <given-names>A. M.</given-names></name> <name><surname>Scott</surname> <given-names>K. A.</given-names></name> <name><surname>Santangelo</surname> <given-names>T. J.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>The Hyperthermophilic Restriction-Modification Systems of Thermococcus kodakarensis Protect Genome Integrity.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>12</volume>:<issue>657356</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2021.657356</pub-id> <pub-id pub-id-type="pmid">34093470</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>J.</given-names></name> <name><surname>Horton</surname> <given-names>J. R.</given-names></name> <name><surname>Blumenthal</surname> <given-names>R. M.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Cheng</surname> <given-names>X.</given-names></name></person-group> (<year>2021</year>). <article-title>Clostridioides difficile specific DNA adenine methyltransferase CamA squeezes and flips adenine out of DNA helix.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>12</volume>:<issue>3436</issue>. <pub-id pub-id-type="doi">10.1038/s41467-021-23693-w</pub-id> <pub-id pub-id-type="pmid">34103525</pub-id></citation></ref>
</ref-list>
</back>
</article>
