<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2016.01645</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Methods</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>PIMMS (Pragmatic Insertional Mutation Mapping System) Laboratory Methodology a Readily Accessible Tool for Identification of Essential Genes in <italic>Streptococcus</italic></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Blanchard</surname> <given-names>Adam M.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/222661/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Egan</surname> <given-names>Sharon A.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/222664/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Emes</surname> <given-names>Richard D.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/23877/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Warry</surname> <given-names>Andrew</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/376940/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Leigh</surname> <given-names>James A.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/217601/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Veterinary Medicine and Science, University of Nottingham</institution> <country>Sutton Bonington, UK</country></aff>
<aff id="aff2"><sup>2</sup><institution>Advanced Data Analysis Centre, University of Nottingham</institution> <country>Sutton Bonington, UK</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: <italic>Martin G. Klotz, Queens College, City University of New York, USA</italic></p></fn>
<fn fn-type="edited-by"><p>Reviewed by: <italic>Kevin S. McIver, University of Maryland, College Park, USA; Awdhesh Kalia, University of Texas MD Anderson Cancer Center, USA</italic></p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x002A;Correspondence: <italic>James A. Leigh, <email>james.leigh@nottingham.ac.uk</email></italic></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Microbial Physiology and Metabolism, a section of the journal Frontiers in Microbiology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>25</day>
<month>10</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="collection">
<year>2016</year>
</pub-date>
<volume>7</volume>
<elocation-id>1645</elocation-id>
<history>
<date date-type="received">
<day>01</day>
<month>07</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>03</day>
<month>10</month>
<year>2016</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2016 Blanchard, Egan, Emes, Warry and Leigh.</copyright-statement>
<copyright-year>2016</copyright-year>
<copyright-holder>Blanchard, Egan, Emes, Warry and Leigh</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The Pragmatic Insertional Mutation Mapping (PIMMS) laboratory protocol was developed alongside various bioinformatics packages (<xref ref-type="bibr" rid="B3">Blanchard et al., 2015</xref>) to enable detection of essential and conditionally essential genes in <italic>Streptococcus</italic> and related bacteria. This extended the methodology commonly used to locate insertional mutations in individual mutants to the analysis of mutations in populations of bacteria. In <italic>Streptococcus uberis</italic>, a pyogenic <italic>Streptococcus</italic> associated with intramammary infection and mastitis in ruminants, the mutagen pGhost9:ISS1 was shown to integrate across the entire genome. Analysis of >80,000 mutations revealed 196 coding sequences, which were not be mutated and a further 67 where mutation only occurred beyond the 90th percentile of the coding sequence. These sequences showed good concordance with sequences within the database of essential genes and typically matched sequences known to be associated with basic cellular functions. Due to the broad utility of this mutagen and the simplicity of the methodology it is anticipated that PIMMS will be of value to a wide range of laboratories in functional genomic analysis of a wide range of Gram positive bacteria (<italic>Streptococcus, Enterococcus</italic>, and <italic>Lactococcus</italic>) of medical, veterinary, and industrial significance.</p>
</abstract>
<kwd-group>
<kwd>mutagenesis</kwd>
<kwd>insertion sequencing</kwd>
<kwd>essential genome</kwd>
<kwd><italic>Streptococcus</italic></kwd>
<kwd>laboratory protocol</kwd>
</kwd-group>
<contract-sponsor id="cn001">University of Nottingham<named-content content-type="fundref-id">10.13039/501100000837</named-content></contract-sponsor>
<counts>
<fig-count count="4"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="40"/>
<page-count count="12"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec><title>Introduction</title>
<p>Streptococci are significant pathogens of man, animals, aquatic mammals, and fish (<xref ref-type="bibr" rid="B6">Chanter, 1997</xref>). Some show a high degree of host and disease specificity whilst others are able to cause a wide array of different pathologies in distinct host targets (<xref ref-type="bibr" rid="B31">Steer et al., 2012</xref>). Many streptococcal species (including pathogens) are also able to co-exist in an asymptomatic carriage state with their host (<xref ref-type="bibr" rid="B25">Murphy and Frick, 2013</xref>); while others previously considered benign commensals, are now associated with colon cancer and endocarditis in humans (<xref ref-type="bibr" rid="B5">Chadfield et al., 2004</xref>; <xref ref-type="bibr" rid="B39">zur Hausen, 2006</xref>).</p>
<p><italic>Streptococcus uberis</italic> is a member of the pyogenic cluster of <italic>Streptococcus</italic>. Although, able to colonize the bovine gut asymptomatically, intramammary infection with this bacterium is one of the most common causes of bovine mastitis worldwide (<xref ref-type="bibr" rid="B4">Bradley et al., 2007</xref>); resulting in huge financial losses to the dairy industry and the requirement for large quantities of therapeutic antibiotics (<xref ref-type="bibr" rid="B27">Pol and Ruegg, 2007</xref>). <italic>S. uberis</italic> is amenable to insertional mutagenesis with the temperature sensitive mutagen, pGhost9:ISS1 (<xref ref-type="bibr" rid="B34">Ward et al., 2001</xref>), which has been used similarly in other species of <italic>Streptococcus, Lactococcus</italic>, and <italic>Enterococcus</italic> to gain insight of the role of individual bacterial sequences (<xref ref-type="bibr" rid="B23">Maguin et al., 1996</xref>; <xref ref-type="bibr" rid="B30">Spellerberg et al., 1999</xref>; <xref ref-type="bibr" rid="B2">Biswas and Biswas, 2011</xref>; <xref ref-type="bibr" rid="B1">Baureder and Hederstedt, 2012</xref>).</p>
<p>An understanding of the contribution of the entire bacterial genome to biological processes will enable a more comprehensive evaluation of microbial physiology and biochemistry. In doing so, it will be possible to identify bacterial gene products and combinations of gene products responsible for bacterial proliferation and survival against which new therapeutics and preventative disease controlling agents can be developed.</p>
<p>The use of random insertional mutagenesis coupled with high throughput sequencing technologies has enabled identification of essential and conditionally essential genes for many pathogenic bacteria. Various protocols have been developed to achieve this including; Tn-Seq (<xref ref-type="bibr" rid="B32">van Opijnen et al., 2009</xref>), INSeq (<xref ref-type="bibr" rid="B10">Goodman et al., 2009</xref>), HITS (<xref ref-type="bibr" rid="B9">Gawronski et al., 2009</xref>), and TraDIS (<xref ref-type="bibr" rid="B19">Langridge et al., 2009</xref>). These each require a series of complex steps to produce mutants and isolate DNA fragments flanking insertions and in some cases specialized sequencing procedures are required to generate the final data set. Various bioinformatic approaches and predictive modeling strategies have been used to analyze the vast amounts of data produced from such protocols (<xref ref-type="bibr" rid="B38">Zomer et al., 2012</xref>; <xref ref-type="bibr" rid="B8">Chao et al., 2013</xref>; <xref ref-type="bibr" rid="B28">Pritchard et al., 2014</xref>; <xref ref-type="bibr" rid="B3">Blanchard et al., 2015</xref>).</p>
<p>The use of inverse PCR of re-circularized restriction fragments to amplify sequences flanking insertions has been used for determination of the sites of individual mutations in various bacterial species including; <italic>Pseudomonas abietaniphila</italic> (<xref ref-type="bibr" rid="B24">Martin and Mohn, 1999</xref>), <italic>Mycoplasma genitalium</italic> (<xref ref-type="bibr" rid="B16">Hutchison et al., 1999</xref>), <italic>Xanthomonas albilineans</italic> (<xref ref-type="bibr" rid="B15">Huang et al., 2000</xref>), <italic>Helicobacter pylori</italic> (<xref ref-type="bibr" rid="B29">Salama et al., 2004</xref>), and <italic>S. uberis</italic> (<xref ref-type="bibr" rid="B34">Ward et al., 2001</xref>). However, methods that combine this commonly used strategy with high throughput technologies, to enable simultaneous analysis of bacterial mutant populations, have not been developed. The wide applicability of pGh9:ISS1 as a mutagen in <italic>Streptococcus</italic> (and related bacterial species) makes this an attractive target around which such technology may be produced.</p>
<p>In this communication, we describe the development and application of a simple, accessible laboratory protocol Pragmatic Insertional Mutation Mapping (PIMMS laboratory protocol), with wide applicability to any bacterial species mutated with pGh9:ISS1, using an existing bank of <italic>S. uberis</italic> mutants (<xref ref-type="bibr" rid="B34">Ward et al., 2001</xref>).</p>
</sec>
<sec><title>Methods</title>
<sec><title>Generation of Bacterial Mutant Pools</title>
<p>A culture of the bovine isolate of <italic>S. uberis</italic> 0140J that had been mutagenized (<xref ref-type="bibr" rid="B34">Ward et al., 2001</xref>) with the thermosensitive plasmid containing the insertion sequence element S1 (pGh9:ISS1) and stored at -80&#x00B0;C was used throughout this study. The viability and frequency of pGh9:ISS1 insertions within the culture were assessed by serial dilution plate counts on Todd-Hewitt agar (THA; Oxoid, UK) in the presence and absence of erythromycin (Ery; 1 &#x03BC;g/ml; Sigma Aldrich, UK). Total counts in the presence/absence of Ery were used to calculate the total number and the proportion of mutant bacteria within the culture.</p>
<p>Subsequently, a sample of the mutagenised culture, diluted appropriately, was plated and grown to single colonies on THA containing Ery (1 &#x03BC;g/ml). Ten pools each containing approximately 10<sup>4</sup> colonies were scraped from plates into phosphate buffered saline (PBS; Gibco, ThermoFisher, UK) and resulting bacterial suspension collected by centrifugation (8000 &#x00D7; <italic>g</italic>, for 10 min) washed (three times) in PBS, and finally suspended in pyrogen free saline (Sigma Aldrich, UK) containing 50% (v/v) glycerol. These pools were stored in aliquots and frozen at -80&#x00B0;C.</p>
</sec>
<sec><title>DNA Extraction</title>
<p>Chromosomal DNA was extracted from bacteria according to the method (<xref ref-type="bibr" rid="B13">Hill and Leigh, 1989</xref>). The final DNA sample was obtained by centrifugation (12, 000 &#x00D7; <italic>g</italic> for 5 min), and following removal of the supernatant was allowed to air dry at ambient temperature before being suspended in TE buffer containing 20 &#x03BC;g/ml RNAse A. DNA was quantified using the Qubit dsDNA Broad Range Fluorometric Assay kit (Life Sciences, UK), according to manufacturer&#x2019;s instructions.</p>
</sec>
<sec><title>Preparation of DNA for Inverse PCR Reaction</title>
<p>Restriction digests of DNA using <italic>HindIII</italic> and <italic>EcoR1</italic> were performed by the addition of 10 units of restriction enzyme to a total reaction volume of 50 &#x03BC;l using 1 &#x03BC;g of DNA and incubated for 1 h at 37&#x00B0;C. The reaction was then heat inactivated at 80&#x00B0;C for 20 min. The digested DNA was purified using PCR cleanup kit (Machery and Nagel, USA) and eluted using 30 &#x03BC;l of pre-heated (70&#x00B0;C) elution buffer.</p>
<p>Approximately 6 &#x03BC;g of genomic DNA was suspended in 200 &#x03BC;l TE buffer and fragmented to an average size of 3 kb using Covaris Adaptive Focused Acoustics (Covaris, Inc., USA) according to the manufacturer&#x2019;s protocol. The fragmented DNA was purified using Agencourt SPRI beads (Beckman Coulter, UK) according to the manufacturer&#x2019;s protocol; briefly 1.8x volume of beads was added to the DNA, mixed by pipetting and allowed to incubate at room temperature for 5 min. The beads containing DNA were separated from the supernatant using a magnetic stand and the supernatant aspirated. Beads were washed twice with high purity 70% ethanol, and the DNA eluted in molecular biology grade water (Fisher Scientific, UK). The size distribution of DNA fragments was quantified using an Agilent Bioanalyser in line with the standard protocols. Fragmented DNA was blunt end repaired using the NEBNext End Repair module (New England Biolabs, Inc., USA), purified with 1.8x SPRI beads (as previously described) and resuspended in 50 &#x03BC;l molecular biology grade water.</p>
<p>The end-repaired or restriction digested DNA (1 &#x03BC;g) was suspended in 750 &#x03BC;l ligase buffer in the presence of 1000U T4 ligase (New England Biolabs, Inc., USA); and incubated at 22&#x00B0;C overnight. The DNA was purified and concentrated using a PCR clean-up kit (Machery and Nagel, USA) and eluted using 30 &#x03BC;l of pre-heated (70&#x00B0;C) elution buffer.</p>
</sec>
<sec><title>Inverse PCR</title>
<p>An inverse PCR was conducted to enrich the sequence flanking the ISS1 element. In a 50 &#x03BC;l reaction volume, 100 ng of re-circularized DNA was used as template with 2 mM dNTPs and 10 pmol of each primer (P082 5&#x2032;-CCAACAGCGACAATAATCACATC-3&#x2032; and P064 5&#x2032;-AGAACCGAAGAATTCGAACGCTC-3&#x2032;). The reaction was incubated for 5 min at 98&#x00B0;C before the addition of 1 U of Phusion High fidelity DNA polymerase (New England Biolabs, Inc., USA) to initiate the reaction (denature of 98&#x00B0;C for 2 min followed by 35 cycles of 98&#x00B0;C for 10 s, 63&#x00B0;C for 30 s, 72&#x00B0;C for 1 min with a final extension of 8 min at 72&#x00B0;C). The PCR products were isolated using 1.8-volumes of Agencourt SPRI beads (Beckman and Coulter, UK) as previously described and suspended in 30 &#x03BC;l of Molecular biology water (Fisher Scientific, UK).</p>
</sec>
<sec><title>Nucleotide Sequencing</title>
<p>The purified PCR products were fragmented to 550 bp using Covaris Adaptive Focused Acoustics (Covaris, Inc., USA) following the manufacturer&#x2019;s directions. This size distribution of DNA fragments was estimated using an Agilent Bioanalyser 2100 using the DNA7500 kit (Agilent Technologies, USA) in line with the standard protocols. The samples were prepared for sequencing on the Illumina MiSeq platform at 2 &#x00D7; 250 bp reads using the Illumina TruSeq Nano library preparation kit (Illumina, Inc., USA).</p>
</sec>
<sec><title>Analysis of Data</title>
<p>Raw FASTQ files containing all fragment reads accompanied by quality scores and read identifiers from the sequence run generated by the MiSeq were analyzed using freely available software on a Linux system. The PIMMS pipeline (<xref ref-type="bibr" rid="B3">Blanchard et al., 2015</xref>) was used to process the reads and map them to the <italic>S. uberis</italic> 0140J reference genome [accession number AM946015 (<xref ref-type="bibr" rid="B35">Ward et al., 2009</xref>)]. Briefly, each sequence read was assessed for the presence of the terminal portion of ISS1 [accession number (of pGh9:ISS1) EU223008.1] and once identified, the remaining sequence was analyzed for quality. To ensure high quality mapping to the bacterial genome, each read was required to provide a Phred score of >30 (each base has a 99.9% confidence level) and adhere to a minimum (21 bp) length restriction, to ensure an unequivocal alignment. Only reads that reached these satisfactory quality and length requirements were mapped against the <italic>S. uberis</italic> genome. The mapping parameters were set to restrict any mismatch, to maintain high alignment accuracy and minimizing the likelihood of sequence ambiguity.</p>
</sec>
</sec>
<sec><title>Results</title>
<sec><title>Development of the PIMMS Protocol</title>
<p>The quality of the sequence data generated from each of the enriched samples was comparable to that obtained following direct sequencing of gDNA; all sequence data had an average Phred score of >35 and no single base had a Phred score of lower than 31 (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Analysis of sequences by inverse PCR obtained from gDNA and preparations enriched for DNA flanking pGh9:IS<italic>S1</italic> insertions.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"></td>
<th valign="top" align="center" colspan="4">DNA fragmentation method<hr/></th></tr>
<tr>
<th valign="top" align="left">Measured parameter</th>
<th valign="top" align="center"><italic>Eco</italic>R1</th>
<th valign="top" align="center"><italic>Hin</italic>dIII</th>
<th valign="top" align="center">Covaris</th>
<th valign="top" align="center">None<sup>1</sup></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Total sequence reads (TSRs)</td>
<td valign="top" align="center">3,848,211</td>
<td valign="top" align="center">3,294,706</td>
<td valign="top" align="center">2,977,706</td>
<td valign="top" align="center">3,377,794</td>
</tr>
<tr>
<td valign="top" align="left">Average Phred score</td>
<td valign="top" align="center">37.26</td>
<td valign="top" align="center">37.31</td>
<td valign="top" align="center">37.48</td>
<td valign="top" align="center">36.75</td>
</tr>
<tr>
<td valign="top" align="left">Minimum base Phred score</td>
<td valign="top" align="center">33.33</td>
<td valign="top" align="center">33.6</td>
<td valign="top" align="center">32.23</td>
<td valign="top" align="center">31.59</td></tr>
<tr>
<td valign="top" align="left">Maximum base Phred score</td>
<td valign="top" align="center">38.48</td>
<td valign="top" align="center">38.48</td>
<td valign="top" align="center">38.35</td>
<td valign="top" align="center">38.62</td>
</tr>
<tr>
<td valign="top" align="left">Matched reads (%)<sup>2</sup></td>
<td valign="top" align="center">2,400,817 (62.39)</td>
<td valign="top" align="center">2,075,014 (62.98)</td>
<td valign="top" align="center">1,380,699 (40.88)</td>
<td valign="top" align="center">5,817 (0.20)</td>
</tr>
<tr>
<td valign="top" align="left">Mapped reads (%)<sup>3</sup></td>
<td valign="top" align="center">937,081 (39.03)</td>
<td valign="top" align="center">613,441 (29.56)</td>
<td valign="top" align="center">341,562 (24.74)</td>
<td valign="top" align="center">1,439 (24.74)</td>
</tr>
<tr>
<td valign="top" align="left">Unique insertion points (UIP)</td>
<td valign="top" align="center">5,835</td>
<td valign="top" align="center">9,657</td>
<td valign="top" align="center">21,834</td>
<td valign="top" align="center">721</td>
</tr>
<tr>
<td valign="top" align="left">Average read depth (TSR/UIP)</td>
<td valign="top" align="center">160.6</td>
<td valign="top" align="center">63.5</td>
<td valign="top" align="center">15.6</td>
<td valign="top" align="center">1.9</td>
</tr>
<tr>
<td valign="top" align="left">Average inter-insertion distance (bp)</td>
<td valign="top" align="center">89</td>
<td valign="top" align="center">72</td>
<td valign="top" align="center">41</td>
<td valign="top" align="center">1054</td></tr>
</tbody></table>
<table-wrap-foot>
<attrib><italic>Data on each CDS is located in <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">S1</xref></bold>. <sup>1</sup>Not subjected to Inverse PCR enrichment gDNA only. <sup>2</sup>Number of sequence reads containing 23 bp of either terminus of IS<italic>S1.</italic> <sup>3</sup>Number of Matched sequence reads containing >20 bp of unambiguous chromosomal sequence adjacent to IS<italic>S1</italic> sequence.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>Sequence data was analyzed and mapped back to the original genome using the PIMMS bioinformatic pipeline (<xref ref-type="bibr" rid="B3">Blanchard et al., 2015</xref>). Comparative analysis indicated that the number of sequence reads that included 23 bp from either terminus of the insertion sequence (Matched Reads; <bold>Table <xref ref-type="table" rid="T1">1</xref></bold>) was similar for the <italic>EcoR1</italic> and <italic>HindIII</italic> digested samples (62.3% and 62.9%, respectively). Proportionally fewer sequence reads (40.8%) produced from acoustically fragmented samples contained the equivalent ISS1 sequences and only very few (0.2%) of the sequence reads obtained directly from the untreated gDNA sample contained either terminus of the insertion sequence (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>).</p>
<p>A bioinformatic pipeline (<xref ref-type="bibr" rid="B3">Blanchard et al., 2015</xref>) was used to map the sequence directly adjacent to the IS<italic>S1</italic> terminus to the <italic>S. uberis</italic> reference genome (accession number AM946015; <xref ref-type="bibr" rid="B34">Ward et al., 2001</xref>). This revealed that between 39 and 25% of the matched reads mapped unambiguously to the source genome. However, the number of unique matches (unique mutations) showed that the libraries produced from endonuclease digestion were markedly less diverse than that generated by random acoustic shearing of gDNA (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>).</p>
<p>In each case, these libraries were enriched for flanking sequences (matched reads) compared to that produced from non-enriched gDNA; the <italic>EcoR1</italic>-based library was enriched 651-fold and those generated using <italic>HindIII</italic> and acoustically (randomly) sheared DNA were enriched 426 and 237 fold, respectively (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>). In all cases, locations of mutations were shown to be dispersed around the entire bacterial chromosome (<bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p><bold>Representation of unique insertions following analysis of different sequencing libraries.</bold> The genomic coordinates of each unique insertion were used to generate corresponding Circos plots (<xref ref-type="bibr" rid="B17">Krzywinski et al., 2009</xref>) of mutations in the genome of <italic>Streptococcus uberis</italic>. The height of each line represents mutation read depth (set to a maximum of fifty reads and a minimum of three). The libraries were generated from <bold>(A)</bold> <italic>EcoR1</italic> digested DNA; <bold>(B)</bold> <italic>HindIII</italic> digested DNA; <bold>(C)</bold> Acoustically fragmented DNA; or <bold>(D)</bold> Unenriched gDNA.</p></caption>
<graphic xlink:href="fmicb-07-01645-g001.tif"/>
</fig>
</sec>
<sec><title>Comparison of the Sample Production Methods on Insertion Discovery</title>
<p>As fewer unique insertion points were detected in the two libraries generated from inverse PCR products of endonuclease digested gDNA than in that generated by randomly shearing DNA prior to re-circularisation, it can be assumed that endonuclease fragmentation with either <italic>HindIII</italic> or <italic>EcoRI</italic> introduced bias in the process. This was assessed using <italic>in silico</italic> digestion of the <italic>S. uberis</italic> genome and the association of all insertions found within boundaries of each pair of restriction sites (<bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold>). This demonstrated the tendency for smaller restriction fragments to yield insertion data (<bold>Figure <xref ref-type="fig" rid="F2">2A</xref></bold>); mapping insertions from the randomly sheared samples was largely independent of the length of the theoretical restriction fragments (<bold>Figure <xref ref-type="fig" rid="F2">2B</xref></bold>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p><bold>Mapping insertions to restriction fragments following fragmentation with the corresponding endonuclease or by random acoustic shearing.</bold> Fragment length produced by either endonuclease was calculated. The number of insertions located to each fragment was determined and a linear regression of normalized insertion (NIM; <xref ref-type="bibr" rid="B3">Blanchard et al., 2015</xref>) was used to show any trends. Data represents NIM (dots right axis) mapped to fragments produced with endonuclease (left axis, orange bars = <italic>HindIII</italic>; blue bars = <italic>EcoRI</italic>). <bold>(A)</bold> NIM from corresponding endonuclease generated sample or <bold>(B)</bold> or acoustic sheared sample. <italic>R</italic>-values chart <bold>(A)</bold> HindIII (0.5476) and EcoR1 (0.5283); chart <bold>(B)</bold> HindIII (0.0255) and EcoR1 (0.0057).</p></caption>
<graphic xlink:href="fmicb-07-01645-g002.tif"/>
</fig>
<p>Correspondence of the insertion data from the three enrichment protocols was investigated at the level of unique insertion discovery and at the resolution of coding sequence disruption (<bold>Figure <xref ref-type="fig" rid="F3">3</xref></bold>). A high proportion (87.5%) were discovered in the library originating from randomly sheared gDNA, whereas considerably fewer (21.7 and 35.8%) were identified in the libraries produced by digestion with <italic>EcoR1</italic> and <italic>HindIII</italic>, respectively. However, more than half the insertions (57.5%) were detected by combining data obtained from both libraries produced from endonuclease digested gDNA. Despite the clear superiority of using randomly sheared gDNA as the starting material in this process, the number of mutated coding sequences detected in libraries generated with randomly sheared or endonuclease fragmented gDNA were similar. Of 1474 mutated coding sequences (CDS), a very high proportion (98%) was identified using the randomly sheared sample library and 83.4 and 60.0% were identified using the <italic>HindIII</italic> and <italic>EcoRI</italic> generated samples, respectively. Cumulatively, the endonuclease generated libraries yielded insertion data in approximately 90% of the total mutated CDS identified in the study.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p><bold>Comparison of mutations detected in samples prepared by endonuclease digestion and random acoustic fragmentation (Covaris).</bold> Venn diagrams to demonstrate the overlap of mapped sequences detected from sample prepared using different procedures; <bold>(A)</bold> indicates the number of unique insertions using each procedure; <bold>(B)</bold> indicates the number of unique insertions using both endonuclease generated preparations compared to that generated by random acoustic shearing (Covaris); <bold>(C,D)</bold> as <bold>(A,B)</bold>, respectively, using mutated coding sequence as the unit of definition.</p></caption>
<graphic xlink:href="fmicb-07-01645-g003.tif"/>
</fig>
</sec>
<sec><title>The Application of PIMMS for Identification of Genes Essential for Bacterial Growth</title>
<p>Mutated cultures were plated on to solid media containing erythromycin and harvested in saline and stored at -80&#x00B0;C in the presence of glycerol to produce 10 amplified pools of mutants these were processed as previously described using acoustically sheared gDNA as the starting material (<bold>Table <xref ref-type="table" rid="T2">2</xref></bold>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Evaluation of sequences generated from 10 pools of <italic>Streptococcus uberis</italic> mutants.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Pool number</th>
<th valign="top" align="center">Total number of sequence reads</th>
<th valign="top" align="center">Matched reads (%)<sup>1</sup></th>
<th valign="top" align="center">Mapped reads (%)<sup>2</sup></th>
<th valign="top" align="center">Unique insertionswith &#x2265;3 occurrences<sup>3</sup></th>
<th valign="top" align="center">Average read depth</th>
<th valign="top" align="center">Number of CDSlacking insertions</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">8,953,812</td>
<td valign="top" align="center">4,437,287 (49.6)</td>
<td valign="top" align="center">2,178,162 (49)</td>
<td valign="top" align="center">10,647</td>
<td valign="top" align="center">204</td>
<td valign="top" align="center">574</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">7,861,879</td>
<td valign="top" align="center">2,426,829 (30.9)</td>
<td valign="top" align="center">1,071,066 (44)</td>
<td valign="top" align="center">13,669</td>
<td valign="top" align="center">78</td>
<td valign="top" align="center">473</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">8,277,653</td>
<td valign="top" align="center">4,183,217 (50.5)</td>
<td valign="top" align="center">1,335,140 (32)</td>
<td valign="top" align="center">15,287</td>
<td valign="top" align="center">87</td>
<td valign="top" align="center">450</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">8,115,818</td>
<td valign="top" align="center">3,003,635 (37)</td>
<td valign="top" align="center">1,153,172 (38)</td>
<td valign="top" align="center">15,611</td>
<td valign="top" align="center">73</td>
<td valign="top" align="center">446</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="center">8,268,603</td>
<td valign="top" align="center">3,672,825 (44.4)</td>
<td valign="top" align="center">176,272 (4.8)</td>
<td valign="top" align="center">6,004</td>
<td valign="top" align="center">28</td>
<td valign="top" align="center">648</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="center">6,761,822</td>
<td valign="top" align="center">1,942,516 (28.7)</td>
<td valign="top" align="center">797,029 (41)</td>
<td valign="top" align="center">7,356</td>
<td valign="top" align="center">108</td>
<td valign="top" align="center">695</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="center">8,636,714</td>
<td valign="top" align="center">2,446,844 (28.3)</td>
<td valign="top" align="center">836,874 (34)</td>
<td valign="top" align="center">8,856</td>
<td valign="top" align="center">94</td>
<td valign="top" align="center">596</td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="center">6,478,983</td>
<td valign="top" align="center">1,795,475 (27.7)</td>
<td valign="top" align="center">881,896 (49)</td>
<td valign="top" align="center">9,719</td>
<td valign="top" align="center">90</td>
<td valign="top" align="center">618</td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="center">7,266,178</td>
<td valign="top" align="center">2,187,573 (30.1)</td>
<td valign="top" align="center">961,217 (43.9)</td>
<td valign="top" align="center">8,780</td>
<td valign="top" align="center">109</td>
<td valign="top" align="center">650</td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="center">8,526,696</td>
<td valign="top" align="center">4,220,350 (49.5)</td>
<td valign="top" align="center">2,143,901 (50.7)</td>
<td valign="top" align="center">19,561</td>
<td valign="top" align="center">109</td>
<td valign="top" align="center">375</td>
</tr>
<tr>
<td valign="top" align="left">Pooled<sup>4</sup></td>
<td valign="top" align="center"><bold>79,148,158</bold></td>
<td valign="top" align="center"><bold>30,316,551 (38.3)</bold></td>
<td valign="top" align="center"><bold>11,534,729 (38)</bold></td>
<td valign="top" align="center"><bold>80,617</bold></td>
<td valign="top" align="center"><bold>182</bold></td>
<td valign="top" align="center"><bold>196</bold></td></tr>
</tbody></table>
<table-wrap-foot>
<attrib><italic>Data on each CDS is located in <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">S1</xref></bold>. <sup>1</sup>Sequence reads which contain 23 bp of IS<italic>S1</italic> terminal sequences. <sup>2</sup>Matched sequence reads in which 21&#x2013;50 bases adjacent to the ISS1 terminus have also been mapped to the reference genome [accession number AM946015 (<xref ref-type="bibr" rid="B35">Ward et al., 2009</xref>)]. <sup>3</sup>The number of unique flanking sequences identified within the mapped reads with >3 occurrences (<xref ref-type="bibr" rid="B10">Goodman et al., 2009</xref>). <sup>4</sup>Total unique mutations mapped in all pools.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>Analysis of the combined sequence data from the mutant pools (approximately 10<sup>5</sup> individual mutant colonies) identified 80,617 unique insertion points; each mutated CDS having an average of 31 unique insertions. Analysis of the locations of all unique insertions within the genome identified 196 CDS where no insertion event was identified (<bold>Table <xref ref-type="table" rid="T3">3</xref></bold>) and a further 67 CDS where mutations were only detected in the last 10th percentile of the CDS; termed truncated genes (<bold>Table <xref ref-type="table" rid="T4">4</xref></bold>). Essential and truncated sequences were classified using RAST (Rapid Annotation using Subsystem Technology; <xref ref-type="bibr" rid="B40">Overbeek et al., 2014</xref>). The majority of non-mutated sequences were associated with transcription and translation and other basic cellular functions including those involved in catabolism and cell cycle (<bold>Table <xref ref-type="table" rid="T3">3</xref></bold>). The truncated sequences were dominated by genes associated with the synthesis of ribosomal proteins (<bold>Table <xref ref-type="table" rid="T4">4</xref></bold>).</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Rapid Annotation using Subsystem Technology (RAST) classification of <italic>S. uberis</italic> CDS containing no insertions.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">RAST category</th>
<th valign="top" align="center">Count</th>
<th valign="top" align="left">Gene</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Amino acids and derivatives</td>
<td valign="top" align="center">2</td>
<td valign="top" align="left"><italic>aspS, alr</italic></td>
</tr>
<tr>
<td valign="top" align="left">Carbohydrates</td>
<td valign="top" align="center">12</td>
<td valign="top" align="left"><italic>fba, pgk, plr, gpmA, acoL, pfk, pdhC, pdhB, ptsH, gapN, pgi, gpsA</italic></td>
</tr>
<tr>
<td valign="top" align="left">Cell division and cell cycle</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left"><italic>ftsA recU, ftsL, ftsZ</italic>, SUB1127, 1285, 1404, 1092</td>
</tr>
<tr>
<td valign="top" align="left">Cell wall and capsule</td>
<td valign="top" align="center">14</td>
<td valign="top" align="left"><italic>glr, dltC, murF, ddlA, murG, rmlB, murE, pbpX, glmS, rmlA, rlmC</italic>, SUB0010, 0696, 0697</td>
</tr>
<tr>
<td valign="top" align="left">Cofactors, vitamins, prosthetic groups</td>
<td valign="top" align="center">4</td>
<td valign="top" align="left"><italic>ppnK, dpfB, birA</italic>, SUB0641</td>
</tr>
<tr>
<td valign="top" align="left">DNA metabolism</td>
<td valign="top" align="center">11</td>
<td valign="top" align="left"><italic>mecA, dnaI, dnaG, hlpA, ssb, dnaC, parE, dnaH, plsX, xseB</italic>, SUB1777</td>
</tr>
<tr>
<td valign="top" align="left">Fatty acids, lipids, and isoprenoids</td>
<td valign="top" align="center">18</td>
<td valign="top" align="left"><italic>aacA, aacD, aacC, fabZ, uppS, dgk, mvaD, fni, fabD, fabK, acpP, fabH, a, mvaS, fabE, fabF</italic>, SUB1015, 1501</td>
</tr>
<tr>
<td valign="top" align="left">Hypothetical proteins</td>
<td valign="top" align="center">9</td>
<td valign="top" align="left"><italic>veg</italic>, SUB0149, 0332, 0388, 0726, 0930, 1286, 1547, 1619</td>
</tr>
<tr>
<td valign="top" align="left">Iron acquisition and metabolism</td>
<td valign="top" align="center">0</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Membrane transport</td>
<td valign="top" align="center">6</td>
<td valign="top" align="left">SUB0581, 1004, 1158, 1413, 1799, 1853</td>
</tr>
<tr>
<td valign="top" align="left">Miscellaneous</td>
<td valign="top" align="center">11</td>
<td valign="top" align="left"><italic>prsA1, secG, ftsE</italic>, SUB0223, 0382, 0399, 1093, 1472, 1620, 1732, 1775</td>
</tr>
<tr>
<td valign="top" align="left">Nucleosides and nucleotides</td>
<td valign="top" align="center">6</td>
<td valign="top" align="left"><italic>prsA2, tmk, nrdH, adk, ybeY</italic> SUB1225</td>
</tr>
<tr>
<td valign="top" align="left">Phosphorus metabolism</td>
<td valign="top" align="center">0</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Protein metabolism</td>
<td valign="top" align="center">58</td>
<td valign="top" align="left"><italic>alaS, engA, engC, fus, gatA, gatB, gltX, groES, infA, infB, infC, pheS, prfB, rimM, rplB, rplD, rplD, rplE, rplF, rplJ, rplK, rplK, rplN, rplO, rplP, rplR, rplS, rplU, rplV, rplW, rplX, rpmA, rpmB, rpmC, rpmD, rpme, rpmF, rpml, rpmJ, rpsB, rspC, rpsD, rpsE, rpsF, rpsG, rpsH, rpsI, rpsJ, rpsK, rpsL, rpsM, rpsN, rpsO, rpsQ, rpsS, rpsU, tufA</italic>, SUB1008, 1732A</td>
</tr>
<tr>
<td valign="top" align="left">Regulation and cell signaling</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left"><italic>hisS</italic></td>
</tr>
<tr>
<td valign="top" align="left">Respiration</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left"><italic>atpB, atpD, atpE, atpF, atpA, atpG, atpH</italic></td>
</tr>
<tr>
<td valign="top" align="left">RNA metabolism</td>
<td valign="top" align="center">13</td>
<td valign="top" align="left"><italic>leuS, asnS, glyS, proS, rnpA, era, trmD</italic>, SUB0849, 1470, 1467, 1616, 1618, 1847</td>
</tr>
<tr>
<td valign="top" align="left">Stress response</td>
<td valign="top" align="center">3</td>
<td valign="top" align="left"><italic>dnaJ, grpE</italic>, SUB0009</td>
</tr>
<tr>
<td valign="top" align="left">Virulence, disease and defense</td>
<td valign="top" align="center">4</td>
<td valign="top" align="left"><italic>vicR</italic>, SUB0502, 0505, 0506</td>
</tr>
<tr>
<td valign="top" align="left">Total</td>
<td valign="top" align="center">196</td>
<td valign="top" align="left"></td></tr>
</tbody>
</table>
</table-wrap>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>Rapid Annotation using Subsystem Technology classification of <italic>S. uberis</italic> CDS containing insertions only within the last 10% of the CDS.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">RAST category</th>
<th valign="top" align="left">Count</th>
<th valign="top" align="left">Gene</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Amino acids and derivatives</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left"><italic>cysE, mtnN</italic></td>
</tr>
<tr>
<td valign="top" align="left">Carbohydrates</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left"><italic>ptsK, tpi</italic></td>
</tr>
<tr>
<td valign="top" align="left">Cell division and cell cycle</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left"><italic>ftsW, ftsX</italic></td>
</tr>
<tr>
<td valign="top" align="left">Cell wall and capsule</td>
<td valign="top" align="left">10</td>
<td valign="top" align="left"><italic>bacA, gcaD, hasC, mraY, murB, murC, murM, rgpG</italic>, SUB426, 0700</td>
</tr>
<tr>
<td valign="top" align="left">Cofactors, vitamins, prosthetic groups</td>
<td valign="top" align="left">7</td>
<td valign="top" align="left"><italic>coaA, coaD, coaC, dyr, metK, nadE</italic>, SUB0356</td>
</tr>
<tr>
<td valign="top" align="left">DNA metabolism</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left"><italic>dnaA, dnaN, holB, parC, gyrA</italic></td>
</tr>
<tr>
<td valign="top" align="left">Fatty acids, lipids, and isoprenoids</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left"><italic>mvaA, mvaK2</italic>, SUB253, 0333, 1246</td>
</tr>
<tr>
<td valign="top" align="left">Hypothetical proteins</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">SUB1434, 1834</td>
</tr>
<tr>
<td valign="top" align="left">Iron acquisition and metabolism</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Membrane transport</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left"><italic>secA</italic>, SUB0511, 1005, 1852, 1854</td>
</tr>
<tr>
<td valign="top" align="left">Miscellaneous</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left"><italic>tyrS</italic>, SUB0019, 0393, 0704</td>
</tr>
<tr>
<td valign="top" align="left">Nucleosides and nucleotides</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left"><italic>gmk, pgmA, thiD</italic>, SUB0745, 1227</td>
</tr>
<tr>
<td valign="top" align="left">Phosphorus metabolism</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left"><italic>ppaC</italic></td>
</tr>
<tr>
<td valign="top" align="left">Protein metabolism</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left"><italic>argS, efp, prfA, pth, rplL, rplM, rplQ, serS, thrS, trsA</italic>, SUB0345</td>
</tr>
<tr>
<td valign="top" align="left">Regulation and cell signaling</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Respiration</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">RNA metabolism</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left"><italic>ileS, nusA, rpoD</italic>, SUB0013, 0873, 1479</td>
</tr>
<tr>
<td valign="top" align="left">Stress response</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Virulence, disease and defense</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Total</td>
<td valign="top" align="left">67</td>
<td valign="top" align="left"></td></tr>
</tbody>
</table>
</table-wrap>
<p>Insertion coordinate data was used to determine their location in the genome (<bold>Figure <xref ref-type="fig" rid="F4">4</xref></bold>). As previously detected, insertions were dispersed around the entire genome (<bold>Figure <xref ref-type="fig" rid="F4">4A</xref></bold>) and a kernel density plot showed insertions typically occurred along the entire length of mutated CDS (<bold>Figure <xref ref-type="fig" rid="F4">4B</xref></bold>) The PIMMS counts package (<xref ref-type="bibr" rid="B3">Blanchard et al., 2015</xref>) was used to evaluate the density of mutations identified within mutated CDS, the rate of insertion was found to be an average of 140 insertions per kb of CDS within the mutated genome. The genome sequence preceding the insertion point was evaluated to assess for the presence of any insertional motif; none was detected.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p><bold>Analysis of >80,000 unique insertions detected within the genome of <italic>S. uberis</italic> following PIMMS. (A)</bold> Circos graphical representation (<xref ref-type="bibr" rid="B17">Krzywinski et al., 2009</xref>) of the distribution of unique insertions identified within the <italic>S. uberis</italic> genome. <bold>(B)</bold> Kernal density plot displaying the proportion of individual mutations identified each centile position of mutated CDS within the <italic>S. uberis</italic> genome. <bold>(C)</bold> Box and whisker plot showing inter quartile range (box), median (line) and range (whiskers) of unique insertions detected following a random permutation test (1000 permutations of 1 million randomly selected sequence reads from each pool were used to calculate mean number of disrupted CDS that may be detected from any 1 of 10, 2 of 10, 3 of 10, etc. to 10 of 10 pools).</p></caption>
<graphic xlink:href="fmicb-07-01645-g004.tif"/>
</fig>
<p>To assess the level of redundancy in data acquisition the raw sequence-read data were subjected to a random permutation test using 1000 permutations for each parameter. Initially, samples (1 million sequence reads) randomly selected from each pool was used to calculate the number of mutated CDS detected in each. These data were used to calculate the mean number of mutated CDS detected and the upper and lower quartiles. The process was repeated for any 2 out of 10 pools, any 3 out of 10 pools and so on up to 10 out of 10 pools (<bold>Figure <xref ref-type="fig" rid="F4">4C</xref></bold>). This indicated that obtaining 1 million sequence reads from any five pools of mutants was likely to generate >85% of the total informative data relating to CDS requirement gained from sequencing all 10 mutant pools.</p>
<p>Using orthologous genes, the position and frequency of unique insertions were superimposed on to the glycolytic pathway (Kegg pathway identifier:sub00010) using Pathview (<xref ref-type="bibr" rid="B22">Luo and Brouwer, 2013</xref>). This revealed that most genes with unique 1:1 orthologs contained no insertions or contained insertion beyond the 90th percentile of their sequence. However, both enolase (sub0655) and pyruvate kinase (sub1000) tolerated mutation starting at the 39th and 46th percentile of their sequences, respectively. Mutations were present at low frequencies (sub0655 = 3.1 and sub1000 = 0.66 unique insertions/kb CDS) and these insertions were not abundant with Normalized Read Scores (NRM; <xref ref-type="bibr" rid="B3">Blanchard et al., 2015</xref>) of 16.86 and 0.17, respectively for each CDS; compared to a mean NRM of 539.33 for all (mutated and non-mutated) CDS in the genome. In this pathway, interconversion of glycerate-2P and glycerate-3P can be effected by two distinct enzymes, phosphoglycerate mutase and 2,3-di-phosphoglycerate-dependant phosphoglycerate mutase. Phosphoglycerate mutase has four orthologs in <italic>S. uberis</italic> (sub0594; sub0838, sub0839, and sub1509) all of which showed insertions throughout their sequences at frequencies ranging from 38.7 to 72.6 unique insertions/kb. Whereas, 2,3-di-phosphoglycerate-dependant phosphoglycerate mutase has only a single ortholog (sub1263; <italic>gpmA</italic>), this did not contain insertions.</p>
</sec>
</sec>
<sec><title>Discussion</title>
<p>The laboratory methodology (PIMMS laboratory protocol) described in this communication was an adaption and extension of techniques used previously to map individual mutations generated by the insertional mutagen pGh9:IS<italic>S1</italic> (<xref ref-type="bibr" rid="B34">Ward et al., 2001</xref>). By combining high-density random mutagenesis and readily available DNA sequencing protocols, PIMMS laboratory protocol was able to comprehensively generate data to identify mutated sequences in populations of <italic>S. uberis</italic> using the PIMMS bioinformatics packages (<xref ref-type="bibr" rid="B3">Blanchard et al., 2015</xref>). The ability to utilize bacteria mutagenized with pGh9:IS<italic>S1</italic> in this manner is a significant advance in the repertoire of tools available for functional genomics of <italic>Streptococcus, Lactococcus</italic>, and <italic>Enterococcus</italic> in which production of banks of random mutants using this mutagen is similarly straight forward (<xref ref-type="bibr" rid="B23">Maguin et al., 1996</xref>).</p>
<p>The PIMMS protocols are relatively simple and sequence data was produced using conventional, and thus readily available, library preparation, and sequencing protocols. In line with previous studies using this mutagen (<xref ref-type="bibr" rid="B23">Maguin et al., 1996</xref>) no obvious insertion motif was detected; the PIMMS bioinformatic pipeline provided data on base frequencies of insertion positions and whilst a slight bias for AT was seen (28.8% A and 25% T) this is consistent with the AT nucleotide content (63.4%) of <italic>S. uberis</italic>.</p>
<p>Detection of insertions within the non-enriched gDNA sample was in line with that predicted using the formula: (RS)/g = gn where &#x2018;R&#x2019; is the number of sequence reads (paired end) for the sample (2,977,706); &#x2018;S&#x2019; is the number of bases sequenced per read (500); &#x2018;g&#x2019; is the length of the genome (1,852,352 bp); and &#x2018;gn&#x2019; equals the number of genome equivalents sequenced; the predicted value for gn was 803. The actual number of unique insertions detected was 721; &#x223C;90% of that predicted.</p>
<p>The efficiency with which the protocol detected IS<italic>S1</italic> junctions using endonuclease-fragmented gDNA was higher than that obtained with randomly sheared gDNA. However, the diversity of the insertions mapped using endonuclease digested samples was reduced compared to those data obtained using acoustically fragmented gDNA. This may be explained by the proximity of the specific restriction enzyme recognition sites and the point of insertion, as we were able to demonstrate that insertions were preferentially detected from shorter restriction fragments in their corresponding endonuclease prepared samples. Furthermore, this effect was abolished when mapping insertions detected using the randomly sheared gDNA sample to these sequences (<bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold>). The irregular distribution of such restriction sites within <italic>S. uberis</italic> (this particular strain contains 1086 <italic>HindIII</italic> and 495 <italic>EcoR1</italic> restriction sites) renders these more simple methodologies for gDNA fragmentation less useful in a comprehensive, genome-wide analysis of mutated populations. However, when data was analyzed by CDS mutation, the combined data obtained from both libraries produced by <italic>EcoRI</italic> and <italic>HindIII</italic> could be used to detect 90% of the total mutated CDS, of which 87.5% could be detected using gDNA fragmented with <italic>HindIII</italic> alone (compared to 98% detected in libraries produced from randomly sheared gDNA). Consequently, in the absence of the capability to generate precisely-sized, randomly sheared gDNA and with the application of carefully controlled experimental protocols (<xref ref-type="bibr" rid="B7">Chao et al., 2016</xref>), analysis of combinations of sequencing libraries made from inverse PCR products derived from endonuclease digestion of gDNA may be a practical alternative to identify conditionally essential CDS.</p>
<p>The movement toward next generation high-throughput transposon insertion-site sequencing in Streptococci has been shown in studies on <italic>Streptococcus pneumoniae</italic> (<xref ref-type="bibr" rid="B32">van Opijnen et al., 2009</xref>), <italic>Streptococcus pyogenes</italic> (<xref ref-type="bibr" rid="B20">Le Breton et al., 2015</xref>), and <italic>Streptococcus agalactiae</italic> (<xref ref-type="bibr" rid="B14">Hooven et al., 2016</xref>) and these provide a benchmark for the PIMMS protocol. In all cases the Tn-seq protocol was used for detection of the insertion junction sequences. In the initial studies using <italic>S. pneumoniae</italic> a population of 150,000 mutants was used and only 23,875 unique insertions (15.9%) were detected. In the present study, the pGh9:ISS1 mutagen compared favorably; in a population of approximately 115,000 mutants, 80,617 (&#x223C;70%) unique mutations were detected. These data further support the assertion that pGh9:ISS1 has neither a transposition bias to a specific insertion motif (<xref ref-type="bibr" rid="B11">Green et al., 2012</xref>) nor holds an insertional preference to specific structural features of DNA (<xref ref-type="bibr" rid="B18">Lampe et al., 1998</xref>); either of which can lead to pseudo-random insertions, limiting the range and variability of mutations that can be created within a population. In the studies in <italic>S. pyogenes</italic> (<xref ref-type="bibr" rid="B20">Le Breton et al., 2015</xref>) and <italic>S. agalactiae</italic> (<xref ref-type="bibr" rid="B14">Hooven et al., 2016</xref>) the mutation/mutant ratios were not reported.</p>
<p>Mutagenesis with pGhost9:ISS1 is very straight forward and enables production of mutagenized pools of bacteria that have undergone very little manipulation and/or inter-strain competition. In the current study, the only post-mutagenesis selection of mutants was their ability to survive a short outgrowth period (typically 2.5 h after transfer to the non-permissive temperature; <xref ref-type="bibr" rid="B34">Ward et al., 2001</xref>), storage at -80&#x00B0;C and to produce a colony on solid media containing erythromycin. Viable counts of harvested pools indicated that total and erythromycin resistant counts were the same and that approximately 10<sup>7</sup> cfu of <italic>S. uberis</italic> were obtained per harvested colony; thus each suspension contains an amplified pool of mutated bacteria in which the frequency of mutations approximated to the number of mutant colonies harvested. This enabled pools of mutants to be prepared as a reagent for subsequent repeated use, thus permitting greater cross comparison between studies of different phenotypic selections.</p>
<p>The efficiency with which the PIMMS pipeline identified matched and mapped reads was in line with expectations in consideration of the associated laboratory protocols. The average length of the PCR product from the randomly sheared and re-circularized gDNA template is &#x223C;2 kb; fragmentation of this to 550 bp for sequencing library preparation would generate 2&#x2013;4 fragments for sequencing equating to approximately 25&#x2013;50% containing the IS<italic>S1</italic> terminus (matched reads). We identified matched reads at a mean frequency of 38% (range from 28 to 51%; <bold>Table <xref ref-type="table" rid="T2">2</xref></bold>). It may be possible to generate smaller fragments of gDNA in the initial stages of this protocol and these might be expected to yield corresponding shorter inverse PCR products, which would proportionally increase the information content of each PCR product. However, experimentally, following the procedures outlined by <xref ref-type="bibr" rid="B12">Hartl and Ochman (1994)</xref>, an initial fragment size of &#x223C;3 kb was deemed optimal for generation of a re-circularized inverse PCR template.</p>
<p>Within this study we deemed a gene was essential when no insertion event was detected in the CDS. However, this was expanded to include those CDS where insertion events were detected only beyond the 90th percentile of the sequence. Such CDS may produce an incomplete but sometimes functional protein (carrying a relatively short C-terminal deletion of the gene product). The sequences of the essential and truncated genes detected using PIMMS were compared with other known essential genes obtained from the database of essential genes (DEGs; <xref ref-type="bibr" rid="B37">Zhang et al., 2004</xref>; <xref ref-type="bibr" rid="B36">Zhang and Lin, 2009</xref>; <xref ref-type="bibr" rid="B21">Luo et al., 2014</xref>). In <italic>S. uberis</italic>, genes encoding ribosomal proteins and transfer RNA adaptor molecules dominated the non-mutated (50%) and truncated (30%) CDS. Such sequences are highly conserved across most of the different bacterial species and were also essential in 82% of the genomes contained within the DEG, further indicating the suitability of the PIMMS laboratory protocol and bioinformatic pipeline for detection of essential (and/or conditionally essential) genes. Interestingly, a number of conserved hypothetical sequences within the essential <italic>S. uberis</italic> dataset (sub0149, sub223, sub399, sub1158, sub1413, sub1468, sub1619, sub1832A) were also identified as essential, in other Streptococcal species in the DEG.</p>
<p>Comparison of the ability of CDS associated with glycolysis to tolerate mutation indicated, not unexpectedly, this pathway to be comprised mainly of essential and/or terminally mutated sequences. Where clear redundancy existed, for instance in the case of phosphoglycerate mutase, the orthologous CDS (sub0594; sub0838, sub0839, and sub1509) were all mutated; suggesting that none had functional dominance. The interconversion of glycerate-2P to glycerate-3P may also be effected by a distinct activity, 2,3-di-phosphoglycerate-dependant phosphoglycerate mutase, for which only one ortholog (sub1263; <italic>gpmA</italic>) was detected and this was devoid of mutations indicating its essentiality and/or functional dominance. <italic>S. pyogenes</italic>, also contains multiple sequences encoding both activities capable of interconversion of glycreate-2P and glycerate-3P (<xref ref-type="bibr" rid="B26">Pancholi and Caparon, 2016</xref>). An investigation of two strains of <italic>S. pyogenes</italic> (<xref ref-type="bibr" rid="B20">Le Breton et al., 2015</xref>) identified three orthologs of gpm. In one strain of <italic>S. pyogenes, gpmA</italic> was essential whilst in another strain none of the sequences was clearly identified as essential. Somewhat surprisingly, and in contrast to the findings of <xref ref-type="bibr" rid="B20">Le Breton et al. (2015)</xref>, enolase (sub0655) was mutated in our study. Although insertions were not highly prevalent (four unique insertions in the CDS) in the population, indicating a high likelihood of a major role in bacterial fitness, these were not located at the extreme sequence termini implying this activity could be removed (at some fitness cost) under the conditions used. Similarly, detection of a single mutation (present at very low prevalence) in pyruvate kinase around the midpoint of its CDS suggests it also plays a major role in bacterial fitness. Alternative metabolic routes to pyruvate exist via products of the pentose phosphate pathway and from metabolism of acetyl CoA<sup><xref ref-type="fn" rid="fn01">1</xref></sup>. In addition, other activities encoded within the many hypothetical sequences may play key/redundant roles in metabolism.</p>
<p>The PIMMS pipeline may also be used in line with the annotation independent procedures described by <xref ref-type="bibr" rid="B7">Chao et al. (2016)</xref> to examine the role of non-coding regions of DNA. The functional understanding of the roles of non-coding DNA is still in its infancy, however variably sized areas of non-coding DNA fragments are known to bind to transcriptional factors to form enhancer or silencer regulatory regions (<xref ref-type="bibr" rid="B33">van Wolfswinkel and Ketting, 2010</xref>). The regions of the <italic>S. uberis</italic> genome that do not code for protein, account for approximately 9% of the total genome sequence. Analysis of these regions revealed a total of 1,149,915 insertion events; 9.8% of all detected insertions. There were 465 intragenic regions where no mutation events could be detected; suggesting some potential functional role may be associated with these sequences, but further detailed analysis is required to substantiate these claims.</p>
<p>Whilst there appears to be new techniques emerging for insertion mutation mapping, the straightforward and pragmatic nature of the laboratory protocol described in this communication: application of a mutagenic technique that is simple, randomly integrating and that does not require a highly transformable host, alongside readily accessible molecular biology techniques and conventional (commercially available) sequencing library preparation and sequencing protocols highlights the PIMMS laboratory methodology as a technology that is very accessible to the wider scientific community to enable functional description and annotation of an increasing list genome-sequenced <italic>Lactococcus, Streptococcus</italic>, and <italic>Enterococcus</italic>.</p>
</sec>
<sec><title>Author Contributions</title>
<p>JL and RE conceived the study; SE and AB developed the methodology; AW conducted pathway analysis. JL, AB, RE, and SE wrote the manuscript.</p>
</sec>
<sec><title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>This project was funded by the University of Nottingham in collaboration with Zoetis.</p>
</ack>
<sec sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fmicb.2016.01645">http://journal.frontiersin.org/article/10.3389/fmicb.2016.01645</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.XLS" id="SM1" mimetype="application/vnd.ms-excel" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S1</label>
<caption><p><bold>Mapped insertion data for each coding sequence in the genome of <italic>S. uberis</italic> 0140J using PIMMS libraries generated with <italic>HindIII</italic>, <italic>EcoR1</italic>, acoustic shearing (Covaris), no treatment (gDNA); summarized in Table <xref ref-type="table" rid="T1">1</xref>.</bold> Mapped insertion data for each coding sequence in the genome of <italic>S. uberis</italic> 0140J using the combined pools of over >80,000 insertions in the final protocol; summarized in <bold>Table <xref ref-type="table" rid="T2">2</xref></bold>.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Data_Sheet_1.XLS" id="S1" mimetype="application/vnd.ms-excel" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baureder</surname> <given-names>M.</given-names></name> <name><surname>Hederstedt</surname> <given-names>L.</given-names></name></person-group> (<year>2012</year>). <article-title>Genes important for catalase activity in <italic>Enterococcus faecalis</italic>.</article-title> <source><italic>PLoS ONE</italic></source> <volume>7</volume>:<issue>e36725</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0036725</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Biswas</surname> <given-names>S.</given-names></name> <name><surname>Biswas</surname> <given-names>I.</given-names></name></person-group> (<year>2011</year>). <article-title>Role of VltAB, an ABC transporter complex, in viologen tolerance in <italic>Streptococcus mutans</italic>. <italic>Antimicrob.</italic></article-title> <source><italic>Agents Chemother.</italic></source> <volume>55</volume> <fpage>1460</fpage>&#x2013;<lpage>1469</lpage>. <pub-id pub-id-type="doi">10.1128/AAC.01094-10</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blanchard</surname> <given-names>A. M.</given-names></name> <name><surname>Leigh</surname> <given-names>J. A.</given-names></name> <name><surname>Egan</surname> <given-names>S. A.</given-names></name> <name><surname>Emes</surname> <given-names>R. D.</given-names></name></person-group> (<year>2015</year>). <article-title>Transposon insertion mapping with PIMMS &#x2013; pragmatic insertional mutation mapping system.</article-title> <source><italic>Front. Genet.</italic></source> <volume>6</volume>:<issue>139</issue>. <pub-id pub-id-type="doi">10.3389/fgene.2015.00139</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bradley</surname> <given-names>A. J.</given-names></name> <name><surname>Leach</surname> <given-names>K. A.</given-names></name> <name><surname>Breen</surname> <given-names>J. E.</given-names></name> <name><surname>Green</surname> <given-names>L. E.</given-names></name> <name><surname>Green</surname> <given-names>M. J.</given-names></name></person-group> (<year>2007</year>). <article-title>Survey of the incidence and aetiology of mastitis on dairy farms in England and Wales.</article-title> <source><italic>Vet. Rec.</italic></source> <volume>160</volume> <fpage>253</fpage>&#x2013;<lpage>257</lpage>. <pub-id pub-id-type="doi">10.1136/vr.160.8.253</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chadfield</surname> <given-names>M. S.</given-names></name> <name><surname>Christensen</surname> <given-names>J. P.</given-names></name> <name><surname>Christensen</surname> <given-names>H.</given-names></name> <name><surname>Bisgaard</surname> <given-names>M.</given-names></name></person-group> (<year>2004</year>). <article-title>Characterization of streptococci and enterococci associated with septicaemia in broiler parents with a high prevalence of endocarditis.</article-title> <source><italic>Avian Pathol.</italic></source> <volume>33</volume> <fpage>610</fpage>&#x2013;<lpage>617</lpage>.</citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chanter</surname> <given-names>N.</given-names></name></person-group> (<year>1997</year>). <article-title>Streptococci and enterococci as animal pathogens.</article-title> <source><italic>Soc. Appl. Bacteriol. Symp. Ser.</italic></source> <volume>26</volume> <fpage>100S</fpage>&#x2013;<lpage>109S</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-2672.83.s1.11.x</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chao</surname> <given-names>M. C.</given-names></name> <name><surname>Abel</surname> <given-names>S.</given-names></name> <name><surname>Davis</surname> <given-names>B. M.</given-names></name> <name><surname>Waldor</surname> <given-names>M. K.</given-names></name></person-group> (<year>2016</year>). <article-title>The design and analysis of transposon insertion sequencing experiments.</article-title> <source><italic>Nat. Rev. Microbiol.</italic></source> <volume>14</volume> <fpage>119</fpage>&#x2013;<lpage>128</lpage>. <pub-id pub-id-type="doi">10.1038/nrmicro.2015.7</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chao</surname> <given-names>M. C.</given-names></name> <name><surname>Pritchard</surname> <given-names>J. R.</given-names></name> <name><surname>Zhang</surname> <given-names>Y. J.</given-names></name> <name><surname>Rubin</surname> <given-names>E. J.</given-names></name> <name><surname>Livny</surname> <given-names>J.</given-names></name> <name><surname>Davis</surname> <given-names>B. M.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>High-resolution definition of the <italic>Vibrio cholerae</italic> essential gene set with hidden Markov model-based analyses of transposon-insertion sequencing data.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>41</volume> <fpage>9033</fpage>&#x2013;<lpage>9048</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkt654</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gawronski</surname> <given-names>J. D.</given-names></name> <name><surname>Wong</surname> <given-names>S. M. S.</given-names></name> <name><surname>Giannoukos</surname> <given-names>G.</given-names></name> <name><surname>Ward</surname> <given-names>D. V.</given-names></name> <name><surname>Akerley</surname> <given-names>B. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Tracking insertion mutants within libraries by deep sequencing and a genome-wide screen for <italic>Haemophilus</italic> genes required in the lung.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>106</volume> <fpage>16422</fpage>&#x2013;<lpage>16427</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0906627106</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goodman</surname> <given-names>A. L.</given-names></name> <name><surname>McNulty</surname> <given-names>N. P.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name> <name><surname>Leip</surname> <given-names>D.</given-names></name> <name><surname>Mitra</surname> <given-names>R. D.</given-names></name> <name><surname>Lozupone</surname> <given-names>C. A.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Identifying genetic determinants needed to establish a human gut symbiont in its habitat.</article-title> <source><italic>Cell Host Microbe</italic></source> <volume>6</volume> <fpage>279</fpage>&#x2013;<lpage>289</lpage>. <pub-id pub-id-type="doi">10.1016/j.chom.2009.08.003</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Green</surname> <given-names>B.</given-names></name> <name><surname>Bouchier</surname> <given-names>C.</given-names></name> <name><surname>Fairhead</surname> <given-names>C.</given-names></name> <name><surname>Craig</surname> <given-names>N. L.</given-names></name> <name><surname>Cormack</surname> <given-names>B. P.</given-names></name></person-group> (<year>2012</year>). <article-title>Insertion site preference of Mu, Tn5 and Tn7 transposons.</article-title> <source><italic>Mob. DNA</italic></source> <volume>3</volume>:<issue>3</issue>. <pub-id pub-id-type="doi">10.1186/1759-8753-3-3</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hartl</surname> <given-names>D. L.</given-names></name> <name><surname>Ochman</surname> <given-names>H.</given-names></name></person-group> (<year>1994</year>). <article-title>Inverse polymerase chain reaction.</article-title> <source><italic>Methods Mol. Biol.</italic></source> <volume>31</volume> <fpage>187</fpage>&#x2013;<lpage>196</lpage>. <pub-id pub-id-type="doi">10.1385/0-89603-258-2:187</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hill</surname> <given-names>A.</given-names></name> <name><surname>Leigh</surname> <given-names>J.</given-names></name></person-group> (<year>1989</year>). <article-title>DNA fingerprinting of <italic>Streptococcus uberis</italic>: a useful tool for epidemiology of bovine mastitis.</article-title> <source><italic>Epidemiol. Infect.</italic></source> <volume>103</volume> <fpage>165</fpage>&#x2013;<lpage>171</lpage>. <pub-id pub-id-type="doi">10.1017/S0950268800030466</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hooven</surname> <given-names>T. A.</given-names></name> <name><surname>Catomeris</surname> <given-names>A. J.</given-names></name> <name><surname>Akabas</surname> <given-names>L. H.</given-names></name> <name><surname>Randis</surname> <given-names>T. M.</given-names></name> <name><surname>Maskell</surname> <given-names>D. J.</given-names></name> <name><surname>Peters</surname> <given-names>S. E.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>The essential genome of <italic>Streptococcus agalactiae</italic>.</article-title> <source><italic>BMC Genomics</italic></source> <volume>17</volume>:<issue>406</issue>. <pub-id pub-id-type="doi">10.1186/s12864-016-2741-z</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Birch</surname> <given-names>R. G.</given-names></name></person-group> (<year>2000</year>). <article-title>Rapid amplification and cloning of Tn5 flanking fragments by inverse PCR.</article-title> <source><italic>Lett. Appl. Microbiol.</italic></source> <volume>31</volume> <fpage>149</fpage>&#x2013;<lpage>153</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-2672.2000.00781.x</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hutchison</surname> <given-names>C. A.</given-names> <suffix>III</suffix></name> <name><surname>Peterson</surname> <given-names>S. N. N.</given-names></name> <name><surname>Gill</surname> <given-names>S. R. R.</given-names></name> <name><surname>Cline</surname> <given-names>R. T. T.</given-names></name> <name><surname>White</surname> <given-names>O.</given-names></name> <name><surname>Fraser</surname> <given-names>C. M. M.</given-names></name><etal/></person-group> (<year>1999</year>). <article-title>Global transposon mutagenesis and a minimal mycoplasma genome.</article-title> <source><italic>Science</italic></source> <volume>286</volume> <fpage>2165</fpage>&#x2013;<lpage>2169</lpage>. <pub-id pub-id-type="doi">10.1126/science.286.5447.2165</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krzywinski</surname> <given-names>M. I.</given-names></name> <name><surname>Schein</surname> <given-names>J. E.</given-names></name> <name><surname>Birol</surname> <given-names>I.</given-names></name> <name><surname>Connors</surname> <given-names>J.</given-names></name> <name><surname>Gascoyne</surname> <given-names>R.</given-names></name> <name><surname>Horsman</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Circos: an information aesthetic for comparative genomics.</article-title> <source><italic>Genome Res.</italic></source> <volume>19</volume> <fpage>1639</fpage>&#x2013;<lpage>1645</lpage>. <pub-id pub-id-type="doi">10.1101/gr.092759.109</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lampe</surname> <given-names>D. J.</given-names></name> <name><surname>Grant</surname> <given-names>T. E.</given-names></name> <name><surname>Robertson</surname> <given-names>H. M.</given-names></name></person-group> (<year>1998</year>) <article-title>Factors affecting transposition of the <italic>Himar1 mariner</italic> transposon <italic>in vitro</italic>.</article-title> <source><italic>Genetics</italic></source> <volume>149</volume> <fpage>179</fpage>&#x2013;<lpage>187</lpage>.</citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langridge</surname> <given-names>G. C.</given-names></name> <name><surname>Phan</surname> <given-names>M.</given-names></name> <name><surname>Turner</surname> <given-names>D. J.</given-names></name> <name><surname>Perkins</surname> <given-names>T. T.</given-names></name> <name><surname>Parts</surname> <given-names>L.</given-names></name> <name><surname>Haase</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Simultaneous assay of every <italic>Salmonella</italic> Typhi gene using one million transposon mutants.</article-title> <source><italic>Genome Res.</italic></source> <volume>19</volume> <fpage>2308</fpage>&#x2013;<lpage>2316</lpage>. <pub-id pub-id-type="doi">10.1101/gr.097097.109</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Le Breton</surname> <given-names>Y.</given-names></name> <name><surname>Belew</surname> <given-names>A. T.</given-names></name> <name><surname>Valdes</surname> <given-names>K. M.</given-names></name> <name><surname>Islam</surname> <given-names>E.</given-names></name> <name><surname>Curry</surname> <given-names>P.</given-names></name> <name><surname>Tettelin</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Essential genes in the core genome of the human pathogen <italic>Streptococcus pyogenes</italic>.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>5</volume>:<issue>9838</issue>. <pub-id pub-id-type="doi">10.1038/srep09838</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>H.</given-names></name> <name><surname>Lin</surname> <given-names>Y.</given-names></name> <name><surname>Gao</surname> <given-names>F.</given-names></name> <name><surname>Zhang</surname> <given-names>C.-T.</given-names></name> <name><surname>Zhang</surname> <given-names>R.</given-names></name></person-group> (<year>2014</year>). <article-title>DEG 10 an update of the database of essential genes that includes both protein-coding genes and noncoding genomic elements.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>42</volume> <fpage>574</fpage>&#x2013;<lpage>580</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkt1131</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>W.</given-names></name> <name><surname>Brouwer</surname> <given-names>C.</given-names></name></person-group> (<year>2013</year>). <article-title>Pathview: an R/Bioconductor package for pathway-based data integration and visualization.</article-title> <source><italic>Bioinformatics</italic></source> <volume>29</volume> <fpage>1830</fpage>&#x2013;<lpage>1831.</lpage> <pub-id pub-id-type="doi">10.1093/bioinformatics/btt285</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maguin</surname> <given-names>E.</given-names></name> <name><surname>Pr&#x00E9;vost</surname> <given-names>H.</given-names></name> <name><surname>Ehrlich</surname> <given-names>S. D.</given-names></name> <name><surname>Gruss</surname> <given-names>A.</given-names></name></person-group> (<year>1996</year>). <article-title>Efficient insertional mutagenesis in lactococci and other gram-positive bacteria. <italic>J.</italic></article-title> <source><italic>Bacteriol.</italic></source> <volume>178</volume> <fpage>931</fpage>&#x2013;<lpage>935</lpage>.</citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martin</surname> <given-names>V.</given-names></name> <name><surname>Mohn</surname> <given-names>W.</given-names></name></person-group> (<year>1999</year>). <article-title>An alternative inverse PCR (IPCR) method to amplify DNA sequences flanking Tn5 transposon insertions.</article-title> <source><italic>J. Microbiol. Methods</italic></source> <volume>35</volume> <fpage>163</fpage>&#x2013;<lpage>166</lpage>. <pub-id pub-id-type="doi">10.1016/S0167-7012(98)00115-8</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murphy</surname> <given-names>E. C.</given-names></name> <name><surname>Frick</surname> <given-names>I. M.</given-names></name></person-group> (<year>2013</year>). <article-title>Gram-positive anaerobic cocci - commensals and opportunistic pathogens.</article-title> <source><italic>FEMS Microbiol. Rev.</italic></source> <volume>37</volume> <fpage>520</fpage>&#x2013;<lpage>553</lpage>. <pub-id pub-id-type="doi">10.1111/1574-6976.12005</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Overbeek</surname> <given-names>R.</given-names></name> <name><surname>Olson</surname> <given-names>R.</given-names></name> <name><surname>Pusch</surname> <given-names>G. D.</given-names></name> <name><surname>Olsen</surname> <given-names>G. J.</given-names></name> <name><surname>Davis</surname> <given-names>J. J.</given-names></name> <name><surname>Disz</surname> <given-names>T.</given-names></name><etal/></person-group>. (<year>2014</year>). <article-title>The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST).</article-title> <source><italic>Nucl. Acids Res.</italic></source> <volume>42</volume> <fpage>D206</fpage>&#x2013;<lpage>D214</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkt1226</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pancholi</surname> <given-names>V.</given-names></name> <name><surname>Caparon</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x201C;<italic>Streptococcus pyogenes</italic> Metabolism,&#x201D; in</article-title> <source><italic>Streptococcus pyogenes: Basic Biology to Clinical Manifestations</italic></source> <role>eds</role> <person-group person-group-type="editor"><name><surname>Ferretti</surname> <given-names>J. J.</given-names></name> <name><surname>Stevens</surname> <given-names>D. L.</given-names></name> <name><surname>Fischetti</surname> <given-names>V. A.</given-names></name></person-group> (<publisher-loc>Oklahoma City, OK</publisher-loc>: <publisher-name>University of Oklahoma Health Sciences Center</publisher-name>).</citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pol</surname> <given-names>M.</given-names></name> <name><surname>Ruegg</surname> <given-names>P. L.</given-names></name></person-group> (<year>2007</year>). <article-title>Relationship between antimicrobial drug usage and antimicrobial susceptibility of gram-positive mastitis pathogens.</article-title> <source><italic>J. Dairy Sci.</italic></source> <volume>90</volume> <fpage>262</fpage>&#x2013;<lpage>273</lpage>. <pub-id pub-id-type="doi">10.3168/jds.S0022-0302(07)72627-9</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pritchard</surname> <given-names>J. R.</given-names></name> <name><surname>Chao</surname> <given-names>M. C.</given-names></name> <name><surname>Abel</surname> <given-names>S.</given-names></name> <name><surname>Davis</surname> <given-names>B. M.</given-names></name> <name><surname>Baranowski</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>Y. J.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>ARTIST: high-resolution genome-wide assessment of fitness using transposon-insertion sequencing.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>10</volume>:<issue>e1004782</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1004782</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Salama</surname> <given-names>N. R.</given-names></name> <name><surname>Shepherd</surname> <given-names>B.</given-names></name> <name><surname>Falkow</surname> <given-names>S.</given-names></name></person-group> (<year>2004</year>). <article-title>Global transposon mutagenesis and essential gene analysis of <italic>Helicobacter</italic> pylori.</article-title> <source><italic>J. Bacteriol.</italic></source> <volume>186</volume> <fpage>7926</fpage>&#x2013;<lpage>7935</lpage>. <pub-id pub-id-type="doi">10.1128/JB.186.23.7926-7935.2004</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spellerberg</surname> <given-names>B.</given-names></name> <name><surname>Pohl</surname> <given-names>B.</given-names></name> <name><surname>Haase</surname> <given-names>G.</given-names></name> <name><surname>Martin</surname> <given-names>S.</given-names></name> <name><surname>Weber-Heynemann</surname> <given-names>J.</given-names></name> <name><surname>L&#x00FC;tticken</surname> <given-names>R.</given-names></name></person-group> (<year>1999</year>). <article-title>Identification of genetic determinants for the hemolytic activity of <italic>Streptococcus agalactiae</italic> by ISS1 transposition. <italic>J.</italic></article-title> <source><italic>Bacteriol.</italic></source> <volume>181</volume> <fpage>3212</fpage>&#x2013;<lpage>3219</lpage>.</citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steer</surname> <given-names>A. C.</given-names></name> <name><surname>Lamagni</surname> <given-names>T.</given-names></name> <name><surname>Curtis</surname> <given-names>N.</given-names></name> <name><surname>Carapetis</surname> <given-names>J. R.</given-names></name></person-group> (<year>2012</year>). <article-title>Invasive group a streptococcal disease: Epidemiology, pathogenesis and management.</article-title> <source><italic>Drugs</italic></source> <volume>72</volume> <fpage>1213</fpage>&#x2013;<lpage>1227</lpage>. <pub-id pub-id-type="doi">10.2165/11634180-000000000-00000</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Opijnen</surname> <given-names>T.</given-names></name> <name><surname>Bodi</surname> <given-names>K. L.</given-names></name> <name><surname>Camilli</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganiams.</article-title> <source><italic>Nat. Methods</italic></source> <volume>6</volume> <fpage>767</fpage>&#x2013;<lpage>772</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1377</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Wolfswinkel</surname> <given-names>J. C.</given-names></name> <name><surname>Ketting</surname> <given-names>R. F.</given-names></name></person-group> (<year>2010</year>). <article-title>The role of small non-coding RNAs in genome stability and chromatin organization.</article-title> <source><italic>J. Cell Sci.</italic></source> <volume>123</volume> <fpage>1825</fpage>&#x2013;<lpage>1839.</lpage> <pub-id pub-id-type="doi">10.1242/jcs.061713</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ward</surname> <given-names>P. N.</given-names></name> <name><surname>Field</surname> <given-names>T. R.</given-names></name> <name><surname>Ditcham</surname> <given-names>W. G. F.</given-names></name> <name><surname>Maguin</surname> <given-names>E.</given-names></name> <name><surname>Leigh</surname> <given-names>J. A.</given-names></name></person-group> (<year>2001</year>). <article-title>Identification and disruption of two discrete loci encoding hyaluronic acid capsule biosynthesis genes hasA, hasB, and hasC in <italic>Streptococcus uberis</italic>.</article-title> <source><italic>Am. J. Microbiol.</italic></source> <volume>69</volume> <fpage>392</fpage>&#x2013;<lpage>399</lpage>.</citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ward</surname> <given-names>P. N.</given-names></name> <name><surname>Holden</surname> <given-names>M. T. G.</given-names></name> <name><surname>Leigh</surname> <given-names>J. A.</given-names></name> <name><surname>Lennard</surname> <given-names>N.</given-names></name> <name><surname>Bignell</surname> <given-names>A.</given-names></name> <name><surname>Barron</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Evidence for niche adaptation in the genome of the bovine pathogen <italic>Streptococcus uberis</italic>.</article-title> <source><italic>BMC Genomics</italic></source> <volume>10</volume>:<issue>54</issue>. <pub-id pub-id-type="doi">10.1186/1471-2164-10-54</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>R.</given-names></name> <name><surname>Lin</surname> <given-names>Y.</given-names></name></person-group> (<year>2009</year>). <article-title>DEG 5.0 a database of essential genes in both prokaryotes and eukaryotes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>37</volume> <fpage>455</fpage>&#x2013;<lpage>458</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkn858</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>R.</given-names></name> <name><surname>Ou</surname> <given-names>H.-Y.</given-names></name> <name><surname>Zhang</surname> <given-names>C.-T.</given-names></name></person-group> (<year>2004</year>). <article-title>DEG: a database of essential genes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>32</volume> <fpage>271</fpage>&#x2013;<lpage>272</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkh024</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zomer</surname> <given-names>A.</given-names></name> <name><surname>Burghout</surname> <given-names>P.</given-names></name> <name><surname>Bootsma</surname> <given-names>H. J.</given-names></name> <name><surname>Hermans</surname> <given-names>P. W. M.</given-names></name> <name><surname>van Hijum</surname> <given-names>S. A.</given-names></name></person-group> (<year>2012</year>). <article-title>ESSENTIALS: software for rapid analysis of high throughput transposon insertion sequencing data.</article-title> <source><italic>PLoS ONE</italic></source> <volume>7</volume>:<issue>e43012</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0043012</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>zur Hausen</surname> <given-names>H.</given-names></name></person-group> (<year>2006</year>). <article-title><italic>Streptococcus bovis</italic>: causal or incidental involvement in cancer of the colon?</article-title> <source><italic>Int. J. Cancer</italic></source> <volume>119</volume> <fpage>xi</fpage>&#x2013;<lpage>xii</lpage>. <pub-id pub-id-type="doi">10.1002/ijc.22314</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn01">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="http://www.genome.jp/kegg-bin/show_pathway?sub00010">http://www.genome.jp/kegg-bin/show_pathway?sub00010</ext-link></p>
</fn>
</fn-group>
</back>
</article>