<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2017.01700</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Whole-Genome Sequencing for National Surveillance of <italic>Shigella flexneri</italic></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Chattaway</surname> <given-names>Marie A.</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/328272/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Greig</surname> <given-names>David R.</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/452983/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Gentle</surname> <given-names>Amy</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/473013/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Hartman</surname> <given-names>Hassan B.</given-names></name>
</contrib>
<contrib contrib-type="author">
<name><surname>Dallman</surname> <given-names>Timothy J.</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/320577/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Jenkins</surname> <given-names>Claire</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/308964/overview"/>
</contrib>
</contrib-group>
<aff><institution>Gastrointestinal Bacteria Reference Unit, Public Health England</institution> <country>London, United Kingdom</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: <italic>Fumito Maruyama, Kyoto University, Japan</italic></p></fn>
<fn fn-type="edited-by"><p>Reviewed by: <italic>Mar&#x00ED;a de Toro, Centro de Investigaci&#x00F3;n Biom&#x00E9;dica de La Rioja (CIBIR), Spain; Miklos Fuzi, Semmelweis University, Hungary</italic></p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x002A;Correspondence: <italic>Marie A. Chattaway, <email>marie.chattaway@phe.gov.uk</email></italic></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Infectious Diseases, a section of the journal Frontiers in Microbiology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>19</day>
<month>09</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>1700</elocation-id>
<history>
<date date-type="received">
<day>21</day>
<month>06</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>23</day>
<month>08</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2017 Chattaway, Greig, Gentle, Hartman, Dallman and Jenkins.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Chattaway, Greig, Gentle, Hartman, Dallman and Jenkins</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>National surveillance of <italic>Shigella flexneri</italic> ensures the rapid detection of outbreaks to facilitate public health investigation and intervention strategies. In this study, we used whole-genome sequencing (WGS) to type <italic>S. flexneri</italic> in order to detect linked cases and support epidemiological investigations. We prospectively analyzed 330 isolates of <italic>S. flexneri</italic> received at the Gastrointestinal Bacteria Reference Unit at Public Health England between August 2015 and January 2016. Traditional phenotypic and WGS sub-typing methods were compared. PCR was carried out on isolates exhibiting phenotypic/genotypic discrepancies with respect to serotype. Phylogenetic relationships between isolates were analyzed by WGS using single nucleotide polymorphism (SNP) typing to facilitate cluster detection. For 306/330 (93%) isolates there was concordance between serotype derived from the genome and phenotypic serology. Discrepant results between the phenotypic and genotypic tests were attributed to novel O-antigen synthesis/modification gene combinations or indels identified in O-antigen synthesis/modification genes rendering them dysfunctional. SNP typing identified 36 clusters of two isolates or more. WGS provided microbiological evidence of epidemiologically linked clusters and detected novel O-antigen synthesis/modification gene combinations associated with two outbreaks. WGS provided reliable and robust data for monitoring trends in the incidence of different serotypes over time. SNP typing can be used to facilitate outbreak investigations in real-time thereby informing surveillance strategies and providing the opportunities for implementing timely public health interventions.</p>
</abstract>
<kwd-group>
<kwd><italic>Shigella flexneri</italic></kwd>
<kwd>whole-genome sequencing</kwd>
<kwd>surveillance</kwd>
<kwd>outbreaks</kwd>
<kwd>phylogeny</kwd>
</kwd-group>
<contract-num rid="cn001">109524</contract-num>
<contract-sponsor id="cn001">National Institute for Health Research<named-content content-type="fundref-id">10.13039/501100000272</named-content></contract-sponsor>
<counts>
<fig-count count="1"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="29"/>
<page-count count="6"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec><title>Introduction</title>
<p>Shigellosis is caused by four species of <italic>Shigella</italic>, including <italic>S. boydii, S. dysenteriae, S. flexneri</italic> and <italic>S. sonnei</italic>, transmitted via the fecal oral route. Symptoms typically start 1&#x2013;2 days after exposure and include diarrhea, bloody diarrhea abdominal pain, fever, and tenesmus. The burden of shigellosis is highest in developing countries with up to 167 million episodes of diarrhea annually, leading to over a million deaths (<xref ref-type="bibr" rid="B14">Kotloff et al., 1999</xref>). A multicenter study of shigellosis in six Asian countries indicated the incidence rate to be highest in children under the age of 4 years old and in adults over 70 years old (<xref ref-type="bibr" rid="B26">von Seidlein et al., 2006</xref>). In the United Kingdom, <italic>S. flexneri</italic> is most commonly associated with causing travelers&#x2019; diarrhea and outbreaks of gastrointestinal symptoms in men who have sex with men (MSM) (<xref ref-type="bibr" rid="B21">Simms et al., 2015</xref>). Furthermore, there are reports of increased intercontinental dissemination of multidrug resistant <italic>S. flexneri</italic> (<xref ref-type="bibr" rid="B2">Baker et al., 2015</xref>).</p>
<p>Between 2004 and 2015, 18,266 <italic>Shigella</italic> cases were reported by GBRU for England and Wales with <italic>S. flexneri</italic> accounting for 7075 (39%) of these infections (<italic>S. sonnei n</italic> = 8897, 49%; <italic>S. boydii n</italic> = 1364, 7%; <italic>S. dysenteriae n</italic> = 808, 4%; <italic>Shigella</italic> species unknown <italic>n</italic> = 122, 1%).</p>
<p><italic>Shigella flexneri</italic> are traditionally serotyped phenotypically using antisera raised in rabbits, although molecular PCR methods have been implemented in a number of reference laboratories (<xref ref-type="bibr" rid="B29">Zhang et al., 2012</xref>; <xref ref-type="bibr" rid="B11">Gentle et al., 2016</xref>). Serotyping provides limited resolution with serotypes 2a (<italic>n</italic> = 2448, 35%), 3a (<italic>n</italic> = 1476, 21%), 6 (<italic>n</italic> = 1047, 15%), and 1b (<italic>n</italic> = 711, 10%) accounting for 81% of <italic>S. flexneri</italic> cases. Without a higher level of discrimination, outbreak detection is dependent on the identification of epidemiological links between cases belonging to the same serotype.</p>
<p>Whole-genome sequencing (WGS) has been shown to have potential in replacing traditional phenotypic and PCR methods (<xref ref-type="bibr" rid="B11">Gentle et al., 2016</xref>) for routine surveillance. This approach has the added value of further discriminating strains by their genetic relatedness to a single nucleotide polymorphism (SNP) level and has been used to investigate multiple gastrointestinal outbreaks at Public Health England (PHE) (<xref ref-type="bibr" rid="B18">McDonnell et al., 2013</xref>; <xref ref-type="bibr" rid="B9">Dallman et al., 2015</xref>, <xref ref-type="bibr" rid="B10">2016</xref>).</p>
<p>The aim of this study was to compare traditional serotyping with WGS for serotyping <italic>S. flexneri</italic> for routine public health surveillance and to evaluate utility of WGS data to support epidemiologically linked clusters and outbreak investigations.</p>
</sec>
<sec id="s1" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec><title>Bacterial Strains</title>
<p>Bacterial isolates of <italic>S. flexneri</italic> from 330 cases were submitted to the Gastrointestinal Bacterial Reference Unit between August 2015 and January 2016, from local and regional hospital laboratories in England and Wales. This strain set comprised the following phenotypic serotypes (numbers of isolates belonging to each serotype in parenthesis): 1a (3), 1b (16), 1c (25), 2a (185), 2b (16), 3a (41), 3b (7), 4av/E1037 (9), 6 (19) X (4), Y (4), X and Y (1). Epidemiological data on age, sex, and region of residence were available from laboratory report forms. Travel history was available for 176 of cases. All isolates were serotyped and WGS.</p>
</sec>
<sec><title>Serotyping and PCR</title>
<p>Phenotypic identification of <italic>S. flexneri</italic> isolates were confirmed using the Omnilog GenIII MicroPlate (Biolog, Hayward, CA, United States). Serotyping was carried out using standard methods by slide agglutination using both commercially available monovalent antisera (Denka Seiken, Japan) and monoclonal antibody reagents (Reagensia AB, Sweden) and in-house antisera raised in rabbits (<xref ref-type="bibr" rid="B13">Gross and Rowe, 1985</xref>). Molecular PCR was carried out on the discrepancies between phenotypic and WGS typing results as previously described (<xref ref-type="bibr" rid="B11">Gentle et al., 2016</xref>) and a serotype assigned according to the gene combination detected (<bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">S1</xref></bold>).</p>
</sec>
<sec><title>Whole-Genome Sequencing</title>
<p>Genome sequencing and sequencing analysis were carried out as previously described (<xref ref-type="bibr" rid="B9">Dallman et al., 2015</xref>). Genomic DNA extracted using the QiaSymphony DNA extraction platform (Qiagen) from 330 <italic>S. flexneri</italic> was fragmented and tagged for multiplexing with Nextera XT DNA Sample Preparation Kits (Illumina) and sequenced using the Illumina HiSeq 2500 at PHE. A reference database containing the gene sequences encoding the 12 O-antigen synthesis or modification genes described by <xref ref-type="bibr" rid="B25">Sun et al. (2011</xref>, <xref ref-type="bibr" rid="B23">2012a</xref>,<xref ref-type="bibr" rid="B24">b</xref>), including <italic>wzxc1-5, wzxe1-5, wzx6, gtrI, gtrII, gtrIV, gtrV, gtrX, gtr1c, oac oac1b</italic> and <italic>opt</italic>, was constructed. Using the <italic>GeneFinder</italic> tool (Doumith, unpublished), FASTQ reads were mapped to the <italic>S. flexneri</italic> O-antigen synthesis or modification genes using Bowtie 2 (<xref ref-type="bibr" rid="B15">Langmead and Salzberg, 2012</xref>) and the best match to each target was reported with metrics including coverage, depth, mixture and nucleotide similarity in XML format for quality assessment. Only <italic>in silico</italic> predictions of serotype that matched to a gene determinant at >80% nucleotide identity over >80% target gene length were accepted. FASTQ sequences were deposited in the National Center for Biotechnology Information Short Read Archive under the bioproject <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="PRJNA315192">PRJNA315192</ext-link> (see <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM2">S2</xref></bold> for SRA identifiers).</p>
</sec>
<sec><title>Cluster Detection</title>
<p>Short reads were quality trimmed and mapped to the reference <italic>S. flexneri</italic> serotype 2a strain 2457T (AE014073.1) (<xref ref-type="bibr" rid="B27">Wei et al., 2003</xref>) or the reference strain NC_007613 if <italic>S. flexneri</italic> serotype 6, using BWA v0.75 (<xref ref-type="bibr" rid="B17">Li et al., 2009</xref>; <xref ref-type="bibr" rid="B4">Bolger et al., 2014</xref>). The Sequence Alignment Map output from BWA was sorted and indexed to produce a Binary Alignment Map (BAM) using Samtools (<xref ref-type="bibr" rid="B16">Li and Durbin, 2010</xref>). GATK v2.6.5 was used to create a Variant Call Format (VCF) file from each of the BAMs, which were further parsed to extract only SNP positions which were of high quality (MQ > 30, DP > 10, GQ > 30, Variant Ratio > 0.9) (<xref ref-type="bibr" rid="B19">McKenna et al., 2010</xref>). Gubbins v2.0.0 (<xref ref-type="bibr" rid="B8">Croucher et al., 2015</xref>) was used to identify recombinant regions of the genome which were subsequently masked for phylogenetic analysis. Pseudosequences of polymorphic positions were used to create maximum likelihood trees using RAxML v8.1.17 (<xref ref-type="bibr" rid="B22">Stamatakis, 2014</xref>). <italic>De novo</italic> assembly was carried out using Spades 3.5.0 using &#x2018;&#x2013;careful&#x2019; and &#x2018; -k 21,33,55,65,77,83,91&#x2019; options (<xref ref-type="bibr" rid="B3">Bankevich et al., 2012</xref>).</p>
<p>To proactively detect outbreaks from WGS data, SNP typing was carried out on <italic>S. flexneri</italic> isolates belonging to clonal complex (CC) 245 and CC145. CC145 mostly comprises <italic>S. boydii</italic> serotypes but includes <italic>S. flexneri</italic> serotype 6, as this <italic>S. flexneri</italic> serotype was misidentified historically (<xref ref-type="bibr" rid="B28">Wirth et al., 2006</xref>; <xref ref-type="bibr" rid="B6">Chattaway et al., 2017</xref>). Hierarchical single linkage clustering was performed at seven descending thresholds of SNP distance (&#x0394;250, &#x0394;100, &#x0394;50, &#x0394;25, &#x0394;10, &#x0394;5, &#x0394;0) as previously described (<xref ref-type="bibr" rid="B10">Dallman et al., 2016</xref>). This clustering results in a discrete seven digit code where each number represents the cluster membership at each descending SNP distance threshold. The resultant SNP addresses describes an isolates position in the <italic>S. flexneri</italic> population structure where two isolates with the same SNP addresses have 0 SNP differences between them.</p>
</sec>
</sec>
<sec><title>Results</title>
<sec><title>Demographic of Patients</title>
<p>There were 330 isolates reported between 1st August 2015 and 18th January 2016, 236 (71%) were from males, 85 (26%) from females, and for nine (3%) cases the sex was not stated. Travel history was not provided for 154 (47%) cases, 85 cases reported travel 7 days prior to onset of symptoms and 91 (27%) reported that they did not travel during the 7 days prior to onset of symptoms. The most frequently reported destinations were India (<italic>n</italic> = 16, 5%) and Pakistan (<italic>n</italic> = 11, 3%) with other counties accounting for less &#x003C; 1% each.</p>
</sec>
<sec><title>Comparison of WGS Predicted Serotype versus PCR Predicted Serotype Results</title>
<p>Of the 330 cultures tested prospectively by WGS, 306 (93%) had concordant results with phenotypic serotyping. Three of the mismatched results between the phenotypic and genotypic tests were attributed to mutations identified in O-antigen synthesis or modification genes (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>), including nonsense mutations resulting in early stop codons (<italic>n</italic> = 2) and a frameshift mutation. Repeat testing of sample 4 revealed the discrepancy was due to an auto-agglutination reaction of the strain with the sera (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Summary of mismatched phenotypic and genotypic results.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Reference no.</th>
<th valign="top" align="center">Sample no.</th>
<th valign="top" align="center">Phenotypic serotype</th>
<th valign="top" align="center">PCR serotype</th>
<th valign="top" align="center">WGS serotype</th>
<th valign="top" align="left">Explanation of mismatch</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">SRR4787737</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1a</td>
<td valign="top" align="center">1b</td>
<td valign="top" align="center">1b</td>
<td valign="top" align="left"><italic>OacIb</italic> contains an early stop codon at position 128, resulting in the 1a phenotype</td>
</tr>
<tr>
<td valign="top" align="left">SRR4786841</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">X variant/Y variant</td>
<td valign="top" align="center">3a</td>
<td valign="top" align="center">3a</td>
<td valign="top" align="left"><italic>Oac</italic> contains an early stop codon at position 20 resulting in the X phenotype</td>
</tr>
<tr>
<td valign="top" align="left">SRR5018321</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">Y variant</td>
<td valign="top" align="center">2a</td>
<td valign="top" align="center">2a</td>
<td valign="top" align="left">Frameshift mutation (insertion of two A&#x2019;s) at position 1116 rendering <italic>gtrII</italic> non-functional and resulting in the Y phenotype</td>
</tr>
<tr>
<td valign="top" align="left">SRR5017471</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">X variant</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="left">Auto agglutination strain resulting in incorrect phenotype</td>
</tr>
<tr>
<td valign="top" align="left">SRR4788187 SRR4897598</td>
<td valign="top" align="center">5&#x2013;6</td>
<td valign="top" align="center">1c</td>
<td valign="top" align="center">1c</td>
<td valign="top" align="center">1a</td>
<td valign="top" align="left">No explanation for the mismatch could be determined</td>
</tr>
<tr>
<td valign="top" align="left">Cluster 2 (13 isolates)</td>
<td valign="top" align="center">7&#x2013;19</td>
<td valign="top" align="center">Negative serology</td>
<td valign="top" align="center">1cv</td>
<td valign="top" align="center">1cv</td>
<td valign="top" align="left">Novel serotype</td>
</tr>
<tr>
<td valign="top" align="left">Cluster 4 (five isolates)</td>
<td valign="top" align="center">20&#x2013;24</td>
<td valign="top" align="center">3a</td>
<td valign="top" align="center">3av</td>
<td valign="top" align="center">3av</td>
<td valign="top" align="left">Novel serotype</td></tr>
</tbody></table>
<table-wrap-foot>
<attrib><italic>Clusters 2 and 4 contain multiple isolates, please see <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM2">S2</xref></bold> for the Reference numbers.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>The remaining mismatches were novel serotype gene profiles detected by WGS, both associated with outbreaks (Cluster 2, <italic>n</italic> = 13; Cluster 4, <italic>n</italic> = 5) (<bold>Tables <xref ref-type="table" rid="T1">1</xref>, <xref ref-type="table" rid="T2">2</xref></bold>). Cluster 2 comprised 13 isolates associated with a local community outbreak. The isolates failed to agglutinate any of the serotype specific <italic>S. flexneri</italic> antisera and were positive for <italic>wzx1-5</italic> and <italic>gtrIc</italic> gene targets in the PCR but were negative for <italic>gtrI</italic>. Typical strains of serotype 1c are positive for <italic>wzx1, gtrI</italic>, and <italic>gtrIc</italic> (<bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">S1</xref></bold>). These strains were designated 1c variants (1cv). Cluster 4 comprised five isolates of <italic>S. flexneri</italic> associated with an outbreak of gastrointestinal symptoms in five captive chimpanzees phenotypically identified as serotype 3a. The isolates had <italic>wzx1-5, oac</italic>, and <italic>grtX</italic> but were also positive for <italic>gtrII</italic> and were designated 3a variant (3av).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Summary of Clusters detected by WGS.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Cluster no.</th>
<th valign="top" align="center">No. cases</th>
<th valign="top" align="center">Travel</th>
<th valign="top" align="center">M:F ratio</th>
<th valign="top" align="center">Age range</th>
<th valign="top" align="center">Serotype</th>
<th valign="top" align="left">Epidemiological context/transmission route</th>
<th valign="top" align="center">SNP address</th>
<th valign="top" align="center">Minimum SNP difference</th>
<th valign="top" align="center">Maximum SNP difference</th>
<th valign="top" align="center">Median SNP difference</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">121</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">119:0<sup>&#x2217;</sup></td>
<td valign="top" align="center">20&#x2013;63</td>
<td valign="top" align="center">2a</td>
<td valign="top" align="left">MSM</td>
<td valign="top" align="center">34.42.42.42.#</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">40&#x02C6;</td>
<td valign="top" align="center">17</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4:9</td>
<td valign="top" align="center">23&#x2013;96</td>
<td valign="top" align="center">$1cv</td>
<td valign="top" align="left">Community</td>
<td valign="top" align="center">3.45.45.46.46.46.47</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">3:4</td>
<td valign="top" align="center">21&#x2013;53</td>
<td valign="top" align="center">2b</td>
<td valign="top" align="left">Restaurant</td>
<td valign="top" align="center">8.9.9.138.140.154.167</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">N/A</td>
<td valign="top" align="center">N/A</td>
<td valign="top" align="center">N/A</td>
<td valign="top" align="center">$3av</td>
<td valign="top" align="left">Captive chimpanzees</td>
<td valign="top" align="center">45.130.197.328.353.405.#</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">2.5</td></tr>
</tbody></table>
<table-wrap-foot>
<attrib><italic>Table summarising the four clusters (containing five or more isolates) detected between 1st August 2015 and 18th January 2016. SNP address: hierarchical single linkage clustering performed on the pairwise SNP difference between all isolates at descending distance thresholds (250, 100, 50, 25, 10, 5, 0). Each number of the SNP address represents the cluster membership at that threshold level. SNP addresses with the &#x2018;#&#x2019; suffix match all subsequent clusters within that SNP address. &#x02C6; SNP distance corrected for recombination. <sup>&#x2217;</sup>2 cases gender were unknown and not included the ratio. $ determined as variant (v) due to absence of gtrl for 1cv or additional presence of gtrll for 3av.</italic></attrib>
</table-wrap-foot>
</table-wrap>
</sec>
<sec><title>Outbreak Investigation Using SNP Clustering Typing</title>
<p>Analysis of the WGS data organized 161 of the 330 isolates into 36 five SNP single-linkage clusters of two or more isolates associated with CC245 and one cluster in CC145. The median number of cases in these clusters was 2 and ranged from 2 to 27; 32 (89%) clusters investigated comprised less than 5 isolates (<bold>Table <xref ref-type="table" rid="T2">2</xref></bold>). It was not possible to identify epidemiological links associated with these small clusters using only the limited epidemiological data available from laboratory report forms. However, 9/32 (28%) comprised at least one case reporting recent travel abroad prior to onset of symptoms.</p>
<p>During the period of the study, four outbreaks were identified following routine surveillance of local hospital reports of gastrointestinal symptoms caused by <italic>S. flexneri</italic>. SNP typing confirmed that all the isolates belonging to each outbreak cluster were closely related and monophyletic. Cluster 1 was the largest cluster and had a high male to female ratio with the 97% of cases being adult males (for two cases the gender was not stated) (<bold>Table <xref ref-type="table" rid="T2">2</xref></bold>). Isolates from this cluster were observed throughout the study period. The minimum SNP distance between isolates in this cluster was zero with the median distance 17. The maximum SNP distance between any two isolates was 64, however the majority of these SNPs were caused by a transposase mediated recombination of <italic>pic</italic>, encoding a serine protease. This demographic of adult males has previously been shown to be characteristic of clusters linked to sexual transmission among the MSM community (<xref ref-type="bibr" rid="B5">Borg et al., 2012</xref>; <xref ref-type="bibr" rid="B12">Gilbart et al., 2015</xref>; <xref ref-type="bibr" rid="B2">Baker et al., 2015</xref>; <xref ref-type="bibr" rid="B21">Simms et al., 2015</xref>). Cluster 1 was part of larger outbreak of <italic>S. flexneri</italic> serotype 2a previously described by <xref ref-type="bibr" rid="B21">Simms et al. (2015)</xref>.</p>
<p>Clusters 2 and 3 were community outbreaks and cases were geographically linked. They were temporally restricted and genetically homogenous, exhibiting 0&#x2013;1 SNPs difference in the core genome between isolates. Despite a thorough epidemiological investigation, the source and route of transmission associated with Cluster 2 could not be determined. Cluster 3 was linked to consumption of contaminated food at a restaurant, most likely due to an infected food handler. Cluster 4 was associated with an outbreak of gastrointestinal symptoms in a group of captive chimpanzees with a minimum SNP distance between isolates of one and a maximum SNP distance of four.</p>
<p>The <italic>S. flexneri</italic> population structure clusters into seven distinct phylogenetic groups (PGs) (<xref ref-type="bibr" rid="B7">Connor et al., 2015</xref>). Phylogenetic analysis of the diversity of CC245 in the PHE collection is shown in <bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>. Four out of seven clades are represented by samples received by PHE through routine surveillance. With respect to the clusters described in this study, Clusters 1, 3, and 4 fall within PG3 and Cluster 2 falls within PG1.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Maximum Likelihood tree showing a single representative from each 10 SNP single linkage cluster (<italic>N</italic> = 156) within the PHE clonal complex 245 database (<italic>N</italic> = 1333). Phylogenetic Groups as defined by <xref ref-type="bibr" rid="B7">Connor et al. (2015)</xref> are listed 1&#x2013;7 and the four outbreak clusters annotated. Refer to <bold>Table <xref ref-type="table" rid="T2">2</xref></bold> for details of the outbreak clusters.</p></caption>
<graphic xlink:href="fmicb-08-01700-g001.tif"/>
</fig>
</sec>
</sec>
<sec><title>Discussion</title>
<p>This study showed WGS to be a robust and reliable method for serotyping <italic>S. flexneri</italic> isolates and provided additional strain discrimination at the SNP level. There was high correlation between phenotypic serotyping and WGS serotyping (93%) thus facilitating the prospective comparison of WGS data with historical phenotypic data and ensuring continuity for monitoring trends in the incidence of different serotypes over time.</p>
<p>In three of the mismatches, the WGS derived serotype was shown to predict serotype based on gene presence which may not be expressed phenotypically. WGS provided insight on the effect of mutations on O-antigen modification genes and potential mechanisms that inactivate phenotypic expression. WGS analysis also identified novel serotype gene profiles associated with two outbreaks.</p>
<p>Serotype is not a robust phylogenetic marker, as the O-antigen synthesis/modification genes are encoded on mobile genetic elements (prophages) (<xref ref-type="bibr" rid="B7">Connor et al., 2015</xref>). Prior to the implementation of WGS, detection of outbreaks of <italic>S. flexneri</italic> at PHE relied on the identification of epidemiological links between cases as serotyping was not discriminatory enough to detect outbreaks during routine surveillance. During this study, SNP typing provided microbiological evidence that the isolates associated with each of the four outbreaks identified were closely related. SNP typing has been used previously to investigate outbreaks of <italic>S. sonnei</italic>, and this study provides further evidence of the utility of this approach (<xref ref-type="bibr" rid="B18">McDonnell et al., 2013</xref>; <xref ref-type="bibr" rid="B10">Dallman et al., 2016</xref>; <xref ref-type="bibr" rid="B20">Mook et al., 2016</xref>; <xref ref-type="bibr" rid="B1">Baker et al., 2017</xref>).</p>
<p>Three of the outbreak clusters were temporally and genetically restricted with less than five SNP differences observed between outbreak isolates. In contrast, an outbreak representing on-going person-to-person transmission within the MSM community exhibited between zero and 64 SNP differences between isolates. <italic>S. flexneri</italic> may become endemic in defined populations or communities, such as religious communities or sexual networks. Over time circulating strains will accumulate mutations leading to an increase in observed SNPs between cases in that network. Similarly, extended transmission provides greater opportunities for horizontal gene transfer in the population and therefore bioinformatic analyses have to be robust to such influxes of variation.</p>
<p>This heterogeneity in genetic conservation between epidemiologically linked cases highlights the need to be flexible with respect to the case cluster definition. Epidemiological data associated with each outbreak, and WGS analyses of the deeper phylogenetic relationship between isolates, should be used in concert to inform outbreak investigations.</p>
<p>The use of WGS for routine surveillance of <italic>S. flexneri</italic> provides reliable and robust data that can be used to monitor trends in the incidence of different serotypes over time. WGS derived serotyping data ensures backward compatibility with historical phenotypic serotyping data. SNP typing can be used to facilitate outbreak investigations in real-time thereby enhancing surveillance strategies and providing the opportunities for implementing rapid public health interventions.</p>
</sec>
<sec><title>Informed Consent</title>
<p>Informed consent was not required as all data in this study was anonymized.</p>
</sec>
<sec><title>Author Contributions</title>
<p>DG and AG performed the DNA extractions, PCR, phenotypic serotyping and identification. MC and CJ implemented the wet lab WGS pipelines and performed analysis. TD, HH, and DG performed bioinformatic analysis. MC and CJ wrote the manuscript and TD, DG, and HH contributed to the manuscript.</p>
</sec>
<sec><title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This work was supported by the National Institute for Health Research Health Protection Research Unit in Gastrointestinal Infections (#109524). The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR, the Department of Health or Public Health England.</p></fn>
</fn-group>
<ack>
<p>We would like to thank Vivienne do Nascimento and Dawn Hedges for curation of the serotyping scheme in GBRU.</p>
</ack>
<sec sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fmicb.2017.01700/full#supplementary-material">http://journal.frontiersin.org/article/10.3389/fmicb.2017.01700/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S1</label>
<caption><p>Serotype by gene presence.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Table_1.docx" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.XLSX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S2</label>
<caption><p>Sample list and results.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Table_2.XLSX" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baker</surname> <given-names>K. S.</given-names></name> <name><surname>Campos</surname> <given-names>J.</given-names></name> <name><surname>Pichel</surname> <given-names>M.</given-names></name> <name><surname>Della Gaspera</surname> <given-names>A.</given-names></name> <name><surname>Duarte-Mart&#x00ED;nez</surname> <given-names>F.</given-names></name> <name><surname>Campos-Chac&#x00F3;n</surname> <given-names>E.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Whole genome sequencing of <italic>Shigella sonnei</italic> through PulseNet Latin America and Caribbean: advancing global surveillance of foodborne illnesses.</article-title> <source><italic>Clin. Microbiol. Infect.</italic></source> <pub-id pub-id-type="doi">10.1016/j.cmi.2017.03.021</pub-id> <comment>[Epub ahead of print]</comment>.</citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baker</surname> <given-names>K. S.</given-names></name> <name><surname>Dallman</surname> <given-names>T. J.</given-names></name> <name><surname>Ashton</surname> <given-names>P. M.</given-names></name> <name><surname>Day</surname> <given-names>M.</given-names></name> <name><surname>Hughes</surname> <given-names>G.</given-names></name> <name><surname>Crook</surname> <given-names>P. D.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Intercontinental dissemination of azithromycin-resistant shigellosis through sexual transmission: a cross-sectional study.</article-title> <source><italic>Lancet Infect. Dis.</italic></source> <volume>15</volume> <fpage>913</fpage>&#x2013;<lpage>921</lpage>. <pub-id pub-id-type="doi">10.1016/S1473-3099(15)00002-X</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bankevich</surname> <given-names>A.</given-names></name> <name><surname>Nurk</surname> <given-names>S.</given-names></name> <name><surname>Antipov</surname> <given-names>D.</given-names></name> <name><surname>Gurevich</surname> <given-names>A. A.</given-names></name> <name><surname>Dvorkin</surname> <given-names>M.</given-names></name> <name><surname>Kulikov</surname> <given-names>A. S.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.</article-title> <source><italic>J. Comput. Biol.</italic></source> <volume>19</volume> <fpage>455</fpage>&#x2013;<lpage>477</lpage>. <pub-id pub-id-type="doi">10.1089/cmb.2012.0021</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bolger</surname> <given-names>A. M.</given-names></name> <name><surname>Lohse</surname> <given-names>M.</given-names></name> <name><surname>Usadel</surname> <given-names>B.</given-names></name></person-group> (<year>2014</year>). <article-title>Trimmomatic: a flexible trimmer for Illumina sequence data.</article-title> <source><italic>Bioinformatics</italic></source> <volume>30</volume> <fpage>2114</fpage>&#x2013;<lpage>2120</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu170</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borg</surname> <given-names>M. L.</given-names></name> <name><surname>Modi</surname> <given-names>A.</given-names></name> <name><surname>Tostmann</surname> <given-names>A.</given-names></name> <name><surname>Gobin</surname> <given-names>M.</given-names></name> <name><surname>Cartwright</surname> <given-names>J.</given-names></name> <name><surname>Quigley</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Ongoing outbreak of <italic>Shigella flexneri</italic> serotype 3a in men who have sex with men in England and Wales, data from 2009&#x2013;2011.</article-title> <source><italic>Euro Surveill</italic></source>. <volume>17</volume>:<issue>20137</issue>.</citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chattaway</surname> <given-names>M. A.</given-names></name> <name><surname>Schaefer</surname> <given-names>U.</given-names></name> <name><surname>Tewolde</surname> <given-names>R.</given-names></name> <name><surname>Dallman</surname> <given-names>T. J.</given-names></name> <name><surname>Jenkins</surname> <given-names>C.</given-names></name></person-group> (<year>2017</year>). <article-title>Identification of <italic>Escherichia coli</italic> and <italic>Shigella</italic> species from whole-genome sequences.</article-title> <source><italic>J. Clin. Microbiol.</italic></source> <volume>55</volume> <fpage>616</fpage>&#x2013;<lpage>623</lpage>. <pub-id pub-id-type="doi">10.1128/JCM.01790-16</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Connor</surname> <given-names>T. R.</given-names></name> <name><surname>Barker</surname> <given-names>C. R.</given-names></name> <name><surname>Baker</surname> <given-names>K. S.</given-names></name> <name><surname>Weill</surname> <given-names>F. X.</given-names></name> <name><surname>Talukder</surname> <given-names>K. A.</given-names></name> <name><surname>Smith</surname> <given-names>A. M.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Species-wide whole genome sequencing reveals historical global spread and recent local persistence in <italic>Shigella flexneri</italic>.</article-title> <source><italic>eLife</italic></source> <volume>4</volume>:<issue>e07335</issue>. <pub-id pub-id-type="doi">10.7554/eLife.07335</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Croucher</surname> <given-names>N. J.</given-names></name> <name><surname>Page</surname> <given-names>A. J.</given-names></name> <name><surname>Connor</surname> <given-names>T. R.</given-names></name> <name><surname>Delaney</surname> <given-names>A. J.</given-names></name> <name><surname>Keane</surname> <given-names>J. A.</given-names></name> <name><surname>Bentley</surname> <given-names>S. D.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>43</volume> <issue>e15</issue>. <pub-id pub-id-type="doi">10.1093/nar/gku1196</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dallman</surname> <given-names>T. J.</given-names></name> <name><surname>Byrne</surname> <given-names>L.</given-names></name> <name><surname>Ashton</surname> <given-names>P. M.</given-names></name> <name><surname>Cowley</surname> <given-names>L. A.</given-names></name> <name><surname>Perry</surname> <given-names>N. T.</given-names></name> <name><surname>Adak</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Whole-genome sequencing for national surveillance of Shiga toxin-producing <italic>Escherichia coli</italic> O157.</article-title> <source><italic>Clin. Infect. Dis.</italic></source> <volume>61</volume> <fpage>305</fpage>&#x2013;<lpage>312</lpage>. <pub-id pub-id-type="doi">10.1093/cid/civ318</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dallman</surname> <given-names>T. J.</given-names></name> <name><surname>Chattaway</surname> <given-names>M. A.</given-names></name> <name><surname>Mook</surname> <given-names>P.</given-names></name> <name><surname>Godbole</surname> <given-names>G.</given-names></name> <name><surname>Crook</surname> <given-names>P. D.</given-names></name> <name><surname>Jenkins</surname> <given-names>C.</given-names></name></person-group> (<year>2016</year>). <article-title>Use of whole-genome sequencing for the public health surveillance of <italic>Shigella sonnei</italic> in England and Wales, 2015.</article-title> <source><italic>J. Med. Microbiol.</italic></source> <volume>65</volume> <fpage>882</fpage>&#x2013;<lpage>884</lpage>. <pub-id pub-id-type="doi">10.1099/jmm.0.000296</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gentle</surname> <given-names>A.</given-names></name> <name><surname>Ashton</surname> <given-names>P. M.</given-names></name> <name><surname>Dallman</surname> <given-names>T. J.</given-names></name> <name><surname>Jenkins</surname> <given-names>C.</given-names></name></person-group> (<year>2016</year>). <article-title>Evaluation of molecular methods for serotyping <italic>Shigella flexneri</italic>.</article-title> <source><italic>J. Clin. Microbiol.</italic></source> <volume>54</volume> <fpage>1456</fpage>&#x2013;<lpage>1461</lpage>. <pub-id pub-id-type="doi">10.1128/JCM.03386-15</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gilbart</surname> <given-names>V. L.</given-names></name> <name><surname>Simms</surname> <given-names>I.</given-names></name> <name><surname>Jenkins</surname> <given-names>C.</given-names></name> <name><surname>Furegato</surname> <given-names>M.</given-names></name> <name><surname>Gobin</surname> <given-names>M.</given-names></name> <name><surname>Oliver</surname> <given-names>I.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Sex, drugs and smart phone applications: findings from semistructured interviews with men who have sex with men diagnosed with <italic>Shigella flexneri</italic> 3a in England and Wales.</article-title> <source><italic>Sex Transm. Infect.</italic></source> <volume>91</volume> <fpage>598</fpage>&#x2013;<lpage>602</lpage>. <pub-id pub-id-type="doi">10.1136/sextrans-2015-052014</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gross</surname> <given-names>R. J.</given-names></name> <name><surname>Rowe</surname> <given-names>B.</given-names></name></person-group> (<year>1985</year>). <article-title>&#x201C;Serotyping of <italic>Escherichia coli</italic>,&#x201D; in</article-title> <source><italic>The Virulence of Escherichia coli</italic></source> <role>ed.</role> <person-group person-group-type="editor"><name><surname>Sussman</surname> <given-names>M.</given-names></name></person-group> (<publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>) <fpage>345</fpage>&#x2013;<lpage>360</lpage>.</citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kotloff</surname> <given-names>K. L.</given-names></name> <name><surname>Winickoff</surname> <given-names>J. P.</given-names></name> <name><surname>Ivanoff</surname> <given-names>B.</given-names></name> <name><surname>Clemens</surname> <given-names>J. D.</given-names></name> <name><surname>Swerdlow</surname> <given-names>D. L.</given-names></name> <name><surname>Sansonetti</surname> <given-names>P. J.</given-names></name><etal/></person-group> (<year>1999</year>). <article-title>Global burden of <italic>Shigella</italic> infections: implications for vaccine development and implementation of control strategies.</article-title> <source><italic>Bull. World Health Organ.</italic></source> <volume>77</volume> <fpage>651</fpage>&#x2013;<lpage>666</lpage>.</citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langmead</surname> <given-names>B.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2012</year>). <article-title>Fast gapped-read alignment with Bowtie 2.</article-title> <source><italic>Nat. Methods</italic></source> <volume>9</volume> <fpage>357</fpage>&#x2013;<lpage>359</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1923</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name></person-group> (<year>2010</year>). <article-title>Fast and accurate long-read alignment with Burrows-Wheeler transform.</article-title> <source><italic>Bioinformatics</italic></source> <volume>26</volume> <fpage>589</fpage>&#x2013;<lpage>595</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp698</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Handsaker</surname> <given-names>B.</given-names></name> <name><surname>Wysoker</surname> <given-names>A.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name> <name><surname>Ruan</surname> <given-names>J.</given-names></name> <name><surname>Homer</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>The sequence alignment/map format and SAMtools.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>2078</fpage>&#x2013;<lpage>2079</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McDonnell</surname> <given-names>J.</given-names></name> <name><surname>Dallman</surname> <given-names>T.</given-names></name> <name><surname>Atkin</surname> <given-names>S.</given-names></name> <name><surname>Turbitt</surname> <given-names>D. A.</given-names></name> <name><surname>Connor</surname> <given-names>T. R.</given-names></name> <name><surname>Grant</surname> <given-names>K. A.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Retrospective analysis of whole genome sequencing compared to prospective typing data in further informing the epidemiological investigation of an outbreak of <italic>Shigella sonnei</italic> in the UK.</article-title> <source><italic>Epidemiol. Infect.</italic></source> <volume>141</volume> <fpage>2568</fpage>&#x2013;<lpage>2575</lpage>. <pub-id pub-id-type="doi">10.1017/S0950268813000137</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McKenna</surname> <given-names>A.</given-names></name> <name><surname>Hanna</surname> <given-names>M.</given-names></name> <name><surname>Banks</surname> <given-names>E.</given-names></name> <name><surname>Sivachenko</surname> <given-names>A.</given-names></name> <name><surname>Cibulskis</surname> <given-names>K.</given-names></name> <name><surname>Kernytsky</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.</article-title> <source><italic>Genome Res.</italic></source> <volume>20</volume> <fpage>1297</fpage>&#x2013;<lpage>1303</lpage>. <pub-id pub-id-type="doi">10.1101/gr.107524.110</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mook</surname> <given-names>P.</given-names></name> <name><surname>McCormick</surname> <given-names>J.</given-names></name> <name><surname>Bains</surname> <given-names>M.</given-names></name> <name><surname>Cowley</surname> <given-names>L. A.</given-names></name> <name><surname>Chattaway</surname> <given-names>M. A.</given-names></name> <name><surname>Jenkins</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>ESBL-producing and macrolide-resistant <italic>Shigella sonnei</italic> infections among men who have sex with men, England, 2015.</article-title> <source><italic>Emerg. Infect. Dis.</italic></source> <volume>22</volume> <fpage>1948</fpage>&#x2013;<lpage>1952</lpage>. <pub-id pub-id-type="doi">10.3201/eid2211.160653</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simms</surname> <given-names>I.</given-names></name> <name><surname>Field</surname> <given-names>N.</given-names></name> <name><surname>Jenkins</surname> <given-names>C.</given-names></name> <name><surname>Childs</surname> <given-names>T.</given-names></name> <name><surname>Gilbart</surname> <given-names>V. L.</given-names></name> <name><surname>Dallman</surname> <given-names>T. J.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Intensified shigellosis epidemic associated with sexual transmission in men who have sex with men&#x2013;<italic>Shigella flexneri</italic> and <italic>S. sonnei</italic> in England, 2004 to end of February 2015.</article-title> <source><italic>Euro Surveill.</italic></source> <volume>20</volume> <issue>21097</issue>.</citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamatakis</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.</article-title> <source><italic>Bioinformatics</italic></source> <volume>30</volume> <fpage>1312</fpage>&#x2013;<lpage>1313</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu033</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>Q.</given-names></name> <name><surname>Knirel</surname> <given-names>Y. A.</given-names></name> <name><surname>Lan</surname> <given-names>R.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Senchenkova</surname> <given-names>S. N.</given-names></name> <name><surname>Jin</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2012a</year>). <article-title>A novel plasmid-encoded serotype conversion mechanism through addition of phosphoethanolamine to the O-antigen of <italic>Shigella flexneri</italic>.</article-title> <source><italic>PLOS ONE</italic></source> <volume>7</volume>:<issue>e46095</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0046095</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>Q.</given-names></name> <name><surname>Lan</surname> <given-names>R.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Xia</surname> <given-names>S.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2012b</year>). <article-title>Identification of a divergent <italic>O</italic>-acetyltransferase gene <italic>oac</italic><sub>1b</sub> from <italic>Shigella flexneri</italic> serotype 1b strains.</article-title> <source><italic>Emerg. Microbes Infect.</italic></source> <volume>1</volume> <issue>e21</issue>. <pub-id pub-id-type="doi">10.1038/emi.2012.22</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>Q.</given-names></name> <name><surname>Lan</surname> <given-names>R.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Zhao</surname> <given-names>A.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Development of a multiplex PCR assay targeting O-antigen modification genes for molecular serotyping of <italic>Shigella flexneri</italic>.</article-title> <source><italic>J. Clin. Microbiol.</italic></source> <volume>49</volume> <fpage>3766</fpage>&#x2013;<lpage>3770</lpage>. <pub-id pub-id-type="doi">10.1128/JCM.01259-11</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>von Seidlein</surname> <given-names>L.</given-names></name> <name><surname>Kim</surname> <given-names>D. R.</given-names></name> <name><surname>Ali</surname> <given-names>M.</given-names></name> <name><surname>Lee</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Thiem</surname> <given-names>V. D.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>A multicentre study of <italic>Shigella</italic> diarrhoea in six Asian countries: disease burden, clinical manifestations, and microbiology.</article-title> <source><italic>PLOS Med.</italic></source> <volume>3</volume>:<issue>e353</issue>. <pub-id pub-id-type="doi">10.1371/journal.pmed.0030353</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wei</surname> <given-names>J.</given-names></name> <name><surname>Goldberg</surname> <given-names>M. B.</given-names></name> <name><surname>Burland</surname> <given-names>V.</given-names></name> <name><surname>Venkatesan</surname> <given-names>M. M.</given-names></name> <name><surname>Deng</surname> <given-names>W.</given-names></name> <name><surname>Fournier</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2003</year>). <article-title>Complete genome sequence and comparative genomics of <italic>Shigella flexneri</italic> serotype 2a strain 2457T.</article-title> <source><italic>Infect. Immun.</italic></source> <volume>71</volume> <fpage>2775</fpage>&#x2013;<lpage>2786</lpage>. <pub-id pub-id-type="doi">10.1128/IAI.71.5.2775-2786.2003</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wirth</surname> <given-names>T.</given-names></name> <name><surname>Falush</surname> <given-names>D.</given-names></name> <name><surname>Lan</surname> <given-names>R.</given-names></name> <name><surname>Colles</surname> <given-names>F.</given-names></name> <name><surname>Mensa</surname> <given-names>P.</given-names></name> <name><surname>Wieler</surname> <given-names>L. H.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>Sex and virulence in <italic>Escherichia coli</italic>: an evolutionary perspective.</article-title> <source><italic>Mol. Microbiol.</italic></source> <volume>60</volume> <fpage>1136</fpage>&#x2013;<lpage>1151</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2958.2006.05172.x</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>S. X.</given-names></name> <name><surname>Wang</surname> <given-names>J. P.</given-names></name> <name><surname>Jin</surname> <given-names>D.</given-names></name> <name><surname>Chen</surname> <given-names>D. L.</given-names></name> <name><surname>Liu</surname> <given-names>K.</given-names></name> <name><surname>Wang</surname> <given-names>Y. T.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Development of a singleplex PCR assay targeting O-antigen modification genes for molecular serotyping of <italic>Shigella flexneri</italic>.</article-title> <source><italic>Zhonghua Liu Xing Bing Xue Za Zhi</italic></source> <volume>33</volume> <fpage>328</fpage>&#x2013;<lpage>331</lpage>.</citation></ref>
</ref-list>
</back>
</article>