<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Cell. Infect. Microbiol.</journal-id>
<journal-title>Frontiers in Cellular and Infection Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Cell. Infect. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">2235-2988</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fcimb.2021.772574</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Cellular and Infection Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Improved Genomic Identification, Clustering, and Serotyping of Shiga Toxin-Producing <italic>Escherichia coli</italic> Using Cluster/Serotype-Specific Gene Markers</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>Xiaomei</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/662079"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Payne</surname>
<given-names>Michael</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/663938"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Kaur</surname>
<given-names>Sandeep</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1259291"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Lan</surname>
<given-names>Ruiting</given-names>
</name>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/94718"/>
</contrib>
</contrib-group>
<aff id="aff1">
<institution>School of Biotechnology and Biomolecular Sciences, University of New South Wales</institution>, <addr-line>Sydney, NSW</addr-line>, <country>Australia</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Floriana Campanile, University of Catania, Italy</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Roberto Mauricio Vidal, University of Chile, Chile; Karen Keddy, South African Medical Research Council, South Africa; Joseph M. Bosilevac, U.S. Meat Animal Research Center, Agricultural Research Service (USDA), United States</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Ruiting Lan, <email xlink:href="mailto:r.lan@unsw.edu.au">r.lan@unsw.edu.au</email>
</p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Clinical Microbiology, a section of the journal Frontiers in Cellular and Infection Microbiology</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>10</day>
<month>01</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>11</volume>
<elocation-id>772574</elocation-id>
<history>
<date date-type="received">
<day>08</day>
<month>09</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>03</day>
<month>12</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Zhang, Payne, Kaur and Lan</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Zhang, Payne, Kaur and Lan</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Shiga toxin-producing <italic>Escherichia coli</italic> (STEC) have more than 470 serotypes. The well-known STEC O157:H7 serotype is a leading cause of STEC infections in humans. However, the incidence of non-O157:H7 STEC serotypes associated with foodborne outbreaks and human infections has increased in recent years. Current detection and serotyping assays are focusing on O157 and top six (&#x201c;Big six&#x201d;) non-O157 STEC serogroups. In this study, we performed phylogenetic analysis of nearly 41,000 publicly available STEC genomes representing 460 different STEC serotypes and identified 19 major and 229 minor STEC clusters. STEC cluster-specific gene markers were then identified through comparative genomic analysis. We further identified serotype-specific gene markers for the top 10 most frequent non-O157:H7 STEC serotypes. The cluster or serotype specific gene markers had 99.54% accuracy and more than 97.25% specificity when tested using 38,534 STEC and 14,216 non-STEC <italic>E. coli</italic> genomes, respectively. In addition, we developed a freely available <italic>in silico</italic> serotyping pipeline named STECFinder that combined these robust gene markers with established <italic>E. coli</italic> serotype specific O and H antigen genes and <italic>stx</italic> genes for accurate identification, cluster determination and serotyping of STEC. STECFinder can assign 99.85% and 99.83% of 38,534 STEC isolates to STEC clusters using assembled genomes and Illumina reads respectively and can simultaneously predict <italic>stx</italic> subtypes and STEC serotypes. Using shotgun metagenomic sequencing reads of STEC spiked food samples from a published study, we demonstrated that STECFinder can detect the spiked STEC serotypes, accurately. The cluster/serotype-specific gene markers could also be adapted for culture independent typing, facilitating rapid STEC typing. STECFinder is available as an installable package (<uri xlink:href="https://github.com/LanLab/STECFinder">https://github.com/LanLab/STECFinder</uri>) and will be useful for <italic>in silico</italic> STEC cluster identification and serotyping using genome data.</p>
</abstract>
<kwd-group>
<kwd>STEC O157:H7</kwd>
<kwd>non-O157:H7 STEC serotypes</kwd>
<kwd>STEC phylogenetic clusters</kwd>
<kwd>cluster/serotype-specific gene markers</kwd>
<kwd>STEC serotyping</kwd>
<kwd>
<italic>in silico</italic> STEC tying pipeline STECFinder</kwd>
<kwd>metagenomics</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="101"/>
<page-count count="14"/>
<word-count count="7928"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<title>Introduction</title>
<p>Shiga toxin-producing <italic>Escherichia coli</italic> (STEC) are an important cause of foodborne disease worldwide (<xref ref-type="bibr" rid="B88">Tuttle et&#xa0;al., 1999</xref>; <xref ref-type="bibr" rid="B87">Teunis et&#xa0;al., 2008</xref>; <xref ref-type="bibr" rid="B92">World Health Organization, 2019</xref>). STEC causes human infections ranging from mild non-bloody diarrhea to haemorrhagic colitis (HC), haemolytic uraemic syndrome (HUS), end-stage renal disease (ESRD) and death (<xref ref-type="bibr" rid="B77">Paton and Paton, 1998</xref>; <xref ref-type="bibr" rid="B85">Tarr et&#xa0;al., 2005</xref>; <xref ref-type="bibr" rid="B28">Gould et&#xa0;al., 2009</xref>). Globally, an estimated 2.8 million STEC infections resulted in 3,890 cases of HUS, 270 cases of ESRD and 230 deaths in 2010 (<xref ref-type="bibr" rid="B58">Majowicz et&#xa0;al., 2014</xref>). Importantly, STEC infections were more frequent and severe in children younger than five years old (<xref ref-type="bibr" rid="B28">Gould et&#xa0;al., 2009</xref>; <xref ref-type="bibr" rid="B13">Buvens et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B56">Lozer et&#xa0;al., 2013</xref>).</p>    <p>Currently, there are over 470 STEC serotypes recognized based on <italic>E. coli</italic> O antigen (determination of O serogroup) and H (flagellar) antigen typing (<xref ref-type="bibr" rid="B33">Gyles, 2007</xref>; <xref ref-type="bibr" rid="B64">Mora et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B57">Ludwig et&#xa0;al., 2020</xref>). More than 130 STEC serotypes are associated with human STEC infections (<xref ref-type="bibr" rid="B41">Johnson et&#xa0;al., 1996</xref>; <xref ref-type="bibr" rid="B5">Bettelheim, 2000</xref>; <xref ref-type="bibr" rid="B42">Johnson et&#xa0;al., 2006</xref>; <xref ref-type="bibr" rid="B89">Valilis et&#xa0;al., 2018</xref>). STEC O157:H7 is the most frequent STEC serotype associated with foodborne outbreaks and human infections (<xref ref-type="bibr" rid="B5">Bettelheim, 2000</xref>; <xref ref-type="bibr" rid="B80">Qin et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B52">Li et&#xa0;al., 2017</xref>). However, other non-O157:H7 STEC serotypes have also been a major cause of foodborne outbreaks and sporadic cases, and are responsible for up to 50% STEC infections in recent years (<xref ref-type="bibr" rid="B78">Paton et&#xa0;al., 1999</xref>; <xref ref-type="bibr" rid="B59">McCarthy et&#xa0;al., 2001</xref>; <xref ref-type="bibr" rid="B74">Paciorek, 2002</xref>; <xref ref-type="bibr" rid="B54">Lipt&#xe1;kov&#xe1; et&#xa0;al., 2005</xref>; <xref ref-type="bibr" rid="B42">Johnson et&#xa0;al., 2006</xref>; <xref ref-type="bibr" rid="B96">Zhang et&#xa0;al., 2007</xref>; <xref ref-type="bibr" rid="B19">European Food Safety Authority, 2011</xref>; <xref ref-type="bibr" rid="B22">Frank et&#xa0;al., 2011a</xref>; <xref ref-type="bibr" rid="B46">K&#xe4;ppeli et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B90">Verstraete et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B101">Zweifel et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B66">Morton et&#xa0;al., 2017</xref>). Among STEC non-O157:H7 serotypes, six serogroups O26, O45, O103, O111, O121 and O45, also known as &#x201c;The Big six&#x201d; (comprising nine serotypes: O26:H11/H-; O45:H2; O103:H2, H11, H25; O111:H8/H-; O121:H19 or H7; and O145:H28/H-) account for over 70% of non-O157:H7 STEC infections (<xref ref-type="bibr" rid="B11">Brooks et&#xa0;al., 2005</xref>; <xref ref-type="bibr" rid="B35">Hedican et&#xa0;al., 2009</xref>; <xref ref-type="bibr" rid="B8">Bosilevac and Koohmaraie, 2011</xref>).</p>
<p>Shiga toxin (Stx) is the main characteristic that defines STEC (<xref ref-type="bibr" rid="B68">Nataro and Kaper, 1998</xref>; <xref ref-type="bibr" rid="B85">Tarr et&#xa0;al., 2005</xref>), which is encoded by <italic>stx</italic> genes located within lambdoid prophages (Stx-converting phages or Stx-phages) (<xref ref-type="bibr" rid="B72">O'Brien et&#xa0;al., 1989</xref>; <xref ref-type="bibr" rid="B63">Mizutani et&#xa0;al., 1999</xref>; <xref ref-type="bibr" rid="B12">Bryan et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B48">Lacher et&#xa0;al., 2016</xref>). Shiga toxins are classified into two types, Stx1 and Stx2. Each of Stx type comprises several subtypes with three subtypes for Stx1 (Stx1a, Stx1c and Stx1d) and 10 subtypes for Stx2 (Stx2a, Stx2b, Stx2c, Stx2d, Stx2e, Stx2f, Stx2g, Stx2h, Stx2i and Stx2k) (<xref ref-type="bibr" rid="B81">Scheutz et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B48">Lacher et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B1">Bai et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B93">Yang et&#xa0;al., 2020</xref>). Stx1 and/or Stx2 carrying STEC can cause human disease, however, Stx2 is more often associated with HC and HUS (<xref ref-type="bibr" rid="B49">Lentz et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B47">Kr&#xfc;ger and Lucchesi, 2015</xref>). Among Stx2 subtypes, Stx2a is the most prevalent subtype association with severe disease, followed by Stx2c and Stx2d (<xref ref-type="bibr" rid="B20">Feng and Reddy, 2013</xref>; <xref ref-type="bibr" rid="B61">Melton-Celsa, 2014</xref>; <xref ref-type="bibr" rid="B47">Kr&#xfc;ger and Lucchesi, 2015</xref>). <italic>Shigella dystenteriae</italic> and some strains of <italic>Shigella sonnei</italic>, <italic>Shigella flexneri</italic> and <italic>E. albertii</italic> also produce Stx (<xref ref-type="bibr" rid="B6">Beutin et&#xa0;al., 1999</xref>; <xref ref-type="bibr" rid="B31">Gupta et&#xa0;al., 2007</xref>; <xref ref-type="bibr" rid="B73">Ooka et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B29">Gray et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B67">Murakami et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B9">Brandal et&#xa0;al., 2015</xref>). In addition to Shiga toxin, some STEC serotypes also carry the locus of enterocyte effacement (LEE) pathogenicity island (<xref ref-type="bibr" rid="B60">McDaniel and Kaper, 1997</xref>; <xref ref-type="bibr" rid="B45">Kaper et&#xa0;al., 2004</xref>) responsible for adherence during STEC infections.</p>
<p>STEC serotype detection and identification rely on the detection of Stx proteins by enzyme immune assays or detection of the presence of <italic>stx</italic> genes by molecular methods such as PCR (<xref ref-type="bibr" rid="B10">Brian et&#xa0;al., 1992</xref>; <xref ref-type="bibr" rid="B62">Milley and Sekla, 1993</xref>; <xref ref-type="bibr" rid="B4">B&#xe9;langer et&#xa0;al., 2002</xref>; <xref ref-type="bibr" rid="B34">Hara-Kudo et&#xa0;al., 2007</xref>; <xref ref-type="bibr" rid="B86">Teel et&#xa0;al., 2007</xref>; <xref ref-type="bibr" rid="B94">Zhang et&#xa0;al., 2012</xref>). Conventional phenotypic serotyping through antigenic agglutination can further classify STEC to the serotype level (<xref ref-type="bibr" rid="B33">Gyles, 2007</xref>). However, cross-reactivity, lack of expression of O antigens, a focus on STEC O157:H7 and novel serotypes may all prevent accurate serotyping and lead to under-detection of non-O157:H7 STEC (<xref ref-type="bibr" rid="B55">Liu et&#xa0;al., 2008</xref>; <xref ref-type="bibr" rid="B84">Stigi et&#xa0;al., 2012</xref>). Molecular methods, including microarrays, utilising the sequence variations in the O antigen gene clusters, have been developed to serotype STEC O157:H7, &#x201c;Big six&#x201d; STEC non-O157:H7 and other STEC serotypes (<xref ref-type="bibr" rid="B17">DebRoy et&#xa0;al., 2004</xref>; <xref ref-type="bibr" rid="B26">Gonzales et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B53">Lin et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B71">Norman et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B37">Iguchi et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B57">Ludwig et&#xa0;al., 2020</xref>). More recently, WGS based methods have been developed for <italic>in silico</italic> serotyping STEC, which allow phenotypically untypeable isolates be serotyped <italic>in silico</italic> using O antigen and flagellin H antigen genes (<xref ref-type="bibr" rid="B38">Inouye et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B40">Joensen et&#xa0;al., 2015</xref>).</p>
<p>Alongside STEC serotyping which is useful in outbreak investigation and for prevalence surveillance (<xref ref-type="bibr" rid="B30">FAO/WHO STEC EXPERT GROUP, 2019</xref>), other subtyping methods such as pulsed-field gel electrophoresis (PFGE), multiple locus variable-number tandem repeat analysis (MLVA) and multilocus sequence typing (MLST) were also used for STEC outbreak investigations (<xref ref-type="bibr" rid="B25">Gerner-Smidt et&#xa0;al., 2006</xref>; <xref ref-type="bibr" rid="B33">Gyles, 2007</xref>; <xref ref-type="bibr" rid="B23">Frank et&#xa0;al., 2011b</xref>). Recently, WGS based typing and metagenomic sequencing have shown great potential for STEC surveillance and outbreak investigation with high resolution and specificity (<xref ref-type="bibr" rid="B50">Leonard et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B76">Parsons et&#xa0;al., 2016</xref>).</p>
<p>STEC serotypes with the same O and H antigens were generally clustered together and share a common ancestor (<xref ref-type="bibr" rid="B44">Ju et&#xa0;al., 2012</xref>). A recent phylogenetic analysis on 276 STEC isolates belonging to 81 serotypes revealed that some STECs formed discrete clades with clustering associated with sequence types and serotypes (<xref ref-type="bibr" rid="B27">Gonz&#xe1;lez-Escalona and Kase, 2019</xref>). Our present study aimed to i), identify phylogenetic clusters of STEC through large scale examination of publicly available genomes; ii), identify cluster/serotype-specific genes for detection of STEC isolates and for detection and serotyping of most frequent STEC serotypes through comparative genomic analysis of accessory genomes; iii), develop an automated pipeline for STEC <italic>in silico</italic> cluster typing and serotyping from WGS data based on cluster/serotype-specific gene markers combined with <italic>E. coli</italic> O and H antigen genes.</p>
</sec>
<sec id="s2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="s2_1">
<title>Identification of STEC Isolates From NCBI Database</title>
<p>
<italic>E. coli</italic> isolates from the NCBI SRA (National Center for Biotechnology Information Sequence Read Archive) in June of 2020 were queried. The keyword &#x201c;<italic>Escherichia coli</italic>&#x201d; was used to retrieve SRA accession numbers of <italic>E. coli</italic> isolates. Raw reads were retrieved from ENA (European Nucleotide Archive). The <italic>stx</italic> genes (<italic>stx<sub>1</sub>
</italic>, GenBank accession number M19437; <italic>stx<sub>2</sub>
</italic> GenBank accession number X07865) and <italic>ipaH</italic> gene (GenBank accession number M32063) were used to screen <italic>E. coli</italic> reads using Salmon v0.13.0 (<xref ref-type="bibr" rid="B79">Patro et&#xa0;al., 2017</xref>). Taxonomic classification for <italic>E. coli</italic> was confirmed by Kraken v1.1.1 (<xref ref-type="bibr" rid="B91">Wood and Salzberg, 2014</xref>). Isolates that were positive to any <italic>stx</italic> genes and negative to the <italic>ipaH</italic> gene (the latter to exclude <italic>Shigella</italic> or enteroinvasive <italic>E. coli</italic> [EIEC]) were selected to form the STEC dataset.</p>
<p>A control dataset that represented the sequence types (STs) and ribosomal STs (rSTs) of <italic>stx</italic> negative <italic>E. coli</italic> (&#x201c;non-STEC&#x201d;) isolates was constructed. STs and rSTs of non-STEC isolates were obtained from the <italic>E. coli/Shigella</italic> database in Enterobase on August 2020 (<xref ref-type="bibr" rid="B99">Zhou et&#xa0;al., 2020</xref>). For STs and rSTs with only one isolate, the isolate was selected. For STs and rSTs with more than one isolate, one representative isolate for each ST and rST were randomly selected. In total, 14,126 <italic>stx</italic>-negative <italic>E. coli</italic> isolates representing 4,354 STs and 11,520 rSTs were selected as non-STEC control database.</p>
</sec>
<sec id="s2_2">
<title>Genome Assembly and Data Processing</title>
<p>Raw reads were assembled <italic>de novo</italic> using SPADES v3.14.0 assembler with default settings [<uri xlink:href="http://bioinf.spbau.ru/spades">http://bioinf.spbau.ru/spades</uri>] (<xref ref-type="bibr" rid="B2">Bankevich et&#xa0;al., 2012</xref>). The metrics of assembled genomes were obtained with QUAST v5.0.0 (<xref ref-type="bibr" rid="B32">Gurevich et&#xa0;al., 2013</xref>). Three standard deviations (SD) from the mean for contig number, largest contig, total length, GC, N50 and genes were used as quality filter for assembled genomes.</p>
<p>The STs for isolates in the STEC database were checked using mlst (<uri xlink:href="https://github.com/tseemann/mlst">https://github.com/tseemann/mlst</uri>) with the <italic>E. coli</italic> scheme from PubMLST (<xref ref-type="bibr" rid="B43">Jolley and Maiden, 2010</xref>). rSTs were extracted from the <italic>E. coli/Shigella</italic> rMLST database in Enterobase on August 2020 (<xref ref-type="bibr" rid="B99">Zhou et&#xa0;al., 2020</xref>). Serotyping of <italic>E. coli</italic> O and H antigen types were predicted by using SerotypeFinder v2.0.1 (<xref ref-type="bibr" rid="B40">Joensen et&#xa0;al., 2015</xref>). The phylogroups of STEC isolates were obtained using ClermonTyping (<xref ref-type="bibr" rid="B3">Beghain et&#xa0;al., 2018</xref>).</p>
</sec>
<sec id="s2_3">
<title>Selection of Isolates for STEC Identification Dataset</title>
<p>Representative isolates for each ST, rST and serotype in the STEC dataset were selected to form the identification dataset. For STs, rSTs and serotypes with only one isolate, the isolate was selected. For STs, rSTs and serotypes with more than one isolate, one representative isolate for each ST, rST and serotype was randomly selected. For rSTs in the top six STs, one representative isolate for each rST with two or more isolates was randomly selected. A further 691 isolates including 72 <italic>Escherichia coli</italic> reference (ECOR) strains downloaded from Enterobase, 573 non-STEC <italic>E.coli</italic> isolates representing 573 STs with more than nine genomes, 41 <italic>Shigella</italic> and EIEC isolates representing each cluster identified in our previous study (<xref ref-type="bibr" rid="B98">Zhang et&#xa0;al., 2021</xref>), three <italic>E. albertii</italic> isolates and two <italic>E. fergusonii</italic> isolates were used as controls for the identification dataset. The details of the identification dataset are listed in <xref ref-type="supplementary-material" rid="ST1">
<bold>Table S1</bold>
</xref>. The remaining STEC isolates in the STEC database were referred to as the validation dataset (<xref ref-type="supplementary-material" rid="ST2">
<bold>Table S2</bold>
</xref>).</p>
<p>The identification dataset was used to identify the phylogenetic relationships of STEC isolates and was also used to identify cluster/serotype-specific gene markers. The validation dataset was used to evaluate the performance of cluster/serotype-specific gene markers relative to phylogenetic relationships.</p>
</sec>
<sec id="s2_4">
<title>Phylogeny of STEC Isolates Based on WGS</title>
<p>Phylogenetic trees including an identification tree and 15 validation trees were constructed by using Quicktree v1.3 (<xref ref-type="bibr" rid="B36">Hu et&#xa0;al., 2020</xref>) with default parameters to identify and confirm the phylogenetic clustering of STEC isolates. The phylogenetic trees were visualised by Grapetree and ITOL v5 (<xref ref-type="bibr" rid="B100">Zhou et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B51">Letunic and Bork, 2019</xref>).</p>
<p>The identification phylogenetic tree was generated using isolates in the identification dataset for the identification of clusters of STEC isolates. The validation trees were constructed using isolates in the STEC validation dataset and a subset of isolates known to represent each identified cluster from the identification dataset to assign validation dataset isolates to the clusters defined.</p>
</sec>
<sec id="s2_5">
<title>Identification of the Cluster/Serotype-Specific Gene Markers</title>
<p>Cluster/serotype-specific gene markers were identified from STEC accessory genomes. The genomes from the identification dataset were annotated using PROKKA v1.13.3 (<xref ref-type="bibr" rid="B82">Seemann, 2014</xref>). Pan- and core-genomes were analysed by Roary v3.12.0 (<xref ref-type="bibr" rid="B75">Page et&#xa0;al., 2015</xref>) using an 80% sequence identity threshold. The candidate gene markers specific to each cluster/serotype were identified from accessory genes with an in-house python script from our previous study (<xref ref-type="bibr" rid="B98">Zhang et&#xa0;al., 2021</xref>). The best performing specific gene marker set was selected from the candidates by using BLASTN to search against the identification dataset.</p>
<p>As in our previous studies (<xref ref-type="bibr" rid="B97">Zhang et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B98">Zhang et&#xa0;al., 2021</xref>) the genomes from a given cluster containing all specific gene markers for that cluster were termed true positives (TP), the genomes from the same cluster lacking any of those same gene markers were termed false negatives (FN). The genomes from other clusters containing all of those same gene markers were termed false positives (FP). The sensitivity (True positive rate, TPR) of each cluster-specific gene marker was defined as TP/(TP+FN). The specificity (True negative rate, TNR) was defined as TN/(TN+FP).</p>
</sec>
<sec id="s2_6">
<title>Validation of the Cluster/Serotype-Specific Gene Markers</title>
<p>The specific gene markers were examined by using BLASTN to search against the validation dataset (<xref ref-type="supplementary-material" rid="ST2">
<bold>Table S2</bold>
</xref>) and non-STEC <italic>E. coli</italic> control database for the presence of any of the cluster/serotype-specific gene markers. The BLASTN thresholds were defined as 80% sequence identity and 50% gene length coverage.</p>
</sec>
<sec id="s2_7">
<title>Development of STECFinder, an Automated Pipeline for Molecular Serotyping of STEC</title>
<p>STECFinder was developed for STEC serotyping from either paired end Illumina genome sequencing reads or assembled genomes. The typing reference sequences used for construction of STECFinder included specific gene marker sets identified in this study, established <italic>E. coli</italic> O antigen and H antigen gene sequences collected from SerotypeFinder (<xref ref-type="bibr" rid="B40">Joensen et&#xa0;al., 2015</xref>), <italic>stx</italic> subtype sequences collected from VirulenceFinder and three other studies (<xref ref-type="bibr" rid="B39">Joensen et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B48">Lacher et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B1">Bai et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B93">Yang et&#xa0;al., 2020</xref>), the <italic>ipaH</italic> gene sequence downloaded from NCBI, and seven House Keeping (HK) genes -<italic>recA</italic>, <italic>purA</italic>, <italic>mdh</italic>, <italic>icd</italic>, <italic>gyrB</italic>, <italic>fumC</italic> and <italic>adk</italic> from the <italic>E. coli</italic> MLST scheme (<xref ref-type="bibr" rid="B43">Jolley and Maiden, 2010</xref>) for contamination checking (<xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>). All sequences are available in fasta format at <uri xlink:href="https://github.com/LanLab/STECFinder">https://github.com/LanLab/STECFinder</uri> with cluster specific genes named with the following convention: STEC-cluster-gene_number (i.e. STEC-C1-gene_1 for the first gene in the C1 specific set).</p>
<fig id="f1" position="float">
<label>Figure&#xa0;1</label>
<caption>
<p>
<italic>in silico</italic> serotyping pipeline workflow. Schematic of <italic>in silico</italic> serotyping STEC by cluster/serotype-specific genes combined with the <italic>ipaH</italic> gene, <italic>stx</italic> genes including all available subtypes and <italic>E. coli</italic> O antigen and H antigen genes, implemented in STECFinder. Both assembled genomes and raw reads are accepted as data input.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fcimb-11-772574-g001.tif"/>
</fig>
<p>For the analysis of sequence data as raw reads, KMA (<italic>k</italic>-mer alignment) v1.3.15 (<xref ref-type="bibr" rid="B16">Clausen et&#xa0;al., 2018</xref>) was used to align the raw reads to the typing reference sequences. KMA utilizes <italic>k</italic>-mer seeding and the Needleman-Wunsch algorithm (<xref ref-type="bibr" rid="B70">Needleman and Wunsch, 1970</xref>) to accurately align reads to genes of interest. The best-aligning template was chosen from a novel sorting scheme ConClave scheme incorporated into KMA (<xref ref-type="bibr" rid="B16">Clausen et&#xa0;al., 2018</xref>). To determine whether the genes were present or absent, the mapping length coverage and a minimum depth were used as the thresholds.</p>
<p>For the submission of sequence data as assembled genomes, BLASTN v2.9.0 (<xref ref-type="bibr" rid="B15">Camacho et&#xa0;al., 2009</xref>) was used to search against the typing reference sequences with 80% sequence identity. The presence or absence of genes was determined by the gene length coverage.</p>
<p>The presence or absence of genes in STECFinder was determined by the cutoff value of gene length coverage for assembled genomes and the mapping length coverage and a minimum mapping depth for raw reads. For assembled genomes, length coverage of 50% for all cluster/serotype-specific genes, 60% for O and H antigen genes and 10% for <italic>ipaH</italic> gene and <italic>stx</italic> genes were used as cutoff value for determination of the presence of genes. For raw reads, mapping length coverage of 50% for all cluster/serotype-specific genes, 60% for O and H antigen genes, 10% for <italic>ipaH</italic> gene and <italic>stx</italic> genes and a minimum depth of 10 for all cluster-specific genes, a minimum depth of one for O and H antigen genes, <italic>ipaH</italic> gene and <italic>stx</italic> genes were used to define the gene as present. In addition, when multiple O and H genes were detected the bitscore was incorporated into STECFinder for filtering and ranking O and H antigen. The highest match was chosen as the O or H antigen present, when multiple O or H variants were present.</p>
<p>The major and minor clusters and top 10 non-O157:H7 STEC serotypes were assigned based on the presence of cluster/serotype-specific gene marker set together with the presence of <italic>stx</italic> subtypes and the absence of <italic>ipaH</italic> gene. All genes in a cluster/serotype-specific gene set must be defined as present for a cluster or serotype to be called. An &#x2018;unclustered&#x2019; was assigned for isolate that cannot be detected by any of cluster-specific gene marker set. Unclustered STEC could be new clusters or isolates that contained all genes in the marker set but one or more genes from marker set did not pass the cutoff value.</p>
<p>Additional subsets of gene marker sets were added to increase the accuracy of clusters and calling of the top 10 non-O157:H7 STEC serotypes. For example, the combination of the specific gene marker set of O157:H7 and AM18 can eliminate the known false presences of AM18 gene set in O157:H7. The isolate is assigned as AM18 if both gene sets are present while the isolate is assigned as O157:H7 if AM18 specific gene set is absent. The subsets of combined gene sets were incorporated into the STECFinder for elimination of false cluster assignment are listed in <xref ref-type="supplementary-material" rid="ST6">
<bold>Table S6</bold>
</xref>.</p>
<p>STECFinder was tested with identification dataset. The accuracy and specificity of STECFinder for prediction of clusters and serotypes were evaluated with STEC validation dataset and non-STEC <italic>E.coli</italic> control dataset.</p>
</sec>
<sec id="s2_8">
<title>Application of STECFinder in STEC Typing Using Metagenomics Data From STEC Spiked Food Samples</title>
<p>STECFinder can take input from metagenomics sequencing reads for STEC typing. The application of STECFinder in metagenomics analysis was evaluated using 17 metagenomic sequencing read sets from samples published by Buytaers et al. (<xref ref-type="bibr" rid="B14">Buytaers et&#xa0;al., 2020</xref>). These 17 shotgun metagenomic sequencing reads (<xref ref-type="bibr" rid="B14">Buytaers et&#xa0;al., 2020</xref>) were downloaded from ENA.</p>
</sec>
</sec>
<sec id="s3" sec-type="results">
<title>Results</title>
<sec id="s3_1">
<title>Screening Sequenced Genomes for STEC Isolates</title>
<p>The presence of any of <italic>stx</italic> genes and the absence of the <italic>ipaH</italic> gene were used to identify STEC isolates. We examined 140,348 isolates with the species annotation of <italic>E. coli</italic> with paired end Illumina sequencing reads available in ENA database. Of the 140,348 isolates, 43,960 isolates were positive to <italic>stx<sub>1</sub>
</italic> and/or <italic>stx<sub>2</sub>
</italic> genes and negative for the <italic>ipaH</italic> gene. 41,101 of the 43,960 isolates passed taxonomic classification and genome assembly quality filters and were selected to form the STEC dataset.</p>
<p>Isolates in the STEC dataset were typed using MLST, rMLST and SerotypeFinder. MLST typed the 41,101 STEC isolates into 817 STs (202 isolates not typed by MLST) of which 368 STs were represented by a single isolate, 424 STs represented by two to 100 isolates each and accounted for 12% of the STEC isolates, whereas 25 STs contained more than 100 isolates each and encompassed 86.61% of the STEC isolates, of which ST11 is the largest, accounting for 37.12% of the STEC isolates, followed by ST21 (14.71%), ST17 (11.91%), ST16 (6.72%), ST655 (2,71%) and ST32 (2.46%). rMLST divided the 41,101 STEC isolates into 2,911 rSTs (12,208 isolates not typed by rMLST).</p>
<p>Using SerotypeFinder, 38,958 of the 41,101 (94.79%) isolates were assigned to 460 <italic>E. coli</italic> O:H antigen types, 2,039 isolates (4.96%) were not assigned an O antigen and were typed for H antigens only with 38 H antigen types, of which H7, H2, H8, H11 and H21 were the most frequent types, 96 isolates (0.23%) were typed as multiple O:H types and six isolates (0.01%) were untypeable.</p>
</sec>
<sec id="s3_2">
<title>The Frequency of STEC Serotypes</title>
<p>The 38,958 STEC O:H antigen typeable isolates belonged to 460 different serotypes including O157:H7 (38.55% of 38,958 typeable isolates) and 459 non-O157:H7 serotypes (61.45% of 38,958 typeable isolates).</p>
<p>Of the 459 non-O157:H7 serotypes, the top 28 serotypes were present in more than 100 isolates each and accounted for 50.8% of 38,958 typeable STEC isolates, of which the 10 most frequent serotypes (41.66% of 38,958 typeable STEC isolates) were O26:H11, O103:H2, O111:H8, O121:H9, O145:H28, O45:H2, O91:H14, O118/O151:H16, O123/O186:H2 and O146:H21. The top 6 serotypes belonged to the well-known &#x201c;Big six&#x201d; STEC non-O157 serogroups (<xref ref-type="bibr" rid="B11">Brooks et&#xa0;al., 2005</xref>; <xref ref-type="bibr" rid="B35">Hedican et&#xa0;al., 2009</xref>; <xref ref-type="bibr" rid="B8">Bosilevac and Koohmaraie, 2011</xref>).  It should be noted that three serotypes, O103:H11, O103:H25, and O121:H7, belonging to the &#x201c;Big six&#x201d; non-O157 STEC serogroups were outside the top 10 serotypes. The 116 serotypes present with 10 to 100 isolates each, belonged to 8.64% of typeable STEC isolates. The remaining 315 serotypes with less than 10 isolates each represented 2% of the typeable STEC isolates (<xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref>).</p>
<fig id="f2" position="float">
<label>Figure&#xa0;2</label>
<caption>
<p>The frequency of 460 STEC serotypes. The graph shows the frequency of 460 STEC serotypes. STEC O157:H7 and top 28 non-O157:H7 serotypes are listed separately. The number on top of each stacked column refers to the number of isolates for each serotype.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fcimb-11-772574-g002.tif"/>
</fig>
</sec>
<sec id="s3_3">
<title>Identification of STEC Clusters</title>
<p>To identify any phylogenetic clusters containing one or more STEC serotypes from the 41,101 STEC isolates, we selected representative isolates to perform phylogenetic analysis as it was impractical to construct a tree with all isolates. The selection was performed on the basis of ST, rST and serotype of the 41,101 STEC isolates. One isolate was selected to represent each ST, rST and serotype for a total of 2,567 STEC isolates. Note that in the case that STs or rSTs overlapped with serotype, an isolate was only selected once to avoid duplicates of the same isolate. The selection included 817 STs, 1,413 rSTs, 460 STEC serotypes and 102 partial antigen types (H antigen only and multiple O/H types). A further 691 isolates consisting of 72 ECOR isolates, 573 non-STEC <italic>E.coli</italic> isolates, 41 <italic>Shigella</italic> and EIEC isolates, three <italic>E. albertii</italic> isolates and two <italic>E. fergusonii</italic> isolates were also included. The identification dataset consisted of 3,258 isolates in total. Details are listed in <xref ref-type="supplementary-material" rid="ST1">
<bold>Table S1</bold>
</xref>. A phylogenetic tree was constructed using 3,258 isolates in the identification dataset to identify the clusters (<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>).</p>
<fig id="f3" position="float">
<label>Figure&#xa0;3</label>
<caption>
<p>STEC cluster identification phylogenetic tree. Representative isolates from the identification dataset were used to construct the phylogenetic tree by Quicktree v1.3 to identify STEC (Shiga toxin-producing <italic>E. coli</italic>) clusters and visualised using Grapetree. The dendrogram shows the phylogenetic relationships of 2,567 STEC isolates represented in the identification dataset. Branch lengths are log scale for clarity. The scale bar represents 0.1 substitutions per site. STEC clusters are coloured. Numbers in square brackets after cluster name are the number of isolates for each identified cluster. ECOLI is <italic>E. coli.</italic> EIEC is Enteroinvasive <italic>E.&#xa0;coli</italic>. MC indicates a minor STEC cluster.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fcimb-11-772574-g003.tif"/>
</fig>
<p>The identification of clusters was focused on O157:H7 and the top 28 non-O157:H7 serotypes. A major cluster was defined if the cluster node had a bootstrap value of above 80% and contained STEC isolates belonging to O157:H7 and top 28 non-O157:H7 serotypes. The isolates of O157:H7 were grouped into one large cluster. A further 18 major clusters (C1-C18) all of which carried only non-O157:H7 serotypes (<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>; <xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref> and <xref ref-type="supplementary-material" rid="SF1">
<bold>Figure S1</bold>
</xref>), were identified. The isolates of top 28 non-O157:H7 serotypes fell into these 18 major clusters. Of the 2,567 STEC isolates, 1,412 fell within the O157:H7 cluster or one of the 18 non-O157:H7 major STEC clusters.</p>
<table-wrap id="T1" position="float">
<label>Table&#xa0;1</label>
<caption>
<p>Major STEC clusters identified in identification dataset.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Cluster</th>
<th valign="top" align="center">No. of isolates</th>
<th valign="top" align="center">No. of serotypes</th>
<th valign="top" align="center">No. of STs</th>
<th valign="top" align="center">Top 28 non-O157:H7 serotypes*</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="center">356</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">83</td>
<td valign="top" align="left">O157:H7</td>
</tr>
<tr>
<td valign="top" align="left">C1</td>
<td valign="top" align="center">414</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">97</td>
<td valign="top" align="left">1-O26:H11, 3-O111:H8, 8-O118/O151:H16,</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="center"/>
<td valign="top" align="center"/>
<td valign="top" align="left">12-O71:H11, 15-O103:H11, 18-O69:H11</td>
</tr>
<tr>
<td valign="top" align="left">C2</td>
<td valign="top" align="center">181</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">42</td>
<td valign="top" align="left">2-O103:H2, 6-O45:H2, 9-O123/O186:H2</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="center"/>
<td valign="top" align="center"/>
<td valign="top" align="left">11-O118/O151:H2</td>
</tr>
<tr>
<td valign="top" align="left">C3</td>
<td valign="top" align="center">45</td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">12</td>
<td valign="top" align="left">19-O103:H25, 25-O156:H25, 6-O45:H2</td>
</tr>
<tr>
<td valign="top" align="left">C4</td>
<td valign="top" align="center">89</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">21</td>
<td valign="top" align="left">13-O5:H9, 20-O165:H25, 24-O177:H25</td>
</tr>
<tr>
<td valign="top" align="left">C5</td>
<td valign="top" align="center">29</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">5</td>
<td valign="top" align="left">4-O121:H19</td>
</tr>
<tr>
<td valign="top" align="left">C6</td>
<td valign="top" align="center">41</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">6</td>
<td valign="top" align="left">5-O145:H28</td>
</tr>
<tr>
<td valign="top" align="left">C7</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">13</td>
<td valign="top" align="left">7-O91:H14</td>
</tr>
<tr>
<td valign="top" align="left">C8</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">14</td>
<td valign="top" align="left">10-O146:H21</td>
</tr>
<tr>
<td valign="top" align="left">C9</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left">10-O146:H21</td>
</tr>
<tr>
<td valign="top" align="left">C10</td>
<td valign="top" align="center">50</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">15</td>
<td valign="top" align="left">14-O128ab:H2</td>
</tr>
<tr>
<td valign="top" align="left">C11</td>
<td valign="top" align="center">27</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">6</td>
<td valign="top" align="left">16-O117:H7</td>
</tr>
<tr>
<td valign="top" align="left">C12</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">6</td>
<td valign="top" align="left">17-O76:H19</td>
</tr>
<tr>
<td valign="top" align="left">C13</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">21-O113:H21</td>
</tr>
<tr>
<td valign="top" align="left">C14</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="left">22-O113:H4</td>
</tr>
<tr>
<td valign="top" align="left">C15</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left">23-O104:H4</td>
</tr>
<tr>
<td valign="top" align="left">C16</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">4</td>
<td valign="top" align="left">26-O8:H19</td>
</tr>
<tr>
<td valign="top" align="left">C17</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">27-O130:H11</td>
</tr>
<tr>
<td valign="top" align="left">C18</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left">28-O55:H7</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>*The serotypes in each non-O157:H7 cluster are listed with their rank by isolate frequency for the top 28 non-O157:H7 serotypes followed by the serotype.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>Of the remaining 1,155 STEC isolates, 877 isolates were grouped into 229 STEC minor clusters with two or more isolates in a cluster, whereas 278 isolates were singletons separated from other clusters by non-STEC <italic>E. coli</italic> isolates. We further typed the isolates from minor clusters using phylogroup typing (<xref ref-type="bibr" rid="B11">Brooks et&#xa0;al., 2005</xref>) and each minor cluster was named by phylogroup and lineage number, for example, phylogroup A minor cluster 1 (AM1). Most of the minor clusters belonged to phylogroup B1 (<xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref>).</p>
<table-wrap id="T2" position="float">
<label>Table&#xa0;2</label>
<caption>
<p>Summary of identified STEC minor clusters in identification dataset.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Phylogroup</th>
<th valign="top" align="center">No. of MC*</th>
<th valign="top" align="center">Name of MC</th>
<th valign="top" align="center">No. of isolates</th>
<th valign="top" align="center">No. of serotypes</th>
<th valign="top" align="center">No. of STs</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">A</td>
<td valign="top" align="center">37</td>
<td valign="top" align="left">AM1-AM37</td>
<td valign="top" align="center">139</td>
<td valign="top" align="center">64</td>
<td valign="top" align="center">42</td>
</tr>
<tr>
<td valign="top" align="left">B1</td>
<td valign="top" align="center">126</td>
<td valign="top" align="left">B1M1-B1M126</td>
<td valign="top" align="center">519</td>
<td valign="top" align="center">157</td>
<td valign="top" align="center">186</td>
</tr>
<tr>
<td valign="top" align="left">B2</td>
<td valign="top" align="center">14</td>
<td valign="top" align="left">B2M1-B2M14</td>
<td valign="top" align="center">35</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">17</td>
</tr>
<tr>
<td valign="top" align="left">C</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">CM1-CM7</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td valign="top" align="left">D</td>
<td valign="top" align="center">22</td>
<td valign="top" align="left">DM1-DM22</td>
<td valign="top" align="center">67</td>
<td valign="top" align="center">26</td>
<td valign="top" align="center">29</td>
</tr>
<tr>
<td valign="top" align="left">E</td>
<td valign="top" align="center">19</td>
<td valign="top" align="left">EM1-EM19</td>
<td valign="top" align="center">73</td>
<td valign="top" align="center">26</td>
<td valign="top" align="center">34</td>
</tr>
<tr>
<td valign="top" align="left">G</td>
<td valign="top" align="center">4</td>
<td valign="top" align="left">GM1-GM4</td>
<td valign="top" align="center">27</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">12</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>*MC, minor clusters.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>In total, 19 major STEC clusters including one O157:H7 and 18 non-O157:H7 clusters (Top 28 non-O157:H7 serotypes) and 229 STEC minor clusters containing other non-O157:H7 serotypes were identified. Of the 19 major clusters, 12 had a single serotype and seven had two or more serotypes. The frequency of non-O157:H7 STEC serotypes in the major clusters are shown in <xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref>. For the 229 STEC minor clusters, 103 contained a single serotype, 109 consisted of two or more serotypes and the remaining 17 comprised of isolates with H antigen types only.</p>
<fig id="f4" position="float">
<label>Figure&#xa0;4</label>
<caption>
<p>The frequency of the top 28 non-O157:H7 STEC serotypes in STEC major clusters. The graph shows the frequency of top 28 non-O157:H7 serotypes in the 18 STEC major clusters. Clusters are shown per colour legend and also at the top of the bar. X-axis shows the serotype while y-axis shows the number of isolates.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fcimb-11-772574-g004.tif"/>
</fig>
<p>Among the top 10 non-O157:H7 serotypes, O121:H19 (C5), O145:H28 (C6), O91:H14 (C7) had a single origin while O146:H21 (C8 and C9) was a paraphyletic serotype. O26:H11 and O118/O151:H16 were grouped into C1. O123/O186:H2 was grouped into C2. O103:H2, O111:H8 and O45:H2 had polyphyletic origins. O103:H2 and O111:H8 were grouped into C2 and B1M118, C1 and B1M119, respectively. O45:H2 had three lineages which were clustered into C2, C3 and AM37. Three serotypes (O128ac:H2, O8:H19 and O113:H21) of the remaining top 28 non-O157:H7 serotypes were polyphyletic serotypes. Thirty non top 28 non-O157:H7 serotypes also had polyphyletic origins.</p>
<p>Apart from STEC isolates, 26 of the 573 <italic>stx</italic> negative <italic>E. coli</italic> isolates from the identification dataset were grouped into clusters. Of the 19 major clusters identified, 12 contained <italic>stx</italic> negative <italic>E. coli</italic> isolates (ST11 in O157:H7; ST765 and ST29 in C1; ST17, and ST376 in C2; ST343 andST300 in C3, ST342 in C4; ST655 in C5; ST32 in C6; ST442 and ST1992 in C8, ST335 in C18). These STs containing <italic>stx</italic> negative <italic>E. coli</italic> isolates were the most frequent STs in the STEC database, suggesting these <italic>stx</italic> negative <italic>E. coli</italic> isolates may have lost the <italic>stx</italic> genes. The details of STEC clusters and lineages were listed in <xref ref-type="supplementary-material" rid="ST3">
<bold>Table S3</bold>
</xref>.</p>
<p>However, 11 STEC minor clusters also contained <italic>stx</italic> negative <italic>E. coli</italic> isolates. Therefore, we further examined STs with more than two isolates from all minor STEC clusters that were also found within the 14,126 <italic>stx</italic> negative <italic>E. coli</italic> (&#x201c;non-STEC&#x201d;) isolates. Of the 229 minor STEC clusters, the STs in 58 clusters contained <italic>stx</italic> positive isolates only and the STs in 171 clusters contained both <italic>stx</italic> negative and <italic>stx</italic> positive isolates. Of these 171 minor STEC clusters, the STs in four clusters consisted of <italic>stx</italic> positive isolates and <italic>E. coli</italic> isolates that didn&#x2019;t carry typical pathotype specific genes (data not shown). While STs in the remaining 167 clusters consisted of <italic>stx</italic> positive isolates and isolates that carried pathotype specific genes from other <italic>E. coli</italic> pathotypes (data not shown). Thus, these STEC minor clusters are a mix of STEC and other pathotypes.</p>
</sec>
<sec id="s3_4">
<title>Identification of the Cluster/Serotype-Specific Gene Markers</title>
<p>In this study, we used the same definition and approach as used to find the <italic>Shigella</italic>/EIEC cluster specific genes (<xref ref-type="bibr" rid="B98">Zhang et&#xa0;al., 2021</xref>). We searched for potential specific gene marker sets for the 19 major and 229 minor clusters using the accessory genomes from the 3,258 identification dataset isolates. Genes associated with STEC O antigen gene clusters were excluded from the analysis to identify O antigen gene independent markers. Multiple candidate cluster/serotype-specific gene marker sets for each of the 19 major STEC clusters and 229 minor STEC clusters were identified. The single gene marker set with 100% sensitive and the highest specificity were then selected from candidate cluster-specific gene marker sets by BLASTN searches against genomes in the identification dataset using 80% sequence identity and 50% gene length threshold.</p>
<p>We also searched for specific gene markers for six of the top 10 non-O157:H7 serotypes (O26:H11, O111:H8, O118/O151:H16, O103:H2, O45:H2 and O123/O186:H2) which were not in a cluster of their own. The best performing gene marker set for each of six of top 10 non-O157:H7 serotypes were identified using the same approach as used to identify and select cluster-specific gene marker sets.</p>
<p>The sensitivity and specificity of each major STEC cluster and six non-O157:H7 serotype specific gene marker set for the identification dataset were listed in <xref ref-type="table" rid="T3">
<bold>Table&#xa0;3</bold>
</xref>. The major STEC cluster and six non-O157:H7 serotype specific gene marker sets were all 100% sensitive and the specificity varied from 99.72% to 100% for major STEC cluster-specific gene marker sets and from 99.41% to 100% for non-O157:H7 serotype-specific gene marker sets. The STEC minor cluster-specific gene marker sets were 100% specific with the exception of 12 minor clusters which had specificity ranging from 99.85% to 99.97% (<xref ref-type="supplementary-material" rid="ST4">
<bold>Table S4</bold>
</xref>).</p>
<table-wrap id="T3" position="float">
<label>Table&#xa0;3</label>
<caption>
<p>The sensitivity and specificity of STEC cluster/serotype-specific gene markers.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" rowspan="2" align="left">Clusters</th>
<th valign="top" rowspan="2" align="center">Cluster-specific genes marker sets</th>
<th valign="top" colspan="3" align="center">Identification dataset (3,258 isolates)</th>
</tr>
<tr>
<th valign="top" align="center">No. of isolates</th>
<th valign="top" align="center">Sensitivity</th>
<th valign="top" align="center">Specificity*</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">Set of 6 genes</td>
<td valign="top" align="center">356</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.72</td>
</tr>
<tr>
<td valign="top" align="left">C1</td>
<td valign="top" align="left">Set of 4 genes</td>
<td valign="top" align="center">414</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.82</td>
</tr>
<tr>
<td valign="top" align="left">C2</td>
<td valign="top" align="left">Set of 4 genes</td>
<td valign="top" align="center">181</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">C3</td>
<td valign="top" align="left">Set of 3 genes</td>
<td valign="top" align="center">45</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">C4</td>
<td valign="top" align="left">Set of 3 genes</td>
<td valign="top" align="center">89</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">C5</td>
<td valign="top" align="left">Set of 4 genes</td>
<td valign="top" align="center">29</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">C6</td>
<td valign="top" align="left">Set of 3 genes</td>
<td valign="top" align="center">41</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.88</td>
</tr>
<tr>
<td valign="top" align="left">C7</td>
<td valign="top" align="left">Set of 4 genes</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">C8</td>
<td valign="top" align="left">Set of 5 genes</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">C9</td>
<td valign="top" align="left">Set of 2 genes</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">C10</td>
<td valign="top" align="left">Set of 2 genes</td>
<td valign="top" align="center">50</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">C11</td>
<td valign="top" align="left">Single gene</td>
<td valign="top" align="center">27</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">C12</td>
<td valign="top" align="left">Set of 2 genes</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">C13</td>
<td valign="top" align="left">Set of 4 genes</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">C14</td>
<td valign="top" align="left">Set of 4 genes</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">C15</td>
<td valign="top" align="left">Set of 2 genes</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">C16</td>
<td valign="top" align="left">Set of 4 genes</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">C17</td>
<td valign="top" align="left">Set of 3 genes</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">C18</td>
<td valign="top" align="left">Set of 3 genes</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">O26:H11</td>
<td valign="top" align="left">Set of 6 genes</td>
<td valign="top" align="center">204</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.41</td>
</tr>
<tr>
<td valign="top" align="left">O103:H2</td>
<td valign="top" align="left">Set of 4 genes</td>
<td valign="top" align="center">121</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.87</td>
</tr>
<tr>
<td valign="top" align="left">O111:H8</td>
<td valign="top" align="left">Set of 3 genes</td>
<td valign="top" align="center">96</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">O45:H2 (C2)</td>
<td valign="top" align="left">Set of 5 genes</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">O45:H2 (C3)</td>
<td valign="top" align="left">Set of 3 genes</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
<tr>
<td valign="top" align="left">O118/O156:H16</td>
<td valign="top" align="left">Set of 4 genes</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">99.94</td>
</tr>
<tr>
<td valign="top" align="left">O123/O186:H2</td>
<td valign="top" align="left">Set of 3 genes</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>
<sup>*</sup>The specificity of cluster-specific gene set less than 100% was due to at least one false positive found in that set.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s3_5">
<title>Validation of Cluster/Serotype-Specific Gene Markers</title>
<p>The STEC cluster/serotype-specific gene marker sets were evaluated with 38,534 STEC isolates from the validation dataset and 14,126 isolates from non-STEC <italic>E.coli</italic> control dataset.</p>
<p>The STEC cluster-specific gene marker sets were able to assign 35,464 of 38,534 (92.03%) STEC isolates to the major clusters and 2,703 (7.01%) STEC isolates to minor clusters. In total, 38,155 of 38,534 (99.02%) STEC isolates can be assigned to clusters by cluster-specific gene marker sets, while 150 of the 38,534 (0.39%) STEC isolates were assigned with more than one cluster and 217 of the 38,534 (0.56%) STEC isolates were not assigned to any cluster by STEC cluster-specific gene marker sets.</p>
<p>Validation phylogenetic trees (<xref ref-type="supplementary-material" rid="SF2">
<bold>Figure S2</bold>
</xref>) were then constructed to confirm the assignment of cluster-specific gene marker sets. We divided the 38,534 STEC validation isolates into 15 subgroups. Each of the 15 subgroups isolates together with a subset of 476 STEC isolates with known clusters and 691 non-STEC isolates from identification dataset were used to generate validation trees for a total of 15 validation trees. The validation isolates were considered to truly belong to a given cluster if the isolates were found within a branch that only contained identification dataset isolates from that cluster with a bootstrap value of 80% or greater. In total 38,340 (99.5%) validation isolates were assigned to major and minor STEC clusters with 35,574 (92.32%) and 2,766 (7.18%) respectively, while the remaining 194 isolates (0.5%) were not assigned to any clusters.</p>
<p>Compared to cluster assignment by phylogenetic trees as the ground truth, cluster-specific gene marker sets correctly assigned 35,461 validation isolates to major clusters and 2,704 validation isolates to minor clusters. Cluster -specific gene marker sets also correctly identified 191 of the 194 isolates without cluster assignments. In total the accuracy of assignments by cluster -specific gene marker sets were 99.54%. The sensitivity and specificity for each cluster-specific gene marker set for validation dataset were listed in <xref ref-type="supplementary-material" rid="ST4">
<bold>Table S4</bold>
</xref>.</p>
<p>The STEC cluster specific gene marker sets were validated on 14,216 non-STEC <italic>E. coli</italic> isolates. The specificity of the STEC cluster-specific gene markers set for major clusters varied from 99.38% to 100% and the specificity of the STEC cluster-specific gene marker sets for minor clusters ranged from 97.25% to 100%. Details are listed in <xref ref-type="supplementary-material" rid="ST5">
<bold>Table S5</bold>
</xref>.</p>
</sec>
<sec id="s3_6">
<title>STECFinder for Molecular Serotyping of STEC Isolates and Its Accuracy and Specificity</title>
<p>STECFinder was developed for cluster and serotype identification of STEC isolates. Cluster was identified using cluster -specific gene marker sets and serotype was identified using serotype-specific gene markers as well as <italic>E. coli</italic> O and H antigen genes within clusters. Either paired end Illumina genome sequencing reads or assembled genomes can be used. STECFinder is available on github (<uri xlink:href="https://github.com/LanLab/STECFinder">https://github.com/LanLab/STECFinder</uri>).</p>
<p>The accuracy and specificity of STECFinder for STEC typing were tested with 3,258 isolates from the identification dataset. For assembled genomes, all 1,412 STEC isolates belonging to 19 major clusters and all 877 STEC isolates belonging to 229 minor clusters were correctly predicted, while 26 of 573 <italic>stx</italic> negative <italic>E.coli</italic> isolates were assigned to STEC clusters by their corresponding cluster-specific gene marker sets. Eighteen STEC singletons were assigned to clusters or minor clusters. For read mapping, two of 1,412 isolates belonging to the 19 major clusters and 25 of 877 isolates from minor clusters were not detected by cluster-specific gene marker sets, while 26 <italic>stx</italic> negative <italic>E.coli</italic> was assigned to STEC clusters similar to the assignment using the assembled genomes. The accuracy of STECFinder for cluster assignments was 99.45% and 98.5% for assembled genomes and read mapping respectively. The accuracy of cluster assignment for the top 10 non-O157:H7 serotypes was 99.14% and 99.11% for assembled genomes and read mapping, respectively.</p>
<p>STECFinder was validated on 38,534 isolates from the STEC validation dataset. Compared to the ground truth assignments determined using phylogenetic analysis, STECFinder assigned 99.85% and 99.83% of validation isolates correctly to clusters for assembled genomes and read mapping, respectively. The accuracy of cluster assignment for top 10 non-O157:H7 serotypes was 99.72% for assembled genomes and 99.65% for read mapping. For the 38,534 <italic>stx</italic>-positive isolates from validation dataset, STECFinder demonstrated 100% cluster assignment specificity for both assembled genomes and read mapping. The cluster assignment specificity of STECFinder was further evaluated using the 14,126 <italic>stx</italic>-negative <italic>E. coli</italic> isolates from the &#x201c;non-STEC&#x201d; control dataset. The specificity was 87.07% and 85.12% for assembled genomes and read mapping, respectively. Further investigation of the false positive isolates found that 1,074 false positive isolates belonged to the STEC cluster based on phylogenetic analysis. After removing all of these false positive isolates, the specificity was 94.66% and 92.72% for assembled genomes and read mapping respectively.</p>
<p>STECFinder can assign STEC isolates to serotype level within predicted clusters. The comparison of <italic>in silico</italic> serotyping of the total of 41,101 STEC isolates between STECFinder and SerotypeFinder (<xref ref-type="bibr" rid="B40">Joensen et&#xa0;al., 2015</xref>) was performed. For assembled genomes, the serotype prediction of 40,912 of 41,101 (99.54%) STEC isolates by STECFinder agreed with that by SerotypeFinder when applying the same cutoff values of 80% sequence identity and 60% length coverage. For the remaining 189 STEC isolates with non-identical serotype prediction, STECFinder predicted serotypes were largely a subset of O:H types predicted by SerotypeFinder. For example, an isolate may be assigned as wzx_O103 and H2 by STECFinder while SerotypeFinder predicted as a mixed wzx_O103/O26 and H2/H11.</p>
<p>There were 40,618 of 41,101 (98.82%) STEC isolates with the same serotype prediction by STECFinder and SerotypeFinder from read mapping. For the remaining 483 cases, STECfinder assigned a full serotype while SerotypeFinder assigned 257 isolates with H antigen only, 117 isolates with O antigen only and 109 isolates with multiple O:H types.</p>
</sec>
<sec id="s3_7">
<title>Detection of STEC Clusters and Serotypes Using STECFinder in Spiked Food Samples Using Shotgun Metagenomic Sequencing Reads</title>
<p>The application of STECFinder in metagenomics analysis was evaluated with 17 metagenomic sequencing reads from samples published by Buytaers et al. (<xref ref-type="bibr" rid="B14">Buytaers et&#xa0;al., 2020</xref>). The 17 metagenomic samples consisted of nine minced beef meat samples spiked with a STEC O157:H7 isolate, one fresh goat cheese sample each spiked with STEC O145:H28 isolate, O103:H2 isolate and co-spiked with STEC O103:H2 and O145:H28 isolates and five STEC negative control food samples. Samples were spiked with STEC isolates at the lowest infectious dose (&lt;10 CFU for 25 g of food) (<xref ref-type="bibr" rid="B14">Buytaers et&#xa0;al., 2020</xref>).</p>
<p>STECFinder assigned the nine samples spiked with STEC O157:H7 to O157:H7 cluster, one sample with STEC O145:H28 to C6 (O145:H8), one sample spiked with STEC O103:H2 to C2 and O103:H2 (O103:H2 is within C2). One sample co-spiked with STEC O103:H2 and O145:H28 was assigned to C2 and O103:H2 (O103:H2 is within C2), and C6 (O145:H8). The cluster/serotype-specific gene marker sets were not detected in the five control samples and STECFinder assigned the five sequenced reads of STEC negative control to &#x201c;Other-<italic>E. coli</italic>&#x201d;. STECFinder correctly typed the spiked samples using cluster/serotype-specific gene markers.</p>
</sec>
</sec>
<sec id="s4" sec-type="discussion">
<title>Discussion</title>
<p>In this study, we performed genomic analysis of more than 41,000 STEC genomes representing 460 different serotypes and identified 19 major phylogenetic clusters including one O157:H7 cluster and 18 non-O157:H7 clusters containing the 28 most frequent non-O157:H7 serotypes, and 229 minor clusters. WGS-based phylogenetic analysis of such a large set of genome data found that STEC had far greater genetic diversity than what has been observed previously with clusters containing one or more serotypes. The close phylogenetic relationship between O26:H11, O111:H8 and O103:H11 in C1, O103:H2 and O45:H2 in C2 agreed with previous studies (<xref ref-type="bibr" rid="B27">Gonz&#xe1;lez-Escalona and Kase, 2019</xref>; <xref ref-type="bibr" rid="B95">Zhang et&#xa0;al., 2020</xref>). With the large number of serotypes (460) as well as polyphyletic or paraphyletic origin of 37 serotypes, identification of serotype specific markers for all serotypes was not possible. However, cluster specific markers were identified and used to develop a pipeline, STECFinder, to facilitate cluster and serotype identification of STEC isolates.</p>
<p>STEC infections have a significant impact on public health worldwide (<xref ref-type="bibr" rid="B30">FAO/WHO STEC EXPERT GROUP, 2019</xref>). Early detection and differentiation of STEC is vital for food safety surveillance and public health. The initial screening of <italic>stx</italic> genes for STEC serotype detection may lead to misdiagnosis of STEC because <italic>stx</italic> genes can be lost or gained (<xref ref-type="bibr" rid="B30">FAO/WHO STEC EXPERT GROUP, 2019</xref>). We identified a small number of <italic>stx</italic>-negative <italic>E. coli</italic> isolates that were grouped into STEC clusters with the corresponding STEC serotypes and STs. Whether these <italic>stx</italic>-negative <italic>E.coli</italic> isolates lost <italic>stx</italic>-containing prophages or were the progenitors of STEC remains unknown. It may also be possible that only a subset of isolates within those STs were <italic>stx</italic> positive due to recent acquisition of <italic>stx</italic>. However, human infections caused by <italic>stx</italic>-negative isolates with typical STEC serotypes have been reported previously (<xref ref-type="bibr" rid="B7">Bielaszewska et&#xa0;al., 2007</xref>; <xref ref-type="bibr" rid="B65">Mora et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B21">Ferdous et&#xa0;al., 2015</xref>). STECFinder will predict these typical STEC serotypes based on cluster/serotype-specific gene markers even if <italic>stx</italic> is absent. It should be noted that STECFinder does not make determination whether a given isolate is an STEC as its key utility is to predict predefined STEC clusters and serotypes. The presence and identity of <italic>stx</italic> genes is also reported to allow the user to make their own determination.</p>
<p>Our analysis found some minor clusters as well as STs contain both <italic>stx</italic> negative and <italic>stx</italic> positive isolates with <italic>stx</italic> negative isolates being of other <italic>E. coli</italic> pathotypes, which suggests that the STEC within those clusters and STs may be hybrid pathogens. Such hybrids have been recognised in recent years including the well-known STEC/EAEC (enteroaggregative <italic>E. coli</italic>) hybrid O104:H4 (ST678) and STEC/UPEC (uropathogenic <italic>E. coli</italic>) hybrid O2:H6 (ST141) (<xref ref-type="bibr" rid="B69">Navarro-Garcia, 2014</xref>; <xref ref-type="bibr" rid="B24">Gati et&#xa0;al., 2019</xref>). Therefore, for STEC clusters, serotypes or STs that carry isolates with different pathogenicity, a note of caution on the use of STECFinder is required as such clusters identified may not uniquely contain STEC pathogens. More data are needed to determine how many serotypes or STs carry different pathotypes and STECFinder does not attempt to determine other or hybrid pathogenic types. Determining whether an isolate is a hybrid pathogen is often difficult as some pathogenic types were not well defined by the presence of virulence genes.</p>
<p>Serotyping provides valuable information on identification of potential pathogenic STEC (<xref ref-type="bibr" rid="B33">Gyles, 2007</xref>; <xref ref-type="bibr" rid="B92">World Health Organization, 2019</xref>). Current serotyping methods focus on well-known O157 and &#x201c;Big six&#x201d; non-O157 serogroups which account for about 70% of STEC infections. There are many challenges for the detection of other non-O157:H7 serotypes which cause the remaining 30% of STEC infections (<xref ref-type="bibr" rid="B18">DebRoy et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B71">Norman et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B101">Zweifel et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B83">Smith et&#xa0;al., 2014</xref>). In addition, not all STEC can be serotyped <italic>in silico</italic> or predicted based on O or H type genes from genome sequencing data (<xref ref-type="bibr" rid="B40">Joensen et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B27">Gonz&#xe1;lez-Escalona and Kase, 2019</xref>). STECFinder can accurately predict STEC serotypes including those most frequently associated with foodborne outbreaks and severe disease. STECFinder can also accurately predict other non-&#x201d;Big six&#x201d; non-O157:H7 STEC serotypes. This could be beneficial for identification of the most frequent STEC serotypes for early diagnosis and for clinical management and will better inform the genomic surveillance of STEC serotypes.</p>
<p>We verified the serotype of STEC isolates predicted using STECFinder by phylogenetic cluster assignment and shared STs with STEC isolates of known serotypes. Compared with the existing pipeline for <italic>E. coli in silico</italic> serotyping, SerotypeFinder (<xref ref-type="bibr" rid="B40">Joensen et&#xa0;al., 2015</xref>), cluster/serotype-specific gene markers based STECFinder can eliminate the majority of uncertain antigen type calls and provides more accurate STEC serotyping within predicted clusters. STECFinder will be useful for epidemiological and diagnostic investigations as well as providing an alternative <italic>in silico</italic> STEC typing method.</p>
<p>We were unable to validate 43 of the 229 minor cluster-specific gene marker sets as these minor clusters had few isolates and once isolates were included in the identification dataset, no isolates remained for validation. Therefore, markers for these 43 minor clusters are tentative and require future validation when more genomes become available. Genes specific to each of these STEC minor clusters were also based on very small number of genomes and should be used with caution. However, since these minor clusters are rarely isolated, they have relatively little effect on the overall applicability of the cluster-specific gene marker sets to STEC typing.</p>
<p>Culture-independent approaches such as shotgun metagenomic analysis may be used for detection of contaminating STEC directly from food samples or enriched food samples (<xref ref-type="bibr" rid="B50">Leonard et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B14">Buytaers et&#xa0;al., 2020</xref>). However, it is difficult to determine STEC serotype from food or faecal samples directly as O and H antigen genes cannot uniquely identify a STEC serotype in a mixed sample. We showed that the cluster/serotype-specific gene marker sets of interest were detected in the spiked food samples by STECfinder using shotgun metagenomic sequencing reads from the study of <xref ref-type="bibr" rid="B14">Buytaers et al. (2020)</xref>. Our cluster or serotype specific genes provide proxy markers to identify these serotypes in original or non-pure culture samples. These gene markers could be adapted for metagenomics based diagnosis and culture independent typing, facilitating rapid identification of known STEC clusters and serotypes.</p>
</sec>
<sec id="s5">
<title>Conclusion</title>
<p>This study analysed 41,101 publicly available genomes of STEC isolates and identified 19 major and 229 minor STEC clusters. Specific gene marker sets for the 19 major and 229 minor clusters were identified and found to be valuable for <italic>in silico</italic> typing. We also identified serotype specific markers for the top 10 non-O157:H7 STEC serotypes. These markers can be used as proxy markers to identify the serotypes. We additionally developed STECFinder, a freely available <italic>in silico</italic> serotyping pipeline incorporating the cluster/serotype specific gene markers to facilitate serotyping of STEC isolates using genome sequences with high specificity and sensitivity. The STECFinder pipeline was tested on published metagenomics samples to determine the serotype of known STECs and the results show that cluster and serotype specific markers have potential for culture independent STEC serotyping.</p>
</sec>
<sec id="s6" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="supplementary-material" rid="SF1">
<bold>Supplementary Material</bold>
</xref>. Further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7" sec-type="author-contributions">
<title>Author Contributions</title>
<p>RL designed the study. XZ, MP, and SK performed the bioinformatic analysis. XZ, MP, and RL analysed the results. XZ drafted the manuscript. MP and RL provided critical revision of the manuscript. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="s8" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="s9" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgments</title>
<p>The authors thank Duncan Smith and Robin Heron from UNSW Research Technology Services for high performance computing assistance. This work was funded in part by a National Health and Medical Research Council project grant (grant number 1129713) and an Australian Research Council Discovery Grant (DP170101917).</p>
</ack>
<sec id="s10" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fcimb.2021.772574/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fcimb.2021.772574/full#supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="Image_1.tif" id="SF1" mimetype="image/tiff">
<label>Supplementary Figure&#xa0;S1</label>
<caption>
<p>Identification phylogenetic tree. The identification phylogenetic tree was constructed using Quicktree v1.3 as <xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref> and was visualised using ITOL v5 which allowed bootstrap values to be displayed by colouring the internal nodes. The scalar bar represents 0.01 substitutions per site. STEC (Shiga toxin producing <italic>E. coli</italic>) clusters are colored per cluster legend and shown as the ring. The internal branches are colored to represent the bootstrap values per colour legend with green and red indicating the maximum (1) and minimum bootstrap values (0). Each cluster is supported by bootstrap value of 80% or greater. ECOLI is <italic>E. coli</italic>. EIEC is Enteroinvasive <italic>E. coli</italic>. MC is STEC minor clusters.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Image_2.tiff" id="SF2" mimetype="image/tiff">
<label>Supplementary Figure S2</label>
<caption>
<p>The representative validation phylogenetic tree.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Image_2.tiff" id="SF3" mimetype="image/tiff">
<label>Supplementary Figure S2-A</label>
<caption>
<p>The 38,534 STEC (Shiga toxin-producing <italic>E. coli</italic>) validation isolates were divided randomly into fifteen subsets and each subset of the isolates were combined with the identification dataset to construct a phylogenetic tree (tree 1 to 15) using Quicktree v1.3 and visualised using Grapetree&#x2019;s to assign isolates in validation dataset to clusters. The representative tree (tree 1) is shown in detail as an example and all the others are similar. The scalar bar represents 0.02 substitutions per site. Known STEC clusters from identification dataset are coloured. Numbers in square brackets after cluster name are the number of isolates of each identified cluster. Isolates in validation dataset (valdb) are coloured white. An &#x201c;validation&#x201d; isolate was assigned to a STEC cluster if the isolate was found within a branch that exclusively contained identification dataset isolates from that cluster and that branch had a bootstrap support value of 80% or greater. ECOLI is <italic>E. coli</italic>. EIEC is Enteroinvasive <italic>E. coli</italic>. MC is STEC minor clusters.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Image_3.tiff" id="SF4" mimetype="image/tiff">
<label>Supplementary Figure S2-B</label>
<caption>
<p>The same phylogenetic tree as Figure S2-A was visualised using ITOL v5 which allowed bootstrap values to be displayed by colouring the internal nodes. The scalar bar represents 0.01substitutions per site. STEC (Shiga toxin-producing <italic>E. coli</italic>) clusters are coloured per cluster legend and shown as the ring. The internal branches are coloured to represent the bootstrap values per colour legend with green and red indicating the maximum (1) and minimum bootstrap values (0). Each cluster is supported by bootstrap value of 80% or greater. ECOLI is <italic>E. coli</italic>. EIEC is Enteroinvasive <italic>E. coli</italic>. MC is STEC minor clusters.</p>
</caption>
</supplementary-material>
<supplementary-material xlink:href="Table_1.xlsx" id="ST1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet"/>
<supplementary-material xlink:href="Table_2.xlsx" id="ST2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet"/>
<supplementary-material xlink:href="Table_3.xlsx" id="ST3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet"/>
<supplementary-material xlink:href="Table_4.xlsx" id="ST4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet"/>
<supplementary-material xlink:href="Table_5.xlsx" id="ST5" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet"/>
<supplementary-material xlink:href="Table_6.xlsx" id="ST6" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet"/>
</sec>
<sec id="s11">
<title>Abbreviations</title>
<p>STEC, Shiga toxin-producing <italic>Escherichia coli</italic>; HC, haemorrhagic colitis; HUS, haemolytic uraemic syndrome; ESRD, end-stage renal disease; Stx, Shiga toxin; LEE, locus of enterocyte effacement; EIEC, enteroinvasive <italic>E. coli</italic>; MLST, multi-locus sequence typing; rSTs, ribosomal MLST STs; TP, true positives; TPR, true positive rate; TN, true negatives; TNR, true negative rate; FN, false negatives; FP, false positives.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bai</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Fu</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Fan</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>H.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>Identification and Pathogenomic Analysis of an <italic>Escherichia coli</italic> Strain Producing a Novel Shiga Toxin 2 Subtype</article-title>. <source>Sci. Rep.</source> <volume>8</volume> (<issue>1</issue>), <fpage>6756</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41598-018-25233-x</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bankevich</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Nurk</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Antipov</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Gurevich</surname> <given-names>A. A.</given-names>
</name>
<name>
<surname>Dvorkin</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Kulikov</surname> <given-names>A. S.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>SPAdes: A New Genome Assembly Algorithm and its Applications to Single-Cell Sequencing</article-title>. <source>J. Comput. Biol.</source> <volume>19</volume> (<issue>5</issue>), <fpage>455</fpage>&#x2013;<lpage>477</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1089/cmb.2012.0021</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Beghain</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Bridier-Nahmias</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Le Nagard</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Denamur</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Clermont</surname> <given-names>O.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>ClermonTyping: An Easy-to-Use and Accurate <italic>in Silico</italic> Method for <italic>Escherichia</italic> Genus Strain Phylotyping</article-title>. <source>Microb. Genom</source> <volume>4</volume> (<issue>7</issue>). doi:&#xa0;<pub-id pub-id-type="doi">10.1099/mgen.0.000192</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>B&#xe9;langer</surname> <given-names>S. D.</given-names>
</name>
<name>
<surname>Boissinot</surname> <given-names>M.</given-names>
</name>
<name>
<surname>M&#xe9;nard</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Picard</surname> <given-names>F. J.</given-names>
</name>
<name>
<surname>Bergeron</surname> <given-names>M. G.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Rapid Detection of Shiga Toxin-Producing Bacteria in Feces by Multiplex PCR With Molecular Beacons on the Smart Cycler</article-title>. <source>J. Clin. Microbiol.</source> <volume>40</volume> (<issue>4</issue>), <fpage>1436</fpage>&#x2013;<lpage>1440</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.40.4.1436-1440.2002</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bettelheim</surname> <given-names>K. A.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Role of non-O157 VTEC</article-title>. <source>Symp Ser. Soc. Appl. Microbiol.</source> <volume>29)</volume>, <fpage>38s</fpage>&#x2013;<lpage>50s</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1365-2672.2000.tb05331.x</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Beutin</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Strauch</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Fischer</surname> <given-names>I.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Isolation of <italic>Shigella sonnei</italic> Lysogenic for a Bacteriophage Encoding Gene for Production of Shiga Toxin</article-title>. <source>Lancet (London England)</source> <volume>353</volume> (<issue>9163</issue>), <fpage>1498</fpage>&#x2013;<lpage>1498</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/S0140-6736(99)00961-7</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bielaszewska</surname> <given-names>M.</given-names>
</name>
<name>
<surname>K&#xf6;ck</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Friedrich</surname> <given-names>A. W.</given-names>
</name>
<name>
<surname>von Eiff</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Zimmerhackl</surname> <given-names>L. B.</given-names>
</name>
<name>
<surname>Karch</surname> <given-names>H.</given-names>
</name>
<etal/>
</person-group>. (<year>2007</year>). <article-title>Shiga Toxin-Mediated Hemolytic Uremic Syndrome: Time to Change the Diagnostic Paradigm</article-title>? <source>PloS One</source> <volume>2</volume> (<issue>10</issue>), <fpage>e1024</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pone.0001024</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bosilevac</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Koohmaraie</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Prevalence and Characterization of non-O157 Shiga Toxin-Producing <italic>Escherichia coli</italic> Isolates From Commercial Ground Beef in the United States</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>77</volume> (<issue>6</issue>), <fpage>2103</fpage>&#x2013;<lpage>2112</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/aem.02833-10</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brandal</surname> <given-names>L. T.</given-names>
</name>
<name>
<surname>Tunsj&#xf8;</surname> <given-names>H. S.</given-names>
</name>
<name>
<surname>Ranheim</surname> <given-names>T. E.</given-names>
</name>
<name>
<surname>L&#xf8;bersli</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Lange</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Wester</surname> <given-names>A. L.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Shiga Toxin 2a in <italic>Escherichia albertii</italic>
</article-title>. <source>J. Clin. Microbiol.</source> <volume>53</volume> (<issue>4</issue>), <fpage>1454</fpage>&#x2013;<lpage>1455</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.03378-14</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brian</surname> <given-names>M. J.</given-names>
</name>
<name>
<surname>Frosolono</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Murray</surname> <given-names>B. E.</given-names>
</name>
<name>
<surname>Miranda</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Lopez</surname> <given-names>E. L.</given-names>
</name>
<name>
<surname>Gomez</surname> <given-names>H. F.</given-names>
</name>
<etal/>
</person-group>. (<year>1992</year>). <article-title>Polymerase Chain Reaction for Diagnosis of Enterohemorrhagic <italic>Escherichia coli</italic> Infection and Hemolytic-Uremic Syndrome</article-title>. <source>J. Clin. Microbiol.</source> <volume>30</volume> (<issue>7</issue>), <fpage>1801</fpage>&#x2013;<lpage>1806</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.30.7.1801-1806.1992</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brooks</surname> <given-names>J. T.</given-names>
</name>
<name>
<surname>Sowers</surname> <given-names>E. G.</given-names>
</name>
<name>
<surname>Wells</surname> <given-names>J. G.</given-names>
</name>
<name>
<surname>Greene</surname> <given-names>K. D.</given-names>
</name>
<name>
<surname>Griffin</surname> <given-names>P. M.</given-names>
</name>
<name>
<surname>Hoekstra</surname> <given-names>R. M.</given-names>
</name>
<etal/>
</person-group>. (<year>2005</year>). <article-title>Non-O157 Shiga Toxin-Producing <italic>Escherichia coli</italic> Infections in the United State</article-title>
<article-title>-2002</article-title>. <source>J. Infect. Dis.</source> <volume>192</volume> (<issue>8</issue>), <fpage>1422</fpage>&#x2013;<lpage>1429</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1086/466536</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bryan</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Youngster</surname> <given-names>I.</given-names>
</name>
<name>
<surname>McAdam</surname> <given-names>A. J.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Shiga Toxin Producing <italic>Escherichia coli</italic>
</article-title>. <source>Clin. Lab. Med.</source> <volume>35</volume> (<issue>2</issue>), <fpage>247</fpage>&#x2013;<lpage>272</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.cll.2015.02.004</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Buvens</surname> <given-names>G.</given-names>
</name>
<name>
<surname>De Gheldre</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Dediste</surname> <given-names>A.</given-names>
</name>
<name>
<surname>de Moreau</surname> <given-names>A. I.</given-names>
</name>
<name>
<surname>Mascart</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Simon</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>Incidence and Virulence Determinants of Verocytotoxin-Producing <italic>Escherichia coli</italic> Infections in the Brussels-Capital Region, Belgium, in 2008-2010</article-title>. <source>J. Clin. Microbiol.</source> <volume>50</volume> (<issue>4</issue>), <fpage>1336</fpage>&#x2013;<lpage>1345</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.05317-11</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Buytaers</surname> <given-names>F. E.</given-names>
</name>
<name>
<surname>Saltykova</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Denayer</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Verhaegen</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Vanneste</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Roosens</surname> <given-names>N. H. C.</given-names>
</name>
<etal/>
</person-group>. (<year>2020</year>). <article-title>A Practical Method to Implement Strain-Level Metagenomics-Based Foodborne Outbreak Investigation and Source Tracking in Routine</article-title>. <source>Microorganisms</source> <volume>8</volume> (<issue>8</issue>), <fpage>1191</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3390/microorganisms8081191</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Camacho</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Coulouris</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Avagyan</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Papadopoulos</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Bealer</surname> <given-names>K.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>BLAST+: Architecture and Applications</article-title>. <source>BMC Bioinf.</source> <volume>10</volume>:<elocation-id>421</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2105-10-421</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Clausen</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Aarestrup</surname> <given-names>F. M.</given-names>
</name>
<name>
<surname>Lund</surname> <given-names>O.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Rapid and Precise Alignment of Raw Reads Against Redundant Databases With KMA</article-title>. <source>BMC Bioinf.</source> <volume>19</volume> (<issue>1</issue>), <fpage>307</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/s12859-018-2336-6</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>DebRoy</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Roberts</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Kundrat</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Davis</surname> <given-names>M. A.</given-names>
</name>
<name>
<surname>Briggs</surname> <given-names>C. E.</given-names>
</name>
<name>
<surname>Fratamico</surname> <given-names>P. M.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Detection of <italic>Escherichia coli</italic> Serogroups O26 and O113 by PCR Amplification of the <italic>wzx</italic> and <italic>wzy</italic> Genes</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>70</volume> (<issue>3</issue>), <fpage>1830</fpage>&#x2013;<lpage>1832</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/aem.70.3.1830-1832.2004</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>DebRoy</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Roberts</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Valadez</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Dudley</surname> <given-names>E. G.</given-names>
</name>
<name>
<surname>Cutter</surname> <given-names>C. N.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Detection of Shiga Toxin-Producing <italic>Escherichia coli</italic> O26, O45, O103, O111, O113, O121, O145, and O157 Serogroups by Multiplex Polymerase Chain Reaction of the <italic>wzx</italic> Gene of the O-Antigen Gene Cluster</article-title>. <source>Foodborne Pathog. Dis.</source> <volume>8</volume> (<issue>5</issue>), <fpage>651</fpage>&#x2013;<lpage>652</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1089/fpd.2010.0769</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<collab>European Food Safety Authority, E.C.f.D.P.C</collab>
</person-group> (<year>2011</year>). <article-title>The European Union Summary Report on Trends and Sources of Zoonoses, Zoonotic Agents and Food-Borne Outbreaks in 2009</article-title>. <source>EFSA Journal: Eur. Food Standards Agency</source> <volume>9</volume> (<issue>3</issue>), <elocation-id>2090</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.2903/j.efsa.2011.2090</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Feng</surname> <given-names>P. C.</given-names>
</name>
<name>
<surname>Reddy</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Prevalences of Shiga Toxin Subtypes and Selected Other Virulence Factors Among Shiga-Toxigenic <italic>Escherichia coli</italic> Strains Isolated From Fresh Produce</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>79</volume> (<issue>22</issue>), <fpage>6917</fpage>&#x2013;<lpage>6923</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/aem.02455-13</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ferdous</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Zhou</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Mellmann</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Morabito</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Croughs</surname> <given-names>P. D.</given-names>
</name>
<name>
<surname>de Boer</surname> <given-names>R. F.</given-names>
</name>
<etal/>
</person-group>. (<year>2015</year>). <article-title>Is Shiga Toxin-Negative <italic>Escherichia coli</italic> O157:H7 Enteropathogenic or Enterohemorrhagic <italic>Escherichia coli</italic>? Comprehensive Molecular Analysis Using Whole-Genome Sequencing</article-title>. <source>J. Clin. Microbiol.</source> <volume>53</volume> (<issue>11</issue>), <fpage>3530</fpage>&#x2013;<lpage>3538</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.01899-15</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Frank</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Faber</surname> <given-names>M. S.</given-names>
</name>
<name>
<surname>Askar</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Bernard</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Fruth</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Gilsdorf</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2011</year>a). <article-title>Large and Ongoing Outbreak of Haemolytic Uraemic Syndrome, Germany, May 2011</article-title>. <source>Euro Surveill</source> <volume>16</volume> (<issue>21</issue>), <fpage>19878</fpage>. doi: <pub-id pub-id-type="doi">10.2807/ese.16.21.19878-en</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Frank</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Werber</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Cramer</surname> <given-names>J. P.</given-names>
</name>
<name>
<surname>Askar</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Faber</surname> <given-names>M.</given-names>
</name>
<name>
<surname>an der Heiden</surname> <given-names>M.</given-names>
</name>
<etal/>
</person-group>. (<year>2011</year>b). <article-title>Epidemic Profile of Shiga-Toxin-Producing <italic>Escherichia coli</italic> O104:H4 Outbreak in Germany</article-title>. <source>N Engl. J. Med.</source> <volume>365</volume> (<issue>19</issue>), <fpage>1771</fpage>&#x2013;<lpage>1780</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1056/NEJMoa1106483</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gati</surname> <given-names>N. S.</given-names>
</name>
<name>
<surname>Middendorf-Bauchart</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Bletz</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Dobrindt</surname> <given-names>U.</given-names>
</name>
<name>
<surname>Mellmann</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Origin and Evolution of Hybrid Shiga Toxin-Producing and Uropathogenic <italic>Escherichia coli</italic> Strains of Sequence Type 141</article-title>. <source>J. Clin. Microbiol.</source> <volume>58</volume> (<issue>1</issue>), <page-range>e01309&#x2013;19</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.01309-19</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gerner-Smidt</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Hise</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Kincaid</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Hunter</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Rolando</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Hyyti&#xe4;-Trees</surname> <given-names>E.</given-names>
</name>
<etal/>
</person-group>. (<year>2006</year>). <article-title>PulseNet USA: A Five-Year Update</article-title>. <source>Foodborne Pathog. Dis.</source> <volume>3</volume> (<issue>1</issue>), <fpage>9</fpage>&#x2013;<lpage>19</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1089/fpd.2006.3.9</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gonzales</surname> <given-names>T. K.</given-names>
</name>
<name>
<surname>Kulow</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Park</surname> <given-names>D. J.</given-names>
</name>
<name>
<surname>Kaspar</surname> <given-names>C. W.</given-names>
</name>
<name>
<surname>Anklam</surname> <given-names>K. S.</given-names>
</name>
<name>
<surname>Pertzborn</surname> <given-names>K. M.</given-names>
</name>
<etal/>
</person-group>. (<year>2011</year>). <article-title>A High-Throughput Open-Array qPCR Gene Panel to Identify, Virulotype, and Subtype O157 and Non-O157 Enterohemorrhagic <italic>Escherichia coli</italic>
</article-title>. <source>Mol. Cell Probes</source> <volume>25</volume> (<issue>5-6</issue>), <fpage>222</fpage>&#x2013;<lpage>230</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.mcp.2011.08.004</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gonz&#xe1;lez-Escalona</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Kase</surname> <given-names>J. A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Virulence Gene Profiles and Phylogeny of Shiga Toxin-Positive <italic>Escherichia coli</italic> Strains Isolated From FDA Regulated Foods During 2010-2017</article-title>. <source>PloS One</source> <volume>14</volume> (<issue>4</issue>), <fpage>e0214620</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pone.0214620</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gould</surname> <given-names>L. H.</given-names>
</name>
<name>
<surname>Demma</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Jones</surname> <given-names>T. F.</given-names>
</name>
<name>
<surname>Hurd</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Vugia</surname> <given-names>D. J.</given-names>
</name>
<name>
<surname>Smith</surname> <given-names>K.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>Hemolytic Uremic Syndrome and Death in Persons With <italic>Escherichia coli</italic> O157:H7 Infection, Foodborne Diseases Active Surveillance Network Site</article-title>
<article-title>-2006</article-title>. <source>Clin. Infect. Dis.</source> <volume>49</volume> (<issue>10</issue>), <fpage>1480</fpage>&#x2013;<lpage>1485</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1086/644621</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gray</surname> <given-names>M. D.</given-names>
</name>
<name>
<surname>Lampel</surname> <given-names>K. A.</given-names>
</name>
<name>
<surname>Strockbine</surname> <given-names>N. A.</given-names>
</name>
<name>
<surname>Fernandez</surname> <given-names>R. E.</given-names>
</name>
<name>
<surname>Melton-Celsa</surname> <given-names>A. R.</given-names>
</name>
<name>
<surname>Maurelli</surname> <given-names>A. T.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Clinical Isolates of Shiga Toxin 1a-Producing <italic>Shigella flexneri</italic> With an Epidemiological Link to Recent Travel to Hispa&#xf1;iola</article-title>. <source>Emerg. Infect. Dis.</source> <volume>20</volume> (<issue>10</issue>), <fpage>1669</fpage>&#x2013;<lpage>1677</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3201/eid2010.140292</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<collab>GROUP, F.W.S.E</collab>
</person-group> (<year>2019</year>). <article-title>Hazard Identification and Characterization: Criteria for Categorizing Shiga Toxin-Producing <italic>Escherichia coli</italic> on a Risk Basis(&#x2020;)</article-title>. <source>J. Food Protectionv</source> <volume>82</volume> (<issue>1</issue>), <fpage>7</fpage>&#x2013;<lpage>21</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.4315/0362-028x.Jfp-18-291</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gupta</surname> <given-names>S. K.</given-names>
</name>
<name>
<surname>Strockbine</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Omondi</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Hise</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Fair</surname> <given-names>M. A.</given-names>
</name>
<name>
<surname>Mintz</surname> <given-names>E.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Emergence of Shiga Toxin 1 Genes Within <italic>Shigella dysenteriae</italic> Type 4 Isolates From Travelers Returning From the Island of Hispa&#xf1;ola</article-title>. <source>Am. J. Trop. Med. Hyg</source> <volume>76</volume> (<issue>6</issue>), <fpage>1163</fpage>&#x2013;<lpage>1165</lpage>. doi: <pub-id pub-id-type="doi">10.4269/ajtmh.2007.76.1163</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gurevich</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Saveliev</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Vyahhi</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Tesler</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>QUAST: Quality Assessment Tool for Genome Assemblies</article-title>. <source>Bioinformatics</source> <volume>29</volume> (<issue>8</issue>), <fpage>1072</fpage>&#x2013;<lpage>1075</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btt086</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gyles</surname> <given-names>C. L.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Shiga Toxin-Producing <italic>Escherichia coli</italic>: An Overview</article-title>. <source>J. Anim. Sci.</source> <volume>85</volume> (<supplement>13 Suppl</supplement>), <fpage>E45</fpage>&#x2013;<lpage>E62</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.2527/jas.2006-508</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hara-Kudo</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Nemoto</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Ohtsuka</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Segawa</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Takatori</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Kojima</surname> <given-names>T.</given-names>
</name>
<etal/>
</person-group>. (<year>2007</year>). <article-title>Sensitive and Rapid Detection of Vero Toxin-Producing <italic>Escherichia coli</italic> Using Loop-Mediated Isothermal Amplification</article-title>. <source>J. Med. Microbiol.</source> <volume>56</volume> (<issue>Pt 3</issue>), <fpage>398</fpage>&#x2013;<lpage>406</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1099/jmm.0.46819-0</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hedican</surname> <given-names>E. B.</given-names>
</name>
<name>
<surname>Medus</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Besser</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Juni</surname> <given-names>B. A.</given-names>
</name>
<name>
<surname>Koziol</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Taylor</surname> <given-names>C.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>Characteristics of O157 Versus Non-O157 Shiga Toxin-Producing <italic>Escherichia coli</italic> Infections in Minnesot</article-title>
<article-title>-2006</article-title>. <source>Clin. Infect. Dis.</source> <volume>49</volume> (<issue>3</issue>), <fpage>358</fpage>&#x2013;<lpage>364</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1086/600302</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Reeves</surname> <given-names>P. R.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Living Trees: High-Quality Reproducible and Reusable Construction of Bacterial Phylogenetic Trees</article-title>. <source>Mol. Biol. Evol.</source> <volume>37</volume> (<issue>2</issue>), <fpage>563</fpage>&#x2013;<lpage>575</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/msz241</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Iguchi</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Iyoda</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Seto</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Morita-Ishihara</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Scheutz</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Ohnishi</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>
<italic>Escherichia coli</italic> O-Genotyping PCR: A Comprehensive and Practical Platform for Molecular O Serogrouping</article-title>. <source>J. Clin. Microbiol.</source> <volume>53</volume> (<issue>8</issue>), <fpage>2427</fpage>&#x2013;<lpage>2432</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.00321-15</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Inouye</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Dashnow</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Raven</surname> <given-names>L. A.</given-names>
</name>
<name>
<surname>Schultz</surname> <given-names>M. B.</given-names>
</name>
<name>
<surname>Pope</surname> <given-names>B. J.</given-names>
</name>
<name>
<surname>Tomita</surname> <given-names>T.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>SRST2: Rapid Genomic Surveillance for Public Health and Hospital Microbiology Labs</article-title>. <source>Genome Med.</source> <volume>6</volume> (<issue>11</issue>):<elocation-id>90</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/s13073-014-0090-6</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Joensen</surname> <given-names>K. G.</given-names>
</name>
<name>
<surname>Scheutz</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Lund</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Hasman</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Kaas</surname> <given-names>R. S.</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>E. M.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>Real-Time Whole-Genome Sequencing for Routine Typing, Surveillance, and Outbreak Detection of Verotoxigenic <italic>Escherichia coli</italic>
</article-title>. <source>J.&#xa0;Clin. Microbiol.</source> <volume>52</volume> (<issue>5</issue>), <fpage>1501</fpage>&#x2013;<lpage>1510</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.03617-13</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Joensen</surname> <given-names>K. G.</given-names>
</name>
<name>
<surname>Tetzschner</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Iguchi</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Aarestrup</surname> <given-names>F. M.</given-names>
</name>
<name>
<surname>Scheutz</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Rapid and Easy <italic>In Silico</italic> Serotyping of <italic>Escherichia coli</italic> Isolates by Use of Whole-Genome Sequencing Data</article-title>. <source>J. Clin. Microbiol.</source> <volume>53</volume> (<issue>8</issue>), <fpage>2410</fpage>&#x2013;<lpage>2426</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.00008-15</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Johnson</surname> <given-names>R. P.</given-names>
</name>
<name>
<surname>Clarke</surname> <given-names>R. C.</given-names>
</name>
<name>
<surname>Wilson</surname> <given-names>J. B.</given-names>
</name>
<name>
<surname>Read</surname> <given-names>S. C.</given-names>
</name>
<name>
<surname>Rahn</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Renwick</surname> <given-names>S. A.</given-names>
</name>
<etal/>
</person-group>. (<year>1996</year>). <article-title>Growing Concerns and Recent Outbreaks Involving Non-O157:H7 Serotypes of Verotoxigenic <italic>Escherichia coli</italic>
</article-title>. <source>J. Food Prot</source> <volume>59</volume> (<issue>10</issue>), <fpage>1112</fpage>&#x2013;<lpage>1122</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.4315/0362-028x-59.10.1112</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Johnson</surname> <given-names>K. E.</given-names>
</name>
<name>
<surname>Thorpe</surname> <given-names>C. M.</given-names>
</name>
<name>
<surname>Sears</surname> <given-names>C. L.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>The Emerging Clinical Importance of non-O157 Shiga Toxin-Producing <italic>Escherichia coli</italic>
</article-title>. <source>Clin. Infect. Dis.</source> <volume>43</volume> (<issue>12</issue>), <fpage>1587</fpage>&#x2013;<lpage>1595</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1086/509573</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jolley</surname> <given-names>K. A.</given-names>
</name>
<name>
<surname>Maiden</surname> <given-names>M. C.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>BIGSdb: Scalable Analysis of Bacterial Genome Variation at the Population Level</article-title>. <source>BMC Bioinf.</source> <volume>11</volume>, <elocation-id>595</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2105-11-595</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ju</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Cao</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Rump</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Strain</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Luo</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Timme</surname> <given-names>R.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>Phylogenetic Analysis of non-O157 Shiga Toxin-Producing <italic>Escherichia coli</italic> Strains by Whole-Genome Sequencing</article-title>. <source>J. Clin. Microbiol.</source> <volume>50</volume> (<issue>12</issue>), <fpage>4123</fpage>&#x2013;<lpage>4127</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.02262-12</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaper</surname> <given-names>J. B.</given-names>
</name>
<name>
<surname>Nataro</surname> <given-names>J. P.</given-names>
</name>
<name>
<surname>Mobley</surname> <given-names>H. L.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Pathogenic <italic>Escherichia coli</italic>
</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>2</volume> (<issue>2</issue>), <fpage>123</fpage>&#x2013;<lpage>140</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nrmicro818</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>K&#xe4;ppeli</surname> <given-names>U.</given-names>
</name>
<name>
<surname>H&#xe4;chler</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Giezendanner</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Beutin</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Stephan</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Human Infections With non-O157 Shiga Toxin-Producing <italic>Escherichia coli</italic>, Switzerlan</article-title>
<article-title>-2009</article-title>. <source>Emerg. Infect. Dis.</source> <volume>17</volume> (<issue>2</issue>), <fpage>180</fpage>&#x2013;<lpage>185</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3201/eid1702.100909</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kr&#xfc;ger</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Lucchesi</surname> <given-names>P. M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Shiga Toxins and Stx Phages: Highly Diverse Entities</article-title>. <source>Microbiol. (Reading)</source> <volume>161</volume> (<issue>Pt 3</issue>), <fpage>451</fpage>&#x2013;<lpage>462</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1099/mic.0.000003</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lacher</surname> <given-names>D. W.</given-names>
</name>
<name>
<surname>Gangiredla</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Patel</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Elkins</surname> <given-names>C. A.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>P. C.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Use of the <italic>Escherichia coli</italic> Identification Microarray for Characterizing the Health Risks of Shiga Toxin-Producing <italic>Escherichia coli</italic> Isolated From Foods</article-title>. <source>J. Food Prot</source> <volume>79</volume> (<issue>10</issue>), <fpage>1656</fpage>&#x2013;<lpage>1662</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.4315/0362-028x.Jfp-16-176</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lentz</surname> <given-names>E. K.</given-names>
</name>
<name>
<surname>Leyva-Illades</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>M. S.</given-names>
</name>
<name>
<surname>Cherla</surname> <given-names>R. P.</given-names>
</name>
<name>
<surname>Tesh</surname> <given-names>V. L.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Differential Response of the Human Renal Proximal Tubular Epithelial Cell Line HK-2 to Shiga Toxin Types 1 and 2</article-title>. <source>Infection Immun.</source> <volume>79</volume> (<issue>9</issue>), <fpage>3527</fpage>&#x2013;<lpage>3540</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/iai.05139-11</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Leonard</surname> <given-names>S. R.</given-names>
</name>
<name>
<surname>Mammel</surname> <given-names>M. K.</given-names>
</name>
<name>
<surname>Lacher</surname> <given-names>D. W.</given-names>
</name>
<name>
<surname>Elkins</surname> <given-names>C. A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Application of Metagenomic Sequencing to Food Safety: Detection of Shiga Toxin-Producing <italic>Escherichia coli</italic> on Fresh Bagged Spinach</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>81</volume> (<issue>23</issue>), <fpage>8183</fpage>&#x2013;<lpage>8191</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/aem.02601-15</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Letunic</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Bork</surname> <given-names>P.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Interactive Tree Of Life (iTOL) V4: Recent Updates and New Developments</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume> (<issue>W1</issue>), <fpage>W256</fpage>&#x2013;<lpage>w259</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkz239</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>W.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Multiplex Real-Time PCR Assay for Detection of <italic>Escherichia coli</italic> O157:H7 and Screening for Non-O157 Shiga Toxin-Producing <italic>E. coli</italic>
</article-title>. <source>BMC Microbiol.</source> <volume>17</volume> (<issue>1</issue>), <fpage>215</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/s12866-017-1123-2</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Nguyen</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Clotilde</surname> <given-names>L. M.</given-names>
</name>
<name>
<surname>Kase</surname> <given-names>J. A.</given-names>
</name>
<name>
<surname>Son</surname> <given-names>I.</given-names>
</name>
<etal/>
</person-group>. (<year>2011</year>). <article-title>Rapid O Serogroup Identification of the Ten Most Clinically Relevant STECs by Luminex Microbead-Based Suspension Array</article-title>. <source>J. Microbiol. Methods</source> <volume>87</volume> (<issue>1</issue>), <fpage>105</fpage>&#x2013;<lpage>110</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.mimet.2011.07.019</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lipt&#xe1;kov&#xe1;</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Siegfried</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Kmetov&#xe1;</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Birosov&#xe1;</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Kotulov&#xe1;</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Benc&#xe1;tov&#xe1;</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2005</year>). <article-title>Hemolytic Uremic Syndrome Caused by Verotoxin-Producing <italic>Escherichia coli</italic> O26. Case Report</article-title>. <source>Folia Microbiol. (Praha)</source> <volume>50</volume> (<issue>2</issue>), <fpage>95</fpage>&#x2013;<lpage>98</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/bf02931454</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Knirel</surname> <given-names>Y. A.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Perepelov</surname> <given-names>A. V.</given-names>
</name>
<name>
<surname>Senchenkova</surname> <given-names>S. N.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Q.</given-names>
</name>
<etal/>
</person-group>. (<year>2008</year>). <article-title>Structure and Genetics of <italic>Shigella</italic> O Antigens</article-title>. <source>FEMS Microbiol. Rev.</source> <volume>32</volume> (<issue>4</issue>), <fpage>627</fpage>&#x2013;<lpage>653</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1574-6976.2008.00114.x</pub-id>
</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lozer</surname> <given-names>D. M.</given-names>
</name>
<name>
<surname>Souza</surname> <given-names>T. B.</given-names>
</name>
<name>
<surname>Monfardini</surname> <given-names>M. V.</given-names>
</name>
<name>
<surname>Vicentini</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Kitagawa</surname> <given-names>S. S.</given-names>
</name>
<name>
<surname>Scaletsky</surname> <given-names>I. C.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>Genotypic and Phenotypic Analysis of Diarrheagenic <italic>Escherichia coli</italic> Strains Isolated From Brazilian Children Living in Low Socioeconomic Level Communities</article-title>. <source>BMC Infect. Dis.</source> <volume>13</volume>:<elocation-id>418</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2334-13-418</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ludwig</surname> <given-names>J. B.</given-names>
</name>
<name>
<surname>Shi</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Shridhar</surname> <given-names>P. B.</given-names>
</name>
<name>
<surname>Roberts</surname> <given-names>E. L.</given-names>
</name>
<name>
<surname>DebRoy</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Phebus</surname> <given-names>R. K.</given-names>
</name>
<etal/>
</person-group>. (<year>2020</year>). <article-title>Multiplex PCR Assays for the Detection of One Hundred and Thirty Seven Serogroups of Shiga Toxin-Producing <italic>Escherichia coli</italic> Associated With Cattle</article-title>. <source>Front. Cell Infect. Microbiol.</source> <volume>10</volume>, <elocation-id>378</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fcimb.2020.00378</pub-id>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Majowicz</surname> <given-names>S. E.</given-names>
</name>
<name>
<surname>Scallan</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Jones-Bitton</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Sargeant</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Stapleton</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Angulo</surname> <given-names>F. J.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>Global Incidence of Human Shiga Toxin-Producing <italic>Escherichia coli</italic> Infections and Deaths: A Systematic Review and Knowledge Synthesis</article-title>. <source>Foodborne Pathog. Dis.</source> <volume>11</volume> (<issue>6</issue>), <fpage>447</fpage>&#x2013;<lpage>455</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1089/fpd.2013.1704</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McCarthy</surname> <given-names>T. A.</given-names>
</name>
<name>
<surname>Barrett</surname> <given-names>N. L.</given-names>
</name>
<name>
<surname>Hadler</surname> <given-names>J. L.</given-names>
</name>
<name>
<surname>Salsbury</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Howard</surname> <given-names>R. T.</given-names>
</name>
<name>
<surname>Dingman</surname> <given-names>D. W.</given-names>
</name>
<etal/>
</person-group>. (<year>2001</year>). <article-title>Hemolytic-Uremic Syndrome and <italic>Escherichia coli</italic> O121 at a Lake in Connecticu</article-title>. <source>Pediatrics</source> <volume>108</volume> (<issue>4</issue>), <fpage>E59</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1542/peds.108.4.e59</pub-id>
</citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McDaniel</surname> <given-names>T. K.</given-names>
</name>
<name>
<surname>Kaper</surname> <given-names>J. B.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>A Cloned Pathogenicity Island From Enteropathogenic <italic>Escherichia coli</italic> Confers the Attaching and Effacing Phenotype on E. Coli K-12</article-title>. <source>Mol. Microbiol.</source> <volume>23</volume> (<issue>2</issue>), <fpage>399</fpage>&#x2013;<lpage>407</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1046/j.1365-2958.1997.2311591.x</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Melton-Celsa</surname> <given-names>A. R.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Shiga Toxin (Stx) Classification, Structure, and Function</article-title>. <source>Microbiol. Spectr.</source> <volume>2</volume> (<issue>4</issue>), <fpage>Ehec</fpage>&#x2013;<lpage>0024-2013</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/microbiolspec.EHEC-0024-2013</pub-id>
</citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Milley</surname> <given-names>D. G.</given-names>
</name>
<name>
<surname>Sekla</surname> <given-names>L. H.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>An Enzyme-Linked Immunosorbent Assay-Based Isolation Procedure for Verotoxigenic <italic>Escherichia coli</italic>
</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>59</volume> (<issue>12</issue>), <fpage>4223</fpage>&#x2013;<lpage>4229</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/aem.59.12.4223-4229.1993</pub-id>
</citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mizutani</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Nakazono</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Sugino</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>The So-Called Chromosomal Verotoxin Genes are Actually Carried by Defective Prophages</article-title>. <source>DNA Res.</source> <volume>6</volume> (<issue>2</issue>), <fpage>141</fpage>&#x2013;<lpage>143</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/dnares/6.2.141</pub-id>
</citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mora</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Herrrera</surname> <given-names>A.</given-names>
</name>
<name>
<surname>L&#xf3;pez</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Dahbi</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Mamani</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Pita</surname> <given-names>J. M.</given-names>
</name>
<etal/>
</person-group>. (<year>2011</year>). <article-title>Characteristics of the Shiga-Toxin-Producing Enteroaggregative <italic>Escherichia coli</italic> O104:H4 German Outbreak Strain and of STEC Strains Isolated in Spain</article-title>. <source>Int. Microbiol.</source> <volume>14</volume> (<issue>3</issue>), <fpage>121</fpage>&#x2013;<lpage>141</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.2436/20.1501.01.142</pub-id>
</citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mora</surname> <given-names>A.</given-names>
</name>
<name>
<surname>L&#xf3;pez</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Dhabi</surname> <given-names>G.</given-names>
</name>
<name>
<surname>L&#xf3;pez-Beceiro</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Fidalgo</surname> <given-names>L. E.</given-names>
</name>
<name>
<surname>D&#xed;az</surname> <given-names>E. A.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>Seropathotypes, Phylogroups, Stx Subtypes, and Intimin Types of Wildlife-Carried, Shiga Toxin-Producing <italic>Escherichia coli</italic> Strains With the Same Characteristics as Human-Pathogenic Isolates</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>78</volume> (<issue>8</issue>), <fpage>2578</fpage>&#x2013;<lpage>2585</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/aem.07520-11</pub-id>
</citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morton</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Cheng</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Sharma</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Kearney</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Notes From the Field: An Outbreak of Shiga Toxin-Producing <italic>Escherichia coli</italic> O121 Infections Associated With Flour - Canad</article-title>
<article-title>-2017</article-title>. <source>MMWR Morb Mortal Wkly Rep.</source> <volume>66</volume> (<issue>26</issue>), <fpage>705</fpage>&#x2013;<lpage>706</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.15585/mmwr.mm6626a6</pub-id>
</citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Murakami</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Etoh</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Tanaka</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Ichihara</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Horikawa</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Kawano</surname> <given-names>K.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>Shiga Toxin 2f-Producing <italic>Escherichia albertii</italic> From a Symptomatic Human</article-title>. <source>Jpn J. Infect. Dis.</source> <volume>67</volume> (<issue>3</issue>), <fpage>204</fpage>&#x2013;<lpage>208</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.7883/yoken.67.204</pub-id>
</citation>
</ref>
<ref id="B68">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nataro</surname> <given-names>J. P.</given-names>
</name>
<name>
<surname>Kaper</surname> <given-names>J. B.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Diarrheagenic <italic>Escherichia coli</italic>
</article-title>. <source>Clin. Microbiol. Rev.</source> <volume>11</volume> (<issue>1</issue>), <fpage>142</fpage>&#x2013;<lpage>201</lpage>. doi: <pub-id pub-id-type="doi">10.1128/CMR.11.1.142</pub-id>
</citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Navarro-Garcia</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>
<italic>Escherichia coli</italic> O104:H4 Pathogenesis: An Enteroaggregative <italic>E. coli</italic>/Shiga Toxin-Producing <italic>E. coli</italic> Explosive Cocktail of High Virulence</article-title>. <source>Microbiol. Spectr.</source> <volume>2</volume> (<issue>6</issue>), <fpage>2</fpage>&#x2013;<lpage>6</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/microbiolspec.EHEC-0008-2013</pub-id>
</citation>
</ref>
<ref id="B70">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Needleman</surname> <given-names>S. B.</given-names>
</name>
<name>
<surname>Wunsch</surname> <given-names>C. D.</given-names>
</name>
</person-group> (<year>1970</year>). <article-title>A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins</article-title>. <source>J. Mol. Biol.</source> <volume>48</volume> (<issue>3</issue>), <fpage>443</fpage>&#x2013;<lpage>453</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/0022-2836(70)90057-4</pub-id>
</citation>
</ref>
<ref id="B71">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Norman</surname> <given-names>K. N.</given-names>
</name>
<name>
<surname>Strockbine</surname> <given-names>N. A.</given-names>
</name>
<name>
<surname>Bono</surname> <given-names>J. L.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Association of Nucleotide Polymorphisms Within the O-Antigen Gene Cluster of <italic>Escherichia coli</italic> O26, O45, O103, O111, O121, and O145 With Serogroups and Genetic Subtypes</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>78</volume> (<issue>18</issue>), <fpage>6689</fpage>&#x2013;<lpage>6703</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/aem.01259-12</pub-id>
</citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>O'Brien</surname> <given-names>A. D.</given-names>
</name>
<name>
<surname>Marques</surname> <given-names>L. R.</given-names>
</name>
<name>
<surname>Kerry</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Newland</surname> <given-names>J. W.</given-names>
</name>
<name>
<surname>Holmes</surname> <given-names>R. K.</given-names>
</name>
</person-group> (<year>1989</year>). <article-title>Shiga-Like Toxin Converting Phage of Enterohemorrhagic <italic>Escherichia coli</italic> Strain 933</article-title>. <source>Microb. Pathog.</source> <volume>6</volume> (<issue>5</issue>), <fpage>381</fpage>&#x2013;<lpage>390</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/0882-4010(89)90080-6</pub-id>
</citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ooka</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Seto</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Kawano</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Kobayashi</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Etoh</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Ichihara</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>Clinical Significance of <italic>Escherichia albertii</italic>
</article-title>. <source>Emerg. Infect. Dis.</source> <volume>18</volume> (<issue>3</issue>), <fpage>488</fpage>&#x2013;<lpage>492</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3201/eid1803.111401</pub-id>
</citation>
</ref>
<ref id="B74">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Paciorek</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Virulence Properties of <italic>Escherichia coli</italic> Faecal Strains Isolated in Poland From Healthy Children and Strains Belonging to Serogroups O18, O26, O44, O86, O126 and O127 Isolated From Children With Diarrhoea</article-title>. <source>J.&#xa0;Med. Microbiol.</source> <volume>51</volume> (<issue>7</issue>), <fpage>548</fpage>&#x2013;<lpage>571</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1099/0022-1317-51-7-548</pub-id>
</citation>
</ref>
<ref id="B75">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Page</surname> <given-names>A. J.</given-names>
</name>
<name>
<surname>Cummins</surname> <given-names>C. A.</given-names>
</name>
<name>
<surname>Hunt</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Wong</surname> <given-names>V. K.</given-names>
</name>
<name>
<surname>Reuter</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Holden</surname> <given-names>M. T.</given-names>
</name>
<etal/>
</person-group>. (<year>2015</year>). <article-title>Roary: Rapid Large-Scale Prokaryote Pan Genome Analysis</article-title>. <source>Bioinformatics</source> <volume>31</volume> (<issue>22</issue>), <fpage>3691</fpage>&#x2013;<lpage>3693</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btv421</pub-id>
</citation>
</ref>
<ref id="B76">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Parsons</surname> <given-names>B. D.</given-names>
</name>
<name>
<surname>Zelyas</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Berenger</surname> <given-names>B. M.</given-names>
</name>
<name>
<surname>Chui</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Detection, Characterization, and Typing of Shiga Toxin-Producing <italic>Escherichia coli</italic>
</article-title>. <source>Front. Microbiol.</source> <volume>7</volume>, <elocation-id>478</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fmicb.2016.00478</pub-id>
</citation>
</ref>
<ref id="B77">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Paton</surname> <given-names>J. C.</given-names>
</name>
<name>
<surname>Paton</surname> <given-names>A. W.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Pathogenesis and Diagnosis of Shiga Toxin-Producing <italic>Escherichia coli</italic> Infections</article-title>. <source>Clin. Microbiol. Rev.</source> <volume>11</volume> (<issue>3</issue>), <fpage>450</fpage>&#x2013;<lpage>479</lpage>. doi: <pub-id pub-id-type="doi">10.1128/CMR.11.3.450</pub-id>
</citation>
</ref>
<ref id="B78">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Paton</surname> <given-names>A. W.</given-names>
</name>
<name>
<surname>Woodrow</surname> <given-names>M. C.</given-names>
</name>
<name>
<surname>Doyle</surname> <given-names>R. M.</given-names>
</name>
<name>
<surname>Lanser</surname> <given-names>J. A.</given-names>
</name>
<name>
<surname>Paton</surname> <given-names>J. C.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Molecular Characterization of a Shiga Toxigenic <italic>Escherichia coli</italic> O113:H21 Strain Lacking Eae Responsible for a Cluster of Cases of Hemolytic-Uremic Syndrome</article-title>. <source>J. Clin. Microbiol.</source> <volume>37</volume> (<issue>10</issue>), <fpage>3357</fpage>&#x2013;<lpage>3361</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.37.10.3357-3361.1999</pub-id>
</citation>
</ref>
<ref id="B79">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Patro</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Duggal</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Love</surname> <given-names>M. I.</given-names>
</name>
<name>
<surname>Irizarry</surname> <given-names>R. A.</given-names>
</name>
<name>
<surname>Kingsford</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Salmon Provides Fast and Bias-Aware Quantification of Transcript Expression</article-title>. <source>Nat. Methods</source> <volume>14</volume> (<issue>4</issue>), <fpage>417</fpage>&#x2013;<lpage>419</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nmeth.4197</pub-id>
</citation>
</ref>
<ref id="B80">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qin</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Klein</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Galanakis</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Thomas</surname> <given-names>A. A.</given-names>
</name>
<name>
<surname>Stapp</surname> <given-names>J. R.</given-names>
</name>
<name>
<surname>Rich</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2015</year>). <article-title>Real-Time PCR Assay for Detection and Differentiation of Shiga Toxin-Producing <italic>Escherichia coli</italic> From Clinical Samples</article-title>. <source>J. Clin. Microbiol.</source> <volume>53</volume> (<issue>7</issue>), <fpage>2148</fpage>&#x2013;<lpage>2153</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.00115-15</pub-id>
</citation>
</ref>
<ref id="B81">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Scheutz</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Teel</surname> <given-names>L. D.</given-names>
</name>
<name>
<surname>Beutin</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Pi&#xe9;rard</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Buvens</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Karch</surname> <given-names>H.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>Multicenter Evaluation of a Sequence-Based Protocol for Subtyping Shiga Toxins and Standardizing Stx Nomenclature</article-title>. <source>J. Clin. Microbiol.</source> <volume>50</volume> (<issue>9</issue>), <fpage>2951</fpage>&#x2013;<lpage>2963</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.00860-12</pub-id>
</citation>
</ref>
<ref id="B82">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Seemann</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Prokka: Rapid Prokaryotic Genome Annotation</article-title>. <source>Bioinformatics</source> <volume>30</volume> (<issue>14</issue>), <fpage>2068</fpage>&#x2013;<lpage>2069</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btu153</pub-id>
</citation>
</ref>
<ref id="B83">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smith</surname> <given-names>J. L.</given-names>
</name>
<name>
<surname>Fratamico</surname> <given-names>P. M.</given-names>
</name>
<name>
<surname>Gunther</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Shiga Toxin-Producing <italic>Escherichia coli</italic>
</article-title>. <source>Adv. Appl. Microbiol.</source> <volume>86</volume>, <fpage>145</fpage>&#x2013;<lpage>197</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/b978-0-12-800262-9.00003-2</pub-id>
</citation>
</ref>
<ref id="B84">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stigi</surname> <given-names>K. A.</given-names>
</name>
<name>
<surname>Macdonald</surname> <given-names>J. K.</given-names>
</name>
<name>
<surname>Tellez-Marfin</surname> <given-names>A. A.</given-names>
</name>
<name>
<surname>Lofy</surname> <given-names>K. H.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Laboratory Practices and Incidence of non-O157 Shiga Toxin-Producing <italic>Escherichia coli</italic> Infections</article-title>. <source>Emerg. Infect. Dis.</source> <volume>18</volume> (<issue>3</issue>), <fpage>477</fpage>&#x2013;<lpage>479</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3201/eid1803.111358</pub-id>
</citation>
</ref>
<ref id="B85">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tarr</surname> <given-names>P. I.</given-names>
</name>
<name>
<surname>Gordon</surname> <given-names>C. A.</given-names>
</name>
<name>
<surname>Chandler</surname> <given-names>W. L.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Shiga-Toxin-Producing <italic>Escherichia coli</italic> and Haemolytic Uraemic Syndrome</article-title>. <source>Lancet</source> <volume>365</volume> (<issue>9464</issue>), <fpage>1073</fpage>&#x2013;<lpage>1086</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/s0140-6736(05)71144-2</pub-id>
</citation>
</ref>
<ref id="B86">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Teel</surname> <given-names>L. D.</given-names>
</name>
<name>
<surname>Daly</surname> <given-names>J. A.</given-names>
</name>
<name>
<surname>Jerris</surname> <given-names>R. C.</given-names>
</name>
<name>
<surname>Maul</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Svanas</surname> <given-names>G.</given-names>
</name>
<name>
<surname>O'Brien</surname> <given-names>A. D.</given-names>
</name>
<etal/>
</person-group>. (<year>2007</year>). <article-title>Rapid Detection of Shiga Toxin-Producing <italic>Escherichia coli</italic> by Optical Immunoassay</article-title>. <source>J. Clin. Microbiol.</source> <volume>45</volume> (<issue>10</issue>), <fpage>3377</fpage>&#x2013;<lpage>3380</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.00837-07</pub-id>
</citation>
</ref>
<ref id="B87">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Teunis</surname> <given-names>P. F.</given-names>
</name>
<name>
<surname>Ogden</surname> <given-names>I. D.</given-names>
</name>
<name>
<surname>Strachan</surname> <given-names>N. J.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Hierarchical Dose Response of <italic>E. coli</italic> O157:H7 From Human Outbreaks Incorporating Heterogeneity in Exposure</article-title>. <source>Epidemiol. Infect.</source> <volume>136</volume> (<issue>6</issue>), <fpage>761</fpage>&#x2013;<lpage>770</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1017/s0950268807008771</pub-id>
</citation>
</ref>
<ref id="B88">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tuttle</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Gomez</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Doyle</surname> <given-names>M. P.</given-names>
</name>
<name>
<surname>Wells</surname> <given-names>J. G.</given-names>
</name>
<name>
<surname>Zhao</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Tauxe</surname> <given-names>R. V.</given-names>
</name>
<etal/>
</person-group>. (<year>1999</year>). <article-title>Lessons From a Large Outbreak of <italic>Escherichia coli</italic> O157:H7 Infections: Insights Into the Infectious Dose and Method of Widespread Contamination of Hamburger Patties</article-title>. <source>Epidemiol. Infect.</source> <volume>122</volume> (<issue>2</issue>), <fpage>185</fpage>&#x2013;<lpage>192</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1017/s0950268898001976</pub-id>
</citation>
</ref>
<ref id="B89">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Valilis</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Ramsey</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Sidiq</surname> <given-names>S.</given-names>
</name>
<name>
<surname>DuPont</surname> <given-names>H. L.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Non-O157 Shiga Toxin-Producing <italic>Escherichia coli</italic>-A Poorly Appreciated Enteric Pathogen: Systematic Review</article-title>. <source>Int. J. Infect. Dis.</source> <volume>76</volume>, <fpage>82</fpage>&#x2013;<lpage>87</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ijid.2018.09.002</pub-id>
</citation>
</ref>
<ref id="B90">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Verstraete</surname> <given-names>K.</given-names>
</name>
<name>
<surname>De Reu</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Van Weyenberg</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Pi&#xe9;rard</surname> <given-names>D.</given-names>
</name>
<name>
<surname>De Zutter</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Herman</surname> <given-names>L.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>Genetic Characteristics of Shiga Toxin-Producing <italic>E. coli</italic> O157, O26, O103, O111 and O145 Isolates From Humans, Food, and Cattle in Belgium</article-title>. <source>Epidemiol. Infect.</source> <volume>141</volume> (<issue>12</issue>), <fpage>2503</fpage>&#x2013;<lpage>2515</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1017/s0950268813000307</pub-id>.</citation>
</ref>
<ref id="B91">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wood</surname> <given-names>D. E.</given-names>
</name>
<name>
<surname>Salzberg</surname> <given-names>S. L.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Kraken: Ultrafast Metagenomic Sequence Classification Using Exact Alignments</article-title>. <source>Genome Biol.</source> <volume>15</volume> (<issue>3</issue>), <fpage>R46</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/gb-2014-15-3-r46</pub-id>
</citation>
</ref>
<ref id="B92">
<citation citation-type="book">
<person-group person-group-type="author">
<collab>World Health Organization</collab>
</person-group> (<year>2019</year>). <source>Shiga Toxin-Producing Escherichia coli (STEC) and Food: Attribution Characterization and Monitoring</source>. (<publisher-loc>Rome</publisher-loc>: <publisher-name>World Health Organization</publisher-name>).</citation>
</ref>
<ref id="B93">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Bai</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Fu</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Fan</surname> <given-names>R.</given-names>
</name>
<etal/>
</person-group>. (<year>2020</year>). <article-title>
<italic>Escherichia coli</italic> Strains Producing a Novel Shiga Toxin 2 Subtype Circulate in China</article-title>. <source>Int. J. Med. Microbiol.</source> <volume>310</volume> (<issue>1</issue>):<elocation-id>151377</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ijmm.2019.151377</pub-id>
</citation>
</ref>
<ref id="B94">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Bielaszewska</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Bauwens</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Fruth</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Mellmann</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Karch</surname> <given-names>H.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Real-Time Multiplex PCR for Detecting Shiga Toxin 2-Producing <italic>Escherichia coli</italic> O104:H4 in Human Stools</article-title>. <source>J. Clin. Microbiol.</source> <volume>50</volume> (<issue>5</issue>), <fpage>1752</fpage>&#x2013;<lpage>1754</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/jcm.06817-11</pub-id>
</citation>
</ref>
<ref id="B95">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Liao</surname> <given-names>Y. T.</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>V. C. H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Is Shiga Toxin-Producing <italic>Escherichia coli</italic> O45 No Longer a Food Safety Threat? The Danger is Still Out There</article-title>. <source>Microorganisms</source> <volume>8</volume> (<issue>5</issue>). doi:&#xa0;<pub-id pub-id-type="doi">10.3390/microorganisms8050782</pub-id>
</citation>
</ref>
<ref id="B96">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Mellmann</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Sonntag</surname> <given-names>A. K.</given-names>
</name>
<name>
<surname>Wieler</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Bielaszewska</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Tsch&#xe4;pe</surname> <given-names>H.</given-names>
</name>
<etal/>
</person-group>. (<year>2007</year>). <article-title>Structural and Functional Differences Between Disease-Associated Genes of Enterohaemorrhagic <italic>Escherichia coli</italic> O111</article-title>. <source>Int. J. Med. Microbiol.</source> <volume>297</volume> (<issue>1</issue>), <fpage>17</fpage>&#x2013;<lpage>26</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ijmm.2006.10.004</pub-id>
</citation>
</ref>
<ref id="B97">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Payne</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Lan</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>
<italic>In Silico</italic> Identification of Serovar-Specific Genes for <italic>Salmonella</italic> Serotyping</article-title>. <source>Front. Microbiol.</source> <volume>10</volume>:<elocation-id>835</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fmicb.2019.00835</pub-id>
</citation>
</ref>
<ref id="B98">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Payne</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Nguyen</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Kaur</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Lan</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Cluster-Specific Gene Markers Enhance <italic>Shigella</italic> and Enteroinvasive <italic>Escherichia coli In Silico</italic> Serotyping</article-title>. <source>Microb. Genom.</source> <volume>7</volume> (<issue>12</issue>). doi:&#xa0;<pub-id pub-id-type="doi">10.1099/mgen.0.000704</pub-id>. 2001.2030.428723</citation>
</ref>
<ref id="B99">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Alikhan</surname> <given-names>N. F.</given-names>
</name>
<name>
<surname>Mohamed</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Fan</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Achtman</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>The EnteroBase User's Guide, With Case Studies on <italic>Salmonella</italic> Transmissions, Yersinia Pestis Phylogeny, and <italic>Escherichia</italic> Core Genomic Diversity</article-title>. <source>Genome Res.</source> <volume>30</volume> (<issue>1</issue>), <fpage>138</fpage>&#x2013;<lpage>152</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1101/gr.251678.119</pub-id>
</citation>
</ref>
<ref id="B100">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Alikhan</surname> <given-names>N. F.</given-names>
</name>
<name>
<surname>Sergeant</surname> <given-names>M. J.</given-names>
</name>
<name>
<surname>Luhmann</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Vaz</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Francisco</surname> <given-names>A. P.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>GrapeTree: Visualization of Core Genomic Relationships Among 100,000 Bacterial Pathogens</article-title>. <source>Genome Res.</source> <volume>28</volume> (<issue>9</issue>), <fpage>1395</fpage>&#x2013;<lpage>1404</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1101/gr.232397.117</pub-id>
</citation>
</ref>
<ref id="B101">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zweifel</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Cernela</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Stephan</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Detection of the Emerging Shiga Toxin-Producing <italic>Escherichia coli</italic> O26:H11/H- Sequence Type 29 (ST29) Clone in Human Patients and Healthy Cattle in Switzerland</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>79</volume> (<issue>17</issue>), <fpage>5411</fpage>&#x2013;<lpage>5413</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/aem.01728-13</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>