<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2017.02604</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Comprehensive Analysis Reveals Two Distinct Evolution Patterns of <italic>Salmonella</italic> Flagellin Gene Clusters</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Liu</surname> <given-names>Yue</given-names></name>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02020;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhang</surname> <given-names>Dao-Feng</given-names></name>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/311048/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhou</surname> <given-names>Xiujuan</given-names></name>
</contrib>
<contrib contrib-type="author">
<name><surname>Xu</surname> <given-names>Li</given-names></name>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhang</surname> <given-names>Lida</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/353353/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Shi</surname> <given-names>Xianming</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/312626/overview"/>
</contrib>
</contrib-group>
<aff><institution>MOST-USDA Joint Research Center for Food Safety, School of Agriculture and Biology, and State Key Laboratory of Microbial Metabolism, Shanghai Jiao Tong University</institution>, <addr-line>Shanghai</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: John R. Battista, Louisiana State University, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Xiaohui Zhou, University of Connecticut, United States; Sunil D. Saroj, Symbiosis International University, India</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Xianming Shi <email>xmshi&#x00040;sjtu.edu.cn</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Evolutionary and Genomic Microbiology, a section of the journal Frontiers in Microbiology</p></fn>
<fn fn-type="other" id="fn003"><p>&#x02020;These authors have contributed equally to this work.</p></fn></author-notes>
<pub-date pub-type="epub">
<day>22</day>
<month>12</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>2604</elocation-id>
<history>
<date date-type="received">
<day>01</day>
<month>09</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>14</day>
<month>12</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2017 Liu, Zhang, Zhou, Xu, Zhang and Shi.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Liu, Zhang, Zhou, Xu, Zhang and Shi</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p><italic>Salmonella</italic> is one of the primary causes of foodborne disease, especially <italic>Salmonella enterica</italic> subsp. <italic>enterica</italic> (I) which has caused &#x0007E;99% of clinical salmonellosis cases for humans and domestic mammals. The flagella genes, <italic>fliC</italic> and <italic>fljB</italic>, which encode the <italic>Salmonella</italic> phase 1 and phase 2 antigens respectively, are considered as the <italic>Salmonella</italic> serotype determinant genes, and contribute to the virulence of <italic>Salmonella</italic>. However, the evolution of the two flagellin genes is still not well-understood. In this study, the <italic>fliC</italic> and <italic>fljB</italic> gene clusters were analyzed among 205 <italic>S. enterica</italic> subspecies I genomes. The dataset covered 87 different serovars of <italic>S. enterica</italic> subsp. <italic>enterica</italic> and included 9 genomes (six serovars) of four other <italic>Salmonella</italic> subspecies. Based on a pan-genome definition and flanked gene linkages, the <italic>fliC</italic> and <italic>fljB</italic> gene clusters were identified in 207 (91 serovars) and 138 (61 serovars) genomes, respectively. A phylogenetic tree constructed based on SNPs (Single Nucleotide Polymorphisms) of core genes were used to reflect the essential evolutionary relationships among various serovars. Congruence analysis was performed among the core genome and each gene of <italic>fliC</italic> and <italic>fljB</italic> gene clusters, with only <italic>fliA</italic> and <italic>fliS</italic> showing congruence to <italic>Salmonella</italic> core genome. Congruence was also observed among <italic>fliB, fliC/fljB</italic>, and <italic>fliD</italic> genes, and their phylogeny revealed a division into two major groups, which strongly corresponded to monophasic and biphasic serovars. Besides, homologous recombination events referring <italic>fliB, fliC</italic>, and <italic>fliD</italic> were found to have mainly occurred within each group. These results suggested two distinct evolutionary patterns of <italic>Salmonella</italic> flagellin gene clusters. Further insight on the evolutionary implication of the two patterns and a framework for phase variation mechanism are needed to be further processed.</p></abstract>
<kwd-group>
<kwd><italic>Salmonella enterica</italic></kwd>
<kwd>flagellin genes</kwd>
<kwd><italic>fliC</italic></kwd>
<kwd><italic>fljB</italic></kwd>
<kwd>recombination</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="53"/>
<page-count count="12"/>
<word-count count="7806"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p><italic>Salmonella</italic> is one of the primary causes of foodborne illness around the world, leading to a huge burden of illness calculated at nearly 93.8 million global cases annually (Majowicz et al., <xref ref-type="bibr" rid="B24">2010</xref>). For understanding the epidemiology of these microorganisms and the investigation of outbreaks caused by <italic>Salmonella</italic>, serotyping has been greatly facilitated by the characterization of isolates. The <italic>Salmonella</italic> genus is composed of two species, <italic>Salmonella bongori</italic> and <italic>Salmonella enterica</italic>, the latter being further classified into six subspecies, <italic>enterica</italic> (I), <italic>salamae</italic> (II), <italic>arizonae</italic> (IIIa), <italic>diarizonae</italic> (IIIb), <italic>houtenae</italic> (IV), and <italic>indica</italic> (VI) (Grimont and Weill, <xref ref-type="bibr" rid="B13">2007</xref>). <italic>Salmonella</italic> is an extremely diverse species, of which more than 2,500 serovars has been reported. <italic>S. enterica</italic> subsp. <italic>enterica</italic> (I) strains, the vast majority of <italic>Salmonella</italic> strains, could be isolated from humans and warm-blooded animals, while all the other subspecies and <italic>S. bongori</italic> are more frequently isolated from cold-blooded animals (Nataro et al., <xref ref-type="bibr" rid="B33">2011</xref>). Importantly, <italic>S. enterica</italic> subsp. <italic>enterica</italic> (I) has caused &#x0007E;99% of diagnosed <italic>salmonellosis</italic> cases for humans and domestic mammals (Centers for Disease Control and Prevention (CDC), <xref ref-type="bibr" rid="B6">2011</xref>). Strains with different serotypes show different levels of variation, a single host (e.g., <italic>S</italic>. Typhi) being adapted by monomorphic serotypes comprise a single strain in a single lineage, whilst polymorphic serotypes comprise multiple lineages of strains which may or may not be host adapted (e.g., <italic>S</italic>. Choleraesuis) (Wain and O&#x00027;Grady, <xref ref-type="bibr" rid="B47">2017</xref>).</p>
<p>Based on White-Kauffmann scheme, the <italic>Salmonella</italic> strains have been identified by their surface antigens, on the basis of somatic (O), flagellar (H), and capsular (Vi) antigens (Brenner et al., <xref ref-type="bibr" rid="B5">2000</xref>) and their reactivity to antisera (Schrader et al., <xref ref-type="bibr" rid="B39">2008</xref>). Flagella (<italic>fliC</italic> or <italic>fljB</italic>) protein is the primary globular protein forming the filament of flagellum in the majority of <italic>Salmonella</italic> strains, which functions as a main structural subunit (Haiko and Westerlund-Wikstrom, <xref ref-type="bibr" rid="B14">2013</xref>). These two flagellin genes in most <italic>Salmonella</italic> strains, <italic>fliC</italic> and <italic>fljB</italic>, encode the phase 1 and phase 2 flagellins, respectively. The <italic>fliC</italic> gene is existent in all <italic>Salmonella</italic> cells and has homologs in other enteric bacteria, located in one of the flagellar biosynthesis operons (Macnab, <xref ref-type="bibr" rid="B23">1992</xref>). The <italic>fljB</italic> gene is unique to <italic>S. enterica</italic> and is present in four of the <italic>S. enterica</italic> subspecies (subspecies I, II, IIIb, and VI), separately located on the chromosome from the <italic>fliC</italic> gene (McQuiston et al., <xref ref-type="bibr" rid="B29">2004</xref>). However, these two flagellin genes cannot be expressed simultaneously. The second flagellin locus is considered as a genetic &#x0201C;spare tyre&#x0201D; availed in certain environmental circumstances (McQuiston et al., <xref ref-type="bibr" rid="B28">2008</xref>). <italic>Salmonella</italic> isolates are indicated biphasic, monophasic, or non-motile according to the &#x0201C;phase variation,&#x0201D; expressing both, one, or no flagellar phases, which is regulated by a switch mechanism, an invertible element <italic>hin</italic> (Bonifield and Hughes, <xref ref-type="bibr" rid="B4">2003</xref>). The <italic>fljB</italic> is considered as a duplication of <italic>fliC</italic> but is flanked with the genes, <italic>fljA</italic> and <italic>hin</italic> (Szekely and Simon, <xref ref-type="bibr" rid="B46">1983</xref>), involving in the switch mechanism, which appears to be specific to <italic>Salmonella</italic> (Zieg et al., <xref ref-type="bibr" rid="B52">1978</xref>). Recombination of these genes is the basic source of flagella antigenic variation (Smith et al., <xref ref-type="bibr" rid="B42">1990</xref>) and may occur between <italic>fliC</italic> of co-infecting organisms or <italic>fliC</italic> and <italic>fljB</italic> of the same isolate (Okazaki et al., <xref ref-type="bibr" rid="B34">1993</xref>). The function of flagellar genes have been studied well; the <italic>fliA</italic> existing upstream of the coding region is specific to promoters for flagellar operons (Ohnishi et al., <xref ref-type="bibr" rid="B37">1990</xref>), <italic>fliB</italic> is responsible for flagellar methylation (Parkhill et al., <xref ref-type="bibr" rid="B38">2001</xref>), <italic>fliC</italic> encodes the flagellar filament protein (Ogushi et al., <xref ref-type="bibr" rid="B36">2001</xref>), and <italic>fliD</italic> encodes the filament cap protein of the flagellar apparatus (Yokoseki et al., <xref ref-type="bibr" rid="B49">1995</xref>). The expression of <italic>fliS</italic> facilitates the export of flagellin through the flagellum-specific export pathway (Yokoseki et al., <xref ref-type="bibr" rid="B49">1995</xref>), whereas the <italic>fljB</italic> gene, which expresses phase II flagellin protein, constitutes an operon with the <italic>fljA</italic> gene, which encodes a negative regulator for <italic>fliC</italic> expression (Yamamoto and Kutsukake, <xref ref-type="bibr" rid="B48">2006</xref>).</p>
<p>The flagellin genes are useful as a model system for studying genetic differentiation (Okazaki et al., <xref ref-type="bibr" rid="B34">1993</xref>). Previous study has revealed that the 5&#x02032; and 3&#x02032; ends of the <italic>fliC</italic> and <italic>fljB</italic> gene sequences from different antigenic types are highly conserved, but the central region sequences can be highly variable (Okazaki et al., <xref ref-type="bibr" rid="B34">1993</xref>). A molecular serotyping for the H antigens by targeting the genes encoding the immunologically recognized antigen could be linked to traditional serotyping methods (McQuiston et al., <xref ref-type="bibr" rid="B30">2011</xref>). Based on phylogenetic relationships adapted from previous studies, the acquisition of phase variation, the biphasic <italic>Salmonella</italic> flourished (McQuiston et al., <xref ref-type="bibr" rid="B28">2008</xref>). However, this conclusion was drawn from analysis of limited numbers of samples and the basic evolutionary process of this apparently high level of surface-antigen diversity in the <italic>Salmonella</italic> is not well-understood (McQuiston et al., <xref ref-type="bibr" rid="B28">2008</xref>). There is also a lack of data on the detailed phylogenetic description of the flagellar gene relationship among the <italic>Salmonella</italic>, as a framework for evolutionary studies. Therefore, it is worthwhile to seek full evolutionary insights and the evolutionary relationships of the <italic>Salmonella</italic> flagellin alleles to estimate the recombination within these loci. So, in this study, comparative genomics was performed to obtain additional evolutionary information regarding flagellin genes, which showed that there are two distinct evolutionary patterns of <italic>fliC</italic> and <italic>fljB</italic> clusters during the speciation and serovar diversification of <italic>Salmonella</italic>.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Data collection</title>
<p>A total of 214 <italic>Salmonella</italic> complete genomes and genome assemblies of highest quality (Table <xref ref-type="supplementary-material" rid="SM1">S1</xref>) were retrieved from the NCBI (National Center for Biotechnology Information, <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/">https://www.ncbi.nlm.nih.gov/</ext-link>). All the <italic>Salmonella</italic> genomes with annotations of different serotypes were downloaded as of May, 2016, including most of complete genomes as well as draft genomes of type strains or strains of some serovars for which a complete genome was not available. Five subspecies of <italic>S. enterica</italic> (I), <italic>salamae</italic> (II), <italic>arizonae</italic> (IIIa), <italic>diarizonae</italic> (IIIb), and <italic>houtenae</italic> (IV), and a total of 87 serovars of <italic>enterica</italic> (I), were under consideration. MLST (Multilocus Sequence Typing) (Achtman et al., <xref ref-type="bibr" rid="B1">2012</xref>), together with checking the original studies was implemented to identify their serovars. Housekeeping genes, <italic>thrA, purE, sucA, hisD, aroC, hemD</italic>, and <italic>dnaN</italic> were targeted for the MLST scheme (Achtman et al., <xref ref-type="bibr" rid="B1">2012</xref>). The nucleotide sequences were then aligned against seven housekeeping genes. Outcome of sequences were subsequently submitted to the MLST database (<ext-link ext-link-type="uri" xlink:href="http://mlst.warwick.ac.uk/mlst/dbs/Senterica">http://mlst.warwick.ac.uk/mlst/dbs/Senterica</ext-link>) and assigned existing or novel allele type numbers. The composite sequence STs (sequence types) were defined by the database based on the set of allelic profiles derived from each of the seven loci.</p>
</sec>
<sec>
<title>Determination of homologous groups</title>
<p>OrthoMCL (version 2.0.9) (Li et al., <xref ref-type="bibr" rid="B22">2003</xref>) was used to determine homologous clusters of the pan-genome for subsequent analyses. All extracted protein sequences were adjusted to a prescribed format and were grouped into homologous clusters based on sequence similarity. The BLAST reciprocal best hit algorithm (Moreno-Hagelsieb and Latimer, <xref ref-type="bibr" rid="B31">2008</xref>) was employed with 70% match cutoff, and Markov Cluster Algorithms (MCL) (Enright et al., <xref ref-type="bibr" rid="B10">2002</xref>) were applied with an inflation index of 1.5.</p>
</sec>
<sec>
<title>Phylogenetic analysis</title>
<p>To determine the phylogenetic relationships among <italic>Salmonella</italic> subspecies and different serovars, a phylogenetic tree was inferred from single nucleotide polymorphisms (SNPs) of core genes. According to the identification of homologous clusters, a total of 127,544 genes presented in single-copy orthologous genes shared by all 214 <italic>Salmonella</italic> strains were selected for phylogenetic analysis. For each orthologous cluster, nucleotide sequences were aligned and then concatenated. Custom Perl scripts were used to parse the concatenated sequence and extract SNPs. The organismal phylogeny of SNPs and other genes were inferred using the maximum likelihood (ML) algorithm in RAxML version 8.2.9 (Stamatakis, <xref ref-type="bibr" rid="B43">2014</xref>). The tree reconstruction is repeated 1,000 times, and the percentage of times each interior branch which is noted as bootstrap-value is given. Each tree of genes in the two flagellin gene clusters was constructed by MEGA (Kumar et al., <xref ref-type="bibr" rid="B20">2016</xref>), using Neighbor-Joining method with 1,000 bootstraps.</p>
</sec>
<sec>
<title>Identification of <italic>fliC</italic> and <italic>fljB</italic></title>
<p>To characterize the evolution pattern of flagellin genes, reference sequences of <italic>fliC, fljB</italic>, and the flanked genes, <italic>fliA, fliB, fliD, fliS, fljA</italic>, and <italic>hin</italic>, were extracted from the <italic>S</italic>. Typhimurium LT2 genome (GenBank accession number <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="AE006468">AE006468</ext-link>) (McClelland et al., <xref ref-type="bibr" rid="B27">2001</xref>). The relevant homologous clusters in the pan-genome were determined by BLAST with 70% match cutoff and 50% coverage cutoff. In cases of shared homologous genes related to flagellin genes, phylogenetic trees were inferred by ML algorithms in RAxML version 8.2.9 (Stamatakis, <xref ref-type="bibr" rid="B43">2014</xref>). Together with the phylogenetic trees, the flanked gene contents were taken in account to confirm the gene loci of <italic>fliA, fliB, fliC, fliD, fliS, fljA, fljB</italic>, and <italic>hin</italic>.</p>
</sec>
<sec>
<title>Congruence analysis</title>
<p>PAUP version 4.0a152 (Swofford, <xref ref-type="bibr" rid="B45">1998</xref>) was used to conduct maximum-likelihood (ML) analysis of the congruence of gene trees, as described by Feil et al. (<xref ref-type="bibr" rid="B11">2001</xref>). The HKY85 model of DNA substitutions was used to estimate &#x003B1; parameter, and the transition/transversion (Ti/Tv) ratio, assuming gamma distribution. The difference between the log likelihood scores of the trees of two genes &#x00394;-lnL was calculated, generating from the comparison of one gene tree against another based on the 99th percentile of the distribution of log likelihood scores for 200 trees from random topology. If the &#x00394;-lnL values for the comparisons is less than that reference gene comparisons within the 99th percentile of this distribution, then these two gene trees should be considered significantly congruent (Octavia and Lan, <xref ref-type="bibr" rid="B35">2006</xref>).</p>
</sec>
<sec>
<title>Regression analysis for evolutionary synchronization</title>
<p>A key element of our study was to determine whether the evolution of flagella gene was associated with the evolutionary rate estimate. To test for this pattern we conducted a least squares linear regression for the genome-wide rate as a function of sampling distance. The MEGA version 7 (Kumar et al., <xref ref-type="bibr" rid="B20">2016</xref>) was used for pairwise distance calculating. A potential shortcoming of this analysis is that the data do not necessarily represent independent samples because some correspond to closely related lineages. This can be addressed by using phylogenetic independent contrasts, or phylogenetic generalized least squares. For the regression of the rate as a function of sampling evolutionary consensus the degree of association was estimated when fitting a regression for a ratio as a function of its denominator.</p>
</sec>
<sec>
<title>Recombination analysis</title>
<p>Potential recombination within <italic>fliB, fliC</italic>, and <italic>fliD</italic> genes was screened using seven methods (RDP, GENECONV, MaxChi, Bootscan, Chimera, SiScan, and 3Seq) implemented in the Recombination Detection Program version 4.88 (RDP4) (Martin et al., <xref ref-type="bibr" rid="B25">2015</xref>). The breakpoints were also defined by RDP4. Similarity between the recombinants and their possible major and minor parents was estimated using BootScan, embedded in RDP4 (Stamatakis, <xref ref-type="bibr" rid="B43">2014</xref>). The recombination event evaluated by RDP4 was considered significant if it satisfied at least two criteria when the <italic>P</italic>-value (<italic>p</italic>) &#x0003C; 0.05 and the RDP recombination consensus score (RDPRCS) was &#x0003E;0.4. When <italic>p</italic> &#x0003C; 0.05 and the RDPRCS was between 0.4 and 0.6, the recombination event was possible. An RDPRCS &#x0003C; 0.4 with <italic>p</italic> &#x0003C; 0.05 indicated the rejection of the recombination event (Martin et al., <xref ref-type="bibr" rid="B25">2015</xref>).</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Core genome tree revealed detailed phylogenetic relationships among different serovars</title>
<p>A total of 214 <italic>Salmonella</italic> genomes were analyzed in this study, consisting of 205 <italic>S. enterica</italic> subsp. <italic>enterica</italic> genomes and 9 genomes of other four <italic>S. enterica</italic> subspecies, which covered 87 serovars. More than one genome (maximum twelve) was included for some famous serovars (e.g. <italic>S</italic>. Enteritidis and <italic>S</italic>. Typhi), while only one genome was considered for most other serovars because no more genomes were available. More detailed information on these genomes was summarized in Table <xref ref-type="supplementary-material" rid="SM1">S1</xref>. After homologous gene clustering, the total 947,793 proteins were grouped into 14,583 homologous clusters, including common genes represented by 596 single-copy groups of 712 core homologous clusters. A total of 57,946 SNPs in the &#x0201C;core&#x0201D; regions were identified that were present and aligned with high confidence in all taxa. These SNPs were used to construct the phylogeny (Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Phylogenetic tree of <italic>Salmonella enterica</italic>. Maximum Likelihood phylogenetic tree based on 57,946 SNPs of 596 single copy core genes of 214 genomes. Bootstrap values (&#x0003E;70%) are presented on the branches. The inner dots indicated the presence of <italic>fliC</italic> gene clusters, while the outer dots indicated <italic>fljB</italic> gene clusters. The bootstrap values near the tips of the trees were omitted. The other four subspecies of <italic>S</italic>. <italic>salamae</italic> (II), <italic>arizonae</italic> (IIIa), <italic>diarizonae</italic> (IIIb), and <italic>houtenae</italic> (IV) were far distant from subspecies <italic>enterica</italic> (I) and omitted.</p></caption>
<graphic xlink:href="fmicb-08-02604-g0001.tif"/>
</fig>
<p>The role of the phylogenetic tree based on SNPs in genome variation and evolution has been documented previously (Filliol et al., <xref ref-type="bibr" rid="B12">2006</xref>; Holt et al., <xref ref-type="bibr" rid="B16">2008</xref>). The core genome tree can support the major genetic group phylogeny and reveal much more phylogenetic detail, including the divergence of major phylogenetic groups. Moreover, the core genome tree also revealed a number of strong concordances with serotyping. From Figure <xref ref-type="fig" rid="F1">1</xref>, core genome tree was able to categorize <italic>S</italic>. Typhimurium isolates but unable to ultimately separate the <italic>S</italic>. Stanleyville from the <italic>S</italic>. Gallinarum with high resolution. In most cases, the genomes within a cluster belonged to the same serovar, as for <italic>S</italic>. Typhi or Enteritidis. Although some isolates were mixed up with each other, for example, it was hard to differentiate <italic>S</italic>. Give, <italic>S</italic>. Schwarzengrund, and <italic>S</italic>. Bredney, the core genome tree generally separated the serovars from each other.</p>
</sec>
<sec>
<title>The monophasic and biphasic strains were less grouped into the same cluster on core genome tree</title>
<p>The prevalence of phase I and phase II related flagelllin gene locus of the 214 genomes was also indicated in Figure <xref ref-type="fig" rid="F1">1</xref>. In total, <italic>fliC</italic> gene cluster was found in 207 genomes (96.7%) of 91 serovars (97.8%), including subspecies, while <italic>fljB</italic> gene cluster was found in 138 genomes (64.5%) of 61 serovars (65.6%). The detailed list (Table <xref ref-type="supplementary-material" rid="SM2">S2</xref>) was dominated by <italic>fliC</italic> and <italic>fljB</italic> genes and included their flanked genes and <italic>hin</italic>. Some well-studied serovars (Enteritidis, Dublin, Typhi etc.) are monophasic the same as the previous study (Selander et al., <xref ref-type="bibr" rid="B40">1992</xref>; Baker et al., <xref ref-type="bibr" rid="B3">2007</xref>). Nevertheless, the <italic>fliC</italic> gene was not found present in the genomes of strain <italic>S</italic>. Poona 2010K-2244, <italic>S</italic>. Heidelberg B182, <italic>S</italic>. Abortusovis SS44, <italic>S. enterica</italic> subsp. <italic>salamae</italic> 3588/07, <italic>S</italic>. Bovismorbificans A1608, <italic>S</italic>. Hadar ABBSB1020-2, and <italic>S</italic>. London CFSAN001081, of which only a draft genome was obtained for these strains. Duplicated annotations and misprediction of some genes that span contig boundaries may occur in draft genomes. The homologs (Table <xref ref-type="supplementary-material" rid="SM2">S2</xref>) were able to mitigate but not eliminate these limitations. We further checked the contigs and related annotation of seven genomes of <italic>fliC</italic> gene cluster absent, and found some residue of the gene cluster at some contigs&#x00027; ends in each genome. On the other hand, there was no report on the absence of <italic>fliC</italic> in these serovars, for example, <italic>fliC</italic> should be present in all <italic>S</italic>. Heidelberg (Macnab, <xref ref-type="bibr" rid="B23">1992</xref>). Therefore, the absence of <italic>fliC</italic> gene in these genomes should be a result of misassembly or the related fragment not fully sequenced.</p>
<p>Interestingly, the monophase and biphase being linked to the evolution of <italic>S. enterica</italic> subsp. <italic>enterica</italic> showed discrete phylogenetic clusters on the core genome tree (Figure <xref ref-type="fig" rid="F1">1</xref>). A few serovars with identical phase, clustered together in the core genome tree. In one larger monophasic clade, with 25 genomes, containing <italic>S</italic>. Dublin, Gallinarum, Pullorum, Stanleyville, Blegdam, Enteritidis, and Berta, all the genomes of each serovar were located together in one distinct cluster in the core genome tree. For the other monophasic serovars, the 45 genomes were identified at different locations, of which another larger clade of 12 genomes, containing <italic>S. enterica</italic> subsp. <italic>enterica</italic> serovars Cubana, Rissen, Tennessee, Senftenberg, and Albany were located adjacent to Derby, Havana, and Agona, but were disrupted by Mbandaka and Kentucky of biphase.</p>
</sec>
<sec>
<title>Evolutionary congruence and phylogenetic analysis implied two distinct evolution patterns of the flagellin gene clusters</title>
<p>Evolutionary analysis of flagellin gene <italic>fliC</italic> and <italic>fljB</italic> in <italic>Salmonella</italic> has been performed previously (Mortimer et al., <xref ref-type="bibr" rid="B32">2007</xref>). However, the co-evolution of these two genes and their flanking genes has never been studied. To evaluate evolutionary congruence between and within the <italic>fliC</italic> and <italic>fljB</italic> clusters, ML analysis was carried out, and the results were summarized in Table <xref ref-type="table" rid="T1">1</xref>. As is shown, none of the genes in <italic>fliC</italic> gene cluster were congruent to all the genes in <italic>fljB</italic> gene cluster, and none of the genes in <italic>fljB</italic> gene cluster were congruent to all the genes in <italic>fliC</italic> gene cluster. The gene trees with the largest number of congruencies were those of <italic>fliC</italic> and <italic>hin</italic>, which were congruent to four other gene trees. More concretely, <italic>fliC</italic> gene tree was similar to those of <italic>fliB, fliD, fljA</italic>, and <italic>fljB</italic> gene, and <italic>hin</italic> gene tree was similar to those of <italic>fliB, fliD, fljA</italic>, and <italic>fljB</italic> gene. As for <italic>fliB</italic> (equaling to <italic>fljA</italic> and <italic>fljB</italic> gene trees) and <italic>fljB</italic> (equaling to <italic>fliB</italic> and <italic>fljA</italic> trees), they were congruent to two gene trees. And <italic>fliA</italic> and <italic>fliD</italic> trees were congruent to <italic>fljA</italic> and <italic>fliB</italic> respectively. Overall, only 29.2% (14/48) of the flagellar gene tree comparisons were congruent among these <italic>S</italic>. <italic>enterica</italic> strains. As for the <italic>fliC</italic> and its neighbored genes, <italic>fliB</italic> and <italic>fliD</italic>, they were closely related, while for <italic>fljA</italic> and <italic>fljB</italic> genes, their relationship was more distant. Moreover, there was generally higher level of congruence between <italic>fliC</italic> and <italic>fljB</italic>/<italic>fliD</italic> genes than between <italic>fliA</italic> and <italic>fliB</italic> genes and between <italic>fliD</italic> and <italic>fliS</italic> genes. However, correlation calculations were not available between <italic>fliC</italic> and <italic>fliS</italic> genes, because approximation limit dynamically readjusted too low.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>ML analysis for congruence between each gene tree of the flagella genes analyzed in this study.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th/>
<th valign="top" align="center"><bold>&#x003B1;<xref ref-type="table-fn" rid="TN1"><sup>a</sup></xref></bold></th>
<th valign="top" align="center"><bold>Ti/Tv ratio<xref ref-type="table-fn" rid="TN2"><sup>b</sup></xref></bold></th>
<th valign="top" align="center"><bold>-lnL</bold></th>
<th valign="top" align="center"><bold>&#x00394;-lnL (99th)<xref ref-type="table-fn" rid="TN3"><sup>c</sup></xref></bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold><italic>fliA</italic></bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold><italic>fliB</italic></bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold><italic>fliC</italic></bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold><italic>fliD</italic></bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold><italic>fliS</italic></bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold><italic>fljA</italic></bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold><italic>fljB</italic></bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold><italic>hin</italic></bold></th>
</tr>
<tr>
<th/>
<th/>
<th/>
<th/>
<th/>
<th valign="top" align="center" colspan="8">&#x00394;<bold>-lnL (99 percentile)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>fliA</italic></td>
<td valign="top" align="center">0.02</td>
<td valign="top" align="center">7.59</td>
<td valign="top" align="center">1,442.96</td>
<td valign="top" align="center">1,333.38</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">1,446.74</td>
<td valign="top" align="center">1,855.03</td>
<td valign="top" align="center">1,588.38</td>
<td valign="top" align="center">1,336.41</td>
<td valign="top" align="center"><bold>1,332.98</bold></td>
<td valign="top" align="center">1,559.4</td>
<td valign="top" align="center">1,392.09</td>
</tr>
<tr>
<td valign="top" align="left"><italic>fliB</italic></td>
<td valign="top" align="center">0.2</td>
<td valign="top" align="center">3.37</td>
<td valign="top" align="center">7,850.6</td>
<td valign="top" align="center">24,309.07</td>
<td valign="top" align="center">28,488.81</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">26,254.26</td>
<td valign="top" align="center">24,838.09</td>
<td valign="top" align="center">29,386.81</td>
<td valign="top" align="center"><bold>24,142.47</bold></td>
<td valign="top" align="center"><bold>24,295.95</bold></td>
<td valign="top" align="center">27,444.31</td>
</tr>
<tr>
<td valign="top" align="left"><italic>fliC</italic></td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">1.29</td>
<td valign="top" align="center">25,042.12</td>
<td valign="top" align="center">127,229.25</td>
<td valign="top" align="center">152,120.99</td>
<td valign="top" align="center"><bold>124,348.02</bold></td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center"><bold>125,530.74</bold></td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center"><bold>125,113.23</bold></td>
<td valign="top" align="center"><bold>124,009.55</bold></td>
<td valign="top" align="center">145,249.22</td>
</tr>
<tr>
<td valign="top" align="left"><italic>fliD</italic></td>
<td valign="top" align="center">0.29</td>
<td valign="top" align="center">3.2</td>
<td valign="top" align="center">7,535.46</td>
<td valign="top" align="center">35,561.8</td>
<td valign="top" align="center">43,616.44</td>
<td valign="top" align="center"><bold>35,031.96</bold></td>
<td valign="top" align="center">40,180.51</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">44,377.23</td>
<td valign="top" align="center">36,789.97</td>
<td valign="top" align="center">36,477.65</td>
<td valign="top" align="center">43,028.63</td>
</tr>
<tr>
<td valign="top" align="left"><italic>fliS</italic></td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">8.72</td>
<td valign="top" align="center">877.71</td>
<td valign="top" align="center">802.22</td>
<td valign="top" align="center">815.62</td>
<td valign="top" align="center">850.98</td>
<td valign="top" align="center">1,059.74</td>
<td valign="top" align="center">902.58</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">809.76</td>
<td valign="top" align="center">900.25</td>
<td valign="top" align="center">842.57</td>
</tr>
<tr>
<td valign="top" align="left"><italic>fljA</italic></td>
<td valign="top" align="center">0.15</td>
<td valign="top" align="center">3.2</td>
<td valign="top" align="center">1,649.33</td>
<td valign="top" align="center">2,071.13</td>
<td valign="top" align="center">2,224.6</td>
<td valign="top" align="center">2,100.08</td>
<td valign="top" align="center">2,443.48</td>
<td valign="top" align="center">2,172.55</td>
<td valign="top" align="center">2,236.68</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">2,143.27</td>
<td valign="top" align="center">2,236.19</td>
</tr>
<tr>
<td valign="top" align="left"><italic>fljB</italic></td>
<td valign="top" align="center">0.25</td>
<td valign="top" align="center">2.27</td>
<td valign="top" align="center">8,965.19</td>
<td valign="top" align="center">24,268.16</td>
<td valign="top" align="center">28,402.47</td>
<td valign="top" align="center"><bold>23,710.94</bold></td>
<td valign="top" align="center">25,860.96</td>
<td valign="top" align="center">24,279.29</td>
<td valign="top" align="center">28,701.65</td>
<td valign="top" align="center"><bold>24,179.59</bold></td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">27,639.31</td>
</tr>
<tr>
<td valign="top" align="left"><italic>hin</italic></td>
<td valign="top" align="center">0.02</td>
<td valign="top" align="center">2.67</td>
<td valign="top" align="center">1,674.34</td>
<td valign="top" align="center">2,095.84</td>
<td valign="top" align="center">2,101.85</td>
<td valign="top" align="center"><bold>1,832.47</bold></td>
<td valign="top" align="center">2,264.01</td>
<td valign="top" align="center"><bold>1,947.05</bold></td>
<td valign="top" align="center">2,097.23</td>
<td valign="top" align="center"><bold>1,795.33</bold></td>
<td valign="top" align="center"><bold>1,955.46</bold></td>
<td valign="top" align="center">&#x02013;</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN1">
<label>a</label>
<p><italic>Nucleotide substitution rate variation between sites, with gamma distribution as the parameter</italic>.</p></fn>
<fn id="TN2">
<label>b</label>
<p><italic>Estimated transition/transversion ratio</italic>.</p></fn>
<fn id="TN3">
<label>c</label>
<p><italic>Difference in -lnL score from -lnL column (reference data) and the 99th percentile from random topology</italic>.</p></fn>
<p><italic><sup>d</sup> Difference in -lnL score from reference data to each calculated score from other genes. Trees are deemed to be congruent if &#x00394;-lnL is equal to or lower than the 99th percentile of random trees, when compared with reference data (Feil et al., <xref ref-type="bibr" rid="B11">2001</xref>; Octavia and Lan, <xref ref-type="bibr" rid="B35">2006</xref>). Bold character in the table symbolizes congruent tree between these two genes</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>Besides the relationship among the genes in the <italic>fliC</italic> and <italic>fljB</italic> gene clusters, we further studied their congruence to the core genome using linear regression analysis instead of ML analysis, because datasets of the core genome were too large to calculate -lnL score. The mutual information analysis indicated a weak correlation between each flagellin gene and core genome. The low values of both trees for each linear regression indicate a high level of incongruence between individual tree of each gene and the core genome tree, as shown in Figures <xref ref-type="fig" rid="F2">2</xref>, <xref ref-type="fig" rid="F3">3</xref>. The topology of <italic>fljB, fliC</italic>, and <italic>fliD</italic> gene trees was much distinct from that of the core genome tree, while those of the <italic>fliA</italic> and <italic>fliS</italic> genes were more similar to the core genome tree than those of <italic>fljB, fliC</italic>, and <italic>fliD</italic> genes. However, the <italic>fljA</italic> tree for serovars displayed a similar topology to that of the core genome tree, while <italic>fljB</italic> did not.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Regressions between each gene of the <italic>fliC</italic> gene clusters and core genetic distance (expected nucleotide substitutions per site) as a function of sampling the evolutionary distance for <italic>S. enterica</italic> (I). <bold>(A)</bold>, <italic>fliA</italic>; <bold>(B)</bold>, <italic>fliB</italic>; <bold>(C)</bold>, <italic>fliC</italic>; <bold>(D)</bold>, <italic>fliD</italic>; and <bold>(E)</bold>, <italic>fliS</italic>. Each dot corresponds to a pair of sampled genomes, and the linear regression is shown as a solid line using least squares; the <italic>R</italic><sup>2</sup> coefficients are also shown.</p></caption>
<graphic xlink:href="fmicb-08-02604-g0002.tif"/>
</fig>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Regressions between each gene of the <italic>fljB</italic> gene clusters and core genetic distance (expected nucleotide substitutions per site) as a function of sampling the evolutionary distance for <italic>S. enterica</italic> (I). <bold>(A)</bold>, <italic>fljA</italic>; and <bold>(B)</bold>, <italic>fljB</italic>. Each dot corresponds to a pair of sampled genomes, and the linear regression is shown as a solid line using least squares; the <italic>R</italic><sup>2</sup> coefficients are also shown.</p></caption>
<graphic xlink:href="fmicb-08-02604-g0003.tif"/>
</fig>
<p>To infer evolutionary relationships among different serovars, based upon similarities and differences in their genetic characteristics, we constructed phylogenetic trees of <italic>fliA, fliB, fliC, fliD, fliS</italic>, and <italic>fljA, fljB, hin</italic>, and <italic>fliC</italic>/<italic>fljB</italic> (Figure <xref ref-type="fig" rid="F4">4</xref>). Diagrammatic representation of <italic>fliB, fliC</italic>, and <italic>fliD</italic> indicated bifurcation to form two distinct lineages, where other <italic>Salmonella enteric</italic> subspecies joined together with subspecies enteric serovars. The other four subspecies could be easily recognized from <italic>fliA</italic> and <italic>fliS</italic> gene tree because of their distant branches, which was more consistent with core genome tree. Subspecies <italic>salamae</italic> was close to three neighbors, serovar Weltervreden, Sloterdijk and Kentucky, in <italic>fljB</italic> tree, while these serovars were scattered to different branches in <italic>fljA</italic> tree. The results of phylogenetic trees were consistent to the congruence analysis (Table <xref ref-type="table" rid="T1">1</xref>, Figure <xref ref-type="fig" rid="F2">2</xref>, <xref ref-type="fig" rid="F3">3</xref>). Dramatically, the individual gene trees of <italic>fliB, fliC</italic> (<italic>fliC</italic> of <italic>S</italic>. Typhi was an exception) and <italic>fliD</italic> were consistent with a clustering dividing monophasic and biphasic serovars. This seemed to imply two distinct evolutionary paths of <italic>fliC</italic>/<italic>fljB</italic> corresponding to monophasic and biphasic serovars.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Phylogenetic trees of <italic>Salmonella</italic> flagella related genes. Neighbor-joining phylogenetic trees of flagella related genes are based on the nucleotide sequences: <italic>fliA</italic> <bold>(A)</bold>, <italic>fliB</italic> <bold>(B)</bold>, <italic>fliC</italic> <bold>(C)</bold>, <italic>fliD</italic> <bold>(D)</bold>, <italic>fliS</italic> <bold>(E)</bold>, <italic>fljA</italic> <bold>(F)</bold>, <italic>fljB</italic> <bold>(G)</bold>, <italic>hin</italic> <bold>(H)</bold> and <italic>fliC/fljB</italic> <bold>(I)</bold>. Different color dot implies the monophasic or biphasic serovars.</p></caption>
<graphic xlink:href="fmicb-08-02604-g0004.tif"/>
</fig>
</sec>
<sec>
<title>Phylogeny of <italic>fliC</italic> and <italic>fljB</italic> showed complex relationships</title>
<p>Because different genes of the flagellin diverge under different evolutionary pressures, we manually curated the phylogenetic pattern of genes in flagellin gene clusters. For all flagellin and their flanked genes, the individual gene trees (Figure <xref ref-type="fig" rid="F4">4</xref>) can be used to demonstrate the evolutionary story on individual gene basis. To study the genetic evolution of the 207 <italic>fliC</italic> and 138 <italic>fljB</italic> allelic variants, phylogenetic tree based on nucleotide sequences was constructed (Figure <xref ref-type="fig" rid="F4">4I</xref>). There is a clear separation of monophasic <italic>Salmonella</italic> groups from the biphasic groups based on the <italic>fliC</italic> gene, while all the <italic>fljB</italic> alleles clustered with the biphasic <italic>fliC</italic> alleles. This may either reflect that monophasic were acting as incipient serotypes with a barrier to recombination with biphasic serotypes, or indicate that there were insufficient incidents in which recombination could have taken place between monophasic and biphasic serotypes. However, <italic>S</italic>. Typhi was not clustered into other monophasic genomes as one branch together. Similarly, previous study had also identified that monophasic <italic>S</italic>. Typhimurium existed, of which the serotype was always biphasic (Laorden et al., <xref ref-type="bibr" rid="B21">2010</xref>).</p>
<p>Furthermore, all biphasic <italic>fliC</italic> and <italic>fljB</italic> alleles were analyzed based on the amino acid sequences (Figure <xref ref-type="supplementary-material" rid="SM3">S1</xref>). The genetic distance between <italic>fliC</italic> gene and <italic>fljB</italic> gene sequences showed no relationship within/among serovars, which was reflected in the phylogenetic tree. For example, serovar Schwarzengrund, Liverpool, and Livingstone were closely related based on <italic>fliC</italic> gene while they were distant based on <italic>fljB</italic>. <italic>S</italic>. Heidelberg and <italic>S</italic>. Newport were clustered together according to the <italic>fljB</italic> gene, while they were clustered separately according to the <italic>fliC</italic> gene. It suggested that the evolutionary histories were independent between <italic>fliC</italic> and <italic>fljB</italic> genes and frequent homologous recombination referring <italic>fliC</italic> and <italic>fljB</italic> occurred between biphasic serovars. The phylogenetic tree of 5&#x02032; and 3&#x02032; conserved segment of <italic>fliC</italic> and <italic>fljB</italic> genes was also constructed (data not shown), showing the similar division.</p>
</sec>
<sec>
<title>Less recombination events were found between monophasic and biphasic serovars</title>
<p>Recombination is a significant source of genetic diversity in <italic>Salmonella</italic> (Silverman et al., <xref ref-type="bibr" rid="B41">1979</xref>). This phenomenon, which occurs during co-infection with different strains, is usually observed at the <italic>fliC</italic> and its flanked genes. It is apparent that there were different levels of recombination within flagellar genes of <italic>S. enterica</italic>. Comparison of recombination of monophasic genes with biphasic genes, which represents the unique ability to change flagellar composition, by both ML (Table <xref ref-type="table" rid="T1">1</xref>) and recombination detective analysis (Figure <xref ref-type="fig" rid="F5">5</xref>), showed that recombination occured far more frequently within monophasic or biphasic gene clusters than between them. For example, the serovar Heidelberg, whose major parent (a sequence closely related to that from which the greater part of the recombinant&#x00027;s sequence may have been derived, Martin et al., <xref ref-type="bibr" rid="B25">2015</xref>) was Virchow, and minor parent (a sequence closely related to that from which sequences in the proposed recombinant region may have been derived, Martin et al., <xref ref-type="bibr" rid="B25">2015</xref>) was Newport, did not recombine to any monophasic genes. It was identical to Hartford, whose major parent was Pomona, and minor parent was Anatum. Even though it was found that Senftenberg (monophase) was minor parent for Newport (biphase), and other similar recombinant events of referring small segments were also found, recombination was believed mainly to have occurred within the same phase. Moreover, there were lots of breakpoints and the breakpoints tended to cluster far more around the edges of genes than they did within the central parts of genes (Figure <xref ref-type="fig" rid="F5">5</xref>).</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Recombination events in <italic>Salmonella fliBCD</italic> gene cluster. The recombination properties of flagella genes are indicated as different colors on the right, indicating positions of potential recombination events. Piece of sequence from major parent refer to the top of rectangles. Piece of sequence from minor parent is a graphical representation of a sequence fragment that has potentially been derived through recombination from a sequence resembling the next two bars of the rectangle. On the left, a phylogenetic tree of the phase I alleles based on concatenated <italic>fliBCD</italic> nucleotide sequences was constructed by using the maximum-likelihood method with a bootstrap value of 1,000.</p></caption>
<graphic xlink:href="fmicb-08-02604-g0005.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>With the cost of whole genome sequencing (WGS) decreasing dramatically (Didelot et al., <xref ref-type="bibr" rid="B9">2012</xref>) and the technology becoming available for routine use, WGS is very useful and practical in various bacterial infectious studies including diagnostic and public health microbiology (Koser et al., <xref ref-type="bibr" rid="B19">2012</xref>). Many <italic>Salmonella</italic> serovars are human-specific, which may cause gastroenteritis, a self-limiting illness. A few <italic>Salmonella</italic> serovars, especially <italic>S</italic>. Typhi, may elicit typhoid-like diseases and serious systemic infections (Parkhill et al., <xref ref-type="bibr" rid="B38">2001</xref>; Crump and Mintz, <xref ref-type="bibr" rid="B8">2010</xref>). Recent studies support the phylogenetic tree based on SNPs as an outbreak surveillance tool (Allard et al., <xref ref-type="bibr" rid="B2">2012</xref>; Hoffmann et al., <xref ref-type="bibr" rid="B15">2013</xref>; Zhou et al., <xref ref-type="bibr" rid="B51">2013</xref>). The performance of these 214 genomes based on SNPs was validated on the phylogenetic analysis. The fractions of the core genome tree (Figure <xref ref-type="fig" rid="F1">1</xref>) supported the partition of the taxa in previous study and reflected robust evolutionary relationship of <italic>Salmonella</italic> serovars (Chan et al., <xref ref-type="bibr" rid="B7">2003</xref>; Jacobsen et al., <xref ref-type="bibr" rid="B18">2011</xref>; Zou et al., <xref ref-type="bibr" rid="B53">2014</xref>). Moreover, the rich information of <italic>Salmonella</italic> subspecies <italic>enterica</italic>, covering 87 serovars, provided a framework for determination of the evolutionary relationships of the flagellin alleles to assess the co-evolution and recombination events.</p>
<p>Flagellin genes are considered as a dominant antigen of the adaptive immune response (Haiko and Westerlund-Wikstrom, <xref ref-type="bibr" rid="B14">2013</xref>). The <italic>fliC</italic> gene is present in most of <italic>Salmonella</italic> isolates, and has a homologous locus in <italic>Escherichia coli</italic> (Macnab, <xref ref-type="bibr" rid="B23">1992</xref>). Selection within specific niches might have an impact in antigenic variation (McQuiston et al., <xref ref-type="bibr" rid="B28">2008</xref>). For adapting to environmental changes, <italic>Salmonella</italic> adjust their expression, introducing a second set of antigens, to survive in the new niches. The complex branching division of diphasic serotypes, compared to monophasic serotypes, is suggestive of an expanded mutation rate or recombination in the flagellin when <italic>Salmonella</italic> acquired the second flagellin gene. Perhaps monophasic serotypes obtain enough antigenic diversity from SNPs, letting these serotypes to be successful as monophasic strains (Mortimer et al., <xref ref-type="bibr" rid="B32">2007</xref>). However, the reason why <italic>S</italic>. Typhi is a monophasic serotype and cluster with biphasic serotypes in the phylogenetic trees of <italic>fliBCD</italic> is still unclear. We hypothesize that <italic>S</italic>. Typhi had a trend to be diphasic, and the substitute of <italic>fljB</italic> was present in the history but got lost during speciation.</p>
<p>As an important virulence factor in pathogenic bacteria, especially in <italic>Salmonella</italic> genus, flagellin might regulate their virulence and pathogenesis (Stecher et al., <xref ref-type="bibr" rid="B44">2004</xref>). For example, it is beneficial to immunologic escape by turning on/off the expression of genes related to these structures, niche adaptation by expressing the second flagellin and avoiding predation by different motility or antigenic expression. All of these selective advantages are importantly involved in evolutionary step of having the second flagellin. The gain or loss of their unique ability to encode the major flagellin protein by switching expression can be discussed by an evolutionary model. Based on the phylogenetic tree of the <italic>fliC</italic> and <italic>fljB</italic> genes in the broad serovars involved, it is supposed that the phylogeny of flagellin genes illustrated a strong serovar-specific lineage, with strains of the same serovar clustering either together or in a few distinct branches (Figure <xref ref-type="fig" rid="F1">1</xref>), as described before (Yue et al., <xref ref-type="bibr" rid="B50">2015</xref>). Clustering of the <italic>fliC</italic> and <italic>fljB</italic> genes showed that <italic>fljB</italic> arose from <italic>fliC</italic> gene duplication (Mortimer et al., <xref ref-type="bibr" rid="B32">2007</xref>), which have highly conserved 5&#x02032; and 3&#x02032; ends and highly variable center regions (Masten and Joys, <xref ref-type="bibr" rid="B26">1993</xref>; McQuiston et al., <xref ref-type="bibr" rid="B29">2004</xref>). Most studies of flagellin genes focus either on its evolutionary relationships or as a dominant antigen of the sequence diversity (Mortimer et al., <xref ref-type="bibr" rid="B32">2007</xref>) but do not undertake subsequent congruence analysis of their evolutionary process. The systematic approach, ML analysis and regression analysis (Table <xref ref-type="table" rid="T1">1</xref>, Figures <xref ref-type="fig" rid="F2">2</xref>, <xref ref-type="fig" rid="F3">3</xref>), determined that <italic>fliB, fliC</italic>, and <italic>fliD</italic> gene trees were evolutionarily congruent to each other but not to the core genome. In the regression analysis of <italic>fliA</italic> and <italic>fliS</italic> against the core genome, low levels of correlation were found (low <italic>R</italic><sup>2</sup> values). The <italic>fliA</italic> and <italic>fliS</italic> genes were highly conserved within subspecies <italic>enterica</italic> with limited variation (Figure <xref ref-type="fig" rid="F2">2</xref>), which will disturb the regression analysis, while their phylogenetic trees indicated a clear classification as that of core genome, especially other subspecies (Figure <xref ref-type="fig" rid="F4">4</xref>). These results suggested that <italic>fliA</italic> and <italic>fliS</italic> evolved congruently with the core genome. Notably, homologous recombination of <italic>fliC</italic> and flanked genes occurred frequently within monophasic and diphasic serovars (Figure <xref ref-type="fig" rid="F5">5</xref>). These results suggested that the level of clonality of flagellar genes within subspecies I was low.</p>
<p>Dramatically, the phylogenetic trees of <italic>fliB, fliC</italic>, and <italic>fliD</italic> revealed a similar division of the serovars into two distinct groups, corresponding to monophasic and biphasic serovars, respectively. This implied two distinct evolutionary patterns of <italic>Salmonella fliC</italic>/<italic>fljB</italic> gene cluster. This division was not pointed out previously, but similar phylogenetic structures of <italic>fliC</italic>/<italic>fljB</italic> were reported (Smith et al., <xref ref-type="bibr" rid="B42">1990</xref>; Mortimer et al., <xref ref-type="bibr" rid="B32">2007</xref>). It was divided into Non-G antigens and G antigens based on <italic>fliC</italic><sub>g, m, s</sub> encoding three factors. Through the phylogenetic tree in Mortimer (Mortimer et al., <xref ref-type="bibr" rid="B32">2007</xref>) study, Non-G group and G group were clustered into different branches, which equaled to biphasic and monophasic groups respectively. According to these results, a flow diagram was constructed to demonstrate the evolutionary path of the monophasic and biphasic gene clusters (Figure <xref ref-type="fig" rid="F6">6</xref>). The <italic>Salmonella</italic> ancestor should just harbor a <italic>fliC</italic> gene cluster as most of <italic>E. coli</italic> isolates (McQuiston et al., <xref ref-type="bibr" rid="B28">2008</xref>). Subsequently, some lineages suffered <italic>fliC</italic> duplication to assemble an <italic>fljB</italic> gene cluster under certain natural selection conditions (biphasic), while the others not (monophasic). Both monophasic and biphasic lineages evolved and adapted to different environments and occupied diverse ecological niches, resulting in H antigen (serovar) diversification. Ancestors of some biphasic serovars might have lost <italic>fljB</italic> gene cluster under natural selection because it was useless, becoming monophasic again (e g., <italic>S</italic>. Typhi and some <italic>S</italic>. Typhimurium isolates). Frequent homologous recombination occurred within monophasic and biphasic serovars. Finally, the current diversity of <italic>Salmonella</italic> was formed. This theory assumed that <italic>fljB</italic> gene cluster should perform a role together with <italic>fliC</italic> cluster in most biphasic serovars, but present knowledge suggested that just one of them was expressed in a cell (Imre et al., <xref ref-type="bibr" rid="B17">2005</xref>). Therefore, the role of <italic>fljB</italic> needs to be further determined to clarify the implication of the two distinct evolutionary patterns.</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p>Schema diagram showing the two distinct evolution patterns of <italic>Salmonella</italic> fagellin gene clusters. Homologous recombination (represented by black arrows) rarely occurred between monophasic and biphasic serovars, but frequently occurred within the same phase. Some serovars and some isolates of certain serovars should have been biphasic historically but become monophasic because of gene loss of <italic>fljB</italic> gene cluster, such as serovar Typhi and Typhimurium.</p></caption>
<graphic xlink:href="fmicb-08-02604-g0006.tif"/>
</fig>
<p>Collectively, we provide the viewpoint that flagellin genes of <italic>S. enterica</italic> subsp. <italic>enterica</italic> represents two evolutionary patterns with a barrier between monophasic and biphasic serovars. The evolutionary barrier may inhibit an overlapping recombination. However, it is still unclear what the implication of the barriers on the immunological escape, niche adaptation, and avoiding predation is. That is an interesting area for further study. Besides, different serovar genome repertoires were included, suggesting that the core genome evolution of <italic>S. enterica</italic> was accompanied by <italic>fliA</italic>/<italic>fliS</italic> but distinct from those of <italic>fliBCD</italic> and <italic>fljB</italic> gene cluster. This study will provide a robust genetic framework for future studies on the evolution of the phase I and phase II flagellar characteristics in <italic>S. enterica</italic>.</p>
</sec>
<sec id="s5">
<title>Author contributions</title>
<p>YL and DFZ designed and initiated the study, interpreted the results, and wrote the manuscript; YL downloaded and analyzed the data; LZ, XZ, LX, and XS contributed to discussion of the results and improvement of the manuscript and designed the outline of this study and manuscript and provided laboratory equipment and space.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</sec>
</body>
<back>
<ack><p>This work was supported by the National Natural Science Foundation of China [No. 31230058] and the National Key R&#x00026;D program of China [No. 2016YFE0106100]. The authors would like to thank Dr. Harold Corke, Dr. Hongyu Ou, Dr. Xiangyu Deng, and Dr. Arun Bhunia for their useful suggestions on this manuscript. In addition, we want to thank Mr. Ben Jia and Mr. Lincheng Shi who kindly provided help in custom script coding for this work.</p>
</ack>
<sec sec-type="supplementary-material" id="s6">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2017.02604/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmicb.2017.02604/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table1.XLSX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Table S1</label>
<caption><p>Complete list of all <italic>Salmonella</italic> genome sequences. The 214 genomes included 205 <italic>Salmonella enterica</italic> subsp. <italic>enterica</italic> and nine genomes of four other subspecies, <italic>arizonae, diarizonae, houtenae</italic>, and <italic>salamae</italic>.</p></caption></supplementary-material>
<supplementary-material xlink:href="Table2.XLSX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Table S2</label>
<caption><p>Presence/absence of flagellin genes of all analyzed <italic>Salmonella</italic>. The <italic>fliA, fliB, fliC, fliD</italic>, and <italic>fliS</italic> for <italic>fliC</italic> clusters, and <italic>fljA, fljB</italic>, and <italic>hin</italic> for <italic>fljB</italic> clusters were analyzed. a &#x02013;, absence of the gene; gene name, the locus tag in the genome.</p></caption></supplementary-material>
<supplementary-material xlink:href="Image1.JPEG" id="SM3" mimetype="image/jpeg" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Figure S1</label>
<caption><p>The genetic relationships of biphasic <italic>fliC</italic> and <italic>fljB</italic> genes. Maximum Likelihood phylogenetic tree shows the genetic relationships of biphasic <italic>fliC</italic> and <italic>fljB</italic> genes. It shows the details of the biphasic lineage in Figure <xref ref-type="fig" rid="F4">4I</xref>. The <italic>fliC</italic> and <italic>fljB</italic> genes were shown in green and blue characters, respectively. Each square represented a group of alleles with identical amino acid (83 sequence types in total). The <italic>fliC</italic> and <italic>fljB</italic> genes from the same genomes were linked by solid lines.</p></caption></supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Achtman</surname> <given-names>M.</given-names></name> <name><surname>Wain</surname> <given-names>J.</given-names></name> <name><surname>Weill</surname> <given-names>F. X.</given-names></name> <name><surname>Nair</surname> <given-names>S.</given-names></name> <name><surname>Zhou</surname> <given-names>Z.</given-names></name> <name><surname>Sangal</surname> <given-names>V.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Multilocus sequence typing as a replacement for serotyping in <italic>Salmonella enterica</italic></article-title>. <source>PLoS Pathog.</source> <volume>8</volume>:<fpage>e1002776</fpage>. <pub-id pub-id-type="doi">10.1371/journal.ppat.1002776</pub-id><pub-id pub-id-type="pmid">22737074</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Allard</surname> <given-names>M. W.</given-names></name> <name><surname>Luo</surname> <given-names>Y.</given-names></name> <name><surname>Strain</surname> <given-names>E.</given-names></name> <name><surname>Li</surname> <given-names>C.</given-names></name> <name><surname>Keys</surname> <given-names>C. E.</given-names></name> <name><surname>Son</surname> <given-names>I.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>High resolution clustering of <italic>Salmonella enterica</italic> serovar Montevideo strains using a next-generation sequencing approach</article-title>. <source>BMC Genomics</source> <volume>13</volume>:<fpage>32</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-13-32</pub-id><pub-id pub-id-type="pmid">22260654</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baker</surname> <given-names>S.</given-names></name> <name><surname>Hardy</surname> <given-names>J.</given-names></name> <name><surname>Sanderson</surname> <given-names>K. E.</given-names></name> <name><surname>Quail</surname> <given-names>M.</given-names></name> <name><surname>Goodhead</surname> <given-names>I.</given-names></name> <name><surname>Kingsley</surname> <given-names>R. A.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>A novel linear plasmid mediates flagellar variation in <italic>Salmonella</italic> Typhi</article-title>. <source>PLoS Pathog.</source> <volume>3</volume>:<fpage>e59</fpage>. <pub-id pub-id-type="doi">10.1371/journal.ppat.0030059</pub-id><pub-id pub-id-type="pmid">17500588</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonifield</surname> <given-names>H. R.</given-names></name> <name><surname>Hughes</surname> <given-names>K. T.</given-names></name></person-group> (<year>2003</year>). <article-title>Flagellar phase variation in <italic>Salmonella enterica</italic> is mediated by a posttranscriptional control mechanism</article-title>. <source>J. Bacteriol.</source> <volume>185</volume>, <fpage>3567</fpage>&#x02013;<lpage>3574</lpage>. <pub-id pub-id-type="doi">10.1128/JB.185.12.3567-3574.2003</pub-id><pub-id pub-id-type="pmid">12775694</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brenner</surname> <given-names>F. W.</given-names></name> <name><surname>Villar</surname> <given-names>R. G.</given-names></name> <name><surname>Angulo</surname> <given-names>F. J.</given-names></name> <name><surname>Tauxe</surname> <given-names>R.</given-names></name> <name><surname>Swaminathan</surname> <given-names>B.</given-names></name></person-group> (<year>2000</year>). <article-title><italic>Salmonella</italic> nomenclature</article-title>. <source>J. Clin. Microbiol.</source> <volume>38</volume>, <fpage>2465</fpage>&#x02013;<lpage>2467</lpage>. <pub-id pub-id-type="pmid">10878026</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="book"><person-group person-group-type="author"><collab>Centers for Disease Control and Prevention (CDC)</collab></person-group> (<year>2011</year>). <source>National Salmonella Surveillance Annual Summary 2009</source>. <publisher-loc>Atlanta, GA</publisher-loc>: <publisher-name>US Department of Health and Human Services, CDC</publisher-name>.</citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chan</surname> <given-names>K.</given-names></name> <name><surname>Baker</surname> <given-names>S.</given-names></name> <name><surname>Kim</surname> <given-names>C. C.</given-names></name> <name><surname>Detweiler</surname> <given-names>C. S.</given-names></name> <name><surname>Dougan</surname> <given-names>G.</given-names></name> <name><surname>Falkow</surname> <given-names>S.</given-names></name></person-group> (<year>2003</year>). <article-title>Genomic comparison of <italic>Salmonella enterica</italic> serovars and <italic>Salmonella bongori</italic> by use of an <italic>S. enterica</italic> serovar typhimurium DNA microarray</article-title>. <source>J. Bacteriol.</source> <volume>185</volume>, <fpage>553</fpage>&#x02013;<lpage>563</lpage>. <pub-id pub-id-type="doi">10.1128/JB.185.2.553-563.2003</pub-id><pub-id pub-id-type="pmid">12511502</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crump</surname> <given-names>J. A.</given-names></name> <name><surname>Mintz</surname> <given-names>E. D.</given-names></name></person-group> (<year>2010</year>). <article-title>Global trends in typhoid and paratyphoid Fever</article-title>. <source>Clin. Infect. Dis.</source> <volume>50</volume>, <fpage>241</fpage>&#x02013;<lpage>246</lpage>. <pub-id pub-id-type="doi">10.1086/649541</pub-id><pub-id pub-id-type="pmid">20014951</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Didelot</surname> <given-names>X.</given-names></name> <name><surname>Bowden</surname> <given-names>R.</given-names></name> <name><surname>Wilson</surname> <given-names>D. J.</given-names></name> <name><surname>Peto</surname> <given-names>T. E.</given-names></name> <name><surname>Crook</surname> <given-names>D. W.</given-names></name></person-group> (<year>2012</year>). <article-title>Transforming clinical microbiology with bacterial genome sequencing</article-title>. <source>Nat. Rev. Genet.</source> <volume>13</volume>, <fpage>601</fpage>&#x02013;<lpage>612</lpage>. <pub-id pub-id-type="doi">10.1038/nrg3226</pub-id><pub-id pub-id-type="pmid">22868263</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Enright</surname> <given-names>A. J.</given-names></name> <name><surname>Van Dongen</surname> <given-names>S.</given-names></name> <name><surname>Ouzounis</surname> <given-names>C. A.</given-names></name></person-group> (<year>2002</year>). <article-title>An efficient algorithm for large-scale detection of protein families</article-title>. <source>Nucleic Acids Res.</source> <volume>30</volume>, <fpage>1575</fpage>&#x02013;<lpage>1584</lpage>. <pub-id pub-id-type="doi">10.1093/nar/30.7.1575</pub-id><pub-id pub-id-type="pmid">11917018</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Feil</surname> <given-names>E. J.</given-names></name> <name><surname>Holmes</surname> <given-names>E. C.</given-names></name> <name><surname>Bessen</surname> <given-names>D. E.</given-names></name> <name><surname>Chan</surname> <given-names>M. S.</given-names></name> <name><surname>Day</surname> <given-names>N. P. J.</given-names></name> <name><surname>Enright</surname> <given-names>M. C.</given-names></name> <etal/></person-group>. (<year>2001</year>). <article-title>Recombination within natural populations of pathogenic bacteria: short-term empirical estimates and long-term phylogenetic consequences</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>98</volume>, <fpage>182</fpage>&#x02013;<lpage>187</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.98.1.182</pub-id><pub-id pub-id-type="pmid">11136255</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Filliol</surname> <given-names>I.</given-names></name> <name><surname>Motiwala</surname> <given-names>A. S.</given-names></name> <name><surname>Cavatore</surname> <given-names>M.</given-names></name> <name><surname>Qi</surname> <given-names>W.</given-names></name> <name><surname>Hazb&#x000F3;n</surname> <given-names>M. H.</given-names></name> <name><surname>Bobadilla del Valle</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>Global phylogeny of <italic>Mycobacterium tuberculosis</italic> based on single nucleotide polymorphism (SNP) analysis: insights into tuberculosis evolution, phylogenetic accuracy of other DNA fingerprinting systems, and recommendations for a minimal standard SNP set</article-title>. <source>J. Bacteriol.</source> <volume>188</volume>, <fpage>759</fpage>&#x02013;<lpage>772</lpage>. <pub-id pub-id-type="doi">10.1128/JB.188.2.759-772.2006</pub-id><pub-id pub-id-type="pmid">16385065</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Grimont</surname> <given-names>P. A. D.</given-names></name> <name><surname>Weill</surname> <given-names>F. X.</given-names></name></person-group> (<year>2007</year>). <source>Antigenic Formulae of the Salmonella serovars, 9th Edn.</source> <publisher-loc>Paris</publisher-loc>: <publisher-name>WHO Collaborating Centre for Reference and Research on Salmonella</publisher-name>.</citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haiko</surname> <given-names>J.</given-names></name> <name><surname>Westerlund-Wikstr&#x000F6;m</surname> <given-names>B.</given-names></name></person-group> (<year>2013</year>). <article-title>The role of the bacterial flagellum in adhesion and virulence</article-title>. <source>Biology</source> <volume>2</volume>, <fpage>1242</fpage>&#x02013;<lpage>1267</lpage>. <pub-id pub-id-type="doi">10.3390/biology2041242</pub-id><pub-id pub-id-type="pmid">24833223</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hoffmann</surname> <given-names>M.</given-names></name> <name><surname>Luo</surname> <given-names>Y.</given-names></name> <name><surname>Lafon</surname> <given-names>P. C.</given-names></name> <name><surname>Timme</surname> <given-names>R.</given-names></name> <name><surname>Allard</surname> <given-names>M. W.</given-names></name> <name><surname>McDermott</surname> <given-names>P. F.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Genome sequences of <italic>Salmonella enterica</italic> serovar Heidelberg isolates isolated in the United States from a multistate outbreak of human <italic>Salmonella</italic> infections</article-title>. <source>Genome Announc.</source> <volume>1</volume>:<fpage>e00004</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1128/genomeA.00004-12</pub-id><pub-id pub-id-type="pmid">23405335</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holt</surname> <given-names>K. E.</given-names></name> <name><surname>Parkhill</surname> <given-names>J.</given-names></name> <name><surname>Mazzoni</surname> <given-names>C. J.</given-names></name> <name><surname>Roumagnac</surname> <given-names>P.</given-names></name> <name><surname>Weill</surname> <given-names>F. X.</given-names></name> <name><surname>Goodhead</surname> <given-names>I.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>High-throughput sequencing provides insights into genome variation and evolution in <italic>Salmonella</italic> Typhi</article-title>. <source>Nat. Genet.</source> <volume>40</volume>, <fpage>987</fpage>&#x02013;<lpage>993</lpage>. <pub-id pub-id-type="doi">10.1038/ng.195</pub-id><pub-id pub-id-type="pmid">18660809</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Imre</surname> <given-names>A.</given-names></name> <name><surname>Olasz</surname> <given-names>F.</given-names></name> <name><surname>Nagy</surname> <given-names>B.</given-names></name></person-group> (<year>2005</year>). <article-title>Development of a PCR system for the characterisation of <italic>Salmonella</italic> flagellin genes</article-title>. <source>Acta Vet. Hung.</source> <volume>53</volume>, <fpage>163</fpage>&#x02013;<lpage>172</lpage>. <pub-id pub-id-type="doi">10.1556/AVet.53.2005.2.2</pub-id><pub-id pub-id-type="pmid">15959975</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jacobsen</surname> <given-names>A.</given-names></name> <name><surname>Hendriksen</surname> <given-names>R. S.</given-names></name> <name><surname>Aaresturp</surname> <given-names>F. M.</given-names></name> <name><surname>Ussery</surname> <given-names>D. W.</given-names></name> <name><surname>Friis</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>The <italic>Salmonella enterica</italic> pan-genome</article-title>. <source>Microb. Ecol.</source> <volume>62</volume>, <fpage>487</fpage>&#x02013;<lpage>504</lpage>. <pub-id pub-id-type="doi">10.1007/s00248-011-9880-1</pub-id><pub-id pub-id-type="pmid">21643699</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>K&#x000F6;ser</surname> <given-names>C. U.</given-names></name> <name><surname>Ellington</surname> <given-names>M. J.</given-names></name> <name><surname>Cartwright</surname> <given-names>E. J.</given-names></name> <name><surname>Gillespie</surname> <given-names>S. H.</given-names></name> <name><surname>Brown</surname> <given-names>N. M.</given-names></name> <name><surname>Farrington</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Routine use of microbial whole genome sequencing in diagnostic and public health microbiology</article-title>. <source>PLoS Pathog.</source> <volume>8</volume>:<fpage>e1002824</fpage>. <pub-id pub-id-type="doi">10.1371/journal.ppat.1002824</pub-id><pub-id pub-id-type="pmid">22876174</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>S.</given-names></name> <name><surname>Stecher</surname> <given-names>G.</given-names></name> <name><surname>Tamura</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets</article-title>. <source>Mol. Biol. Evol.</source> <volume>33</volume>, <fpage>1870</fpage>&#x02013;<lpage>1874</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msw054</pub-id><pub-id pub-id-type="pmid">27004904</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laorden</surname> <given-names>L.</given-names></name> <name><surname>Herrera-Leon</surname> <given-names>S.</given-names></name> <name><surname>Martinez</surname> <given-names>I.</given-names></name> <name><surname>Sanchez</surname> <given-names>A.</given-names></name> <name><surname>Kromidas</surname> <given-names>L.</given-names></name> <name><surname>Bikandi</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Genetic evolution of the Spanish multidrug-resistant <italic>Salmonella enterica</italic> 4,5,12:i:- monophasic variant</article-title>. <source>J. Clin. Microbiol.</source> <volume>48</volume>, <fpage>4563</fpage>&#x02013;<lpage>4566</lpage>. <pub-id pub-id-type="doi">10.1128/JCM.00337-10</pub-id><pub-id pub-id-type="pmid">20943866</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Stoeckert</surname> <given-names>C. J.</given-names></name> <name><surname>Roos</surname> <given-names>D. S.</given-names></name></person-group> (<year>2003</year>). <article-title>OrthoMCL: identification of ortholog groups for Eukaryotic genomes</article-title>. <source>Genome Res.</source> <volume>13</volume>:<fpage>2178</fpage>. <pub-id pub-id-type="doi">10.1101/gr.1224503</pub-id><pub-id pub-id-type="pmid">12952885</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Macnab</surname> <given-names>R. M.</given-names></name></person-group> (<year>1992</year>). <article-title>Genetics and biogenesis of bacterial flagella</article-title>. <source>Annu. Rev. Genet.</source> <volume>26</volume>, <fpage>131</fpage>&#x02013;<lpage>158</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.ge.26.120192.001023</pub-id><pub-id pub-id-type="pmid">1482109</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Majowicz</surname> <given-names>S. E.</given-names></name> <name><surname>Musto</surname> <given-names>J.</given-names></name> <name><surname>Scallan</surname> <given-names>E.</given-names></name> <name><surname>Angulo</surname> <given-names>F. J.</given-names></name> <name><surname>Kirk</surname> <given-names>M.</given-names></name> <name><surname>O&#x00027;Brien</surname> <given-names>S. J.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>The global burden of nontyphoidal <italic>Salmonella</italic> gastroenteritis</article-title>. <source>Clin. Infect. Dis.</source> <volume>50</volume>, <fpage>882</fpage>&#x02013;<lpage>889</lpage>. <pub-id pub-id-type="doi">10.1086/650733</pub-id><pub-id pub-id-type="pmid">20158401</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martin</surname> <given-names>D. P.</given-names></name> <name><surname>Murrell</surname> <given-names>B.</given-names></name> <name><surname>Golden</surname> <given-names>M.</given-names></name> <name><surname>Khoosal</surname> <given-names>A.</given-names></name> <name><surname>Muhire</surname> <given-names>B.</given-names></name></person-group> (<year>2015</year>). <article-title>RDP4: Detection and analysis of recombination patterns in virus genomes</article-title>. <source>Virus Evol.</source> <volume>1</volume>:<fpage>vev003</fpage>. <pub-id pub-id-type="doi">10.1093/ve/vev003</pub-id><pub-id pub-id-type="pmid">27774277</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Masten</surname> <given-names>B. J.</given-names></name> <name><surname>Joys</surname> <given-names>T. M.</given-names></name></person-group> (<year>1993</year>). <article-title>Molecular analyses of the <italic>Salmonella</italic> g&#x02026; flagellar antigen complex</article-title>. <source>J. Bacteriol.</source> <volume>175</volume>, <fpage>5359</fpage>&#x02013;<lpage>5365</lpage>. <pub-id pub-id-type="doi">10.1128/jb.175.17.5359-5365.1993</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McClelland</surname> <given-names>M.</given-names></name> <name><surname>Sanderson</surname> <given-names>K. E.</given-names></name> <name><surname>Spieth</surname> <given-names>J.</given-names></name> <name><surname>Clifton</surname> <given-names>S. W.</given-names></name> <name><surname>Latreille</surname> <given-names>P.</given-names></name> <name><surname>Courtney</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2001</year>). <article-title>Complete genome sequence of <italic>Salmonella enterica</italic> serovar Typhimurium LT2</article-title>. <source>Nature</source> <volume>413</volume>, <fpage>852</fpage>&#x02013;<lpage>856</lpage>. <pub-id pub-id-type="doi">10.1038/35101614</pub-id><pub-id pub-id-type="pmid">11677609</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McQuiston</surname> <given-names>J. R.</given-names></name> <name><surname>Fields</surname> <given-names>P. I.</given-names></name> <name><surname>Tauxe</surname> <given-names>R. V.</given-names></name> <name><surname>Logsdon</surname> <given-names>J. M.</given-names> <suffix>Jr.</suffix></name></person-group> (<year>2008</year>). <article-title>Do <italic>Salmonella</italic> carry spare tyres?</article-title> <source>Trends Microbiol.</source> <volume>16</volume>, <fpage>142</fpage>&#x02013;<lpage>148</lpage>. <pub-id pub-id-type="doi">10.1016/j.tim.2008.01.009</pub-id><pub-id pub-id-type="pmid">18375124</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McQuiston</surname> <given-names>J. R.</given-names></name> <name><surname>Parrenas</surname> <given-names>R.</given-names></name> <name><surname>Ortiz-Rivera</surname> <given-names>M.</given-names></name> <name><surname>Gheesling</surname> <given-names>L.</given-names></name> <name><surname>Brenner</surname> <given-names>F.</given-names></name> <name><surname>Fields</surname> <given-names>P. I.</given-names></name></person-group> (<year>2004</year>). <article-title>Sequencing and comparative analysis of flagellin genes <italic>fliC, fljB</italic>, and <italic>flpA</italic> from <italic>Salmonella</italic></article-title>. <source>J. Clin. Microbiol.</source> <volume>42</volume>, <fpage>1923</fpage>&#x02013;<lpage>1932</lpage>. <pub-id pub-id-type="doi">10.1128/JCM.42.5.1923-1932.2004</pub-id><pub-id pub-id-type="pmid">15131150</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McQuiston</surname> <given-names>J. R.</given-names></name> <name><surname>Waters</surname> <given-names>R. J.</given-names></name> <name><surname>Dinsmore</surname> <given-names>B. A.</given-names></name> <name><surname>Mikoleit</surname> <given-names>M. L.</given-names></name> <name><surname>Fields</surname> <given-names>P. I.</given-names></name></person-group> (<year>2011</year>). <article-title>Molecular determination of H antigens of <italic>Salmonella</italic> by use of a microsphere-based liquid array</article-title>. <source>J. Clin. Microbiol.</source> <volume>49</volume>, <fpage>565</fpage>&#x02013;<lpage>573</lpage>. <pub-id pub-id-type="doi">10.1128/JCM.01323-10</pub-id><pub-id pub-id-type="pmid">21159932</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moreno-Hagelsieb</surname> <given-names>G.</given-names></name> <name><surname>Latimer</surname> <given-names>K.</given-names></name></person-group> (<year>2008</year>). <article-title>Choosing BLAST options for better detection of orthologs as reciprocal best hits</article-title>. <source>Bioinformatics</source> <volume>24</volume>, <fpage>319</fpage>&#x02013;<lpage>324</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btm585</pub-id><pub-id pub-id-type="pmid">18042555</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mortimer</surname> <given-names>C. K.</given-names></name> <name><surname>Gharbia</surname> <given-names>S. E.</given-names></name> <name><surname>Logan</surname> <given-names>J. M.</given-names></name> <name><surname>Peters</surname> <given-names>T. M.</given-names></name> <name><surname>Arnold</surname> <given-names>C.</given-names></name></person-group> (<year>2007</year>). <article-title>Flagellin gene sequence evolution in <italic>Salmonella</italic></article-title>. <source>Infect. Genet. Evol.</source> <volume>7</volume>, <fpage>411</fpage>&#x02013;<lpage>415</lpage>. <pub-id pub-id-type="doi">10.1016/j.meegid.2006.12.001</pub-id><pub-id pub-id-type="pmid">17251067</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Nataro</surname> <given-names>J. P.</given-names></name> <name><surname>Bopp</surname> <given-names>C. A.</given-names></name> <name><surname>Fields</surname> <given-names>P. I.</given-names></name> <name><surname>Kaper</surname> <given-names>J. B.</given-names></name> <name><surname>Strockbine</surname> <given-names>N. A.</given-names></name> <name><surname>Murray</surname> <given-names>P. R.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title><italic>Escherichia, Shigella</italic>, and <italic>Salmonella</italic></article-title>, in <source>Manual of Clinical Microbiology, 10th Edn</source>, eds <person-group person-group-type="editor"><name><surname>Murray</surname> <given-names>P. R.</given-names></name> <name><surname>Baro</surname> <given-names>E. J.</given-names></name> <name><surname>Jorgensen</surname> <given-names>J. H.</given-names></name> <name><surname>Pfaller</surname> <given-names>M. A.</given-names></name> <name><surname>Yolken</surname> <given-names>R. H.</given-names></name></person-group> (<publisher-loc>Washington, DC</publisher-loc>: <publisher-name>ASM Press</publisher-name>), <fpage>603</fpage>&#x02013;<lpage>626</lpage>.</citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Okazaki</surname> <given-names>N.</given-names></name> <name><surname>Matsuo</surname> <given-names>S.</given-names></name> <name><surname>Saito</surname> <given-names>A.</given-names></name> <name><surname>Tominaga</surname> <given-names>M.</given-names></name> <name><surname>Enomoto</surname> <given-names>M.</given-names></name></person-group> (<year>1993</year>). <article-title>Conversion of the <italic>Salmonella</italic> phase 1 flagellin gene <italic>fliC</italic> to the phase 2 gene <italic>fljB</italic> on the <italic>Escherichia coli</italic> K-12 chromosome</article-title>. <source>J. Bacteriol.</source> <volume>175</volume>, <fpage>758</fpage>&#x02013;<lpage>766</lpage>. <pub-id pub-id-type="doi">10.1128/jb.175.3.758-766.1993</pub-id><pub-id pub-id-type="pmid">8423149</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Octavia</surname> <given-names>S.</given-names></name> <name><surname>Lan</surname> <given-names>R. T.</given-names></name></person-group> (<year>2006</year>). <article-title>Frequent recombination and low level of clonality within <italic>Salmonella enterica</italic> subspecies I</article-title>. <source>Microbiology Sgm</source> <volume>152</volume>, <fpage>1099</fpage>&#x02013;<lpage>1108</lpage>. <pub-id pub-id-type="doi">10.1099/mic.0.28486-0</pub-id><pub-id pub-id-type="pmid">16549673</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ogushi</surname> <given-names>K.</given-names></name> <name><surname>Wada</surname> <given-names>A.</given-names></name> <name><surname>Niidome</surname> <given-names>T.</given-names></name> <name><surname>Mori</surname> <given-names>N.</given-names></name> <name><surname>Oishi</surname> <given-names>K.</given-names></name> <name><surname>Nagatake</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2001</year>). <article-title><italic>Salmonella enteritidis</italic> FliC (flagella filament protein) induces human beta-defensin-2 mRNA production by Caco-2 cells</article-title>. <source>J. Biol. Chem.</source> <volume>276</volume>, <fpage>30521</fpage>&#x02013;<lpage>30526</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M011618200</pub-id><pub-id pub-id-type="pmid">11387317</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ohnishi</surname> <given-names>K.</given-names></name> <name><surname>Kutsukake</surname> <given-names>K.</given-names></name> <name><surname>Suzuki</surname> <given-names>H.</given-names></name> <name><surname>Iino</surname> <given-names>T.</given-names></name></person-group> (<year>1990</year>). <article-title>Gene <italic>fliA</italic> encodes an alternative sigma factor specific for flagellar operons in <italic>Salmonella</italic> typhimurium</article-title>. <source>Mol. Gen. Genet.</source> <volume>221</volume>, <fpage>139</fpage>&#x02013;<lpage>147</lpage>. <pub-id pub-id-type="doi">10.1007/BF00261713</pub-id><pub-id pub-id-type="pmid">2196428</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parkhill</surname> <given-names>J.</given-names></name> <name><surname>Dougan</surname> <given-names>G.</given-names></name> <name><surname>James</surname> <given-names>K. D.</given-names></name> <name><surname>Thomson</surname> <given-names>N. R.</given-names></name> <name><surname>Pickard</surname> <given-names>D.</given-names></name> <name><surname>Wain</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2001</year>). <article-title>Complete genome sequence of a multiple drug resistant <italic>Salmonella enterica</italic> serovar Typhi CT18</article-title>. <source>Nature</source> <volume>413</volume>, <fpage>848</fpage>&#x02013;<lpage>852</lpage>. <pub-id pub-id-type="doi">10.1038/35101607</pub-id><pub-id pub-id-type="pmid">11677608</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schrader</surname> <given-names>K. N.</given-names></name> <name><surname>Fernandez-Castro</surname> <given-names>A.</given-names></name> <name><surname>Cheung</surname> <given-names>W. K.</given-names></name> <name><surname>Crandall</surname> <given-names>C. M.</given-names></name> <name><surname>Abbott</surname> <given-names>S. L.</given-names></name></person-group> (<year>2008</year>). <article-title>Evaluation of commercial antisera for <italic>Salmonella</italic> serotyping</article-title>. <source>J. Clin. Microbiol.</source> <volume>46</volume>, <fpage>685</fpage>&#x02013;<lpage>688</lpage>. <pub-id pub-id-type="doi">10.1128/JCM.01808-07</pub-id><pub-id pub-id-type="pmid">18094130</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Selander</surname> <given-names>R. K.</given-names></name> <name><surname>Smith</surname> <given-names>N. H.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Beltran</surname> <given-names>P.</given-names></name> <name><surname>Ferris</surname> <given-names>K. E.</given-names></name> <name><surname>Kopecko</surname> <given-names>D. J.</given-names></name> <etal/></person-group>. (<year>1992</year>). <article-title>Molecular evolutionary genetics of the cattle-adapted serovar <italic>Salmonella</italic> dublin</article-title>. <source>J. Bacteriol.</source> <volume>174</volume>, <fpage>3587</fpage>&#x02013;<lpage>3592</lpage>. <pub-id pub-id-type="doi">10.1128/jb.174.11.3587-3592.1992</pub-id><pub-id pub-id-type="pmid">1592813</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Silverman</surname> <given-names>M.</given-names></name> <name><surname>Zieg</surname> <given-names>J.</given-names></name> <name><surname>Hilmen</surname> <given-names>M.</given-names></name> <name><surname>Simon</surname> <given-names>M.</given-names></name></person-group> (<year>1979</year>). <article-title>Phase variation in <italic>Salmonella</italic>: genetic analysis of a recombinational switch</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>76</volume>, <fpage>391</fpage>&#x02013;<lpage>395</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.76.1.391</pub-id><pub-id pub-id-type="pmid">370828</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>N. H.</given-names></name> <name><surname>Beltran</surname> <given-names>P.</given-names></name> <name><surname>Selander</surname> <given-names>R. K.</given-names></name></person-group> (<year>1990</year>). <article-title>Recombination of <italic>Salmonella</italic> phase 1 flagellin genes generates new serovars</article-title>. <source>J. Bacteriol.</source> <volume>172</volume>, <fpage>2209</fpage>&#x02013;<lpage>2216</lpage>. <pub-id pub-id-type="doi">10.1128/jb.172.5.2209-2216.1990</pub-id><pub-id pub-id-type="pmid">2129534</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamatakis</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>1312</fpage>&#x02013;<lpage>1313</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu033</pub-id><pub-id pub-id-type="pmid">24451623</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stecher</surname> <given-names>B.</given-names></name> <name><surname>Hapfelmeier</surname> <given-names>S.</given-names></name> <name><surname>M&#x000FC;ller</surname> <given-names>C.</given-names></name> <name><surname>Kremer</surname> <given-names>M.</given-names></name> <name><surname>Stallmach</surname> <given-names>T.</given-names></name> <name><surname>Hardt</surname> <given-names>W. D.</given-names></name></person-group> (<year>2004</year>). <article-title>Flagella and chemotaxis are required for efficient induction of <italic>Salmonella enterica</italic> serovar Typhimurium colitis in streptomycin-pretreated mice</article-title>. <source>Infect. Immun.</source> <volume>72</volume>, <fpage>4138</fpage>&#x02013;<lpage>4150</lpage>. <pub-id pub-id-type="doi">10.1128/IAI.72.7.4138-4150.2004</pub-id><pub-id pub-id-type="pmid">15213159</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Swofford</surname> <given-names>D. L.</given-names></name></person-group> (<year>1998</year>). <source>PAUP<sup>&#x0002A;</sup>. Phylogenetic Analysis Using Parsimony (<sup>&#x0002A;</sup>and Other Methods)</source>. <publisher-loc>Sunderland, MA</publisher-loc>: <publisher-name>Sinauer Associates</publisher-name>.</citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Szekely</surname> <given-names>E.</given-names></name> <name><surname>Simon</surname> <given-names>M.</given-names></name></person-group> (<year>1983</year>). <article-title>DNA sequence adjacent to flagellar genes and evolution of flagellar-phase variation</article-title>. <source>J. Bacteriol.</source> <volume>155</volume>, <fpage>74</fpage>&#x02013;<lpage>81</lpage>. <pub-id pub-id-type="pmid">6305924</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wain</surname> <given-names>J.</given-names></name> <name><surname>O&#x00027;Grady</surname> <given-names>J.</given-names></name></person-group> (<year>2017</year>). <article-title>Genomic diversity in Salmonella enterica</article-title>, in <source>Applied Genomics of Foodborne Pathogens</source>, eds <person-group person-group-type="editor"><name><surname>Deng</surname> <given-names>X.</given-names></name> <name><surname>den Bakker</surname> <given-names>H. C.</given-names></name> <name><surname>Hendriksen</surname> <given-names>R. S.</given-names></name></person-group> (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>91</fpage>&#x02013;<lpage>107</lpage>.</citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yamamoto</surname> <given-names>S.</given-names></name> <name><surname>Kutsukake</surname> <given-names>K.</given-names></name></person-group> (<year>2006</year>). <article-title>FljA-mediated posttranscriptional control of phase 1 flagellin expression in flagellar phase variation of <italic>Salmonella enterica</italic> serovar Typhimurium</article-title>. <source>J. Bacteriol.</source> <volume>188</volume>, <fpage>958</fpage>&#x02013;<lpage>967</lpage>. <pub-id pub-id-type="doi">10.1128/JB.188.3.958-967.2006</pub-id><pub-id pub-id-type="pmid">16428400</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yokoseki</surname> <given-names>T.</given-names></name> <name><surname>Kutsukake</surname> <given-names>K.</given-names></name> <name><surname>Ohnishi</surname> <given-names>K.</given-names></name> <name><surname>Iino</surname> <given-names>T.</given-names></name></person-group> (<year>1995</year>). <article-title>Functional analysis of the flagellar genes in the <italic>fliD</italic> operon of <italic>Salmonella</italic> typhimurium</article-title>. <source>Microbiology</source> <volume>141</volume> (<issue>Pt 7</issue>), <fpage>1715</fpage>&#x02013;<lpage>1722</lpage>. <pub-id pub-id-type="doi">10.1099/13500872-141-7-1715</pub-id><pub-id pub-id-type="pmid">7551038</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yue</surname> <given-names>M.</given-names></name> <name><surname>Han</surname> <given-names>X.</given-names></name> <name><surname>De Masi</surname> <given-names>L.</given-names></name> <name><surname>Zhu</surname> <given-names>C.</given-names></name> <name><surname>Ma</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Allelic variation contributes to bacterial host specificity</article-title>. <source>Nat. Commun.</source> <volume>6</volume>:<fpage>8754</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms9754</pub-id><pub-id pub-id-type="pmid">26515720</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>Z.</given-names></name> <name><surname>McCann</surname> <given-names>A.</given-names></name> <name><surname>Litrup</surname> <given-names>E.</given-names></name> <name><surname>Murphy</surname> <given-names>R.</given-names></name> <name><surname>Cormican</surname> <given-names>M.</given-names></name> <name><surname>Fanning</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Neutral genomic microevolution of a recently emerged pathogen, <italic>Salmonella enterica</italic> serovar Agona</article-title>. <source>PLoS Genet.</source> <volume>9</volume>:<fpage>e1003471</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1003471</pub-id><pub-id pub-id-type="pmid">23637636</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zieg</surname> <given-names>J.</given-names></name> <name><surname>Hilmen</surname> <given-names>M.</given-names></name> <name><surname>Simon</surname> <given-names>M.</given-names></name></person-group> (<year>1978</year>). <article-title>Regulation of gene expression by site-specific inversion</article-title>. <source>Cell</source> <volume>15</volume>, <fpage>237</fpage>&#x02013;<lpage>244</lpage>. <pub-id pub-id-type="doi">10.1016/0092-8674(78)90098-3</pub-id><pub-id pub-id-type="pmid">699043</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zou</surname> <given-names>Q. H.</given-names></name> <name><surname>Li</surname> <given-names>R. Q.</given-names></name> <name><surname>Liu</surname> <given-names>G. R.</given-names></name> <name><surname>Liu</surname> <given-names>S. L.</given-names></name></person-group> (<year>2014</year>). <article-title>Comparative genomic analysis between typhoidal and non-typhoidal <italic>Salmonella</italic> serovars reveals typhoid-specific protein families</article-title>. <source>Infect. Genet. Evol.</source> <volume>26</volume>, <fpage>295</fpage>&#x02013;<lpage>302</lpage>. <pub-id pub-id-type="doi">10.1016/j.meegid.2014.06.008</pub-id><pub-id pub-id-type="pmid">24951835</pub-id></citation></ref>
</ref-list>
</back>
</article>