<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Cell. Inf. Microbio.</journal-id>
<journal-title>Frontiers in Cellular and Infection Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Cell. Inf. Microbio.</abbrev-journal-title>
<issn pub-type="epub">2235-2988</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fcimb.2012.00116</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Perspective Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The genetic integrity of bacterial species: the core genome and the accessory genome, two different stories</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Segerman</surname> <given-names>Bo</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
</contrib>
</contrib-group>
<aff><institution>National Veterinary Institute</institution> <country>Uppsala, Sweden</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Didier Raoult, Universit&#x000E9; de la M&#x000E9;diterran&#x000E9;e, France</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Kalai Mathee, Florida International University, USA; Nick S. Jakubovics, Newcastle University, UK; Jean-Marc Rolain, URMITE CNRS-IRD UMR 6236, France</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Bo Segerman, National Veterinary Institute SVA, SE-751 89 Uppsala, Sweden. e-mail: <email>bo.segerman&#x00040;sva.se</email></p></fn>
</author-notes>
<pub-date pub-type="epreprint">
<day>04</day>
<month>05</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="epub">
<day>06</day>
<month>09</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="collection">
<year>2012</year>
</pub-date>
<volume>2</volume>
<elocation-id>116</elocation-id>
<history>
<date date-type="received">
<day>01</day>
<month>03</month>
<year>2012</year>
</date>
<date date-type="accepted">
<day>20</day>
<month>08</month>
<year>2012</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2012 Segerman.</copyright-statement>
<copyright-year>2012</copyright-year>
<license license-type="open-access" xlink:href="http://www.frontiersin.org/licenseagreement"><p>This is an open-access article distributed under the terms of the <uri xlink:href="http://creativecommons.org/licenses/by/3.0/">Creative Commons Attribution License</uri>, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.</p>
</license>
</permissions>
<abstract><p>Strains within a bacterial species typically have a set of conserved core genes and a variable set of accessory genes. The accessory genes often appear to move laterally between strains, thereby forming new trait combinations. Sometimes, genetic material also moves laterally between species, thereby resulting in diffuse borders between them. The growing number of genome sequences offers new possibilities to study these processes. Ten species for which abundant genomic data exists were here selected for analysis of the species border integrity. The average core genome similarities and relative core genome sizes (RCGSs) were determined for strain pairs within the species and for strain pairs crossing the species border. The variability within the species as well as the border integrity varies for different bacterial species. Some have very distinct borders while others are more or less indefinable. From the growing amount of genomic data, it becomes even clearer that the concept of bacterial species is, in many cases, far from absolute.</p></abstract>
<kwd-group>
<kwd>accessory genome</kwd>
<kwd>bacterial species</kwd>
<kwd>core genome</kwd>
<kwd>lateral gene transfer</kwd>
<kwd>species border</kwd>
</kwd-group>
<counts>
<fig-count count="2"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="31"/>
<page-count count="8"/>
<word-count count="4644"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="introduction" id="s1">
<title>Introduction</title>
<p>In higher eukaryotes, a species is often defined as a group of organisms that are so reproductively isolated that interbreeding with other species cannot occur or does not result in a fertile offspring. This is believed to maintain the genetic integrity of the species over time. A genetic pool that is much larger than that present in each individual is maintained within the species and sexual reproduction accounts for the formation of new allele combinations. Thus, the genetic material is inherited vertically from the combined genetic pool of the parents and the apparent universality of this inheritance mode led Darwin to propose that all organisms could be organized into a &#x0201C;tree of life.&#x0201D;</p>
<p>In prokaryotes, the situation is somewhat different. Their genetic material is asexually inherited from the ancestral cell. Accumulation of mutations during this clonal expansion can give rise to sub-populations with selective advantages. If prokaryotes were to rely on only this mechanism for adaptation, new trait combinations would require &#x0201C;reinventing the wheel&#x0201D; over and over. Thus, it is not surprising that lateral gene transfer (LGT) mechanisms exist in prokaryotes (Ochman et al., <xref ref-type="bibr" rid="B20">2000</xref>). LGT allows advantageous genes to sweep through populations (Shapiro et al., <xref ref-type="bibr" rid="B26">2012</xref>). As a consequence of the lateral movements of genetic material, the organisms becomes chimerical and a strict tree model cannot adequately represent their phylogenetic relationships (Bapteste et al., <xref ref-type="bibr" rid="B2">2009</xref>). To reflect this, the phrase &#x0201C;rhizome of life&#x0201D; is sometimes used as an alternative to &#x0201C;tree of life&#x0201D; (Raoult, <xref ref-type="bibr" rid="B21">2010</xref>). It is clear that different models to represent this complex evolutionary history are necessary, depending on what scientific questions we are addressing, i.e., we need to use &#x0201C;pattern pluralism&#x0201D; in our way of thinking (Doolittle and Bapteste, <xref ref-type="bibr" rid="B7">2007</xref>). However, the relative extent to which the tree model should, or must, be discarded in favor of alternative models is still debated and varies depending on the species in question. A further consequence of LGT is that the actual concept of the bacterial species becomes partially undermined (Doolittle and Papke, <xref ref-type="bibr" rid="B8">2006</xref>). A broader viewpoint for describing bacterial population structures constitutes the presence of sympatric species complexes with high plasticity and lateral gene exchange from which specialized allopatric species, such as pathogens, can escape (Georgiades and Raoult, <xref ref-type="bibr" rid="B10">2010</xref>). Reductive evolution accompanying the pathogenic lifestyle will, in many cases, confine the species (Merhej et al., <xref ref-type="bibr" rid="B18">2009</xref>).</p>
<p>Recently, a clearer understanding of bacterial genome evolution has emerged. This is mainly because of the intense technological development of high-throughput sequencing (Metzker, <xref ref-type="bibr" rid="B19">2010</xref>). Sequencing is now done in enormous amounts of randomly primed parallel sequencing reactions from the same sample. The technologies are often collected under the name Next Generation Sequencing (NGS). NGS has unquestionably had an enormous impact on the growth rate of the bacterial genome database (Figure <xref ref-type="fig" rid="F1">1A</xref>). As parallel sequencing machines are moving from the core facilities into regular laboratories, it is likely that the number of sequences will continue to increase exponentially. In the early days of bacterial genome sequencing, most projects typically aimed to produce a complete genome sequence as the final product. Making a complete genome sequence still requires a lot of resources. In contrast, draft sequences have become easy and affordable to produce and are often sufficient for answering many biological questions. Consequently, the draft genome database is growing more rapidly than the complete one. At the time of the analysis presented in this paper, there were over 2000 completed genomes and almost 3000 draft genomes in the form of contigs/scaffolds. There are also a large number of draft genome assemblies not yet submitted to the database.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>The genome database</bold>. The figure is based on the NCBI GenomeReport dated 21 Feb, 2012 and only includes genomes projects where completed sequences or draft assemblies were available for download. <bold>(A)</bold> The growth of the databases for draft and complete bacterial genomes. <bold>(B)</bold> The number of species with 1, 2, 3, 4, and 5 (or more) strains sequenced Aug 2008 and Feb. 2012, respectively. <bold>(C)</bold> The species with most genome sequences. Black represents completed genomes and grey draft genomes.</p></caption>
<graphic xlink:href="fcimb-02-00116-g0001.tif"/>
</fig>
<p>How are the current sequencing efforts being directed? Figure <xref ref-type="fig" rid="F1">1B</xref> shows the number of species in the genome database in August 2008 and February 2012. The number of species has grown from approximately 700 to almost 1000 (a 1.4-fold increase) and the number of sequences from 1255 to &#x0007E;4900 (a 3.9-fold increase). This illustrates that sequencing activities aimed at producing more sequences from strains belonging to already represented species are far more intense than sequencing projects directed towards new species. The ten most sequenced species account for 22% of the genome database. The bacterial species for which the highest number of genome sequences are available (at the time of this writing) are shown in Figure <xref ref-type="fig" rid="F1">1C</xref>. The large number of available genomic sequences has given us the opportunity to compare genomic variation within species as well as between them. However, sequencing efforts are undoubtedly biased towards strains of medical or economical importance and this may very well bias the conclusions we make.</p>
<p>The genomic sequencing efforts have made us realize that, similar to eukaryotes, bacterial species maintain a &#x0201C;genetic pool&#x0201D; much larger than the one present in each strain. Each strain has a conserved set of core genes and additionally, a number of accessory genes. The dynamics of the accessory genes can give rise to strains with &#x0201C;customized genomic repertoires&#x0201D; (Mathee et al., <xref ref-type="bibr" rid="B16">2008</xref>). Thus, different strains may have different sets of accessory genes and the superset of all different genes present in a species is often referred to as the pan-genome (Tettelin et al., <xref ref-type="bibr" rid="B30">2005</xref>). It is likely that the accessory genes constitute a reservoir for functionality that can be transferred laterally to create new trait combinations. However, a large fraction of the accessory genes have often no functional annotation; our knowledge about many of these genes is poor. The accessory genome has become an important field of study (Sim et al., <xref ref-type="bibr" rid="B27">2008</xref>; Bennett et al., <xref ref-type="bibr" rid="B3">2010</xref>; Kung et al., <xref ref-type="bibr" rid="B13">2010</xref>).</p>
<p>In this study, a limited dataset of species representing different life strategies and having a high number of both draft and complete genome sequences was selected for a more detailed analysis (Table <xref ref-type="table" rid="T1">1</xref>). The genomic information was here used to study the genetic integrity of these bacterial species in terms of core genome sequence variability and variations in the relative size of the core genome/accessory genome.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Species included in the analysis</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Genus</bold></th>
<th align="left"><bold>Species analyzed</bold></th>
<th align="left"><bold>Number of species represented in the genome sequence database</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left"><italic>Staphylococcus</italic></td>
<td align="left"><italic>S. aureus</italic></td>
<td align="left">12</td>
</tr>
<tr>
<td align="left"><italic>Streptococcus</italic></td>
<td align="left"><italic>S. pneumoniae</italic></td>
<td align="left">37</td>
</tr>
<tr>
<td align="left"><italic>Escherichia</italic></td>
<td align="left"><italic>E. coli</italic></td>
<td align="left">3</td>
</tr>
<tr>
<td align="left"><italic>Salmonella</italic></td>
<td align="left"><italic>S. enterica</italic></td>
<td align="left">2</td>
</tr>
<tr>
<td align="left"><italic>Mycobacterium</italic></td>
<td align="left"><italic>M. tuberculosis</italic></td>
<td align="left">17</td>
</tr>
<tr>
<td align="left"><italic>Neisseria</italic></td>
<td align="left"><italic>N. meningitidis</italic></td>
<td align="left">15</td>
</tr>
<tr>
<td align="left"><italic>Helicobacter</italic></td>
<td align="left"><italic>H. pylori</italic></td>
<td align="left">12</td>
</tr>
<tr>
<td align="left"><italic>Clostridium</italic></td>
<td align="left"><italic>C. botulinum</italic></td>
<td align="left">14</td>
</tr>
<tr>
<td align="left"><italic>Bacillus</italic></td>
<td align="left"><italic>B. anthracis</italic></td>
<td align="left">20</td>
</tr>
<tr>
<td align="left"><italic>Burkholderia</italic></td>
<td align="left"><italic>B. pseudomallei</italic></td>
<td align="left">17</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Comparative analysis</title>
<p>The degree of sequence similarity within the core genome is considered to be one of the best phylogenomic measures for comparing microbial genomes (Rokas et al., <xref ref-type="bibr" rid="B25">2003</xref>). By averaging comparisons of a large number of genes, the risk for disturbances in the result caused by laterally moved genetic material is minimized. In this study, pairwise average core genome similarity (ACGS) values and relative core genome size (RCGS) values were calculated using the Gegenees fragmented alignment method (&#x000C5;gren et al., <xref ref-type="bibr" rid="B1">2012</xref>). In brief, the genomes were fragmented into overlapping 200-basepair pieces and for each fragment a BLASTN score was calculated. The core genome was defined as the regions constituted by fragments with scores of at least 25% of the score value of a perfect match. The average similarity was normalized towards the value obtained when the genome was compared to itself. The RCGS value represents the core genome size relative to the whole genome size (core genome &#x0002B; accessory genome).</p>
<p>If genetic material were only inherited vertically, the differences in size of the accessory genetic material between two strains would depend on gene loss events and duplication events followed by acquisition of new functionality (neo-functionalization or sub-functionalization). It would then be expected that RCGS would gradually decrease as ACGS decrease. However, if lateral movement of accessory genetic material occur, a much greater variation in RCGS values would be expected because lateral movements would be more or less uncoupled with the vertical inheritance.</p>
<p>In this study, the relationship between ACGS and RCGS values and the integrity of the species border in terms of these values were examined by analysis of a large number of pairwise comparisons within a selected set of bacterial genera. Ten species, with good representation in the sequence database (Table <xref ref-type="table" rid="T1">1</xref>), were selected. The ACGS and RCGS values were then calculated, pairwise, for every possible genome combination in the genus. Thus, there were both intra- and inter-species comparisons. The data were used to create a diagram with the highest ACGS (shortest vertical inheritance distance) first, and then the pairwise comparisons plotted in descending order. In the same diagram, the corresponding RCGS values were plotted and the part of the diagram that represented intra-species comparisons was indicated. Hereafter, this diagram type is referred to as a &#x0201C;species integrity diagram.&#x0201D; The software Gegenees (&#x000C5;gren et al., <xref ref-type="bibr" rid="B1">2012</xref>) can, from version 1.1.5, generate this type of diagram and also gives interactive annotations of individual data points.</p>
</sec>
<sec>
<title>Species integrity of ten selected species</title>
<p>In the intra-species pairwise comparison of <italic>Staphylococcus aureus</italic> strains, ACGS values varied between 90% and 100% (Figure <xref ref-type="fig" rid="F2">2A</xref>). The RCGS values fluctuated between 90&#x02013;99% (i.e., up to 10% accessory genetic material) with modest correlation with the ACGS values. This indicates that the accessory genetic material is mobile between strains within the species border. Lateral transfer of genetic material in <italic>S. aureus</italic> has been described previously in relation to mobile genetic elements (Deurenberg and Stobberingh, <xref ref-type="bibr" rid="B5">2008</xref>; Lindsay, <xref ref-type="bibr" rid="B14">2010</xref>). The closest strain in the inter-species comparison had distinctly lower ACGS and RCGS values. Thus, on the basis of the genomic data available today, there seems to be a distinct border between the species <italic>S. aureus</italic> and its closest neighbors.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Species integrity diagrams for 10 selected species</bold>. Pairwise genome comparisons defining average core genome similarity (ACGS) are shown in red and relative core genome size (RCGS) in blue. ACGS measures phylogenomic distance while RCGS measures the size of the core genome relative to the total genome size (core &#x0002B; accessory genome). Intra-species comparisons are plotted first (indicated with the black line) and inter-species comparisons thereafter. Data are sorted by ASCG. <bold>(A&#x02013;J)</bold> Intra-species comparisons of strains from the selected species [<italic>Staphylococcus aureus, Streptococcus pneumoniae, Escherichia coli, Salmonella enterica, Mycobacterium tuberculosis, Neisseria meningitidis, Helicobacter pylori, Clostridium botulinum, Bacillus anthracis</italic> (a part of the <italic>Bacillus cereus</italic> group), and <italic>Burkholderia pseudomallei</italic>] and inter-species comparison with strains from other species in the same genus.</p></caption>
<graphic xlink:href="fcimb-02-00116-g0002.tif"/>
</fig>
<p>In <italic>Streptococcus pneumoniae</italic>, the intra-species comparison results were very similar to those found in <italic>S. aureus</italic> (Figure <xref ref-type="fig" rid="F2">2B</xref>). Lateral transfer within <italic>S. pneumoniae</italic> has been described previously (Coffey et al., <xref ref-type="bibr" rid="B4">1991</xref>). Inter-species comparisons show that the distance between <italic>S. pneumoniae</italic> and its closest related strains outside the species border (belonging to <italic>S. mitis</italic>) is quite small and there is a continuous decline in similarity with not fully distinct plateaus. This indicates that <italic>S. pneumoniae</italic> is exchanging genes with other <italic>Streptococcus</italic> spp. The population structure and dynamics of <italic>S. pneumoniae</italic> in terms of the pan genome has been studied previously (Donati et al., <xref ref-type="bibr" rid="B6">2010</xref>).</p>
<p>In the genus <italic>Escherichia</italic>, almost all sequences came from <italic>E. coli</italic>. The high amount of accessory genetic material and the large RCGS fluctuations between strain pairs with similar ACGS values (Figure <xref ref-type="fig" rid="F2">2C</xref>) suggest that there is an extraodinary mobility of accessory genetic material in <italic>E. coli</italic>. The border with other <italic>Escherichia</italic>. spp is not distinct although the sequencing database is underrepresented for non <italic>E. coli</italic> species. Like <italic>Escherichia</italic>, the related <italic>Salmonella</italic> genome database is dominated by a single species, <italic>S. enterica</italic>. <italic>S. enterica</italic> strains typically have an ACGS value of &#x0007E;55% compared to <italic>E. coli</italic> strains. The diagram shows that <italic>S. enterica strains</italic> have a lower variability in accessory genome size compared to <italic>E. coli</italic> (Figure <xref ref-type="fig" rid="F2">2D</xref>).</p>
<p>In contrast to the examples above, <italic>Mycobacterium tuberculosis</italic> is very distinct (Figure <xref ref-type="fig" rid="F2">2E</xref>). Low variability was seen for both for RCGS and ACGS values as was a clear difference to related, non-<italic>M. tuberculosis</italic> strains (except for <italic>M. bovis</italic> as discussed below). This indicates lateral gene movements are less frequent in this species. However, another type of problem in species designation becomes apparent here. <italic>M. bovis</italic> is, on a genomic level, an indistinguishable part of <italic>M. tuberculosis</italic>. Detailed genome analysis of a <italic>M. bovis</italic> strain has also shown that there is a very high genomic similarity to <italic>M. tuberculosis</italic> (Garnier et al., <xref ref-type="bibr" rid="B9">2003</xref>).</p>
<p><italic>Neisseria meningitidis</italic> is distinct but closely related to strains outside the species (Figure <xref ref-type="fig" rid="F2">2F</xref>). The closest strains belonged to the species <italic>N. gonorrhoeae</italic>. Many <italic>N. meningitidis</italic> strains showed only 10% lower RCGS values when compared to <italic>N. gonorrhoeae</italic> strains than when compared to the other strains within the species. This suggests crossover of genetic material can occur over the species border; nevertheless the ACGS values show clear distinct plateaus. The accessory genome has been observed to evolve differently from the core genome in <italic>Neisseria</italic> (Bennett et al., <xref ref-type="bibr" rid="B3">2010</xref>).</p>
<p>In <italic>Helicobacter pylori</italic>, the ACGS values were, in most intra-species comparisons, relatively low (Figure <xref ref-type="fig" rid="F2">2G</xref>). This suggests that <italic>Helicobacter</italic> has a comparatively high mutation rate. This has also been discussed previously (Wang et al., <xref ref-type="bibr" rid="B31">1999</xref>). The RCGS values fluctuate, as described in many species above, indicating the lateral movement of genes.</p>
<p><italic>Clostridium botulinum</italic> is a classical example of how long distance lateral gene movements can affect the integrity of what we call a bacterial species. The Botulinum Neuroxin (BoNT) gene, <italic>bont</italic>, has during several occasions jumped between quite distantly related <italic>Clostridium</italic> strains and this can be seen in the species integrity diagram (Figure <xref ref-type="fig" rid="F2">2H</xref>). <italic>C. botulinum</italic> can actually be seen as four distinct species that all are able to produce BoNT (Hill et al., <xref ref-type="bibr" rid="B11">2007</xref>; Skarin and Segerman, <xref ref-type="bibr" rid="B29">2011</xref>; Skarin et al., <xref ref-type="bibr" rid="B28">2011</xref>). Some strains without a functional <italic>bont</italic> gene can, from a genomic point of view, be considered to be the same species as <italic>C. botulinum</italic> but they go under other names (e.g., <italic>C. sporogenes, C. novyi).</italic> Furthermore, the <italic>bont</italic> gene can also be found in some <italic>C. baratii</italic> strains and some <italic>C. butyricum</italic> strains.</p>
<p><italic>Bacillus anthracis</italic> is an example of a species that represents a monophyletic clade within a larger group of related strains (Figure <xref ref-type="fig" rid="F2">2I</xref>). <italic>B. anthracis</italic> strains all have two virulence plasmids. Apart from the plasmids, they are extremely similar to other related strains (Kolsto et al., <xref ref-type="bibr" rid="B12">2009</xref>). There is a very diffuse boarder between the species in a large group of <italic>Bacillus</italic> strains containing <italic>B. anthracis</italic>, <italic>B. cereus</italic>, <italic>B. thuringiensis</italic>, <italic>B. mycoides, B. weihenstephanensis</italic>, and <italic>B. pseudomycoides</italic>. This group is commonly called the <italic>B. cereus</italic> group (Rasko et al., <xref ref-type="bibr" rid="B22">2005</xref>). Many <italic>B. cereus</italic> strains are much more closely related to <italic>B. thuringiensis</italic> strains than to other <italic>B. cereus</italic> strains.</p>
<p>Finally <italic>Burkholderia pseudomallei</italic> was analyzed (Figure <xref ref-type="fig" rid="F2">2J</xref>). The most striking observation is its relation to <italic>B. mallei. B. mallei</italic> is a lineage of <italic>B. pseudomallei</italic>, that has undergone massive reductive evolution (Losada et al., <xref ref-type="bibr" rid="B15">2010</xref>). Hence, the pronounced drop in the RCGS values without a corresponding drop in ACGS values.</p>
</sec>
<sec sec-type="conclusions" id="s2">
<title>Conclusions</title>
<p>When comparing different bacterial species from a genomic perspective, large differences can be found in what we call a species. A large part of the sequencing efforts today are focused on important human pathogens. The niches these species colonize are quite different from those for most environmental species. During their speciation, pathogens have generally shifted from a sympatric to an allopatric lifestyle and this is usually accompanied by reductive evolution and reduced pan genome size. Most of these pathogenic species are more or less distinct, based on core genome conservation, although lateral gene movement channels probably are quite common. In environmental species, however, we see a much larger variability and more poorly defined borders between the species. However, more data are needed on environmental species.</p>
<p>One extreme species is <italic>B. anthracis</italic>. It resides as a spore, without growth. When it does grow, it does so mainly without contact with other bacterial species before returning to the dormant spore state. Thus, it has little opportunity for contact with other bacterial species and hence low genetic variability. On the other extreme is <italic>E coli</italic>. This species has an extraordinarily large and variable set of accessory genes. It also lives in an environment where it is surrounded by a community of different, competing bacteria. Recently, we saw an example of an <italic>E. coli</italic> strain that took up new genetic material by lateral transfer and thereby transformed in to a new, highly virulent variant (Mellmann et al., <xref ref-type="bibr" rid="B17">2011</xref>; Rasko et al., <xref ref-type="bibr" rid="B23">2011</xref>; Rohde et al., <xref ref-type="bibr" rid="B24">2011</xref>).</p>
<p>Other species are located in between these extremes. They may be very close to neighboring species, but are typically still more or less distinguishable based on core sequence similarity. They have specialized in separate, environmental niches. As more strains are sequenced, we will probably find more intermediate forms of these species, showing that the bacterial species concept does not have an absolute definition. However, some draft genomes are of poor quality or are incorrectly annotated and must be treated with caution in any analysis of them.</p>
<p>How should we then relate to bacterial species? For most purposes, species of importance for humans are definable based on the core genome similarity. This is basically what we measure with the classical 16S analysis. However, we must be aware that the species borders are often (perhaps almost always) not absolute, and there will be intermediate strains occurring every now and then. Finally, it is definitely going to be equally important to classify strains according to their accessory gene content. This type of analysis will become much more efficient with the new upcoming sequencing technologies. We are standing at the beginning of a period with an enormous genome database growth and the possibility to greatly increase our understanding of what defines a bacterial species as such.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>&#x000C5;gren</surname> <given-names>J.</given-names></name> <name><surname>Sundstr&#x000F6;m</surname> <given-names>A.</given-names></name> <name><surname>H&#x000E5;fstr&#x000F6;m</surname> <given-names>T.</given-names></name> <name><surname>Segerman</surname> <given-names>B.</given-names></name></person-group> (<year>2012</year>). <article-title>Gegenees: fragmented alignment of multiple genomes for determining phylogenomic distances and genetic signatures unique for specified target groups</article-title>. <source>PLoS ONE</source> <volume>7</volume>:<fpage>e39107</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0039107</pub-id><pub-id pub-id-type="pmid">22723939</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bapteste</surname> <given-names>E.</given-names></name> <name><surname>O&#x00027;Malley</surname> <given-names>M. A.</given-names></name> <name><surname>Beiko</surname> <given-names>R. G.</given-names></name> <name><surname>Ereshefsky</surname> <given-names>M.</given-names></name> <name><surname>Gogarten</surname> <given-names>J. P.</given-names></name> <name><surname>Franklin-Hall</surname> <given-names>L.</given-names></name> <name><surname>Lapointe</surname> <given-names>F. J.</given-names></name> <name><surname>Dupre</surname> <given-names>J.</given-names></name> <name><surname>Dagan</surname> <given-names>T.</given-names></name> <name><surname>Boucher</surname> <given-names>Y.</given-names></name> <name><surname>Martin</surname> <given-names>W.</given-names></name></person-group> (<year>2009</year>). <article-title>Prokaryotic evolution and the tree of life are two different things</article-title>. <source>Biol. Direct</source> <volume>4</volume>, <fpage>34</fpage>. <pub-id pub-id-type="doi">10.1186/1745-6150-4-34</pub-id><pub-id pub-id-type="pmid">19788731</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bennett</surname> <given-names>J. S.</given-names></name> <name><surname>Bentley</surname> <given-names>S. D.</given-names></name> <name><surname>Vernikos</surname> <given-names>G. S.</given-names></name> <name><surname>Quail</surname> <given-names>M. A.</given-names></name> <name><surname>Cherevach</surname> <given-names>I.</given-names></name> <name><surname>White</surname> <given-names>B.</given-names></name> <name><surname>Parkhill</surname> <given-names>J.</given-names></name> <name><surname>Maiden</surname> <given-names>M. C.</given-names></name></person-group> (<year>2010</year>). <article-title>Independent evolution of the core and accessory gene sets in the genus <italic>Neisseria</italic>: insights gained from the genome of <italic>Neisseria lactamica</italic> isolate 020&#x02013;006</article-title>. <source>BMC Genomics</source> <volume>11</volume>, <fpage>652</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-11-652</pub-id><pub-id pub-id-type="pmid">21092259</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Coffey</surname> <given-names>T. J.</given-names></name> <name><surname>Dowson</surname> <given-names>C. G.</given-names></name> <name><surname>Daniels</surname> <given-names>M.</given-names></name> <name><surname>Zhou</surname> <given-names>J.</given-names></name> <name><surname>Martin</surname> <given-names>C.</given-names></name> <name><surname>Spratt</surname> <given-names>B. G.</given-names></name> <name><surname>Musser</surname> <given-names>J. M.</given-names></name></person-group> (<year>1991</year>). <article-title>Horizontal transfer of multiple penicillin-binding protein genes, and capsular biosynthetic genes, in natural populations of <italic>Streptococcus pneumoniae</italic></article-title>. <source>Mol. Microbiol</source>. <volume>5</volume>, <fpage>2255</fpage>&#x02013;<lpage>2260</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2958.1991.tb02155.x</pub-id><pub-id pub-id-type="pmid">1766389</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deurenberg</surname> <given-names>R. H.</given-names></name> <name><surname>Stobberingh</surname> <given-names>E. E.</given-names></name></person-group> (<year>2008</year>). <article-title>The evolution of <italic>Staphylococcus aureus</italic></article-title>. <source>Infect. Genet. Evol</source>. <volume>8</volume>, <fpage>747</fpage>&#x02013;<lpage>763</lpage>. <pub-id pub-id-type="doi">10.1016/j.meegid.2008.07.007</pub-id><pub-id pub-id-type="pmid">18718557</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Donati</surname> <given-names>C.</given-names></name> <name><surname>Hiller</surname> <given-names>N. L.</given-names></name> <name><surname>Tettelin</surname> <given-names>H.</given-names></name> <name><surname>Muzzi</surname> <given-names>A.</given-names></name> <name><surname>Croucher</surname> <given-names>N. J.</given-names></name> <name><surname>Angiuoli</surname> <given-names>S. V.</given-names></name> <name><surname>Oggioni</surname> <given-names>M.</given-names></name> <name><surname>Dunning Hotopp</surname> <given-names>J. C.</given-names></name> <name><surname>Hu</surname> <given-names>F. Z.</given-names></name> <name><surname>Riley</surname> <given-names>D. R.</given-names></name> <name><surname>Covacci</surname> <given-names>A.</given-names></name> <name><surname>Mitchell</surname> <given-names>T. J.</given-names></name> <name><surname>Bentley</surname> <given-names>S. D.</given-names></name> <name><surname>Kilian</surname> <given-names>M.</given-names></name> <name><surname>Ehrlich</surname> <given-names>G. D.</given-names></name> <name><surname>Rappuoli</surname> <given-names>R.</given-names></name> <name><surname>Moxon</surname> <given-names>E. R.</given-names></name> <name><surname>Masignani</surname> <given-names>V.</given-names></name></person-group> (<year>2010</year>). <article-title>Structure and dynamics of the pan-genome of <italic>Streptococcus pneumoniae</italic> and closely related species</article-title>. <source>Genome Biol</source>. <volume>11</volume>, <fpage>R107</fpage>. <pub-id pub-id-type="doi">10.1186/gb-2010-11-10-r107</pub-id><pub-id pub-id-type="pmid">21034474</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Doolittle</surname> <given-names>W. F.</given-names></name> <name><surname>Bapteste</surname> <given-names>E.</given-names></name></person-group> (<year>2007</year>). <article-title>Pattern pluralism and the Tree of Life hypothesis</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>104</volume>, <fpage>2043</fpage>&#x02013;<lpage>2049</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0610699104</pub-id><pub-id pub-id-type="pmid">17261804</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Doolittle</surname> <given-names>W. F.</given-names></name> <name><surname>Papke</surname> <given-names>R. T.</given-names></name></person-group> (<year>2006</year>). <article-title>Genomics and the bacterial species problem</article-title>. <source>Genome Biol</source>. <volume>7</volume>, <fpage>116</fpage>. <pub-id pub-id-type="doi">10.1186/gb-2006-7-9-116</pub-id><pub-id pub-id-type="pmid">17020593</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garnier</surname> <given-names>T.</given-names></name> <name><surname>Eiglmeier</surname> <given-names>K.</given-names></name> <name><surname>Camus</surname> <given-names>J. C.</given-names></name> <name><surname>Medina</surname> <given-names>N.</given-names></name> <name><surname>Mansoor</surname> <given-names>H.</given-names></name> <name><surname>Pryor</surname> <given-names>M.</given-names></name> <name><surname>Duthoy</surname> <given-names>S.</given-names></name> <name><surname>Grondin</surname> <given-names>S.</given-names></name> <name><surname>Lacroix</surname> <given-names>C.</given-names></name> <name><surname>Monsempe</surname> <given-names>C.</given-names></name> <name><surname>Simon</surname> <given-names>S.</given-names></name> <name><surname>Harris</surname> <given-names>B.</given-names></name> <name><surname>Atkin</surname> <given-names>R.</given-names></name> <name><surname>Doggett</surname> <given-names>J.</given-names></name> <name><surname>Mayes</surname> <given-names>R.</given-names></name> <name><surname>Keating</surname> <given-names>L.</given-names></name> <name><surname>Wheeler</surname> <given-names>P. R.</given-names></name> <name><surname>Parkhill</surname> <given-names>J.</given-names></name> <name><surname>Barrell</surname> <given-names>B. G.</given-names></name> <name><surname>Cole</surname> <given-names>S. T.</given-names></name> <name><surname>Gordon</surname> <given-names>S. V.</given-names></name> <name><surname>Hewinson</surname> <given-names>R. G.</given-names></name></person-group> (<year>2003</year>). <article-title>The complete genome sequence of <italic>Mycobacterium bovis</italic></article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>100</volume>, <fpage>7877</fpage>&#x02013;<lpage>7882</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1130426100</pub-id><pub-id pub-id-type="pmid">12788972</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Georgiades</surname> <given-names>K.</given-names></name> <name><surname>Raoult</surname> <given-names>D.</given-names></name></person-group> (<year>2010</year>). <article-title>Defining pathogenic bacterial species in the genomic era</article-title>. <source>Front. Microbiol</source>. <volume>1</volume>:<issue>151</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2010.00151</pub-id><pub-id pub-id-type="pmid">21687765</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hill</surname> <given-names>K. K.</given-names></name> <name><surname>Smith</surname> <given-names>T. J.</given-names></name> <name><surname>Helma</surname> <given-names>C. H.</given-names></name> <name><surname>Ticknor</surname> <given-names>L. O.</given-names></name> <name><surname>Foley</surname> <given-names>B. T.</given-names></name> <name><surname>Svensson</surname> <given-names>R. T.</given-names></name> <name><surname>Brown</surname> <given-names>J. L.</given-names></name> <name><surname>Johnson</surname> <given-names>E. A.</given-names></name> <name><surname>Smith</surname> <given-names>L. A.</given-names></name> <name><surname>Okinaka</surname> <given-names>R. T.</given-names></name> <name><surname>Jackson</surname> <given-names>P. J.</given-names></name> <name><surname>Marks</surname> <given-names>J. D.</given-names></name></person-group> (<year>2007</year>). <article-title>Genetic diversity among Botulinum Neurotoxin-producing clostridial strains</article-title>. <source>J. Bacteriol</source>. <volume>189</volume>, <fpage>818</fpage>&#x02013;<lpage>832</lpage>. <pub-id pub-id-type="doi">10.1128/JB.01180-06</pub-id><pub-id pub-id-type="pmid">17114256</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kolsto</surname> <given-names>A. B.</given-names></name> <name><surname>Tourasse</surname> <given-names>N. J.</given-names></name> <name><surname>Okstad</surname> <given-names>O. A.</given-names></name></person-group> (<year>2009</year>). <article-title>What sets <italic>Bacillus anthracis</italic> apart from other Bacillus species?</article-title> <source>Annu. Rev. Microbiol.</source> <volume>63</volume>, <fpage>451</fpage>&#x02013;<lpage>476</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.micro.091208.073255</pub-id><pub-id pub-id-type="pmid">19514852</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kung</surname> <given-names>V. L.</given-names></name> <name><surname>Ozer</surname> <given-names>E. A.</given-names></name> <name><surname>Hauser</surname> <given-names>A. R.</given-names></name></person-group> (<year>2010</year>). <article-title>The accessory genome of <italic>Pseudomonas aeruginosa</italic></article-title>. <source>Microbiol. Mol. Biol. Rev</source>. <volume>74</volume>, <fpage>621</fpage>&#x02013;<lpage>641</lpage>. <pub-id pub-id-type="doi">10.1128/MMBR.00027-10</pub-id><pub-id pub-id-type="pmid">21119020</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindsay</surname> <given-names>J. A.</given-names></name></person-group> (<year>2010</year>). <article-title>Genomic variation and evolution of <italic>Staphylococcus aureus</italic></article-title>. <source>Int. J. Med. Microbiol</source>. <volume>300</volume>, <fpage>98</fpage>&#x02013;<lpage>103</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijmm.2009.08.013</pub-id><pub-id pub-id-type="pmid">19811948</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Losada</surname> <given-names>L.</given-names></name> <name><surname>Ronning</surname> <given-names>C. M.</given-names></name> <name><surname>Deshazer</surname> <given-names>D.</given-names></name> <name><surname>Woods</surname> <given-names>D.</given-names></name> <name><surname>Fedorova</surname> <given-names>N.</given-names></name> <name><surname>Kim</surname> <given-names>H. S.</given-names></name> <name><surname>Shabalina</surname> <given-names>S. A.</given-names></name> <name><surname>Pearson</surname> <given-names>T. R.</given-names></name> <name><surname>Brinkac</surname> <given-names>L.</given-names></name> <name><surname>Tan</surname> <given-names>P.</given-names></name> <name><surname>Nandi</surname> <given-names>T.</given-names></name> <name><surname>Crabtree</surname> <given-names>J.</given-names></name> <name><surname>Badger</surname> <given-names>J.</given-names></name> <name><surname>Beckstrom-Sternberg</surname> <given-names>S.</given-names></name> <name><surname>Saqib</surname> <given-names>M.</given-names></name> <name><surname>Schutzer</surname> <given-names>S. E.</given-names></name> <name><surname>Keim</surname> <given-names>P.</given-names></name> <name><surname>Nierman</surname> <given-names>W. C.</given-names></name></person-group> (<year>2010</year>). <article-title>Continuing evolution of <italic>Burkholderia mallei</italic> through genome reduction and large-scale rearrangements</article-title>. <source>Genome Biol. Evol</source>. <volume>2</volume>, <fpage>102</fpage>&#x02013;<lpage>116</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evq003</pub-id><pub-id pub-id-type="pmid">20333227</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mathee</surname> <given-names>K.</given-names></name> <name><surname>Narasimhan</surname> <given-names>G.</given-names></name> <name><surname>Valdes</surname> <given-names>C.</given-names></name> <name><surname>Qiu</surname> <given-names>X.</given-names></name> <name><surname>Matewish</surname> <given-names>J. M.</given-names></name> <name><surname>Koehrsen</surname> <given-names>M.</given-names></name> <name><surname>Rokas</surname> <given-names>A.</given-names></name> <name><surname>Yandava</surname> <given-names>C. N.</given-names></name> <name><surname>Engels</surname> <given-names>R.</given-names></name> <name><surname>Zeng</surname> <given-names>E.</given-names></name> <name><surname>Olavarietta</surname> <given-names>R.</given-names></name> <name><surname>Doud</surname> <given-names>M.</given-names></name> <name><surname>Smith</surname> <given-names>R. S.</given-names></name> <name><surname>Montgomery</surname> <given-names>P.</given-names></name> <name><surname>White</surname> <given-names>J. R.</given-names></name> <name><surname>Godfrey</surname> <given-names>P. A.</given-names></name> <name><surname>Kodira</surname> <given-names>C.</given-names></name> <name><surname>Birren</surname> <given-names>B.</given-names></name> <name><surname>Galagan</surname> <given-names>J. E.</given-names></name> <name><surname>Lory</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>Dynamics of <italic>Pseudomonas aeruginosa</italic> genome evolution</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>105</volume>, <fpage>3100</fpage>&#x02013;<lpage>3105</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0711982105</pub-id><pub-id pub-id-type="pmid">18287045</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mellmann</surname> <given-names>A.</given-names></name> <name><surname>Harmsen</surname> <given-names>D.</given-names></name> <name><surname>Cummings</surname> <given-names>C. A.</given-names></name> <name><surname>Zentz</surname> <given-names>E. B.</given-names></name> <name><surname>Leopold</surname> <given-names>S. R.</given-names></name> <name><surname>Rico</surname> <given-names>A.</given-names></name> <name><surname>Prior</surname> <given-names>K.</given-names></name> <name><surname>Szczepanowski</surname> <given-names>R.</given-names></name> <name><surname>Ji</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>McLaughlin</surname> <given-names>S. F.</given-names></name> <name><surname>Henkhaus</surname> <given-names>J. K.</given-names></name> <name><surname>Leopold</surname> <given-names>B.</given-names></name> <name><surname>Bielaszewska</surname> <given-names>M.</given-names></name> <name><surname>Prager</surname> <given-names>R.</given-names></name> <name><surname>Brzoska</surname> <given-names>P. M.</given-names></name> <name><surname>Moore</surname> <given-names>R. L.</given-names></name> <name><surname>Guenther</surname> <given-names>S.</given-names></name> <name><surname>Rothberg</surname> <given-names>J. M.</given-names></name> <name><surname>Karch</surname> <given-names>H.</given-names></name></person-group> (<year>2011</year>). <article-title>Prospective genomic characterization of the German enterohemorrhagic <italic>Escherichia coli</italic> O104, H4 outbreak by rapid next generation sequencing technology</article-title>. <source>PLoS ONE</source> <volume>6</volume>:<fpage>e22751</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0022751</pub-id><pub-id pub-id-type="pmid">21799941</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Merhej</surname> <given-names>V.</given-names></name> <name><surname>Royer-Carenzi</surname> <given-names>M.</given-names></name> <name><surname>Pontarotti</surname> <given-names>P.</given-names></name> <name><surname>Raoult</surname> <given-names>D.</given-names></name></person-group> (<year>2009</year>). <article-title>Massive comparative genomic analysis reveals convergent evolution of specialized bacteria</article-title>. <source>Biol. Direct</source> <volume>4</volume>, <fpage>13</fpage>. <pub-id pub-id-type="doi">10.1186/1745-6150-4-13</pub-id><pub-id pub-id-type="pmid">19361336</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Metzker</surname> <given-names>M. L.</given-names></name></person-group> (<year>2010</year>). <article-title>Sequencing technologies&#x02013;the next generation</article-title>. <source>Nat. Rev. Genet</source>. <volume>11</volume>, <fpage>31</fpage>&#x02013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2626</pub-id><pub-id pub-id-type="pmid">19997069</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ochman</surname> <given-names>H.</given-names></name> <name><surname>Lawrence</surname> <given-names>J. G.</given-names></name> <name><surname>Groisman</surname> <given-names>E. A.</given-names></name></person-group> (<year>2000</year>). <article-title>Lateral gene transfer and the nature of bacterial innovation</article-title>. <source>Nature</source> <volume>405</volume>, <fpage>299</fpage>&#x02013;<lpage>304</lpage>. <pub-id pub-id-type="doi">10.1038/35012500</pub-id><pub-id pub-id-type="pmid">10830951</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raoult</surname> <given-names>D.</given-names></name></person-group> (<year>2010</year>). <article-title>The post-Darwinist rhizome of life</article-title>. <source>Lancet</source> <volume>375</volume>, <fpage>104</fpage>&#x02013;<lpage>105</lpage>. <pub-id pub-id-type="doi">10.1016/S0140-6736(09)61958-9</pub-id><pub-id pub-id-type="pmid">20109873</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rasko</surname> <given-names>D. A.</given-names></name> <name><surname>Altherr</surname> <given-names>M. R.</given-names></name> <name><surname>Han</surname> <given-names>C. S.</given-names></name> <name><surname>Ravel</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>Genomics of the <italic>Bacillus cereus</italic> group of organisms</article-title>. <source>FEMS Microbiol. Rev</source>. <volume>29</volume>, <fpage>303</fpage>&#x02013;<lpage>329</lpage>. <pub-id pub-id-type="doi">10.1016/j.femsre.2004.12.005</pub-id><pub-id pub-id-type="pmid">15808746</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rasko</surname> <given-names>D. A.</given-names></name> <name><surname>Webster</surname> <given-names>D. R.</given-names></name> <name><surname>Sahl</surname> <given-names>J. W.</given-names></name> <name><surname>Bashir</surname> <given-names>A.</given-names></name> <name><surname>Boisen</surname> <given-names>N.</given-names></name> <name><surname>Scheutz</surname> <given-names>F.</given-names></name> <name><surname>Paxinos</surname> <given-names>E. E.</given-names></name> <name><surname>Sebra</surname> <given-names>R.</given-names></name> <name><surname>Chin</surname> <given-names>C. S.</given-names></name> <name><surname>Iliopoulos</surname> <given-names>D.</given-names></name> <name><surname>Klammer</surname> <given-names>A.</given-names></name> <name><surname>Peluso</surname> <given-names>P.</given-names></name> <name><surname>Lee</surname> <given-names>L.</given-names></name> <name><surname>Kislyuk</surname> <given-names>A. O.</given-names></name> <name><surname>Bullard</surname> <given-names>J.</given-names></name> <name><surname>Kasarskis</surname> <given-names>A.</given-names></name> <name><surname>Wang</surname> <given-names>S.</given-names></name> <name><surname>Eid</surname> <given-names>J.</given-names></name> <name><surname>Rank</surname> <given-names>D.</given-names></name> <name><surname>Redman</surname> <given-names>J. C.</given-names></name> <name><surname>Steyert</surname> <given-names>S. R.</given-names></name> <name><surname>Frimodt-Moller</surname> <given-names>J.</given-names></name> <name><surname>Struve</surname> <given-names>C.</given-names></name> <name><surname>Petersen</surname> <given-names>A. M.</given-names></name> <name><surname>Krogfelt</surname> <given-names>K. A.</given-names></name> <name><surname>Nataro</surname> <given-names>J. P.</given-names></name> <name><surname>Schadt</surname> <given-names>E. E.</given-names></name> <name><surname>Waldor</surname> <given-names>M. K.</given-names></name></person-group> (<year>2011</year>). <article-title>Origins of the <italic>E. coli</italic> strain causing an outbreak of hemolytic-uremic syndrome in Germany</article-title>. <source>N. Engl. J. Med</source>. <volume>365</volume>, <fpage>709</fpage>&#x02013;<lpage>717</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa1106920</pub-id><pub-id pub-id-type="pmid">21793740</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rohde</surname> <given-names>H.</given-names></name> <name><surname>Qin</surname> <given-names>J.</given-names></name> <name><surname>Cui</surname> <given-names>Y.</given-names></name> <name><surname>Li</surname> <given-names>D.</given-names></name> <name><surname>Loman</surname> <given-names>N. J.</given-names></name> <name><surname>Hentschke</surname> <given-names>M.</given-names></name> <name><surname>Chen</surname> <given-names>W.</given-names></name> <name><surname>Pu</surname> <given-names>F.</given-names></name> <name><surname>Peng</surname> <given-names>Y.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Xi</surname> <given-names>F.</given-names></name> <name><surname>Li</surname> <given-names>S.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Yang</surname> <given-names>X.</given-names></name> <name><surname>Zhao</surname> <given-names>M.</given-names></name> <name><surname>Wang</surname> <given-names>P.</given-names></name> <name><surname>Guan</surname> <given-names>Y.</given-names></name> <name><surname>Cen</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>X.</given-names></name> <name><surname>Christner</surname> <given-names>M.</given-names></name> <name><surname>Kobbe</surname> <given-names>R.</given-names></name> <name><surname>Loos</surname> <given-names>S.</given-names></name> <name><surname>Oh</surname> <given-names>J.</given-names></name> <name><surname>Yang</surname> <given-names>L.</given-names></name> <name><surname>Danchin</surname> <given-names>A.</given-names></name> <name><surname>Gao</surname> <given-names>G. F.</given-names></name> <name><surname>Song</surname> <given-names>Y.</given-names></name> <name><surname>Yang</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Xu</surname> <given-names>J.</given-names></name> <name><surname>Pallen</surname> <given-names>M. J.</given-names></name> <name><surname>Aepfelbacher</surname> <given-names>M.</given-names></name> <name><surname>Yang</surname> <given-names>R.</given-names></name></person-group> (<year>2011</year>). <article-title>Open-source genomic analysis of Shiga-toxin-producing <italic>E. coli</italic> O104, H4</article-title>. <source>N. Engl. J. Med</source>. <volume>365</volume>, <fpage>718</fpage>&#x02013;<lpage>724</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa1107643</pub-id><pub-id pub-id-type="pmid">21793736</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rokas</surname> <given-names>A.</given-names></name> <name><surname>Williams</surname> <given-names>B. L.</given-names></name> <name><surname>King</surname> <given-names>N.</given-names></name> <name><surname>Carroll</surname> <given-names>S. B.</given-names></name></person-group> (<year>2003</year>). <article-title>Genome-scale approaches to resolving incongruence in molecular phylogenies</article-title>. <source>Nature</source> <volume>425</volume>, <fpage>798</fpage>&#x02013;<lpage>804</lpage>. <pub-id pub-id-type="doi">10.1038/nature02053</pub-id><pub-id pub-id-type="pmid">14574403</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shapiro</surname> <given-names>B. J.</given-names></name> <name><surname>Friedman</surname> <given-names>J.</given-names></name> <name><surname>Cordero</surname> <given-names>O. X.</given-names></name> <name><surname>Preheim</surname> <given-names>S. P.</given-names></name> <name><surname>Timberlake</surname> <given-names>S. C.</given-names></name> <name><surname>Szabo</surname> <given-names>G.</given-names></name> <name><surname>Polz</surname> <given-names>M. F.</given-names></name> <name><surname>Alm</surname> <given-names>E. J.</given-names></name></person-group> (<year>2012</year>). <article-title>Population genomics of early events in the ecological differentiation of bacteria</article-title>. <source>Science</source> <volume>336</volume>, <fpage>48</fpage>&#x02013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1126/science.1218198</pub-id><pub-id pub-id-type="pmid">22491847</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sim</surname> <given-names>S. H.</given-names></name> <name><surname>Yu</surname> <given-names>Y.</given-names></name> <name><surname>Lin</surname> <given-names>C. H.</given-names></name> <name><surname>Karuturi</surname> <given-names>R. K.</given-names></name> <name><surname>Wuthiekanun</surname> <given-names>V.</given-names></name> <name><surname>Tuanyok</surname> <given-names>A.</given-names></name> <name><surname>Chua</surname> <given-names>H. H.</given-names></name> <name><surname>Ong</surname> <given-names>C.</given-names></name> <name><surname>Paramalingam</surname> <given-names>S. S.</given-names></name> <name><surname>Tan</surname> <given-names>G.</given-names></name> <name><surname>Tang</surname> <given-names>L.</given-names></name> <name><surname>Lau</surname> <given-names>G.</given-names></name> <name><surname>Ooi</surname> <given-names>E. E.</given-names></name> <name><surname>Woods</surname> <given-names>D.</given-names></name> <name><surname>Feil</surname> <given-names>E.</given-names></name> <name><surname>Peacock</surname> <given-names>S. J.</given-names></name> <name><surname>Tan</surname> <given-names>P.</given-names></name></person-group> (<year>2008</year>). <article-title>The core and accessory genomes of <italic>Burkholderia pseudomallei:</italic> implications for human melioidosis</article-title>. <source>PLoS Pathog</source>. <volume>4</volume>:<fpage>e1000178</fpage>. <pub-id pub-id-type="doi">10.1371/journal.ppat.1000178</pub-id><pub-id pub-id-type="pmid">18927621</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Skarin</surname> <given-names>H.</given-names></name> <name><surname>Hafstrom</surname> <given-names>T.</given-names></name> <name><surname>Westerberg</surname> <given-names>J.</given-names></name> <name><surname>Segerman</surname> <given-names>B.</given-names></name></person-group> (<year>2011</year>). <article-title><italic>Clostridium botulinum</italic> group III: a group with dual identity shaped by plasmids, phages and mobile elements</article-title>. <source>BMC Genomics</source> <volume>12</volume>, <fpage>185</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-12-185</pub-id><pub-id pub-id-type="pmid">21486474</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Skarin</surname> <given-names>H.</given-names></name> <name><surname>Segerman</surname> <given-names>B.</given-names></name></person-group> (<year>2011</year>). <article-title>Horizontal gene transfer of toxin genes in <italic>Clostridium botulinum</italic>: involvement of mobile elements and plasmids</article-title>. <source>Mob. Genet. Elements</source> <volume>1</volume>, <fpage>213</fpage>&#x02013;<lpage>215</lpage>. <pub-id pub-id-type="doi">10.4161/mge.1.3.17617</pub-id><pub-id pub-id-type="pmid">22479689</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tettelin</surname> <given-names>H.</given-names></name> <name><surname>Masignani</surname> <given-names>V.</given-names></name> <name><surname>Cieslewicz</surname> <given-names>M. J.</given-names></name> <name><surname>Donati</surname> <given-names>C.</given-names></name> <name><surname>Medini</surname> <given-names>D.</given-names></name> <name><surname>Ward</surname> <given-names>N. L.</given-names></name> <name><surname>Angiuoli</surname> <given-names>S. V.</given-names></name> <name><surname>Crabtree</surname> <given-names>J.</given-names></name> <name><surname>Jones</surname> <given-names>A. L.</given-names></name> <name><surname>Durkin</surname> <given-names>A. S.</given-names></name> <name><surname>Deboy</surname> <given-names>R. T.</given-names></name> <name><surname>Davidsen</surname> <given-names>T. M.</given-names></name> <name><surname>Mora</surname> <given-names>M.</given-names></name> <name><surname>Scarselli</surname> <given-names>M.</given-names></name> <name><surname>Margarit y Ros</surname> <given-names>I.</given-names></name> <name><surname>Peterson</surname> <given-names>J. D.</given-names></name> <name><surname>Hauser</surname> <given-names>C. R.</given-names></name> <name><surname>Sundaram</surname> <given-names>J. P.</given-names></name> <name><surname>Nelson</surname> <given-names>W. C.</given-names></name> <name><surname>Madupu</surname> <given-names>R.</given-names></name> <name><surname>Brinkac</surname> <given-names>L. M.</given-names></name> <name><surname>Dodson</surname> <given-names>R. J.</given-names></name> <name><surname>Rosovitz</surname> <given-names>M. J.</given-names></name> <name><surname>Sullivan</surname> <given-names>S. A.</given-names></name> <name><surname>Daugherty</surname> <given-names>S. C.</given-names></name> <name><surname>Haft</surname> <given-names>D. H.</given-names></name> <name><surname>Selengut</surname> <given-names>J.</given-names></name> <name><surname>Gwinn</surname> <given-names>M. L.</given-names></name> <name><surname>Zhou</surname> <given-names>L.</given-names></name> <name><surname>Zafar</surname> <given-names>N.</given-names></name> <name><surname>Khouri</surname> <given-names>H.</given-names></name> <name><surname>Radune</surname> <given-names>D.</given-names></name> <name><surname>Dimitrov</surname> <given-names>G.</given-names></name> <name><surname>Watkins</surname> <given-names>K.</given-names></name> <name><surname>O&#x00027;connor</surname> <given-names>K. J.</given-names></name> <name><surname>Smith</surname> <given-names>S.</given-names></name> <name><surname>Utterback</surname> <given-names>T. R.</given-names></name> <name><surname>White</surname> <given-names>O.</given-names></name> <name><surname>Rubens</surname> <given-names>C. E.</given-names></name> <name><surname>Grandi</surname> <given-names>G.</given-names></name> <name><surname>Madoff</surname> <given-names>L. C.</given-names></name> <name><surname>Kasper</surname> <given-names>D. L.</given-names></name> <name><surname>Telford</surname> <given-names>J. L.</given-names></name> <name><surname>Wessels</surname> <given-names>M. R.</given-names></name> <name><surname>Rappuoli</surname> <given-names>R.</given-names></name> <name><surname>Fraser</surname> <given-names>C. M.</given-names></name></person-group> (<year>2005</year>). <article-title>Genome analysis of multiple pathogenic isolates of <italic>Streptococcus agalactiae</italic>: implications for the microbial &#x0201C;pan-genome.&#x0201D;</article-title> <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>102</volume>, <fpage>13950</fpage>&#x02013;<lpage>13955</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0506758102</pub-id><pub-id pub-id-type="pmid">16172379</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>G.</given-names></name> <name><surname>Humayun</surname> <given-names>M. Z.</given-names></name> <name><surname>Taylor</surname> <given-names>D. E.</given-names></name></person-group> (<year>1999</year>). <article-title>Mutation as an origin of genetic variability in <italic>Helicobacter pylori</italic></article-title>. <source>Trends Microbiol</source>. <volume>7</volume>, <fpage>488</fpage>&#x02013;<lpage>493</lpage>. <pub-id pub-id-type="pmid">10603484</pub-id></citation>
</ref>
</ref-list>
</back>
</article>