<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2014.00406</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Use of whole genome sequences to develop a molecular phylogenetic framework for <italic>Rhodococcus fascians</italic> and the <italic>Rhodococcus</italic> genus</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Creason</surname> <given-names>Allison L.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02020;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Davis</surname> <given-names>Edward W.</given-names> <suffix>II</suffix></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/176895"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Putnam</surname> <given-names>Melodie L.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Vandeputte</surname> <given-names>Olivier M.</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/176995"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Chang</surname> <given-names>Jeff H.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/113606"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Botany and Plant Pathology, Oregon State University</institution> <country>Corvallis, OR, USA</country></aff>
<aff id="aff2"><sup>2</sup><institution>Molecular and Cellular Biology Program, Oregon State University</institution> <country>Corvallis, OR, USA</country></aff>
<aff id="aff3"><sup>3</sup><institution>Laboratoire de Biotechnologie Vegetale, Universite Libre de Bruxelles</institution> <country>Gosselies, Belgium</country></aff>
<aff id="aff4"><sup>4</sup><institution>Center for Genome Research and Biocomputing, Oregon State University</institution> <country>Corvallis, OR, USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Gail Preston, University of Oxford, UK</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: David John Studholme, University of Exeter, UK; Leighton Pritchard, James Hutton Institute, UK</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Jeff H. Chang, Department of Botany and Plant Pathology, Oregon State University, 2082 Cordley Hall, Corvallis, OR 97331, USA e-mail: <email>changj&#x00040;science.oregonstate.edu</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Plant-Microbe Interaction, a section of the journal Frontiers in Plant Science.</p></fn>
<fn fn-type="present-address" id="fn003"><p>&#x02020;These authors have contributed equally to this work.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>19</day>
<month>08</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date>
<volume>5</volume>
<elocation-id>406</elocation-id>
<history>
<date date-type="received">
<day>20</day>
<month>06</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>30</day>
<month>07</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2014 Creason, Davis, Putnam, Vandeputte and Chang.</copyright-statement>
<copyright-year>2014</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract><p>The accurate diagnosis of diseases caused by pathogenic bacteria requires a stable species classification. <italic>Rhodococcus fascians</italic> is the only documented member of its ill-defined genus that is capable of causing disease on a wide range of agriculturally important plants. Comparisons of genome sequences generated from isolates of <italic>Rhodococcus</italic> associated with diseased plants revealed a level of genetic diversity consistent with them representing multiple species. To test this, we generated a tree based on more than 1700 homologous sequences from plant-associated isolates of <italic>Rhodococcus</italic>, and obtained support from additional approaches that measure and cluster based on genome similarities. Results were consistent in supporting the definition of new <italic>Rhodococcus</italic> species within clades containing phytopathogenic members. We also used the genome sequences, along with other rhodococcal genome sequences to construct a molecular phylogenetic tree as a framework for resolving the <italic>Rhodococcus</italic> genus. Results indicated that <italic>Rhodococcus</italic> has the potential for having 20 species and also confirmed a need to revisit the taxonomic groupings within <italic>Rhodococcus</italic>.</p></abstract>
<kwd-group>
<kwd>Gram-positive</kwd>
<kwd>plant pathogen</kwd>
<kwd>prokaryotic taxonomy</kwd>
<kwd>leafy gall</kwd>
<kwd>average nucleotide identity</kwd>
</kwd-group>
<counts>
<fig-count count="5"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="59"/>
<page-count count="12"/>
<word-count count="9087"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="introduction" id="s1">
<title>Introduction</title>
<p>Defining bacteria into stable and coherent genetically similar species has many practical implications. However, multiple factors including effective population size, horizontal gene transfer and bacterial recombination, and their barriers, affect cohesiveness of different groups of bacteria to varying degrees (Doolittle and Zhaxybayeva, <xref ref-type="bibr" rid="B11">2009</xref>). As a consequence, a unifying concept for bacterial species has yet to be adopted, which has made it difficult to develop criteria and thresholds that can be generally applied for defining bacterial species.</p>
<p>Traditional polyphasic approaches define bacterial species as a monophyletic group with at least one discriminative phenotypic trait. Though pragmatic and widely adopted, the traditional approaches are weighted toward phenotypic traits and cannot keep pace with the rate in which new genotypes are being discovered and sequenced. With major advances in contemporary methods in sequencing, operational criteria based on whole genome sequences have been developed and adopted to assist in resolving bacterial phylogeny (Konstantinidis et al., <xref ref-type="bibr" rid="B26">2006</xref>). Multi-locus sequence analysis (MLSA) and trees based on whole genome sequences are powerful methods for inferring evolutionary relationships (Staley, <xref ref-type="bibr" rid="B52">2009</xref>). Alternative criteria based on the degree of similarities in genome signatures have also been developed (Konstantinidis et al., <xref ref-type="bibr" rid="B26">2006</xref>; Bohlin et al., <xref ref-type="bibr" rid="B3">2008</xref>; Richter and Rossell&#x000F3;-M&#x000F3;ra, <xref ref-type="bibr" rid="B43">2009</xref>). Average nucleotide identity (ANI), for example, is a simple measure of genetic relatedness based on sequences conserved among compared genomes and has gained acceptance as a method for defining bacterial species (Konstantinidis et al., <xref ref-type="bibr" rid="B26">2006</xref>; Chan et al., <xref ref-type="bibr" rid="B6">2012</xref>; Kim et al., <xref ref-type="bibr" rid="B24">2014</xref>). ANI has also been developed as a method for codifying bacteria based on genome similarity (Marakeby et al., <xref ref-type="bibr" rid="B35">2014</xref>).</p>
<p>Genome-enabled comparisons and the recognition of environmental niches in structuring gene flow have revealed a diversity of population structures for groups of plant-associated bacteria. <italic>Pseudomonas fluorescens</italic>, for example, occupies multiple niches and has a level of heterogeneity consistent with limited gene flow between sub-clades that challenge their taxonomy (Loper et al., <xref ref-type="bibr" rid="B34">2012</xref>). Likewise, a change to the taxonomy of <italic>Agrobacterium tumefaciens</italic> has been proposed to reflect genome-enabled discovery of clade-specific traits (Lassalle et al., <xref ref-type="bibr" rid="B30">2011</xref>). The ANI method has been used to assign newly discovered isolates to known plant-associated species and discover new species of plant pathogens (Dudnik et al., <xref ref-type="bibr" rid="B12">2014</xref>; Dur&#x000E1;n et al., <xref ref-type="bibr" rid="B13">2014</xref>; van der Wolf et al., <xref ref-type="bibr" rid="B56">2014</xref>).</p>
<p>The Gram-positive <italic>Rhodococcus</italic> genus is a member of the <italic>Nocardiaceae</italic> family and forms a distinct group with 30 valid species published (Jones and Goodfellow, <xref ref-type="bibr" rid="B21">2012</xref>). The genus has diverse members that inhabit a wide range of terrestrial as well as aquatic habitats and are renowned for their catabolic functions and ability to degrade a large number of organic compounds (Larkin et al., <xref ref-type="bibr" rid="B29">2005</xref>). Additionally, members of <italic>Rhodococcus</italic> have been recovered from extreme environments such as the deep-sea, oil-contaminated soils, and freeze-thaw tundra on glacial margins (Sheng et al., <xref ref-type="bibr" rid="B49">2011</xref>; Shevtsov et al., <xref ref-type="bibr" rid="B50">2013</xref>; Konishi et al., <xref ref-type="bibr" rid="B25">2014</xref>). Because of their biotechnological applications and potential in bioremediation, there has been a dramatic increase in the number of sequenced <italic>Rhodococcus</italic> genomes. Their genomes are high in GC content and range in size from 4.3 megabases (Mb) to over 10 Mb. Most genomes are larger than 5 Mb and their large sizes have been attributed to both horizontal gene transfer and gene duplication (Letek et al., <xref ref-type="bibr" rid="B31">2010</xref>). Partly due to the historical reliance on phenotypic traits and use of 16S rDNA sequence information, the <italic>Rhodococcus</italic> phylogeny still remains poorly resolved (G&#x000FC;rtler et al., <xref ref-type="bibr" rid="B17">2004</xref>).</p>
<p>To date, <italic>Rhodococcus fascians</italic> and <italic>Rhodococcus equi</italic> are the only two members of the genus that are well documented as being pathogenic (von Bargen and Haas, <xref ref-type="bibr" rid="B58">2009</xref>; Stes et al., <xref ref-type="bibr" rid="B54">2011</xref>). <italic>R. fascians</italic> can infect a broad range of plants. After breaching the plant cuticle, the pathogen collapses the epidermal layer, and forms ingression sites beneath epiphytic colonies (Cornelis et al., <xref ref-type="bibr" rid="B7">2001</xref>). <italic>R. fascians</italic> then grows inside the host tissue and provokes cell differentiation and <italic>de novo</italic> organogenesis, resulting in proliferations and abnormal growths called witches&#x00027; brooms or leafy galls (Putnam and Miller, <xref ref-type="bibr" rid="B41">2007</xref>). To gain insights into the mechanisms and evolution of virulence, we determined the genome sequences for 20 isolates of <italic>Rhodococcus</italic> (Creason et al., <xref ref-type="bibr" rid="B8">2014</xref>). Like <italic>R. equi</italic>, <italic>R. fascians</italic> has few horizontally-acquired virulence genes, which are predicted to be augmented by co-option of core genes, that contribute to the ability of the bacterium to infect and cause disease (Crespi et al., <xref ref-type="bibr" rid="B9">1992</xref>; Letek et al., <xref ref-type="bibr" rid="B32">2008</xref>, <xref ref-type="bibr" rid="B31">2010</xref>; Creason et al., <xref ref-type="bibr" rid="B8">2014</xref>). Because of this mechanism of virulence evolution, phytopathogenicity is not expected to be a distinguishing trait suitable for classifying these <italic>Rhodococcus</italic> isolates.</p>
<p>In this study, we tested the hypothesis that leafy gall disease is caused by members of multiple species of <italic>Rhodococcus</italic>. Results from four independent methods were consistent and supported the hypothesis. Analysis of the 20 genome sequences showed the isolates formed two well supported clades, with one consisting of 16 isolates and having complex substructure indicative of multiple species. Analysis of the <italic>Rhodococcus</italic> genus associated four isolates collected from extreme environments or found in association with healthy plants to the two clades of <italic>Rhodococcus</italic> with plant-pathogenic members. Lastly, the need for revision of taxonomic grouping in <italic>Rhodococcus</italic> is suggested, as determined based on ANI distances calculated for all possible pairwise comparisons between members with available genome sequences.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Isolation of phytopathogenic <italic>Rhodococcus</italic></title>
<p>Symptomatic tissue of <italic>Leucanthemum</italic> &#x000D7; superbum &#x0201C;Becky,&#x0201D; received by the Oregon State University Plant Clinic, was washed, macerated in sterile saline, and incubated at room temperature for 30 min. <italic>Rhodococcus</italic> cells were selected for by culturing on semi-selective D2 media (Kado and Heskett, <xref ref-type="bibr" rid="B22">1970</xref>). Isolate A22b was selected and verified as phytopathogenic based on its ability to cause leafy gall disease on pea seedlings and positive amplification for the <italic>fasA</italic> gene.</p>
</sec>
<sec>
<title>Nucleic acid preparations</title>
<p>A22b was grown in LB at 28&#x000B0;C with shaking (Bertani, <xref ref-type="bibr" rid="B2">1951</xref>). Genomic DNA from A22b was extracted from cells grown directly from stocks. The Wizard Genomic DNA Purification Kit was used, according to the instructions of the manufacturer, to extract genomic DNA (Promega Corporation, Madison, WI, USA).</p>
</sec>
<sec>
<title>Next-generation sequencing, assembly, and annotation</title>
<p>Library construction and sequencing on an Illumina MiSeq were done in the Center for Genome Research and Biocomputing at Oregon State University. The A22b genome was assembled using Velvet (v1.2.08), with a hash length of 125 (Zerbino and Birney, <xref ref-type="bibr" rid="B59">2008</xref>). The insert size was determined based on the estimated fragment size of the library preparation. Multiple assemblies were done, in which coverage cutoff, expected coverage, and hash length parameters were changed (Creason et al., <xref ref-type="bibr" rid="B8">2014</xref>). The highest quality assembly was identified based on the number of contigs and having a sum total size between 5 and 6 Mb. Contigs were reordered using the genome sequence of <italic>R. fascians</italic> A44a as a reference and the Mauve Contig Mover (Rissman et al., <xref ref-type="bibr" rid="B44">2009</xref>). The genome was annotated using Prokka (Seemann, <xref ref-type="bibr" rid="B46">2014</xref>). As part of the Prokka pipeline, CDSs were annotated in part, based on BLASTP analysis and a database of genomes core to the <italic>Rhodococcus</italic> genus, including whole-genome assemblies from <italic>Rhodococcus jostii</italic> RHA1, <italic>Rhodococcus opacus</italic> B4, <italic>Rhodococcus erythropolis</italic> PR4, <italic>R. equi</italic> 103S, and the <italic>R. fascians</italic> linear plasmid, pFiD188 (Na et al., <xref ref-type="bibr" rid="B38">2005</xref>; McLeod et al., <xref ref-type="bibr" rid="B36">2006</xref>; Sekine et al., <xref ref-type="bibr" rid="B47">2006</xref>; Letek et al., <xref ref-type="bibr" rid="B31">2010</xref>; Francis et al., <xref ref-type="bibr" rid="B14">2012</xref>). The whole genome shotgun project for A22b has been deposited at DDJB/EMBL/GenBank under the accession JOKB00000000 (BioProject PRJNA252927, BioSample SAMN02864791). The version described in this paper is version JOKB01000000. The A22b short reads and annotated genome sequences are available for download (SRS641819, <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.7267/N9PN93H8">http://dx.doi.org/10.7267/N9PN93H8</ext-link>). In order to be consistent, publicly available wgs sequences used in this study were similarly annotated. Their genome annotations are available upon request.</p>
</sec>
<sec>
<title>Phylogenetic analyses</title>
<p>We used Hal (&#x02212;a muscle and &#x02212;y 100 settings) to construct the multi-gene tree of the 20 isolates and the <italic>Nocardia farcinica</italic> type strain as the outgroup (Ishikawa et al., <xref ref-type="bibr" rid="B20">2004</xref>; Robbertse et al., <xref ref-type="bibr" rid="B45">2011</xref>).</p>
<p>Sequences for the maximum-likelihood MLSA tree were gathered from the NCBI nt and wgs databases, using FtsY, InfB, RpoB, RsmA, SecY, TsaD, and YchF from <italic>Rhodococcus jostii</italic> RHA1 and <italic>Bifidobacterium longum</italic> subsp. <italic>infantis</italic> ATCC 15697 as queries in TBLASTN&#x0002B; (v2.2.29) searches (with default settings; Ad&#x000E9;kambi et al., <xref ref-type="bibr" rid="B1">2011</xref>). The query sequences were selected to provide coverage of the Actinobacteria phylum. Duplicate results from the two TBLASTN&#x0002B; (v2.2.29) results, and strains in which all seven translated sequences were not detected, were filtered out. A total of 1316 strains passed filter.</p>
<p>The filtered sequences were aligned using the L-INS-i algorithm in MAFFT (v7.149b) with the &#x02013;legacygappenalty flag. Gblocks (v0.91b) was used to trim the alignments prior to concatenation with half gapped positions allowed (&#x02212;b5 &#x0003D; h setting; Castresana, <xref ref-type="bibr" rid="B5">2000</xref>). Concatenated sequences with 100% identity, excluding those in the <italic>Rhodococcus</italic> genus, were collapsed into one entry, resulting in 961 sequences as input for tree generation. The most appropriate models of substitution for each gene were selected using the ProteinModelSelection.pl script provided with RAxML (v8; Katoh and Standley, <xref ref-type="bibr" rid="B23">2013</xref>; Stamatakis, <xref ref-type="bibr" rid="B53">2014</xref>). Trees were generated using RAxML (v8), based on the guidelines provided in the users manual (Stamatakis, <xref ref-type="bibr" rid="B53">2014</xref>). Briefly, five starting parsimony trees were generated using the &#x02212;y option; fixed initial arrangements were run on the five trees separately with the &#x02212;i 10 setting. Automatic initial arrangements were also run on the five trees. The best log likelihood scores were used to choose the proper initial arrangement setting for further tree generation (&#x02212;i 10 was the best for the dataset). A total of 500 rapid bootstraps (&#x02212;x setting with &#x02212;N 500) were performed on this dataset, and 10 distinct (&#x02212;f d setting with &#x02212;N 10) trees were generated. Bootstrap values were mapped on the best of the 10 distinct trees using the &#x02212;f b setting. See Supplemental Data <xref ref-type="supplementary-material" rid="SM2">1</xref> for the full tree, accession values, and duplicated sequences that were removed.</p>
<p>Alignments were visualized using Belvu (Sonnhammer and Hollich, <xref ref-type="bibr" rid="B51">2005</xref>). Images were generated using the iTOL (Letunic and Bork, <xref ref-type="bibr" rid="B33">2011</xref>).</p>
</sec>
<sec>
<title>Bioinformatic analyses</title>
<p>The progressiveMauve (v2.3.1) alignment was produced using default settings and as input, the chromosomal sequences for isolates D188, A21d2, 05-339-1, and A44a (Darling et al., <xref ref-type="bibr" rid="B10">2010</xref>).</p>
<p>JSpecies with whole genome FASTA sequences as input, was used to calculate average nucleotide identities (BLAST; ANIb) and do pairwise comparisons of tetranucleotide frequencies (TETRA; Richter and Rossell&#x000F3;-M&#x000F3;ra, <xref ref-type="bibr" rid="B43">2009</xref>). Codon usage tables were constructed using EMBOSS cusp and sum difference statistics were calculated using EMBOSS codcmp (default settings; <ext-link ext-link-type="uri" xlink:href="http://emboss.sourceforge.net/apps/cvs/emboss/apps/cusp.html">http://emboss.sourceforge.net/apps/cvs/emboss/apps/cusp.html</ext-link>). Reciprocal best BLASTP analysis was done according to methods previously reported (Creason et al., <xref ref-type="bibr" rid="B8">2014</xref>).</p>
<p>The ANI values used to generate the distance dendrogram were calculated using published methods (Konstantinidis and Tiedje, <xref ref-type="bibr" rid="B27">2005</xref>). The following were automated using <italic>ad-hoc</italic> scripts. Genome sequences were split into 1020-nucleotide long segments. The genome segments were used as queries in BLASTN&#x0002B; (v2.2.27) searches against all other complete genomes in an all-by-all pairwise analysis. BLASTN&#x0002B; (v2.2.27) was used, with the extra settings, &#x0201C;blastn &#x02013;task blastn &#x02013;dust no &#x02013;xdrop_gap 150 &#x02013;penalty &#x02212;1 &#x02013;reward 1 &#x02013;gapopen 5 &#x02013;gapextend 2,&#x0201D; for the searches (Camacho et al., <xref ref-type="bibr" rid="B4">2009</xref>). Sequences with less than 70% coverage and 30% identity were filtered out, the number of results above the cutoffs were counted, and the average nucleotide identity of the resulting sequences were calculated. Results were comparable to those calculated using jSpecies and were better at handling the larger number of samples (Richter and Rossell&#x000F3;-M&#x000F3;ra, <xref ref-type="bibr" rid="B43">2009</xref>).</p>
<p>The distance dendrogram was generated using the all-by-all pairwise ANI divergence values as input, which is defined as 100%&#x02014;ANI (Chan et al., <xref ref-type="bibr" rid="B6">2012</xref>). The hcluster Python package was installed along with all dependencies (<ext-link ext-link-type="uri" xlink:href="http://scipy-cluster.googlecode.com/">http://scipy-cluster.googlecode.com/</ext-link>). IPython, in interactive mode, was used to generate the dendogram (Perez and Granger, <xref ref-type="bibr" rid="B39">2007</xref>). The matplotlib library was also required (ipython &#x02013;matplotlib; Hunter, <xref ref-type="bibr" rid="B19">2007</xref>; <ext-link ext-link-type="uri" xlink:href="http://matplotlib.org">http://matplotlib.org</ext-link>). The pdist() function from hcluster was used to calculate the Euclidean distance between the ANI divergence values, and the complete linkage on the distance matrix was calculated using the complete() function. The dendrogram was generated using the dendrogram() function of hcluster. Input data and the resulting script from the interactive IPython session can be found in Supplemental Data <xref ref-type="supplementary-material" rid="SM3">2</xref>.</p>
<p>Graphs were generated in R (R Core Team, <xref ref-type="bibr" rid="B42">2013</xref>). The 3-D scatter plot was generated using plot3d{rgl} and quads3d{rgl}. Heatmaps were generated using heatmap.2{gplots}.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Whole genome-based phylogeny supports multiple lineages of plant pathogenic <italic>Rhodococcus</italic></title>
<p>In our first sequencing effort, we used hybrid approaches to generate high quality assemblies for isolates D188 and A44a (Creason et al., <xref ref-type="bibr" rid="B8">2014</xref>). Unexpectedly, initial attempts to align the genome sequences were challenging, leading us to hypothesize that the two isolates represented different species of <italic>Rhodococcus</italic>. In order to test this hypothesis, we determined the genome sequences for 18 additional isolates of <italic>Rhodococcus</italic> identified from diseased plants or initially typed as <italic>R. fascians</italic> (Table <xref ref-type="table" rid="T1">1</xref>; Miteva et al., <xref ref-type="bibr" rid="B37">2004</xref>). The alignment of four genome sequences shows conservation of collinear blocks, with A44a being the most disparate in respect to the level of conservation and number and size of gaps between blocks (Figure <xref ref-type="supplementary-material" rid="SM1">S1</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Isolates of <italic>Rhodococcus</italic> selected for whole genome sequencing</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"><bold>Isolate<xref ref-type="table-fn" rid="TN1"><sup>&#x0002A;</sup></xref></bold></th>
<th align="left"><bold>Source</bold></th>
<th align="left"><bold>Geographic location</bold></th>
<th align="left"><bold>Year<xref ref-type="table-fn" rid="TN2"><sup>&#x000A7;</sup></xref></bold></th>
<th align="left"><bold>Group<xref ref-type="table-fn" rid="TN3"><sup>&#x02020;</sup></xref></bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">GIC26</td>
<td align="left">Greenland glacial ice core</td>
<td align="left">Greenland</td>
<td align="left">&#x0003E;120,000 years</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left">GIC36</td>
<td align="left">Greenland glacial ice core</td>
<td align="left">Greenland</td>
<td align="left">&#x0003E;120,000 years</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left">05-561-1</td>
<td align="left"><italic>Lavandula angustifolia</italic> &#x0201C;Violet Intrigue&#x0201D;</td>
<td align="left">Washington, USA</td>
<td align="left">2005</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left">LMG3605</td>
<td align="left"><italic>Chrysanthemum</italic> &#x000D7; <italic>morifolium</italic></td>
<td align="left">United Kingdom</td>
<td align="left">Unknown</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left"><italic>D188</italic></td>
<td align="left"><italic>Chrysanthemum</italic> &#x000D7; <italic>morifolium</italic></td>
<td align="left">Europe</td>
<td align="left">1984</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left">LMG3602</td>
<td align="left"><italic>Lilium longiflorum</italic></td>
<td align="left">Moerbeke, Belgium</td>
<td align="left">Unknown</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left"><bold>LMG3623 (Tilford&#x00027;s strain)</bold></td>
<td align="left"><italic>Lathyrus odoratus</italic></td>
<td align="left">USA</td>
<td align="left">Unknown</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left">A3b</td>
<td align="left"><italic>Heliopsis helianthoides</italic> &#x0201C;Loraine Sunshine&#x0201D;</td>
<td align="left">Michigan, USA</td>
<td align="left">2005</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left">LMG3616</td>
<td align="left"><italic>Lathyrus odoratus</italic></td>
<td align="left">United Kingdom</td>
<td align="left">Unknown</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left">A78</td>
<td align="left"><italic>Leucanthemum</italic> &#x000D7; <italic>superbum</italic> &#x0201C;Becky&#x0201D;</td>
<td align="left">Pennsylvania, USA</td>
<td align="left">2002</td>
<td align="left">Sub-clade i</td>
</tr>
<tr>
<td align="left">A21d2</td>
<td align="left"><italic>Oenothera speciosa</italic> &#x0201C;Siskiyou&#x0201D;</td>
<td align="left">Michigan, USA</td>
<td align="left">2002</td>
<td align="left">Sub-clade ii</td>
</tr>
<tr>
<td align="left">04-516</td>
<td align="left"><italic>Aster</italic> &#x000D7; &#x0201C;Woods Pink&#x0201D;</td>
<td align="left">Florida, USA</td>
<td align="left">2004</td>
<td align="left">Sub-clade ii</td>
</tr>
<tr>
<td align="left">A25f</td>
<td align="left"><italic>Nemesia</italic> &#x000D7; &#x0201C;Natalie&#x0201D;</td>
<td align="left">Washington, USA</td>
<td align="left">2002</td>
<td align="left">Sub-clade ii</td>
</tr>
<tr>
<td align="left">LMG3625</td>
<td align="left"><italic>Lathyrus odoratus</italic></td>
<td align="left">United Kingdom</td>
<td align="left">1958</td>
<td align="left">Sub-clade ii</td>
</tr>
<tr>
<td align="left">05-339-1</td>
<td align="left"><italic>Hosta</italic> &#x0201C;Blue Umbrellas&#x0201D;</td>
<td align="left">Michigan, USA</td>
<td align="left">2005</td>
<td align="left">Sub-clade iii</td>
</tr>
<tr>
<td align="left">A76</td>
<td align="left"><italic>Veronica spicata</italic> &#x0201C;Royal Candles&#x0201D;</td>
<td align="left">Michigan, USA</td>
<td align="left">2002</td>
<td align="left">Sub-clade iii</td>
</tr>
<tr>
<td align="left"><italic>A44a</italic></td>
<td align="left"><italic>Veronica spicata</italic> &#x0201C;Minuet&#x0201D;</td>
<td align="left">Oregon, USA</td>
<td align="left">2002</td>
<td align="left">Clade II</td>
</tr>
<tr>
<td align="left">02-815</td>
<td align="left"><italic>Campanula</italic> &#x000D7; &#x0201C;Sarastro&#x0201D;</td>
<td align="left">Michigan, USA</td>
<td align="left">2002</td>
<td align="left">Clade II</td>
</tr>
<tr>
<td align="left">02-816c</td>
<td align="left"><italic>Viola</italic> &#x000D7; &#x0201C;Purple Showers&#x0201D;</td>
<td align="left">Michigan, USA</td>
<td align="left">2002</td>
<td align="left">Clade II</td>
</tr>
<tr>
<td align="left">A73a</td>
<td align="left"><italic>Aster amellus</italic> &#x0201C;Violet Queen&#x0201D;</td>
<td align="left">Pennsylvania, USA</td>
<td align="left">2003</td>
<td align="left">Clade II</td>
</tr>
<tr>
<td align="left">A22b</td>
<td align="left"><italic>Leucanthemum</italic> &#x000D7; superbum &#x0201C;Becky&#x0201D;</td>
<td align="left">Washington, USA</td>
<td align="left">2002</td>
<td align="left">Clade II</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN1"><label>&#x0002A;</label><p><italic>Isolates designated with LMG were obtained from Belgium co-ordinated collection of micro-organisms (BCCM); GIC isolates are from a Greenland glacial ice core; remaining isolates were obtained from diseased plants submitted to the Oregon State University (OSU) Plant Clinic. Italicized isolates, first sequenced using a hybrid approach; bold, type strain.</italic></p></fn>
<fn id="TN2"><label>&#x000A7;</label><p><italic>Year deposited (BCCM), isolated (OSU plant clinic), or trapped in ice (GIC isolates).</italic></p></fn>
<fn id="TN3"><label>&#x02020;</label><p><italic>Group designation is based on this study; sub-clades i-iii all belong to Clade I.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
<p>We constructed a multi-gene phylogenetic tree for the 20 isolates of <italic>Rhodococcus</italic> based on the whole genome sequences to infer the evolutionary relationships. Homologous sequences were identified from all 20 and from the type strain of <italic>Nocardia</italic>, which we used as an outgroup (Ishikawa et al., <xref ref-type="bibr" rid="B20">2004</xref>). Clusters with paralogs, but not those with sequences potentially acquired via recombination, were filtered out, leaving 1727 clusters. The corresponding sequences from each isolate were concatenated, aligned, and used to derive a neighbor-joining (NJ) tree. The isolates formed two distinct clades (Figure <xref ref-type="fig" rid="F1">1</xref>). Clade I has substructure, with the largest and deepest branching sub-clade i consisting of the type strain LMG3623, D188, the two glacial ice core isolates, three other culture collection isolates, and three isolated from diseased plants. Sub-clade ii includes three isolates obtained directly from diseased plants and one obtained from a culture collection. Sub-clade iii is represented by just two isolates, 05-339-1, and A76, collected from diseased plants. As indicated by the longer branch lengths, the two isolates are more diverged. Clade II consists of four isolates from the US that includes A44a.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Neighbor-joining tree based on 1727 homologous genes.</bold> A rooted neighbor-joining tree was constructed using translated sequences from 1727 genes present in all 20 isolates and the <italic>Nocardia farcinica</italic> type strain (not shown). Clade and sub-clade designations are indicated at the corresponding node. Scale bar, number of amino acid substitutions per site; only branches with lengths greater than zero are indicated. Isolates with a linear plasmid are denoted with a diagonal bar; empty boxes, absence of linear plasmid. Phytopathogenic isolates are indicated with, &#x0201C;&#x0002B;&#x0201D;; non-pathogenic isolates are indicated with, &#x0201C;&#x02212;.&#x0201D;</p></caption>
<graphic xlink:href="fpls-05-00406-g0001.tif"/>
</fig>
<p>We projected the presence/absence of a linear virulence plasmid and trait of phytopathogenicity on to the phylogenetic tree (Figure <xref ref-type="fig" rid="F1">1</xref>; Creason et al., <xref ref-type="bibr" rid="B8">2014</xref>). Phytopathogenicity is not exclusive to any clade or sub-clade. The five non-pathogenic isolates clustered in Clade I with four and one found in sub-clades i and ii, respectively. Sub-clade ii is the most variable in respect to virulence loci structure, as phytopathogenic A21d2 and A25f lack a linear virulence plasmid and A21d2 also lacks the entire <italic>fas</italic> operon (Creason et al., <xref ref-type="bibr" rid="B8">2014</xref>). These data are consistent with our hypothesis that leafy gall disease is caused by multiple species of <italic>Rhodococcus</italic> and explain why initial attempts in aligning the whole genome sequences of D188 and A44a were difficult.</p>
</sec>
<sec>
<title>Alternative whole-genome based analyses support the multiple lineages of <italic>Rhodococcus</italic></title>
<p>To support the existence of distinct groups of plant pathogenic <italic>Rhodococcus</italic>, we used alternative methods to cluster the bacteria based on similarities in their genome sequence features. We determined the average nucleotide identity values (ANIb; calculated with the BLAST algorithm) and tetranucleotide usage patterns (TETRA) for all pairwise comparisons (Teeling et al., <xref ref-type="bibr" rid="B55">2004</xref>; Konstantinidis and Tiedje, <xref ref-type="bibr" rid="B27">2005</xref>; Richter and Rossell&#x000F3;-M&#x000F3;ra, <xref ref-type="bibr" rid="B43">2009</xref>). A plot of ANI vs. TETRA formed three distinct clouds (Figure <xref ref-type="fig" rid="F2">2</xref>). The plots of the comparisons of isolates within sub-clades i and ii as well as Clade II coalesced into cloud 1. These comparisons were wholly within the calibrated and strictest thresholds of 96% ANI and 0.997 TETRA values that are recommended for circumscribing prokaryotic taxa (Richter and Rossell&#x000F3;-M&#x000F3;ra, <xref ref-type="bibr" rid="B43">2009</xref>). Results were identical when we relaxed ANI thresholds to 94% (data not shown). The reciprocal comparisons between isolates 05-339-1 and A76 of sub-clade iii associated with Cloud 1 but fell below ANI thresholds, regardless of which strictness level was used. The failure to exceed threshold is consistent with the greater divergence between these isolates, as observed in the NJ tree. Cloud 2 represented all possible comparisons between isolates in different sub-clades of Clade I. This cloud spanned the TETRA threshold value but was well below the ANI threshold values (Richter and Rossell&#x000F3;-M&#x000F3;ra, <xref ref-type="bibr" rid="B43">2009</xref>). Cloud 3 contains the most dissimilar comparisons between isolates of Clades I and II. The values within this cloud are similar to those derived from a comparison between <italic>R. equi</italic> 103S and an environmental isolate of <italic>Rhodococcus</italic> (McLeod et al., <xref ref-type="bibr" rid="B36">2006</xref>; Letek et al., <xref ref-type="bibr" rid="B31">2010</xref>). Thus, not only was the structure of the <italic>Rhodococcus</italic> samples supported by analysis with ANI and TETRA, but the consistency in results reconfirmed the use of ANI for inferring genetic relatedness, as was previously demonstrated by others (Goris et al., <xref ref-type="bibr" rid="B16">2007</xref>; Richter and Rossell&#x000F3;-M&#x000F3;ra, <xref ref-type="bibr" rid="B43">2009</xref>).</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Cloud plot of ANIb vs. TETRA for all possible pairwise comparisons.</bold> Average nucleotide identity (ANIb; x-axis) and tetranucleotide (TETRA; y-axis) usage patterns were calculated and plotted as a factor of isolate grouping (z-axis). All pairwise comparisons, including reciprocal comparisons are presented, with colors assigned based on the clade membership of the isolate being compared to. Gray colored areas demark 96% ANI and 0.997 TETRA thresholds. Clouds are demarcated by dotted lines. Circles with black borders are below the TETRA threshold. Circles with yellow borders (comparisons between isolates of sub-clade iii; purple) exceed the TETRA threshold but not the ANIb threshold. The black circle represents a comparison between <italic>R. jostii</italic> RHA1 and <italic>R. equi</italic> 103S.</p></caption>
<graphic xlink:href="fpls-05-00406-g0002.tif"/>
</fig>
<p>The genetic code is nearly universal but synonymous codons are not used with equal frequencies across species because of a complex balance of multiple forces (Plotkin and Kudla, <xref ref-type="bibr" rid="B40">2011</xref>). Because this codon bias can be used to distinguish between dissimilar groups of organisms, we compared all possible pairwise combinations between the 20 isolates and displayed the similarity values as a heat map (Sharp and Li, <xref ref-type="bibr" rid="B48">1987</xref>). Codon usage preferences clearly differentiated the isolates of Clade I from those of Clade II (Figure <xref ref-type="fig" rid="F3">3A</xref>). Codon usage preferences also revealed a pattern consistent with the substructure observed in the NJ tree and ANI vs. TETRA analysis (Figures <xref ref-type="fig" rid="F1">1</xref>, <xref ref-type="fig" rid="F2">2</xref>). The three sub-clades were easily discernable, though the relationships within sub-clade i differed slightly from those inferred from the NJ tree.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>Heatmaps of codon usage preference and homology. (A)</bold> Codon usage preference similarities were calculated for all possible pairwise comparisons. Lower values indicate fewer differences. Isolates were ordered according to their phylogenetic relationships. <bold>(B)</bold> Reciprocal BLASTP analysis was used to determine percent homology for all possible pairwise comparisons. Larger values indicate greater similarities. Isolates were ordered according to their phylogenetic relationships.</p></caption>
<graphic xlink:href="fpls-05-00406-g0003.tif"/>
</fig>
<p>The core genome hypothesis suggests that coherent clusters of bacteria have a core set of functions, and each member augments the core with a variable accessory genome that contributes functions for niche adaptation (Lan and Reeves, <xref ref-type="bibr" rid="B28">1996</xref>). The 20 sequenced isolates share a core of 3063 genes. However, sample size indubitably affects estimation of core genome identities and sizes. Given the small sample size of our collection and the imbalance in numbers of isolates between clades, we elected to cluster based on the percent of shared homologs rather than on core genomes (Figure <xref ref-type="fig" rid="F3">3B</xref>). The members of Clades I and II separated into distinct clusters, which could be taken as evidence for limited gene flow between clades. Relative to results from other approaches, the relationship of the isolates within Clade I were noticeably different, as sub-clade ii and sub-clade iii were not clearly demarcated. Regardless of these subtle differences, the results were entirely consistent in separating the 20 isolates into two groups of phytopathogenic <italic>Rhodococcus</italic>, with one also having evidence for substructure.</p>
</sec>
<sec>
<title>A molecular phylogeny based on whole genome sequences provides a framework for resolving the <italic>Rhodococcus</italic> genus</title>
<p>To develop a framework for resolving the <italic>Rhodococcus</italic> phylogeny, we constructed a multi-locus sequence analysis (MLSA) maximum likelihood (ML) tree based on 961 concatenated sequences representing 1316 members of the Actinobacteria phylum (Figure <xref ref-type="fig" rid="F4">4</xref>). We used seven marker genes that were previously identified as conserved and informative for the subclass Actinobacteridae (Ad&#x000E9;kambi et al., <xref ref-type="bibr" rid="B1">2011</xref>). <italic>Rhodococcus</italic> and <italic>Nocardia</italic> formed sister groups and the members of the <italic>Rhodococcus</italic> genus formed a well-supported phylogenetically coherent cluster (bootstrap percentage of 98%). We were able to identify two relatively defined groups and one small, less defined, group within the <italic>Rhodococcus</italic> genus (Figure <xref ref-type="fig" rid="F4">4</xref>; marked as a, b, and c). The two larger groups were consistent with the two clades previously described in a phylogeny based on 16S rDNA sequences (Jones and Goodfellow, <xref ref-type="bibr" rid="B21">2012</xref>). In contrast, in the MLSA ML tree, the smaller <italic>R. equi</italic> clade is within the larger <italic>Rhodococcus</italic> clade (bootstrap percentage of 93%), unlike previous studies, which associated the <italic>R. equi</italic> clade with <italic>Nocardia</italic>.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>Multi-locus sequence analysis maximum likelihood tree of the Actinobacterium phylum.</bold> Translated sequences for <italic>ftsY</italic>, <italic>infB</italic>, <italic>rpoB</italic>, <italic>rsmA</italic>, <italic>secY</italic>, <italic>tsaD</italic>, and <italic>ychF</italic> from 1316 members were identified using TBLASTN, aligned, and used to generate a multi-locus maximum likelihood tree. A total of 961 sequences were used as input for tree generation. The 21 <italic>Rhodococcus</italic> isolates sequenced by our group are shown in bold and the two clades that include phytopathogenic <italic>Rhodococcus</italic> spp. are indicated. The <italic>R. erythropolis</italic>, <italic>R. rhodochrous</italic>, and <italic>R. equi</italic> clades, previously identified based on a 16S rDNA phylogeny of <italic>Rhodococcus</italic> are labeled with a, b, and c, respectively. Type strains are indicated with a superscript &#x0201C;T.&#x0201D; Branches outside of the <italic>Rhodococcus</italic> genus were collapsed at the genus, family, suborder, order, and subclass level, as appropriate, with the corners of the triangle indicating the shortest and longest total branch lengths for the members of the collapsed clade. A total of 500 rapid bootstraps were performed on this dataset, and branches are colored on a gradient to indicate bootstrap percentage (Green-Cyan-Red, with cutoffs of 100-75-50 and below, respectively). Scale bar, mean number of amino acid substitutions per site.</p></caption>
<graphic xlink:href="fpls-05-00406-g0004.tif"/>
</fig>
<p>The 20 isolates of interest in this study formed a distinct subgroup (bootstrap percentage of 100%) within the MLSA ML tree and also included five other isolates (Figure <xref ref-type="fig" rid="F4">4</xref>). Phytopathogenic isolate A22b, which was identified from a diseased plant and sequenced independently from the 20 isolates, clustered in Clade II. Two of the isolates that clustered with the 20 of interest in this study, were identified independent of plants and in extreme environments. <italic>Rhodococcus</italic> sp. JG-3 was isolated from permafrost (GenBank BioProject PRJNA195882) and <italic>Rhodococcus</italic> sp. AW25M09 was isolated from the stomach of an Atlantic Hagfish (Hjerde et al., <xref ref-type="bibr" rid="B18">2013</xref>). Two others, <italic>Rhodococcus</italic> spp. 29MFTsu3.1 and 114MTsu3.1 were found associated with the rhizosphere or endosphere of <italic>Arabidopsis thaliana</italic> (GenBank BioProject PRJNA201196).</p>
<p>The topology of the tree outside of the <italic>Rhodococcus</italic> genus was similar to previously reported trees and revealed inconsistencies in currently defined taxonomic groups, as previously observed (Ad&#x000E9;kambi et al., <xref ref-type="bibr" rid="B1">2011</xref>; Gao and Gupta, <xref ref-type="bibr" rid="B15">2012</xref>; Jones and Goodfellow, <xref ref-type="bibr" rid="B21">2012</xref>; Verma et al., <xref ref-type="bibr" rid="B57">2013</xref>). The Micrococcineae formed a polyphyletic group, with three to four distinct clades, depending on the tree and tree generation method (bootstrap percentages of 88&#x02013;100%). We also had difficulties in accurately placing the Frankineae into a discrete group, as they were found throughout the phylogeny. The <italic>Actinomyces neuii</italic> species formed a separate, but somewhat poorly supported (bootstrap percentage of 63%) clade with <italic>Mobiluncus curtisii</italic>, as was the case in a phylogeny based on 16S rDNA sequences (Jones and Goodfellow, <xref ref-type="bibr" rid="B21">2012</xref>). We observed a cryptic relationship with Actinopolysporineae included within the Pseudonocardineae clade. However, some of the branches within the Pseudonocardineae were poorly supported (bootstrap percentages &#x0003C;50%), indicating poor resolution with the clade as a whole. One key addition of the MLSA ML tree was that the branching of <italic>Actinomyces</italic> was well supported and is consistent with there being two large clades (bootstrap percentages of 90 and 100%).</p>
</sec>
<sec>
<title>ANI provides a framework for resolving the <italic>Rhodococcus</italic> genus</title>
<p>Because of the relatively few informative sites, bootstrap values at the tips of the MLSA ML tree were often low and insufficient for resolving the <italic>Rhodococcus</italic> genus; compare for example, the two clades of <italic>Rhodococcus</italic> with plant pathogenic members (Figures <xref ref-type="fig" rid="F1">1</xref>, <xref ref-type="fig" rid="F4">4</xref>). Therefore, to develop a molecular framework for the <italic>Rhodococcus</italic> genus, we used ANIb as a tool for inferring similarity. The values were calculated for 3422 pairwise comparisons of the 59 <italic>Rhodococcus</italic> isolates, compiled into a distance matrix, and used to generate a divergence dendogram (Figure <xref ref-type="fig" rid="F5">5</xref>; Chan et al., <xref ref-type="bibr" rid="B6">2012</xref>). Seven distinct clusters formed with inter-group comparisons that exceeded ANI values of 70&#x02013;75%, a range that is typically found between members of the same genus.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p><bold>Average nucleotide identity dendogram for 59 isolates of <italic>Rhodococcus</italic>.</bold> Complete genome sequences for 59 isolates of <italic>Rhodococcus</italic> were used to generate an ANI matrix. The matrix was used to calculate an ANI divergence dendrogram. Groups are color coded according to groups represented in the MLSA ML tree. Branches are colored using cutoffs for pairwise comparisons of all taxa after the nodal point. The 21 <italic>Rhodococcus</italic> isolates sequenced by our group are shown in bold; type strains are designated with a superscript &#x0201C;T.&#x0201D; Clades and sub-clades of the phytopathogenic <italic>Rhodococcus</italic> isolates are labeled at the corresponding node. <sup>&#x0002A;</sup>Indicates conflict between placement within the dendogram and calculated ANI values (see corresponding text for details).</p></caption>
<graphic xlink:href="fpls-05-00406-g0005.tif"/>
</fig>
<p>The two clades of phytopathogenic <italic>Rhodococcus</italic> spp. represent seven different species, when using ANI and a 94% threshold. As previously observed, Clade I has complex substructure and represents four different species (Figures <xref ref-type="fig" rid="F1">1</xref>&#x02013;<xref ref-type="fig" rid="F3">3</xref>, <xref ref-type="fig" rid="F5">5</xref>). Like conclusions described above, subclade i represents the originally named <italic>R. fascians</italic> species, as it includes the type strain LMG3623, along with the most sequenced isolates. Isolates in subclade iii were not as closely grouped, which was also consistent with results from the NJ tree and analysis of codon usage. In fact, based on the single criterion of ANI, 05-339-1, and A76 should be considered as separate species. Subclade iv is represented by a single isolate, <italic>Rhodococcus</italic> sp. AW25M09. In Clade II, isolate A22b is just below the 94% ANI cutoff and may represent a separate subspecies from those that formed this second major clade in previously described analyses (Figures <xref ref-type="fig" rid="F1">1</xref>&#x02013;<xref ref-type="fig" rid="F3">3</xref>, <xref ref-type="fig" rid="F5">5</xref>).</p>
<p>The other isolates of <italic>Rhodococcus</italic>, which are not known as plant-associated, formed five additional clades. Within these clades, and including single isolates (singletons), there were 13 smaller groups defined by ANI values of 94% ANI or greater. These relationships can be used to infer species groups.</p>
<p>Clade III consists mostly of isolates named as <italic>Rhodococcus erythropolis</italic>. The group also included <italic>Rhodococcus qinshengii</italic> BKS 20-40, <italic>Rhodococcus rhodochrous</italic> ATCC 17895, and <italic>Rhodococcus</italic> sp. P27, which based on their association to the clade, could be considered members of the <italic>R. erythropolis</italic> species.</p>
<p>Clade IV has two subgroups and two singletons. The first subgroup is represented by <italic>Rhodococcus jostii</italic> RHA1 and also included <italic>Rhodococcus</italic> spp. DK17 and JVH1. The second subgroup varied in terms of named species and consisted of two isolates of <italic>Rhodococcus opacus</italic>, the type strain of <italic>Rhodococcus imtechensis</italic> and <italic>Rhodococcus wratislaviensis</italic> IFP 2016. Because the latter strain did not cluster with the type strain of <italic>R. wratislaviensis</italic>, NBRC 100605, we suggest that IFP 2016 is not a member of the <italic>R. wratislaviensis</italic> species and it belongs to a different species of <italic>Rhodococcus</italic>. The other singleton isolate was <italic>R. opacus</italic> B4. A whole genome sequence of the <italic>R. opacus</italic> type strain is unavailable. Thus, we cannot suggest whether the subgroup or the singleton should be designated as the <italic>R. opacus</italic> species. Furthermore, 16S rDNA sequences of the three named <italic>R. opacus</italic> isolates are too similar for resolving this issue. Interestingly, <italic>R. opacus</italic> PD630 is most similar to <italic>R. wratislaviensis</italic> NBRC 100605 in respect to ANI values, but the two did not associate with one another in the divergence dendogram.</p>
<p>Clade V contains a tightly clustered group with members of the <italic>Rhodococcus equi</italic> species. Its placement within <italic>Rhodococcus</italic> was consistent with the MLSA ML tree.</p>
<p>Clade VI consists of two smaller subgroups. The first subgroup has <italic>R. rhodochrous</italic> ATCC 21198 and two undesignated isolates (EsD8 and BCP1). The low ANI of 72% placed <italic>R. rhodochrous</italic> ATCC 21198 in a clade separate from isolate <italic>R. rhodochrous</italic> BKS6-46. To further investigate this discrepancy, we used the 16S rDNA sequence from <italic>R. rhodochrous</italic> ATCC 21198 as a query in a BLASTN&#x0002B; search. The sequence identified corresponding sequences from <italic>Rhodococcus aetherivorans</italic> (100% identity, 100% subject coverage), including the type strain DSM 44752. When we used the 16S rDNA sequence from the type strain of <italic>R. rhodochrous</italic> as a query, it showed greater similarity to the corresponding sequence of <italic>R. rhodochrous</italic> BKS6-46 rather than ATCC 21198. In all, these data suggest the identity of ATCC 211983 should be revisited. The second subgroup of Clade VI consists of two <italic>Rhodococcus ruber</italic> isolates (BKS 20-38 and Chol-4) and another undesignated isolate (P14). Thus, we suggest that P14 is a member of the <italic>R. ruber</italic> species.</p>
<p>Clade VII has one cluster of defined species and two singletons. Two isolates of <italic>Rhodococcus pyridinivorans</italic> (AK37 and SB3094) and <italic>Rhodococcus</italic> sp. R04 clustered, which we suggest represents the <italic>R. pyridinivorans</italic> species. <italic>R. rhodochrous</italic> BKS6-46 is a singleton in this clade but its precise placement within this clade of the dendrogram is somewhat misleading. The ANI values for the pairwise comparisons of <italic>R. rhodochrous</italic> BKS6-46 to the two named <italic>R. pyridinivorans</italic>, but not <italic>Rhodococcus</italic> sp. R04, exceeded the 94% threshold used to define a species relationship. The values derived from comparison with <italic>Rhodococcus</italic> sp. R04 likely caused <italic>R. rhodochrous</italic> BKS6-46 to form its own branch. The last singleton is <italic>Rhodococcus</italic> sp. R1101, which had ANI values around 90% relative to the other isolates of Clade VII.</p>
<p>There were two outliers in the dendogram. <italic>Rhodococcus rhodnii</italic> and <italic>Rhodococcus triatomae</italic> were identified as singletons and placed in a clade closest to Clade V (<italic>R. equi</italic>). These isolates had low ANI values (between 71 and 75%) in comparison to all of the sequenced <italic>Rhodococcus</italic> isolates.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>Leafy gall disease is a substantial economic problem for the horticultural industry. The pathogen has an extensive host range that includes most plants important to the industry (Putnam and Miller, <xref ref-type="bibr" rid="B41">2007</xref>). Current management strategies rely on visual inspection and the only method of control is the destruction of infected plant material. While visual inspection of plants for disease is superficially trivial, the absence of fundamental information on its epidemiology and the lack of robust, on-site diagnostics contribute to make disease management challenging. Whole genome sequencing is a cost-effective and facile approach for studying bacterial species and has important practical implications for diagnosis of disease. In this study, we used whole genome sequences to analyze the genetic diversity of plant pathogenic <italic>Rhodococcus</italic> as a first step toward the development of better management strategies for this pathogen.</p>
<p>Twenty isolates, many of which were identified from diseased plants, were previously selected for whole genome sequencing (Creason et al., <xref ref-type="bibr" rid="B8">2014</xref>). Based on a tree derived from 1727 homologous genes, we demonstrated that the 20 isolates separated into distinct clades and sub-clades (Figure <xref ref-type="fig" rid="F1">1</xref>). Similarities in genome features, including ANI, TETRA, codon usage preference, and degree of genome homology, were all consistent in clustering the isolates into distinct and coherent groups (Figures <xref ref-type="fig" rid="F2">2</xref>, <xref ref-type="fig" rid="F3">3</xref>). The similarity in results between the tree and ANI was encouraging and gave us confidence in using the latter for inferring evolutionary relationships of isolates within a larger dataset (Figure <xref ref-type="fig" rid="F5">5</xref>).</p>
<p>The phytopathogenic isolates formed a subgroup distinct from other members of the <italic>Rhodococcus</italic> genus, which could be taken as evidence for cohesion (Figures <xref ref-type="fig" rid="F4">4</xref>, <xref ref-type="fig" rid="F5">5</xref>). However, within this subgroup, phytopathogenicity is not a discriminative trait (Figures <xref ref-type="fig" rid="F1">1</xref>, <xref ref-type="fig" rid="F4">4</xref>, <xref ref-type="fig" rid="F5">5</xref>). Although, we speculate that members of this subgroup are potentiated toward phytopathogenicity. Virulence evolution in <italic>Rhodococcus</italic> has been modeled according to a mechanism of gene co-option whereby limited, but key horizontally acquired virulence genes, trigger the co-option of core genes for virulence (Letek et al., <xref ref-type="bibr" rid="B31">2010</xref>; Creason et al., <xref ref-type="bibr" rid="B8">2014</xref>). As few as four functions, most often conferred by a cluster of genes vectored by a conjugative virulence plasmid, are hypothesized to be sufficient for phytopathogenicity for members of these genetically diverse clades of <italic>Rhodococcus</italic>.</p>
<p>The non-pathogenic isolates recovered from a Greenland glacier ice core, GIC26, and GIC36, as well as <italic>Rhodococcus</italic> sp. JG-3 from permafrost and AW25M09 from the stomach of the Atlantic Hagfish, clustered with phytopathogenic isolates in Clade I (Miteva et al., <xref ref-type="bibr" rid="B37">2004</xref>; Hjerde et al., <xref ref-type="bibr" rid="B18">2013</xref>). <italic>Rhodococcus</italic> spp. 29MFTsu3.1 and 114MTsu3.1, both of which associate with plants, clustered in Clade II. Inspection of the genome sequences of isolates JG-3, AW25M09, 29MFTsu3.1, and 114MTsu3.1, failed to reveal any of the virulence genes known to be necessary for <italic>Rhodococcus</italic> to cause leafy gall disease. We did detect a linear plasmid-like sequence in the draft genome of <italic>Rhodococcus</italic> sp. AW25M09, but it lacks genes known to be necessary for virulence toward plants (Hjerde et al., <xref ref-type="bibr" rid="B18">2013</xref>). It would be interesting to test whether these isolates, upon acquisition of genes that confer the four key virulence functions, gain the ability to infect and cause disease to plants.</p>
<p>Our genome sequencing effort of plant-associated isolates contributed to increasing the number of sequenced <italic>Rhodococcus</italic> isolates by 50%. Not since a decade ago, have genomic features been used to address the <italic>Rhodococcus</italic> genus (G&#x000FC;rtler et al., <xref ref-type="bibr" rid="B17">2004</xref>). We therefore used these genome sequences, an additional sequence we generated for phytopathogenic isolate A22b, along with most currently available rhodococcal genome sequences, to construct a molecular phylogeny to help resolve the genus and shed light on the phylum Actinobacteria as a whole. MLSA provided a framework for defining genus level relationships that can be further explored (Figure <xref ref-type="fig" rid="F4">4</xref>). In this study, the <italic>R. equi</italic> species was placed within the <italic>Rhodococcus</italic> genus, consistent with previously reported results derived from whole-genome based approaches, and in contrast to phylogenetic analysis based on 16S rDNA sequences (Letek et al., <xref ref-type="bibr" rid="B31">2010</xref>; Jones and Goodfellow, <xref ref-type="bibr" rid="B21">2012</xref>). Use of MLSA to infer phylogeny revealed inconsistencies in the placement of Frankineae, in contrast to a 16S rDNA-based phylogeny, which formed a Frankineae cluster near the root of the phylum (Jones and Goodfellow, <xref ref-type="bibr" rid="B21">2012</xref>). However, MLSA was not sufficient for resolving some of the branches within the Pseudonocardineae (bootstrap percentages &#x0003C;50%). More genome sequences, or informative marker sequences for members of Actinopolysporineae might help discern the finer details within the suborder. Other minor discrepancies in single isolate naming were also noted (Supplemental Data <xref ref-type="supplementary-material" rid="SM2">1</xref>).</p>
<p>With ANI, we were able to place the 59 <italic>Rhodococcus</italic> isolates with sequenced genomes into seven large groups. Results were consistent with the MLSA ML tree but ANI provided greater resolution of their relationships (Figure <xref ref-type="fig" rid="F5">5</xref>). We were also able to infer at least 20 different species from these 59 isolates. Some conflicts between clustering and species naming were noted and suggest a need to revisit their taxonomical groupings. However, we recognize that the use of ANI to infer taxonomical groupings merely provides a framework and is not sufficient, by itself, for defining bacterial species. At present, in order to validly designate a species, it needs to be further characterized for at least one discriminative trait. Moreover, the designation of a species would also require inclusion of their corresponding type strain.</p>
<p>The methods that were used were congruent in supporting the existence of multiple species of <italic>Rhodococcus</italic>. However, each of the methods has limitations. The tree developed based on whole genome sequences was computationally and time intensive and precluded us from using methods with stronger statistical frameworks for phylogenetic reconstruction and also became too time-intensive for larger datasets. MLSA is limited by the need for a sufficient set of generalizable and informative marker genes. Despite the reduced number of sequences, the MLSA approach was still computationally and time intensive with larger datasets. ANIb can be limiting in projecting relationships in an evolutionary context (Figure <xref ref-type="fig" rid="F2">2</xref>). Though the dendrogram constructed based on Euclidean distances was a convenient way for visualizing dissimilarities, there were nevertheless some discrepancies between placement of isolates and their measured ANI values (Figure <xref ref-type="fig" rid="F5">5</xref>). For example, <italic>R. opacus</italic> PD630 and <italic>R. wratislaviensis</italic> NBRC 100605 have an ANI value that warranted consideration as a species, but their dissimilarities to other isolates prevented the two from clustering in the dendogram (Figure <xref ref-type="fig" rid="F5">5</xref>). Lastly, similarities based on codon usage preferences and genome inventories were inadequate for resolving relationships between isolates with highly similar genome signatures (Figure <xref ref-type="fig" rid="F3">3</xref>). Nevertheless, when multiple methods were coupled, we were able to make strong inferences regarding taxonomical relationships.</p>
<p>In summary, we used whole genome sequences to resolve the phytopathogenic members of <italic>Rhodococcus</italic> into multiple sister species and developed a dataset that contributes to reconstructing the phylogeny of the <italic>Rhodococcus</italic> genus.</p>
</sec>
<sec>
<title>Author contributions</title>
<p>These authors contributed equally. Allison L. Creason, Edward W. Davis II, Melodie L. Putnam, Olivier M. Vandeputte and Jeff H. Chang contributed to the conception and design of the work, analysis of data, and wrote and approved the manuscript. Allison L. Creason, Edward W. Davis II, Melodie L. Putnam, and Olivier M. Vandeputte also contributed to the acquisition of isolates and/or genome sequences. The authors are accountable for all aspects of the work.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
</sec>
</body>
<back>
<ack>
<p>We would like to thank Alex Buchanan, Ethan Chang, and Tyler Horton, of the Chang lab for their assistance. We would also like to thank Joyce Loper for critical reading of this manuscript. This work is supported by a grant from the Agricultural Research Foundation awarded to Jeff H. Chang. Edward W. Davis II is supported by a Provost&#x00027;s Distinguished Graduate Fellowship awarded by Oregon State University. This material is based upon work supported by the National Science Foundation Graduate Research Fellowship under Grant No. DGE-1314109 to Edward W. Davis II. Any opinion, findings, and conclusions or recommendations expressed in this material are those of the authors(s) and do not necessarily reflect the views of the National Science Foundation. This work is also supported in part by the Society of American Florists and a cooperative agreement with USDA-ARS awarded to Melodie L. Putnam. Olivier M. Vandeputte was supported by grants from FRS-FNRS and the <italic>Fonds David et Alice Van Buuren</italic> (Belgium). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</p>
</ack>
<sec sec-type="supplementary material" id="s5">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://www.frontiersin.org/journal/10.3389/fpls.2014.00406/abstract">http://www.frontiersin.org/journal/10.3389/fpls.2014.00406/abstract</ext-link></p>
<supplementary-material xlink:href="Image1.JPEG" id="SM1" mimetype="image/jpeg" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Figure S1</label>
<caption><p><bold>Genome alignments of four members of phytopathogenic <italic>Rhodococcus</italic>.</bold> Sequences of the chromosomes of isolates D188, A21d2, 05-339-1, and A44a were aligned using progressiveMauve. Each colored square represents a block of sequences that is collinear to a corresponding block of sequences in another genome sequence; linear collinear blocks (LCBs). The extent of homology within LCBs is represented by the height of the colored plot in each block.</p></caption></supplementary-material>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM2" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplemental Data 1</label>
<caption><p><bold>Compressed folder with MLSA tree in Newick tree format file and its associated data.</bold> The phylogenetic tree of the entire Actinobacteria phylum is available as a Newick file. Also included are three comma separate files. (1) SD1_identical_seqs.csv: lists the names of the duplicate sequences that were removed prior to generating the final tree. The names of the species used in generating the tree are listed in the first column. The remaining columns contain the names of the species that were filtered out based on having an identical sequence. (2) SD1_names_identifiers.csv: associates genome sequence identifiers (either GI number for complete genomes, or 4 letter code for wgs sequences) to corresponding isolates in the tree. (3) SD1_taxonomy.csv: lists the taxonomy values for each of the taxa.</p></caption></supplementary-material>
<supplementary-material xlink:href="DataSheet2.ZIP" id="SM3" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplemental Data 2</label>
<caption><p><bold>Compressed folder with Python script and input data for ANI dendrogram generation.</bold> A Python script file is included that can generate an ANI dendrogram from the two input files, one that includes the ANI divergence matrix (SD2_ani_data.txt), and one that includes the names of the genomes included in the analysis (SD2_ani_names.txt).</p></caption></supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ad&#x000E9;kambi</surname> <given-names>T.</given-names></name> <name><surname>Butler</surname> <given-names>R. W.</given-names></name> <name><surname>Hanrahan</surname> <given-names>F.</given-names></name> <name><surname>Delcher</surname> <given-names>A. L.</given-names></name> <name><surname>Drancourt</surname> <given-names>M.</given-names></name> <name><surname>Shinnick</surname> <given-names>T. M.</given-names></name></person-group> (<year>2011</year>). <article-title>Core gene set as the basis of multilocus sequence analysis of the subclass Actinobacteridae</article-title>. <source>PLoS ONE</source> <volume>6</volume>:<fpage>e14792</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0014792</pub-id><pub-id pub-id-type="pmid">21483493</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bertani</surname> <given-names>G.</given-names></name></person-group> (<year>1951</year>). <article-title>Studies on lysogenesis. I. The mode of phage liberation by lysogenic <italic>Escherichia coli</italic></article-title>. <source>J. Bacteriol</source>. <volume>62</volume>, <fpage>293</fpage>&#x02013;<lpage>300</lpage>. <pub-id pub-id-type="pmid">14888646</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bohlin</surname> <given-names>J.</given-names></name> <name><surname>Skjerve</surname> <given-names>E.</given-names></name> <name><surname>Ussery</surname> <given-names>D. W.</given-names></name></person-group> (<year>2008</year>). <article-title>Reliability and applications of statistical methods based on oligonucleotide frequencies in bacterial and archaeal genomes</article-title>. <source>BMC Genomics</source> <volume>9</volume>:<fpage>104</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-9-104</pub-id><pub-id pub-id-type="pmid">18307761</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Camacho</surname> <given-names>C.</given-names></name> <name><surname>Coulouris</surname> <given-names>G.</given-names></name> <name><surname>Avagyan</surname> <given-names>V.</given-names></name> <name><surname>Ma</surname> <given-names>N.</given-names></name> <name><surname>Papadopoulos</surname> <given-names>J.</given-names></name> <name><surname>Bealer</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>BLAST&#x0002B;: architecture and applications</article-title>. <source>BMC Bioinformatics</source> <volume>10</volume>:<fpage>421</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-10-421</pub-id><pub-id pub-id-type="pmid">20003500</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Castresana</surname> <given-names>J.</given-names></name></person-group> (<year>2000</year>). <article-title>Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis</article-title>. <source>Mol. Biol. Evol</source>. <volume>17</volume>, <fpage>540</fpage>&#x02013;<lpage>552</lpage>. <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a026334</pub-id><pub-id pub-id-type="pmid">10742046</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chan</surname> <given-names>J. Z.-M.</given-names></name> <name><surname>Halachev</surname> <given-names>M. R.</given-names></name> <name><surname>Loman</surname> <given-names>N. J.</given-names></name> <name><surname>Constantinidou</surname> <given-names>C.</given-names></name> <name><surname>Pallen</surname> <given-names>M. J.</given-names></name></person-group> (<year>2012</year>). <article-title>Defining bacterial species in the genomic era: insights from the genus Acinetobacter</article-title>. <source>BMC Microbiol</source>. <volume>12</volume>:<fpage>302</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2180-12-302</pub-id><pub-id pub-id-type="pmid">23259572</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cornelis</surname> <given-names>K.</given-names></name> <name><surname>Ritsema</surname> <given-names>T.</given-names></name> <name><surname>Nijsse</surname> <given-names>J.</given-names></name> <name><surname>Holsters</surname> <given-names>M.</given-names></name> <name><surname>Goethals</surname> <given-names>K.</given-names></name> <name><surname>Jaziri</surname> <given-names>M.</given-names></name></person-group> (<year>2001</year>). <article-title>The plant pathogen <italic>Rhodococcus fascians</italic> colonizes the exterior and interior of the aerial parts of plants</article-title>. <source>Mol. Plant Microbe. Interact</source>. <volume>14</volume>, <fpage>599</fpage>&#x02013;<lpage>608</lpage>. <pub-id pub-id-type="doi">10.1094/MPMI.2001.14.5.599</pub-id><pub-id pub-id-type="pmid">11332724</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Creason</surname> <given-names>A. L.</given-names></name> <name><surname>Vandeputte</surname> <given-names>O. M.</given-names></name> <name><surname>Savory</surname> <given-names>E. A.</given-names></name> <name><surname>Davis</surname> <given-names>E. W.</given-names></name> <name><surname>Putnam</surname> <given-names>M. L.</given-names></name> <name><surname>Hu</surname> <given-names>E.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Analysis of genome sequences from plant pathogenic <italic>Rhodococcus</italic> reveals genetic novelties in virulence Loci</article-title>. <source>PLoS ONE</source> <volume>9</volume>:<fpage>e101996</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0101996</pub-id><pub-id pub-id-type="pmid">25010934</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crespi</surname> <given-names>M.</given-names></name> <name><surname>Messens</surname> <given-names>E.</given-names></name> <name><surname>Caplan</surname> <given-names>A. B.</given-names></name> <name><surname>van Montagu</surname> <given-names>M.</given-names></name> <name><surname>Desomer</surname> <given-names>J.</given-names></name></person-group> (<year>1992</year>). <article-title>Fasciation induction by the phytopathogen <italic>Rhodococcus fascians</italic> depends upon a linear plasmid encoding a cytokinin synthase gene</article-title>. <source>EMBO J</source>. <volume>11</volume>, <fpage>795</fpage>&#x02013;<lpage>804</lpage>. <pub-id pub-id-type="pmid">1547783</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Darling</surname> <given-names>A. E.</given-names></name> <name><surname>Mau</surname> <given-names>B.</given-names></name> <name><surname>Perna</surname> <given-names>N. T.</given-names></name></person-group> (<year>2010</year>). <article-title>progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement</article-title>. <source>PLoS ONE</source> <volume>5</volume>:<fpage>e11147</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0011147</pub-id><pub-id pub-id-type="pmid">20593022</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Doolittle</surname> <given-names>W. F.</given-names></name> <name><surname>Zhaxybayeva</surname> <given-names>O.</given-names></name></person-group> (<year>2009</year>). <article-title>On the origin of prokaryotic species</article-title>. <source>Genome. Res</source>. <volume>19</volume>, <fpage>744</fpage>&#x02013;<lpage>756</lpage>. <pub-id pub-id-type="doi">10.1101/gr.086645.108</pub-id><pub-id pub-id-type="pmid">19411599</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dudnik</surname> <given-names>A.</given-names></name> <name><surname>Bigler</surname> <given-names>L.</given-names></name> <name><surname>Dudler</surname> <given-names>R.</given-names></name></person-group> (<year>2014</year>). <article-title>The Endophytic Strain <italic>Rhizobium</italic> sp. AP16 produces the proteasome inhibitor Syringolin A</article-title>. <source>Appl. Environ. Microbiol</source>. <volume>80</volume>, <fpage>3741</fpage>&#x02013;<lpage>3748</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.00395-14</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dur&#x000E1;n</surname> <given-names>D.</given-names></name> <name><surname>Rey</surname> <given-names>L.</given-names></name> <name><surname>Mayo</surname> <given-names>J.</given-names></name> <name><surname>Zu&#x000F1;iga-Davila</surname> <given-names>D.</given-names></name> <name><surname>Imperial</surname> <given-names>J.</given-names></name> <name><surname>Ruiz-Arg&#x000FC;eso</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title><italic>Bradyrhizobium paxllaeri</italic> sp. nov. and <italic>Bradyrhizobium icense</italic> sp. nov., nitrogen-fixing rhizobial symbionts of Lima bean (<italic>Phaseolus lunatus</italic> L.) in Peru</article-title>. <source>Int. J. Syst. Evol. Microbiol</source>. <volume>64</volume>, <fpage>2072</fpage>&#x02013;<lpage>2078</lpage>. <pub-id pub-id-type="doi">10.1099/ijs.0.060426-0</pub-id><pub-id pub-id-type="pmid">24664579</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Francis</surname> <given-names>I.</given-names></name> <name><surname>De Keyser</surname> <given-names>A.</given-names></name> <name><surname>De Backer</surname> <given-names>P.</given-names></name> <name><surname>Sim&#x000F3;n-Mateo</surname> <given-names>C.</given-names></name> <name><surname>Kalkus</surname> <given-names>J.</given-names></name> <name><surname>Pertry</surname> <given-names>I.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>pFiD188, the Linear Virulence Plasmid of <italic>Rhodococcus fascians</italic> D188</article-title>. <source>Mol. Plant Microbe. Interact</source>. <volume>25</volume>, <fpage>637</fpage>&#x02013;<lpage>647</lpage>. <pub-id pub-id-type="doi">10.1094/MPMI-08-11-0215</pub-id><pub-id pub-id-type="pmid">22482837</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gao</surname> <given-names>B.</given-names></name> <name><surname>Gupta</surname> <given-names>R. S.</given-names></name></person-group> (<year>2012</year>). <article-title>Phylogenetic framework and molecular signatures for the main clades of the phylum Actinobacteria</article-title>. <source>Microbiol. Mol. Biol. Rev</source>. <volume>76</volume>, <fpage>66</fpage>&#x02013;<lpage>112</lpage>. <pub-id pub-id-type="doi">10.1128/MMBR.05011-11</pub-id><pub-id pub-id-type="pmid">22390973</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goris</surname> <given-names>J.</given-names></name> <name><surname>Konstantinidis</surname> <given-names>K. T.</given-names></name> <name><surname>Klappenbach</surname> <given-names>J. A.</given-names></name> <name><surname>Coenye</surname> <given-names>T.</given-names></name> <name><surname>Vandamme</surname> <given-names>P.</given-names></name> <name><surname>Tiedje</surname> <given-names>J. M.</given-names></name></person-group> (<year>2007</year>). <article-title>DNA-DNA hybridization values and their relationship to whole-genome sequence similarities</article-title>. <source>Int. J. Syst. Evol. Microbiol</source>. <volume>57</volume>, <fpage>81</fpage>&#x02013;<lpage>91</lpage>. <pub-id pub-id-type="doi">10.1099/ijs.0.64483-0</pub-id><pub-id pub-id-type="pmid">17220447</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x000FC;rtler</surname> <given-names>V.</given-names></name> <name><surname>Mayall</surname> <given-names>B. C.</given-names></name> <name><surname>Seviour</surname> <given-names>R.</given-names></name></person-group> (<year>2004</year>). <article-title>Can whole genome analysis refine the taxonomy of the genus <italic>Rhodococcus</italic>?</article-title> <source>FEMS Microbiol. Rev</source>. <volume>28</volume>, <fpage>377</fpage>&#x02013;<lpage>403</lpage>. <pub-id pub-id-type="doi">10.1016/j.femsre.2004.01.001</pub-id><pub-id pub-id-type="pmid">15449609</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hjerde</surname> <given-names>E.</given-names></name> <name><surname>Pierechod</surname> <given-names>M. M.</given-names></name> <name><surname>Williamson</surname> <given-names>A. K.</given-names></name> <name><surname>Bjerga</surname> <given-names>G. E. K.</given-names></name> <name><surname>Willassen</surname> <given-names>N. P.</given-names></name> <name><surname>Smal&#x000E5;s</surname> <given-names>A. O.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Draft genome sequence of the Actinomycete <italic>Rhodococcus</italic> sp. Strain AW25M09, Isolated from the Hadsel Fjord, Northern Norway</article-title>. <source>Genome Announc</source>. <volume>1</volume>:<fpage>e0005513</fpage>. <pub-id pub-id-type="doi">10.1128/genomeA.00055-13</pub-id><pub-id pub-id-type="pmid">23516194</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hunter</surname> <given-names>J. D.</given-names></name></person-group> (<year>2007</year>). <article-title>Matplotlib: a 2D graphics environment</article-title>. <source>Comput. Sci. Eng</source>. <volume>9</volume>, <fpage>90</fpage>&#x02013;<lpage>95</lpage>. <pub-id pub-id-type="doi">10.1109/MCSE.2007.55</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ishikawa</surname> <given-names>J.</given-names></name> <name><surname>Yamashita</surname> <given-names>A.</given-names></name> <name><surname>Mikami</surname> <given-names>Y.</given-names></name> <name><surname>Hoshino</surname> <given-names>Y.</given-names></name> <name><surname>Kurita</surname> <given-names>H.</given-names></name> <name><surname>Hotta</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2004</year>). <article-title>The complete genomic sequence of <italic>Nocardia farcinica</italic> IFM 10152</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>101</volume>, <fpage>14925</fpage>&#x02013;<lpage>14930</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0406410101</pub-id><pub-id pub-id-type="pmid">15466710</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jones</surname> <given-names>A. L.</given-names></name> <name><surname>Goodfellow</surname> <given-names>M. L.</given-names></name></person-group> (<year>2012</year>). <article-title>Genus IV. <italic>Rhodococcus</italic> (Zopf 1891) emend Goodfellow, Alderson and Chun 1998a</article-title>, in <source>Bergey&#x00027;s Manual of Systematic Bacteriology 2nd Edn. Vol</source>. 5 <italic>The Actinobacteria, Part A</italic>, eds <person-group person-group-type="editor"><name><surname>Goodfellow</surname> <given-names>M.</given-names></name> <name><surname>Kampfer</surname> <given-names>P.</given-names></name> <name><surname>Busee</surname> <given-names>H. J.</given-names></name> <name><surname>Trujillo</surname> <given-names>M. E.</given-names></name> <name><surname>Suzuki</surname> <given-names>K.</given-names></name> <name><surname>Ludwig</surname> <given-names>W.</given-names></name> <name><surname>Whitman</surname> <given-names>W. B.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>437</fpage>&#x02013;<lpage>477</lpage>.</citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kado</surname> <given-names>C. I.</given-names></name> <name><surname>Heskett</surname> <given-names>M. G.</given-names></name></person-group> (<year>1970</year>). <article-title>Selective media for isolation of <italic>Agrobacterium, Corynebacterium</italic>, <italic>Erwinia, Pseudomonas</italic>, and <italic>Xanthomonas</italic></article-title>. <source>Phytopathology</source> <volume>60</volume>, <fpage>969</fpage>&#x02013;<lpage>976</lpage>. <pub-id pub-id-type="doi">10.1094/Phyto-60-969</pub-id><pub-id pub-id-type="pmid">5469886</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katoh</surname> <given-names>K.</given-names></name> <name><surname>Standley</surname> <given-names>D. M.</given-names></name></person-group> (<year>2013</year>). <article-title>MAFFT Multiple Sequence Alignment Software Version 7: improvements in performance and usability</article-title>. <source>Mol. Biol. Evol</source>. <volume>30</volume>, <fpage>772</fpage>&#x02013;<lpage>780</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/mst010</pub-id><pub-id pub-id-type="pmid">23329690</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>M.</given-names></name> <name><surname>Oh</surname> <given-names>H.-S.</given-names></name> <name><surname>Park</surname> <given-names>S.-C.</given-names></name> <name><surname>Chun</surname> <given-names>J.</given-names></name></person-group> (<year>2014</year>). <article-title>Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes</article-title>. <source>Int. J. Syst. Evol. Microbiol</source>. <volume>64</volume>, <fpage>346</fpage>&#x02013;<lpage>351</lpage>. <pub-id pub-id-type="doi">10.1099/ijs.0.059774-0</pub-id><pub-id pub-id-type="pmid">24505072</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konishi</surname> <given-names>M.</given-names></name> <name><surname>Nishi</surname> <given-names>S.</given-names></name> <name><surname>Fukuoka</surname> <given-names>T.</given-names></name> <name><surname>Kitamoto</surname> <given-names>D.</given-names></name> <name><surname>Watsuji</surname> <given-names>T.-O.</given-names></name> <name><surname>Nagano</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Deep-sea <italic>Rhodococcus</italic> sp. BS-15, Lacking the Phytopathogenic <italic>fas</italic> Genes, Produces a Novel Glucotriose Lipid Biosurfactant</article-title>. <source>Mar. Biotechnol</source>. <volume>16</volume>, <fpage>484</fpage>&#x02013;<lpage>493</lpage>. <pub-id pub-id-type="doi">10.1007/s10126-014-9568-x</pub-id><pub-id pub-id-type="pmid">24510374</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konstantinidis</surname> <given-names>K. T.</given-names></name> <name><surname>Ramette</surname> <given-names>A.</given-names></name> <name><surname>Tiedje</surname> <given-names>J. M.</given-names></name></person-group> (<year>2006</year>). <article-title>The bacterial species definition in the genomic era</article-title>. <source>Philos. Trans. R. Soc. Lond. B Biol. Sci</source>. <volume>361</volume>, <fpage>1929</fpage>&#x02013;<lpage>1940</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2006.1920</pub-id><pub-id pub-id-type="pmid">17062412</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konstantinidis</surname> <given-names>K. T.</given-names></name> <name><surname>Tiedje</surname> <given-names>J. M.</given-names></name></person-group> (<year>2005</year>). <article-title>Genomic insights that advance the species definition for prokaryotes</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>102</volume>, <fpage>2567</fpage>&#x02013;<lpage>2572</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0409727102</pub-id><pub-id pub-id-type="pmid">15701695</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lan</surname> <given-names>R.</given-names></name> <name><surname>Reeves</surname> <given-names>P. R.</given-names></name></person-group> (<year>1996</year>). <article-title>Gene transfer is a major factor in bacterial evolution</article-title>. <source>Mol. Biol. Evol</source>. <volume>13</volume>, <fpage>47</fpage>&#x02013;<lpage>55</lpage>. <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a025569</pub-id><pub-id pub-id-type="pmid">8583905</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Larkin</surname> <given-names>M. J.</given-names></name> <name><surname>Kulakov</surname> <given-names>L. A.</given-names></name> <name><surname>Allen</surname> <given-names>C. C. R.</given-names></name></person-group> (<year>2005</year>). <article-title>Biodegradation and <italic>Rhodococcus</italic>&#x02013;masters of catabolic versatility</article-title>. <source>Curr. Opin. Biotechnol</source>. <volume>16</volume>, <fpage>282</fpage>&#x02013;<lpage>290</lpage>. <pub-id pub-id-type="doi">10.1016/j.copbio.2005.04.007</pub-id><pub-id pub-id-type="pmid">15961029</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lassalle</surname> <given-names>F.</given-names></name> <name><surname>Campillo</surname> <given-names>T.</given-names></name> <name><surname>Vial</surname> <given-names>L.</given-names></name> <name><surname>Baude</surname> <given-names>J.</given-names></name> <name><surname>Costechareyre</surname> <given-names>D.</given-names></name> <name><surname>Chapulliot</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Genomic species are ecological species as revealed by comparative genomics in <italic>Agrobacterium tumefaciens</italic></article-title>. <source>Genome Biol. Evol</source>. <volume>3</volume>, <fpage>762</fpage>&#x02013;<lpage>781</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evr070</pub-id><pub-id pub-id-type="pmid">21795751</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Letek</surname> <given-names>M.</given-names></name> <name><surname>Gonz&#x000E1;lez</surname> <given-names>P.</given-names></name> <name><surname>Macarthur</surname> <given-names>I.</given-names></name> <name><surname>Rodr&#x000ED;guez</surname> <given-names>H.</given-names></name> <name><surname>Freeman</surname> <given-names>T. C.</given-names></name> <name><surname>Valero-Rello</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>The genome of a pathogenic <italic>Rhodococcus</italic>: cooptive virulence underpinned by key gene acquisitions</article-title>. <source>PLoS Genet</source>. <volume>6</volume>:<fpage>e1001145</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1001145</pub-id><pub-id pub-id-type="pmid">20941392</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Letek</surname> <given-names>M.</given-names></name> <name><surname>Ocampo-Sosa</surname> <given-names>A. A.</given-names></name> <name><surname>Sanders</surname> <given-names>M.</given-names></name> <name><surname>Fogarty</surname> <given-names>U.</given-names></name> <name><surname>Buckley</surname> <given-names>T.</given-names></name> <name><surname>Leadon</surname> <given-names>D. P.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Evolution of the <italic>Rhodococcus equi vap</italic> pathogenicity island seen through comparison of host-associated <italic>vapA</italic> and <italic>vapB</italic> virulence plasmids</article-title>. <source>J. Bacteriol</source>. <volume>190</volume>, <fpage>5797</fpage>&#x02013;<lpage>5805</lpage>. <pub-id pub-id-type="doi">10.1128/JB.00468-08</pub-id><pub-id pub-id-type="pmid">18606735</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Letunic</surname> <given-names>I.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name></person-group> (<year>2011</year>). <article-title>Interactive Tree Of Life v2: online annotation and display of phylogenetic trees made easy</article-title>. <source>Nucleic Acids Res</source>. <volume>39</volume>, <fpage>W475</fpage>&#x02013;<lpage>W478</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr201</pub-id><pub-id pub-id-type="pmid">21470960</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loper</surname> <given-names>J. E.</given-names></name> <name><surname>Hassan</surname> <given-names>K. A.</given-names></name> <name><surname>Mavrodi</surname> <given-names>D. V.</given-names></name> <name><surname>Davis</surname> <given-names>E. W.</given-names></name> <name><surname>Lim</surname> <given-names>C. K.</given-names></name> <name><surname>Shaffer</surname> <given-names>B. T.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Comparative genomics of plant-associated <italic>Pseudomonas</italic> spp.: insights into diversity and inheritance of traits involved in multitrophic interactions</article-title>. <source>PLoS Genet</source>. <volume>8</volume>:<fpage>e1002784</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1002784</pub-id><pub-id pub-id-type="pmid">22792073</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marakeby</surname> <given-names>H.</given-names></name> <name><surname>Badr</surname> <given-names>E.</given-names></name> <name><surname>Torkey</surname> <given-names>H.</given-names></name> <name><surname>Song</surname> <given-names>Y.</given-names></name> <name><surname>Leman</surname> <given-names>S.</given-names></name> <name><surname>Monteil</surname> <given-names>C. L.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>A system to automatically classify and name any individual genome-sequenced organism independently of current biological classification and nomenclature</article-title>. <source>PLoS ONE</source> <volume>9</volume>:<fpage>e89142</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0089142</pub-id><pub-id pub-id-type="pmid">24586551</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McLeod</surname> <given-names>M. P.</given-names></name> <name><surname>Warren</surname> <given-names>R. L.</given-names></name> <name><surname>Hsiao</surname> <given-names>W. W. L.</given-names></name> <name><surname>Araki</surname> <given-names>N.</given-names></name> <name><surname>Myhre</surname> <given-names>M.</given-names></name> <name><surname>Fernandes</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>The complete genome of <italic>Rhodococcus</italic> sp. RHA1 provides insights into a catabolic powerhouse</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>103</volume>, <fpage>15582</fpage>&#x02013;<lpage>15587</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0607048103</pub-id><pub-id pub-id-type="pmid">17030794</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miteva</surname> <given-names>V. I.</given-names></name> <name><surname>Sheridan</surname> <given-names>P. P.</given-names></name> <name><surname>Brenchley</surname> <given-names>J. E.</given-names></name></person-group> (<year>2004</year>). <article-title>Phylogenetic and physiological diversity of microorganisms isolated from a deep Greenland glacier ice core</article-title>. <source>Appl. Environ. Microbiol</source>. <volume>70</volume>, <fpage>202</fpage>&#x02013;<lpage>213</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.70.1.202-213.2004</pub-id><pub-id pub-id-type="pmid">14711643</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Na</surname> <given-names>K.-S.</given-names></name> <name><surname>Kuroda</surname> <given-names>A.</given-names></name> <name><surname>Takiguchi</surname> <given-names>N.</given-names></name> <name><surname>Ikeda</surname> <given-names>T.</given-names></name> <name><surname>Ohtake</surname> <given-names>H.</given-names></name> <name><surname>Kato</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>Isolation and characterization of benzene-tolerant <italic>Rhodococcusopacus</italic> strains</article-title>. <source>J. Biosci. Bioeng</source>. <volume>99</volume>, <fpage>378</fpage>&#x02013;<lpage>382</lpage>. <pub-id pub-id-type="doi">10.1263/jbb.99.378</pub-id><pub-id pub-id-type="pmid">16233805</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Perez</surname> <given-names>F.</given-names></name> <name><surname>Granger</surname> <given-names>B. E.</given-names></name></person-group> (<year>2007</year>). <article-title>IPython: a system for interactive scientific computing</article-title>. <source>Comput. Sci. Eng</source>. <volume>9</volume>, <fpage>21</fpage>&#x02013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1109/MCSE.2007.53</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Plotkin</surname> <given-names>J. B.</given-names></name> <name><surname>Kudla</surname> <given-names>G.</given-names></name></person-group> (<year>2011</year>). <article-title>Synonymous but not the same: the causes and consequences of codon bias</article-title>. <source>Nat. Rev. Genet</source>. <volume>12</volume>, <fpage>32</fpage>&#x02013;<lpage>42</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2899</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Putnam</surname> <given-names>M. L.</given-names></name> <name><surname>Miller</surname> <given-names>M. L.</given-names></name></person-group> (<year>2007</year>). <article-title><italic>Rhodococcusfascians</italic> in herbaceous perennials</article-title>. <source>Plant Dis</source>. <volume>91</volume>, <fpage>1064</fpage>&#x02013;<lpage>1076</lpage>. <pub-id pub-id-type="doi">10.1094/PDIS-91-9-1064</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="web"><person-group person-group-type="author"><collab>R Core Team.</collab></person-group> (<year>2013</year>). <source>R: A Language and Environment for Statistical Computing</source>. R Foundation for Statistical Computing. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.R-project.org">http://www.R-project.org</ext-link></citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richter</surname> <given-names>M.</given-names></name> <name><surname>Rossell&#x000F3;-M&#x000F3;ra</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Shifting the genomic gold standard for the prokaryotic species definition</article-title>. <source>Proc. Nat. Acad. Sci. U.S.A</source>. <volume>106</volume>, <fpage>19126</fpage>&#x02013;<lpage>19131</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0906412106</pub-id><pub-id pub-id-type="pmid">19855009</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rissman</surname> <given-names>A. I.</given-names></name> <name><surname>Mau</surname> <given-names>B.</given-names></name> <name><surname>Biehl</surname> <given-names>B. S.</given-names></name> <name><surname>Darling</surname> <given-names>A. E.</given-names></name> <name><surname>Glasner</surname> <given-names>J. D.</given-names></name> <name><surname>Perna</surname> <given-names>N. T.</given-names></name></person-group> (<year>2009</year>). <article-title>Reordering contigs of draft genomes using the Mauve aligner</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>2071</fpage>&#x02013;<lpage>2073</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp356</pub-id><pub-id pub-id-type="pmid">19515959</pub-id></citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robbertse</surname> <given-names>B.</given-names></name> <name><surname>Yoder</surname> <given-names>R. J.</given-names></name> <name><surname>Boyd</surname> <given-names>A.</given-names></name> <name><surname>Reeves</surname> <given-names>J.</given-names></name> <name><surname>Spatafora</surname> <given-names>J. W.</given-names></name></person-group> (<year>2011</year>). <article-title>Hal: an automated pipeline for phylogenetic analyses of genomic data</article-title>. <source>PLoS Curr</source>. <volume>3</volume>:<fpage>RRN1213</fpage>. <pub-id pub-id-type="doi">10.1371/currents.RRN1213</pub-id><pub-id pub-id-type="pmid">21327165</pub-id></citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seemann</surname> <given-names>T.</given-names></name></person-group> (<year>2014</year>). <article-title>Prokka: rapid prokaryotic genome annotation</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>2068</fpage>&#x02013;<lpage>2069</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu153</pub-id><pub-id pub-id-type="pmid">24642063</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sekine</surname> <given-names>M.</given-names></name> <name><surname>Tanikawa</surname> <given-names>S.</given-names></name> <name><surname>Omata</surname> <given-names>S.</given-names></name> <name><surname>Saito</surname> <given-names>M.</given-names></name> <name><surname>Fujisawa</surname> <given-names>T.</given-names></name> <name><surname>Tsukatani</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>Sequence analysis of three plasmids harboured in <italic>Rhodococcus erythropolis</italic> strain PR4</article-title>. <source>Environ. Microbiol</source>. <volume>8</volume>, <fpage>334</fpage>&#x02013;<lpage>346</lpage>. <pub-id pub-id-type="doi">10.1111/j.1462-2920.2005.00899.x</pub-id><pub-id pub-id-type="pmid">16423019</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharp</surname> <given-names>P. M.</given-names></name> <name><surname>Li</surname> <given-names>W. H.</given-names></name></person-group> (<year>1987</year>). <article-title>The codon Adaptation Index&#x02013;a measure of directional synonymous codon usage bias, and its potential applications</article-title>. <source>Nucleic Acids Res</source>. <volume>15</volume>, <fpage>1281</fpage>&#x02013;<lpage>1295</lpage>. <pub-id pub-id-type="doi">10.1093/nar/15.3.1281</pub-id><pub-id pub-id-type="pmid">3547335</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sheng</surname> <given-names>H. M.</given-names></name> <name><surname>Gao</surname> <given-names>H. S.</given-names></name> <name><surname>Xue</surname> <given-names>L. G.</given-names></name> <name><surname>Ding</surname> <given-names>S.</given-names></name> <name><surname>Song</surname> <given-names>C. L.</given-names></name> <name><surname>Feng</surname> <given-names>H. Y.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Analysis of the composition and characteristics of culturable endophytic bacteria within subnival plants of the Tianshan Mountains, northwestern China</article-title>. <source>Curr. Microbiol</source>. <volume>62</volume>, <fpage>923</fpage>&#x02013;<lpage>932</lpage>. <pub-id pub-id-type="doi">10.1007/s00284-010-9800-5</pub-id><pub-id pub-id-type="pmid">21061126</pub-id></citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shevtsov</surname> <given-names>A.</given-names></name> <name><surname>Tarlykov</surname> <given-names>P.</given-names></name> <name><surname>Zholdybayeva</surname> <given-names>E.</given-names></name> <name><surname>Momynkulov</surname> <given-names>D.</given-names></name> <name><surname>Sarsenova</surname> <given-names>A.</given-names></name> <name><surname>Moldagulova</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Draft genome sequence of <italic>Rhodococcus erythropolis</italic> DN1, a crude oil biodegrader</article-title>. <source>Genome Announc</source>. <volume>1</volume>:<fpage>e00846-13</fpage>. <pub-id pub-id-type="doi">10.1128/genomeA.00846-13</pub-id><pub-id pub-id-type="pmid">24136850</pub-id></citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sonnhammer</surname> <given-names>E. L. L.</given-names></name> <name><surname>Hollich</surname> <given-names>V.</given-names></name></person-group> (<year>2005</year>). <article-title>Scoredist: a simple and robust protein sequence distance estimator</article-title>. <source>BMC Bioinformatics</source> <volume>6</volume>:<fpage>108</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-6-108</pub-id><pub-id pub-id-type="pmid">15857510</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Staley</surname> <given-names>J. T.</given-names></name></person-group> (<year>2009</year>). <article-title>The phylogenomic species concept for Bacteria and Archaea</article-title>. <source>Microbe</source> <volume>4</volume>, <fpage>361</fpage>&#x02013;<lpage>365</lpage>.</citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamatakis</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>1312</fpage>&#x02013;<lpage>1313</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu033</pub-id><pub-id pub-id-type="pmid">24451623</pub-id></citation>
</ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stes</surname> <given-names>E.</given-names></name> <name><surname>Vandeputte</surname> <given-names>O. M.</given-names></name> <name><surname>El Jaziri</surname> <given-names>M.</given-names></name> <name><surname>Holsters</surname> <given-names>M.</given-names></name> <name><surname>Vereecke</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <article-title>A successful bacterial coup d&#x00027;&#x000E9;tat: how <italic>Rhodococcus fascians</italic> redirects plant development</article-title>. <source>Annu. Rev. Phytopathol</source>. <volume>49</volume>, <fpage>69</fpage>&#x02013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-phyto-072910-095217</pub-id><pub-id pub-id-type="pmid">21495844</pub-id></citation>
</ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Teeling</surname> <given-names>H.</given-names></name> <name><surname>Waldmann</surname> <given-names>J.</given-names></name> <name><surname>Lombardot</surname> <given-names>T.</given-names></name> <name><surname>Bauer</surname> <given-names>M.</given-names></name> <name><surname>Gl&#x000F6;ckner</surname> <given-names>F. O.</given-names></name></person-group> (<year>2004</year>). <article-title>TETRA: a web-service and a stand-alone program for the analysis and comparison of tetranucleotide usage patterns in DNA sequences</article-title>. <source>BMC Bioinformatics</source> <volume>5</volume>:<fpage>163</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-5-163</pub-id><pub-id pub-id-type="pmid">15507136</pub-id></citation>
</ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>van der Wolf</surname> <given-names>J. M.</given-names></name> <name><surname>Nijhuis</surname> <given-names>E. H.</given-names></name> <name><surname>Kowalewska</surname> <given-names>M. J.</given-names></name> <name><surname>Saddler</surname> <given-names>G. S.</given-names></name> <name><surname>Parkinson</surname> <given-names>N.</given-names></name> <name><surname>Elphinstone</surname> <given-names>J. G.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title><italic>Dickeya solani</italic> sp. nov., a pectinolytic plant-pathogenic bacterium isolated from potato (<italic>Solanum tuberosum</italic>)</article-title>. <source>Int. J. Syst. Evol. Microbiol</source>. <volume>64</volume>, <fpage>768</fpage>&#x02013;<lpage>774</lpage>. <pub-id pub-id-type="doi">10.1099/ijs.0.052944-0</pub-id><pub-id pub-id-type="pmid">24225027</pub-id></citation>
</ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Verma</surname> <given-names>M.</given-names></name> <name><surname>Lal</surname> <given-names>D.</given-names></name> <name><surname>Kaur</surname> <given-names>J.</given-names></name> <name><surname>Saxena</surname> <given-names>A.</given-names></name> <name><surname>Kaur</surname> <given-names>J.</given-names></name> <name><surname>Anand</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Phylogenetic analyses of phylum actinobacteria based on whole genome sequences</article-title>. <source>Res. Microbiol</source>. <volume>164</volume>, <fpage>718</fpage>&#x02013;<lpage>728</lpage>. <pub-id pub-id-type="doi">10.1016/j.resmic.2013.04.002</pub-id><pub-id pub-id-type="pmid">23608518</pub-id></citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>von Bargen</surname> <given-names>K.</given-names></name> <name><surname>Haas</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Molecular and infection biology of the horse pathogen <italic>Rhodococcus equi</italic></article-title>. <source>FEMS Microbiol. Rev</source>. <volume>33</volume>, <fpage>870</fpage>&#x02013;<lpage>891</lpage>. <pub-id pub-id-type="doi">10.1111/j.1574-6976.2009.00181.x</pub-id><pub-id pub-id-type="pmid">19453748</pub-id></citation>
</ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zerbino</surname> <given-names>D. R.</given-names></name> <name><surname>Birney</surname> <given-names>E.</given-names></name></person-group> (<year>2008</year>). <article-title>Velvet: algorithms for de novo short read assembly using de Bruijn graphs</article-title>. <source>Genome Res</source>. <volume>18</volume>, <fpage>821</fpage>&#x02013;<lpage>829</lpage>. <pub-id pub-id-type="doi">10.1101/gr.074492.107</pub-id><pub-id pub-id-type="pmid">18349386</pub-id></citation>
</ref>
</ref-list>
</back>
</article>