<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbio.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbio.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Research Foundation</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2012.00252</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Global Biodiversity of Aquatic Ammonia-Oxidizing Archaea is Partitioned by Habitat</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Biller</surname> <given-names>Steven J.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x02020;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Mosier</surname> <given-names>Annika C.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x02020;</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02021;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Wells</surname> <given-names>George F.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x02020;</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02021;</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Francis</surname> <given-names>Christopher A.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001">&#x0002A;</xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Environmental Earth System Science, Stanford University</institution> <country>Stanford, CA, USA</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Civil and Environmental Engineering, Stanford University</institution> <country>Stanford, CA, USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Karla B. Heidelberg, University of Southern California, USA</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Zhe-Xue Quan, Fudan University, China; Anne Bernhard, Connecticut College, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Christopher A. Francis, Department of Environmental Earth System Science, Stanford University, 473 Via Ortega, Y2E2 Building Room 140, Stanford, CA 94305-4216, USA. e-mail: <email>caf&#x00040;stanford.edu</email></p></fn>
<fn fn-type="present-address" id="fn002"><p><sup>&#x02020;</sup>Present address: Steven J. Biller, Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA; Annika C. Mosier, Department of Earth and Planetary Science, University of California, Berkeley, CA, USA; George F. Wells, Department of Process Engineering, Eawag Swiss Federal Institute of Aquatic Science and Technology, D&#x000FC;bendorf, Switzerland.</p></fn>
<fn fn-type="other" id="fn003"><p><sup>&#x02021;</sup>Annika C. Mosier and George F. Wells have contributed equally to this work.</p></fn>
<fn fn-type="other" id="fn004"><p>This article was submitted to Frontiers in Aquatic Microbiology, a specialty of Frontiers in Microbiology.</p></fn>
</author-notes>
<pub-date pub-type="epreprint">
<day>30</day>
<month>04</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="epub">
<day>18</day>
<month>07</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="collection">
<year>2012</year>
</pub-date>
<volume>3</volume>
<elocation-id>252</elocation-id>
<history>
<date date-type="received">
<day>10</day>
<month>04</month>
<year>2012</year>
</date>
<date date-type="accepted">
<day>27</day>
<month>06</month>
<year>2012</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2012 Biller, Mosier, Wells and Francis.</copyright-statement>
<copyright-year>2012</copyright-year>
<license license-type="open-access" xlink:href="http://www.frontiersin.org/licenseagreement"><p>This is an open-access article distributed under the terms of the <uri xlink:href="http://creativecommons.org/licenses/by/3.0/">Creative Commons Attribution License</uri>, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.</p></license>
</permissions>
<abstract>
<p>Archaea play an important role in nitrification and are, thus, inextricably linked to the global carbon and nitrogen cycles. Since the initial discovery of an ammonia monooxygenase &#x003B1;-subunit (<italic>amoA</italic>) gene associated with an archaeal metagenomic fragment, archaeal <italic>amoA</italic> sequences have been detected in a wide variety of nitrifying environments. Recent sequencing efforts have revealed extensive diversity of archaeal <italic>amoA</italic> sequences within different habitats. In this study, we have examined over 8000 <italic>amoA</italic> sequences from the literature and public databases in an effort to understand the ecological factors influencing the distribution and diversity of ammonia-oxidizing archaea (AOA), with a particular focus on sequences from aquatic habitats. This broad survey provides strong statistical support for the hypothesis that different environments contain distinct clusters of AOA <italic>amoA</italic> sequences, as surprisingly few sequences are found in more than one habitat type. Within aquatic environments, salinity, depth in the water column, and temperature were significantly correlated with the distribution of sequence types. These findings support the existence of multiple distinct aquatic AOA populations in the environment and suggest some possible selective pressures driving the partitioning of AOA <italic>amoA</italic> diversity.</p>
</abstract>
<kwd-group>
<kwd>biogeography</kwd>
<kwd>ammonia-oxidizing archaea</kwd>
<kwd>nitrification</kwd>
<kwd>Thaumarchaeota</kwd>
<kwd><italic>amoA</italic></kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="5"/>
<equation-count count="0"/>
<ref-count count="55"/>
<page-count count="15"/>
<word-count count="9494"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="introduction">
<title>Introduction</title>
<p>Nitrification &#x02013; the two-step oxidation of ammonia (NH<sub>3</sub>) to nitrate <inline-formula><mml:math id="M1"><mml:mrow><mml:msubsup><mml:mrow><mml:mtext>NO</mml:mtext></mml:mrow><mml:mn>3</mml:mn><mml:mtext>&#x02212;</mml:mtext></mml:msubsup></mml:mrow></mml:math></inline-formula> via nitrite <inline-formula><mml:math id="M2"><mml:mrow><mml:msubsup><mml:mrow><mml:mtext>NO</mml:mtext></mml:mrow><mml:mn>2</mml:mn><mml:mtext>&#x02212;</mml:mtext></mml:msubsup></mml:mrow></mml:math></inline-formula> &#x02013; is a critical component of the global nitrogen cycle. While bacteria were long thought to be the sole organisms capable of carrying out nitrification, there is now considerable evidence that members of the Archaeal domain are also capable of carrying out the first step of nitrification, the oxidation of NH<sub>3</sub> to <inline-formula><mml:math id="M3"><mml:mrow><mml:msubsup><mml:mrow><mml:mtext>NO</mml:mtext></mml:mrow><mml:mn>2</mml:mn><mml:mtext>&#x02212;</mml:mtext></mml:msubsup></mml:mrow></mml:math></inline-formula> (reviewed by Francis et al., <xref ref-type="bibr" rid="B9">2007</xref>). Ammonia-oxidizing archaea (AOA) are now known to comprise a diverse and seemingly ubiquitous group of microorganisms that may make a substantial contribution to the global nitrogen and carbon cycles. Intriguingly, recent studies have shown that the AOA belong to a new phylum of Archaea, the Thaumarchaea (Brochier-Armanet et al., <xref ref-type="bibr" rid="B5">2008</xref>).</p>
<p>The <italic>amoA</italic> gene has proven to be a useful molecular marker for aerobic ammonia oxidizers in the environment. <italic>amoA</italic> encodes the &#x003B1;-subunit of the ammonia monooxygenase enzyme, which catalyzes the initial and rate-limiting step in nitrification. The first suggestion that ammonia oxidation could occur within the Archaea came from the discovery of <italic>amoA</italic> homologs associated with crenarchaeal metagenomic fragments (Venter et al., <xref ref-type="bibr" rid="B51">2004</xref>; Treusch et al., <xref ref-type="bibr" rid="B50">2005</xref>). Definitive evidence for ammonia oxidation capability within the Archaeal domain has come from the successful enrichment and cultivation of AOA, including <italic>Nitrosopumilus maritimus</italic> (K&#x000F6;nneke et al., <xref ref-type="bibr" rid="B23">2005</xref>), <italic>Nitrososphaera gargensis</italic> (Hatzenpichler et al., <xref ref-type="bibr" rid="B16">2008</xref>), <italic>Nitrosocaldus yellowstonii</italic> (De La Torre et al., <xref ref-type="bibr" rid="B7">2008</xref>), <italic>Nitrosoarchaeum limnia</italic> (Blainey et al., <xref ref-type="bibr" rid="B4">2011</xref>; Mosier et al., <xref ref-type="bibr" rid="B37">2012</xref>), <italic>Nitrosotalea devanaterra</italic> (Lehtovirta-Morley et al., <xref ref-type="bibr" rid="B27">2011</xref>), <italic>Nitrososphaera viennensis</italic> (Tourna et al., <xref ref-type="bibr" rid="B49">2011</xref>), and <italic>Nitrosoarchaeum koreensis</italic> (Jung et al., <xref ref-type="bibr" rid="B21">2011</xref>).</p>
<p>Numerous studies have provided evidence that archaeal <italic>amoA</italic> sequences are present, and often prevalent, in diverse nitrifying environments such as estuarine sediments, soils, and in the ocean water column (e.g., Francis et al., <xref ref-type="bibr" rid="B10">2005</xref>; Leininger et al., <xref ref-type="bibr" rid="B28">2006</xref>; Park et al., <xref ref-type="bibr" rid="B40">2006</xref>; He et al., <xref ref-type="bibr" rid="B17">2007</xref>; Mincer et al., <xref ref-type="bibr" rid="B33">2007</xref>; Agogu&#x000E9; et al., <xref ref-type="bibr" rid="B1">2008</xref>; Beman et al., <xref ref-type="bibr" rid="B2">2008</xref>; Hansel et al., <xref ref-type="bibr" rid="B15">2008</xref>; Mosier and Francis, <xref ref-type="bibr" rid="B35">2008</xref>; Santoro et al., <xref ref-type="bibr" rid="B44">2008</xref>; Jia and Conrad, <xref ref-type="bibr" rid="B20">2009</xref>; Kalanetra et al., <xref ref-type="bibr" rid="B22">2009</xref>; Moin et al., <xref ref-type="bibr" rid="B34">2009</xref>; Wells et al., <xref ref-type="bibr" rid="B53">2009</xref>). Studies have shown that the majority of Marine Group I (MGI) Thaumarchaea contain at least one copy of this gene, underscoring the abundance and likely functional importance of <italic>amoA</italic> in the marine environment (Mincer et al., <xref ref-type="bibr" rid="B33">2007</xref>; Beman et al., <xref ref-type="bibr" rid="B3">2010</xref>; Church et al., <xref ref-type="bibr" rid="B6">2010</xref>; Santoro et al., <xref ref-type="bibr" rid="B45">2010</xref>). While archaeal <italic>amoA</italic> sequences tend to cluster into a few major phylogenetic groups, each of those clades harbors significant fine-scale diversity (Gubry-Rangin et al., <xref ref-type="bibr" rid="B11">2011</xref>; Pester et al., <xref ref-type="bibr" rid="B41">2012</xref>). In environments where the relative diversity of <italic>amoA</italic> sequences from AOA and ammonia-oxidizing bacteria (AOB) have been compared, AOA <italic>amoA</italic> diversity is usually observed to be much higher than that of AOB <italic>amoA</italic> (e.g., Mosier and Francis, <xref ref-type="bibr" rid="B35">2008</xref>; Santoro et al., <xref ref-type="bibr" rid="B44">2008</xref>; Wankel et al., <xref ref-type="bibr" rid="B52">2011</xref>), although the reasons for this difference are not clear.</p>
<p>Most AOA are thought to be chemolithoautotrophs, but recent work in wastewater treatment plants has suggested that some may be capable of oxidizing organic compounds as energy and carbon sources (Mussmann et al., <xref ref-type="bibr" rid="B38">2011</xref>). Field studies have demonstrated the functional importance of Thaumarchaea in ammonia oxidation in both aquatic and soil environments (Leininger et al., <xref ref-type="bibr" rid="B28">2006</xref>; Lam et al., <xref ref-type="bibr" rid="B26">2009</xref>; Santoro et al., <xref ref-type="bibr" rid="B45">2010</xref>; Zhang et al., <xref ref-type="bibr" rid="B55">2010</xref>), although the presence of archaeal <italic>amoA</italic> genes has not always been found to correlate with active nitrification <italic>in situ</italic> (Mussmann et al., <xref ref-type="bibr" rid="B38">2011</xref>). The relationship between the relative abundance and activity of AOA and AOB appears to be complex, and depends at least in part on salinity and other environmental variables (e.g., Erguder et al., <xref ref-type="bibr" rid="B8">2009</xref>, and references therein).</p>
<p>Inspection of <italic>amoA</italic> phylogenies has suggested that sequences tend to cluster with others from the same environment, but the significance of this association has remained largely unclear. In this study, we sought to determine the importance of habitat type in explaining <italic>amoA</italic> phylogeny and to investigate environmental and ecological factors that may be responsible for partitioning AOA <italic>amoA</italic> genotypic diversity. To this end, we used a bioinformatic approach to characterize the diversity and distribution of over 8000 AOA <italic>amoA</italic> sequences from the GenBank database, with a particular focus on <italic>amoA</italic> sequences from aquatic habitats. Multiple independent approaches, including a complete phylogenetic analysis of the entire dataset of unique sequences, support the hypothesis that different environmental habitats harbor distinct and largely coherent groups of <italic>amoA</italic> sequences. We find a significant role for salinity, temperature, and other environmental parameters in partitioning AOA diversity in aquatic habitats. Together, these results indicate the existence of largely distinct populations of AOA occupying different habitats in the environment.</p>
</sec>
<sec sec-type="materials|methods">
<title>Materials and Methods</title>
<sec>
<title><italic>amoA</italic> sequence dataset</title>
<p>Archaeal <italic>amoA</italic> sequences were extracted from GenBank (release 175) by searching for records identified as environmental samples containing the search terms &#x0201C;<italic>amoA</italic>&#x0201D; and &#x0201C;uncultured ammonia-oxidizing archaeon,&#x0201D; &#x0201C;uncultured archaeon,&#x0201D; or &#x0201C;uncultured crenarchaeote.&#x0201D; Metadata for each sequence was either downloaded directly from GenBank or manually annotated by referring to the original publication (where available). We assigned each sequence to one of 13 habitat categories: aquaria and biofilters, caves, coastal sediments, coral and sponges, groundwater (including groundwater treatment), hot springs, hydrothermal vents, lakes and rivers, marine sediments, seas, soils, water column (marine), and wastewater treatment (including wastewater treatment plants, activated sludge, and bioreactors).</p>
<p>Archaeal <italic>amoA</italic> sequences were aligned in ARB (Ludwig et al., <xref ref-type="bibr" rid="B31">2004</xref>) using a seed-alignment constructed in MEGA v4.0.2 (Tamura et al., <xref ref-type="bibr" rid="B48">2007</xref>) based on nucleotide sequences. Four partial-length betaproteobacterial ammonia-oxidizing bacterial <italic>amoA</italic> sequences [<italic>Nitrosomonas europaea</italic> (AF058691), <italic>Nitrosospira briensis</italic> (U76553), <italic>Nitrosospira multiformis</italic> (AF042171), <italic>Nitrosomonas cryotolerans</italic> (AF314753)], one partial-length gammaproteobacterial ammonia-oxidizing bacterial <italic>amoA</italic> sequence [<italic>Nitrosococcus oceanus</italic> (AF047705)], and two partial-length methane-oxidizing bacterial <italic>pmoA</italic> sequences [<italic>Methylosinus trichosporium</italic> OB3b (U31650) and <italic>Methylococcus capsulatus</italic> (L40804)] were aligned against amino acid translations of selected archaeal <italic>amoA</italic> sequences in Geneious v4.8.5 (Biomatters Ltd., Auckland, New Zealand) for use as an outgroup. The alignment was trimmed to a final length of 534&#x02009;bp to maximize the number of sequences included in the final dataset while still removing low-quality sequence ends. Sequences of insufficient length (e.g., DGGE bands) or with insufficient metadata were discarded.</p>
</sec>
<sec>
<title>Phylogenetic tree construction</title>
<p>Phylogeny was inferred with RAxML v7.2.6 (Stamatakis, <xref ref-type="bibr" rid="B47">2006</xref>) as implemented in the CIPRES portal (Miller et al., <xref ref-type="bibr" rid="B32">2009</xref>). Two-hundred independent maximum likelihood inferences were run on the alignment, starting from independent randomized maximum parsimony trees; the best-scoring maximum likelihood tree was used as the final tree. Habitat associations (based on 13 manually defined categories) were mapped onto the tree using the interactive Tree of Life (iTOL) program (Letunic and Bork, <xref ref-type="bibr" rid="B29">2007</xref>).</p>
</sec>
<sec>
<title>Bioinformatic analyses</title>
<p>MOTHUR v1.11.0 (Schloss et al., <xref ref-type="bibr" rid="B46">2009</xref>) was used to determine the number of operational taxonomic units (OTUs) present in the AOA <italic>amoA</italic> dataset at varying levels of sequence identity, calculate rarefaction curves, and compute beta-diversity metrics between sequences from different habitats. LIBSHUFF analysis was also performed from within MOTHUR, using the default settings. Calculations of the Jaccard index for protein sequences were performed using a custom Python script. Due to the nature of this dataset, we did not have sufficient abundance information to calculate quantitative indices of alpha or beta-diversity that incorporate species richness information.</p>
<p>Average pairwise identities between sequences at both the nucleotide and amino acid level were carried out using a custom Python script. Tests for selection pressures on AmoA were conducted using the maximum likelihood-based SLAC methodology (Kosakovsky Pond and Frost, <xref ref-type="bibr" rid="B24">2005</xref>) as implemented in the HyPhy package (Pond et al., <xref ref-type="bibr" rid="B43">2005</xref>) and run using the web interface at <uri xlink:href="http://www.datamonkey.org">http://www.datamonkey.org</uri> (Pond and Frost, <xref ref-type="bibr" rid="B42">2005</xref>). To look for evidence of selection in the overall dataset, we analyzed representative sequences of OTUs at the 85% identity level as determined by MOTHUR; this was done both due to computational limitations and to increase the likelihood that the sequences being analyzed represented fixed lineages from distinct populations, and not simply polymorphisms within a population (Kryazhimskiy and Plotkin, <xref ref-type="bibr" rid="B25">2008</xref>). For analyses within a habitat type, sequences representing the 90% identity OTUs were used to ensure that a sufficient number of sequences were analyzed. Automatic nucleotide substitution model selection and recombination detection (using GARD, when possible) were both carried out before the SLAC analysis.</p>
<p>Sequences assigned to the &#x0201C;coastal sediments&#x0201D; and &#x0201C;lakes and rivers&#x0201D; habitats were analyzed further using AdaptML (Hunt et al., <xref ref-type="bibr" rid="B19">2008</xref>). AdaptML defines ecologically meaningful phylogenetic groups using an evolutionary hidden Markov model that identifies populations as groups of related strains sharing a common projected habitat. The default parameters were used except for our use of a more precise numerical optimization for the habitat transition rate parameter. Sequences were assigned to a habitat sub-category based on the metadata: coastal, surf zone, estuary, salt marsh, lake, heathland pool, or river. Additionally, each sequence was assigned to a high (&#x02265;15&#x02009;ppt) or low (&#x0003C;15&#x02009;ppt) salinity category. Clonal sequences were removed from the sequence dataset, except where unique sequences were found in more than one habitat type. Out of 2470 total sequences in the broad coastal sediment, lake, and river habitat categories, 1997 sequences were used in the AdaptML analysis (1962 unique sequences and 35 additional non-unique sequences representing a different sub-category). Phylogenetic trees used as an input into AdaptML were reconstructed using PhyML v.2.4.4 using the ATGC bioinformatics platform (Guindon and Gascuel, <xref ref-type="bibr" rid="B12">2003</xref>) with the following parameter settings: DNA substitution was modeled using the HKY parameter; the transition/transversion ratio was set to 4.0; PhyML estimated the proportion of invariable nucleotide sites; the gamma distribution parameter was set to 1.0; 4 gamma rate categories were used; a BIONJ tree was initially used; and branch lengths and rate parameters were optimized by PhyML. <italic>Nitrosomonas europaea</italic> (AF058691) was used as an outgroup. AdaptML output files were visualized using iTOL (Letunic and Bork, <xref ref-type="bibr" rid="B29">2007</xref>).</p>
</sec>
<sec>
<title>Statistical analyses</title>
<p>Principal components analysis was carried out using the unweighted FastUnifrac algorithm on the Unifrac website (<uri xlink:href="http://bmf2.colorado.edu/fastunifrac/index.psp">http://bmf2.colorado.edu/fastunifrac/index.psp</uri>; Hamady et al., <xref ref-type="bibr" rid="B14">2010</xref>), using the best RAxML tree and a file assigning each sequence to one of 13 different habitat groupings as input. Analyses were carried out using the default parameters or as indicated in the text. ANOSIM and perMANOVA analyses were conducted with 1000 permutations in the R statistical programming environment v2.11.1 using algorithms implemented in the package vegan v1.17 (Oksanen et al., <xref ref-type="bibr" rid="B39">2010</xref>) with distance matrices generated in MOTHUR v1.11.0 and associated metadata as inputs. Experimental factors tested were derived from sequence metadata and included habitat (categories as described above), temperature (psychrophilic, mesophilic, and thermophilic, defined as &#x0003C;15&#x000B0;C, between 15 and 40&#x000B0;C, and &#x0003E;40&#x000B0;C, respectively), salinity (low and high, defined as &#x0003C;15 and &#x02265;15&#x02009;ppt, respectively), latitude (low-, mid-, and high-latitude, defined as &#x0003C;23.4&#x000B0;, between 23.4&#x000B0; and 66.6&#x000B0;, and &#x0003E;66.6&#x000B0;, respectively), and ocean water column depth (surface, mid, and deep, corresponding to &#x0003C;199&#x02009;m, between 200&#x02013;399&#x02009;m, and &#x02265;400&#x02009;m depth, respectively).</p>
</sec>
</sec>
<sec>
<title>Results and Discussion</title>
<sec>
<title>Overview of archaeal <italic>amo</italic><italic>A</italic> sequence diversity</title>
<p>We compiled and aligned 8296 archaeal <italic>amoA</italic> sequences from GenBank, representing nearly 100 different environmental- and cultivation-based studies (published and unpublished) from around the world. The vast majority of <italic>amoA</italic> sequences came from coastal sediments (&#x0007E;30%) and soils (&#x0007E;32%). Caves, seas, hydrothermal vents, and wastewater treatment were the most underrepresented habitats in terms of total number of sequences in the database (Table <xref ref-type="table" rid="T1">1</xref>). Of the 8296 total amoA sequences within our alignment, 6203 (&#x0007E;75%) were unique. On average, <italic>amoA</italic> sequences were 76% identical to each other at the nucleotide level (Figure <xref ref-type="fig" rid="FA1">A1</xref> in Appendix; Table <xref ref-type="table" rid="T1">1</xref>). Although some of this apparent &#x0201C;diversity&#x0201D; could arise from experimental artifacts, we note that even allowing for &#x0007E;5 PCR- or sequencing-based errors in each <italic>amoA</italic> fragment (corresponding to the 99% identity level) still yields extensive diversity among AOA (Table <xref ref-type="table" rid="T1">1</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Summary statistics for the AOA <italic>amoA</italic> sequence dataset</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"/>
<th align="left">Total sequences</th>
<th colspan="5" align="center">Number of nucleotide sequence OTUs (identity level)<hr/></th>
<th align="left">Unique protein sequences</th>
</tr>
<tr>
<th align="left"/>
<th align="left"/>
<th align="left">Unique</th>
<th align="left">99%</th>
<th align="left">95%</th>
<th align="left">90%</th>
<th align="left">85%</th>
<th align="left"/>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Overall</td>
<td align="left">8296</td>
<td align="left">6203</td>
<td align="left">2494</td>
<td align="left">805</td>
<td align="left">315</td>
<td align="left">138</td>
<td align="left">3729</td>
</tr>
<tr>
<td align="left">Aquaria&#x02009;&#x0002B;&#x02009;Biofilters</td>
<td align="left">277</td>
<td align="left">159</td>
<td align="left">48</td>
<td align="left">18</td>
<td align="left">8</td>
<td align="left">4</td>
<td align="left">104</td>
</tr>
<tr>
<td align="left">Caves</td>
<td align="left">82</td>
<td align="left">52</td>
<td align="left">6</td>
<td align="left">5</td>
<td align="left">4</td>
<td align="left">3</td>
<td align="left">41</td>
</tr>
<tr>
<td align="left">Coastal sediments</td>
<td align="left">2459</td>
<td align="left">1947</td>
<td align="left">864</td>
<td align="left">355</td>
<td align="left">150</td>
<td align="left">66</td>
<td align="left">1296</td>
</tr>
<tr>
<td align="left">Coral&#x02009;&#x0002B;&#x02009;Sponges</td>
<td align="left">407</td>
<td align="left">282</td>
<td align="left">101</td>
<td align="left">56</td>
<td align="left">42</td>
<td align="left">30</td>
<td align="left">194</td>
</tr>
<tr>
<td align="left">Groundwater</td>
<td align="left">215</td>
<td align="left">180</td>
<td align="left">81</td>
<td align="left">26</td>
<td align="left">15</td>
<td align="left">10</td>
<td align="left">123</td>
</tr>
<tr>
<td align="left">Hot springs</td>
<td align="left">197</td>
<td align="left">183</td>
<td align="left">121</td>
<td align="left">80</td>
<td align="left">51</td>
<td align="left">31</td>
<td align="left">169</td>
</tr>
<tr>
<td align="left">Hydrothermal vents</td>
<td align="left">120</td>
<td align="left">95</td>
<td align="left">43</td>
<td align="left">25</td>
<td align="left">17</td>
<td align="left">12</td>
<td align="left">77</td>
</tr>
<tr>
<td align="left">Lakes&#x02009;&#x0002B;&#x02009;Rivers</td>
<td align="left">445</td>
<td align="left">355</td>
<td align="left">150</td>
<td align="left">72</td>
<td align="left">39</td>
<td align="left">27</td>
<td align="left">228</td>
</tr>
<tr>
<td align="left">Marine sediments</td>
<td align="left">342</td>
<td align="left">299</td>
<td align="left">232</td>
<td align="left">125</td>
<td align="left">65</td>
<td align="left">41</td>
<td align="left">243</td>
</tr>
<tr>
<td align="left">Seas</td>
<td align="left">95</td>
<td align="left">73</td>
<td align="left">22</td>
<td align="left">8</td>
<td align="left">6</td>
<td align="left">5</td>
<td align="left">52</td>
</tr>
<tr>
<td align="left">Soils</td>
<td align="left">2621</td>
<td align="left">1748</td>
<td align="left">704</td>
<td align="left">280</td>
<td align="left">137</td>
<td align="left">70</td>
<td align="left">1031</td>
</tr>
<tr>
<td align="left">Water column</td>
<td align="left">902</td>
<td align="left">767</td>
<td align="left">301</td>
<td align="left">64</td>
<td align="left">27</td>
<td align="left">16</td>
<td align="left">304</td>
</tr>
<tr>
<td align="left">Wastewater treatment</td>
<td align="left">134</td>
<td align="left">100</td>
<td align="left">36</td>
<td align="left">25</td>
<td align="left">20</td>
<td align="left">17</td>
<td align="left">84</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Pester et al. (<xref ref-type="bibr" rid="B41">2012</xref>) proposed that <italic>amoA</italic> sequences with less than 87% nucleic acid sequence identity are likely to represent two different AOA species. Bracketing this value, when grouping sequences at a 90% nucleotide identity level, we identified 315 AOA &#x0201C;species&#x0201D; across all environments sampled; at 85% identity, we observed 138 <italic>amoA</italic>-based &#x0201C;species&#x0201D; (Table <xref ref-type="table" rid="T1">1</xref>). Although the degree of genetic diversification reflecting species differentiation is controversial (for both 16S rRNA genes and functional genes such as <italic>amoA</italic>), these numbers reflect a general estimate of the overall diversity of AOA across wide-ranging environments. Although the observed number of OTUs decreased rapidly from the 99% identity level to the 95 and 90% identity levels, rarefaction analysis indicates that the unique diversity of <italic>amoA</italic> sequences observed thus far is far from saturating (Figure <xref ref-type="fig" rid="FA2">A2</xref> in Appendix). Sequences from coastal sediments have the highest number of observed OTUs (at all identity levels), followed by soils. Caves and seas have the fewest observed OTUs.</p>
<p>The diversity of <italic>amoA</italic> sequences from hot springs and marine sediments appear to be the most undersampled, based on the high percentage of unique nucleotide sequences and OTUs (at all levels) relative to the total number of sequences (Table <xref ref-type="table" rid="T1">1</xref>). Conversely, rarefaction analysis suggests that the diversity within aquaria and caves appear to be the most oversampled (Figure <xref ref-type="fig" rid="FA2">A2</xref> in Appendix); however, it is important to note that only a few studies have looked at AOA from these environments. It is possible, if not likely, that additional data from other types of aquaria, biofilters, or caves would increase the diversity of sequences observed within these groups. While the primers used to amplify <italic>amoA</italic> can potentially affect the relative diversity observed in these habitats, our analysis did not exclude any particular primer set. Overall, AOA <italic>amoA</italic> diversity appears to have been well sampled in the literature, but this analysis suggests that the number of undiscovered <italic>amoA</italic> sequence types in the environment is still potentially vast.</p>
</sec>
<sec>
<title>Different environments contain distinct groups of <italic>amo</italic><italic>A</italic> sequence types</title>
<p>Early studies of archaeal <italic>amoA</italic> diversity (Francis et al., <xref ref-type="bibr" rid="B10">2005</xref>) suggested that <italic>amoA</italic> sequences from sediment and water column samples formed distinct phylogenetic clusters. We sought to determine whether this pattern could still be observed in a more current and significantly larger <italic>amoA</italic> dataset. Habitat assignments were mapped onto a maximum likelihood phylogenetic tree of all 6203 unique <italic>amoA</italic> sequences (Figure <xref ref-type="fig" rid="F1">1</xref>). Four major clusters were evident in the tree: one representing sequences from coastal and marine sediments, one from soils, and two groups from the marine water column. While the groups were not explicitly defined by habitat definition, the sequences were strongly correlated with environment. For example, within the two marine water column clusters, nearly all sequences that did not come from that environment instead came from other marine habitats: corals, seas, hydrothermal vents, and marine sediments. Water column sequences were rarely observed outside of the two water column groups. Coral sequences fell predominantly in the sediment and water column cluster. Most of the hot spring and wastewater treatment sequences grouped within the soil cluster, whereas most of the other habitat types grouped predominantly within the sediment cluster. Statistical analyses support the idea that there is a strong correlation between <italic>amoA</italic> phylogenetic and ecological differentiation. Both a <italic>P</italic>-test and overall Unifrac significance test indicated that there was significant clustering of sequences as grouped by habitat on the tree (<italic>P</italic>-test <italic>P</italic>&#x02009;&#x0003D;&#x02009;0; Unifrac test <italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001). In addition, each of the 13 habitat-defined groups of <italic>amoA</italic> sequences was significantly different from all others by the LIBSHUFF test (<italic>P</italic>&#x02009;&#x0003C;&#x02009;0.0001).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Maximum likelihood phylogeny and habitat distribution among all 6203 unique AOA <italic>amoA</italic> sequences</bold>. Colored bars in the outer ring correspond to the habitat assignment for each individual sequence. The bacterial <italic>amoA</italic> out group has been excluded from the tree for visualization purposes.</p></caption>
<graphic xlink:href="fmicb-03-00252-g001.tif"/>
</fig>
<p>We used Unifrac distances between <italic>amoA</italic> sequences from each habitat to better understand the phylogenetic relationships between these groups. Unifrac provides a measure of the amount of evolution in a tree that is confined to a specific habitat grouping, or how much more unique branch length is attributed to a particular environment than would be expected by chance. Analysis of Unifrac distances between individual habitat types indicated that sequences found in coastal sediments, corals and sponges, groundwater, hot springs, marine sediments, and soils were significantly clustered on the phylogenetic tree (<italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001 for all except groundwater, <italic>P</italic>&#x02009;&#x0003D;&#x02009;0.013). This result corroborates previous findings that suggested the presence of distinct groupings from coastal sediments and soil habitats (e.g., Francis et al., <xref ref-type="bibr" rid="B10">2005</xref>) and supports the hypothesis that different environmental habitats tend to contain distinct groups of <italic>amoA</italic> sequence types. In contrast to the LIBSHUFF results, <italic>amoA</italic> sequences from aquaria and biofilters, caves, hydrothermal vents, lakes, and rivers, seas, wastewater treatment, and the water column categories were not distinct from all others by the Unifrac test. This may reflect artifacts of our metadata-based habitat definitions not truly reflecting environmentally relevant groupings, the number of sequences examined per category, or result from some shared selective pressures on <italic>amoA</italic> among these groups.</p>
<p>&#x003B2;-diversity analysis of the distribution of <italic>amoA</italic> OTUs also indicated that <italic>amoA</italic> sequence types are strongly partitioned by environment. Pairwise comparisons of the number of OTUs shared between any two habitats (using the Jaccard index) showed that, even at a coarse 90% identity level, many environments had no OTUs in common; at most, 23% of the total OTUs observed in any two habitats were found in both (Table <xref ref-type="table" rid="T2">2</xref>). The proportion of shared OTUs dropped markedly at the 95 and 99% identity level (less than 14 and 3%, respectively, of the total OTUs found in any two habitats were shared; Table <xref ref-type="table" rid="TA1">A1</xref> in Appendix). ANOSIM (<italic>R</italic>&#x02009;&#x0003D;&#x02009;0.378, <italic>P</italic>&#x02009;&#x0003D;&#x02009;0.001) and perMANOVA (<italic>R</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.221, <italic>P</italic>&#x02009;&#x0003D;&#x02009;0.001) analyses confirmed highly significant habitat partitioning among archaeal <italic>amoA</italic> sequence types (Table <xref ref-type="table" rid="T3">3</xref>; Table <xref ref-type="table" rid="TA2">A2</xref> in Appendix). Taken together, these results suggest that groups of AOA (as defined by their <italic>amoA</italic> sequence) found in different environments are significantly different from one another.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>&#x003B2;-Diversity among environment types</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"/>
<th align="left">Aquaria&#x02009;&#x0002B;&#x02009;biofilters</th>
<th align="left">Caves</th>
<th align="left">Coastal sediments</th>
<th align="left">Coral&#x02009;&#x0002B;&#x02009;sponges</th>
<th align="left">Groundwater</th>
<th align="left">Hot springs</th>
<th align="left">Hydrothermal vents</th>
<th align="left">Lakes&#x02009;&#x0002B;&#x02009;rivers</th>
<th align="left">Marine sediments</th>
<th align="left">Seas</th>
<th align="left">Soils</th>
<th align="left">Ocean water column</th>
<th align="left">Wastewater treatment</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Aquaria&#x02009;&#x0002B;&#x02009;biofilters</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.053</td>
<td align="left">0.087</td>
<td align="left">0.045</td>
<td align="left">0</td>
<td align="left">0.042</td>
<td align="left">0</td>
<td align="left">0.090</td>
<td align="left">0.077</td>
<td align="left">0</td>
<td align="left">0.094</td>
<td align="left">0.037</td>
</tr>
<tr>
<td align="left">Caves</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0.007</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.058</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.015</td>
<td align="left">0</td>
<td align="left">0.007</td>
<td align="left">0</td>
<td align="left">0</td>
</tr>
<tr>
<td align="left">Coastal sediments</td>
<td align="left">0.008</td>
<td align="left">0.001</td>
<td align="left" style="background-color:gray"/>
<td align="left">0.129</td>
<td align="left">0.058</td>
<td align="left">0.117</td>
<td align="left">0.037</td>
<td align="left">0.167</td>
<td align="left">0.229</td>
<td align="left">0.033</td>
<td align="left">0.221</td>
<td align="left">0.099</td>
<td align="left">0.097</td>
</tr>
<tr>
<td align="left">Coral&#x02009;&#x0002B;&#x02009;sponges</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.007</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.054</td>
<td align="left">0</td>
<td align="left">0.081</td>
<td align="left">0.067</td>
<td align="left">0</td>
<td align="left">0.113</td>
<td align="left">0.016</td>
</tr>
<tr>
<td align="left">Groundwater</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.006</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0.031</td>
<td align="left">0</td>
<td align="left">0.125</td>
<td align="left">0.039</td>
<td align="left">0</td>
<td align="left">0.048</td>
<td align="left">0</td>
<td align="left">0.029</td>
</tr>
<tr>
<td align="left">Hot springs</td>
<td align="left">0</td>
<td align="left">0.014</td>
<td align="left">0.005</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.233</td>
<td align="left">0.084</td>
<td align="left">0</td>
<td align="left">0.182</td>
<td align="left">0.013</td>
<td align="left">0.109</td>
</tr>
<tr>
<td align="left">Hydrothermal vents</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.007</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.206</td>
<td align="left">0.095</td>
<td align="left">0</td>
<td align="left">0.158</td>
<td align="left">0</td>
</tr>
<tr>
<td align="left">Lakes&#x02009;&#x0002B;&#x02009;rivers</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.014</td>
<td align="left">0</td>
<td align="left">0.014</td>
<td align="left">0.018</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0.118</td>
<td align="left">0</td>
<td align="left">0.205</td>
<td align="left">0.015</td>
<td align="left">0.157</td>
</tr>
<tr>
<td align="left">Marine sediments</td>
<td align="left">0.009</td>
<td align="left">0</td>
<td align="left">0.018</td>
<td align="left">0.002</td>
<td align="left">0.003</td>
<td align="left">0.01</td>
<td align="left">0.022</td>
<td align="left">0.015</td>
<td align="left" style="background-color:gray"/>
<td align="left">0.044</td>
<td align="left">0.092</td>
<td align="left">0.136</td>
<td align="left">0.076</td>
</tr>
<tr>
<td align="left">Seas</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.001</td>
<td align="left">0.008</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.008</td>
<td align="left">0</td>
<td align="left">0.003</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.179</td>
<td align="left">0</td>
</tr>
<tr>
<td align="left">Soils</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.022</td>
<td align="left">0.001</td>
<td align="left">0.004</td>
<td align="left">0.008</td>
<td align="left">0</td>
<td align="left">0.021</td>
<td align="left">0.014</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0.006</td>
<td align="left">0.090</td>
</tr>
<tr>
<td align="left">Ocean water column</td>
<td align="left">0.005</td>
<td align="left">0</td>
<td align="left">0.006</td>
<td align="left">0.020</td>
<td align="left">0</td>
<td align="left">0.002</td>
<td align="left">0.016</td>
<td align="left">0.002</td>
<td align="left">0.009</td>
<td align="left">0.017</td>
<td align="left">0.001</td>
<td align="left" style="background-color:gray"/>
<td align="left">0.044</td>
</tr>
<tr>
<td align="left">Wastewater treatment</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.005</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.008</td>
<td align="left">0</td>
<td align="left">0.010</td>
<td align="left">0.009</td>
<td align="left">0</td>
<td align="left">0.005</td>
<td align="left">0.003</td>
<td align="left" style="background-color:gray"/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Values represent the Jaccard similarity index for (above the diagonal) <italic>amoA</italic> nucleotide OTUs at the 90% identity level and (below the diagonal) unique AmoA protein sequence types</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>Environmental determinants of archaeal <italic>amoA</italic> sequence diversity</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Factor</th>
<th colspan="3" align="center">Sequences included in analysis<hr/></th>
</tr>
<tr>
<th align="left"/>
<th align="left">All sequences with associated metadata (<italic>n</italic>&#x02009;&#x0003D;&#x02009;2014)<xref ref-type="table-fn" rid="tfn1"><sup>1</sup></xref></th>
<th align="left">Marine water column sequences (<italic>n</italic>&#x02009;&#x0003D;&#x02009;290)<xref ref-type="table-fn" rid="tfn2"><sup>2</sup></xref></th>
<th align="left">Aquatic sequences (water column, groundwater, sea, lakes/rivers; <italic>n</italic>&#x02009;&#x0003D;&#x02009;984)<xref ref-type="table-fn" rid="tfn3"><sup>3</sup></xref></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Habitat (13 levels)</td>
<td align="left"><bold>0.221</bold></td>
<td align="left">NA</td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Latitude (low, mid, high)</td>
<td align="left"><bold>0.010</bold></td>
<td align="left"><bold>0.092</bold></td>
<td align="left"><bold>0.057</bold></td>
</tr>
<tr>
<td align="left">Temperature (pychrophilic, mesophilic, thermophilic)</td>
<td align="left"><bold>0.017</bold></td>
<td align="left"><bold>0.023</bold></td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Water depth (surface, mid, deep)</td>
<td align="left">NA</td>
<td align="left"><bold>0.335</bold></td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Salinity (high, low)</td>
<td align="left">NA</td>
<td align="left">NA</td>
<td align="left"><bold>0.197</bold></td>
</tr>
<tr>
<td align="left">Habitat and temperature</td>
<td align="left"><bold>0.017</bold></td>
<td align="left">NA</td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Habitat and latitude</td>
<td align="left"><bold>0.012</bold></td>
<td align="left">NA</td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Temperature and latitude</td>
<td align="left">0.0005</td>
<td align="left">0.003</td>
<td align="left"/>
</tr>
<tr>
<td align="left">Depth and latitude</td>
<td align="left">NA</td>
<td align="left"><bold>0.019</bold></td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Temperature and depth</td>
<td align="left">NA</td>
<td align="left"><bold>0.016</bold></td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Latitude and salinity</td>
<td align="left">NA</td>
<td align="left">NA</td>
<td align="left"><bold>0.023</bold></td>
</tr>
<tr>
<td align="left">Total variation explained by combined factors</td>
<td align="left">0.248</td>
<td align="left">0.469</td>
<td align="left">0.277</td>
</tr>
<tr>
<td align="left">Residual variation</td>
<td align="left">0.723</td>
<td align="left">0.531</td>
<td align="left">0.723</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic><italic>R</italic><sup>2</sup> values for factors tested via perMANOVA for association with variation in <italic>amoA</italic> diversity are indicated; significant <italic>R</italic><sup>2</sup> values (<italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001) are highlighted in bold. Analyses were carried out on the indicated subset of sequences for which the relevant metadata was available. Level designations for each habitat are provided; definitions for levels are given in the text. For each sequence grouping, all factors were concurrently subjected to perMANOVA. <italic>R</italic><sup>2</sup> indicates the proportion of variation each factor contributes to the total variation in the dataset</italic>.</p>
<p><italic>NA, not applicable</italic>.</p>
<fn id="tfn1"><p><italic><sup>1</sup>Of 6203 unique archaeal <italic>amoA</italic> sequences, both latitude and temperature data were available for 2014</italic>.</p></fn>
<fn id="tfn2"><p><italic><sup>2</sup>Of 902 sequences classified as water column, latitude, depth, and temperature metadata were available for 290</italic>.</p></fn>
<fn id="tfn3"><p><italic><sup>3</sup>Of 1657 water-associated sequences, latitude and salinity metadata were available for 984. Temperature designation was not included in this perMANOVA due to the fact that all 290 available sequences with temperature sub-category assignments were classified as water column sequences, and thus the perMANOVA analysis with temperature would be identical to the Water Column-specific analysis (the middle column in the table)</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Potential for <italic>amo</italic><italic>A</italic> functional diversity in different environments</title>
<p>The existence of distinct <italic>amoA</italic> sequences in different environments could be explained by varying selective pressures on the function of the AMO enzyme complex. To explore the potential for functional diversity within the ammonia monooxygenase &#x003B1;-subunit, we examined the distribution of AmoA sequence types at the amino acid level. The 6203 unique <italic>amoA</italic> sequences in our dataset yielded 3729 unique protein variants, which had an average pairwise identity of 86% (Figure <xref ref-type="fig" rid="FA1">A1</xref> in Appendix). Interestingly, no amino acid position was completely conserved across the entire <italic>amoA</italic> dataset. When comparing sequences from individual habitats, we found that AmoA was most similar (96%) within each of the aquaria/biofilters and sea categories; this may be attributable to the relatively constant nature of these environments. The lowest average pairwise identities within an environment were 84 and 85% from sequences in hot springs and lakes/rivers, respectively. We note that there was no correlation between the number of unique protein sequences sampled in an environment and the average amino acid identity (<italic>R</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.03). Qualitative &#x000DF;-diversity analysis of the translated AmoA protein sequences indicated that, like the nucleotide sequences, biodiversity at the amino acid level was strongly partitioned within each habitat type; no more than &#x0007E;2% of AmoA protein sequences were shared between two or more habitats (Table <xref ref-type="table" rid="T2">2</xref>).</p>
<p>To gain insight into the strength of selective pressures acting on AmoA, we calculated the ratio of non-synonymous to synonymous substitutions (dN/dS) in our dataset. Using sequences representing the 90% OTUs, the dN/dS ratio for <italic>amoA</italic> was 0.048; this indicates that deleterious <italic>amoA</italic> mutations are removed from archaeal populations through purifying (negative) selection. Furthermore, there was no evidence for positive selection at any individual codon position in our alignment (SLAC algorithm; <italic>P</italic>&#x02009;&#x0003C;&#x02009;0.05).</p>
<p>We were curious whether the selective pressures acting upon ammonia-oxidation functions might differ among environments. To address this, we calculated dN/dS ratios for AmoA sequences from each of the broad habitat categories (Figure <xref ref-type="fig" rid="FA2">A2</xref> in Appendix). While the dN/dS value was indistinguishable from the overall value in many environments, there was a small but significant increase in dN/dS (i.e., weaker purifying selection) on <italic>amoA</italic> sequences from five environments: caves, groundwater, hot springs, hydrothermal vents, and the marine water column (95% confidence intervals do not overlap with the value from the overall dataset). The relaxation of purifying selection (or increased positive selection) in these five environments could arise from changes in environmental conditions selecting for functional differences in AmoA, or a change in the expression levels or selective importance of AmoA to the overall fitness of archaea in these environments. These differences could also reflect changes in the effective population sizes that might affect the efficiency of purifying selection in these environments.</p>
</sec>
<sec>
<title>Environmental parameters influence the distribution of <italic>amo</italic><italic>A</italic> sequence types</title>
<p>Given the evidence for partitioning of <italic>amoA</italic> sequence diversity by habitat type, we wanted to understand the environmental factors that might be principally responsible for driving these patterns. As a first step, we compared the groupings of <italic>amoA</italic> sequences from different habitats using Unifrac. The largest component of the unweighted Unifrac distances between environments generally separated samples from soil- or sediment-associated environments versus samples from aquatic habitats (Figure <xref ref-type="fig" rid="F2">2</xref>A). Principal component 2 corresponded to a division between sequences from marine and freshwater/terrestrial environments. ANOSIM analyses confirmed a strong and significant divergence between sequences associated with marine versus freshwater/terrestrial environments (<italic>R</italic>&#x02009;&#x0003D;&#x02009;0.422, <italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001) and moderate divergence between sequence groupings associated with aquatic versus soil or sediment environments (<italic>R</italic>&#x02009;&#x0003D;&#x02009;0.099, <italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001; Table <xref ref-type="table" rid="TA2">A2</xref> in Appendix).</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Principal components analysis of unweighted Unifrac distances between different groups of <italic>amoA</italic> sequences, as categorized by habitat from which the sequences were isolated</bold>. <bold>(A)</bold> Unifrac analysis of <italic>amoA</italic> sequences from all habitats annotated in the complete <italic>amoA</italic> sequence dataset. The first two principal components account for 49% of the variation between all habitats. Unifrac analysis was conducted based on the maximum likelihood tree shown in Figure <xref ref-type="fig" rid="F1">1</xref>. Habitats are colored as either marine (blue) or freshwater/terrestrial (brown). <bold>(B)</bold> Unifrac analysis of <italic>amoA</italic> sequences from aquatic environments only. A maximum likelihood tree was computed for all unique sequences from the indicated habitats using RAxML and analyzed by Unifrac as above. The major principal component correlates with salinity.</p></caption>
<graphic xlink:href="fmicb-03-00252-g002.tif"/>
</fig>
<p>Surveys of AOA <italic>amoA</italic> diversity in soils have shown that different groups of <italic>amoA</italic> sequences are found in different geographic locations (Gubry-Rangin et al., <xref ref-type="bibr" rid="B11">2011</xref>; Pester et al., <xref ref-type="bibr" rid="B41">2012</xref>). Although many sequences lacked detailed location data, we asked whether two broad categories generally associated with geographic variation &#x02013; temperature and latitude &#x02013; were correlated with sequence diversity in our dataset. ANOSIM analyses indicated a moderate association with temperature, and a weak but significant association with latitude (Table <xref ref-type="table" rid="TA2">A2</xref> in Appendix). The perMANOVA analyses found that the combination of habitat type, latitude, and temperature could explain approximately 25% (<italic>R</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.248) of the variance in our global alignment, but the vast majority of this variance was accounted for by habitat type (Table <xref ref-type="table" rid="T3">3</xref>). While latitude and temperature were both significantly associated with variation in sequence type (<italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001), they could account for a mere 1.7 and 1% of variance in sequence diversity, after accounting for the influence of habitat type. Interaction effects for these factors could account for an additional 2.7% of sequence variation. When assessed independently from habitat type, temperature, and latitude explain 9.7 and 3.4%, respectively, of <italic>amoA</italic> sequence variation. In total, these results suggest that the observed diversity among AOA <italic>amoA</italic> across all environments is linked in part to biogeographic (latitudinal) variation (corroborating the findings of Pester et al., <xref ref-type="bibr" rid="B41">2012</xref>) and temperature, but these factors likely play relatively minor roles compared to the influence of habitat type. Our analysis was limited to metadata available in GenBank or the associated publications and, therefore, it is certainly possible that other environmental variables not analyzed here are correlated with sequence diversity in the overall dataset.</p>
<p>We wanted to determine what environmental parameters might specifically explain the differences in AOA from distinct aquatic habitats, where the associated environmental metadata was most complete. To address this question, we examined the relative importance of depth, salinity, and other environmental factors that could potentially drive <italic>amoA</italic> sequence divergence. We began by examining the phylogenetic relationship between <italic>amoA</italic> sequences from the marine water column, seas, lakes and rivers, and groundwater samples. A global Unifrac significance test on the phylogenetic relationship among these aquatic <italic>amoA</italic> sequences confirmed the earlier inference that there is significant clustering of these environmental groupings (<italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001).</p>
<p>To assess the potential for differences between <italic>amoA</italic> sequences from different depths in the ocean, we further separated water column sequences into three subgroups: surface (0&#x02013;199&#x02009;m), middle (200&#x02013;399&#x02009;m), and deep (&#x02265;400&#x02009;m). We found that depth can explain &#x0007E;30% of the sequence variation in the marine water column (perMANOVA <italic>R</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.335, <italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001; Table <xref ref-type="table" rid="T3">3</xref>). The correlation between <italic>amoA</italic> groupings and ocean depth might be explained by multiple selective factors such as ammonium availability, competition with other organisms, light levels, oxygen concentrations, and physical partitioning of these populations that may have reduced the opportunity for migration. Sequences from surface waters were separated from sequences found at middle or deep depths by the second principal component of variation in the Unifrac distance matrix (Figure <xref ref-type="fig" rid="F2">2</xref>B); the third principal component roughly correlated with depth among the water column sequences, and explained 11% of the variation within <italic>amoA</italic> sequences from all aquatic habitats. This result is in keeping with previous studies arguing that archaeal <italic>amoA</italic> sequences from the open ocean fall into two phylogenetically distinct groups corresponding to surface and deep water ecotypes (Francis et al., <xref ref-type="bibr" rid="B10">2005</xref>; Hallam et al., <xref ref-type="bibr" rid="B13">2006</xref>; Mincer et al., <xref ref-type="bibr" rid="B33">2007</xref>; Beman et al., <xref ref-type="bibr" rid="B2">2008</xref>; Santoro et al., <xref ref-type="bibr" rid="B45">2010</xref>; Hu et al., <xref ref-type="bibr" rid="B18">2011</xref>; Mosier and Francis, <xref ref-type="bibr" rid="B36">2011</xref>). Given the current level of known <italic>amoA</italic> sequence diversity in the oceans, it is apparent that two major water column ecotypes exist; however, other still-unknown factors in addition to depth are influencing the distribution of these sequence types. Indeed, perMANOVA tests also indicated that the combination of latitude and temperature could account for 37% of sequence variation amongst water column sequences (<italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001).</p>
<p>Principal components analysis of the Unifrac distances between each group of aquatic <italic>amoA</italic> sequences revealed that 60% of the variation could be explained by an axis that roughly corresponded to salinity (Figure <xref ref-type="fig" rid="F2">2</xref>B). <italic>amoA</italic> Sequences from freshwater environments (lakes and rivers) formed a phylogenetic cluster that was significantly different from all other groups of aquatic sequences from saline environments (<italic>P</italic>&#x02009;&#x0003C;&#x02009;0.002; pairwise Unifrac significance test). ANOSIM (<italic>R</italic>&#x02009;&#x0003D;&#x02009;0.576, <italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001; Table <xref ref-type="table" rid="TA2">A2</xref> in Appendix) and perMANOVA (19.7%; Table <xref ref-type="table" rid="T3">3</xref>) confirmed that salinity was strongly associated with <italic>amoA</italic> sequence clustering amongst the aquatic habitats. However, a large percentage of the total variation in the aquatic <italic>amoA</italic> dataset could not be explained by salinity alone, suggesting that additional environmental variables significantly influence sequence variation in aquatic environments.</p>
<p>Salinity is a major determinant of overall microbial community composition in the environment (Lozupone and Knight, <xref ref-type="bibr" rid="B30">2007</xref>), and also influences the abundance of AOA in some estuary and surf zone sediments (Mosier and Francis, <xref ref-type="bibr" rid="B35">2008</xref>; Santoro et al., <xref ref-type="bibr" rid="B44">2008</xref>). To further explore whether phylogenetic clustering of AOA <italic>amoA</italic> was associated with salinity, we employed AdaptML (Hunt et al., <xref ref-type="bibr" rid="B19">2008</xref>) to define ecologically coherent populations (ecotypes) amongst the coastal sediment, lake, and river sequences. AdaptML is an evolutionary model that defines ecotypes or inferred habitats within the dataset based on the environmental characteristics of the sampling location (e.g., salinity and environmental setting) and the observed phylogeny. Each sequence was assigned to a high (&#x02265;15&#x02009;ppt) or low (&#x0003C;15&#x02009;ppt) salinity category based on the reported data from the sampling location. Additionally, the coastal sediment, lake, and river sequences were assigned to sub-categories based on environmental setting: coastal, surf zone, estuary, salt marsh, lake, heathland pool, or river.</p>
<p>The analysis identified six distinct ecotypes or inferred habitats with strong signals from salinity and environmental setting (Figures <xref ref-type="fig" rid="F3">3</xref>A,B). Habitat A was dominated by estuary sequences from high-salinity sites, whereas Habitat B mainly represented estuary sequences from low-salinity sites. Habitats C and D showed a similar salinity distinction amongst surf zone sequences. Habitat E was more cosmopolitan in nature and combined high-salinity sequences from estuaries, salt marshes, and heathland pools. Habitat F corresponded to low-salinity lake sequences. Habitat subcategories (coastal, surf zone, estuary, salt marsh, lake, heathland pool, or river) alone could explain more than 10% of the variation in <italic>amoA</italic> sequence diversity, based on perMANOVA analysis (<italic>R</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.103, <italic>P</italic>&#x02009;&#x0003C;&#x02009;0.001), while salinity and interaction effects with habitat subcategories explained an additional 2 and 1.1%, respectively.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>Salinity and microenvironment ecotype predictions amongst coastal sediment, lake, and river AOA <italic>amoA</italic> sequences as inferred by AdaptML</bold>. <bold>(A)</bold> Maximum likelihood phylogenetic tree showing environmental characteristics and habitat predictions. Characteristics of the sampling locations are plotted on the inner and outer rings: the inner ring indicates high (&#x02265;15&#x02009;ppt) or low (&#x0003C;15&#x02009;ppt) salinity and the outer ring indicates the microenvironment. Ecological habitats predicted by the model are shown as nodes on the tree. <bold>(B)</bold> The distribution of each habitat among microenvironments and salinity. The habitat and environment colors match the legend in <bold>(A)</bold>.</p></caption>
<graphic xlink:href="fmicb-03-00252-g003.tif"/>
</fig>
</sec>
<sec>
<title>Considerations and future areas of research</title>
<p>Previous work has noted the existence of distinct groups of <italic>amoA</italic> sequences associated with different environments, such as soil- and water-associated phylogenetic clades. In this study, we inquired whether the phylogenetic relationships between <italic>amoA</italic> groups and ecological habitats were simply an artifact of early limited sampling or instead reflected a broad trend separating AOA communities in the environment. Our analysis of an aggregated global <italic>amoA</italic> sequence dataset highlights the immense diversity of this gene in the environment and demonstrates that distinct groupings of phylogenetically related <italic>amoA</italic> sequences are indeed found in particular habitats; however, with increased sampling of <italic>amoA</italic> diversity, nuances have begun to emerge.</p>
<p>We find a strong distinction between sediment/soil-associated <italic>amoA</italic> sequence types and those found in other aquatic systems. Factors affecting <italic>amoA</italic> diversity within soils, such as pH, have been discussed extensively elsewhere (Gubry-Rangin et al., <xref ref-type="bibr" rid="B11">2011</xref>; Pester et al., <xref ref-type="bibr" rid="B41">2012</xref>) and were not the focus of this study. Among <italic>amoA</italic> sequences from aquatic ecosystems, environmental factors such as salinity, temperature, and depth in the water column appear to play a role in selecting for and/or maintaining <italic>amoA</italic> sequence diversity. We do not yet know whether the environmental variables studied here are selecting specifically for variants of AmoA with specific functional differences, or are instead selecting on corresponding differences in the rest of the AMO enzyme or associated pathways. It is also possible that <italic>amoA</italic> could play a role in other cellular functions besides ammonia oxidation, and that this could in turn affect the selective pressures on this gene. We also note that the effects of recent recombination and horizontal gene transfer could complicate interpretation of the ecological clustering.</p>
<p>We found that one of the major barriers in trying to place sequence diversity into an environmental context was the lack of standardized environmental metadata from the sites sampled (e.g., salinity, temperature, latitude and longitude, nutrient concentrations, dissolved oxygen concentration, pH, among others). We support efforts to establish a set of standardized metadata submission requirements to accompany sequences from environmental marker genes such as MIMARKS (Yilmaz et al., <xref ref-type="bibr" rid="B54">2011</xref>). A better understanding of the exact selective factors that influence the distribution of <italic>amoA</italic> sequences in different habitats will require such annotations; even still, environmental factors beyond the scope of standardized metadata (e.g., competition for ammonia with other organisms, abiotic processes, etc.), including the range of physicochemical parameters experienced at a given sampling location, likely play a role in shaping the overall distribution and activity of AOA. These limitations argue strongly for the necessity of additional studies on cultivated AOA (both pure isolates and enrichment cultures), to better understand the functional differences between different <italic>amoA</italic> sequence types and the specific factors that drive AOA niche differentiation.</p>
<p>While there is clearly a large amount of diversity at the AmoA protein level, the functional implications of these protein variants is impossible to know at this time. The successful culturing of AOA from different environments provides hope that, in time, physiological studies could shed some light on the functional differences between <italic>amoA</italic> alleles, interactions between mutations within different subunits of the AMO complex, the fitness advantage conferred by different alleles, and the effect of other factors (e.g., gene expression patterns) on the selection of different <italic>amoA</italic> sequences in different habitats.</p>
<p>The dramatic advances in sequencing technology in recent years will undoubtedly allow future studies to more thoroughly survey the diversity of <italic>amoA</italic> sequences in the environment. Our results suggest that the novel <italic>amoA</italic> sequences are most likely to be found in hot springs and marine sediments. Future use of single-cell genomics approaches to uncover other genomic features that covary with <italic>amoA</italic> sequence types will likely lead to a better understanding of the functional context and implications of this diversity, as well as the relative roles of selection, mutation, migration, and other processes in partitioning <italic>amoA</italic> diversity.</p>
</sec>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at <uri xlink:href="http://www.frontiersin.org/Aquatic_Microbiology/10.3389/fmicb.2012.00252/abstract">http://www.frontiersin.org/Aquatic_Microbiology/10.3389/fmicb.2012.00252/abstract</uri></p>
</sec>
</body>
<back>
<app-group>
<app id="A1">
<title>Appendix</title>
<table-wrap position="float" id="TA1">
<label>Table A1</label>
<caption><p><bold>&#x003B2;-Diversity between environment types</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"/>
<th align="left">Aquaria&#x02009;&#x0002B;&#x02009;biofilters</th>
<th align="left">Caves</th>
<th align="left">Coastal sediments</th>
<th align="left">Coral&#x02009;&#x0002B;&#x02009;sponges</th>
<th align="left">Groundwater</th>
<th align="left">Hot springs</th>
<th align="left">Hydrothermal vents</th>
<th align="left">Lakes&#x02009;&#x0002B;&#x02009;rivers</th>
<th align="left">Marine sediments</th>
<th align="left">Seas</th>
<th align="left">Soils</th>
<th align="left">Ocean water column</th>
<th align="left">Wastewater treatment</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Aquaria&#x02009;&#x0002B;&#x02009;biofilters</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.010</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
</tr>
<tr>
<td align="left">Caves</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
</tr>
<tr>
<td align="left">Coastal sediments</td>
<td align="left">0.042</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0.009</td>
<td align="left">0.004</td>
<td align="left">0.004</td>
<td align="left">0</td>
<td align="left">0.013</td>
<td align="left">0.013</td>
<td align="left">0.003</td>
<td align="left">0.031</td>
<td align="left">0.010</td>
<td align="left">0.008</td>
</tr>
<tr>
<td align="left">Coral&#x02009;&#x0002B;&#x02009;sponges</td>
<td align="left">0.028</td>
<td align="left">0</td>
<td align="left">0.046</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.008</td>
<td align="left">0</td>
<td align="left">0.023</td>
<td align="left">0</td>
</tr>
<tr>
<td align="left">Groundwater</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.024</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.022</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.004</td>
<td align="left">0</td>
<td align="left">0.009</td>
</tr>
<tr>
<td align="left">Hot springs</td>
<td align="left">0</td>
<td align="left">0.037</td>
<td align="left">0.041</td>
<td align="left">0</td>
<td align="left">0.029</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.019</td>
<td align="left">0.006</td>
<td align="left">0</td>
<td align="left">0.015</td>
<td align="left">0</td>
<td align="left">0.006</td>
</tr>
<tr>
<td align="left">Hydrothermal vents</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.008</td>
<td align="left">0.013</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.026</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.027</td>
<td align="left">0</td>
</tr>
<tr>
<td align="left">Lakes&#x02009;&#x0002B;&#x02009;rivers</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.087</td>
<td align="left">0</td>
<td align="left">0.089</td>
<td align="left">0.134</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0.003</td>
<td align="left">0</td>
<td align="left">0.035</td>
<td align="left">0</td>
<td align="left">0.016</td>
</tr>
<tr>
<td align="left">Marine sediments</td>
<td align="left">0.021</td>
<td align="left">0.008</td>
<td align="left">0.109</td>
<td align="left">0.017</td>
<td align="left">0.007</td>
<td align="left">0.057</td>
<td align="left">0.103</td>
<td align="left">0.048</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.017</td>
<td align="left">0.008</td>
<td align="left">0.011</td>
</tr>
<tr>
<td align="left">Seas</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.011</td>
<td align="left">0.032</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.031</td>
<td align="left">0</td>
<td align="left">0.008</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.019</td>
<td align="left">0</td>
</tr>
<tr>
<td align="left">Soils</td>
<td align="left">0</td>
<td align="left">0</td>
<td align="left">0.126</td>
<td align="left">0</td>
<td align="left">0.027</td>
<td align="left">0.091</td>
<td align="left">0</td>
<td align="left">0.121</td>
<td align="left">0.074</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
<td align="left">0.012</td>
</tr>
<tr>
<td align="left">Ocean water column</td>
<td align="left">0.025</td>
<td align="left">0</td>
<td align="left">0.050</td>
<td align="left">0.071</td>
<td align="left">0</td>
<td align="left">0.007</td>
<td align="left">0.113</td>
<td align="left">0.007</td>
<td align="left">0.038</td>
<td align="left">0.075</td>
<td align="left">0.003</td>
<td align="left" style="background-color:gray"/>
<td align="left">0</td>
</tr>
<tr>
<td align="left">Wastewater treatment</td>
<td align="left">0.024</td>
<td align="left">0</td>
<td align="left">0.044</td>
<td align="left">0</td>
<td align="left">0.020</td>
<td align="left">0.061</td>
<td align="left">0</td>
<td align="left">0.090</td>
<td align="left">0.056</td>
<td align="left">0</td>
<td align="left">0.045</td>
<td align="left">0</td>
<td align="left" style="background-color:gray"/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Values represent the Jaccard similarity index for (above the diagonal) <italic>amoA</italic> nucleotide OTUs at the 99% identity level and (below the diagonal) 95% identity level</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="TA2">
<label>Table A2</label>
<caption><p><bold><italic>R</italic> values for factors tested via ANOSIM for association with variation in archaeal <italic>amoA</italic> sequence diversity</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Factor</th>
<th colspan="3" align="center">Sequences included in analysis<hr/></th>
</tr>
<tr>
<th align="left"/>
<th align="left">Global alignment</th>
<th align="left">Water column sequences</th>
<th align="left">Water associated sequences (water column, groundwater, sea, lakes/rivers)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Habitat (13 levels)</td>
<td align="left"><bold>0.378</bold> (<italic>n</italic>&#x02009;&#x0003D;&#x02009;6202)</td>
<td align="left">NA</td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Latitude (low, mid, high)</td>
<td align="left"><bold>0.026</bold> (<italic>n</italic>&#x02009;&#x0003D;&#x02009;7138)</td>
<td align="left"><bold>0.296</bold> (<italic>n</italic>&#x02009;&#x0003D;&#x02009;440)</td>
<td align="left">0.049 (<italic>n</italic>&#x02009;&#x0003D;&#x02009;984)</td>
</tr>
<tr>
<td align="left">Temperature (psychrophilic, mesophilic, thermophilic)</td>
<td align="left"><bold>0.170</bold> (<italic>n</italic>&#x02009;&#x0003D;&#x02009;2014)</td>
<td align="left"><bold>0.263</bold> (<italic>n</italic>&#x02009;&#x0003D;&#x02009;290)</td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Water depth (surface, mid, deep)</td>
<td align="left">NA</td>
<td align="left"><bold>0.464</bold> (<italic>n</italic>&#x02009;&#x0003D;&#x02009;898)</td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Salinity (high, low)</td>
<td align="left">NA</td>
<td align="left">NA</td>
<td align="left"><bold>0.576</bold> (<italic>n</italic>&#x02009;&#x0003D;&#x02009;1402)</td>
</tr>
<tr>
<td align="left">Marine versus freshwater / terrestrial<xref ref-type="table-fn" rid="tfn4"><sup>1</sup></xref></td>
<td align="left"><bold>0.422</bold> (<italic>n</italic>&#x02009;&#x0003D;&#x02009;6202)</td>
<td align="left">NA</td>
<td align="left">NA</td>
</tr>
<tr>
<td align="left">Soil/sediment versus aquatic</td>
<td align="left"><bold>0.099</bold> (<italic>n</italic>&#x02009;&#x0003D;&#x02009;6202)</td>
<td align="left">NA</td>
<td align="left">NA</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Significant <italic>R</italic> values (<italic>P</italic>&#x0003C;0.001) are indicated in bold. Level designations for each habitat are provided; definitions for levels are given in the text. The number of sequences included in each analysis (<italic>n</italic>) is given below <italic>R</italic> values. All factors are individually tested in ANOSIM analyses</italic>.</p>
<p><italic>NA, not applicable</italic>.</p>
<fn id="tfn4"><p><italic><sup>1</sup>The marine level included sequences associated with aquaria and biofilters, coastal sediments, marine sediments, hydrothermal vents, seas, water column (marine), coral/sponge, and selected wastewater treatment sequences obtained from high-salinity bioreactors in Hong Kong (EU860262&#x02013;EU860273, EU870438&#x02013;EU870442). The terrestrial level included sequences associated with soils, lakes and rivers, caves, groundwater, hot springs, and the majority of the wastewater treatment sequences</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<fig id="FA1" position="float">
<label>Figure A1</label>
<caption><p><bold>Average pairwise <italic>amoA</italic> nucleotide and protein sequence identity</bold>. Error bars represent the standard error of the mean.</p></caption>
<graphic xlink:href="fmicb-03-00252-a001.tif"/>
</fig>
<fig id="FA2" position="float">
<label>Figure A2</label>
<caption><p><bold>Rarefaction analysis of <italic>amoA</italic> sequence diversity, by habitat type</bold>. Curves represent rarefaction analysis of unique sequences and OTUs defined by 99, 95, and 90% identity.</p></caption>
<graphic xlink:href="fmicb-03-00252-a002.tif"/>
</fig>
<fig id="FA3" position="float">
<label>Figure A3</label>
<caption><p><bold>Selective pressure (dN/dS) on <italic>amoA</italic> sequences from different habitats as calculated by the SLAC algorithm</bold>. Error bars represent 95% confidence intervals.</p></caption>
<graphic xlink:href="fmicb-03-00252-a003.tif"/>
</fig>
</app>
</app-group>
<ack>
<p>The authors would like to thank Paul Rainey for guidance in the development and data analysis of this project. We also thank Marie Lund for helpful discussions, Lawrence David for assistance with AdaptML, and Paul Berube for critical reading of the manuscript. The Stanford Center for Computational Earth and Environmental Science and the Hopkins Microbiology Course kindly provided access to computational resources. This work was supported in part by EPA STAR Graduate Fellowships (to Annika C. Mosier and George F. Wells), as well as National Science Foundation grants MCB-0604270 and OCE-0847266 to Christopher A. Francis.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Agogu&#x000E9;</surname> <given-names>H.</given-names></name> <name><surname>Brink</surname> <given-names>M.</given-names></name> <name><surname>Dinasquet</surname> <given-names>J.</given-names></name> <name><surname>Herndl</surname> <given-names>G. J.</given-names></name></person-group> (<year>2008</year>). <article-title>Major gradients in putatively nitrifying and non-nitrifying archaea in the deep North Atlantic</article-title>. <source>Nature</source> <volume>456</volume>, <fpage>788</fpage>&#x02013;<lpage>791</lpage>.<pub-id pub-id-type="doi">10.1038/nature07535</pub-id><pub-id pub-id-type="pmid">19037244</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beman</surname> <given-names>J.</given-names></name> <name><surname>Popp</surname> <given-names>B.</given-names></name> <name><surname>Francis</surname> <given-names>C.</given-names></name></person-group> (<year>2008</year>). <article-title>Molecular and biogeochemical evidence for ammonia oxidation by marine crenarchaeota in the Gulf of California</article-title>. <source>ISME J.</source> <volume>2</volume>, <fpage>429</fpage>&#x02013;<lpage>441</lpage>.<pub-id pub-id-type="doi">10.1038/ismej.2008.33</pub-id><pub-id pub-id-type="pmid">18200070</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beman</surname> <given-names>J. M.</given-names></name> <name><surname>Sachdeva</surname> <given-names>R.</given-names></name> <name><surname>Fuhrman</surname> <given-names>J. A.</given-names></name></person-group> (<year>2010</year>). <article-title>Population ecology of nitrifying archaea and bacteria in the Southern California bight</article-title>. <source>Environ. Microbiol.</source> <volume>12</volume>, <fpage>1282</fpage>&#x02013;<lpage>1292</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2010.02172.x</pub-id><pub-id pub-id-type="pmid">20192962</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blainey</surname> <given-names>P. C.</given-names></name> <name><surname>Mosier</surname> <given-names>A. C.</given-names></name> <name><surname>Potanina</surname> <given-names>A.</given-names></name> <name><surname>Francis</surname> <given-names>C. A.</given-names></name> <name><surname>Quake</surname> <given-names>S. R.</given-names></name></person-group> (<year>2011</year>). <article-title>Genome of a low-salinity ammonia-oxidizing archaeon determined by single-cell and metagenomic analysis</article-title>. <source>PLoS ONE</source> <volume>6</volume>, <fpage>e16626</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0016626</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brochier-Armanet</surname> <given-names>C.</given-names></name> <name><surname>Boussau</surname> <given-names>B.</given-names></name> <name><surname>Gribaldo</surname> <given-names>S.</given-names></name> <name><surname>Forterre</surname> <given-names>P.</given-names></name></person-group> (<year>2008</year>). <article-title>Mesophilic crenarchaeota: proposal for a third archaeal phylum, the Thaumarchaeota</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>6</volume>, <fpage>245</fpage>&#x02013;<lpage>252</lpage>.<pub-id pub-id-type="doi">10.1038/nrmicro1852</pub-id><pub-id pub-id-type="pmid">18274537</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Church</surname> <given-names>M. J.</given-names></name> <name><surname>Wai</surname> <given-names>B.</given-names></name> <name><surname>Karl</surname> <given-names>D. M.</given-names></name> <name><surname>Delong</surname> <given-names>E. F.</given-names></name></person-group> (<year>2010</year>). <article-title>Abundances of crenarchaeal amoA genes and transcripts in the Pacific Ocean</article-title>. <source>Environ. Microbiol.</source> <volume>12</volume>, <fpage>679</fpage>&#x02013;<lpage>688</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2009.02108.x</pub-id><pub-id pub-id-type="pmid">20002133</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>De La Torre</surname> <given-names>J.</given-names></name> <name><surname>Walker</surname> <given-names>C.</given-names></name> <name><surname>Ingalls</surname> <given-names>A.</given-names></name> <name><surname>Konneke</surname> <given-names>M.</given-names></name> <name><surname>Stahl</surname> <given-names>D.</given-names></name></person-group> (<year>2008</year>). <article-title>Cultivation of a thermophilic ammonia oxidizing archaeon synthesizing crenarchaeol</article-title>. <source>Environ. Microbiol.</source> <volume>10</volume>, <fpage>810</fpage>&#x02013;<lpage>818</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2007.01506.x</pub-id><pub-id pub-id-type="pmid">18205821</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Erguder</surname> <given-names>T. H.</given-names></name> <name><surname>Boon</surname> <given-names>N.</given-names></name> <name><surname>Wittebolle</surname> <given-names>L.</given-names></name> <name><surname>Marzorati</surname> <given-names>M.</given-names></name> <name><surname>Verstraete</surname> <given-names>W.</given-names></name></person-group> (<year>2009</year>). <article-title>Environmental factors shaping the ecological niches of ammonia-oxidizing archaea</article-title>. <source>FEMS Microbiol. Rev.</source> <volume>33</volume>, <fpage>855</fpage>&#x02013;<lpage>869</lpage>.<pub-id pub-id-type="doi">10.1111/j.1574-6976.2009.00179.x</pub-id><pub-id pub-id-type="pmid">19453522</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Francis</surname> <given-names>C. A.</given-names></name> <name><surname>Beman</surname> <given-names>J. M.</given-names></name> <name><surname>Kuypers</surname> <given-names>M. M.</given-names></name></person-group> (<year>2007</year>). <article-title>New processes and players in the nitrogen cycle: the microbial ecology of anaerobic and archaeal ammonia oxidation</article-title>. <source>ISME J.</source> <volume>1</volume>, <fpage>19</fpage>&#x02013;<lpage>27</lpage>.<pub-id pub-id-type="doi">10.1038/ismej.2007.8</pub-id><pub-id pub-id-type="pmid">18043610</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Francis</surname> <given-names>C. A.</given-names></name> <name><surname>Roberts</surname> <given-names>K. J.</given-names></name> <name><surname>Beman</surname> <given-names>J. M.</given-names></name> <name><surname>Santoro</surname> <given-names>A. E.</given-names></name> <name><surname>Oakley</surname> <given-names>B. B.</given-names></name></person-group> (<year>2005</year>). <article-title>Ubiquity and diversity of ammonia-oxidizing archaea in water columns and sediments of the ocean</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>102</volume>, <fpage>14683</fpage>&#x02013;<lpage>14688</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.0506625102</pub-id><pub-id pub-id-type="pmid">16186488</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gubry-Rangin</surname> <given-names>C.</given-names></name> <name><surname>Hai</surname> <given-names>B.</given-names></name> <name><surname>Quince</surname> <given-names>C.</given-names></name> <name><surname>Engel</surname> <given-names>M.</given-names></name> <name><surname>Thomson</surname> <given-names>B. C.</given-names></name> <name><surname>James</surname> <given-names>P.</given-names></name> <name><surname>Schloter</surname> <given-names>M.</given-names></name> <name><surname>Griffiths</surname> <given-names>R. I.</given-names></name> <name><surname>Prosser</surname> <given-names>J. I.</given-names></name> <name><surname>Nicol</surname> <given-names>G. W.</given-names></name></person-group> (<year>2011</year>). <article-title>Niche specialization of terrestrial archaeal ammonia oxidizers</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>108</volume>, <fpage>21206</fpage>&#x02013;<lpage>21211</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.1109000108</pub-id><pub-id pub-id-type="pmid">22158986</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guindon</surname> <given-names>S.</given-names></name> <name><surname>Gascuel</surname> <given-names>O.</given-names></name></person-group> (<year>2003</year>). <article-title>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</article-title>. <source>Syst. Biol.</source> <volume>52</volume>, <fpage>696</fpage>&#x02013;<lpage>704</lpage>.<pub-id pub-id-type="doi">10.1080/10635150390235520</pub-id><pub-id pub-id-type="pmid">14530136</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hallam</surname> <given-names>S.</given-names></name> <name><surname>Mincer</surname> <given-names>T.</given-names></name> <name><surname>Schleper</surname> <given-names>C.</given-names></name> <name><surname>Preston</surname> <given-names>C.</given-names></name> <name><surname>Roberts</surname> <given-names>K.</given-names></name> <name><surname>Richardson</surname> <given-names>P.</given-names></name> <name><surname>Delong</surname> <given-names>E.</given-names></name></person-group> (<year>2006</year>). <article-title>Pathways of carbon assimilation and ammonia oxidation suggested by environmental genomic analyses of marine crenarchaeota</article-title>. <source>PLoS Biol.</source> <volume>4</volume>, <fpage>e95</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pbio.0040095</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamady</surname> <given-names>M.</given-names></name> <name><surname>Lozupone</surname> <given-names>C.</given-names></name> <name><surname>Knight</surname> <given-names>R.</given-names></name></person-group> (<year>2010</year>). <article-title>Fast uniFrac: facilitating high-throughput phylogenetic analyses of microbial communities including analysis of pyrosequencing and phylochip data</article-title>. <source>ISME J.</source> <volume>4</volume>, <fpage>17</fpage>&#x02013;<lpage>27</lpage>.<pub-id pub-id-type="doi">10.1038/ismej.2009.97</pub-id><pub-id pub-id-type="pmid">19710709</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hansel</surname> <given-names>C.</given-names></name> <name><surname>Fendorf</surname> <given-names>S.</given-names></name> <name><surname>Jardine</surname> <given-names>P.</given-names></name> <name><surname>Francis</surname> <given-names>C.</given-names></name></person-group> (<year>2008</year>). <article-title>Changes in bacterial and archaeal community structure and functional diversity along a geochemically variable soil profile</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>74</volume>, <fpage>1620</fpage>&#x02013;<lpage>1633</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.01787-07</pub-id><pub-id pub-id-type="pmid">18192411</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hatzenpichler</surname> <given-names>R.</given-names></name> <name><surname>Lebedeva</surname> <given-names>E.</given-names></name> <name><surname>Spieck</surname> <given-names>E.</given-names></name> <name><surname>Stoecker</surname> <given-names>K.</given-names></name> <name><surname>Richter</surname> <given-names>A.</given-names></name> <name><surname>Daims</surname> <given-names>H.</given-names></name> <name><surname>Wagner</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>A moderately thermophilic ammonia-oxidizing crenarchaeote from a hot spring</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>105</volume>, <fpage>2134</fpage>&#x02013;<lpage>2139</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.0708857105</pub-id><pub-id pub-id-type="pmid">18250313</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>J.</given-names></name> <name><surname>Shen</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Zhu</surname> <given-names>Y.</given-names></name> <name><surname>Zheng</surname> <given-names>Y.</given-names></name> <name><surname>Xu</surname> <given-names>M.</given-names></name> <name><surname>Di</surname> <given-names>H.</given-names></name></person-group> (<year>2007</year>). <article-title>Quantitative analyses of the abundance and composition of ammonia-oxidizing bacteria and ammonia-oxidizing archaea of a Chinese upland red soil under long-term fertilization practices</article-title>. <source>Environ. Microbiol.</source> <volume>9</volume>, <fpage>2364</fpage>&#x02013;<lpage>2374</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2007.01358.x</pub-id><pub-id pub-id-type="pmid">17686032</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>A.</given-names></name> <name><surname>Jiao</surname> <given-names>N.</given-names></name> <name><surname>Zhang</surname> <given-names>R.</given-names></name> <name><surname>Yang</surname> <given-names>Z.</given-names></name></person-group> (<year>2011</year>). <article-title>Niche partitioning of marine group I crenarchaeota in the euphotic and upper mesopelagic zones of the East China Sea</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>77</volume>, <fpage>7469</fpage>&#x02013;<lpage>7478</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.00294-11</pub-id><pub-id pub-id-type="pmid">21873485</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hunt</surname> <given-names>D. E.</given-names></name> <name><surname>David</surname> <given-names>L. A.</given-names></name> <name><surname>Gevers</surname> <given-names>D.</given-names></name> <name><surname>Preheim</surname> <given-names>S. P.</given-names></name> <name><surname>Alm</surname> <given-names>E. J.</given-names></name> <name><surname>Polz</surname> <given-names>M. F.</given-names></name></person-group> (<year>2008</year>). <article-title>Resource partitioning and sympatric differentiation among closely related bacterioplankton</article-title>. <source>Science</source> <volume>320</volume>, <fpage>1081</fpage>&#x02013;<lpage>1085</lpage>.<pub-id pub-id-type="doi">10.1126/science.1157890</pub-id><pub-id pub-id-type="pmid">18497299</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jia</surname> <given-names>Z.</given-names></name> <name><surname>Conrad</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Bacteria rather than archaea dominate microbial ammonia oxidation in an agricultural soil</article-title>. <source>Environ. Microbiol.</source> <volume>11</volume>, <fpage>1658</fpage>&#x02013;<lpage>1671</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2009.01891.x</pub-id><pub-id pub-id-type="pmid">19236445</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jung</surname> <given-names>M. Y.</given-names></name> <name><surname>Park</surname> <given-names>S. J.</given-names></name> <name><surname>Min</surname> <given-names>D.</given-names></name> <name><surname>Kim</surname> <given-names>J. S.</given-names></name> <name><surname>Rijpstra</surname> <given-names>W. I.</given-names></name> <name><surname>Sinninghe Damste</surname> <given-names>J. S.</given-names></name> <name><surname>Kim</surname> <given-names>G. J.</given-names></name> <name><surname>Madsen</surname> <given-names>E. L.</given-names></name> <name><surname>Rhee</surname> <given-names>S. K.</given-names></name></person-group> (<year>2011</year>). <article-title>Enrichment and characterization of an autotrophic ammonia-oxidizing archaeon of mesophilic crenarchaeal group I.1a from an agricultural soil</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>77</volume>, <fpage>8635</fpage>&#x02013;<lpage>8647</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.05787-11</pub-id><pub-id pub-id-type="pmid">22003023</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kalanetra</surname> <given-names>K. M.</given-names></name> <name><surname>Bano</surname> <given-names>N.</given-names></name> <name><surname>Hollibaugh</surname> <given-names>J. T.</given-names></name></person-group> (<year>2009</year>). <article-title>Ammonia-oxidizing archaea in the Arctic Ocean and Antarctic coastal waters</article-title>. <source>Environ. Microbiol.</source> <volume>11</volume>, <fpage>2434</fpage>&#x02013;<lpage>2445</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2009.01974.x</pub-id><pub-id pub-id-type="pmid">19601959</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>K&#x000F6;nneke</surname> <given-names>M.</given-names></name> <name><surname>Bernhard</surname> <given-names>A.</given-names></name> <name><surname>De La Torre</surname> <given-names>J.</given-names></name> <name><surname>Walker</surname> <given-names>C.</given-names></name> <name><surname>Waterbury</surname> <given-names>J.</given-names></name> <name><surname>Stahl</surname> <given-names>D.</given-names></name></person-group> (<year>2005</year>). <article-title>Isolation of an autotrophic ammonia-oxidizing marine archaeon</article-title>. <source>Nature</source> <volume>437</volume>, <fpage>543</fpage>&#x02013;<lpage>546</lpage>.<pub-id pub-id-type="doi">10.1038/nature03911</pub-id><pub-id pub-id-type="pmid">16177789</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kosakovsky Pond</surname> <given-names>S. L.</given-names></name> <name><surname>Frost</surname> <given-names>S. D. W.</given-names></name></person-group> (<year>2005</year>). <article-title>Not so different after all: a comparison of methods for detecting amino acid sites under selection</article-title>. <source>Mol. Biol. Evol.</source> <volume>22</volume>, <fpage>1208</fpage>&#x02013;<lpage>1222</lpage>.<pub-id pub-id-type="doi">10.1093/molbev/msi105</pub-id><pub-id pub-id-type="pmid">15703242</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kryazhimskiy</surname> <given-names>S.</given-names></name> <name><surname>Plotkin</surname> <given-names>J. B.</given-names></name></person-group> (<year>2008</year>). <article-title>The population genetics of dN/dS</article-title>. <source>PLoS Genet.</source> <volume>4</volume>, <fpage>e1000304</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pgen.1000304</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lam</surname> <given-names>P.</given-names></name> <name><surname>Lavik</surname> <given-names>G.</given-names></name> <name><surname>Jensen</surname> <given-names>M.</given-names></name> <name><surname>Van De Vossenberg</surname> <given-names>J.</given-names></name> <name><surname>Schmid</surname> <given-names>M.</given-names></name> <name><surname>Woebken</surname> <given-names>D.</given-names></name> <name><surname>Gutierrez</surname> <given-names>D.</given-names></name> <name><surname>Amann</surname> <given-names>R.</given-names></name> <name><surname>Jetten</surname> <given-names>M.</given-names></name> <name><surname>Kuypers</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>Revising the nitrogen cycle in the Peruvian oxygen minimum zone</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>106</volume>, <fpage>4752</fpage>&#x02013;<lpage>4757</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.0900608106</pub-id><pub-id pub-id-type="pmid">19255441</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lehtovirta-Morley</surname> <given-names>L. E.</given-names></name> <name><surname>Stoecker</surname> <given-names>K.</given-names></name> <name><surname>Vilcinskas</surname> <given-names>A.</given-names></name> <name><surname>Prosser</surname> <given-names>J. I.</given-names></name> <name><surname>Nicol</surname> <given-names>G. W.</given-names></name></person-group> (<year>2011</year>). <article-title>Cultivation of an obligate acidophilic ammonia oxidizer from a nitrifying acid soil</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>108</volume>, <fpage>15892</fpage>&#x02013;<lpage>15897</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.1107196108</pub-id><pub-id pub-id-type="pmid">21896746</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leininger</surname> <given-names>S.</given-names></name> <name><surname>Urich</surname> <given-names>T.</given-names></name> <name><surname>Schloter</surname> <given-names>M.</given-names></name> <name><surname>Schwark</surname> <given-names>L.</given-names></name> <name><surname>Qi</surname> <given-names>J.</given-names></name> <name><surname>Nicol</surname> <given-names>G.</given-names></name> <name><surname>Prosser</surname> <given-names>J.</given-names></name> <name><surname>Schuster</surname> <given-names>S.</given-names></name> <name><surname>Schleper</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <article-title>Archaea predominate among ammonia-oxidizing prokaryotes in soils</article-title>. <source>Nature</source> <volume>442</volume>, <fpage>806</fpage>&#x02013;<lpage>809</lpage>.<pub-id pub-id-type="doi">10.1038/nature04983</pub-id><pub-id pub-id-type="pmid">16915287</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Letunic</surname> <given-names>I.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name></person-group> (<year>2007</year>). <article-title>Interactive tree of life (itol): an online tool for phylogenetic tree display and annotation</article-title>. <source>Bioinformatics</source> <volume>23</volume>, <fpage>127</fpage>&#x02013;<lpage>128</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btl529</pub-id><pub-id pub-id-type="pmid">17050570</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lozupone</surname> <given-names>C. A.</given-names></name> <name><surname>Knight</surname> <given-names>R.</given-names></name></person-group> (<year>2007</year>). <article-title>Global patterns in bacterial diversity</article-title>. <source>Proc. Natl. Acad. Sci.U.S.A.</source> <volume>104</volume>, <fpage>11436</fpage>&#x02013;<lpage>11440</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.0611525104</pub-id><pub-id pub-id-type="pmid">17592124</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ludwig</surname> <given-names>W.</given-names></name> <name><surname>Strunk</surname> <given-names>O.</given-names></name> <name><surname>Westram</surname> <given-names>R.</given-names></name> <name><surname>Richter</surname> <given-names>L.</given-names></name> <name><surname>Meier</surname> <given-names>H.</given-names></name> <name><surname>Yadhukumar</surname></name> <name><surname>Buchner</surname> <given-names>A.</given-names></name> <name><surname>Lai</surname> <given-names>T.</given-names></name> <name><surname>Steppi</surname> <given-names>S.</given-names></name> <name><surname>Jobb</surname> <given-names>G.</given-names></name> <name><surname>Forster</surname> <given-names>W.</given-names></name> <name><surname>Brettske</surname> <given-names>I.</given-names></name> <name><surname>Gerber</surname> <given-names>S.</given-names></name> <name><surname>Ginhart</surname> <given-names>A.</given-names></name> <name><surname>Gross</surname> <given-names>O.</given-names></name> <name><surname>Grumann</surname> <given-names>S.</given-names></name> <name><surname>Hermann</surname> <given-names>S.</given-names></name> <name><surname>Jost</surname> <given-names>R.</given-names></name> <name><surname>Konig</surname> <given-names>A.</given-names></name> <name><surname>Liss</surname> <given-names>T.</given-names></name> <name><surname>Lussmann</surname> <given-names>R.</given-names></name> <name><surname>May</surname> <given-names>M.</given-names></name> <name><surname>Nonhoff</surname> <given-names>B.</given-names></name> <name><surname>Reichel</surname> <given-names>B.</given-names></name> <name><surname>Strehlow</surname> <given-names>R.</given-names></name> <name><surname>Stamatakis</surname> <given-names>A.</given-names></name> <name><surname>Stuckmann</surname> <given-names>N.</given-names></name> <name><surname>Vilbig</surname> <given-names>A.</given-names></name> <name><surname>Lenke</surname> <given-names>M.</given-names></name> <name><surname>Ludwig</surname> <given-names>T.</given-names></name> <name><surname>Bode</surname> <given-names>A.</given-names></name> <name><surname>Schleifer</surname> <given-names>K.</given-names></name></person-group> (<year>2004</year>). <article-title>ARB: a software environment for sequence data</article-title>. <source>Nucleic Acids Res.</source> <volume>32</volume>, <fpage>1363</fpage>&#x02013;<lpage>1371</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkh293</pub-id><pub-id pub-id-type="pmid">14985472</pub-id></citation></ref>
<ref id="B32"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Miller</surname> <given-names>M.</given-names></name> <name><surname>Holder</surname> <given-names>M.</given-names></name> <name><surname>Vos</surname> <given-names>R.</given-names></name> <name><surname>Midford</surname> <given-names>P.</given-names></name> <name><surname>Liebowitz</surname> <given-names>T.</given-names></name> <name><surname>Chan</surname> <given-names>L.</given-names></name> <name><surname>Hoover</surname> <given-names>P.</given-names></name> <name><surname>Warnow</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). <source>The CIPRES Portals</source>. Available at: <uri xlink:href="http://www.phylo.org/sub_sections/portal">http://www.phylo.org/sub_sections/portal</uri> [Accessed: 2009-08-04]. (Archived by WebCite(r) at <uri xlink:href="http://www.webcitation.org/5imQlJeQa">http://www.webcitation.org/5imQlJeQa</uri>) [Online]. [Accessed].</citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mincer</surname> <given-names>T. J.</given-names></name> <name><surname>Church</surname> <given-names>M. J.</given-names></name> <name><surname>Taylor</surname> <given-names>L. T.</given-names></name> <name><surname>Preston</surname> <given-names>C.</given-names></name> <name><surname>Karl</surname> <given-names>D. M.</given-names></name> <name><surname>Delong</surname> <given-names>E. F.</given-names></name></person-group> (<year>2007</year>). <article-title>Quantitative distribution of presumptive archaeal and bacterial nitrifiers in Monterey Bay and the North Pacific Subtropical Gyre</article-title>. <source>Environ. Microbiol.</source> <volume>9</volume>, <fpage>1162</fpage>&#x02013;<lpage>1175</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2007.01239.x</pub-id><pub-id pub-id-type="pmid">17472632</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moin</surname> <given-names>N. S.</given-names></name> <name><surname>Nelson</surname> <given-names>K. A.</given-names></name> <name><surname>Bush</surname> <given-names>A.</given-names></name> <name><surname>Bernhard</surname> <given-names>A. E.</given-names></name></person-group> (<year>2009</year>). <article-title>Distribution and diversity of archaeal and bacterial ammonia oxidizers in salt marsh sediments</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>75</volume>, <fpage>7461</fpage>&#x02013;<lpage>7468</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.01001-09</pub-id><pub-id pub-id-type="pmid">19801456</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mosier</surname> <given-names>A. C.</given-names></name> <name><surname>Francis</surname> <given-names>C. A.</given-names></name></person-group> (<year>2008</year>). <article-title>Relative abundance and diversity of ammonia-oxidizing archaea and bacteria in the San Francisco Bay estuary</article-title>. <source>Environ. Microbiol.</source> <volume>10</volume>, <fpage>3002</fpage>&#x02013;<lpage>3016</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2008.01764.x</pub-id><pub-id pub-id-type="pmid">18973621</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mosier</surname> <given-names>A. C.</given-names></name> <name><surname>Francis</surname> <given-names>C. A.</given-names></name></person-group> (<year>2011</year>). <article-title>Determining the distribution of marine and coastal ammonia-oxidizing archaea and bacteria using a quantitative approach</article-title>. <source>Meth. Enzymol.</source> <volume>486</volume>, <fpage>205</fpage>&#x02013;<lpage>221</lpage>.<pub-id pub-id-type="doi">10.1016/B978-0-12-381294-0.00009-2</pub-id><pub-id pub-id-type="pmid">21185437</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mosier</surname> <given-names>A. C.</given-names></name> <name><surname>Lund</surname> <given-names>M. B.</given-names></name> <name><surname>Francis</surname> <given-names>C. A.</given-names></name></person-group> (<year>2012</year>). <article-title>Ecophysiology of an ammonia-oxidizing archaeon adapted to low-salinity habitats</article-title>. <source>Microb. Ecol.</source><pub-id pub-id-type="doi">10.1007/s00248-012-0075-1</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mussmann</surname> <given-names>M.</given-names></name> <name><surname>Brito</surname> <given-names>I.</given-names></name> <name><surname>Pitcher</surname> <given-names>A.</given-names></name> <name><surname>Sinninghe Damst&#x000E9;</surname> <given-names>J. S.</given-names></name> <name><surname>Hatzenpichler</surname> <given-names>R.</given-names></name> <name><surname>Richter</surname> <given-names>A.</given-names></name> <name><surname>Nielsen</surname> <given-names>J. L.</given-names></name> <name><surname>Nielsen</surname> <given-names>P. H.</given-names></name> <name><surname>M&#x000FC;ller</surname> <given-names>A.</given-names></name> <name><surname>Daims</surname> <given-names>H.</given-names></name> <name><surname>Wagner</surname> <given-names>M.</given-names></name> <name><surname>Head</surname> <given-names>I. M.</given-names></name></person-group> (<year>2011</year>). <article-title>Thaumarchaeotes abundant in refinery nitrifying sludges express amoA but are not obligate autotrophic ammonia oxidizers</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>108</volume>, <fpage>16771</fpage>&#x02013;<lpage>16776</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.1106427108</pub-id><pub-id pub-id-type="pmid">21930919</pub-id></citation></ref>
<ref id="B39"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Oksanen</surname> <given-names>J.</given-names></name> <name><surname>Blanchet</surname> <given-names>F. G.</given-names></name> <name><surname>Kindt</surname> <given-names>R.</given-names></name> <name><surname>Legendre</surname> <given-names>P.</given-names></name> <name><surname>O&#x02019;hara</surname> <given-names>R. B.</given-names></name> <name><surname>Simpson</surname> <given-names>G. L.</given-names></name> <name><surname>Solymos</surname> <given-names>P.</given-names></name> <name><surname>Henry</surname> <given-names>M.</given-names></name> <name><surname>Stevens</surname> <given-names>H.</given-names></name> <name><surname>Wagner</surname> <given-names>H.</given-names></name></person-group> (<year>2010</year>). <source>Vegan: Community Ecology Package. R Package Version 1.17-3</source>. Available at: <uri xlink:href="http://CRAN.R-project.org/package=vegan">http://CRAN.R-project.org/package=vegan</uri></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Park</surname> <given-names>H.</given-names></name> <name><surname>Wells</surname> <given-names>G.</given-names></name> <name><surname>Bae</surname> <given-names>H.</given-names></name> <name><surname>Criddle</surname> <given-names>C.</given-names></name> <name><surname>Francis</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <article-title>Occurrence of ammonia-oxidizing archaea in wastewater treatment plant bioreactors</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>72</volume>, <fpage>5643</fpage>&#x02013;<lpage>5647</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.00402-06</pub-id><pub-id pub-id-type="pmid">16885322</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pester</surname> <given-names>M.</given-names></name> <name><surname>Rattei</surname> <given-names>T.</given-names></name> <name><surname>Flechl</surname> <given-names>S.</given-names></name> <name><surname>Gr&#x000F6;ngr&#x000F6;ft</surname> <given-names>A.</given-names></name> <name><surname>Richter</surname> <given-names>A.</given-names></name> <name><surname>Overmann</surname> <given-names>J.</given-names></name> <name><surname>Reinhold-Hurek</surname> <given-names>B.</given-names></name> <name><surname>Loy</surname> <given-names>A.</given-names></name> <name><surname>Wagner</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>amoA-Based consensus phylogeny of ammonia-oxidizing archaea and deep sequencing of amoA genes from soils of four different geographic regions</article-title>. <source>Environ. Microbiol.</source> <volume>14</volume>, <fpage>525</fpage>&#x02013;<lpage>539</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2011.02666.x</pub-id><pub-id pub-id-type="pmid">22141924</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pond</surname> <given-names>S. L.</given-names></name> <name><surname>Frost</surname> <given-names>S. D.</given-names></name></person-group> (<year>2005</year>). <article-title>Datamonkey: rapid detection of selective pressure on individual sites of codon alignments</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>2531</fpage>&#x02013;<lpage>2533</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti320</pub-id><pub-id pub-id-type="pmid">15713735</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pond</surname> <given-names>S. L.</given-names></name> <name><surname>Frost</surname> <given-names>S. D.</given-names></name> <name><surname>Muse</surname> <given-names>S. V.</given-names></name></person-group> (<year>2005</year>). <article-title>HyPhy: hypothesis testing using phylogenies</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>676</fpage>&#x02013;<lpage>679</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti320</pub-id><pub-id pub-id-type="pmid">15509596</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Santoro</surname> <given-names>A. E.</given-names></name> <name><surname>Francis</surname> <given-names>C. A.</given-names></name> <name><surname>De Sieyes</surname> <given-names>N. R.</given-names></name> <name><surname>Boehm</surname> <given-names>A. B.</given-names></name></person-group> (<year>2008</year>). <article-title>Shifts in the relative abundance of ammonia-oxidizing bacteria and archaea across physicochemical gradients in a subterranean estuary</article-title>. <source>Environ. Microbiol.</source> <volume>10</volume>, <fpage>1068</fpage>&#x02013;<lpage>1079</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2007.01547.x</pub-id><pub-id pub-id-type="pmid">18266758</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Santoro</surname> <given-names>A. E.</given-names></name> <name><surname>Casciotti</surname> <given-names>K. L.</given-names></name> <name><surname>Francis</surname> <given-names>C. A.</given-names></name></person-group> (<year>2010</year>). <article-title>Activity, abundance, and diversity of nitrifying archaea and bacteria in the central California current</article-title>. <source>Environ. Microbiol.</source> <volume>12</volume>, <fpage>1989</fpage>&#x02013;<lpage>2006</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2010.02205.x</pub-id><pub-id pub-id-type="pmid">20345944</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>P. D.</given-names></name> <name><surname>Westcott</surname> <given-names>S. L.</given-names></name> <name><surname>Ryabin</surname> <given-names>T.</given-names></name> <name><surname>Hall</surname> <given-names>J. R.</given-names></name> <name><surname>Hartmann</surname> <given-names>M.</given-names></name> <name><surname>Hollister</surname> <given-names>E. B.</given-names></name> <name><surname>Lesniewski</surname> <given-names>R. A.</given-names></name> <name><surname>Oakley</surname> <given-names>B. B.</given-names></name> <name><surname>Parks</surname> <given-names>D. H.</given-names></name> <name><surname>Robinson</surname> <given-names>C. J.</given-names></name> <name><surname>Sahl</surname> <given-names>J. W.</given-names></name> <name><surname>Stres</surname> <given-names>B.</given-names></name> <name><surname>Thallinger</surname> <given-names>G. G.</given-names></name> <name><surname>Van Horn</surname> <given-names>D. J.</given-names></name> <name><surname>Weber</surname> <given-names>C. F.</given-names></name></person-group> (<year>2009</year>). <article-title>Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>75</volume>, <fpage>7537</fpage>&#x02013;<lpage>7541</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.01541-09</pub-id><pub-id pub-id-type="pmid">19801464</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamatakis</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models</article-title>. <source>Bioinformatics</source> <volume>22</volume>, <fpage>2688</fpage>&#x02013;<lpage>2690</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btl446</pub-id><pub-id pub-id-type="pmid">16928733</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tamura</surname> <given-names>K.</given-names></name> <name><surname>Dudley</surname> <given-names>J.</given-names></name> <name><surname>Nei</surname> <given-names>M.</given-names></name> <name><surname>Kumar</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>MEGA4: molecular evolutionary genetics analysis (mega) software version 4.0</article-title> <source>. Mol. Biol. Evol.</source> <volume>24</volume>, <fpage>1596</fpage>&#x02013;<lpage>1599</lpage>.</citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tourna</surname> <given-names>M.</given-names></name> <name><surname>Stieglmeier</surname> <given-names>M.</given-names></name> <name><surname>Spang</surname> <given-names>A.</given-names></name> <name><surname>Konneke</surname> <given-names>M.</given-names></name> <name><surname>Schintlmeister</surname> <given-names>A.</given-names></name> <name><surname>Urich</surname> <given-names>T.</given-names></name> <name><surname>Engel</surname> <given-names>M.</given-names></name> <name><surname>Schloter</surname> <given-names>M.</given-names></name> <name><surname>Wagner</surname> <given-names>M.</given-names></name> <name><surname>Richter</surname> <given-names>A.</given-names></name> <name><surname>Schleper</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title><italic>Nitrososphaera viennensis</italic>, an ammonia oxidizing archaeon from soil</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>108</volume>, <fpage>8420</fpage>&#x02013;<lpage>8425</lpage>.</citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Treusch</surname> <given-names>A.</given-names></name> <name><surname>Leininger</surname> <given-names>S.</given-names></name> <name><surname>Kletzin</surname> <given-names>A.</given-names></name> <name><surname>Schuster</surname> <given-names>S.</given-names></name> <name><surname>Klenk</surname> <given-names>H.</given-names></name> <name><surname>Schleper</surname> <given-names>C.</given-names></name></person-group> (<year>2005</year>). <article-title>Novel genes for nitrite reductase and amo-related proteins indicate a role of uncultivated mesophilic crenarchaeota in nitrogen cycling</article-title>. <source>Environ. Microbiol.</source> <volume>7</volume>, <fpage>1985</fpage>&#x02013;<lpage>1995</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2005.00906.x</pub-id><pub-id pub-id-type="pmid">16309395</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Venter</surname> <given-names>J.</given-names></name> <name><surname>Remington</surname> <given-names>K.</given-names></name> <name><surname>Heidelberg</surname> <given-names>J.</given-names></name> <name><surname>Halpern</surname> <given-names>A.</given-names></name> <name><surname>Rusch</surname> <given-names>D.</given-names></name> <name><surname>Eisen</surname> <given-names>J.</given-names></name> <name><surname>Wu</surname> <given-names>D.</given-names></name> <name><surname>Paulsen</surname> <given-names>I.</given-names></name> <name><surname>Nelson</surname> <given-names>K.</given-names></name> <name><surname>Nelson</surname> <given-names>W.</given-names></name> <name><surname>Fouts</surname> <given-names>D.</given-names></name> <name><surname>Levy</surname> <given-names>S.</given-names></name> <name><surname>Knap</surname> <given-names>A.</given-names></name> <name><surname>Lomas</surname> <given-names>M.</given-names></name> <name><surname>Nealson</surname> <given-names>K.</given-names></name> <name><surname>White</surname> <given-names>O.</given-names></name> <name><surname>Peterson</surname> <given-names>J.</given-names></name> <name><surname>Hoffman</surname> <given-names>J.</given-names></name> <name><surname>Parsons</surname> <given-names>R.</given-names></name> <name><surname>Baden-Tillson</surname> <given-names>H.</given-names></name> <name><surname>Pfannkoch</surname> <given-names>C.</given-names></name> <name><surname>Rogers</surname> <given-names>Y.</given-names></name> <name><surname>Smith</surname> <given-names>H.</given-names></name></person-group> (<year>2004</year>). <article-title>Environmental genome shotgun sequencing of the Sargasso Sea</article-title>. <source>Science</source> <volume>304</volume>, <fpage>66</fpage>&#x02013;<lpage>74</lpage>.<pub-id pub-id-type="doi">10.1126/science.1093857</pub-id><pub-id pub-id-type="pmid">15001713</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wankel</surname> <given-names>S. D.</given-names></name> <name><surname>Mosier</surname> <given-names>A. C.</given-names></name> <name><surname>Hansel</surname> <given-names>C. M.</given-names></name> <name><surname>Paytan</surname> <given-names>A.</given-names></name> <name><surname>Francis</surname> <given-names>C. A.</given-names></name></person-group> (<year>2011</year>). <article-title>Spatial variability in nitrification rates and ammonia-oxidizing microbial communities in the agriculturally impacted Elkhorn Slough estuary, California</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>77</volume>, <fpage>269</fpage>&#x02013;<lpage>280</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.01318-10</pub-id><pub-id pub-id-type="pmid">21057023</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wells</surname> <given-names>G. F.</given-names></name> <name><surname>Park</surname> <given-names>H.-D.</given-names></name> <name><surname>Yeung</surname> <given-names>C.-H.</given-names></name> <name><surname>Eggleston</surname> <given-names>B.</given-names></name> <name><surname>Francis</surname> <given-names>C. A.</given-names></name> <name><surname>Criddle</surname> <given-names>C. S.</given-names></name></person-group> (<year>2009</year>). <article-title>Ammonia-oxidizing communities in a highly aerated full-scale activated sludge bioreactor: betaproteobacterial dynamics and low relative abundance of crenarchaea</article-title>. <source>Environ. Microbiol.</source> <volume>11</volume>, <fpage>2310</fpage>&#x02013;<lpage>2328</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2009.01958.x</pub-id><pub-id pub-id-type="pmid">19515200</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yilmaz</surname> <given-names>P.</given-names></name> <name><surname>Kottmann</surname> <given-names>R.</given-names></name> <name><surname>Field</surname> <given-names>D.</given-names></name> <name><surname>Knight</surname> <given-names>R.</given-names></name> <name><surname>Cole</surname> <given-names>J. R.</given-names></name> <name><surname>Amaral-Zettler</surname> <given-names>L.</given-names></name> <name><surname>Gilbert</surname> <given-names>J. A.</given-names></name> <name><surname>Karsch-Mizrachi</surname> <given-names>I.</given-names></name> <name><surname>Johnston</surname> <given-names>A.</given-names></name> <name><surname>Cochrane</surname> <given-names>G.</given-names></name> <name><surname>Vaughan</surname> <given-names>R.</given-names></name> <name><surname>Hunter</surname> <given-names>C.</given-names></name> <name><surname>Park</surname> <given-names>J.</given-names></name> <name><surname>Morrison</surname> <given-names>N.</given-names></name> <name><surname>Rocca-Serra</surname> <given-names>P.</given-names></name> <name><surname>Sterk</surname> <given-names>P.</given-names></name> <name><surname>Arumugam</surname> <given-names>M.</given-names></name> <name><surname>Bailey</surname> <given-names>M.</given-names></name> <name><surname>Baumgartner</surname> <given-names>L.</given-names></name> <name><surname>Birren</surname> <given-names>B. W.</given-names></name> <name><surname>Blaser</surname> <given-names>M. J.</given-names></name> <name><surname>Bonazzi</surname> <given-names>V.</given-names></name> <name><surname>Booth</surname> <given-names>T.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name> <name><surname>Bushman</surname> <given-names>F. D.</given-names></name> <name><surname>Buttigieg</surname> <given-names>P. L.</given-names></name> <name><surname>Chain</surname> <given-names>P. S. G.</given-names></name> <name><surname>Charlson</surname> <given-names>E.</given-names></name> <name><surname>Costello</surname> <given-names>E. K.</given-names></name> <name><surname>Huot-Creasy</surname> <given-names>H.</given-names></name> <name><surname>Dawyndt</surname> <given-names>P.</given-names></name> <name><surname>Desantis</surname> <given-names>T.</given-names></name> <name><surname>Fierer</surname> <given-names>N.</given-names></name> <name><surname>Fuhrman</surname> <given-names>J. A.</given-names></name> <name><surname>Gallery</surname> <given-names>R. E.</given-names></name> <name><surname>Gevers</surname> <given-names>D.</given-names></name> <name><surname>Gibbs</surname> <given-names>R. A.</given-names></name> <name><surname>San Gil</surname> <given-names>I.</given-names></name> <name><surname>Gonzalez</surname> <given-names>A.</given-names></name> <name><surname>Gordon</surname> <given-names>J. I.</given-names></name> <name><surname>Guralnick</surname> <given-names>R.</given-names></name> <name><surname>Hankeln</surname> <given-names>W.</given-names></name> <name><surname>Highlander</surname> <given-names>S.</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P.</given-names></name> <name><surname>Jansson</surname> <given-names>J.</given-names></name> <name><surname>Kau</surname> <given-names>A. L.</given-names></name> <name><surname>Kelley</surname> <given-names>S. T.</given-names></name> <name><surname>Kennedy</surname> <given-names>J.</given-names></name> <name><surname>Knights</surname> <given-names>D.</given-names></name> <name><surname>Koren</surname> <given-names>O.</given-names></name> <name><surname>Kuczynski</surname> <given-names>J.</given-names></name> <name><surname>Kyrpides</surname> <given-names>N.</given-names></name> <name><surname>Larsen</surname> <given-names>R.</given-names></name> <name><surname>Lauber</surname> <given-names>C. L.</given-names></name> <name><surname>Legg</surname> <given-names>T.</given-names></name> <name><surname>Ley</surname> <given-names>R. E.</given-names></name> <name><surname>Lozupone</surname> <given-names>C. A.</given-names></name> <name><surname>Ludwig</surname> <given-names>W.</given-names></name> <name><surname>Lyons</surname> <given-names>D.</given-names></name> <name><surname>Maguire</surname> <given-names>E.</given-names></name> <name><surname>Meth&#x000E9;</surname> <given-names>B. A.</given-names></name> <name><surname>Meyer</surname> <given-names>F.</given-names></name> <name><surname>Muegge</surname> <given-names>B.</given-names></name> <name><surname>Nakielny</surname> <given-names>S.</given-names></name> <name><surname>Nelson</surname> <given-names>K. E.</given-names></name> <name><surname>Nemergut</surname> <given-names>D.</given-names></name> <name><surname>Neufeld</surname> <given-names>J. D.</given-names></name> <name><surname>Newbold</surname> <given-names>L. K.</given-names></name> <name><surname>Oliver</surname> <given-names>A. E.</given-names></name> <name><surname>Pace</surname> <given-names>N. R.</given-names></name> <name><surname>Palanisamy</surname> <given-names>G.</given-names></name> <name><surname>Peplies</surname> <given-names>J.</given-names></name> <name><surname>Petrosino</surname> <given-names>J.</given-names></name> <name><surname>Proctor</surname> <given-names>L.</given-names></name> <name><surname>Pruesse</surname> <given-names>E.</given-names></name> <name><surname>Quast</surname> <given-names>C.</given-names></name> <name><surname>Raes</surname> <given-names>J.</given-names></name> <name><surname>Ratnasingham</surname> <given-names>S.</given-names></name> <name><surname>Ravel</surname> <given-names>J.</given-names></name> <name><surname>Relman</surname> <given-names>D. A.</given-names></name> <name><surname>Assunta-Sansone</surname> <given-names>S.</given-names></name> <name><surname>Schloss</surname> <given-names>P. D.</given-names></name> <name><surname>Schriml</surname> <given-names>L.</given-names></name> <name><surname>Sinha</surname> <given-names>R.</given-names></name> <name><surname>Smith</surname> <given-names>M. I.</given-names></name> <name><surname>Sodergren</surname> <given-names>E.</given-names></name> <name><surname>Spo</surname> <given-names>A.</given-names></name> <name><surname>Stombaugh</surname> <given-names>J.</given-names></name> <name><surname>Tiedje</surname> <given-names>J. M.</given-names></name> <name><surname>Ward</surname> <given-names>D. V.</given-names></name> <name><surname>Weinstock</surname> <given-names>G. M.</given-names></name> <name><surname>Wendel</surname> <given-names>D.</given-names></name> <name><surname>White</surname> <given-names>O.</given-names></name> <name><surname>Whiteley</surname> <given-names>A.</given-names></name> <name><surname>Wilke</surname> <given-names>A.</given-names></name> <name><surname>Wortman</surname> <given-names>J. R.</given-names></name> <name><surname>Yatsunenko</surname> <given-names>T.</given-names></name> <name><surname>Gl&#x000F6;ckner</surname> <given-names>F. O.</given-names></name></person-group> (<year>2011</year>). <article-title>Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications</article-title>. <source>Nat. Biotechnol.</source> <volume>29</volume>, <fpage>415</fpage>&#x02013;<lpage>420</lpage>.<pub-id pub-id-type="doi">10.1038/nbt.1823</pub-id><pub-id pub-id-type="pmid">21552244</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>L.-M.</given-names></name> <name><surname>Offre</surname> <given-names>P. R.</given-names></name> <name><surname>He</surname> <given-names>J.-Z.</given-names></name> <name><surname>Verhamme</surname> <given-names>D. T.</given-names></name> <name><surname>Nicol</surname> <given-names>G. W.</given-names></name> <name><surname>Prosser</surname> <given-names>J. I.</given-names></name></person-group> (<year>2010</year>). <article-title>Autotrophic ammonia oxidation by soil thaumarchaea</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>107</volume>, <fpage>17240</fpage>&#x02013;<lpage>17245</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.1002506107</pub-id><pub-id pub-id-type="pmid">20855593</pub-id></citation></ref>
</ref-list>
</back>
</article>