<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2023.1114548</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Influence of soil nutrients on the presence and distribution of CPR bacteria in a long-term crop rotation experiment</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Santana-Pereira</surname>
<given-names>Alinne L. R.</given-names>
</name>
<xref rid="fn0002" ref-type="author-notes"><sup>&#x2020;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Moen</surname>
<given-names>Francesco S.</given-names>
</name>
<xref rid="fn0002" ref-type="author-notes"><sup>&#x2020;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/2126735/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Severance</surname>
<given-names>Beatrice</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/2178251/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Liles</surname>
<given-names>Mark R.</given-names>
</name>
<xref rid="c001" ref-type="corresp"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/219367/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Biological Sciences, Auburn University</institution>, <addr-line>Auburn, AL</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by" id="fn0003"><p>Edited by: Ludmila Chistoserdova, University of Washington, United States</p></fn>
<fn fn-type="edited-by" id="fn0004"><p>Reviewed by: Anne-Kristin Kaster, Karlsruhe Institute of Technology (KIT), Germany; Tobias Goris, German Institute of Human Nutrition Potsdam-Rehbruecke (DIfE), Germany</p></fn>
<corresp id="c001">&#x002A;Correspondence: Mark R. Liles, <email>lilesma@auburn.edu</email></corresp>
<fn fn-type="equal" id="fn0002"><p><sup>&#x2020;</sup>These authors have contributed equally to this work</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>27</day>
<month>07</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1114548</elocation-id>
<history>
<date date-type="received">
<day>02</day>
<month>12</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>10</day>
<month>07</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Santana-Pereira, Moen, Severance and Liles.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Santana-Pereira, Moen, Severance and Liles</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Bacteria affiliated with the Candidate Phyla Radiation (CPR) are a hyper-diverse group of ultra-small bacteria with versatile yet sparse metabolisms. However, most insights into this group come from a surprisingly small number of environments, and recovery of CPR bacteria from soils has been hindered due to their extremely low abundance within complex microbial assemblages. In this study we enriched soil samples from 14 different soil fertility treatments for ultra-small (&#x003C;0.45&#x2009;&#x03BC;m) bacteria in order to study rare soil CPR. 42 samples were sequenced, enabling the reconstruction of 27 quality CPR metagenome-assembled genomes (MAGs) further classified as Parcubacteria/Paceibacteria, Saccharibacteria/Saccharimonadia and ABY1, in addition to representative genomes from Gemmatimonadetes, Dependentiae and Chlamydae phyla. These genomes were fully annotated and used to reconstruct the CPR community across all 14 plots. Additionally, for five of these plots, the entire microbiota was reconstructed using 16S amplification, showing that specific soil CPR may form symbiotic relationships with a varied and circumstantial range of hosts. Cullars CPR had a prevalence of enzymes predicted to degrade plant-derived carbohydrates, which suggests they have a role in plant biomass degradation. Parcubacteria appear to be more apt at microfauna necromass degradation. Cullars Saccharibacteria and a Parcubacteria group were shown to carry a possible aerotolerance mechanism coupled with potential for aerobic respiration, which appear to be a unique adaptation to the oxic soil environment. Reconstruction of CPR communities across treatment plots showed that they were not impacted by changes in nutrient levels or microbiota composition, being only impacted by extreme conditions, causing some CPR to dominate the community. These findings corroborate the understanding that soil-dwelling CPR bacteria have a very broad symbiont range and have metabolic capabilities associated to soil environments which allows them to scavenge resources and form resilient communities. The contributions of these microbial dark matter species to soil ecology and plant interactions will be of significant interest in future studies.</p>
</abstract>
<kwd-group>
<kwd>candidate phyla radiation</kwd>
<kwd>metagenome-assembled genomes</kwd>
<kwd>metabolism</kwd>
<kwd>phylogenetic diversity</kwd>
<kwd>soil ecology</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="89"/>
<page-count count="13"/>
<word-count count="9734"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Evolutionary and Genomic Microbiology</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="sec1">
<label>1.</label>
<title>Introduction</title>
<p>Since the discovery of Microgenomatia (OD11) from the Obsidian pools in Yellowstone National Park (<xref ref-type="bibr" rid="ref29">Hugenholtz et al., 1998</xref>), the CPR have been reported in diverse environments leading to the discovery of Parcubacteria (OD1) and Asconditabacteria (SR1) taxa (<xref ref-type="bibr" rid="ref24">Harris et al., 2004</xref>). The systematic study of these uncultured bacteria led to the description of the hyper-diverse monophyletic group described as the CPR (<xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>), comprising more than 20% of known bacterial diversity (<xref ref-type="bibr" rid="ref28">Hug et al., 2016</xref>; <xref ref-type="bibr" rid="ref62">Parks et al., 2017</xref>), significantly increasing the previously described Patescibacteria clade (<xref ref-type="bibr" rid="ref67">Rinke et al., 2013</xref>).</p>
<p>Members of this group have small cells (0.009&#x2009;&#x00B1;&#x2009;0.002&#x2009;&#x03BC;m<sup>3</sup>) (<xref ref-type="bibr" rid="ref48">Luef et al., 2015</xref>) and patchy metabolisms with incomplete or lacking major pathways such as nucleotide, amino acid and lipid metabolism, TCA cycle or electron transport chains (<xref ref-type="bibr" rid="ref10">Castelle et al., 2018</xref>), and are predicted to rely on alternate mechanisms to generate and exploit the proton motor force (<xref ref-type="bibr" rid="ref85">Wrighton et al., 2012</xref>; <xref ref-type="bibr" rid="ref33">Kantor et al., 2013</xref>). Despite these observed genome reductions, CPR bacteria have innovative features such as alternate fermentation enzymes (<xref ref-type="bibr" rid="ref33">Kantor et al., 2013</xref>), hybrid RuBisCOs that function in an archaeal-like fashion (<xref ref-type="bibr" rid="ref83">Wrighton et al., 2016</xref>), and deployment of other archaeal-like enzymes (<xref ref-type="bibr" rid="ref21">Ghuneim et al., 2018</xref>). Additionally, they have been predicted to participate in nitrogen, sulfur and carbon cycling (<xref ref-type="bibr" rid="ref84">Wrighton et al., 2014</xref>; <xref ref-type="bibr" rid="ref17">Danczak et al., 2017</xref>). The foundation of our current understanding of CPR taxa focused on aquifers, groundwater, and associated sediment (<xref ref-type="bibr" rid="ref85">Wrighton et al., 2012</xref>; <xref ref-type="bibr" rid="ref12">Castelle et al., 2013</xref>; <xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>; <xref ref-type="bibr" rid="ref48">Luef et al., 2015</xref>; <xref ref-type="bibr" rid="ref2">Anantharaman et al., 2016</xref>; <xref ref-type="bibr" rid="ref17">Danczak et al., 2017</xref>). Therefore, the study of these ultra-small bacteria in other environmental contexts is imperative to better understand the extent of their diversity, metabolic capabilities, and ecological roles.</p>
<p>Soil environments harbor enormous bacterial diversity (<xref ref-type="bibr" rid="ref80">Torsvik and Ovreas, 2002</xref>); however, members of the CPR group have only sparsely been reported in soil microbiome surveys (<xref ref-type="bibr" rid="ref87">Yeates et al., 2000</xref>; <xref ref-type="bibr" rid="ref81">Wagner et al., 2009</xref>; <xref ref-type="bibr" rid="ref40">Lemos et al., 2020</xref>; <xref ref-type="bibr" rid="ref69">Santana-Pereira et al., 2020</xref>). Recovery of soil dwelling CPR is challenging due to their extremely low abundances (<xref ref-type="bibr" rid="ref26">Herrmann et al., 2019</xref>) and intronic 16S rRNA genes that evade amplicon-based surveys (<xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>). Despite the advent of genome-resolved metagenomics (<xref ref-type="bibr" rid="ref73">Sharon and Banfield, 2013</xref>), the challenges in assembling sequences from complex soil metagenomes has hindered the reconstruction of these low abundant CPR genomes from soil and to date only a few studies have achieved this goal (<xref ref-type="bibr" rid="ref36">Kroeger et al., 2018</xref>; <xref ref-type="bibr" rid="ref82">Woodcroft et al., 2018</xref>; <xref ref-type="bibr" rid="ref40">Lemos et al., 2020</xref>; <xref ref-type="bibr" rid="ref74">Sharrar et al., 2020</xref>; <xref ref-type="bibr" rid="ref55">Nicolas et al., 2021</xref>).</p>
<p>Another significant challenge in accessing CPR genomes is their low relative abundance within soil microbiota. Other studies have used size fractionation to isolate distinct populations of microbial cells (<xref ref-type="bibr" rid="ref64">Portillo et al., 2013</xref>), and to enrich for novel phylogenetic groups including CPR bacteria that may not be observed from studies of bulk soil (<xref ref-type="bibr" rid="ref54">Nakai, 2020</xref>; <xref ref-type="bibr" rid="ref39">Lee et al., 2021</xref>; <xref ref-type="bibr" rid="ref55">Nicolas et al., 2021</xref>). We therefore used an ultra-small (&#x003C;0.45&#x2009;&#x03BC;m) bacterial cell enrichment protocol to enrich for low abundance CPR bacteria, and other members of the rare biosphere, from soil samples of the Cullars Rotation (Auburn, AL). The Cullars Rotation is the oldest ongoing soil fertility experiment in the southern United States, with a 3-year rotation of corn, cotton, soybean and winter legumes across 14 plots that vary significantly in NPK fertilizer and lime amendments (<xref ref-type="bibr" rid="ref89">Zhao et al., 2015</xref>). Direct sequencing of the small cell-derived metagenomic DNA allowed the reconstruction of 31 MAGs predominantly from CPR taxa. CPR genomes were annotated for their predicted potential for biomass degradation, and their abundance across Cullars Rotation soil treatment groups was correlated with differences in nutrient, pH, and metal levels, shedding light into their possible ecological roles and distribution patterns in soil.</p>
</sec>
<sec sec-type="materials|methods" id="sec2">
<label>2.</label>
<title>Materials and methods</title>
<sec id="sec3">
<label>2.1.</label>
<title>Sampling site</title>
<p>The Cullars Rotation is a century old crop rotation experiment alternating between five crops during the course of a year (cotton, crimson clover, corn, wheat and soybean) (<xref ref-type="bibr" rid="ref53">Mitchell et al., 2011</xref>). The rotation is composed of 14 plots for the study of the effects of controlled deficiencies of phosphorus (P), potassium (K), nitrogen (N) and sulfur (S), as well as other soil amendments such as lime (for pH control) and addition of a micronutrient mix containing boron (B), zinc (Zn), manganese (Mn), copper (Cu) and iron (Fe) on crop yield (<xref ref-type="bibr" rid="ref52">Mitchell et al., 2005</xref>) (<xref rid="fig1" ref-type="fig">Figure 1</xref>).</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption>
<p>Representation of Cullars Rotation plots and Fertilization treatments. <bold>(A)</bold>. Schematic representation of Cullars Rotation Plot layout and Soil treatments administered. pH correction: pH correction with lime. Fractions represent the amount added to the plot with Standard Fertilization as reference (3/3). <bold>(B)</bold>. Table with the Standard Fertilization treatments applied to each plot (<xref ref-type="bibr" rid="ref52">Mitchell et al., 2005</xref>). <bold>(C)</bold>. Average soil nutrient parameters from sampled bulk soil. <bold>(D)</bold>. Overview of the methods employed to reconstruct the MAGs from the filtration enrichment (mini-metagenome), reconstruct the MAG community and reconstruct the total bacterial community using 16S amplification from soil total DNA extraction. </p>
</caption>
<graphic xlink:href="fmicb-14-1114548-g001.tif"/>
</fig>
</sec>
<sec id="sec4">
<label>2.2.</label>
<title>Sample collection and soil analysis</title>
<p>Soil samples were obtained from the Cullars Rotation in March 2019. Three topsoil samples from each one of the 14 plots in Cullars Rotation were collected for soil DNA extraction, alongside ~400&#x2009;g of bulk soil for analysis. The samples for DNA extraction were immediately transported to the lab and kept refrigerated at 4&#x00B0;C until processing. Processing of the 42 soil samples (3 replicates/14 plots) were completed on the same day as sampling. The bulk soil samples were submitted for analysis at Auburn University&#x2019;s Soil Testing Laboratory the same day. Soil nutrients were obtained using Mehlich I Extraction and analyzed by ICP. Measurements were obtained for calcium (Ca), K, magnesium (Mg), P, aluminum (Al), B, Cu, Fe, Mn, sodium (Na), Zn, Nitrate-Nitrogen (NO<sub>3</sub>-N), all in parts per million (ppm). Percent carbon &#x00A9;, N, and pH were also measured.</p>
</sec>
<sec id="sec5">
<label>2.3.</label>
<title>Ultra-small cell enrichment</title>
<p>For each sample, 5&#x2009;g of soil was suspended in 25&#x2009;mL of 0.2% tetrasodium pyrophosphate buffer (pH 8.0), which has been shown to dissociate microbial cells from soil particulates (e.g., clay, sand and silt) while retaining the microbial community structure associated with bulk soil (<xref ref-type="bibr" rid="ref64">Portillo et al., 2013</xref>; <xref ref-type="bibr" rid="ref88">Yuan et al., 2018</xref>). Suspensions were homogenized for 10&#x2009;min, with gentle inversions, followed by 10&#x2009;cycles of intermittent vortexing for 5&#x2009;s. The soil slurry was subjected to a low-speed centrifugation (5,000 x <italic>g</italic> for 3&#x2009;min) to precipitate larger soil particulates and eukaryotic cells. The supernatant was collected and serially filtered through a 5.0&#x2009;&#x03BC;m cellulose membrane, then 1.2&#x2009;&#x03BC;m and 0.45&#x2009;&#x03BC;m SUPOR&#x00AE; membranes (Pall Corporation, Port Washington, New York) in tandem. The &#x003C;0.45&#x2009;&#x03BC;m fraction was then pelleted by centrifugation (20,000 x <italic>g</italic> for 20&#x2009;min) and DNA was isolated from the cell pellet using the Power Soil DNA isolation kit (MoBIO Labs) to obtain the &#x003C;0.45&#x2009;&#x03BC;m fraction metagenomic DNA (<xref rid="fig1" ref-type="fig">Figure 1</xref>). There were three topsoil replicates from each of the 14 plots in the Cullars Rotation, resulting in 42 different samples from which a&#x2009;&#x003C;&#x2009;0.45&#x2009;&#x03BC;m fraction metagenome was generated.</p>
</sec>
<sec id="sec6">
<label>2.4.</label>
<title>Shotgun sequencing, binning, and genome annotation</title>
<p>Each &#x003C;0.45&#x2009;&#x03BC;m fraction metagenome was uniquely indexed and paired-end sequenced using an Illumina HiSeq platform in one dedicated run (<xref rid="fig1" ref-type="fig">Figure 1</xref>). Raw reads from all metagenomes were processed using Trimmomatic v0.39 (<xref ref-type="bibr" rid="ref5">Bolger et al., 2014</xref>). The remaining reads from all 42 samples were combined and assembled with MEGAHIT v1.2.9 (<xref ref-type="bibr" rid="ref46">Li et al., 2015</xref>). The reads from each sample were mapped back against the assembled contigs using BWA v0.7.17 (<xref ref-type="bibr" rid="ref44">Li and Durbin, 2009</xref>). Parallel binning was conducted with MetaBAT2 v2.15 (<xref ref-type="bibr" rid="ref32">Kang et al., 2019</xref>), MaxBin 2.0 v2.2.7 (<xref ref-type="bibr" rid="ref86">Wu et al., 2016</xref>) and CONCOCT v1.1.0 (<xref ref-type="bibr" rid="ref1">Alneberg et al., 2014</xref>). Resulting genomic bins were curated using DasTool v1.1.4 (<xref ref-type="bibr" rid="ref75">Sieber et al., 2018</xref>). The completeness and contamination of each bin was assessed using CheckM v1.1.3 (<xref ref-type="bibr" rid="ref61">Parks et al., 2015</xref>) using the standard bacterial marker set and a CPR-specific phylogenetic marker set (<xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>). Bins with completeness below 65%, contamination above 10% or strain heterogeneity above 10% were dropped from subsequent analyses. Bins with completeness above 85%, contamination below 5% and strain heterogeneity below 5% were considered high-quality MAGs, while bins with completeness above 75%, contamination below 5% and strain heterogeneity below 5% were considered good-quality MAGs and the remainder were considered draft MAGs. To aid in genome assembly and binning, an additional set of reads from a similar sequencing run of a pilot enrichment for Cullars Rotation&#x2019;s Plot C was added to the read pool for a second round of assembly and binning following the same steps and criteria above. The total bins obtained were dereplicated using Derep v0.12.8 (<xref ref-type="bibr" rid="ref57">Olm et al., 2017</xref>) and used in the subsequent analysis.</p>
</sec>
<sec id="sec7">
<label>2.5.</label>
<title>MAG phylogenetic classification</title>
<p>MAGs were taxonomically classified using two strategies: 16S rRNA gene sequence homology and phylogenetic marker gene voting. First, 16S rRNA sequences were extracted using CheckM v1.1.3 (<xref ref-type="bibr" rid="ref61">Parks et al., 2015</xref>) and a custom package for CPR 16S rRNA extraction and intron removal (<xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>). The MAG-associated 16S rRNA genes were classified against the SILVA database (<xref ref-type="bibr" rid="ref66">Quast et al., 2012</xref>) (accessed February, 2020) using SINA (<xref ref-type="bibr" rid="ref65">Pruesse et al., 2012</xref>) through the web client. A homology threshold of 75% was used for phylum classification and 80% for class classification. For the second approach, 13 phylogenetically informative marker genes (<xref rid="sec23" ref-type="sec">Supplementary Table S1</xref>) corresponding to ribosomal proteins with similar phylogenetic signal to 16S rRNA (<xref ref-type="bibr" rid="ref23">Gregor et al., 2016</xref>), were extracted from each bin using CheckM v1.1.3. The markers were annotated against the nr/nt NCBI database. Then each marker &#x201C;casts a vote&#x201D; according to their taxonomic annotation. The taxonomy with the higher number of &#x201C;votes&#x201D; was assigned to the MAG. Finally, the consensus classification from the two strategies was applied to the MAG. Both strategies used NCBI taxonomy, which is itself based on the taxonomy and nomenclature proposed by <xref ref-type="bibr" rid="ref7">Brown et al. (2015)</xref>.</p>
<p>MAGs classified as CPR were selected for a robust additional classification by constructing a concatenated maximum likelihood tree. A curated collection of reference CPR genomes was constructed by downloading 1,200 CPR genomes from NCBI (accessed October 2019) and filtering out low quality MAGs with the program CheckM v1.1.3 by the following requirements: &#x003E;75% completeness, &#x003C; 5% contamination and strain heterogeneity. The set of 13 phylogenetic markers was extracted from each reference genome. For each marker, the Cullars MAG sequences and the references sequences were aligned using MAFFT v7.475 (<xref ref-type="bibr" rid="ref34">Katoh and Standley, 2013</xref>), trimmed with TrimAl v1.4.1 (<xref ref-type="bibr" rid="ref8">Capella-Gutierrez et al., 2009</xref>). Then, the marker alignments were concatenated using Phylemon 2.0 (<xref ref-type="bibr" rid="ref68">Sanchez et al., 2011</xref>) and manually inspected with Geneious 10.3.<xref rid="fn0001" ref-type="fn"><sup>1</sup></xref> The concatenated alignment of 13 markers was used to generate a maximum likelihood tree with 1,000 bootstraps using RAxML v8.2.9 (<xref ref-type="bibr" rid="ref77">Stamatakis, 2014</xref>). Marker genes from the Dependentiae MAGs were used to root the tree. The final tree was visualized and annotated using iTOL (<xref ref-type="bibr" rid="ref42">Letunic and Bork, 2019</xref>), using <xref ref-type="bibr" rid="ref7">Brown et al. (2015)</xref> proposed nomenclature. In an effort to also comply with the efforts toward an standardized bacterial classification and taxonomy (<xref ref-type="bibr" rid="ref58">Parks et al., 2020</xref>), our MAGs were classified according to GTDB (<xref ref-type="bibr" rid="ref59">Parks et al., 2022</xref>) using GTDB-Tk v2.2 (<xref ref-type="bibr" rid="ref14">Chaumeil et al., 2022</xref>), and utilized the (<xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>) proposed nomenclature for phylogenetic affiliations.</p>
</sec>
<sec id="sec8">
<label>2.6.</label>
<title>MAG metabolic potential analysis</title>
<p>To assess metabolic and nutrient utilization potential for each of our CPR MAGs, each of the sufficient quality MAGs and members of the same taxa from the curated collection of reference genomes, were annotated using DRAM v1.3.5 (<xref ref-type="bibr" rid="ref72">Shaffer et al., 2020</xref>). This software utilizes a series of databases (UniRef90, PFAM, dbCAN, Refseq viral, VOGDB, and MEROPS peptidase) to annotate genes and reconstruct the organism&#x2019;s metabolic profile. DRAM annotations were analyzed with special attention to carbohydrate utilization <italic>via</italic> carbohydrate-active enzymes (CAZymes) (<xref ref-type="bibr" rid="ref47">Lombard et al., 2014</xref>). Profiles between MAGs and their corresponding references were compared using a 2-way analysis of variance (ANOVA) based on a linear regression model using the lme4 (<xref ref-type="bibr" rid="ref4">Bates et al., 2015</xref>) and emmeans (<xref ref-type="bibr" rid="ref41">Lenth et al., 2019</xref>) packages. Output was analyzed in RStudio v. 4.0.3.</p>
</sec>
<sec id="sec9">
<label>2.7.</label>
<title>Cullars &#x003C;0.45&#x2009;&#x03BC;m fraction bacterial community reconstruction and distribution across Cullars plots</title>
<p>The &#x003C;0.45&#x2009;&#x03BC;m fraction bacterial community from Cullars soil was reconstructed from the assembled MAGs. To estimate MAG counts, reads from each sample were individually mapped back against each MAG using Bowtie2 (<xref ref-type="bibr" rid="ref38">Langmead and Salzberg, 2012</xref>) with high sensitivity. Well mapped reads were counted using SAMTools v1.3 (<xref ref-type="bibr" rid="ref45">Li et al., 2009</xref>). The matrix with read counts per sample was imported into Rstudio. Library sizes were normalized using the cumulative scale sum strategy (CSS) with metagenomeSeq (<xref ref-type="bibr" rid="ref63">Paulson et al., 2013</xref>). Counts were then normalized by genome length by dividing the total read length by the MAG genome size (bp). Coverages below 1% were discarded as inconclusive and assigned to zero (<xref ref-type="bibr" rid="ref43">Levy-Booth et al., 2021</xref>). Fractions were rounded up to generate a list of normalized natural counts. These counts were used to reconstruct MAG community and calculate average MAG relative abundance per plot.</p>
<p>Rarefaction curves for each sample were calculated using the package Ranacapa (<xref ref-type="bibr" rid="ref31">Kandlikar et al., 2018</xref>). For both MAG&#x2013; and 16S rRNA gene-based community assessments, non-metric multidimensional scaling (NMDS) and correlation analysis (CA) were performed with Bray-Curtis calculated distances using the package PhyloSeq (<xref ref-type="bibr" rid="ref51">McMurdie and Holmes, 2013</xref>). Differences in community composition across plots were accessed by performing PERMANOVA with 999 permutations and subsequent multivariate homogeneity of groups dispersions (Betadisper) using the package Vegan (<xref ref-type="bibr" rid="ref56">Oksanen et al., 2013</xref>). The same package was used to perform vector fitting and surface fitting of Cullars Rotation soil parameters with significant correlation to community changes (<italic>p</italic>&#x2009;&#x003C;&#x2009;0.05).</p>
</sec>
<sec id="sec10">
<label>2.8.</label>
<title>Bulk soil bacterial community reconstruction and distribution across Cullars rotation plots</title>
<p>Total DNA was extracted from bulk soil samples which were the same used for CPR cell enrichment. DNA from plots 03, 08, 10 and 11 were used for 16S rRNA gene PCR amplification. The V4 region of the 16S rRNA gene was amplified and paired-end sequenced. Paired-end reads were processed using FLASH (<xref ref-type="bibr" rid="ref49">Magoc and Salzberg, 2011</xref>) to produce raw tags, which were quality filtered following the QIIME2 protocol (<xref ref-type="bibr" rid="ref9">Caporaso et al., 2010</xref>). The clean high-quality tags were compared against the SILVA database and chimeras were removed using UCHIME (<xref ref-type="bibr" rid="ref18">Edgar et al., 2011</xref>). Sequences with &#x003E;97% similarity were clustered into the same operational taxonomic unit (OTU) using Uparse. Taxonomic annotation of OTU representative sequences was carried out against the SILVA SSU database using Mothur (<xref ref-type="bibr" rid="ref70">Schloss et al., 2009</xref>).</p>
<p>OTU absolute abundance was normalized by rarefaction, and the subsequent counts were used for alpha and beta diversity analyses. The community comparisons between plots and ordinations were constructed using the same programs outlines above for the MAG communities. Linear discriminant analysis Effect Size (LEfSE) analysis was conducted using the LEfSe software (<xref ref-type="bibr" rid="ref71">Segata et al., 2011</xref>).</p>
</sec>
<sec id="sec11">
<label>2.9.</label>
<title>Network analysis</title>
<p>Normalized counts from the 10 most abundant phyla in the16S and MAG abundance data were combined to access the possible relationships between the relative abundance of CPR MAGs and 16S ribotypes from soil bacteria. The resulting count table was used to build the network using the package SPIEC-EASI (<xref ref-type="bibr" rid="ref37">Kurtz et al., 2015</xref>). The graphs were drawn using the package iGraph (<xref ref-type="bibr" rid="ref16">Csardi and Nepusz, 2006</xref>) and exported into Cytoscape (<xref ref-type="bibr" rid="ref76">Smoot et al., 2011</xref>) for visualization.</p>
</sec>
</sec>
<sec sec-type="results" id="sec12">
<label>3.</label>
<title>Results</title>
<sec id="sec13">
<label>3.1.</label>
<title>Recovering ultra-small bacteria genomes from Cullars rotation soil</title>
<p>A previous study of a Cullars Rotation soil metagenome revealed that CPR bacteria were present in this soil in extremely low abundance, but were accessible using a direct cloning and sequencing approach that avoided PCR and 16S rRNA gene-associated biases (<xref ref-type="bibr" rid="ref69">Santana-Pereira et al., 2020</xref>). Therefore, for each of three samples from the 14 plots at Cullars Rotation, ultra-small cells &#x003C;0.45&#x2009;&#x03BC;m were enriched from soil samples to directly sequence and obtain CPR genome bins. From the 124 non-replicated bins, a total of 42 genomic bins were curated ranging from 400 Kb to 3.5&#x2009;Mb (average 1.17&#x2009;Mb). We identified 27 MAGs with acceptable quality, which was defined as completeness below 65%, contamination over 10% or strain heterogeneity over 10% (<xref rid="sec23" ref-type="sec">Supplementary Figure S1</xref>). These MAGs had an average completeness of 86.8%, with 5 bins later identified as CPR with 100% completeness, and an average genome size of 1.13&#x2009;Mb in accordance with the reduced genome size characteristic of CPR bacteria. A noticeable exception was MAG DT 16 with a genome size of 3.55&#x2009;Mb and completeness of 78.6%; however, this relatively larger genome was affiliated with the phylum Gemmatimonadetes which is not known to have a small size but has been previously observed to be capable of transmission through a small sized filter (<xref ref-type="bibr" rid="ref64">Portillo et al., 2013</xref>).</p>
</sec>
<sec id="sec14">
<label>3.2.</label>
<title>Taxonomical classification of Cullars MAGs</title>
<p>CPR classification has experienced several revisions (<xref ref-type="bibr" rid="ref67">Rinke et al., 2013</xref>; <xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>; <xref ref-type="bibr" rid="ref62">Parks et al., 2017</xref>, <xref ref-type="bibr" rid="ref60">2018</xref>), highlighting the challenges associated with conducting phylogenetic analyses of uncultured microorganisms (<xref ref-type="bibr" rid="ref35">Konstantinidis et al., 2017</xref>). Initial studies on CPR taxonomy, based on up ta few thousands of genomes, categorized it as a monophyletic group encompassing several individual candidate phyla, such as Saccharibacteria, and the superphyla Parcubacteria and Microgenomates, each containing numerous candidate phyla (<xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>; <xref ref-type="bibr" rid="ref2">Anantharaman et al., 2016</xref>; <xref ref-type="bibr" rid="ref28">Hug et al., 2016</xref>). This classification is most congruent to NCBI taxonomy and thus popularized. With the increasing availability of MAGs and genomes, much more comprehensive studies using from 8 to more than 90 thousand genomes made widespread rank normalization and standardization possible, settling on Patescibacteria/CPR as a single phylum (<xref ref-type="bibr" rid="ref62">Parks et al., 2017</xref>, <xref ref-type="bibr" rid="ref60">2018</xref>). This framework was used to develop the Genome Taxonomy DataBase (GTDB) (<xref ref-type="bibr" rid="ref13">Chaumeil et al., 2019</xref>; <xref ref-type="bibr" rid="ref58">Parks et al., 2020</xref>), which proposed new nomenclatures and taxonomic ranks for CPR. Thus the previous candidate phyla Saccharibacteria, Parcubacteria and ABY1, correspond to the classes Saccharimonadia, Paceibacteria and ABY1, respectively (<xref rid="sec23" ref-type="sec">Supplementary Table S2</xref>). Due to their widespread use and congruence with popular NCBI nomenclature, the names CPR, Saccharibacteria and Parcubacteria will be favored throughout the paper.</p>
<p>A collection of 13 phylogenetically informative marker genes (<xref rid="sec23" ref-type="sec">Supplementary Table S1</xref>) and 16S rRNA genes were extracted from Cullars MAGs and were independently compared to the NCBI GenBank nr/nt and SILVA databases, respectively, to form a consensus homology classification of the MAGs. All curated MAGs were classified, with one representative MAG recovered from each of the phyla Actinobacteria, Chlamydiae, Dependentiae and Gemmatimonadetes, and the remaining belonging to the CPR group. Within CPR, there were representatives of the Parcubacteria (16), Saccharibacteria (10) and ABY1 (1) groups (<xref rid="fig2" ref-type="fig">Figure 2</xref>). Additionally, CPR MAGs were classified using the GTDB classifier, showing high congruence with the previous classification, since Parcubacteria, Saccharibacteria and ABY1 MAGs, were classified as Paceibacteria, Saccharimonadia and ABY1, respectively. Moreover, GTDB provided finer classification up to family level. Interestingly, a number of MAGs associated with underrepresented taxa in GTDB were recovered from Cullars Soil, such as representatives of families VEQN01 and CAYRIW01, which have only 3 and 1 reference genomes, respectively.</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption>
<p>Classification of all Cullars MAGs and the proportion of each phyla. &#x002A;Total MAGs does not include MAGs filtered out by the completeness, strain heterogeneity and contamination thresholds. Concatenated maximum likelihood tree of 13 phylogenetic marker genes from Cullars CPR and 920 reference genomes allowed further classification of CPR MAGs. MAGs were classified as Saccharibacteria (Saccharimonadia), ABY1 and Parcubacteria (Paceibacteria). The Parcubacteria group is expanded in the tree due to its intrinsic diversity. Zoomed in regions offers context to MAGs placement. Red dots denote Cullars MAGs.</p>
</caption>
<graphic xlink:href="fmicb-14-1114548-g002.tif"/>
</fig>
<p>Given the high degree of diversity among members of the CPR group, the candidate CPR MAGs were further classified by constructing a concatenated maximum likelihood tree using 13 phylogenetic markers and 920 reference genomes, allowing the selection of closest relatives for metabolic potential comparisons (<xref rid="fig2" ref-type="fig">Figure 2</xref>). The tree confirmed the previous taxonomical assignments. Cullars CPR MAGs affiliated with the Parcubacteria/Paceibacteria class, were all from the order UBA9983_A and distributed across 10 families that correlate to the broad groups Nomurabacteria, Taylorbacteria, Adlerbacteria and Kaiserbacteria in NCBI taxonomy. Additionally, two MAGs (DT 10 and DT 38) clustered with an unclassified CPR MAG to form a deeper branch within Saccharibacteria (<xref rid="fig2" ref-type="fig">Figure 2</xref>). These MAGs were classified by GTDB as belonging to the order UBA4664, which carries only 9 references, corresponding to 1.25% of all Saccharibacteria genomes in the database. Indeed, this low representation, whether due to sampling or environmental rarity, might explain the observed deeper branching, although it might be mitigated with the insertion of more reference genomes. The ABY1 group MAG was further classified into the Uhrbacteria group.</p>
</sec>
<sec id="sec15">
<label>3.3.</label>
<title>Metabolic potential of soil dwelling CPR</title>
<p>The MAGs recovered from Cullars Rotation soil represent three distinct CPR classes including the Saccharibacteria (10), Parcubacteria (16), and ABY1 (1). These MAG genomes (<xref rid="fig3" ref-type="fig">Figure 3</xref>) were compared to a curated collection of 124 genome references from NCBI (Saccharibacteria, 43; Parcubacteria, 49; ABY1, 32) to assess their metabolic potential utilizing the metabolic profiling tool DRAM (<xref rid="sec23" ref-type="sec">Supplementary Figure S2</xref>).</p>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption>
<p>Energetic metabolic potential of Cullars MAGs. % Complete: level of completeness of the pathway, measured as presence of genes coding for necessary enzymes to carry out each step.</p>
</caption>
<graphic xlink:href="fmicb-14-1114548-g003.tif"/>
</fig>
<p>ABY1, Saccharibacteria, and Parcubacteria MAGs carried partially complete glycolysis, pentose phosphate and reductive pentose phosphate pathways, as consistently observed among these classes of CPR. Additionally, they were found to have nearly complete F-type ATPase. All MAGs were lacking a complete tricarboxylic acid cycle (TCA), subunits of nicotinamide adenine dinucleotide (NADH) dehydrogenase and electron transport chain (ETC), which indicates an anaerobic and fermentative metabolism, although the presence of complete or nearly complete F-type ATPases was widespread. Interestingly, Parcubacteria, namely from the family UBA2103, and Saccharibacteria from Cullars were more likely to carry mostly complete operons coding for cytochrome o ubiquitol oxidase. Cytochrome o functions by catalyzing the oxidation of ubiquinol (CoQH<sub>2</sub>) into ubiquinone (CoQ), coupled with the reduction of O<sub>2</sub> into water, supplying energy to pump H+ into the outer membrane contributing to the formation of a proton motive force (<xref ref-type="bibr" rid="ref3">Aussel et al., 2014</xref>). Interestingly, it was more common among the soil CPR than the references.</p>
<p>The CAZyme database and its enzyme classes were utilized to understand Cullars MAGs carbohydrate metabolic potential, namely glycoside hydrolases (GH, carbohydrate degradation), glycotransferases (GT, carbohydrate biosynthesis), carbohydrate esterases (CE, hydrolysis of carbohydrate esters), carbohydrate-binding modules (CBM, adhesion of carbohydrates), and auxiliary activities (AA, redox coenzymes for lignocellulose degradation) (<xref rid="sec23" ref-type="sec">Supplementary Figure S4</xref>). Most Cullars MAGs contained genes for enzymes associated with starch, cellulose/hemicellulose, and chitin degradation with the top hits among the Saccharibacteria and Parcubacteria, but not present in the Cullars ABY1 MAG (<xref rid="fig4" ref-type="fig">Figure 4</xref>). Interestingly, only Cullars Saccharibacteria carried the potential for lignin degradation. This continues to support soil CPR as containing the necessary enzymes for degradation of complex plant carbohydrates and soil necromass including bacterial and fungal cell walls.</p>
<fig position="float" id="fig4">
<label>Figure 4</label>
<caption>
<p>Carbohydrate degradation potential of Cullars CPR MAGs and their references. Hits correspond to GH genes identified against the CAZy database, which was also used to obtain substract information. Circles represent the averaged hits for references and Cullars MAGs per substrate category.</p>
</caption>
<graphic xlink:href="fmicb-14-1114548-g004.tif"/>
</fig>
<p>ABY1 and Saccharibacteria carbon substrate metabolic profiles were not found to be statistically different from that of the reference genomes available on NCBI (<italic>p</italic>&#x2009;&#x003E;&#x2009;0.05). However, Cullars Parcubacteria MAGs did demonstrate a strongly statistically difference from their references (<italic>p</italic>&#x2009;&#x003C;&#x2009;0.001). Parcubacteria was the most represented CPR in our MAGs and particular samples (i.e., DT.28) contained several hits for particular enzyme classes for peptidoglycan and chitin degradation, suggesting these soil microorganisms may be particularly adept for scavenging dead or decaying soil microflora.</p>
<p>Cullars CPR MAGs did not demonstrate any ability to participate in nitrogen or methane cycling. Select Saccharibacteria were predicted to have some capacity for dissimilatory and assimilatory sulfate reduction. Cullars MAGs were predicted to degrade organic forms of nitrogen, particularly, the amino acid methionine in the cysteine biosynthesis pathway. Likewise, Cullars CPR had many predicted peptidases, indicating the ability to degrade peptides from the soil environment.</p>
</sec>
<sec id="sec16">
<label>3.4.</label>
<title>Distribution of the ultra-small bacteria in Cullars rotation soil</title>
<p>Sampling the Cullars Rotation soil fertility plots offers a unique opportunity to assess the influence of nutrient availability and soil parameters on the soil CPR community, due to its century-old controlled nutrient deficiency experiments (<xref ref-type="bibr" rid="ref53">Mitchell et al., 2011</xref>) (<xref rid="fig1" ref-type="fig">Figure 1</xref>). To reconstruct the MAG community in each plot, reads from each sample were mapped back against the curated MAGs and normalized to account for variations in library size and genome length (<xref rid="fig1" ref-type="fig">Figure 1</xref>). The normalized, filtered counts were used to reconstruct the communities in each of the plots in Cullars Rotation (<xref rid="fig5" ref-type="fig">Figure 5A</xref>). Additionally, soil nutrient content for each plot was also measured creating a profile that could be correlated with changes in community structure (<xref rid="fig5" ref-type="fig">Figure 5B</xref>).</p>
<fig position="float" id="fig5">
<label>Figure 5</label>
<caption>
<p><bold>(A)</bold> MAG community reconstruction from Cullars soil at different plots corresponding MAG average relative abundance, ranked by Phyla. <bold>(B)</bold> NMDS ordination of all sample communities with Bray-Curtis distances. Colors emphasize the varying nutrients and pH on the plots. Ctrl +: Plot 3 (Standard Fertilization); Ctrl -: Plot C (No fertilization or amendment). Green lines: Aluminum levels; Red lines: Iron levels. Vectors are soil parameters significantly related to community changes (p &#x003C; 0.05). Emphasis on Plot 8 samples, which are pulled in the direction of the highest Fe and Al concentrations. <bold>(C)</bold> Overall community reconstruction from Cullars soil of selected Cullars plots using 16S amplification. <bold>(D)</bold> Ordination plot by PCA of all sample communities with Bray-Curtis distances. </p>
</caption>
<graphic xlink:href="fmicb-14-1114548-g005.tif"/>
</fig>
<p>MAG communities are composed predominantly of Saccharibacteria, followed by Gemmatimonadetes and Dependentiae, even though the latter two have only one representative MAG each. Despite having the higher and most diverse number of representatives, Parcubacteria accounted for a significantly smaller proportion of the community, indicating that taxa dominance was observed in the sampled metagenomes. Interestingly, there was an observed spike in Actinobacteria abundance in Plot 08, which received no pH correction treatment and had a significantly lower pH than other treatment groups.</p>
<p>MAG abundance within the different CPR taxa varied among the different experimental treatment groups (<xref rid="sec23" ref-type="sec">Supplementary Figure S5</xref>). In particular, Plot 08 (no pH correction) showed the greatest shift in community structure, with 78% of the relative abundance corresponding to one Actinobacteria and two Saccharibacteria bins, which were dominant exclusively in this plot, while also having the highest number of recovered Parcubacteria genomes. Dependentiae and Gemmatimonadetes genomes were also recovered across all plots, with the later peaking in abundance under increased phosphorous amendment (Plot 05) and dropping to marginal abundance when there was no soil pH correction (Plot 08) indicating a susceptibility to very acidic soils. Otherwise, the abundance of CPR genomes was not significantly different among the different Cullars Rotation plots.</p>
<p>NMDS ordination was used to visualize communities, dissimilarities in relation to fertilization treatment. To further explore the role of soil parameters in community compositions, the measurements of soil macro and micro-nutrients were modeled to the community distance matrix (<xref rid="fig5" ref-type="fig">Figure 5B</xref>). Community profiles did vary significantly across plots (PERMANOVA, <italic>p</italic>&#x2009;=&#x2009;0.001 and Betadisper, <italic>p</italic>&#x2009;=&#x2009;0.8537) with the soil plots with an acidic pH having the most dramatic effect in clustering (<italic>p</italic>&#x2009;=&#x2009;0.015). The lack of pH correction coupled with the addition of fertilization treatment resulted in the most acidic pH across all plots for Plot 08 (pH&#x2009;=&#x2009;4.08&#x2009;&#x00B1;&#x2009;0.08), surpassing even Plot C which received no amendment nor fertilization (pH&#x2009;=&#x2009;4.58&#x2009;&#x00B1;&#x2009;0.04); however, pH alone could not account for the observed differences (PERMANOVA, <italic>p</italic>&#x2009;=&#x2009;0.199), with similar trends not observed for Plot C, which also has a very acidic soil. The environmental fit analysis reveals that the factors with the strongest correlation to the changes in the community were aluminum (Al) and iron (Fe) concentrations (<italic>p</italic>&#x2009;=&#x2009;0.001 and <italic>p</italic>&#x2009;=&#x2009;0.0001, respectively).</p>
</sec>
<sec id="sec17">
<label>3.5.</label>
<title>Overall microbiota community and network analysis</title>
<p>Next, the relationship between the MAG community and overall bacterial community was explored. 16S rRNA gene amplicon data were used to reconstruct the overall bacterial community, recovering representatives of 40 phyla. The overall community was dominated by Proteobacteria, Actinobacteria, Acidobacteria, Bacteroidota, and Firmicutes (<xref rid="fig5" ref-type="fig">Figure 5C</xref>). Plot 08 had lower alpha diversity (<xref rid="sec23" ref-type="sec">Supplementary Figure S6</xref>) and an overrepresentation of the phylum Chloroflexi. Communities between plots were significantly different (<italic>p</italic>&#x2009;=&#x2009;0.001), with Plot 08 also clustering away from other plots in PCA ordinations (<xref rid="fig5" ref-type="fig">Figure 5D</xref>). Similarly to the MAG community shifts, an acidic pH was correlated with significant 16S community differences (<italic>p</italic>&#x2009;=&#x2009;0.004); however, the only soil parameter with a significant correlation to the community was Al concentration (<italic>p</italic>&#x2009;=&#x2009;0.001).</p>
<p>Linear discriminant analysis effect size (LEfSe) was used to identify taxa highly correlated to community shifts in each plot that can act as biomarkers. 18 OTUs were marked as Plot 08 biomarkers at varying taxonomical levels, with the strongest signal belonging to the phylum Chloroflexi, class Ktedonobacteria (<xref rid="sec23" ref-type="sec">Supplementary Figure S7</xref>).</p>
<p>The potential relationship between Cullars&#x2019; MAGs and members of the soil microbiome was explored by constructing networks between Cullars&#x2019; MAGS and soil phyla (<xref rid="fig6" ref-type="fig">Figure 6</xref>). The MAGs were correlated with the occurrence of diverse phyla, albeit the weight of the edges was low. No particularly strong relationship was identified between the MAGs or any particular taxa in the overall soil microbiome, even when focusing on OTUs and MAGs distinctively associated with Plot 08, such as MAGs DT35 and DT24, or Plot 08-specific biomarkers such as Chloroflexi or Acidobacteria taxa.</p>
<fig position="float" id="fig6">
<label>Figure 6</label>
<caption>
<p><bold>(A)</bold> Network analysis filtered by all the OTUS belonging to the main phyla from the overall microbiota that interacts positively with the Cullars MAGs. Line thickness corresponds to correlation weight <bold>(B)</bold> Network analysis filtered by all the OTUS belonging to the plot biomarkers (defined by LEfS analysis) that interacts positively with the Cullars MAGs. Diamond shaped nodes: Prominent MAGs and biomarkers in Plot 08. </p>
</caption>
<graphic xlink:href="fmicb-14-1114548-g006.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussions" id="sec18">
<label>4.</label>
<title>Discussion</title>
<p>Most studies of ultra-small bacteria have centered around the CPR phylum due to their unusual metabolic potential (<xref ref-type="bibr" rid="ref30">Jaffe et al., 2020</xref>; <xref ref-type="bibr" rid="ref15">Chiriac et al., 2022</xref>) and astounding phylogenetic diversity (<xref ref-type="bibr" rid="ref28">Hug et al., 2016</xref>; <xref ref-type="bibr" rid="ref25">He et al., 2021</xref>). Despite their notable presence in groundwater, aquifers and associated sediment (<xref ref-type="bibr" rid="ref48">Luef et al., 2015</xref>; <xref ref-type="bibr" rid="ref2">Anantharaman et al., 2016</xref>; <xref ref-type="bibr" rid="ref15">Chiriac et al., 2022</xref>), it was indicated that aquatic CPR are mobilized from soils (<xref ref-type="bibr" rid="ref26">Herrmann et al., 2019</xref>). Hence soil became a focus of CPR study, supported by the reconstruction of CPR genomes from Amazonian (<xref ref-type="bibr" rid="ref36">Kroeger et al., 2018</xref>) and Antarctic soils (<xref ref-type="bibr" rid="ref82">Woodcroft et al., 2018</xref>), via deep-sequencing.</p>
<p>Following the observation of CPR in Cullars rotation soil in very low abundances (<xref ref-type="bibr" rid="ref69">Santana-Pereira et al., 2020</xref>), we employed an enrichment protocol targeting the small fraction (&#x003C; 0.45&#x2009;&#x03BC;m) of the microbiota for the recovery of soil CPR with much modest sequencing resources then the otherwise terabyte efforts previously reported (<xref ref-type="bibr" rid="ref2">Anantharaman et al., 2016</xref>; <xref ref-type="bibr" rid="ref82">Woodcroft et al., 2018</xref>), although consideration of possible biases and incomplete sampling must be taken into account. This approach allowed the study of 14 different cultivated plots under different fertilization regimens, offering a unique opportunity to observe soil CPR communities under controlled soil nutrient variation. This study demonstrated that CPR are widespread in Cullars Rotation soils, and potentially in the soil environment generally, albeit in low abundance. Genome reduction (&#x003C;1.5Mbp) and sparse metabolisms were also observed in Cullars Rotation soil-derived CPR genomes, pointing toward symbiotic life-styles, as previously observed (<xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>; <xref ref-type="bibr" rid="ref10">Castelle et al., 2018</xref>; <xref ref-type="bibr" rid="ref26">Herrmann et al., 2019</xref>).</p>
<p>Cullars CPR had similar metabolic restrictions as their aquatic counterparts: incomplete nucleotide, amino-acid and lipid metabolisms. The lack of an TCA cycle and ETC elements, suggest that soil CPR are mainly fermentative, akin to previously characterized CPR (<xref ref-type="bibr" rid="ref10">Castelle et al., 2018</xref>). However, while those CPR were associated with anoxic environments, Cullars CPR were collected from an aerobic fraction of the topsoil (<xref ref-type="bibr" rid="ref11">Castelle et al., 2017</xref>). Interestingly, soil Saccharibacteria and UBA2103 (Parcubacteria/Paceibacteria) were more likely to carry a Cytochrome o complex, suggesting the ability to reduce O<sub>2</sub> to H<sub>2</sub>O. Although uncommon, the presence of ETC components have been described in CPR and associated with O<sub>2</sub> scavenging instead of energy production (<xref ref-type="bibr" rid="ref7">Brown et al., 2015</xref>). Indeed, Coenzyme Q<sub>8</sub> (CoQ<sub>8</sub>) plays an important role in oxidative stress by neutralizing reactive oxygen species (ROS) (<xref ref-type="bibr" rid="ref3">Aussel et al., 2014</xref>), therefore it is possible that the combined action of CoQ<sub>8</sub> and Cytochrome o confers aerotolerance to soil CPR by scavenging ROS and O<sub>2</sub> simultaneously <italic>via</italic> CoQ<sub>8</sub> redox cycling (ubiquinone &#x2194; ubiquinol). However, the simultaneous presence of Cytochrome o and ATPases, suggest that Cullars Saccharibacteria and UBA2103 (Parcubacteria/Paceibacteria) carry some capacity for respiration, which has been previously observed and hypothesized in other CPR recovered from soil (<xref ref-type="bibr" rid="ref55">Nicolas et al., 2021</xref>). Although, no NADH or succinate dehydrogenases were identified in Cullars CPR, ROS scavenging <italic>via</italic> ubiquinol formation could be a source of electrons for Cytochrome o and proton motor force creation. It is noteworthy that most MAGs carried ATPases, even in the absence Cytochrome o, or any other ETC component, suggesting that they might be used in yet unknown ways.</p>
<p>CPR bacteria have been predicted to ferment complex substrates (<xref ref-type="bibr" rid="ref33">Kantor et al., 2013</xref>), and Cullars Saccharibacteria and Parcubacteria were particularly adept at degrading a variety of carbohydrates including plant-derived sugars (<xref ref-type="bibr" rid="ref78">Starr et al., 2018</xref>; <xref ref-type="bibr" rid="ref79">Taubert et al., 2019</xref>), suggesting their participation in plant necromass turn over and Carbon cycling in the soil. However, due to their patchy fermentation pathways, it is unclear if these organisms preferentially derive their energy from direct biomass degradation or from scavenging processed derivatives (<xref ref-type="bibr" rid="ref79">Taubert et al., 2019</xref>; <xref ref-type="bibr" rid="ref19">Figueroa-Gonzalez et al., 2020</xref>).</p>
<p>Despite different selective pressures in aquatic environments and soils, the sugar degradation profiles of Cullars CPR and references were surprisingly similar, with the retention of plant-matter degradation enzymes by the later, thus corroborating the observation that aquatic CPR are mobilized from soils (<xref ref-type="bibr" rid="ref26">Herrmann et al., 2019</xref>). On the other hand, Cullars Parcubacteria displayed distinct enrichment of chitin and peptidoglycan degradation enzymes and cytochrome o complexes, suggesting soil specific adaptations toward scavenging dead or decaying soil microfauna under oxic conditions.</p>
<p>While some CPR have been shown to live an endosymbiotic lifestyle (<xref ref-type="bibr" rid="ref22">Gong et al., 2014</xref>; <xref ref-type="bibr" rid="ref6">Bor et al., 2018</xref>), we did not find specific correlations between CPR MAGs and specific bacterial taxa, which points toward reliance on available bacterial cohorts in each plot for the supply of the needed building blocks (<xref ref-type="bibr" rid="ref2">Anantharaman et al., 2016</xref>), perhaps forming episymbiotic relationships with diverse hosts (<xref ref-type="bibr" rid="ref25">He et al., 2021</xref>), instead of forming strict host-specific interactions. Unsurprisingly, the mini-metagenome approach identified other symbiotic bacteria within these soils including environmental Chlamydia and Dependentiae (TM6), which are other examples of bacterial lifestyles that result in genome reduction (<xref ref-type="bibr" rid="ref50">McCutcheon and Moran, 2011</xref>).</p>
<p>The Cullars Rotation soil fertility experiment offered a unique resource to observe how the CPR community responds to controlled nutrient deficiencies in soil. Although they were not likely directly used by CPR, they have great influence in plant matter input and microbiota activity (<xref ref-type="bibr" rid="ref89">Zhao et al., 2015</xref>). Yet, nutrient availability changes did not influence CPR communities, indicating that their particular niches are not disturbed by the observed fluctuations in the overall microbiota composition and activity, nor by nutrient bioavailability and crop growth. Moreover the lack of taxa-specific trends in CPR distribution across the plots and the ability of all Cullars CPR to withstand and grow under diverse conditions suggests that soil CPR have dynamic ecological roles (<xref ref-type="bibr" rid="ref30">Jaffe et al., 2020</xref>).</p>
<p>CPR communities, and the overall microbiota, were the most affected by a sharp decrease in pH caused by lack of pH correction resulting in heavy metal solubilization and potential toxicity, leading to decreased crop yields (<xref ref-type="bibr" rid="ref52">Mitchell et al., 2005</xref>), microbiota diversity and activity (<xref ref-type="bibr" rid="ref89">Zhao et al., 2015</xref>). Despite that, some Cullars CPR, especially two particular Saccharibacteria MAGs, thrived and outcompeted others, dominating the community. Although CPR have been observed in extreme conditions, usually Microgenomates and Parcubacteria (<xref ref-type="bibr" rid="ref24">Harris et al., 2004</xref>; <xref ref-type="bibr" rid="ref20">Gagen et al., 2020</xref>; <xref ref-type="bibr" rid="ref27">Hosokawa et al., 2021</xref>), Saccharibacteria are not commonly associated with these environments.</p>
<p>More extensive sampling could have the potential to uncover more CPR diversity, since rarefaction curves show that most of the plots were under-sampled. Indeed, Cullars CPR sequences previously captured in a soil metagenomic library (<xref ref-type="bibr" rid="ref69">Santana-Pereira et al., 2020</xref>) were not observed in this study. More stringent filtering (<xref ref-type="bibr" rid="ref55">Nicolas et al., 2021</xref>) or increased sequencing depth could be implemented in future studies to better MAG construction and thus diversity recovery.</p>
<p>Nevertheless, these findings offer valuable insight into the soil dwelling CPR community and their possible ecological niches. We have shown that CPR are rare but widespread in soils, despite differences in nutrient availability and microbiota composition. Although soil CPR share many of the metabolic limitations of groundwater or sediment-associated CPR (<xref ref-type="bibr" rid="ref30">Jaffe et al., 2020</xref>; <xref ref-type="bibr" rid="ref40">Lemos et al., 2020</xref>), there is a notable bias toward degradation of plant-derived matter and even the development of specific adaptations of oxic environments, including the potential for a unique respiration pathway coupled with ROS scavenging. By observing the CPR communities in parallel, we were able to verify that Cullars CPR do not form strict symbiotic relationships, nor depend on specific hosts for survival, behaving like scavenging ectosymbionts. Cullars CPR appear to live in the fringes of the microbiota undisturbed due to their low abundances and resourceful metabolisms, allowing them to thrive under adverse conditions. Future studies should explore the contributions of soil CPR bacteria to important biogeochemical processes and especially plant-microbial interactions.</p>
</sec>
<sec sec-type="data-availability" id="sec19">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: <ext-link xlink:href="https://www.ncbi.nlm.nih.gov/genbank/" ext-link-type="uri">https://www.ncbi.nlm.nih.gov/genbank/</ext-link>, BioProject accession number PRJNA901447.</p>
</sec>
<sec id="sec20">
<title>Author contributions</title>
<p>ML conceived of the research and secured funding for this project. AS-P and FM conducted the research experiments and analysed the results. AS-P, FM, and BS conducted the bioinformatic analyses. AS-P was primarily responsible for writing the first draft and all authors edited and approved the manuscript for submission. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="funding-information" id="sec21">
<title>Funding</title>
<p>This publication was supported by the Alabama Agricultural Experiment Station and the Hatch program of the National Institute of Food and Agriculture, U.S. Department of Agriculture, Hatch project # ALA021-1-13733.</p>
</sec>
<sec sec-type="COI-statement" id="sec22">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="COI-statement" id="sec143">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack>
<p>We thank the members of the Liles laboratory for their support, particularly Stacey LaFrentz for her technical support and advice.</p>
</ack>
<sec id="sec23" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2023.1114548/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/fmicb.2023.1114548/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.pdf" id="SM1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_1.XLSX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alneberg</surname> <given-names>J.</given-names></name> <name><surname>Bjarnason</surname> <given-names>B. S.</given-names></name> <name><surname>de Bruijn</surname> <given-names>I.</given-names></name> <name><surname>Schirmer</surname> <given-names>M.</given-names></name> <name><surname>Quick</surname> <given-names>J.</given-names></name> <name><surname>Ijaz</surname> <given-names>U. Z.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Binning metagenomic contigs by coverage and composition</article-title>. <source>Nat. Methods</source> <volume>11</volume>, <fpage>1144</fpage>&#x2013;<lpage>1146</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nmeth.3103</pub-id>, PMID: <pub-id pub-id-type="pmid">25218180</pub-id></citation></ref>
<ref id="ref2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anantharaman</surname> <given-names>K.</given-names></name> <name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Hug</surname> <given-names>L. A.</given-names></name> <name><surname>Sharon</surname> <given-names>I.</given-names></name> <name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>Probst</surname> <given-names>A. J.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system</article-title>. <source>Nat. Commun.</source> <volume>7</volume>:<fpage>13219</fpage>. doi: <pub-id pub-id-type="doi">10.1038/ncomms13219</pub-id>, PMID: <pub-id pub-id-type="pmid">27774985</pub-id></citation></ref>
<ref id="ref3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aussel</surname> <given-names>L.</given-names></name> <name><surname>Pierrel</surname> <given-names>F.</given-names></name> <name><surname>Loiseau</surname> <given-names>L.</given-names></name> <name><surname>Lombard</surname> <given-names>M.</given-names></name> <name><surname>Fontecave</surname> <given-names>M.</given-names></name> <name><surname>Barras</surname> <given-names>F.</given-names></name></person-group> (<year>2014</year>). <article-title>Biosynthesis and physiology of coenzyme Q in bacteria</article-title>. <source>Biochim. Biophys. Acta</source> <volume>1837</volume>, <fpage>1004</fpage>&#x2013;<lpage>1011</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.bbabio.2014.01.015</pub-id>, PMID: <pub-id pub-id-type="pmid">24480387</pub-id></citation></ref>
<ref id="ref4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bates</surname> <given-names>D.</given-names></name> <name><surname>M&#x00E4;chler</surname> <given-names>M.</given-names></name> <name><surname>Bolker</surname> <given-names>B.</given-names></name> <name><surname>Walker</surname> <given-names>S.</given-names></name></person-group> (<year>2015</year>). <article-title>Fitting linear mixed-effects models Usinglme4</article-title>. <source>J. Stat. Softw.</source> <volume>67</volume>:<fpage>i01</fpage>. doi: <pub-id pub-id-type="doi">10.18637/jss.v067.i01</pub-id>, PMID: <pub-id pub-id-type="pmid">37415114</pub-id></citation></ref>
<ref id="ref5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bolger</surname> <given-names>A. M.</given-names></name> <name><surname>Lohse</surname> <given-names>M.</given-names></name> <name><surname>Usadel</surname> <given-names>B.</given-names></name></person-group> (<year>2014</year>). <article-title>Trimmomatic: a flexible trimmer for Illumina sequence data</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>2114</fpage>&#x2013;<lpage>2120</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btu170</pub-id>, PMID: <pub-id pub-id-type="pmid">24695404</pub-id></citation></ref>
<ref id="ref6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bor</surname> <given-names>B.</given-names></name> <name><surname>McLean</surname> <given-names>J. S.</given-names></name> <name><surname>Foster</surname> <given-names>K. R.</given-names></name> <name><surname>Cen</surname> <given-names>L.</given-names></name> <name><surname>To</surname> <given-names>T. T.</given-names></name> <name><surname>Serrato-Guillen</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Rapid evolution of decreased host susceptibility drives a stable relationship between ultrasmall parasite TM7x and its bacterial host</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>115</volume>, <fpage>12277</fpage>&#x2013;<lpage>12282</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1810625115</pub-id>, PMID: <pub-id pub-id-type="pmid">30442671</pub-id></citation></ref>
<ref id="ref7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Hug</surname> <given-names>L. A.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <name><surname>Sharon</surname> <given-names>I.</given-names></name> <name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>Singh</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Unusual biology across a group comprising more than 15% of domain bacteria</article-title>. <source>Nature</source> <volume>523</volume>, <fpage>208</fpage>&#x2013;<lpage>211</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nature14486</pub-id>, PMID: <pub-id pub-id-type="pmid">26083755</pub-id></citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Capella-Gutierrez</surname> <given-names>S.</given-names></name> <name><surname>Silla-Martinez</surname> <given-names>J. M.</given-names></name> <name><surname>Gabaldon</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). <article-title>trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>1972</fpage>&#x2013;<lpage>1973</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btp348</pub-id>, PMID: <pub-id pub-id-type="pmid">19505945</pub-id></citation></ref>
<ref id="ref9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caporaso</surname> <given-names>J. G.</given-names></name> <name><surname>Kuczynski</surname> <given-names>J.</given-names></name> <name><surname>Stombaugh</surname> <given-names>J.</given-names></name> <name><surname>Bittinger</surname> <given-names>K.</given-names></name> <name><surname>Bushman</surname> <given-names>F. D.</given-names></name> <name><surname>Costello</surname> <given-names>E. K.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>QIIME allows analysis of high-throughput community sequencing data</article-title>. <source>Nat. Methods</source> <volume>7</volume>, <fpage>335</fpage>&#x2013;<lpage>336</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nmeth.f.303</pub-id>, PMID: <pub-id pub-id-type="pmid">20383131</pub-id></citation></ref>
<ref id="ref10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Anantharaman</surname> <given-names>K.</given-names></name> <name><surname>Probst</surname> <given-names>A. J.</given-names></name> <name><surname>Huang</surname> <given-names>R. H.</given-names></name> <name><surname>Banfield</surname> <given-names>J. F.</given-names></name></person-group> (<year>2018</year>). <article-title>Biosynthetic capacity, metabolic variety and unusual biology in the CPR and DPANN radiations</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>16</volume>, <fpage>629</fpage>&#x2013;<lpage>645</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41579-018-0076-2</pub-id>, PMID: <pub-id pub-id-type="pmid">30181663</pub-id></citation></ref>
<ref id="ref11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <name><surname>Williams</surname> <given-names>K. H.</given-names></name> <name><surname>Banfield</surname> <given-names>J. F.</given-names></name></person-group> (<year>2017</year>). <article-title>Unusual respiratory capacity and nitrogen metabolism in a Parcubacterium (OD1) of the candidate phyla radiation</article-title>. <source>Sci. Rep.</source> <volume>7</volume>:<fpage>40101</fpage>. doi: <pub-id pub-id-type="doi">10.1038/srep40101</pub-id>, PMID: <pub-id pub-id-type="pmid">28067254</pub-id></citation></ref>
<ref id="ref12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>Hug</surname> <given-names>L. A.</given-names></name> <name><surname>Wrighton</surname> <given-names>K. C.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <name><surname>Williams</surname> <given-names>K. H.</given-names></name> <name><surname>Wu</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Extraordinary phylogenetic diversity and metabolic versatility in aquifer sediment</article-title>. <source>Nat. Commun.</source> <volume>4</volume>:<fpage>2120</fpage>. doi: <pub-id pub-id-type="doi">10.1038/ncomms3120</pub-id>, PMID: <pub-id pub-id-type="pmid">23979677</pub-id></citation></ref>
<ref id="ref13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chaumeil</surname> <given-names>P. A.</given-names></name> <name><surname>Mussig</surname> <given-names>A. J.</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P.</given-names></name> <name><surname>Parks</surname> <given-names>D. H.</given-names></name></person-group> (<year>2019</year>). <article-title>GTDB-Tk: a toolkit to classify genomes with the genome taxonomy database</article-title>. <source>Bioinformatics</source> <volume>36</volume>, <fpage>1925</fpage>&#x2013;<lpage>1927</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btz848</pub-id>, PMID: <pub-id pub-id-type="pmid">31730192</pub-id></citation></ref>
<ref id="ref14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chaumeil</surname> <given-names>P. A.</given-names></name> <name><surname>Mussig</surname> <given-names>A. J.</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P.</given-names></name> <name><surname>Parks</surname> <given-names>D. H.</given-names></name></person-group> (<year>2022</year>). <article-title>GTDB-Tk v2: memory friendly classification with the genome taxonomy database</article-title>. <source>Bioinformatics</source> <volume>38</volume>, <fpage>5315</fpage>&#x2013;<lpage>5316</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btac672</pub-id>, PMID: <pub-id pub-id-type="pmid">36218463</pub-id></citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chiriac</surname> <given-names>M. C.</given-names></name> <name><surname>Bulzu</surname> <given-names>P. A.</given-names></name> <name><surname>Andrei</surname> <given-names>A. S.</given-names></name> <name><surname>Okazaki</surname> <given-names>Y.</given-names></name> <name><surname>Nakano</surname> <given-names>S. I.</given-names></name> <name><surname>Haber</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2022</year>). <article-title>Ecogenomics sheds light on diverse lifestyle strategies in freshwater CPR</article-title>. <source>Microbiome</source> <volume>10</volume>:<fpage>84</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s40168-022-01274-3</pub-id>, PMID: <pub-id pub-id-type="pmid">35659305</pub-id></citation></ref>
<ref id="ref16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Csardi</surname> <given-names>G.</given-names></name> <name><surname>Nepusz</surname> <given-names>T.</given-names></name></person-group> (<year>2006</year>). <article-title>The igraph software package for complex network research</article-title>. <source>InterJ complex syst</source> <volume>1695</volume>, <fpage>1</fpage>&#x2013;<lpage>9</lpage>.</citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Danczak</surname> <given-names>R. E.</given-names></name> <name><surname>Johnston</surname> <given-names>M. D.</given-names></name> <name><surname>Kenah</surname> <given-names>C.</given-names></name> <name><surname>Slattery</surname> <given-names>M.</given-names></name> <name><surname>Wrighton</surname> <given-names>K. C.</given-names></name> <name><surname>Wilkins</surname> <given-names>M. J.</given-names></name></person-group> (<year>2017</year>). <article-title>Members of the candidate phyla radiation are functionally differentiated by carbon&#x2013; and nitrogen-cycling capabilities</article-title>. <source>Microbiome</source> <volume>5</volume>:<fpage>112</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s40168-017-0331-1</pub-id>, PMID: <pub-id pub-id-type="pmid">28865481</pub-id></citation></ref>
<ref id="ref18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>R. C.</given-names></name> <name><surname>Haas</surname> <given-names>B. J.</given-names></name> <name><surname>Clemente</surname> <given-names>J. C.</given-names></name> <name><surname>Quince</surname> <given-names>C.</given-names></name> <name><surname>Knight</surname> <given-names>R.</given-names></name></person-group> (<year>2011</year>). <article-title>UCHIME improves sensitivity and speed of chimera detection</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>2194</fpage>&#x2013;<lpage>2200</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btr381</pub-id>, PMID: <pub-id pub-id-type="pmid">21700674</pub-id></citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Figueroa-Gonzalez</surname> <given-names>P. A.</given-names></name> <name><surname>Bornemann</surname> <given-names>T. L. V.</given-names></name> <name><surname>Adam</surname> <given-names>P. S.</given-names></name> <name><surname>Plewka</surname> <given-names>J.</given-names></name> <name><surname>R&#x00E9;v&#x00E9;sz</surname> <given-names>F.</given-names></name> <name><surname>von Hagen</surname> <given-names>C. A.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Saccharibacteria as organic carbon sinks in hydrocarbon-Fueled communities</article-title>. <source>Front. Microbiol.</source> <volume>11</volume>:<fpage>587782</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fmicb.2020.587782</pub-id>, PMID: <pub-id pub-id-type="pmid">33424787</pub-id></citation></ref>
<ref id="ref20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gagen</surname> <given-names>E. J.</given-names></name> <name><surname>Levett</surname> <given-names>A.</given-names></name> <name><surname>Paz</surname> <given-names>A.</given-names></name> <name><surname>Bostelmann</surname> <given-names>H.</given-names></name> <name><surname>Valadares</surname> <given-names>R. B. S.</given-names></name> <name><surname>Bitencourt</surname> <given-names>J. A. P.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Accelerating microbial iron cycling promotes re-cementation of surface crusts in iron ore regions</article-title>. <source>Microb. Biotechnol.</source> <volume>13</volume>, <fpage>1960</fpage>&#x2013;<lpage>1971</lpage>. doi: <pub-id pub-id-type="doi">10.1111/1751-7915.13646</pub-id>, PMID: <pub-id pub-id-type="pmid">32812342</pub-id></citation></ref>
<ref id="ref21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghuneim</surname> <given-names>L. J.</given-names></name> <name><surname>Jones</surname> <given-names>D. L.</given-names></name> <name><surname>Golyshin</surname> <given-names>P. N.</given-names></name> <name><surname>Golyshina</surname> <given-names>O. V.</given-names></name></person-group> (<year>2018</year>). <article-title>Nano-sized and filterable bacteria and archaea: biodiversity and function</article-title>. <source>Front. Microbiol.</source> <volume>9</volume>:<fpage>1971</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fmicb.2018.01971</pub-id>, PMID: <pub-id pub-id-type="pmid">30186275</pub-id></citation></ref>
<ref id="ref22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gong</surname> <given-names>J.</given-names></name> <name><surname>Qing</surname> <given-names>Y.</given-names></name> <name><surname>Guo</surname> <given-names>X.</given-names></name> <name><surname>Warren</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>"Candidatus Sonnebornia yantaiensis", a member of candidate division OD1, as intracellular bacteria of the ciliated protist <italic>Paramecium bursaria</italic> (Ciliophora, Oligohymenophorea)</article-title>. <source>Syst. Appl. Microbiol.</source> <volume>37</volume>, <fpage>35</fpage>&#x2013;<lpage>41</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.syapm.2013.08.007</pub-id>, PMID: <pub-id pub-id-type="pmid">24231291</pub-id></citation></ref>
<ref id="ref23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gregor</surname> <given-names>I.</given-names></name> <name><surname>Droge</surname> <given-names>J.</given-names></name> <name><surname>Schirmer</surname> <given-names>M.</given-names></name> <name><surname>Quince</surname> <given-names>C.</given-names></name> <name><surname>McHardy</surname> <given-names>A. C.</given-names></name></person-group> (<year>2016</year>). <article-title>PhyloPythiaS+: a self-training method for the rapid reconstruction of low-ranking taxonomic bins from metagenomes</article-title>. <source>PeerJ</source> <volume>4</volume>:<fpage>e1603</fpage>. doi: <pub-id pub-id-type="doi">10.7717/peerj.1603</pub-id>, PMID: <pub-id pub-id-type="pmid">26870609</pub-id></citation></ref>
<ref id="ref24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harris</surname> <given-names>J. K.</given-names></name> <name><surname>Kelley</surname> <given-names>S. T.</given-names></name> <name><surname>Pace</surname> <given-names>N. R.</given-names></name></person-group> (<year>2004</year>). <article-title>New perspective on uncultured bacterial phylogenetic division OP11</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>70</volume>, <fpage>845</fpage>&#x2013;<lpage>849</lpage>. doi: <pub-id pub-id-type="doi">10.1128/AEM.70.2.845-849.2004</pub-id>, PMID: <pub-id pub-id-type="pmid">14766563</pub-id></citation></ref>
<ref id="ref25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>C.</given-names></name> <name><surname>Keren</surname> <given-names>R.</given-names></name> <name><surname>Whittaker</surname> <given-names>M. L.</given-names></name> <name><surname>Farag</surname> <given-names>I. F.</given-names></name> <name><surname>Doudna</surname> <given-names>J. A.</given-names></name> <name><surname>Cate</surname> <given-names>J. H. D.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Genome-resolved metagenomics reveals site-specific diversity of episymbiotic CPR bacteria and DPANN archaea in groundwater ecosystems</article-title>. <source>Nat. Microbiol.</source> <volume>6</volume>, <fpage>354</fpage>&#x2013;<lpage>365</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41564-020-00840-5</pub-id>, PMID: <pub-id pub-id-type="pmid">33495623</pub-id></citation></ref>
<ref id="ref26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Herrmann</surname> <given-names>M.</given-names></name> <name><surname>Wegner</surname> <given-names>C. E.</given-names></name> <name><surname>Taubert</surname> <given-names>M.</given-names></name> <name><surname>Geesink</surname> <given-names>P.</given-names></name> <name><surname>Lehmann</surname> <given-names>K.</given-names></name> <name><surname>Yan</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Predominance of Cand. Patescibacteria in groundwater is caused by their preferential mobilization from soils and flourishing under oligotrophic conditions</article-title>. <source>Front. Microbiol.</source> <volume>10</volume>:<fpage>1407</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fmicb.2019.01407</pub-id>, PMID: <pub-id pub-id-type="pmid">31281301</pub-id></citation></ref>
<ref id="ref27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hosokawa</surname> <given-names>S.</given-names></name> <name><surname>Kuroda</surname> <given-names>K.</given-names></name> <name><surname>Narihiro</surname> <given-names>T.</given-names></name> <name><surname>Aoi</surname> <given-names>Y.</given-names></name> <name><surname>Ozaki</surname> <given-names>N.</given-names></name> <name><surname>Ohashi</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Cometabolism of the Superphylum Patescibacteria with Anammox bacteria in a Long-term freshwater Anammox column reactor</article-title>. <source>Water</source> <volume>13</volume>:<fpage>208</fpage>. doi: <pub-id pub-id-type="doi">10.3390/w13020208</pub-id></citation></ref>
<ref id="ref28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hug</surname> <given-names>L. A.</given-names></name> <name><surname>Baker</surname> <given-names>B. J.</given-names></name> <name><surname>Anantharaman</surname> <given-names>K.</given-names></name> <name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Probst</surname> <given-names>A. J.</given-names></name> <name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>A new view of the tree of life</article-title>. <source>Nat. Microbiol.</source> <volume>1</volume>:<fpage>16048</fpage>. doi: <pub-id pub-id-type="doi">10.1038/nmicrobiol.2016.48</pub-id></citation></ref>
<ref id="ref29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hugenholtz</surname> <given-names>P.</given-names></name> <name><surname>Pitulle</surname> <given-names>C.</given-names></name> <name><surname>Hershberger</surname> <given-names>K. L.</given-names></name> <name><surname>Pace</surname> <given-names>N. R.</given-names></name></person-group> (<year>1998</year>). <article-title>Novel division level bacterial diversity in a Yellowstone hot spring</article-title>. <source>J. Bacteriol.</source> <volume>180</volume>, <fpage>366</fpage>&#x2013;<lpage>376</lpage>. doi: <pub-id pub-id-type="doi">10.1128/JB.180.2.366-376.1998</pub-id>, PMID: <pub-id pub-id-type="pmid">9440526</pub-id></citation></ref>
<ref id="ref30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaffe</surname> <given-names>A. L.</given-names></name> <name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>Matheus Carnevali</surname> <given-names>P. B.</given-names></name> <name><surname>Gribaldo</surname> <given-names>S.</given-names></name> <name><surname>Banfield</surname> <given-names>J. F.</given-names></name></person-group> (<year>2020</year>). <article-title>The rise of diversity in metabolic platforms across the candidate phyla radiation</article-title>. <source>BMC Biol.</source> <volume>18</volume>:<fpage>69</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12915-020-00804-5</pub-id>, PMID: <pub-id pub-id-type="pmid">32560683</pub-id></citation></ref>
<ref id="ref31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kandlikar</surname> <given-names>G. S.</given-names></name> <name><surname>Gold</surname> <given-names>Z. J.</given-names></name> <name><surname>Cowen</surname> <given-names>M. C.</given-names></name> <name><surname>Meyer</surname> <given-names>R. S.</given-names></name> <name><surname>Freise</surname> <given-names>A. C.</given-names></name> <name><surname>Kraft</surname> <given-names>N. J. B.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Ranacapa: An R package and shiny web app to explore environmental DNA data with exploratory statistics and interactive visualizations</article-title>. <source>F1000Research</source> <volume>7</volume>:<fpage>1734</fpage>. doi: <pub-id pub-id-type="doi">10.12688/f1000research.16680.1</pub-id>, PMID: <pub-id pub-id-type="pmid">30613396</pub-id></citation></ref>
<ref id="ref32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kang</surname> <given-names>D. D.</given-names></name> <name><surname>Li</surname> <given-names>F.</given-names></name> <name><surname>Kirton</surname> <given-names>E.</given-names></name> <name><surname>Thomas</surname> <given-names>A.</given-names></name> <name><surname>Egan</surname> <given-names>R.</given-names></name> <name><surname>An</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies</article-title>. <source>PeerJ</source> <volume>7</volume>:<fpage>e7359</fpage>. doi: <pub-id pub-id-type="doi">10.7717/peerj.7359</pub-id>, PMID: <pub-id pub-id-type="pmid">31388474</pub-id></citation></ref>
<ref id="ref33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kantor</surname> <given-names>R. S.</given-names></name> <name><surname>Wrighton</surname> <given-names>K. C.</given-names></name> <name><surname>Handley</surname> <given-names>K. M.</given-names></name> <name><surname>Sharon</surname> <given-names>I.</given-names></name> <name><surname>Hug</surname> <given-names>L. A.</given-names></name> <name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Small genomes and sparse metabolisms of sediment-associated bacteria from four candidate phyla</article-title>. <source>MBio</source> <volume>4</volume>, <fpage>e00708</fpage>&#x2013;<lpage>e00713</lpage>. doi: <pub-id pub-id-type="doi">10.1128/mBio.00708-13</pub-id>, PMID: <pub-id pub-id-type="pmid">24149512</pub-id></citation></ref>
<ref id="ref34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katoh</surname> <given-names>K.</given-names></name> <name><surname>Standley</surname> <given-names>D. M.</given-names></name></person-group> (<year>2013</year>). <article-title>MAFFT multiple sequence alignment software version 7: improvements in performance and usability</article-title>. <source>Mol. Biol. Evol.</source> <volume>30</volume>, <fpage>772</fpage>&#x2013;<lpage>780</lpage>. doi: <pub-id pub-id-type="doi">10.1093/molbev/mst010</pub-id>, PMID: <pub-id pub-id-type="pmid">23329690</pub-id></citation></ref>
<ref id="ref35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konstantinidis</surname> <given-names>K. T.</given-names></name> <name><surname>Rossello-Mora</surname> <given-names>R.</given-names></name> <name><surname>Amann</surname> <given-names>R.</given-names></name></person-group> (<year>2017</year>). <article-title>Uncultivated microbes in need of their own taxonomy</article-title>. <source>ISME J.</source> <volume>11</volume>, <fpage>2399</fpage>&#x2013;<lpage>2406</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ismej.2017.113</pub-id>, PMID: <pub-id pub-id-type="pmid">28731467</pub-id></citation></ref>
<ref id="ref36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kroeger</surname> <given-names>M. E.</given-names></name> <name><surname>Delmont</surname> <given-names>T. O.</given-names></name> <name><surname>Eren</surname> <given-names>A. M.</given-names></name> <name><surname>Meyer</surname> <given-names>K. M.</given-names></name> <name><surname>Guo</surname> <given-names>J.</given-names></name> <name><surname>Khan</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>New biological insights into how deforestation in Amazonia affects soil microbial communities using metagenomics and metagenome-assembled genomes</article-title>. <source>Front. Microbiol.</source> <volume>9</volume>:<fpage>1635</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fmicb.2018.01635</pub-id>, PMID: <pub-id pub-id-type="pmid">30083144</pub-id></citation></ref>
<ref id="ref37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kurtz</surname> <given-names>Z. D.</given-names></name> <name><surname>M&#x00FC;ller</surname> <given-names>C. L.</given-names></name> <name><surname>Miraldi</surname> <given-names>E. R.</given-names></name> <name><surname>Littman</surname> <given-names>D. R.</given-names></name> <name><surname>Blaser</surname> <given-names>M. J.</given-names></name> <name><surname>Bonneau</surname> <given-names>R. A.</given-names></name></person-group> (<year>2015</year>). <article-title>Sparse and compositionally robust inference of microbial ecological networks</article-title>. <source>PLoS Comput. Biol.</source> <volume>11</volume>:<fpage>e1004226</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pcbi.1004226</pub-id>, PMID: <pub-id pub-id-type="pmid">25950956</pub-id></citation></ref>
<ref id="ref38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langmead</surname> <given-names>B.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2012</year>). <article-title>Fast gapped-read alignment with bowtie 2</article-title>. <source>Nat. Methods</source> <volume>9</volume>, <fpage>357</fpage>&#x2013;<lpage>359</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nmeth.1923</pub-id>, PMID: <pub-id pub-id-type="pmid">22388286</pub-id></citation></ref>
<ref id="ref39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>J.</given-names></name> <name><surname>Kim</surname> <given-names>H. S.</given-names></name> <name><surname>Jo</surname> <given-names>H. Y.</given-names></name> <name><surname>Kwon</surname> <given-names>M. J.</given-names></name></person-group> (<year>2021</year>). <article-title>Revisiting soil bacterial counting methods: optimal soil storage and pretreatment methods and comparison of culture-dependent and -independent methods</article-title>. <source>PLoS One</source> <volume>16</volume>:<fpage>e0246142</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0261737</pub-id>, PMID: <pub-id pub-id-type="pmid">34972129</pub-id></citation></ref>
<ref id="ref40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lemos</surname> <given-names>L.</given-names></name> <name><surname>Manoharan</surname> <given-names>L.</given-names></name> <name><surname>Mendes</surname> <given-names>L.</given-names></name> <name><surname>Venturini</surname> <given-names>A.</given-names></name> <name><surname>Pylro</surname> <given-names>V.</given-names></name> <name><surname>Tsai</surname> <given-names>S. M.</given-names></name></person-group> (<year>2020</year>). <article-title>Metagenome assembled-genomes reveal similar functional profiles of CPR/Patescibacteria phyla in soils</article-title>. <source>Environ. Microbiol. Rep.</source> <volume>12</volume>, <fpage>651</fpage>&#x2013;<lpage>655</lpage>. doi: <pub-id pub-id-type="doi">10.1111/1758-2229.12880</pub-id>, PMID: <pub-id pub-id-type="pmid">32815317</pub-id></citation></ref>
<ref id="ref41"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Lenth</surname> <given-names>R.</given-names></name> <name><surname>Singmann</surname> <given-names>H.</given-names></name> <name><surname>Love</surname> <given-names>J.</given-names></name> <name><surname>Buerkner</surname> <given-names>P.</given-names></name> <name><surname>Herve</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>Emmeans: estimated marginal means, aka least-squares means. R Package Version 1.7.2.</article-title></citation></ref>
<ref id="ref42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Letunic</surname> <given-names>I.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>Interactive tree of life (iTOL) v4: recent updates and new developments</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume>, <fpage>W256</fpage>&#x2013;<lpage>W259</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkz239</pub-id>, PMID: <pub-id pub-id-type="pmid">30931475</pub-id></citation></ref>
<ref id="ref43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levy-Booth</surname> <given-names>D. J.</given-names></name> <name><surname>Hashimi</surname> <given-names>A.</given-names></name> <name><surname>Roccor</surname> <given-names>R.</given-names></name> <name><surname>Liu</surname> <given-names>L. Y.</given-names></name> <name><surname>Renneckar</surname> <given-names>S.</given-names></name> <name><surname>Eltis</surname> <given-names>L. D.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Genomics and metatranscriptomics of biogeochemical cycling and degradation of lignin-derived aromatic compounds in thermal swamp sediment</article-title>. <source>ISME J.</source> <volume>15</volume>, <fpage>879</fpage>&#x2013;<lpage>893</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41396-020-00820-x</pub-id>, PMID: <pub-id pub-id-type="pmid">33139871</pub-id></citation></ref>
<ref id="ref44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Fast and accurate short read alignment with burrows-wheeler transform</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>1754</fpage>&#x2013;<lpage>1760</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btp324</pub-id>, PMID: <pub-id pub-id-type="pmid">19451168</pub-id></citation></ref>
<ref id="ref45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Handsaker</surname> <given-names>B.</given-names></name> <name><surname>Wysoker</surname> <given-names>A.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name> <name><surname>Ruan</surname> <given-names>J.</given-names></name> <name><surname>Homer</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>The sequence alignment/map format and SAMtools</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>2078</fpage>&#x2013;<lpage>2079</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id>, PMID: <pub-id pub-id-type="pmid">19505943</pub-id></citation></ref>
<ref id="ref46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>D.</given-names></name> <name><surname>Liu</surname> <given-names>C. M.</given-names></name> <name><surname>Luo</surname> <given-names>R.</given-names></name> <name><surname>Sadakane</surname> <given-names>K.</given-names></name> <name><surname>Lam</surname> <given-names>T. W.</given-names></name></person-group> (<year>2015</year>). <article-title>MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>1674</fpage>&#x2013;<lpage>1676</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btv033</pub-id>, PMID: <pub-id pub-id-type="pmid">25609793</pub-id></citation></ref>
<ref id="ref47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lombard</surname> <given-names>V.</given-names></name> <name><surname>Golaconda Ramulu</surname> <given-names>H.</given-names></name> <name><surname>Drula</surname> <given-names>E.</given-names></name> <name><surname>Coutinho</surname> <given-names>P. M.</given-names></name> <name><surname>Henrissat</surname> <given-names>B.</given-names></name></person-group> (<year>2014</year>). <article-title>The carbohydrate-active enzymes database (CAZy) in 2013</article-title>. <source>Nucleic Acids Res.</source> <volume>42</volume>, <fpage>D490</fpage>&#x2013;<lpage>D495</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkt1178</pub-id>, PMID: <pub-id pub-id-type="pmid">24270786</pub-id></citation></ref>
<ref id="ref48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luef</surname> <given-names>B.</given-names></name> <name><surname>Frischkorn</surname> <given-names>K. R.</given-names></name> <name><surname>Wrighton</surname> <given-names>K. C.</given-names></name> <name><surname>Holman</surname> <given-names>H. Y. N.</given-names></name> <name><surname>Birarda</surname> <given-names>G.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Diverse uncultivated ultra-small bacterial cells in groundwater</article-title>. <source>Nat. Commun.</source> <volume>6</volume>:<fpage>6372</fpage>. doi: <pub-id pub-id-type="doi">10.1038/ncomms7372</pub-id>, PMID: <pub-id pub-id-type="pmid">25721682</pub-id></citation></ref>
<ref id="ref49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Magoc</surname> <given-names>T.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2011</year>). <article-title>FLASH: fast length adjustment of short reads to improve genome assemblies</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>2957</fpage>&#x2013;<lpage>2963</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btr507</pub-id>, PMID: <pub-id pub-id-type="pmid">21903629</pub-id></citation></ref>
<ref id="ref50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCutcheon</surname> <given-names>J. P.</given-names></name> <name><surname>Moran</surname> <given-names>N. A.</given-names></name></person-group> (<year>2011</year>). <article-title>Extreme genome reduction in symbiotic bacteria</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>10</volume>, <fpage>13</fpage>&#x2013;<lpage>26</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nrmicro2670</pub-id>, PMID: <pub-id pub-id-type="pmid">22064560</pub-id></citation></ref>
<ref id="ref51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McMurdie</surname> <given-names>P. J.</given-names></name> <name><surname>Holmes</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <article-title>Phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data</article-title>. <source>PLoS One</source> <volume>8</volume>:<fpage>e61217</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0061217</pub-id>, PMID: <pub-id pub-id-type="pmid">23630581</pub-id></citation></ref>
<ref id="ref52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mitchell</surname> <given-names>C. C.</given-names></name> <name><surname>Delaney</surname> <given-names>D.</given-names></name> <name><surname>Balkcom</surname> <given-names>K. S.</given-names></name></person-group> (<year>2005</year>). <article-title>Cullars rotation: the South&#x2019;s oldest continuous soil fertility experiment</article-title>. <source>Better Crops</source> <volume>89</volume>, <fpage>5</fpage>&#x2013;<lpage>9</lpage>.</citation></ref>
<ref id="ref53"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Mitchell</surname> <given-names>C.C.</given-names></name> <name><surname>Delaney</surname> <given-names>D.P.</given-names></name> <name><surname>Balkcom</surname> <given-names>K.S.</given-names></name></person-group> <source>Centennial of Alabama's Cullars rotation, the South's oldest, continuous soil fertility experiment</source>. (<year>2011</year>). Alabama&#x2019;s Agricultural Experiment Station (Auburn University).</citation></ref>
<ref id="ref54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nakai</surname> <given-names>R.</given-names></name></person-group> (<year>2020</year>). <article-title>Size matters: ultra-small and filterable microorganisms in the environment</article-title>. <source>Microbes Environ.</source> <volume>35</volume>:ME20025. doi: <pub-id pub-id-type="doi">10.1264/jsme2.ME20025</pub-id>, PMID: <pub-id pub-id-type="pmid">32493880</pub-id></citation></ref>
<ref id="ref55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nicolas</surname> <given-names>A. M.</given-names></name> <name><surname>Jaffe</surname> <given-names>A. L.</given-names></name> <name><surname>Nuccio</surname> <given-names>E. E.</given-names></name> <name><surname>Taga</surname> <given-names>M. E.</given-names></name> <name><surname>Firestone</surname> <given-names>M. K.</given-names></name> <name><surname>Banfield</surname> <given-names>J. F.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Soil candidate phyla radiation bacteria encode components of aerobic metabolism and co-occur with Nanoarchaea in the rare biosphere of rhizosphere grassland communities</article-title>. <source>mSystems</source> <volume>6</volume>:<fpage>e0120520</fpage>. doi: <pub-id pub-id-type="doi">10.1128/mSystems.01205-20</pub-id>, PMID: <pub-id pub-id-type="pmid">34402646</pub-id></citation></ref>
<ref id="ref56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oksanen</surname> <given-names>J.</given-names></name> <name><surname>Blanchet</surname> <given-names>F. G.</given-names></name> <name><surname>Friendly</surname> <given-names>M.</given-names></name> <name><surname>Kindt</surname> <given-names>R.</given-names></name> <name><surname>Legendre</surname> <given-names>P.</given-names></name> <name><surname>McGlinn</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Package &#x2018;vegan&#x2019;</article-title>. <source>Community ecology package, version</source> <volume>2</volume>, <fpage>1</fpage>&#x2013;<lpage>295</lpage>.</citation></ref>
<ref id="ref57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olm</surname> <given-names>M. R.</given-names></name> <name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Brooks</surname> <given-names>B.</given-names></name> <name><surname>Banfield</surname> <given-names>J. F.</given-names></name></person-group> (<year>2017</year>). <article-title>dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication</article-title>. <source>ISME J.</source> <volume>11</volume>, <fpage>2864</fpage>&#x2013;<lpage>2868</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ismej.2017.126</pub-id>, PMID: <pub-id pub-id-type="pmid">28742071</pub-id></citation></ref>
<ref id="ref58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parks</surname> <given-names>D. H.</given-names></name> <name><surname>Chuvochina</surname> <given-names>M.</given-names></name> <name><surname>Chaumeil</surname> <given-names>P. A.</given-names></name> <name><surname>Rinke</surname> <given-names>C.</given-names></name> <name><surname>Mussig</surname> <given-names>A. J.</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P.</given-names></name></person-group> (<year>2020</year>). <article-title>A complete domain-to-species taxonomy for bacteria and archaea</article-title>. <source>Nat. Biotechnol.</source> <volume>38</volume>, <fpage>1079</fpage>&#x2013;<lpage>1086</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41587-020-0501-8</pub-id>, PMID: <pub-id pub-id-type="pmid">32341564</pub-id></citation></ref>
<ref id="ref59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parks</surname> <given-names>D. H.</given-names></name> <name><surname>Chuvochina</surname> <given-names>M.</given-names></name> <name><surname>Rinke</surname> <given-names>C.</given-names></name> <name><surname>Mussig</surname> <given-names>A. J.</given-names></name> <name><surname>Chaumeil</surname> <given-names>P. A.</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P.</given-names></name></person-group> (<year>2022</year>). <article-title>GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy</article-title>. <source>Nucleic Acids Res.</source> <volume>50</volume>, <fpage>D785</fpage>&#x2013;<lpage>D794</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkab776</pub-id>, PMID: <pub-id pub-id-type="pmid">34520557</pub-id></citation></ref>
<ref id="ref60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parks</surname> <given-names>D. H.</given-names></name> <name><surname>Chuvochina</surname> <given-names>M.</given-names></name> <name><surname>Waite</surname> <given-names>D. W.</given-names></name> <name><surname>Rinke</surname> <given-names>C.</given-names></name> <name><surname>Skarshewski</surname> <given-names>A.</given-names></name> <name><surname>Chaumeil</surname> <given-names>P. A.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life</article-title>. <source>Nat. Biotechnol.</source> <volume>36</volume>, <fpage>996</fpage>&#x2013;<lpage>1004</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nbt.4229</pub-id>, PMID: <pub-id pub-id-type="pmid">30148503</pub-id></citation></ref>
<ref id="ref61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parks</surname> <given-names>D. H.</given-names></name> <name><surname>Imelfort</surname> <given-names>M.</given-names></name> <name><surname>Skennerton</surname> <given-names>C. T.</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P.</given-names></name> <name><surname>Tyson</surname> <given-names>G. W.</given-names></name></person-group> (<year>2015</year>). <article-title>CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes</article-title>. <source>Genome Res.</source> <volume>25</volume>, <fpage>1043</fpage>&#x2013;<lpage>1055</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gr.186072.114</pub-id>, PMID: <pub-id pub-id-type="pmid">25977477</pub-id></citation></ref>
<ref id="ref62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parks</surname> <given-names>D. H.</given-names></name> <name><surname>Rinke</surname> <given-names>C.</given-names></name> <name><surname>Chuvochina</surname> <given-names>M.</given-names></name> <name><surname>Chaumeil</surname> <given-names>P. A.</given-names></name> <name><surname>Woodcroft</surname> <given-names>B. J.</given-names></name> <name><surname>Evans</surname> <given-names>P. N.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life</article-title>. <source>Nat. Microbiol.</source> <volume>2</volume>, <fpage>1533</fpage>&#x2013;<lpage>1542</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41564-017-0012-7</pub-id>, PMID: <pub-id pub-id-type="pmid">28894102</pub-id></citation></ref>
<ref id="ref63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paulson</surname> <given-names>J. N.</given-names></name> <name><surname>Stine</surname> <given-names>O. C.</given-names></name> <name><surname>Bravo</surname> <given-names>H. C.</given-names></name> <name><surname>Pop</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>Differential abundance analysis for microbial marker-gene surveys</article-title>. <source>Nat. Methods</source> <volume>10</volume>, <fpage>1200</fpage>&#x2013;<lpage>1202</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nmeth.2658</pub-id>, PMID: <pub-id pub-id-type="pmid">24076764</pub-id></citation></ref>
<ref id="ref64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Portillo</surname> <given-names>M. C.</given-names></name> <name><surname>Leff</surname> <given-names>J. W.</given-names></name> <name><surname>Lauber</surname> <given-names>C. L.</given-names></name> <name><surname>Fierer</surname> <given-names>N.</given-names></name></person-group> (<year>2013</year>). <article-title>Cell size distributions of soil bacterial and archaeal taxa</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>79</volume>, <fpage>7610</fpage>&#x2013;<lpage>7617</lpage>. doi: <pub-id pub-id-type="doi">10.1128/AEM.02710-13</pub-id>, PMID: <pub-id pub-id-type="pmid">24077710</pub-id></citation></ref>
<ref id="ref65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pruesse</surname> <given-names>E.</given-names></name> <name><surname>Peplies</surname> <given-names>J.</given-names></name> <name><surname>Glockner</surname> <given-names>F. O.</given-names></name></person-group> (<year>2012</year>). <article-title>SINA: accurate high-throughput multiple sequence alignment of ribosomal RNA genes</article-title>. <source>Bioinformatics</source> <volume>28</volume>, <fpage>1823</fpage>&#x2013;<lpage>1829</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/bts252</pub-id>, PMID: <pub-id pub-id-type="pmid">22556368</pub-id></citation></ref>
<ref id="ref66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quast</surname> <given-names>C.</given-names></name> <name><surname>Pruesse</surname> <given-names>E.</given-names></name> <name><surname>Yilmaz</surname> <given-names>P.</given-names></name> <name><surname>Gerken</surname> <given-names>J.</given-names></name> <name><surname>Schweer</surname> <given-names>T.</given-names></name> <name><surname>Yarza</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>The SILVA ribosomal RNA gene database project: improved data processing and web-based tools</article-title>. <source>Nucleic Acids Res.</source> <volume>41</volume>, <fpage>D590</fpage>&#x2013;<lpage>D596</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gks1219</pub-id>, PMID: <pub-id pub-id-type="pmid">23193283</pub-id></citation></ref>
<ref id="ref67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rinke</surname> <given-names>C.</given-names></name> <name><surname>Schwientek</surname> <given-names>P.</given-names></name> <name><surname>Sczyrba</surname> <given-names>A.</given-names></name> <name><surname>Ivanova</surname> <given-names>N. N.</given-names></name> <name><surname>Anderson</surname> <given-names>I. J.</given-names></name> <name><surname>Cheng</surname> <given-names>J. F.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Insights into the phylogeny and coding potential of microbial dark matter</article-title>. <source>Nature</source> <volume>499</volume>, <fpage>431</fpage>&#x2013;<lpage>437</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nature12352</pub-id>, PMID: <pub-id pub-id-type="pmid">23851394</pub-id></citation></ref>
<ref id="ref68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sanchez</surname> <given-names>R.</given-names></name> <name><surname>Serra</surname> <given-names>F.</given-names></name> <name><surname>Tarraga</surname> <given-names>J.</given-names></name> <name><surname>Medina</surname> <given-names>I.</given-names></name> <name><surname>Carbonell</surname> <given-names>J.</given-names></name> <name><surname>Pulido</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Phylemon 2.0: a suite of web-tools for molecular evolution, phylogenetics, phylogenomics and hypotheses testing</article-title>. <source>Nucleic Acids Res.</source> <volume>39</volume>, <fpage>W470</fpage>&#x2013;<lpage>W474</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkr408</pub-id>, PMID: <pub-id pub-id-type="pmid">21646336</pub-id></citation></ref>
<ref id="ref69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Santana-Pereira</surname> <given-names>A. L. R.</given-names></name> <name><surname>Sandoval-Powers</surname> <given-names>M.</given-names></name> <name><surname>Monsma</surname> <given-names>S.</given-names></name> <name><surname>Zhou</surname> <given-names>J.</given-names></name> <name><surname>Santos</surname> <given-names>S. R.</given-names></name> <name><surname>Mead</surname> <given-names>D. A.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Discovery of novel biosynthetic gene cluster diversity from a soil metagenomic library</article-title>. <source>Front. Microbiol.</source> <volume>11</volume>:<fpage>585398</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fmicb.2020.585398</pub-id>, PMID: <pub-id pub-id-type="pmid">33365020</pub-id></citation></ref>
<ref id="ref70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>P. D.</given-names></name> <name><surname>Westcott</surname> <given-names>S. L.</given-names></name> <name><surname>Ryabin</surname> <given-names>T.</given-names></name> <name><surname>Hall</surname> <given-names>J. R.</given-names></name> <name><surname>Hartmann</surname> <given-names>M.</given-names></name> <name><surname>Hollister</surname> <given-names>E. B.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>75</volume>, <fpage>7537</fpage>&#x2013;<lpage>7541</lpage>. doi: <pub-id pub-id-type="doi">10.1128/AEM.01541-09</pub-id>, PMID: <pub-id pub-id-type="pmid">19801464</pub-id></citation></ref>
<ref id="ref71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Segata</surname> <given-names>N.</given-names></name> <name><surname>Izard</surname> <given-names>J.</given-names></name> <name><surname>Waldron</surname> <given-names>L.</given-names></name> <name><surname>Gevers</surname> <given-names>D.</given-names></name> <name><surname>Miropolsky</surname> <given-names>L.</given-names></name> <name><surname>Garrett</surname> <given-names>W. S.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Metagenomic biomarker discovery and explanation</article-title>. <source>Genome Biol.</source> <volume>12</volume>:<fpage>R60</fpage>. doi: <pub-id pub-id-type="doi">10.1186/gb-2011-12-6-r60</pub-id>, PMID: <pub-id pub-id-type="pmid">21702898</pub-id></citation></ref>
<ref id="ref72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shaffer</surname> <given-names>M.</given-names></name> <name><surname>Borton</surname> <given-names>M. A.</given-names></name> <name><surname>McGivern</surname> <given-names>B. B.</given-names></name> <name><surname>Zayed</surname> <given-names>A. A.</given-names></name> <name><surname>la Rosa</surname> <given-names>S. L.</given-names></name> <name><surname>Solden</surname> <given-names>L. M.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>DRAM for distilling microbial metabolism to automate the curation of microbiome function</article-title>. <source>Nucleic Acids Res.</source> <volume>48</volume>, <fpage>8883</fpage>&#x2013;<lpage>8900</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkaa621</pub-id>, PMID: <pub-id pub-id-type="pmid">32766782</pub-id></citation></ref>
<ref id="ref73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharon</surname> <given-names>I.</given-names></name> <name><surname>Banfield</surname> <given-names>J. F.</given-names></name></person-group> (<year>2013</year>). <article-title>Microbiology. Genomes from metagenomics</article-title>. <source>Science</source> <volume>342</volume>, <fpage>1057</fpage>&#x2013;<lpage>1058</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.1247023</pub-id>, PMID: <pub-id pub-id-type="pmid">24288324</pub-id></citation></ref>
<ref id="ref74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharrar</surname> <given-names>A. M.</given-names></name> <name><surname>Crits-Christoph</surname> <given-names>A.</given-names></name> <name><surname>M&#x00E9;heust</surname> <given-names>R.</given-names></name> <name><surname>Diamond</surname> <given-names>S.</given-names></name> <name><surname>Starr</surname> <given-names>E. P.</given-names></name> <name><surname>Banfield</surname> <given-names>J. F.</given-names></name></person-group> (<year>2020</year>). <article-title>Bacterial secondary metabolite biosynthetic potential in soil varies with phylum, depth, and vegetation type</article-title>. <source>MBio</source> <volume>11</volume>:e00416-20. doi: <pub-id pub-id-type="doi">10.1128/mBio.00416-20</pub-id>, PMID: <pub-id pub-id-type="pmid">32546614</pub-id></citation></ref>
<ref id="ref75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sieber</surname> <given-names>C. M. K.</given-names></name> <name><surname>Probst</surname> <given-names>A. J.</given-names></name> <name><surname>Sharrar</surname> <given-names>A.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <name><surname>Hess</surname> <given-names>M.</given-names></name> <name><surname>Tringe</surname> <given-names>S. G.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy</article-title>. <source>Nat. Microbiol.</source> <volume>3</volume>, <fpage>836</fpage>&#x2013;<lpage>843</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41564-018-0171-1</pub-id>, PMID: <pub-id pub-id-type="pmid">29807988</pub-id></citation></ref>
<ref id="ref76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smoot</surname> <given-names>M. E.</given-names></name> <name><surname>Ono</surname> <given-names>K.</given-names></name> <name><surname>Ruscheinski</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>P. L.</given-names></name> <name><surname>Ideker</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>Cytoscape 2.8: new features for data integration and network visualization</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>431</fpage>&#x2013;<lpage>432</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btq675</pub-id>, PMID: <pub-id pub-id-type="pmid">21149340</pub-id></citation></ref>
<ref id="ref77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamatakis</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>1312</fpage>&#x2013;<lpage>1313</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btu033</pub-id>, PMID: <pub-id pub-id-type="pmid">24451623</pub-id></citation></ref>
<ref id="ref78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Starr</surname> <given-names>E. P.</given-names></name> <name><surname>Shi</surname> <given-names>S.</given-names></name> <name><surname>Blazewicz</surname> <given-names>S. J.</given-names></name> <name><surname>Probst</surname> <given-names>A. J.</given-names></name> <name><surname>Herman</surname> <given-names>D. J.</given-names></name> <name><surname>Firestone</surname> <given-names>M. K.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Stable isotope informed genome-resolved metagenomics reveals that Saccharibacteria utilize microbially-processed plant-derived carbon</article-title>. <source>Microbiome</source> <volume>6</volume>:<fpage>122</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s40168-018-0499-z</pub-id>, PMID: <pub-id pub-id-type="pmid">29970182</pub-id></citation></ref>
<ref id="ref79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Taubert</surname> <given-names>M.</given-names></name> <name><surname>Stahly</surname> <given-names>J.</given-names></name> <name><surname>Kolb</surname> <given-names>S.</given-names></name> <name><surname>Kusel</surname> <given-names>K.</given-names></name></person-group> (<year>2019</year>). <article-title>Divergent microbial communities in groundwater and overlying soils exhibit functional redundancy for plant-polysaccharide degradation</article-title>. <source>PLoS One</source> <volume>14</volume>:<fpage>e0212937</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0212937</pub-id>, PMID: <pub-id pub-id-type="pmid">30865693</pub-id></citation></ref>
<ref id="ref80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Torsvik</surname> <given-names>V.</given-names></name> <name><surname>Ovreas</surname> <given-names>L.</given-names></name></person-group> (<year>2002</year>). <article-title>Microbial diversity and function in soil: from genes to ecosystems</article-title>. <source>Curr. Opin. Microbiol.</source> <volume>5</volume>, <fpage>240</fpage>&#x2013;<lpage>245</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S1369-5274(02)00324-7</pub-id>, PMID: <pub-id pub-id-type="pmid">12057676</pub-id></citation></ref>
<ref id="ref81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wagner</surname> <given-names>D.</given-names></name> <name><surname>Kobabe</surname> <given-names>S.</given-names></name> <name><surname>Liebner</surname> <given-names>S.</given-names></name></person-group> (<year>2009</year>). <article-title>Bacterial community structure and carbon turnover in permafrost-affected soils of the Lena Delta, northeastern Siberia</article-title>. <source>Can. J. Microbiol.</source> <volume>55</volume>, <fpage>73</fpage>&#x2013;<lpage>83</lpage>. doi: <pub-id pub-id-type="doi">10.1139/W08-121</pub-id>, PMID: <pub-id pub-id-type="pmid">19190703</pub-id></citation></ref>
<ref id="ref82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woodcroft</surname> <given-names>B. J.</given-names></name> <name><surname>Singleton</surname> <given-names>C. M.</given-names></name> <name><surname>Boyd</surname> <given-names>J. A.</given-names></name> <name><surname>Evans</surname> <given-names>P. N.</given-names></name> <name><surname>Emerson</surname> <given-names>J. B.</given-names></name> <name><surname>Zayed</surname> <given-names>A. A. F.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Genome-centric view of carbon processing in thawing permafrost</article-title>. <source>Nature</source> <volume>560</volume>, <fpage>49</fpage>&#x2013;<lpage>54</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41586-018-0338-1</pub-id>, PMID: <pub-id pub-id-type="pmid">30013118</pub-id></citation></ref>
<ref id="ref83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wrighton</surname> <given-names>K. C.</given-names></name> <name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>Varaljay</surname> <given-names>V. A.</given-names></name> <name><surname>Satagopan</surname> <given-names>S.</given-names></name> <name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Wilkins</surname> <given-names>M. J.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>RubisCO of a nucleoside pathway known from archaea is found in diverse uncultivated phyla in bacteria</article-title>. <source>ISME J.</source> <volume>10</volume>, <fpage>2702</fpage>&#x2013;<lpage>2714</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ismej.2016.53</pub-id>, PMID: <pub-id pub-id-type="pmid">27137126</pub-id></citation></ref>
<ref id="ref84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wrighton</surname> <given-names>K. C.</given-names></name> <name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>Wilkins</surname> <given-names>M. J.</given-names></name> <name><surname>Hug</surname> <given-names>L. A.</given-names></name> <name><surname>Sharon</surname> <given-names>I.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Metabolic interdependencies between phylogenetically novel fermenters and respiratory organisms in an unconfined aquifer</article-title>. <source>ISME J.</source> <volume>8</volume>, <fpage>1452</fpage>&#x2013;<lpage>1463</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ismej.2013.249</pub-id>, PMID: <pub-id pub-id-type="pmid">24621521</pub-id></citation></ref>
<ref id="ref85"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wrighton</surname> <given-names>K. C.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <name><surname>Sharon</surname> <given-names>I.</given-names></name> <name><surname>Miller</surname> <given-names>C. S.</given-names></name> <name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>VerBerkmoes</surname> <given-names>N. C.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Fermentation, hydrogen, and sulfur metabolism in multiple uncultivated bacterial phyla</article-title>. <source>Science</source> <volume>337</volume>, <fpage>1661</fpage>&#x2013;<lpage>1665</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.1224041</pub-id>, PMID: <pub-id pub-id-type="pmid">23019650</pub-id></citation></ref>
<ref id="ref86"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>Y. W.</given-names></name> <name><surname>Simmons</surname> <given-names>B. A.</given-names></name> <name><surname>Singer</surname> <given-names>S. W.</given-names></name></person-group> (<year>2016</year>). <article-title>MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets</article-title>. <source>Bioinformatics</source> <volume>32</volume>, <fpage>605</fpage>&#x2013;<lpage>607</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btv638</pub-id>, PMID: <pub-id pub-id-type="pmid">26515820</pub-id></citation></ref>
<ref id="ref87"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yeates</surname> <given-names>C.</given-names></name> <name><surname>Holmes</surname> <given-names>A. J.</given-names></name> <name><surname>Gillings</surname> <given-names>M. R.</given-names></name></person-group> (<year>2000</year>). <article-title>Novel forms of ring-hydroxylating dioxygenases are widespread in pristine and contaminated soils</article-title>. <source>Environ. Microbiol.</source> <volume>2</volume>, <fpage>644</fpage>&#x2013;<lpage>653</lpage>. doi: <pub-id pub-id-type="doi">10.1046/j.1462-2920.2000.00147.x</pub-id>, PMID: <pub-id pub-id-type="pmid">11214797</pub-id></citation></ref>
<ref id="ref88"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yuan</surname> <given-names>Q.</given-names></name> <name><surname>Xueyu</surname> <given-names>P.</given-names></name> <name><surname>Wei</surname> <given-names>J.</given-names></name> <name><surname>Lianqing</surname> <given-names>C.</given-names></name> <name><surname>Zhilin</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Comparison of four extraction methods of soil microbiome in poplar plantation</article-title>. <source>Scientia Silvae Sinicae</source> <volume>54</volume>, <fpage>169</fpage>&#x2013;<lpage>176</lpage>.</citation></ref>
<ref id="ref89"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>C.</given-names></name> <name><surname>Fu</surname> <given-names>S.</given-names></name> <name><surname>Mathew</surname> <given-names>R. P.</given-names></name> <name><surname>Lawrence</surname> <given-names>K. S.</given-names></name> <name><surname>Feng</surname> <given-names>Y.</given-names></name></person-group> (<year>2015</year>). <article-title>Soil microbial community structure and activity in a 100-year-old fertilization and crop rotation experiment</article-title>. <source>J. Plant Ecol.</source> <volume>8</volume>, <fpage>rtv007</fpage>&#x2013;<lpage>rtv632</lpage>. doi: <pub-id pub-id-type="doi">10.1093/jpe/rtv007</pub-id></citation></ref></ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup><ext-link xlink:href="https://www.geneious.com" ext-link-type="uri">https://www.geneious.com</ext-link></p></fn>
</fn-group>
</back>
</article>