<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Nutr.</journal-id>
<journal-title>Frontiers in Nutrition</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Nutr.</abbrev-journal-title>
<issn pub-type="epub">2296-861X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnut.2016.00026</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Nutrition</subject>
<subj-group>
<subject>Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Considerations For Optimizing Microbiome Analysis Using a Marker Gene</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>de la Cuesta-Zuluaga</surname> <given-names>Jacobo</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://frontiersin.org/people/u/348324"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Escobar</surname> <given-names>Juan S.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="cor1">&#x0002A;</xref>
<uri xlink:href="http://frontiersin.org/people/u/358808"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Vidarium &#x02013; Nutrition, Health and Wellness Research Center, Grupo Empresarial Nutresa</institution>, <addr-line>Medell&#x000ED;n</addr-line>, <country>Colombia</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Fernando Lizcano, Universidad de La Sabana, Colombia</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: David Lin, Cornell University, USA; Sharon Ross, National Cancer Institute, USA</p></fn>
<corresp content-type="corresp" id="cor1">&#x0002A;Correspondence: Juan S. Escobar, <email>jsescobar&#x00040;serviciosnutresa.com</email></corresp>
<fn fn-type="other" id="fn001"><p>Specialty section: This article was submitted to Nutrigenomics, a section of the journal Frontiers in Nutrition</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>08</day>
<month>08</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="collection">
<year>2016</year>
</pub-date>
<volume>3</volume>
<elocation-id>26</elocation-id>
<history>
<date date-type="received">
<day>25</day>
<month>05</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>26</day>
<month>07</month>
<year>2016</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2016 de la Cuesta-Zuluaga and Escobar.</copyright-statement>
<copyright-year>2016</copyright-year>
<copyright-holder>de la Cuesta-Zuluaga and Escobar</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Next-generation sequencing technologies have found a widespread use in the study of host&#x02013;microbe interactions due to the increase in their throughput and their ever-decreasing costs. The analysis of human-associated microbial communities using a marker gene, particularly the 16S rRNA, has been greatly benefited from these technologies &#x02013; the human gut microbiome research being a remarkable example of such analysis that has greatly expanded our understanding of microbe-mediated human health and disease, metabolism, and food absorption. 16S studies go through a series of <italic>in vitro</italic> and <italic>in silico</italic> steps that can greatly influence their outcomes. However, the lack of a standardized workflow has led to uncertainties regarding the transparency and reproducibility of gut microbiome studies. We, here, discuss the most common challenges in the archetypical 16S rRNA workflow, including the extraction of total DNA, its use as template in PCR with primers that amplify specific hypervariable regions of the gene, amplicon sequencing, the denoising and removal of low-quality reads, the detection and removal of chimeric sequences, the clustering of high-quality sequences into operational taxonomic units, and their taxonomic classification. We recommend the essential technical information that should be conveyed in publications for reproducibility of results and encourage non-experts to include procedures and available tools that mitigate most of the problems encountered in microbiome analysis.</p>
</abstract>
<kwd-group>
<kwd>gut microbiome</kwd>
<kwd>16S rRNA</kwd>
<kwd>next-generation sequencing</kwd>
<kwd>personalized medicine</kwd>
<kwd>personalized nutrition</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="140"/>
<page-count count="12"/>
<word-count count="10363"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="introduction">
<title>Introduction</title>
<p>The gut microbiome, our &#x0201C;second genome,&#x0201D; is the most intimate connection we have with the environment. During the last decade, the study of the gut microbiome has revolutionized our understanding of human health and disease, metabolism, and food absorption. This research field has gone beyond being a mere object of study and is now recognized as an object of intervention (<xref ref-type="bibr" rid="B1">1</xref>) that may eventually assist in personalized diagnostic assessment, risk stratification, disease prevention, treatment decision-making, and patients&#x02019; follow-up (<xref ref-type="bibr" rid="B2">2</xref>).</p>
<p>The gut microbiome is the target of therapies for gastrointestinal diseases, such as infection by <italic>Clostridium difficile</italic> or inflammatory bowel disease, metabolic conditions, such as obesity and diabetes, and non-gastrointestinal pathologies, like allergy and autism (<xref ref-type="bibr" rid="B3">3</xref>&#x02013;<xref ref-type="bibr" rid="B5">5</xref>). Dietary manipulation through supplementation with pre- and probiotics, and the modulation of the microbial community with antibiotics or fecal matter transplants have been studied (<xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B7">7</xref>) and successfully applied (<xref ref-type="bibr" rid="B8">8</xref>). <italic>In vitro</italic> models that simulate the gastrointestinal tract and that allow the fine tuning of physicochemical conditions have been developed to test the effect of different substances on particular bacterial species or even the whole microbial community (<xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B10">10</xref>).</p>
<p>However, understanding how the gut microbiome contributes to the pathogenesis of complex disorders or to nutrient absorption will critically depend upon the accuracy with which we characterize this microbial community. Next-generation sequencing (NGS) technologies (<xref ref-type="bibr" rid="B11">11</xref>&#x02013;<xref ref-type="bibr" rid="B13">13</xref>) are currently of wide use to this end because of their capacity to measure non-cultivable organisms, relatively low cost, and high throughput. NGS platforms have allowed measuring microbial diversity with an ever-increasing throughput and read length (<xref ref-type="bibr" rid="B14">14</xref>, <xref ref-type="bibr" rid="B15">15</xref>) and at a constantly decreasing cost (<xref ref-type="bibr" rid="B16">16</xref>), which has granted the possibility for a new wave of researchers to get involved in projects of considerable size and complexity, to carry sophisticated quantitative evaluations and to study low-abundance microorganisms. The outstanding increase in the number of publications in recent years (2,319 papers published in 2015; source: Scopus) is a proof of this. It raises, nonetheless, questions about how aware all these researchers are about pitfalls in microbiome analyses.</p>
<p>One of the most used ways to examine the gut microbiome is to use a marker gene or barcode to identify microorganisms and reconstruct their phylogenetic relationships; the 16S rRNA gene is the most used for that purpose, although others have been proposed and used (<xref ref-type="bibr" rid="B17">17</xref>&#x02013;<xref ref-type="bibr" rid="B19">19</xref>). As shown in Figure <xref ref-type="fig" rid="F1">1</xref>, most 16S studies follow a common workflow (<xref ref-type="bibr" rid="B20">20</xref>): total DNA is extracted from a sample (e.g., feces in the case of the gut microbiome) and used as template in PCR with primers that amplify specific regions of the 16S rRNA gene; the PCR products are sequenced using any technology (formerly Sanger but more recently NGS platforms, such as Roche 454, Illumina, Ion Torrent, PacBio) and raw sequences are processed using bioinformatic pipelines that include the denoising and removal of low-quality reads, the detection and removal of chimeric sequences, the clustering of the curated sequences into operational taxonomic units (OTUs), and their taxonomic classification. The output data can then be used to perform ecological and statistical tests (e.g., &#x003B1; and &#x003B2; diversity analyses). A careless execution of any single procedure in the workflow and the cumulative effect of the inherent bias of each step, which can be reduced but not totally eradicated as we shall see, can result in a biased representation of the microbial community under study or erroneous estimations of the changes induced by interventions.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Schematic view of the archetypical workflow in 16S rRNA studies, and some of the problems associated with each step</bold>. Dotted lines link the workflow with steps beyond the scope of the review, and dashed lines represent non-standard steps.</p></caption>
<graphic xlink:href="fnut-03-00026-g001.tif"/>
</fig>
<p>The unification of analysis procedures and the implementation of standardized workflows in order to minimize the variation introduced to the results have been recurrent topics on symposia (<xref ref-type="bibr" rid="B21">21</xref>), editorials (<xref ref-type="bibr" rid="B22">22</xref>), and opinion papers (<xref ref-type="bibr" rid="B23">23</xref>, <xref ref-type="bibr" rid="B24">24</xref>). We, here, go over each step in the workflow of an archetypical 16S study, from DNA extraction to the generation and classification of OTUs, briefly explain their principles, draw attention to their potential biases and propose some solutions to (reasonably) mitigate them, including available software tools. In addition, we highlight instances where direct comparisons between studies are discouraged and recommend the essential information that should be included when describing a microbiome study for reproducibility of results.</p>
<p>While some of the issues discussed here have been separately reviewed elsewhere [benefits and problems of barcode sequencing (<xref ref-type="bibr" rid="B36">36</xref>), primer selection (<xref ref-type="bibr" rid="B37">37</xref>), DNA extraction and PCR biases (<xref ref-type="bibr" rid="B38">38</xref>), sequence curation (<xref ref-type="bibr" rid="B39">39</xref>), taxonomic classification (<xref ref-type="bibr" rid="B40">40</xref>)], they have frequently been overlooked in publications of original datasets. We wish to encourage newcomer scientists to implement rigorous analyses so that they get confident results that better represent the microbial communities under scrutiny. Upstream and downstream procedures, namely, experimental design and sample collection, calculation of diversity indices, rarefaction curves, hypothesis testing, and other ecological and statistical analyses are of the uttermost importance; however, they vary between different kinds of studies and are beyond the scope of this paper. They have been reviewed elsewhere (<xref ref-type="bibr" rid="B41">41</xref>&#x02013;<xref ref-type="bibr" rid="B45">45</xref>).</p>
<sec id="S1-1">
<title>DNA Extraction</title>
<p>The first step, once the samples are collected, is the extraction of total DNA, which will then be used as template for PCR amplification of the marker gene. After the DNA is extracted and purified, the workflow for most 16S studies becomes roughly the same. Fecal samples are composed of microorganisms that differ in characteristics, such as size and cell wall composition, and that are present in different proportions. This can make the purification of a DNA sample that accurately represents the original community (i.e., that keeps all species and their abundances at the same relative proportions) a challenge, as different sample handling and DNA extraction protocols can yield samples with different bacterial ratios. It has been shown, for instance, that frozen fecal samples yield a higher amount of DNA from Gram-positive than from Gram-negative bacteria, probably due to the effect that the freeze&#x02013;thaw cycle can have over the Gram-positive cell wall (<xref ref-type="bibr" rid="B46">46</xref>).</p>
<p>Differences in gut microbial community patterns can also arise due to the principles of the genetic material extraction protocols, causing the over or underrepresentation of the same microbial group in DNA extracted from subsamples of the same source (<xref ref-type="bibr" rid="B47">47</xref>). Some DNA extraction kits use bead-containing lysing matrices and vigorous shaking steps that contribute to the disruption of the cell wall, whereas others rely on chemical lysis (<xref ref-type="bibr" rid="B48">48</xref>). Several studies have consistently demonstrated that protocols that involve a bead-beating step yield higher quantities of bacterial DNA, and, most importantly, these samples tend to be a more comprehensive representation of the microbial community, regardless of the source material and analysis method (<xref ref-type="bibr" rid="B49">49</xref>&#x02013;<xref ref-type="bibr" rid="B51">51</xref>). The differences between subsamples extracted with different kits can even be statistically significant, which is why it has been suggested that data from studies using different extraction methods should not be compared (<xref ref-type="bibr" rid="B52">52</xref>). Opportunely, studies are increasingly using similar DNA extraction protocols. For instance, the PowerSoil<sup>&#x000AE;</sup> DNA isolation kit (MoBio) has become popular because it performs well in a wide variety of samples, including human feces. Although using the same extraction protocol does not guarantee accurate representation of the microbial community under study, it allows comparison among studies.</p>
<p>Another issue with DNA extraction is that, due to the non-specificity of marker gene and metagenomic assays, they are highly sensitive to contamination with foreign microbial DNA. The presence of bacterial DNA from sources other than the original sample can alter the outcome of the analysis in a way that it no longer mirrors the original community it is supposed to reflect. Contamination sources may include the PCR reagents (<xref ref-type="bibr" rid="B53">53</xref>, <xref ref-type="bibr" rid="B54">54</xref>), ultra pure water (<xref ref-type="bibr" rid="B55">55</xref>, <xref ref-type="bibr" rid="B56">56</xref>), and, even, the DNA extraction reagents (<xref ref-type="bibr" rid="B57">57</xref>, <xref ref-type="bibr" rid="B58">58</xref>). The genetic material extracted from samples with low biomass is more prone to being drowned by contaminant DNA (<xref ref-type="bibr" rid="B59">59</xref>, <xref ref-type="bibr" rid="B60">60</xref>), and the contamination profile varies between laboratories, extraction kits, and batches from the same kit (<xref ref-type="bibr" rid="B60">60</xref>). Procedures to reduce the effect of contamination include the maximization of starting biomass from which DNA is extracted, the randomization of the order in which samples are to be processed, the collection, processing and sequencing of technical controls of the reagents to be used (storage media, DNA extraction kits, and PCR kits), the recording of the kit lots as additional metadata, and the quantification of negative-control sequences (<xref ref-type="bibr" rid="B60">60</xref>).</p>
<p>Today, there is no standard procedure on how to deal with sequences showing up in technical controls. One suggestion would be to compare the abundance in real samples and controls: if an OTU has similar relative abundance in samples and controls, it is likely a contaminant; otherwise, it probably is not. This approach has the drawback that the threshold in which the abundance of an OTU is considered a contaminant is subjective (<xref ref-type="bibr" rid="B61">61</xref>). Another method involves the removal of OTUs whose abundance is negatively correlated with amplicon concentration, as it is assumed that the signal from contaminant sequences in low biomass samples is less likely to be drowned by the signal of real data (<xref ref-type="bibr" rid="B61">61</xref>). In any case, it is necessary to be aware of taxa that are present in negative controls, taxa statistically associated with a particular batch of reagents, and taxa biologically unexpected in the treated samples.</p>
</sec>
<sec id="S1-2">
<title>Multi-Template PCR</title>
<p>In marker gene studies, total DNA is used as template for the PCR amplification of the barcode region. As in single-template PCR, the efficiency of multi-template PCR is influenced by the GC content of the target region (<xref ref-type="bibr" rid="B62">62</xref>), the DNA concentration (<xref ref-type="bibr" rid="B63">63</xref>), and the thermocycling conditions (<xref ref-type="bibr" rid="B64">64</xref>). However, because of the multiorganismal origin of the gut microbiome, a series of particular difficulties and artifacts, such as primer mismatches, gene copy number variation (CNV), chimeras, heteroduplex, and skewed template-to-product ratios, are encountered and can distort the diversity measures. Primer selection, CNV normalization, and chimeric sequence removal are discussed below; for a detailed discussion of reagents and PCR conditions in multi-template assays, see Ref. (<xref ref-type="bibr" rid="B65">65</xref>).</p>
<sec id="S1-2-1">
<title>16S rRNA Gene Hypervariable Regions</title>
<p>Due to its ubiquity in prokaryotes, low horizontal gene transfer, and ability to differentiate closely related organisms, the 16S rRNA gene has been used for decades in the study of diversity and ecology of microorganisms (<xref ref-type="bibr" rid="B66">66</xref>&#x02013;<xref ref-type="bibr" rid="B68">68</xref>). However, most NGS platforms are not capable of covering the full length of the gene (ca. 1,500&#x02009;bp) (<xref ref-type="bibr" rid="B68">68</xref>). This is why short regions within the gene (e.g., hypervariable V1&#x02013;V9 regions) have been prioritized with the advent of these newer technologies (<xref ref-type="bibr" rid="B69">69</xref>). Hypervariable regions are supposed to act as proxies of the complete gene. Actually, there is correlation between the phylogenies generated using different hypervariable regions or combinations thereof and the phylogenies generated with the whole gene (<xref ref-type="bibr" rid="B69">69</xref>), but the strength of these correlations varies among regions (<xref ref-type="bibr" rid="B70">70</xref>) because their different evolutionary rates limit their capacity to serve as surrogates of full-length sequences (<xref ref-type="bibr" rid="B71">71</xref>, <xref ref-type="bibr" rid="B72">72</xref>). Because of these disparities, the OTU count of different 16S regions can be inconsistent (<xref ref-type="bibr" rid="B70">70</xref>, <xref ref-type="bibr" rid="B73">73</xref>), which, in turn, makes studies using different hypervariable regions incomparable (<xref ref-type="bibr" rid="B71">71</xref>). Currently, there is no consensus of which region best reflects the gut microbial community (<xref ref-type="bibr" rid="B69">69</xref>, <xref ref-type="bibr" rid="B74">74</xref>, <xref ref-type="bibr" rid="B75">75</xref>). While read length increases in newer NGS technologies, one empirical way to overcome comparability between studies would be to sequence the same hypervariable region. This is, indeed, what is seen in many gut microbiome studies today: since the Illumina MiSeq platform gives one of the bests value for money of all NGS, most microbiome researchers are moving to sequence the V4 region since its size (ca. 250&#x02009;bp) fits well the read size of this platform at its current version.</p>
</sec>
<sec id="S1-2-2">
<title>Primer Selection</title>
<p>In order to amplify the selected 16S hypervariable region, a set of broad-range primers (so-called &#x0201C;universal primers&#x0201D;) must be used. These primers are usually designed to hybridize with the conserved regions flanking the sequence of interest. Universal primers work under the assumption that the flanking regions are conserved among a wide range of microbial groups, which allows the correct annealing and amplification of the desired PCR product (<xref ref-type="bibr" rid="B76">76</xref>). The rationale behind this approach is as good as possible but it still has problems, as mutations also occur within the flanking regions. The use of primers with a suboptimal coverage rate can lead to selective amplification of the template DNA, that is, the sub-representation or selection against a given microbial group (<xref ref-type="bibr" rid="B77">77</xref>). Thus, the relative content of sequences may be modified, resulting in a deviation from the true gut-community composition (<xref ref-type="bibr" rid="B77">77</xref>&#x02013;<xref ref-type="bibr" rid="B79">79</xref>).</p>
<p>In short, studies evaluating biases introduced by primer selection have demonstrated that there is no such thing as a truly &#x0201C;universal primer,&#x0201D; since there is no single pair of primers that can be used to amplify all prokaryotic or even bacterial groups. Genome evolution being what it is, the practical way to overcome this limitation and compare results among studies is to use similar pairs of primers and allow for degenerate sites in them. This is the preferred approach in some recent studies that make extensive use of modified 515F (5&#x02032; GTGYCAGCMGCCGCGGTAA 3&#x02032;) and 806R (5&#x02032; GGACTACNVGGGTWTCTAAT 3&#x02032;) primers that amplify the V4 region (<xref ref-type="bibr" rid="B80">80</xref>&#x02013;<xref ref-type="bibr" rid="B82">82</xref>).</p>
</sec>
</sec>
<sec id="S1-3">
<title>Amplicon Sequencing by NGS</title>
<p>Next-generation sequencing technologies refer to various strategies that rely on a combination of template preparation, sequencing and imaging, and genome alignment and assembly methods (<xref ref-type="bibr" rid="B83">83</xref>). The major advance offered by NGS is the ability to produce an enormous volume of data cheaply and fast. The transition from Sanger to NGS has opened new horizons in the gut-microbiome field by making it possible to collect millions of sequences, spanning hundreds of samples (<xref ref-type="bibr" rid="B80">80</xref>). A good example of this is the Human Microbiome Project, which used NGS to characterize the diversity of bacteria, archaea, and viruses that inhabit various areas of the human body in several hundreds healthy individuals (<xref ref-type="bibr" rid="B84">84</xref>). In the last decade, the throughput of NGS technologies has dramatically increased, and the operation cost has reduced, which, in turn, has boosted its use in microbial studies. However, the major drawback of all NGS technologies is that they raise concerns regarding the quality of data.</p>
<p>When sequencing genomes, multiple reads are used to construct a consensus and the error rate, defined as the number of errors per total base call (<xref ref-type="bibr" rid="B25">25</xref>), is, thus, reduced since each nucleotide in the original sequence is called several times by different reads. Such approach cannot be used when sequencing marker gene amplicons, such as the 16S rRNA, because each individual read is considered an identifier of an independent organism (e.g., a bacterium), and it is not possible to assemble the amplicon sequences (<xref ref-type="bibr" rid="B34">34</xref>); hence, the reduction of the error rate by other means becomes imperative.</p>
<p>One strategy to determine how many errors are introduced at each NGS run consists of sequencing a synthetic mixture of genomic DNA (mock community), comprising several known bacterial species, along with the samples. Reads are compared with a reference database of the marker gene, and errors are identified in pairwise alignments of each experimentally generated sequence relative to the closest reference sequence (<xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B32">32</xref>, <xref ref-type="bibr" rid="B34">34</xref>). Sequencing mock communities to assess the error rate of each individual amplicon sequencing run should become a standard step in microbial community analysis (see <uri xlink:href="http://www.hmpdacc.org/HMMC/">http://www.hmpdacc.org/HMMC/</uri>) (<xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B29">29</xref>).</p>
<p>Currently, Roche 454 GS-FLX, Illumina MiSeq, Ion Torrent PGM, and PacBio SMRT are the most used platforms for the study of the gut microbiome (<xref ref-type="bibr" rid="B35">35</xref>, <xref ref-type="bibr" rid="B85">85</xref>&#x02013;<xref ref-type="bibr" rid="B87">87</xref>). However, each technology performs differently in the trade-off between read length, sequence throughput, and error rate (Table <xref ref-type="table" rid="T1">1</xref>). As mentioned above, since hypervariable regions correlate differently with the whole 16S rRNA gene (<xref ref-type="bibr" rid="B88">88</xref>, <xref ref-type="bibr" rid="B89">89</xref>), it is arguably better to sequence shorter reads at greater depths and with lower error rates (e.g., Illumina, Ion Torrent) than longer reads with higher error rates (e.g., PacBio) (<xref ref-type="bibr" rid="B34">34</xref>). The former allows the detection of low-abundance microorganisms (<xref ref-type="bibr" rid="B90">90</xref>, <xref ref-type="bibr" rid="B91">91</xref>) and the avoidance of unnecessary greater computing times due to the description of non-existent organisms caused by artifactual sequences. Although increased read length usually improves classification, platforms such as PacBio are currently limited by their high sequencing error and low yield of sequencing data relative to the other platforms (<xref ref-type="bibr" rid="B34">34</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Specifications of the most commonly used sequencing platforms in microbial community characterization studies</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Platform</th>
<th valign="top" align="center">Raw ER<xref ref-type="table-fn" rid="tfn1"><sup>a</sup></xref> (%)</th>
<th valign="top" align="center">ER after denoise<xref ref-type="table-fn" rid="tfn1"><sup>a</sup></xref> (%)</th>
<th valign="top" align="center">Read length (bp)</th>
<th valign="top" align="center">Throughput (Gb/run)</th>
<th valign="top" align="center">Cost/Gb (USD)</th>
<th valign="top" align="left">Known problems</th>
<th valign="top" align="center">Reference</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">454 FLX Titanium</td>
<td align="center" valign="top">1.0&#x02013;2.0</td>
<td align="center" valign="top">&#x0003C;0.02</td>
<td align="center" valign="top">450</td>
<td align="center" valign="top">0.4</td>
<td align="center" valign="top">15,500</td>
<td align="left" valign="top">High error rate in homopolymer regions. Sequence quality decreases in a lengthwise fashion. Soon to be phased out</td>
<td align="center" valign="top">(<xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B25">25</xref>&#x02013;<xref ref-type="bibr" rid="B28">28</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Illumina MiSeq v2</td>
<td align="center" valign="top">0.8&#x02013;1.0</td>
<td align="center" valign="top">&#x0003C;0.02</td>
<td align="center" valign="top">2&#x02009;&#x000D7;&#x02009;250</td>
<td align="center" valign="top">7.5</td>
<td align="center" valign="top">142</td>
<td align="left" valign="top">Sequence quality decreases in a lengthwise fashion. The second read has a higher error rate than the first read. Increased single-base errors in association with GGC motifs</td>
<td align="center" valign="top">(<xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B26">26</xref>, <xref ref-type="bibr" rid="B29">29</xref>&#x02013;<xref ref-type="bibr" rid="B31">31</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Ion Torrent PGM 316 chip</td>
<td align="center" valign="top">1.5</td>
<td align="center" valign="top">NA<xref ref-type="table-fn" rid="tfn2"><sup>b</sup></xref></td>
<td align="center" valign="top">400</td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">674</td>
<td align="left" valign="top">Premature sequence truncation caused by organism- and orientation-dependent biases. Low accuracy in homopolymer regions</td>
<td align="center" valign="top">(<xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B31">31</xref>&#x02013;<xref ref-type="bibr" rid="B33">33</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">PacBio RS II</td>
<td align="center" valign="top">1.8</td>
<td align="center" valign="top">0.3</td>
<td align="center" valign="top">10,000</td>
<td align="center" valign="top">0.1</td>
<td align="center" valign="top">1,100</td>
<td align="left" valign="top">Systematic and non-random errors; G and C are more likely to be deleted than A and C. Preferential loading of shorter sequences into zero-mode waveguides</td>
<td align="center" valign="top">(<xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B27">27</xref>, <xref ref-type="bibr" rid="B31">31</xref>, <xref ref-type="bibr" rid="B34">34</xref>, <xref ref-type="bibr" rid="B35">35</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1"><p><italic><sup>a</sup>Error rate calculated by sequencing of 16S amplicons from mock bacterial communities</italic>.</p></fn>
<fn id="tfn2"><p><italic><sup>b</sup>To the best of our knowledge, there are no available studies assessing the error rate of Ion Torrent sequences after bioinformatic curation</italic>.</p></fn></table-wrap-foot></table-wrap>
</sec>
<sec id="S1-4">
<title>Culling of Dubious Sequences</title>
<p>Up to this point, procedures in the archetypical workflow described in Figure <xref ref-type="fig" rid="F1">1</xref> take place <italic>in vitro</italic>. Hereafter, treatment of raw DNA sequences occurs <italic>in silico</italic>. To reduce sequencing error rates, it has become mandatory to apply stringent sequence curation and denoising algorithms. Inadequate cleaning of reads can have many negative effects including limited ability to identify chimeras and inflation of &#x003B1; and &#x003B2; diversity metrics (<xref ref-type="bibr" rid="B92">92</xref>). Low-quality sequences, artifacts, and contamination can compromise the downstream analyses and, thus, must be removed from the dataset.</p>
<p>The first step is the removal of reads with ambiguous base calls (N) in the barcode or in the marker gene amplicon, as it is not possible to determine the true nucleotide sequence (<xref ref-type="bibr" rid="B93">93</xref>). On the other hand, mismatches in the primers and barcodes are usually allowed up to a certain number; the removal of sequences with less than three mismatches has little effect on the reduction of the error rate (<xref ref-type="bibr" rid="B93">93</xref>). Emulsion-PCR-based platforms (e.g., 454, Ion Torrent) are known for producing homopolymer-associated indel errors (<xref ref-type="bibr" rid="B33">33</xref>); these artifacts have been shown to account for a large proportion of errors in benchmark studies using mock communities and to be associated with low-quality scores (<xref ref-type="bibr" rid="B92">92</xref>). Therefore, reads with homopolymers longer than eight nucleotides should be culled (<xref ref-type="bibr" rid="B25">25</xref>).</p>
<p>In addition, in most sequencing platforms (e.g., 454, Illumina, Ion Torrent), quality scores reduce in a lengthwise fashion, and it is possible to identify breakpoints where the quality criteria are not met. Sequences can be trimmed to those breakpoints to reduce the overall error rate. Two trimming approaches have been widely used: a &#x0201C;hard cutoff&#x0201D; method trims the sequences at the first nucleotide with a quality score below a given threshold (<xref ref-type="bibr" rid="B94">94</xref>); this minimizes the error rate but also reduces the average sequence length. Another method, called &#x0201C;sliding window,&#x0201D; calculates the average quality score within a sequence window (or substring) and trims when the average quality score within that window drops below a threshold; the latter method has the advantage that reduces the overall error rate without reducing the average sequence length (<xref ref-type="bibr" rid="B25">25</xref>). Reads with anomalous lengths (well above or below the expected value for a given technology) are also removed, as they likely represent PCR or sequencing errors, or become not informative as a result of the quality trimming (<xref ref-type="bibr" rid="B93">93</xref>).</p>
<p>The use of a pre-clustering algorithm has also been shown to reduce the number of sequences that are the result of sequencing errors and to predict with higher accuracy the number of expected OTUs in template preparations of known taxonomic composition (<xref ref-type="bibr" rid="B95">95</xref>). It assumes that rare sequences are more likely to derive from abundant sequences and can, therefore, be merged if they are within a specified similarity threshold. This threshold must always be lower than the value used for OTU clustering, usually 1% (<xref ref-type="bibr" rid="B25">25</xref>).</p>
<p>Also, contaminant sequences must be removed from the dataset. Due to the nature of the 16S rRNA gene, mitochondria, chloroplast (<xref ref-type="bibr" rid="B96">96</xref>), and other eukaryotic sequences are likely to be amplified and should be identified and discarded, along with sequences unclassified at the domain level; according to the scope of the study and the primers used, bacterial or archaeal sequences would also be needed to get removed.</p>
</sec>
<sec id="S1-5">
<title>Chimera Removal</title>
<p>Sequences composed of two or more parents are named chimeras. Chimeras are a serious concern in studies of the gut microbiome because they can lead to the description of non-existent organisms and inflate diversity metrics. This kind of artifact arises from errors during PCR, and several factors influence its appearance, such as DNA damage (<xref ref-type="bibr" rid="B97">97</xref>), the amplification of highly similar sequences (<xref ref-type="bibr" rid="B98">98</xref>), a high number of cycles, and short elongation times (<xref ref-type="bibr" rid="B99">99</xref>). This suggests that prematurely terminated amplicons that anneal to a homologous template to prime the next PCR cycle are likely to be the major cause of chimera formation.</p>
<p>The detection of chimeras in libraries of 16S amplicons is particularly challenging, as sequences are short and highly similar. There are multiple algorithms designed to detect and remove chimeric sequences (<xref ref-type="bibr" rid="B100">100</xref>&#x02013;<xref ref-type="bibr" rid="B107">107</xref>), which follow the same basic principle: substrings or fragments of the query sequence are compared to a set of reference sequences in order to establish if the said substrings match different references. Once a chimera is identified, it is removed from the dataset. Some algorithms use allegedly chimera-free 16S sequence databases as reference, including Chimera Slayer (<xref ref-type="bibr" rid="B105">105</xref>) and DECIPHER (<xref ref-type="bibr" rid="B108">108</xref>). Others [e.g., Perseus (<xref ref-type="bibr" rid="B106">106</xref>), UCHIME (<xref ref-type="bibr" rid="B107">107</xref>)] use a database-free approach that assumes that the most abundant sequences from the query dataset are unlikely to be chimeric and can, therefore, be used as reference. Database dependency influences the ability of different algorithms to identify and remove chimeras (<xref ref-type="bibr" rid="B109">109</xref>). Database-independent algorithms have the advantage of being able to detect them even if the studied community is poorly described (<xref ref-type="bibr" rid="B25">25</xref>). In contrast, database-dependent algorithms rely on reference collections that only contain gene sequences from cultured bacteria and are not expected to perform as well on samples that contain sequences from yet uncultured organisms (<xref ref-type="bibr" rid="B24">24</xref>), something very common in studies of the gut microbiome. Thus, the use of algorithms that do not rely on databases should be preferred in order to minimize the inflation of diversity caused by chimeras, especially when dealing with poorly characterized gut microbial communities.</p>
</sec>
<sec id="S1-6">
<title>OTU Clustering and Taxonomy Assignment</title>
<sec id="S1-6-1">
<title>Sequence Grouping</title>
<p>In order to describe and compare gut microbiomes or shifts in the gut microbiome following intervention, diversity metrics should be estimated (e.g., Chao-1, UniFrac), which requires information about the composition and abundance of organisms in said communities. Currently, two approaches are used to characterize microbial communities: taxonomic-dependent (also called phylotype analysis) and OTU-based methods (<xref ref-type="bibr" rid="B110">110</xref>).</p>
<p>The taxonomic-dependent methods rely on reference databases of full-length 16S rRNA gene sequences from cultured microorganisms (i.e., with a known taxonomy). Some popular reference databases are Greengenes (<xref ref-type="bibr" rid="B111">111</xref>), SILVA (<xref ref-type="bibr" rid="B112">112</xref>), and RDP (<xref ref-type="bibr" rid="B113">113</xref>). Query sequences are compared against the reference database and assigned to the organism of the best-matched reference (<xref ref-type="bibr" rid="B114">114</xref>). While this approach is computationally fast and allows the straightforward taxonomic labeling of a query sequence, indicating its relationship to previously characterized microorganisms, it is hindered by the lack of well-annotated or incomplete databases (<xref ref-type="bibr" rid="B115">115</xref>). This is exacerbated when working with genes other than the 16S rRNA or with sequences from hard-to-culture or yet uncultured organisms, as is usually the case of colonic microbes, making them inherently limited (<xref ref-type="bibr" rid="B116">116</xref>).</p>
<p>On the other hand, OTU-based methods do not rely on reference databases; they calculate a distance matrix among all query sequences and group them based on their similarity at a given threshold. Since grouping does not require previous taxonomic information, these OTU-based methods perform very well with poorly characterized microorganisms. OTU-based methods are not without faults, however. They are usually computationally exigent and prone to overestimation due to low-quality sequences, contamination, chimeras, etc. (<xref ref-type="bibr" rid="B117">117</xref>).</p>
<p>In turn, most OTU-clustering algorithms fall into two broad categories, hierarchical clustering (HC) and greedy heuristic clustering (GHC). HC and GHC differ in the methods for comparison of sequences and clustering into OTUs, their computational requirements, and the accuracy of the result. HC methods start by generating distance matrices that measure the distance between each pair of sequences in the dataset, either by multiple [e.g., Mothur (<xref ref-type="bibr" rid="B118">118</xref>)] or pairwise [e.g., ESPRIT (<xref ref-type="bibr" rid="B119">119</xref>)] sequence alignments, and then apply standard HC (single, complete or average linkage clustering) to group OTUs at a given threshold (usually, 97%). While debated (<xref ref-type="bibr" rid="B120">120</xref>), the use of multiple sequence alignment is preferred over pairwise alignments because it preserves positional homology across all sequences (<xref ref-type="bibr" rid="B121">121</xref>). The incorporation of the secondary structure of the 16S rRNA molecule into the alignment provides additional biological information that strengthens the confidence that positional homology is being conserved (<xref ref-type="bibr" rid="B122">122</xref>, <xref ref-type="bibr" rid="B123">123</xref>). HC methods are computationally complex; however, several approaches have been devised to reduce their complexity and computer memory requirements (<xref ref-type="bibr" rid="B116">116</xref>, <xref ref-type="bibr" rid="B119">119</xref>, <xref ref-type="bibr" rid="B121">121</xref>), and software such as Mothur (from version 1.27.0) performs well with reasonable computer capacities.</p>
<p>Yet, computational requirements of HC algorithms can be a real headache in the analysis of many fecal samples; GHC algorithms have been developed to this end. They process input sequences one at a time, hence, avoiding the comparison of all pairs of sequences and the construction of a distance matrix (<xref ref-type="bibr" rid="B115">115</xref>). In GHC, the query sequence is compared against a set of seed sequences (or centroids) that are representative of existing clusters; if the similarity of the query and the seed sequences is above a given threshold (usually, 97%), the query sequence is assigned to the existing cluster, otherwise it becomes the seed of a new cluster or it is discarded. The seed sequences can be obtained either by generating them <italic>de novo</italic> [e.g., CD-HIT (<xref ref-type="bibr" rid="B124">124</xref>) or UCLUST (<xref ref-type="bibr" rid="B125">125</xref>)] or from a database of predefined centroids [e.g., UCLUST as implemented by QIIME (<xref ref-type="bibr" rid="B126">126</xref>, <xref ref-type="bibr" rid="B127">127</xref>)]; the latter approach has the same limitations of other database-dependent methods, as discussed above. Furthermore, the centroid databases are constructed by clustering full-length sequences at a defined threshold; when used to cluster partial sequences, problems may arise. Some taxa may have identical sequences within a specific 16S sub-region, yet, they can be below the predefined threshold when the full-length sequence is considered; the opposite would also be true.</p>
<p>As with other steps in the workflow discussed here, there is a trade-off between complexity and accuracy. Different clustering methods can yield different results from identical datasets; their performance varies according to the complexity and the abundance ratio of the sequences in the dataset and the selected similarity threshold (<xref ref-type="bibr" rid="B117">117</xref>). Benchmark studies have consistently shown that methods such as complete linkage (HC), average linkage (HC), and CD-HIT (GHC) are robust to changing OTU thresholds and produce consistent clusters. On the other hand, single linkage (HC) produces OTUs that are not homogeneous and together with UCLUST (GHC) and UPARSE (GHC) have been shown to be very sensitive to threshold definitions and to have reproducibility issues, thus, in our opinion, their use should be less encouraged (<xref ref-type="bibr" rid="B115">115</xref>, <xref ref-type="bibr" rid="B128">128</xref>, <xref ref-type="bibr" rid="B129">129</xref>).</p>
</sec>
<sec id="S1-6-2">
<title>Taxonomic Assignment</title>
<p>In order to establish the biological significance of any intervention on the gut microbiome, it is usually desired to give a taxonomic classification to the previously detected OTUs. Several methods for the taxonomic assignment of 16S rRNA gene sequences are available and are based on different principles, such as k-mer count [SINA (<xref ref-type="bibr" rid="B130">130</xref>), RDP Bayesian classifier (<xref ref-type="bibr" rid="B88">88</xref>)], multiple sequence alignment [NAST (<xref ref-type="bibr" rid="B131">131</xref>)], BLAST [TUIT (<xref ref-type="bibr" rid="B132">132</xref>)], and machine learning algorithms [16S classifier (<xref ref-type="bibr" rid="B133">133</xref>)], among others. Although new algorithms continue to be developed, the RDP Bayesian classifier remains the most widely used tool for taxonomic assignment of 16S sequences; it provides taxonomic assignments from domain to genus, with confidence estimates for each assignment. The misclassification rate of short sequences varies approximately from 16 to 20% according to the dataset used to train the algorithm and the 16S rRNA gene region (<xref ref-type="bibr" rid="B114">114</xref>). As with others database-dependent methods, flaws in the databases will unavoidably lead to flaws in classification; fortunately, the approach used to label OTUs can reduce the error.</p>
<p>Regardless of the algorithm, OTUs can be classified either by assigning them the taxonomy of a representative sequence (<xref ref-type="bibr" rid="B127">127</xref>) or by classifying every sequence in the OTU and assigning the taxonomy by majority consensus (<xref ref-type="bibr" rid="B116">116</xref>). The former method can yield a less robust classification; if an OTU is composed of related sequences but with divergent taxonomies, the classification of a single sequence can lead to an erroneous classification of the entire OTU. Therefore, we recommended using majority-consensus taxonomy to the cost of a less detailed classification (genus, species).</p>
</sec>
</sec>
<sec id="S1-7">
<title>Copy Number Variation</title>
<p>A problem that arises when studying the gut microbiome is the difference in the number of copies of the 16S rRNA gene among species, which can range from a single copy up to 15 (<xref ref-type="bibr" rid="B134">134</xref>). This variation can lead to erroneous abundance assessment; at equal number of cells, taxa with few copies of the 16S rRNA gene have lower amplicon counts than taxa with more copies of the gene. Therefore, CNV can result in over or underestimation of microbial abundance. CNV has not deserved full attention; yet, it is of utmost importance since it can result in a biased description of the microbial community. Indeed, it has been suggested that bacterial diversity could be overestimated by a factor of 3 due to 16S CNV (<xref ref-type="bibr" rid="B135">135</xref>).</p>
<p>In microorganisms with known 16S rRNA gene copy number, CNV could be corrected by weighting read counts by the inverse of its gene copy number. However, the problem is more difficult to deal with in cases where the gene copy number is unknown. A possible solution in these cases is to use the value of a closely related organism (<xref ref-type="bibr" rid="B136">136</xref>). Another possibility is to place 16S reads on a phylogenetic tree and calculate gene copy number using phylogenetically independent contrasts (<xref ref-type="bibr" rid="B137">137</xref>, <xref ref-type="bibr" rid="B138">138</xref>). While these methods have been shown to improve the measures of diversity and abundance of microbial communities, they rely on databases of 16S and sequenced genomes, which, as with phylotype-based clustering, lack information of uncultured and poorly cultured organisms. In cases of poorly studied deep evolutionary lineages (say, rare phyla), CNV correction is definitely an unsolved issue.</p>
<p>Although CNV can move away estimates of diversity from reality, it must be noted that researchers usually want to compare these estimates between treatments (e.g., obese vs. lean, vaginal delivery vs. C-section, probiotic vs. placebo). In other words, we look for relative changes in the abundance of OTUs A, B, and C; even if they would be badly estimated due to the assumption that they only have one 16S rRNA gene each, what is important is to see how populations change under different tested conditions. The take-home message from CNV is that we should emphasize more comparisons of the same OTU among samples than comparisons among OTUs within samples.</p>
</sec>
<sec id="S1-8">
<title>Essential Information That Should Be Included when Describing a Microbiome Study</title>
<p>In order to guarantee reproducibility of results, we encourage researchers and journals to explicitly include and require the following technical information in microbiome publications: (I) DNA extraction method, including the type of extraction kit if one was used and modifications to the standard protocol proposed by the manufacturer; (II) description of how DNA contamination was controlled for (e.g., DNA extraction of negative controls); (III) 16S rRNA hypervariable region targeted including the nucleotide sequences of the primers used; (IV) sequencing technology employed; (V) description of sequencing error-rate assessment (e.g., was a mock community sequenced in parallel with the samples?); and (VI) description of <italic>in silico</italic> analyses (culling of dubious sequences, removal of chimeras, OTU clustering and taxonomy assignment, copy number variation correction), including the code or command lines with parameters used if appropriate.</p>
</sec>
</sec>
<sec id="S2">
<title>Conclusion</title>
<p>The study of the gut microbiome is revolutionizing medicine and science by allowing understanding how microbes are intimately involved in many physiological processes. The gut microbiome is shifting from an appealing object of study to a precision medicine target. NGS have enabled the possibility to gather the most impressive amount of microbiome data at costs and speeds that were unthinkable a decade ago. However, these technologies have introduced new challenges in data analysis that researchers must take care of. We have, here, discussed some of these challenges and suggested ways to control them using available tools (see Table <xref ref-type="table" rid="T2">2</xref> for our recommendations to reduce the impact of these pitfalls). Our hope is that, while a minimum information standard that unifies the procedures of microbiome studies is established, researchers implement rigorous analyses so that their results better represent the microbial communities under scrutiny. Only by making as stringent as possible analyses and by guaranteeing the transparency and reproducibility of microbiome analyses (<xref ref-type="bibr" rid="B139">139</xref>) we will give the field its first dose of &#x0201C;healthy skepticism&#x0201D; (<xref ref-type="bibr" rid="B140">140</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Recommendations to reduce the impact of biases introduced in the different steps of the analysis of microbial communities using the 16S rRNA gene</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Step</th>
<th valign="top" align="left">Main challenge</th>
<th valign="top" align="left">Possible solution</th>
<th valign="top" align="left">Importance</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="3">DNA extraction</td>
<td align="left" valign="top">Uneven representation of the microbial community under scrutiny.</td>
<td align="left" valign="top">The use of a DNA extraction method that includes a bead-beating step results in a more comprehensive representation of the microbial community.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top">Differential representation of microbial communities due to differences in DNA extraction kits.</td>
<td align="left" valign="top">Direct comparisons should be carried only between studies using the same DNA extraction kit.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top">Contamination by microbial DNA from the DNA extraction and PCR reagents.</td>
<td align="left" valign="top">In order to reduce the risk of contamination, the starting biomass should be maximized. To control it, the samples must be processed in random order, the kit lots must be included as metadata and technical controls from the reagents must be sequenced.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">Multi-template PCR</td>
<td align="left" valign="top">Differences in the estimated phylogenetic diversity between hypervariable regions of the 16S rRNA gene.</td>
<td align="left" valign="top">The region that best approximates the phylogenetic diversity given by the whole gene should be selected. The V4 region has been shown to approximate the phylogenetic diversity given by the whole gene and to result in best taxonomy labeling.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Uneven coverage of different microbial taxa by the PCR primers.</td>
<td align="left" valign="top">Bioinformatic tools, such as SILVA TestPrime, allow the evaluation of primers, and the ones with the highest coverage rate for the taxa known to be present in the microbial community of interest should be selected.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top">The microbial coverage is maximized by using degenerate primers.</td>
<td align="left" valign="top">High</td>
</tr>
<tr>
<td align="left" valign="top">Direct comparisons should be carried only between studies using the same set of primers.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Amplicon sequencing by NGS</td>
<td align="left" valign="top">Sequencing platform selection.</td>
<td align="left" valign="top">The selection of the sequencing platform should be made prioritizing error rate over sequencing depth and read length.</td>
<td align="left" valign="top">High</td>
</tr>
<tr>
<td align="left" valign="top">Assessment of the quality of the sequencing run.</td>
<td align="left" valign="top">The sequencing of a mock community allows the quality assessment of each individual amplicon sequencing run.</td>
<td align="left" valign="top">High</td>
</tr>
<tr>
<td align="left" valign="top">Culling of dubious sequences</td>
<td align="left" valign="top">Overestimation of diversity caused by spurious sequences.</td>
<td align="left" valign="top">Apply a stringent sequence denoising and curation procedures and assess their effectiveness by determining the final error rate using a sequenced mock community.</td>
<td align="left" valign="top">High</td>
</tr>
<tr>
<td align="left" valign="top">Chimera removal</td>
<td align="left" valign="top">Overestimation of diversity caused by non-existent organisms (chimeric sequences).</td>
<td align="left" valign="top">The use of database-free approaches, especially when studying poorly characterized environments, is encouraged.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">OTU clustering and taxonomy assignment</td>
<td align="left" valign="top" rowspan="2">Overestimation of diversity caused by clustering algorithms.</td>
<td align="left" valign="top">Database-free OTU-based methods should be preferred over taxonomic-dependent (phylotyping) approaches.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top">If computationally possible, the use of hierarchical methods such as average or complete linkage should be used, otherwise, a heuristic method such as CD-HIT is suggested.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top">Erroneous taxonomic classification of OTUs.</td>
<td align="left" valign="top">The taxonomic assignment should be carried by majority consensus of the sequences within the OTU.</td>
<td align="left" valign="top">Moderate</td>
</tr>
<tr>
<td align="left" valign="top">Copy number variation</td>
<td align="left" valign="top">Over- or underestimation of diversity caused by erroneous abundance assessment.</td>
<td align="left" valign="top">While algorithms that correct CNV exist, they depend on whole genome sequence data, which may not be available for poorly described microorganisms, thus, their use is not encouraged</td>
<td align="left" valign="top">Low</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="S3">
<title>Author Contributions</title>
<p>JC-Z and JE devised, wrote, and made corrections to the manuscript.</p>
</sec>
<sec id="S4">
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>We thank Colciencias for funding the Young Researcher internship of JC-Z and CABBIO-Colciencias for sponsoring the attendance of JC-Z to the CABBIO course organized by the Environmental Microbiology Lab of &#x0201C;Centro Nacional Patag&#x000F3;nico&#x0201D; in Puerto Madryn, Chubut, Argentina. We also thank Roberto Jim&#x000E9;nez-Cardona for insightful comments on the manuscript.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><label>1</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Br&#x000FC;ssow</surname> <given-names>H</given-names></name></person-group>. <article-title>Human microbiota: &#x0201C;the philosophers have only interpreted the world in various ways. The point, however, is to change it&#x0201D;</article-title>. <source>Microb Biotechnol</source> (<year>2015</year>) <volume>8</volume>(<issue>1</issue>):<fpage>11</fpage>&#x02013;<lpage>2</lpage>.<pub-id pub-id-type="doi">10.1111/1751-7915.12259</pub-id></citation></ref>
<ref id="B2"><label>2</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zmora</surname> <given-names>N</given-names></name> <name><surname>Zeevi</surname> <given-names>D</given-names></name> <name><surname>Korem</surname> <given-names>T</given-names></name> <name><surname>Segal</surname> <given-names>E</given-names></name> <name><surname>Elinav</surname> <given-names>E</given-names></name></person-group>. <article-title>Taking it personally: personalized utilization of the human microbiome in health and disease</article-title>. <source>Cell Host Microbe</source> (<year>2016</year>) <volume>19</volume>(<issue>1</issue>):<fpage>12</fpage>&#x02013;<lpage>20</lpage>.<pub-id pub-id-type="doi">10.1016/j.chom.2015.12.016</pub-id><pub-id pub-id-type="pmid">26764593</pub-id></citation></ref>
<ref id="B3"><label>3</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foxx-Orenstein</surname> <given-names>AE</given-names></name> <name><surname>Chey</surname> <given-names>WD</given-names></name></person-group>. <article-title>Manipulation of the gut microbiota as a novel treatment strategy for gastrointestinal disorders</article-title>. <source>Am J Gastroenterol Suppl</source> (<year>2012</year>) <volume>1</volume>(<issue>1</issue>):<fpage>41</fpage>&#x02013;<lpage>6</lpage>.<pub-id pub-id-type="doi">10.1038/ajgsup.2012.8</pub-id></citation></ref>
<ref id="B4"><label>4</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>C</given-names></name> <name><surname>Shan</surname> <given-names>Y</given-names></name> <name><surname>Song</surname> <given-names>W</given-names></name></person-group>. <article-title>Targeting gut microbiota as a possible therapy for diabetes</article-title>. <source>Nutr Res</source> (<year>2015</year>) <volume>35</volume>(<issue>5</issue>):<fpage>361</fpage>&#x02013;<lpage>7</lpage>.<pub-id pub-id-type="doi">10.1016/j.nutres.2015.03.002</pub-id><pub-id pub-id-type="pmid">25818484</pub-id></citation></ref>
<ref id="B5"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butel</surname> <given-names>M-J</given-names></name></person-group>. <article-title>Probiotics, gut microbiota and health</article-title>. <source>M&#x000E9;decine Mal Infect.</source> (<year>2014</year>) <volume>44</volume>(<issue>1</issue>):<fpage>1</fpage>&#x02013;<lpage>8</lpage>.<pub-id pub-id-type="doi">10.1016/j.medmal.2013.10.002</pub-id></citation></ref>
<ref id="B6"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cammarota</surname> <given-names>G</given-names></name> <name><surname>Ianiro</surname> <given-names>G</given-names></name> <name><surname>Bibb&#x000F2;</surname> <given-names>S</given-names></name> <name><surname>Gasbarrini</surname> <given-names>A</given-names></name></person-group>. <article-title>Gut microbiota modulation: probiotics, antibiotics or fecal microbiota transplantation?</article-title> <source>Intern Emerg Med</source> (<year>2014</year>) <volume>9</volume>(<issue>4</issue>):<fpage>365</fpage>&#x02013;<lpage>73</lpage>.<pub-id pub-id-type="doi">10.1007/s11739-014-1069-4</pub-id><pub-id pub-id-type="pmid">24664520</pub-id></citation></ref>
<ref id="B7"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walsh</surname> <given-names>CJ</given-names></name> <name><surname>Guinane</surname> <given-names>CM</given-names></name> <name><surname>O&#x02019;Toole</surname> <given-names>PW</given-names></name> <name><surname>Cotter</surname> <given-names>PD</given-names></name></person-group>. <article-title>Beneficial modulation of the gut microbiota</article-title>. <source>FEBS Lett</source> (<year>2014</year>) <volume>588</volume>(<issue>22</issue>):<fpage>4120</fpage>&#x02013;<lpage>30</lpage>.<pub-id pub-id-type="doi">10.1016/j.febslet.2014.03.035</pub-id><pub-id pub-id-type="pmid">24681100</pub-id></citation></ref>
<ref id="B8"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Nood</surname> <given-names>E</given-names></name> <name><surname>Vrieze</surname> <given-names>A</given-names></name> <name><surname>Nieuwdorp</surname> <given-names>M</given-names></name> <name><surname>Fuentes</surname> <given-names>S</given-names></name> <name><surname>Zoetendal</surname> <given-names>EG</given-names></name> <name><surname>de Vos</surname> <given-names>WM</given-names></name> <etal/></person-group> <article-title>Duodenal infusion of donor feces for recurrent <italic>Clostridium difficile</italic></article-title>. <source>N Engl J Med</source> (<year>2013</year>) <volume>368</volume>(<issue>5</issue>):<fpage>407</fpage>&#x02013;<lpage>15</lpage>.<pub-id pub-id-type="doi">10.1056/NEJMoa1205037</pub-id><pub-id pub-id-type="pmid">23323867</pub-id></citation></ref>
<ref id="B9"><label>9</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alander</surname> <given-names>M</given-names></name> <name><surname>De Smet</surname> <given-names>I</given-names></name> <name><surname>Nollet</surname> <given-names>L</given-names></name> <name><surname>Verstraete</surname> <given-names>W</given-names></name> <name><surname>von Wright</surname> <given-names>A</given-names></name> <name><surname>Mattila-Sandholm</surname> <given-names>T</given-names></name></person-group>. <article-title>The effect of probiotic strains on the microbiota of the simulator of the human intestinal microbial ecosystem (SHIME)</article-title>. <source>Int J Food Microbiol</source> (<year>1999</year>) <volume>46</volume>(<issue>1</issue>):<fpage>71</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1016/S0168-1605(98)00182-2</pub-id><pub-id pub-id-type="pmid">10050686</pub-id></citation></ref>
<ref id="B10"><label>10</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chung</surname> <given-names>WSF</given-names></name> <name><surname>Walker</surname> <given-names>AW</given-names></name> <name><surname>Louis</surname> <given-names>P</given-names></name> <name><surname>Parkhill</surname> <given-names>J</given-names></name> <name><surname>Vermeiren</surname> <given-names>J</given-names></name> <name><surname>Bosscher</surname> <given-names>D</given-names></name> <etal/></person-group> <article-title>Modulation of the human gut microbiota by dietary fibres occurs at the species level</article-title>. <source>BMC Biol</source> (<year>2016</year>) <volume>14</volume>(<issue>1</issue>):<fpage>3</fpage>.<pub-id pub-id-type="doi">10.1186/s12915-015-0224-3</pub-id><pub-id pub-id-type="pmid">26754945</pub-id></citation></ref>
<ref id="B11"><label>11</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kovacs</surname> <given-names>A</given-names></name> <name><surname>Ben-Jacob</surname> <given-names>N</given-names></name> <name><surname>Tayem</surname> <given-names>H</given-names></name> <name><surname>Halperin</surname> <given-names>E</given-names></name> <name><surname>Iraqi</surname> <given-names>FA</given-names></name> <name><surname>Gophna</surname> <given-names>U</given-names></name></person-group>. <article-title>Genotype is a stronger determinant than sex of the mouse gut microbiota</article-title>. <source>Microb Ecol</source> (<year>2011</year>) <volume>61</volume>(<issue>2</issue>):<fpage>423</fpage>&#x02013;<lpage>8</lpage>.<pub-id pub-id-type="doi">10.1007/s00248-010-9787-2</pub-id><pub-id pub-id-type="pmid">21181142</pub-id></citation></ref>
<ref id="B12"><label>12</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tang</surname> <given-names>K</given-names></name> <name><surname>Liu</surname> <given-names>K</given-names></name> <name><surname>Jiao</surname> <given-names>N</given-names></name> <name><surname>Zhang</surname> <given-names>Y</given-names></name> <name><surname>Chen</surname> <given-names>C-TA</given-names></name></person-group>. <article-title>Functional metagenomic investigations of microbial communities in a shallow-sea hydrothermal system</article-title>. <source>PLoS One</source> (<year>2013</year>) <volume>8</volume>(<issue>8</issue>):<fpage>e72958</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0072958</pub-id><pub-id pub-id-type="pmid">23940820</pub-id></citation></ref>
<ref id="B13"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Afshinnekoo</surname> <given-names>E</given-names></name> <name><surname>Meydan</surname> <given-names>C</given-names></name> <name><surname>Chowdhury</surname> <given-names>S</given-names></name> <name><surname>Jaroudi</surname> <given-names>D</given-names></name> <name><surname>Boyer</surname> <given-names>C</given-names></name> <name><surname>Bernstein</surname> <given-names>N</given-names></name> <etal/></person-group> <article-title>Geospatial resolution of human and bacterial diversity with city-scale metagenomics</article-title>. <source>Cell Syst</source> (<year>2015</year>) <volume>1</volume>(<issue>1</issue>):<fpage>72</fpage>&#x02013;<lpage>87</lpage>.<pub-id pub-id-type="doi">10.1016/j.cels.2015.01.001</pub-id></citation></ref>
<ref id="B14"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>L</given-names></name> <name><surname>Li</surname> <given-names>Y</given-names></name> <name><surname>Li</surname> <given-names>S</given-names></name> <name><surname>Hu</surname> <given-names>N</given-names></name> <name><surname>He</surname> <given-names>Y</given-names></name> <name><surname>Pong</surname> <given-names>R</given-names></name> <etal/></person-group> <article-title>Comparison of next-generation sequencing systems</article-title>. <source>J Biomed Biotechnol</source> (<year>2012</year>) <volume>2012</volume>:<fpage>251364</fpage>.<pub-id pub-id-type="doi">10.1155/2012/251364</pub-id><pub-id pub-id-type="pmid">22829749</pub-id></citation></ref>
<ref id="B15"><label>15</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Frey</surname> <given-names>KG</given-names></name> <name><surname>Herrera-Galeano</surname> <given-names>JE</given-names></name> <name><surname>Redden</surname> <given-names>CL</given-names></name> <name><surname>Luu</surname> <given-names>TV</given-names></name> <name><surname>Servetas</surname> <given-names>SL</given-names></name> <name><surname>Mateczun</surname> <given-names>AJ</given-names></name> <etal/></person-group> <article-title>Comparison of three next-generation sequencing platforms for metagenomic sequencing and identification of pathogens in blood</article-title>. <source>BMC Genomics</source> (<year>2014</year>) <volume>15</volume>(<issue>1</issue>):<fpage>96</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2164-15-96</pub-id><pub-id pub-id-type="pmid">24495417</pub-id></citation></ref>
<ref id="B16"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Glenn</surname> <given-names>TC</given-names></name></person-group>. <article-title>Field guide to next-generation DNA sequencers</article-title>. <source>Mol Ecol Resour</source> (<year>2011</year>) <volume>11</volume>(<issue>5</issue>):<fpage>759</fpage>&#x02013;<lpage>69</lpage>.<pub-id pub-id-type="doi">10.1111/j.1755-0998.2011.03024.x</pub-id><pub-id pub-id-type="pmid">21592312</pub-id></citation></ref>
<ref id="B17"><label>17</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Case</surname> <given-names>RJ</given-names></name> <name><surname>Boucher</surname> <given-names>Y</given-names></name> <name><surname>Dahll&#x000F6;f</surname> <given-names>I</given-names></name> <name><surname>Holmstr&#x000F6;m</surname> <given-names>C</given-names></name> <name><surname>Doolittle</surname> <given-names>WF</given-names></name> <name><surname>Kjelleberg</surname> <given-names>S</given-names></name></person-group>. <article-title>Use of 16S rRNA and rpoB genes as molecular markers for microbial ecology studies</article-title>. <source>Appl Environ Microbiol</source> (<year>2007</year>) <volume>73</volume>(<issue>1</issue>):<fpage>278</fpage>&#x02013;<lpage>88</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.01177-06</pub-id><pub-id pub-id-type="pmid">17071787</pub-id></citation></ref>
<ref id="B18"><label>18</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>D</given-names></name> <name><surname>Jospin</surname> <given-names>G</given-names></name> <name><surname>Eisen</surname> <given-names>JA</given-names></name></person-group>. <article-title>Systematic identification of gene families for use as markers for phylogenetic and phylogeny-driven ecological studies of bacteria and archaea and their major subgroups</article-title>. <source>PLoS One</source> (<year>2013</year>) <volume>8</volume>(<issue>10</issue>):<fpage>e77033</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0077033</pub-id><pub-id pub-id-type="pmid">24146954</pub-id></citation></ref>
<ref id="B19"><label>19</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gaby</surname> <given-names>JC</given-names></name> <name><surname>Buckley</surname> <given-names>DH</given-names></name></person-group>. <article-title>A comprehensive aligned nifH gene database: a multipurpose tool for studies of nitrogen-fixing bacteria</article-title>. <source>Database (Oxford)</source> (<year>2014</year>) <volume>2014</volume>(<issue>0</issue>):<fpage>bau001</fpage>.<pub-id pub-id-type="doi">10.1093/database/bau001</pub-id><pub-id pub-id-type="pmid">24501396</pub-id></citation></ref>
<ref id="B20"><label>20</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barriuso</surname> <given-names>J</given-names></name> <name><surname>Valverde</surname> <given-names>JR</given-names></name> <name><surname>Mellado</surname> <given-names>RP</given-names></name></person-group>. <article-title>Estimation of bacterial diversity using next generation sequencing of 16S rDNA: a comparison of different workflows</article-title>. <source>BMC Bioinformatics</source> (<year>2011</year>) <volume>12</volume>(<issue>1</issue>):<fpage>473</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-12-473</pub-id><pub-id pub-id-type="pmid">22168258</pub-id></citation></ref>
<ref id="B21"><label>21</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ravel</surname> <given-names>J</given-names></name> <name><surname>Blaser</surname> <given-names>MJ</given-names></name> <name><surname>Braun</surname> <given-names>J</given-names></name> <name><surname>Brown</surname> <given-names>E</given-names></name> <name><surname>Bushman</surname> <given-names>FD</given-names></name> <name><surname>Chang</surname> <given-names>EB</given-names></name> <etal/></person-group> <article-title>Human microbiome science: vision for the future, Bethesda, MD, July 24 to 26, 2013</article-title>. <source>Microbiome</source> (<year>2014</year>) <volume>2</volume>(<issue>1</issue>):<fpage>16</fpage>.<pub-id pub-id-type="doi">10.1186/2049-2618-2-16</pub-id></citation></ref>
<ref id="B22"><label>22</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Frick</surname> <given-names>J-S</given-names></name> <name><surname>Haller</surname> <given-names>D</given-names></name></person-group>. <article-title>Intestinal microbiota: from sequencing to function</article-title>. <source>Int J Med Microbiol</source> (<year>2016</year>).<pub-id pub-id-type="doi">10.1016/j.ijmm.2016.02.007</pub-id></citation></ref>
<ref id="B23"><label>23</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pekkala</surname> <given-names>S</given-names></name> <name><surname>Munukka</surname> <given-names>E</given-names></name> <name><surname>Rintala</surname> <given-names>A</given-names></name> <name><surname>Huovinen</surname> <given-names>P</given-names></name></person-group>. <article-title>The microbiome studies in metabolic diseases have advanced but are poorly standardized and lack a mechanistic perspective</article-title>. <source>J Diabetes Metab</source> (<year>2015</year>) <volume>6</volume>:<fpage>480</fpage>.<pub-id pub-id-type="doi">10.4172/2155-6156.1000480</pub-id></citation></ref>
<ref id="B24"><label>24</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Avershina</surname> <given-names>E</given-names></name> <name><surname>Rudi</surname> <given-names>K</given-names></name></person-group>. <article-title>Confusion about the species richness of human gut microbiota</article-title>. <source>Benef Microbes</source> (<year>2015</year>) <volume>6</volume>(<issue>5</issue>):<fpage>657</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.3920/BM2015.0007</pub-id><pub-id pub-id-type="pmid">26036144</pub-id></citation></ref>
<ref id="B25"><label>25</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>PD</given-names></name> <name><surname>Gevers</surname> <given-names>D</given-names></name> <name><surname>Westcott</surname> <given-names>SL</given-names></name></person-group>. <article-title>Reducing the effects of PCR amplification and sequencing artifacts on 16S rRNA-based studies</article-title>. <source>PLoS One</source> (<year>2011</year>) <volume>6</volume>(<issue>12</issue>):<fpage>e27310</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0027310</pub-id><pub-id pub-id-type="pmid">22194782</pub-id></citation></ref>
<ref id="B26"><label>26</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>C</given-names></name> <name><surname>Tsementzi</surname> <given-names>D</given-names></name> <name><surname>Kyrpides</surname> <given-names>N</given-names></name> <name><surname>Read</surname> <given-names>T</given-names></name> <name><surname>Konstantinidis</surname> <given-names>KT</given-names></name></person-group>. <article-title>Direct comparisons of Illumina vs. Roche 454 sequencing technologies on the same microbial community DNA sample</article-title>. <source>PLoS One</source> (<year>2012</year>) <volume>7</volume>(<issue>2</issue>):<fpage>e30087</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0030087</pub-id><pub-id pub-id-type="pmid">22347999</pub-id></citation></ref>
<ref id="B27"><label>27</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mosher</surname> <given-names>JJ</given-names></name> <name><surname>Bernberg</surname> <given-names>EL</given-names></name> <name><surname>Shevchenko</surname> <given-names>O</given-names></name> <name><surname>Kan</surname> <given-names>J</given-names></name> <name><surname>Kaplan</surname> <given-names>LA</given-names></name></person-group>. <article-title>Efficacy of a 3rd generation high-throughput sequencing platform for analyses of 16S rRNA genes from environmental samples</article-title>. <source>J Microbiol Methods</source> (<year>2013</year>) <volume>95</volume>(<issue>2</issue>):<fpage>175</fpage>&#x02013;<lpage>81</lpage>.<pub-id pub-id-type="doi">10.1016/j.mimet.2013.08.009</pub-id><pub-id pub-id-type="pmid">23999276</pub-id></citation></ref>
<ref id="B28"><label>28</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nederbragt</surname> <given-names>AJ</given-names></name></person-group>. <article-title>On the middle ground between open source and commercial software &#x02013; the case of the Newbler program</article-title>. <source>Genome Biol</source> (<year>2014</year>) <volume>15</volume>(<issue>4</issue>):<fpage>113</fpage>.<pub-id pub-id-type="doi">10.1186/gb4173</pub-id></citation></ref>
<ref id="B29"><label>29</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kozich</surname> <given-names>JJ</given-names></name> <name><surname>Westcott</surname> <given-names>SL</given-names></name> <name><surname>Baxter</surname> <given-names>NT</given-names></name> <name><surname>Highlander</surname> <given-names>SK</given-names></name> <name><surname>Schloss</surname> <given-names>PD</given-names></name></person-group>. <article-title>Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform</article-title>. <source>Appl Environ Microbiol</source> (<year>2013</year>) <volume>79</volume>(<issue>17</issue>):<fpage>5112</fpage>&#x02013;<lpage>20</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.01043-13</pub-id><pub-id pub-id-type="pmid">23793624</pub-id></citation></ref>
<ref id="B30"><label>30</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schr&#x000F6;der</surname> <given-names>J</given-names></name> <name><surname>Bailey</surname> <given-names>J</given-names></name> <name><surname>Conway</surname> <given-names>T</given-names></name> <name><surname>Zobel</surname> <given-names>J</given-names></name></person-group>. <article-title>Reference-free validation of short read data</article-title>. <source>PLoS One</source> (<year>2010</year>) <volume>5</volume>(<issue>9</issue>):<fpage>e12681</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0012681</pub-id><pub-id pub-id-type="pmid">20877643</pub-id></citation></ref>
<ref id="B31"><label>31</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quail</surname> <given-names>MA</given-names></name> <name><surname>Smith</surname> <given-names>M</given-names></name> <name><surname>Coupland</surname> <given-names>P</given-names></name> <name><surname>Otto</surname> <given-names>TD</given-names></name> <name><surname>Harris</surname> <given-names>SR</given-names></name> <name><surname>Connor</surname> <given-names>TR</given-names></name> <etal/></person-group> <article-title>A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers</article-title>. <source>BMC Genomics</source> (<year>2012</year>) <volume>13</volume>(<issue>1</issue>):<fpage>341</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2164-13-341</pub-id><pub-id pub-id-type="pmid">22827831</pub-id></citation></ref>
<ref id="B32"><label>32</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Salipante</surname> <given-names>SJ</given-names></name> <name><surname>Kawashima</surname> <given-names>T</given-names></name> <name><surname>Rosenthal</surname> <given-names>C</given-names></name> <name><surname>Hoogestraat</surname> <given-names>DR</given-names></name> <name><surname>Cummings</surname> <given-names>LA</given-names></name> <name><surname>Sengupta</surname> <given-names>DJ</given-names></name> <etal/></person-group> <article-title>Performance comparison of Illumina and Ion Torrent next-generation sequencing platforms for 16S rRNA-based bacterial community profiling</article-title>. <source>Appl Environ Microbiol</source> (<year>2014</year>) <volume>80</volume>(<issue>24</issue>):<fpage>7583</fpage>&#x02013;<lpage>91</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.02206-14</pub-id><pub-id pub-id-type="pmid">25261520</pub-id></citation></ref>
<ref id="B33"><label>33</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loman</surname> <given-names>NJ</given-names></name> <name><surname>Misra</surname> <given-names>RV</given-names></name> <name><surname>Dallman</surname> <given-names>TJ</given-names></name> <name><surname>Constantinidou</surname> <given-names>C</given-names></name> <name><surname>Gharbia</surname> <given-names>SE</given-names></name> <name><surname>Wain</surname> <given-names>J</given-names></name> <etal/></person-group> <article-title>Performance comparison of benchtop high-throughput sequencing platforms</article-title>. <source>Nat Biotechnol</source> (<year>2012</year>) <volume>30</volume>(<issue>5</issue>):<fpage>434</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1038/nbt.2198</pub-id><pub-id pub-id-type="pmid">22522955</pub-id></citation></ref>
<ref id="B34"><label>34</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>PD</given-names></name> <name><surname>Jenior</surname> <given-names>ML</given-names></name> <name><surname>Koumpouras</surname> <given-names>CC</given-names></name> <name><surname>Westcott</surname> <given-names>SL</given-names></name> <name><surname>Highlander</surname> <given-names>SK</given-names></name></person-group>. <article-title>Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system</article-title>. <source>PeerJ</source> (<year>2016</year>) <volume>4</volume>:<fpage>e1869</fpage>.<pub-id pub-id-type="doi">10.7717/peerj.1869</pub-id><pub-id pub-id-type="pmid">27069806</pub-id></citation></ref>
<ref id="B35"><label>35</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fichot</surname> <given-names>EB</given-names></name> <name><surname>Norman</surname> <given-names>RS</given-names></name></person-group>. <article-title>Microbial phylogenetic profiling with the Pacific Biosciences sequencing platform</article-title>. <source>Microbiome</source> (<year>2013</year>) <volume>1</volume>(<issue>1</issue>):<fpage>10</fpage>.<pub-id pub-id-type="doi">10.1186/2049-2618-1-10</pub-id><pub-id pub-id-type="pmid">24450498</pub-id></citation></ref>
<ref id="B36"><label>36</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>J</given-names></name> <name><surname>He</surname> <given-names>Z</given-names></name> <name><surname>Yang</surname> <given-names>Y</given-names></name> <name><surname>Deng</surname> <given-names>Y</given-names></name> <name><surname>Tringe</surname> <given-names>SG</given-names></name> <name><surname>Alvarez-Cohen</surname> <given-names>L</given-names></name></person-group>. <article-title>High-throughput metagenomic technologies for complex microbial community analysis: open and closed formats</article-title>. <source>MBio</source> (<year>2015</year>) <volume>6</volume>(<issue>1</issue>):<fpage>e02288</fpage>&#x02013;<lpage>14</lpage>.<pub-id pub-id-type="doi">10.1128/mBio.02288-14</pub-id><pub-id pub-id-type="pmid">25626903</pub-id></citation></ref>
<ref id="B37"><label>37</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tremblay</surname> <given-names>J</given-names></name> <name><surname>Singh</surname> <given-names>K</given-names></name> <name><surname>Fern</surname> <given-names>A</given-names></name> <name><surname>Kirton</surname> <given-names>ES</given-names></name> <name><surname>He</surname> <given-names>S</given-names></name> <name><surname>Woyke</surname> <given-names>T</given-names></name> <etal/></person-group> <article-title>Primer and platform effects on 16S rRNA tag sequencing</article-title>. <source>Front Microbiol</source> (<year>2015</year>) <volume>6</volume>:<fpage>771</fpage>.<pub-id pub-id-type="doi">10.3389/fmicb.2015.00771</pub-id><pub-id pub-id-type="pmid">26300854</pub-id></citation></ref>
<ref id="B38"><label>38</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hazen</surname> <given-names>TC</given-names></name> <name><surname>Rocha</surname> <given-names>AM</given-names></name> <name><surname>Techtmann</surname> <given-names>SM</given-names></name></person-group>. <article-title>Advances in monitoring environmental microbes</article-title>. <source>Curr Opin Biotechnol</source> (<year>2013</year>) <volume>24</volume>(<issue>3</issue>):<fpage>526</fpage>&#x02013;<lpage>33</lpage>.<pub-id pub-id-type="doi">10.1016/j.copbio.2012.10.020</pub-id><pub-id pub-id-type="pmid">23183250</pub-id></citation></ref>
<ref id="B39"><label>39</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Preheim</surname> <given-names>SP</given-names></name> <name><surname>Perrotta</surname> <given-names>AR</given-names></name> <name><surname>Friedman</surname> <given-names>J</given-names></name> <name><surname>Smilie</surname> <given-names>C</given-names></name> <name><surname>Brito</surname> <given-names>I</given-names></name> <name><surname>Smith</surname> <given-names>MB</given-names></name> <etal/></person-group> <article-title>Computational methods for high-throughput comparative analyses of natural microbial communities</article-title>. <source>Methods Enzymol</source> (<year>2013</year>) <volume>531</volume>:<fpage>353</fpage>&#x02013;<lpage>70</lpage>.<pub-id pub-id-type="doi">10.1016/B978-0-12-407863-5.00018-6</pub-id><pub-id pub-id-type="pmid">24060130</pub-id></citation></ref>
<ref id="B40"><label>40</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jovel</surname> <given-names>J</given-names></name> <name><surname>Patterson</surname> <given-names>J</given-names></name> <name><surname>Wang</surname> <given-names>W</given-names></name> <name><surname>Hotte</surname> <given-names>N</given-names></name> <name><surname>O&#x02019;Keefe</surname> <given-names>S</given-names></name> <name><surname>Mitchel</surname> <given-names>T</given-names></name> <etal/></person-group> <article-title>Characterization of the gut microbiome using 16S or shotgun metagenomics</article-title>. <source>Front Microbiol</source> (<year>2016</year>) <volume>7</volume>:<fpage>459</fpage>.<pub-id pub-id-type="doi">10.3389/fmicb.2016.00459</pub-id><pub-id pub-id-type="pmid">27148170</pub-id></citation></ref>
<ref id="B41"><label>41</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>PD</given-names></name></person-group>. <article-title>Evaluating different approaches that test whether microbial communities have the same structure</article-title>. <source>ISME J</source> (<year>2008</year>) <volume>2</volume>(<issue>3</issue>):<fpage>265</fpage>&#x02013;<lpage>75</lpage>.<pub-id pub-id-type="doi">10.1038/ismej.2008.5</pub-id><pub-id pub-id-type="pmid">18239608</pub-id></citation></ref>
<ref id="B42"><label>42</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prosser</surname> <given-names>JI</given-names></name></person-group>. <article-title>Replicate or lie</article-title>. <source>Environ Microbiol</source> (<year>2010</year>) <volume>12</volume>(<issue>7</issue>):<fpage>1806</fpage>&#x02013;<lpage>10</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2010.02201.x</pub-id></citation></ref>
<ref id="B43"><label>43</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lennon</surname> <given-names>JT</given-names></name></person-group>. <article-title>Replication, lies and lesser-known truths regarding experimental design in environmental microbiology</article-title>. <source>Environ Microbiol</source> (<year>2011</year>) <volume>13</volume>(<issue>6</issue>):<fpage>1383</fpage>&#x02013;<lpage>6</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2011.02445.x</pub-id><pub-id pub-id-type="pmid">21392200</pub-id></citation></ref>
<ref id="B44"><label>44</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Knight</surname> <given-names>R</given-names></name> <name><surname>Jansson</surname> <given-names>J</given-names></name> <name><surname>Field</surname> <given-names>D</given-names></name> <name><surname>Fierer</surname> <given-names>N</given-names></name> <name><surname>Desai</surname> <given-names>N</given-names></name> <name><surname>Fuhrman</surname> <given-names>JA</given-names></name> <etal/></person-group> <article-title>Unlocking the potential of metagenomics through replicated experimental design</article-title>. <source>Nat Biotechnol</source> (<year>2012</year>) <volume>30</volume>(<issue>6</issue>):<fpage>513</fpage>&#x02013;<lpage>20</lpage>.<pub-id pub-id-type="doi">10.1038/nbt.2235</pub-id><pub-id pub-id-type="pmid">22678395</pub-id></citation></ref>
<ref id="B45"><label>45</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McMurdie</surname> <given-names>PJ</given-names></name> <name><surname>Holmes</surname> <given-names>S</given-names></name></person-group>. <article-title>Waste not, want not: why rarefying microbiome data is inadmissible</article-title>. <source>PLoS Comput Biol</source> (<year>2014</year>) <volume>10</volume>(<issue>4</issue>):<fpage>e1003531</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pcbi.1003531</pub-id><pub-id pub-id-type="pmid">24699258</pub-id></citation></ref>
<ref id="B46"><label>46</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bahl</surname> <given-names>MI</given-names></name> <name><surname>Bergstr&#x000F6;m</surname> <given-names>A</given-names></name> <name><surname>Licht</surname> <given-names>TR</given-names></name></person-group>. <article-title>Freezing fecal samples prior to DNA extraction affects the firmicutes to bacteroidetes ratio determined by downstream quantitative PCR analysis</article-title>. <source>FEMS Microbiol Lett</source> (<year>2012</year>) <volume>329</volume>(<issue>2</issue>):<fpage>193</fpage>&#x02013;<lpage>7</lpage>.<pub-id pub-id-type="doi">10.1111/j.1574-6968.2012.02523.x</pub-id><pub-id pub-id-type="pmid">22325006</pub-id></citation></ref>
<ref id="B47"><label>47</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>F</given-names></name> <name><surname>Zhang</surname> <given-names>T</given-names></name></person-group>. <article-title>Biases during DNA extraction of activated sludge samples revealed by high throughput sequencing</article-title>. <source>Appl Microbiol Biotechnol</source> (<year>2013</year>) <volume>97</volume>(<issue>10</issue>):<fpage>4607</fpage>&#x02013;<lpage>16</lpage>.<pub-id pub-id-type="doi">10.1007/s00253-012-4244-4</pub-id><pub-id pub-id-type="pmid">22760785</pub-id></citation></ref>
<ref id="B48"><label>48</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ariefdjohan</surname> <given-names>MW</given-names></name> <name><surname>Savaiano</surname> <given-names>DA</given-names></name> <name><surname>Nakatsu</surname> <given-names>CH</given-names></name></person-group>. <article-title>Comparison of DNA extraction kits for PCR-DGGE analysis of human intestinal microbial communities from fecal specimens</article-title>. <source>Nutr J</source> (<year>2010</year>) <volume>9</volume>:<fpage>23</fpage>.<pub-id pub-id-type="doi">10.1186/1475-2891-9-23</pub-id><pub-id pub-id-type="pmid">20492702</pub-id></citation></ref>
<ref id="B49"><label>49</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Feinstein</surname> <given-names>LM</given-names></name> <name><surname>Sul</surname> <given-names>WJ</given-names></name> <name><surname>Blackwood</surname> <given-names>CB</given-names></name></person-group>. <article-title>Assessment of bias associated with incomplete extraction of microbial DNA from soil</article-title>. <source>Appl Environ Microbiol</source> (<year>2009</year>) <volume>75</volume>(<issue>16</issue>):<fpage>5428</fpage>&#x02013;<lpage>33</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.00120-09</pub-id><pub-id pub-id-type="pmid">19561189</pub-id></citation></ref>
<ref id="B50"><label>50</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>B</given-names></name> <name><surname>Li</surname> <given-names>N</given-names></name> <name><surname>Andersen</surname> <given-names>AS</given-names></name> <name><surname>Slotved</surname> <given-names>HC</given-names></name> <name><surname>Krogfelt</surname> <given-names>KA</given-names></name></person-group>. <article-title>Optimising bacterial DNA extraction from faecal samples: comparison of three methods</article-title>. <source>Open Microbiol J</source> (<year>2011</year>) <volume>5</volume>:<fpage>14</fpage>&#x02013;<lpage>7</lpage>.<pub-id pub-id-type="doi">10.2174/1874285801105010014</pub-id><pub-id pub-id-type="pmid">21643498</pub-id></citation></ref>
<ref id="B51"><label>51</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wesolowska-Andersen</surname> <given-names>A</given-names></name> <name><surname>Bahl</surname> <given-names>MI</given-names></name> <name><surname>Carvalho</surname> <given-names>V</given-names></name> <name><surname>Kristiansen</surname> <given-names>K</given-names></name> <name><surname>Sicheritz-Pont&#x000E9;n</surname> <given-names>T</given-names></name> <name><surname>Gupta</surname> <given-names>R</given-names></name> <etal/></person-group> <article-title>Choice of bacterial DNA extraction method from fecal material influences community structure as evaluated by metagenomic analysis</article-title>. <source>Microbiome</source> (<year>2014</year>) <volume>2</volume>(<issue>1</issue>):<fpage>19</fpage>.<pub-id pub-id-type="doi">10.1186/2049-2618-2-19</pub-id><pub-id pub-id-type="pmid">24949196</pub-id></citation></ref>
<ref id="B52"><label>52</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Henderson</surname> <given-names>G</given-names></name> <name><surname>Cox</surname> <given-names>F</given-names></name> <name><surname>Kittelmann</surname> <given-names>S</given-names></name> <name><surname>Miri</surname> <given-names>VH</given-names></name> <name><surname>Zethof</surname> <given-names>M</given-names></name> <name><surname>Noel</surname> <given-names>SJ</given-names></name> <etal/></person-group> <article-title>Effect of DNA extraction methods and sampling techniques on the apparent structure of cow and sheep rumen microbial communities</article-title>. <source>PLoS One</source> (<year>2013</year>) <volume>8</volume>(<issue>9</issue>):<fpage>e74787</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0074787</pub-id><pub-id pub-id-type="pmid">24040342</pub-id></citation></ref>
<ref id="B53"><label>53</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rand</surname> <given-names>KH</given-names></name> <name><surname>Houck</surname> <given-names>H</given-names></name></person-group>. <article-title>Taq polymerase contains bacterial DNA of unknown origin</article-title>. <source>Mol Cell Probes</source> (<year>1990</year>) <volume>4</volume>(<issue>6</issue>):<fpage>445</fpage>&#x02013;<lpage>50</lpage>.<pub-id pub-id-type="doi">10.1016/0890-8508(90)90003-I</pub-id><pub-id pub-id-type="pmid">2087233</pub-id></citation></ref>
<ref id="B54"><label>54</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shen</surname> <given-names>H</given-names></name> <name><surname>Rogelj</surname> <given-names>S</given-names></name> <name><surname>Kieft</surname> <given-names>TL</given-names></name></person-group>. <article-title>Sensitive, real-time PCR detects low-levels of contamination by <italic>Legionella pneumophila</italic> in commercial reagents</article-title>. <source>Mol Cell Probes</source> (<year>2006</year>) <volume>20</volume>(<issue>3&#x02013;4</issue>):<fpage>147</fpage>&#x02013;<lpage>53</lpage>.<pub-id pub-id-type="doi">10.1016/j.mcp.2005.09.007</pub-id><pub-id pub-id-type="pmid">16632318</pub-id></citation></ref>
<ref id="B55"><label>55</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kulakov</surname> <given-names>LA</given-names></name> <name><surname>McAlister</surname> <given-names>MB</given-names></name> <name><surname>Ogden</surname> <given-names>KL</given-names></name> <name><surname>Larkin</surname> <given-names>MJ</given-names></name> <name><surname>O&#x02019;Hanlon</surname> <given-names>JF</given-names></name></person-group>. <article-title>Analysis of bacteria contaminating ultrapure water in industrial systems</article-title>. <source>Appl Environ Microbiol</source> (<year>2002</year>) <volume>68</volume>(<issue>4</issue>):<fpage>1548</fpage>&#x02013;<lpage>55</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.68.4.1548-1555.2002</pub-id><pub-id pub-id-type="pmid">11916667</pub-id></citation></ref>
<ref id="B56"><label>56</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McAlister</surname> <given-names>MB</given-names></name> <name><surname>Kulakov</surname> <given-names>LA</given-names></name> <name><surname>O&#x02019;Hanlon</surname> <given-names>JF</given-names></name> <name><surname>Larkin</surname> <given-names>MJ</given-names></name> <name><surname>Ogden</surname> <given-names>KL</given-names></name></person-group>. <article-title>Survival and nutritional requirements of three bacteria isolated from ultrapure water</article-title>. <source>J Ind Microbiol Biotechnol</source> (<year>2002</year>) <volume>29</volume>(<issue>2</issue>):<fpage>75</fpage>&#x02013;<lpage>82</lpage>.<pub-id pub-id-type="doi">10.1038/sj.jim.7000273</pub-id><pub-id pub-id-type="pmid">12161774</pub-id></citation></ref>
<ref id="B57"><label>57</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tanner</surname> <given-names>MA</given-names></name> <name><surname>Goebel</surname> <given-names>BM</given-names></name> <name><surname>Dojka</surname> <given-names>MA</given-names></name> <name><surname>Pace</surname> <given-names>NR</given-names></name></person-group>. <article-title>Specific ribosomal DNA sequences from diverse environmental settings correlate with experimental contaminants</article-title>. <source>Appl Envir Microbiol</source> (<year>1998</year>) <volume>64</volume>(<issue>8</issue>):<fpage>3110</fpage>&#x02013;<lpage>3</lpage>.<pub-id pub-id-type="pmid">9687486</pub-id></citation></ref>
<ref id="B58"><label>58</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Corless</surname> <given-names>CE</given-names></name> <name><surname>Guiver</surname> <given-names>M</given-names></name> <name><surname>Borrow</surname> <given-names>R</given-names></name> <name><surname>Edwards-Jones</surname> <given-names>V</given-names></name> <name><surname>Kaczmarski</surname> <given-names>EB</given-names></name> <name><surname>Fox</surname> <given-names>AJ</given-names></name></person-group>. <article-title>Contamination and sensitivity issues with a real-time universal 16S rRNA PCR</article-title>. <source>J Clin Microbiol</source> (<year>2000</year>) <volume>38</volume>(<issue>5</issue>):<fpage>1747</fpage>&#x02013;<lpage>52</lpage>.<pub-id pub-id-type="pmid">10790092</pub-id></citation></ref>
<ref id="B59"><label>59</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grahn</surname> <given-names>N</given-names></name> <name><surname>Olofsson</surname> <given-names>M</given-names></name> <name><surname>Ellnebo-Svedlund</surname> <given-names>K</given-names></name> <name><surname>Monstein</surname> <given-names>H-J</given-names></name> <name><surname>Jonasson</surname> <given-names>J</given-names></name></person-group>. <article-title>Identification of mixed bacterial DNA contamination in broad-range PCR amplification of 16S rDNA V1 and V3 variable regions by pyrosequencing of cloned amplicons</article-title>. <source>FEMS Microbiol Lett</source> (<year>2003</year>) <volume>219</volume>(<issue>1</issue>):<fpage>87</fpage>&#x02013;<lpage>91</lpage>.<pub-id pub-id-type="doi">10.1016/S0378-1097(02)01190-4</pub-id><pub-id pub-id-type="pmid">12594028</pub-id></citation></ref>
<ref id="B60"><label>60</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Salter</surname> <given-names>SJ</given-names></name> <name><surname>Cox</surname> <given-names>MJ</given-names></name> <name><surname>Turek</surname> <given-names>EM</given-names></name> <name><surname>Calus</surname> <given-names>ST</given-names></name> <name><surname>Cookson</surname> <given-names>WO</given-names></name> <name><surname>Moffatt</surname> <given-names>MF</given-names></name> <etal/></person-group> <article-title>Reagent and laboratory contamination can critically impact sequence-based microbiome analyses</article-title>. <source>BMC Biol</source> (<year>2014</year>) <volume>12</volume>(<issue>1</issue>):<fpage>87</fpage>.<pub-id pub-id-type="doi">10.1186/s12915-014-0087-z</pub-id><pub-id pub-id-type="pmid">25387460</pub-id></citation></ref>
<ref id="B61"><label>61</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jervis-Bardy</surname> <given-names>J</given-names></name> <name><surname>Leong</surname> <given-names>LEX</given-names></name> <name><surname>Marri</surname> <given-names>S</given-names></name> <name><surname>Smith</surname> <given-names>RJ</given-names></name> <name><surname>Choo</surname> <given-names>JM</given-names></name> <name><surname>Smith-Vaughan</surname> <given-names>HC</given-names></name> <etal/></person-group> <article-title>Deriving accurate microbiota profiles from human samples with low bacterial content through post-sequencing processing of Illumina MiSeq data</article-title>. <source>Microbiome</source> (<year>2015</year>) <volume>3</volume>(<issue>1</issue>):<fpage>19</fpage>.<pub-id pub-id-type="doi">10.1186/s40168-015-0083-8</pub-id><pub-id pub-id-type="pmid">25969736</pub-id></citation></ref>
<ref id="B62"><label>62</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pinto</surname> <given-names>AJ</given-names></name> <name><surname>Raskin</surname> <given-names>L</given-names></name></person-group>. <article-title>PCR biases distort bacterial and archaeal community structure in pyrosequencing datasets</article-title>. <source>PLoS One</source> (<year>2012</year>) <volume>7</volume>(<issue>8</issue>):<fpage>e43093</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0043093</pub-id><pub-id pub-id-type="pmid">22905208</pub-id></citation></ref>
<ref id="B63"><label>63</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Polz</surname> <given-names>MF</given-names></name> <name><surname>Cavanaugh</surname> <given-names>CM</given-names></name></person-group>. <article-title>Bias in template-to-product ratios in multitemplate PCR</article-title>. <source>Appl Environ Microbiol</source> (<year>1998</year>) <volume>64</volume>(<issue>10</issue>):<fpage>3724</fpage>&#x02013;<lpage>30</lpage>.<pub-id pub-id-type="pmid">9758791</pub-id></citation></ref>
<ref id="B64"><label>64</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ishii</surname> <given-names>K</given-names></name> <name><surname>Fukui</surname> <given-names>M</given-names></name></person-group>. <article-title>Optimization of annealing temperature to reduce bias caused by a primer mismatch in multitemplate PCR</article-title>. <source>Appl Environ Microbiol</source> (<year>2001</year>) <volume>67</volume>(<issue>8</issue>):<fpage>3753</fpage>&#x02013;<lpage>5</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.67.8.3753-3755.2001</pub-id><pub-id pub-id-type="pmid">11472961</pub-id></citation></ref>
<ref id="B65"><label>65</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kalle</surname> <given-names>E</given-names></name> <name><surname>Gulevich</surname> <given-names>A</given-names></name> <name><surname>Rensing</surname> <given-names>C</given-names></name></person-group>. <article-title>External and semi-internal controls for PCR amplification of homologous sequences in mixed templates</article-title>. <source>J Microbiol Methods</source> (<year>2013</year>) <volume>95</volume>(<issue>2</issue>):<fpage>285</fpage>&#x02013;<lpage>94</lpage>.<pub-id pub-id-type="doi">10.1016/j.mimet.2013.09.014</pub-id><pub-id pub-id-type="pmid">24076226</pub-id></citation></ref>
<ref id="B66"><label>66</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woese</surname> <given-names>CR</given-names></name> <name><surname>Fox</surname> <given-names>GE</given-names></name></person-group>. <article-title>Phylogenetic structure of the prokaryotic domain: the primary kingdoms</article-title>. <source>Proc Natl Acad Sci U S A.</source> (<year>1977</year>) <volume>74</volume>(<issue>11</issue>):<fpage>5088</fpage>&#x02013;<lpage>90</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.74.11.5088</pub-id><pub-id pub-id-type="pmid">270744</pub-id></citation></ref>
<ref id="B67"><label>67</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Head</surname> <given-names>IM</given-names></name> <name><surname>Saunders</surname> <given-names>JR</given-names></name> <name><surname>Pickup</surname> <given-names>RW</given-names></name></person-group>. <article-title>Microbial evolution, diversity, and ecology: a decade of ribosomal RNA analysis of uncultivated microorganisms</article-title>. <source>Microb Ecol</source> (<year>1998</year>) <volume>35</volume>(<issue>1</issue>):<fpage>1</fpage>&#x02013;<lpage>21</lpage>.<pub-id pub-id-type="doi">10.1007/s002489900056</pub-id></citation></ref>
<ref id="B68"><label>68</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tringe</surname> <given-names>SG</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P</given-names></name></person-group>. <article-title>A renaissance for the pioneering 16S rRNA gene</article-title>. <source>Curr Opin Microbiol</source> (<year>2008</year>) <volume>11</volume>(<issue>5</issue>):<fpage>442</fpage>&#x02013;<lpage>6</lpage>.<pub-id pub-id-type="doi">10.1016/j.mib.2008.09.011</pub-id><pub-id pub-id-type="pmid">18817891</pub-id></citation></ref>
<ref id="B69"><label>69</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Youssef</surname> <given-names>N</given-names></name> <name><surname>Sheik</surname> <given-names>CS</given-names></name> <name><surname>Krumholz</surname> <given-names>LR</given-names></name> <name><surname>Najar</surname> <given-names>FZ</given-names></name> <name><surname>Roe</surname> <given-names>BA</given-names></name> <name><surname>Elshahed</surname> <given-names>MS</given-names></name></person-group>. <article-title>Comparison of species richness estimates obtained using nearly complete fragments and simulated pyrosequencing-generated fragments in 16S rRNA gene-based environmental surveys</article-title>. <source>Appl Environ Microbiol</source> (<year>2009</year>) <volume>75</volume>(<issue>16</issue>):<fpage>5227</fpage>&#x02013;<lpage>36</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.00592-09</pub-id><pub-id pub-id-type="pmid">19561178</pub-id></citation></ref>
<ref id="B70"><label>70</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamp</surname> <given-names>TJ</given-names></name> <name><surname>Jones</surname> <given-names>WJ</given-names></name> <name><surname>Fodor</surname> <given-names>AA</given-names></name></person-group>. <article-title>Effects of experimental choices and analysis noise on surveys of the &#x0201C;rare biosphere&#x0201D;</article-title>. <source>Appl Environ Microbiol</source> (<year>2009</year>) <volume>75</volume>(<issue>10</issue>):<fpage>3263</fpage>&#x02013;<lpage>70</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.01931-08</pub-id></citation></ref>
<ref id="B71"><label>71</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>PD</given-names></name></person-group>. <article-title>The effects of alignment quality, distance calculation method, sequence filtering, and region on the analysis of 16S rRNA gene-based studies</article-title>. <source>PLoS Comput Biol</source> (<year>2010</year>) <volume>6</volume>(<issue>7</issue>):<fpage>e1000844</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pcbi.1000844</pub-id><pub-id pub-id-type="pmid">20628621</pub-id></citation></ref>
<ref id="B72"><label>72</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yarza</surname> <given-names>P</given-names></name> <name><surname>Yilmaz</surname> <given-names>P</given-names></name> <name><surname>Pruesse</surname> <given-names>E</given-names></name> <name><surname>Gl&#x000F6;ckner</surname> <given-names>FO</given-names></name> <name><surname>Ludwig</surname> <given-names>W</given-names></name> <name><surname>Schleifer</surname> <given-names>K-H</given-names></name> <etal/></person-group> <article-title>Uniting the classification of cultured and uncultured bacteria and archaea using 16S rRNA gene sequences</article-title>. <source>Nat Rev Microbiol</source> (<year>2014</year>) <volume>12</volume>(<issue>9</issue>):<fpage>635</fpage>&#x02013;<lpage>45</lpage>.<pub-id pub-id-type="doi">10.1038/nrmicro3330</pub-id><pub-id pub-id-type="pmid">25118885</pub-id></citation></ref>
<ref id="B73"><label>73</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cai</surname> <given-names>L</given-names></name> <name><surname>Ye</surname> <given-names>L</given-names></name> <name><surname>Tong</surname> <given-names>AHY</given-names></name> <name><surname>Lok</surname> <given-names>S</given-names></name> <name><surname>Zhang</surname> <given-names>T</given-names></name></person-group>. <article-title>Biased diversity metrics revealed by bacterial 16S pyrotags derived from different primer sets</article-title>. <source>PLoS One</source> (<year>2013</year>) <volume>8</volume>(<issue>1</issue>):<fpage>e53649</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0053649</pub-id><pub-id pub-id-type="pmid">23341963</pub-id></citation></ref>
<ref id="B74"><label>74</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chakravorty</surname> <given-names>S</given-names></name> <name><surname>Helb</surname> <given-names>D</given-names></name> <name><surname>Burday</surname> <given-names>M</given-names></name> <name><surname>Connell</surname> <given-names>N</given-names></name> <name><surname>Alland</surname> <given-names>D</given-names></name></person-group>. <article-title>A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteria</article-title>. <source>J Microbiol Methods</source> (<year>2007</year>) <volume>69</volume>(<issue>2</issue>):<fpage>330</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1016/j.mimet.2007.02.005</pub-id><pub-id pub-id-type="pmid">17391789</pub-id></citation></ref>
<ref id="B75"><label>75</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>PS</given-names></name> <name><surname>Brooker</surname> <given-names>MR</given-names></name> <name><surname>Dowd</surname> <given-names>SE</given-names></name> <name><surname>Camerlengo</surname> <given-names>T</given-names></name></person-group>. <article-title>Target region selection is a critical determinant of community fingerprints generated by 16S pyrosequencing</article-title>. <source>PLoS One</source> (<year>2011</year>) <volume>6</volume>(<issue>6</issue>):<fpage>e20956</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0020956</pub-id><pub-id pub-id-type="pmid">21738596</pub-id></citation></ref>
<ref id="B76"><label>76</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Y</given-names></name> <name><surname>Qian</surname> <given-names>P.-Y</given-names></name></person-group>. <article-title>Conserved regions in 16S ribosome RNA sequences and primer design for studies of environmental microbes</article-title>. In: <source>Encyclopedia of Metagenomics</source>. <publisher-loc>Boston, MA</publisher-loc>: <publisher-name>Springer US</publisher-name> (<year>2015</year>). p. <fpage>106</fpage>&#x02013;<lpage>10</lpage>.<pub-id pub-id-type="doi">10.1007/978-1-4899-7478-5_772</pub-id></citation></ref>
<ref id="B77"><label>77</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klindworth</surname> <given-names>A</given-names></name> <name><surname>Pruesse</surname> <given-names>E</given-names></name> <name><surname>Schweer</surname> <given-names>T</given-names></name> <name><surname>Peplies</surname> <given-names>J</given-names></name> <name><surname>Quast</surname> <given-names>C</given-names></name> <name><surname>Horn</surname> <given-names>M</given-names></name> <etal/></person-group> <article-title>Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies</article-title>. <source>Nucleic Acids Res</source> (<year>2013</year>) <volume>41</volume>(<issue>1</issue>):<fpage>e1</fpage>.<pub-id pub-id-type="doi">10.1093/nar/gks808</pub-id><pub-id pub-id-type="pmid">22933715</pub-id></citation></ref>
<ref id="B78"><label>78</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mao</surname> <given-names>D-P</given-names></name> <name><surname>Zhou</surname> <given-names>Q</given-names></name> <name><surname>Chen</surname> <given-names>C-Y</given-names></name> <name><surname>Quan</surname> <given-names>Z-X</given-names></name></person-group>. <article-title>Coverage evaluation of universal bacterial primers using the metagenomic datasets</article-title>. <source>BMC Microbiol</source> (<year>2012</year>) <volume>12</volume>(<issue>1</issue>):<fpage>66</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2180-12-66</pub-id><pub-id pub-id-type="pmid">22554309</pub-id></citation></ref>
<ref id="B79"><label>79</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>J-H</given-names></name> <name><surname>Hong</surname> <given-names>P-Y</given-names></name> <name><surname>Liu</surname> <given-names>W-T</given-names></name></person-group>. <article-title>Quantitative effects of position and type of single mismatch on single base primer extension</article-title>. <source>J Microbiol Methods</source> (<year>2009</year>) <volume>77</volume>(<issue>3</issue>):<fpage>267</fpage>&#x02013;<lpage>75</lpage>.<pub-id pub-id-type="doi">10.1016/j.mimet.2009.03.001</pub-id><pub-id pub-id-type="pmid">19285527</pub-id></citation></ref>
<ref id="B80"><label>80</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caporaso</surname> <given-names>JG</given-names></name> <name><surname>Lauber</surname> <given-names>CL</given-names></name> <name><surname>Walters</surname> <given-names>WA</given-names></name> <name><surname>Berg-Lyons</surname> <given-names>D</given-names></name> <name><surname>Lozupone</surname> <given-names>CA</given-names></name> <name><surname>Turnbaugh</surname> <given-names>PJ</given-names></name> <etal/></person-group> <article-title>Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample</article-title>. <source>Proc Natl Acad Sci U S A.</source> (<year>2011</year>) <volume>108</volume>(<issue>Suppl</issue>):<fpage>4516</fpage>&#x02013;<lpage>22</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.1000080107</pub-id><pub-id pub-id-type="pmid">20534432</pub-id></citation></ref>
<ref id="B81"><label>81</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yatsunenko</surname> <given-names>T</given-names></name> <name><surname>Rey</surname> <given-names>FE</given-names></name> <name><surname>Manary</surname> <given-names>MJ</given-names></name> <name><surname>Trehan</surname> <given-names>I</given-names></name> <name><surname>Dominguez-Bello</surname> <given-names>MG</given-names></name> <name><surname>Contreras</surname> <given-names>M</given-names></name> <etal/></person-group> <article-title>Human gut microbiome viewed across age and geography</article-title>. <source>Nature</source> (<year>2012</year>) <volume>486</volume>(<issue>7402</issue>):<fpage>222</fpage>&#x02013;<lpage>7</lpage>.<pub-id pub-id-type="doi">10.1038/nature11053</pub-id><pub-id pub-id-type="pmid">22699611</pub-id></citation></ref>
<ref id="B82"><label>82</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fierer</surname> <given-names>N</given-names></name> <name><surname>Leff</surname> <given-names>JW</given-names></name> <name><surname>Adams</surname> <given-names>BJ</given-names></name> <name><surname>Nielsen</surname> <given-names>UN</given-names></name> <name><surname>Bates</surname> <given-names>ST</given-names></name> <name><surname>Lauber</surname> <given-names>CL</given-names></name> <etal/></person-group> <article-title>Cross-biome metagenomic analyses of soil microbial communities and their functional attributes</article-title>. <source>Proc Natl Acad Sci U S A.</source> (<year>2012</year>) <volume>109</volume>(<issue>52</issue>):<fpage>21390</fpage>&#x02013;<lpage>5</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.1215210110</pub-id><pub-id pub-id-type="pmid">23236140</pub-id></citation></ref>
<ref id="B83"><label>83</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Metzker</surname> <given-names>ML</given-names></name></person-group>. <article-title>Sequencing technologies &#x02013; the next generation</article-title>. <source>Nat Rev Genet</source> (<year>2010</year>) <volume>11</volume>(<issue>1</issue>):<fpage>31</fpage>&#x02013;<lpage>46</lpage>.<pub-id pub-id-type="doi">10.1038/nrg2626</pub-id></citation></ref>
<ref id="B84"><label>84</label><citation citation-type="journal"><collab>The Human Microbiome Project Consortium</collab>. <article-title>A framework for human microbiome research</article-title>. <source>Nature</source> (<year>2012</year>) <volume>486</volume>(<issue>7402</issue>):<fpage>215</fpage>&#x02013;<lpage>21</lpage>.<pub-id pub-id-type="doi">10.1038/nature11209</pub-id><pub-id pub-id-type="pmid">22699610</pub-id></citation></ref>
<ref id="B85"><label>85</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Turnbaugh</surname> <given-names>PJ</given-names></name> <name><surname>Hamady</surname> <given-names>M</given-names></name> <name><surname>Yatsunenko</surname> <given-names>T</given-names></name> <name><surname>Cantarel</surname> <given-names>BL</given-names></name> <name><surname>Duncan</surname> <given-names>A</given-names></name> <name><surname>Ley</surname> <given-names>RE</given-names></name> <etal/></person-group> <article-title>A core gut microbiome in obese and lean twins</article-title>. <source>Nature</source> (<year>2009</year>) <volume>457</volume>(<issue>7228</issue>):<fpage>480</fpage>&#x02013;<lpage>4</lpage>.<pub-id pub-id-type="doi">10.1038/nature07540</pub-id><pub-id pub-id-type="pmid">19043404</pub-id></citation></ref>
<ref id="B86"><label>86</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Candon</surname> <given-names>S</given-names></name> <name><surname>Perez-Arroyo</surname> <given-names>A</given-names></name> <name><surname>Marquet</surname> <given-names>C</given-names></name> <name><surname>Valette</surname> <given-names>F</given-names></name> <name><surname>Foray</surname> <given-names>A-P</given-names></name> <name><surname>Pelletier</surname> <given-names>B</given-names></name> <etal/></person-group> <article-title>Antibiotics in early life alter the gut microbiome and increase disease incidence in a spontaneous mouse model of autoimmune insulin-dependent diabetes</article-title>. <source>PLoS One</source> (<year>2015</year>) <volume>10</volume>(<issue>5</issue>):<fpage>e0125448</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0125448</pub-id></citation></ref>
<ref id="B87"><label>87</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cobaugh</surname> <given-names>KL</given-names></name> <name><surname>Schaeffer</surname> <given-names>SM</given-names></name> <name><surname>DeBruyn</surname> <given-names>JM</given-names></name></person-group>. <article-title>Functional and structural succession of soil microbial communities below decomposing human cadavers</article-title>. <source>PLoS One</source> (<year>2015</year>) <volume>10</volume>(<issue>6</issue>):<fpage>e0130201</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0130201</pub-id><pub-id pub-id-type="pmid">26067226</pub-id></citation></ref>
<ref id="B88"><label>88</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Q</given-names></name> <name><surname>Garrity</surname> <given-names>GM</given-names></name> <name><surname>Tiedje</surname> <given-names>JM</given-names></name> <name><surname>Cole</surname> <given-names>JR</given-names></name></person-group>. <article-title>Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy</article-title>. <source>Appl Environ Microbiol</source> (<year>2007</year>) <volume>73</volume>(<issue>16</issue>):<fpage>5261</fpage>&#x02013;<lpage>7</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.00062-07</pub-id><pub-id pub-id-type="pmid">17586664</pub-id></citation></ref>
<ref id="B89"><label>89</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Z</given-names></name> <name><surname>DeSantis</surname> <given-names>TZ</given-names></name> <name><surname>Andersen</surname> <given-names>GL</given-names></name> <name><surname>Knight</surname> <given-names>R</given-names></name></person-group>. <article-title>Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers</article-title>. <source>Nucleic Acids Res</source> (<year>2008</year>) <volume>36</volume>(<issue>18</issue>):<fpage>e120</fpage>.<pub-id pub-id-type="doi">10.1093/nar/gkn491</pub-id><pub-id pub-id-type="pmid">18723574</pub-id></citation></ref>
<ref id="B90"><label>90</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lundin</surname> <given-names>D</given-names></name> <name><surname>Severin</surname> <given-names>I</given-names></name> <name><surname>Logue</surname> <given-names>JB</given-names></name> <name><surname>Ostman</surname> <given-names>O</given-names></name> <name><surname>Andersson</surname> <given-names>AF</given-names></name> <name><surname>Lindstr&#x000F6;m</surname> <given-names>ES</given-names></name></person-group>. <article-title>Which sequencing depth is sufficient to describe patterns in bacterial &#x003B1;- and &#x003B2;-diversity?</article-title> <source>Environ Microbiol Rep</source> (<year>2012</year>) <volume>4</volume>(<issue>3</issue>):<fpage>367</fpage>&#x02013;<lpage>72</lpage>.<pub-id pub-id-type="doi">10.1111/j.1758-2229.2012.00345.x</pub-id><pub-id pub-id-type="pmid">23760801</pub-id></citation></ref>
<ref id="B91"><label>91</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pedr&#x000F3;s-Ali&#x000F3;</surname> <given-names>C</given-names></name></person-group>. <article-title>The rare bacterial biosphere</article-title>. <source>Ann Rev Mar Sci</source> (<year>2012</year>) <volume>4</volume>:<fpage>449</fpage>&#x02013;<lpage>66</lpage>.<pub-id pub-id-type="doi">10.1146/annurev-marine-120710-100948</pub-id><pub-id pub-id-type="pmid">22457983</pub-id></citation></ref>
<ref id="B92"><label>92</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kunin</surname> <given-names>V</given-names></name> <name><surname>Engelbrektson</surname> <given-names>A</given-names></name> <name><surname>Ochman</surname> <given-names>H</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P</given-names></name></person-group>. <article-title>Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates</article-title>. <source>Environ Microbiol</source> (<year>2010</year>) <volume>12</volume>(<issue>1</issue>):<fpage>118</fpage>&#x02013;<lpage>23</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2009.02051.x</pub-id><pub-id pub-id-type="pmid">19725865</pub-id></citation></ref>
<ref id="B93"><label>93</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huse</surname> <given-names>SM</given-names></name> <name><surname>Huber</surname> <given-names>JA</given-names></name> <name><surname>Morrison</surname> <given-names>HG</given-names></name> <name><surname>Sogin</surname> <given-names>ML</given-names></name> <name><surname>Welch</surname> <given-names>DM</given-names></name></person-group>. <article-title>Accuracy and quality of massively parallel DNA pyrosequencing</article-title>. <source>Genome Biol</source> (<year>2007</year>) <volume>8</volume>(<issue>7</issue>):<fpage>R143</fpage>.<pub-id pub-id-type="doi">10.1186/gb-2007-8-7-r143</pub-id><pub-id pub-id-type="pmid">17659080</pub-id></citation></ref>
<ref id="B94"><label>94</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caporaso</surname> <given-names>JG</given-names></name> <name><surname>Lauber</surname> <given-names>CL</given-names></name> <name><surname>Walters</surname> <given-names>WA</given-names></name> <name><surname>Berg-Lyons</surname> <given-names>D</given-names></name> <name><surname>Huntley</surname> <given-names>J</given-names></name> <name><surname>Fierer</surname> <given-names>N</given-names></name> <etal/></person-group> <article-title>Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms</article-title>. <source>ISME J</source> (<year>2012</year>) <volume>6</volume>(<issue>8</issue>):<fpage>1621</fpage>&#x02013;<lpage>4</lpage>.<pub-id pub-id-type="doi">10.1038/ismej.2012.8</pub-id><pub-id pub-id-type="pmid">22402401</pub-id></citation></ref>
<ref id="B95"><label>95</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huse</surname> <given-names>SM</given-names></name> <name><surname>Welch</surname> <given-names>DM</given-names></name> <name><surname>Morrison</surname> <given-names>HG</given-names></name> <name><surname>Sogin</surname> <given-names>ML</given-names></name></person-group>. <article-title>Ironing out the wrinkles in the rare biosphere through improved OTU clustering</article-title>. <source>Environ Microbiol</source> (<year>2010</year>) <volume>12</volume>(<issue>7</issue>):<fpage>1889</fpage>&#x02013;<lpage>98</lpage>.<pub-id pub-id-type="doi">10.1111/j.1462-2920.2010.02193.x</pub-id><pub-id pub-id-type="pmid">20236171</pub-id></citation></ref>
<ref id="B96"><label>96</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hanshew</surname> <given-names>AS</given-names></name> <name><surname>Mason</surname> <given-names>CJ</given-names></name> <name><surname>Raffa</surname> <given-names>KF</given-names></name> <name><surname>Currie</surname> <given-names>CR</given-names></name></person-group>. <article-title>Minimization of chloroplast contamination in 16S rRNA gene pyrosequencing of insect herbivore bacterial communities</article-title>. <source>J Microbiol Methods</source> (<year>2013</year>) <volume>95</volume>(<issue>2</issue>):<fpage>149</fpage>&#x02013;<lpage>55</lpage>.<pub-id pub-id-type="doi">10.1016/j.mimet.2013.08.007</pub-id><pub-id pub-id-type="pmid">23968645</pub-id></citation></ref>
<ref id="B97"><label>97</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>P&#x000E4;&#x000E4;bo</surname> <given-names>S</given-names></name> <name><surname>Irwin</surname> <given-names>DM</given-names></name> <name><surname>Wilson</surname> <given-names>AC</given-names></name></person-group>. <article-title>DNA damage promotes jumping between templates during enzymatic amplification</article-title>. <source>J Biol Chem</source> (<year>1990</year>) <volume>265</volume>(<issue>8</issue>):<fpage>4718</fpage>&#x02013;<lpage>21</lpage>.<pub-id pub-id-type="pmid">2307682</pub-id></citation></ref>
<ref id="B98"><label>98</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>G</given-names></name> <name><surname>Wang</surname> <given-names>Y</given-names></name></person-group>. <article-title>Frequency of formation of chimeric molecules as a consequence of PCR coamplification of 16S rRNA genes from mixed bacterial genomes</article-title>. <source>Appl Envir Microbiol</source> (<year>1997</year>) <volume>63</volume>(<issue>12</issue>):<fpage>4645</fpage>&#x02013;<lpage>50</lpage>.<pub-id pub-id-type="pmid">9406382</pub-id></citation></ref>
<ref id="B99"><label>99</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>GC</given-names></name> <name><surname>Wang</surname> <given-names>Y</given-names></name></person-group>. <article-title>The frequency of chimeric molecules as a consequence of PCR co-amplification of 16S rRNA genes from different bacterial species</article-title>. <source>Microbiology</source> (<year>1996</year>) <volume>142</volume>(<issue>Pt 5</issue>):<fpage>1107</fpage>&#x02013;<lpage>14</lpage>.<pub-id pub-id-type="doi">10.1099/13500872-142-5-1107</pub-id><pub-id pub-id-type="pmid">8704952</pub-id></citation></ref>
<ref id="B100"><label>100</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huber</surname> <given-names>T</given-names></name> <name><surname>Faulkner</surname> <given-names>G</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P</given-names></name></person-group>. <article-title>Bellerophon: a program to detect chimeric sequences in multiple sequence alignments</article-title>. <source>Bioinformatics</source> (<year>2004</year>) <volume>20</volume>(<issue>14</issue>):<fpage>2317</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bth226</pub-id><pub-id pub-id-type="pmid">15073015</pub-id></citation></ref>
<ref id="B101"><label>101</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashelford</surname> <given-names>KE</given-names></name> <name><surname>Chuzhanova</surname> <given-names>NA</given-names></name> <name><surname>Fry</surname> <given-names>JC</given-names></name> <name><surname>Jones</surname> <given-names>AJ</given-names></name> <name><surname>Weightman</surname> <given-names>AJ</given-names></name></person-group>. <article-title>At least 1 in 20 16S rRNA sequence records currently held in public repositories is estimated to contain substantial anomalies</article-title>. <source>Appl Environ Microbiol</source> (<year>2005</year>) <volume>71</volume>(<issue>12</issue>):<fpage>7724</fpage>&#x02013;<lpage>36</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.71.12.7724-7736.2005</pub-id><pub-id pub-id-type="pmid">16332745</pub-id></citation></ref>
<ref id="B102"><label>102</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez</surname> <given-names>JM</given-names></name> <name><surname>Zimmermann</surname> <given-names>J</given-names></name> <name><surname>Saiz-Jimenez</surname> <given-names>C</given-names></name></person-group>. <article-title>Evaluating putative chimeric sequences from PCR-amplified products</article-title>. <source>Bioinformatics</source> (<year>2005</year>) <volume>21</volume>(<issue>3</issue>):<fpage>333</fpage>&#x02013;<lpage>7</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bti008</pub-id><pub-id pub-id-type="pmid">15347575</pub-id></citation></ref>
<ref id="B103"><label>103</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashelford</surname> <given-names>KE</given-names></name> <name><surname>Chuzhanova</surname> <given-names>NA</given-names></name> <name><surname>Fry</surname> <given-names>JC</given-names></name> <name><surname>Jones</surname> <given-names>AJ</given-names></name> <name><surname>Weightman</surname> <given-names>AJ</given-names></name></person-group>. <article-title>New screening software shows that most recent large 16S rRNA gene clone libraries contain chimeras</article-title>. <source>Appl Environ Microbiol</source> (<year>2006</year>) <volume>72</volume>(<issue>9</issue>):<fpage>5734</fpage>&#x02013;<lpage>41</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.00556-06</pub-id><pub-id pub-id-type="pmid">16957188</pub-id></citation></ref>
<ref id="B104"><label>104</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gontcharova</surname> <given-names>V</given-names></name> <name><surname>Youn</surname> <given-names>E</given-names></name> <name><surname>Wolcott</surname> <given-names>RD</given-names></name> <name><surname>Hollister</surname> <given-names>EB</given-names></name> <name><surname>Gentry</surname> <given-names>TJ</given-names></name> <name><surname>Dowd</surname> <given-names>SE</given-names></name></person-group>. <article-title>Black Box Chimera Check (B2C2): a windows-based software for batch depletion of chimeras from bacterial 16S rRNA gene datasets</article-title>. <source>Open Microbiol J</source> (<year>2010</year>) <volume>4</volume>:<fpage>47</fpage>&#x02013;<lpage>52</lpage>.<pub-id pub-id-type="doi">10.2174/1874285801004010047</pub-id><pub-id pub-id-type="pmid">21339894</pub-id></citation></ref>
<ref id="B105"><label>105</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haas</surname> <given-names>BJ</given-names></name> <name><surname>Gevers</surname> <given-names>D</given-names></name> <name><surname>Earl</surname> <given-names>AM</given-names></name> <name><surname>Feldgarden</surname> <given-names>M</given-names></name> <name><surname>Ward</surname> <given-names>DV</given-names></name> <name><surname>Giannoukos</surname> <given-names>G</given-names></name> <etal/></person-group> <article-title>Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons</article-title>. <source>Genome Res</source> (<year>2011</year>) <volume>21</volume>(<issue>3</issue>):<fpage>494</fpage>&#x02013;<lpage>504</lpage>.<pub-id pub-id-type="doi">10.1101/gr.112730.110</pub-id><pub-id pub-id-type="pmid">21212162</pub-id></citation></ref>
<ref id="B106"><label>106</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quince</surname> <given-names>C</given-names></name> <name><surname>Lanzen</surname> <given-names>A</given-names></name> <name><surname>Davenport</surname> <given-names>RJ</given-names></name> <name><surname>Turnbaugh</surname> <given-names>PJ</given-names></name></person-group>. <article-title>Removing noise from pyrosequenced amplicons</article-title>. <source>BMC Bioinformatics</source> (<year>2011</year>) <volume>12</volume>(<issue>1</issue>):<fpage>38</fpage>.<pub-id pub-id-type="doi">10.1186/1471-2105-12-38</pub-id><pub-id pub-id-type="pmid">21276213</pub-id></citation></ref>
<ref id="B107"><label>107</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>RC</given-names></name> <name><surname>Haas</surname> <given-names>BJ</given-names></name> <name><surname>Clemente</surname> <given-names>JC</given-names></name> <name><surname>Quince</surname> <given-names>C</given-names></name> <name><surname>Knight</surname> <given-names>R</given-names></name></person-group>. <article-title>UCHIME improves sensitivity and speed of chimera detection</article-title>. <source>Bioinformatics</source> (<year>2011</year>) <volume>27</volume>(<issue>16</issue>):<fpage>2194</fpage>&#x02013;<lpage>200</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btr381</pub-id></citation></ref>
<ref id="B108"><label>108</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>ES</given-names></name> <name><surname>Yilmaz</surname> <given-names>LS</given-names></name> <name><surname>Noguera</surname> <given-names>DR</given-names></name></person-group>. <article-title>DECIPHER, a search-based approach to chimera identification for 16S rRNA sequences</article-title>. <source>Appl Environ Microbiol</source> (<year>2012</year>) <volume>78</volume>(<issue>3</issue>):<fpage>717</fpage>&#x02013;<lpage>25</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.06516-11</pub-id><pub-id pub-id-type="pmid">22101057</pub-id></citation></ref>
<ref id="B109"><label>109</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pible</surname> <given-names>O</given-names></name> <name><surname>Armengaud</surname> <given-names>J</given-names></name></person-group>. <article-title>Improving the quality of genome, protein sequence, and taxonomy databases: a prerequisite for microbiome meta-omics 2.0</article-title>. <source>Proteomics</source> (<year>2015</year>) <volume>15</volume>(<issue>20</issue>):<fpage>3418</fpage>&#x02013;<lpage>23</lpage>.<pub-id pub-id-type="doi">10.1002/pmic.201500104</pub-id><pub-id pub-id-type="pmid">26038180</pub-id></citation></ref>
<ref id="B110"><label>110</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chang</surname> <given-names>Q</given-names></name> <name><surname>Luan</surname> <given-names>Y</given-names></name> <name><surname>Chen</surname> <given-names>T</given-names></name> <name><surname>Fuhrman</surname> <given-names>JA</given-names></name> <name><surname>Sun</surname> <given-names>F</given-names></name></person-group>. <article-title>Computational methods for the analysis of tag sequences in metagenomics studies</article-title>. <source>Front Biosci (Schol Ed)</source> (<year>2012</year>) <volume>4</volume>:<fpage>1333</fpage>&#x02013;<lpage>43</lpage>.<pub-id pub-id-type="pmid">22652875</pub-id></citation></ref>
<ref id="B111"><label>111</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>DeSantis</surname> <given-names>TZ</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P</given-names></name> <name><surname>Larsen</surname> <given-names>N</given-names></name> <name><surname>Rojas</surname> <given-names>M</given-names></name> <name><surname>Brodie</surname> <given-names>EL</given-names></name> <name><surname>Keller</surname> <given-names>K</given-names></name> <etal/></person-group> <article-title>Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB</article-title>. <source>Appl Environ Microbiol</source> (<year>2006</year>) <volume>72</volume>(<issue>7</issue>):<fpage>5069</fpage>&#x02013;<lpage>72</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.03006-05</pub-id><pub-id pub-id-type="pmid">16820507</pub-id></citation></ref>
<ref id="B112"><label>112</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quast</surname> <given-names>C</given-names></name> <name><surname>Pruesse</surname> <given-names>E</given-names></name> <name><surname>Yilmaz</surname> <given-names>P</given-names></name> <name><surname>Gerken</surname> <given-names>J</given-names></name> <name><surname>Schweer</surname> <given-names>T</given-names></name> <name><surname>Yarza</surname> <given-names>P</given-names></name> <etal/></person-group> <article-title>The SILVA ribosomal RNA gene database project: improved data processing and web-based tools</article-title>. <source>Nucleic Acids Res</source> (<year>2013</year>) <volume>41</volume>(<issue>Database issue</issue>):<fpage>D590</fpage>&#x02013;<lpage>6</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gks1219</pub-id><pub-id pub-id-type="pmid">23193283</pub-id></citation></ref>
<ref id="B113"><label>113</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cole</surname> <given-names>JR</given-names></name> <name><surname>Wang</surname> <given-names>Q</given-names></name> <name><surname>Fish</surname> <given-names>JA</given-names></name> <name><surname>Chai</surname> <given-names>B</given-names></name> <name><surname>McGarrell</surname> <given-names>DM</given-names></name> <name><surname>Sun</surname> <given-names>Y</given-names></name> <etal/></person-group> <article-title>Ribosomal database project: data and tools for high throughput rRNA analysis</article-title>. <source>Nucleic Acids Res</source> (<year>2014</year>) <volume>42</volume>(<issue>Database issue</issue>):<fpage>D633</fpage>&#x02013;<lpage>42</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkt1244</pub-id><pub-id pub-id-type="pmid">24288368</pub-id></citation></ref>
<ref id="B114"><label>114</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vinje</surname> <given-names>H</given-names></name> <name><surname>Liland</surname> <given-names>KH</given-names></name> <name><surname>Alm&#x000F8;y</surname> <given-names>T</given-names></name> <name><surname>Snipen</surname> <given-names>L</given-names></name></person-group>. <article-title>Comparing K-mer based methods for improved classification of 16S sequences</article-title>. <source>BMC Bioinformatics</source> (<year>2015</year>) <volume>16</volume>(<issue>1</issue>):<fpage>205</fpage>.<pub-id pub-id-type="doi">10.1186/s12859-015-0647-4</pub-id><pub-id pub-id-type="pmid">26130333</pub-id></citation></ref>
<ref id="B115"><label>115</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>Y</given-names></name> <name><surname>Cai</surname> <given-names>Y</given-names></name> <name><surname>Huse</surname> <given-names>SM</given-names></name> <name><surname>Knight</surname> <given-names>R</given-names></name> <name><surname>Farmerie</surname> <given-names>WG</given-names></name> <name><surname>Wang</surname> <given-names>X</given-names></name> <etal/></person-group> <article-title>A large-scale benchmark study of existing algorithms for taxonomy-independent microbial community analysis</article-title>. <source>Brief Bioinform</source> (<year>2012</year>) <volume>13</volume>(<issue>1</issue>):<fpage>107</fpage>&#x02013;<lpage>21</lpage>.<pub-id pub-id-type="doi">10.1093/bib/bbr009</pub-id><pub-id pub-id-type="pmid">21525143</pub-id></citation></ref>
<ref id="B116"><label>116</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>PD</given-names></name> <name><surname>Westcott</surname> <given-names>SL</given-names></name></person-group>. <article-title>Assessing and improving methods used in operational taxonomic unit-based approaches for 16S rRNA gene sequence analysis</article-title>. <source>Appl Environ Microbiol</source> (<year>2011</year>) <volume>77</volume>(<issue>10</issue>):<fpage>3219</fpage>&#x02013;<lpage>26</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.02810-10</pub-id><pub-id pub-id-type="pmid">21421784</pub-id></citation></ref>
<ref id="B117"><label>117</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>W</given-names></name> <name><surname>Zhang</surname> <given-names>CK</given-names></name> <name><surname>Cheng</surname> <given-names>Y</given-names></name> <name><surname>Zhang</surname> <given-names>S</given-names></name> <name><surname>Zhao</surname> <given-names>H</given-names></name></person-group>. <article-title>A comparison of methods for clustering 16S rRNA sequences into OTUs</article-title>. <source>PLoS One</source> (<year>2013</year>) <volume>8</volume>(<issue>8</issue>):<fpage>e70837</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0070837</pub-id><pub-id pub-id-type="pmid">23967117</pub-id></citation></ref>
<ref id="B118"><label>118</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>PD</given-names></name> <name><surname>Westcott</surname> <given-names>SL</given-names></name> <name><surname>Ryabin</surname> <given-names>T</given-names></name> <name><surname>Hall</surname> <given-names>JR</given-names></name> <name><surname>Hartmann</surname> <given-names>M</given-names></name> <name><surname>Hollister</surname> <given-names>EB</given-names></name> <etal/></person-group> <article-title>Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities</article-title>. <source>Appl Environ Microbiol</source> (<year>2009</year>) <volume>75</volume>(<issue>23</issue>):<fpage>7537</fpage>&#x02013;<lpage>41</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.01541-09</pub-id><pub-id pub-id-type="pmid">19801464</pub-id></citation></ref>
<ref id="B119"><label>119</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>Y</given-names></name> <name><surname>Cai</surname> <given-names>Y</given-names></name> <name><surname>Liu</surname> <given-names>L</given-names></name> <name><surname>Yu</surname> <given-names>F</given-names></name> <name><surname>Farrell</surname> <given-names>ML</given-names></name> <name><surname>McKendree</surname> <given-names>W</given-names></name> <etal/></person-group> <article-title>ESPRIT: estimating species richness using large collections of 16S rRNA pyrosequences</article-title>. <source>Nucleic Acids Res</source> (<year>2009</year>) <volume>37</volume>(<issue>10</issue>):<fpage>e76</fpage>.<pub-id pub-id-type="doi">10.1093/nar/gkp285</pub-id><pub-id pub-id-type="pmid">19417062</pub-id></citation></ref>
<ref id="B120"><label>120</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>X</given-names></name> <name><surname>Cai</surname> <given-names>Y</given-names></name> <name><surname>Sun</surname> <given-names>Y</given-names></name> <name><surname>Knight</surname> <given-names>R</given-names></name> <name><surname>Mai</surname> <given-names>V</given-names></name></person-group>. <article-title>Secondary structure information does not improve OTU assignment for partial 16s rRNA sequences</article-title>. <source>ISME J</source> (<year>2012</year>) <volume>6</volume>(<issue>7</issue>):<fpage>1277</fpage>&#x02013;<lpage>80</lpage>.<pub-id pub-id-type="doi">10.1038/ismej.2011.187</pub-id></citation></ref>
<ref id="B121"><label>121</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>PD</given-names></name></person-group>. <article-title>A high-throughput DNA sequence aligner for microbial ecology studies</article-title>. <source>PLoS One</source> (<year>2009</year>) <volume>4</volume>(<issue>12</issue>):<fpage>e8230</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0008230</pub-id><pub-id pub-id-type="pmid">20011594</pub-id></citation></ref>
<ref id="B122"><label>122</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keller</surname> <given-names>A</given-names></name> <name><surname>F&#x000F6;rster</surname> <given-names>F</given-names></name> <name><surname>M&#x000FC;ller</surname> <given-names>T</given-names></name> <name><surname>Dandekar</surname> <given-names>T</given-names></name> <name><surname>Schultz</surname> <given-names>J</given-names></name> <name><surname>Wolf</surname> <given-names>M</given-names></name></person-group>. <article-title>Including RNA secondary structures improves accuracy and robustness in reconstruction of phylogenetic trees</article-title>. <source>Biol Direct</source> (<year>2010</year>) <volume>5</volume>:<fpage>4</fpage>.<pub-id pub-id-type="doi">10.1186/1745-6150-5-4</pub-id><pub-id pub-id-type="pmid">20078867</pub-id></citation></ref>
<ref id="B123"><label>123</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>PD</given-names></name></person-group>. <article-title>Secondary structure improves OTU assignments of 16S rRNA gene sequences</article-title>. <source>ISME J</source> (<year>2013</year>) <volume>7</volume>(<issue>3</issue>):<fpage>457</fpage>&#x02013;<lpage>60</lpage>.<pub-id pub-id-type="doi">10.1038/ismej.2012.102</pub-id></citation></ref>
<ref id="B124"><label>124</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>W</given-names></name> <name><surname>Godzik</surname> <given-names>A</given-names></name></person-group>. <article-title>Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences</article-title>. <source>Bioinformatics</source> (<year>2006</year>) <volume>22</volume>(<issue>13</issue>):<fpage>1658</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btl158</pub-id><pub-id pub-id-type="pmid">16731699</pub-id></citation></ref>
<ref id="B125"><label>125</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>RC</given-names></name></person-group>. <article-title>Search and clustering orders of magnitude faster than BLAST</article-title>. <source>Bioinformatics</source> (<year>2010</year>) <volume>26</volume>(<issue>19</issue>):<fpage>2460</fpage>&#x02013;<lpage>1</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/btq461</pub-id><pub-id pub-id-type="pmid">20709691</pub-id></citation></ref>
<ref id="B126"><label>126</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caporaso</surname> <given-names>JG</given-names></name> <name><surname>Kuczynski</surname> <given-names>J</given-names></name> <name><surname>Stombaugh</surname> <given-names>J</given-names></name> <name><surname>Bittinger</surname> <given-names>K</given-names></name> <name><surname>Bushman</surname> <given-names>FD</given-names></name> <name><surname>Costello</surname> <given-names>EK</given-names></name> <etal/></person-group> <article-title>QIIME allows analysis of high-throughput community sequencing data</article-title>. <source>Nat Methods</source> (<year>2010</year>) <volume>7</volume>(<issue>5</issue>):<fpage>335</fpage>&#x02013;<lpage>6</lpage>.<pub-id pub-id-type="doi">10.1038/nmeth.f.303</pub-id></citation></ref>
<ref id="B127"><label>127</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuczynski</surname> <given-names>J</given-names></name> <name><surname>Stombaugh</surname> <given-names>J</given-names></name> <name><surname>Walters</surname> <given-names>WA</given-names></name> <name><surname>Gonz&#x000E1;lez</surname> <given-names>A</given-names></name> <name><surname>Caporaso</surname> <given-names>JG</given-names></name> <name><surname>Knight</surname> <given-names>R</given-names></name></person-group>. <article-title>Using QIIME to analyze 16S rRNA gene sequences from microbial communities</article-title>. <source>Curr Protoc Bioinformatics</source> (<year>2011</year>) Chapter 10:Unit10.7.<pub-id pub-id-type="doi">10.1002/0471250953.bi1007s36</pub-id><pub-id pub-id-type="pmid">22161565</pub-id></citation></ref>
<ref id="B128"><label>128</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt</surname> <given-names>TSB</given-names></name> <name><surname>Matias Rodrigues</surname> <given-names>JF</given-names></name> <name><surname>von Mering</surname> <given-names>C</given-names></name></person-group>. <article-title>Ecological consistency of SSU rRNA-based operational taxonomic units at a global scale</article-title>. <source>PLoS Comput Biol</source> (<year>2014</year>) <volume>10</volume>(<issue>4</issue>):<fpage>e1003594</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pcbi.1003594</pub-id><pub-id pub-id-type="pmid">24763141</pub-id></citation></ref>
<ref id="B129"><label>129</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt</surname> <given-names>TSB</given-names></name> <name><surname>Matias Rodrigues</surname> <given-names>JF</given-names></name> <name><surname>von Mering</surname> <given-names>C</given-names></name></person-group>. <article-title>Limits to robustness and reproducibility in the demarcation of operational taxonomic units</article-title>. <source>Environ Microbiol</source> (<year>2015</year>) <volume>17</volume>(<issue>5</issue>):<fpage>1689</fpage>&#x02013;<lpage>706</lpage>.<pub-id pub-id-type="doi">10.1111/1462-2920.12610</pub-id><pub-id pub-id-type="pmid">25156547</pub-id></citation></ref>
<ref id="B130"><label>130</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pruesse</surname> <given-names>E</given-names></name> <name><surname>Peplies</surname> <given-names>J</given-names></name> <name><surname>Gl&#x000F6;ckner</surname> <given-names>FO</given-names></name></person-group>. <article-title>SINA: accurate high-throughput multiple sequence alignment of ribosomal RNA genes</article-title>. <source>Bioinformatics</source> (<year>2012</year>) <volume>28</volume>(<issue>14</issue>):<fpage>1823</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1093/bioinformatics/bts252</pub-id></citation></ref>
<ref id="B131"><label>131</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>DeSantis</surname> <given-names>TZ</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P</given-names></name> <name><surname>Keller</surname> <given-names>K</given-names></name> <name><surname>Brodie</surname> <given-names>EL</given-names></name> <name><surname>Larsen</surname> <given-names>N</given-names></name> <name><surname>Piceno</surname> <given-names>YM</given-names></name> <etal/></person-group> <article-title>NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes</article-title>. <source>Nucleic Acids Res</source> (<year>2006</year>) <volume>34</volume>(<issue>Web Server issue</issue>):<fpage>W394</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkl244</pub-id><pub-id pub-id-type="pmid">16845035</pub-id></citation></ref>
<ref id="B132"><label>132</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tuzhikov</surname> <given-names>A</given-names></name> <name><surname>Panchin</surname> <given-names>A</given-names></name> <name><surname>Shestopalov</surname> <given-names>VI</given-names></name></person-group>. <article-title>TUIT, a BLAST-based tool for taxonomic classification of nucleotide sequences</article-title>. <source>Biotechniques</source> (<year>2014</year>) <volume>56</volume>(<issue>2</issue>):<fpage>78</fpage>&#x02013;<lpage>84</lpage>.<pub-id pub-id-type="doi">10.2144/000114135</pub-id><pub-id pub-id-type="pmid">24502797</pub-id></citation></ref>
<ref id="B133"><label>133</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chaudhary</surname> <given-names>N</given-names></name> <name><surname>Sharma</surname> <given-names>AK</given-names></name> <name><surname>Agarwal</surname> <given-names>P</given-names></name> <name><surname>Gupta</surname> <given-names>A</given-names></name> <name><surname>Sharma</surname> <given-names>VK</given-names></name></person-group>. <article-title>16S classifier: a tool for fast and accurate taxonomic classification of 16S rRNA hypervariable regions in metagenomic datasets</article-title>. <source>PLoS One. Public Library of Science</source> (<year>2015</year>) <volume>10</volume>(<issue>2</issue>):<fpage>e0116106</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0116106</pub-id><pub-id pub-id-type="pmid">25646627</pub-id></citation></ref>
<ref id="B134"><label>134</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klappenbach</surname> <given-names>JA</given-names></name> <name><surname>Dunbar</surname> <given-names>JM</given-names></name> <name><surname>Schmidt</surname> <given-names>TM</given-names></name></person-group>. <article-title>rRNA operon copy number reflects ecological strategies of bacteria</article-title>. <source>Appl Environ Microbiol</source> (<year>2000</year>) <volume>66</volume>(<issue>4</issue>):<fpage>1328</fpage>&#x02013;<lpage>33</lpage>.<pub-id pub-id-type="doi">10.1128/AEM.66.4.1328-1333.2000</pub-id><pub-id pub-id-type="pmid">10742207</pub-id></citation></ref>
<ref id="B135"><label>135</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Acinas</surname> <given-names>SG</given-names></name> <name><surname>Marcelino</surname> <given-names>LA</given-names></name> <name><surname>Klepac-Ceraj</surname> <given-names>V</given-names></name> <name><surname>Polz</surname> <given-names>MF</given-names></name></person-group>. <article-title>Divergence and redundancy of 16S rRNA sequences in genomes with multiple rrn operons</article-title>. <source>J Bacteriol</source> (<year>2004</year>) <volume>186</volume>(<issue>9</issue>):<fpage>2629</fpage>&#x02013;<lpage>35</lpage>.<pub-id pub-id-type="doi">10.1128/JB.186.9.2629-2635.2004</pub-id><pub-id pub-id-type="pmid">15090503</pub-id></citation></ref>
<ref id="B136"><label>136</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>V&#x0011B;trovsk&#x000FD;</surname> <given-names>T</given-names></name> <name><surname>Baldrian</surname> <given-names>P</given-names></name></person-group>. <article-title>The variability of the 16S rRNA gene in bacterial genomes and its consequences for bacterial community analyses</article-title>. <source>PLoS One</source> (<year>2013</year>) <volume>8</volume>(<issue>2</issue>):<fpage>e57923</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0057923</pub-id><pub-id pub-id-type="pmid">23460914</pub-id></citation></ref>
<ref id="B137"><label>137</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kembel</surname> <given-names>SW</given-names></name> <name><surname>Wu</surname> <given-names>M</given-names></name> <name><surname>Eisen</surname> <given-names>JA</given-names></name> <name><surname>Green</surname> <given-names>JL</given-names></name></person-group>. <article-title>Incorporating 16S gene copy number information improves estimates of microbial diversity and abundance</article-title>. <source>PLoS Comput Biol</source> (<year>2012</year>) <volume>8</volume>(<issue>10</issue>):<fpage>e1002743</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pcbi.1002743</pub-id><pub-id pub-id-type="pmid">23133348</pub-id></citation></ref>
<ref id="B138"><label>138</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Angly</surname> <given-names>FE</given-names></name> <name><surname>Dennis</surname> <given-names>PG</given-names></name> <name><surname>Skarshewski</surname> <given-names>A</given-names></name> <name><surname>Vanwonterghem</surname> <given-names>I</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P</given-names></name> <name><surname>Tyson</surname> <given-names>GW</given-names></name></person-group>. <article-title>CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction</article-title>. <source>Microbiome.</source> (<year>2014</year>) <volume>2</volume>:<fpage>11</fpage>.<pub-id pub-id-type="doi">10.1186/2049-2618-2-11</pub-id><pub-id pub-id-type="pmid">24708850</pub-id></citation></ref>
<ref id="B139"><label>139</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ravel</surname> <given-names>J</given-names></name> <name><surname>Wommack</surname> <given-names>KE</given-names></name></person-group>. <article-title>All hail reproducibility in microbiome research</article-title>. <source>Microbiome</source> (<year>2014</year>) <volume>2</volume>(<issue>1</issue>):<fpage>8</fpage>.<pub-id pub-id-type="doi">10.1186/2049-2618-2-8</pub-id></citation></ref>
<ref id="B140"><label>140</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hanage</surname> <given-names>WP</given-names></name></person-group>. <article-title>Microbiology: microbiome science needs a healthy dose of scepticism</article-title>. <source>Nature</source> (<year>2014</year>) <volume>512</volume>(<issue>7514</issue>):<fpage>247</fpage>&#x02013;<lpage>8</lpage>.<pub-id pub-id-type="doi">10.1038/512247a</pub-id></citation></ref>
</ref-list>
</back>
</article>
